Publication | Closed Access
Trusted Guidance Pyramid Network for Human Parsing
46
Citations
38
References
2018
Year
Unknown Venue
Artificial IntelligenceStructured PredictionSyntactic ParsingEngineeringMachine LearningHuman ParsingNatural Language ProcessingMultimodal LlmImage AnalysisVisual GroundingData ScienceComputational LinguisticsGuidance Pyramid NetworkVisual Question AnsweringMachine TranslationMachine VisionVision Language ModelComputer ScienceDeep LearningSemantic ParsingComputer VisionTreebanksPyramid Residual Pooling
Human parsing, which segments a human-centric image into pixel-wise categorization, has a wide range of applications. However, none of the existing methods can productively solve the issue of label parsing fragmentation due to confused and complicated annotations. In this paper, we propose a novel Trusted Guidance Pyramid Network (TGPNet) to address this limitation. Based on a pyramid architecture, we design a Pyramid Residual Pooling (PRP) module setting at the end of a bottom-up approach to capture both global and local level context. In the top-down approach, we propose a Trusted Guidance Multi-scale Supervision (TGMS) that efficiently integrates and supervises multi-scale contextual information. Furthermore, we present a simple yet powerful Trusted Guidance Framework (TGF) which imposes global-level semantics into parsing results directly without extra ground truth labels in model training. Extensive experiments on two public human parsing benchmarks well demonstrate that our TGPNet has a strong ability in solving label parsing fragmentation problem and has an obtained improvement than other methods.
| Year | Citations | |
|---|---|---|
Page 1
Page 1