Skip to main navigation Skip to search Skip to main content

Globally-Robust Instance Identification and Locally-Accurate Keypoint Alignment for Multi-Person Pose Estimation

  • Fangzheng Tian
  • , Sungchan Kim*
  • *Corresponding author for this work
    • Jeonbuk National University

    Research output: Contribution to conferenceConference paperpeer-review

    Abstract

    Scenes with a large number of human instances are characterized by significant overlap of the instances with similar appearance, occlusion, and scale variation. We propose GRAPE, a novel method that leverages both Globally Robust human instance identification and locally Accurate keypoint alignment for 2D Pose Estimation. GRAPE predicts instance center and keypoint heatmaps, as global identifications of instance location and scale, and keypoint offset vectors from instance centers, as representations of accurate local keypoint positions. We use Transformer to jointly learn the global and local contexts, which allows us to robustly detect instance centers even in difficult cases such as crowded scenes, and align instance offset vectors with relevant keypoint heatmaps, resulting in refined final poses. GRAPE also predicts keypoint visibility, which is crucial for estimating centers of partially visible instances in crowded scenes. We demonstrate that GRAPE achieves state-of-the-art performance on the CrowdPose, OCHuman, and COCO datasets. The benefit of GRAPE is more apparent on crowded scenes (CrowdPose and OCHuman), where our model significantly outperforms previous methods, especially on hard examples.

    Original languageEnglish
    Title of host publicationMM 2023 - Proceedings of the 31st ACM International Conference on Multimedia
    PublisherAssociation for Computing Machinery, Inc
    Pages4816-4827
    Number of pages12
    ISBN (Electronic)9798400701085
    DOIs
    StatePublished - 2023.10.27
    Event31st ACM International Conference on Multimedia, MM 2023 - Ottawa, Canada
    Duration: 2023.10.292023.11.3

    Publication series

    NameMM 2023 - Proceedings of the 31st ACM International Conference on Multimedia

    Conference

    Conference31st ACM International Conference on Multimedia, MM 2023
    Country/TerritoryCanada
    CityOttawa
    Period23.10.2923.11.3

    Keywords

    • crowded scene
    • human pose estimation
    • single-stage
    • transformer

    Quacquarelli Symonds(QS) Subject Topics

    • Computer Science & Information Systems
    • Data Science

    Fingerprint

    Dive into the research topics of 'Globally-Robust Instance Identification and Locally-Accurate Keypoint Alignment for Multi-Person Pose Estimation'. Together they form a unique fingerprint.

    Cite this