PoseTED: a Novel Regression-Based Technique for Recognizing Multiple Pose Instances

EasyChair Preprint 6791

12 pages•Date: October 6, 2021

Afsana Ahsan Jeny, Masum Shah Junayed and Md Baharul Islam

Abstract

Pose estimation for multiple people can be viewed as a hierarchical set predicting challenge. Algorithms are needed to classify all persons according to their physical components appropriately. Pose estimation methods divide into two categories: (1) heatmap-based, (2) regression-based. Heatmap-based techniques are susceptible to various heuristic designs and are not end-to-end trainable, while regression-based methods involve fewer intermediary non-differentiable stages. This paper presents a novel regression-based multi-instance human pose recognition network called PoseTED. It utilizes the well-known object detector YOLOv4 for person detection, and the spatial transformer network (STN) used as a cropping filter. After that, we used a CNN-based backbone that extracts deep features and positional encoding with an encoder-decoder transformer applied for keypoint detection, solving the heuristic design problem before regression-based techniques and increasing overall performance. A prediction-based feed-forward network (FFN) is used to predict several key locations' posture as a group and display the body components as an output. Two available public datasets are tested in this experiment. Experimental results are shown on the COCO and MPII datasets, with an average precision (AP) of 73.7% on the COCO val. dataset, 72.7% on the COCO test dev. dataset, and 89.7% on the MPII datasets, respectively. These results are comparable to the state-of-the-art methods.

Keyphrases: FFN, STN, Transformer encoder-decoder, person detection, pose recognition

Links:

https://easychair.org/publications/preprint/WhW9

BibTeX entry

BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:

@booklet{EasyChair:6791,
  author    = {Afsana Ahsan Jeny and Masum Shah Junayed and Md Baharul Islam},
  title     = {PoseTED: a Novel Regression-Based Technique for Recognizing Multiple Pose Instances},
  howpublished = {EasyChair Preprint 6791},
  year      = {EasyChair, 2021}}

Download PDF Open PDF in browser