MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video

Jinlu Zhang, Zhigang Tu 0001, Jianyu Yang, Yujin Chen, Junsong Yuan. MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. pages 13222-13232, IEEE, 2022. [doi]

@inproceedings{Zhang0YCY22,
  title = {MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video},
  author = {Jinlu Zhang and Zhigang Tu 0001 and Jianyu Yang and Yujin Chen and Junsong Yuan},
  year = {2022},
  doi = {10.1109/CVPR52688.2022.01288},
  url = {https://doi.org/10.1109/CVPR52688.2022.01288},
  researchr = {https://researchr.org/publication/Zhang0YCY22},
  cites = {0},
  citedby = {0},
  pages = {13222-13232},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022},
  publisher = {IEEE},
  isbn = {978-1-6654-6946-3},
}