Multi-level Multi-modal Feature Fusion for Action Recognition in Videos

Xinghang Hu, Yanli Ji, Kumie Alemu Gedamu. Multi-level Multi-modal Feature Fusion for Action Recognition in Videos. In Dingwen Zhang, Chaowei Fang, Wu Liu, Xinchen Liu, Jingkuan Song, Hongyuan Zhu, Wenbing Huang 0001, John Smith, editors, HCMA@MM 2022: Proceedings of the 3rd International Workshop on Human-Centric Multimedia Analysis, Lisboa, Portugal, October 10, 2022. pages 25-33, ACM, 2022. [doi]

@inproceedings{HuJG22,
  title = {Multi-level Multi-modal Feature Fusion for Action Recognition in Videos},
  author = {Xinghang Hu and Yanli Ji and Kumie Alemu Gedamu},
  year = {2022},
  doi = {10.1145/3552458.3556449},
  url = {https://doi.org/10.1145/3552458.3556449},
  researchr = {https://researchr.org/publication/HuJG22},
  cites = {0},
  citedby = {0},
  pages = {25-33},
  booktitle = {HCMA@MM 2022: Proceedings of the 3rd International Workshop on Human-Centric Multimedia Analysis, Lisboa, Portugal, October 10, 2022},
  editor = {Dingwen Zhang and Chaowei Fang and Wu Liu and Xinchen Liu and Jingkuan Song and Hongyuan Zhu and Wenbing Huang 0001 and John Smith},
  publisher = {ACM},
  isbn = {978-1-4503-9492-5},
}