The following publications are possibly variants of this publication:
- OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video GenerationKepan Nan, Rui Xie, Penghao Zhou, Tiehan Fan, Zhenheng Yang, Zhijie Chen, Xiang Li 0041, Jian Yang 0003, Ying Tai. iclr 2025: [doi]
- HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video GenerationKun Liu 0016, Qi Liu 0081, Xinchen Liu, Jie Li, Yongdong Zhang 0001, Jiebo Luo 0001, Xiaodong He, Wu Liu. cvpr 2025: 24001-24010 [doi]
- Scalable video object coding & QoS control for next generation space internetGuofang Tu, Can Zhang, Heinrich Niemann, Jie Xu, Weiren Wu. chinaf, 51(5):599-608, 2008. [doi]
- VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language ResearchXin Wang, Jiawei Wu, Junkun Chen, Lei Li, Yuan-Fang Wang, William Yang Wang. iccv 2019: 4580-4590 [doi]
- InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and GenerationYi Wang, Yinan He, Yizhuo Li 0001, Kunchang Li 0002, Jiashuo Yu, Xin Ma, Xinhao Li, Guo Chen 0006, Xinyuan Chen, Yaohui Wang 0004, Ping Luo 0002, Ziwei Liu 0002, Yali Wang 0001, Limin Wang 0002, Yu Qiao 0001. iclr 2024: [doi]