The following publications are possibly variants of this publication:
- MaskGIT: Masked Generative Image TransformerHuiwen Chang, Han Zhang, Lu Jiang, Ce Liu, William T. Freeman. cvpr 2022: 11305-11315 [doi]
- Accelerated masked transformer for dense video captioningZhou Yu, Nanjia Han. ijon, 445:72-80, 2021. [doi]
- Muse: Text-To-Image Generation via Masked Generative TransformersHuiwen Chang, Han Zhang, Jarred Barber, Aaron Maschinot, José Lezama, Lu Jiang 0004, Ming-Hsuan Yang 0001, Kevin Patrick Murphy, William T. Freeman, Michael Rubinstein, Yuanzhen Li, Dilip Krishnan. icml 2023: 4055-4075 [doi]
- An Empirical Study of End-to-End Video-Language Transformers with Masked Visual ModelingTsu-Jui Fu, Linjie Li, Zhe Gan, Kevin Lin, William Yang Wang, Lijuan Wang, Zicheng Liu 0001. cvpr 2023: 22898-22909 [doi]
- Masked Face TransformerWeisong Zhao, Xiangyu Zhu, Kaiwen Guo, Haichao Shi, Xiaoyu Zhang 0002, Zhen Lei 0001. tifs, 19:265-279, 2024. [doi]