The following publications are possibly variants of this publication:
- MaskGIT: Masked Generative Image TransformerHuiwen Chang, Han Zhang, Lu Jiang, Ce Liu, William T. Freeman. cvpr 2022: 11305-11315 [doi]
- Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative ModelsJaewoong Lee, Sangwon Jang, Jaehyeong Jo, Jaehong Yoon, Yunji Kim, Jin-Hwa Kim, Jung-Woo Ha 0001, Sung Ju Hwang. iccv 2023: 23195-23205 [doi]
- MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec TransformerYuancheng Wang, Haoyue Zhan, Liwei Liu, Ruihong Zeng, Haotian Guo, Jiachen Zheng, Qiang Zhang, Xueyao Zhang, Shunsi Zhang, Zhizheng Wu 0001. iclr 2025: [doi]
- CogView: Mastering Text-to-Image Generation via TransformersMing Ding, Zhuoyi Yang, Wenyi Hong, Wendi Zheng, Chang Zhou, Da Yin, Junyang Lin, Xu Zou, Zhou Shao, Hongxia Yang, Jie Tang 0001. nips 2021: 19822-19835 [doi]
- Knowledge based natural answer generation via masked-graph transformerXiangyu Li, Sen Hu, Lei Zou 0001. www, 25(3):1403-1423, 2022. [doi]
- MAGVIT: Masked Generative Video TransformerLijun Yu, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang 0010, Huiwen Chang, Alexander G. Hauptmann, Ming-Hsuan Yang 0001, Yuan-Hao, Irfan Essa, Lu Jiang 0004. cvpr 2023: 10459-10469 [doi]
- Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image SynthesisJinbin Bai, Tian Ye 0001, Wei Chow, Enxin Song, Qing-Guo Chen, Xiangtai Li, Zhen Dong 0003, Lei Zhu 0003, Shuicheng Yan. iclr 2025: [doi]
- Muses: 3D-Controllable Image Generation via Multi-Modal Agent CollaborationYanbo Ding, Shaobin Zhuang, Kunchang Li 0002, Zhengrong Yue, Yu Qiao 0001, Yali Wang. AAAI 2025: 2753-2761 [doi]
- Masked cross-attention and multi-head channel attention guiding single-stage generative adversarial networks for text-to-image generationShouming Hou, Ziying Li, Kuikui Wu, Yinggang Zhao, Hui Li. vc, 40(12):8639-8651, December 2024. [doi]