Shilin Yan, Renrui Zhang, Ziyu Guo, Wenchao Chen, Wei Zhang, Hongyang Li 0001, Yu Qiao, Hao Dong 0003, Zhongjiang He, Peng Gao 0007. Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation. In Michael J. Wooldridge, Jennifer G. Dy, Sriraam Natarajan, editors, Thirty-Eigth AAAI Conference on Artificial Intelligence, AAAI 2024, Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence, IAAI 2024, Fourteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2014, February 20-27, 2024, Vancouver, Canada. pages 6449-6457, AAAI Press, 2024. [doi]
Abstract is missing.