Abstract is missing.
- VisionLLaMA: A Unified LLaMA Backbone for Vision TasksXiangxiang Chu, Jianlin Su, Bo Zhang 0046, Chunhua Shen. 1-18 [doi]
- Frugal 3D Point Cloud Model Training via Progressive Near Point Filtering and Fused AggregationDonghyun Lee, Yejin Lee 0001, Jae W. Lee, Hongil Yoon. 19-35 [doi]
- HVCLIP: High-Dimensional Vector in CLIP for Unsupervised Domain AdaptationNoranart Vesdapunt, Kah Kuen Fu, Yue Wu, Xu Zhang, Pradeep Natarajan. 36-54 [doi]
- Improving 3D Semi-supervised Learning by Effectively Utilizing All Unlabelled DataSneha Paul, Zachary Patterson, Nizar Bouguila. 55-71 [doi]
- PRET: Planning with Directed Fidelity Trajectory for Vision and Language NavigationRenjie Lu, Jingke Meng, Wei-Shi Zheng 0001. 72-88 [doi]
- MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory PredictionSeongju Lee, Junseok Lee, Yeonguk Yu, Taeri Kim, Kyoobin Lee. 89-107 [doi]
- Expanding Scene Graph Boundaries: Fully Open-Vocabulary Scene Graph Generation via Visual-Concept Alignment and RetentionZuyao Chen, Jinlin Wu, Zhen Lei 0001, Zhaoxiang Zhang 0001, Chang Wen Chen. 108-124 [doi]
- Few-Shot NeRF by Adaptive Rendering Loss RegularizationQingshan Xu 0001, Xuanyu Yi, Jianyao Xu, Wenbing Tao, Yew-Soon Ong, Hanwang Zhang. 125-142 [doi]
- Investigating Style Similarity in Diffusion ModelsGowthami Somepalli, Anubhav Gupta, Kamal Gupta 0002, Shramay Palta, Micah Goldblum, Jonas Geiping, Abhinav Shrivastava, Tom Goldstein. 143-160 [doi]
- JDT3D: Addressing the Gaps in LiDAR-Based Tracking-by-AttentionBrian Cheong, Jiachen Zhou, Steven Lake Waslander. 161-177 [doi]
- MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search SpaceArmand Comas Massague, Di Qiu, Menglei Chai, Marcel C. Bühler, Amit Raj, RuiQi Gao, Qiangeng Xu, Mark Matthews, Paulo F. U. Gotardo, Sergio Orts-Escolano, Thabo Beeler. 178-196 [doi]
- EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image ClassificationSuorong Yang, Furao Shen, Jian Zhao 0013. 197-214 [doi]
- Timestep-Aware Correction for Quantized Diffusion ModelsYuzhe Yao, Feng Tian 0002, Jun Chen 0023, Haonan Lin, Guang Dai, Yong Liu 0007, Jingdong Wang 0001. 215-232 [doi]
- SPARO: Selective Attention for Robust and Compositional Transformer Encodings for VisionAnkit Vani, Bac Nguyen, Samuel Lavoie, Ranjay Krishna, Aaron C. Courville. 233-251 [doi]
- Towards Compact Reversible Image Representations for Neural Style TransferXiyao Liu 0001, Siyu Yang, Jian Zhang 0048, Gerald Schaefer, Jiya Li, Xunli Fan, Songtao Wu, Hui Fang 0003. 252-268 [doi]
- Out-of-Bounding-Box Triggers: A Stealthy Approach to Cheat Object DetectorsTao Lin, Lijia Yu, Gaojie Jin, Renjue Li, Peng Wu, Lijun Zhang. 269-287 [doi]
- GTMS: A Gradient-Driven Tree-Guided Mask-Free Referring Image Segmentation MethodHaoxin Lyu, Tianxiong Zhong, Sanyuan Zhao. 288-304 [doi]
- Long-Term Temporal Context Gathering for Neural Video CompressionLinfeng Qi, Zhaoyang Jia, Jiahao Li, Bin Li 0012, Houqiang Li, Yan Lu 0001. 305-322 [doi]
- VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous DrivingYibo Liu, Zheyuan Yang, Guile Wu, Yuan Ren, Kejian Lin, Bingbing Liu, Yang Liu, Jinjun Shan. 323-340 [doi]
- From Pixels to Objects: A Hierarchical Approach for Part and Object Segmentation Using Local and Global AggregationYunfei Xie, Cihang Xie, Alan L. Yuille, Jieru Mei. 341-356 [doi]
- Leveraging Text Localization for Scene Text Removal via Text-Aware Masked Image ModelingZixiao Wang 0002, Hongtao Xie, Yuxin Wang 0002, Yadong Qu, Fengjun Guo, Pengwei Liu. 357-373 [doi]
- Unmasking Bias in Diffusion Model TrainingHu Yu, Li Shen, Jie Huang, Hongsheng Li, Feng Zhao. 374-390 [doi]
- Multimodal Label Relevance Ranking via Reinforcement LearningTaian Guo, Taolin Zhang 0003, Haoqian Wu, Hanjun Li 0002, Ruizhi Qiao, Xing Sun. 391-408 [doi]
- Animate Your Motion: Turning Still Images into Dynamic VideosMingxiao Li, Bo Wan, Marie-Francine Moens, Tinne Tuytelaars. 409-425 [doi]
- Layered Rendering Diffusion Model for Controllable Zero-Shot Image SynthesisZipeng Qi, Guoxi Huang, Chenyang Liu, Fei Ye. 426-443 [doi]
- CIC-BART-SSA: Controllable Image Captioning with Structured Semantic AugmentationKalliopi Basioti, Mohamed A. Abdelsalam, Federico Fancellu, Vladimir Pavlovic 0001, Afsaneh Fazly. 444-461 [doi]
- A Simple Background Augmentation Method for Object Detection with Diffusion ModelYuhang Li, Xin Dong 0009, Chen Chen 0043, Weiming Zhuang, Lingjuan Lyu. 462-479 [doi]