Improving the generalization of ViTs for action understanding with VLM pre-training

Hui Lu, Albert Ali Salah, Ronald Poppe. Improving the generalization of ViTs for action understanding with VLM pre-training. Pattern Recognition, 179:113794, 2026. [doi]

Abstract

Abstract is missing.