VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining

Junjie Ke, Keren Ye, Jiahui Yu, Yonghui Wu, Peyman Milanfar, Feng Yang. VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023. pages 10041-10051, IEEE, 2023. [doi]

@inproceedings{KeYYWMY23,
  title = {VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining},
  author = {Junjie Ke and Keren Ye and Jiahui Yu and Yonghui Wu and Peyman Milanfar and Feng Yang},
  year = {2023},
  doi = {10.1109/CVPR52729.2023.00968},
  url = {https://doi.org/10.1109/CVPR52729.2023.00968},
  researchr = {https://researchr.org/publication/KeYYWMY23},
  cites = {0},
  citedby = {0},
  pages = {10041-10051},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023},
  publisher = {IEEE},
  isbn = {979-8-3503-0129-8},
}