Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment

Kim Sung-Bin, Arda Senocak, Hyunwoo Ha, Andrew Owens, Tae Hyun Oh. Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023. pages 6430-6440, IEEE, 2023. [doi]

Authors

Kim Sung-Bin

This author has not been identified. Look up 'Kim Sung-Bin' in Google

Arda Senocak

This author has not been identified. Look up 'Arda Senocak' in Google

Hyunwoo Ha

This author has not been identified. Look up 'Hyunwoo Ha' in Google

Andrew Owens

This author has not been identified. Look up 'Andrew Owens' in Google

Tae Hyun Oh

This author has not been identified. Look up 'Tae Hyun Oh' in Google