The following publications are possibly variants of this publication:
- Latent semantic retrieval of personal photos with sparse user annotation by fused image/speech/text featuresYi-Sheng Fu, Chia-Yu Wan, Lin-Shan Lee. icassp 2009: 1969-1972 [doi]
- Semantic retrieval of personal photos using matrix factorization and two-layer random walk fusing sparse speech annotations with visual featuresYuan-ming Liou, Yi-Sheng Fu, Hung-yi Lee, Lin-Shan Lee. interspeech 2014: 1762-1766 [doi]
- Enhancing sparse voice annotation for semantic retrieval of personal photos by continuous space word representationsYuan-ming Liou, Hung-tsung Lu, Yi-Sheng Fu, Winston Hsu, Lin-Shan Lee. icassp 2015: 5341-5345 [doi]
- Semantic retrieval of personal photos using a deep autoencoder fusing visual features with speech annotations represented as word/paragraph vectorsHung-tsung Lu, Yuan-ming Liou, Hung-yi Lee, Lin-Shan Lee. interspeech 2015: 140-144 [doi]