Gaze-assisted automatic captioning of fetal ultrasound videos using three-way multi-modal deep neural networks

Mohammad Alsharid, Yifan Cai, Harshita Sharma, Lior Drukker, Aris T. Papageorghiou, J. Alison Noble. Gaze-assisted automatic captioning of fetal ultrasound videos using three-way multi-modal deep neural networks. Medical Image Analysis, 82:102630, 2022. [doi]

@article{AlsharidCSDPN22,
  title = {Gaze-assisted automatic captioning of fetal ultrasound videos using three-way multi-modal deep neural networks},
  author = {Mohammad Alsharid and Yifan Cai and Harshita Sharma and Lior Drukker and Aris T. Papageorghiou and J. Alison Noble},
  year = {2022},
  doi = {10.1016/j.media.2022.102630},
  url = {https://doi.org/10.1016/j.media.2022.102630},
  researchr = {https://researchr.org/publication/AlsharidCSDPN22},
  cites = {0},
  citedby = {0},
  journal = {Medical Image Analysis},
  volume = {82},
  pages = {102630},
}