Hybrid CNN-Transformer Architecture for Object Detection and Multimodal Captioning in Educational Contexts

Leila Habibi, Madjid Maidi, Larbi Boubchir, Boubaker Daachi. Hybrid CNN-Transformer Architecture for Object Detection and Multimodal Captioning in Educational Contexts. In IEEE International Conference on Big Data, BigData 2025, Macau, China, December 8-11, 2025. pages 4576-4585, IEEE, 2025. [doi]

@inproceedings{HabibiMBD25,
  title = {Hybrid CNN-Transformer Architecture for Object Detection and Multimodal Captioning in Educational Contexts},
  author = {Leila Habibi and Madjid Maidi and Larbi Boubchir and Boubaker Daachi},
  year = {2025},
  doi = {10.1109/BigData66926.2025.11401600},
  url = {https://doi.org/10.1109/BigData66926.2025.11401600},
  researchr = {https://researchr.org/publication/HabibiMBD25},
  cites = {0},
  citedby = {0},
  pages = {4576-4585},
  booktitle = {IEEE International Conference on Big Data, BigData 2025, Macau, China, December 8-11, 2025},
  publisher = {IEEE},
  isbn = {979-8-3315-9447-3},
}