Docs2Synth: A Synthetic Data Tuned Retriever Framework for Documents Understanding

Yihao Ding, Qiang Sun 0006, Puzhen Wu, Sirui Li, Siwen Luo, Wei Liu 0006. Docs2Synth: A Synthetic Data Tuned Retriever Framework for Documents Understanding. In Companion Proceedings of the ACM Web Conference 2026, WWW Companion 2026, Dubai, United Arab Emirates, 29 June 2026 - 3 July 2026. pages 124-127, ACM, 2026. [doi]

Authors

Yihao Ding

This author has not been identified. Look up 'Yihao Ding' in Google

Qiang Sun 0006

This author has not been identified. Look up 'Qiang Sun 0006' in Google

Puzhen Wu

This author has not been identified. Look up 'Puzhen Wu' in Google

Sirui Li

This author has not been identified. Look up 'Sirui Li' in Google

Siwen Luo

This author has not been identified. Look up 'Siwen Luo' in Google

Wei Liu 0006

This author has not been identified. Look up 'Wei Liu 0006' in Google