SDSP: Scalable and Diverse Synthetic Pairwise Text Generation from Web Corpus Using Large Language Model

Xiaoxu Wu, Xi Li 0001, Wentao Wu, Aleksei Timofeev, Yinfei Yang, Meng Cao, Ping Huang, Si Liu 0001, Jiulong Shan. SDSP: Scalable and Diverse Synthetic Pairwise Text Generation from Web Corpus Using Large Language Model. In Tadahiro Taniguchi, Andrew Chi-Sing Leung, Tadashi Kozuno, Junichiro Yoshimoto, Mufti Mahmud, Maryam Doborjeh, Kenji Doya, editors, Neural Information Processing - 32nd International Conference, ICONIP 2025, Okinawa, Japan, November 20-24, 2025, Proceedings, Part I. Volume 16309 of Lecture Notes in Computer Science, pages 3-16, Springer, 2025. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.