- Wei Huang 0039, Anda Cheng, Yinggui Wang, Lei Wang 0251, Tao Wei 0002. LLM-AutoDP: Automatic Data Processing via LLM Agents for Model Fine-tuning. PVLDB, 19(5):794-807, January 2026.
- Ferdinand Kossmann, Ziniu Wu, Alex Turk, Nesime Tatbul, Lei Cao 0004, Samuel Madden 0001. Ken: An Execution Engine for Unstructured Database Systems. PVLDB, 19(5):902-916, January 2026.
- Danni Wu, Yuanyuan Xu 0002, Xuemin Lin 0001, Wenjie Zhang 0001, Ying Zhang 0001. Understanding Evolving Graph Structures for Large Discrete-Time Dynamic Graph Representation. PVLDB, 19(5):862-875, January 2026.
- Zixin Wei, Yucan Guo, Jinyang Li 0003, Xiaolin Han 0002, Xiaolong Jin, Chenhao Ma 0001. Revisiting Task-Oriented Dataset Search in the Era of Large Language Models: Challenges, Benchmark, and Solution. PVLDB, 19(5):973-986, January 2026.
- Jian Zhou, Luna Wang, Shuaihua Zhao, Chen Zhong 0002, Song Jiang 0001. LiBox: A Learned Index as an Array to Minimize Last-Mile Search. PVLDB, 19(5):836-848, January 2026.
- Pengyu Chen, Zizheng Guo, Jianwei Yang, Dongjing Miao. Towards Efficient Random-Order Enumeration for Join Queries. PVLDB, 19(5):889-901, January 2026.
- Shengkun Zhu, Jinshan Zeng, Yuan Sun 0003, Sheng Wang 0007, Yiming Wang, Yushuai Ji, Feiping Nie, Xiaodong Li 0001, Zhiyong Peng 0001. Highly-Efficient Large-Scale k-means with Individual Fairness. PVLDB, 19(5):808-821, January 2026.
- Matthew Russo, Chunwei Liu, Sivaprasad Sudhir, Gerardo Vitagliano, Michael J. Cafarella, Tim Kraska, Samuel Madden 0001. Abacus: A Cost-Based Optimizer for Semantic Operator Systems. PVLDB, 19(5):1060-1073, January 2026.
- Tatsuhiro Nakamori, Hideyuki Kawashima. Libra: One-Shot Parameter Sensitivity Estimation for Transfer Learning in Database Performance Prediction. PVLDB, 19(5):945-957, January 2026.
- Cong Yu, Tuo Shi, Matthias Weidlich 0001, Bo Zhao 0019. SHARP: Shared State Reduction for Efficient Matching of Sequential Patterns. PVLDB, 19(5):987-1000, January 2026.