Abstract is missing.
- The Limitations of Data, Machine Learning and UsRicardo Baeza-Yates. 1-2 [doi]
- The Journey to a Knowledgeable Assistant with Retrieval-Augmented Generation (RAG)Xin Luna Dong. 3 [doi]
- Making Data Management Better with Vectorized Query ProcessingPeter A. Boncz. 4 [doi]
- Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query EngineAndrew Lamb, Yijie Shen, Daniël Heres, Jayjeet Chakraborty, Mehmet Ozan Kabak, Liang Chi Hsieh, Chao Sun. 5-17 [doi]
- Unified Query Optimization in the Fabric Data WarehouseNicolas Bruno, César A. Galindo-Legaria, Milind Joshi, Esteban Calvo Vargas, Kabita Mahapatra, Sharon Ravindran, Guoheng Chen, Ernesto Cervantes Juárez, Beysim Sezgin. 18-30 [doi]
- Measures in SQLJulian Hyde, John Fremlin. 31-40 [doi]
- ByteCard: Enhancing ByteDance's Data Warehouse with Learned Cardinality EstimationYuxing Han 0002, Haoyu Wang, Lixiang Chen, Yifeng Dong, Xing Chen, Benquan Yu, Chengcheng Yang, Weining Qian. 41-54 [doi]
- Automated Multidimensional Data Layouts in Amazon RedshiftJialin Ding, Matt Abrams, Sanghita Bandyopadhyay, Luciano Di Palma, Yanzhu Ji, Davide Pagano, Gopal Paliwal, Panos Parchas, Pascal Pfeil, Orestis Polychroniou, Gaurav Saxena, Aamer Shah, Amina Voloder, Sherry Xiao, Davis Zhang, Tim Kraska. 55-67 [doi]
- Automated Clustering Recommendation With Database Zone MapsSuratna Budalakoti, Mohamed Ziauddin, Andrew Witkowski, You Jung Kim, Ramarajan Krishnamachari, Alan Wood. 68-79 [doi]
- Similarity Joins of Sparse FeaturesAhmed Metwally 0001, Michael Shum. 80-92 [doi]
- FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial AnalysisChao Zhang, Yuren Mao, Yijiang Fan, Yu Mi, Yunjun Gao, Lu Chen 0001, Dongfang Lou, Jinshu Lin. 93-105 [doi]
- Rock: Cleaning Data by Embedding ML in Logic RulesXianchun Bao, Zian Bao, Bie Binbin, Qingsong Duan, Wenfei Fan, Hui Lei, Daji Li, Wei Lin, Peng Liu, Zhicong Lv, Mingliang Ouyang, Shuai Tang, Yaoshu Wang, Qiyuan Wei, Min Xie, Jing Zhang, Xin Zhang, Runxiao Zhao, Shuping Zhou. 106-119 [doi]
- Data-Juicer: A One-Stop Data Processing System for Large Language ModelsDaoyuan Chen, Yilun Huang 0004, Zhijian Ma, Hesen Chen, Xuchen Pan, Ce Ge, Dawei Gao, Yuexiang Xie, Zhaoyang Liu, Jinyang Gao, Yaliang Li, Bolin Ding, Jingren Zhou. 120-134 [doi]
- The Hopsworks Feature Store for Machine LearningJavier de la Rúa Martínez, Fabio Buso, Antonios Kouzoupis, Alexandru A. Ormenisan, Salman Niazi, Davit Bzhalava, Kenneth Mak, Victor Jouffrey, Mikael Ronström, Raymond Cunningham, Ralfs Zangis, Dhananjay Mukhedkar, Ayushman Khazanchi, Vladimir Vlassov, Jim Dowling. 135-147 [doi]
- COSMO: A Large-Scale E-commerce Common Sense Knowledge Generation and Serving System at AmazonChanglong Yu, Xin Liu, Jefferson Maia, Yang Li, Tianyu Cao 0001, YiFan Gao, Yangqiu Song, Rahul Goutam, Haiyang Zhang, Bing Yin, Zheng Li. 148-160 [doi]
- LETUS: A Log-Structured Efficient Trusted Universal BlockChain StorageShikun Tian, Zhonghao Lu, Haizhen Zhuo, Xiaojing Tang, Peiyi Hong, Shenglong Chen, Dayi Yang, Ying Yan 0002, Zhiyong Jiang, Hui Zhang, Guofei Jiang. 161-174 [doi]
- Vortex: A Stream-oriented Storage Engine For Big Data AnalyticsPavan Edara, Jonathan Forbesj, Bigang Li. 175-187 [doi]
- Native Cloud Object Storage in Db2 Warehouse: Implementing a Fast and Cost-Efficient Cloud Storage ArchitectureDavid Kalmuk, Christian Garcia-Arellano, Ronald Barber, Richard Sidle, Kostas Rakopoulos, Hamdi Roumani, William Minor, Alexander Cheung, Robert C. Hooper, Matthew Emmerton, Zach Hoggard, Scott Walkty, Patrick Pérez, Aleksandrs Santars, Michael Chen, Matthew Olan, Daniel C. Zilio, Imran Sayyid, Humphrey Li, Ketan Rampurkar, Krishna K. Ramachandran, Yiren Shen. 188-200 [doi]
- ESTELLE: An Efficient and Cost-effective Cloud Log EngineYupu Zhang, Guanglin Cong, Jihan Qu, Ran Xu, Yuan Fu, Weiqi Li, Feiran Hu, Jing Liu, Wenliang Zhang, Kai Zheng 0001. 201-213 [doi]
- TimeCloth: Fast Point-in-Time Database Recovery in The CloudJianjun Deng, Jianan Lu, Hua Fan 0002, Chaoyang Liu, Shi Cheng, Cuiyun Fu, Wenchao Zhou. 214-226 [doi]
- Proactive Resume and Pause of Resources for Microsoft Azure SQL Database ServerlessOlga Poppe, Pankaj Arora, Sakshi Sharma, Jie Chen, Sachin Pandit, Rahul Sawhney, Vaishali Jhalani, Willis Lang, Qun Guo, Anupriya Inumella, Sanjana Dulipeta Sridhar, Dheren Gala, Nilesh Rathi, Morgan Oslake, Alexandru Chirica, Sarika Iyer, Prateek Goel, Ajay Kalhan. 227-240 [doi]
- Vertically Autoscaling Monolithic Applications with CaaSPER: Scalable Container-as-a-Service Performance Enhanced Resizing Algorithm for the CloudAnna Pavlenko, Joyce Cahoon, Yiwen Zhu, Brian Kroth, Michael Nelson, Andrew Carter, David Liao, Travis Wright, Jesús Camacho-Rodríguez, Karla Saur. 241-254 [doi]
- Flux: Decoupled Auto-Scaling for Heterogeneous Query Workload in Alibaba AnalyticDBWei Li 0226, Jiachi Zhang 0002, Ye Yin, Yan Li, Zhanyang Zhu, Wenchao Zhou, Liang Lin, Feifei Li 0001. 255-268 [doi]
- Intelligent Scaling in Amazon RedshiftVikram Nathan, Vikramank Y. Singh, Zhengchun Liu, Mohammad Rahman, Andreas Kipf, Dominik Horn, Davide Pagano, Gaurav Saxena, Balakrishnan Narayanaswamy, Tim Kraska. 269-279 [doi]
- Stage: Query Execution Time Prediction in Amazon RedshiftZiniu Wu, Ryan Marcus, Zhengchun Liu, Parimarjan Negi, Vikram Nathan, Pascal Pfeil, Gaurav Saxena, Mohammad Rahman, Balakrishnan Narayanaswamy, Tim Kraska. 280-294 [doi]
- PolarDB-MP: A Multi-Primary Cloud-Native Database via Disaggregated Shared MemoryXinjun Yang, Yingqiang Zhang, Hao Chen, Feifei Li 0001, Bo Wang, Jing Fang, Chuan Sun, Yuhui Wang. 295-308 [doi]
- Amazon MemoryDB: A Fast and Durable Memory-First Cloud DatabaseYacine Taleb, Kevin McGehee, Nan Yan, Shawn Wang, Stefan C. Müller, Allen Samuels. 309-320 [doi]
- Extending Polaris to Support TransactionsJosep Aguilar-Saborit, Raghu Ramakrishnan 0001, Kevin Bocksrocker, Alan Halverson, Konstantin Kosinsky, Ryan O'Connor, Nadejda Poliakova, Moe Shafiei, Haris Mahmood Ansari, Bogdan Crivat, Conor Cunningham, Taewoo Kim, Phil Kon-Kim, Ishan Rajesh Madan, Blazej Matuszyk, Matt Miles, Sumin Mohanan, Cristian Petculescu, Emma Rose-Wirshing, Elias Yousefi, Amin Abadi. 321-333 [doi]
- BigLake: BigQuery's Evolution toward a Multi-Cloud LakehouseJustin J. Levandoski, Garrett Casto, Mingge Deng, Rushabh Desai, Pavan Edara, Thibaud Hottelier, Amir Hormati, Anoop Johnson, Jeff Johnson, Dawid Kurzyniec, Sam McVeety, Prem Ramanathan, Gaurav Saxena, Vidya Shanmugan, Yuri Volobuev. 334-346 [doi]
- Predicate Caching: Query-Driven Secondary Indexing for Cloud Data WarehousesTobias Schmidt, Andreas Kipf, Dominik Horn, Gaurav Saxena, Tim Kraska. 347-359 [doi]
- BG3: A Cost Effective and I/O Efficient Graph Database in BytedanceWei Zhang, Cheng Chen, Qiange Wang, Wei Wang, Shijiao Yang, BingYu Zhou, Huiming Zhu, Chao Chen, Yongjun Zhao, Yingqian Hu, MiaoMiao Cheng, Meng Li, Hongfei Tan, Mengjin Liu, Hexiang Lin, Shuai Zhang, Lei Zhang. 360-372 [doi]
- PG-Triggers: Triggers for Property GraphsStefano Ceri, Anna Bernasconi 0002, Alessia Gagliardi, Davide Martinenghi, Luigi Bellomarini, Davide Magnanimi. 373-385 [doi]
- GraphScope Flex: LEGO-like Graph Computing StackTao He, Shuxian Hu, Longbin Lai, Dongze Li, Neng Li, Xue Li, Lexiao Liu, Xiaojian Luo, Bingqing Lyu, Ke Meng, Sijie Shen, Li Su, Lei Wang, Jingbo Xu 0001, Wenyuan Yu, Weibin Zeng, Lei Zhang, Siyuan Zhang, Jingren Zhou, Xiaoli Zhou, Diwen Zhu. 386-399 [doi]
- Bouncer: Admission Control with Response Time Objectives for Low-latency Online Data SystemsHao Xu, Juan A. Colmenares. 400-413 [doi]
- NPA: Improving Large-scale Graph Neural Networks with Non-parametric AttentionWentao Zhang, Guochen Yan, Yu Shen, Yang Ling, Yangyu Tao, Bin Cui 0001, Jian Tang. 414-427 [doi]
- Demonstration of Ver: View Discovery in the WildKevin Dharmawan, Chirag A. Kawediya, Yue Gong, Zaki Indra Yudhistira, Zhiru Zhu, Sainyam Galhotra, Adila Alfa Krisnadhi, Raul Castro Fernandez. 428-431 [doi]
- Comquest: Large Scale User Comment Crawling and IntegrationZhijia Chen, Lihong He 0001, Arjun Mukherjee, Eduard C. Dragut. 432-435 [doi]
- QueryShield: Cryptographically Secure Analytics in the CloudEthan Seow, Yan Tong, Eli Baum, Sam Buxbaum, Muhammad Faisal, John Liagouris, Vasiliki Kalavri, Mayank Varia. 436-439 [doi]
- SIERRA: A Counterfactual Thinking-based Visual Interface for Property Graph Query ConstructionJiebing Ma, Sourav S. Bhowmick, Lester Tay, Byron Choi. 440-443 [doi]
- Sawmill: From Logs to Causal Diagnosis of Large SystemsMarkos Markakis, An Bo Chen, Brit Youngmann, Trinity Gao, Ziyu Zhang, Rana Shahout, Peter Baile Chen, Chunwei Liu, Ibrahim Sabek, Michael J. Cafarella. 444-447 [doi]
- Demonstrating REmatch: A Novel RegEx Engine for Finding all MatchesKyle Bossonney, Vicente Calisto, Cristian Riveros, Gustavo Toro, Nicolás Van Sint Jan, Domagoj Vrgoc. 448-451 [doi]
- ASQP-RL Demo: Learning Approximation Sets for Exploratory QueriesSusan B. Davidson, Tova Milo, Kathy Razmadze, Gal Zeevi. 452-455 [doi]
- IMBridge: Impedance Mismatch Mitigation between Database Engine and Prediction Query ExecutionChenyang Zhang, Junxiong Peng, Chen Xu, Quanqing Xu, Chuanhui Yang. 456-459 [doi]
- ASM in Action: Fast and Practical Learned Cardinality EstimationSangoh Lee, Kyoungmin Kim 0002, Wook-Shin Han. 460-463 [doi]
- The Game Of Recourse: Simulating Algorithmic Recourse over Time to Improve Its Reliability and FairnessAndrew Bell, João Fonseca, Julia Stoyanovich. 464-467 [doi]
- RobOpt: A Tool for Robust Workload Optimization Based on Uncertainty-Aware Machine LearningAmin Kamali, Verena Kantere, Calisto Zuzarte, Vincent Corvinelli. 468-471 [doi]
- Demonstrating CAESURA: Language Models as Multi-Modal Query PlannersMatthias Urban, Carsten Binnig. 472-475 [doi]
- Demonstration of Udon: Line-by-line Debugging of User-Defined Functions in Data WorkflowsYicong Huang 0002, Zuozhi Wang, Chen Li 0001. 476-479 [doi]
- UniTS: A Universal Time Series Analysis Framework Powered by Self-Supervised Representation LearningZhiyu Liang, Chen Liang, Zheng Liang, Hongzhi Wang, Bo Zheng. 480-483 [doi]
- ChatPipe: Orchestrating Data Preparation Pipelines by Optimizing Human-ChatGPT InteractionsSibei Chen, Hanbing Liu, Waiting Jin, Xiangyu Sun, Xiaoyao Feng, Ju Fan, Xiaoyong Du 0001, Nan Tang 0001. 484-487 [doi]
- Responsible Model Selection with Virny and VirnyViewDenys Herasymuk, Falaah Arif Khan, Julia Stoyanovich. 488-491 [doi]
- Property Graph Stream Processing In Action with SeraphRiccardo Tommasini 0001, Christopher Rost, Angela Bonifati, Emanuele Della Valle, Erhard Rahm, Keith W. Hare, Stefan Plantikow, Petra Selmer, Hannes Voigt. 492-495 [doi]
- MillenniumDB: A Multi-modal, Multi-model Graph DatabaseDomagoj Vrgoc, Carlos Rojas, Renzo Angles, Marcelo Arenas, Vicente Calisto, Benjamín Farias, Sebastián Ferrada, Tristan Heuer, Aidan Hogan, Gonzalo Navarro 0001, Alexander Pinto, Juan L. Reutter, Henry Rosales-Méndez, Etienne Toussaint. 496-499 [doi]
- IDE: A System for Iterative Mislabel DetectionYuhao Deng, Deng Qiyan, Chengliang Chai, Lei Cao 0004, Nan Tang 0001, Ju Fan, Jiayi Wang 0002, Ye Yuan, Guoren Wang. 500-503 [doi]
- A Demonstration of GPTuner: A GPT-Based Manual-Reading Database Tuning SystemJiale Lao, Yibo Wang, Yufei Li, Jianping Wang, Yunjia Zhang, Zhiyuan Cheng 0010, Wanghu Chen, Yuanchun Zhou, MingJie Tang, Jianguo Wang 0001. 504-507 [doi]
- Demonstrating λ-Tune: Exploiting Large Language Models for Workload-Adaptive Database System TuningVictor Giannakouris, Immanuel Trummer. 508-511 [doi]
- User-friendly, Interactive, and Configurable Explanations for Graph Neural Networks with Graph ViewsTingyang Chen, Dazhuo Qiu, Yinghui Wu, Arijit Khan 0001, Xiangyu Ke, Yunjun Gao. 512-515 [doi]
- OpenIVM: a SQL-to-SQL Compiler for Incremental ComputationsIlaria Battiston, Kriti Kathuria, Peter A. Boncz. 516-519 [doi]
- Building Reactive Large Language Model Pipelines with MotionShreya Shankar, Aditya G. Parameswaran. 520-523 [doi]
- Demonstrating Nexus for Correlation Discovery over Collections of Spatio-Temporal Tabular DataYue Gong, Raul Castro Fernandez. 524-527 [doi]
- PLUTUS: Understanding Data Distribution Tailoring for Machine LearningJiwon Chang, Christina Dionysio, Fatemeh Nargesian, Matthias Boehm 0001. 528-531 [doi]
- Multi-Backend Zonal Statistics Execution with RavenGereon Dusella, Haralampos Gavriilidis, Laert Nuhu, Volker Markl, Eleni Tzirita Zacharatou. 532-535 [doi]
- ShiftScope: Adapting Visualization Recommendations to Users' Dynamic Data FocusSanad Saha, Nischal Aryal, Leilani Battle, Arash Termehchy. 536-539 [doi]
- Demonstration of ElasticNotebook: Migrating Live Computational Notebook StatesZhaoheng Li, Supawit Chockchowwat, Hanxi Fang, Ribhav Sahu, Sumay Thakurdesai, Kantanat Pridaphatrakun, Yongjoo Park. 540-543 [doi]
- The Future of Graph AnalyticsAngela Bonifati, M. Tamer Özsu, Yuanyuan Tian, Hannes Voigt, Wenyuan Yu, Wenjie Zhang 0001. 544-545 [doi]
- AI for SystemsCarlo Curino, Raghu Ramakrishnan 0001. 546 [doi]
- Demystifying Data Management for Large Language ModelsXupeng Miao, Zhihao Jia, Bin Cui 0001. 547-555 [doi]
- SmartNICs in the Cloud: The Why, What and How of In-network Processing for Data-Intensive ApplicationsFaeze Faghih, Tobias Ziegler 0001, Zsolt István, Carsten Binnig. 556-560 [doi]
- Learned Query Optimizer: What is New and What is NextRong Zhu, Lianggui Weng, Bolin Ding, Jingren Zhou. 561-569 [doi]
- Distributed Transaction Processing in Untrusted EnvironmentsMohammad Javad Amiri, Divyakant Agrawal, Amr El Abbadi, Boon Thau Loo. 570-579 [doi]
- Responsible Sharing of Spatiotemporal DataRaul Castro Fernandez, Arnab Nandi 0001. 580-584 [doi]
- Querying Graph Databases at ScaleAidan Hogan, Domagoj Vrgoc. 585-589 [doi]
- Cognitive Psychology Meets Data Management: State of the Art and Future DirectionsSourav S. Bhowmick, S. H. Annabel Chen, Divesh Srivastava. 590-596 [doi]
- Vector Database Management Techniques and SystemsJames Jie Pan, Jianguo Wang 0001, Guoliang Li. 597-604 [doi]
- An Overview of Continuous Querying in (Modern) Data SystemsAngela Bonifati, Riccardo Tommasini 0001. 605-612 [doi]
- SIMDified Data Processing - Foundations, Abstraction, and Advanced TechniquesDirk Habich, Johannes Pietrzyk. 613-621 [doi]
- Machine Learning for Databases: Foundations, Paradigms, and Open problemsGao Cong, Jingyi Yang, Yue Zhao. 622-629 [doi]
- Applications and Computation of the Shapley Value in Databases and Machine LearningXuan Luo, Jian Pei. 630-635 [doi]
- Beyond Bloom: A Tutorial on Future Feature-Rich FiltersPrashant Pandey 0001, Martín Farach-Colton, Niv Dayan, Huanchen Zhang. 636-644 [doi]
- International Workshop on Data Management on New Hardware (DaMoN)Carsten Binnig, Nesime Tatbul. 645-646 [doi]
- Second Workshop on Simplicity in Management of Data (SiMoD)Danica Porobic, Tianzheng Wang 0001. 647-648 [doi]
- Seventh International Workshop on Exploiting Artificial Intelligence Techniques for Data Management (aiDM)Rajesh Bordawekar, Oded Shmueli, Yael Amsterdamer, Renata Borovica-Gajic, Donatella Firmani. 649-650 [doi]
- Eighth Workshop on Data Management for End-to-End Machine Learning (DEEM)Madelon Hulsebos, Matteo Interlandi, Shreya Shankar. 651-652 [doi]
- GRADES-NDA'24: 7th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)Olaf Hartig, Zoi Kaoudi. 653-654 [doi]
- Fourth International Workshop on Big Data in Emergent Distributed Environments (BiDEDE)Philippe Cudré-Mauroux, Andrea Ko, Robert Wrembel. 655-656 [doi]
- Eighth Workshop on Human-In-the-Loop Data Analytics (HILDA)Jean-Daniel Fekete, Kexin Rong 0001, Behrooz Omidvar Tehrani, Roee Shraga. 657-658 [doi]
- Third International Workshop on Data Systems Education (DataEd'24)Daphne Miedema, Sourav S. Bhowmick, Michael Liut. 659-660 [doi]
- First Workshop on Governance, Understanding and Integration of Data for Effective and Responsible AI (GUIDE-AI)Abolfazl Asudeh, Sainyam Galhotra, Amir Gilad, Babak Salimi, Brit Youngmann. 661-662 [doi]
- First Workshop on Quantum Computing and Quantum-Inspired Technology for Data-Intensive Systems and Applications (Q-Data)Ibrahim Sabek, Immanuel Trummer, Stefan Prestel. 663-664 [doi]
- Tenth International Workshop on Testing Database Systems (DBTest)Anja Gruenheid, Manuel Rigger. 665-666 [doi]