| 0 | -- | 0 | Lei Chen 0002, Fatma Özcan. Front Matter |
| 1778 | -- | 1781 | Guilherme Damasio, Spencer Bryson, Vincent Corvinelli, Parke Godfrey, Piotr Mierzejewski, Jaroslaw Szlichta, Calisto Zuzarte. GALO: Guided Automated Learning for re-Optimization |
| 1782 | -- | 1785 | Yuanyuan Tian, Suijun Tong, Mir Hamid Pirahesh, Wen Sun, En Liang Xu, Wei Zhao. Synergistic Graph and SQL Analytics Inside IBM Db2 |
| 1786 | -- | 1789 | Xiaoou Ding, Hongzhi Wang, Jiaxuan Su, Zijue Li, Jianzhong Li, Hong Gao. Cleanits: A Data Cleaning System for Industrial Time Series |
| 1790 | -- | 1793 | Yipeng Zhang, Zhifeng Bao, Songsong Mo, Yuchen Li, Yanghao Zhou. ITAA: An Intelligent Trajectory-driven Outdoor Advertising Deployment Assistant |
| 1794 | -- | 1797 | Kun Qian 0002, Lucian Popa 0001, Prithviraj Sen. SystemER: A Human-in-the-loop System for Explainable Entity Resolution |
| 1798 | -- | 1801 | Viet-Phi Huynh, Paolo Papotti. Buckle: Evaluating Fact Checking Algorithms Built on Knowledge Bases |
| 1802 | -- | 1805 | Peng Gao, Xusheng Xiao, Zhichun Li, Kangkook Jee, Fengyuan Xu, Sanjeev R. Kulkarni, Prateek Mittal. A Query System for Efficiently Investigating Complex Attack Behaviors for Enterprise Security |
| 1806 | -- | 1809 | Zhengjie Miao, Qitian Zeng, Chenjie Li, Boris Glavic, Oliver Kennedy, Sudeepa Roy. CAPE: Explaining Outliers by Counterbalancing |
| 1810 | -- | 1813 | Karthik Ramachandra 0002, Kwanghyun Park. BlackMagic: Automatic Inlining of Scalar UDFs into SQL Queries with Froid |
| 1814 | -- | 1817 | Lukas Berg, Tobias Ziegler 0001, Carsten Binnig, Uwe Röhm. ProgressiveDB - Progressive Data Analytics as a Middleware |
| 1818 | -- | 1821 | Kaan Kara, Zeke Wang, Ce Zhang, Gustavo Alonso. doppioDB 2.0: Hardware Techniques for Improved Integration of Machine Learning into Databases |
| 1822 | -- | 1825 | Cícero A. L. Pahins, Behrooz Omidvar Tehrani, Sihem Amer-Yahia, Valérie Siroux, Jean Louis Pépin, Jean-Christian Borel, João Comba. COVIZ: A System for Visual Formation and Exploration of Patient Cohorts |
| 1826 | -- | 1829 | Martin Franke, Ziad Sehili, Erhard Rahm. PRIMAT: A Toolbox for Fast Privacy-preserving Matching |
| 1830 | -- | 1833 | Ryan Marcus, Chi Zhang, Shuai Yu, Geoffrey Kao, Olga Papaemmanouil. NashDB: Fragmentation, Replication, and Provisioning using Economic Methods |
| 1834 | -- | 1837 | Ibrahim Sabek, Mashaal Musleh, Mohamed F. Mokbel. Flash in Action: Scalable Spatial Data Analysis Using Markov Logic Networks |
| 1838 | -- | 1841 | Lucas Kuhring, Zsolt István. I Can't Believe It's Not (Only) Software! Bionic Distributed Storage for Parquet Files |
| 1842 | -- | 1845 | Hyewon Choi, Erkang Zhu, Arsala Bangash, Renée J. Miller. VISE: Vehicle Image Search Engine with Traffic Camera |
| 1846 | -- | 1849 | Stephan Goldberg, Tova Milo, Slava Novgorodov, Kathy Razmadze. WiClean: A System for Fixing Wikipedia Interlinks Using Revision History Patterns |
| 1850 | -- | 1853 | Abhishek Roy, Alekh Jindal, Hiren Patel, Ashit Gosalia, Subru Krishnan, Carlo Curino. SparkCruise: Handsfree Computation Reuse in Spark |
| 1854 | -- | 1857 | Sandeep Singh Sandha, Wellington Cabrera, Mohammed Al-Kateb, Sanjay Nair, Mani B. Srivastava. In-database Distributed Machine Learning: Demonstration using Teradata SQL Engine |
| 1858 | -- | 1861 | Zhao Li, Xia Chen, Xuming Pan, Pengcheng Zou, Yuchen Li, Guoxian Yu. SHOAL: Large-scale Hierarchical Taxonomy via Graph-based Query Coalition in E-commerce |
| 1862 | -- | 1865 | Min Xu, Tianhao Wang 0001, Bolin Ding, Jingren Zhou, Cheng Hong, Zhicong Huang. DPSAaS: Multi-Dimensional Data Sharing and Analytics as Services under Local Differential Privacy |
| 1866 | -- | 1869 | Yang Cao 0011, Yonghui Xiao, Li Xiong 0001, Liquan Bai, Masatoshi Yoshikawa. PriSTE: Protecting Spatiotemporal Event Privacy in Continuous Location-Based Services |
| 1870 | -- | 1873 | Daniel Deutch, Evgeny Marants, Yuval Moskovitch. Datalignment: Ontology Schema Alignment Through Datalog Containment |
| 1874 | -- | 1877 | Congcong Ge, Yunjun Gao, Xiaoye Miao, Lu Chen, Christian S. Jensen, Ziyuan Zhu. IHCS: An Integrated Hybrid Cleaning System |
| 1878 | -- | 1881 | Constantinos Costa, Xiaoyu Ge, Panos K. Chrysanthis. CAPRIO: Graph-based Integration of Indoor and Outdoor Data for Path Discovery |
| 1882 | -- | 1885 | Yingjun Wu, Jia Yu, Yuanyuan Tian, Richard Sidle, Ronald Barber. HERMIT in Action: Succinct Secondary Indexing Mechanism via Correlation Exploration |
| 1886 | -- | 1889 | Julien Loudet, Iulian Sandu Popa, Luc Bouganim. DISPERS: Securing Highly Distributed Queries on Personal Data Management Systems |
| 1890 | -- | 1893 | Adil AKhter, Marios Fragkoulis, Asterios Katsifodimos. Stateful Functions as a Service in Action |
| 1894 | -- | 1897 | Allen Ordookhanians, Xin Li, Supun Nakandala, Arun Kumar. Demonstration of Krypton: Optimized CNN Inference for Occlusion-based Deep CNN Explanations |
| 1898 | -- | 1901 | Zhengjie Miao, Andrew Lee, Sudeepa Roy. LensXPlain: Visualizing and Explaining Contributing Subsets for Aggregate Query Answers |
| 1902 | -- | 1905 | Yi Zhang, Zachary G. Ives. Juneau: Data Lake Management for Jupyter |
| 1906 | -- | 1909 | Sona Hasani, Faezeh Ghaderi, Shohedul Hasan, Saravanan Thirumuruganathan, Abolfazl Asudeh, Nick Koudas, Gautam Das 0001. ApproxML: Efficient Approximate Ad-Hoc ML Models Through Materialization and Reuse |
| 1910 | -- | 1913 | Grégory M. Essertel, Ruby Y. Tahboub, Fei Wang, James M. Decker, Tiark Rompf. Flare & Lantern: Efficiently Swapping Horses Midstream |
| 1914 | -- | 1917 | Ruben Martins, Jia Chen, Yanju Chen, Yu Feng, Isil Dillig. Trinity: An Extensible Synthesis Framework for Data Science |
| 1918 | -- | 1921 | Zhiqi Huang, Ryan Mckenna, George Bissias, Gerome Miklau, Michael Hay, Ashwin Machanavajjhala. PSynDB: Accurate and Accessible Private Data Generation |
| 1922 | -- | 1925 | Badrish Chandramouli, Dong Xie 0001, Yinan Li, Donald Kossmann. FishStore: Fast Ingestion and Indexing of Raw Data |
| 1926 | -- | 1929 | Yanlei Diao, Pawel Guzewicz, Ioana Manolescu, Mirjana Mazuran. Spade: A Modular Framework for Analytical Exploration of RDF Graphs |
| 1930 | -- | 1933 | Joseph Vinish D'silva, Florestan De Moor, Bettina Kemme. Making an RDBMS Data Scientist Friendly: Advanced In-database Interactive Analytics with Visualization Support |
| 1934 | -- | 1937 | Khaled Zaouk, Fei Song, Chenghao Lyu, Arnab Sinha, Yanlei Diao, Prashant J. Shenoy. UDAO: A Next-Generation Unified Data Analytics Optimizer |
| 1938 | -- | 1941 | Saehan Jo, Immanuel Trummer, Weicheng Yu, Xuezhi Wang 0002, Cong Yu 0001, Daniel Liu, Niyati Mehta. AggChecker: A Fact-Checking System for Text Summaries of Relational Data Sets |
| 1942 | -- | 1945 | Hanzhang Wang, Phuong Nguyen, Jun Li, Selcuk Kopru, Gene Zhang, Sanjeev Katariya, Sami Ben-romdhane. GRANO: Interactive Graph-based Root Cause Analysis for Cloud-Native Distributed Data Platform |
| 1946 | -- | 1949 | Davide Frey, Marc X. Makkes, Pierre-Louis Roman, François Taïani, Spyros Voulgaris. Dietcoin: Hardening Bitcoin Transaction Verification Process For Mobile Devices |
| 1950 | -- | 1953 | Samriddhi Singla, Ahmed Eldawy, Rami Alghamdi, Mohamed F. Mokbel. Raptor: Large Scale Analysis of Big Raster and Vector Data |
| 1954 | -- | 1957 | El Kindi Rezig, Lei Cao, Michael Stonebraker, Giovanni Simonini, Wenbo Tao, Samuel Madden, Mourad Ouzzani, Nan Tang 0001, Ahmed K. Elmagarmid. Data Civilizer 2.0: A Holistic Framework for Data Preparation and Analytics |
| 1958 | -- | 1961 | Leonhard F. Spiegelberg, Tim Kraska. Tuplex: Robust, Efficient Analytics When Python Rules |
| 1962 | -- | 1965 | Cédric Renggli, Frances Ann Hubis, Bojan Karlas, Kevin Schawinski, Wentao Wu 0001, Ce Zhang. Ease.ml/ci and Ease.ml/meter in Action: Towards Data Management for Statistical Generalization |
| 1966 | -- | 1969 | Han Xueran, Jun Chen, Jiaheng Lu, Yueguo Chen, Xiaoyong Du. PivotE: Revealing and Visualizing the Underlying Entity Structures for Exploration |
| 1970 | -- | 1973 | Jiaheng Lu, Yuxing Chen, Herodotos Herodotou, Shivnath Babu. Speedup Your Analytics: Automatic Parameter Tuning for Databases and Big Data Systems |
| 1974 | -- | 1977 | Yu Meng, Jiaxin Huang, Jingbo Shang, Jiawei Han 0001. TextCube: Automated Construction and Multidimensional Exploration |
| 1978 | -- | 1981 | Sihem Amer-Yahia, Senjuti Basu Roy. The Ever Evolving Online Labor Market: Overview, Challenges and Opportunities |
| 1982 | -- | 1985 | Ibrahim Sabek, Mohamed F. Mokbel. Machine Learning Meets Big Spatial Data |
| 1986 | -- | 1989 | Fatemeh Nargesian, Erkang Zhu, Renée J. Miller, Ken Q. Pu, Patricia C. Arocena. Data Lake Management: Challenges and Opportunities |
| 1990 | -- | 1993 | Laks V. S. Lakshmanan, Michael Simpson, Saravanan Thirumuruganathan. Combating Fake News: A Data Management and Mining Perspective |
| 1994 | -- | 1997 | Nicolas Anciaux, Luc Bouganim, Philippe Pucheral, Iulian Sandu Popa, Guillaume Scerri. Personal Database Security and Trusted Execution Environments: A Tutorial at the Crossroads |
| 1998 | -- | 2009 | Stephan Kessler, Jens Hoff, Johann Christoph Freytag. SAP HANA goes private - From Privacy Research to Privacy Aware Enterprise Analytics |
| 2010 | -- | 2021 | Guilherme Damasio, Vincent Corvinelli, Parke Godfrey, Piotr Mierzejewski, Alexandar Mihaylov, Jaroslaw Szlichta, Calisto Zuzarte. Guided automated learning for query workload re-optimization |
| 2022 | -- | 2034 | Biswapesh Chattopadhyay, Priyam Dutta, Weiran Liu, Ott Tinn, Andrew McCormick, Aniket Mokashi, Paul Harvey, Hector Gonzalez, David Lomax, Sagar Mittal, Roee Ebenstein, Nikita Mikhaylin, Hung-Ching Lee, Xiaoyan Zhao, Tony Xu, Luis Perez, Farhad Shahmohammadi, Tran Bui, Neil Mckay, Selcuk Aya, Vera Lychagina, Brett Elliott. Procella: Unifying serving and analytical data at YouTube |
| 2035 | -- | 2046 | Wei Lu, Zhanhao Zhao, Xiaoyu Wang, Haixiang Li, Zhenmiao Zhang, Zhiyu Shui, Sheng Ye, Anqun Pan, Xiaoyong Du. A Lightweight and Efficient Temporal Database Management System in TDSQL |
| 2047 | -- | 2058 | Reza Sherkat, Colin Florendo, Mihnea Andrei, Rolando Blanco, Adrian Dragusanu, Amit Pathak, Pushkar Khadilkar, Neeraj Kulkarni, Christian Lemke, Sebastian Seifert, Sarika Iyer, Sasikanth Gottapu, Robert Schulze, Chaitanya Gottipati, Nirvik Basak, Yanhong Wang, Vivek Kandiyanallur, Santosh Pendap, Dheren Gala, Rajesh Almeida, Prasanta Ghosh. Native Store Extension for SAP HANA |
| 2059 | -- | 2070 | Chaoqun Zhan, Maomeng Su, Chuangxian Wei, Xiaoqiang Peng, Liang Lin, Sheng Wang, Zhe Chen, Feifei Li 0001, Yue Pan, Fang Zheng, Chengliang Chai. AnalyticDB: Real-time OLAP Database System at Alibaba Cloud |
| 2071 | -- | 2081 | William Schultz, Tess Avitabile, Alyson Cabral. Tunable Consistency in MongoDB |
| 2082 | -- | 2093 | Shaosheng Cao, Xinxing Yang, Cen Chen, Jun Zhou, Xiaolong Li, Yuan Qi 0001. TitAnt: Online Real-time Transaction Fraud Detection in Ant Financial |
| 2094 | -- | 2105 | Rong Zhu, Kun Zhao, Hongxia Yang, Wei Lin, Chang Zhou, Baole Ai, Yong Li, Jingren Zhou. AliGraph: A Comprehensive Graph Neural Network Platform |
| 2106 | -- | 2117 | Zhimin Chen, Yue Wang, Vivek R. Narasayya, Surajit Chaudhuri. Customizable and Scalable Fuzzy Join for Big Data |
| 2118 | -- | 2130 | Guoliang Li, Xuanhe Zhou, Shifu Li, Bo Gao. QTune: A Query-Aware Database Tuning System with Deep Reinforcement Learning |
| 2131 | -- | 2142 | Srikanth Kandula, Kukjin Lee, Surajit Chaudhuri, Marc Friedman. Experiences with Approximating Queries in Microsoft's Production Big-Data Clusters |
| 2143 | -- | 2154 | Panagiotis Antonopoulos, Peter Byrne, Wayne Chen, Cristian Diaconu, Raghavendra Thallam Kodandaramaih, Hanuma Kodavalla, Prashanth Purnananda, Adrian-Leonard Radu, Chaitanya Sreenivas Ravella, Girish Mittur Venkataramanappa. Constant Time Recovery in Azure SQL Database |
| 2155 | -- | 2169 | Yuzhen Huang, Yingjie Shi, Zheng Zhong, Yihui Feng, James Cheng, Jiwei Li, Haochuan Fan, Chao Li, Tao Guan, Jingren Zhou. Yugong: Geo-Distributed Data and Job Placement at Scale |
| 2170 | -- | 2182 | Junjay Tan, Thanaa Ghanem, Matthew Perron, Xiangyao Yu, Michael Stonebraker, David J. DeWitt, Marco Serafini, Ashraf Aboulnaga, Tim Kraska. Choosing A Cloud DBMS: Architectures and Tradeoffs |
| 2183 | -- | 2194 | Jingtian Zhang, Sai Wu, Zeyuan Tan, Gang Chen, Zhushi Cheng, Wei Cao, Yusong Gao, Xiaojie Feng. S3: A Scalable In-memory Skip-List Index for Key-Value Store |
| 2195 | -- | 2205 | Charles Masson, Jee E. Rim, Homin K. Lee. DDSketch: A Fast and Fully-Mergeable Quantile Sketch with Relative-Error Guarantees |
| 2206 | -- | 2217 | Qiang Long, Wei Wang, Jinfu Deng, Song Liu, Wenhao Huang, Fangying Chen, SiFan Liu. A Distributed System for Large-scale n-gram Language Models at Tencent |
| 2218 | -- | 2229 | Kayhan Dursun, Carsten Binnig, Ugur Çetintemel, Garret Swart, Weiwei Gong. A Morsel-Driven Query Execution Engine for Heterogeneous Multi-Cores |
| 2230 | -- | 2241 | Lei Cao, Wenbo Tao, Sungtae An, Jing Jin, Yizhou Yan, Xiaoyu Liu, Wendong Ge, Adam Sah, Leilani Battle, Jimeng Sun, Remco Chang, M. Brandon Westover, Samuel Madden, Michael Stonebraker. Smile: A System to Support Machine Learning on EEG Data at Scale |
| 2242 | -- | 2253 | Alastair Green, Paolo Guagliardo, Leonid Libkin, Tobias Lindaaker, Victor Marsault, Stefan Plantikow, Martin Schuster, Petra Selmer, Hannes Voigt. Updating Graph Databases with Cypher |
| 2254 | -- | 2262 | Asya Kamsky. Adapting TPC-C Benchmark to Measure Performance of Multi-Document Transactions in MongoDB |
| 2263 | -- | 2272 | Feifei Li 0001. Cloud native database systems at Alibaba: Opportunities and Challenges |
| 2273 | -- | 2274 | Alexander Boehm 0002. In-Memory for the masses: Enabling cost-efficient deployments of in-memory data management platforms for business applications |
| 2275 | -- | 2286 | Murtadha Al Hubail, Ali Alsuliman, Michael Blow, Michael J. Carey 0001, Dmitry Lychagin, Ian Maxon, Till Westmann. Couchbase Analytics: NoETL for Scalable NoSQL Data Analysis |
| 2287 | -- | 2289 | Adrian Coyler. Performance in the spotlight |
| 2290 | -- | 2299 | Azza Abouzied, Daniel J. Abadi, Kamil Bajda-Pawlikowski, Avi Silberschatz. Integration of Large-Scale Data Processing Systems and Traditional Parallel Database Technology |
| 2300 | -- | 2307 | Brian F. Cooper, P. P. S. Narayan, Raghu Ramakrishnan, Utkarsh Srivastava, Adam Silberstein, Philip Bohannon, Hans-Arno Jacobsen, Nick Puz, Daniel Weaver, Ramana Yerneni. PNUTS to Sherpa: Lessons from Yahoo!'s Cloud Database |
| 2308 | -- | 0 | Wang Chiew Tan. What I probably did right and what I think I could have done better |
| 2309 | -- | 2322 | Aditya Parameswaran. Enabling Data Science for the Majority |
| 2323 | -- | 2324 | Theodoros Rekatsinas, Sudeepa Roy, Manasi Vartak, Ce Zhang, Neoklis Polyzotis. Opportunities for Data Management Research in the Era of Horizontal AI/ML |