Journal: PVLDB

Volume 14, Issue 9

0 -- 0Rainer Gemulla. Front Matter
1454 -- 1466Jianwen Zhao, Yufei Tao. Minimum Vertex Augmentation
1467 -- 1480Kevin P. Gaffney, Robert K. Claus, Jignesh Patel. Database Isolation By Scheduling
1481 -- 1488Jong-Hyeok Park, Soyee Choi, Gihwan Oh, Sang-Won Lee. SaS: SSD as SQL Database System
1489 -- 1502Rong Zhu, Ziniu Wu, Yuxing Han, Kai Zeng, Andreas Pfadler, Zhengping Qian, Jingren Zhou, Bin Cui 0001. FLAT: Fast, Lightweight and Accurate Method for Cardinality Estimation
1503 -- 1516Tsz Nam Chan, Zhe Li 0011, Leong Hou U, Jianliang Xu, Reynold Cheng. Fast Augmentation Algorithms for Network Kernel Density Visualization
1517 -- 1530Jiawei Wang, Cheng Li, Kai Ma, Jingze Huo, Feng Yan, Xinyu Feng, Yinlong Xu. AutoGR: Automated Geo-Replication with Fast System Performance and Preserved Application Semantics
1531 -- 1543Qing Liu, Xuliang Zhu, Xin Huang 0001, Jianliang Xu. Local Algorithms for Distance-generalized Core Decomposition over Large Dynamic Graphs
1544 -- 1556Lawrence Benson, Hendrik Makait, Tilmann Rabl. Viper: An Efficient Hybrid PMem-DRAM Key-Value Store
1557 -- 1569Sepanta Zeighami, Cyrus Shahabi, John Krumm. Estimating Spread of Contact-Based Contagions in a Population Through Sub-Sampling
1570 -- 1582Herodotos Herodotou, Elena Kakoulli. Trident: Task Scheduling over Tiered Storage Systems in Big Data Platforms
1583 -- 1596Zicun Cong, Lingyang Chu, Yu Yang 0001, Jian Pei. Comprehensible Counterfactual Explanation on Kolmogorov-Smirnov Test
1597 -- 1605Hongkuan Zhou, Ajitesh Srivastava, Hanqing Zeng, Rajgopal Kannan, Viktor K. Prasanna. Accelerating Large Scale Real-Time GNN Inference using Channel Pruning
1606 -- 1612Viktor Leis, Maximilian Kuschewski. Towards Cost-Optimal Query Processing in the Cloud
1613 -- 1625Shufeng Gong, Chao Tian 0001, Qiang Yin, Wenyuan Yu, Yanfeng Zhang, Liang Geng, Song Yu, Ge Yu, Jingren Zhou. Automating Incremental Graph Processing with Flexible Memoization
1626 -- 1639Theo Jepsen, Alberto Lerner, Fernando Pedone, Robert Soulé, Philippe Cudré-Mauroux. In-Network Support for Transaction Triaging
1640 -- 1654Xiaoying Wang, Changbo Qu, Weiyuan Wu, Jiannan Wang, QingQing Zhou. Are We Ready For Learned Cardinality Estimation?
1655 -- 1667Jakub Lemiesz. On the algebra of data sketches
1668 -- 1680Guanhao Hou, Xingguang Chen, Sibo Wang 0001, Zhewei Wei. Massively Parallel Algorithms for Personalized PageRank
1681 -- 1693Maximilian Schleich, Zixuan Geng, Yihong Zhang, Dan Suciu. GeCo: Quality Counterfactual Explanations in Real Time
1694 -- 1702Ricardo Salazar, Felix Neutatz, Ziawasch Abedjan. Automated Feature Engineering for Algorithmic Fairness

Volume 14, Issue 8

0 -- 0Floris Geerts. Front Matter
1254 -- 1261Nan Tang 0001, Ju Fan, Fangyi Li, Jianhong Tu, Xiaoyong Du 0001, Guoliang Li 0001, Samuel Madden, Mourad Ouzzani. RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation
1262 -- 1275Jia Zou 0001, Amitabh Das, Pratik Barhate, Arun Iyengar, Binhang Yuan, Dimitrije Jankov, Chris Jermaine. Lachesis: Automated Partitioning for UDF-Centric Analytics (Revision of Paper 270)
1276 -- 1288Jiacheng Wu, Yong Zhang, Shimin Chen, Yu Chen, Jin Wang, Chunxiao Xing. Updatable Learned Index with Precise Positions
1289 -- 1297Ziquan Fang, Lu Pan, Lu Chen 0001, YunTao Du, Yunjun Gao. MDTP: A Multi-source Deep Traffic Prediction Framework over Spatio-Temporal Trajectory Data
1298 -- 1310Seunghwan Min, Sung Gwan Park, Kunsoo Park, Dora Giammarresi, Giuseppe F. Italiano, Wook-Shin Han. Symmetric Continuous Subgraph Matching with Bidirectional Dynamic Programming
1311 -- 1324Tomoya Suzuki, Kazuhiro Hiwada, Hirotsugu Kajihara, Shintaro Sano, Shuou Nomura, Tatsuo Shiozawa. Approaching DRAM performance by using microsecond-latency flash memory for small-sized random read accesses: a new access method and its graph applications
1325 -- 1337Abdelghny Orogat, Isabelle Liu, Ahmed El-Roby. CBench: Towards Better Evaluation of Question Answering Over Knowledge Graphs
1338 -- 1350Binhang Yuan, Dimitrije Jankov, Jia Zou 0001, Yuxin Tang, Daniel Bourgeois, Chris Jermaine. Tensor Relational Algebra for Distributed Machine Learning System Design
1351 -- 1364Wenfei Fan, Chao Tian 0001, Yanghao Wang, Qiang Yin. Parallel Discrepancy Detection and Incremental Detection
1365 -- 1377Tiantian Liu 0003, Huan Li 0003, Hua Lu 0001, Muhammad Aamir Cheema, Lidan Shou. Towards Crowd-aware Indoor Path Planning
1378 -- 1391Surabhi Gupta, Karthik Ramachandra 0002. Procedural Extensions of SQL: Understanding their usage in the wild
1392 -- 1400Sagar Bharadwaj, Praveen Gupta, Ranjita Bhagwan, Saikat Guha. Discovering Related Data At Scale
1401 -- 1413Stefano Cereda, Stefano Valladares, Paolo Cremonesi, Stefano Doni. CGPTuner: a Contextual Gaussian Process Bandit Approach for the Automatic Tuning of IT Configurations Under Varying Workload Conditions
1414 -- 1426Filippo Schiavio, Daniele Bonetta, Walter Binder. Language-Agnostic Integrated Queries in a Managed Polyglot Runtime
1427 -- 1440Chinmay Kulkarni 0002, Badrish Chandramouli, Ryan Stutsman. Achieving High Throughput and Elasticity in a Larger-than-Memory Store
1441 -- 1453Kai Yao, Lijun Chang. Efficient Size-Bounded Community Search over Large Networks

Volume 14, Issue 7

0 -- 0Arun Kumar, Alon Y. Halevy, Nesime Tatbul. Front Matter
1124 -- 1136Dimitris Tsaras, George Trimponias, Lefteris Ntaflos, Dimitris Papadias. Collective Influence Maximization for Multiple Competing Products with an Awareness-to-Influence Model
1137 -- 1149Yahui Sun, Xiaokui Xiao, Bin Cui 0001, Saman K. Halgamuge, Theodoros Lappas, Jun Luo 0001. Finding Group Steiner Trees in Graphs with both Vertex and Edge Weights
1150 -- 1158Tenindra Abeywickrama, Victor Liang, Kian-Lee Tan. Optimizing Bipartite Matching in Real-World Applications by Incremental Cost Computation
1159 -- 1165Immanuel Trummer. The Case for NLP-Enhanced Database Tuning: Towards Tuning Tools that "Read the Manual"
1166 -- 0Sujaya Maiyya, Faisal Nawab, Divy Agrawal, Amr El Abbadi. Errata for "Unifying Consensus and Atomic Commitment for Effective Cloud Data Management"
1167 -- 1174Zsolt István, Soujanya Ponnapalli, Vijay Chidambaram. Software-Defined Data Protection: Low Overhead Policy Compliance at the Storage Layer is Within Reach!
1175 -- 1187Tianyi Li, Lu Chen 0001, Christian S. Jensen, Torben Bach Pedersen. TRACE: Real-time Compression of Streaming Trajectories in Road Networks
1188 -- 1201Arkaprava Saha, K. Ruben Brokkelkamp, Yllka Velaj, Arijit Khan, Francesco Bonchi. Shortest Paths and Centrality in Uncertain Networks
1202 -- 1214Tongyu Liu, Yinqing Luo, Ju Fan, Nan Tang 0001, Guoliang Li 0001, Xiaoyong Du 0001. Adaptive Data Augmentation for Supervised Learning over Missing Data
1215 -- 1227Fuheng Zhao, Sujaya Maiyya, Ryan Weiner, Divy Agrawal, Amr El Abbadi. KLL±: Approximate Quantile Sketches over Dynamic Datasets
1228 -- 1240Dimitrije Jankov, Binhang Yuan, Shangyu Luo, Chris Jermaine. Distributed Numerical and Machine Learning Computations via Two-Phase Execution of Aggregated Join Trees
1241 -- 1253Dana Van Aken, Dongsheng Yang, Sebastien Brillard, Ari Fiorino, Bohan Zhang, Christian Billian, Andrew Pavlo. An Inquiry into Machine Learning-based Automatic Configuration Tuning Services on Real-World Database Management Systems

Volume 14, Issue 6

0 -- 0Hannes Mühleisen, Thorsten Papenbrock. Front Matter
863 -- 0Supun Nakandala, Yuhao Zhang, Arun Kumar 0001. Errata for "Cerebro: A Data System for Optimized Deep Learning Model Selection"
864 -- 877Lujia Yin, Yiming Zhang 0003, Zhaoning Zhang 0001, Yuxing Peng, Peng Zhao. ParaX: Boosting Deep Learning for Big Data Analytics on Many-Core CPUs
878 -- 889Walter Cai, Philip A. Bernstein, Wentao Wu 0001, Badrish Chandramouli. Optimization of Threshold Functions over Streams
890 -- 902Xuliang Zhu, Xin Huang 0001, Byron Choi, Jiaxin Jiang, Zhaonian Zou, Jianliang Xu. Budget Constrained Interactive Search for Multiple Targets
903 -- 915Yangjun Chen, Hoang Hai Nguyen. On the String Matching with k Differences in DNA Databases
916 -- 928Yasuhiro Fujiwara, Sekitoshi Kanai, Yasutoshi Ida, Atsutoshi Kumagai, Naonori Ueda. Fast Algorithm for Anchor Graph Hashing
929 -- 942Wangda Zhang, Junyoung Kim, Kenneth A. Ross, Eric Sedlar, Lukas Stadler. Adaptive Code Generation for Data-Intensive Analytics
943 -- 956Efthymia Tsamoura, David Carral, Enrico Malizia, Jacopo Urbani. Materializing Knowledge Bases via Trigger Graphs
957 -- 969Jinfei Liu, Jian Lou, Junxu Liu, Li Xiong 0001, Jian Pei, Jimeng Sun. Dealer: An End-to-End Model Marketplace with Differential Privacy
970 -- 983Sajjadur Rahman, Mangesh Bendre, Yuyang Liu, Shichu Zhu, Zhaoyuan Su, Karrie Karahalios, Aditya G. Parameswaran. NOAH: Interactive Spreadsheet Exploration with Dynamic Hierarchical Overviews
984 -- 996Yixing Yang, Yixiang Fang, Maria E. Orlowska, Wenjie Zhang 0001, Xuemin Lin 0001. Efficient Bi-triangle Counting for Large Bipartite Networks
997 -- 1005Sandeep Tata, Navneet Potti, James B. Wendt, Lauro Beltrão Costa, Marc Najork, Beliz Gunel. Glean: Structured Extractions from Templatic Documents
1006 -- 1018Jun Gao, Jiazun Chen, Zhao Li, Ji Zhang. ICS-GNN: Lightweight Interactive Community Search via Graph Neural Network
1019 -- 1032Yuanyuan Sun, Sheng Wang, Huorong Li, Feifei Li 0001. Building Enclave-Native Storage Engines for Practical Encrypted Databases
1033 -- 1039James Thorne, Majid Yazdani, Marzieh Saeidi, Fabrizio Silvestri, Sebastian Riedel 0001, Alon Y. Levy. From Natural Language Processing to Neural Databases
1040 -- 1052Haibo Wang, Chaoyi Ma, Olufemi O. Odegbile, Shigang Chen, Jih-Kwon Peir. Randomized Error Removal for Online Spread Estimation in Data Streaming
1053 -- 1066Dean De Leo, Peter A. Boncz. Teseo and the Analysis of Structural Dynamic Graphs
1067 -- 1079Tim Gubner, Peter A. Boncz. Charting the Design Space of Query Execution using VOILA
1080 -- 1092Zhiqi Wang, Jin Xue, Zili Shao. Heracles: An Efficient Storage Model And Data Flushing For Performance Monitoring Timeseries
1093 -- 1101Stephen Macke, Aditya G. Parameswaran, Hongpu Gong, Doris Jung Lin Lee, Doris Xin, Andrew Head. Fine-Grained Lineage for Safer Notebook Interactions
1102 -- 1110Anton Tsitsulin, Marina Munkhoeva, Davide Mottin, Panagiotis Karras, Ivan V. Oseledets, Emmanuel Müller. FREDE: Anytime Graph Embeddings
1111 -- 1123Xiaodong Li 0009, Reynold Cheng, Kevin Chen-Chuan Chang, Caihua Shan, Chenhao Ma, Hongtai Cao. On Analyzing Graphs with Motif-Paths

Volume 14, Issue 5

0 -- 0Ashraf Aboulnaga. Front Matter
721 -- 729Shuyuan Yan, Bolin Ding, Wei Guo, Jingren Zhou, Zhewei Wei, Xiaowei Jiang, Sheng Xu. FlashP: An Analytical Pipeline for Real-time Forecasting of Time-Series Relational Data
730 -- 742Chi Thang Duong, Dung Hoang, Hongzhi Yin, Matthias Weidlich, Quoc Viet Hung Nguyen, Karl Aberer. Efficient Streaming Subgraph Isomorphism with Graph Neural Networks
743 -- 756Yi Lu 0010, Xiangyao Yu, Lei Cao 0004, Samuel Madden. Epoch-based Commit and Replication in Distributed OLTP Databases
757 -- 770Zhe Lin, Fan Zhang 0036, Xuemin Lin 0001, Wenjie Zhang 0001, Zhihong Tian. Hierarchical Core Maintenance on Large Dynamic Graphs
771 -- 784Jayashree Mohan, Amar Phanishayee, Ashish Raniwala, Vijay Chidambaram. Analyzing and Mitigating Data Stalls in DNN Training
785 -- 798Daokun Hu, Zhiwen Chen, Jianbing Wu, Jianhua Sun 0002, Hao Chen 0002. Persistent Memory Hash Indexes: An Experimental Evaluation
799 -- 812Cheng Chen 0008, Jun Yang 0022, Mian Lu, Taize Wang, Zhao Zheng, Yuqiang Chen, Wenyuan Dai, Bingsheng He, Weng-Fai Wong, Guoan Wu, Yuping Zhao, Andy Rudoff. Optimizing An In-memory Database System For AI-powered On-line Decision Augmentation Using Persistent Memory
813 -- 821Arif Usta, Akifhan Karakayali, Özgür Ulusoy. DBTagger: Multi-Task Learning for Keyword Mapping in NLIDBs Using Bi-Directional Recurrent Neural Networks
822 -- 834Ritesh Sarkhel, Arnab Nandi 0001. Improving Information Extraction from Visually Rich Documents using Visual Span Representations
835 -- 848Gang Liu, Leying Chen, Shimin Chen. Zen: a High-Throughput Log-Free OLTP Engine for Non-Volatile Main Memory
849 -- 862Tianxi Ji, Pan Li 0001, Emre Yilmaz, Erman Ayday, Yanfang Ye, Jinyuan Sun. Differentially Private Binary- and Matrix-Valued Data Query: An XOR Mechanism

Volume 14, Issue 4

0 -- 0Angela Bonifati, Jorge-Arnulfo Quiané-Ruiz. Front Matter
458 -- 470Long Gong, Ziheng Liu, Liang Liu 0013, Jun Xu 0014, Mitsunori Ogihara, Tong Yang 0003. Space- and Computationally-Efficient Set Reconciliation via Parity Bitmap Sketch (PBS)
471 -- 484Suraj Shetiya, Saravanan Thirumuruganathan, Nick Koudas, Gautam Das 0001. Astrid: Accurate Selectivity Estimation for String Predicates using Deep Learning
485 -- 497Nan Zheng, Zack Ives. Compact, Tamper-Resistant Archival of Fine-Grained Provenance
498 -- 506Ingo Müller 0002, Ghislain Fourny, Stefan Irimescu, Can Berker Cikis, Gustavo Alonso. Rumble: Data Independence for Large Messy Data Sets
507 -- 520Adriane Chapman, Paolo Missier, Giulia Simonelli, Riccardo Torlone. Capturing and querying fine-grained provenance of preprocessing pipelines in data science
521 -- 533Victor A. E. de Farias, Felipe T. Brito, Cheryl Flynn, Javam C. Machado, Subhabrata Majumdar, Divesh Srivastava. Local Dampening: Differential Privacy for Non-numeric Queries via Local Sensitivity
534 -- 546Tianyu Li, Matthew Butrovich, Amadou Ngom, Wan Shen Lim, Wes McKinney, Andrew Pavlo. Mainlining Databases: Supporting Fast Transactional Workloads on Universal Columnar Data File Formats
547 -- 559Shengliang Lu, Bingsheng He, Yuchen Li 0001, Hao Fu. Accelerating Exact Constrained Shortest Paths on GPUs
560 -- 572Songsong Mo, Zhifeng Bao, Ping Zhang, Zhiyong Peng. Towards an Efficient Weighted Random Walk Domination
573 -- 585Guimu Guo, Da Yan 0001, M. Tamer Özsu, Zhe Jiang 0001, Jalal Khalil. Scalable Mining of Maximal Quasi-Cliques: An Algorithm-System Codesign Approach
586 -- 599Eleftherios Kokoris-Kogias, Enis Ceyhun Alp, Linus Gasser, Philipp Jovanovic, Ewa Syta, Bryan Ford. CALYPSO: Private Data Management for Decentralized Ledgers
600 -- 612Brian Hentschel, Stratos Idreos, Kyle Deeds. Stacked Filters: Learning to Filter by Structure
613 -- 625Prithu Banerjee, Laks V. S. Lakshmanan, Wei Chen 0013. Maximizing Social Welfare in a Competitive Diffusion Model
626 -- 639Shashank Gugnani, Arjun Kashyap, Xiaoyi Lu. Understanding the Idiosyncrasies of Real Persistent Memory
640 -- 652Abraham Gale, Amélie Marian. Explaining Ranking Functions
653 -- 667Laxman Dhulipala, Changwan Hong, Julian Shun. ConnectIt: A Framework for Static and Incremental Parallel Graph Connectivity Algorithms
668 -- 681Wissam Maamar Kouadri, Mourad Ouziri, Salima Benbernou, Karima Echihabi, Themis Palpanas, Iheb Ben Amor. Quality of Sentiment Analysis Tools: The Reasons of Inconsistency
682 -- 693Rolando Garcia, Eric Liu, Vikram Sreekanti, Bobby Yan, Anusha Dandamudi, Joseph Gonzalez 0001, Joseph M. Hellerstein, Koushik Sen. Hindsight Logging for Model Training
694 -- 707Lin Jiang, Junqiao Qiu, Zhijia Zhao 0001. Scalable Structural Index Construction for JSON Analytics
708 -- 720Ran Rui, Hao Li, Yi-Cheng Tu. Efficient Join Algorithms For Large Database Tables in a Multi-GPU Environment

Volume 14, Issue 3

0 -- 0Anastasia Ailamaki. Front Matter
241 -- 254Chen Luo, Michael J. Carey 0001. Breaking Down Memory Walls: Adaptive Memory Management in LSM-based Storage Systems
255 -- 267Bojan Karlas, Peng Li, Renzhi Wu, Nezihe Merve Gürel, Xu Chu, Wentao Wu 0001, Ce Zhang 0001. Nearest Neighbor Classifiers over Incomplete Information: From Certain Answers to Certain Predictions
268 -- 280Peter Alvaro, Kyle Kingsbury. Elle: Inferring Isolation Anomalies from Experimental Observations
281 -- 293Martin Kiefer, Ilias Poulakis, Sebastian Breß, Volker Markl. Scotch: Generating FPGA-Accelerators for Sketching at Line Rate
294 -- 306Mourad Khayati, Ines Arous, Zakhar Tymchenko, Philippe Cudré-Mauroux. ORBITS: Online Recovery of Missing Values in Multiple Time Series Streams
307 -- 319Xiang Deng, Huan Sun, Alyssa Lees, You Wu 0001, Cong Yu 0001. TURL: Table Understanding through Representation Learning
320 -- 328Long Guo, Lifeng Hua, Rongfei Jia, Fei Fang, Binqiang Zhao, Bin Cui 0001. EdgeDIPN: a Unified Deep Intent Prediction Network Deployed at the Edge
329 -- 341Yiming Lin, Daokun Jiang, Roberto Yus, Georgios Bouloukakis, Andrew Chio, Sharad Mehrotra, Nalini Venkatasubramanian. LOCATER: Cleaning WiFi Connectivity Datasets for Semantic Localization
342 -- 350Hao Liu 0026, Jindong Han, Yanjie Fu, Jingbo Zhou, Xinjiang Lu, Hui Xiong 0001. Multi-Modal Transportation Recommendation with Unified Route Representation Learning
351 -- 363Yue Wang 0012, Ruiqi Xu, Zonghao Feng, Yulin Che, Lei Chen 0002, Qiong Luo 0001, Rui Mao 0001. DISK: A Distributed Framework for Single-Source SimRank with Accuracy Guarantee
364 -- 377Diego Didona, Nikolas Ioannou, Radu Stoica, Kornilios Kourtis. Toward a Better Understanding and Evaluation of Tree Structures on Flash SSDs
378 -- 390Jianyu Yang, Tianhao Wang 0001, Ninghui Li, Xiang Cheng 0003, Sen Su. Answering Multi-Dimensional Range Queries under Local Differential Privacy
391 -- 403Dimitris Palyvos-Giannas, Bastian Havers, Marina Papatriantafilou, Vincenzo Gulisano. Ananke: A Streaming Framework for Live Forward Provenance
404 -- 417Kartik Lakhotia, Rajgopal Kannan, Viktor K. Prasanna, César A. F. De Rose. RECEIPT: REfine CoarsE-grained IndePendent Tasks for Parallel Tip decomposition of Bipartite Graphs
418 -- 430Shaleen Deep, Anja Gruenheid, Paraschos Koutris, Jeffrey F. Naughton, Stratis Viglas. Comprehensive and Efficient Workload Compression
431 -- 444Yongjun He, Jiacheng Lu, Tianzheng Wang 0001. CoroBase: Coroutine-Oriented Main-Memory Database Engine
445 -- 457Jaclyn Smith, Michael Benedikt, Milos Nikolic, Amir Shaikhha. Scalable Querying of Nested Data

Volume 14, Issue 2

0 -- 0Yufei Tao. Front Matter
74 -- 86Jialin Ding, Vikram Nathan, Mohammad Alizadeh, Tim Kraska. Tsunami: A Learned Multi-dimensional Index for Correlated Data and Skewed Workloads
87 -- 100Daniel Kang, Ankit Mathur, Teja Veeramacheneni, Peter Bailis, Matei Zaharia. Jointly Optimizing Preprocessing and Inference for DNN-based Visual Analytics
101 -- 113Prashanth Menon, Amadou Ngom, Todd C. Mowry, Andrew Pavlo, Lin Ma 0006. Permutable Compiled Queries: Dynamically Adapting Compiled Queries without Recompiling
114 -- 127Seungwon Min, Vikram Sharma Mailthody, Zaid Qureshi, Jinjun Xiong, Eiman Ebrahimi, Wen-mei Hwu. EMOGI: Efficient Memory-access for Out-of-memory Graph-traversal In GPUs
128 -- 140Yinda Zhang, Jinyang Li, Yutian Lei, Tong Yang 0003, Zhetao Li, Gong Zhang, Bin Cui 0001. On-Off Sketch: A Fast and Accurate Sketch on Persistence
141 -- 153Luan Tran, Minyoung Mun, Cyrus Shahabi. Real-Time Distance-Based Outlier Detection in Data Streams
154 -- 162Olga Poppe, Tayo Amuneke, Dalitso Banda, Aritra De, Ari Green, Manon Knoertzer, Ehi Nosakhare, Karthik Rajendran, Deepak Shankargouda, Meina Wang, Alan Au, Carlo Curino, Qun Guo, Alekh Jindal, Ajay Kalhan, Morgan Oslake, Sonia Parchani, Vijay Ramani, Raj Sellappan, Saikat Sen, Sheetal Shrotri, Soundararajan Srinivasan, Ping Xia, Shize Xu, Alicia Yang, Yiwen Zhu. Seagull: An Infrastructure for Load Prediction and Optimized Resource Allocation
163 -- 175Sheng Wang, Yuan Sun, Zhifeng Bao. On the Efficiency of K-Means Clustering: Evaluation, Optimization, and Algorithm Selection
176 -- 188Shixuan Sun, Xibo Sun, Yulin Che, Qiong Luo 0001, Bingsheng He. RapidMatch: A Holistic Approach to Subgraph Query Processing
189 -- 201Yu Xia, Xiangyao Yu, Andrew Pavlo, Srinivas Devadas. Taurus: Lightweight Parallel Logging for In-Memory Database Management Systems
202 -- 214Johns Paul, Bingsheng He, Shengliang Lu, Chiew Tong Lau. Improving Execution Efficiency of Just-in-time Compilation based Query Processing on GPUs
215 -- 227Shuang Wang, Hakan Ferhatosmanoglu. PPQ-Trajectory: Spatio-temporal Quantization for Qerying in Large Trajectory Repositories
228 -- 240Xiao Hu, Shouzhuo Sun, Shweta Patwa, Debmalya Panigrahi, Sudeepa Roy. Aggregated Deletion Propagation for Counting Conjunctive Query Answers

Volume 14, Issue 13

0 -- 0Yi Chen. Front Matter
3253 -- 3266Jian Liu, Kefei Wang, Feng Chen 0005. TSCache: An Efficient Flash-based Caching Scheme for Time-series Data Workloads
3267 -- 3280Huayi Wang, Jingfan Meng, Long Gong, Jun Xu 0014, Mitsunori Ogihara. MP-RW-LSH: An Efficient Multi-Probe LSH Solution to ANNS-L_1
3281 -- 3294Theofilos Mailis, Yannis Kotidis, Stamatis Christoforidis, Evgeny Kharlamov, Yannis E. Ioannidis. View Selection over Knowledge Graphs in Triple Stores
3295 -- 3307Dongjie Li, Siyi Lv, Yanyu Huang, Yijing Liu, Tong Li, Zheli Liu, Liang Guo. Frequency-Hiding Order-Preserving Encryption with Small Client Storage
3308 -- 3321Dimitrios Koutsoukos, Ingo Müller 0002, Renato Marroquin, Ana Klimovic, Gustavo Alonso. Modularis: Modular Relational Analytics over Heterogeneous Distributed Platforms
3322 -- 3334Yunkai Lou, Chaokun Wang, Tiankai Gu, Hao Feng, Jun Chen, Jeffrey Xu Yu. Time-Topology Analysis
3335 -- 3347Daniel Bernau, Günther Eibl, Philip-William Grassal, Hannah Keller, Florian Kerschbaum. Quantifying identifiability to choose and audit epsilon in differentially private deep learning
3348 -- 3361Rodrigo N. Laigner, Yongluan Zhou, Marcos Antonio Vaz Salles, Yijian Liu, Marcos Kalinowski. Data Management in Microservices: State of the Practice, Challenges, and Research Directions
3362 -- 3375H. M. Sajjad Hossain, Marc T. Friedman, Hiren Patel, Shi Qiao 0001, Soundar Srinivasan, Markus Weimer, Remmelt Ammerlaan, Lucas Rosenblatt, Gilbert Antonius, Peter Orenberg, Vijay Ramani, Abhishek Roy, Irene Shaffer, Alekh Jindal. PerfGuard: Deploying ML-for-Systems without Performance Regressions, Almost!
3376 -- 3388Bailu Ding, Surajit Chaudhuri, Johannes Gehrke, Vivek R. Narasayya. DSB: A Decision Support Benchmark for Workload-Driven and Traditional Database Systems
3389 -- 3401Daniel Hernández 0002, Luis Galárraga, Katja Hose. Computing How-Provenance for SPARQL Queries via Query Rewriting
3402 -- 3414Junxiong Wang, Immanuel Trummer, Debabrota Basu. UDO: Universal Database Optimization using Reinforcement Learning
3415 -- 0Anja Feldmann. Internet Traffic Analysis at Scale
3416 -- 0Danai Koutra. The Power of Summarization in Graph Mining and Learning: Smaller Data, Faster Methods, More Interpretability
3417 -- 0Nigam Shah. Summarizing Patients Like Mine via an On-demand Consultation Service
3418 -- 0Joaquin Vanschoren. Towards Scalable Online Machine Learning Collaborations with OpenML
3419 -- 0Manasi Vartak. From ML Models to Intelligent Applications: The Rise of MLOps
3420 -- 0Matei Zaharia. Designing Production-Friendly Machine Learning

Volume 14, Issue 12

0 -- 0Xin Luna Dong, Felix Naumann. Front Matter
2655 -- 2658Tsz Nam Chan, Pak Lon Ip, Leong Hou U, Weng Hou Tong, Shivansh Mittal, Ye Li, Reynold Cheng. KDV-Explorer: A Near Real-Time Kernel Density Visualization System for Spatial Analysis
2659 -- 2662Zhebin Zhang, Dajie Dong, Yuhang Ma, Yilong Ying, Dawei Jiang, Ke Chen 0005, Lidan Shou, Gang Chen 0001. Refiner: A Reliable Incentive-Driven Federated Learning System Powered by Blockchain
2663 -- 2666Valter Uotila, Jiaheng Lu, Dieter Gawlick, Zhen Hua Liu, Souripriya Das, Gregory Pogossiants. MultiCategory: Multi-model Query Processing Meets Category Theory and Functional Programming
2667 -- 2670Qichen Wang, Chaoqi Zhang, Danish Alsayed, Ke Yi 0001, Bin Wu 0003, Feifei Li 0001, Chaoqun Zhan. Cquirrel: Continuous Query Processing over Acyclic Relational Schemas
2671 -- 2674Yuetian Mao, Shuai Yuan, Nan Cui, Tianjiao Du, Beijun Shen, Yuting Chen. DeFiHap: Detecting and Fixing HiveQL Anti-Patterns
2675 -- 2678Ahmed Helal, Mossad Helali, Khaled Ammar, Essam Mansour. A Demonstration of KGLac: A Data Discovery and Enrichment Platform for Data Science
2679 -- 2682Pierre Faure--giovagnoli, Marie Le Guilly, Vasile-Marian Scuturici, Jean-Marc Petit. Assessing the Existence of a Model in your Data with ADESIT
2683 -- 2686Yinzhao Yan, Raymond Chi-Wing Wong. Path Advisor: A Multi-Functional Campus Map Tool for Shortest Path
2687 -- 2690Liangde Li, Supun Chathuranga Nakandala, Arun Kumar 0001. Intermittent Human-in-the-Loop Model Selection using Cerebro: A Demonstration
2691 -- 2694Henning Funke, Jens Teubner. Low-Latency Compilation of SQL Queries to Machine Code
2695 -- 2698Sven Groppe, Rico Klinckenberg, Benjamin Warnke. Sound of Databases: Sonification of a Semantic Web Database Engine
2699 -- 2702Zihao Chen, Zhizhen Xu, Chen Xu 0001, Juan Soto 0001, Volker Markl, Weining Qian, Aoying Zhou. HyMAC: A Hybrid Matrix Computation System
2703 -- 2706Jingbo Xu, Zhanning Bai, Wenfei Fan, Longbin Lai, Xue Li, Zhao Li, Zhengping Qian, Lei Wang, Yanyan Wang, Wenyuan Yu, Jingren Zhou. GraphScope: A One-Stop Large Graph Processing System
2707 -- 2710Alexander Renz-Wieland, Tobias Drobisch, Zoi Kaoudi, Rainer Gemulla, Volker Markl. Just Move It! Dynamic Parameter Allocation in Action
2711 -- 2714Abdelghny Orogat, Ahmed El-Roby. CBench: Demonstrating Comprehensive Evaluation of Question Answering Systems over Knowledge Graphs Through Deep Analysis of Benchmarks
2715 -- 2718Lucas Woltmann, Dominik Olwig, Claudio Hartmann, Dirk Habich, Wolfgang Lehner. PostCENN: PostgreSQL with Machine Learning Models for Cardinality Estimation
2719 -- 2722Jinyang Li, Yuval Moskovitch, H. V. Jagadish. DENOUNCER: Detection of Unfairness in Classifiers
2723 -- 2726Sofiane Abbar, Rade Stanojevic, Mashaal Musleh, Mohamed M. Elshrif, Mohamed F. Mokbel. A Demonstration of QARTA: An ML-based System for Accurate Map Services
2727 -- 2730Jaclyn Smith, Michael Benedikt, Brandon Moore, Milos Nikolic. TraNCE: Transforming Nested Collections Efficiently
2731 -- 2734Ralf Diestelkämper, Seokki Lee, Boris Glavic, Melanie Herschel. Debugging Missing Answers for Spark Queries over Nested Data with Breadcrumb
2735 -- 2738Renzhi Wu, Prem Sakala, Peng Li, Xu Chu, Yeye He. Demonstration of Panda: A Weakly Supervised Entity Matching System
2739 -- 2742Jiabin Liu, Fu Zhu, Chengliang Chai, Yuyu Luo, Nan Tang 0001. Automatic Data Acquisition for Deep Learning
2743 -- 2746Xuanhe Zhou, Lianyuan Jin, Ji Sun, Xinyang Zhao, Xiang Yu, Shifu Li, Tianqing Wang, Kun Li, Luyang Liu. DBMind: A Self-Driving Platform in openGauss
2747 -- 2750Jinfei Liu, Qiongqiong Lin, Jiayao Zhang, Kui Ren, Jian Lou 0001, Junxu Liu, Li Xiong 0001, Jian Pei, Jimeng Sun. Demonstration of Dealer: An End-to-End Model Marketplace with Differential Privacy
2751 -- 2754Tianyu Mu, Hongzhi Wang 0001, Shenghe Zheng, Shaoqing Zhang, Cheng Liang, Haoyun Tang. Assassin: an Automatic claSSificAtion system baSed on algorithm SelectIoN
2755 -- 2758Lei Cao 0004, Dongqing Xiao, Yizhou Yan, Samuel Madden, Guoliang Li 0001. ATLANTIC: Making Database Differentially Private and Faster with Accuracy Guarantee
2759 -- 2762Anders Carlsson, Anze Xie, Jason Mohoney, Roger Waleffe, Shanan Peters, Theodoros Rekatsinas, Shivaram Venkataraman. Demonstration of Marius: Graph Embeddings with a Single Machine
2763 -- 2766Heiko Mueller 0001, Sonia Castelo, Munaf A. Qazi, Juliana Freire. From Papers to Practice: The openclean Open-Source Data Cleaning Library
2767 -- 2770Vanessa Lin, Yongming Ge, Maureen Daum, Alvin Cheung, Brandon Haynes, Magdalena Balazinska. Demonstration of Apperception: A Database Management System for Geospatial Video Data
2771 -- 2774Mary Karatzoglidi, Paraskevas Kerasiotis, Verena Kantere. Automated energy consumption forecasting with EnForce
2775 -- 2778Myung-Hwan Jang, Yong-Yeon Jo, Sang-Wook Kim. RealGraph-Web: A Graph Analysis Platform on the Web
2779 -- 2782Arthita Ghosh, Arpit Narechania, Visweswara Sai Prashanth Dintyala, Su Timurturkan, Joy Arulraj, Deven Bansod. Interactive Demonstration of SQLCHECK
2783 -- 2786Yiming Lin, Pramod P. Khargonekar, Sharad Mehrotra, Nalini Venkatasubramanian. T-Cove: An exposure tracing System based on Cleaning Wi-Fi Events on Organizational Premises
2787 -- 2790Paul Y. Wang, Sainyam Galhotra, Romila Pradhan, Babak Salimi. Demonstration of Generating Explanations for Black-Box Algorithms Using Lewis
2791 -- 2794Sonia Castelo, Rémi Rampin, Aécio S. R. Santos, Aline Bessa, Fernando Chirigati, Juliana Freire. Auctus: A Dataset Search Engine for Data Discovery and Augmentation
2795 -- 2798Mohammed Suhail Rehman, Silu Huang, Aaron J. Elmore. A Demonstration of Relic: A System for REtrospective Lineage InferenCe of Data Workflows
2799 -- 2802Zhihao Chen, Haizhen Zhuo, Quanqing Xu, Xiaodong Qi, Chengyu Zhu, Zhao Zhang 0009, Cheqing Jin, Aoying Zhou, Ying Yan, Hui Zhang. SChain: A Scalable Consortium Blockchain Exploiting Intra- and Inter-Block Concurrency
2803 -- 2806Chrysovalantis Anastasiou, Constantinos Costa, Panos K. Chrysanthis, Cyrus Shahabi. EPICGen: An Experimental Platform for Indoor Congestion Generation and Forecasting
2807 -- 2810Hiba Arnaout, Simon Razniewski, Gerhard Weikum, Jeff Z. Pan. Wikinegata: a Knowledge Base with Interesting Negative Statements
2811 -- 2814Liang Guo, Jinwei Zhu, Jiayang Liu, Kun Cheng. Full Encryption: An end to end encryption mechanism in GaussDB
2815 -- 2818Antonis Mandamadiotis, Georgia Koutrika, Stavroula Eleftherakis, Apostolis Glenis, Dimitrios Skoutas, Yannis Stavrakas. DatAgent: The Imminent Age of Intelligent Data Assistants
2819 -- 2822El Kindi Rezig, Anshul Bhandari, Anna Fariha, Benjamin Price, Allan Vanterpool, Vijay Gadepally, Michael Stonebraker. DICE: Data Discovery by Example
2823 -- 2826Felix Martin Schuhknecht, Aaron Priesterroth, Justus Henneberg, Reza Salkhordeh. AnyOLAP: Analytical Processing of Arbitrary Data-Intensive Applications without ETL
2827 -- 2830Vincent Jacob, Fei Song, Arnaud Stiegler, Bijan Rad, Yanlei Diao, Nesime Tatbul. A Demonstration of the Exathlon Benchmarking Platform for Explainable Anomaly Detection
2831 -- 2834Amir Shaikhha, Maximilian Schleich, Dan Olteanu. An Intermediate Representation for Hybrid Database and Machine Learning Workloads
2835 -- 2838Eliana Pastor, Andrew Gavgavian, Elena Baralis, Luca de Alfaro. How Divergent Is Your Data?
2839 -- 2842Auday Berro, Mohammad-ali Yaghub Zade Fard, Marcos Baez, Boualem Benatallah, Khalid Benabdeslem. An Extensible and Reusable Pipeline for Automated Utterance Paraphrases
2843 -- 2846Kaustubh Beedkar, David Brekardin, Jorge-Arnulfo Quiané-Ruiz, Volker Markl. Compliant Geo-distributed Data Processing in Action
2847 -- 2850Piyush Yadav, Dhaval Salwala, Felipe Pontes, Praneet Dhingra, Edward Curry. Query-Driven Video Event Processing for the Internet of Multimedia Things
2851 -- 2854Nikolaos Koutroumanis, Kousathanas Nikolaos, Christos Doulkeridis, Akrivi Vlachou. A Demonstration of NoDA: Unified Access to NoSQL Stores
2855 -- 2858Rathijit Sen, Abhishek Roy, Alekh Jindal, Rui Fang, Jeff Zheng, Xiaolei Liu, Ruiping Li. AutoExecutor: Predictive Parallelism for Spark SQL Queries
2859 -- 2862Jiaxiang Liu, Karl Knopf, Yiqing Tan, Bolin Ding, Xi He 0001. Catch a Blowfish Alive: A Demonstration of Policy-Aware Differential Privacy for Interactive Data Exploration
2863 -- 2866Paul Ouellette, Aidan Sciortino, Fatemeh Nargesian, Bahar Ghadiri Bashardoost, Erkang Zhu, Ken Pu, Renée J. Miller. RONIN: Data Lake Exploration
2867 -- 2870Paul Boniol, John Paparrizos, Themis Palpanas, Michael J. Franklin. SAND in Action: Subsequence Anomaly Detection for Streams
2871 -- 2874Christos Koutras, Kyriakos Psarakis, George Siachamis, Andra Ionescu, Marios Fragkoulis, Angela Bonifati, Asterios Katsifodimos. Valentine in Action: Matching Tabular Data at Scale
2875 -- 2878Sheng Guan, Hanchao Ma, Sutanay Choudhury, Yinghui Wu. GEDet: Detecting Erroneous Nodes with A Few Examples
2879 -- 2892Wenfei Fan, Tao He, Longbin Lai, Xue Li, Yong Li, Zhao Li, Zhengping Qian, Chao Tian 0001, Lei Wang, Jingbo Xu, Youyang Yao, Qiang Yin, Wenyuan Yu, Kai Zeng, Kun Zhao, Jingren Zhou, Diwen Zhu, Rong Zhu. GraphScope: A Unified Engine For Big Graph Processing
2893 -- 2905Zeyuan Shang, Emanuel Zgraggen, Benedetto Buratti, Philipp Eichmann, Navid Karimeddiny, Charlie Meyer, Wesley Runnels, Tim Kraska. Davos: A System for Interactive Data-Driven Decision Making
2906 -- 2917Mengbai Xiao, An Qin 0001, Yongwei Wu, Xinjie Huang, Xiaodong Zhang 0001. Mixer: Efficiently Understanding and Retrieving Visual Content at Web-Scale
2918 -- 2931David Justo, Shaoqing Yi, Lukas Stadler, Nadia Polikarpova, Arun Kumar. Towards A Polyglot Framework for Factorized ML
2932 -- 2944Niv Dayan, Yuval Rochman, Iddo Naiss, Shmuel Dashevsky, Noam Rabinovich, Edward Bortnikov, Igal Maly, Ofer Frishman, Itai Ben Zion, Avraham, Moshe Twitto, Uri Beitler, Evgeni Ginzburg, Mark Mokryn. The End of Moore's Law and the Rise of The Data Processor
2945 -- 2958Derek Gordon Murray, Jiri Simsa, Ana Klimovic, Ihor Indyk. tf.data: A Machine Learning Data Processing Framework
2959 -- 2971Mohamed Y. Eltabakh, Anantha Subramanian, Awny Alomari, Mohammed Al-Kateb, Sanjay Nair, Mahbub Hasan, Wellington Cabrera, Charles Zhang, Amit Kishore, Snigdha Prasad. Not Black-Box Anymore! Enabling Analytics-Aware Optimizations in Teradata Vantage
2972 -- 2985Yingda Chen, Jiamang Wang, Yifeng Lu, Ying Han, Zhiqiang Lv, Xuebin Min, Hua Cai, Wei Zhang, Haochuan Fan, Chao Li, Tao Guan, Wei Lin 0016, Yangqing Jia, Jingren Zhou. Fangorn: Adaptive Execution Framework for Heterogeneous Workloads on Shared Clusters
2986 -- 2998Ankur Agiwal, Kevin Lai, Gokul Nath Babu Manoharan, Indrajit Roy, Jagan Sankaranarayanan, Hao Zhang, Tao Zou, Jim Chen, Min Chen, Ming Dai, Thanh Do, Haoyu Gao, Haoyan Geng, Raman Grover, Bo Huang, Yanlai Huang, Adam Li, Jianyi Liang, Tao Lin, Li Liu, Yao Liu, Xi Mao, Maya Meng, Prashant Mishra, Jay Patel, Rajesh Sr, Vijayshankar Raman, Sourashis Roy, Mayank Singh Shishodia, Tianhang Sun, Justin Tang, Jun Tatemura, Sagar Trehan, Ramkumar Vadali, Prasanna Venkatasubramanian, Joey Zhang, Kefei Zhang, Yupu Zhang, Zeleng Zhuang, Goetz Graefe, Divy Agrawal, Jeff Naughton, Sujata Kosalge, Hakan Hacigumus. Napa: Powering Scalable Data Warehousing with Robust Query Performance at Google
2999 -- 3013Rubao Lee, Minghong Zhou, Chi Li, Shenggang Hu, Jianping Teng, Dongyang Li, Xiaodong Zhang. The Art of Balance: A RateupDB Experience of Building a CPU/GPU Hybrid Database Product
3014 -- 3027Audrey Cheng, Xiao Shi, Lu Pan, Anthony Simpson, Neil Wheaton, Shilpa Lawande, Nathan Bronson, Peter Bailis, Natacha Crooks, Ion Stoica. RAMP-TAO: Layering Atomic Transactions on Facebook's Online TAO Data Store
3028 -- 3041Guoliang Li 0001, Xuanhe Zhou, Ji Sun, Xiang Yu, Yue Han, Lianyuan Jin, Wenbo Li, Tianqing Wang, Shifu Li. openGauss: An Autonomous Database System
3043 -- 3055Rahul Potharaju, Terry Kim, Eunjin Song, Wentao Wu 0001, Lev Novik, Apoorve Dave, Pouria Pirzadeh, Andrew Fogarty, Gurleen Dhody, Jiying Li, Vidip Acharya, Sinduja Ramanujam, Nicolas Bruno, César A. Galindo-Legaria, Vivek R. Narasayya, Surajit Chaudhuri, Anil Nori, Tomas Talius, Raghu Ramakrishnan. Hyperspace: The Indexing Subsystem of Azure Synapse
3056 -- 3068Bolong Zheng, Lei Bi, Juan Cao, Hua Chai, Jun Fang, Lu Chen 0001, Yunjun Gao, Xiaofang Zhou 0001, Christian S. Jensen. SpeakNav: Voice-based Route Description Language Understanding for Template Driven Path Search
3069 -- 3082Ana Sofia Gomes, João Oliveirinha, Pedro Cardoso, Pedro Bizarro. Railgun: managing large streaming windows under MAD requirements
3083 -- 3095Pavan Edara, Mosha Pasumansky. Big Metadata : When Metadata is Big Data
3096 -- 3109Joshua F. Stoddard, Adam Mustafa, Naveen Goela. Tanium Reveal: A Federated Search Engine for Querying Unstructured File Data on Large Enterprise Networks
3110 -- 3121Can Gencer, Marko Topolnik, Viliam Durina, Emin Demirci, Ensar B. Kahveci, Ali Gürbüz, József Bartók, Grzegorz Gierlach, Frantisek Hartman, Ufuk Yilmaz, Ondrej Lukás, Mehmet Dogan, Mohamed Mandouh, Marios Fragkoulis, Asterios Katsifodimos. Hazelcast Jet: Low-latency Stream Processing at the 99.99th Percentile
3122 -- 3134Abhishek Roy, Alekh Jindal, Priyanka Gomatam, Xiating Ouyang, Ashit Gosalia, Nishkam Ravi, Swinky Mann, Prakhar Jain. SparkCruise: Workload Optimization in Managed Spark Clusters at Microsoft
3135 -- 3147Edmon Begoli, Tyler Akidau, Slava Chernyak, Fabian Hueske, Kathryn Knight, Kenneth Knowles, Daniel Mills, Dan Sotolongo. Watermarks in Stream Processing Systems: Semantics and Comparative Analysis of Apache Flink and Google Cloud Dataflow
3148 -- 3161Conor Power, Hiren Patel, Alekh Jindal, Jyoti Leeka, Bob Jenkins, Michael Rys, Ed Triou, Dexin Zhu, Lucky Katahanas, Chakrapani Bhat Talapady, Josh Rowe, Fan Zhang, Rich Draves, Ivan Santa, Amrish Kumar. The Cosmos Big Data Platform at Microsoft: Over a Decade of Progress and a Decade to Look Forward
3162 -- 3163Ippokratis Pandis. The evolution of Amazon Redshift
3175 -- 3177Simon Razniewski, Hiba Arnaout, Shrestha Ghosh, Fabian M. Suchanek. On the Limits of Machine Knowledge: Completeness, Recall and Negation in Web-scale Knowledge Bases
3178 -- 3181Laurel J. Orr, Atindriyo Sanyal, Xiao Ling, Karan Goel, Megan Leszczynski. Managing ML Pipelines: Feature Stores and the Coming Wave of Embedding Ecosystems
3182 -- 3185Yuliang Li 0001, Xiaolan Wang 0001, Zhengjie Miao, Wang Chiew Tan. Data Augmentation for ML-driven Data Preparation and Integration
3186 -- 3189Ramon Antonio Rodriges Zalipynis. Array DBMS: Past, Present, and (Near) Future
3190 -- 3193Guoliang Li 0001, Xuanhe Zhou, Lei Cao 0004. Machine Learning for Databases
3194 -- 3197Saeed Kargar, Faisal Nawab. Extending the Lifetime of NVM: Challenges and Opportunities
3198 -- 3201Karima Echihabi, Themis Palpanas, Kostas Zoumpatianos. New Trends in High-D Vector Similarity Search: AI-driven, Progressive, and Distributed
3202 -- 3205Alekh Jindal, Matteo Interlandi. Machine Learning for Cloud Data Systems: the Promise, the Progress, and the Path Forward
3206 -- 0Susan Davidson. It's not just Cookies and Tea
3207 -- 3210Thomas Neumann 0001. Evolution of a Compiling Query Engine
3211 -- 3221Andy Pavlo. Make Your Database System Dream of Electric Sheep: Towards Self-Driving Operation
3222 -- 3232Tim Kraska. Towards instance-optimized data systems
3233 -- 3238Gerhard Weikum. Knowledge Graphs 2021: A Data Odyssey
3239 -- 3240Zachary Ives. The future of data(base) education: Is the "cow book" dead?
3240 -- 3252Luis Remis, Chaunte W. Lacewell. Using VDMS to Index and Search 100M Images

Volume 14, Issue 11

0 -- 0Stratos Idreos, Zack Ives. Front Matter
1922 -- 1936Maciej Besta, Zur Vonarburg-Shmaria, Yannick Schaffner, Leonardo Schwarz, Grzegorz Kwasniewski, Lukas Gianinazzi, Jakub Beránek, Kacper Janda, Tobias Holenstein, Sebastian Leisinger, Peter Tatkowski, Esref Özdemir, Adrian Balla, Marcin Copik, Philipp Lindenberger, Marek Konieczny, Onur Mutlu, Torsten Hoefler. GraphMineSuite: Enabling High-Performance and Programmable Graph Mining Algorithms with Set Algebra
1937 -- 1949Keita Takenouchi, Takashi Ishio, Joji Okada, Yuji Sakata. PATSQL: Efficient Synthesis of SQL Queries from Example Tables with Quick Inference of Projected Columns
1950 -- 1963Jie Liu, Wenqian Dong, Dong Li 0001, QingQing Zhou. Fauce: Fast and Accurate Deep Ensembles with Uncertainty for Cardinality Estimation
1964 -- 1978Mengzhao Wang, Xiaoliang Xu, Qiang Yue, Yuxiang Wang 0001. A Comprehensive Survey and Experimental Comparison of Graph-Based Approximate Nearest Neighbor Search
1979 -- 1991Zifeng Yuan, Huey-Eng Chua, Sourav S. Bhowmick, Zekun Ye, Wook-Shin Han, Byron Choi. Towards Plug-and-Play Visual Graph Query Interfaces: Data-driven Canned Pattern Selection for Large Networks
1992 -- 2005Shixuan Sun, Yuhang Chen, Shengliang Lu, Bingsheng He, Yuchen Li 0001. ThunderRW: An In-Memory Graph Random Walk Engine
2006 -- 2018Zheng Dong, Xin Huang, Guorui Yuan, Hengshu Zhu, Hui Xiong 0001. Butterfly-Core Community Search over Labeled Graphs
2019 -- 2032Parimarjan Negi, Ryan C. Marcus, Andreas Kipf, Hongzi Mao, Nesime Tatbul, Tim Kraska, Mohammad Alizadeh. Flow-Loss: Learning Cardinality Estimates That Matter
2033 -- 2045Michael Yu, Dong Wen, Lu Qin, Ying Zhang 0001, Wenjie Zhang 0001, Xuemin Lin 0001. On Querying Historical K-Cores
2046 -- 2058Graham Cormode, Samuel Maddock, Carsten Maple. Frequency Estimation under Local Differential Privacy
2059 -- 2072Fatjon Zogaj, José Pablo Cambronero, Martin Rinard, Jürgen Cito. Doing More with Less: Characterizing Dataset Downsampling for AutoML
2073 -- 2086Yifan Li, Xiaohui Yu 0001, Nick Koudas. LES3: Learning-based exact set similarity search
2087 -- 2100Seungwon Min, Kun Wu, Sitao Huang, Mert Hidayetoglu, Jinjun Xiong, Eiman Ebrahimi, Deming Chen, Wen-mei W. Hwu. Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture
2101 -- 2113Yifei Yang, Matt Youill, Matthew E. Woicik, Yizhou Liu, Xiangyao Yu, Marco Serafini, Ashraf Aboulnaga, Michael Stonebraker. FlexPushdownDB: Hybrid Pushdown and Caching in a Cloud DBMS
2114 -- 2126Zhiwei Chen, Shaoxu Song, Ziheng Wei, Jingyun Fang, Jiang Long. Approximating Median Absolute Deviation with Bounded Error
2127 -- 2140Mengxuan Zhang, Lei Li 0003, Xiaofang Zhou 0001. An Experimental Evaluation and Guideline for Path Finding in Weighted Dynamic Network
2141 -- 2153Brecht Vandevoort, Bas Ketsman, Christoph Koch 0001, Frank Neven. Robustness against Read Committed for Transaction Templates
2154 -- 2166Huayi Zhang, Lei Cao 0004, Samuel Madden, Elke A. Rundensteiner. LANCET: Labeling Complex Data at Scale
2167 -- 2176Yang Li, Yu Shen, Wentao Zhang, Jiawei Jiang, Yaliang Li, Bolin Ding, Jingren Zhou, Zhi Yang, Wentao Wu 0001, Ce Zhang 0001, Bin Cui 0001. VolcanoML: Speeding up End-to-End AutoML via Scalable Search Space Decomposition
2177 -- 2189Peng Cheng 0003, Jiabao Jin, Lei Chen 0002, Xuemin Lin 0001, Libin Zheng. A Queueing-Theoretic Framework for Vehicle Dispatching in Dynamic Car-Hailing
2190 -- 2202Kuntai Cai, Xiaoyu Lei, Jianxin Wei, Xiaokui Xiao. Data Synthesis via Differentially Private Markov Random Field
2203 -- 2215Michael Whittaker, Ailidani Ailijiang, Aleksey Charapko, Murat Demirbas, Neil Giridharan, Joseph M. Hellerstein, Heidi Howard, Ion Stoica, Adriana Szekeres. Scaling Replicated State Machines with Compartmentalization
2216 -- 2229Subhadeep Sarkar, Dimitris Staratzis, Zichen Zhu, Manos Athanassoulis. Constructing and Analyzing the LSM Compaction Design Space
2230 -- 2243Jelle Hellings, Mohammad Sadoghi. ByShard: Sharding in a Byzantine Environment
2244 -- 2257Otmar Ertl. SetSketch: Filling the Gap between MinHash and HyperLogLog
2258 -- 2270Ergute Bao, Yin Yang, Xiaokui Xiao, Bolin Ding. CGM: An Enhanced Mechanism for Streaming Data Collectionwith Local Differential Privacy
2271 -- 2272Dean De Leo, Per Fuchs, Peter A. Boncz. Errata for "Teseo and the Analysis of Structural Dynamic Graph"
2273 -- 2282Mashaal Musleh, Sofiane Abbar, Rade Stanojevic, Mohamed F. Mokbel. QARTA: An ML-based System for Accurate Map Services
2283 -- 2295Teddy Cunningham, Graham Cormode, Hakan Ferhatosmanoglu, Divesh Srivastava. Real-World Trajectory Sharing with Local Differential Privacy
2296 -- 2304Phanwadee Sinthong, Michael J. Carey 0001. PolyFrame: A Retargetable Query-based Approach to Scaling Dataframes
2305 -- 2313Jessica Shi, Laxman Dhulipala, David Eisenstat, Jakub Lacki, Vahab S. Mirrokni. Scalable Community Detection via Parallel Correlation Clustering
2314 -- 2326Cheng Xu 0004, Ce Zhang, Jianliang Xu, Jian Pei. SlimChain: Scaling Blockchain Transactions through Off-Chain Storage and Parallel Processing
2327 -- 2340Side Li, Arun Kumar 0001. Towards an Optimized GROUP BY Abstraction for Large-Scale Machine Learning
2341 -- 2354Daniel Kang, John Guibas, Peter D. Bailis, Tatsunori Hashimoto, Yi Sun, Matei Zaharia. Accelerating Approximate Aggregation Queries with Expensive Predicates
2355 -- 2368Tobias Schmidt, Maximilian Bandle, Jana Giceva. A four-dimensional Analysis of Partitioned Approximate Filters
2369 -- 2382Monica Chiosa, Thomas Preußer, Gustavo Alonso. SKT: A One-Pass Multi-Sketch Data Analytics Accelerator
2383 -- 2396Philipp Fent, Thomas Neumann 0001. A Practical Approach to Groupjoin and Nested Aggregates
2397 -- 2409Ziyun Wei, Immanuel Trummer, Connor Anderson. Robust Voice Querying with MUVE: Optimally Visualizing Results of Phonetically Similar Queries
2410 -- 2418Yinjun Wu, James Weimer, Susan B. Davidson. CHEF: A Cheap and Fast Pipeline for Iteratively Cleaning Label Uncertainties
2419 -- 2431Tarique Siddiqui, Surajit Chaudhuri, Vivek R. Narasayya. COMPARE: Accelerating Groupwise Comparison in Relational Databases for Data Analytics
2432 -- 2444Dominik Durner, Badrish Chandramouli, Yinan Li. Crystal: A Unified Cache Storage System for Analytical Databases
2445 -- 2458Valerio Cetorelli, Paolo Atzeni, Valter Crescenzi, Franco Milicchio. The Smallest Extraction Problem
2459 -- 2472Saravanan Thirumuruganathan, Han Li, Nan Tang 0001, Mourad Ouzzani, Yash Govind, Derek Paulsen, Glenn M. Fung, AnHai Doan. Deep Learning for Blocking in Entity Matching: A Design Space Exploration
2473 -- 2482Wentao Zhang, Zhi Yang, Yexin Wang, Yu Shen, Yang Li, Liang Wang, Bin Cui 0001. Grain: Improving Data Efficiency of Graph Neural Networks via Diversified Influence Maximization
2483 -- 2490Maximilian Bandle, Jana Giceva. Database Technology for the Masses: Sub-Operators as First-Class Entities
2491 -- 2504Pranjal Gupta, Amine Mhedhbi, Semih Salihoglu. Columnar Storage and List-based Processing for Graph Database Management Systems
2505 -- 2518Yiwen Zhu, Matteo Interlandi, Abhishek Roy, Krishnadhan Das, Hiren Patel, Malay Bag, Hitesh Sharma, Alekh Jindal. Phoebe: A Learning-based Checkpoint Optimizer
2519 -- 2532Fatemeh Nargesian, Abolfazl Asudeh, H. V. Jagadish. Tailoring Data Source Distributions for Fairness-aware Data Integration
2533 -- 2545Parikshit Bansal, Prathamesh Deshpande, Sunita Sarawagi. Missing Value Imputation on Multidimensional Time Series
2546 -- 2554El Kindi Rezig, Mourad Ouzzani, Walid G. Aref, Ahmed K. Elmagarmid, Ahmed R. Mahmood, Michael Stonebraker. Horizon: Scalable Dependency-driven Data Cleaning
2555 -- 2562Ted Shaowang, Nilesh Jain, Dennis Matthews, Sanjay Krishnan. Declarative Data Serving: The Future of Machine Learning Inference on the Edge
2563 -- 2575Junwen Yang, Yeye He, Surajit Chaudhuri. Auto-Pipeline: Synthesize Data Pipelines By-Target Using Reinforcement Learning and Search
2576 -- 2585Brandon Lockhart, Jinglin Peng, Weiyuan Wu, Jiannan Wang, Eugene Wu 0002. Explaining Inference Queries with Bayesian Optimization
2586 -- 2598Chunwei Liu, Hao Jiang, John Paparrizos, Aaron J. Elmore. Decomposed Bounded Floats for Fast Compression and Queries
2599 -- 2612Nikolaos Tziavelis, Wolfgang Gatterbauer, Mirek Riedewald. Beyond Equi-joins: Ranking, Enumeration and Factorization
2613 -- 2626Vincent Jacob, Fei Song, Arnaud Stiegler, Bijan Rad, Yanlei Diao, Nesime Tatbul. Exathlon: A Benchmark for Explainable Anomaly Detection over Time Series
2627 -- 2641Michael Kuchnik, George Amvrosiadis, Virginia Smith. Progressive Compressed Records: Taking a Byte out of Deep Learning Data
2642 -- 2654Abdulrahman Alsaudi, Yasser Altowim, Sharad Mehrotra, Yaming Yu. TQEL: Framework for Query-Driven Linking of Top-K Entities in Social Media Blogs

Volume 14, Issue 10

0 -- 0Stefan Manegold. Front Matter
1703 -- 1716Raghavendra Addanki, Sainyam Galhotra, Barna Saha. How to Design Robust Algorithms using Noisy Comparison Oracle
1717 -- 1729Paul Boniol, John Paparrizos, Themis Palpanas, Michael J. Franklin. SAND: Streaming Subsequence Anomaly Detection
1730 -- 1742Yingtai Xiao, Zeyu Ding, Yuxin Wang, Danfeng Zhang, Daniel Kifer. Optimizing Fitness-For-Use of Differentially Private Linear Queries
1743 -- 1755Xinle Cao, Jian Liu, Hao Lu, Kui Ren 0001. Cryptanalysis of An Encrypted Database in SIGMOD '14
1756 -- 1768Tianyuan Jin, Yu Yang 0001, Renchi Yang, Jieming Shi, Keke Huang, Xiaokui Xiao. Unconstrained Submodular Maximization with Modular Costs: Tight Approximation and Application to Profit Maximization
1769 -- 1782Yuhao Zhang, Frank Mcquillan, Nandish Jayaram, Nikhil Kak, Ekta Khanna, Orhan Kislal, Domino Valdano, Arun Kumar. Distributed Deep Learning on Data Systems: A Comparative Analysis of Approaches
1783 -- 1796Siyuan Sheng, Qun Huang 0001, Sa Wang, Yungang Bao. PR-Sketch: Monitoring Per-key Aggregation of Streaming Data with Nearly Full Accuracy
1797 -- 1804Dimitrios Koutsoukos, Supun Nakandala, Konstantinos Karanasos, Karla Saur, Gustavo Alonso, Matteo Interlandi. Tensors: An abstraction for general data processing
1805 -- 1817David Pujol, Yikai Wu 0001, Brandon Fain, Ashwin Machanavajjhala. Budget Sharing for Multi-Analyst Differential Privacy
1818 -- 1831Rudi Poepsel Lemaitre, Martin Kiefer, Joscha Von Hein, Jorge-Arnulfo Quiané-Ruiz, Volker Markl. In the Land of Data Streams where Synopses are Missing, One Framework to Bring Them All
1832 -- 1844Yifan Li, Xiaohui Yu 0001, Nick Koudas. Data Acquisition for Improving Machine Learning Models
1845 -- 1858Xiaoshuang Chen, Kai Wang 0037, Xuemin Lin 0001, Wenjie Zhang 0001, Lu Qin, Ying Zhang. Efficiently Answering Reachability and Path Queries on Temporal Bipartite Graphs
1859 -- 1871Paolo Ciaccia, Davide Martinenghi, Riccardo Torlone. Preference Queries over Taxonomic Domains
1872 -- 1885Baoyue Yan, Xuntao Cheng, Bo Jiang, Shibin Chen, Canfang Shang, Jianying Wang, Kenry Huang, Xinjun Yang, Wei Cao, Feifei Li 0001. Revisiting the Design of LSM-tree Based OLTP Storage Engine with Persistent Memory
1886 -- 1899Chang Ge 0002, Shubhankar Mohapatra, Xi He 0001, Ihab F. Ilyas. Kamino: Constraint-Aware Differentially Private Data Synthesis
1900 -- 1912Yingqiang Zhang, Chaoyi Ruan, Cheng Li, Jimmy Yang, Wei Cao, Feifei Li 0001, Bo Wang, Jing Fang, Yuhui Wang, Jingze Huo, Chao Bi. Towards Cost-Effective and Elastic Cloud Database Deployment via Memory Disaggregation
1913 -- 1921Ralph Peeters, Christian Bizer. Dual-Objective Fine-Tuning of BERT for Entity Matching

Volume 14, Issue 1

0 -- 0Xin Luna Dong, Felix Naumann. Front Matter
1 -- 13Ryan Marcus, Andreas Kipf, Alexander van Renen, Mihail Stoian, Sanchit Misra, Alfons Kemper, Thomas Neumann 0001, Tim Kraska. Benchmarking Learned Indexes
14 -- 27Zuozhi Wang, Kai Zeng, Botong Huang, Wei Chen, Xiaozong Cui, Bo Wang, Ji Liu, Liya Fan, Dachuan Qu, Zhenyu Hou, Tao Guan, Chen Li, Jingren Zhou. Tempura: A General Cost-Based Optimizer Framework for Incremental Data Processing
28 -- 36Geon Heo, Yuji Roh, Seonghyeon Hwang, Dayun Lee, Steven Whang. Inspector Gadget: A Data Programming-based Labeling System for Industrial Images
37 -- 49Renchi Yang, Jieming Shi, Xiaokui Xiao, Yin Yang 0001, Juncheng Liu, Sourav S. Bhowmick. Scaling Attributed Network Embedding to Massive Graphs
50 -- 60Yuliang Li 0001, Jinfeng Li, Yoshihiko Suhara, AnHai Doan, Wang Chiew Tan. Deep Entity Matching with Pre-Trained Language Models
61 -- 73Zongheng Yang, Amog Kamsetty, Sifei Luan, Eric Liang, Yan Duan, Peter Chen, Ion Stoica. NeuroCard: One Cardinality Estimator for All Tables