Journal: PVLDB

Volume 12, Issue 9

0 -- 0Lei Chen 0002, Fatma Özcan. Front Matter
948 -- 960Faisal Orakzai, Toon Calders, Torben Bach Pedersen. k/2-hop: Fast Mining of Convoy Patterns With Effective Pruning
961 -- 974Ji Sun, Zeyuan Shang, Guoliang Li 0001, Zhifeng Bao, Dong Deng. Balance-Aware Distributed String Similarity-Based Query Processing System
975 -- 988Pingcheng Ruan, Gang Chen 0001, Anh Dinh, Qian Lin, Beng Chin Ooi, Meihui Zhang. Fine-Grained, Secure and Efficient Data Provenance for Blockchain
989 -- 1001Dalsu Choi, Chang-Sup Park, Yon Dohn Chung. Progressive Top-k Subarray Query Processing in Array Databases
1002 -- 1015Moritz Hoffmann, Andrea Lattuada, Frank McSherry, Vasiliki Kalavri, John Liagouris, Timothy Roscoe. Megaphone: Latency-conscious state migration for distributed streaming dataflows
1016 -- 1029Thanh-Tam Nguyen, Matthias Weidlich, Bolong Zheng, Hongzhi Yin, Quoc Viet Hung Nguyen, Bela Stantic. From Anomaly Detection to Rumour Detection using Data Streams of Social Platforms
1030 -- 1043Peeush Gupta, Yin Li, Sharad Mehrotra, Nisha Panwar, Shantanu Sharma 0001, Sumaya Almanee. Obscure: Information-Theoretic Oblivious and Verifiable Aggregation Queries
1044 -- 1057Anshuman Dutt, Chi Wang 0001, Azade Nazi, Srikanth Kandula, Vivek R. Narasayya, Surajit Chaudhuri. Selectivity Estimation for Range Predicates using Lightweight Models

Volume 12, Issue 8

0 -- 0Lei Chen 0002, Fatma Özcan. Front Matter
836 -- 849Shahram Ghandeharizadeh, Hieu Nguyen. Design, Implementation, and Evaluation of Write-Back Policy with Cache Augmented Data Stores
850 -- 863Thanh-Tam Nguyen, Hongzhi Yin, Matthias Weidlich, Bolong Zheng, Quoc Viet Hung Nguyen, Bela Stantic. User Guidance for Efficient Fact Checking
864 -- 876Xiangyu Ke, Arijit Khan, Leroy Lim Hong Quan. An In-Depth Comparison of s-t Reliability Algorithms over Uncertain Graphs
877 -- 890Wenfei Fan, Chunming Hu, Muyang Liu, Ping Lu, Qiang Yin, Jingren Zhou. Dynamic Scaling for Parallel Graph Computations
891 -- 905Dongsheng Li, Yiming Zhang, Jinyan Wang, Kian-Lee Tan. TopoX: Topology Refactorization for Efficient Graph Partitioning and Processing
906 -- 919Dmitrii Avdiukhin, Sergey Pupyrev, Grigory Yaroslavtsev. Multi-Dimensional Balanced Graph Partitioning via Projected Gradient Descent
920 -- 932Lei Cao, Yizhou Yan, Samuel Madden, Elke A. Rundensteiner, Mathan Gopalsamy. Efficient Discovery of Sequence Outlier Patterns
933 -- 947Dmytro Bogatov, George Kollios, Leonid Reyzin. A Comparative Evaluation of Order-Revealing Encryption Schemes and Secure Range-Query Protocols

Volume 12, Issue 7

0 -- 0Lei Chen 0002, Fatma Özcan. Front Matter
724 -- 737Michael Borkowski, Christoph Hochreiner, Stefan Schulte 0002. Minimizing Cost by Reducing Scaling Operations in Distributed Stream Processing
738 -- 751Yinjun Wu, Abdussalam Alawini, Daniel Deutch, Tova Milo, Susan B. Davidson. ProvCite: Provenance-based Data Citation
752 -- 765Wenfei Fan, Ping Lu, Chao Tian, Jingren Zhou. Deducing Certain Fixes to Graphs
766 -- 778Matteo Ceccarello, Andrea Pietracaprina, Geppino Pucci. Solving k-center Clustering (with Outliers) in MapReduce and Streaming, almost as Accurately as Sequentially
779 -- 792Xiaolan Wang, Alexandra Meliou. Explain3D: Explaining Disagreements in Disjoint Datasets
793 -- 806Youjip Won, Sundoo Kim, Juseong Yun, Damquang Tuan, Jiwon Seo. DASH: Database Shadowing for Mobile DBMS
807 -- 821Zeke Wang, Kaan Kara, Hantian Zhang, Gustavo Alonso, Ce Zhang, Onur Mutlu. Accelerating Generalized Linear Models with MLWeaving: A One-Size-Fits-All System for Any-precision Learning
822 -- 835Dimitrije Jankov, Shangyu Luo, Binhang Yuan, Zhuhua Cai, Jia Zou 0001, Chris Jermaine, Zekai J. Gao. Declarative Recursive Computation on an RDBMS

Volume 12, Issue 6

0 -- 0Lei Chen 0002, Fatma Özcan. Front Matter
624 -- 638Chenggang Wu, Vikram Sreekanti, Joseph M. Hellerstein. Autoscaling Tiered Cloud Storage in Anna
639 -- 652Anton Dignös, Boris Glavic, Xing Niu, Johann Gamper, Michael H. Böhlen. Snapshot Semantics for Temporal Multiset Relations
653 -- 666Selasi Kwashie, Jixue Liu, Jiuyong Li, Lin Liu 0003, Markus Stumptner, Lujing Yang. Certus: An Effective Entity Resolution Approach with Graph Differential Dependencies (GDDs)
667 -- 680Kai Han, Fei Gui, Xiaokui Xiao, Jing Tang, Yuntian He, Zongmai Cao, He Huang 0001. Efficient and Effective Algorithms for Clustering Uncertain Graphs
681 -- 694Jia Zou 0001, Arun Iyengar, Chris Jermaine. Pangea: Monolithic Distributed Storage for Data Analytics
695 -- 708Zhiwei Fan, Jianqiao Zhu, Zuyu Zhang, Aws Albarghouthi, Paraschos Koutris, Jignesh M. Patel. Scaling-Up In-Memory Datalog Processing: Observations and Techniques
709 -- 723Aaron Archer, Kevin Aydin, MohammadHossein Bateni, Vahab S. Mirrokni, Aaron Schild, Ray Yang, Richard Zhuang. Cache-aware load balancing of data center applications

Volume 12, Issue 5

0 -- 0Lei Chen 0002, Fatma Özcan. Front Matter
461 -- 474Cong Fu, Chao Xiang, Changxu Wang, Deng Cai. Fast Approximate Nearest Neighbor Search With The Navigating Spreading-out Graph
475 -- 487Qi Wang, Torsten Suel. Document Reordering for Faster Intersection
488 -- 501Xiaofei Zhang, Tamer Özsu. Correlation Constraint Shortest Path over Large Multi-Relation Graphs
502 -- 515Harald Lang, Thomas Neumann 0001, Alfons Kemper, Peter A. Boncz. Performance-Optimal Filtering: Bloom overtakes Cuckoo at High-Throughput
516 -- 530Steffen Zeuch, Sebastian Breß, Tilmann Rabl, Bonaventura Del Monte, Jeyhun Karimov, Clemens Lutz, Manuel Renz, Jonas Traub, Volker Markl. Analyzing Efficient Stream Processing on Modern Hardware
531 -- 543Chen Luo, Michael J. Carey 0001. Efficient Data Ingestion and Query Processing for LSM-Based Storage Systems
544 -- 556Periklis Chrysogelos, Manos Karpathiotakis, Raja Appuswamy, Anastasia Ailamaki. HetExchange: Encapsulating heterogeneous CPU-GPU parallelism in JIT compiled engines
557 -- 569Paolo Atzeni, Luigi Bellomarini, Paolo Papotti, Riccardo Torlone. Meta-Mappings for Schema Mapping Reuse
570 -- 583Lijie Xu, Tian Guo, Wensheng Dou, Wei Wang 0049, Jun Wei 0001. An Experimental Evaluation of Garbage Collectors on Big Data Applications
584 -- 596Jinwei Guo, Peng Cai, Jiahao Wang, Weining Qian, Aoying Zhou. Adaptive Optimistic Concurrency Control for Heterogeneous Workloads
597 -- 610Yu-Shan Lin, Shao-Kan Pi, Meng-Kai Liao, Ching Tsai, Aaron Elmore, Shan-Hung Wu. MgCrab: Transaction Crabbing for Live Migration in Deterministic Database Systems
611 -- 623Sujaya Maiyya, Faisal Nawab, Divy Agrawal, Amr El Abbadi. Unifying Consensus and Atomic Commitment for Effective Cloud Data Management

Volume 12, Issue 4

0 -- 0Lei Chen 0002, Fatma Özcan. Front Matter
321 -- 334Gurbinder Gill, Roshan Dathathri, Loc Hoang, Keshav Pingali. A Study of Partitioning Policies for Graph Analytics on Large-scale Distributed Platforms
335 -- 347K. Ashwin Kumar, Petros Efstathopoulos. Utility-Driven Graph Summarization
348 -- 361Kaan Kara, Ken Eguro, Ce Zhang, Gustavo Alonso. ColumnML: Column-Store Machine Learning with On-The-Fly Data Transformation
362 -- 375Yanying Li, Haipei Sun, Boxiang Dong, Wendy Hui Wang. Cost-efficient Data Acquisition on Online Data Marketplaces for Correlation Analysis
376 -- 389Mohamad Dolatshah, Mathew Teoh, Jiannan Wang, Jian Pei. Cleaning Crowdsourced Labels Using Oracles For Statistical Classification
390 -- 403Matteo Lissandrini, Martin Brugnara, Yannis Velegrakis. Beyond Macrobenchmarks: Microbenchmark-based Graph Database Evaluation
404 -- 418Valter Balegas, Sérgio Duarte, Carla Ferreira 0001, Rodrigo Rodrigues, Nuno M. Preguiça. IPA: Invariant-preserving Applications for Weakly consistent Replicated Databases
419 -- 432Firas Abuzaid, Peter Kraft, Sahaana Suri, Edward Gan, Eric Xu, Atul Shenoy, Asvin Anathanaraya, John Sheu, Erik Meijer 0001, Xi Wu 0001, Jeffrey F. Naughton, Peter Bailis, Matei Zaharia. DIFF: A Relational Interface for Large-Scale Data Explanation
433 -- 445Ran Ben-Basat, Roy Friedman, Rana Shahout. Stream Frequency Over Interval Queries
446 -- 460Doris Xin, Stephen Macke, Litian Ma, Jialin Liu, Shuchen Song, Aditya G. Parameswaran. Helix: Holistic Optimization for Accelerating Iterative Machine Learning

Volume 12, Issue 3

183 -- 196Ting Xie, Varun Chandola, Oliver Kennedy. Query Log Compression for Workload Analytics
197 -- 209Mohammed Eunus Ali, Shadman Saqib Eusuf, Kaysar Abdullah, Farhana Murtaza Choudhury, J. Shane Culpepper, Timos Sellis. The Maximum Trajectory Coverage Query in Spatial Databases
210 -- 222Chenggang Wu, Alekh Jindal, Saeed Amizadeh, Hiren Patel, Wangchao Le, Shi Qiao, Sriram Rao. Towards a Learning Optimizer for Shared Clouds
223 -- 236Paroma Varma, Christopher Ré. Snuba: Automating Weak Supervision to Label Training Data
237 -- 250Abolfazl Asudeh, H. V. Jagadish, Gerome Miklau, Julia Stoyanovich. On Obtaining Stable Rankings
251 -- 264Shuping Ji, Hans-Arno Jacobsen. PS-Tree-based Efficient Boolean Expression Matching for High Dimensional and Dense Workloads
265 -- 277Yizhou Yan, Lei Cao, Samuel Madden, Elke A. Rundensteiner. SWIFT: Mining Representative Patterns from Large Event Streams
278 -- 291Paul Suganthan G. C., Adel Ardalan, AnHai Doan, Aditya Akella. Smurf: Self-Service String Matching Using Random Forests
292 -- 306Feilong Liu, Ario Salmasi, Spyros Blanas, Anastasios Sidiropoulos. Chasing Similarity: Distribution-aware Aggregation Scheduling
307 -- 320Johes Bater, Xi He, William Ehrich, Ashwin Machanavajjhala, Jennie Rogers. ShrinkWrap: Efficient SQL Query Processing in Differentially Private Data Federations

Volume 12, Issue 2

0 -- 0Lei Chen 0002, Fatma Özcan. Front Matter
85 -- 98Tobias Bleifuß, Leon Bornemann, Theodore Johnson, Dmitri V. Kalashnikov, Felix Naumann, Divesh Srivastava. Exploring Change - A New Dimension of Data Analytics
99 -- 111Bishwamittra Ghosh, Mohammed Eunus Ali, Farhana Murtaza Choudhury, Sajid Hasan Apon, Timos Sellis, Jianxin Li. The Flexible Socio Spatial Group Queries
112 -- 127Karima Echihabi, Kostas Zoumpatianos, Themis Palpanas, Houda Benbrahim. The Lernaean Hydra of Data Series Similarity Search: An Experimental Evaluation of the State of the Art
128 -- 140Wei Wang 0059, Sheng Wang, Jinyang Gao, Meihui Zhang, Gang Chen 0001, Teck Khim Ng, Beng Chin Ooi, Jie Shao. Rafiki: Machine Learning as an Analytics Service System
141 -- 153Pavle Subotic, Herbert Jordan, Lijun Chang, Alan Fekete, Bernhard Scholz. Automatic Index Selection for Large-Scale Datalog Computation
154 -- 168Shuang Song, Xu Liu 0001, Qinzhe Wu, Andreas Gerstlauer, Tao Li, Lizy K. John. Start Late or Finish Early: A Distributed Graph Processing System with Redundancy Reduction
169 -- 182Bailu Ding, Lucja Kot, Johannes Gehrke. Improving Optimistic Concurrency Control Through Transaction Batching and Operation Reordering

Volume 12, Issue 13

0 -- 0Lei Chen 0002, Fatma Özcan. Front Matter
2325 -- 2338Claude Barthels, Ingo Müller 0002, Konstantin Taranov, Gustavo Alonso, Torsten Hoefler. Strong consistency is not hard to get: Two-Phase Locking and Two-Phase Commit on Thousands of Cores
2339 -- 2352Ziheng Wei, Uwe Leck, Sebastian Link. Discovery and Ranking of Embedded Uniqueness Constraints
2353 -- 2365Lingyang Chu, Yanyan Zhang, Yu Yang 0001, Lanjun Wang, Jian Pei. Online Density Bursting Subgraph Detection from Temporal Graphs
2366 -- 2378Pedro Holanda, Stefan Manegold, Hannes Mühleisen, Mark Raasveldt. Progressive Indexes: Indexing for Interactive Data Analysis
2379 -- 2392Masatoshi Hanai, Toyotaro Suzumura, Wen Jun Tan, Elvis S. Liu, Georgios Theodoropoulos 0001, Wentong Cai. Distributed Edge Partitioning for Trillion-edge Graphs
2393 -- 2407Manos Athanassoulis, Kenneth S. Bøgh, Stratos Idreos. Optimal Column Layout for Hybrid Workloads
2408 -- 2421Stavros Sintos, Pankaj Agarwal, Jun Yang 0001. Selecting Data to Clean for Fact Checking: Minimizing Uncertainty vs. Maximizing Surprise

Volume 12, Issue 12

0 -- 0Lei Chen 0002, Fatma Özcan. Front Matter
1778 -- 1781Guilherme Damasio, Spencer Bryson, Vincent Corvinelli, Parke Godfrey, Piotr Mierzejewski, Jaroslaw Szlichta, Calisto Zuzarte. GALO: Guided Automated Learning for re-Optimization
1782 -- 1785Yuanyuan Tian, Suijun Tong, Mir Hamid Pirahesh, Wen Sun, En Liang Xu, Wei Zhao. Synergistic Graph and SQL Analytics Inside IBM Db2
1786 -- 1789Xiaoou Ding, Hongzhi Wang, Jiaxuan Su, Zijue Li, Jianzhong Li, Hong Gao. Cleanits: A Data Cleaning System for Industrial Time Series
1790 -- 1793Yipeng Zhang, Zhifeng Bao, Songsong Mo, Yuchen Li, Yanghao Zhou. ITAA: An Intelligent Trajectory-driven Outdoor Advertising Deployment Assistant
1794 -- 1797Kun Qian 0002, Lucian Popa 0001, Prithviraj Sen. SystemER: A Human-in-the-loop System for Explainable Entity Resolution
1798 -- 1801Viet-Phi Huynh, Paolo Papotti. Buckle: Evaluating Fact Checking Algorithms Built on Knowledge Bases
1802 -- 1805Peng Gao, Xusheng Xiao, Zhichun Li, Kangkook Jee, Fengyuan Xu, Sanjeev R. Kulkarni, Prateek Mittal. A Query System for Efficiently Investigating Complex Attack Behaviors for Enterprise Security
1806 -- 1809Zhengjie Miao, Qitian Zeng, Chenjie Li, Boris Glavic, Oliver Kennedy, Sudeepa Roy. CAPE: Explaining Outliers by Counterbalancing
1810 -- 1813Karthik Ramachandra 0002, Kwanghyun Park. BlackMagic: Automatic Inlining of Scalar UDFs into SQL Queries with Froid
1814 -- 1817Lukas Berg, Tobias Ziegler 0001, Carsten Binnig, Uwe Röhm. ProgressiveDB - Progressive Data Analytics as a Middleware
1818 -- 1821Kaan Kara, Zeke Wang, Ce Zhang, Gustavo Alonso. doppioDB 2.0: Hardware Techniques for Improved Integration of Machine Learning into Databases
1822 -- 1825Cícero A. L. Pahins, Behrooz Omidvar Tehrani, Sihem Amer-Yahia, Valérie Siroux, Jean Louis Pépin, Jean-Christian Borel, João Comba. COVIZ: A System for Visual Formation and Exploration of Patient Cohorts
1826 -- 1829Martin Franke, Ziad Sehili, Erhard Rahm. PRIMAT: A Toolbox for Fast Privacy-preserving Matching
1830 -- 1833Ryan Marcus, Chi Zhang, Shuai Yu, Geoffrey Kao, Olga Papaemmanouil. NashDB: Fragmentation, Replication, and Provisioning using Economic Methods
1834 -- 1837Ibrahim Sabek, Mashaal Musleh, Mohamed F. Mokbel. Flash in Action: Scalable Spatial Data Analysis Using Markov Logic Networks
1838 -- 1841Lucas Kuhring, Zsolt István. I Can't Believe It's Not (Only) Software! Bionic Distributed Storage for Parquet Files
1842 -- 1845Hyewon Choi, Erkang Zhu, Arsala Bangash, Renée J. Miller. VISE: Vehicle Image Search Engine with Traffic Camera
1846 -- 1849Stephan Goldberg, Tova Milo, Slava Novgorodov, Kathy Razmadze. WiClean: A System for Fixing Wikipedia Interlinks Using Revision History Patterns
1850 -- 1853Abhishek Roy, Alekh Jindal, Hiren Patel, Ashit Gosalia, Subru Krishnan, Carlo Curino. SparkCruise: Handsfree Computation Reuse in Spark
1854 -- 1857Sandeep Singh Sandha, Wellington Cabrera, Mohammed Al-Kateb, Sanjay Nair, Mani B. Srivastava. In-database Distributed Machine Learning: Demonstration using Teradata SQL Engine
1858 -- 1861Zhao Li, Xia Chen, Xuming Pan, Pengcheng Zou, Yuchen Li, Guoxian Yu. SHOAL: Large-scale Hierarchical Taxonomy via Graph-based Query Coalition in E-commerce
1862 -- 1865Min Xu, Tianhao Wang 0001, Bolin Ding, Jingren Zhou, Cheng Hong, Zhicong Huang. DPSAaS: Multi-Dimensional Data Sharing and Analytics as Services under Local Differential Privacy
1866 -- 1869Yang Cao 0011, Yonghui Xiao, Li Xiong 0001, Liquan Bai, Masatoshi Yoshikawa. PriSTE: Protecting Spatiotemporal Event Privacy in Continuous Location-Based Services
1870 -- 1873Daniel Deutch, Evgeny Marants, Yuval Moskovitch. Datalignment: Ontology Schema Alignment Through Datalog Containment
1874 -- 1877Congcong Ge, Yunjun Gao, Xiaoye Miao, Lu Chen, Christian S. Jensen, Ziyuan Zhu. IHCS: An Integrated Hybrid Cleaning System
1878 -- 1881Constantinos Costa, Xiaoyu Ge, Panos K. Chrysanthis. CAPRIO: Graph-based Integration of Indoor and Outdoor Data for Path Discovery
1882 -- 1885Yingjun Wu, Jia Yu, Yuanyuan Tian, Richard Sidle, Ronald Barber. HERMIT in Action: Succinct Secondary Indexing Mechanism via Correlation Exploration
1886 -- 1889Julien Loudet, Iulian Sandu Popa, Luc Bouganim. DISPERS: Securing Highly Distributed Queries on Personal Data Management Systems
1890 -- 1893Adil AKhter, Marios Fragkoulis, Asterios Katsifodimos. Stateful Functions as a Service in Action
1894 -- 1897Allen Ordookhanians, Xin Li, Supun Nakandala, Arun Kumar. Demonstration of Krypton: Optimized CNN Inference for Occlusion-based Deep CNN Explanations
1898 -- 1901Zhengjie Miao, Andrew Lee, Sudeepa Roy. LensXPlain: Visualizing and Explaining Contributing Subsets for Aggregate Query Answers
1902 -- 1905Yi Zhang, Zachary G. Ives. Juneau: Data Lake Management for Jupyter
1906 -- 1909Sona Hasani, Faezeh Ghaderi, Shohedul Hasan, Saravanan Thirumuruganathan, Abolfazl Asudeh, Nick Koudas, Gautam Das 0001. ApproxML: Efficient Approximate Ad-Hoc ML Models Through Materialization and Reuse
1910 -- 1913Grégory M. Essertel, Ruby Y. Tahboub, Fei Wang, James M. Decker, Tiark Rompf. Flare & Lantern: Efficiently Swapping Horses Midstream
1914 -- 1917Ruben Martins, Jia Chen, Yanju Chen, Yu Feng, Isil Dillig. Trinity: An Extensible Synthesis Framework for Data Science
1918 -- 1921Zhiqi Huang, Ryan Mckenna, George Bissias, Gerome Miklau, Michael Hay, Ashwin Machanavajjhala. PSynDB: Accurate and Accessible Private Data Generation
1922 -- 1925Badrish Chandramouli, Dong Xie 0001, Yinan Li, Donald Kossmann. FishStore: Fast Ingestion and Indexing of Raw Data
1926 -- 1929Yanlei Diao, Pawel Guzewicz, Ioana Manolescu, Mirjana Mazuran. Spade: A Modular Framework for Analytical Exploration of RDF Graphs
1930 -- 1933Joseph Vinish D'silva, Florestan De Moor, Bettina Kemme. Making an RDBMS Data Scientist Friendly: Advanced In-database Interactive Analytics with Visualization Support
1934 -- 1937Khaled Zaouk, Fei Song, Chenghao Lyu, Arnab Sinha, Yanlei Diao, Prashant J. Shenoy. UDAO: A Next-Generation Unified Data Analytics Optimizer
1938 -- 1941Saehan Jo, Immanuel Trummer, Weicheng Yu, Xuezhi Wang 0002, Cong Yu 0001, Daniel Liu, Niyati Mehta. AggChecker: A Fact-Checking System for Text Summaries of Relational Data Sets
1942 -- 1945Hanzhang Wang, Phuong Nguyen, Jun Li, Selcuk Kopru, Gene Zhang, Sanjeev Katariya, Sami Ben-romdhane. GRANO: Interactive Graph-based Root Cause Analysis for Cloud-Native Distributed Data Platform
1946 -- 1949Davide Frey, Marc X. Makkes, Pierre-Louis Roman, François Taïani, Spyros Voulgaris. Dietcoin: Hardening Bitcoin Transaction Verification Process For Mobile Devices
1950 -- 1953Samriddhi Singla, Ahmed Eldawy, Rami Alghamdi, Mohamed F. Mokbel. Raptor: Large Scale Analysis of Big Raster and Vector Data
1954 -- 1957El Kindi Rezig, Lei Cao, Michael Stonebraker, Giovanni Simonini, Wenbo Tao, Samuel Madden, Mourad Ouzzani, Nan Tang 0001, Ahmed K. Elmagarmid. Data Civilizer 2.0: A Holistic Framework for Data Preparation and Analytics
1958 -- 1961Leonhard F. Spiegelberg, Tim Kraska. Tuplex: Robust, Efficient Analytics When Python Rules
1962 -- 1965Cédric Renggli, Frances Ann Hubis, Bojan Karlas, Kevin Schawinski, Wentao Wu 0001, Ce Zhang. Ease.ml/ci and Ease.ml/meter in Action: Towards Data Management for Statistical Generalization
1966 -- 1969Han Xueran, Jun Chen, Jiaheng Lu, Yueguo Chen, Xiaoyong Du. PivotE: Revealing and Visualizing the Underlying Entity Structures for Exploration
1970 -- 1973Jiaheng Lu, Yuxing Chen, Herodotos Herodotou, Shivnath Babu. Speedup Your Analytics: Automatic Parameter Tuning for Databases and Big Data Systems
1974 -- 1977Yu Meng, Jiaxin Huang, Jingbo Shang, Jiawei Han 0001. TextCube: Automated Construction and Multidimensional Exploration
1978 -- 1981Sihem Amer-Yahia, Senjuti Basu Roy. The Ever Evolving Online Labor Market: Overview, Challenges and Opportunities
1982 -- 1985Ibrahim Sabek, Mohamed F. Mokbel. Machine Learning Meets Big Spatial Data
1986 -- 1989Fatemeh Nargesian, Erkang Zhu, Renée J. Miller, Ken Q. Pu, Patricia C. Arocena. Data Lake Management: Challenges and Opportunities
1990 -- 1993Laks V. S. Lakshmanan, Michael Simpson, Saravanan Thirumuruganathan. Combating Fake News: A Data Management and Mining Perspective
1994 -- 1997Nicolas Anciaux, Luc Bouganim, Philippe Pucheral, Iulian Sandu Popa, Guillaume Scerri. Personal Database Security and Trusted Execution Environments: A Tutorial at the Crossroads
1998 -- 2009Stephan Kessler, Jens Hoff, Johann Christoph Freytag. SAP HANA goes private - From Privacy Research to Privacy Aware Enterprise Analytics
2010 -- 2021Guilherme Damasio, Vincent Corvinelli, Parke Godfrey, Piotr Mierzejewski, Alexandar Mihaylov, Jaroslaw Szlichta, Calisto Zuzarte. Guided automated learning for query workload re-optimization
2022 -- 2034Biswapesh Chattopadhyay, Priyam Dutta, Weiran Liu, Ott Tinn, Andrew McCormick, Aniket Mokashi, Paul Harvey, Hector Gonzalez, David Lomax, Sagar Mittal, Roee Ebenstein, Nikita Mikhaylin, Hung-Ching Lee, Xiaoyan Zhao, Tony Xu, Luis Perez, Farhad Shahmohammadi, Tran Bui, Neil Mckay, Selcuk Aya, Vera Lychagina, Brett Elliott. Procella: Unifying serving and analytical data at YouTube
2035 -- 2046Wei Lu, Zhanhao Zhao, Xiaoyu Wang, Haixiang Li, Zhenmiao Zhang, Zhiyu Shui, Sheng Ye, Anqun Pan, Xiaoyong Du. A Lightweight and Efficient Temporal Database Management System in TDSQL
2047 -- 2058Reza Sherkat, Colin Florendo, Mihnea Andrei, Rolando Blanco, Adrian Dragusanu, Amit Pathak, Pushkar Khadilkar, Neeraj Kulkarni, Christian Lemke, Sebastian Seifert, Sarika Iyer, Sasikanth Gottapu, Robert Schulze, Chaitanya Gottipati, Nirvik Basak, Yanhong Wang, Vivek Kandiyanallur, Santosh Pendap, Dheren Gala, Rajesh Almeida, Prasanta Ghosh. Native Store Extension for SAP HANA
2059 -- 2070Chaoqun Zhan, Maomeng Su, Chuangxian Wei, Xiaoqiang Peng, Liang Lin, Sheng Wang, Zhe Chen, Feifei Li 0001, Yue Pan, Fang Zheng, Chengliang Chai. AnalyticDB: Real-time OLAP Database System at Alibaba Cloud
2071 -- 2081William Schultz, Tess Avitabile, Alyson Cabral. Tunable Consistency in MongoDB
2082 -- 2093Shaosheng Cao, Xinxing Yang, Cen Chen, Jun Zhou, Xiaolong Li, Yuan Qi 0001. TitAnt: Online Real-time Transaction Fraud Detection in Ant Financial
2094 -- 2105Rong Zhu, Kun Zhao, Hongxia Yang, Wei Lin, Chang Zhou, Baole Ai, Yong Li, Jingren Zhou. AliGraph: A Comprehensive Graph Neural Network Platform
2106 -- 2117Zhimin Chen, Yue Wang, Vivek R. Narasayya, Surajit Chaudhuri. Customizable and Scalable Fuzzy Join for Big Data
2118 -- 2130Guoliang Li, Xuanhe Zhou, Shifu Li, Bo Gao. QTune: A Query-Aware Database Tuning System with Deep Reinforcement Learning
2131 -- 2142Srikanth Kandula, Kukjin Lee, Surajit Chaudhuri, Marc Friedman. Experiences with Approximating Queries in Microsoft's Production Big-Data Clusters
2143 -- 2154Panagiotis Antonopoulos, Peter Byrne, Wayne Chen, Cristian Diaconu, Raghavendra Thallam Kodandaramaih, Hanuma Kodavalla, Prashanth Purnananda, Adrian-Leonard Radu, Chaitanya Sreenivas Ravella, Girish Mittur Venkataramanappa. Constant Time Recovery in Azure SQL Database
2155 -- 2169Yuzhen Huang, Yingjie Shi, Zheng Zhong, Yihui Feng, James Cheng, Jiwei Li, Haochuan Fan, Chao Li, Tao Guan, Jingren Zhou. Yugong: Geo-Distributed Data and Job Placement at Scale
2170 -- 2182Junjay Tan, Thanaa Ghanem, Matthew Perron, Xiangyao Yu, Michael Stonebraker, David J. DeWitt, Marco Serafini, Ashraf Aboulnaga, Tim Kraska. Choosing A Cloud DBMS: Architectures and Tradeoffs
2183 -- 2194Jingtian Zhang, Sai Wu, Zeyuan Tan, Gang Chen, Zhushi Cheng, Wei Cao, Yusong Gao, Xiaojie Feng. S3: A Scalable In-memory Skip-List Index for Key-Value Store
2195 -- 2205Charles Masson, Jee E. Rim, Homin K. Lee. DDSketch: A Fast and Fully-Mergeable Quantile Sketch with Relative-Error Guarantees
2206 -- 2217Qiang Long, Wei Wang, Jinfu Deng, Song Liu, Wenhao Huang, Fangying Chen, SiFan Liu. A Distributed System for Large-scale n-gram Language Models at Tencent
2218 -- 2229Kayhan Dursun, Carsten Binnig, Ugur Çetintemel, Garret Swart, Weiwei Gong. A Morsel-Driven Query Execution Engine for Heterogeneous Multi-Cores
2230 -- 2241Lei Cao, Wenbo Tao, Sungtae An, Jing Jin, Yizhou Yan, Xiaoyu Liu, Wendong Ge, Adam Sah, Leilani Battle, Jimeng Sun, Remco Chang, M. Brandon Westover, Samuel Madden, Michael Stonebraker. Smile: A System to Support Machine Learning on EEG Data at Scale
2242 -- 2253Alastair Green, Paolo Guagliardo, Leonid Libkin, Tobias Lindaaker, Victor Marsault, Stefan Plantikow, Martin Schuster, Petra Selmer, Hannes Voigt. Updating Graph Databases with Cypher
2254 -- 2262Asya Kamsky. Adapting TPC-C Benchmark to Measure Performance of Multi-Document Transactions in MongoDB
2263 -- 2272Feifei Li 0001. Cloud native database systems at Alibaba: Opportunities and Challenges
2273 -- 2274Alexander Boehm 0002. In-Memory for the masses: Enabling cost-efficient deployments of in-memory data management platforms for business applications
2275 -- 2286Murtadha Al Hubail, Ali Alsuliman, Michael Blow, Michael J. Carey 0001, Dmitry Lychagin, Ian Maxon, Till Westmann. Couchbase Analytics: NoETL for Scalable NoSQL Data Analysis
2287 -- 2289Adrian Coyler. Performance in the spotlight
2290 -- 2299Azza Abouzied, Daniel J. Abadi, Kamil Bajda-Pawlikowski, Avi Silberschatz. Integration of Large-Scale Data Processing Systems and Traditional Parallel Database Technology
2300 -- 2307Brian F. Cooper, P. P. S. Narayan, Raghu Ramakrishnan, Utkarsh Srivastava, Adam Silberstein, Philip Bohannon, Hans-Arno Jacobsen, Nick Puz, Daniel Weaver, Ramana Yerneni. PNUTS to Sherpa: Lessons from Yahoo!'s Cloud Database
2308 -- 0Wang Chiew Tan. What I probably did right and what I think I could have done better
2309 -- 2322Aditya Parameswaran. Enabling Data Science for the Majority
2323 -- 2324Theodoros Rekatsinas, Sudeepa Roy, Manasi Vartak, Ce Zhang, Neoklis Polyzotis. Opportunities for Data Management Research in the Era of Horizontal AI/ML

Volume 12, Issue 11

0 -- 0Lei Chen 0002, Fatma Özcan. Front Matter
1235 -- 1248Michael J. Whittaker, Nick Edmonds, Sandeep Tata, James Bradley Wendt, Marc Najork. Online Template Induction for Machine-Generated Emails
1249 -- 1261Yong Wang, Guoliang Li 0001, Nan Tang 0001. Querying Shortest Paths on Time Dependent Road Networks
1262 -- 1275Anna Fariha, Alexandra Meliou. Example-Driven Query Intent Discovery: Abductive Reasoning using Semantic Similarity
1276 -- 1288Qi Zhou, Joy Arulraj, Shamkant B. Navathe, William Harris, Dong Xu. Automated Verification of Query Equivalence Using Satisfiability Modulo Theories
1289 -- 1302Pengfei Xu, Jiaheng Lu. Towards a Unified Framework for String Similarity Joins
1303 -- 1315Susik Yoon, Jae-Gil Lee 0001, Byung Suk Lee. NETS: Extremely Fast Outlier Detection from a Data Stream via Set-Based Processing
1316 -- 1329Yi Lu, Xiangyao Yu, Samuel Madden. STAR: Scaling Transactions through Asymmetric Replication
1330 -- 1343Yuliang Li, Aaron Feng, Jinfeng Li, Saran Mumick, Alon Y. Halevy, Vivian Li, Wang Chiew Tan. Subjective Databases
1344 -- 1356Xuguang Ren, Junhu Wang, Wook-Shin Han, Jeffrey Xu Yu. Fast and Robust Distributed Subgraph Enumeration
1357 -- 1370Fangcheng Fu, Jiawei Jiang, Yingxia Shao, Bin Cui 0001. An Experimental Evaluation of Large Scale GBDT Systems
1371 -- 1384Ios Kotsogiannis, Yuchao Tao, Xi He, Maryam Fanaeepour, Ashwin Machanavajjhala, Michael Hay, Gerome Miklau. PrivateSQL: A Differentially Private SQL Query Engine
1385 -- 1398Mohammad Javad Amiri, Divyakant Agrawal, Amr El Abbadi. CAPER: A Cross-Application Permissioned Blockchain
1399 -- 1413Alexandros Koliousis, Pijika Watcharapichat, Matthias Weidlich, Luo Mai, Paolo Costa, Peter R. Pietzuch. Crossbow: Scaling Deep Learning with Small Batch Sizes on Multi-GPU Servers
1414 -- 1426Kaiyu Feng, Gao Cong, Christian S. Jensen, Tao Guo. Finding Attribute-Aware Similar Region for Data Analysis
1427 -- 1441Dixin Tang, Zechao Shang, Aaron J. Elmore, Sanjay Krishnan, Michael J. Franklin. Intermittent Query Processing
1442 -- 1457Mihai Budiu, Parikshit Gopalan, Lalith Suresh, Udi Wieder, Han Kruiger, Marcos K. Aguilera. Hillview: A trillion-cell spreadsheet for big data
1458 -- 1470Ziheng Wei, Sebastian Link. Embedded Functional Dependencies and Data-completeness Tailored Database Design
1471 -- 1484Hua Fan, Wojciech Golab. Ocean Vista: Gossip-Based Visibility Control for Speedy Geo-Distributed Transactions
1485 -- 1498Xikui Wang, Michael Carey. An IDEA: An Ingestion Framework for Data Enrichment in AsterixDB
1499 -- 1512Alexey Karyakin, Kenneth Salem. DimmStore: Memory Power Optimization for Database Systems
1513 -- 1525Cong Yan, Alvin Cheung. Generating Application-specific Data Layouts for In-memory Databases
1526 -- 1538Rihan Hai, Christoph Quix. Rewriting of Plain SO Tgds into Nested Tgds
1539 -- 1552Senthil Nathan, Chander Govindarajan, Adarsh Saraf, Manish Sethi, Praveen Jayachandran. Blockchain Meets Database: Design and Implementation of a Blockchain Relational Database
1553 -- 1567Andreas Kunft, Asterios Katsifodimos, Sebastian Schelter, Sebastian Breß, Tilmann Rabl, Volker Markl. An Intermediate Representation for Optimizing Machine Learning Pipelines
1568 -- 1582Yuanwei Fang, Chen Zou, Andrew A. Chien. Accelerating Raw Data Analysis with the ACCORDA Software and Hardware Architecture
1583 -- 1596A. B. Siddique, Ahmed Eldawy, Vagelis Hristidis. Comparing Synopsis Techniques for Approximate Spatial Data Analysis
1597 -- 1609Muhammad El-Hindi, Carsten Binnig, Arvind Arasu, Donald Kossmann, Ravi Ramamurthy. BlockchainDB - A Shared Database on Blockchains
1610 -- 1623Ruoxi Jia, David Dao, Boxin Wang, Frances Ann Hubis, Nezihe Merve Gürel, Bo Li 0026, Ce Zhang, Costas J. Spanos, Dawn Song. Efficient Task-Specific Data Valuation for Nearest Neighbor Algorithms
1624 -- 1636Hemant Saxena, Lukasz Golab, Ihab F. Ilyas. Distributed Implementations of Dependency Discovery Algorithms
1637 -- 1650Erfan Zamanian, Xiangyao Yu, Michael Stonebraker, Tim Kraska. Rethinking Database High Availability with RDMA Networks
1651 -- 1663Marco Bressan 0002, Stefano Leucci 0001, Alessandro Panconesi. Motivo: Fast Motif Counting via Succinct Color Coding and Adaptive Sampling
1664 -- 1678Rishabh Poddar, Tobias Boelter, Raluca Ada Popa. Arx: An Encrypted Database using Semantically Secure Encryption
1679 -- 1691Junyang Gao, Xian Li, Yifan Ethan Xu, Bunyamin Sisman, Xin Luna Dong, Jun Yang. Efficient Knowledge Graph Accuracy Evaluation
1692 -- 1704Amine Mhedhbi, Semih Salihoglu. Optimizing Subgraph Queries by Combining Binary and Worst-Case Optimal Joins
1705 -- 1718Ryan C. Marcus, Parimarjan Negi, Hongzi Mao, Chi Zhang, Mohammad Alizadeh, Tim Kraska, Olga Papaemmanouil, Nesime Tatbul. Neo: A Learned Query Optimizer
1719 -- 1732Yixiang Fang, Kaiqiang Yu, Reynold Cheng, Laks V. S. Lakshmanan, Xuemin Lin. Efficient Algorithms for Densest Subgraph Discovery
1733 -- 1746Ryan C. Marcus, Olga Papaemmanouil. Plan-Structured Deep Neural Network Models for Query Performance Prediction
1747 -- 1761Kun Ren, Dennis Li, Daniel J. Abadi. SLOG: Serializable, Low-latency, Geo-replicated Transactions
1762 -- 1777John Paparrizos, Michael J. Franklin. GRAIL: Efficient Time-Series Representation Learning

Volume 12, Issue 10

0 -- 0Lei Chen 0002, Fatma Özcan. Front Matter
1058 -- 1070Ye Yuan 0001, Xiang Lian, Guoren Wang, Yuliang Ma, Yishu Wang. Constrained Shortest Path Query in a Large Time-Dependent Graph
1071 -- 1084Lingyang Chu, Zhefeng Wang, Jian Pei, Yanyan Zhang, Yu Yang 0001, Enhong Chen. Finding Theme Communities from Database Networks
1085 -- 1098James Pan, Guoliang Li 0001, Juntao Hu. Ridesharing: Simulator, Benchmark, and Evaluation
1099 -- 1112Longbin Lai, Zhu Qing, Zhengyi Yang, Xin Jin, Zhengmin Lai, Ran Wang, Kongzhang Hao, Xuemin Lin, Lu Qin, Wenjie Zhang 0001, Ying Zhang 0001, Zhengping Qian, Jingren Zhou. Distributed Subgraph Matching on Timely Dataflow
1113 -- 1125Shi Qiao, Adrian Nicoara, Jin Sun, Marc Friedman, Hiren Patel, Jaliya Ekanayake. Hyper Dimension Shuffle: Efficient Data Repartition at Petabyte Scale in Scope
1126 -- 1138Graham Cormode, Tejas Kulkarni, Divesh Srivastava. Answering Range Queries Under Local Differential Privacy
1139 -- 1152Kai Wang, Xuemin Lin, Lu Qin, Wenjie Zhang 0001, Ying Zhang 0001. Vertex Priority Based Butterfly Counting for Large-scale Bipartite Networks
1153 -- 1166Yang Cao 0012, Wenfei Fan, Tengfei Yuan. Block as a Value for SQL over NoSQL
1167 -- 1180Kanat Tangwongsan, Martin Hirzel, Scott Schneider 0001. Optimal and General Out-of-Order Sliding-Window Aggregation
1181 -- 1194Bo Tang, Kyriakos Mouratidis, Man Lung Yiu, Zhenyu Chen. Creating Top Ranking Options in the Continuous Option and Preference Space
1195 -- 1207Hanchao Ma, Morteza Alipour Langouri, Yinghui Wu, Fei Chiang, Jiaxing Pi. Ontology-based Entity Matching in Attributed Graphs
1208 -- 1220Lu Chen, Yunjun Gao, Ziquan Fang, Xiaoye Miao, Christian S. Jensen, Chenjuan Guo. Real-time Distributed Co-Movement Pattern Detection on Streaming Trajectories
1221 -- 1234Jian Tan, Tieying Zhang, Feifei Li 0001, Jie Chen, Qixing Zheng, Ping Zhang, Honglin Qiao, Yue Shi, Wei Cao, Rui Zhang. iBTune: Individualized Buffer Tuning for Large-scale Cloud Databases

Volume 12, Issue 1

0 -- 0Lei Chen 0002, Fatma Özcan. Front Matter
1 -- 13Sunghwan Kim, Taesung Lee, Seung-won Hwang, Sameh Elnikety. List Intersection for Web Search: Algorithms, Cost Models, and Optimizations
14 -- 27Michael Whittaker, Joseph M. Hellerstein. Interactive Checks for Coordination Avoidance
28 -- 42Jianbin Qin, Chuan Xiao. Pigeonring: A Principle for Faster Thresholded Similarity Search
43 -- 56Ahmet Erdem Sariyüce, C. Seshadhri, Ali Pinar. Local Algorithms for Hierarchical Dense Subgraph Discovery
57 -- 70Jingru Yang, Ju Fan, Zhewei Wei, Guoliang Li 0001, Tongyu Liu, Xiaoyong Du. Cost-Effective Data Annotation using Game-Based Crowdsourcing
71 -- 84Enhui Huang, Liping Peng, Luciano Di Palma, Ahmed Abdelkafi, Anna Liu, Yanlei Diao. Optimization for Active Learning-based Interactive Database Exploration