| 0 | -- | 0 | Chen Li 0001, Volker Markl. Front Matter |
| 1334 | -- | 1345 | Weimo Liu, Md Farhadur Rahman, Saravanan Thirumuruganathan, Nan Zhang 0004, Gautam Das. Aggregate Estimations over Location Based Services |
| 1346 | -- | 1357 | Souvik Bhattacherjee, Amit Chavan, Silu Huang, Amol Deshpande, Aditya G. Parameswaran. Principles of Dataset Versioning: Exploring the Recreation/Storage Tradeoff |
| 1358 | -- | 1369 | Yeye He, Kris Ganjam, Xu Chu. SEMA-JOIN: Joining Semantically-Related Tables Using Big Table Corpora |
| 1370 | -- | 1381 | Sanjay Krishnan, Jiannan Wang, Michael J. Franklin, Ken Goldberg, Tim Kraska. Stale View Cleaning: Getting Fresh Answers from Stale Materialized Views |
| 1382 | -- | 1393 | Parth Nagarkar, K. Selçuk Candan, Aneesha Bhat. Compressed Spatial Hierarchical Bitmap (cSHB) Indexes for Efficiently Processing Spatial Range Query Workloads |
| 1394 | -- | 1405 | Daniel Deutch, Amir Gilad, Yuval Moskovitch. Selective Provenance for Datalog Programs Using Top-K Queries |
| 1406 | -- | 1417 | Yoonjae Park, Jun-Ki Min, Kyuseok Shim. Processing of Probabilistic Skyline Queries Using MapReduce |
| 1418 | -- | 1429 | Xiaofei Zhang, Hong Cheng, Lei Chen 0002. Bonding Vertex Sets Over Distributed Graph: A Betweenness Aware Approach |
| 1430 | -- | 1441 | Yael Amsterdamer, Anna Kukliansky, Tova Milo. A Natural Language Interface for Querying General and Individual Knowledge |
| 1442 | -- | 1453 | Iraklis Psaroudakis, Tobias Scheuer, Norman May, Abdelkader Sellami, Anastasia Ailamaki. Scaling Up Concurrent Main-Memory Column-Store Scans: Towards Adaptive NUMA-aware Data and Task Placement |
| 1454 | -- | 1465 | Gihwan Oh, Sangchul Kim, Sang-Won Lee, Bongki Moon. SQLite Optimization with Phase Change Memory for Mobile Applications |
| 1466 | -- | 1477 | Andrew Crotty, Alex Galakatos, Kayhan Dursun, Tim Kraska, Carsten Binnig, Ugur Çetintemel, Stan Zdonik. An Architecture for Compiling UDF-centric Workflows |
| 1478 | -- | 1489 | Daniel W. Margo, Margo I. Seltzer. A Scalable Distributed Graph Partitioner |
| 1490 | -- | 1501 | Artyom Sharov, Alexander Shraer, Arif Merchant, Murray Stokely. Take me to your leader! Online Optimization of Distributed Storage Configurations |
| 1502 | -- | 1513 | Wenfei Fan, Xin Wang, Yinghui Wu, Jingbo Xu. Association Rules with Graph Patterns |
| 1514 | -- | 1525 | Ben Kimmett, Venkatesh Srinivasan, Alex Thomo. Fuzzy Joins in MapReduce: An Experimental Study |
| 1518 | -- | 1529 | Minsik Cho, Daniel Brand, Rajesh Bordawekar, Ulrich Finkler, Vincent KulandaiSamy, Ruchir Puri. PARADIS: An Efficient Parallel Algorithm for In-place Radix Sort |
| 1530 | -- | 1541 | David Vengerov, Andre Cavalheiro Menck, Mohamed Zaït, Sunil Chakkappen. Join Size Estimation Subject to Filter Conditions |
| 1542 | -- | 1553 | Jingjing Wang, Magdalena Balazinska, Daniel Halperin. Asynchronous and Fault-Tolerant Recursive Datalog Evaluation in Shared-Nothing Engines |
| 1554 | -- | 1565 | Kyriakos Mouratidis, Jilian Zhang, HweeHwa Pang. Maximum Rank Query |
| 1566 | -- | 1577 | Foteini Katsarou, Nikos Ntarmos, Peter Triantafillou. Performance and Scalability of Indexed Subgraph Query Processing Methods |
| 1578 | -- | 1589 | Ying Yang, Niccolo' Meneghetti, Ronny Fehling, Zhen Hua Liu, Oliver Kennedy. Lenses: An On-Demand Approach to ETL |
| 1590 | -- | 1601 | Wenfei Fan, Zhe Fan, Chao Tian, Xin Luna Dong. Keys for Graphs |
| 1602 | -- | 1613 | Ahmed Eldawy, Louai Alarabi, Mohamed F. Mokbel. Spatial Partitioning Techniques in Spatial Hadoop |
| 1606 | -- | 1617 | Tomohiro Manabe, Keishi Tajima. Extracting Logical Hierarchical Structure of HTML Documents Based on Headings |
| 1618 | -- | 1629 | Bilegsaikhan Naidan, Leonid Boytsov, Eric Nyberg. Permutation Search Methods are Efficient, Yet Faster Search is Possible |
| 1630 | -- | 1641 | Niloy Mukherjee, Shasank Chavan, Maria Colgan, Dinesh Das, Mike Gleeson, Sanket Hase, Allison Holloway, Hui Jin, Jesse Kamp, Kartik Kulkarni, Tirthankar Lahiri, Juan Loaiza, Neil MacNaughton, Vineet Marwah, Atrayee Mullick, Andy Witkowski, Jiaqi Yan, Mohamed Zaït. Distributed Architecture of Oracle Database In-memory |
| 1642 | -- | 1653 | Daniel Haas, Jason Ansel, Lydia Gu, Adam Marcus. Argonaut: Macrotask Crowdsourcing for Complex Data Processing |
| 1654 | -- | 1665 | Guozhang Wang, Joel Koshy, Sriram Subramanian, Kartik Paramasivam, Mammad Zadeh, Neha Narkhede, Jun Rao, Jay Kreps, Joe Stein. Building a Replicated Logging System with Apache Kafka |
| 1656 | -- | 1667 | Alessandra Loro, Anja Gruenheid, Donald Kossmann, Damien Profeta, Philippe Beaudequin. Indexing and Selecting Hierarchical Business Logic |
| 1668 | -- | 1679 | Dharma Shukla, Shireesh Thota, Karthik Raman, Madhan Gajendran, Ankur Shah, Sergii Ziuzin, Krishnan Sundaram, Miguel Gonzalez Guajardo, Anna Wawrzyniak, Samer Boshra, Renato Ferreira, Mohamed Nassar, Michael Koltachev, Ji Huang, Sudipta Sengupta, Justin J. Levandoski, David B. Lomet. Schema-Agnostic Indexing with Azure DocumentDB |
| 1680 | -- | 1691 | Eric Boutin, Paul Brett, Xiaoyu Chen, Jaliya Ekanayake, Tao Guan, Anna Korsun, Zhicheng Yin, Nan Zhang, Jingren Zhou. JetScope: Reliable and Interactive Analytics at Cloud Scale |
| 1692 | -- | 1703 | Xueyang Hu, Mingxuan Yuan, Jianguo Yao, Yu Deng, Lei Chen, Qiang Yang, Haibing Guan, Jia Zeng. Differential Privacy in Telco Big Data Platform |
| 1704 | -- | 1715 | Amr El-Helw, Venkatesh Raghavan, Mohamed A. Soliman, George C. Caragea, Zhongxian Gu, Michalis Petropoulos. Optimization of Common Table Expressions in MPP Database Systems |
| 1716 | -- | 1727 | Anil K. Goel, Jeffrey Pound, Nathan Auch, Peter Bumbulis, Scott MacLean, Franz Färber, Francis Gropengießer, Christian Mathis, Thomas Bodner, Wolfgang Lehner. Towards Scalable Real-time Analytics: An Architecture for Scale-out of OLxP Workloads |
| 1729 | -- | 1740 | Tamraparni Dasu, Vladislav Shkapenyuk, Divesh Srivastava, Deborah F. Swayne. FIT to Monitor Feed Quality |
| 1740 | -- | 1751 | Per-Åke Larson, Adrian Birka, Eric N. Hanson, Weiyun Huang, Michal Nowakiewicz, Vassilis Papadimos. Real-Time Analytical Processing with SQL Server |
| 1752 | -- | 1763 | You Wu, Boulos Harb, Jun Yang, Cong Yu. Efficient Evaluation of Object-Centric Exploration Queries for Visualization |
| 1764 | -- | 1775 | Lin Qiao, Yinan Li, Sahil Takiar, Ziyang Liu, Narasimha Veeramreddy, Min Tu, Ying Dai, Issac Buenrostro, Kapil Surlaker, Shirshanka Das, Chavdar Botev. Gobblin: Unifying Data Ingestion for Hadoop |
| 1770 | -- | 1781 | Dinesh Das, Jiaqi Yan, Mohamed Zaït, Satyanarayana R. Valluri, Nirav Vyas, Ramarajan Krishnamachari, Prashant Gaharwar, Jesse Kamp, Niloy Mukherjee. Query Optimization in Oracle 12c Database In-Memory |
| 1782 | -- | 1793 | Todd J. Green, Dan Olteanu, Geoffrey Washburn. Live Programming in the LogicBlox System: A MetaLogiQL Approach |
| 1792 | -- | 1803 | Tyler Akidau, Robert Bradshaw, Craig Chambers, Slava Chernyak, Rafael Fernández-Moctezuma, Reuven Lax, Sam McVeety, Daniel Mills, Frances Perry, Eric Schmidt, Sam Whittle. The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing |
| 1804 | -- | 1815 | Avery Ching, Sergey Edunov, Maja Kabiljo, Dionysios Logothetis, Sambavi Muthukrishnan. One Trillion Edges: Graph Processing at Facebook-Scale |
| 1816 | -- | 1827 | Tuomas Pelkonen, Scott Franklin, Paul Cavallaro, Qi Huang, Justin Meza, Justin Teller, Kaushik Veeraraghavan. Gorilla: A Fast, Scalable, In-Memory Time Series Database |
| 1828 | -- | 1839 | Rahul Potharaju, Joseph Chan, Luhui Hu, Cristina Nita-Rotaru, MingShi Wang, Liyuan Zhang, Navendu Jain. ConfSeer: Leveraging Customer Support Knowledge Bases for Automated Misconfiguration Detection |
| 1840 | -- | 1851 | Michael Armbrust, Tathagata Das, Aaron Davidson, Ali Ghodsi, Andrew Or, Josh Rosen, Ion Stoica, Patrick Wendell, Reynold Xin, Matei Zaharia. Scaling Spark in the Real World: Performance and Usability |
| 1844 | -- | 1855 | Majed Sahli, Essam Mansour, Panos Kalnis. StarDB: A Large-Scale DBMS for Strings |
| 1848 | -- | 1859 | Razen Harbi, Ibrahim Abdelaziz, Panos Kalnis, Nikos Mamoulis. Evaluating SPARQL Queries on Massive RDF Datasets |
| 1852 | -- | 1863 | Ngai Meng Kou, Leong Hou U, Nikos Mamoulis, Yuhong Li, Ye Li, Zhiguo Gong. A Topic-based Reviewer Assignment System |
| 1856 | -- | 1867 | Miguel Liroz-Gistau, Reza Akbarinia, Patrick Valduriez. FP-Hadoop: Efficient Execution of Parallel Jobs Over Skewed Data |
| 1860 | -- | 1871 | Thorsten Papenbrock, Tanja Bergmann, Moritz Finke, Jakob Zwiener, Felix Naumann. Data Profiling with Metanome |
| 1864 | -- | 1875 | Arun Kumar, Mona Jalal, Boqun Yan, Jeffrey F. Naughton, Jignesh M. Patel. Demonstration of Santoku: Optimizing Machine Learning over Normalized Data |
| 1868 | -- | 1879 | Boon-Siew Seah, Sourav S. Bhowmick, Aixin Sun. PRISM: Concept-preserving Summarization of Top-K Social Image Search Results |
| 1872 | -- | 1883 | Tobias Muller, Torsten Grust. Provenance for SQL through Abstract Interpretation: Value-less, but Worthwhile |
| 1876 | -- | 1887 | Zhian He, Wai Kit Wong, Ben Kao, David Wai-Lok Cheung, Rongbin Li, Siu-Ming Yiu, Eric Lo. SDB: A Secure Query Processing System with Data Interoperability |
| 1880 | -- | 1891 | Ibrahim Abdelaziz, Razen Harbi, Semih Salihoglu, Panos Kalnis, Nikos Mamoulis. SPARTex: A Vertex-Centric Framework for RDF Data Analytics |
| 1884 | -- | 1895 | Lu Chen, Yunjun Gao, Zhihao Xing, Christian S. Jensen, Gang Chen. I2RS: A Distributed Geo-Textual Image Retrieval and Recommendation System |
| 1888 | -- | 1899 | Damian Bursztyn, François Goasdoué, Ioana Manolescu. Reformulation-based query answering in RDF: alternatives and performance |
| 1892 | -- | 1903 | Marc Bux, Jörgen Brandt, Carsten Lipka, Kamal Hakimzadeh, Jim Dowling, Ulf Leser. SAASFEE: Scalable Scientific Workflow Execution Engine |
| 1896 | -- | 1907 | Ahmed Eldawy, Mohamed F. Mokbel, Christopher Jonathan. A Demonstration of HadoopViz: An Extensible MapReduce System for Visualizing Big Spatial Data |
| 1900 | -- | 1911 | Moria Bergman, Tova Milo, Slava Novgorodov, Wang Chiew Tan. QOCO: A Query Oriented Data Cleaning System with Oracles |
| 1904 | -- | 1915 | Shanshan Ying, Flip Korn, Barna Saha, Divesh Srivastava. TreeScope: Finding Structural Anomalies In Semi-Structured Data |
| 1908 | -- | 1919 | Aaron J. Elmore, Jennie Duggan, Mike Stonebraker, Magdalena Balazinska, Ugur Çetintemel, Vijay Gadepally, J. Heer, Bill Howe, Jeremy Kepner, Tim Kraska, Samuel Madden, David Maier, Timothy G. Mattson, S. Papadopoulos, J. Parkhurst, Nesime Tatbul, Manasi Vartak, Stan Zdonik. A Demonstration of the BigDAWG Polystore System |
| 1912 | -- | 1923 | Kostas Zoumpatianos, Stratos Idreos, Themis Palpanas. RINSE: Interactive Data Series Exploration with ADS+ |
| 1916 | -- | 1927 | Anant P. Bhardwaj, Amol Deshpande, Aaron J. Elmore, David R. Karger, Sam Madden, Aditya G. Parameswaran, Harihar Subramanyam, Eugene Wu 0002, Rebecca Zhang. Collaborative Data Analytics with DataHub |
| 1920 | -- | 1931 | Jaeho Shin, Christopher Ré, Michael J. Cafarella. Mindtagger: A Demonstration of Data Labeling in Knowledge Base Construction |
| 1924 | -- | 1935 | Danai Koutra, Di Jin, Yuanshi Ning, Christos Faloutsos. Perseus: An Interactive Large-Scale Graph Mining and Visualization Tool |
| 1928 | -- | 1939 | Manas Joglekar, Hector Garcia-Molina, Aditya G. Parameswaran. Smart Drill-Down: A New Data Exploration Operator |
| 1932 | -- | 1943 | Curtis E. Dyreson, Sourav S. Bhowmick, Ryan Grapp. Virtual eXist-db: Liberating Hierarchical Queries from the Shackles of Access Path Dependence |
| 1936 | -- | 1947 | Eli Cortez, Philip A. Bernstein, Yeye He, Lev Novik. Annotating Database Schemas to Help Enterprise Search |
| 1940 | -- | 1951 | Nandish Jayaram, Sidharth Goyal, Chengkai Li. VIIQ: Auto-Suggestion Enabled Visual Interface for Interactive Graph Query Formulation |
| 1944 | -- | 1955 | Qingyuan Liu, Eduard C. Dragut, Arjun Mukherjee, Weiyi Meng. FLORIN - A System to Support (Near) Real-Time Applications on User Generated Content on Daily News |
| 1948 | -- | 1959 | Yunyao Li, Elmer Kim, Marc A. Touchette, Ramiya Venkatachalam, Hao Wang. VINERy: A Visual IDE for Information Extraction |
| 1952 | -- | 1963 | Xu Chu, Mourad Ouzzani, John Morcos, Ihab F. Ilyas, Paolo Papotti, Nan Tang 0001, Yin Ye. KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing |
| 1956 | -- | 1967 | Foteini Alvanaki, Romulo Goncalves, Milena Ivanova, Martin L. Kersten, Kostis Kyzirakos. GIS Navigation Boosted by Column Stores |
| 1960 | -- | 1971 | Patricia C. Arocena, Radu Ciucanu, Boris Glavic, Renée J. Miller. Gain Control over your Integration Evaluations |
| 1964 | -- | 1975 | Yanlei Diao, Kyriaki Dimitriadou, Zhan Li, Wenzhao Liu, Olga Papaemmanouil, Kemi Peng, Liping Peng. AIDE: An Automatic User Navigation System for Interactive Data Exploration |
| 1968 | -- | 1979 | Ahmed M. Aly, Ahmed S. Abdelhamid, Ahmed R. Mahmood, Walid G. Aref, Mohamed S. Hassan, Hazem Elmeleegy, Mourad Ouzzani. A Demonstration of AQWA: Adaptive Query-Workload-Aware Partitioning of Big Spatial Data |
| 1972 | -- | 1983 | Jens Dittrich, Patrick Bender. Janiform Intra-Document Analytics for Reproducible Research |
| 1976 | -- | 1987 | Erich Schubert, Alexander Koos, Tobias Emrich, Andreas Züfle, Klaus Arthur Schmid, Arthur Zimek. A Framework for Clustering Uncertain Data |
| 1980 | -- | 1991 | Nicole Bidoit, Melanie Herschel, Katerina Tzompanaki. EFQ: Why-Not Answer Polynomials in Action |
| 1984 | -- | 1995 | Xiaolan Wang, Mary Feng, Yue Wang, Xin Luna Dong, Alexandra Meliou. Error Diagnosis and Data Profiling with Data X-Ray |
| 1988 | -- | 1999 | Quan Pham, Severin Thaler, Tanu Malik, Ian T. Foster, Boris Glavic. Sharing and Reproducing Database Applications |
| 1992 | -- | 2003 | Marcin Wylot, Philippe Cudré-Mauroux, Paul T. Groth. A Demonstration of TripleProv: Tracking and Querying Provenance over Web Data |
| 1996 | -- | 2007 | Stefano Ortona, Giorgio Orsi, Marcello Buoncristiano, Tim Furche. WADaR: Joint Wrapper and Data Repair |
| 2000 | -- | 2011 | Mangesh Bendre, Bofan Sun, Ding Zhang, Xinyan Zhou, Kevin Chen-Chuan Chang, Aditya Parameswaran. DATASPREAD: Unifying Databases and Spreadsheets |
| 2004 | -- | 2015 | Daniel Haas, Sanjay Krishnan, Jiannan Wang, Michael J. Franklin, Eugene Wu 0002. Wisteria: Nurturing Scalable Data Cleaning Infrastructure |
| 2008 | -- | 2019 | Ashoke S., Jayant R. Haritsa. CODD: A Dataless Approach to Big Data Testing |
| 2012 | -- | 2023 | Sejla Cebiric, François Goasdoué, Ioana Manolescu. Query-Oriented Summarization of RDF Graphs |
| 2016 | -- | 2027 | Yodsawalai Chodpathumwan, Amirhossein Aleyasen, Arash Termehchy, Yizhou Sun. Universal-DB: Towards Representation Independent Graph Analytics |
| 2020 | -- | 2031 | Ahmed R. Mahmood, Ahmed M. Aly, Thamir Qadah, El Kindi Rezig, Anas Daghistani, Amgad Madkour, Ahmed S. Abdelhamid, Mohamed S. Hassan, Walid G. Aref, Saleh Basalamah. Tornado: A Distributed Spatio-Textual Stream Processing System |
| 2024 | -- | 2035 | Andrew Crotty, Alex Galakatos, Emanuel Zgraggen, Carsten Binnig, Tim Kraska. Vizdom: Interactive Analytics through Pen and Touch |
| 2028 | -- | 2039 | Mariano P. Consens, Valeria Fionda, Shahan Khatchadourian, Giuseppe Pirrò. S+EPPs: Construct and Explore Bisimulation Summaries, plus Optimize Navigational Queries; all on Existing SPARQL Systems |
| 2032 | -- | 2043 | Konstantinos Xirogiannopoulos, Udayan Khurana, Amol Deshpande. GraphGen: Exploring Interesting Graphs in Relational Data |
| 2036 | -- | 2047 | Dong Young Yoon, Barzan Mozafari, Douglas P. Brown. DBSeer: Pain-free Database Administration through Workload Intelligence |
| 2040 | -- | 2051 | Arun Kejariwal, Sanjeev Kulkarni, Karthik Ramasamy. Real Time Analytics: Algorithms and Systems |
| 2042 | -- | 2053 | Arijit Khan, Lei Chen. On Uncertain Graphs Modeling and Queries |
| 2044 | -- | 2055 | Xin Luna Dong, Wang Chiew Tan. A Time Machine for Information: Looking Back to Look Forward |
| 2046 | -- | 2057 | Mahashweta Das, Gautam Das. Structured Analytics in Social Media |
| 2048 | -- | 2059 | Jing Gao, Qi Li, Bo Zhao, Wei Fan, Jiawei Han. Truth Discovery and Crowdsourcing Aggregation: A Unified Perspective |
| 2050 | -- | 2061 | Daniel Abadi, Shivnath Babu, Fatma Ozcan, Ippokratis Pandis. Tutorial: SQL-on-Hadoop Systems |
| 2052 | -- | 2063 | Juan Loaiza. Engineering Database Hardware and Software Together |
| 2053 | -- | 2064 | Magdalena Balazinska. Big Data Research: Will Industry Solve all the Problems? |
| 2057 | -- | 2068 | Todd Walter. Big Plateaus of Big Data on the Big Island |
| 2058 | -- | 2069 | Anastasia Ailamaki. Databases and Hardware: The Beginning and Sequel of a Beautiful Friendship |