Journal: PVLDB

Volume 8, Issue 12

0 -- 0Chen Li 0001, Volker Markl. Front Matter
1334 -- 1345Weimo Liu, Md Farhadur Rahman, Saravanan Thirumuruganathan, Nan Zhang 0004, Gautam Das. Aggregate Estimations over Location Based Services
1346 -- 1357Souvik Bhattacherjee, Amit Chavan, Silu Huang, Amol Deshpande, Aditya G. Parameswaran. Principles of Dataset Versioning: Exploring the Recreation/Storage Tradeoff
1358 -- 1369Yeye He, Kris Ganjam, Xu Chu. SEMA-JOIN: Joining Semantically-Related Tables Using Big Table Corpora
1370 -- 1381Sanjay Krishnan, Jiannan Wang, Michael J. Franklin, Ken Goldberg, Tim Kraska. Stale View Cleaning: Getting Fresh Answers from Stale Materialized Views
1382 -- 1393Parth Nagarkar, K. Selçuk Candan, Aneesha Bhat. Compressed Spatial Hierarchical Bitmap (cSHB) Indexes for Efficiently Processing Spatial Range Query Workloads
1394 -- 1405Daniel Deutch, Amir Gilad, Yuval Moskovitch. Selective Provenance for Datalog Programs Using Top-K Queries
1406 -- 1417Yoonjae Park, Jun-Ki Min, Kyuseok Shim. Processing of Probabilistic Skyline Queries Using MapReduce
1418 -- 1429Xiaofei Zhang, Hong Cheng, Lei Chen 0002. Bonding Vertex Sets Over Distributed Graph: A Betweenness Aware Approach
1430 -- 1441Yael Amsterdamer, Anna Kukliansky, Tova Milo. A Natural Language Interface for Querying General and Individual Knowledge
1442 -- 1453Iraklis Psaroudakis, Tobias Scheuer, Norman May, Abdelkader Sellami, Anastasia Ailamaki. Scaling Up Concurrent Main-Memory Column-Store Scans: Towards Adaptive NUMA-aware Data and Task Placement
1454 -- 1465Gihwan Oh, Sangchul Kim, Sang-Won Lee, Bongki Moon. SQLite Optimization with Phase Change Memory for Mobile Applications
1466 -- 1477Andrew Crotty, Alex Galakatos, Kayhan Dursun, Tim Kraska, Carsten Binnig, Ugur Çetintemel, Stan Zdonik. An Architecture for Compiling UDF-centric Workflows
1478 -- 1489Daniel W. Margo, Margo I. Seltzer. A Scalable Distributed Graph Partitioner
1490 -- 1501Artyom Sharov, Alexander Shraer, Arif Merchant, Murray Stokely. Take me to your leader! Online Optimization of Distributed Storage Configurations
1502 -- 1513Wenfei Fan, Xin Wang, Yinghui Wu, Jingbo Xu. Association Rules with Graph Patterns
1514 -- 1525Ben Kimmett, Venkatesh Srinivasan, Alex Thomo. Fuzzy Joins in MapReduce: An Experimental Study
1518 -- 1529Minsik Cho, Daniel Brand, Rajesh Bordawekar, Ulrich Finkler, Vincent KulandaiSamy, Ruchir Puri. PARADIS: An Efficient Parallel Algorithm for In-place Radix Sort
1530 -- 1541David Vengerov, Andre Cavalheiro Menck, Mohamed Zaït, Sunil Chakkappen. Join Size Estimation Subject to Filter Conditions
1542 -- 1553Jingjing Wang, Magdalena Balazinska, Daniel Halperin. Asynchronous and Fault-Tolerant Recursive Datalog Evaluation in Shared-Nothing Engines
1554 -- 1565Kyriakos Mouratidis, Jilian Zhang, HweeHwa Pang. Maximum Rank Query
1566 -- 1577Foteini Katsarou, Nikos Ntarmos, Peter Triantafillou. Performance and Scalability of Indexed Subgraph Query Processing Methods
1578 -- 1589Ying Yang, Niccolo' Meneghetti, Ronny Fehling, Zhen Hua Liu, Oliver Kennedy. Lenses: An On-Demand Approach to ETL
1590 -- 1601Wenfei Fan, Zhe Fan, Chao Tian, Xin Luna Dong. Keys for Graphs
1602 -- 1613Ahmed Eldawy, Louai Alarabi, Mohamed F. Mokbel. Spatial Partitioning Techniques in Spatial Hadoop
1606 -- 1617Tomohiro Manabe, Keishi Tajima. Extracting Logical Hierarchical Structure of HTML Documents Based on Headings
1618 -- 1629Bilegsaikhan Naidan, Leonid Boytsov, Eric Nyberg. Permutation Search Methods are Efficient, Yet Faster Search is Possible
1630 -- 1641Niloy Mukherjee, Shasank Chavan, Maria Colgan, Dinesh Das, Mike Gleeson, Sanket Hase, Allison Holloway, Hui Jin, Jesse Kamp, Kartik Kulkarni, Tirthankar Lahiri, Juan Loaiza, Neil MacNaughton, Vineet Marwah, Atrayee Mullick, Andy Witkowski, Jiaqi Yan, Mohamed Zaït. Distributed Architecture of Oracle Database In-memory
1642 -- 1653Daniel Haas, Jason Ansel, Lydia Gu, Adam Marcus. Argonaut: Macrotask Crowdsourcing for Complex Data Processing
1654 -- 1665Guozhang Wang, Joel Koshy, Sriram Subramanian, Kartik Paramasivam, Mammad Zadeh, Neha Narkhede, Jun Rao, Jay Kreps, Joe Stein. Building a Replicated Logging System with Apache Kafka
1656 -- 1667Alessandra Loro, Anja Gruenheid, Donald Kossmann, Damien Profeta, Philippe Beaudequin. Indexing and Selecting Hierarchical Business Logic
1668 -- 1679Dharma Shukla, Shireesh Thota, Karthik Raman, Madhan Gajendran, Ankur Shah, Sergii Ziuzin, Krishnan Sundaram, Miguel Gonzalez Guajardo, Anna Wawrzyniak, Samer Boshra, Renato Ferreira, Mohamed Nassar, Michael Koltachev, Ji Huang, Sudipta Sengupta, Justin J. Levandoski, David B. Lomet. Schema-Agnostic Indexing with Azure DocumentDB
1680 -- 1691Eric Boutin, Paul Brett, Xiaoyu Chen, Jaliya Ekanayake, Tao Guan, Anna Korsun, Zhicheng Yin, Nan Zhang, Jingren Zhou. JetScope: Reliable and Interactive Analytics at Cloud Scale
1692 -- 1703Xueyang Hu, Mingxuan Yuan, Jianguo Yao, Yu Deng, Lei Chen, Qiang Yang, Haibing Guan, Jia Zeng. Differential Privacy in Telco Big Data Platform
1704 -- 1715Amr El-Helw, Venkatesh Raghavan, Mohamed A. Soliman, George C. Caragea, Zhongxian Gu, Michalis Petropoulos. Optimization of Common Table Expressions in MPP Database Systems
1716 -- 1727Anil K. Goel, Jeffrey Pound, Nathan Auch, Peter Bumbulis, Scott MacLean, Franz Färber, Francis Gropengießer, Christian Mathis, Thomas Bodner, Wolfgang Lehner. Towards Scalable Real-time Analytics: An Architecture for Scale-out of OLxP Workloads
1729 -- 1740Tamraparni Dasu, Vladislav Shkapenyuk, Divesh Srivastava, Deborah F. Swayne. FIT to Monitor Feed Quality
1740 -- 1751Per-Åke Larson, Adrian Birka, Eric N. Hanson, Weiyun Huang, Michal Nowakiewicz, Vassilis Papadimos. Real-Time Analytical Processing with SQL Server
1752 -- 1763You Wu, Boulos Harb, Jun Yang, Cong Yu. Efficient Evaluation of Object-Centric Exploration Queries for Visualization
1764 -- 1775Lin Qiao, Yinan Li, Sahil Takiar, Ziyang Liu, Narasimha Veeramreddy, Min Tu, Ying Dai, Issac Buenrostro, Kapil Surlaker, Shirshanka Das, Chavdar Botev. Gobblin: Unifying Data Ingestion for Hadoop
1770 -- 1781Dinesh Das, Jiaqi Yan, Mohamed Zaït, Satyanarayana R. Valluri, Nirav Vyas, Ramarajan Krishnamachari, Prashant Gaharwar, Jesse Kamp, Niloy Mukherjee. Query Optimization in Oracle 12c Database In-Memory
1782 -- 1793Todd J. Green, Dan Olteanu, Geoffrey Washburn. Live Programming in the LogicBlox System: A MetaLogiQL Approach
1792 -- 1803Tyler Akidau, Robert Bradshaw, Craig Chambers, Slava Chernyak, Rafael Fernández-Moctezuma, Reuven Lax, Sam McVeety, Daniel Mills, Frances Perry, Eric Schmidt, Sam Whittle. The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing
1804 -- 1815Avery Ching, Sergey Edunov, Maja Kabiljo, Dionysios Logothetis, Sambavi Muthukrishnan. One Trillion Edges: Graph Processing at Facebook-Scale
1816 -- 1827Tuomas Pelkonen, Scott Franklin, Paul Cavallaro, Qi Huang, Justin Meza, Justin Teller, Kaushik Veeraraghavan. Gorilla: A Fast, Scalable, In-Memory Time Series Database
1828 -- 1839Rahul Potharaju, Joseph Chan, Luhui Hu, Cristina Nita-Rotaru, MingShi Wang, Liyuan Zhang, Navendu Jain. ConfSeer: Leveraging Customer Support Knowledge Bases for Automated Misconfiguration Detection
1840 -- 1851Michael Armbrust, Tathagata Das, Aaron Davidson, Ali Ghodsi, Andrew Or, Josh Rosen, Ion Stoica, Patrick Wendell, Reynold Xin, Matei Zaharia. Scaling Spark in the Real World: Performance and Usability
1844 -- 1855Majed Sahli, Essam Mansour, Panos Kalnis. StarDB: A Large-Scale DBMS for Strings
1848 -- 1859Razen Harbi, Ibrahim Abdelaziz, Panos Kalnis, Nikos Mamoulis. Evaluating SPARQL Queries on Massive RDF Datasets
1852 -- 1863Ngai Meng Kou, Leong Hou U, Nikos Mamoulis, Yuhong Li, Ye Li, Zhiguo Gong. A Topic-based Reviewer Assignment System
1856 -- 1867Miguel Liroz-Gistau, Reza Akbarinia, Patrick Valduriez. FP-Hadoop: Efficient Execution of Parallel Jobs Over Skewed Data
1860 -- 1871Thorsten Papenbrock, Tanja Bergmann, Moritz Finke, Jakob Zwiener, Felix Naumann. Data Profiling with Metanome
1864 -- 1875Arun Kumar, Mona Jalal, Boqun Yan, Jeffrey F. Naughton, Jignesh M. Patel. Demonstration of Santoku: Optimizing Machine Learning over Normalized Data
1868 -- 1879Boon-Siew Seah, Sourav S. Bhowmick, Aixin Sun. PRISM: Concept-preserving Summarization of Top-K Social Image Search Results
1872 -- 1883Tobias Muller, Torsten Grust. Provenance for SQL through Abstract Interpretation: Value-less, but Worthwhile
1876 -- 1887Zhian He, Wai Kit Wong, Ben Kao, David Wai-Lok Cheung, Rongbin Li, Siu-Ming Yiu, Eric Lo. SDB: A Secure Query Processing System with Data Interoperability
1880 -- 1891Ibrahim Abdelaziz, Razen Harbi, Semih Salihoglu, Panos Kalnis, Nikos Mamoulis. SPARTex: A Vertex-Centric Framework for RDF Data Analytics
1884 -- 1895Lu Chen, Yunjun Gao, Zhihao Xing, Christian S. Jensen, Gang Chen. I2RS: A Distributed Geo-Textual Image Retrieval and Recommendation System
1888 -- 1899Damian Bursztyn, François Goasdoué, Ioana Manolescu. Reformulation-based query answering in RDF: alternatives and performance
1892 -- 1903Marc Bux, Jörgen Brandt, Carsten Lipka, Kamal Hakimzadeh, Jim Dowling, Ulf Leser. SAASFEE: Scalable Scientific Workflow Execution Engine
1896 -- 1907Ahmed Eldawy, Mohamed F. Mokbel, Christopher Jonathan. A Demonstration of HadoopViz: An Extensible MapReduce System for Visualizing Big Spatial Data
1900 -- 1911Moria Bergman, Tova Milo, Slava Novgorodov, Wang Chiew Tan. QOCO: A Query Oriented Data Cleaning System with Oracles
1904 -- 1915Shanshan Ying, Flip Korn, Barna Saha, Divesh Srivastava. TreeScope: Finding Structural Anomalies In Semi-Structured Data
1908 -- 1919Aaron J. Elmore, Jennie Duggan, Mike Stonebraker, Magdalena Balazinska, Ugur Çetintemel, Vijay Gadepally, J. Heer, Bill Howe, Jeremy Kepner, Tim Kraska, Samuel Madden, David Maier, Timothy G. Mattson, S. Papadopoulos, J. Parkhurst, Nesime Tatbul, Manasi Vartak, Stan Zdonik. A Demonstration of the BigDAWG Polystore System
1912 -- 1923Kostas Zoumpatianos, Stratos Idreos, Themis Palpanas. RINSE: Interactive Data Series Exploration with ADS+
1916 -- 1927Anant P. Bhardwaj, Amol Deshpande, Aaron J. Elmore, David R. Karger, Sam Madden, Aditya G. Parameswaran, Harihar Subramanyam, Eugene Wu 0002, Rebecca Zhang. Collaborative Data Analytics with DataHub
1920 -- 1931Jaeho Shin, Christopher Ré, Michael J. Cafarella. Mindtagger: A Demonstration of Data Labeling in Knowledge Base Construction
1924 -- 1935Danai Koutra, Di Jin, Yuanshi Ning, Christos Faloutsos. Perseus: An Interactive Large-Scale Graph Mining and Visualization Tool
1928 -- 1939Manas Joglekar, Hector Garcia-Molina, Aditya G. Parameswaran. Smart Drill-Down: A New Data Exploration Operator
1932 -- 1943Curtis E. Dyreson, Sourav S. Bhowmick, Ryan Grapp. Virtual eXist-db: Liberating Hierarchical Queries from the Shackles of Access Path Dependence
1936 -- 1947Eli Cortez, Philip A. Bernstein, Yeye He, Lev Novik. Annotating Database Schemas to Help Enterprise Search
1940 -- 1951Nandish Jayaram, Sidharth Goyal, Chengkai Li. VIIQ: Auto-Suggestion Enabled Visual Interface for Interactive Graph Query Formulation
1944 -- 1955Qingyuan Liu, Eduard C. Dragut, Arjun Mukherjee, Weiyi Meng. FLORIN - A System to Support (Near) Real-Time Applications on User Generated Content on Daily News
1948 -- 1959Yunyao Li, Elmer Kim, Marc A. Touchette, Ramiya Venkatachalam, Hao Wang. VINERy: A Visual IDE for Information Extraction
1952 -- 1963Xu Chu, Mourad Ouzzani, John Morcos, Ihab F. Ilyas, Paolo Papotti, Nan Tang 0001, Yin Ye. KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing
1956 -- 1967Foteini Alvanaki, Romulo Goncalves, Milena Ivanova, Martin L. Kersten, Kostis Kyzirakos. GIS Navigation Boosted by Column Stores
1960 -- 1971Patricia C. Arocena, Radu Ciucanu, Boris Glavic, Renée J. Miller. Gain Control over your Integration Evaluations
1964 -- 1975Yanlei Diao, Kyriaki Dimitriadou, Zhan Li, Wenzhao Liu, Olga Papaemmanouil, Kemi Peng, Liping Peng. AIDE: An Automatic User Navigation System for Interactive Data Exploration
1968 -- 1979Ahmed M. Aly, Ahmed S. Abdelhamid, Ahmed R. Mahmood, Walid G. Aref, Mohamed S. Hassan, Hazem Elmeleegy, Mourad Ouzzani. A Demonstration of AQWA: Adaptive Query-Workload-Aware Partitioning of Big Spatial Data
1972 -- 1983Jens Dittrich, Patrick Bender. Janiform Intra-Document Analytics for Reproducible Research
1976 -- 1987Erich Schubert, Alexander Koos, Tobias Emrich, Andreas Züfle, Klaus Arthur Schmid, Arthur Zimek. A Framework for Clustering Uncertain Data
1980 -- 1991Nicole Bidoit, Melanie Herschel, Katerina Tzompanaki. EFQ: Why-Not Answer Polynomials in Action
1984 -- 1995Xiaolan Wang, Mary Feng, Yue Wang, Xin Luna Dong, Alexandra Meliou. Error Diagnosis and Data Profiling with Data X-Ray
1988 -- 1999Quan Pham, Severin Thaler, Tanu Malik, Ian T. Foster, Boris Glavic. Sharing and Reproducing Database Applications
1992 -- 2003Marcin Wylot, Philippe Cudré-Mauroux, Paul T. Groth. A Demonstration of TripleProv: Tracking and Querying Provenance over Web Data
1996 -- 2007Stefano Ortona, Giorgio Orsi, Marcello Buoncristiano, Tim Furche. WADaR: Joint Wrapper and Data Repair
2000 -- 2011Mangesh Bendre, Bofan Sun, Ding Zhang, Xinyan Zhou, Kevin Chen-Chuan Chang, Aditya Parameswaran. DATASPREAD: Unifying Databases and Spreadsheets
2004 -- 2015Daniel Haas, Sanjay Krishnan, Jiannan Wang, Michael J. Franklin, Eugene Wu 0002. Wisteria: Nurturing Scalable Data Cleaning Infrastructure
2008 -- 2019Ashoke S., Jayant R. Haritsa. CODD: A Dataless Approach to Big Data Testing
2012 -- 2023Sejla Cebiric, François Goasdoué, Ioana Manolescu. Query-Oriented Summarization of RDF Graphs
2016 -- 2027Yodsawalai Chodpathumwan, Amirhossein Aleyasen, Arash Termehchy, Yizhou Sun. Universal-DB: Towards Representation Independent Graph Analytics
2020 -- 2031Ahmed R. Mahmood, Ahmed M. Aly, Thamir Qadah, El Kindi Rezig, Anas Daghistani, Amgad Madkour, Ahmed S. Abdelhamid, Mohamed S. Hassan, Walid G. Aref, Saleh Basalamah. Tornado: A Distributed Spatio-Textual Stream Processing System
2024 -- 2035Andrew Crotty, Alex Galakatos, Emanuel Zgraggen, Carsten Binnig, Tim Kraska. Vizdom: Interactive Analytics through Pen and Touch
2028 -- 2039Mariano P. Consens, Valeria Fionda, Shahan Khatchadourian, Giuseppe Pirrò. S+EPPs: Construct and Explore Bisimulation Summaries, plus Optimize Navigational Queries; all on Existing SPARQL Systems
2032 -- 2043Konstantinos Xirogiannopoulos, Udayan Khurana, Amol Deshpande. GraphGen: Exploring Interesting Graphs in Relational Data
2036 -- 2047Dong Young Yoon, Barzan Mozafari, Douglas P. Brown. DBSeer: Pain-free Database Administration through Workload Intelligence
2040 -- 2051Arun Kejariwal, Sanjeev Kulkarni, Karthik Ramasamy. Real Time Analytics: Algorithms and Systems
2042 -- 2053Arijit Khan, Lei Chen. On Uncertain Graphs Modeling and Queries
2044 -- 2055Xin Luna Dong, Wang Chiew Tan. A Time Machine for Information: Looking Back to Look Forward
2046 -- 2057Mahashweta Das, Gautam Das. Structured Analytics in Social Media
2048 -- 2059Jing Gao, Qi Li, Bo Zhao, Wei Fan, Jiawei Han. Truth Discovery and Crowdsourcing Aggregation: A Unified Perspective
2050 -- 2061Daniel Abadi, Shivnath Babu, Fatma Ozcan, Ippokratis Pandis. Tutorial: SQL-on-Hadoop Systems
2052 -- 2063Juan Loaiza. Engineering Database Hardware and Software Together
2053 -- 2064Magdalena Balazinska. Big Data Research: Will Industry Solve all the Problems?
2057 -- 2068Todd Walter. Big Plateaus of Big Data on the Big Island
2058 -- 2069Anastasia Ailamaki. Databases and Hardware: The Beginning and Sequel of a Beautiful Friendship