Journal: PVLDB

Volume 6, Issue 9

0 -- 0Yanlei Diao, Thomas Neumann 0001. Front Matter
601 -- 612Flip Korn, Barna Saha, Divesh Srivastava, Shanshan Ying. On Repairing Structural Problems In Semi-structured Data
613 -- 624Faraz Makari Manshadi, Baruch Awerbuch, Rainer Gemula, Rohit Khandekar, Julián Mestre, Mauro Sozio. A Distributed Algorithm for Large-Scale Generalized Matching
625 -- 636Floris Geerts, Giansalvatore Mecca, Paolo Papotti, Donatello Santoro. The LLUNATIC Data-Cleaning Framework
637 -- 648Iraklis Psaroudakis, Manos Athanassoulis, Anastasia Ailamaki. Sharing Data and Work Across Concurrent Analytical Queries
649 -- 660Haichuan Shang, Masaru Kitsuregawa. Skyline Operator on Anti-correlated Distributions
661 -- 672Hatem A. Mahmoud, Faisal Nawab, Alexander Pucher, Divyakant Agrawal, Amr El Abbadi. Low-Latency Multi-Datacenter Databases using Replicated Commit
673 -- 684Yun Chi, Hakan Hacigümüs, Wang-Pin Hsiung, Jeffrey F. Naughton. Distribution-Based Query Scheduling
685 -- 696Wenfei Fan, Floris Geerts, Frank Neven. Making Queries Tractable on Big Data with Preprocessing
697 -- 708Haim Kaplan, Ilia Lotosh, Tova Milo, Slava Novgorodov. Answering Planning Queries with the Crowd
709 -- 720Max Heimel, Michael Saecker, Holger Pirk, Stefan Manegold, Volker Markl. Hardware-Oblivious Parallelism for In-Memory Column-Stores
721 -- 732Risi Thonangi, Jun Yang 0001. Permuting Data on Random-Access Block Storage
733 -- 744Radu Stoica, Anastasia Ailamaki. Improving Flash Write Performance by Using Update Frequency
745 -- 756Lu Li, Chee Yong Chan. Efficient Indexing for Diverse Query Results
757 -- 768Chen Jason Zhang, Lei Chen, H. V. Jagadish, Caleb Chen Cao. Reducing Uncertainty of Schema Matching via Crowdsourcing
769 -- 780Bin Yang 0002, Chenjuan Guo, Christian S. Jensen. Travel Cost Inference from Sparse, Spatio-Temporally Correlated Time Series Using Markov Models

Volume 6, Issue 8

0 -- 0Sihem Amer-Yahia, Stefan Manegold. Front Matter
541 -- 552Xin Liu, Kenneth Salem. Hybrid Storage Management for Database Systems
553 -- 564Eugene Wu 0002, Samuel Madden. Scorpion: Explaining Away Outliers in Aggregate Queries
565 -- 576Rajeev Gupta, Krithi Ramamritham, Mukesh K. Mohania. Ratio Threshold Queries over Distributed Data Sources
577 -- 588Ting Deng, Wenfei Fan. On the Complexity of Query Result Diversification
589 -- 600Sourav Dutta, Ankur Narang, Suman K. Bera. Streaming Quotient Filter: A Near Optimal Approximate Duplicate Detection Approach for Data Streams

Volume 6, Issue 7

0 -- 0Johannes Gehrke, Nikos Mamoulis. Front Matter
493 -- 504Weiguo Zheng, Lei Zou, Yansong Feng, Lei Chen 0002, Dongyan Zhao. Efficient SimRank-based Similarity Join Over Large Graphs
505 -- 516Guimei Liu, Andre Suchitra, Limsoon Wong. A Performance Study of Three Disk-based Structures for Indexing and Querying Frequent Itemsets
517 -- 528Pingpeng Yuan, Pu Liu, Buwen Wu, Hai Jin, Wenya Zhang, Ling Liu. TripleBit: a Fast and Compact System for Large Scale RDF Data
529 -- 540Sumeet Bajaj, Radu Sion. CorrectDB: SQL Engine with Practical Query Authentication

Volume 6, Issue 6

0 -- 0Jayant R. Haritsa, Jens Teubner. Front Matter
349 -- 360Steven Euijong Whang, Peter Lofgren, Hector Garcia-Molina. Question Selection for Crowd Entity Resolution
361 -- 372Alekh Jindal, Endre Palatinus, Vladimir Pavlov, Jens Dittrich. A Comparison of Knives for Bread Slicing
373 -- 384Chuan Xiao, Jianbin Qin, Wei Wang 0011, Yoshiharu Ishikawa, Koji Tsuda, Kunihiko Sadakane. Efficient Error-tolerant Query Autocompletion
385 -- 396Alexander Shraer, Maxim Gurevich, Marcus Fontoura, Vanja Josifovski. Top-k Publish-Subscribe for Social Annotation of News
397 -- 408Phokion G. Kolaitis, Enela Pema, Wang Chiew Tan. Efficient Querying of Inconsistent Databases with Binary Integer Programming
409 -- 420Aristides Gionis, Flavio Junqueira, Vincent Leroy, Marco Serafini, Ingmar Weber. Piggybacking on Social Networks
421 -- 432Marco D. Adelfio, Hanan Samet. Schema Extraction for Tabular Data on the Web
433 -- 444Ahmet Erdem Sariyüce, Bugra Gedik, Gabriela Jacques-Silva, Kun-Lung Wu, Ümit V. Çatalyürek. Streaming Algorithms for k-core Decomposition
444 -- 456Oktie Hassanzadeh, Ken Q. Pu, Soheil Hassas Yeganeh, Renée J. Miller, Lucian Popa, Mauricio A. Hernández, Howard Ho. Discovering Linkage Points over Web Data
457 -- 468Ada Wai-Chee Fu, Huanhuan Wu, James Cheng, Raymond Chi-Wing Wong. IS-LABEL: an Independent-Set based Labeling Scheme for Point-to-Point Distance Querying
469 -- 480Thanh T. L. Tran, Yanlei Diao, Charles A. Sutton, Anna Liu. Supporting User-Defined Functions on Uncertain Data
481 -- 492Fanwei Zhu, Yuan Fang, Kevin Chen-Chuan Chang, Jing Ying. Incremental and Accuracy-Aware Personalized PageRank through Scheduled Approximation

Volume 6, Issue 5

0 -- 0Dan Olteanu, Divesh Srivastava. Front Matter
289 -- 300Stephen Tu, M. Frans Kaashoek, Samuel Madden, Nickolai Zeldovich. Processing Analytical Queries over Encrypted Data
301 -- 312Georgios Kellaris, Stavros Papadopoulos. Practical Differential Privacy via Grouping and Smoothing
313 -- 324Yupeng Fu, Raghav Kaushik, Ravishankar Ramamurthy. On Scaling Up Sensitive Data Auditing
325 -- 336Maheswaran Sathiamoorthy, Megasthenis Asteris, Dimitris S. Papailiopoulos, Alexandros G. Dimakis, Ramkumar Vadali, Scott Chen, Dhruba Borthakur. XORing Elephants: Novel Erasure Codes for Big Data
337 -- 348Steffen Rendle. Scaling Factorization Machines to Relational Data

Volume 6, Issue 4

0 -- 0Ashraf Aboulnaga, Chee Yong Chan. Front Matter
229 -- 240Milad Eftekhar, Nick Koudas. Partitioning and Ranking Tagged Data Sources
241 -- 252Bin Cao, Antonio Badia. Efficient Implementation of Generalized Quantification in Relational Query Languages
253 -- 264Rui Liu, Ashraf Aboulnaga, Kenneth Salem. DAX: A Widely Distributed Multi-tenant Storage Service for DBMS Hosting
265 -- 276Kai Zeng, Jiacheng Yang, Haixun Wang, Bin Shao, Zhongyuan Wang. A Distributed Graph Engine for Web Scale RDF Data
277 -- 288Foto N. Afrati, Anish Das Sarma, Semih Salihoglu, Jeffrey D. Ullman. Upper and Lower Bounds on the Cost of a Map-Reduce Computation

Volume 6, Issue 3

0 -- 0Ada Wai-Chee Fu, Alon Y. Halevy. Front Matter
157 -- 168Ye Zhang, Wai Kit Wong, Siu-Ming Yiu, Nikos Mamoulis, David W. Cheung. Lightweight Privacy-Preserving Peer-to-Peer Data Integration
169 -- 180Yang Li, Pegah Kamousi, Fangqiu Han, Shengqi Yang, Xifeng Yan, Subhash Suri. Memory Efficient Minimum Substring Partitioning
181 -- 192Arijit Khan, Yinghui Wu, Charu C. Aggarwal, Xifeng Yan. NeMa: Fast Graph Search with Label Similarity
193 -- 204Xika Lin, Abhishek Mukherji, Elke A. Rundensteiner, Carolina Ruiz, Matthew O. Ward. PARAS: A Parameter Space Framework for Online Association Mining
205 -- 216Zhepeng Yan, Nan Zheng, Zachary G. Ives, Partha Pratim Talukdar, Cong Yu. Actively Soliciting Feedback for Query Answers in Keyword Search-Based Data Integration
217 -- 228Lisi Chen, Gao Cong, Christian S. Jensen, Dingming Wu. Spatial Keyword Query Processing: An Experimental Evaluation

Volume 6, Issue 2

0 -- 0Peer Kröger, Stratis Viglas. Front Matter
37 -- 48Xin Luna Dong, Barna Saha, Divesh Srivastava. Less is More: Selecting Sources Wisely for Integration
49 -- 60Wenchao Zhou, Suyog Mapara, Yiqing Ren, Yang Li, Andreas Haeberlen, Zachary G. Ives, Boon Thau Loo, Micah Sherr. Distributed Time-aware Provenance
61 -- 72Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini, Moshe Y. Vardi. Query Processing under GLAV Mappings for Relational and Graph Databases
73 -- 84Kyriakos Mouratidis, HweeHwa Pang. Computing Immutable Regions for Subspace Top-k Queries
85 -- 96Feng Zhao, Anthony K. H. Tung. Large Scale Cohesive Subgraphs Discovery for Social Network Visual Analysis
97 -- 108Xian Li, Xin Luna Dong, Kenneth Lyons, Weiyi Meng, Divesh Srivastava. Truth Finding on the Deep Web: Is the Problem Solved?
109 -- 120Adam Marcus 0002, David R. Karger, Samuel Madden, Rob Miller, Sewoong Oh. Counting with the Crowd
109 -- 120Tao Zou, Ronan Le Bras, Marcos Antonio Vaz Salles, Alan J. Demers, Johannes Gehrke. ClouDiA: A Deployment Advisor for Public Clouds
133 -- 144Jinsoo Lee, Wook-Shin Han, Romans Kasperovics, Jeong-Hoon Lee. An In-depth Comparison of Subgraph Isomorphism Algorithms in Graph Databases
145 -- 156Kun Ren, Alexander Thomson, Daniel J. Abadi. Lightweight Locking for Main Memory Database Systems

Volume 6, Issue 14

1642 -- 1653Lei Zhang 0007, Thanh Tran, Achim Rettinger. Probabilistic Query Rewriting for Efficient and Effective Keyword Search on Graph Data
1654 -- 1665Martin Schäler, Alexander Grebhahn, Reimar Schröter, Sandro Schulze, Veit Köppen, Gunter Saake. QuEval: Beyond high-dimensional indexing a la carte
1666 -- 1677Yuhong Li, Leong Hou U, Man Lung Yiu, Zhiguo Gong. Discovering Longest-lasting Correlation in Sequence Databases
1678 -- 1689Adrian Daniel Popescu, Andrey Balmin, Vuk Ercegovac, Anastasia Ailamaki. PREDIcT: Towards Predicting the Runtime of Large Scale Iterative Analytics
1690 -- 1701Xiaohan Zhao, Adelbert Chang, Atish Das Sarma, Haitao Zheng, Ben Y. Zhao. On the Embeddability of Random Walk Distances
1702 -- 1713Tobias Mühlbauer, Wolf Rödiger, Robert Seilbeck, Angelika Reiser, Alfons Kemper, Thomas Neumann 0001. Instant Loading for Main Memory Databases
1714 -- 1725Karolina Alexiou, Donald Kossmann, Per-Ake Larson. Adaptive Range Filters for Cold Data: Avoiding Trips to Siberia
1726 -- 1737Badrish Chandramouli, Jonathan Goldstein, Abdul Quamar. Scalable Progressive Analytics on Big Data in the Cloud
1738 -- 1749Peter Ogden, David Thomas, Peter Pietzuch. Scalable XML Query Processing using Parallel Pushdown Transducers
1750 -- 1761Yin Huai, Siyuan Ma, Rubao Lee, Owen O'Malley, Xiaodong Zhang 0001. Understanding Insights into the Basic Structure and Essential Issues of Table Placement Methods in Clusters
1762 -- 1773Davide Mottin, Alice Marascu, Senjuti Basu Roy, Gautam Das, Themis Palpanas, Yannis Velegrakis. A Probabilistic Optimization Framework for the Empty-Answer Problem
1774 -- 1785Yinghui Wu, Shengqi Yang, Mudhakar Srivatsa, Arun Iyengar, Xifeng Yan. Summarizing Answer Graphs Induced by Keyword Queries
1786 -- 1797Huizhong Duan, ChengXiang Zhai, Jinxing Cheng, Abhishek Gattani. Supporting Keyword Search in Product Database: A Probabilistic Approach
1798 -- 1809Supriya Nirkhiwale, Alin Dobra, Christopher M. Jermaine. A Sampling Algebra for Aggregate Estimation
1810 -- 1821Maximilian Dylla, Iris Miliaraki, Martin Theobald. A Temporal-Probabilistic Database Model for Information Extraction
1822 -- 1833Pit Fender, Guido Moerkotte. Counter Strike: Generic Top-Down Join Enumeration for Hypergraphs
1834 -- 1845Daniar Achakeev, Bernhard Seeger. Efficient Bulk Updates on Multiversion B-trees
1846 -- 1857Hotham Altwaijry, Dmitri V. Kalashnikov, Sharad Mehrotra. Query-Driven Approach to Entity Resolution
1858 -- 1869Jaroslaw Szlichta, Parke Godfrey, Jarek Gryz, Calisto Zuzarte. Expressiveness and Complexity of Order Dependencies
1870 -- 1881A. Pavan, Kanat Tangwongsan, Srikanta Tirthapura, Kun-Lung Wu. Counting and Sampling Triangles from a Graph Stream
1882 -- 1893Benjamin Sowell, Marcos Antonio Vaz Salles, Tuan Cao, Alan J. Demers, Johannes Gehrke. An Experimental Analysis of Iterated Spatial Joins in Main Memory
1894 -- 1905Kisung Lee, Ling Liu. Scaling Queries over Big RDF Graphs with Semantic Hash Partitioning
1906 -- 1917Jiwon Seo, JongSoo Park, Jaeho Shin, Monica S. Lam. Distributed SociaLite: A Datalog-Based Language for Large-Scale Graph Analysis
1918 -- 1929Mohamed Sarwat, Sameh Elnikety, Yuxiong He, Mohamed F. Mokbel. Horton+: A Distributed System for Processing Declarative Reachability Queries over Partitioned Graphs
1930 -- 1941Narayanan Sundaram, Aizana Turmukhametova, Nadathur Satish, Todd Mostak, Piotr Indyk, Samuel Madden, Pradeep Dubey. Streaming Similarity Search over one Billion Tweets using Parallel Locality-Sensitive Hashing
1942 -- 1953Justin DeBrabant, Andrew Pavlo, Stephen Tu, Michael Stonebraker, Stanley B. Zdonik. Anti-Caching: A New Approach to Database Management System Architecture
1954 -- 1965Wahbeh H. Qardaji, Weining Yang, Ninghui Li. Understanding Hierarchical Methods for Differentially Private Histograms
1966 -- 1977Rui Li, Shengjie Wang, Kevin Chen-Chuan Chang. Towards Social Data Platform: Automatic Topic-focused Monitor for Twitter Stream
1978 -- 1989Ruoming Jin, Guan Wang. Simple, Fast, and Scalable Reachability Oracle
1990 -- 2001Nurzhan Bakibayev, Tomás Kociský, Dan Olteanu, Jakub Zavodny. Aggregation and Ordering in Factorised Databases
2002 -- 2013Yoonjae Park, Jun-Ki Min, Kyuseok Shim. Parallel Computation of Skyline and Reverse Skyline Queries Using MapReduce
2014 -- 2025Wenlei Xie, Guozhang Wang, David Bindel, Alan J. Demers, Johannes Gehrke. Fast Iterative Graph Computation with Block Updates

Volume 6, Issue 13

0 -- 0Peer Kröger, Stratis Viglas. Front Matter
1462 -- 1473Gonçalo Simões, Helena Galhardas, Luis Gravano. When Speed Has a Price: Fast Information Extraction Using Approximate Algorithms
1474 -- 1485Craig Chasseur, Jignesh M. Patel. Design and Evaluation of Storage Organizations for Read-Optimized Main Memory Databases
1486 -- 1497Luying Chen, Stefano Ortona, Giorgio Orsi, Michael Benedikt. ggregating Semantic Annotators
1498 -- 1509Xu Chu, Ihab F. Ilyas, Paolo Papotti. Discovering Denial Constraints
1510 -- 1521Wenfei Fan, Xin Wang, Yinghui Wu. Diversified Top-k Graph Pattern Matching
1522 -- 1533Weixiong Rao, Lei Chen 0002, Pan Hui, Sasu Tarkoma. Bitlist: New Full-text Index for Low Space Cost and Efficient Keyword Search
1534 -- 1545Sebastian Wandelt, Johannes Starlinger, Marc Bux, Ulf Leser. RCSI: Scalable similarity search in thousand(s) of genomes
1546 -- 1557Yufei Tao, Xiaocheng Hu, Dong-Wan Choi, Chin-Wan Chung. Approximate MaxRS in Spatial Databases
1558 -- 1569Benny Kimelfeld, Jan Vondrák, David P. Woodruff. Multi-Tuple Deletion Propagation: Approximations and Complexity
1570 -- 1581Badrish Chandramouli, Suman Nath, Wenchao Zhou. Supporting Distributed Feed-Following Apps over Edge Devices
1582 -- 1593Saravanan Thirumuruganathan, Nan Zhang 0004, Gautam Das. Rank Discovery From Web Databases
1594 -- 1605Theodoros Rekatsinas, Amol Deshpande, Ashwin Machanavajjhala. A SPARSI: Partitioning Sensitive Data amongst Multiple Adversaries
1606 -- 1617Dong Deng, Yu Jiang, Guoliang Li, Jian Li, Cong Yu. Scalable Column Concept Determination for Web Tables Using Large Knowledge Bases
1618 -- 1629Xin Huang, Hong Cheng, Rong-Hua Li, Lu Qin, Jeffrey Xu Yu. Top-K Structural Diversity Search in Large Networks
1630 -- 1641Federico Cavalieri, Alessandro Solimando, Giovanna Guerrini. Synthetising Changes in XML Documents as PULs

Volume 6, Issue 12

0 -- 0Dimitrios Gunopoulos, Letizia Tanca, Jun Yang 0001. Front Matter
1198 -- 1201Andy Yuan Xue, Rui Zhang, Yu Zheng, Xing Xie, Jianhui Yu, Yong Tang, Sapna Jain, Jingren Zhou. DesTeller: A System for Destination Prediction Based on Trajectories with Privacy Protection
1202 -- 1205Zhe Chen, Mike Cafarella, Jun Chen, Daniel Prevo, Junfeng Zhuang. Senbazuru: A Prototype Spreadsheet Database Management System
1206 -- 1209Grégory Smits, Olivier Pivert, Thomas Girault. ReqFlex: Fuzzy Queries for Everyone
1210 -- 1213Martin Kaufmann, Panagiotis Vagenas, Peter M. Fischer, Donald Kossmann, Franz Färber. Comprehensive and Interactive Temporal Query Processing with SAP HANA
1214 -- 1217Torsten Grust, Nils Schweinsberg, Alexander Ulrich. Functions Are Data Too (Defunctionalization for PL/SQL)
1218 -- 1221Amr Ebaid, Ahmed K. Elmagarmid, Ihab F. Ilyas, Mourad Ouzzani, Jorge-Arnulfo Quiané-Ruiz, Nan Tang 0001, Si Yin. NADEEF: A Generalized Data Cleaning System
1222 -- 1225Sonia Bergamaschi, Francesco Guerra, Matteo Interlandi, Raquel Trillo Lado, Yannis Velegrakis. QUEST: A Keyword Search System for Relational Data based on Semantic and Machine Learning Techniques
1226 -- 1229Kenneth Bøgh, Anders Skovsgaard, Christian S. Jensen. GroupFinder: A New Approach to Top-K Point-of-Interest Group Retrieval
1230 -- 1233Ahmed Eldawy, Mohamed F. Mokbel. A Demonstration of SpatialHadoop: An Efficient MapReduce Framework for Spatial Data
1234 -- 1237Mehmet Ali Abbasoglu, Bugra Gedik, Hakan Ferhatosmanoglu. Aggregate Profile Clustering for Telco Analytics
1238 -- 1241Luying Chen, Stefano Ortona, Giorgio Orsi, Michael Benedikt. ROSeAnn: Reconciling Opinions of Semantic Annotators
1242 -- 1245Mohamed Sarwat, James Avery, Mohamed F. Mokbel. A RecDB in Action: Recommendation Made Easy in Relational Databases
1246 -- 1249Marina Drosou, Evaggelia Pitoura. POIKILO: A Tool for Evaluating the Results of Diversification Models and Algorithms
1250 -- 1253Yael Amsterdamer, Yael Grossman, Tova Milo, Pierre Senellart. CrowdMiner: Mining association rules from the crowd
1254 -- 1257Chen Chen, Hongzhi Yin, Junjie Yao, Bin Cui. TeRec: A Temporal Recommender System Over Tweet Stream
1258 -- 1261Alexander Shkapsky, Kai Zeng, Carlo Zaniolo. Graph Queries in a Next-Generation Datalog System
1262 -- 1265Abdeltawab M. A. Hendawi, Jie Bao 0003, Mohamed F. Mokbel. iRoad: A Framework For Scalable Predictive Query Processing On Road Networks
1266 -- 1269Mithila Nagendra, K. Selçuk Candan. SkySuite: A Framework of Skyline-Join Operators for Static and Stream Environments
1270 -- 1273Jianlong Zhong, Bingsheng He. Parallel Graph Processing on Graphics Processors Made Easy
1274 -- 1277Stefan Richter 0007, Jens Dittrich, Stefan Schuh, Tobias Frey. Mosquito: Another One Bites the Data Upload STream
1278 -- 1281Sergey Hardock, Ilia Petrov, Robert Gottstein, Alejandro P. Buchmann. NoFTL: Database Systems on FTL-less Flash Storage
1282 -- 1285Dimitrios Kotsakos, Panos Sakkos, Vana Kalogeraki, Dimitrios Gunopulos. SmartMonitor: Using Smart Devices to Perform Structural Health Monitoring
1286 -- 1289Yagiz Karæz, Milena Ivanova, Ying Zhang, Stefan Manegold, Martin L. Kersten. Lazy ETL in Action: ETL Technology Dates Scientific Data
1290 -- 1293Niv Dayan, Martin Kjær Svendsen, Matias Bjørling, Philippe Bonnet, Luc Bouganim. EagleTree: Exploring the Design Space of SSD-Based Algorithms
1294 -- 1297Saket Sathe, Arthur Oviedo, Dipanjan Chakraborty 0001, Karl Aberer. EnviroMeter: A Platform for Querying Community-Sensed Data
1298 -- 1301Alper Okcan, Mirek Riedewald, Biswanath Panda, Daniel Fink. Scolopax: Exploratory Analysis of Scientific Data
1302 -- 1305Daniel Deutch, Yuval Moskovitch, Val Tannen. PROPOLIS: Provisioned Analysis of Data-Centric Processes
1306 -- 1309Pradap Konda, Arun Kumar, Christopher Re, Vaishnavi Sashikanth. Feature Selection in Enterprise Analytics: A Demonstration using an R-based Data Analytics System
1310 -- 1313Mohammedreza Najafi, Mohammad Sadoghi, Hans-Arno Jacobsen. Flexible Query Processor on FPGAs
1314 -- 1317Cristina Civili, Marco Console, Giuseppe De Giacomo, Domenico Lembo, Maurizio Lenzerini, Lorenzo Lepore, Riccardo Mancini, Antonella Poggi, Riccardo Rosati, Marco Ruzzi, Valerio Santarelli, Domenico Fabio Savo. MASTRO STUDIO: Managing Ontology-Based Data Access applications
1318 -- 1321David Fuhry, Yang Zhang, Venu Satuluri, Arnab Nandi, Srinivasan Parthasarathy. PLASMA-HD: Probing the LAttice Structure and MAkeup of High-dimensional Data
1322 -- 1325Matthew Moyers, Emad Soroush, Spencer Wallace, K. Simon Krughoff, Jake VanderPlas, Magdalena Balazinska, Andrew Connolly. A Demonstration of Iterative Parallel Array Processing in Support of Telescope Image Analysis
1326 -- 1329Hamed Abdelhaq, Christian Sengstock, Michael Gertz. EvenTweet: Online Localized Event Detection from Twitter
1330 -- 1333Hamid Mousavi, Shi Gao, Carlo Zaniolo. IBminer: A Text Mining Tool for Constructing and Populating InfoBox Databases and Knowledge Bases
1334 -- 1337Nicholas L. Farnan, Adam J. Lee, Panos Chyrsanthis, Ting Yu. PAQO: A Preference-Aware Query Optimizer for PostgreSQL
1338 -- 1341Suvarna Bothe, Panagiotis Karras, Akrivi Vlachou. eSkyline: Processing Skyline Queries over Encrypted Data
1342 -- 1345Lilong Jiang, Michael Mandel, Arnab Nandi. GestureQuery: A Multitouch Database Query Interface
1346 -- 1349Di Yang, Kaiyu Zhao, Maryam Hasan, Hanyuan Lu, Elke A. Rundensteiner, Matthew O. Ward. Mining and Linking Patterns across Live Data Streams and Stream Archives
1350 -- 1353Hanan Samet, Marco D. Adelfio, Brendan C. Fruin, Michael D. Lieberman, Jagan Sankaranarayanan. PhotoStand: A Map Query Interface for a Database of News Photos
1354 -- 1357K. Ashwin Kumar, Jonathan Gluck, Amol Deshpande, Jimmy Lin. Hone: "Scaling Down" Hadoop on Shared-Memory Systems
1358 -- 1361Dolan Antenucci, Erdong Li, Shaobo Liu, Bochun Zhang, Michael J. Cafarella, Christopher Re. Ringtail: A Generalized Nowcasting System
1362 -- 1365Min Xie 0002, Laks V. S. Lakshmanan, Peter T. Wood. IPS: An Interactive Package Configuration System for Trip Planning
1366 -- 1369Jingbo Zhou, Anthony K. H. Tung, Wei Wu, Wee Siong Ng. R2-D2: a System to Support Probabilistic Path Prediction in Dynamic Environments via "Semi-Lazy" Learning
1370 -- 1373Byung-Gon Chun, Tyson Condie, Carlo Curino, Raghu Ramakrishnan, Russell Sears, Markus Weimer. REEF: Retainable Evaluator Execution Framework
1374 -- 1377Shuhao Zhang, Jiong He, Bingsheng He, Mian Lu. OmniDB: Towards Portable and Efficient Query Processing on Parallel CPU/GPU Architectures
1378 -- 1381Ognjen Savkovic, Mirza Paramita, Alex Tomasi, Werner Nutt. Complete Approximations of Incomplete Queries
1382 -- 1385Georgia Koutrika, Qian Lin, Jerry Liu. User Analytics with UbeOne: Insights into Web Printing
1386 -- 1389Ivo Santos, Marcel Tilly, Badrish Chandramouli, Jonathan Goldstein. DiAl: Distributed Streaming Analytics Anywhere, Anytime
1390 -- 1391Rada Chirkova, Jun Yang. Big and Useful: What's in the Data for Me?
1392 -- 1397Tomás Bartos. Universal Indexing of Arbitrary Similarity Models
1398 -- 1403Sebastian Breß. Why it is time for a HyPE: A Hybrid Query Processing Engine for Efficient GPU Coprocessing in DBMS
1404 -- 1409Alireza Rezaei Mahdiraji. Database Support for Unstructured Meshes
1410 -- 1415Aastha Madaan. Domain Specific Multi-stage Query Language for Medical Document Repositories
1416 -- 1421Io Taxidou. Realtime Analysis of Information Diffusion in Social Media
1422 -- 1427Luca Bonomi. Mining Frequent Patterns with Differential Privacy
1428 -- 1433Anett Hoppe. Automatic ontology-based User Profile Learning from heterogeneous Web Resources in a Big Data Context
1434 -- 1439Akon Dey. Scalable Transactions across Heterogeneous NoSQL Key-Value Data Stores
1440 -- 1443Nhung Ngo. Getting Unique Solution in Data Exchange
1444 -- 1449Martin Kaufmann. Storing and Processing Temporal Data in a Main Memory Column Store
1450 -- 1455Stepan Kozak. Efficiency and Security in Similarity Cloud Services
1456 -- 1461Thibault Sellam. Fast Cartography for Data Explorers

Volume 6, Issue 11

0 -- 0Min Wang, Cong Yu. Front Matter
961 -- 972Nicolas Bruno, Sapna Jain, Jingren Zhou. Continuous Cloud-Scale Query Optimization and Processing
973 -- 984Andrii Cherniak, Huma Zaidi, Vladimir Zadorozhny. Optimization Strategies for A/B Testing on HADOOP
985 -- 996Khaled Elmeleegy. Piranha: Optimizing Short Jobs in Hadoop
997 -- 1008Yuan Yuan, Rubao Lee, Xiaodong Zhang 0001. Making Updates Disk-I/O Friendly Using SSDs
1009 -- 1020Ablimit Aji, Fusheng Wang, Hoang Vo, Rubao Lee, Qiaoling Liu, Xiaodong Zhang 0001, Joel H. Saltz. Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce
1021 -- 1032Bhuvan Bamba, Siva Ravada, Ying Hu, Richard Anderson. Statistics Collection in Oracle Spatial and Graph: Fast Histogram Construction for Complex Geometry Objects
1033 -- 1044Tyler Akidau, Alex Balikov, Kaya Bekiroglu, Slava Chernyak, Josh Haberman, Reuven Lax, Sam McVeety, Daniel Mills, Paul Nordstrom, Sam Whittle. MillWheel: Fault-Tolerant Stream Processing at Internet Scale
1045 -- 1056Ian Rae, Eric Rollins, Jeff Shute, Sukhdeep Sodhi, Radek Vingralek. Online, Asynchronous Schema Change in F1
1057 -- 1067Lior Abraham, John Allen, Oleksandr Barykin, Vinayak R. Borkar, Bhuwan Chopra, Ciprian Gerea, Daniel Merl, Josh Metzler, David Reiss, Subbu Subramanian, Janet L. Wiener, Okay Zed. Scuba: Diving into Data at Facebook
1068 -- 1079Jeff Shute, Radek Vingralek, Bart Samwel, Ben Handy, Chad Whipkey, Eric Rollins, Mircea Oancea, Kyle Littlefield, David Menestrina, Stephan Ellner, John Cieslewicz, Ian Rae, Traian Stancescu, Himani Apte. F1: A Distributed SQL Database That Scales
1080 -- 1091Vijayshankar Raman, Gopi K. Attaluri, Ronald Barber, Naresh Chainani, David Kalmuk, Vincent KulandaiSamy, Jens Leenstra, Sam Lightstone, Shaorong Liu, Guy M. Lohman, Tim Malkemus, René Müller, Ippokratis Pandis, Berni Schiefer, David Sharpe, Richard Sidle, Adam J. Storm, Liping Zhang. DB2 with BLU Acceleration: So Much More than Just a Column Store
1092 -- 1101Michael Ovsiannikov, Silvius Rus, Damian Reeves, Paul Sutter, Sriram Rao, Jim Kelly. A The Quantcast File System
1102 -- 1113Srikanth Bellamkonda, Hua-Gang Li, Unmesh Jagtap, Yali Zhu, Vince Liang, Thierry Cruanes. Adaptive and Big Data Scale Parallel Execution in Oracle
1114 -- 1125Kedar Bellare, Carlo Curino, Ashwin Machanavajihala, Peter Mika, Mandar Rahurkar, Aamod Sane. WOO: A Scalable and Multi-tenant Platform for Continuous Knowledge Base Synthesis
1126 -- 1137Abhishek Gattani, Digvijay S. Lamba, Nikesh Garera, Mitul Tiwari, Xiaoyong Chai, Sanjib Das, Sri Subramaniam, Anand Rajaraman, Venky Harinarayan, AnHai Doan. Entity Extraction, Linking, Classification, and Tagging for Social Media: A Wikipedia-Based Approach
1138 -- 1149Hazem Elmeleegy, Yinan Li, Yan Qi 0002, Peter Wilmot, Mingxi Wu, Santanu Kolay, Ali Dasdan, Songting Chen. Overview of Turn Data Management Platform for Digital Advertising
1150 -- 1161Michael Curtiss, Iain Becker, Tudor Bosman, Sergey Doroshenko, Lucian Grijincu, Tom Jackson, Sandhya Kunnatur, Soren Lassen, Philip Pronin, Sriram Sankar, Guanghao Shen, Gintaras Woss, Chao Yang, Ning Zhang. Unicorn: A System for Searching the Social Graph
1162 -- 1163Sergio Ramazzina, Chiara L. Ballari, Daniela Somenzi. A New Service for Customer Care Based on the TrentoRise BigData Platform
1164 -- 1165Fabrizio Antonelli, Antonino Casella, Cristiana Chitic, Roberto Larcher, Giovanni Torrisi. Exploiting the Diversity, Mass and Speed of Territorial Data by TELCO Operator for Better User Services
1166 -- 1167Ivan Bedini, Benedikt Elser, Yannis Velegrakis. The Trento Big Data Platform for Public Administration and Large Companies: Use cases and Opportunities
1168 -- 1169Nga Tran, Sreenath Bodagala, Jaimin Dave. Designing Query Optimizers for Big Data Problems of The Future
1170 -- 1171Monica Franceschini. How to maximize the value of big data with the open source SpagoBI suite through a comprehensive approach
1172 -- 1173Edward Y. Chang. Context-Aware Computing: Opportunities and Open Issues
1174 -- 1175Oktie Hassanzadeh, Anastasios Kementsietsidis, Benny Kimelfeld, Rajasekar Krishnamurthy, Fatma Ozcan, Ippokratis Pandis. Next Generation Data Analytics at IBM Research
1176 -- 1177Mauro Brunato, Roberto Battiti. Learning and Intelligent Optimization (LION): One Ring to Rule Them All
1178 -- 1179David B. Lomet. Microsoft SQL Server's Integrated Database Approach for Modern Applications and Hardware
1180 -- 1181Hakan Hacigümüs, Jagan Sankaranarayanan, Jun'ichi Tatemura, Jeff LeFevre, Neoklis Polyzotis. Odyssey: A Multi-Store System for Evolutionary Analytics
1182 -- 1183Paolo Bouquet, Andrea Molinari. A global Entity Name System (ENS) for data ecosystems
1184 -- 1185Vishal Sikka, Franz Färber, Anil K. Goel, Wolfgang Lehner. SAP HANA: The Evolution from a Modern Main-Memory Data Platform to an Enterprise Application Platform
1186 -- 1187Raghunath Othayoth Nambiar, Meikel Poess. Keeping the TPC Relevant!
1188 -- 1189Xin Luna Dong, Divesh Srivastava. Big Data Integration
1190 -- 1191Stratis Viglas. Just-in-time compilation for SQL query processing
1192 -- 1193Anastasia Ailamaki, Ryan Johnson, Ippokratis Pandis, Pinar Tözün. Toward Scalable Transaction Processing
1194 -- 1195Aaron J. Elmore, Carlo Curino, Divyakant Agrawal, Amr El Abbadi. Towards Database Virtualization for Database as a Service
1196 -- 1197Mohamed F. Mokbel, Mohamed Sarwat. Mobility and Social Networking: A Data Management Perspective

Volume 6, Issue 10

0 -- 0Themis Palpanas, Yannis Velegrakis. Front Matter
781 -- 792Hyunjung Park, Jennifer Widom. Query Optimization over Crowdsourced Data
793 -- 804Yang Wang, Peng Wang, Jian Pei, Wei Wang, Sheng Huang. A Data-adaptive and Dynamic Segmentation Index for Whole Matching on Time Series
805 -- 816Mirko Bronzi, Valter Crescenzi, Paolo Merialdo, Paolo Papotti. Extraction and Integration of Partially Overlapping Web Sources
817 -- 828Yuan Yuan, Rubao Lee, Xiaodong Zhang 0001. The Yin and Yang of Processing Data Warehousing Queries on GPU Devices
829 -- 840Dayu Yuan, Prasenjit Mitra, C. Lee Giles. Mining and Indexing Graphs for Supergraph Search
841 -- 852Jianmin Wang 0001, Shaoxu Song, Xiaochen Zhu, Xuemin Lin. Efficient Recovery of Missing Events
853 -- 864Kai Ren, YongChul Kwon, Magdalena Balazinska, Bill Howe. Hadoop's Adolescence
865 -- 876Essam Mansour, Ahmed El-Roby, Panos Kalnis, Aron Ahmadia, Ashraf Aboulnaga. RACE: A Scalable and Elastic Parallel System for Discovering Repeats in Very Long Sequences
877 -- 888Justin J. Levandoski, David B. Lomet, Sudipta Sengupta. LLAMA: A Cache/Storage Subsystem for Modern Hardware
889 -- 900Jiong He, Mian Lu, Bingsheng He. Revisiting Co-Processing for Hash Joins on the Coupled CPU-GPU Architecture
901 -- 912Miao Qiao, Lu Qin, Hong Cheng, Jeffrey Xu Yu, Wentao Tian. Top-K Nearest Keyword Search on Large Graphs
913 -- 924Nikos Armenatzoglou, Stavros Papadopoulos, Dimitris Papadias. A General Framework for Geo-Social Query Processing
925 -- 936Wentao Wu, Yun Chi, Hakan Hacigümüs, Jeffrey F. Naughton. Towards Predicting Query Execution Time for Concurrent and Dynamic Database Workloads
937 -- 948Minos N. Garofalakis, Daniel Keren, Vasilis Samoladas. Sketch-based Geometric Monitoring of Distributed Stream Queries
949 -- 960Cheng Long, Raymond Chi-Wing Wong, Chenjuan Guo, H. V. Jagadish. Direction-Preserving Trajectory Simplification

Volume 6, Issue 1

0 -- 0Michael H. Böhlen, Christoph Koch. Front Matter
1 -- 12Panagiotis Bouros, Shen Ge, Nikos Mamoulis. Spatio-textual similarity joins
13 -- 24Marina Drosou, Evaggelia Pitoura. DisC diversity: result diversification based on dissimilarity and coverage
25 -- 36Chen Zeng, Jeffrey F. Naughton, Jin-yi Cai. On differentially private frequent itemset mining