Accepted Research Papers
- "Explanation-Based Auditing" by Daniel Fabbri, Kristen LeFevre.
- "Human-powered Sorts and Joins" by Adam Marcus, Eugene Wu, David Karger, Samuel Madden, Robert Miller.
- "Verifying Computations with Streaming Interactive Proofs" by Graham Cormode, Justin Thaler, Ke Yi.
- "A MovingObject Index for Efficient Query Processing with Peer-Wise Location Privacy" by Dan Lin, Christian S. Jensen, Rui Zhang, Lu Xiao, Jiaheng Lu.
- "ERA: Efficient Serial and Parallel Suffix Tree Construction for Very Long Strings" by Essam Mansour, Amin Allam, Spiros Skiadopoulos, Panos Kalnis.
- "Fast Updates on Read-Optimized Databases Using Multi-Core CPUs" by Jens Krueger, Changkyu Kim, Martin Grund, Nadathur Satish, David Schwalb, Jatin Chhugani, Hasso Plattner, Pradeep Dubey, Alexander Zeier.
- "A Data-Based Approach to Social Influence Maximization" by Amit Goyal, Francesco Bonchi, Laks V. S. Lakshmanan.
- "On Predictive Modeling for Optimizing Transaction Execution in Parallel OLTP Systems" by Andrew Pavlo, Evan P.C. Jones, Stanley Zdonik.
- "View Selection in Semantic Web Databases" by François Goasdoué, Konstantinos Karanasos, Julien Leblay, Ioana Manolescu.
- "Building Wavelet Histograms on Large Data in MapReduce" by Jeffrey Jestes, Ke Yi, Feifei Li.
- "Summarization and Matching of Density-Based Clusters in Streaming Environments" by Di Yang, Elke A. Rundensteiner, Matthew O. Ward.
- "Multilingual Schema Matching for Wikipedia Infoboxes" by Thanh Nguyen, Viviane Moreira, Huong Nguyen, Hoa Nguyen, Juliana Freire.
- "Controlling False Positives in Association Rule Mining" by Guimei Liu, Haojun Zhang, Limsoon Wong.
- "PARIS: Probabilistic Alignment of Relations, Instances, and Schema" by Fabian M. Suchanek, Serge Abiteboul, Pierre Senellart.
- "Answering Top-k Queries Over a Mixture of Attractive and Repulsive Dimensions" by Sayan Ranu, Ambuj K. Singh.
- "PIQL: Success-Tolerant Query Processing in the Cloud" by Michael Armbrust, Kristal Curtis, Tim Kraska, Armando Fox, Michael J. Franklin, David A. Patterson.
- "gSketch: On Query Estimation in Graph Streams" by Peixiang Zhao, Charu C. Aggarwal, Min Wang.
- "Indexing the Earth Mover's Distance Using Normal Distributions" by Brian E. Ruttenberg, Ambuj K. Singh.
- "Generating Exact- and Ranked Partially-Matched Answers to Questions in Advertisements" by Rani Qumsiyeh, Maria S. Pera, Yiu-Kai Ng.
- "Size-l Object Summaries for Relational Keyword Search" by Georgios J. Fakas, Zhi Cai, Nikos Mamoulis.
- "REX: Explaining Relationships between Entity Pairs" by Lujun Fang, Anish Das Sarma, Cong Yu, Philip Bohannon.
- "PASS-JOIN: A Partition-based Method for Similarity Joins" by Guoliang Li, Dong Deng, Jiannan Wang, Jianhua Feng.
- "Relative Lempel-Ziv Factorization for Efficient Storage and Retrieval of Web Collections" by Christopher Hoobin, Simon J. Puglisi, Justin Zobel.
- "Towards Cost-Effective Storage Provisioning for DBMSs" by Ning Zhang, Junichi Tatemura, Jignesh M. Patel, Hakan Hacıgümüş.
- "B+-tree Index Optimization by Exploiting Internal Parallelism of Flash-based Solid State Drives" by Hongchan Roh, Sanghyun Park, Sungho Kim, Mincheol Shin, Sang-Won Lee.
- "High-Performance Concurrency Control Mechanisms for Main-Memory Databases" by Per-Åke Larson, Spyros Blanas, Cristian Diaconu, Craig Freedman, Jignesh M. Patel, Mike Zwilling.
- "Capturing Topology in Graph Pattern Matching" by Shuai Ma, Yang Cao, Wenfei Fan, Jinpeng Huai, Tianyu Wo.
- "Probabilistic Management of OCR Data using an RDBMS" by Arun Kumar, Christopher Ré.
- "RTED: A Robust Algorithm for the Tree Edit Distance" by Mateusz Pawlik, Nikolaus Augsten.
- "Putting Lipstick on Pig: Enabling Database-style Workflow Provenance" by Yael Amsterdamer, Susan B. Davidson, Daniel Deutch, Tova Milo, Julia Stoyanovich, Val Tannen.
- "Relational Approach for Shortest Path Discovery over Large Graphs" by Jun Gao, Ruoming Jin, Jiashuai Zhou, Jeffrey Xu Yu, Xiao Jiang, Tengjiao Wang.
- "Mining Flipping Correlations from Large Datasets with Taxonomies" by Marina Barsky, Sangkyum Kim, Tim Weninger, Jiawei Han.
- "A Statistical Approach Towards Robust Progress Estimation" by Arnd Christian König, Bolin Ding, Surajit Chaudhuri, Vivek Narasayya.
- "Relation Strength-Aware Clustering of Heterogeneous Information Networks with Incomplete Attributes" by Yizhou Sun, Charu C. Aggarwal, Jiawei Han.
- "Shortest Path and Distance Queries on Road Networks: An Experimental Evaluation" by Lingkun Wu, Xiaokui Xiao, Dingxiong Deng, Gao Cong, Andy Diwen Zhu, Shuigeng Zhou.
- "The Filter-Placement Problem and its Application to Minimizing Information Multiplicity" by Dóra Erdös, Vatche Ishakian, Andrei Lapets, Evimaria Terzi, Azer Bestavros.
- "Bayesian Locality Sensitive Hashing for Fast Similarity Search" by Venu Satuluri, Srinivasan Parthasarathy.
- "Fast and Exact Top-k Search for Random Walk with Restart" by Yasuhiro Fujiwara, Makoto Nakatsuji, Makoto Onizuka, Masaru Kitsuregawa.
- "Densest Subgraph in Streaming and MapReduce" by Bahman Bahmani, Ravi Kumar, Sergei Vassilvitskii.
- "Mining Attribute-structure Correlated Patterns in Large Attributed Graphs" by Arlei Silva, Wagner Meira Jr., Mohammed J. Zaki.
- "Semi-Automatic Index Tuning: Keeping DBAs in the Loop" by Karl Schnaitter, Neoklis Polyzotis.
- "Aggregation in Probabilistic Databases via Knowledge Compilation" by Robert Fink, Larisa Han, Dan Olteanu.
- "Stochastic Database Cracking: Towards Robust Adaptive Indexing in Main-Memory Column-Stores" by Felix Halim, Stratos Idreos, Panagiotis Karras, Roland H. C. Yap.
- "An Adaptive Mechanism for Accurate Query Answering under Differential Privacy" by Chao Li, Gerome Miklau.
- "SharedDB: Killing One Thousand Queries With One Stone" by Georgios Giannikis, Gustavo Alonso, Donald Kossmann.
- "Pushing the Boundaries of Crowd-enabled Databases with Query-driven Schema Expansion" by Joachim Selke, Christoph Lofi, Wolf-Tilo Balke.
- "A Bayesian Approach to Discovering Truth from Conflicting Sources for Data Integration" by Bo Zhao, Benjamin I. P. Rubinstein, Jim Gemmell, Jiawei Han.
- "How to Price Shared Optimizations in the Cloud" by Prasang Upadhyaya, Magdalena Balazinska, Dan Suciu.
- "Dense Subgraph Maintenance under Streaming Edge Weight Updates for Real-time Story Identification" by Albert Angel, Nick Koudas, Nikos Sarkas, Divesh Srivastava.
- "ReStore: Reusing Results of MapReduce Jobs" by Iman Elghandour, Ashraf Aboulnaga.
- "PerfXplain: Debugging MapReduce Job Performance" by Nodira Khoussainova, Magdalena Balazinska, Dan Suciu.
- "Uncertain Centroid based Partitional Clustering of Uncertain Data" by Francesco Gullo, Andrea Tagarelli.
- "Scalable K-Means++" by Bahman Bahmani, Benjamin Moseley, Andrea Vattani, Ravi Kumar, Sergei Vassilvitskii.
- "Querying Schemas With Access Restrictions" by Michael Benedikt, Pierre Bourhis, Clemens Ley.
- "Definition, Detection, and Recovery of Single-Page Failures, a Fourth Class of Database Failures" by Goetz Graefe, Harumi Kuno.
- "Concurrency Control for Adaptive Indexing" by Goetz Graefe, Felix Halim, Stratos Idreos, Harumi Kuno, Stefan Manegold.
- "Comments on "Stack-based Algorithms for Pattern Matching on DAGs"" by Qiang Zeng, Zhuge Hai.
- "An Analysis of Structured Data on the Web" by Nilesh Dalvi, Ashwin Machanavajjhala, Bo Pang.
- "Shortest Path Computation with No Information Leakage" by Kyriakos Mouratidis, Man Lung Yiu.
- "V-SMART-Join: A Scalable MapReduce Framework for All-Pair Similarity Joins of Multisets and Vectors" by Ahmed Metwally, Christos Faloutsos.
- "Distributed GraphLab: A Framework for Machine Learning in the Cloud" by Yucheng Low, Joseph Gonzalez, Aapo Kyrola, Danny Bickson, Carlos Guestrin, Joseph M. Hellerstein.
- "Adding Logical Operators to Tree Pattern Queries on Graph-Structured Data" by Qiang Zeng, Xiaorui Jiang, Hai Zhuge.
- "Learning Semantic String Transformations from Examples" by Rishabh Singh, Sumit Gulwani.
- "Cologne: A Declarative Distributed Constraint Optimization Platform" by Changbin Liu, Lu Ren, Boon Thau Loo, Yun Mao, Prithwish Basu.
- "Optimizing I/O for Big Array Analytics" by Yi Zhang, Jun Yang.
- "Probabilistically Bounded Staleness for Practical Partial Quorums" by Peter Bailis, Shivaram Venkataraman, Michael J. Franklin, Joseph M. Hellerstein, Ion Stoica.
- "Efficient Subgraph Matching on Billion Node Graphs" by Zhao Sun, Hongzhi Wang, Haixun Wang, Bin Shao, Jianzhong Li.
- "Efficient Subgraph Similarity Search on Large Probabilistic Graph Databases" by Ye Yuan, Guoren Wang, Lei Chen, Haixun Wang.
- "Truss Decomposition in Massive Networks" by Jia Wang, James Cheng.
- "SEAL: Spatio-Textual Similarity Search" by Ju Fan, Guoliang Li, Lizhu Zhou, Shanshan Chen, Jun Hu.
- "On The Spatiotemporal Burstiness of Terms" by Theodoros Lappas, Marcos R. Vieira, Dimitrios Gunopulos, Vassilis J. Tsotras.
- "Efficient Reachability Query Evaluation in Large Spatiotemporal Contact Datasets" by Houtan Shirani-Mehr, Farnoush Banaei Kashani, Cyrus Shahabi.
- "Boosting Moving Object Indexing through Velocity Partitioning" by Thi Nguyen, Zhen He, Rui Zhang, Phillip Ward.
- "Type-Based Detection of XML Query-Update Independence" by Nicole Bidoit-Tollu, Dario Colazzo, Federico Ulliana.
- "Minuet: A Scalable Distributed Multiversion B-Tree" by Benjamin Sowell, Wojciech Golab, Mehul A. Shah.
- "Challenging the Long Tail Recommendation" by Hongzhi Yin, Bin Cui, Jing Li, Junjie Yao, Chen Chen.
- "hStorage-DB: Heterogeneity-aware Data Management to Exploit the Full Capability of Hybrid Storage Systems" by Tian Luo, Rubao Li, Michael Mesnier, Feng Chen, Xiaodong Zhang.
- "Privacy Preservation by Disassociation" by Manolis Terrovitis, John Liagouris, Nikos Mamoulis, Spiros Skiadopoulos.
- "Real Time Discovery of Dense Clusters in Highly Dynamic Graphs: Identifying Real World Events in Highly Dynamic Environments" by Manoj Agarwal, Krithi Ramamritham, Manish Bhide.
- "Only Aggressive Elephants are Fast Elephants" by Jens Dittrich, Alekh Jindal, Jorge Quiané, Stefan Richter, Jörg Schad, Stefan Schuh.
- "Uncertain Time-Series Similarity: Return to the Basics" by Michele Dallachiesa, Besmira Nushi, Katsiaryna Mirylenka, Themis Palpanas.
- "Towards Energy-Efficient Database Cluster Design" by Willis Lang, Stavros Harizopoulos, Jignesh Patel, Mehul Shah, Dimitris Tsirogiannis.
- "Mining Frequent Itemsets over Uncertain Databases" by Yongxin Tong, Yurong Cheng, Lei Chen, Philip Yu.
- "Statistical Distortion: Consequences of Data Cleaning" by Tamraparni Dasu, Ji Meng Loh.
- "Answering Table Queries on the Web using Column Keywords" by Rakesh Pimplikar, Sunita Sarawagi.
- "ALAE: Accelerating Local Alignment with Affine Gap Exactly in Biosequence Databases" by Xiaochun Yang, Honglei Liu, Bin Wang.
- "A Quality-Sensitive Model for Crowdsourcing-based Data Analytic System" by Xuan Liu, Meiyu Lu, Beng Chin Ooi, Sai Wu, Meihui ZHANG, Yanyan Shen.
- "Massively Parallel Sort-Merge Joins in Main Memory Multi-Core Database Systems" by Martina-Cezara Albutiu, Alfons Kemper, Thomas Neumann.
- "LogBase: Scalable Log-Structured Storage System for Write-heavy Environments" by Hoang Tam Vo, Sheng Wang, Divyakant Agrawal, Gang Chen, Beng Chin Ooi.
- "Sketch-based Querying of Distributed Sliding-Window Data Streams" by Odysseas Papapetrou, Minos Garofalakis, Antonios Deligiannakis.
- "Answering Queries using Views over Probabilistic XML: Complexity and Tractability" by Bogdan Cautis, Evgeny Kharlamov.
- "Efficient Processing of K Nearest Neighbor Joins using MapReduce" by Wei Lu, Yanyan Shen, Su Chen, Beng Chin Ooi.
- "Optimal Algorithms for Crawling a Hidden Database in the Web" by Cheng Sheng, Nan Zhang, Yufei Tao, Xin Jin.
- "Flash-based Extended Cache for Higher Throughput and Faster Recovery" by Woon-Hak Kang, Sang-Won Lee, Bongki Moon.
- "Optimization of Analytic Window Functions" by Yu Cao, Chee-Yong Chan, Jie Li, Kian-Lee Tan.
- "Labeling Workflow Views with Fine-Grained Dependencies" by Zhuowei Bao, Susan Davidson, Tova Milo.
- "Opening the Black Boxes in Data Flow Optimization" by Fabian Hueske, Mathias Peters, Matthias Sax, Astrid Rheinländer, Rico Bergmann, Aljoscha Krettek, Kostas Tzoumas.
- "FDB: A Query Engine for Factorised Relational Databases" by Dan Olteanu, Jakub Zavodny.
- "OLTP on Hardware Islands" by Danica Porobic, Ippokratis Pandis, Miguel Branco, Pinar Tozun, Anastasia Ailamaki.
- "Efficient Multi-way Theta-Join Processing Using MapReduce" by Xiaofei Zhang, Lei Chen, Min Wang.
- "K-Reach: Who is In Your Small World" by James Cheng, Zechao Shang, Hong Cheng, Haixun Wang, Jeffrey Xu Yu.
- "Low-Rank Mechanism: Optimizing Batch Queries under Differential Privacy" by Ganzhao Yuan, Zhenjie Zhang, Marianne Winslett, Xiaokui Xiao, Yin Yang, Zhifeng Hao.
- "SCOUT: StructureAware Prefetching for Latent Feature Following Queries" by Farhan Tauheed, Thomas Heinis, Anastasia Ailamaki, Felix Shurmann, Henry Markram.
- "Spatial Queries with Two kNN Predicates" by Ahmed Aly, Walid Aref, Mourad Ouzzani.
- "Performance Guarantees for Distributed Reachability Queries" by Wenfei Fan, Xin Wang, Yinghui Wu.
- "Learning Expressive Linkage Rules using Genetic Programming" by Robert Isele, Christian Bizer.
- "sDTW: Computing DTW Distances using Locally Relevant Constraints based on Salient Feature Alignments" by Selcuk Candan, Rosaria Rossini, Maria Sapino, Xiaolan Wang.
- "A Scalable Algorithm for Maximizing Range Sum in Spatial Databases" by Dong-Wan Choi, Chin-Wan Chung, Yufei Tao.
- "Spinning Fast Iterative Data Flows" by Stephan Ewen, Kostas Tzoumas, Moritz Kaufmann, Volker Markl.
- "Ranking Large Temporal Data" by Jeffrey Jestes, Jeff Phillips, Feifei Li, Mingwang Tang.
- "Compacting Transactional Data in Hybrid OLTP & OLAP Databases" by Florian Funke, Alfons Kemper, Thomas Neumann.
- "REX: Recursive, Delta-Based Data-Centric Computation" by Svilen Mihaylov, Zachary Ives, Sudipto Guha.
- "Measuring Two-Event Structural Correlations on Graphs" by Ziyu Guan, Xifeng Yan, Lance Kaplan.
- "Processing a Trillion Cells per Mouse Click" by Alexander Hall, Olaf Bachmann, Robert Buessow, Silviu Ganceanu, Marc Nunkesser.
- "DBToaster: Higher-order delta processing for dynamic, frequently fresh views" by Yanif Ahmad, Oliver Kennedy, Christoph Koch, Milos Nikolic.
- "Early Accurate Results for Advanced Analytics on MapReduce" by Nikolay Laptev, Kai Zeng, Carlo Zaniolo.
- "Fundamentals of Order Dependencies" by Jaroslaw Szlichta, Parke Godfrey, Jarek Gryz.
- "CrowdER: Crowdsourcing Entity Resolution" by Jiannan Wang, Tim Kraska, Michael Franklin, Jianhua Feng.
- "The Complexity of Social Coordination" by Sigal Oren, Konstantinos Mamouras, Lior Seeman, Lucja Kot, Johannes Gehrke.
- "Probabilistic Databases with MarkoViews" by Dan Suciu, Abhay Jha.
- "Functional Mechanism: Regression Analysis under Differential Privacy" by Jun Zhang, Zhenjie Zhang, Xiaokui Xiao, Yin Yang, Marianne Winslett.
- "Keyword-aware Optimal Route Search" by Xin Cao, Lisi Chen, Gao Cong, Xiaokui Xiao.
- "Efficient Indexing and Querying over Syntactically Annotated Trees" by Pirooz Chubak, Davood Rafiei.
- "Who Tags What? An Analysis Framework" by Mahashweta Das, Saravanan Thirumuruganathan, Sihem Amer-Yahia, Gautam Das, Cong Yu.
- "Queries with Guarded Negation" by Vince Barany, Balder Ten Cate, Martin Otto.
- "PrivBasis: Frequent Itemset Mining with Differential Privacy" by Ninghui Li, Wahbeh Qardaji, Dong Su, Jianneng Cao.
- "Automatic Partitioning of Database Applications" by Alvin Cheung, Owen Arden, Samuel Madden, Andrew Myers.
- "Diversifying Top-K Results" by Lu Qin, Jeffrey Xu Yu, Lijun Chang.
- "Injecting Uncertainty in Graphs for Identity Obfuscation" by Paolo Boldi, Francesco Bonchi, Aris Gionis, Tamir Tassa.
- "Whom to Ask? Jury Selection for Decision Making Tasks on Micro-blog Services" by Chen CAO, Jieying She, Yongxin Tong, Lei Chen.
- "Don?t Thrash: How to Cache Your Hash on Flash" by Rob Johnson, Michael Bender, Martin Farach-Colton, Russell Kraner, Bradley Kuszmaul, Dzejla Medjedovic, Pablo Montes, Pradeep Shetty, Richard Spillane, Erez Zadok.
- "Serializability, not Serial: Concurrency Control and Availability in Multi-Datacenter Datastores" by Stacy Patterson, Aaron Elmore, Faisal Nawab, Divyakant Agrawal, Amr El Abbadi.
- "A Joint Multiple Location Model for Profiling Users' Locations from Social Network and Content" by Rui Li, Kevin Chang, Shengjie Wang.
- "A Generic Framework for Efficient and Effective Subsequence Retrieval" by George Kollios, Haohan Zhu, Vassilis Athitsos.
- "SODA: Generating SQL for Business Users" by Lukas Blunschi, Claudio Jossen, Donald Kossman, Magdalini Mori, Kurt Stockinger.
- "Mining Statistically Significant Substrings using the Chi-Square Statistic" by Mayank Sachan, Arnab Bhattacharya.
- "Supercharging Recommender Systems using Taxonomies for Learning User Purchase Behavior" by Bhargav Kanagal, Amr Ahmed, Sandeep Pandey, Vanja Josifovski, Jeff Yuan, Lluis Garcia.
- "Robust Estimation of Resource Consumption for SQL Queries using Statistical Techniques" by Jiexing Li, Arnd König, Vivek Narasayya, Surajit Chaudhuri.
- "Accelerating Pathology Image Data Cross Comparison on CPU-GPU Hybrid Systems" by Kaibo Wang, Yin Huai, Rubao Li, Fusheng Wang, Xiaodong Zhang, Joel Saltz.
- "Stubby: A Transformation-based Optimizer for MapReduce Workflows" by Harold Lim, Herodotos Herodotou, Shivnath Babu.
- "Efficient Verification of Web-Content Searching Through Authenticated Web Crawlers" by Michael Goodrich, Duy Nguyen, Olga Ohrimenko, Charalampos Papamanthou, Roberto Tamassia, Nikos Triandopoulos, Cristina Lopes.