Original publisher: OpenProceedings.org, ISBN: 978-3-89318-067-7, Electronic Edition
Front Matter
Research Sessions
Systems
Chariots: A Scalable Shared Log for Data Management in Multi-Datacenter Cloud Environments
Faisal Nawab, Vaibhav Arora, Divyakant Agrawal, Amr El Abbadi
pp. 13-24
Hermes: Dynamic Partitioning for Distributed Social Network Graph
Databases
Daniel Nicoara, Shahin Kamali, Khuzaima Daudjee, Lei Chen
pp. 25-36
Data Curation, Annotation and Provenance
On Debugging Non-Answers in Keyword Search Systems
Akanksha Baid, Wentao Wu, Chong Sun, AnHai Doan, Jeffrey F. Naughton
pp. 37-48
Elevating Annotation Summaries To First-Class Citizens In InsightNotes
Karim Ibrahim, Dongqing Xiao, Mohamed Eltabakh
pp. 49-60
Estimating Data Integration and Cleaning Effort
Paolo Papotti, Felix Naumann, Sebastian Kruse
pp. 61-72
Data Mining and Integration
Mining Frequent Co-occurrence Patterns across Multiple Data Streams
Ziqiang Yu, Xiaohui Yu, Yang Liu, Wenzhu Li, Jian Pei
pp. 73-84
Extracting Aggregate Answer Statistics for Integration
Zainab Zolaktaf, Jian Xu, Rachel Pottinger
pp. 85-96
Discovering Recurring Patterns in Time Series
R. Uday Kiran, Haichuan Shang, Masashi Toyoda, Masaru Kitsuregawa
pp. 97-108
Graphs Session 1
TimeReach: Historical Reachability Queries on Evolving Graphs
Konstantinos Semertzidis, Evaggelia Pitoura, Kostas Lillis
pp. 121-132
Efficiently Computing Top-K Shortest Path Join
Lijun Chang, Xuemin Lin, Lu Qin, Jeffrey Xu Yu, Jian Pei
pp. 133-144
Graphs Session 2
SIEF: Efficiently Answering Distance Queries for Failure Prone Graphs
Yongrui Qin, Quan Z. Sheng, Wei Emma Zhang
pp. 145-156
A Selectivity based approach to Continuous Pattern Detection in Streaming Graphs
Sutanay Choudhury, Lawrence Holder, George Chin, Khushbu Agarwal, John Feo
pp. 157-168
Scaling Unbound-Property Queries on Big RDF Data Warehouses using MapReduce
Padmashree Ravindra, Kemafor Anyanwu
pp. 169-180
Crowd 1
Reaching a desired set of users via different paths: an online advertising technique on micro-blogging platforms
Milad Eftekhar, Nick Koudas, Yashar Ganjali
pp. 181-192
On Optimality of Jury Selection in Crowdsourcing
Yudian Zheng, Reynold Cheng, Silviu Maniu, Luyi Mo
pp. 193-204
Finding the Most Diverse Products using Preference Queries
Orestis Gkorgkas, Akrivi Vlachou, Christos Doulkeridis, Kjetil Norvag
pp. 205-216
Indexing and Classification
Index Design for Enforcing Partial Referential Integrity Efficiently
Mozhgan Memari, Sebastian Link
pp. 217-228
Learning to Rank Adaptively for Scalable Information Extraction
Pablo Barrio, Gonçalo Simões, Helena Galhardas, Luis Gravano
pp. 241-252
Query Processing 1
The Sweet Spot between Inverted Indices and Metric-Space Indexing for Top-K-List Similarity Search
Evica Milchevski, Avishek Anand, Sebastian Michel
pp. 253-264
Optimizing Reformulation-based Query Answering in RDF
Damian Bursztyn, François Goasdoué, Ioana Manolescu
pp. 265-276
Resolving XML Semantic Ambiguity
Nathalie Charbel, Joe Tekli, Richard Chbeir, Gilbert Tekli
pp. 277-288
New Applications
SpMacho - Optimizing Sparse Linear Algebra Expressions with Probabilistic Density Estimation
David Kernert, Frank Köhler, Wolfgang Lehner
pp. 289-300
Efficient evaluation of threshold queries of derived fields in a numerical simulation database
Kalin Kanov, Randal Burns, Cristian C. Lalescu
pp. 301-312
Identifying Converging Pairs of Nodes on a Budget
Konstantina Lazaridou, Konstantinos Semertzidis, Evaggelia Pitoura, Panayiotis Tsaparas
pp. 313-324
Query Processing 2
Query-Based Outlier Detection in Heterogeneous Information Networks
Jonathan Kuck, Honglei Zhuang, Xifeng Yan, Hasan Cam, Jiawei Han
pp. 325-336
Efficient caching for constrained skyline queries
Michael Lind Mortensen, Sean Chester, Ira Assent, Matteo Magnani
pp. 337-348
Parallel Processing
Efficient Processing of Hamming-Distance-Based Similarity-Search Queries Over MapReduce
Mingjie Tang, Yongyang Yu, Walid G. Aref, Qutaibah M. Malluhi, Mourad Ouzzani
pp. 361-372
Joins for Hybrid Warehouses: Exploiting Massive Parallelism in Hadoop and Enterprise Data Warehouses
Yuanyuan Tian, Tao Zou, Fatma Ozcan, Romulo Goncalves, Hamid Pirahesh
pp. 373-384
Benchmarking Smart Meter Data Analytics
Xiufeng Liu, Lukasz Golab, Wojciech Golab, Ihab F. Ilyas
pp. 385-396
Crowd 2
Crowd-Selection Query Processing in Crowdsourcing Databases: A Task-Driven Approach
Zhou Zhao, Furu Wei, Ming Zhou, Weikeng Chen, Wilfred Ng
pp. 397-408
Group Recommendation with Temporal Affinities
Sihem Amer-Yahia, Behrooz Omidvar-Tehrani, Senjuti Basu Roy, Nafiseh Shabib
pp. 421-432
Spatial Queries
On Processing Top-k Spatio-Textual Preference Queries
George Tsatsanifos, Akrivi Vlachou
pp. 433-444
Probabilistic Resource Route Queries with Reappearance
Gregor Jossé, Klaus Arthur Schmid, Matthias Schubert
pp. 445-456
Cost Estimation of Spatial k-Nearest-Neighbor Operators
Ahmed M. Aly, Walid G. Aref, Mourad Ouzzani
pp. 457-468
Temporal Queries and Privacy
Reconstruction Privacy: Enabling Statistical Learning
Ke Wang, Chao Han, Ada Waichee Fu, Raymond Chi Wing Wong, Philip S. Yu
pp. 469-480
Time series anomaly discovery with grammar-based compression
Pavel Senin, Jessica Lin, Xing Wang, Tim Oates, Sunil Gandhi, Arnold P. Boedihardjo, Crystal Chen, Susan Frankenstein
pp. 481-492
Visionary Sessions
Visionary
A Generic Solution to Integrate SQL and Analytics for Big Data
Nick R. Katsipoulakis, Yuanyuan Tian, Fatma Ozcan, Hamid Pirahesh, Berthold Reinwald
pp. 671-676
ECCO- A Framework for Ecological Data Collection and Management Involving Human Workers
Senjuti Basu Roy, Sihem Amer-Yahia, Lucas Joppa
pp. 677-682
ligDB---Online Query Processing Without (almost) any Storage
Evica Milchevski, Sebastian Michel
pp. 683-688
Procrastination Beats Prevention: Timely Sufficient Persistence for Efficient Crash Resilience
Faisal Nawab, Dhruva R. Charkrabarti, Terence Kelly, Charles B. Morrey III
pp. 689-694
Demonstration Sessions
Demo
Interactive Path Query Specification on Graph Databases
Angela Bonifati, Radu Ciucanu, Aurélien Lemay
pp. 505-508
Flexible Analysis of Plant Genomes in a Database Management System
Sebastian Dorok, Sebastian Breß, Jens Teubner, Gunter Saake
pp. 509-512
Demonstrating Transfer-Efficient Sample Maintenance on Graphics Cards
Max Heimel, Martin Kiefer, Volker Markl
pp. 513-516
NoFTL for Real: Databases on Real Native Flash Storage
Sergey Hardock, Ilia Petrov, Robert Gottstein, Alejandro Buchmann
pp. 517-520
WAVEGUIDE: Evaluating SPARQL Property Path Queries
Nikolay Yakovets, Parke Godfrey, Jarek Gryz
pp. 525-528
Meta-Stars: Dynamic, Schemaless, and Semantically-Rich Topic Hierarchies in Social BI
Enrico Gallinucci, Matteo Golfarelli, Stefano Rizzi
pp. 529-532
“I would like to watch something like ‘The Terminator’…” Cooperative Query Personalization Based on Perceptual Similarity
Christoph Lofi, Christian Nieke
pp. 533-536
Liquid Benchmarking: A Platform for Democratizing the Performance Evaluation Process
Sherif Sakr, Amin Shafaat, Fuad Bajaber, Ahmed Barnawi, Omar Batarfi, Abdulrahman Altalhi
pp. 537-540
Natural Language Specification and Violation Reporting of Business Rules over ER-modeled Databases
Michael Minock, Daniel Oskarsson, Björn Pelzer, Mika Cohen
pp. 541-544
POIESIS: a Tool for Quality-aware ETL Process Redesign
Vasileios Theodorou, Alberto Abelló, Maik Thiele, Wolfgang Lehner
pp. 545-548
Quarry: Digging Up the Gems of Your Data Treasury
Petar Jovanovic, Oscar Romero, Alkis Simitsis, Alberto Abelló, Héctor Candón, Sergi Nadal
pp. 549-552
QaRS: A User-Friendly Graphical Tool for Semantic Query Design and Relaxation
Géraud Fokou, Stephane Jean, Allel Hadjali, Mickaël Baron
pp. 553-556
Industry and Applications
Industry
Using Object-Awareness to Optimize Join Processing in the SAP HANA Aggregate Cache
Stephan Müller, Anisoara Nica, Lars Butzmann, Stefan Klauck, Hasso Plattner
pp. 557-568
Transactional Replication in Hybrid Data Store Architectures
Hojjat Jafarpour, Junichi Tatemura, Hakan Hacigumus
pp. 569-580
SAP HANA – From Relational OLAP Database to Big Data Infrastructure
Norman May, Wolfgang Lehner, Shahul Hameed P., Nitesh Maheshwari, Carsten Müller, Sudipto Chowdhuri, Anil Goel
pp. 581-592
Taxi Queue, Passenger Queue or No Queue? - A Queue Detection and Analysis System using Taxi State Transition
YU LU, Shili Xiang, Wei Wu
pp. 593-604
The NPD Benchmark: Reality Check for OBDA Systems
Davide Lanti, Martin Rezk, Guohui Xiao, Diego Calvanese
pp. 617-628
Event Recognition for Maritime Surveillance
Kostas Patroumpas, Alexander Artikis, Nikos Katzouris, Marios Vodas, Yannis Theodoridis, Nikos Pelekis
pp. 629-640
Identifying User Interests within the Data Space - a Case Study with SkyServer
Hoang Vu Nguyen, Klemens Boehm, Florian Becker, Bertrand Goldman, Georg Hinkel, Emmanuel Müller
pp. 641-652
Insights on a Scalable and Dynamic Traffic Management System
Nikolas Zygouras, Nikos Zacheilas, Vana Kalogeraki, Dermot Kinane, Dimitrios Gunopulos
pp. 653-664