Original publisher: OpenProceedings.org, ISBN: 978-3-89318-073-8, Electronic Edition
Series ISSN: 2367-2005
Front Matter
Research Sessions
Graph Databases
Parallel Array-Based Single- and Multi-Source Breadth First Searches on Large Dense Graphs
Moritz Kaufmann, Manuel Then, Alfons Kemper, Thomas Neumann
pp. 1-12
GraphCache: A Caching System for Graph Queries
Jing Wang, Nikos Ntarmos, Peter Triantafillou
pp. 13-24
Subgraph Querying with Parallel Use of Query Rewritings and Alternative Algorithms
Foteini Katsarou, Nikos Ntarmos, Peter Triantafillou
pp. 25-36
Reverse k-Ranks Queries on Large Graphs
Yuqiu Qian, Hui Li, Nikos Mamoulis, Yu Liu, David Cheung
pp. 37-48
Main Memory DB and Emerging Hardware
Analytics on Fast Data: Main-Memory Database Systems versus Modern Streaming Systems
Andreas Kipf, Varun Pandey, Jan Böttcher, Lucas Braun, Thomas Neumann, Alfons Kemper
pp. 49-60
Self-managed collections: Off-heap memory management for scalable query-dominated collections
Fabian Nagel, Gavin Bierman, Aleksandar Dragojevic, Stratis Viglas
pp. 61-71
Lightweight Data Compression Algorithms: An Experimental Survey (Experiments and Analyses)
Patrick Damme, Dirk Habich, Juliana Hildebrandt, Wolfgang Lehner
pp. 72-83
SQL- and Operator-centric Data Analytics in Relational Main-Memory Databases
Linnea Passing, Manuel Then, Nina Hubig, Harald Lang, Michael Schreier, Stephan Günnemann, Alfons Kemper, Thomas Neumann
pp. 84-95
Data Mining
Data Exploration with SQL using Machine Learning Techniques
Julien Cumin, Jean-Marc Petit, Vasile-Marian Scuturici, Sabina Surdu
pp. 96-107
Task-Optimized Group Search for Social Internet of Things
Chih-Ya Shen, Hong-Han Shuai, Kuo-Feng Hsu, Ming-Syan Chen
pp. 108-119
Finding Socio-Textual Associations Among Locations
Paras Mehta, Dimitris Sacharidis, Dimitrios Skoutas, Agnes Voisard
pp. 120-131
COP: Planning Conflicts for Faster Parallel Transactional Machine Learning
Faisal Nawab, Divy Agrawal, Amr El Abbadi, Sanjay Chawla
pp. 132-143
CEP and Streaming
EXstream: Explaining Anomalies in Event Stream Monitoring
Haopeng Zhang, Yanlei Diao, Alexandra Meliou
pp. 156-167
Real Time Contextual Summarization of Highly Dynamic Data Streams
Manoj Agarwal, Krithi Ramamritham
pp. 168-179
An Effective and Efficient Truth Discovery Framework over Data Streams
Tianyi Li, Yu Gu, Xiangmin Zhou, Qian Ma, Ge Yu
pp. 180-191
Data Warehousing and Integration
Maritime data integration and analysis: recent progress and research challenges
Christophe Claramunt, Cyril Ray, Elena Camossi, Anne-Laure Jousselme, Melita Hadzagic, Andrienko Gennady, Natalia Andrienko, Yannis Theodoridis, George Vouros, Loïc Salmon
pp. 192-197
DeepSea: Progressive Workload-Aware Partitioning of Materialized Views in Scalable Data Analytics
Jiang Du, Renée Miller, Boris Glavic, Wei Tan
pp. 198-209
Matching Web Tables To DBpedia - A Feature Utility Study
Dominique Ritze, Christian Bizer
pp. 210-221
Schema Inference for Massive JSON Datasets
Mohamed-Amine Baazizi, Houssem Ben Lahmar, Dario Colazzo, Giorgio Ghelli, Carlo Sartiani
pp. 222-233
Social Networks and Semantic Web
Motivation-Aware Task Assignment in Crowdsourcing
Julien Pilourdault, Sihem Amer-Yahia, Dongwon Lee, Senjuti Basu Roy
pp. 246-257
A Probabilistic Framework for Estimating Pairwise Distances Through Crowdsourcing
Habibur Rahman, Senjuti Basu Roy, Gautam Das
pp. 258-269
Indexing and Query Processing
Grid-Index Algorithm for Reverse Rank Queries
Yuyang Dong, Hanxiong Chen, Jeffrey Xu Yu, Kazutaka Furuse, Hiroyuki Kitagawa
pp. 306-317
Parallel and Distributed Processing of Spatial Preference Queries using Keywords
Christos Doulkeridis, Akrivi Vlachou, Dimitris Mpestas, Nikos Mamoulis
pp. 318-329
Heterogeneous Databases
Continuous Imputation of Missing Values in Streams of Pattern-Determining Time Series
Kevin Wellenzohn, Michael Böhlen, Anton Dignös, Johann Gamper, Hannes Mitterer
pp. 330-341
Towards Interactive Debugging of Rule-based Entity Matching
Fatemah Panahi, Wentao Wu, AnHai Doan, Jeffrey Naughton
pp. 354-365
Spatial, Temporal, and Geographic Databases
Efficient Motif Discovery in Spatial Trajectories Using Discrete Fréchet Distance
Bo Tang, Man Lung Yiu, Kyriakos Mouratidis, Kai Wang
pp. 378-389
Group Trip Scheduling (GTS) Queries in Spatial Databases
Roksana Jahan, Tanzima Hashem, Sukarna Barua
pp. 390-401
Towards Efficient Maintenance of Continuous MaxRS Query for Trajectories
Muhammed Mas-ud Hussain, Kazi Ashik Islam, Goce Trajcevski, Mohammed Eunus Ali
pp. 402-413
Exact and Approximate Algorithms for Finding k-Shortest Paths with Limited Overlap
Theodoros Chondrogiannis, Panagiotis Bouros, Johann Gamper, Ulf Leser
pp. 414-425
Efficient Parallel Spatial Skyline Evaluation Using MapReduce
Wenlu Wang, Ji Zhang, MinTe Sun, Wei-Shinn Ku
pp. 426-437
Demonstration Sessions
Demonstration
I²: Interactive Real-Time Visualization for Streaming Data
Jonas Traub, Nikolaas Steenbergen, Philipp Grulich, Tilmann Rabl, Volker Markl
pp. 526-529
HDM: Optimized Big Data Processing with Data Provenance
Dongyao Wu, Sherif Sakr, Liming Zhu
pp. 530-533
ChaseFUN: a Data Exchange Engine for Functional Dependencies at Scale
Angela Bonifati, Ioana Ileana, Michele Linardi
pp. 534-537
GnosisMiner: Reading Order Recommendations over Document Collections
Georgia Koutrika, Alkis Simitsis, Yannis Ioannidis
pp. 538-541
MovieFinder: A Movie Search System via Graph Pattern Matching
Xin Wang, Chengye Yu, Enyang Zhang, Tong Du
pp. 542-545
VAT: A System for Data-Driven Biodiversity Research
Christian Beilschmidt, Johannes Drönner, Michael Mattig, Bernhard Seeger
pp. 546-549
SDOS: Using Trusted Platform Modules for Secure Cryptographic Deletion in the Swift Object Store
Tim Waizenegger, Frank Wagner, Cataldo Mega
pp. 550-553
Come and crash our database! -- Instant recovery in action
Caetano Sauer, Gilson Souza, Goetz Graefe, Theo Härder
pp. 554-557
μTOP: Spatio-Temporal Detection and Summarization of Locally Trending Topics in Microblog Posts
Paras Mehta, Manuel Kotlarski, Dimitrios Skoutas, Dimitris Sacharidis, Kostas Patroumpas, Agnes Voisard
pp. 558-561
Insights into the Comparative Evaluation of Lightweight Data Compression Algorithms
Patrick Damme, Dirk Habich, Juliana Hildebrandt, Wolfgang Lehner
pp. 562-565
Context-Aware Proactive Personalization of Linear Audio Content
Paolo Casagranda, Maria Luisa Sapino, K. Selcuk Candan
pp. 574-577
In Search for Relevant, Diverse and Crowd-screen Points of Interests
Xiaoyu Ge, Samanvoy Panati, Konstantinos Pelechrinis, Panos Chrysanthis, Mohamed Sharaf
pp. 578-581
SIMDMS: Data Management and Analysis to Support Decision Making through Large Simulation Ensembles
Silvestro Poccia, Maria Luisa Sapino, Sicong Liu, Xilun Chen, Yash Garg, Shengyu Huang, Jung Kim, Xinsheng Li, Parth Nagarkar, K. Selcuk Candan
pp. 582-585
In-Place Appends for Real: DBMS Overwrites on Flash without Erase
Sergey Hardock, Ilia Petrov, Robert Gottstein, Alejandro Buchmann
pp. 586-589
CAESAR: Context-Aware Event Stream Analytics for Urban Transportation Services
Olga Poppe, Chuan Lei, Elke Rundensteiner, Dan Dougherty, Goutham Deva, Nicholas Fajardo, James Owens, Thomas Schweich, MaryAnn VanValkenburg, Sarun Paisarnsrisomsuk, Pitchaya Wiratchotisatian, George Gettel, Robert Hollinger, Devin Roberts, Daniel Tocco
pp. 590-593
Building Multi-Resolution Event-Enriched Maps From Social Data
Faizan Ur Rehman, Imad Afyouni, Ahmed Lbath, Sohaib Khan, Saleh Basalamah, Mohamed Mokbel
pp. 594-597
Tutorials
Tutorials
Declarative Graph Querying in Practice and Theory
George Fletcher, Hannes Voigt, Nikolay Yakovets
pp. 598-601
Industry and Applications
Industrial and Applications Paper
Improving Company Recognition from Unstructured Text by using Dictionaries
Michael Loster, Zuo Zhe, Felix Naumann, Oliver Maspfuhl, Dirk Thomas
pp. 610-619
Temporal group linkage and evolution analysis for census data
Victor Christen, Anika Groß, Jeffrey Fisher, Qing Wang, Peter Christen, Erhard Rahm
pp. 620-631
In-DBMS Sampling-based Sub-trajectory Clustering
Nikos Pelekis, Panagiotis Tampakis, Marios Vodas, Costas Panagiotakis, Yannis Theodoridis
pp. 632-643
Powering Archive Store Query Processing via Join Indices
Joseph D'silva, Bettina Kemme, Richard Grondin, Evgueni Fadeitchev
pp. 644-655
RDF Keyword-based Query Technology Meets a Real-World Dataset
Grettel García, Yenier Izquierdo, Elisa Menendez, Frederic Dartayre, Marco Casanova
pp. 656-667
Hi-WAY: Execution of Scientific Workflows on Hadoop YARN
Marc Bux, Jörgen Brandt, Carl Witt, Jim Dowling, Ulf Leser
pp. 668-679
Buddy Instance - A Mechanism for Increasing Availability in Shared-Disk Clusters
Anjan Kumar Amirishetty, Yunrui Li, Tolga Yurek, Mahesh Girkar, Wilson Chan, Graham Ivey, Vsevolod Panteleen, Ken Wong
pp. 680-686
DBaaS Cloud Capacity Planning - Accounting for Dynamic RDBMS System that Employ Clustering and Standby Architectures
Antony Higginson, Norman Paton, Suzanne Embury, Clive Bostock
pp. 687-698
Herding the elephants: Workload-level optimization strategies for Hadoop
Sandeep Akinapelli, Ravi Shetye, Sangeeta T
pp. 699-710
Poster Sessions
Poster Paper
Crowdsourcing Strategies for Text Creation Tasks
Ria Mae Borromeo, Maha Alsaysneh, Sihem Amer-Yahia, Vincent Leroy
pp. 450-453
SPST-Index: A Self-Pruning Splay Tree Index for Caching Database Cracking
Pedro Holanda, Eduardo Cunha de Almeida
pp. 458-461
Fairness and Transparency in Crowdsourcing
Ria Mae Borromeo, Thomas Laurent, Motomichi Toyama, Sihem Amer-Yahia
pp. 466-469
TASWEET: Optimizing Disjunctive Path Queries in Graph Databases
Zahid Abul-Basher, Nikolay Yakovets, Parke Godfrey, Shadi Ghajar-Khosrav, Mark Chignell
pp. 470-473
A New Division Operator to Handle Complex Objects in Very Large Relational Datasets
Andre Gonzaga, Robson Leonardo Ferreira Cordeiro
pp. 474-477
Protecting Location Privacy in Spatial Crowdsourcing using Encrypted Data
Bozhong Liu, Ling Chen, Xingquan Zhu, Ying Zhang, Chengqi Zhang, Weidong Qiu
pp. 478-481
Break the Windows: Explicit State Management for Stream Processing Systems
Alessandro Margara, Daniele Dell'Aglio, Abraham Bernstein
pp. 482-485
Stability notions in synthetic graph generation: a preliminary study
Wilco van Leeuwen, Angela Bonifati, George Fletcher, Nikolay Yakovets
pp. 486-489
Big Spatial Data Processing Frameworks: Feature and Performance Evaluation
Stefan Hagedorn, Philipp Götze, Kai-Uwe Sattler
pp. 490-493
Implementation and Evaluation of Genome Type Processing for Disease-Causal Gene Studies on DBMS
Yoshifumi Ujibashi, Lilian Harada
pp. 494-497
Authority-based Team Discovery in Social Networks
Morteza Zihayat, Aijun An, Lukasz Golab, Mehdi Kargar, Jaroslaw Szlichta
pp. 498-501
Correlation-Aware Distance Measures for Data Series
Katsiaryna Mirylenka, Michele Dallachiesa, Themis Palpanas
pp. 502-505
Load balancing for Key Value Data Stores
Ainhoa Azqueta-Alzúaz, Ivan Brondino, Marta Patino-Martinez, Ricardo Jimenez-Peris
pp. 506-509
Entity Matching on Web Tables: a Table Embeddings approach for Blocking
Anna Lisa Gentile, Petar Ristoski, Steffen Eckel, Dominique Ritze, Heiko Paulheim
pp. 510-513
Fast Subsequence Search on Time Series Data
Yuhong Li, Bo Tang, Leong Hou U, Man Lung Yiu, Zhiguo Gong
pp. 514-517
Progressive Recovery of Correlated Failures in Distributed Stream Processing Engines
Li Su, Yongluan Zhou
pp. 518-521
Optimal Obstructed Sequenced Route Queries in Spatial Databases
Anika Anwar, Tanzima Hashem
pp. 522-525