Advances in Database Technology - EDBT 2017, 20th International Conference on Extending Database Technology, Venice, Italy, March 21-24, Proceedings

Original publisher:, ISBN: 978-3-89318-073-8, Electronic Edition
Series ISSN: 2367-2005
Complete proceedings: [pdf] [zip]

Front Matter

Research Sessions

Graph Databases
[PDF] [DOI] Parallel Array-Based Single- and Multi-Source Breadth First Searches on Large Dense Graphs
Moritz Kaufmann, Manuel Then, Alfons Kemper, Thomas Neumann   pp. 1-12
[PDF] [DOI] GraphCache: A Caching System for Graph Queries
Jing Wang, Nikos Ntarmos, Peter Triantafillou   pp. 13-24
[PDF] [DOI] Subgraph Querying with Parallel Use of Query Rewritings and Alternative Algorithms
Foteini Katsarou, Nikos Ntarmos, Peter Triantafillou   pp. 25-36
[PDF] [DOI] Reverse k-Ranks Queries on Large Graphs
Yuqiu Qian, Hui Li, Nikos Mamoulis, Yu Liu, David Cheung   pp. 37-48
Main Memory DB and Emerging Hardware
[PDF] [DOI] Analytics on Fast Data: Main-Memory Database Systems versus Modern Streaming Systems
Andreas Kipf, Varun Pandey, Jan Böttcher, Lucas Braun, Thomas Neumann, Alfons Kemper   pp. 49-60
[PDF] [DOI] Self-managed collections: Off-heap memory management for scalable query-dominated collections
Fabian Nagel, Gavin Bierman, Aleksandar Dragojevic, Stratis Viglas   pp. 61-71
[PDF] [DOI] Lightweight Data Compression Algorithms: An Experimental Survey (Experiments and Analyses)
Patrick Damme, Dirk Habich, Juliana Hildebrandt, Wolfgang Lehner   pp. 72-83
[PDF] [DOI] SQL- and Operator-centric Data Analytics in Relational Main-Memory Databases
Linnea Passing, Manuel Then, Nina Hubig, Harald Lang, Michael Schreier, Stephan Günnemann, Alfons Kemper, Thomas Neumann   pp. 84-95
Data Mining
[PDF] [DOI] Data Exploration with SQL using Machine Learning Techniques
Julien Cumin, Jean-Marc Petit, Vasile-Marian Scuturici, Sabina Surdu   pp. 96-107
[PDF] [DOI] Task-Optimized Group Search for Social Internet of Things
Chih-Ya Shen, Hong-Han Shuai, Kuo-Feng Hsu, Ming-Syan Chen   pp. 108-119
[PDF] [DOI] Finding Socio-Textual Associations Among Locations
Paras Mehta, Dimitris Sacharidis, Dimitrios Skoutas, Agnes Voisard   pp. 120-131
[PDF] [DOI] COP: Planning Conflicts for Faster Parallel Transactional Machine Learning
Faisal Nawab, Divy Agrawal, Amr El Abbadi, Sanjay Chawla   pp. 132-143
CEP and Streaming
[PDF] [DOI] ChronicleDB: A High-Performance Event Store
Marc Seidemann, Bernhard Seeger   pp. 144-155
[PDF] [DOI] EXstream: Explaining Anomalies in Event Stream Monitoring
Haopeng Zhang, Yanlei Diao, Alexandra Meliou   pp. 156-167
[PDF] [DOI] Real Time Contextual Summarization of Highly Dynamic Data Streams
Manoj Agarwal, Krithi Ramamritham   pp. 168-179
[PDF] [DOI] An Effective and Efficient Truth Discovery Framework over Data Streams
Tianyi Li, Yu Gu, Xiangmin Zhou, Qian Ma, Ge Yu   pp. 180-191
Data Warehousing and Integration
[PDF] [DOI] Maritime data integration and analysis: recent progress and research challenges
Christophe Claramunt, Cyril Ray, Elena Camossi, Anne-Laure Jousselme, Melita Hadzagic, Andrienko Gennady, Natalia Andrienko, Yannis Theodoridis, George Vouros, Loïc Salmon   pp. 192-197
[PDF] [DOI] Matching Web Tables To DBpedia - A Feature Utility Study
Dominique Ritze, Christian Bizer   pp. 210-221
[PDF] [DOI] Schema Inference for Massive JSON Datasets
Mohamed-Amine Baazizi, Houssem Ben Lahmar, Dario Colazzo, Giorgio Ghelli, Carlo Sartiani   pp. 222-233
Social Networks and Semantic Web
[PDF] [DOI] Motivation-Aware Task Assignment in Crowdsourcing
Julien Pilourdault, Sihem Amer-Yahia, Dongwon Lee, Senjuti Basu Roy   pp. 246-257
[PDF] [DOI] A Probabilistic Framework for Estimating Pairwise Distances Through Crowdsourcing
Habibur Rahman, Senjuti Basu Roy, Gautam Das   pp. 258-269
[PDF] [DOI] Information Propagation in Interaction Networks
Rohit Kumar, Toon Calders   pp. 270-281
Indexing and Query Processing
[PDF] [DOI] Flexible Caching in Trie Joins
Oren Kalinsky, Yoav Etsion, Benny Kimelfeld   pp. 282-293
[PDF] [DOI] Querying Improvement Strategies
Guolei Yang, Ying Cai   pp. 294-305
[PDF] [DOI] Grid-Index Algorithm for Reverse Rank Queries
Yuyang Dong, Hanxiong Chen, Jeffrey Xu Yu, Kazutaka Furuse, Hiroyuki Kitagawa   pp. 306-317
[PDF] [DOI] Parallel and Distributed Processing of Spatial Preference Queries using Keywords
Christos Doulkeridis, Akrivi Vlachou, Dimitris Mpestas, Nikos Mamoulis   pp. 318-329
Heterogeneous Databases
[PDF] [DOI] Continuous Imputation of Missing Values in Streams of Pattern-Determining Time Series
Kevin Wellenzohn, Michael Böhlen, Anton Dignös, Johann Gamper, Hannes Mitterer   pp. 330-341
[PDF] [DOI] Data-driven Schema Normalization
Thorsten Papenbrock, Felix Naumann   pp. 342-353
[PDF] [DOI] Towards Interactive Debugging of Rule-based Entity Matching
Fatemah Panahi, Wentao Wu, AnHai Doan, Jeffrey Naughton   pp. 354-365
[PDF] [DOI] Convergent Interactive Inference with Leaky Joins
Ying Yang, Oliver Kennedy   pp. 366-377
Spatial, Temporal, and Geographic Databases
[PDF] [DOI] Efficient Motif Discovery in Spatial Trajectories Using Discrete Fréchet Distance
Bo Tang, Man Lung Yiu, Kyriakos Mouratidis, Kai Wang   pp. 378-389
[PDF] [DOI] Group Trip Scheduling (GTS) Queries in Spatial Databases
Roksana Jahan, Tanzima Hashem, Sukarna Barua   pp. 390-401
[PDF] [DOI] Towards Efficient Maintenance of Continuous MaxRS Query for Trajectories
Muhammed Mas-ud Hussain, Kazi Ashik Islam, Goce Trajcevski, Mohammed Eunus Ali   pp. 402-413
[PDF] [DOI] Exact and Approximate Algorithms for Finding k-Shortest Paths with Limited Overlap
Theodoros Chondrogiannis, Panagiotis Bouros, Johann Gamper, Ulf Leser   pp. 414-425
[PDF] [DOI] Efficient Parallel Spatial Skyline Evaluation Using MapReduce
Wenlu Wang, Ji Zhang, MinTe Sun, Wei-Shinn Ku   pp. 426-437

Demonstration Sessions

[PDF] [DOI] I²: Interactive Real-Time Visualization for Streaming Data
Jonas Traub, Nikolaas Steenbergen, Philipp Grulich, Tilmann Rabl, Volker Markl   pp. 526-529
[PDF] [DOI] HDM: Optimized Big Data Processing with Data Provenance
Dongyao Wu, Sherif Sakr, Liming Zhu   pp. 530-533
[PDF] [DOI] ChaseFUN: a Data Exchange Engine for Functional Dependencies at Scale
Angela Bonifati, Ioana Ileana, Michele Linardi   pp. 534-537
[PDF] [DOI] GnosisMiner: Reading Order Recommendations over Document Collections
Georgia Koutrika, Alkis Simitsis, Yannis Ioannidis   pp. 538-541
[PDF] [DOI] MovieFinder: A Movie Search System via Graph Pattern Matching
Xin Wang, Chengye Yu, Enyang Zhang, Tong Du   pp. 542-545
[PDF] [DOI] VAT: A System for Data-Driven Biodiversity Research
Christian Beilschmidt, Johannes Drönner, Michael Mattig, Bernhard Seeger   pp. 546-549
[PDF] [DOI] Come and crash our database! -- Instant recovery in action
Caetano Sauer, Gilson Souza, Goetz Graefe, Theo Härder   pp. 554-557
[PDF] [DOI] μTOP: Spatio-Temporal Detection and Summarization of Locally Trending Topics in Microblog Posts
Paras Mehta, Manuel Kotlarski, Dimitrios Skoutas, Dimitris Sacharidis, Kostas Patroumpas, Agnes Voisard   pp. 558-561
[PDF] [DOI] Insights into the Comparative Evaluation of Lightweight Data Compression Algorithms
Patrick Damme, Dirk Habich, Juliana Hildebrandt, Wolfgang Lehner   pp. 562-565
[PDF] [DOI] Multi-workflow optimization in PAW
Maxim Filatov, Verena Kantere   pp. 566-569
[PDF] [DOI] Efficient spatio-temporal event processing with STARK
Stefan Hagedorn, Timo Räth   pp. 570-573
[PDF] [DOI] Context-Aware Proactive Personalization of Linear Audio Content
Paolo Casagranda, Maria Luisa Sapino, K. Selcuk Candan   pp. 574-577
[PDF] [DOI] In Search for Relevant, Diverse and Crowd-screen Points of Interests
Xiaoyu Ge, Samanvoy Panati, Konstantinos Pelechrinis, Panos Chrysanthis, Mohamed Sharaf   pp. 578-581
[PDF] [DOI] SIMDMS: Data Management and Analysis to Support Decision Making through Large Simulation Ensembles
Silvestro Poccia, Maria Luisa Sapino, Sicong Liu, Xilun Chen, Yash Garg, Shengyu Huang, Jung Kim, Xinsheng Li, Parth Nagarkar, K. Selcuk Candan   pp. 582-585
[PDF] [DOI] In-Place Appends for Real: DBMS Overwrites on Flash without Erase
Sergey Hardock, Ilia Petrov, Robert Gottstein, Alejandro Buchmann   pp. 586-589
[PDF] [DOI] CAESAR: Context-Aware Event Stream Analytics for Urban Transportation Services
Olga Poppe, Chuan Lei, Elke Rundensteiner, Dan Dougherty, Goutham Deva, Nicholas Fajardo, James Owens, Thomas Schweich, MaryAnn VanValkenburg, Sarun Paisarnsrisomsuk, Pitchaya Wiratchotisatian, George Gettel, Robert Hollinger, Devin Roberts, Daniel Tocco   pp. 590-593
[PDF] [DOI] Building Multi-Resolution Event-Enriched Maps From Social Data
Faizan Ur Rehman, Imad Afyouni, Ahmed Lbath, Sohaib Khan, Saleh Basalamah, Mohamed Mokbel   pp. 594-597


[PDF] [DOI] Declarative Graph Querying in Practice and Theory
George Fletcher, Hannes Voigt, Nikolay Yakovets   pp. 598-601
[PDF] [DOI] Multi-model Data Management: What's New and What's Next?
Jiaheng Lu, Irena Holubová   pp. 602-605
[PDF] [DOI] Data Security and Privacy for Outsourced Data In the Cloud
Cetin Sahin, Amr El Abbadi   pp. 606-609

Industry and Applications

Industrial and Applications Paper
[PDF] [DOI] Improving Company Recognition from Unstructured Text by using Dictionaries
Michael Loster, Zuo Zhe, Felix Naumann, Oliver Maspfuhl, Dirk Thomas   pp. 610-619
[PDF] [DOI] Temporal group linkage and evolution analysis for census data
Victor Christen, Anika Groß, Jeffrey Fisher, Qing Wang, Peter Christen, Erhard Rahm   pp. 620-631
[PDF] [DOI] In-DBMS Sampling-based Sub-trajectory Clustering
Nikos Pelekis, Panagiotis Tampakis, Marios Vodas, Costas Panagiotakis, Yannis Theodoridis   pp. 632-643
[PDF] [DOI] Powering Archive Store Query Processing via Join Indices
Joseph D'silva, Bettina Kemme, Richard Grondin, Evgueni Fadeitchev   pp. 644-655
[PDF] [DOI] RDF Keyword-based Query Technology Meets a Real-World Dataset
Grettel García, Yenier Izquierdo, Elisa Menendez, Frederic Dartayre, Marco Casanova   pp. 656-667
[PDF] [DOI] Hi-WAY: Execution of Scientific Workflows on Hadoop YARN
Marc Bux, Jörgen Brandt, Carl Witt, Jim Dowling, Ulf Leser   pp. 668-679
[PDF] [DOI] Buddy Instance - A Mechanism for Increasing Availability in Shared-Disk Clusters
Anjan Kumar Amirishetty, Yunrui Li, Tolga Yurek, Mahesh Girkar, Wilson Chan, Graham Ivey, Vsevolod Panteleen, Ken Wong   pp. 680-686
[PDF] [DOI] Herding the elephants: Workload-level optimization strategies for Hadoop
Sandeep Akinapelli, Ravi Shetye, Sangeeta T   pp. 699-710

Poster Sessions

Poster Paper
[PDF] [DOI] Top-k Skyline Groups Queries
Haoyang Zhu, Peidong Zhu, Xiaoyong Li, Qiang Liu   pp. 442-445
[PDF] [DOI] Towards sequenced semantics for evolving graphs
Vera Zaychik Moffitt, Julia Stoyanovich   pp. 446-449
[PDF] [DOI] Crowdsourcing Strategies for Text Creation Tasks
Ria Mae Borromeo, Maha Alsaysneh, Sihem Amer-Yahia, Vincent Leroy   pp. 450-453
[PDF] [DOI] SPST-Index: A Self-Pruning Splay Tree Index for Caching Database Cracking
Pedro Holanda, Eduardo Cunha de Almeida   pp. 458-461
[PDF] [DOI] In-Memory Spatial Join: The Data Matters!
Sadegh Nobari, Qiang Qu, Christian Jensen   pp. 462-465
[PDF] [DOI] Fairness and Transparency in Crowdsourcing
Ria Mae Borromeo, Thomas Laurent, Motomichi Toyama, Sihem Amer-Yahia   pp. 466-469
[PDF] [DOI] TASWEET: Optimizing Disjunctive Path Queries in Graph Databases
Zahid Abul-Basher, Nikolay Yakovets, Parke Godfrey, Shadi Ghajar-Khosrav, Mark Chignell   pp. 470-473
[PDF] [DOI] A New Division Operator to Handle Complex Objects in Very Large Relational Datasets
Andre Gonzaga, Robson Leonardo Ferreira Cordeiro   pp. 474-477
[PDF] [DOI] Protecting Location Privacy in Spatial Crowdsourcing using Encrypted Data
Bozhong Liu, Ling Chen, Xingquan Zhu, Ying Zhang, Chengqi Zhang, Weidong Qiu   pp. 478-481
[PDF] [DOI] Break the Windows: Explicit State Management for Stream Processing Systems
Alessandro Margara, Daniele Dell'Aglio, Abraham Bernstein   pp. 482-485
[PDF] [DOI] Stability notions in synthetic graph generation: a preliminary study
Wilco van Leeuwen, Angela Bonifati, George Fletcher, Nikolay Yakovets   pp. 486-489
[PDF] [DOI] Big Spatial Data Processing Frameworks: Feature and Performance Evaluation
Stefan Hagedorn, Philipp Götze, Kai-Uwe Sattler   pp. 490-493
[PDF] [DOI] Authority-based Team Discovery in Social Networks
Morteza Zihayat, Aijun An, Lukasz Golab, Mehdi Kargar, Jaroslaw Szlichta   pp. 498-501
[PDF] [DOI] Correlation-Aware Distance Measures for Data Series
Katsiaryna Mirylenka, Michele Dallachiesa, Themis Palpanas   pp. 502-505
[PDF] [DOI] Load balancing for Key Value Data Stores
Ainhoa Azqueta-Alzúaz, Ivan Brondino, Marta Patino-Martinez, Ricardo Jimenez-Peris   pp. 506-509
[PDF] [DOI] Entity Matching on Web Tables: a Table Embeddings approach for Blocking
Anna Lisa Gentile, Petar Ristoski, Steffen Eckel, Dominique Ritze, Heiko Paulheim   pp. 510-513
[PDF] [DOI] Fast Subsequence Search on Time Series Data
Yuhong Li, Bo Tang, Leong Hou U, Man Lung Yiu, Zhiguo Gong   pp. 514-517