Original publisher: OpenProceedings.org, ISBN: 978-3-89318-081-3, Electronic Edition
Series ISSN: 2367-2005
Front Matter
Research Papers
Umzi: Unified Multi-Zone Indexing for Large-Scale HTAP
Chen Luo, Pinar Tozun, Yuanyuan Tian, Ronald Barber, Vijayshankar Raman, Richard Sidle
pp. 1-12
A Highly Scalable Labelling Approach for Exact Distance Queries in Complex Networks
Muhammad Farhan, Qing Wang, Yu Lin, Brendan Mckay
pp. 13-24
Stratified Random Sampling over Streaming and Stored Data
Trong Nguyen, Ming-Hung Shih, Divesh Srivastava, Srikanta Tirthapura, Bojian Xu
pp. 25-36
Leveraging Bitmap Indexing for Subgraph Searching
David Luaces, José R.R. Viqueira, Tomás F. Pena, José M. Cotos
pp. 49-60
Spec-QP: Speculative Query Planning for Joins over Knowledge Graphs
Madhulika Mohanty, Maya Ramanath, Mohamed Yahya, Gerhard Weikum
pp. 61-72
Iterative Estimation of Mutual Information with Error Bounds
Michael Vollmer, Klemens Böhm
pp. 73-84
Functional Geometric Monitoring for Distributed Streams
Vasileios Samoladas, Minos Garofalakis
pp. 85-96
Efficient Window Aggregation with General Stream Slicing.
Jonas Traub, Philipp M. Grulich, Alejandro Rodriguez Cuellar, Sebastian Breß, Asterios Katsifodimos, Tilmann Rabl, Volker Markl
pp. 97-108
An Efficient Sliding Window Approach for Approximate Entity Extraction with Synonyms
Jin Wang, Chunbin Lin, Mingda Li, Carlo Zaniolo
pp. 109-120
Attendance Maximization for Successful Social Event Planning
Nikos Bikakis, Vana Kalogeraki, Dimitrios Gunopulos
pp. 121-132
GroupTravel: Customizing Travel Packages for Groups
Sihem Amer-Yahia, Shady Elbassuoni, Behrooz Omidvar-Tehrani, Ria Mae Borromeo, Mehrdad Farokhnejad
pp. 133-144
SEP2P: Secure and Efficient P2P Personal Data Processing
Julien Loudet, Iulian Sandu-Popa, Luc Bouganim
pp. 145-156
Indexing Trajectories for Travel-Time Histogram Retrieval
Robert Waury, Christian S. Jensen, Satoshi Koide, Yoshiharu Ishikawa, Chuan Xiao
pp. 157-168
BB-Tree: A practical and efficient main-memory index structure for multidimensional workloads
Stefan Sprenger, Patrick Schäfer, Ulf Leser
pp. 169-180
Semantic and Influence aware k-Representative Queries over Social Streams
Yanhao Wang, Yuchen Li, Kian-Lee Tan
pp. 181-192
Comparative Analysis of Content-based Personalized Microblog Recommendations
Efi Karra Taniskidou, George Papadakis, George Giannakopoulos, Manolis Koubarakis
pp. 193-204
Crowdsourced Truth Discovery in the Presence of Hierarchies for Knowledge Fusion
Woohwan Jung, Younghoon Kim, Kyuseok Shim
pp. 205-216
A Utility-Preserving and Scalable Technique for Protecting Location Data with Geo-Indistinguishability
Ritesh Ahuja, Gabriel Ghinita, Cyrus Shahabi
pp. 217-228
Inves: Incremental Partitioning-Based Verification for Graph Similarity Search
Jongik Kim, Dong-Hoon Choi, Chen Li
pp. 229-240
Flow Motifs in Interaction Networks
Chrysanthi Kosyfaki, Nikos Mamoulis, Evaggelia Pitoura, Panayiotis Tsaparas
pp. 241-252
DynFD: Functional Dependency Discovery in Dynamic Datasets
Philipp Schirmer, Thorsten Papenbrock, Sebastian Kruse, Felix Naumann, Dennis Hempfing, Torben Mayer, Daniel Neuschäfer-Rube
pp. 253-264
CLX: Towards verifiable PBE data transformation
Zhongjun Jin, Michael Cafarella, H. V. Jagadish, Sean Kandel, Michael Minar, Joseph M. Hellerstein
pp. 265-276
Reverse-Engineering Conjunctive Queries from Provenance Examples
Daniel Deutch, Amir Gilad
pp. 277-288
A Six-dimensional Analysis of In-memory Aggregation
Puya Memarzia, Suprio Ray, Virendra C. Bhavsar
pp. 289-300
BionicDB: Fast and Power-Efficient OLTP on FPGA
Kangnyeon Kim, Ryan Johnson, Ippokratis Pandis
pp. 301-312
Hyrise Re-engineered: An Extensible Database System for Research in Relational In-Memory Data Management
Markus Dreseler, Jan Kossmann, Martin Boissier, Stefan Klauck, Matthias Uflacker, Hasso Plattner
pp. 313-324
Efficient Computation of Probabilistic Core Decomposition at Web-Scale
Fatemeh Esfahani, Venkatesh Srinivasan, Alex Thomo, Kui Wu
pp. 325-336
Efficient Network Reliability Computation in Uncertain Graphs
Yuya Sasaki, Yasuhiro Fujiwara, Makoto Onizuka
pp. 337-348
Scalable Parallelization of RDF Joins on Multicore Architectures
Dimitris Bilidas, Manolis Koubarakis
pp. 349-360
Pivoted Subgraph Isomorphism: The Optimist, the Pessimist and the Realist
Ehab Abdelhamid, Ibrahim Abdelaziz, Zuhair Khayyat, Panos Kalnis
pp. 361-372
MinoanER: Schema-Agnostic, Non-Iterative, Massively Parallel Resolution of Web Entities
Vasilis Efthymiou, George Papadakis, Kostas Stefanidis, Vassilis Christophides
pp. 373-384
Extending Cross-Domain Knowledge Bases with Long Tail Entities using Web Table Data
Yaser Oulabi, Christian Bizer
pp. 385-396
Continuous Deployment of Machine Learning Pipelines
Behrouz Derakhshan, Alireza Rezaei Mahdiraji, Tilmann Rabl, Volker Markl
pp. 397-408
Discovering Order Dependencies through Order Compatibility
Cristian Consonni, Paolo Sottovia, Alberto Montresor, Yannis Velegrakis
pp. 409-420
+ Errata
+ Errata
Scalable Kernel Density Estimation-based Local Outlier Detection over Large Data Streams
Xiao Qin, Lei Cao, Elke Rundensteiner, Samuel Madden
pp. 421-432
Tutorial
RDF graph summarization: principles, techniques and applications
Haridimos Kondylakis, Dimitris Kotzinos, Ioana Manolescu
pp. 433-436
Schemas And Types For JSON Data
Mohamed-Amine Baazizi, Dario Colazzo, Giorgio Ghelli, Carlo Sartiani
pp. 437-439
Influence Maximization Revisited: The State of the Art and the Gaps that Remain
Akhil Arora, Sainyam Galhotra, Sayan Ranu
pp. 440-443
Industry and Applications Paper
Finding Meaningful Contrast Patterns for Quantitative Data
Rohan Khade, Jessica Lin, Nital Patel
pp. 444-455
Predicting "What is Interesting" by Mining Interactive-Data-Analysis Session Logs
Amit Somech, Tova Milo, Chai Ozeri
pp. 456-467
Hidden Layer Models for Company Representations and Product Recommendations
Katsiaryna Mirylenka, Paolo Scotton, Christoph Miksovic, Jeff Dillon
pp. 468-476
Big POI data integration with Linked Data technologies
Spiros Athanasiou, Giorgos Giannopoulos, Damien Graux, Nikos Karagiannakis, Jens Lehmann, Axel-Cyrille Ngonga Ngomo, Kostas Patroumpas, Mohamed Ahmed Sherif, Dimitrios Skoutas
pp. 477-488
Executing Entity Matching End to End: A Case Study
Pradap Konda, Sanjay Seshadri, Elan Segarra, Brent Hueth, Anhai Doan
pp. 489-500
The Copernicus App Lab project: Easy Access to Copernicus Data
Konstantina Bereta, Herve Caumont, Ulrike Daniels, Erwin Goor, Manolis Koubarakis, Despina-Athanasia Pantazi, George Stamoulis, Sam Ubels, Valentijn Venus, Firman Wahyudi
pp. 501-511
Modeling and Building IoT Data Platforms with Actor-Oriented Databases
Yiwen Wang, Julio Cesar Dos Reis, Kasper Myrtue Borggren, Marcos Antonio Vaz Salles, Claudia Bauzer Medeiros, Yongluan Zhou
pp. 512-523
Demonstration
The Power of SQL Lambda Functions
Maximilian Schüle, Dimitri Vorona, Linnea Passing, Harald Lang, Alfons Kemper, Stephan Günnemann, Thomas Neumann
pp. 534-537
MINARET: A Recommendation Framework for Scientific Reviewers
Sherif Sakr, Mohamed Ragab, Mohamed Maher, Ahmed Awad
pp. 538-541
SparkTune: tuning Spark SQL through query cost modeling
Enrico Gallinucci, Matteo Golfarelli
pp. 546-549
HOTMapper: Historical Open Data Table Mapper
Henrique Varella Ehrenfried, Rudolf Eckelberg, Hamer Iboshi, Eduardo Todt, Daniel Weingaertner, Marcos Didonet Del Fabro
pp. 550-553
SmartML: A Meta Learning-Based Framework for Automated Selection and Hyperparameter Tuning for Machine Learning Algorithms
Mohamed Maher, Sherif Sakr
pp. 554-557
devUDF: Increasing UDF development efficiency through IDE Integration. It works like a PyCharm!
Mark Raasveldt, Pedro Holanda, Stefan Manegold
pp. 558-561
ML2SQL - Compiling a Declarative Machine Learning Language to SQL and Python
Maximilian Schüle, Matthias Bungeroth, Dimitri Vorona, Alfons Kemper, Stephan Günnemann, Thomas Neumann
pp. 562-565
Incremental structural summarization of RDF graphs
Francois Goasdoue, Paweł Guzewicz, Ioana Manolescu
pp. 566-569
VISTA: A visual analytics platform for semantic annotation of trajectories
Amílcar Soares, Jordan Rose, Mohammad Etemad, Chiara Renso, Stan Matwin
pp. 570-573
SLIPO: Large-Scale Data Integration for Points of Interest
Spiros Athanasiou, Michail Alexakis, Giorgos Giannopoulos, Nikos Karagiannakis, Yannis Kouvaras, Pantelis Mitropoulos, Kostas Patroumpas, Dimitrios Skoutas
pp. 574-577
A Map Search System based on a Spatial Query Language
Yuanyuan Wang, Panote Siriaraya, Haruka Sakara, Yukiko Kawai, Keishi Tajima
pp. 578-581
FaiRank: An Interactive System to Explore Fairness of Ranking in Online Job Marketplaces
Ahmad Ghizzawi, Julien Marinescu, Shady Elbassuoni, Sihem Amer-Yahia, Gilles Bisson
pp. 582-585
MM-evolver: A Multi-model Evolution Management Tool
Michal Vavrek, Irena Holubova, Stefanie Scherzinger
pp. 586-589
Resense: Transparent Record and Replay of Sensor Data in the Internet of Things
Dimitrios Giouroukis, Julius Hülsmann, Janis von Bleichert, Morgan Geldenhuys, Tim Stullich, Felipe Gutierrez, Jonas Traub, Kaustubh Beedkar, Volker Markl
pp. 590-593
Improving Named Entity Recognition using Deep Learning with Human in the Loop
Ticiana Coelho Da Silva, Régis Magalhães, José de Macêdo, David Araújo, Natanael Araújo, Vinicius de Melo, Pedro Olímpio, Paulo Rego, Aloisio Neto
pp. 594-597
Demonstrating data collections curation and exploration with CURARE
Genoveva Vargas-Solar, Gavin Kemp, Irving Hernandez Gallegos, Javier-Alfonso Espinosa-Oviedo, Catarina Ferrera da Silva, Parisa Ghodous
pp. 598-601
SparkER: Scaling Entity Resolution in Spark
Luca Gagliardelli, Giovanni Simonini, Domenico Beneventano, Sonia Bergamaschi
pp. 602-605
Exploring Interpretable Features for Large Time Series with SE4TeC
Jingwei Zuo, Karine Zeitouni, Yehia Taher
pp. 606-609
Query-Driven Data Minimization with the DataEconomist
Peter K. Schwab, Julian Matschinske, Andreas M. Wahl, Klaus Meyer-Wegener
pp. 614-617
Short Paper
ITGC: Information-theoretic grid-based clustering
Sahar Behzadi Soheil, Hermann Hinterhauser, Claudia Plant
pp. 618-621
Adaptive Watermarks: A Concept Drift-based Approach for Predicting Event-Time Progress in Data Streams
Ahmed Awad, Jonas Traub, Sherif Sakr
pp. 622-625
Rock - Let the points roam to their clusters themselves
Anna Beer, Daniyal Kazempour, Thomas Seidl
pp. 630-633
Fast Trajectory Range Query with Discrete Frechet Distance
Jiahao Zhang, Bo Tang, Yiu Man Lung
pp. 634-637
Repairing of Record Linkage: Turning Errors into Insight
Quyen Bui-Nguyen, Qing Wang, Jingyu Shao, Dinusha Vatsalan
pp. 638-641
Streaming HyperCube: A Massively Parallel Stream Join Algorithm
Yuan Qiu, Serafeim Papadias, Ke Yi
pp. 642-645
Exploring Fairness of Ranking in Online Job Marketplaces
Shady Elbassuoni, Sihem Amer-Yahia, Christine El Atie, Ahmad Ghizzawi, Bilel Oualha
pp. 646-649
Recurrent Neural Networks for Dynamic User Intent Prediction in Human-Database Interaction
Venkata Vamsikrishna Meduri, Kanchan Chowdhury, Mohamed Sarwat
pp. 654-657
Optimal Algorithm for Profiling Dynamic Arrays with Finite Values
Dingcheng Yang, Wenjian Yu, Junhui Deng, Shenghua Liu
pp. 658-661
Publishing Differentially Private Datasets via Stable Microaggregation
Masooma Iftikhar, Qing Wang, Yu Lin
pp. 662-665
Range Query Processing for Monitoring Applications over Untrustworthy Clouds
Hoang Van Tran, Tristan Allard, Laurent D'Orazio, Amr El Abbadi
pp. 666-669
Towards Augmented Database Schemes by Discovery of Latent Visual Attributes
Tomas Grosup, Ladislav Peska, Tomas Skopal
pp. 670-673
Workload-Driven and Robust Selection of Compression Schemes for Column Stores
Martin Boissier, Max Jendruk
pp. 674-677
CLRL: Feature Engineering for Cross-Language Record Linkage
Öykü Özlem Çakal, Mohammad Mahdavi, Ziawasch Abedjan
pp. 678-681
Operational Stream Processing: Towards Scalable and Consistent Event-Driven Applications
Asterios Katsifodimos, Marios Fragkoulis
pp. 682-685
Metropolis-Hastings Algorithms for Estimating Betweenness Centrality
Mostafa Haghir Chehreghani, Talel Abdessalem, Albert Bifet
pp. 686-689
From Copernicus Big Data to Extreme Earth Analytics
Manolis Koubarakis, Konstantina Bereta, Dimitris Bilidas, Konstantinos Giannousis, Theofilos Ioannidis, Despina-Athanasia Pantazi, George Stamoulis, Jim Dowling, Seif Haridi, Vladimir Vlassov, Lorenzo Bruzzone, Claudia Paris, Torbjørn Eltoft, Thomas Krämer, Angelos Charalabidis, Vangelis Karkaletsis, Stasinos Konstantopoulos, Theofilos Kakantousis, Mihai Datcu, Corneliu Octavian Dumitru, Florian Appel, Heike Bach, Silke Migdall, Nick Hughes, David Arthurs, Andrew Fleming
pp. 690-693
Neuromorphic Hardware As Database Co-Processors: Potential and Limitations
Thomas Heinis
pp. 694-697
Query Driven Data Labeling with Experts: Why Pay Twice?
Eyal Dushkin, Shay Gershtein, Tova Milo, Slava Novgorodov
pp. 698-701
Insights into a running clockwork: On interactive process-aware clustering
Daniyal Kazempour, Thomas Seidlz
pp. 706-709
Interpolation-friendly B-trees: Bridging the Gap Between Algorithmic and Learned Indexes
Ali Hadian, Thomas Heinis
pp. 710-713
SynthEdit: Format transformations by example using edit operations
Alex Teodor Bogatu, Alvaro A. A. Fernandes, Norman W. Paton, Nikolaos Konstantinou
pp. 714-717
Triad Enumeration at Trillion-Scale Using a Single Commodity Machine
Yudi Santoso, Alex Thomo, Venkatesh Srinivasan, Sean Chester
pp. 718-721
Fast Truss Decomposition in Large-scale Probabilistic Graphs
Fatemeh Esfahani, Jian Wu, Venkatesh Srinivasan, Alex Thomo, Kui Wu
pp. 722-725