Original publisher: OpenProceedings.org, ISBN: 978-3-89318-070-7, Electronic Edition
Series ISSN: 2367-2005
Front Matter
Keynotes
Research Sessions
Social networks
Finding Users of Interest in Micro-blogging Systems
Camelia Constantin, Ryadh Dahimene, Quentin Grossetti, Cedric Du Mouza
pp. 5-16
Slowing the Firehose: Multi-Dimensional Diversity on Social Post Streams
Shiwen Cheng, Marek Chrobak, Vagelis Hristidis
pp. 17-28
Social, Structured and Semantic Search
Raphaël Bonaque, Bogdan Cautis, François Goasdoué, Ioana Manolescu
pp. 29-40
Graphs
Indexing Query Graphs to Speedup Graph Query Processing
Jing Wang, Nikos Ntarmos, Peter Triantafillou
pp. 41-52
Providing Serializability for Pregel-like Graph Processing Systems
Minyang Han, Khuzaima Daudjee
pp. 77-88
Exploratory search
DBExplorer: Exploratory Search in Databases
Manish Singh, Michael Cafarella, Hosagrahar Visvesvar Jagadish
pp. 89-100
Refinement Driven Processing of Aggregation Constrained Queries
Manasi Vartak, Venkatesh Raghavan, Elke Rundensteiner, Samuel Madden
pp. 101-112
CrowdSky: Skyline Computation with Crowdsourcing
Jongwuk Lee, Dongwon Lee, Sang-Wook Kim
pp. 125-136
Keyword search
Cohesive Keyword Search on Tree Data
Aggeliki Dimitriou, Ananya Dass, Dimitri Theodoratos, Yannis Vassiliou
pp. 137-148
Generic Keyword Search over XML Data
Manoj Agarwal, Krithi Ramamritham, Prashant Agarwal
pp. 149-160
Answering Keyword Queries involving Aggregates and GROUPBY on Relational Databases
Zhong Zeng, Mong Li Lee, Tok Wang Ling
pp. 161-172
Data mining and analytics
Finding All Maximal Cliques in Very Large Social Networks
Alessio Conte, Roberto De Virgilio, Antonio Maccioni, Maurizio Patrignani, Riccardo Torlone
pp. 173-184
RPM: Representative Pattern Mining for Efficient Time Series Classification
Xing Wang, Jessica Lin, Pavel Senin, Tim Oates, Sunil Gandhi, Arnold Boedihardjo, Crystal Chen, Susan Frankenstein
pp. 185-196
Interactive Temporal Association Analytics
Xiao Qin, Ramoza Ahsan, Xika Lin, Elke Rundensteiner, Matthew Ward
pp. 197-208
Integration: record linkage, data exchange, entity resolution
Efficient Record Linkage Using a Compact Hamming Space
Dimitrios Karapiperis, Dinusha Vatsalan, Vassilios Verykios, Peter Christen
pp. 209-220
Scaling Entity Resolution to Large, Heterogeneous Data with Enhanced Meta-blocking
George Papadakis, George Papastefanatos, Themis Palpanas, Manolis Koubarakis
pp. 221-232
Practical Query Answering in Data Exchange Under Inconsistency-Tolerant Semantics
Balder ten Cate, Richard Halpert, Phokion Kolaitis
pp. 233-244
Querying RDF data
Querying RDF Data Using A Multigraph-based Approach
Vijay Ingalalli, Dino Ienco, Pascal Poncelet, Serena Villata
pp. 245-256
Optimization of Complex SPARQL Analytical Queries
Padmashree Ravindra, HyeongSik Kim, Kemafor Anyanwu
pp. 257-268
RDF-TX: A Fast, User-Friendly System for Querying the History of RDF Knowledge Bases
Shi Gao, Jiaqi Gu, Carlo Zaniolo
pp. 269-280
Semantics, dependencies and relationships
Efficient Computation of Containment and Complementarity in RDF Data Cubes
Marios Meimaris, George Papastefanatos, Panos Vassiliadis, Ioannis Anagnostopoulos
pp. 281-292
Semi-automatic support for evolving functional dependencies
Mirjana Mazuran, Elisa Quintarelli, Letizia Tanca, Stefania Ugolini
pp. 293-304
Holistic Data Profiling: Simultaneous Discovery of Various Metadata
Jens Ehrlich, Mandy Roick, Lukas Schulze, Jakob Zwiener, Thorsten Papenbrock, Felix Naumann
pp. 305-316
Spatial data processing
Similarity Search on Spatio-Textual Point Sets
Christodoulos Efstathiades, Alexandros Belesiotis, Dimitrios Skoutas, Dieter Pfoser
pp. 329-340
Nearest Window Cluster Queries
Chen-Che Huang, Jiun-Long Huang, Tsung-Ching Liang, Jun-Zhe Wang, Wen-Yuah Shih, Wang-Chien Lee
pp. 341-352
Distribution and parallelism
Adaptive query parallelization in multi-core column stores
Mrunal Gawade, Martin Kersten
pp. 353-364
PARAGON: Parallel Architecture-Aware Graph Partition Refinement Algorithm
Angen Zheng, Alexandros Labrinidis, Patrick Pisciuneri, Panos Chrysanthis, Peyman Givi
pp. 365-376
Query Workload-based RDF Graph Fragmentation and Allocation
Peng Peng, Lei Zou, Lei Chen, Dongyan Zhao
pp. 377-388
Advanced query processing: Non-traditional data and streams
Efficient Query Processing using the Earth’s Mover Distance in Video Databases
Merih Seran Uysal, Christian Beecks, Daniel Sabinasz, Jochen Schmuecking, Thomas Seidl
pp. 389-400
Probabilistic Threshold Indexing for Uncertain Strings
Sudip Biswas, Manish Patil, Sharma Thankachan, Rahul Shah
pp. 401-412
Context-Aware Event Stream Analytics
Olga Poppe, Chuan Lei, Elke Rundensteiner, Dan Dougherty
pp. 413-424
Spatial and mobile data
Who Cares about Others’ Privacy: Personalized Anonymization of Moving Object Trajectories
Despina Kopanaki, Vasilis Theodossopoulos, Nikos Pelekis, Ioannis Kopanakis, Yannis Theodoridis
pp. 425-436
Identifying and Describing Streets of Interest
Dimitrios Skoutas, Dimitris Sacharidis, Kostas Stamatoukos
pp. 437-448
Finding Frequently Visited Indoor POIs Using Symbolic Indoor Tracking Data
Hua Lu, Chenjuan Guo, Bin Yang, Chrisitian Jensen
pp. 449-460
Visionary Sessions
Visionary
Designing Access Methods: The RUM Conjecture
Manos Athanassoulis, Michael Kester, Lukas Maas, Radu Stoica, Stratos Idreos, Anastasia Ailamaki, Mark Callaghan
pp. 461-466
Self-Curating Databases
Mohammad Sadoghi, Kavitha Srinivas, Oktie Hassanzadeh, Yuan-Chi Chang, Mustafa Canim, Achille Fokoue, Yishai Feldman
pp. 467-472
Data Wrangling for Big Data: Challenges and Opportunities
Tim Furche, Georg Gottlob, Leonid Libkin, Giorgio Orsi, Norman Paton
pp. 473-478
Road to Freedom in Big Data Analytics
Divy Agrawal, Sanjay Chawla, Ahmed Elmagarmid, Zoi Kaoudi, Mourad Ouzzani, Paolo Papotti, Jorge Quiane, Nan Tang, Mohammed Zaki
pp. 479-484
Data Management for Next Generation Genomic Computing
Stefano Ceri, Abdulrahaman Kaitoua, Marco Masseroli, Pietro Pinoli, Francesco Venco
pp. 485-490
Demonstration Sessions
Demonstration Track
GROM: a General Rewriter of Semantic Mappings
Giansalvatore Mecca, Guillem Rull, Donatello Santoro, Ernest Teniente
pp. 592-595
PowerQ: An Interactive Keyword Search Engine for Aggregate Queries on Relational Databases
Zhong Zeng, Mong Li Lee, Tok Wang Ling
pp. 596-599
Visualization Through Inductive Aggregation
Parke Godfrey, Jarek Gryz, Piotr Lasek, Nasim Razavi
pp. 600-603
Contextual Event Search: Finding Contextual Messages in Dynamic microblog Data Stream in Real Time
Manoj Agarwal, Divyam Bansal, Mridul Garg, Krithi Ramamritham
pp. 604-607
Answering Controlled Natural Language Questions on RDF Knowledge Bases
Giuseppe Mazzeo, Carlo Zaniolo
pp. 608-611
tPredictor: A Micro-blog Based System for Teenagers' Stress Prediction
Jing Huang, Qi Li, Zhuonan Feng, Yiping Li, Ling Feng
pp. 612-615
OSNI: Searching for Needles in a Haystack of Social Network Data
Shiwen Cheng, James Fang, Vagelis Hristidis, Harsha Madhyastha, Niluthpol Chowdhury Mithun, Dorian Perkins, Amit Roy-Chowdhury, Moloud Shahbazi, Vassilis Tsotras
pp. 616-619
PROX: Approximated Summarization of Data Provenance
Eleanor Ainy, Pierre Bourhis, Susan Davidson, Daniel Deutch, Tova Milo
pp. 620-623
streamLoader: An Event-Driven ETL System for the On-line Processing of Heterogeneous Sensor Data
Marco Mesiti, Luca Ferrari, Stefano Valtolina, Giacomo Licari, GIanluca Galliani, Minh-San Dao, Koji Zettsu
pp. 628-631
TINTIN: a Tool for INcremental INTegrity checking of Assertions in SQL Server
Xavier Oriol, Ernest Teniente, Guillem Rull
pp. 632-635
Efficient regular path query evaluation using path indexes
George Fletcher, Jeroen Peters, Alexandra Poulovassilis
pp. 636-639
Galaxy: A Platform for Explorative Analysis of Open Data Sources
Seyed-Mehdi-Reza Beheshti, Boualem Benatallah, Hamid Reza Motahari Nezhad
pp. 640-643
OAPT: A Tool for Ontology Analysis and Partitioning
Alsayed Algergawy, Samira Babalou, Friederike Klan, Birgitta König-Ries
pp. 644-647
ShapeExplorer: Querying and Exploring Shapes using Visual Knowledge
Tong Ge, Yafang Wang, Gerard de Melo, Zengguang Hao, Andrei Sharf, Baoquan Chen
pp. 648-651
Distributed Secure Search in the Personal Cloud
Thu Le, Nicolas Anciaux, Sebastien Guilloton, Saliha Lallali, Philippe Pucheral, Iulian Sandu Popa, Chao Chen
pp. 652-655
Tutorials
Tutorials
Data Responsibly: Fairness, Neutrality and Transparency in Data Analysis
Julia Stoyanovich, Serge Abiteboul, Gerome Miklau
pp. 718-719
Core Decomposition in Graphs: Concepts, Algorithms and Applications
Fragkiskos D. Malliaros, Apostolos N. Papadopoulos, Michalis Vazirgiannis
pp. 720-721
Industry and Applications
Industrial and Applications Sessions
Exploring Text Classification for Messy Data: An Industry Use Case for Domain-Specific Analytics
Laura Kassner, Bernhard Mitschang
pp. 491-502
Discovering Correlations in Annotated Databases
Xuebin He, Stephen Donohue, Mohamed Eltabakh
pp. 503-514
Query Performance Problem Determination with Knowledge Base in Semantic Web System OptImatch
Guilherme Damasio, Piotr Mierzejewski, Jaroslaw Szlichta, Calisto Zuzarte
pp. 515-526
Characterizing Home Device Usage From Wireless Traffic Time Series
Katsiaryna Mirylenka, Vassilis Christophides, Themis Palpanas, Ioannis Pefkianakis, Martin May
pp. 539-550
Parallel Duplicate Detection in Adverse Drug Reaction Databases with Spark
Chen Wang, Sarvnaz Karimi
pp. 551-562
e#: Sharper Expertise Detection from Microblogs
Thibault Sellam, Martin Hentschel, Vasilis Kandylas, Omar Alonso
pp. 563-572
DECT: Distributed Evolving Context Tree for Mining Web Behavior Evolution
Xiaokui Shu, Nikolay Laptev, Danfeng Yao
pp. 573-579
Strudel: A Framework for Transaction Performance Analyses on SQL/NoSQL Systems
Junichi Tatemura, Oliver Po, Zheng Li, Hakan Hacigumus
pp. 580-591
Poster Sessions
Poster
Quantifying Likelihood of Change through Update Propagation across Top-k Rankings
Evica Milchevski, Sebastian Michel
pp. 660-661
Optimizing B+-Tree for PCM-Based Hybrid Memory
Lu Li, Peiquan Jin, Chengcheng Yang, Zhanglin Wu, Lihua Yue
pp. 662-663
A Data Mining Approach to Choosing Categorical Attributes for Ranked Lists
Koninika Pal, Sebastian Michel
pp. 664-665
Efficient Implementation of Joins over Cassandra DBs
Haridimos Kondylakis, Antonis Fountouris, Dimitris Plexousakis
pp. 666-667
Double Chain-Star: an RDF indexing scheme for fast processing of SPARQL joins
Marios Meimaris, George Papastefanatos
pp. 668-669
Minoan ER: Progressive Entity Resolution in the Web of Data
Vasilis Efthymiou, Kostas Stefanidis, Vassilis Christophides
pp. 670-671
Proposal of a Database Type and Aggregation Function for Accelerating Medical Genomics Study on RDBMS
Yoshifumi Ujibashi, Motoyuki Kawaba, Lilian Harada
pp. 672-673
A Way to Automatically Enrich Biomedical Ontologies
Juan Antonio Lossio-Ventura, Mathieu Roche, Clement Jonquet, Maguelonne Teisseire
pp. 676-677
Sweet KIWI: Statistics-Driven OLAP Acceleration using Query Column Sets
Sung-Soo Kim, Taewhi Lee, Moonyoung Chung, Jongho Won
pp. 680-681
On-Line Mobility Pattern Discovering using Trajectory Data
Ticiana Coelho da Silva, Karine Zeitouni, José Fernandes de Macêdo, Marco Casanova
pp. 682-683
Summarizing Linked Data RDF Graphs Using Approximate Graph Pattern Mining
Mussab Zneika, Claudio Lucchese, Dan Vodislav, Dimitris Kotzinos
pp. 684-685
Understanding Customer Attrition at an Individual Level: a New Model in Grocery Retail Context
Clément Gautrais, Peggy Cellier, Thomas Guyet, René Quiniou, Alexandre Termier
pp. 686-687
Towards an Efficient Ranking of Interval-Based Patterns
Marwan Hassani, Yifeng Lu, Thomas Seidl
pp. 688-689
SOFYA: Semantic on-the-fly Relation Alignment
Maria Koutraki, Nicoleta Preda, Dan Vodislav
pp. 690-691
Model Kit for Lightweight Data Compression Algorithms
Juliana Hildebrandt, Dirk Habich, Patrick Damme, Wolfgang Lehner
pp. 692-693
Revisiting DBMS Space Management for Native Flash
Sergey Hardock, Ilia Petrov, Robert Gottstein, Alejandro Buchmann
pp. 694-695
A Two Phase Deep Learning Model for Identifying Discrimination from Tweets
Shuhan Yuan, Xintao Wu, Yang Xiang
pp. 696-697
Top-k Dominating Queries, in Parallel, in Memory
Sean Chester, Orestis Gkorgkas, Kjetil Nørvåg
pp. 698-699
Snapshot Isolation for Neo4j
Marta Patino, Ricardo Jimenez-PEris, Diego Burgos-Sancho, Ivan Brondino, Valerio Vianello, Rohit Dhamane
pp. 700-701
An On-Line Approximation Algorithm for Mining Frequent Closed Itemsets Based on Incremental Intersection
Koji Iwanuma, Yoshitaka Yamamoto, Shoshi Fukuda
pp. 704-705
Extending Database Accelerators for Data Transformations and Predictive Analytics
Felix Beier, Knut Stolze, Daniel Martin
pp. 706-707
Privacy Protection through Query Rewriting in Smart Environments
Hannes Grunert, Andreas Heuer
pp. 708-709
DatShA :A Data Sharing Algebra for access control plans
Luc Bouganim, Athanasia Katsouraki, Benjamin Nguyen
pp. 710-711
Empirical evaluation of guarded structural indexing
Erik Agterdenbos, George Fletcher, Chee-Yong Chan, Stijn Vansummeren
pp. 714-715
Context-Dependent Quality-Aware Source Selection for Live Queries on Linked Data
Barbara Catania, Giovanna Guerrini, Beyza Yaman
pp. 716-717