Original publisher: OpenProceedings.org, ISBN: 978-3-89318065-3, Electronic Edition
Front Matter
Research Sessions
Hadoop Optimization
PStorM: Profile Storage and Matching for Feedback-Based Tuning of MapReduce Jobs.
Mostafa Ead, Herodotos Herodotou, Ashraf Aboulnaga, Shivnath Babu
pp. 1-12
Optimization Techniques for "Scaling Down" Hadoop on Multi-Core, Shared-Memory Systems.
K. Ashwin Kumar, Jonathan Gluck, Amol Deshpande, Jimmy Lin
pp. 13-24
Redoop: Supporting Recurring Queries in Hadoop.
Chuan Lei, Elke A. Rundensteiner, Mohamed Y. Eltabakh
pp. 25-36
MapReduce Computation
Efficient Skyline Computation in MapReduce.
Kasper Mullesgaard, Jens Laurits Pederseny, Hua Lu, Yongluan Zhou
pp. 37-48
Projected Clustering for Huge Data Sets in MapReduce.
Sergej Fries, Stephan Wels, Thomas Seidl
pp. 49-60
Efficient and Flexible Index Access in MapReduce.
Zhao Cao, Shimin Chen, Dongzhe Ma, Jianhua Feng, Min Wang
pp. 61-72
Steam and Parallel Processing
JISC: Adaptive Stream Processing Using Just-In-Time State Completion.
Ahmed M. Aly, Walid G. Aref, Mourad Ouzzani, Hosam M. Mahmoud
pp. 73-84
Adaptive Fault-Tolerance for Dynamic Resource Provisioning in Distributed Stream Processing Systems.
Paolo Bellavista, Antonio Corradi, Spyros Kotoulas, Andrea Reale
pp. 85-96
Adapting Tree Structures for Processing with SIMD Instructions.
Steffen Zeuch, Johann-Christoph Freytag, Frank Huber
pp. 97-108
Multi-Queries and Concurrent Queries
Contender: A Resource Modeling Approach for Concurrent Query Performance Prediction.
Jennie Duggan, Olga Papaemmanouil, Ugur Çetintemel, Eli Upfal
pp. 109-120
CAQE: A Contract Driven Approach to Processing Concurrent Decision Support Queries.
Venkatesh Raghavan, Elke A. Rundensteiner
pp. 121-132
Multi-Query Diversification in Microblogging Posts.
Shiwen Cheng, Anastasios Arvanitis, Marek Chrobak, Vagelis Hristidis
pp. 133-144
Provenance, Corroboration and Information Extraction
Query-Based Why-Not Provenance with NedExplain.
Nicole Bidoit, Melanie Herschel, Katerina Tzompanaki
pp. 145-156
Overcoming Semantic Drift in Information Extraction.
Zhixu Li, Hongsong Li, Haixun Wang, Yi Yang, Xiangliang Zhang, Xiaofang Zhou
pp. 169-180
Text and Sequence Mining
COLARM: Cost-based Optimization for Localized Association Rule Mining.
Abhishek Mukherji, Elke A. Rundensteiner, Matthew O. Ward
pp. 181-192
Fast Mining of Interesting Phrases from Subsets of Text Corpora.
Deepak P, Atreyee Dey, Debapriyo Majumdar
pp. 193-204
Sequence Pattern Matching over Time-Series Data with Temporal Uncertainty.
Yongluan Zhou, Chunyang Ma, Qingsong Guo, Lidan Shou, Gang Chen
pp. 205-216
EDBT Vision Track
Spatial Data Management Challenges in the Simulation Sciences.
Thomas Heinis, Farhan Tauheed, Anastasia Ailamaki
pp. 217-222
What Can Programming Languages Say About Data Exchange?
Michael Johnson, Jorge Pérez, James F. Terwilliger
pp. 223-228
Toward Hardware-Sensitive Database Operations.
David Broneske, Sebastian Breß, Max Heimel, Gunter Saake
pp. 229-234
Distributed Query Processing
Distributed Spatial Keyword Querying on Road Networks.
Siqiang Luo, Yifeng Luo, Shuigeng Zhou, Gao Cong, Jihong Guan
pp. 235-246
Continuous Quantile Query Processing in Wireless Sensor Networks.
Johannes Niedermayer, Mario A. Nascimento, Matthias Renz, Peer Kröger, Hans-Peter Kriegel
pp. 247-258
RIPPLE: A Scalable Framework for Distributed Processing of Rank Queries.
George Tsatsanifos, Dimitris Sacharidis, Timos Sellis
pp. 259-270
Indexing and Cost Statistics
HCS: Hierarchical Cut Selection for Efficiently Processing Queries on Data Columns using Hierarchical Bitmap Indices.
Parth Nagarkar, K. Selçuk Candan
pp. 271-282
Adaptive String Dictionary Compression in In-Memory Column-Store Database Systems.
Ingo Müller, Cornelius Ratsch, Franz Färber
pp. 283-294
Online Topic-aware Influence Maximization Queries.
Çigdem Aslay, Nicola Barbieri, Francesco Bonchi, Ricardo A. Baeza-Yates
pp. 295-306
Determining Essential Statistics for Cost Based Optimization of an ETL Workflow.
Ramanujam Halasipuram, Prasad M. Deshpande, Sriram Padmanabhan
pp. 307-318
Matrix Factorization, Clustering and Probabilistic Data
CLUDE: An Efficient Algorithm for LU Decomposition Over a Sequence of Evolving Graphs.
Chenghui Ren, Luyi Mo, Ben Kao, Reynold Cheng, David W. Cheung
pp. 319-330
Model Selection for Semi-Supervised Clustering.
Mojgan Pourrajabi, Davoud Moulavi, Ricardo J. G. B. Campello, Arthur Zimek, Jörg Sander, Randy Goebel
pp. 331-342
Spatial Partitioning of Large Urban Road Networks.
Tarique Anwar, Chengfei Liu, Hai L. Vu, Christopher Leckie
pp. 343-354
ENFrame: A Platform for Processing Probabilistic Data.
Sebastiaan J. van Schaik, Dan Olteanu, Robert Fink
pp. 355-366
Keyword Search and Diversity
Diversified Spatial Keyword Search On Road Networks.
Chengyuan Zhang, Ying Zhang, Wenjie Zhang, Xuemin Lin, Muhammad Aamir Cheema, Xiaoyang Wang
pp. 367-378
Cleaning trajectory data of RFID-monitored objects through conditioning under integrity constraints.
Bettina Fazzinga, Sergio Flesca, Filippo Furfaro, Francesco Parisi
pp. 379-390
Multi-Criteria Optimal Location Query with Overlapping Voronoi Diagrams.
Ji Zhang, Wei-Shinn Ku, Min-Te Sun, Xiao Qin, Hua Lu
pp. 391-402
Ranking
Efficient Concept-based Document Ranking.
Anastasios Arvanitis, Matthew T. Wiley, Vagelis Hristidis
pp. 403-414
Metric-Based Top-k Dominating Queries.
Eleftherios Tiakas, George Valkanas, Apostolos N. Papadopoulos, Yannis Manolopoulos
pp. 415-426
A Unified Framework for Efficiently Processing Ranking Related Queries.
Muhammad Aamir Cheema, Zhitao Shen, Xuemin Lin, Wenjie Zhang
pp. 427-438
Joins
Exploiting the query structure for efficient join ordering in SPARQL queries.
Andrey Gubichev, Thomas Neumann
pp. 439-450
Processing Interval Joins On Map-Reduce.
Bhupesh Chawda, Himanshu Gupta, Sumit Negi, Tanveer A. Faruquie, L. Venkata Subramaniam, Mukesh K. Mohania
pp. 463-474
Privacy-Aware Data Processing
Differentially Private Synthesization of Multi-Dimensional Data using Copula Functions.
Haoran Li, Li Xiong, Xiaoqian Jiang
pp. 475-486
Privacy-Preserving Query Execution using a Decentralized Architecture and Tamper Resistant Hardware.
Quoc-Cuong To, Benjamin Nguyen, Philippe Pucheral
pp. 487-498
Privacy Preserving Similarity Evaluation of Time Series Data.
Haohan Zhu, Xianrui Meng, George Kollios
pp. 499-510
Graph Queries and Analytics
Reachability Queries in Very Large Graphs: A Fast Refined Online Search Approach.
Renê Rodrigues Veloso, Loïc Cerf, Wagner Meira Junior, Mohammed J. Zaki
pp. 511-522
Fast Reliability Search in Uncertain Graphs.
Arijit Khan, Francesco Bonchi, Aristides Gionis, Francesco Gullo
pp. 535-546
Distance oracles in edge-labeled graphs.
Francesco Bonchi, Aristides Gionis, Francesco Gullo, Antti Ukkonen
pp. 547-558
Privacy in Networks
A Privacy-Preserving Framework for Personalized, Social Recommendations.
Zach Jorgensen, Ting Yu
pp. 571-582
L-opacity: Linkage-Aware Graph Anonymization.
Sadegh Nobari, Panagiotis Karras, Hweehwa Pang, Stéphane Bressan
pp. 583-594
Privacy Risk in Anonymized Heterogeneous Information Networks.
Aston Zhang, Xing Xie, Kevin Chen-Chuan Chang, Carl A. Gunter, Jiawei Han, XiaoFeng Wang
pp. 595-606
Awards
Test of time award
Demonstration Sessions
Demonstration Sessions
Mindmap-Inspired Semantic Personal Information Management.
Jenny Rompa, Christos Tryfonopoulos, Costas Vassilakis, George Lepouras
pp. 608-611
READ: Rapid data Exploration, Analysis and Discovery.
Udayan Khurana, Srinivasan Parthasarathy, Deepak S. Turaga
pp. 612-615
Demonstrating Self-Learning Algorithm Adaptivity in a Hardware-Oblivious Database Engine.
Max Heimel, Filip Haase, Martin Meinke, Sebastian Breß, Michael Saecker, Volker Markl
pp. 616-619
SECRETA: A System for Evaluating and Comparing RElational and Transaction Anonymization algorithms.
Giorgos Poulis, Aris Gkoulalas-Divanis, Grigorios Loukides, Spiros Skiadopoulos, Christos Tryfonopoulos
pp. 620-623
SIAS-V in Action: Snapshot Isolation Append Storage - Vectors on Flash.
Robert Gottstein, Thorsten Peter, Ilia Petrov, Alejandro P. Buchmann
pp. 624-627
inWalk: Interactive and Thematic Walks inside the Web of Data.
Silvana Castano, Alfio Ferrara, Stefano Montanelli
pp. 628-631
KIEV: a Tool for Extracting Semantic Relations from the World Wide Web.
Naimdjon Takhirov, Fabien Duchateau, Trond Aalberg, Ingeborg T. Sølvberg
pp. 632-635
AGGREGO SEARCH: Interactive Keyword Query Construction.
Grégory Smits, Olivier Pivert, Hélène Jaudoin, François Paulus
pp. 636-639
R2G: a Tool for Migrating Relations to Graphs.
Roberto De Virgilio, Antonio Maccioni, Riccardo Torlone
pp. 640-643
ALIAS: Author Disambiguation in Microsoft Academic Search Engine Dataset.
Michael Pitts, Swapna Savvana, Senjuti Basu Roy, Vani Mandava
pp. 648-651
gst-Store: An Engine for Large RDF Graph Integrating Spatiotemporal Information.
Dong Wang, Lei Zou, Dongyan Zhao
pp. 652-655
Learn2Learn: A Visual Educational System for Study Planning.
Jishang Wei, Georgia Koutrika, Shanchan Wu
pp. 656-659
Helping Teenagers Relieve Psychological Pressures: A Micro-blog Based System.
Qi Li, Yuanyuan Xue, Jia Jia, Ling Feng
pp. 660-663
WePIGE: The WebLab Provenance Information Generator and Explorer.
Clément Caron, Bernd Amann, Camélia Constantin, Patrick Giroux
pp. 664-667
Learning Event Patterns for Gesture Detection.
Felix Beier, Nedal Alaqraa, Yuting Lai, Kai-Uwe Sattler
pp. 668-671
Tutorials
Tutorial Sessions
Tutorial: Managing Personal Data with Strong Privacy Guarantees.
Nicolas Anciaux, Benjamin Nguyen, Iulian Sandu Popa
pp. 672-673
Industry and Applications
Industry and Applications
SMILE: A Data Sharing Platform for Mobile Apps in the Cloud.
Jagan Sankaranarayanan, Hakan Hacigümüs, Haopeng Zhang, Mohamed Sarwat
pp. 688-699
Diff-Index: Differentiated Index in Distributed Log-Structured Data Stores.
Wei Tan, Sandeep Tata, Yuzhe Tang, Liana L. Fong
pp. 700-711
Heterogeneous Stream Processing and Crowdsourcing for Urban Traffic Management.
Alexander Artikis, Matthias Weidlich, François Schnitzler, Ioannis Boutsis, Thomas Liebig, Nico Piatkowski, Christian Bockermann, Katharina Morik, Vana Kalogeraki, Jakub Marecek, Avigdor Gal, Shie Mannor, Dimitrios Gunopulos, Dermot Kinane
pp. 712-723
On Assigning Implicit Reputation Scores in an Online Labor Marketplace.
Maria Daltayanni, Luca de Alfaro, Panagiotis Papadimitriou, Panayiotis Tsaparas
pp. 724-725
Annotating the Behavior of Scientific Modules Using Data Examples: A Practical Approach.
Khalid Belhajjame
pp. 726-737
Benchmarking Bitemporal Database Systems: Ready for the Future or Stuck in the Past?
Martin Kaufmann, Peter M. Fischer, Norman May, Donald Kossmann
pp. 738-749
Business-Intelligence Queries with Order Dependencies in DB2.
Jaroslaw Szlichta, Parke Godfrey, Jarek Gryz, Wenbin Ma, Weinan Qiu, Calisto Zuzarte
pp. 750-761
A Tale of Two Graphs: Property Graphs as RDF in Oracle.
Souripriya Das, Jagannathan Srinivasan, Matthew Perry, Eugene Inseok Chong, Jayanta Banerjee
pp. 762-773