Series ISSN: 2367-2005
Number 1 (November 15, 2021)
Original publisher: OpenProceedings.org, ISBN: 978-3-89318-086-8, Electronic Edition
Research Track
Revisiting Runtime Dynamic Optimization for Join Queries in Big Data Management Systems
Christina Pavlopoulou, Michael Carey, Vassilis J. Tsotras
pp. 1–12
SAHARA: Memory Footprint Reduction of Cloud Databases with Automated Table Partitioning
Michael Brendle, Nick Weber, Mahammad Valiyev, Norman May, Robert Schulze, Alexander Böhm, Guido Moerkotte, Michael Grossniklaus
pp. 13–26
Bandwidth-optimal Relational Joins on FPGAs
Robert Lasch, Mehdi Moghaddamfar, Norman May, Suleyman Demirsoy, Christian Färber, Kai-Uwe Sattler
pp. 27–39
ArrayQL Integration into Code-Generating Database Systems
Maximilian E Schüle, Tobias Götz, Alfons Kemper, Thomas Neumann
pp. 40–51
RENUVER: A Missing Value Imputation Algorithm based on Relaxed Functional Dependencies
Bernardo Breve, Loredana Caruccio, Vincenzo Deufemia, Giuseppe Polese
pp. 52–64
Workload-Aware Materialization of Junction Trees
Martino Ciaperoni, Cigdem Aslay, Aristides Gionis, Michael Mathioudakis
pp. 65–77
Experiments & Analysis Track
Implementing Distributed Approximate Similarity Joins using Locality Sensitive Hashing
Martin Aumüller, Matteo Ceccarello
pp. 78–90
Number 2 (March 23, 2022)
Original publisher: OpenProceedings.org, ISBN: 978-3-89318-085-7, Electronic Edition
Research Track
Mining Change Rules
Daniel Lindner, Franziska Schumann, Nicolas Alder, Tobias Bleifuß, Leon Bornemann, Felix Naumann
pp. 91–103
3DPro: Querying Complex Three-Dimensional Data with Progressive Compression and Refinement
Dejun Teng, Furqan Baig, Vo Hoang, Yanhui Liang, Jun Kong, Fusheng Wang
pp. 104–117
TransER: Homogeneous Transfer Learning for Entity Resolution
Nishadi Kirielle, Peter Christen, Thilina Ranbaduge
pp. 118–130
Differentially-Private Publication of Origin-Destination Matrices with Intermediate Stops
Sina Shaham, Gabriel Ghinita, Cyrus Shahabi
pp. 131–142
SURAGH: Syntactic Pattern Matching to Identify Ill-Formed Records
Mazhar Hameed, Gerardo Vitagliano, Lan Jiang, Felix Naumann
pp. 143–154
SWIRL: Selection of Workload-aware Indexes using Reinforcement Learning
Jan Kossmann, Alexander Kastius, Rainer Schlosser
pp. 155–168
LMKG: Learned Models for Cardinality Estimation in Knowledge Graphs
Angjela Davitkova, Damjan Gjurovski, Sebastian Michel
pp. 169–182
An Indexable Time Series Dimensionality Reduction Method for Maximum Deviation Reduction and Similarity Search
Ruidong Xue, Weiren Yu, Hongxia Wang
pp. 183–195
GPU-FAST-PROCLUS: A Fast GPU-parallelized Approach to Projected Clustering
Jakob Rødsgaard Jørgensen, Katrine Scheel, Ira Assent, Ajeet Ram Pathak, Anne C. Elster
pp. 196–206
Aggregation Detection in CSV Files
Lan Jiang, Gerardo Vitagliano, Mazhar Hameed, Felix Naumann
pp. 207–219
A Supervised Skyline-Based Algorithm for Spatial Entity Linkage
Suela Isaj, Vassilis Kaffes, Torben Bach Pedersen, Giorgos Giannopoulos
pp. 220–233
Efficiently Managing Deep Learning Models in a Distributed Environment
Nils Strassenburg, Ilin Tolovski, Tilmann Rabl
pp. 234–246
Unsupervised Selectivity Estimation by Integrating Gaussian Mixture Models and an Autoregressive Model
Zizhong Meng, Peizhi Wu, Gao Cong, Rong Zhu, Shuai Ma
pp. 247–259
Gamma Probabilistic Databases: Learning from Exchangeable Query-Answers
Niccolò Meneghetti, Ouael Ben Amara
pp. 260–273
Automatic generation of comparison notebooks for interactive data exploration
Alexandre Chanson, Nicolas Labroche, Patrick Marcel, Stefano Rizzi, Vincent T'Kindt
pp. 274–284
Cardinality Estimation using Label Probability Propagation for Subgraph Matching in Property Graph Databases
Leonard Wörteler, Moritz Renftle, Theodoros Chondrogiannis, Michael Grossniklaus
pp. 285–297
RingBFT: Resilient Consensus over Sharded Ring Topology
Sajjad Rahnama, Suyash Gupta, Rohan Sogani, Dhruv Krishnan, Mohammad Sadoghi
pp. 298–311
Elastic Indexes: Dynamic Space vs. Query Efficiency Tuning for In-Memory Database Indexing
Moshik Hershcovitch, Artem Khyzha, Daniel Waddington, Adam Morrison
pp. 312–324
Backbone Index to Support Skyline Path Queries over Multi-cost Road Networks
Qixu Gong, Huiping Cao
pp. 325–337
Towards Optimal Resource Allocation for Big Data Analytics
Anish Pimpley, Shuo Li, Rathijit Sen, Soundararajan Srinivasan, Alekh Jindal
pp. 338–350
Efficient Dynamic Clustering: Capturing Patterns from Historical Cluster Evolution
Binbin Gu, Saeed Kargar, Faisal Nawab
pp. 351–363
Short Paper Track
Detecting Errors in Databases with Bidirectional Recurrent Neural Networks
Severin Holzer, Kurt Stockinger
pp. 364–367
Evaluating In-Memory Hash Joins on Persistent Memory
Tobias Maltenberger, Till Lehmann, Lawrence Benson, Tilmann Rabl
pp. 368–372
DataGossip: A Data Exchange Extension for Distributed Machine Learning Algorithms
Phillip Wenig, Thorsten Papenbrock
pp. 373–377
Masked Language Models as Stereotype Detectors?
Yacine Gaci, Boualem Benatallah, Fabio Casati, Khalid Benabdeslem
pp. 383–387
Implementing Linear Bandits in Off-the-Shelf SQLite
Radu Ciucanu, Marta Soare, Sihem Amer-Yahia
pp. 388–392
Fine-Tuning Dependencies with Parameters
Alireza Vezvaei, Lukasz Golab, Mehdi Kargar, Divesh Srivastava, Jaroslaw Szlichta, Morteza Zihayat
pp. 393–397
RoleSim+: A Fast Algorithm for RoleSim Similarity Search
Weiren Yu, Sima Iranmanesh, Xuming Hong, Jianxun Xu
pp. 398–402
Similarity-driven Schema Transformation for Test Data Generation
Fabian Panse, Meike Klettke, Johannes Schildgen, Wolfram Wingerath
pp. 408–413
Towards A General SIMD Concurrent Approach to Accelerating Integer Compression Algorithms
Juliana Hildebrandt, Dirk Habich, Wolfgang Lehner
pp. 414–418
Columnar Storage Optimization and Caching for Data Lakes
Guodong Jin, Haoqiong Bian, Yueguo Chen, Xiaoyong Du
pp. 419–423
Cache-Coherent Shared Locking for Transactionally Consistent Updates in Near-Data Processing DBMS on Smart Storage
Arthur Bernhardt, Sajjad Tamimi, Florian Stock, Tobias Vincon, Andreas Koch, Ilia Petrov
pp. 424–428
Bias-aware Fair Neural Ranking for Addressing Stereotypical Gender Biases
Shirin Seyedsalehi, Amin Bigdeli, Negar Arabzadeh, Bhaskar Mitra, Morteza Zihayat, Ebrahim Bagheri
pp. 435–439
A Neural Approach to Forming Coherent Teams in Collaboration Networks
Radin Hamidi Rad, Shirin Seyedsalehi, Mehdi Kargar, Morteza Zihayat, Ebrahim Bagheri
pp. 440–444
Diversified Top-r Community Search in Geo-Social Network: A K-Truss Based Model
Renjie Sun, Yanping Wu, Xiaoyang Wang
pp. 445–448
Hierarchical Clustering for Property Graph Schema Discovery
Angela Bonifati, Stefania Dumbrava, Nicolas Mir
pp. 449–453
Vision Papers
PReVer: Towards Private Regulated Verified Data
Mohammad Javad Amiri, Tristan Allard, Divy Agrawal, Amr El Abbadi
pp. 454–461
Experiments & Analysis Track
Bipartite Graph Matching Algorithms for Clean-Clean Entity Resolution: An Empirical Evaluation
George Papadakis, Vasilis Efthymiou, Emmanouil Thanos, Oktie Hassanzadeh
pp. 462–474
Evaluation of Algorithms for Interaction-Sparse Recommendations: Neural Networks don't Always Win
Yasamin Klingler, Claude Lehmann, João Pedro Monteiro, Carlo Saladin, Abraham Bernstein, Kurt Stockinger
pp. 475–486
Industry & Applications Track
Placement of Workloads from Advanced RDBMS Architectures into Complex Cloud Infrastructure
Antony S. Higginson, Clive Bostock, Norman W. Paton, Suzanne M. Embury
pp. 487–497
Unsupervised Graph-based Entity Resolution for Accurate and Efficient Family Pedigree Search
Nishadi Kirielle, Charini Nanayakkara, Peter Christen, Chris Dibben, Lee Williamson, Eilidh Garrett, Clair Manson
pp. 498–510
Integrating the Orca Optimizer into MySQL
Arunprasad P. Marathe, Shu Lin, Weidong Yu, Kareem El Gebaly, Per-Åke Larson, Calvin Sun
pp. 511–523
Model-Independent Design of Knowledge Graphs --- Lessons Learnt From Complex Financial Graphs
Luigi Bellomarini, Andrea Gentili, Eleonora Laurenza, Emanuel Sallinger
pp. 524–536
Voyager: Data Discovery and Integration for Onboarding in Data Science
Alex Bogatu, Norman W. Paton, Mark Douthwaite, André Freitas
pp. 537–548
Distributed Training of Knowledge Graph Embedding Models using Ray
Nasrullah Sheikh, Xiao Qin, Yaniv Gur, Berthold Reinwald
pp. 549–553
Demonstrations
JupySim: Jupyter Notebook Similarity Search System
Misato Horiuchi, Yuya Sasaki, Chuan Xiao, Makoto Onizuka
pp. 554–557
SheerMP: Optimized Streaming Analytics-as-a-Service over Multi-site and Multi-platform Settings
George Stamatakis, Antonis Kontaxakis, Alkis Simitsis, Nikos Giatrakos, Antonios Deligiannakis
pp. 558–561
AVID: GPU-enabled Visual Analytics with GPU-FAST-PROCLUS
Jakob Rødsgaard Jørgensen, Ira Assent, Hans-Jörg Schulz
pp. 562–565
MM-infer: A Tool for Inference of Multi-Model Schemas
Pavel Koupil, Sebastián Hricko, Irena Holubová
pp. 566–569
An Extensive and Secure Personal Data Management System Using SGX
Robin Carpentier, Floris Thiant, Iulian Sandu Popa, Nicolas Anciaux, Luc Bouganim
pp. 570–573
MAGE: Discovering Mixture-based Areas of Interest over Geolocated Entities
Kostas Patroumpas, Dimitrios Skoutas, Dimitris Sacharidis
pp. 574–577
DP-Shield: Face Obfuscation with Differential Privacy
Muhammad Usama Saleem, Dominick Reilly, Liyue Fan
pp. 578–581