Series ISSN: 2367-2005
Number 1 (July 8, 2024)
Original publisher: OpenProceedings.org, ISBN: 978-3-98318-097-4, Electronic Edition
Research Track
GraLMatch: Matching Groups of Entities with Graphs and Language Models
Fernando De Meer Pardo, Claude Lehmann, Dennis Gehrig, Andrea Nagy, Stefano Nicoli, Branka Hadji Misheva, Martin Braschler, Kurt Stockinger
pp. 1–12
Fast Geosocial Reachability Queries
Panagiotis Bouros, Theodoros Chondrogiannis, Daniel Kowalski
pp. 25–38
Efficient Enumeration of Large Maximal k-Plexes
Qihao Cheng, Da Yan, Tianhao Wu, Lyuheng Yuan, Ji Cheng, Zhongyi Huang, Yang Zhou
pp. 53–65
Ensembling Object Detectors for Effective Video Query Processing
Daren Chao, Nick Koudas, Xiaohui Yu, Yueting Chen
pp. 66–79
OmniMatch: Overcoming the Cold-Start Problem in Cross-Domain Recommendations using Auxiliary Reviews
Yingjun Dai, Ahmed El-Roby, Elmira Adeeb, Vivek Thaker
pp. 80–91
Tabular Embeddings for Tables with Bi-Dimensional Hierarchical Metadata and Nesting
Gyanendra Shrestha, Chutian Jiang, Sai Akula, Vivek Yannam, Anna Pyayt, Michael Gubanov
pp. 92–105
Progressive Querying on Knowledge Graphs
Angela Bonifati, Stefania Dumbrava, Haridimos Kondylakis, Georgia Troullinou, Giannis Vassilliou
pp. 106–118
QueryER: A Framework for Fast Analysis-Aware Deduplication over Dirty Data
Giorgos Alexiou, George Papastefanatos, Vassilis Stamatopoulos, Georgia Koutrika, Nectarios Koziris
pp. 119–131
Private Approximate Query over Horizontal Data Federation
Ala Eddine Laouir, Abdessamad Imine
pp. 132–144
SPO-Join: Efficient Stream Inequality Join
Adeel Aslam, Kaustubh Beedkar, Giovanni Simonini
pp. 145–157
Experiments & Analyses Track
Evaluating the Data Model Robustness of Text-to-SQL Systems Based on Real User Queries
Jonathan Fürst, Catherine Kosten, Farhad Nooralahzadeh, Yi Zhang, Kurt Stockinger
pp. 158–170
An Experimental Comparison of Partitioning Strategies for Distributed Graph Neural Network Training
Nikolai Merkel, Daniel Stoll, Ruben Mayer, Hans-Arno Jacobsen
pp. 171–184
Evaluating the Feasibility of Sampling-Based Techniques for Training Multilayer Perceptrons
Sana Ebrahimi, Rishi Advani, Abolfazl Asudeh
pp. 185–198
Analysis of Text-to-SQL Benchmarks: Limitations, Challenges and Opportunities
Anna Mitsopoulou, Georgia Koutrika
pp. 199–212
Number 2 (November 11, 2024)
Original publisher: OpenProceedings.org, ISBN: 978-3-89318-098-1, Electronic Edition
Research Track
Differentially Private Publication of Smart Electricity Grid Data
Sina Shaham, Gabriel Ghinita, Bhaskar Krishnamachari, Cyrus Shahabi
pp. 213–225
DataSculpt: Cost-Efficient Label Function Design via Prompting Large Language Models
Naiqing Guan, Kaiwen Chen, Nick Koudas
pp. 226–232
RASP: Robust Mining of Frequent Temporal Sequential Patterns under Temporal Variations
Hyunjin Choo, Minho Eom, Gyuri Kim, Young-Gyu Yoon, Kijung Shin
pp. 233–245
Modifying an existing sort order with offset-value codes
Goetz Graefe, Marius Kuhrt, Bernhard Seeger
pp. 246–254
MEMPHIS: Holistic Lineage-based Reuse and Memory Management for Multi-backend ML Systems
Arnab Phani, Matthias Boehm
pp. 255–269
LINX: A Language Driven Generative System for Goal-Oriented Automated Data Exploration
Tavor Lipman, Tova Milo, Amit Somech, Tomer Wolfson, Oz Zafar
pp. 270–283
Synopses for Summarizing Spatial Data Streams
Jacco JE Kiezebrink, Wieger R. Punter, Odysseas Papapetrou, Kevin Verbeek
pp. 284–296
PRISMA: A Privacy-Preserving Schema Matcher using Functional Dependencies
Jan-Eric Hellenberg, Fabian Dustin Mahling, Lukas Laskowski, Felix Naumann, Matteo Paganelli, Fabian Panse
pp. 297–309
Taste: Towards Practical Deep Learning-based Approaches for Semantic Type Detection in the Cloud
Tao Li, Feng Liang, Jinqi Quan, Huang Chuang, Teng Wang, Runhuai Huang, Jie Wu, Xiping Hu
pp. 324–336
Evaluation of Dataframe Libraries for Data Preparation on a Single Machine
Angelo Mozzillo, Luca Zecchini, Luca Gagliardelli, Adeel Aslam, Sonia Bergamaschi, Giovanni Simonini
pp. 337–349
MaTElDa: Multi-Table Error Detection
Fatemeh Ahmadi, Marc Speckmann, Malte F. Kuhlmann, Ziawasch Abedjan
pp. 364–376
Metadata Unification in Open Data with Gnomon
Christina Christodoulakis, Moshe Gabel, Angela Demke Brown
pp. 377–383
Pythia: A Neural Model for Data Prefetching
Akshay A Bapat, Saravanan Thirumuruganathan, Nick Koudas
pp. 384–396
Fantastic Tables and Where to Find Them: Table Search in Semantic Data Lakes
Martin P Christensen, Aristotelis Leventidis, Matteo Lissandrini, Laura Di Rocco, Renée J. Miller, Katja Hose
pp. 397–410
Communication-Efficient Distributed Deep Learning via Federated Dynamic Averaging
Michail Theologitis, Georgios Frangias, Georgios Anestis, Vasilis Samoladas, Antonios Deligiannakis
pp. 411–424
Benchmarking, Analyzing, and Optimizing WA of Partial Compaction in RocksDB
Ran Wei, Zichen Zhu, Andrew J Kryczka, Jay Zhuang, Manos Athanassoulis
pp. 425–437
No Time to Halt: In-Situ Analysis for Large-Scale Data Processing via Virtual Snapshotting
Reza Salkhordeh, Felix M Schuhknecht, Hossein Asadi, Steffen Eiden, André Brinkmann
pp. 438–450
QuIT your B+-tree for the Quick Insertion Tree
Aneesh Raman, Konstantinos Karatsenidis, Shaolin Xie, Matthaios Olma, Subhadeep Sarkar, Manos Athanassoulis
pp. 451–463
Parallel Spatial Join Processing with Adaptive Replication
Nikolaos Koutroumanis, Christos Doulkeridis, Akrivi Vlachou
pp. 464–476
Stable Tree Labelling for Accelerating Distance Queries on Dynamic Road Networks
Henning Koehler, Muhammad Farhan, Qing Wang
pp. 477–489
PEG: Local Differential Privacy for Edge-Labeled Graphs
André Mendonça, Felipe Brito, Javam C Machado
pp. 490–502
Template-based Explainable Inference over High-Stakes Financial Knowledge Graphs
Andrea Colombo, Teodoro Baldazzi, Luigi Bellomarini, Emanuel Sallinger, Stefano Ceri
pp. 503–515
Experiments & Analyses Track
Benchmarking Analytical Query Processing in Intel SGXv2
Adrian Lutsch, Muhammad El-Hindi, Matthias Heinrich, Daniel Ritter, Zsolt István, Carsten Binnig
pp. 516–528
Entity Matching using Large Language Models
Ralph Peeters, Aaron Steiner, Christian Bizer
pp. 529–541