Series ISSN: 2367-2005
Number 1 (June 26, 2025)
Original publisher: OpenProceedings.org, ISBN: 978-3-98318-102-5, Electronic Edition
Research Track
ALOG: Adaptive Longitudinal Grids for Geospatial Data using Local Differential Privacy
Eduardo R. Duarte Neto, José S. Costa Filho, Antonio A. Marreiras Neto, Javam C. Machado
pp. 1–14
CAMEO: Autocorrelation-Preserving Line Simplification for Lossy Time Series Compression
Carlos Enrique Muñiz-Cuza, Matthias Boehm, Torben Bach Pedersen
pp. 15–28
Collaborative Scoping: Self-Supervised Linkability Assessment for Schema Matching
Leonard Traeger, Andreas Behrend, George Karabatis
pp. 29–41
Diverse Unionable Tuple Search: Novelty-Driven Discovery in Data Lakes
Aamod Khatiwada, Roee Shraga, Renée J. Miller
pp. 42–55
In-memory Incremental Maintenance of Provenance Sketches
Pengyuan Li, Boris Glavic, Dieter Gawlick, Vasudha Krishnaswamy, Zhen Hua Liu, Danica Porobic, Xing Niu
pp. 56–68
LearnedWMP: Workload Memory Prediction Using Distribution of Query Templates
Shaikh Quader, Andres Jaramillo, Sumona Mukhopadhyay, Ghadeer Abuoda, Calisto Zuzarte, David Kalmuk, Marin Litoiu, Manos Papagelis
pp. 69–81
Thunderbolt: Concurrent Smart Contract Execution with Non-blocking Reconfiguration for Sharded DAGs
Junchao Chen, Alberto Sonnino, Lefteris Kokoris-Kogias, Mohammad Sadoghi
pp. 82–95
Query Performance Explanation through Large Language Model for HTAP Systems
Haibo Xiu, Li Zhang, Tieying Zhang, Jun Yang, Jianjun Chen
pp. 103–109
Experiments & Analyses Track
In-depth Analysis of LLM-based Schema Linking
George Katsogiannis-Meimarakis, Katsiaryna Mirylenka, Paolo Scotton, Francesco Fusco, Abdel Labbi
pp. 117–130
Number 2 (November 7, 2025)
Original publisher: OpenProceedings.org, ISBN: 978-3-98318-103-2, Electronic Edition
Research Track
AdCache: Adaptive Cache Management with Admission Control for LSM-tree Key-Value Stores
Jiarui Ye, Junfeng Liu, Siqiang Luo
pp. 131–143
DendroTime: Progressive Hierarchical Clustering for Variable-Length Time Series
Sebastian Schmidl, Ferdinand Rewicki, Felix Naumann, Thorsten Papenbrock
pp. 144–156
Efficient Dataframe Systems: Lazy Fat Pandas on a Diet
Bhushan Pal Singh, Priyesh Kumar, Chiranmoy Bhattacharya, S. Sudarshan
pp. 157–169
Efficient Locality-based Indexing for Cohesive Subgraphs Discovery in Hypergraphs
Song Kim, Dahee Kim, Taejoon Han, Junghoon Kim, Hyun Ji Jeong, Jungeun Kim
pp. 170–182
Evaluating Learned Indexes in LSM-tree Systems: Benchmarks, Insights and Design Choices
Junfeng Liu, Jiarui Ye, Mengshi Chen, Meng Li, Siqiang Luo
pp. 183–195
Fair Center Clustering in Sliding Windows
Matteo Ceccarello, Andrea Pietracaprina, Geppino Pucci, Francesco Visonà
pp. 196–207
Fast Landmark Reconfiguration for Highway Cover Indexes
David Coudert, Andrea D'Ascenzo, Mattia D'Emidio, Giuseppe F. Italiano
pp. 208–220
In-Database Text Classification with BornSQL
Emanuele Guidotti, Darya Shlyk, Stefano Montanelli, Alfio Ferrara
pp. 221–232
LITHE: A Query Rewrite Advisor using LLMs
Sriram Dharwada, Himanshu Devrani, Jayant R Haritsa, Harish Doraiswamy
pp. 233–246
Optimizing UDF Queries in SQL Data Engines
Konstantinos Chasialis, Yannis Foufoulas, Alkis Simitsis, Yannis Ioannidis
pp. 247–260
ReFRED: Reliable Function REtrieval for Data transformation
Yueting Chen, Nick Koudas, Xiaohui Yu
pp. 261–273
Scalable Convergence Queries on Time Series Compounds
Noura Alghamdi, Khalid Alnuaim, Xiaoshuai Li, Elke A. Rundensteiner, Mohamed Y. Eltabakh
pp. 274–292
Number 3 (March 16, 2026)
Original publisher: OpenProceedings.org, ISBN: 978-3-98318-104-9, Electronic Edition
Research Track
Accelerating Graph Construction for MIPS without Search Accuracy Loss
Yasuhiro Fujiwara, Ángel López García-Arias, Yu Mitsuzumi, Yasutoshi Ida, Atsutoshi Kumagai, Masahiro Nakano, Makoto Nakatsuji, Akisato Kimura
pp. 293–304
Accelerating K-Core Computation in Temporal Graphs
Zhuo Ma, Dong Wen, Hanchen Wang, Wentao Li, Wenjie Zhang, Lu Qin
pp. 305–316
C-SHIFT: Efficient Cluster-based Model Fairness Control under Data Drift
Yijie Li, Huiping Chen, Paolo Missier
pp. 317–331
CAMEL Hash Table: Striking a Balance Between CPU and Memory Efficiency in Main-Memory Hash Join
Sudip Chatterjee, Xiaozheng Zhang, Suprio Ray, Ian Finlay, Calisto Zuzarte, Mark Stoodley
pp. 332–344
CMiner: An Algorithm to Discover Frequent Structures in Conceptual Models
Simone Avellino, Emanuele Valore, Giovanni Micale, Antonio Di Maria, Mattia Fumagalli, Tiago Prince Sales, Alfredo Pulvirenti, Diego Calvanese
pp. 345–358
DCSR: A Fast Data Structure with Leaf-Oriented Locks for Streaming Graph Processing
Yue Shen, Jie Zhang, Huawei Cao, Yuan Zhang, Xuejun An
pp. 359–371
Efficient Crawling for Scalable Web Data Acquisition
Antoine Gauquier, Ioana Manolescu, Pierre Senellart
pp. 372–385
Efficient Model Repository for Entity Resolution: Construction, Search, and Integration
Victor Christen, Peter Christen
pp. 386–398
Khatri-Rao Clustering for Data Summarization
Martino Ciaperoni, Collin Leiber, Aristides Gionis, Heikki Mannila
pp. 399–412
Meta-Property Graphs in Practice: Implementation and Evaluation
Stijn Nooijen, Sepehr Sadoughi, Nikolay Yakovets
pp. 413–419
Mobility Trajectory Data Stream Processing Beyond the Cloud
Mariana M. Garcez Duarte, Dwi P. A. Nugroho, Georges Tod, Evert Bevernage, Pieter Moelans, Elias Saerens, Esteban Zimányi, Mahmoud Sakr, Steffen Zeuch
pp. 420–432
Nova: Scalable Streaming Join Placement and Parallelization in Resource-Constrained Geo-Distributed Environments
Xenofon Chatziliadis, Eleni Tzirita Zacharatou, Samira Akili, Alphan Eracar, Volker Markl
pp. 433–446
PG-HIVE: Hybrid Incremental Schema Discovery for Property Graphs
Sophia Sideri, Georgia Troullinou, Elisjana Ymeralli, Vasilis Efthymiou, Dimitris Plexousakis, Haridimos Kondylakis
pp. 447–461
RecDB: An LSM-Tree based Storage System for Training Large Recommendation Model in Low-Resource Scenarios
Ming Gao, Qingyin Lin, Zhitao Chen, Yunling Chen, Zhiguang Chen
pp. 462–474
RTCM: A Distributed Snapshot-Based Framework for Real-Time Co-Movement Mining
Chenxu Wang, Jiaxing Wei, Tianyi Li, Hongzhen Xiang, Junzhou Zhao, Pinghui Wang, Tao Qin, Yushuai Li, Christian S. Jensen
pp. 475–487
SPIND: Scalable Partial Inclusion Dependency Discovery
Jakob Leander Müller, Marcian Seeger, Thorsten Papenbrock
pp. 503–515
Unleashing Data Dependency-based Query Optimization
Daniel Lindner, Daniel Ritter, Felix Naumann
pp. 516–529
Adaptive Query-Aware Hybrid Search in Vector Databases
Adeel Aslam, Rizwan Khan, Giovanni Simonini, George Konstantinidis
pp. 530–536
Efficient Vector-Based Louvain Algorithm for Massive Low-Rank Graphs
Tengkai Yu, Venkatesh Srinivasan, Alex Thomo
pp. 537–543
Reliable End-to-End Text-to-SQL Generation
Kaiwen Chen, Yueting Chen, Nick Koudas, Xiaohui Yu
pp. 544–550
Experiments & Analyses Track
Benchmarking Large Language Models for Knowledge Graph Validation
Farzad Shami, Stefano Marchesin, Gianmaria Silvello
pp. 551–565
Distance Comparison Operation Optimization in ANNS: A Survey and Experimental Evaluation
Bohai Wang, Yanhao Wang, Huiqi Hu, Weichen Zhao, Minghao Zhao
pp. 578–591
Understanding the Performance of Native Execution in Big Data Engines: The Good, the Bad, and How to Fix It
Haikai Zhao, Zhenman Fang
pp. 592–604
What Drives Learned Optimizer Performance? A Systematic Evaluation
Kostas Mparmparousis, Christos Tsapelas, Georgia Koutrika
pp. 605–618
Vision Track
Towards Multimodal Stream Processing Systems
Uélison Jean Lopes dos Santos, Alessandro Ferri, Szilard Nistor, Riccardo Tommasini, Carsten Binnig, Manisha Luthra
pp. 627–633
Industrial & Applications Track
A Declarative, Recursive SQL Framework for Composable Machine Learning Ensembles
Jason Arnold, Knut Stolze, Dan Zollers, Poojan Khanpara
pp. 634–640
Automatic Detail Extraction from Sustainability Objectives Using Weak Supervision
Mohammad Mahdavi, Tom Debus
pp. 641–648
Automating Efficient Data Collection through the Synergy of Agentic AI and Active Learning
Yael Einy, Guy Dar, Slava Novgorodov, Tova Milo, Nave Frost
pp. 649–658
Data-Driven Trajectory Imputation for Vessel Mobility Analysis
Giannis Spiliopoulos, Alexandros Troupiotis-Kapeliaris, Kostas Patroumpas, Nikolaos Liapis, Dimitrios Skoutas, Dimitris Zissis, Nikos Bikakis
pp. 659–667
Dynamic Prefiltering for Massive-Scale Equi- and Geospatial Joins
Jason Arnold, Knut Stolze, Ellis Saupe
pp. 668–675
Efficient Retrieval Scaling with Hierarchical Indexing for Large Scale Recommendation
Dongqi Fu, Kaushik Rangadurai, Haiyu Lu, Yunchen Pu, Siyang Yuan, Minhui Huang, Yiqun Liu, Golnaz Ghasemiesfeh, Xingfeng He, Fangzhou Xu, Andrew Cui, Vidhoon Viswanathan, Lin Yang, Liang Wang, Jiyan Yang, Chonglin Sun
pp. 676–686
Exploring Dynamic Memory Allocation of CXL Memory Pools in Enterprise In-Memory Database Management Systems
Donghun Lee, Minseon Ahn, Jungmin Kim, Jaemin Jung, Norman May, Daniel Ritter, Jongmin Gim, Heekwon Park, Changho Choi, Yang Seok Ki
pp. 687–695
SchemaTune: A Hybrid Framework Combining Transformer Fine-Tuning and Large Language Models for Schema Matching
Sayandeep Mitra, Manasi Patwardhan, Raveendra Kumar Medicherla
pp. 696–703
Demonstration Track
AnonymEx: An Interactive Platform for Exploring and Evaluating Anonymization Techniques through Re-identification Attacks
Andrea Fieschi, Christoph Stach, Pascal Hirmer
pp. 704–707
Click, Share, Learn: Teaching Data Science Using Apache Texera
Sarah Asad, Xiaozhen Liu, Kun Woo Park, Xinyuan Lin, Jiadong Bai, Shengquan Ni, Yicong Huang, Chen Li
pp. 708–711
CoDeC: Constraints-Guided Diverse Counterfactuals
Avia Asael, Nave Frost, Amir Gilad, Daniel Deutch
pp. 712–715
DEDALUS: A Quantum-Enhanced End-to-End Framework for Cost-Aware Join Order Optimization with Search Space Pruning
Emmanouil Limnaios, Markos Stergiopoulos, George T. Stamatiou, Efthymios Papageorgiou, Vasilis Efthymiou, Dimitrios Loupas, Dimitrios Tsourounis, Kostas Blekos, Aggelos Tsikas, Dimitris Plexousakis, Kostas Magoutis, Yannis Tzitzikas, Haridimos Kondylakis
pp. 716–719
Demonstrating PIPE-X: Supporting Iterative Pipeline Development Through Explanations
Nadja Geisler, Benjamin Hättasch, Carsten Binnig
pp. 720–723
HistoricRAG: Evidence-Centered Newspaper Retrieval for Misinformation-Resilient Question Answering
Stergios Konstantinidis, Min-Yen Kan, Michalis Vlachos
pp. 724–727
MetricLib: A Modular and Extensible Toolkit for Evaluation of Medical ML Datasets
Martin Seyferth, Katinka Becker, Tobias Schaeffter, Daniel Schwabe, Matthias Boehm
pp. 728–731
MIMOSA: A Tool for Fairness Exploration Through Explanations
Vasiliki Papanikou, Danae Pla Karidi, Evaggelia Pitoura, Emmanouil Panagiotou, Eirini Ntoutsi
pp. 732–735
MM-mapsearch: Workload-Aware Mapping Selection
Pavel Koupil, Bedřich Mazourek, Jáchym Bártík, Irena Holubová
pp. 736–739
NavLLM: Interactive LLM-Assisted Navigation over Multidimensional Data Cubes
Xiufeng Liu, Ruyu Liu, Yanyan Yang
pp. 740–743
PG-HIVE: Schema Discovery for Property Graphs
Sophia Sideri, Ioannis Chiras, Myron Giakoumakis, Georgia Troullinou, Elisjana Ymeralli, Vasillis Efthymiou, Dimitris Plexousakis, Haridimos Kondylakis
pp. 744–747
PolyBooks: A System for Interactive Multi-Model Querying with Provenance and Reuse
David Lengweiler, Tobias Weber, Heiko Schuldt, Marco Vogt
pp. 748–751
PolyPipe: Merging Data Pipelines and Multi-Model Databases
David Lengweiler, Tobias Weber, Heiko Schuldt, Marco Vogt
pp. 752–755
Practical Spreadsheet Parsing with SheetReader
Haralampos Gavriilidis, Felix Henze, Joel Ziegler, Jonas Benn, Eleni Tzirita Zacharatou, Volker Markl
pp. 756–759
TerseTS: A Framework for Time Series Compression
Carlos Enrique Muñiz-Cuza, Søren Kejser Jensen, Tom Louis Klein, Sabina Bakhtiiarova, Matthias Boehm, Torben Bach Pedersen
pp. 760–763
TwinDB: Interactive What-If Analysis for Digital Twins
Xiufeng Liu, Ruyu Liu, Per Sieverts Nielsen, Hua Lu
pp. 764–767
Where Graphs Meet Fuzzy Logic — A DBMS-Centered Engine for Polyphonic Music Matching in Score Databases
Adel Aly, Olivier Pivert, Virginie Thion
pp. 768–771
Tutorial Track
Disaggregated Data System Architecture — State-of-the-Art and Open Challenges
Alexander Krause, Johannes Pietrzyk, Alexander Boehm
pp. 772–775
Private LLM Inference with Homomorphic Encryption
Lawrence Lim, Divyakant Agrawal, Amr El Abbadi
pp. 776–780
Privacy-preserving Record Linkage: Past, Present and Yet-to-Come
Lefteris Stetsikas, Dimitrios Karapiperis, George Papadakis, Manolis Koubarakis
pp. 781–784
Streams Meet Semantics: Foundations and Systems of RDF Stream Processing
Haridimos Kondylakis, Pieter Bonte, Olivier Curé, Riccardo Tommasini
pp. 785–788
The Quest for Faster ANN Vector Search
Manos Chatzakis, Francesca Del Gaudio, Sophia Sideri, Themis Palpanas
pp. 789–792