Original publisher: ACM New York, ICPS - International Conference Proceedings Series, Volume: 360, ISBN: 978-1-60558-422-5, Electronic Edition
Front and Back Matter
Invited papers
Data integration flows for business intelligence.
Umeshwar Dayal, Malú Castellanos, Alkis Simitsis, Kevin Wilkinson
pp. 1-11
Research Sessions
Research sessions:
System architectures
Shore-MT: a scalable storage manager for the multicore era.
Ryan Johnson, Ippokratis Pandis, Nikos Hardavellas, Anastasia Ailamaki, Babak Falsafi
pp. 24-35
Workload-aware data partitioning in community-driven data grids.
Tobias Scholl, Bernhard Bauer, Jessica Müller, Benjamin Gufler, Angelika Reiser, Alfons Kemper
pp. 36-47
Research sessions:
Spatio-temporal
Sequenced spatio-temporal aggregation in road networks.
Igor Timko, Michael H. Böhlen, Johann Gamper
pp. 48-59
Processing probabilistic spatio-temporal range queries over moving objects with uncertainty.
Bruce S. E. Chung, Wang-Chien Lee, Arbee L. P. Chen
pp. 60-71
Anonymizing moving objects: how to hide a MOB in a crowd?
Roman Yarovoy, Francesco Bonchi, Laks V. S. Lakshmanan, Wendy Hui Wang
pp. 72-83
Research sessions:
Database summarization
Type-based categorization of relational attributes.
Babak Ahmadi, Marios Hadjieleftheriou, Thomas Seidl, Divesh Srivastava, Suresh Venkatasubramanian
pp. 84-95
AlphaSum: size-constrained table summarization using value lattices.
K. Selçuk Candan, Huiping Cao, Yan Qi, Maria Luisa Sapino
pp. 96-107
Answering aggregate keyword queries on relational databases using minimal group-bys.
Bin Zhou, Jian Pei
pp. 108-119
Research sessions:
Query processing
Rule-based multi-query optimization.
Mingsheng Hong, Mirek Riedewald, Christoph Koch, Johannes Gehrke, Alan J. Demers
pp. 120-131
Managing long-running queries.
Stefan Krompass, Harumi A. Kuno, Janet L. Wiener, Kevin Wilkinson, Umeshwar Dayal, Alfons Kemper
pp. 132-143
Continuous visible nearest neighbor queries.
Yunjun Gao, Baihua Zheng, Wang-Chien Lee, Gencai Chen
pp. 144-155
Research sessions:
XML, XPath, XQuery
On rewriting XPath queries using views.
Foto N. Afrati, Rada Chirkova, Manolis Gergatsoulis, Benny Kimelfeld, Vassia Pavlaki, Yehoshua Sagiv
pp. 168-179
Parallelization of XPath queries using multi-core processors: challenges and experiences.
Rajesh Bordawekar, Lipyeow Lim, Oded Shmueli
pp. 180-191
Research sessions:
Graph techniques
GADDI: distance index based subgraph matching in biological networks.
Shijie Zhang, Shirong Li, Jiong Yang
pp. 192-203
A novel approach for efficient supergraph query processing on graph databases.
Shuo Zhang, Jianzhong Li, Hong Gao, Zhaonian Zou
pp. 204-215
Flexible query answering on graph-modeled data.
Federica Mandreoli, Riccardo Martoglia, Giorgio Villani, Wilma Penzo
pp. 216-227
Research sessions:
Privacy & security
Privacy-preserving data mashup.
Noman Mohammed, Benjamin C. M. Fung, Ke Wang, Patrick C. K. Hung
pp. 228-239
On the comparison of microdata disclosure control algorithms.
Rinku Dewri, Indrajit Ray, Indrakshi Ray, Darrell Whitley
pp. 240-251
Detecting privacy violations in database publishing using disjoint queries.
Millist W. Vincent, Mukesh K. Mohania, Mizuho Iwaihara
pp. 252-262
Research sessions:
Data models
On keys, foreign keys and nullable attributes in relational mapping systems.
Luca Cabibbo
pp. 263-274
A runtime approach to model-independent schema and data translation.
Paolo Atzeni, Luigi Bellomarini, Francesca Bugiotti, Giorgio Gianforme
pp. 275-286
A methodology for preference-based personalization of contextual data.
Antonio Miele, Elisa Quintarelli, Letizia Tanca
pp. 287-298
Research sessions:
Stream processing
Scalable stream join processing with expensive predicates: workload distribution and adaptation by time-slicing.
Song Wang, Elke A. Rundensteiner
pp. 299-310
Indexing density models for incremental learning and anytime classification on data streams.
Thomas Seidl, Ira Assent, Philipp Kranen, Ralph Krieger, Jennifer Herrmann
pp. 311-322
Exploiting the power of relational databases for efficient stream processing.
Erietta Liarou, Romulo Goncalves, Stratos Idreos
pp. 323-334
Research sessions:
XML, XPath, XQuery
A sampling approach for XML query selectivity estimation.
Cheng Luo, Zhewei Jiang, Wen-Chi Hou, Feng Yu, Qiang Zhu
pp. 335-344
Recursion in XQuery: put your distributivity safety belt on.
Loredana Afanasiev, Torsten Grust, Maarten Marx, Jan Rittinger, Jens Teubner
pp. 345-356
Research sessions:
Database summarization
It takes variety to make a world: diversification in recommender systems.
Cong Yu, Laks V. S. Lakshmanan, Sihem Amer-Yahia
pp. 368-378
Supporting annotations on relations.
Mohamed Y. Eltabakh, Walid G. Aref, Ahmed K. Elmagarmid, Mourad Ouzzani, Yasin N. Silva
pp. 379-390
Data clouds: summarizing keyword search results over structured data.
Georgia Koutrika, Zahra Mohammadi Zadeh, Hector Garcia-Molina
pp. 391-402
Research sessions:
Query processing
Sample synopses for approximate answering of group-by queries.
Philipp Rösch, Wolfgang Lehner
pp. 403-414
A query processor for prediction-based monitoring of data streams.
Sergio Ilarri, Ouri Wolfson, Eduardo Mena, Arantza Illarramendi, A. Prasad Sistla
pp. 415-426
Flower-CDN: a hybrid P2P overlay for efficient query processing in CDN.
Manal El Dick, Esther Pacitti, Bettina Kemme
pp. 427-438
Research sessions:
Top-K techniques
Zerber: top-k retrieval from a confidential index.
Sergej Zerr, Daniel Olmedilla, Wolfgang Nejdl, Wolf Siberski
pp. 439-449
Efficient top-k count queries over imprecise duplicates.
Sunita Sarawagi, Vinay S. Deshpande, Sourabh Kasliwal
pp. 450-461
The C-ND tree: a multidimensional index for hybrid continuous and non-ordered discrete data spaces.
Changqing Chen, Sakti Pramanik, Qiang Zhu, Alok Watve, Gang Qian
pp. 462-471
Research sessions:
Graph techniques
G-hash: towards fast kernel-based similarity search in large graph databases.
Xiaohong Wang, Aaron M. Smalter, Jun Huan, Gerald H. Lushington
pp. 472-480
Efficiently indexing shortest paths by exploiting symmetry in graphs.
Yanghua Xiao, Wentao Wu, Jian Pei, Wei Wang, Zhenying He
pp. 493-504
Research sessions:
Data mining
Estimating the number of frequent itemsets in a large database.
Ruoming Jin, Scott McCallen, Yuri Breitbart, David Fuhry, Dong Wang
pp. 505-516
FOGGER: an algorithm for graph generator discovery.
Zhiping Zeng, Jianyong Wang, Jun Zhang, Lizhu Zhou
pp. 517-528
Neighbor-based pattern detection for windows over streaming data.
Di Yang, Elke A. Rundensteiner, Matthew O. Ward
pp. 529-540
Efficient constraint evaluation in categorical sequential pattern mining for trajectory databases.
Leticia I. Gómez, Alejandro A. Vaisman
pp. 541-552
Research sessions:
Heterogeneous & distributed
Flexible and efficient querying and ranking on hyperlinked data sources.
Ramakrishna Varadarajan, Vagelis Hristidis, Louiqa Raschid, Maria-Esther Vidal, Luis Daniel Ibáñez, Héctor Rodríguez-Drumond
pp. 553-564
RankClus: integrating clustering with ranking for heterogeneous information network analysis.
Yizhou Sun, Jiawei Han, Peixiang Zhao, Zhijun Yin, Hong Cheng, Tianyi Wu
pp. 565-576
Research sessions:
System architectures
A sequential indexing scheme for flash-based embedded systems.
Shaoyi Yin, Philippe Pucheral, Xiaofeng Meng
pp. 588-599
Secondary bitmap indexes with vertical and horizontal partitioning.
Guadalupe Canahuate, Tan Apaydin, Ahmet Sacan, Hakan Ferhatosmanoglu
pp. 600-611
Automating the loading of business process data warehouses.
Malú Castellanos, Alkis Simitsis, Kevin Wilkinson, Umeshwar Dayal
pp. 612-623
Research sessions:
Privacy & security
Hiding distinguished ones into crowd: privacy-preserving publishing data with outliers.
Wendy Hui Wang, Ruilin Liu
pp. 624-635
An efficient online auditing approach to limit private data disclosure.
Haibing Lu, Yingjiu Li, Vijayalakshmi Atluri, Jaideep Vaidya
pp. 636-647
Continuous privacy preserving publishing of data streams.
Bin Zhou, Yi Han, Jian Pei, Bin Jiang, Yufei Tao, Yan Jia
pp. 648-659
Research sessions:
Uncertainty
Evaluating probability threshold k-nearest-neighbor queries over uncertain data.
Reynold Cheng, Lei Chen, Jinchuan Chen, Xike Xie
pp. 672-683
PROUD: a probabilistic approach to processing similarity queries over uncertain data streams.
Mi-Yen Yeh, Kun-Lung Wu, Philip S. Yu, Ming-Syan Chen
pp. 684-695
Research sessions:
Workflow techniques
Fair, effective, efficient and differentiated scheduling in an enterprise data warehouse.
Chetan Gupta, Abhay Mehta, Song Wang, Umeshwar Dayal
pp. 696-707
Efficient identification of starters and followers in social media.
Michael Mathioudakis, Nick Koudas
pp. 708-719
A data damage tracking quarantine and recovery (DTQR) scheme for mission-critical database systems.
Kun Bai, Peng Liu
pp. 720-731
Research sessions:
Multi-dimensional
Unrestricted wavelet synopses under maximum error bound.
Chaoyi Pang, Qing Zhang, David P. Hansen, Anthony J. Maeder
pp. 732-743
Distributed similarity search in high dimensions using locality sensitive hashing.
Parisa Haghani, Sebastian Michel, Karl Aberer
pp. 744-755
-Hist: taming massive high-dimensional data cube compression.
Alfredo Cuzzocrea, Paolo Serafino
pp. 768-779
Research sessions:
Caching techniques
Caching content-based queries for robust and efficient image retrieval.
Fabrizio Falchi, Claudio Lucchese, Salvatore Orlando, Raffaele Perego, Fausto Rabitti
pp. 780-790
An approach to detecting relevant updates to cached data using XML and active databases.
Essam Mansour, Hagen Höpfner
pp. 791-802
Self-tuning query mesh for adaptive multi-route query processing.
Rimma V. Nehme, Elke A. Rundensteiner, Elisa Bertino
pp. 803-814
Research sessions:
Information retrieval
Retrieving meaningful relaxed tightest fragments for XML keyword search.
Lingbo Kong, Rémi Gilleron, Aurélien Lemay
pp. 815-826
Finding frequent co-occurring terms in relational keyword search.
Yufei Tao, Jeffrey Xu Yu
pp. 839-850
Research sessions:
Query processing
Time-completeness trade-offs in record linkage using adaptive query processing.
Roald Lengu, Paolo Missier, Alvaro A. A. Fernandes, Giovanna Guerrini, Marco Mesiti
pp. 851-861
Continuous probabilistic nearest-neighbor queries for uncertain trajectories.
Goce Trajcevski, Roberto Tamassia, Hui Ding, Peter Scheuermann, Isabel F. Cruz
pp. 874-885
Research sessions:
Top-k techniques
Reverse k-nearest neighbor search in dynamic and general metric databases.
Elke Achtert, Hans-Peter Kriegel, Peer Kröger, Matthias Renz, Andreas Züfle
pp. 886-897
Top-dominant web services under multi-criteria matching.
Dimitrios Skoutas, Dimitris Sacharidis, Alkis Simitsis, Verena Kantere, Timos K. Sellis
pp. 898-909
Ranking objects based on relationships and fixed associations.
Albert Angel, Surajit Chaudhuri, Gautam Das, Nick Koudas
pp. 910-921
Research sessions:
Potpourri
Towards integrated and efficient scientific sensor data processing: a database approach.
Ji Wu, Yongluan Zhou, Karl Aberer, Kian-Lee Tan
pp. 922-933
Flexible and scalable storage management for data-intensive stream processing.
Irina Botan, Gustavo Alonso, Peter M. Fischer, Donald Kossmann, Nesime Tatbul
pp. 934-945
A view selection algorithm with performance guarantee.
Nicolas Hanusse, Sofian Maabout, Radu Tofan
pp. 946-957
Research sessions:
Provenance
Efficient provenance storage over nested data collections.
Manish Kumar Anand, Shawn Bowers, Timothy M. McPhillips, Bertram Ludäscher
pp. 958-969
Schema-conscious filtering of XML documents.
Panu Silvasti, Seppo Sippu, Eljas Soisalon-Soininen
pp. 970-981
Research sessions:
Spatio-temporal
A data model for trip planning in multimodal transportation systems.
Joel Booth, A. Prasad Sistla, Ouri Wolfson, Isabel F. Cruz
pp. 994-1005
Parsimonious temporal aggregation.
Juozas Gordevicius, Johann Gamper, Michael H. Böhlen
pp. 1006-1017
Research sessions:
Skylines
Finding the influence set through skylines.
Xiaobing Wu, Yufei Tao, Raymond Chi-Wing Wong, Ling Ding, Jeffrey Xu Yu
pp. 1030-1041
Efficient skyline computation in metric space.
David Fuhry, Ruoming Jin, Donghui Zhang
pp. 1042-1051
Efficient skyline retrieval with arbitrary similarity measures.
Deepak Padmanabhan, Prasad M. Deshpande, Debapriyo Majumdar, Raghu Krishnapuram
pp. 1052-1063
Research sessions:
Transaction processing
Transactions on the multiversion B-tree.
Tuukka Haapasalo, Ibrahim Jaluta, Bernhard Seeger, Seppo Sippu, Eljas Soisalon-Soininen
pp. 1064-1075
Efficient maintenance techniques for views over active documents.
Serge Abiteboul, Pierre Bourhis, Bogdan Marinoiu
pp. 1076-1087
Towards materialized view selection for distributed databases.
Leonardo Weiss Ferreira Chaves, Erik Buchmann, Fabian Hueske, Klemens Böhm
pp. 1088-1099
Demonstration Sessions
Demonstrations:
Demo group 1
BaseX & DeepFS joint storage for filesystem and database.
Alexander Holupirek, Christian Grün, Marc H. Scholl
pp. 1108-1111
Xoom: a tool for zooming in and out of XML documents.
Maya Ramanath, Kondreddi Sarath Kumar
pp. 1112-1115
HIDE: heterogeneous information DE-identification.
James J. Gardner, Li Xiong, Kanwei Li, James J. Lu
pp. 1116-1119
MVT: a schema mapping validation tool.
Guillem Rull, Carles Farré, Ernest Teniente, Toni Urpí
pp. 1120-1123
A tool for mapping discovery over revealing schemas.
Verena Kantere, Dimos Bousounis, Timos K. Sellis
pp. 1124-1127
GCIP: exploiting the generation and optimization of integration processes.
Matthias Böhm, Uwe Wloka, Dirk Habich, Wolfgang Lehner
pp. 1128-1131
Demonstrations:
Demo group 2
CourseCloud: summarizing and refining keyword searches over structured data.
Georgia Koutrika, Zahra Mohammadi Zadeh, Hector Garcia-Molina
pp. 1132-1135
EventSummarizer: a tool for summarizing large event sequences.
Jerry Kiernan, Evimaria Terzi
pp. 1136-1139
High-performance information extraction with AliBaba.
Peter Palaga, Long Nguyen, Ulf Leser, Jörg Hakenberg
pp. 1140-1143
Exploiting similarity-aware grouping in decision support systems.
Yasin N. Silva, Muhammad U. Arshad, Walid G. Aref
pp. 1144-1147
MarcoPolo: a community system for sharing and integrating travel information on maps.
Yueguo Chen, Su Chen, Yu Gu, Mei Hui, Feng Li, Chen Liu, Liangxu Liu, Beng Chin Ooi, Xiaoyan Yang, Dongxiang Zhang, Yuan Zhou
pp. 1148-1151
NNexus: an automatic linker for collaborative web-based corpora.
James J. Gardner, Aaron Krowne, Li Xiong
pp. 1152-1155
Tutorials
Tutorials
Performance evaluation in database research: principles and experience.
Stefan Manegold, Ioana Manolescu
pp. 1156
Geographic privacy-aware knowledge discovery and delivery.
Fosca Giannotti, Dino Pedreschi, Yannis Theodoridis
pp. 1157-1158
Industry and Applications
Industrial sessions:
Industrial session
Personalizing entity detection and recommendation with a fusion of web log mining techniques.
Kathleen Tsoukalas, Bin Zhou, Jian Pei, Davor Cubranic
pp. 1100-1103
Estimating aggregates in time-constrained approximate queries in Oracle.
Ying Hu, Seema Sundara, Jagannathan Srinivasan
pp. 1104-1107