Invited Talks
Workshop Program
8:20-8:45 Opening Remarks and Flash Session
8:45-10:00 Keynote 1
- Erhard Rahm (University of
Leipzig, Germany):
Scalable Matching of Real-world Data (abstract)
10:00 – 10:30 Coffee Break
10:30-12:00 Research Session 1: Performance and efficiency of entity resolution.
- Bill McNeill (Intelius), Hakan Kardes (Intelius), Andrew Borthwick (Intelius): Dynamic Record Blocking: Efficient Linking of Massive Databases in MapReduce.
- Tobias Vogel (Hasso-Plattner-Institut), Felix Naumann (Hasso-Plattner-Institut): Automatic Blocking Key Selection for Duplicate Detection based on Unigram Combinations.
- Jie Chen (East China Normal University), Cheqing Jin (East China Normal Unniversity), Rong Zhang (East China Normal Unniversity), Aoying Zhou (East China Normal Unniversity): A Learning Method for Entity Matching.
- Panel by morning keynote speaker and presenters.
12:00 – 13:30 Lunch
13:30 – 15:30 Keynote 2 and Session 2: Data cleaning and truth discovery
-
Ihab Ilyas (Qatar Computing Research Institute):
Non-destructive Cleaning: Modeling and Querying Possible Data Repairs (abstract) - Bo Zhao (UIUC), Jiawei Han (UIUC): A Probabilistic Model for Estimating Real-valued Truth from Conflicting Sources.
15:30 – 16:00 Coffee Break
16:00 –17:30 Session 3: War stories in data quality.
- Thierno Diallo (University of Lyon - LIRIS), Jean-Marc Petit (Universite Lyon – LIRIS), Sylvie Servigne (Universite Lyon - LIRIS): Discovering Editing Rules For Data Cleaning.
- Julianna Göbölös-Szabó (MTA SZTAKI), Natalia Prytkova (MPI für Informatik), Marc Spaniol (MPI für Informatik) Gerhard Weikum (Max Planck Institute for Informatics): Cross-Lingual Data Quality for Knowledge Base Acceleration across Wikipedia Editions.
- Cinzia Cappiello (Politecnico di Milano), Fabio Schreiber (Politecnico di Milano): Quality- and Energy-Aware Data Aggregation in WSN Data Streams.
- Panel by afternoon keynote speaker and presenters.
