Eindhoven University of Technology (TU/e): Assistant Professor, Data Mining Group
Open University of Cyprus (OUC): Lecturer, Faculty of Pure and Applied Sciences
École polytechnique fédérale de Lausanne (EPFL): Adjunct Faculty
European Commission: Independent Expert
Technical University of Crete: Research Collaborator, SoftNet Lab
Open University of Cyprus: Adjunct faculty
L3S Research Center: Researcher
PhD, Leibniz Universität Hannover, Germany
MSc in Computer Science, Saarland University, Germany
MSc in Advanced Information Technologies, University of Cyprus, Cyprus
BSc in Computer Science, University of Cyprus, Cyprus
Tilburg University, 2020-2021:
Quartile 4 → 320099-M-6 – Interactive Data Transformation
Quartile 3 → 320092-M-6 – Business Intelligence and Data Management , co-lecturer with E.A.M. Caron
Completed Courses:
• 320092-M-6 – Business Intelligence and Data Management: Tilburg University (co-lecturer with E.A.M. Caron), 2019-2020, Quartile 3
• JBI030 – Data Mining: TU/e (co-lecturer with D. Vidotto), 2018-2019, Quartile 3
• 2IMM15 – Web information retrieval and data mining: TU/e (co-lecturer with J. Vanschoren), 2018-2019, Quartile 3
• JBG040 – Data Challenge 1: TU/e, 2018-2019, Quartile 2
• 2ID50 – Data modelling and databases: TU/e (responsible for instructions), 2018-2019, Quartile 2
• ENV-342 – Geographic information systems: EPFL (responsible for the database lectures), 2017-2018
• KPS510 - Web Technologies: OUC, 2017-2018, Winter Semester
• PES521 - Research Methods: OUC, 2017-2018 & 2016-2017, Spring Semester
PDEng (Professional Doctorate in Engineering):
• A. Laponin, “Extracting information from unstructured service logbook texts” (in collaboration with
Océ Technologies B.V.), TU/e, Oct. 2019.
M.Sc. Dissertations:
• Ronald van Asseldonk, “Designing a Method for Extracting Invoice Files using Automation
Technologies”, Uvt, Aug. 2020.
• Sem P. L. Nijssen, “An approach for organizations to embed security within a DevOps environment
in order to effectively manage information risks” (in collaboration with CGI), Uvt, June
2020.
• Linde M. A. Koolen, “Using IoT Field Data to Add Value to the Business” (in collaboration with
Signify Holding B.V.), Uvt, June 2020.
• Tim W. A. van Lier, “Process mining in practice: learning from experts”, Uvt, June 2020.
• N. van Son, “Integration of various financial data sources” (in collaboration with NIC), TU/e, Nov. 2019.
• R. Coenders, “Search term clustering” (in collaboration with company ADchieve), TU/e, Aug. 2019.
• U. Biswas, “Normalization of Extracted Named-Entities in Text Mining” (in collaboration with ZyLAB), TU/e, Aug. 2019.
• M. Nikolaou, “Data for evaluating entity-related methodologies”, OUC, June 2017.
• E. Routzouni, “Detecting and monitoring entity evolution”, OUC, June 2013.
Data Integration and Cleaning
Ekaterini Ioannou
SIKS Course on Data Science, 5-6 October 2020.
Entity Resolution: Past, Present and Yet-to-Come
George Papadakis,
Ekaterini Ioannou, and
Themis Palpanas
In Proceedings of the 23nd International Conference on Extending Database Technology (EDBT), April 2020, Denmark.
The Four Generations of Entity Resolution
George Papadakis,
Ekaterini Ioannou,
Emmanouil Thanos, and
Themis Palpanas
Synthesis Lectures on Data Management, Morgan & Claypool Publishers, 2021.
Management of Inconsistencies in Data Integration
Ekaterini Ioannou and
Slawek Staworko
In book: "Data Exchange, Information, and Streams, Dagstuhl Follow-Ups",
Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, Chapter 9, pp. 217-225, 2013.
Embracing Uncertainty in Entity Linking
Ekaterini Ioannou,
Wolfgang Nejdl,
Claudia Niederée,
Yannis Velegrakis
In book: "Semantic Search Over the Web", Springer, Chapter 9, pp. 223-251, 2012.
Query Analytics over Probabilistic Databases with Unmerged Duplicates
Ekaterini Ioannou and
Minos Garofalakis
In IEEE Transactions on Knowledge and Data Engineering 27(8), pages 2245-2260, 2015.
Searching Web 2.0 Data through Entity-Based Aggregation
Ekaterini Ioannou and Yannis Velegrakis
In Journal of Transactions on Computational Collective Intelligence, pages 159-174, 2016.
Data Management Research at the Technical University of Crete
Stavros Christodoulakis, Minos Garofalakis, Euripides Petrakis, Antonios Deligiannakis, Vasilis Samoladas, Ekaterini Ioannou,
Odysseas Papapetrou and Stelios Sotiriadis.
In SIGMOD Record 42(4), pages 61-66, December 2013.
A Blocking Framework for Entity Resolution in Highly Heterogeneous Information Spaces
George Papadakis,
Ekaterini Ioannou,
Themis Palpanas,
Claudia Niederée,
Wolfgang Nejdl.
In IEEE Transactions on Knowledge and Data Engineering 25(12), pages 2665-2682, 2013.
On Generating Benchmark Data for Entity Matching
Ekaterini Ioannou,
Nataliya Rassadko,
Yannis Velegrakis.
In Journal on Data Semantics, Springer, March 2013, Volume 2, Issue 1, pp 37-56.
Leveraging Personal Metadata for Desktop Search - The Beagle++ System
Enrico Minack, Raluca Paiu, Stefania Costache,
Gianluca Demartini, Julien Gaugaz, Ekaterini Ioannou,
Paul-Alexandru Chirita, and Wolfgang Nejdl
In Journal of Web Semantics, 2010, Volume 8, Issue 1, pp 37-54.
Entity Resolution in Large Patent Databases: An Optimization Approach
Emiel Caron, and Ekaterini Ioannou
In Proceedings of the International Conference on Enterprise Information Systems (ICEIS), April 2021.
Support of Part-whole Relations in Query Answering
Piotr Kozikowski, Ekaterini Ioannou, Yannis Velegrakis, and Francesco Guerra
In Proceedings of the 1st KEYSTONE Conference (IKC), September 2015, Portugal.
Analytics over Probabilistic Unmerged Duplicates
Ekaterini Ioannou, and
Minos Garofalakis
In Proceedings of the 8th Conference on Scalable Uncertainty Management (SUM), September 2014, London.
Beyond 100 million entities: large-scale blocking-based resolution for heterogeneous data
George Papadakis,
Ekaterini Ioannou,
Claudia Niederée,
Themis Palpanas,
Wolfgang Nejdl.
In Proceedings of the 5th ACM International Conference on Web Search and Data Mining (WSDM), Feb. 2012, Seattle.
LinkDB: A Probabilistic Linkage Database System
Ekaterini Ioannou,
Wolfgang Nejdl,
Claudia Niederée,
Yannis Velegrakis.
In Proceedings of the ACM SIGMOD International Conference on Management of Data, 12-16 June 2011, Athens, Greece.
Efficient Discovery of Frequent Subgraph Patterns in Uncertain Graph Databases
Odysseas Papapetrou,
Ekaterini Ioannou, and
Dimitrios Skoutas.
In Proceedings of the 14th International Conference on Extending Database Technology (EDBT), Mar. 2011, Sweden.
Efficient Entity Resolution for Large Heterogeneous Information Spaces
George Papadakis,
Ekaterini Ioannou,
Claudia Niederée, and
Peter Fankhauser.
In Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM), Feb. 2011, Hong Kong.
Eliminating the Redundancy in Blocking-based Entity Resolution Methods
George Papadakis,
Ekaterini Ioannou,
Claudia Niederée,
Themis Palpanas,
Wolfgang Nejdl.
In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (JCDL), 13-17 June 2011, Ottawa, Canada.
To Compare or Not to Compare: Making Entity Resolution more Efficient
George Papadakis,
Ekaterini Ioannou,
Claudia Niederée,
Themis Palpanas,
Wolfgang Nejdl.
In Semantic Web Information Management Workshop, co-located with SIGMOD, June 2011, Athens, Greece.
On-the-Fly Entity-Aware Query Processing in the Presence of Linkage
Ekaterini Ioannou,
Wolfgang Nejdl,
Claudia Niederée,
Yannis Velegrakis
In Proceedings of the VLDB Endowment (PVLDB), Vol. 3, No. 1, 13-17 Sep. 2010, Singapore.
Enabling Entity-Based Aggregators for Web 2.0 data
Ekaterini Ioannou,
Claudia Niederée,
Yannis Velegrakis
In Proceedings of the 19th International World Wide Web Conference (WWW), 26-30 April 2010, Raleigh, NC, USA.
Efficient Semantic-Aware Detection of Near Duplicate Resources
Ekaterini Ioannou,
Odysseas Papapetrou,
Dimitrios Skoutas,
Wolfgang Nejdl
In Proceedings of the 7th Extended Semantic Web Conference (ESWC), 30 May - 03 June 2010, Heraklion, Greece.
From Web Data to Entities and Back
Z. Miklos,
N. Bonvin,
P. Bouquet, M. Catasta, D. Cordioli, P. Fankhauser, J. Gaugaz,
E. Ioannou, H. Koshutanski, A. Mana,
C. Niederée,
T. Palpanas, and H. Stoermer
In 22nd International Conference on Advanced Information Systems Engineering (CAiSE), June 2010, Hammamet, Tunisia.
Efficient Term Cloud Generation for Streaming Web Content
Odysseas Papapetrou,
George Papadakis, Ekaterini Ioannou, and
Dimitrios Skoutas
In 10th International Conference on Web Engineering (ICWE), July 2010, Vienna, Austria.
Detecting Contexts on the Desktop Using Bayesian Networks
Stefania Costache, Julien Gaugaz, Ekaterini Ioannou,
Claudia Niederée, and
Wolfgang Nejdl
In DESKTOP Search Workshop, co-located with SIGIR,
July 2010, Geneva, Switzerland.
Entity-Aware Query Processing for Heterogeneous Data with Uncertainty and Correlations
Ekaterini Ioannou
In Joint EDBT/ICDT Ph.D. Workshop, March 2009,
St.-Petersburg, Russia. (Best Submission Award)
Entity Search with NECESSITY
Ekaterini Ioannou, Saket Sathe,
Nicolas Bonvin,
Anshul Jain, Srikanth Bondalapati,
Gleb Skobeltsyn,
Claudia Niederée,
Zoltan Miklos
In Web and Databases Workshop (WebDB) co-located with ACM SIGMOD, June 2009, Providence, Rhode Island.
Probabilistic Entity Linkage for Heterogeneous Information Spaces
Ekaterini Ioannou,
Claudia Niederée,
Wolfgang Nejdl
In Advanced Information Systems Engineering, 20th International Conference (CAiSE), June 2008, Montpellier, France.
Access Control for sharing Semantic Data Across Desktops
Ekaterini Ioannou, Juri Luca De Coi, Arne Kösling, Daniel Olmedilla,
Wolfgang Nejdl
In 1st International Workshop on Privacy Enforcement and Accountability with Semantics (PEAS), International Semantic Web Conference, November 2007, Busan, Korea.
The Beagle++ Toolbox: Towards an Extendable Desktop Search Architecture
Ingo Brunkhorst, Paul-Alexandru Chirita, Stefania
Costache, Julien Gaugaz, Ekaterini Ioannou, Tereza
Iofciu, Enrico Minack, Wolfgang Nejdl, Raluca Paiu
In Semantic Desktop Workshop (SemDesk-2006), International Semantic Web Conference, November 2006, Athens, GA, USA.
EMBench++ is a principled system for the evaluation of entity matching techniques. It offers a unique test case generation approach that combines different levels of types, complexity and scales, allowing a complete and accurate evaluation of the different aspects of a matching technique.
Provides a holistic and systematic view of the evolution of Entity Resolution methods by categorizing them into 4 generations, going from those crafted for maximizing Veracity over structured data, all the way to those tackling Veracity, Volume, Variety and Velocity over semi-structured data.
• Synthetic Entity-related Collections: used in the evaluation of the PVLDB 2018 paper.
• Entity Request Collections: can be used to evaluate methodologies for entity linkage as well as entity search.
• News Articles Collection: RDF data describing the entities from news artciles.