Themis Palpanas's Books

Data Exploration Using Example-Based Methods

By: Matteo Lissandrini,Davide Mottin,Themis Palpanas,Yannis Velegrakis

Write a review Read reviews Take a Quiz Solve Book Puzzle

Data Exploration Using Example-Based Methods

By: Matteo Lissandrini,Davide Mottin,Themis Palpanas,Yannis Velegrakis

Data usually comes in a plethora of formats and dimensions, rendering the information extraction and exploration processes challenging. Thus, being able to perform exploratory analyses of the data with the intent of having an immediate glimpse of some of the data properties is becoming crucial. Exploratory analyses should be simple enough to avoid complicated declarative languages (such as SQL) and mechanisms, while at the same time retaining the flexibility and expressiveness of such languages. Recently, we have witnessed a rediscovery of the so-called example-based methods, in which the user, or analyst, circumvents query languages by using examples as input. An example is a representative of the intended results or, in other words, an item from the result set. Example-based methods exploit inherent characteristics of the data to infer the results that the user has in mind but may not be able to (easily) express. They can be useful in cases where a user is looking for information in an unfamiliar dataset, when they are performing a particularly challenging task like finding duplicate items, or when they are simply exploring the data. In this book, we present an excursus over the main methods for exploratory analysis, with a particular focus on example-based methods. We show how different data types require different techniques and present algorithms that are specifically designed for relational, textual, and graph data. The book also presents the challenges and new frontiers of machine learning in online settings that have recently attracted the attention of the database community. The book concludes with a vision for further research and applications in this area.

Average ratings

N/A

Write a review

Solve jigsaw puzzle

Age

Page Count

166

Publisher

Morgan & Claypool Publishers

Published Date

ISBN 10

1681734567

ISBN 13

9781681734569

The Four Generations of Entity Resolution

By: George Papadakis,Ekaterini Ioannou,Emanouil Thanos,Themis Palpanas

Write a review Read reviews Take a Quiz Solve Book Puzzle

The Four Generations of Entity Resolution

By: George Papadakis,Ekaterini Ioannou,Emanouil Thanos,Themis Palpanas

Entity Resolution (ER) lies at the core of data integration and cleaning and, thus, a bulk of the research examines ways for improving its effectiveness and time efficiency. The initial ER methods primarily target Veracity in the context of structured (relational) data that are described by a schema of well-known quality and meaning. To achieve high effectiveness, they leverage schema, expert, and/or external knowledge. Part of these methods are extended to address Volume, processing large datasets through multi-core or massive parallelization approaches, such as the MapReduce paradigm. However, these early schema-based approaches are inapplicable to Web Data, which abound in voluminous, noisy, semi-structured, and highly heterogeneous information. To address the additional challenge of Variety, recent works on ER adopt a novel, loosely schema-aware functionality that emphasizes scalability and robustness to noise. Another line of present research focuses on the additional challenge of Velocity, aiming to process data collections of a continuously increasing volume. The latest works, though, take advantage of the significant breakthroughs in Deep Learning and Crowdsourcing, incorporating external knowledge to enhance the existing words to a significant extent. This synthesis lecture organizes ER methods into four generations based on the challenges posed by these four Vs. For each generation, we outline the corresponding ER workflow, discuss the state-of-the-art methods per workflow step, and present current research directions. The discussion of these methods takes into account a historical perspective, explaining the evolution of the methods over time along with their similarities and differences. The lecture also discusses the available ER tools and benchmark datasets that allow expert as well as novice users to make use of the available solutions.

Average ratings

N/A

Write a review

Solve jigsaw puzzle

Age

Page Count

152

Publisher

Springer Nature

Published Date

ISBN 10

3031018788

ISBN 13

9783031018787

Data Exploration Using Example-Based Methods

By: Matteo Lissandrini,Davide Mottin,Themis Palpanas,Yannis Velegrakis

Write a review Read reviews Take a Quiz Solve Book Puzzle

Data Exploration Using Example-Based Methods

By: Matteo Lissandrini,Davide Mottin,Themis Palpanas,Yannis Velegrakis

Data usually comes in a plethora of formats and dimensions, rendering the exploration and information extraction processes challenging. Thus, being able to perform exploratory analyses in the data with the intent of having an immediate glimpse on some of the data properties is becoming crucial. Exploratory analyses should be simple enough to avoid complicate declarative languages (such as SQL) and mechanisms, and at the same time retain the flexibility and expressiveness of such languages. Recently, we have witnessed a rediscovery of the so-called example-based methods, in which the user, or the analyst, circumvents query languages by using examples as input. An example is a representative of the intended results, or in other words, an item from the result set. Example-based methods exploit inherent characteristics of the data to infer the results that the user has in mind, but may not able to (easily) express. They can be useful in cases where a user is looking for information in an unfamiliar dataset, when the task is particularly challenging like finding duplicate items, or simply when they are exploring the data. In this book, we present an excursus over the main methods for exploratory analysis, with a particular focus on example-based methods. We show how that different data types require different techniques, and present algorithms that are specifically designed for relational, textual, and graph data. The book presents also the challenges and the new frontiers of machine learning in online settings which recently attracted the attention of the database community. The lecture concludes with a vision for further research and applications in this area.

Average ratings

N/A

Write a review

Solve jigsaw puzzle

Age

Page Count

146

Publisher

Springer Nature

Published Date

ISBN 10

3031018664

ISBN 13

9783031018664

The Four Generations of Entity Resolution

By: George Papadakis,Ekaterini Ioannou,Emanouil Thanos,Themis Palpanas

Write a review Read reviews Take a Quiz Solve Book Puzzle

The Four Generations of Entity Resolution

By: George Papadakis,Ekaterini Ioannou,Emanouil Thanos,Themis Palpanas

Average ratings

N/A

Write a review

Solve jigsaw puzzle

Age

Page Count

152

Publisher

Springer Nature

Published Date

ISBN 10

3031018788

ISBN 13

9783031018787

Book's by Themis Palpanas

Data Exploration Using Example-Based Methods

Data Exploration Using Example-Based Methods

The Four Generations of Entity Resolution

The Four Generations of Entity Resolution

Data Exploration Using Example-Based Methods

Data Exploration Using Example-Based Methods

The Four Generations of Entity Resolution

The Four Generations of Entity Resolution

Username

Password

Username

Password

Password again

Birthday

Email