1st Edition

Analysis of Distributional Data

Edited By Paula Brito, Sonia Dias Copyright 2022
404 Pages 30 Color & 80 B/W Illustrations
by Chapman & Hall

404 Pages 30 Color & 80 B/W Illustrations
by Chapman & Hall

404 Pages 30 Color & 80 B/W Illustrations
by Chapman & Hall

In a time when increasingly larger and complex data collections are being produced, it is clear that new and adaptive forms of data representation and analysis have to be conceived and implemented. Distributional data, i.e., data where a distribution rather than a single value is recorded for each descriptor, on each unit, come into this framework. Distributional data may result from the... Read more

I Data Representation and Exploratory Analysis
1. Fundamental Concepts about Distributional Data
Sónia Dias and Paula Brito
2. Descriptive Statistics based on Frequency Distributions
Sónia Dias and Paula Brito
3. Descriptive Statistics for Numeric Distributional Data
Antonio Irpino and Rosanna Verde
4. The Quantile Methods to Analyze Distributional Data
Manabu Ichino and Paula Brito

II Clustering and Classification
5. Partitive and Hierarchical Clustering of Distributional Data using the Wasserstein Distance
Rosanna Verde and Antonio Irpino
6. Divisive Clustering of Histogram Data
Marie Chavent and Paula Brito
7. Clustering of Modal Valued Data
Vladimir Batagelj, Simona Korenjak-Černe, and Nataša Kejžar
8. Mixture Models for Distributional Data
Richard Emilion
9. Classification of Continuous Distributional Data Using the Logratio Approach
Ivana Pavlu, Peter Filzmoser, Alessandra Menafoglio, and Karel Hron

III Dimension Reduction
10. Principal Component Analysis of Distributional Data
Sun Makosso-Kallyth and Edwin Diday
11. Principal Component Analysis of Numeric Distributional Data
Meiling Chen and Huiwen Wang
12. Multidimensional Scaling of Distributional Data
Yoshikazu Terada and Patrick J.F. Groenen
IV Regression and Forecasting
13. Regression Analysis with the Distribution and Symmetric Distribution Model
Sónia Dias and Paula Brito
14. Regression Analysis of Distributional Data Based on a Two-Component Model
Antonio Irpino and Rosanna Verde
15. Forecasting Distributional Time Series
Javier Arroyo

Biography

Paula Brito is a Professor at the Faculty of Economics of the University of Porto, and a member of the Artificial Intelligence and Decision Support Research Group (LIAAD) of INESC TEC, Portugal. She holds a doctorate degree in Applied Mathematics from the University Paris Dauphine, and an Habilitation in Applied Mathematics from the University of Porto. Her current research focuses on the analysis of multidimensional complex data, known as symbolic data, for which she develops statistical approaches and multivariate analysis methodologies. In this context, she has been involved in two European research projects. Paula Brito has been president of the International Association for Statistical Computing (IASC-ISI) in 2013–2015, and of the Portuguese Association for Classification and Data Analysis for the term 2021-2023. She has been invited speaker at several international conferences, and is a regularly member of international program committees. Paula Brito has been chair of COMPSTAT 2008 and will co-chair the IFCS 2022 conference.

Sónia Dias is a Professor in the area of Mathematics at the School of Technology and Management of the Polytechnic Institute of Viana do Castelo, and a member of the Laboratory in Artificial Intelligence and Decision Support (LIAAD) of INESC TEC, Portugal. She holds a PhD in Applied Mathematics from the University of Porto (2014). Her main scientific areas of research are Data Analysis, Symbolic Data Analysis (analysis of multidimensional complex data) and Statistical/Mathematical Applications. Under this context, she has participated in several conferences and published articles in international journals and proceedings. She was a member of the organizing committee of the international Symbolic Data Analysis Workshop - SDA2018 and is a member of the organizing committee of the IFCS 2022 conference.

" . . . this book will interest those who would like to expand their understanding regarding distributional data and its application in data science and to have a solid mathematical background on the different concepts under symbolic data analysis. This book also provides illustrative examples based on R package and open data which can contribute to the understanding on how to apply these methods to distributional data. This book can also benefit academic researchers who would like apply these types of approaches in their fields."
~Sébastien Bailly, ISCB Book Reviews