ENBIS-24 Leuven Conference

Name: ENBIS-24 Leuven Conference
Start: 2024-09-15T09:00:00+02:00
End: 2024-09-19T22:00:00+02:00
Location: Leuven, Belgium

15–19 Sept 2024

Leuven, Belgium

Europe/Berlin timezone

Chair of the Local Organising Committee

Evaluating classifier performance in hard classification tasks

17 Sept 2024, 10:25

20m

Conference room 2

Data Science Data Science

Amalia Vanacore (Department of Industrial Engineering University of Naples Federico II)Dr Armando Ciardiello (Dept. of Industrial Engineering, University of Naples Federico II)

Different data difficulty factors (e.g., class imbalance, class overlapping, presence of outliers and noisy observations and difficult border decisions) make classification tasks challenging in many practical applications and are hot topics in the domain of pattern recognition, machine learning and deep learning. Data complexity factors have been widely discussed in specialized literature from a model-based or a data-based perspective, conversely less research efforts have been devoted to investigating their effect on the behavior of classifier predictive performance measures. Our study tries to address this issue by investigating the impact of data complexity on the behavior of several measures of classifier predictive performance. The investigation has been conducted via an extensive study based on numerical experiments using artificial data sets. The data generation process has been controlled through a set of parameters (e.g., number of features; class frequency distributions; frequency distributions of safe and unsafe instances) defining the characteristics of generated data. The artificial data sets have been classified using several algorithms whose predictive performances have been evaluated through the measures under study. Study results highlight that, although the investigated performance measures quite agree for easy classification tasks (i.e., with balanced datasets containing only safe instances), their behavior significantly differs when dealing with difficult classification tasks (i.e., increasing data complexity) which is a rule in many real-word classification problems.

Acknowledgements
This study was carried out within the MICS (Made in Italy– Circular and Sustainable) Extended Partnership and received funding from the European Union Next-Generation EU (PIANO NAZIONALE DI RIPRESA E RESILIENZA (PNRR) – MISSIONE 4 COMPONENTE 2, INVESTIMENTO 1.3 –D.D. 1551.11-10-2022, PE00000004). This manuscript reflects only the authors’ views and opinions, neither the European Union nor the European Commission can be considered responsible for them.

Type of presentation	Talk
Classification	Both methodology and application
Keywords	Classifier predictive performance, Data Complexity, Artificial data

Amalia Vanacore (Department of Industrial Engineering University of Naples Federico II)

Dr Armando Ciardiello (Dept. of Industrial Engineering, University of Naples Federico II)

There are no materials yet.

ENBIS-24 Leuven Conference

Chair of the Local Organising Committee

Evaluating classifier performance in hard classification tasks

Conference room 2

Speakers

Description

Primary author

Co-author

Presentation materials

Choose timezone

ENBIS-24 Leuven Conference

Chair of the Local Organising Committee

Speakers

Description

Primary author

Co-author

Presentation materials