ENBIS-26 Conference

Name: ENBIS-26 Conference
Start: 2026-09-06T09:00:00+02:00
End: 2026-09-10T18:00:00+02:00
Location: Centro Didattico Morgagni

Sep 6 – 10, 2026

Centro Didattico Morgagni

Europe/Rome timezone

Chair of the Local Organising Committee

Clustering for Detecting Careless Respondents: A Case Study

Sep 9, 2026, 2:50 PM

20m

Conference Room 107

Data Analytics and Data Science: Case Studies Data Analytics and Data Science: Case Studies

Silvia Villanova (University of Padova)

The presence of careless respondents represents a well-known threat to the quality of survey data. Respondents who provide inattentive or random answers can distort statistical analyses, reduce measurement reliability, and bias substantive conclusions. A variety of indicators have been proposed in the literature to detect such respondents, including response pattern measures such as longstring indices, intra-individual response variability (IRV), and multivariate distance measures such as Mahalanobis distance. However, these indicators are often applied individually and require arbitrary thresholds, which may limit their effectiveness in complex datasets.
This work proposes a data-driven framework for identifying careless respondents that integrates several classical diagnostic indicators with clustering algorithms. First, multiple indicators capturing different aspects of response behavior are computed for each respondent, including measures of response consistency, variability, and multivariate outlyingness. These indicators are then jointly analyzed using unsupervised clustering techniques to identify groups of respondents with similar response patterns. In this way, respondents exhibiting atypical combinations of diagnostic indicators can be detected without relying on predefined cutoffs.
The proposed approach allows the simultaneous exploitation of complementary information contained in multiple quality indicators and provides a flexible and scalable tool for detecting potentially careless respondents. An empirical study based on survey data illustrates how the integration of traditional indicators with clustering methods can improve the identification of problematic response patterns and support more reliable data cleaning procedures in questionnaire-based research.

Classification	Both methodology and application
Keywords	careless responding, machine learning, smart data

Elena Barzizza (University of Padova) Ms Marta Disegna (University of Padova) Ms Sara Naziri (Stat4Value) Silvia Villanova (University of Padova)

There are no materials yet.

ENBIS-26 Conference

Chair of the Local Organising Committee

Clustering for Detecting Careless Respondents: A Case Study

Conference Room 107

Speaker

Description

Authors

Presentation materials

Choose timezone

ENBIS-26 Conference

Chair of the Local Organising Committee

Speaker

Description

Authors

Presentation materials