Use of Data Mining for Intelligent Evaluation of Imputation Methods
Autor:
la Red, David L.
; Primorac, Carlos R.
Fecha:
03/2023Palabra clave:
Revista / editorial:
International Journal of Interactive Multimedia and Artificial IntelligenceTipo de Ítem:
articleDirección web:
https://www.ijimai.org/journal/bibcite/reference/3291Resumen:
In real-world situations, researchers frequently face the difficulty of missing values (MV), i.e., values not observed in a data set. Data imputation techniques allow the estimation of MV using different algorithms, by means of which important data can be imputed for a particular instance. Most of the literature in this field deals with different imputation methods. However, few studies deal with a comparative evaluation of the different methods as to provide more appropriate guidelines for the selection of the method to be applied to impute data for specific situations. The objective of this work is to show a methodology for evaluating the performance of imputation methods by means of new metrics derived from data mining processes, using quality metrics of data mining models. We started from the complete dataset that was amputated with different amputation mechanisms to generate 63 datasets with MV; these were imputed using Median, k-NN, k-Means and Hot-Deck imputation methods. The performance of the imputation methods was evaluated using new metrics derived from quality metrics of the data mining processes, performed with the original full file and with the imputed files. This evaluation is not based on measuring the error when imputing (usual operation), but on considering the similarity of the values of the quality metrics of the data mining processes obtained with the original file and with the imputed files. The results show that –globally considered and according to the new proposed metric, the imputation methods that showed the best performance were k-NN and k-Means. An additional advantage of the proposed methodology is that it provides predictive data mining models that can be used a posteriori.
Ficheros en el ítem
Este ítem aparece en la(s) siguiente(s) colección(es)
Estadísticas de uso
Año |
2012 |
2013 |
2014 |
2015 |
2016 |
2017 |
2018 |
2019 |
2020 |
2021 |
2022 |
2023 |
2024 |
Vistas |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
72 |
68 |
Descargas |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
38 |
42 |
Ítems relacionados
Mostrando ítems relacionados por Título, autor o materia.
-
Finger Flexor Force Influences Performance in Senior Male Air Pistol Olympic Shooting
Mon, Daniel; Zakynthinaki, María S; Cordente, Carlos A; Monroy Anton, Antonio ; Rodríguez Rodríguez, Barbara; López Jiménez, David (PLOS One, 06/2015)The ability to stabilize the gun is crucial for performance in Olympic pistol shooting and is thought to be related to the shooters muscular strength. The present study examines the relation between performance and finger ... -
Music Boundary Detection using Convolutional Neural Networks: A Comparative Analysis of Combined Input Features
Hernandez-Olivan, Carlos; Beltran, Jose R.; Diaz-Guerra, David (International Journal of Interactive Multimedia and Artificial Intelligence (IJIMAI), 12/2021)The analysis of the structure of musical pieces is a task that remains a challenge for Artificial Intelligence, especially in the field of Deep Learning. It requires prior identification of the structural boundaries of the ... -
Lifetime Mental Health Problems in Adult Lower Secondary Education: A Student Survey
Aznárez-Sanado, Maite; Artuch-Garde, Raquel ; Carrica-Ochoa, Sarah; García-Roda, Carlos; Arellano, Araceli; Ramírez-Castillo, David; Arrondo, Gonzalo (Frontiers in Psychology, 06/07/2020)Background/Objective: Adult Lower Secondary Education is an education program for basic qualifications for the labor market. Our study aimed to compare lifetime mental health problems between current Adult Lower Secondary ...