Improving Pipelining Tools for Pre-processing Data
Autor:
Novo-Lourés, María
; Lage, Yeray
; Pavón, Reyes
; Laza, Rosalía
; Ruano-Ordás, David
; Méndez, José Ramón
Fecha:
06/2022Palabra clave:
Revista / editorial:
International Journal of Interactive Multimedia and Artificial Intelligence (IJIMAI)Tipo de Ítem:
articleDirección web:
https://www.ijimai.org/journal/bibcite/reference/3028Resumen:
The last several years have seen the emergence of data mining and its transformation into a powerful tool that adds value to business and research. Data mining makes it possible to explore and find unseen connections between variables and facts observed in different domains, helping us to better understand reality. The programming methods and frameworks used to analyse data have evolved over time. Currently, the use of pipelining schemes is the most reliable way of analysing data and due to this, several important companies are currently offering this kind of services. Moreover, several frameworks compatible with different programming languages are available for the development of computational pipelines and many research studies have addressed the optimization of data processing speed. However, as this study shows, the presence of early error detection techniques and developer support mechanisms is very limited in these frameworks. In this context, this study introduces different improvements, such as the design of different types of constraints for the early detection of errors, the creation of functions to facilitate debugging of concrete tasks included in a pipeline, the invalidation of erroneous instances and/or the introduction of the burst-processing scheme. Adding these functionalities, we developed Big Data Pipelining for Java (BDP4J, https://github.com/sing-group/bdp4j), a fully functional new pipelining framework that shows the potential of these features.
Ficheros en el ítem
Este ítem aparece en la(s) siguiente(s) colección(es)
Estadísticas de uso
Año |
2012 |
2013 |
2014 |
2015 |
2016 |
2017 |
2018 |
2019 |
2020 |
2021 |
2022 |
2023 |
2024 |
Vistas |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
18 |
55 |
127 |
Descargas |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
3 |
24 |
46 |
Ítems relacionados
Mostrando ítems relacionados por Título, autor o materia.
-
Proportion of conflict, contingency learning, and recency effects in a Stroop task
Jiménez, Luis; Gallego, David; Agra, Oscar; Lorda, María José; Méndez Paz, Cástor (SAGE Publications Ltd, 2022)Recent research on the relation between learning and cognitive control has assumed that conflict modulates learning, either by increasing arousal and hence improving learning in high-conflict situations, or by inducing ... -
Percepción del estilo parental y calidad de vida relacionada con la salud entre adolescentes
Jódar Martínez, Rosalía; Martín Chaparro, María Del Pilar; Hidalgo Montesinos, María Dolores; Martínez Ramón, Juan Pedro (Revista Española de Pedagogía, 2022)La interacción entre la calidad de vida relacionada con la salud y los estilos parentales puede dar lugar a percepciones que influyen sobre el comportamiento de los adolescentes. Se considera que puede afectar a elementos ... -
Ortega y Gasset sobre la supuesta inconveniencia de leer Don Quijote en la escuela
Ariso Salgado, José María; Díaz Lage, José María (Logos (Spain), 2022)Este artículo presenta la postura mantenida por José Ortega y Gasset en el debate que tuvo lugar en la España de comienzos del siglo veinte sobre la conveniencia de leer Don Quijote en las escuelas. Con este fin, comenzamos ...