A Clustering Algorithm Based on an Ensemble of Dissimilarities: An Application in the Bioinformatics Domain
Autor:
Martín Merino, Manuel
; López Rivero, Alfonso José
; Alonso, Vidal
; Vallejo, Marcelo
; Ferreras, Antonio
Fecha:
09/2022Palabra clave:
Revista / editorial:
International Journal of Interactive Multimedia and Artificial Intelligence (IJIMAI)Tipo de Ítem:
articleDirección web:
https://ijimai.org/journal/bibcite/reference/3181Resumen:
Clustering algorithms such as k-means depend heavily on choosing an appropriate distance metric that reflect accurately the object proximities. A wide range of dissimilarities may be defined that often lead to different clustering results. Choosing the best dissimilarity is an ill-posed problem and learning a general distance from the data is a complex task, particularly for high dimensional problems. Therefore, an appealing approach is to learn an ensemble of dissimilarities. In this paper, we have developed a semi-supervised clustering algorithm that learns a linear combination of dissimilarities considering incomplete knowledge in the form of pairwise constraints. The minimization of the loss function is based on a robust and efficient quadratic optimization algorithm. Besides, a regularization term is considered that controls the complexity of the distance metric learned avoiding overfitting. The algorithm has been applied to the identification of tumor samples using the gene expression profiles, where domain experts provide often incomplete knowledge in the form of pairwise constraints. We report that the algorithm proposed outperforms a standard semi-supervised clustering technique available in the literature and clustering results based on a single dissimilarity. The improvement is particularly relevant for applications with high level of noise.
Ficheros en el ítem
Este ítem aparece en la(s) siguiente(s) colección(es)
Estadísticas de uso
Año |
2012 |
2013 |
2014 |
2015 |
2016 |
2017 |
2018 |
2019 |
2020 |
2021 |
2022 |
2023 |
2024 |
Vistas |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
11 |
78 |
136 |
Descargas |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
9 |
56 |
43 |
Ítems relacionados
Mostrando ítems relacionados por Título, autor o materia.
-
Empirical Analysis of Ethical Principles Applied to Different AI Uses Cases
López Rivero, Alfonso José; Beato, M. Encarnación; Muñoz Martínez, César; Cortiñas Vázquez, Pedro Gonzalo (International Journal of Interactive Multimedia and Artificial Intelligence (IJIMAI), 12/2022)In this paper, we present an empirical study on the perception of the ethical challenges of artificial intelligence groups in the classification made by the European Union (EU). The study seeks to identify the ethical ... -
Improvement of Memory and Motivation in Language Learning in Primary Education through the Interactive Digital Whiteboard (IDW): The Future in a Post-Pandemic Period
Bautista Vallejo, José Manuel; Hernández-Carrera, Rafael M. ; Moreno-Rodríguez, Ricardo; López-Bastias, José Luis (Sustainability, 10/2020)This paper presents an analysis of the use of an interactive digital whiteboard (IDW) and a computer application called Action Manager (AM), with a sample of 158 sixth-grade students in primary education. Subsequently, a ... -
Introducing the CYSAS-S3 Dataset for Operationalizing a Mission-Oriented Cyber Situational Awareness
Medenou Choumanof, Roumen Daton; Llopis Sánchez, Salvador; Calzado Mayo, Victor Manuel; Garcia Balufo, Miriam; Páramo Castrillo, Miguel; González Garrido, Francisco José; Luis Martinez, Alvaro; Nevado Catalán, David; Hu, Ao; Rodriguez-Bermejo, David Sandoval; Pasqual De Riquelme, Gerardo Ramis; Sotelo Monge, Marco Antonio; Berardi, Antonio; De Santis, Paolo; Torelli, Francesco; Maestre Vidal, Jorge (Sensors, 2022)The digital transformation of the defence sector is not exempt from innovative requirements and challenges, with the lack of availability of reliable, unbiased and consistent data for training automatisms (machine learning ...