Efficient and Robust Model Benchmarks with Item Response Theory and Adaptive Testing

Song, Hao; Flach, Peter

dc.contributor.author	Song, Hao
dc.contributor.author	Flach, Peter
dc.date	2021-03
dc.date.accessioned	2022-04-25T08:26:43Z
dc.date.available	2022-04-25T08:26:43Z
dc.identifier.issn	1989-1660
dc.identifier.uri	https://reunir.unir.net/handle/123456789/12915
dc.description.abstract	Progress in predictive machine learning is typically measured on the basis of performance comparisons on benchmark datasets. Traditionally these kinds of empirical evaluation are carried out on large numbers of datasets, but this is becoming increasingly hard due to computational requirements and the often large number of alternative methods to compare against. In this paper we investigate adaptive approaches to achieve better efficiency on model benchmarking. For a large collection of datasets, rather than training and testing a given approach on every individual dataset, we seek methods that allow us to pick only a few representative datasets to quantify the model’s goodness, from which to extrapolate to performance on other datasets. To this end, we adapt existing approaches from psychometrics: specifically, Item Response Theory and Adaptive Testing. Both are well-founded frameworks designed for educational tests. We propose certain modifications following the requirements of machine learning experiments, and present experimental results to validate the approach.	es_ES
dc.language.iso	eng	es_ES
dc.publisher	International Journal of Interactive Multimedia and Artificial Intelligence (IJIMAI)	es_ES
dc.relation.ispartofseries	;vol. 6, nº 5
dc.relation.uri	https://www.ijimai.org/journal/bibcite/reference/2901	es_ES
dc.rights	openAccess	es_ES
dc.subject	item response theory	es_ES
dc.subject	adaptive testing	es_ES
dc.subject	model evaluation	es_ES
dc.subject	benchmark	es_ES
dc.subject	IJIMAI	es_ES
dc.title	Efficient and Robust Model Benchmarks with Item Response Theory and Adaptive Testing	es_ES
dc.type	article	es_ES
reunir.tag	~IJIMAI	es_ES
dc.identifier.doi	https://doi.org/10.9781/ijimai.2021.02.009

Ficheros en el ítem

Nombre:: ijimai_6_5_11_0.pdf
Tamaño:: 1.481Mb
Formato:: PDF

Ver/Abrir

Este ítem aparece en la(s) siguiente(s) colección(ones)

vol. 6, nº 5, march 2021

Mostrar el registro sencillo del ítem

Efficient and Robust Model Benchmarks with Item Response Theory and Adaptive Testing

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)

Ítems relacionados

Preface ﻿

An Ensemble Classifier for Stock Trend Prediction Using Sentence-Level Chinese News Sentiment and Technical Indicators ﻿

A Generalized Wine Quality Prediction Framework by Evolutionary Algorithms ﻿

Preface

An Ensemble Classifier for Stock Trend Prediction Using Sentence-Level Chinese News Sentiment and Technical Indicators

A Generalized Wine Quality Prediction Framework by Evolutionary Algorithms