ChatGPT, Generative AI, Mathematical Problems, Wolfram Mathematica

dc.contributor.authorGarcía Navarro, Alejandro L.
dc.contributor.authorKoneva, Nataliia
dc.contributor.authorHernández, José Alberto
dc.contributor.authorSánchez-Macián, Alfonso
dc.date2025-11-28
dc.date.accessioned2026-02-25T16:07:52Z
dc.date.available2026-02-25T16:07:52Z
dc.description.abstractIn November 2022, ChatGPT v3.5 was announced to the world. Since then, Generative Artificial Intelligence (GAI) has appeared in the news almost daily, showing impressive capabilities at solving multiple tasks that have surprised the research community and the world in general. Indeed the number of tasks that ChatGPT and other Large Language Models (LLMs) can do are unimaginable, especially when dealing with natural text. Text generation, summarisation, translation, and transformation (into poems, songs, or other styles) are some of its strengths. However, when it comes to reasoning or mathematical calculations, ChatGPT finds difficulties. In this work, we compare different flavors of ChatGPT (v3.5, v4, and Wolfram GPT) at solving 20 mathematical tasks, from high school and first-year engineering courses. We show that GPT-4 is far more powerful than ChatGPT-3.5, and further that the use of Wolfram GPT can even slightly improve the results obtained with GPT-4 at these mathematical tasks.es_ES
dc.identifier.citationA. L. García Navarro, N. Koneva, J. A. Hernández, A. Sánchez-Macián. On the use of Large Language Models at Solving Math Problems: A Comparison Between GPT-4, LlaMA-2 and Gemini, International Journal of Interactive Multimedia and Artificial Intelligence, vol. 9, no. 5, pp. 40-50, 2025es_ES
dc.identifier.doihttp://dx.doi.org/10.9781/ijimai.2025.03.001
dc.identifier.urihttps://reunir.unir.net/handle/123456789/19077
dc.language.isoenges_ES
dc.publisherUNIRes_ES
dc.relation.urihttps://www.ijimai.org/index.php/ijimai/article/view/858es_ES
dc.rightsopenAccesses_ES
dc.subjectchat gptes_ES
dc.subjectgenerative aies_ES
dc.subjectmathematical problemses_ES
dc.subjectwolfram mathematicses_ES
dc.titleChatGPT, Generative AI, Mathematical Problems, Wolfram Mathematicaes_ES
dc.title.alternativeA Comparison Between GPT-4, LlaMA-2 and Geminies_ES
dc.typearticlees_ES
reunir.tag~IJIMAIes_ES

Archivos

Bloque original

Mostrando 1 - 1 de 1
Cargando...
Nombre:
On the Use of Large Language Models at Solving Math Problems.pdf
Tamaño:
406.65 KB
Formato:
Adobe Portable Document Format
Descripción:

Bloque de licencias

Mostrando 1 - 1 de 1
Cargando...
Nombre:
license.txt
Tamaño:
1.27 KB
Formato:
Item-specific license agreed upon to submission
Descripción: