Commonsense Reasoning

Challenging the Abilities of Large Language Models in Italian: a Community Initiative featured image

Challenging the Abilities of Large Language Models in Italian: a Community Initiative

The rapid progress of Large Language Models (LLMs) has transformed natural language processing and broadened its impact across research and society. Yet, systematic evaluation of …

Malvina Nissim
,
Danilo Croce
,
Viviana Patti
,
Pierpaolo Basile
,
Giuseppe Attanasio
,
Elio Musacchio
,
Matteo Rinaldi
,
Federico Borazio
,
Maria Francis
,
Jacopo Gili
,
others
GITA4CALAMITA - Evaluating the Physical Commonsense Understanding of Italian LLMs in a Multi-layered Approach: A CALAMITA Challenge featured image

GITA4CALAMITA - Evaluating the Physical Commonsense Understanding of Italian LLMs in a Multi-layered Approach: A CALAMITA Challenge

In the context of the CALAMITA Challenge, we investigate the physical commonsense reasoning capabilities of large language models (LLMs) and introduce a methodology to assess their …

Giulia Pensa
,
Ekhi Azurmendi
,
Julen Etxaniz
,
Begoña Altuna
,
Itziar Gonzalez-Dios