Assessing Wordplay-Pun classification from JOKER dataset with pretrained BERT humorous models

Victor Manuel Palma Preciado, Grigori Sidorov, Carolina Palma Preciado

Producción científica: Contribución a una revistaArtículo de la conferenciarevisión exhaustiva

1 Cita (Scopus)

Resumen

Humor is one of the most subjective matters of human behavior since it includes a wide range of variables: sentiments, wordplay, double meanings structurally or phonetic, all of this within the construction of written humor. It is important to assess the humor from a different point of view since this variability tends to provide insight into the true structure or the main core of the humoristic dilemma, as we know the range of humor is so diverse that it presents a high skilled problem even on the simplest tasks. Pre-trained base Bert and DistilBert models trained with a humorous one-liners dataset were used, these trained models were tested with a merged dataset from JOKER from data of tasks 1 and task 3, the collected data was trimmed from duplicated records and special characters to create a final dataset with 3,601 humorous sentences. Under this experiment we try to see if our models were able to detect a different humor from the initial type with which they were trained, it was noted that both methods are able to successfully classify another type of humor. On the one hand, it was expected that the pre-trained models would be able to classify at least a portion of the humor in the data set, the results obtained were much better than anticipated, obtaining 95.64% for BERT and 92.58% for DistilBERT, the models were really able to identify humor, an analysis of the worst and best cases were taken into account.

Idioma originalInglés
Páginas (desde-hasta)1828-1833
Número de páginas6
PublicaciónCEUR Workshop Proceedings
Volumen3180
EstadoPublicada - 2022
Evento2022 Conference and Labs of the Evaluation Forum, CLEF 2022 - Bologna, Italia
Duración: 5 sep. 20228 sep. 2022

Huella

Profundice en los temas de investigación de 'Assessing Wordplay-Pun classification from JOKER dataset with pretrained BERT humorous models'. En conjunto forman una huella única.

Citar esto