I2C-UHU at EXIST2024: Learning from Divergence and Perspectivism for Sexism Identification and Source Intent Classification

Guerrero García, Manuel; Cerrejón Naranjo, Manuel; Mata Vázquez, Jacinto; Pachón Álvarez, Victoria

I2C-UHU at EXIST2024: Learning from Divergence and Perspectivism for Sexism Identification and Source Intent Classification

dc.contributor.author	Guerrero García, Manuel
dc.contributor.author	Cerrejón Naranjo, Manuel
dc.contributor.author	Mata Vázquez, Jacinto
dc.contributor.author	Pachón Álvarez, Victoria
dc.date.accessioned	2024-12-10T12:43:43Z
dc.date.available	2024-12-10T12:43:43Z
dc.date.issued	2024
dc.description.abstract	In this paper, we present the contributions of the I2C-UHU team to the EXIST2024 Lab at CLEF 2024, focusing on the identification of sexism and the classification of source intent in social media texts. State-of-the-art transformer models are employed to address the complex and nuanced nature of sexist language. We adopt a two-fold approach: firstly, classifying tweets as sexist or non-sexist, and secondly, categorizing sexist tweets based on intent. Our innovative approach, employing Learning with Disagreement, incorporates diverse perspectives from multiple annotators, enhancing the robustness and accuracy of our models. We detail our data preprocessing, augmentation techniques, and hyperparameter optimization strategies. Our results in the competition demonstrated effectiveness, with our entries achieving positive rankings in the two tasks in which we participated. In Task 1, we secured the 10th position out of 70 participants on the hard labels leaderboard and the 13th position out of 40 for soft labels. In Task 2, we achieved the 11th position out of 46 participants for hard labels and the 17th position out of 35 in the best run for soft labels. Our findings provide a foundation for future research and practical applications in social media moderation and policy-making.	es_ES
dc.description.department	Tecnologías de la Información	es_ES
dc.description.sponsorship	This paper is part of the I+D+i Project titled “Conspiracy Theories and hate speech online: Comparison of patterns in narratives and social networks about COVID-19, immigrants, refugees and LGBTI people [NON-CONSPIRA-HATE!]”, PID2021-123983OB-I00, funded by MCIN/AEI/10.13039/501100011033/ and by “ERDF/EU”.	es_ES
dc.identifier.citation	Guerrero-García, M., Cerrejón-Naranjo, M., Mata-Vázquez, J., & Pachón-Álvarez, V. (2024). I2C-UHU at EXIST2024: Learning from Divergence and Perspectivism for Sexism Identification and Source Intent Classification. CEUR Workshop Proceedings, 3740, 1026-1042.	es_ES
dc.identifier.issn	1613-0073
dc.identifier.uri	https://hdl.handle.net/10272/24655
dc.language.iso	eng	es_ES
dc.publisher	CEUR-WS	es_ES
dc.rights	Atribución-NoComercial-SinDerivadas 3.0 España	*
dc.rights.accessRights	open access	es_ES
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/	*
dc.subject.other	Sexism identification	es_ES
dc.subject.other	Learning with disagreement	es_ES
dc.subject.other	Transformer models	es_ES
dc.subject.other	Natural language processing	es_ES
dc.subject.unesco	3304 Tecnología de Los Ordenadores	es_ES
dc.title	I2C-UHU at EXIST2024: Learning from Divergence and Perspectivism for Sexism Identification and Source Intent Classification	es_ES
dc.type	conference output	es_ES
dspace.entity.type	Publication
relation.isAuthorOfPublication	ac76819b-d91a-4158-b947-4a9e827e5e9d
relation.isAuthorOfPublication	47cb4892-3513-4d33-953c-8521bc9cb187
relation.isAuthorOfPublication.latestForDiscovery	ac76819b-d91a-4158-b947-4a9e827e5e9d

Files

Original bundle

Now showing 1 - 1 of 1

Name:: EXIST-2024_paper96.pdf
Size:: 2.37 MB
Format:: Adobe Portable Document Format
Description:: Versión editor

Download

Collections

Ponencias, comunicaciones y pósteres