Archivio istituzionale della ricerca dell'Università degli Studi di Palermo

With this work we propose an application of the ELECTRA Transformer, fine-tuned on two augmented version of the same training dataset. Our team developed the novel framework for taking part at the Profiling Cryptocurrency Influencers with Few-shot Learning task hosted at PAN@CLEF2023. Our proposed strategy consists of an early data augmentation stage followed by a fine-tuning of ELECTRA. At the first stage we augment the original training dataset provided by the organizers using backtranslation. Using this augmented version of the training dataset, we perform a fine tuning of ELECTRA. Finally, using the fine-tuned version of ELECTRA, we inference the labels of the samples provided in the test set. To develop and test our model we used a two-ways validation on the training set. Firstly, we evaluate all the metrics on the augmented training set, and then we evaluate on the original training set. The metrics we considered span from accuracy to Macro F1, to Micro F1, to Recall and Precision. According to the official evaluator, our best submission reached a Macro F1 value equal to 0.3762.

Siino M., Tesconi M., Tinnirello I. (2023). Profiling Cryptocurrency Influencers with Few-Shot Learning Using Data Augmentation and ELECTRA. In Working Notes of the Conference and Labs of the Evaluation Forum (CLEF-WN 2023), Thessaloniki, Greece, September 18th to 21st, 2023 (pp. 2772-2781). CEUR-WS.

Profiling Cryptocurrency Influencers with Few-Shot Learning Using Data Augmentation and ELECTRA

Siino M.;Tesconi M.;Tinnirello I.

2023-01-01

Abstract

With this work we propose an application of the ELECTRA Transformer, fine-tuned on two augmented version of the same training dataset. Our team developed the novel framework for taking part at the Profiling Cryptocurrency Influencers with Few-shot Learning task hosted at PAN@CLEF2023. Our proposed strategy consists of an early data augmentation stage followed by a fine-tuning of ELECTRA. At the first stage we augment the original training dataset provided by the organizers using backtranslation. Using this augmented version of the training dataset, we perform a fine tuning of ELECTRA. Finally, using the fine-tuned version of ELECTRA, we inference the labels of the samples provided in the test set. To develop and test our model we used a two-ways validation on the training set. Firstly, we evaluate all the metrics on the augmented training set, and then we evaluate on the original training set. The metrics we considered span from accuracy to Macro F1, to Micro F1, to Recall and Precision. According to the official evaluator, our best submission reached a Macro F1 value equal to 0.3762.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data
	
				2023
			
	URL dell'editore (Open access ove possibile)
	
				https://ceur-ws.org/Vol-3497/paper-232.pdf
			
	Citazione
	
				Siino M.,  Tesconi M.,  Tinnirello I. (2023). Profiling Cryptocurrency Influencers with Few-Shot Learning Using Data Augmentation and ELECTRA. In Working Notes of the Conference and Labs of the Evaluation Forum (CLEF-WN 2023), Thessaloniki, Greece, September 18th to 21st, 2023 (pp. 2772-2781). CEUR-WS.
			
	Appare nelle tipologie:
	
				2.07 Contributo in atti di convegno pubblicato in volume

File in questo prodotto:

File	Dimensione	Formato
10 - Profiling cryptocurrency influencers with few-shot learning using data augmentation and electra.pdf accesso aperto Tipologia: Versione Editoriale Dimensione 1.37 MB Formato Adobe PDF Visualizza/Apri	1.37 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/621073

Citazioni

ND

19

ND

social impact