IRIS - Res&Arch Institutional Research Information System - Research & Archive

In this study, we present a novel system for the automatic classification of text complexity in the Italian language, focusing on the phraseological dimension. This quantitative assessment of text complexity is crucial for various applications, including text readability measurement, text simplification, and support for educators during evaluation processes. We use a dataset comprising texts written by Italian L2 learners and classified according to the levels of the Common European Framework of Reference for Languages. The dataset texts serve as a basis for calculating phraseological features, which are then used as input for multiple machine-learning classifiers to compare their performance in predicting proficiency levels. Our experimental results demonstrate that the proposed framework effectively harnesses phraseological complexity features to achieve high classification accuracy in determining proficiency levels.

Classification of Text Writing Proficiency of L2 Learners

Biondi G.^{Membro del Collaboration Group};Franzoni V.^Supervision;Milani A.^{Project Administration};Santucci V.^{Membro del Collaboration Group}

2023

Abstract

In this study, we present a novel system for the automatic classification of text complexity in the Italian language, focusing on the phraseological dimension. This quantitative assessment of text complexity is crucial for various applications, including text readability measurement, text simplification, and support for educators during evaluation processes. We use a dataset comprising texts written by Italian L2 learners and classified according to the levels of the Common European Framework of Reference for Languages. The dataset texts serve as a basis for calculating phraseological features, which are then used as input for multiple machine-learning classifiers to compare their performance in predicting proficiency levels. Our experimental results demonstrate that the proposed framework effectively harnesses phraseological complexity features to achieve high classification accuracy in determining proficiency levels.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Collana o serie
	
				LECTURE NOTES IN COMPUTER SCIENCE
			
	Codice ISBN
	
				978-3-031-37104-2
978-3-031-37105-9
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11391/1561956

Citazioni

ND

3

2

social impact