This study describes the outcomes of a POS-based method for the automatic extraction of V-N Italian oral academic collocations from an annotated corpus. A frequency statistical measure is applied to automatically extract the collocations from the POS-tagged corpus. The results reveal that frequency alone is not sufficient to measure the degree of association that connects the two elements of a word pair. In order to detect the real-attested Italian collocations, the data has been further evaluated by 50 Italian native speakers. The results indicate that these combinations are tightly linked to their context of usage. Thus, native speakers should be exposed to these phrasal contexts to activate their mechanisms of explicit reflection and assess the degree of collocativity of these combinations.
A corpus-based study of the automatic extraction and validation of V-N Italian oral academic collocations
Diana Peppoloni
2018
Abstract
This study describes the outcomes of a POS-based method for the automatic extraction of V-N Italian oral academic collocations from an annotated corpus. A frequency statistical measure is applied to automatically extract the collocations from the POS-tagged corpus. The results reveal that frequency alone is not sufficient to measure the degree of association that connects the two elements of a word pair. In order to detect the real-attested Italian collocations, the data has been further evaluated by 50 Italian native speakers. The results indicate that these combinations are tightly linked to their context of usage. Thus, native speakers should be exposed to these phrasal contexts to activate their mechanisms of explicit reflection and assess the degree of collocativity of these combinations.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.