Building a Corpus of 2L English for Automatic Assessment: the CLEC Corpus

Zarco Tejada, María Ángeles; Noya Gallardo, María Del Carmen; Merino Ferradá, María del Carmen; Calderón López, María Isabel

doi:10.1016/j.sbspro.2015.07.474

Metrics and citations

Export

Metadata

Show full item record

Author/s

Zarco Tejada, María Ángeles

; Noya Gallardo, María Del Carmen

; Merino Ferradá, María del Carmen

; Calderón López, María Isabel

Date

2015-01-01

Department

Filología Francesa e Inglesa

Source

Procedia. Social and Behavioral Sciences 198 (2015) 515-525

Abstract

In this paper we describe the CLEC corpus, an ongoing project set up at the University of Cádiz with the purpose of building up a large corpus of English as a 2L classified according to CEFR proficiency levels and formed to train statistical models for automatic proficiency assessment. The goal of this corpus is twofold: on the one hand it will be used as a data resource for the development of automatic text classification systems and, on the other, it has been used as a means of teaching innovation techniques.

Subjects

automatic assessment; CEFR proficiency labels; teaching innovation techniques; corpus linguistics; automatic linguistic profile

Collections

Artículos Científicos [11595]
Articulos Científicos Fil. Fra. Ing. [321]
Artículos Científicos ILA [147]

UniversidaddeCádiz