Type Paper
License CC BY 4.0

A corpus-based system of error detection and revision suggestion for Spanish learners in Taiwan: A case study

Abstract

Compared with English learners, Spanish learners have fewer resources for automatic error detection and revision and following the current integrative Computer Assisted Language Learning (CALL), we combined corpus-based approach and CALL to create the System of Error Detection and Revision Suggestion (SEDRS) for learning Spanish. Through corpus-based data training and related applications, this system was designed specially for learners of Spanish in Taiwan. The Corpus of Taiwanese Learners of Spanish (CTLS) was used as a database to facilitate the development of the system. The learners’ corpus was tagged with part-of-speech (POS) and lemma information, and it was also annotated by native Spanish speakers with revisions corresponding to errors made by students in their original texts. The system can, in real-time, identify tri-gram errors based on training data extracted from the revised texts of the learners’ corpus and provide revision suggestions listed according to usage frequency for users. The system was evaluated by 25 Spanish learners and eight experienced programmers to quantify the system’s practical effectiveness. In addition, feedback from learners’ was collected to improve the system in the future.

Citation

Lu, H., Chu, Y., & Chang, C. (2013). A corpus-based system of error detection and revision suggestion for Spanish learners in Taiwan: A case study. The JALT CALL Journal, 9(2), 115-130. https://doi.org/10.29140/jaltcall.v9n2.151

DOI

https://doi.org/10.29140/jaltcall.v9n2.151