The suitability of cloud-based speech recognition engines for language learning
As online automatic speech recognition (ASR) engines become more accurate and more widely implemented with call software, it becomes important to evaluate the effectiveness and the accuracy of these recognition engines using authentic speech samples. This study investigates two of the most prominent cloud-based speech recognition engines--Apple’s Siri and Google Speech Recognition (GSR) to determine which engine would be more accurate at transcribing L2 learners’ speech. The average recognition accuracy of Siri and GSR is reported using language samples of Japanese learners speaking English. The study also presents a series of computerized speech assessment tasks that were developed by the researchers using a cloud-based speech recognition engine in conjunction with Moodle, a widely used course management system.
Daniels, P., & Iwago, K. (2017). The suitability of cloud-based speech recognition engines for language learning. The JALT CALL Journal, 13(3), 211-221. https://doi.org/10.29140/jaltcall.v13n3.220