Detecting and correcting automatic speech recognition errors with a new model

Küçük Resim Yok

Tarih

2021

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Tubitak Scientific & Technological Research Council Turkey

Erişim Hakkı

info:eu-repo/semantics/openAccess

Özet

The purpose of automatic speech recognition (ASR) systems is to recognize speech signals obtained from people and convert them into text so that they can be processed by a computer. Although many ASR applications are versatile and widely used in the real world, they still generate relatively inaccurate results. They tend to generate spelling errors in recognized words, especially in noisy environments, in situations where the vocabulary size is increased, and at times when the input speech is of poor quality. The permanent presence of errors in ASR systems has led to the need to find alternative methods for automatic detection and correction of such errors. In this study, the basic principles of ASR evaluation are first summarized, and then a new approach based on the suggestion of an alternative hypothesis is proposed for the detection and correction of these errors generated by ASR systems. The proposed method involves a series of processes such as identifying incorrect words, selecting the ones that can be corrected, and identifying candidate words to replace these words. As a result of the tests carried out by creating different test environments, significant performance improvements for Turkish were achieved and an average of 4.60 % performance improvement was provided.

Açıklama

Anahtar Kelimeler

Automatic Speech Recognition, Automatic Speech Recognition Error Correction, Artificial Intelligence, Alternative Hypothesis Suggestion, Natural Language Processing

Kaynak

Turkish Journal Of Electrical Engineering And Computer Sciences

WoS Q Değeri

Q4

Scopus Q Değeri

Q3

Cilt

29

Sayı

5

Künye