TY - GEN
T1 - Assessing agreement level between forced alignment models with data from endangered language documentation corpora
AU - DiCanio, Christian T.
AU - Nam, Hosung
AU - Whalen, D. H.
AU - Bunnell, H. Timothy
AU - Amith, Jonathan D.
AU - García, Rey Castillo
PY - 2012
Y1 - 2012
N2 - Automatic forced alignment between transcriptions has achieved high levels of agreement for languages with large corpora, but the technique holds great promise for work on all languages. Here, we apply two forced alignment programs to data from an endangered Mixtecan language of Mexico. Both yielded a majority of boundaries within 20 ms of hand-labeled ones. Phonemes with fairly steady-state elements (e.g. nasals, fricatives) were more accurately labeled than others. Forced alignment thus may increase efficiency of labeling texts from smaller languages, at least in cases where the phoneme inventories are similar to those of the languages of the training.
AB - Automatic forced alignment between transcriptions has achieved high levels of agreement for languages with large corpora, but the technique holds great promise for work on all languages. Here, we apply two forced alignment programs to data from an endangered Mixtecan language of Mexico. Both yielded a majority of boundaries within 20 ms of hand-labeled ones. Phonemes with fairly steady-state elements (e.g. nasals, fricatives) were more accurately labeled than others. Forced alignment thus may increase efficiency of labeling texts from smaller languages, at least in cases where the phoneme inventories are similar to those of the languages of the training.
KW - Linguistics
KW - Phonetics
KW - Speech recognition
UR - http://www.scopus.com/inward/record.url?scp=84878384249&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84878384249&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84878384249
SN - 9781622767595
T3 - 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
SP - 130
EP - 133
BT - 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
T2 - 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Y2 - 9 September 2012 through 13 September 2012
ER -