Investigating model performance in language identification: beyond simple error statistics

Suzy J. Styles; Victoria Y.H. Chua; Fei Ting Woon; Hexin Liu; Leibny Paola Garcia Perera; Sanjeev Khudanpur; Andy W.H. Khong; Justin Dauwels

doi:10.21437/Interspeech.2023-1707

Investigating model performance in language identification: beyond simple error statistics

Suzy J. Styles, Victoria Y.H. Chua, Fei Ting Woon, Hexin Liu, Leibny Paola Garcia Perera, Sanjeev Khudanpur, Andy W.H. Khong, Justin Dauwels

Research output: Contribution to journal › Conference article › peer-review

1 Citation (Scopus)

Abstract

Language development experts need tools that can automatically identify languages from fluent, conversational speech and provide reliable estimates of usage rates at the level of an individual recording. However, LID systems are typically evaluated on metrics such as equal error rate and balanced accuracy, applied at the level of an entire speech corpus. These overview metrics do not provide information about model performance at the level of individual speakers, recordings, or units of speech with different linguistic characteristics. Overview statistics may mask systematic errors in model performance for some subsets of the data, and consequently, have worse performance on data derived from some subsets of human speakers, creating a kind of algorithmic bias. Here, we investigate how well a number of LID systems perform on individual recordings and speech units with different linguistic properties in the MERLIon CCS Challenge featuring accented code-switched child-directed speech.

Original language	English
Pages (from-to)	4129-4133
Number of pages	5
Journal	Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume	2023-August
DOIs	https://doi.org/10.21437/Interspeech.2023-1707
Publication status	Published - 2023
Externally published	Yes
Event	24th International Speech Communication Association, Interspeech 2023 - Dublin, Ireland Duration: Aug 20 2023 → Aug 24 2023

Bibliographical note

Publisher Copyright:
© 2023 International Speech Communication Association. All rights reserved.

ASJC Scopus Subject Areas

Language and Linguistics
Human-Computer Interaction
Signal Processing
Software
Modelling and Simulation

Keywords

child-directed speech
code-switching
language diarization
language identification

Access to Document

10.21437/Interspeech.2023-1707

Cite this

Styles, S. J., Chua, V. Y. H., Woon, F. T., Liu, H., Perera, L. P. G., Khudanpur, S., Khong, A. W. H., & Dauwels, J. (2023). Investigating model performance in language identification: beyond simple error statistics. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023-August, 4129-4133. https://doi.org/10.21437/Interspeech.2023-1707

@article{a9f35c2056974040bdfa3f7c2143688d,

title = "Investigating model performance in language identification: beyond simple error statistics",

abstract = "Language development experts need tools that can automatically identify languages from fluent, conversational speech and provide reliable estimates of usage rates at the level of an individual recording. However, LID systems are typically evaluated on metrics such as equal error rate and balanced accuracy, applied at the level of an entire speech corpus. These overview metrics do not provide information about model performance at the level of individual speakers, recordings, or units of speech with different linguistic characteristics. Overview statistics may mask systematic errors in model performance for some subsets of the data, and consequently, have worse performance on data derived from some subsets of human speakers, creating a kind of algorithmic bias. Here, we investigate how well a number of LID systems perform on individual recordings and speech units with different linguistic properties in the MERLIon CCS Challenge featuring accented code-switched child-directed speech.",

keywords = "child-directed speech, code-switching, language diarization, language identification",

author = "Styles, {Suzy J.} and Chua, {Victoria Y.H.} and Woon, {Fei Ting} and Hexin Liu and Perera, {Leibny Paola Garcia} and Sanjeev Khudanpur and Khong, {Andy W.H.} and Justin Dauwels",

note = "Publisher Copyright: {\textcopyright} 2023 International Speech Communication Association. All rights reserved.; 24th International Speech Communication Association, Interspeech 2023 ; Conference date: 20-08-2023 Through 24-08-2023",

year = "2023",

doi = "10.21437/Interspeech.2023-1707",

language = "English",

volume = "2023-August",

pages = "4129--4133",

journal = "Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH",

issn = "2308-457X",

}

Styles, SJ, Chua, VYH, Woon, FT, Liu, H, Perera, LPG, Khudanpur, S, Khong, AWH & Dauwels, J 2023, 'Investigating model performance in language identification: beyond simple error statistics', Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, vol. 2023-August, pp. 4129-4133. https://doi.org/10.21437/Interspeech.2023-1707

Investigating model performance in language identification: beyond simple error statistics. / Styles, Suzy J.; Chua, Victoria Y.H.; Woon, Fei Ting et al.
In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2023-August, 2023, p. 4129-4133.

Research output: Contribution to journal › Conference article › peer-review

TY - JOUR

T1 - Investigating model performance in language identification

T2 - 24th International Speech Communication Association, Interspeech 2023

AU - Styles, Suzy J.

AU - Chua, Victoria Y.H.

AU - Woon, Fei Ting

AU - Liu, Hexin

AU - Perera, Leibny Paola Garcia

AU - Khudanpur, Sanjeev

AU - Khong, Andy W.H.

AU - Dauwels, Justin

PY - 2023

Y1 - 2023

N2 - Language development experts need tools that can automatically identify languages from fluent, conversational speech and provide reliable estimates of usage rates at the level of an individual recording. However, LID systems are typically evaluated on metrics such as equal error rate and balanced accuracy, applied at the level of an entire speech corpus. These overview metrics do not provide information about model performance at the level of individual speakers, recordings, or units of speech with different linguistic characteristics. Overview statistics may mask systematic errors in model performance for some subsets of the data, and consequently, have worse performance on data derived from some subsets of human speakers, creating a kind of algorithmic bias. Here, we investigate how well a number of LID systems perform on individual recordings and speech units with different linguistic properties in the MERLIon CCS Challenge featuring accented code-switched child-directed speech.

AB - Language development experts need tools that can automatically identify languages from fluent, conversational speech and provide reliable estimates of usage rates at the level of an individual recording. However, LID systems are typically evaluated on metrics such as equal error rate and balanced accuracy, applied at the level of an entire speech corpus. These overview metrics do not provide information about model performance at the level of individual speakers, recordings, or units of speech with different linguistic characteristics. Overview statistics may mask systematic errors in model performance for some subsets of the data, and consequently, have worse performance on data derived from some subsets of human speakers, creating a kind of algorithmic bias. Here, we investigate how well a number of LID systems perform on individual recordings and speech units with different linguistic properties in the MERLIon CCS Challenge featuring accented code-switched child-directed speech.

KW - child-directed speech

KW - code-switching

KW - language diarization

KW - language identification

UR - http://www.scopus.com/inward/record.url?scp=85162743570&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85162743570&partnerID=8YFLogxK

U2 - 10.21437/Interspeech.2023-1707

DO - 10.21437/Interspeech.2023-1707

M3 - Conference article

AN - SCOPUS:85162743570

SN - 2308-457X

VL - 2023-August

SP - 4129

EP - 4133

JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Y2 - 20 August 2023 through 24 August 2023

ER -

Investigating model performance in language identification: beyond simple error statistics

Abstract

Bibliographical note

ASJC Scopus Subject Areas

Keywords

Access to Document

Other files and links

Fingerprint

Cite this