Feedback utterances for computer-adied language learning using accent reduction and voice conversion method

Sixuan Zhao; Soo Ngee Koh; Soon Ing Yann; Kang Kwong Luke

doi:10.1109/ICASSP.2013.6639265

Feedback utterances for computer-adied language learning using accent reduction and voice conversion method

Sixuan Zhao, Soo Ngee Koh, Soon Ing Yann, Kang Kwong Luke

Nanyang Technological University

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

5 Citations (Scopus)

Abstract

This paper considers the generation of feedback utterances for speaking skills training of non-native English learners. The proposed feedback is in the form of a combination of the learner's voice and the linguistic gestures, i.e., the prosody or pronunciation, of a native speaker. Both accent reduction method and voice conversion method are employed to generate feedback stimuli. For accent reduction, three speech synthesis methods, namely pitch-synchronous overlap and add (PSOLA), harmonic stochastic model (HSM), and speech transformation and representation by adaptive interpolation of weighted spectrogram (STRAIGHT) are used to reduce the accent of the utterances of English learners. For voice conversion, the teacher's voice is converted to that of the learner and the converted speech is used as a feedback. Objective measurements are employed to assess the nativeness and acoustic quality of the generated stimuli. A feedback scheme which combines the accent reduction and voice conversion methods is also proposed.

Original language	English
Title of host publication	2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings
Pages	8208-8212
Number of pages	5
DOIs	https://doi.org/10.1109/ICASSP.2013.6639265
Publication status	Published - Oct 18 2013
Externally published	Yes
Event	2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Vancouver, BC, Canada Duration: May 26 2013 → May 31 2013

Publication series

Name	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)	1520-6149

Conference

Conference	2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
Country/Territory	Canada
City	Vancouver, BC
Period	5/26/13 → 5/31/13

ASJC Scopus Subject Areas

Software
Signal Processing
Electrical and Electronic Engineering

Keywords

accent reduction
CALL
feedback utterances
voice conversion

Access to Document

10.1109/ICASSP.2013.6639265

Cite this

Zhao, S., Koh, S. N., Yann, S. I., & Luke, K. K. (2013). Feedback utterances for computer-adied language learning using accent reduction and voice conversion method. In 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings (pp. 8208-8212). Article 6639265 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). https://doi.org/10.1109/ICASSP.2013.6639265

Zhao, Sixuan ; Koh, Soo Ngee ; Yann, Soon Ing et al. / Feedback utterances for computer-adied language learning using accent reduction and voice conversion method. 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings. 2013. pp. 8208-8212 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

@inproceedings{0749514a9a55432094ed743545afdb93,

title = "Feedback utterances for computer-adied language learning using accent reduction and voice conversion method",

abstract = "This paper considers the generation of feedback utterances for speaking skills training of non-native English learners. The proposed feedback is in the form of a combination of the learner's voice and the linguistic gestures, i.e., the prosody or pronunciation, of a native speaker. Both accent reduction method and voice conversion method are employed to generate feedback stimuli. For accent reduction, three speech synthesis methods, namely pitch-synchronous overlap and add (PSOLA), harmonic stochastic model (HSM), and speech transformation and representation by adaptive interpolation of weighted spectrogram (STRAIGHT) are used to reduce the accent of the utterances of English learners. For voice conversion, the teacher's voice is converted to that of the learner and the converted speech is used as a feedback. Objective measurements are employed to assess the nativeness and acoustic quality of the generated stimuli. A feedback scheme which combines the accent reduction and voice conversion methods is also proposed.",

keywords = "accent reduction, CALL, feedback utterances, voice conversion",

author = "Sixuan Zhao and Koh, {Soo Ngee} and Yann, {Soon Ing} and Luke, {Kang Kwong}",

year = "2013",

month = oct,

day = "18",

doi = "10.1109/ICASSP.2013.6639265",

language = "English",

isbn = "9781479903566",

series = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

pages = "8208--8212",

booktitle = "2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings",

note = "2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 ; Conference date: 26-05-2013 Through 31-05-2013",

}

Zhao, S, Koh, SN, Yann, SI & Luke, KK 2013, Feedback utterances for computer-adied language learning using accent reduction and voice conversion method. in 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings., 6639265, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 8208-8212, 2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, 5/26/13. https://doi.org/10.1109/ICASSP.2013.6639265

Feedback utterances for computer-adied language learning using accent reduction and voice conversion method. / Zhao, Sixuan; Koh, Soo Ngee; Yann, Soon Ing et al.
2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings. 2013. p. 8208-8212 6639265 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - Feedback utterances for computer-adied language learning using accent reduction and voice conversion method

AU - Zhao, Sixuan

AU - Koh, Soo Ngee

AU - Yann, Soon Ing

AU - Luke, Kang Kwong

PY - 2013/10/18

Y1 - 2013/10/18

N2 - This paper considers the generation of feedback utterances for speaking skills training of non-native English learners. The proposed feedback is in the form of a combination of the learner's voice and the linguistic gestures, i.e., the prosody or pronunciation, of a native speaker. Both accent reduction method and voice conversion method are employed to generate feedback stimuli. For accent reduction, three speech synthesis methods, namely pitch-synchronous overlap and add (PSOLA), harmonic stochastic model (HSM), and speech transformation and representation by adaptive interpolation of weighted spectrogram (STRAIGHT) are used to reduce the accent of the utterances of English learners. For voice conversion, the teacher's voice is converted to that of the learner and the converted speech is used as a feedback. Objective measurements are employed to assess the nativeness and acoustic quality of the generated stimuli. A feedback scheme which combines the accent reduction and voice conversion methods is also proposed.

AB - This paper considers the generation of feedback utterances for speaking skills training of non-native English learners. The proposed feedback is in the form of a combination of the learner's voice and the linguistic gestures, i.e., the prosody or pronunciation, of a native speaker. Both accent reduction method and voice conversion method are employed to generate feedback stimuli. For accent reduction, three speech synthesis methods, namely pitch-synchronous overlap and add (PSOLA), harmonic stochastic model (HSM), and speech transformation and representation by adaptive interpolation of weighted spectrogram (STRAIGHT) are used to reduce the accent of the utterances of English learners. For voice conversion, the teacher's voice is converted to that of the learner and the converted speech is used as a feedback. Objective measurements are employed to assess the nativeness and acoustic quality of the generated stimuli. A feedback scheme which combines the accent reduction and voice conversion methods is also proposed.

KW - accent reduction

KW - CALL

KW - feedback utterances

KW - voice conversion

UR - http://www.scopus.com/inward/record.url?scp=84890498048&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84890498048&partnerID=8YFLogxK

U2 - 10.1109/ICASSP.2013.6639265

DO - 10.1109/ICASSP.2013.6639265

M3 - Conference contribution

AN - SCOPUS:84890498048

SN - 9781479903566

T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

SP - 8208

EP - 8212

BT - 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings

T2 - 2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013

Y2 - 26 May 2013 through 31 May 2013

ER -

Zhao S, Koh SN, Yann SI, Luke KK. Feedback utterances for computer-adied language learning using accent reduction and voice conversion method. In 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings. 2013. p. 8208-8212. 6639265. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). doi: 10.1109/ICASSP.2013.6639265

Feedback utterances for computer-adied language learning using accent reduction and voice conversion method

Abstract

Publication series

Conference

ASJC Scopus Subject Areas

Keywords

Access to Document

Other files and links

Fingerprint

Cite this