Important and new features with analysis for disfluency interruption point (IP) detection in spontaneous Mandarin speech

Publication Type:

Conference Paper


The 4th Workshop on Disfluency in Spontaneous Speech, Aix-en-Provence, France, p.117-121 (2005)





This paper presents a whole set of new features, some duration-related and some pitch-related, to be used in disfluency interruption point (IP) detection for spontaneous Mandarin speech, considering the special linguistic characteristics of Mandarin Chinese. Decision tree is incorporated into the maximum entropy model to perform the IP detection. By examining performance degradation when each specific feature was missing from the whole set, the most important features for IP detection for each disfluency type were analyzed in detail. The experiments were conducted on the Mandarin Conversational Dialogue Corpus (MCDC) developed by the Institute of Linguistics of Academia Sinica in Taiwan.


Université de Provence; September 10-12, 2005