Detection and assessment of hypernasality in repaired cleft palate speech using vocal tract and residual features

J Acoust Soc Am. 2019 Dec;146(6):4211. doi: 10.1121/1.5134433.

Abstract

The presence of hypernasality in repaired cleft palate (CP) speech is a consequence of velopharyngeal insufficiency. The coupling of the nasal tract with the oral tract adds nasal formant and antiformant pairs in the hypernasal speech spectrum. This addition deviates the spectral and linear prediction (LP) residual characteristics of hypernasal speech compared to normal speech. In this work, the vocal tract constriction feature, peak to side-lobe ratio feature, and spectral moment features augmented by low-order cepstral coefficients are used to capture the spectral and residual deviations for hypernasality detection. The first feature captures the lower-frequencies prominence in speech due to the presence of nasal formants, the second feature captures the undesirable signal components in the residual signal due to the nasal antiformants, and the third feature captures the information about formants and antiformants in the spectrum along with the spectral envelope. The combination of three features gives normal versus hypernasal speech detection accuracies of 87.76%, 91.13%, and 93.70% for /a/, /i/, and /u/ vowels, respectively, and hypernasality severity detection accuracies of 80.13% and 81.25% for /i/ and /u/ vowels, respectively. The speech data are collected from 30 control normal and 30 repaired CP children between the ages of 7 and 12.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Child
  • Cleft Palate / surgery*
  • Female
  • Humans
  • Male
  • Speech / physiology*
  • Speech Acoustics
  • Speech Production Measurement / methods
  • Velopharyngeal Insufficiency / physiopathology
  • Velopharyngeal Insufficiency / surgery*
  • Voice / physiology*