home..

Acoustic Unit Discovery

While most aspects of ASR are heavily data-driven, the lexicon and acoustic units are typically derived by experts. For low-resource languages these expert-defined lexicons may not be available. We propose methods for both discovering acoustic units and learning pronunciations using the discovered units. Our initial work relies on the use of an initial grapheme-based lexicon. Acoustic models representing the graphemes are clustered to generate a new set of acoustic units. Pronunciations are generated using a statistical machine translation based approach.

Comments? Send me an email.
© 2023 William Hartmann   •  Theme  Moonwalk