William Hartmann

home..

Publications

Below is a list of my publications. For a full list, see my Google Scholar profile.

2022

“Training Autoregressive Speech Recognition Models with Limited in-domain Supervision”, Chak-Fai Li, Francis Keith, William Hartmann, and Matthew Snover, arXiv preprint arXiv:2210.15135, 2022. [arxiv] [bib] [post]
“Combining Unsupervised and Text Augmented Semi-Supervised Learning for Low Resourced Autoregressive Speech Recognition”, Chak-Fai Li, Francis Keith, William Hartmann, and Matthew Snover, in Proceedings of IEEE ICASSP, 2022. [publication] [arxiv] [bib] [post]

2021

“Overcoming Domain Mismatch in Low Resource Sequence-to-Sequence ASR Models using Hybrid Generated Pseudotranscripts”, Chak-Fai Li, Francis Keith, William Hartmann, Matthew Snover, and Owen Kimball, arXiv preprint arXiv:2106.07716, 2021. [arxiv] [bib] [post]
“Using Heterogeneity in Semi-Supervised Transcription Hypotheses to Improve Code-Switched Speech Recognition”, Andrew Slottje, Shannon Wotherspoon, William Hartmann, Matthew Snover, and Owen Kimball, arXiv preprint arXiv:2106.07699, 2021. [arxiv] [bib] [post]
“Improved Data Selection for Domain Adaptation in ASR”, Shannon Wotherspoon, William Hartmann, Matthew Snover, and Owen Kimball, in Proceedings of IEEE ICASSP, pp. 7018-7022, 2021. [publication] [bib]

2020

“Godec: An Open-Source Data Processing Framework for Deploying ML Data Flows in Edge-Computing Environments”, Ralf Meermeier, Le Zhang, Francis Keith, William Hartmann, Stavros Tsakalidis, and Andrew Tabarez, International Conference on Edge Computing, pp. 84-93, 2020. [publication]
“Towards a New Understanding of the Training of Neural Networks with Mislabeled Training Data”, Herbert Gish, Jan Silovsky, Man-Ling Sung, Man-Hung Siu, William Hartmann, Zhuolin Jiang, in Proceedings of IEEE ICASSP, pp. 8394-8398, 2020. [publication] [arxiv]
“Reformulating Information Retrieval from Speech and Text as a Detection Problem”, Damianos Karakos, Rabih Zbib, William Hartmann, Richard Schwartz, John Makhoul, in Proceedings of the Workshop on Cross-Language Search and Summarization of Text and Speech, pp. 38-43, 2020. [publication]
“The 2019 BBN Cross-Lingual Information Retrieval System”, Le Zhang, Damianos Karakos, William Hartmann, Manaj Srivastava, Lee Tarlin, David Akodes, Sanjay Krishna Gouda, Numra Bathool, Lingjun Zhao, Zhuolin Jiang, Richard Schwartz, John Makhoul, in Proceedings of the Workshop on Cross-Language Search and Summarization of Text and Speech, pp. 44-51, 2020. [publication]
“Cross-Lingual Information Retrieval with BERT”, Zhuolin Jiang, Amro El-Jaroudi, William Hartmann, Damianos Karakos, Lingjun Zhao, arXiv preprint arXiv:2004.13005, 2020. [arxiv]

2019

“Neural-Network Lexical Translation for Cross-Lingual IR from Text and Speech”, Rabih Zbib, Lingjun Zhao, Damianos Karakos, William Hartmann, Jay DeYoung, Zhongqiang Huang, Zhuolin Jiang, Noah Rivkin, Le Zhang, Richard Schwartz, John Makhoul, in Proceedings of ACM SIGIR, pp. 645-654, 2019. [publication]
“Learning from the Best: A Teacher-Student Multilingual Framework for Low-Resource Languages”, Deblin Bagchi, William Hartmann, in Proceedings of IEEE ICASSP, pp. 6051-6055, 2019. [publication]

2018

“Optimizing Multilingual Knowledge Transfer for Time-Delay Neural Networks with Low-Rank Factorization”, Francis Keith, William Hartmann, Man-Hung Siu, Jeff Z. Ma, Owen Kimball, in Proceedings of IEEE ICASSP, pp. 4924-4928, 2018. [publication]
“Individual Ship Detection Using Underwater Acoustics”, Damianos Karakos, Jan Silovsky, Richard Schwartz, William Hartmann, John Makhoul, in Proceedings of IEEE ICASSP, pp. 2121-2125, 2018. [publication]

2017

“Analysis of keyword spotting performance across IARPA babel languages”, William Hartmann, Damianos Karakos, Roger Hsiao, Le Zhang, Alumäe, Stavros Tsakalidis, Richard Schwartz, in Proceedings of IEEE ICASSP, pp. 5765-5769, 2017. [publication] [poster]
“The 2016 BBN Georgian telephone speech keyword spotting system”, Tanel Alumäe, Damianos Karaks, William Hartmann, Roger Hsiao, Le Zhang, Long Nguyen, Stavros Tsakalidis, Richard Schwartz, in Proceedings of IEEE ICASSP, pp. 5755-5759, 2017. [publication]
“Improved Single System Conversational Telephone Speech Recognition with VGG Bottleneck Features.”, William Hartmann, Roger Hsiao, Tim Ng, Jeff Z. Ma, Francis Keith, Man-Hung Siu, in Proceedings of Interspeech, pp. 112-116, 2017. [publication]
“Applying speech technology to the ship-type classification problem”, Damianos Karakos, William Hartmann, Richard Schwartz, John Makhoul, Stavros Tsakalidis, Edin Insanic, George Shepard, in Proceedings of IEEE OCEANS, pp. 1-8, 2017. [publication]
“Alternative networks for monolingual bottleneck features”, William Hartmann, Roger Hsiao, Stavros Tsakalidis, in Proceedings of IEEE ICASSP, pp. 5290-5294, 2017. [publication]

2016

“Sage: The New BBN Speech Processing Platform”, Roger Hsiao, Ralf Meermeier, Tim Ng, Zhongqiang Huang, Maxwell Jordan, Enoch Kan, Tanel Aluma ̈e, Jan Silovsky, William Hartmann, Francis Keith, Omer Lang, Manhung Siu, Owen Kimball, in Proceedings of Interspeech, pp. 3022-3026, 2016. [publication]
“Two-Stage Data Augmentation for Low-Resourced Speech Recognition”, William Hartmann, Tim Ng, Roger Hsiao, Stavros Tsakalidis, Richard Schwartz, in Proceedings of Interspeech, pp. 2378-2382, 2016. [publication] [slides]
“Comparison of Multiple System Combination Techniques for Keyword Spotting”, William Hartmann, Le Zhang, Kerri Barnes, Roger Hsiao, Stavros Tsakalidis, Richard Schwartz, in Proceedings of Interspeech, pp. 1913-1917, 2016. [publication] [poster]

2015

“Enhancing low resource keyword spotting with automatically retrieved web documents”, Le Zhang, Damianos Karakos, William Hartmann, Roger Hsiao, Richard Schwartz, Stavros Tsakalidis, in Proceedings of Interspeech, pp. 839-843, 2015. [publication]
“Exploring minimal pronunciation modeling for low resource languages”, Marelie Davel, Etienne Barnard, Charl van Heerden, William Hartmann, Damianos Karakos, Richard Schwartz, Stavros Tsakalidis, in Proceedings of Interspeech, pp. 538-542, 2015. [publication]
“Robust speech recognition in unknown reverberant and noisy conditions”, Roger Hsiao, Jeff Ma, William Hartmann, Martin Karafiát, František Grézl, Lukáš Burget, Igor Szöke, Jan Honza Černocký, Shinji Watanabe, Zhuo Chen, Sri Harish Mallidi, Hynek Hermanský, Stavros Tsakalidis, Richard Schwartz, in Proceedings of IEEE ASRU, pp. 533-538, 2015. [publication]
“Lexical speaker identification in TV shows”, Anindya Roy, Hervé Bredin, William Hartmann, Viet Bac Le, Claude Barras, Jean-Luc Gauvain, Multimedia Tools and Applications (74) 4, pp. 1377-1396, 2015. [publication]

2014

“Developing STT and KWS systems using limited language resources”, Viet-Bac Le, Lori Lamel, Abdel Messaoudi, William Hartmann, Jean-Luc Gauvain, Cécile Woehrling, Julien Despres, Anindya Roy, in Proceedings of Interspeech, pp. 2484-2488, 2014. [publication]
“Comparing decoding strategies for subword-based keyword spotting in low-resourced languages”, William Hartmann, Viet-Bac Le, Abdel Messaoudi, Lori Lamel, Jean-Luc Gauvain, in Proceedings of Interspeech, pp. 2764-2768, 2014. [publication] [post]
“Cross-word sub-word units for low-resource keyword spotting”, William Hartmann, Lori Lamel, Jean-Luc Gauvain, in Proceedings of SLTU, pp. 112–117, 2014. [publication] [post]
“Efficient rule scoring for improved grapheme-based lexicons”, William Hartmann, Lori Lamel, Jean-Luc Gauvain, in Proceedings of IEEE EUSIPCO, pp. 1477-1481, 2014. [publication] [poster]
“Unsupervised acoustic model training for the Korean language”, Antoine Laurent, William Hartmann, Lori Lamel, in Proceedings of IEEE ISCSLP, pp. 469-473, 2014. [publication]

2013

“A Direct Masking Approach to Robust ASR”, William Hartmann, Arun Narayanan, Eric Fosler-Lussier, DeLiang Wang, IEEE Transactions on Audio, Speech, and Language Processing (21) 10, pp. 1993-2005, 2013. [publication] [preprint] [post]
“Acoustic unit discovery and pronunciation generation from a grapheme-based lexicon”, William Hartmann, Anindya Roy, Lori Lamel, Jean-Luc Gauvain, in Proceedings of IEEE ASRU, pp. 380-385, 2013. [publication] [poster] [post]

2012

“Improved model selection for the ASR-driven binary mask”, William Hartmann, Eric Fosler-Lussier, in Proceedings of Interspeech, pp. 1203-1206, 2012. [publication] [poster]
“ASR-Driven Binary Mask Estimation using Spectral Priors”, William Hartmann, Eric Fosler-Lussier, in Proceedings of IEEE ICASSP, pp. 4685-4688, 2012. [publication] [poster] [post]

2011

“Investigations into the incorporation of the ideal binary mask in ASR”, William Hartmann, Eric Fosler-Lussier, in Proceedings of IEEE ICASSP, pp. 4804-4807, 2011. [publication]

2010

“Investigations into the Crandem approach to word recognition”, Rohit Prabhavalkar, Preethi Jyothi, William Hartmann, Jeremy Morris, Eric Fosler-Lussier, in Proceedings of NAACL-HLT, pp. 725-728, 2010. [publication]

2009

“Investigating phonetic information reduction and lexical confusability”, William Hartmann, Eric Fosler-Lussier, in Proceedings of Interspeech, pp. 1659-1662, 2009. [publication]

Comments? Send me an email.