Thomas Hain

Thomas Hain
Head of Group

Thomas Hain

Head of Group

Thomas holds the degree ‘Dipl.-Ing’ from the University of Technology, Vienna, and a PhD from Cambridge University. After work at Philips Speech Processing, Vienna he joined the Cambridge University Engineering Department in 1997, and moved to SpandH in 2004. He was promoted to Professor in 2012 He is leading the Machine Intelligence for Natural Interfaces subgroup. Thomas has published on machine learning and speech recognition topics in more than 100 publications in international conferences, journals and books (h-index 26). Apart from membership of many technical committees, including speech recognition area chair at ICASSP and Interspeech, he served on the IEEE Speech Technical Committee from 2007-2009, and as organizing committee member of Interspeech 2009, IEEE ASRU 2011 and 2013. He is currently member of the editorial board of Computer Speech and Language (CSL) and Associate Editor of the ACM Transactions on Speech and Language Processing. His recent research has its focus on recognition of natural speech in realistic environments, and on integration of speech technology with downstream processes such as content linking, summarisation or machine translation.

Contact

Contact Information

Office G041, Regent Court, 211 Portobello, Sheffield S1 4DP, UK

+44 114 222 1836

t.hain@sheffield.ac.uk

Publications

127 entries « 3 of 3 »

2014

Inigo Casanueva; Heidi Christensen; Thomas Hain; Phil D. Green: Adaptive speech recognition and dialogue management for users with speech disorders. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 1033–1037, ISCA, 2014. (Type: Proceedings Article | Links)
Heidi Christensen; Inigo Casanueva; Stuart P. Cunningham; Phil D. Green; Thomas Hain: Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data. In: 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7-10, 2014, pp. 254–259, IEEE, 2014. (Type: Proceedings Article | Links)
Erfan Loweimi; Jon Barker; Thomas Hain: Compression of model-based group delay function for robust speech recognition. In: University of Sheffield Engineering Symposium, Sheffield, UK, 2014. (Type: Proceedings Article | )

2013

Charles Fox; Thomas Hain: Lightly supervised learning from a damaged natural speech corpus. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp. 8086–8090, IEEE, 2013. (Type: Proceedings Article | Links)
Raymond W. M. Ng; Thomas Hain; Trevor Cohn: Adaptation of lecture speech recognition system with machine translation output. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp. 8401–8405, IEEE, 2013. (Type: Proceedings Article | Links)
Pierre Lanchantin; Peter Bell; Mark J. F. Gales; Thomas Hain; Xunying Liu; Yanhua Long; Jennifer Quinnell; Steve Renals; Oscar Saz; Matthew Stephen Seigel; Pawel Swietojanski; Philip C. Woodland: Automatic Transcription of Multi-genre Media Archives. In: Gravier, Guillaume; Béchet, Frédéric (Ed.): Proceedings of the First Workshop on Speech, Language and Audio in Multimedia, Marseille, France, August 22-23, 2013, pp. 26–31, CEUR-WS.org, 2013. (Type: Proceedings Article | Links)
Charles Fox; Yulan Liu; Erich Zwyssig; Thomas Hain: The sheffield wargames corpus. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 1116–1120, ISCA, 2013. (Type: Proceedings Article | Links)
Heidi Christensen; Phil D. Green; Thomas Hain: Learning speaker-specific pronunciations of disordered speech. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 1159–1163, ISCA, 2013. (Type: Proceedings Article | Links)
Oscar Saz; Thomas Hain: Asynchronous factorisation of speaker and background with feature transforms in speech recognition. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 1238–1242, ISCA, 2013. (Type: Proceedings Article | Links)
Heidi Christensen; M. B. Aniol; Peter Bell; Phil D. Green; Thomas Hain; Simon King; Pawel Swietojanski: Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 3642–3645, ISCA, 2013. (Type: Proceedings Article | Links)
Charles Fox; Yulan Liu; Erich Zwyssig; Thomas Hain: The Sheffield Wargames Corpus. In: Proc. Interspeech 2013, ISCA, 2013. (Type: Book Chapter | )
Heidi Christensen; Iñigo Casanueva; Stuart P. Cunningham; Phil D. Green; Thomas Hain: homeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition. In: Alexandersson, Jan; Ljunglöf, Peter; McCoy, Kathleen F.; Portet, François; Roark, Brian; Rudzicz, Frank; Vacher, Michel (Ed.): Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, SLPAT 2013, Grenoble, France, August 21-22, 2013, pp. 29–34, Association for Computational Linguistics, 2013. (Type: Proceedings Article | Links)

2012

Heidi Christensen; Stuart P. Cunningham; Charles Fox; Phil D. Green; Thomas Hain: A comparative study of adaptive, automatic recognition of disordered speech. Proc Interspeech 2012, Portland, Oregon, US, 2012. (Type: Conference | Links)
Thomas Hain; Lukás Burget; John Dines; Philip N. Garner; Frantisek Grézl; Asmaa El Hannani; Marijn Huijbregts; Martin Karafiát; Mike Lincoln; Vincent Wan: Transcribing Meetings With the AMIDA Systems. In: IEEE Trans. Speech Audio Process., vol. 20, no. 2, pp. 486–498, 2012. (Type: Journal Article | Links)
Matthew Gibson; Thomas Hain: Correctness-Adjusted Unsupervised Discriminative Acoustic Model Adaptation. In: IEEE Trans. Speech Audio Process., vol. 20, no. 10, pp. 2648–2656, 2012. (Type: Journal Article | Links)
Sarah Al-Shareef; Thomas Hain: CRF-based Diacritisation of Colloquial Arabic for Automatic Speech Recognition. In: INTERSPEECH, 2012. (Type: Proceedings Article | )
Sarah Al-Shareef; Thomas Hain: Conditional Random Fields Based Diacritisation of Colloquial Arabic. In: Saudi International Conference, 2012. (Type: Proceedings Article | )
Charles Fox; Heidi Christensen; Thomas Hain: Studio report: Linux audio for multi-speaker natural speech technology.. Proc. Linux Audio Conference, 2012. (Type: Conference | Links)

2011

Sarah Al-Shareef; Thomas Hain: An Investigation in Speech Recognition for Colloquial Arabic. In: INTERSPEECH, 2011. (Type: Proceedings Article | )
Davide Marino; Thomas Hain: An Analysis of Automatic Speech Recognition with Multiple Microphones. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp. 1281–1284, ISCA, 2011. (Type: Proceedings Article | Links)
Stuart N. Wrigley; Thomas Hain: Web-Based Automatic Speech Recognition Service - webASR. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp. 3265–3268, ISCA, 2011. (Type: Proceedings Article | Links)
Stuart N. Wrigley; Thomas Hain: Making an Automatic Speech Recognition Service Freely Available on the Web. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp. 3325–3326, ISCA, 2011. (Type: Proceedings Article | Links)
Roger C. F. Tucker; Dan Fry; Vincent Wan; Stuart N. Wrigley; Thomas Hain: Extending Audio Notetaker to Browse WebASR Transcriptions. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp. 3329–3330, ISCA, 2011. (Type: Proceedings Article | Links)

2008

Thomas Hain; Asmaa El Hannani; Stuart N. Wrigley; Vincent Wan: Automatic speech recognition for scientific purposes - webASR. In: INTERSPEECH 2008, 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008, pp. 504–507, ISCA, 2008. (Type: Proceedings Article | Links)

2006

Jean Carletta; Simone Ashby; Sebastien Bourban; Mike Flynn; Mael Guillemot; Thomas Hain; Jaroslav Kadlec; Vasilis Karaiskos; Wessel Kraaij; Melissa Kronenthal; Guillaume Lathoud; Mike Lincoln; Agnes Lisowka; Iain McCowan; Wilfried Post; Denns Reidsma; Pierre Wellner: The AMI meeting corpus: A pre-announcement. In: Machine learning for multimodal interaction, pp. 28–39, Springer, 2006. (Type: Book Section | )
Steve Young; Gunnar Evermann; Mark J. F. Gales; Thomas Hain; Dan Kershaw; XA Liu; Gareth Moore; Julian Odell; Dave Ollason; Dan Povey; Valtcho Valtchev; Philip C. Woodland: The HTK book (Version 3.3, Version 3.4). In: 2006. (Type: Journal Article | )

2005

Iain McCowan; Jean Carletta; Wessel Kraaij; Simone Ashby; Sebastien Bourban; Mike Flynn; Maël Guillemot; Thomas Hain; Jaroslav Kadlec; Vasilis Karaiskos; others: The AMI meeting corpus. In: Proceedings of the 5th International Conference on Methods and Techniques in Behavioral Research, vol. 88, 2005. (Type: Journal Article | )
127 entries « 3 of 3 »

Back to Top