Thomas Hain

Head of Group

Thomas Hain

Head of Group

Thomas holds the degree ‘Dipl.-Ing’ from the University of Technology, Vienna, and a PhD from Cambridge University. After work at Philips Speech Processing, Vienna he joined the Cambridge University Engineering Department in 1997, and moved to SpandH in 2004. He was promoted to Professor in 2012 He is leading the Machine Intelligence for Natural Interfaces subgroup. Thomas has published on machine learning and speech recognition topics in more than 100 publications in international conferences, journals and books (h-index 26). Apart from membership of many technical committees, including speech recognition area chair at ICASSP and Interspeech, he served on the IEEE Speech Technical Committee from 2007-2009, and as organizing committee member of Interspeech 2009, IEEE ASRU 2011 and 2013. He is currently member of the editorial board of Computer Speech and Language (CSL) and Associate Editor of the ACM Transactions on Speech and Language Processing. His recent research has its focus on recognition of natural speech in realistic environments, and on integration of speech technology with downstream processes such as content linking, summarisation or machine translation.

Contact

Contact Information

Where to find Us?

Office G041, Regent Court, 211 Portobello, Sheffield S1 4DP, UK

+44 114 222 1836

t.hain@sheffield.ac.uk

Publications

145 entries « ‹ 3 of 3 › »

2015

Madina Hasan; Rama Doddipatla; Thomas Hain: Noise-matched training of CRF based sentence end detection models. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 349–353, ISCA, 2015. (Type: Proceedings Article | Links)

Erfan Loweimi; Jon Barker; Thomas Hain: Source-filter separation of speech signal in the phase domain. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 598–602, ISCA, 2015. (Type: Proceedings Article | Links)

Raymond W. M. Ng; Kashif Shah; Lucia Specia; Thomas Hain: A study on the stability and effectiveness of features in quality estimation for spoken language translation. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 2257–2261, ISCA, 2015. (Type: Proceedings Article | Links)

Mortaza Doulaty; Oscar Saz; Thomas Hain: Data-selective transfer learning for multi-domain speech recognition. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 2897–2901, ISCA, 2015. (Type: Proceedings Article | Links)

Mortaza Doulaty; Oscar Saz; Thomas Hain: Unsupervised domain discovery using latent dirichlet allocation for acoustic modelling in speech recognition. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 3640–3644, ISCA, 2015. (Type: Proceedings Article | Links)

Iñigo Casanueva; Thomas Hain; Heidi Christensen; Ricard Marxer; Phil D. Green: Knowledge transfer between speakers for personalised dialogue management. In: Proceedings of the SIGDIAL 2015 Conference, The 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2-4 September 2015, Prague, Czech Republic, pp. 12–21, The Association for Computer Linguistics, 2015. (Type: Proceedings Article | Links)

Erfan Loweimi; Mortaza Doulaty; Jon Barker; Thomas Hain: Long-Term Statistical Feature Extraction from Speech Signal and Its Application in Emotion Recognition. In: Dediu, Adrian-Horia; Martín-Vide, Carlos; Vicsi, Klára (Ed.): Statistical Language and Speech Processing - Third International Conference, SLSP 2015, Budapest, Hungary, November 24-26, 2015, Proceedings, pp. 173–184, Springer, 2015. (Type: Proceedings Article | Links)

Ghada AlHarbi; Raymond W. M. Ng; Thomas Hain: Annotating meta-discourse in academic lectures from different disciplines. In: Steidl, Stefan; Batliner, Anton; Jokisch, Oliver (Ed.): ISCA International Workshop on Speech and Language Technology in Education, SLaTE 2015, Leipzig, Germany, September 4-5, 2015, pp. 161–166, ISCA, 2015. (Type: Proceedings Article | Links)

2014

Pengyuan Zhang; Yulan Liu; Thomas Hain: Semi-Supervised DNN Training in Meeting Recognition. In: 2014 IEEE Spoken Language Technology Workshop (SLT 2014), South Lake Tahoe, USA, 2014. (Type: Proceedings Article | Links)

Yulan Liu; Pengyuan Zhang; Thomas Hain: Using neural network front-ends on far field multiple microphones based speech recognition. In: Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, pp. 5542-5546, 2014. (Type: Proceedings Article | Links)

Yulan Liu; Pengyuan Zhang; Thomas Hain: Using neural network front-ends on far field multiple microphones based speech recognition. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp. 5542–5546, IEEE, 2014. (Type: Proceedings Article | Links)

Oscar Saz; Thomas Hain: Using contextual information in joint factor eigenspace MLLR for speech recognition in diverse scenarios. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp. 6314–6318, IEEE, 2014. (Type: Proceedings Article | Links)

Rama Doddipatla; Madina Hasan; Thomas Hain: Speaker dependent bottleneck layer training for speaker adaptation in automatic speech recognition. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 2199–2203, ISCA, 2014. (Type: Proceedings Article | Links)

Charles Fox; Thomas Hain: Extending Limabeam with discrimination and coarse gradients. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 2440–2444, ISCA, 2014. (Type: Proceedings Article | Links)

Madina Hasan; Rama Doddipatla; Thomas Hain: Multi-pass sentence-end detection of lecture speech. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 2902–2906, ISCA, 2014. (Type: Proceedings Article | Links)

Oscar Saz; Mortaza Doulaty; Thomas Hain: Background-tracking acoustic features for genre identification of broadcast shows. In: 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7-10, 2014, pp. 118–123, IEEE, 2014. (Type: Proceedings Article | Links)

Pengyuan Zhang; Yulan Liu; Thomas Hain: Semi-supervised DNN training in meeting recognition. In: 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7-10, 2014, pp. 141–146, IEEE, 2014. (Type: Proceedings Article | Links)

Raymond W. M. Ng; Mortaza Doulaty; Rama Doddipatla; Wilker Aziz; Kashif Shah; Oscar Saz; Madina Hasan; Ghada AlHarbi; Lucia Specia; Thomas Hain: The USFD SLT system for IWSLT 2014. In: Federico, Marcello; Stüker, Sebastian; Yvon, François (Ed.): Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, Lake Tahoe, CA, USA, December 4-5, 2014, 2014. (Type: Proceedings Article | Links)

Inigo Casanueva; Heidi Christensen; Thomas Hain; Phil D. Green: Adaptive speech recognition and dialogue management for users with speech disorders. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 1033–1037, ISCA, 2014. (Type: Proceedings Article | Links)

Heidi Christensen; Inigo Casanueva; Stuart P. Cunningham; Phil D. Green; Thomas Hain: Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data. In: 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7-10, 2014, pp. 254–259, IEEE, 2014. (Type: Proceedings Article | Links)

Erfan Loweimi; Jon Barker; Thomas Hain: Compression of model-based group delay function for robust speech recognition. In: University of Sheffield Engineering Symposium, Sheffield, UK, 2014. (Type: Proceedings Article | )

2013

Charles Fox; Thomas Hain: Lightly supervised learning from a damaged natural speech corpus. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp. 8086–8090, IEEE, 2013. (Type: Proceedings Article | Links)

Raymond W. M. Ng; Thomas Hain; Trevor Cohn: Adaptation of lecture speech recognition system with machine translation output. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp. 8401–8405, IEEE, 2013. (Type: Proceedings Article | Links)

Pierre Lanchantin; Peter Bell; Mark J. F. Gales; Thomas Hain; Xunying Liu; Yanhua Long; Jennifer Quinnell; Steve Renals; Oscar Saz; Matthew Stephen Seigel; Pawel Swietojanski; Philip C. Woodland: Automatic Transcription of Multi-genre Media Archives. In: Gravier, Guillaume; Béchet, Frédéric (Ed.): Proceedings of the First Workshop on Speech, Language and Audio in Multimedia, Marseille, France, August 22-23, 2013, pp. 26–31, CEUR-WS.org, 2013. (Type: Proceedings Article | Links)

Charles Fox; Yulan Liu; Erich Zwyssig; Thomas Hain: The sheffield wargames corpus. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 1116–1120, ISCA, 2013. (Type: Proceedings Article | Links)

Heidi Christensen; Phil D. Green; Thomas Hain: Learning speaker-specific pronunciations of disordered speech. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 1159–1163, ISCA, 2013. (Type: Proceedings Article | Links)

Oscar Saz; Thomas Hain: Asynchronous factorisation of speaker and background with feature transforms in speech recognition. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 1238–1242, ISCA, 2013. (Type: Proceedings Article | Links)

Heidi Christensen; M. B. Aniol; Peter Bell; Phil D. Green; Thomas Hain; Simon King; Pawel Swietojanski: Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 3642–3645, ISCA, 2013. (Type: Proceedings Article | Links)

Charles Fox; Yulan Liu; Erich Zwyssig; Thomas Hain: The Sheffield Wargames Corpus. In: Proc. Interspeech 2013, ISCA, 2013. (Type: Book Chapter | )

Heidi Christensen; Iñigo Casanueva; Stuart P. Cunningham; Phil D. Green; Thomas Hain: homeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition. In: Alexandersson, Jan; Ljunglöf, Peter; McCoy, Kathleen F.; Portet, François; Roark, Brian; Rudzicz, Frank; Vacher, Michel (Ed.): Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, SLPAT 2013, Grenoble, France, August 21-22, 2013, pp. 29–34, Association for Computational Linguistics, 2013. (Type: Proceedings Article | Links)

2012

Heidi Christensen; Stuart P. Cunningham; Charles Fox; Phil D. Green; Thomas Hain: A comparative study of adaptive, automatic recognition of disordered speech. Proc Interspeech 2012, Portland, Oregon, US, 2012. (Type: Conference | Links)

Thomas Hain; Lukás Burget; John Dines; Philip N. Garner; Frantisek Grézl; Asmaa El Hannani; Marijn Huijbregts; Martin Karafiát; Mike Lincoln; Vincent Wan: Transcribing Meetings With the AMIDA Systems. In: IEEE Trans. Speech Audio Process., vol. 20, no. 2, pp. 486–498, 2012. (Type: Journal Article | Links)

Matthew Gibson; Thomas Hain: Correctness-Adjusted Unsupervised Discriminative Acoustic Model Adaptation. In: IEEE Trans. Speech Audio Process., vol. 20, no. 10, pp. 2648–2656, 2012. (Type: Journal Article | Links)

Sarah Al-Shareef; Thomas Hain: CRF-based Diacritisation of Colloquial Arabic for Automatic Speech Recognition. In: INTERSPEECH, 2012. (Type: Proceedings Article | )

Sarah Al-Shareef; Thomas Hain: Conditional Random Fields Based Diacritisation of Colloquial Arabic. In: Saudi International Conference, 2012. (Type: Proceedings Article | )

Charles Fox; Heidi Christensen; Thomas Hain: Studio report: Linux audio for multi-speaker natural speech technology.. Proc. Linux Audio Conference, 2012. (Type: Conference | Links)

2011

Sarah Al-Shareef; Thomas Hain: An Investigation in Speech Recognition for Colloquial Arabic. In: INTERSPEECH, 2011. (Type: Proceedings Article | )

Davide Marino; Thomas Hain: An Analysis of Automatic Speech Recognition with Multiple Microphones. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp. 1281–1284, ISCA, 2011. (Type: Proceedings Article | Links)

Stuart N. Wrigley; Thomas Hain: Web-Based Automatic Speech Recognition Service - webASR. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp. 3265–3268, ISCA, 2011. (Type: Proceedings Article | Links)

Stuart N. Wrigley; Thomas Hain: Making an Automatic Speech Recognition Service Freely Available on the Web. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp. 3325–3326, ISCA, 2011. (Type: Proceedings Article | Links)

Roger C. F. Tucker; Dan Fry; Vincent Wan; Stuart N. Wrigley; Thomas Hain: Extending Audio Notetaker to Browse WebASR Transcriptions. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp. 3329–3330, ISCA, 2011. (Type: Proceedings Article | Links)

2008

Thomas Hain; Asmaa El Hannani; Stuart N. Wrigley; Vincent Wan: Automatic speech recognition for scientific purposes - webASR. In: INTERSPEECH 2008, 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008, pp. 504–507, ISCA, 2008. (Type: Proceedings Article | Links)

2006

Jean Carletta; Simone Ashby; Sebastien Bourban; Mike Flynn; Mael Guillemot; Thomas Hain; Jaroslav Kadlec; Vasilis Karaiskos; Wessel Kraaij; Melissa Kronenthal; Guillaume Lathoud; Mike Lincoln; Agnes Lisowka; Iain McCowan; Wilfried Post; Denns Reidsma; Pierre Wellner: The AMI meeting corpus: A pre-announcement. In: Machine learning for multimodal interaction, pp. 28–39, Springer, 2006. (Type: Book Section | )

Steve Young; Gunnar Evermann; Mark J. F. Gales; Thomas Hain; Dan Kershaw; XA Liu; Gareth Moore; Julian Odell; Dave Ollason; Dan Povey; Valtcho Valtchev; Philip C. Woodland: The HTK book (Version 3.3, Version 3.4). In: 2006. (Type: Journal Article | )

2005

Iain McCowan; Jean Carletta; Wessel Kraaij; Simone Ashby; Sebastien Bourban; Mike Flynn; Maël Guillemot; Thomas Hain; Jaroslav Kadlec; Vasilis Karaiskos; others: The AMI meeting corpus. In: Proceedings of the 5th International Conference on Methods and Techniques in Behavioral Research, vol. 88, 2005. (Type: Journal Article | )

145 entries « ‹ 3 of 3 › »