Thomas Hain

Thomas Hain
Head of Group

Thomas Hain

Head of Group

Thomas holds the degree ‘Dipl.-Ing’ from the University of Technology, Vienna, and a PhD from Cambridge University. After work at Philips Speech Processing, Vienna he joined the Cambridge University Engineering Department in 1997, and moved to SpandH in 2004. He was promoted to Professor in 2012 He is leading the Machine Intelligence for Natural Interfaces subgroup. Thomas has published on machine learning and speech recognition topics in more than 100 publications in international conferences, journals and books (h-index 26). Apart from membership of many technical committees, including speech recognition area chair at ICASSP and Interspeech, he served on the IEEE Speech Technical Committee from 2007-2009, and as organizing committee member of Interspeech 2009, IEEE ASRU 2011 and 2013. He is currently member of the editorial board of Computer Speech and Language (CSL) and Associate Editor of the ACM Transactions on Speech and Language Processing. His recent research has its focus on recognition of natural speech in realistic environments, and on integration of speech technology with downstream processes such as content linking, summarisation or machine translation.

Contact

Contact Information

Office G041, Regent Court, 211 Portobello, Sheffield S1 4DP, UK

+44 114 222 1836

t.hain@sheffield.ac.uk

Publications

127 entries « 2 of 3 »

2017

Salil Deena; Raymond W. M. Ng; Pranava Swaroop Madhyastha; Lucia Specia; Thomas Hain: Semi-Supervised Adaptation of RNNLMs by Fine-Tuning with Domain-Specific Auxiliary Features. In: Lacerda, Francisco (Ed.): Interspeech 2017, 18th Annual Conference of the International Speech Communication Association, Stockholm, Sweden, August 20-24, 2017, pp. 2715–2719, ISCA, 2017. (Type: Proceedings Article | Links)
Chenhao Wu; Raymond W. M. Ng; Oscar Saz; Thomas Hain: Analysing acoustic model changes for active learning in automatic speech recognition. In: International Conference on Systems, Signals and Image Processing, IWSSIP 2017, Poznań, Poland, May 22-24, 2017, pp. 1–5, IEEE, 2017. (Type: Proceedings Article | Links)
Erfan Loweimi; Jon Barker; Oscar Saz; Thomas Hain: Robust Source-Filter Separation of Speech Signal in the Phase Domain. In: Lacerda, Francisco (Ed.): Interspeech 2017, 18th Annual Conference of the International Speech Communication Association, Stockholm, Sweden, August 20-24, 2017, pp. 414–418, ISCA, 2017. (Type: Proceedings Article | Links)
Raymond W. M. Ng; Alvin C. M. Kwan; Tan Lee; Thomas Hain: Shefce: A Cantonese-English bilingual speech corpus for pronunciation assessment. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2017, New Orleans, LA, USA, March 5-9, 2017, pp. 5825–5829, IEEE, 2017. (Type: Proceedings Article | Links)

2016

Raymond W. M. Ng; Kashif Shah; Lucia Specia; Thomas Hain: Groupwise learning for ASR k-best list reranking in spoken langauge translation. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016. (Type: Proceedings Article | )
Rosanna Milner; Thomas Hain: Segment-oriented evaluation of speaker diarisation performance. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016, Shanghai, China, March 20-25, 2016, pp. 5460–5464, IEEE, 2016. (Type: Proceedings Article | Links)
Raymond W. M. Ng; Kashif Shah; Lucia Specia; Thomas Hain: Groupwise learning for ASR k-best list reranking in spoken language translation. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016, Shanghai, China, March 20-25, 2016, pp. 6120–6124, IEEE, 2016. (Type: Proceedings Article | Links)
Sarah Al-Shareef; Thomas Hain: Colloquialising Modern Standard Arabic Text for Improved Speech Recognition. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 1345–1349, ISCA, 2016. (Type: Proceedings Article | Links)
Thomas Hain; Jeremy Christian; Oscar Saz; Salil Deena; Madina Hasan; Raymond W. M. Ng; Rosanna Milner; Mortaza Doulaty; Yulan Liu: webASR 2 - Improved Cloud Based Speech Technology. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 1613–1617, ISCA, 2016. (Type: Proceedings Article | Links)
Julia Olcoz; Oscar Saz; Thomas Hain: Error Correction in Lightly Supervised Alignment of Broadcast Subtitles. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 2110–2114, ISCA, 2016. (Type: Proceedings Article | Links)
Mortaza Doulaty; Oscar Saz; Raymond W. M. Ng; Thomas Hain: Automatic Genre and Show Identification of Broadcast Media. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 2115–2119, ISCA, 2016. (Type: Proceedings Article | Links)
Rosanna Milner; Thomas Hain: DNN-Based Speaker Clustering for Speaker Diarisation. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 2185–2189, ISCA, 2016. (Type: Proceedings Article | Links)
Salil Deena; Madina Hasan; Mortaza Doulaty; Oscar Saz; Thomas Hain: Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 2343–2347, ISCA, 2016. (Type: Proceedings Article | Links)
Iñigo Casanueva; Thomas Hain; Phil D. Green: Improving Generalisation to New Speakers in Spoken Dialogue State Tracking. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 2726–2730, ISCA, 2016. (Type: Proceedings Article | Links)
Raymond W. M. Ng; Bhusan Chettri; Thomas Hain: Combining Weak Tokenisers for Phonotactic Language Recognition in a Resource-Constrained Setting. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 2939–2943, ISCA, 2016. (Type: Proceedings Article | Links)
Erfan Loweimi; Jon Barker; Thomas Hain: Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 3798–3802, ISCA, 2016. (Type: Proceedings Article | Links)
Yulan Liu; Charles Fox; Madina Hasan; Thomas Hain: The Sheffield Wargame Corpus - Day Two and Day Three. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 3833–3837, ISCA, 2016. (Type: Proceedings Article | Links)
Ghada AlHarbi; Thomas Hain: The OpenCourseWare Metadiscourse (OCWMD) Corpus. In: Calzolari, Nicoletta; Choukri, Khalid; Declerck, Thierry; Goggi, Sara; Grobelnik, Marko; Maegaard, Bente; Mariani, Joseph; Mazo, Hélène; Moreno, Asunción; Odijk, Jan; Piperidis, Stelios (Ed.): Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, Portorož, Slovenia, May 23-28, 2016, European Language Resources Association (ELRA), 2016. (Type: Proceedings Article | Links)
Raymond W. M. Ng; Mauro Nicolao; Oscar Saz; Madina Hasan; Bhusan Chettri; Mortaza Doulaty; Tan Lee; Thomas Hain: The Sheffield language recognition system in NIST LRE 2015. In: Rodríguez-Fuentes, Luis Javier; Lleida, Eduardo (Ed.): Odyssey 2016: The Speaker and Language Recognition Workshop, Bilbao, Spain, June 21-24, 2016, pp. 181–187, ISCA, 2016. (Type: Proceedings Article | Links)
Iñigo Casanueva; Thomas Hain; Mauro Nicolao; Phil D. Green: Using phone features to improve dialogue state tracking generalisation to unseen states. In: Proceedings of the SIGDIAL 2016 Conference, The 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 13-15 September 2016, Los Angeles, CA, USA, pp. 80–89, The Association for Computer Linguistics, 2016. (Type: Proceedings Article | Links)
Mauro Nicolao; Heidi Christensen; Stuart P. Cunningham; Phil D. Green; Thomas Hain: A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus. In: Calzolari, Nicoletta; Choukri, Khalid; Declerck, Thierry; Goggi, Sara; Grobelnik, Marko; Maegaard, Bente; Mariani, Joseph; Mazo, Hélène; Moreno, Asunción; Odijk, Jan; Piperidis, Stelios (Ed.): Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, Portorož, Slovenia, May 23-28, 2016, European Language Resources Association (ELRA), 2016. (Type: Proceedings Article | Links)

2015

Heidi Christensen; Mauro Nicolao; Stuart P. Cunningham; Salil Deena; Phil D. Green; Thomas Hain: Speech-Enabled Environmental Control in an AAL setting for people with Speech Disorders: a Case Study. In: IET International Conference on Technologies for Active and Assisted Living, TechAAL 2015, London, UK, 2015. (Type: Proceedings Article | )
Yulan Liu; Penny Karanasou; Thomas Hain: An Investigation Into Speaker Informed DNN Front-end for LVCSR. In: Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on, 2015. (Type: Proceedings Article | )
Erfan Loweimi; Jon Barker; Thomas Hain: Emotion Recognition from Speech Signal by Effective Combination of the Generative and Discriminative Models. In: University of Sheffield Engineering Symposium, Sheffield, UK, 2015. (Type: Proceedings Article | )
Mortaza Doulaty; Oscar Saz; Raymond W. M. Ng; Thomas Hain: Latent Dirichlet Allocation based organisation of broadcast media archives for deep neural network adaptation. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp. 130–136, IEEE, 2015. (Type: Proceedings Article | Links)
Oscar Saz; Mortaza Doulaty; Salil Deena; Rosanna Milner; Raymond W. M. Ng; Madina Hasan; Yulan Liu; Thomas Hain: The 2015 sheffield system for transcription of Multi-Genre Broadcast media. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp. 624–631, IEEE, 2015. (Type: Proceedings Article | Links)
Rosanna Milner; Oscar Saz; Salil Deena; Mortaza Doulaty; Raymond W. M. Ng; Thomas Hain: The 2015 sheffield system for longitudinal diarisation of broadcast media. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp. 632–638, IEEE, 2015. (Type: Proceedings Article | Links)
Peter Bell; Mark J. F. Gales; Thomas Hain; Jonathan Kilgour; Pierre Lanchantin; Xunying Liu; Andrew McParland; Steve Renals; Oscar Saz; Mirjam Wester; Philip C. Woodland: The MGB challenge: Evaluating multi-genre broadcast media recognition. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp. 687–693, IEEE, 2015. (Type: Proceedings Article | Links)
Ghada AlHarbi; Thomas Hain: Using Topic Segmentation Models for the Automatic Organisation of MOOCs resources. In: Santos, Olga C.; Boticario, Jesus; Romero, Cristóbal; Pechenizkiy, Mykola; Merceron, Agathe; Mitros, Piotr; Luna, José María; Mihaescu, Marian Cristian; Moreno, Pablo; Hershkovitz, Arnon; Ventura, Sebastián; Desmarais, Michel C. (Ed.): Proceedings of the 8th International Conference on Educational Data Mining, EDM 2015, Madrid, Spain, June 26-29, 2015, pp. 524–527, International Educational Data Mining Society (IEDMS), 2015. (Type: Proceedings Article | Links)
Yulan Liu; Penny Karanasou; Thomas Hain: An investigation into speaker informed DNN front-end for LVCSR. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 2015, pp. 4300–4304, IEEE, 2015. (Type: Proceedings Article | Links)
Raymond W. M. Ng; Kashif Shah; Wilker Aziz; Lucia Specia; Thomas Hain: Quality estimation for asr k-best list rescoring in spoken language translation. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 2015, pp. 5226–5230, IEEE, 2015. (Type: Proceedings Article | Links)
Mauro Nicolao; Amy V. Beeston; Thomas Hain: Automatic assessment of English learner pronunciation using discriminative classifiers. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 2015, pp. 5351–5355, IEEE, 2015. (Type: Proceedings Article | Links)
Madina Hasan; Rama Doddipatla; Thomas Hain: Noise-matched training of CRF based sentence end detection models. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 349–353, ISCA, 2015. (Type: Proceedings Article | Links)
Erfan Loweimi; Jon Barker; Thomas Hain: Source-filter separation of speech signal in the phase domain. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 598–602, ISCA, 2015. (Type: Proceedings Article | Links)
Raymond W. M. Ng; Kashif Shah; Lucia Specia; Thomas Hain: A study on the stability and effectiveness of features in quality estimation for spoken language translation. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 2257–2261, ISCA, 2015. (Type: Proceedings Article | Links)
Mortaza Doulaty; Oscar Saz; Thomas Hain: Data-selective transfer learning for multi-domain speech recognition. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 2897–2901, ISCA, 2015. (Type: Proceedings Article | Links)
Mortaza Doulaty; Oscar Saz; Thomas Hain: Unsupervised domain discovery using latent dirichlet allocation for acoustic modelling in speech recognition. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 3640–3644, ISCA, 2015. (Type: Proceedings Article | Links)
Iñigo Casanueva; Thomas Hain; Heidi Christensen; Ricard Marxer; Phil D. Green: Knowledge transfer between speakers for personalised dialogue management. In: Proceedings of the SIGDIAL 2015 Conference, The 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2-4 September 2015, Prague, Czech Republic, pp. 12–21, The Association for Computer Linguistics, 2015. (Type: Proceedings Article | Links)
Erfan Loweimi; Mortaza Doulaty; Jon Barker; Thomas Hain: Long-Term Statistical Feature Extraction from Speech Signal and Its Application in Emotion Recognition. In: Dediu, Adrian-Horia; Martín-Vide, Carlos; Vicsi, Klára (Ed.): Statistical Language and Speech Processing - Third International Conference, SLSP 2015, Budapest, Hungary, November 24-26, 2015, Proceedings, pp. 173–184, Springer, 2015. (Type: Proceedings Article | Links)
Ghada AlHarbi; Raymond W. M. Ng; Thomas Hain: Annotating meta-discourse in academic lectures from different disciplines. In: Steidl, Stefan; Batliner, Anton; Jokisch, Oliver (Ed.): ISCA International Workshop on Speech and Language Technology in Education, SLaTE 2015, Leipzig, Germany, September 4-5, 2015, pp. 161–166, ISCA, 2015. (Type: Proceedings Article | Links)

2014

Pengyuan Zhang; Yulan Liu; Thomas Hain: Semi-Supervised DNN Training in Meeting Recognition. In: 2014 IEEE Spoken Language Technology Workshop (SLT 2014), South Lake Tahoe, USA, 2014. (Type: Proceedings Article | Links)
Yulan Liu; Pengyuan Zhang; Thomas Hain: Using neural network front-ends on far field multiple microphones based speech recognition. In: Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, pp. 5542-5546, 2014. (Type: Proceedings Article | Links)
Yulan Liu; Pengyuan Zhang; Thomas Hain: Using neural network front-ends on far field multiple microphones based speech recognition. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp. 5542–5546, IEEE, 2014. (Type: Proceedings Article | Links)
Oscar Saz; Thomas Hain: Using contextual information in joint factor eigenspace MLLR for speech recognition in diverse scenarios. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp. 6314–6318, IEEE, 2014. (Type: Proceedings Article | Links)
Rama Doddipatla; Madina Hasan; Thomas Hain: Speaker dependent bottleneck layer training for speaker adaptation in automatic speech recognition. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 2199–2203, ISCA, 2014. (Type: Proceedings Article | Links)
Charles Fox; Thomas Hain: Extending Limabeam with discrimination and coarse gradients. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 2440–2444, ISCA, 2014. (Type: Proceedings Article | Links)
Madina Hasan; Rama Doddipatla; Thomas Hain: Multi-pass sentence-end detection of lecture speech. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 2902–2906, ISCA, 2014. (Type: Proceedings Article | Links)
Oscar Saz; Mortaza Doulaty; Thomas Hain: Background-tracking acoustic features for genre identification of broadcast shows. In: 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7-10, 2014, pp. 118–123, IEEE, 2014. (Type: Proceedings Article | Links)
Pengyuan Zhang; Yulan Liu; Thomas Hain: Semi-supervised DNN training in meeting recognition. In: 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7-10, 2014, pp. 141–146, IEEE, 2014. (Type: Proceedings Article | Links)
Raymond W. M. Ng; Mortaza Doulaty; Rama Doddipatla; Wilker Aziz; Kashif Shah; Oscar Saz; Madina Hasan; Ghada AlHarbi; Lucia Specia; Thomas Hain: The USFD SLT system for IWSLT 2014. In: Federico, Marcello; Stüker, Sebastian; Yvon, François (Ed.): Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, Lake Tahoe, CA, USA, December 4-5, 2014, 2014. (Type: Proceedings Article | Links)
127 entries « 2 of 3 »

Back to Top