Publications

Erfan Loweimi; Mortaza Doulaty; Jon Barker; Thomas Hain: Long-Term Statistical Feature Extraction from Speech Signal and Its Application in Emotion Recognition. In: Dediu, Adrian-Horia; Martín-Vide, Carlos; Vicsi, Klára (Ed.): Statistical Language and Speech Processing - Third International Conference, SLSP 2015, Budapest, Hungary, November 24-26, 2015, Proceedings, pp. 173–184, Springer, 2015. (Type: Proceedings Article | Links)

Madina Hasan; Rama Doddipatla; Thomas Hain: Noise-matched training of CRF based sentence end detection models. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 349–353, ISCA, 2015. (Type: Proceedings Article | Links)

Raymond W. M. Ng; Kashif Shah; Wilker Aziz; Lucia Specia; Thomas Hain: Quality estimation for asr k-best list rescoring in spoken language translation. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 2015, pp. 5226–5230, IEEE, 2015. (Type: Proceedings Article | Links)

Erfan Loweimi; Jon Barker; Thomas Hain: Source-filter separation of speech signal in the phase domain. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 598–602, ISCA, 2015. (Type: Proceedings Article | Links)

Rosanna Milner; Oscar Saz; Salil Deena; Mortaza Doulaty; Raymond W. M. Ng; Thomas Hain: The 2015 sheffield system for longitudinal diarisation of broadcast media. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp. 632–638, IEEE, 2015. (Type: Proceedings Article | Links)

Oscar Saz; Mortaza Doulaty; Salil Deena; Rosanna Milner; Raymond W. M. Ng; Madina Hasan; Yulan Liu; Thomas Hain: The 2015 sheffield system for transcription of Multi-Genre Broadcast media. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp. 624–631, IEEE, 2015. (Type: Proceedings Article | Links)

Peter Bell; Mark J. F. Gales; Thomas Hain; Jonathan Kilgour; Pierre Lanchantin; Xunying Liu; Andrew McParland; Steve Renals; Oscar Saz; Mirjam Wester; Philip C. Woodland: The MGB challenge: Evaluating multi-genre broadcast media recognition. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp. 687–693, IEEE, 2015. (Type: Proceedings Article | Links)

Mortaza Doulaty; Oscar Saz; Thomas Hain: Unsupervised domain discovery using latent dirichlet allocation for acoustic modelling in speech recognition. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 3640–3644, ISCA, 2015. (Type: Proceedings Article | Links)

Ghada AlHarbi; Thomas Hain: Using Topic Segmentation Models for the Automatic Organisation of MOOCs resources. In: Santos, Olga C.; Boticario, Jesus; Romero, Cristóbal; Pechenizkiy, Mykola; Merceron, Agathe; Mitros, Piotr; Luna, José María; Mihaescu, Marian Cristian; Moreno, Pablo; Hershkovitz, Arnon; Ventura, Sebastián; Desmarais, Michel C. (Ed.): Proceedings of the 8th International Conference on Educational Data Mining, EDM 2015, Madrid, Spain, June 26-29, 2015, pp. 524–527, International Educational Data Mining Society (IEDMS), 2015. (Type: Proceedings Article | Links)

Pengyuan Zhang; Yulan Liu; Thomas Hain: Semi-Supervised DNN Training in Meeting Recognition. In: 2014 IEEE Spoken Language Technology Workshop (SLT 2014), South Lake Tahoe, USA, 2014. (Type: Proceedings Article | Links)

Yulan Liu; Pengyuan Zhang; Thomas Hain: Using neural network front-ends on far field multiple microphones based speech recognition. In: Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, pp. 5542-5546, 2014. (Type: Proceedings Article | Links)

Inigo Casanueva; Heidi Christensen; Thomas Hain; Phil D. Green: Adaptive speech recognition and dialogue management for users with speech disorders. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 1033–1037, ISCA, 2014. (Type: Proceedings Article | Links)

Heidi Christensen; Inigo Casanueva; Stuart P. Cunningham; Phil D. Green; Thomas Hain: Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data. In: 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7-10, 2014, pp. 254–259, IEEE, 2014. (Type: Proceedings Article | Links)

Oscar Saz; Mortaza Doulaty; Thomas Hain: Background-tracking acoustic features for genre identification of broadcast shows. In: 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7-10, 2014, pp. 118–123, IEEE, 2014. (Type: Proceedings Article | Links)

Erfan Loweimi; Jon Barker; Thomas Hain: Compression of model-based group delay function for robust speech recognition. In: University of Sheffield Engineering Symposium, Sheffield, UK, 2014. (Type: Proceedings Article | )

Charles Fox; Thomas Hain: Extending Limabeam with discrimination and coarse gradients. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 2440–2444, ISCA, 2014. (Type: Proceedings Article | Links)

Madina Hasan; Rama Doddipatla; Thomas Hain: Multi-pass sentence-end detection of lecture speech. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 2902–2906, ISCA, 2014. (Type: Proceedings Article | Links)

Pengyuan Zhang; Yulan Liu; Thomas Hain: Semi-supervised DNN training in meeting recognition. In: 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7-10, 2014, pp. 141–146, IEEE, 2014. (Type: Proceedings Article | Links)

Rama Doddipatla; Madina Hasan; Thomas Hain: Speaker dependent bottleneck layer training for speaker adaptation in automatic speech recognition. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 2199–2203, ISCA, 2014. (Type: Proceedings Article | Links)

Raymond W. M. Ng; Mortaza Doulaty; Rama Doddipatla; Wilker Aziz; Kashif Shah; Oscar Saz; Madina Hasan; Ghada AlHarbi; Lucia Specia; Thomas Hain: The USFD SLT system for IWSLT 2014. In: Federico, Marcello; Stüker, Sebastian; Yvon, François (Ed.): Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, Lake Tahoe, CA, USA, December 4-5, 2014, 2014. (Type: Proceedings Article | Links)

Oscar Saz; Thomas Hain: Using contextual information in joint factor eigenspace MLLR for speech recognition in diverse scenarios. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp. 6314–6318, IEEE, 2014. (Type: Proceedings Article | Links)

Yulan Liu; Pengyuan Zhang; Thomas Hain: Using neural network front-ends on far field multiple microphones based speech recognition. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp. 5542–5546, IEEE, 2014. (Type: Proceedings Article | Links)

Mauro Nicolao; Fabio Tesser; Roger K. Moore: A phonetic-contrast motivated adaptation to control the degree-of-articulation on Italian HMM-based synthetic voices. In: 8th ISCA Workshop on Speech Synthesis, pp. 127–132, Barcelona, Spain, 2013. (Type: Proceedings Article | )

Erfan Loweimi; Seyed Mohammad Ahadi; Thomas Drugman: A new phase-based feature representation for robust speech recognition. In: IEEE International conference on Acoustics, Speech and Signal Processing, Vancouver, Canada, 2013. (Type: Proceedings Article | )

Raymond W. M. Ng; Thomas Hain; Trevor Cohn: Adaptation of lecture speech recognition system with machine translation output. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp. 8401–8405, IEEE, 2013. (Type: Proceedings Article | Links)

Oscar Saz; Thomas Hain: Asynchronous factorisation of speaker and background with feature transforms in speech recognition. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 1238–1242, ISCA, 2013. (Type: Proceedings Article | Links)

Pierre Lanchantin; Peter Bell; Mark J. F. Gales; Thomas Hain; Xunying Liu; Yanhua Long; Jennifer Quinnell; Steve Renals; Oscar Saz; Matthew Stephen Seigel; Pawel Swietojanski; Philip C. Woodland: Automatic Transcription of Multi-genre Media Archives. In: Gravier, Guillaume; Béchet, Frédéric (Ed.): Proceedings of the First Workshop on Speech, Language and Audio in Multimedia, Marseille, France, August 22-23, 2013, pp. 26–31, CEUR-WS.org, 2013. (Type: Proceedings Article | Links)

Heidi Christensen; M. B. Aniol; Peter Bell; Phil D. Green; Thomas Hain; Simon King; Pawel Swietojanski: Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 3642–3645, ISCA, 2013. (Type: Proceedings Article | Links)

Sarah Al-Shareef : Conversational Arabic Automatic Speech Recognition: Literature Review. The University of Sheffield 2013. (Type: Technical Report | )

D. M González; Phil D. Green; Heidi Christensen: Dysarthria Intelligibility Assessment in a Factor Analysis Total Variability Space. Interspeech’13, 2013. (Type: Conference | Links)

Heidi Christensen; Iñigo Casanueva; Stuart P. Cunningham; Phil D. Green; Thomas Hain: homeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition. In: Alexandersson, Jan; Ljunglöf, Peter; McCoy, Kathleen F.; Portet, François; Roark, Brian; Rudzicz, Frank; Vacher, Michel (Ed.): Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, SLPAT 2013, Grenoble, France, August 21-22, 2013, pp. 29–34, Association for Computational Linguistics, 2013. (Type: Proceedings Article | Links)

Heidi Christensen; Phil D. Green; Thomas Hain: Learning speaker-specific pronunciations of disordered speech. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 1159–1163, ISCA, 2013. (Type: Proceedings Article | Links)

Charles Fox; Thomas Hain: Lightly supervised learning from a damaged natural speech corpus. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp. 8086–8090, IEEE, 2013. (Type: Proceedings Article | Links)

Erfan Loweimi; Seyed Mohammad Ahadi; Thomas Drugman; Samira Loveymi: On the importance of pre-emphasis and window shape in phase-based speech recognition. In: Lecture Notes in Computer Science, Advances in Non-Linear Speech Processing (NOLISP), Mons, Belgium, 2013. (Type: Proceedings Article | )

Charles Fox; Yulan Liu; Erich Zwyssig; Thomas Hain: The Sheffield Wargames Corpus. In: Proc. Interspeech 2013, ISCA, 2013. (Type: Book Chapter | )

Charles Fox; Yulan Liu; Erich Zwyssig; Thomas Hain: The sheffield wargames corpus. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 1116–1120, ISCA, 2013. (Type: Proceedings Article | Links)

Heidi Christensen; Stuart P. Cunningham; Charles Fox; Phil D. Green; Thomas Hain: A comparative study of adaptive, automatic recognition of disordered speech. Proc Interspeech 2012, Portland, Oregon, US, 2012. (Type: Conference | Links)

Mauro Nicolao; Javier Latorre; Roger K. Moore: C2H: A Computational Model of H&H-based Phonetic Contrast in Synthetic Speech. In: Proceedings of 13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012, Portland, OR, 2012. (Type: Proceedings Article | )

Mauro Nicolao; Roger K. Moore: Establishing some principles of human speech production through two-dimensional computational models. In: SAPA-SCALE workshop 2012, Portland, OR, 2012. (Type: Proceedings Article | )

Sarah Al-Shareef; Thomas Hain: Conditional Random Fields Based Diacritisation of Colloquial Arabic. In: Saudi International Conference, 2012. (Type: Proceedings Article | )

Matthew Gibson; Thomas Hain: Correctness-Adjusted Unsupervised Discriminative Acoustic Model Adaptation. In: IEEE Trans. Speech Audio Process., vol. 20, no. 10, pp. 2648–2656, 2012. (Type: Journal Article | Links)

Sarah Al-Shareef; Thomas Hain: CRF-based Diacritisation of Colloquial Arabic for Automatic Speech Recognition. In: INTERSPEECH, 2012. (Type: Proceedings Article | )

Heidi Christensen; Siddharth Sehgal; Peter O’Neill; Zoe Clarke; Simon Judge; Stuart P. Cunningham; Mark S. Hawley: SPECS - an embedded platform, speech-driven environmental control system evaluated in a virtuous circle framework. Proc. Workshop on Innovation and Applications in Speech Technology, 2012. (Type: Conference | Links)

Charles Fox; Heidi Christensen; Thomas Hain: Studio report: Linux audio for multi-speaker natural speech technology.. Proc. Linux Audio Conference, 2012. (Type: Conference | Links)

Thomas Hain; Lukás Burget; John Dines; Philip N. Garner; Frantisek Grézl; Asmaa El Hannani; Marijn Huijbregts; Martin Karafiát; Mike Lincoln; Vincent Wan: Transcribing Meetings With the AMIDA Systems. In: IEEE Trans. Speech Audio Process., vol. 20, no. 2, pp. 486–498, 2012. (Type: Journal Article | Links)

Roger K. Moore; Mauro Nicolao: Reactive Speech Synthesis: Actively Managing Phonetic Contrast Along an H&H Continuum. In: Proceedings of the 17th International Congress of Phonetic Sciences, ICPhS 2011, pp. 1422–1425, Hong Kong, China, 2011. (Type: Proceedings Article | )

Erfan Loweimi; Seyed Mohammad Ahadi: A new group delay-based feature for robust speech recognition. In: IEEE International conference on Multimedia and Expo (ICME), Barcelona, Spain, 2011. (Type: Proceedings Article | )

Davide Marino; Thomas Hain: An Analysis of Automatic Speech Recognition with Multiple Microphones. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp. 1281–1284, ISCA, 2011. (Type: Proceedings Article | Links)

Sarah Al-Shareef; Thomas Hain: An Investigation in Speech Recognition for Colloquial Arabic. In: INTERSPEECH, 2011. (Type: Proceedings Article | )

Roger C. F. Tucker; Dan Fry; Vincent Wan; Stuart N. Wrigley; Thomas Hain: Extending Audio Notetaker to Browse WebASR Transcriptions. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp. 3329–3330, ISCA, 2011. (Type: Proceedings Article | Links)