Publications

136 entries « 2 of 3 »

2016

Rosanna Milner; Thomas Hain: DNN-Based Speaker Clustering for Speaker Diarisation. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 2185–2189, ISCA, 2016. (Type: Inproceedings | Links)
Raymond W. M. Ng; Kashif Shah; Lucia Specia; Thomas Hain: Groupwise learning for ASR k-best list reranking in spoken language translation. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016, Shanghai, China, March 20-25, 2016, pp. 6120–6124, IEEE, 2016. (Type: Inproceedings | Links)
Sarah Al-Shareef; Thomas Hain: Colloquialising Modern Standard Arabic Text for Improved Speech Recognition. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 1345–1349, ISCA, 2016. (Type: Inproceedings | Links)
Julia Olcoz; Oscar Saz; Thomas Hain: Error Correction in Lightly Supervised Alignment of Broadcast Subtitles. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 2110–2114, ISCA, 2016. (Type: Inproceedings | Links)
Mortaza Doulaty; Oscar Saz; Raymond W. M. Ng; Thomas Hain: Automatic Genre and Show Identification of Broadcast Media. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 2115–2119, ISCA, 2016. (Type: Inproceedings | Links)
Salil Deena; Madina Hasan; Mortaza Doulaty; Oscar Saz; Thomas Hain: Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 2343–2347, ISCA, 2016. (Type: Inproceedings | Links)
Iñigo Casanueva; Thomas Hain; Phil D. Green: Improving Generalisation to New Speakers in Spoken Dialogue State Tracking. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 2726–2730, ISCA, 2016. (Type: Inproceedings | Links)
Raymond W. M. Ng; Bhusan Chettri; Thomas Hain: Combining Weak Tokenisers for Phonotactic Language Recognition in a Resource-Constrained Setting. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 2939–2943, ISCA, 2016. (Type: Inproceedings | Links)
Erfan Loweimi; Jon Barker; Thomas Hain: Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 3798–3802, ISCA, 2016. (Type: Inproceedings | Links)
Yulan Liu; Charles Fox; Madina Hasan; Thomas Hain: The Sheffield Wargame Corpus - Day Two and Day Three. In: Morgan, Nelson (Ed.): Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8-12, 2016, pp. 3833–3837, ISCA, 2016. (Type: Inproceedings | Links)
Ghada AlHarbi; Thomas Hain: The OpenCourseWare Metadiscourse (OCWMD) Corpus. In: Calzolari, Nicoletta; Choukri, Khalid; Declerck, Thierry; Goggi, Sara; Grobelnik, Marko; Maegaard, Bente; Mariani, Joseph; Mazo, Hélène; Moreno, Asunción; Odijk, Jan; Piperidis, Stelios (Ed.): Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, Portorož, Slovenia, May 23-28, 2016, European Language Resources Association (ELRA), 2016. (Type: Inproceedings | Links)
Mauro Nicolao; Heidi Christensen; Stuart P. Cunningham; Phil D. Green; Thomas Hain: A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus. In: Calzolari, Nicoletta; Choukri, Khalid; Declerck, Thierry; Goggi, Sara; Grobelnik, Marko; Maegaard, Bente; Mariani, Joseph; Mazo, Hélène; Moreno, Asunción; Odijk, Jan; Piperidis, Stelios (Ed.): Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, Portorož, Slovenia, May 23-28, 2016, European Language Resources Association (ELRA), 2016. (Type: Inproceedings | Links)
Raymond W. M. Ng; Mauro Nicolao; Oscar Saz; Madina Hasan; Bhusan Chettri; Mortaza Doulaty; Tan Lee; Thomas Hain: The Sheffield language recognition system in NIST LRE 2015. In: Rodríguez-Fuentes, Luis Javier; Lleida, Eduardo (Ed.): Odyssey 2016: The Speaker and Language Recognition Workshop, Bilbao, Spain, June 21-24, 2016, pp. 181–187, ISCA, 2016. (Type: Inproceedings | Links)
Iñigo Casanueva; Thomas Hain; Mauro Nicolao; Phil D. Green: Using phone features to improve dialogue state tracking generalisation to unseen states. In: Proceedings of the SIGDIAL 2016 Conference, The 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 13-15 September 2016, Los Angeles, CA, USA, pp. 80–89, The Association for Computer Linguistics, 2016. (Type: Inproceedings | Links)

2015

Heidi Christensen; Mauro Nicolao; Stuart P. Cunningham; Salil Deena; Phil D. Green; Thomas Hain: Speech-Enabled Environmental Control in an AAL setting for people with Speech Disorders: a Case Study. In: IET International Conference on Technologies for Active and Assisted Living, TechAAL 2015, London, UK, 2015. (Type: Inproceedings | )
Kashif Shah; Raymond W. M. Ng; Fethi Bougares; Lucia Specia: Investigating continuous space language models for machine translation quality estimation. In: 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2015. (Type: Inproceedings | )
Yulan Liu; Penny Karanasou; Thomas Hain: An Investigation Into Speaker Informed DNN Front-end for LVCSR. In: Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on, 2015. (Type: Inproceedings | )
Erfan Loweimi; Jon Barker; Thomas Hain: Emotion Recognition from Speech Signal by Effective Combination of the Generative and Discriminative Models. In: University of Sheffield Engineering Symposium, Sheffield, UK, 2015. (Type: Inproceedings | )
David Mart'inez; Eduardo Lleida; Phil D. Green; Heidi Christensen; Alfonso Ortega; Antonio Miguel: Intelligibility Assessment and Speech Recognizer Word Accuracy Rate Prediction for Dysarthric Speakers in a Factor Analysis Subspace. In: ACM Transactions on Accessible Computing (TACCESS), vol. 6, no. 3, pp. 10, 2015. (Type: Journal Article | )
Oscar Saz; Mortaza Doulaty; Salil Deena; Rosanna Milner; Raymond W. M. Ng; Madina Hasan; Yulan Liu; Thomas Hain: The 2015 sheffield system for transcription of Multi-Genre Broadcast media. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp. 624–631, IEEE, 2015. (Type: Inproceedings | Links)
Rosanna Milner; Oscar Saz; Salil Deena; Mortaza Doulaty; Raymond W. M. Ng; Thomas Hain: The 2015 sheffield system for longitudinal diarisation of broadcast media. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp. 632–638, IEEE, 2015. (Type: Inproceedings | Links)
Mortaza Doulaty; Oscar Saz; Raymond W. M. Ng; Thomas Hain: Latent Dirichlet Allocation based organisation of broadcast media archives for deep neural network adaptation. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp. 130–136, IEEE, 2015. (Type: Inproceedings | Links)
Peter Bell; Mark J. F. Gales; Thomas Hain; Jonathan Kilgour; Pierre Lanchantin; Xunying Liu; Andrew McParland; Steve Renals; Oscar Saz; Mirjam Wester; Philip C. Woodland: The MGB challenge: Evaluating multi-genre broadcast media recognition. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015, pp. 687–693, IEEE, 2015. (Type: Inproceedings | Links)
Ghada AlHarbi; Thomas Hain: Using Topic Segmentation Models for the Automatic Organisation of MOOCs resources. In: Santos, Olga C.; Boticario, Jesus; Romero, Cristóbal; Pechenizkiy, Mykola; Merceron, Agathe; Mitros, Piotr; Luna, José María; Mihaescu, Marian Cristian; Moreno, Pablo; Hershkovitz, Arnon; Ventura, Sebastián; Desmarais, Michel C. (Ed.): Proceedings of the 8th International Conference on Educational Data Mining, EDM 2015, Madrid, Spain, June 26-29, 2015, pp. 524–527, International Educational Data Mining Society (IEDMS), 2015. (Type: Inproceedings | Links)
Yulan Liu; Penny Karanasou; Thomas Hain: An investigation into speaker informed DNN front-end for LVCSR. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 2015, pp. 4300–4304, IEEE, 2015. (Type: Inproceedings | Links)
Raymond W. M. Ng; Kashif Shah; Wilker Aziz; Lucia Specia; Thomas Hain: Quality estimation for asr k-best list rescoring in spoken language translation. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 2015, pp. 5226–5230, IEEE, 2015. (Type: Inproceedings | Links)
Mauro Nicolao; Amy V. Beeston; Thomas Hain: Automatic assessment of English learner pronunciation using discriminative classifiers. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, South Brisbane, Queensland, Australia, April 19-24, 2015, pp. 5351–5355, IEEE, 2015. (Type: Inproceedings | Links)
Madina Hasan; Rama Doddipatla; Thomas Hain: Noise-matched training of CRF based sentence end detection models. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 349–353, ISCA, 2015. (Type: Inproceedings | Links)
Erfan Loweimi; Jon Barker; Thomas Hain: Source-filter separation of speech signal in the phase domain. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 598–602, ISCA, 2015. (Type: Inproceedings | Links)
Raymond W. M. Ng; Kashif Shah; Lucia Specia; Thomas Hain: A study on the stability and effectiveness of features in quality estimation for spoken language translation. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 2257–2261, ISCA, 2015. (Type: Inproceedings | Links)
Mortaza Doulaty; Oscar Saz; Thomas Hain: Data-selective transfer learning for multi-domain speech recognition. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 2897–2901, ISCA, 2015. (Type: Inproceedings | Links)
Mortaza Doulaty; Oscar Saz; Thomas Hain: Unsupervised domain discovery using latent dirichlet allocation for acoustic modelling in speech recognition. In: INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015, pp. 3640–3644, ISCA, 2015. (Type: Inproceedings | Links)
Iñigo Casanueva; Thomas Hain; Heidi Christensen; Ricard Marxer; Phil D. Green: Knowledge transfer between speakers for personalised dialogue management. In: Proceedings of the SIGDIAL 2015 Conference, The 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2-4 September 2015, Prague, Czech Republic, pp. 12–21, The Association for Computer Linguistics, 2015. (Type: Inproceedings | Links)
Erfan Loweimi; Mortaza Doulaty; Jon Barker; Thomas Hain: Long-Term Statistical Feature Extraction from Speech Signal and Its Application in Emotion Recognition. In: Dediu, Adrian-Horia; Martín-Vide, Carlos; Vicsi, Klára (Ed.): Statistical Language and Speech Processing - Third International Conference, SLSP 2015, Budapest, Hungary, November 24-26, 2015, Proceedings, pp. 173–184, Springer, 2015. (Type: Inproceedings | Links)
Ghada AlHarbi; Raymond W. M. Ng; Thomas Hain: Annotating meta-discourse in academic lectures from different disciplines. In: Steidl, Stefan; Batliner, Anton; Jokisch, Oliver (Ed.): ISCA International Workshop on Speech and Language Technology in Education, SLaTE 2015, Leipzig, Germany, September 4-5, 2015, pp. 161–166, ISCA, 2015. (Type: Inproceedings | Links)

2014

Pengyuan Zhang; Yulan Liu; Thomas Hain: Semi-Supervised DNN Training in Meeting Recognition. In: 2014 IEEE Spoken Language Technology Workshop (SLT 2014), South Lake Tahoe, USA, 2014. (Type: Inproceedings | Links)
Yulan Liu; Pengyuan Zhang; Thomas Hain: Using neural network front-ends on far field multiple microphones based speech recognition. In: Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, pp. 5542-5546, 2014. (Type: Inproceedings | Links)
Charles Fox; Thomas Hain: Extending Limabeam with discrimination and coarse gradients. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 2440–2444, ISCA, 2014. (Type: Inproceedings | Links)
Erfan Loweimi; Jon Barker; Thomas Hain: Compression of model-based group delay function for robust speech recognition. In: University of Sheffield Engineering Symposium, Sheffield, UK, 2014. (Type: Inproceedings | )
Yulan Liu; Pengyuan Zhang; Thomas Hain: Using neural network front-ends on far field multiple microphones based speech recognition. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp. 5542–5546, IEEE, 2014. (Type: Inproceedings | Links)
Oscar Saz; Thomas Hain: Using contextual information in joint factor eigenspace MLLR for speech recognition in diverse scenarios. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp. 6314–6318, IEEE, 2014. (Type: Inproceedings | Links)
Inigo Casanueva; Heidi Christensen; Thomas Hain; Phil D. Green: Adaptive speech recognition and dialogue management for users with speech disorders. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 1033–1037, ISCA, 2014. (Type: Inproceedings | Links)
Rama Doddipatla; Madina Hasan; Thomas Hain: Speaker dependent bottleneck layer training for speaker adaptation in automatic speech recognition. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 2199–2203, ISCA, 2014. (Type: Inproceedings | Links)
Madina Hasan; Rama Doddipatla; Thomas Hain: Multi-pass sentence-end detection of lecture speech. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 2902–2906, ISCA, 2014. (Type: Inproceedings | Links)
Raymond W. M. Ng; Mortaza Doulaty; Rama Doddipatla; Wilker Aziz; Kashif Shah; Oscar Saz; Madina Hasan; Ghada AlHarbi; Lucia Specia; Thomas Hain: The USFD SLT system for IWSLT 2014. In: Federico, Marcello; Stüker, Sebastian; Yvon, François (Ed.): Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, Lake Tahoe, CA, USA, December 4-5, 2014, 2014. (Type: Inproceedings | Links)
Oscar Saz; Mortaza Doulaty; Thomas Hain: Background-tracking acoustic features for genre identification of broadcast shows. In: 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7-10, 2014, pp. 118–123, IEEE, 2014. (Type: Inproceedings | Links)
Pengyuan Zhang; Yulan Liu; Thomas Hain: Semi-supervised DNN training in meeting recognition. In: 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7-10, 2014, pp. 141–146, IEEE, 2014. (Type: Inproceedings | Links)
Heidi Christensen; Inigo Casanueva; Stuart P. Cunningham; Phil D. Green; Thomas Hain: Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data. In: 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7-10, 2014, pp. 254–259, IEEE, 2014. (Type: Inproceedings | Links)

2013

Mauro Nicolao; Fabio Tesser; Roger K. Moore: A phonetic-contrast motivated adaptation to control the degree-of-articulation on Italian HMM-based synthetic voices. In: 8th ISCA Workshop on Speech Synthesis, pp. 127–132, Barcelona, Spain, 2013. (Type: Inproceedings | )
Erfan Loweimi; Seyed Mohammad Ahadi; Thomas Drugman: A new phase-based feature representation for robust speech recognition. In: IEEE International conference on Acoustics, Speech and Signal Processing, Vancouver, Canada, 2013. (Type: Inproceedings | )
136 entries « 2 of 3 »

Back to Top