Publications

144 entries « 3 of 3 »

2014

Madina Hasan; Rama Doddipatla; Thomas Hain: Multi-pass sentence-end detection of lecture speech. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 2902–2906, ISCA, 2014. (Type: Proceedings Article | Links)
Pengyuan Zhang; Yulan Liu; Thomas Hain: Semi-supervised DNN training in meeting recognition. In: 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7-10, 2014, pp. 141–146, IEEE, 2014. (Type: Proceedings Article | Links)
Rama Doddipatla; Madina Hasan; Thomas Hain: Speaker dependent bottleneck layer training for speaker adaptation in automatic speech recognition. In: Li, Haizhou; Meng, Helen M.; Ma, Bin; Chng, Engsiong; Xie, Lei (Ed.): INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, pp. 2199–2203, ISCA, 2014. (Type: Proceedings Article | Links)
Raymond W. M. Ng; Mortaza Doulaty; Rama Doddipatla; Wilker Aziz; Kashif Shah; Oscar Saz; Madina Hasan; Ghada AlHarbi; Lucia Specia; Thomas Hain: The USFD SLT system for IWSLT 2014. In: Federico, Marcello; Stüker, Sebastian; Yvon, François (Ed.): Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, Lake Tahoe, CA, USA, December 4-5, 2014, 2014. (Type: Proceedings Article | Links)
Oscar Saz; Thomas Hain: Using contextual information in joint factor eigenspace MLLR for speech recognition in diverse scenarios. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp. 6314–6318, IEEE, 2014. (Type: Proceedings Article | Links)
Yulan Liu; Pengyuan Zhang; Thomas Hain: Using neural network front-ends on far field multiple microphones based speech recognition. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp. 5542–5546, IEEE, 2014. (Type: Proceedings Article | Links)

2013

Mauro Nicolao; Fabio Tesser; Roger K. Moore: A phonetic-contrast motivated adaptation to control the degree-of-articulation on Italian HMM-based synthetic voices. In: 8th ISCA Workshop on Speech Synthesis, pp. 127–132, Barcelona, Spain, 2013. (Type: Proceedings Article | )
Erfan Loweimi; Seyed Mohammad Ahadi; Thomas Drugman: A new phase-based feature representation for robust speech recognition. In: IEEE International conference on Acoustics, Speech and Signal Processing, Vancouver, Canada, 2013. (Type: Proceedings Article | )
Raymond W. M. Ng; Thomas Hain; Trevor Cohn: Adaptation of lecture speech recognition system with machine translation output. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp. 8401–8405, IEEE, 2013. (Type: Proceedings Article | Links)
Oscar Saz; Thomas Hain: Asynchronous factorisation of speaker and background with feature transforms in speech recognition. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 1238–1242, ISCA, 2013. (Type: Proceedings Article | Links)
Pierre Lanchantin; Peter Bell; Mark J. F. Gales; Thomas Hain; Xunying Liu; Yanhua Long; Jennifer Quinnell; Steve Renals; Oscar Saz; Matthew Stephen Seigel; Pawel Swietojanski; Philip C. Woodland: Automatic Transcription of Multi-genre Media Archives. In: Gravier, Guillaume; Béchet, Frédéric (Ed.): Proceedings of the First Workshop on Speech, Language and Audio in Multimedia, Marseille, France, August 22-23, 2013, pp. 26–31, CEUR-WS.org, 2013. (Type: Proceedings Article | Links)
Heidi Christensen; M. B. Aniol; Peter Bell; Phil D. Green; Thomas Hain; Simon King; Pawel Swietojanski: Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 3642–3645, ISCA, 2013. (Type: Proceedings Article | Links)
Sarah Al-Shareef : Conversational Arabic Automatic Speech Recognition: Literature Review. The University of Sheffield 2013. (Type: Technical Report | )
D. M González; Phil D. Green; Heidi Christensen: Dysarthria Intelligibility Assessment in a Factor Analysis Total Variability Space. Interspeech’13, 2013. (Type: Conference | Links)
Heidi Christensen; Iñigo Casanueva; Stuart P. Cunningham; Phil D. Green; Thomas Hain: homeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition. In: Alexandersson, Jan; Ljunglöf, Peter; McCoy, Kathleen F.; Portet, François; Roark, Brian; Rudzicz, Frank; Vacher, Michel (Ed.): Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, SLPAT 2013, Grenoble, France, August 21-22, 2013, pp. 29–34, Association for Computational Linguistics, 2013. (Type: Proceedings Article | Links)
Heidi Christensen; Phil D. Green; Thomas Hain: Learning speaker-specific pronunciations of disordered speech. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 1159–1163, ISCA, 2013. (Type: Proceedings Article | Links)
Charles Fox; Thomas Hain: Lightly supervised learning from a damaged natural speech corpus. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013, pp. 8086–8090, IEEE, 2013. (Type: Proceedings Article | Links)
Erfan Loweimi; Seyed Mohammad Ahadi; Thomas Drugman; Samira Loveymi: On the importance of pre-emphasis and window shape in phase-based speech recognition. In: Lecture Notes in Computer Science, Advances in Non-Linear Speech Processing (NOLISP), Mons, Belgium, 2013. (Type: Proceedings Article | )
Charles Fox; Yulan Liu; Erich Zwyssig; Thomas Hain: The Sheffield Wargames Corpus. In: Proc. Interspeech 2013, ISCA, 2013. (Type: Book Chapter | )
Charles Fox; Yulan Liu; Erich Zwyssig; Thomas Hain: The sheffield wargames corpus. In: Bimbot, Frédéric; Cerisara, Christophe; Fougeron, Cécile; Gravier, Guillaume; Lamel, Lori; Pellegrino, François; Perrier, Pascal (Ed.): INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013, pp. 1116–1120, ISCA, 2013. (Type: Proceedings Article | Links)

2012

Heidi Christensen; Stuart P. Cunningham; Charles Fox; Phil D. Green; Thomas Hain: A comparative study of adaptive, automatic recognition of disordered speech. Proc Interspeech 2012, Portland, Oregon, US, 2012. (Type: Conference | Links)
Mauro Nicolao; Javier Latorre; Roger K. Moore: C2H: A Computational Model of H&H-based Phonetic Contrast in Synthetic Speech. In: Proceedings of 13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012, Portland, OR, 2012. (Type: Proceedings Article | )
Mauro Nicolao; Roger K. Moore: Establishing some principles of human speech production through two-dimensional computational models. In: SAPA-SCALE workshop 2012, Portland, OR, 2012. (Type: Proceedings Article | )
Sarah Al-Shareef; Thomas Hain: Conditional Random Fields Based Diacritisation of Colloquial Arabic. In: Saudi International Conference, 2012. (Type: Proceedings Article | )
Matthew Gibson; Thomas Hain: Correctness-Adjusted Unsupervised Discriminative Acoustic Model Adaptation. In: IEEE Trans. Speech Audio Process., vol. 20, no. 10, pp. 2648–2656, 2012. (Type: Journal Article | Links)
Sarah Al-Shareef; Thomas Hain: CRF-based Diacritisation of Colloquial Arabic for Automatic Speech Recognition. In: INTERSPEECH, 2012. (Type: Proceedings Article | )
Heidi Christensen; Siddharth Sehgal; Peter O’Neill; Zoe Clarke; Simon Judge; Stuart P. Cunningham; Mark S. Hawley: SPECS - an embedded platform, speech-driven environmental control system evaluated in a virtuous circle framework. Proc. Workshop on Innovation and Applications in Speech Technology, 2012. (Type: Conference | Links)
Charles Fox; Heidi Christensen; Thomas Hain: Studio report: Linux audio for multi-speaker natural speech technology.. Proc. Linux Audio Conference, 2012. (Type: Conference | Links)
Thomas Hain; Lukás Burget; John Dines; Philip N. Garner; Frantisek Grézl; Asmaa El Hannani; Marijn Huijbregts; Martin Karafiát; Mike Lincoln; Vincent Wan: Transcribing Meetings With the AMIDA Systems. In: IEEE Trans. Speech Audio Process., vol. 20, no. 2, pp. 486–498, 2012. (Type: Journal Article | Links)

2011

Roger K. Moore; Mauro Nicolao: Reactive Speech Synthesis: Actively Managing Phonetic Contrast Along an H&H Continuum. In: Proceedings of the 17th International Congress of Phonetic Sciences, ICPhS 2011, pp. 1422–1425, Hong Kong, China, 2011. (Type: Proceedings Article | )
Erfan Loweimi; Seyed Mohammad Ahadi: A new group delay-based feature for robust speech recognition. In: IEEE International conference on Multimedia and Expo (ICME), Barcelona, Spain, 2011. (Type: Proceedings Article | )
Davide Marino; Thomas Hain: An Analysis of Automatic Speech Recognition with Multiple Microphones. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp. 1281–1284, ISCA, 2011. (Type: Proceedings Article | Links)
Sarah Al-Shareef; Thomas Hain: An Investigation in Speech Recognition for Colloquial Arabic. In: INTERSPEECH, 2011. (Type: Proceedings Article | )
Roger C. F. Tucker; Dan Fry; Vincent Wan; Stuart N. Wrigley; Thomas Hain: Extending Audio Notetaker to Browse WebASR Transcriptions. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp. 3329–3330, ISCA, 2011. (Type: Proceedings Article | Links)
Stuart N. Wrigley; Thomas Hain: Making an Automatic Speech Recognition Service Freely Available on the Web. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp. 3325–3326, ISCA, 2011. (Type: Proceedings Article | Links)
Erfan Loweimi; Seyed Mohammad Ahadi; Samira Loveymi: On the importance of phase and magnitude spectra in speech enhancement. In: Iranian conference on Electrical Engineering (ICEE), Tehran, Iran, 2011. (Type: Proceedings Article | )
Erfan Loweimi; Seyed Mohammad Ahadi; Hamid Sheikhzadeh: Phase-only speech reconstruction using very short frames. In: Proceedings of the 12th Annual Conference of the International Speech Communication Association (INTERSPEECH), Florence, Italy, 2011. (Type: Proceedings Article | )
Stuart N. Wrigley; Thomas Hain: Web-Based Automatic Speech Recognition Service - webASR. In: INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011, pp. 3265–3268, ISCA, 2011. (Type: Proceedings Article | Links)

2010

Erfan Loweimi; Seyed Mohammad Ahadi: Objective evaluation of magnitude and phase only spectrum-based reconstruction of the speech signal. In: International Symposium on Communications, Control and Signal Processing, Limassol, Cyprus, 2010. (Type: Proceedings Article | )
Erfan Loweimi; Seyed Mohammad Ahadi: Objective evaluation of phase and magnitude only reconstructed speech: new considerations. In: Information Sciences Signal Processing and their Applications (ISSPA), Kuala Lumpur, Malaysia, 2010. (Type: Proceedings Article | )

2008

Thomas Hain; Asmaa El Hannani; Stuart N. Wrigley; Vincent Wan: Automatic speech recognition for scientific purposes - webASR. In: INTERSPEECH 2008, 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008, pp. 504–507, ISCA, 2008. (Type: Proceedings Article | Links)

2006

Jean Carletta; Simone Ashby; Sebastien Bourban; Mike Flynn; Mael Guillemot; Thomas Hain; Jaroslav Kadlec; Vasilis Karaiskos; Wessel Kraaij; Melissa Kronenthal; Guillaume Lathoud; Mike Lincoln; Agnes Lisowka; Iain McCowan; Wilfried Post; Denns Reidsma; Pierre Wellner: The AMI meeting corpus: A pre-announcement. In: Machine learning for multimodal interaction, pp. 28–39, Springer, 2006. (Type: Book Section | )
Steve Young; Gunnar Evermann; Mark J. F. Gales; Thomas Hain; Dan Kershaw; XA Liu; Gareth Moore; Julian Odell; Dave Ollason; Dan Povey; Valtcho Valtchev; Philip C. Woodland: The HTK book (Version 3.3, Version 3.4). In: 2006. (Type: Journal Article | )

2005

Iain McCowan; Jean Carletta; Wessel Kraaij; Simone Ashby; Sebastien Bourban; Mike Flynn; Maël Guillemot; Thomas Hain; Jaroslav Kadlec; Vasilis Karaiskos; others: The AMI meeting corpus. In: Proceedings of the 5th International Conference on Methods and Techniques in Behavioral Research, vol. 88, 2005. (Type: Journal Article | )
144 entries « 3 of 3 »

Back to Top