Natural Speech Technology

Natural Speech Technology
Closed

Natural Speech Technology

Natural Speech Technology (NST) is an EPSRC Programme Grant with the aim of significantly advancing the state-of-the-art in speech technology by making it more natural, approaching human levels of reliability, adaptability and conversational richness. NST is a collaboration between the Centre for Speech Technology Research (CSTR) at the University of Edinburgh, the Speech Group at the University of Cambridge and The Speech and Hearing Research Group (SpandH), University of Sheffield.

Publications

  • [PDF] M. Doulaty, O. Saz, and T. Hain, “Data-selective Transfer Learning for Multi-Domain Speech Recognition,” in Proceedings of the 16th Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 2015.
    [Bibtex]
    @inproceedings{doulaty15,
    address = {Dresden, Germany},
    author = {Mortaza Doulaty and Oscar Saz and Thomas Hain},
    booktitle = {{Proceedings of the 16th Annual Conference of the International Speech Communication Association (Interspeech)}},
    project = {nst},
    title = {{Data-selective Transfer Learning for Multi-Domain Speech Recognition}},
    year = {2015}
    }
  • [PDF] M. Doulaty, O. Saz, and T. Hain, “Unsupervised Domain Discovery using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition,” in Proceedings of the 16th Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 2015.
    [Bibtex]
    @inproceedings{doulaty15b,
    address = {Dresden, Germany},
    author = {Mortaza Doulaty and Oscar Saz and Thomas Hain},
    booktitle = {{Proceedings of the 16th Annual Conference of the International Speech Communication Association (Interspeech)}},
    project = {nst},
    title = {{Unsupervised Domain Discovery using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition}},
    year = {2015}
    }
  • [PDF] M. Doulaty, O. Saz, R. W. M. Ng, and T. Hain, “Latent Dirichlet Allocation Based Organisation of Broadcast Media Archives for Deep Neural Network Adaptation,” in Proceedings of the 2015 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2015), Scottsdale, Arizona, USA, 2015.
    [Bibtex]
    @inproceedings{doulaty15c,
    address = {Scottsdale, Arizona, USA},
    author = {Mortaza Doulaty and Oscar Saz and Raymond W. M. Ng and Thomas Hain},
    booktitle = {{Proceedings of the 2015 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2015)}},
    project = {nst},
    title = {{Latent Dirichlet Allocation Based Organisation of Broadcast Media Archives for Deep Neural Network Adaptation}},
    year = {2015}
    }
  • M. Doulaty, O. Saz, R. W. M. Ng, and T. Hain, “Automatic Genre and Show Identification of Broadcast Media,” in Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech), San Francisco, California, USA, 2016.
    [Bibtex]
    @inproceedings{doulaty16a,
    address = {San Francisco, California, USA},
    author = {Mortaza Doulaty and Oscar Saz and Raymond W. M. Ng and Thomas Hain},
    booktitle = {{Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech)}},
    project = {nst},
    title = {{Automatic Genre and Show Identification of Broadcast Media}},
    year = {2016}
    }
  • R. Milner, O. Saz, S. Deena, M. Doulaty, R. Ng, and T. Hain, “The 2015 Sheffield System for Longitudinal Diarisation of Broadcast Media,” in Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), Scottsdale, AZ, 2015.
    [Bibtex]
    @inproceedings{milner_ASRU2015,
    address = {Scottsdale, AZ},
    author = {Rosanna Milner and Oscar Saz and Salil Deena and Mortaza Doulaty and Raymond Ng and Thomas Hain},
    booktitle = {{Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)}},
    project = {nst},
    title = {{The 2015 Sheffield System for Longitudinal Diarisation of Broadcast Media}},
    year = {2015}
    }
  • M. Hasan, R. Doddipatla, and T. Hain, “Noise-matched training of CRF based sentence end detection models,” in Interspeech 2015, 2015.
    [Bibtex]
    @inproceedings{madinainterspeech2015,
    author = {Madina Hasan and Rama Doddipatla and Thomas Hain},
    booktitle = {{Interspeech 2015}},
    project = {nst},
    title = {{Noise-matched training of CRF based sentence end detection models}},
    year = {2015}
    }
  • H. Christensen, M. B. Aniol, P. Bell, P. Green, T. Hain, S. King, and P. Swietojanski, “Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech,” in Interspeech\textquoteright13, 2013.
    [Bibtex]
    @conference{Christensen2013,
    author = {H. Christensen and M. B. Aniol and Bell, P. and P. Green and T. Hain and S. King and P Swietojanski},
    booktitle = {{Interspeech{\textquoteright}13}},
    link = {http://staffwww.dcs.shef.ac.uk/people/H.Christensen/pubs/christensen_is13_2.pdf},
    project = {nst,nst-homeservice},
    title = {{Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech}},
    year = {2013}
    }
  • H. Christensen, I. Casanueva, S. Cunningham, P. Green, and T. Hain, “Automatic Selection of Speakers for Improved Acoustic Modelling : Recognition of Disordered Speech with Sparse Data,” in Spoken Language Technology Workshop, SLT\textquoteright14, Lake Tahoe, 2014.
    [Bibtex]
    @conference{Christensen2014_spoken,
    address = {Lake Tahoe},
    author = {H. Christensen and I. Casanueva and S. Cunningham and P. Green and T. Hain},
    booktitle = {{Spoken Language Technology Workshop, SLT{\textquoteright}14}},
    project = {nst,nst-homeservice},
    title = {{Automatic Selection of Speakers for Improved Acoustic Modelling : Recognition of Disordered Speech with Sparse Data}},
    year = {2014}
    }
  • H. Christensen, S. Siddharth, P. O. textquoteright, Z. Clarke, S. Judge, S. Cunningham, and M. Hawley, “SPECS – an embedded platform, speech-driven environmental control system evaluated in a virtuous circle framework,” in Proc. Workshop on Innovation and Applications in Speech Technology, 2012.
    [Bibtex]
    @conference{Christensen_iast2012,
    author = {H. Christensen and S. Siddharth and P. O{\textquoteright}Neill and Z. Clarke and S. Judge and S. Cunningham and M. Hawley},
    booktitle = {{Proc. Workshop on Innovation and Applications in Speech Technology}},
    link = {http://www.dcs.shef.ac.uk/~heidi/pubs/iast-abstract.pdf},
    project = {nst,nst-homeservice},
    title = {{SPECS - an embedded platform, speech-driven environmental control system evaluated in a virtuous circle framework}},
    year = {2012}
    }
  • H. Christensen, S. Cunningham, C. Fox, P. Green, and T. Hain, “A comparative study of adaptive, automatic recognition of disordered speech,” in Proc Interspeech 2012, Portland, Oregon, US, 2012.
    [Bibtex]
    @conference{christensen_is12,
    address = {Portland, Oregon, US},
    author = {H. Christensen and S. Cunningham and Charles Fox and P. Green and T. Hain},
    booktitle = {{Proc Interspeech 2012}},
    link = {http://staffwww.dcs.shef.ac.uk/people/H.Christensen/pubs/christensen_is12.pdf},
    month = {Sep},
    project = {nst,nst-homeservice},
    title = {{A comparative study of adaptive, automatic recognition of disordered speech}},
    year = {2012}
    }
  • H. Christensen, P. Green, and T. Hain, “Learning speaker-specific pronunciations of disordered speech,” in Interspeech\textquoteright13, 2013.
    [Bibtex]
    @conference{christensen_pron_is13,
    author = {H. Christensen and P. Green and T. Hain},
    booktitle = {{Interspeech{\textquoteright}13}},
    link = {http://staffwww.dcs.shef.ac.uk/people/H.Christensen/pubs/christensen_is13_1.pdf},
    project = {nst,nst-homeservice},
    title = {{Learning speaker-specific pronunciations of disordered speech}},
    year = {2013}
    }
  • H. Christensen, S. Cunningham, P. Green, and T. Hain, “homeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition,” in 4th Workshop on Speech and Language Processing (SLPAT), 2013.
    [Bibtex]
    @conference{christensen_slpat13,
    author = {H. Christensen and S. Cunningham and P. Green and T. Hain},
    booktitle = {{4th Workshop on Speech and Language Processing (SLPAT)}},
    link = {http://staffwww.dcs.shef.ac.uk/people/H.Christensen/pubs/christensen_slpat13.pdf},
    project = {nst,nst-homeservice},
    title = {{homeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition}},
    year = {2013}
    }
  • [PDF] H. Christensen, M. Nicolao, S. Cunningham, S. Deena, P. Green, and T. Hain, “Speech-Enabled Environmental Control in an AAL setting for people with Speech Disorders: a Case Study,” in IET International Conference on Technologies for Active and Assisted Living, TechAAL 2015, London, UK, 2015.
    [Bibtex]
    @inproceedings{christensen_techaal15,
    address = {London, UK},
    author = {Christensen, Heidi and Nicolao, Mauro and Cunningham, Stuart and Deena, Salil and Green, Phil and Hain, Thomas},
    booktitle = {{IET International Conference on Technologies for Active and Assisted Living, TechAAL 2015}},
    project = {nst,nst-homeservice},
    title = {{Speech-Enabled Environmental Control in an AAL setting for people with Speech Disorders: a Case Study}},
    year = {2015}
    }
  • C. Fox, H. Christensen, and T. Hain, “Studio report: Linux audio for multi-speaker natural speech technology.,” in Proc. Linux Audio Conference, 2012.
    [Bibtex]
    @conference{FOX-LAC2012,
    author = {Charles Fox and H. Christensen and T. Hain},
    booktitle = {{Proc. Linux Audio Conference}},
    link = {http://staffwww.dcs.shef.ac.uk/people/C.Fox/fox_lac2012.pdf},
    project = {nst},
    title = {{Studio report: Linux audio for multi-speaker natural speech technology.}},
    year = {2012}
    }
  • D. M. González, P. Green, and H. Christensen, “Dysarthria Intelligibility Assessment in a Factor Analysis Total Variability Space,” in Interspeech\textquoteright13, 2013.
    [Bibtex]
    @conference{Gonzalez2013,
    author = {D. M Gonz{\'a}lez and P. Green and H. Christensen},
    booktitle = {{Interspeech{\textquoteright}13}},
    link = {http://staffwww.dcs.shef.ac.uk/people/H.Christensen/pubs/christensen_is13_3.pdf},
    project = {nst,nst-homeservice},
    title = {{Dysarthria Intelligibility Assessment in a Factor Analysis Total Variability Space}},
    year = {2013}
    }
  • [PDF] D. Mart{‘i}nez, E. Lleida, P. Green, H. Christensen, A. Ortega, and A. Miguel, “Intelligibility Assessment and Speech Recognizer Word Accuracy Rate Prediction for Dysarthric Speakers in a Factor Analysis Subspace,” Acm transactions on accessible computing (taccess), vol. 6, iss. 3, p. 10, 2015.
    [Bibtex]
    @article{martinez2015intelligibility,
    author = {Mart{\'\i}nez, David and Lleida, Eduardo and Green, Phil and Christensen, Heidi and Ortega, Alfonso and Miguel, Antonio},
    journal = {ACM Transactions on Accessible Computing (TACCESS)},
    number = {3},
    pages = {10},
    project = {nst,nst-homeservice},
    publisher = {ACM},
    title = {{Intelligibility Assessment and Speech Recognizer Word Accuracy Rate Prediction for Dysarthric Speakers in a Factor Analysis Subspace}},
    volume = {6},
    year = {2015}
    }
  • [PDF] C. W. Fox, Y. Liu, E. Zwyssig, and T. Hain, “The Sheffield Wargames Corpus,” in Proc. Interspeech 2013, ISCA, 2013.
    [Bibtex]
    @inbook{fox2013,
    author = {Charles W. Fox and Yulan Liu and Erich Zwyssig and Thomas Hain},
    booktitle = {{Proc. Interspeech 2013}},
    pdf = {http://staffwww.dcs.shef.ac.uk/people/Y.Liu/publications/pdf/Fox2013.pdf},
    project = {NST},
    publisher = {ISCA},
    title = {{The Sheffield Wargames Corpus}},
    year = {2013}
    }
  • [PDF] [DOI] Y. Liu, P. Zhang, and T. Hain, “Using neural network front-ends on far field multiple microphones based speech recognition,” in Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, 2014, pp. 5542-5546.
    [Bibtex]
    @inproceedings{liu2014,
    author = {Yulan Liu and Pengyuan Zhang and Thomas Hain},
    booktitle = {{Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on}},
    doi = {10.1109/ICASSP.2014.6854663},
    month = {May},
    pages = {5542-5546},
    pdf = {http://staffwww.dcs.shef.ac.uk/people/Y.Liu/publications/pdf/Liu2014.pdf},
    project = {NST},
    title = {{Using neural network front-ends on far field multiple microphones based speech recognition}},
    year = {2014}
    }
  • [PDF] Y. Liu, P. Karanasou, and T. Hain, “An Investigation Into Speaker Informed DNN Front-end for LVCSR,” in Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on, 2015.
    [Bibtex]
    @inproceedings{liu2015,
    author = {Yulan Liu and Penny Karanasou and Thomas Hain},
    booktitle = {{Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on}},
    keyword = {speech recognition; deep neural network; speaker adaptation; speaker informed training, bias adaptation},
    month = {April},
    pdf = {http://staffwww.dcs.shef.ac.uk/people/Y.Liu/publications/pdf/Liu2015.pdf},
    project = {NST},
    title = {{An Investigation Into Speaker Informed DNN Front-end for {LVCSR}}},
    year = {2015}
    }
  • [PDF] [DOI] P. Zhang, Y. Liu, and T. Hain, “Semi-Supervised DNN Training in Meeting Recognition,” in 2014 IEEE Spoken Language Technology Workshop (SLT 2014), South Lake Tahoe, USA, 2014.
    [Bibtex]
    @inproceedings{zhang2014,
    address = {South Lake Tahoe, USA},
    author = {Pengyuan Zhang and Yulan Liu and Thomas Hain},
    booktitle = {{2014 IEEE Spoken Language Technology Workshop (SLT 2014)}},
    doi = {10.1109/SLT.2014.7078564},
    month = {December},
    pages = {},
    pdf = {http://staffwww.dcs.shef.ac.uk/people/Y.Liu/publications/pdf/Zhang2014.pdf},
    project = {NST},
    title = {{Semi-Supervised DNN Training in Meeting Recognition}},
    year = {2014}
    }
  • C. Fox and T. Hain, “Extending Limabeam with discrimination and coarse gradients,” in INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014, 2014, pp. 2440-2444.
    [Bibtex]
    @inproceedings{DBLP:conf/interspeech/FoxH14,
    author = {Charles Fox and
    Thomas Hain},
    bibsource = {dblp computer science bibliography, http://dblp.org},
    biburl = {http://dblp.uni-trier.de/rec/bib/conf/interspeech/FoxH14},
    booktitle = {{{INTERSPEECH} 2014, 15th Annual Conference of the International Speech
    Communication Association, Singapore, September 14-18, 2014}},
    crossref = {DBLP:conf/interspeech/2014},
    link = {http://www.isca-speech.org/archive/interspeech_2014/i14_2440.html},
    pages = {2440--2444},
    project = {nst},
    timestamp = {Wed, 18 Feb 2015 08:38:47 +0100},
    title = {{Extending Limabeam with discrimination and coarse gradients}},
    year = {2014}
    }
  • C. Fox, H. Christensen, and T. Hain, “Studio report: Linux audio for multi-speaker natural speech technology.,” in Proc. Linux Audio Conference, 2012.
    [Bibtex]
    @conference{FOX-LAC2012,
    author = {Charles Fox and H. Christensen and T. Hain},
    booktitle = {{Proc. Linux Audio Conference}},
    link = {http://staffwww.dcs.shef.ac.uk/people/C.Fox/fox_lac2012.pdf},
    project = {nst},
    title = {{Studio report: Linux audio for multi-speaker natural speech technology.}},
    year = {2012}
    }
  • C. Fox and T. Hain, “Lightly supervised learning from a damaged natural speech corpus,” in Proc. IEEE ICASSP 2013, 2013.
    [Bibtex]
    @conference{WILDCAT,
    author = {Charles Fox and T. Hain},
    booktitle = {{Proc. IEEE ICASSP 2013}},
    link = {http://staffwww.dcs.shef.ac.uk/people/C.Fox/fox_icassp13.pdf},
    project = {nst},
    title = {{Lightly supervised learning from a damaged natural speech corpus}},
    year = {2013}
    }
  • [PDF] M. Nicolao, H. Christensen, S. Cunningham, P. Green, and T. Hain, “A framework for collecting realistic recordings of dysarthric speech – the homeService corpus,” in The International Conference on Language Resources and Evaluation – LREC 2016, Portorož, SLO, 2016.
    [Bibtex]
    @inproceedings{nicolao_lrec2016,
    address = {Portorož, SLO},
    author = {Nicolao, Mauro and Christensen, Heidi and Cunningham, Stuart and Green, Phil and Hain, Thomas},
    booktitle = {{The International Conference on Language Resources and Evaluation - LREC 2016}},
    project = {nst,nst-homeservice},
    title = {{A framework for collecting realistic recordings of dysarthric speech - the homeService corpus}},
    year = {2016}
    }
  • P. Bell, M. Gales, T. Hain, J. Kilgour, P. Lanchantin, A. Liu, A. McParland, S. Renals, O. Saz, M. Wester, and P. Woodland, “The MGB Challenge: Evaluating Multi-genre Broadcast Media Recognition,” in Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), Scottsdale, AZ, 2015.
    [Bibtex]
    @inproceedings{Bell_ASRU,
    address = {Scottsdale, AZ},
    author = {Peter Bell and Mark Gales and Thomas Hain and Jonathan Kilgour and Pierre Lanchantin and Andrew Liu and Andrew McParland and Steve Renals and Oscar Saz and Mirjam Wester and Phil Woodland},
    booktitle = {{Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)}},
    project = {nst},
    title = {{The MGB Challenge: Evaluating Multi-genre Broadcast Media Recognition}},
    year = {2015}
    }
  • [PDF] O. Saz and T. Hain, “Asynchronous Factorisation of Speaker and Background with Feature Transforms in Speech Recognition,” in Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech), Lyon, France, 2013, pp. 1238-1242.
    [Bibtex]
    @inproceedings{Saz13,
    address = {Lyon, France},
    author = {Oscar Saz and Thomas Hain},
    booktitle = {{Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech)}},
    pages = {1238--1242},
    project = {nst},
    title = {{Asynchronous Factorisation of Speaker and Background with Feature Transforms in Speech Recognition}},
    year = {2013}
    }
  • P. Lanchantin, P. J. Bell, M. J. F. Gales, T. Hain, X. Liu, Y. Long, J. Quinnell, S. Renals, O. Saz, M. S. Seigel, P. Swietojanski, and P. C. Woodland, “Automatic Transcription of Multi-Genre Media Archives,” in Proceedings of the First Workshop on Speech, Language and Audio in Multimedia, Marseille, France, 2013, pp. 26-31.
    [Bibtex]
    @inproceedings{Saz13b,
    address = {Marseille, France},
    author = {P. Lanchantin and P.J. Bell and M.J.F. Gales and Thomas Hain and X. Liu and Y. Long and J. Quinnell and S. Renals and Oscar Saz and M.S. Seigel and P. Swietojanski and P.C. Woodland},
    booktitle = {{Proceedings of the First Workshop on Speech, Language and Audio in Multimedia}},
    pages = {26--31},
    project = {nst},
    title = {{Automatic Transcription of Multi-Genre Media Archives}},
    year = {2013}
    }
  • O. Saz and T. Hain, “Using Contextual Information in Joint Factor Eigenspace MLLR for Speech Recognition in Diverse Scenarios,” in Proceedings of the 2014 International Conference on Acoustic, Speech and Signal Processing (ICASSP), Florence, Italy, 2014, pp. 6314-6318.
    [Bibtex]
    @inproceedings{Saz14,
    address = {Florence, Italy},
    author = {Oscar Saz and Thomas Hain},
    booktitle = {{Proceedings of the 2014 International Conference on Acoustic, Speech and Signal Processing (ICASSP)}},
    pages = {6314--6318},
    project = {nst},
    title = {{Using Contextual Information in Joint Factor Eigenspace MLLR for Speech Recognition in Diverse Scenarios}},
    year = {2014}
    }
  • [PDF] O. Saz, M. Doulaty, and T. Hain, “Background-Tracking Acoustic Features for Genre Identification of Broadcast Shows,” in Proceedings of the 2014 Spoken Language Technology (SLT) Workshop, South Lake Tahoe NV, USA, 2014, pp. 118-123.
    [Bibtex]
    @inproceedings{Saz14b,
    address = {South Lake Tahoe NV, USA},
    author = {Oscar Saz and Mortaza Doulaty and Thomas Hain},
    booktitle = {{Proceedings of the 2014 Spoken Language Technology (SLT) Workshop}},
    pages = {118--123},
    project = {nst},
    title = {{Background-Tracking Acoustic Features for Genre Identification of Broadcast Shows}},
    year = {2014}
    }
  • O. Saz, M. Doulaty, S. Deena, R. Milner, R. Ng, M. Hasan, Y. Liu, and T. Hain, “The 2015 Sheffield System for Transcription of Multi–Genre Broadcast Media,” in Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), Scottsdale, AZ, 2015.
    [Bibtex]
    @inproceedings{Saz_ASRU,
    address = {Scottsdale, AZ},
    author = {Oscar Saz and Mortaza Doulaty and Salil Deena and Rosanna Milner and Raymond Ng and Madina Hasan and Yulan Liu and Thomas Hain},
    booktitle = {{Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)}},
    project = {nst},
    title = {{The 2015 Sheffield System for Transcription of Multi--Genre Broadcast Media}},
    year = {2015}
    }
  • R. W. M. Ng, K. Shah, W. Aziz, L. Specia, and T. Hain, “Quality estimation for ASR k-best list rescoring in spoken language translation,” in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015.
    [Bibtex]
    @inproceedings{ng_icassp15,
    author = {Raymond W. M. Ng and Kashif Shah and Wilker Aziz and Lucia Specia and Thomas Hain},
    booktitle = {{2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}},
    keyword = {spoken language translation, quality estimation, system integration},
    month = {April},
    project = {WFST,NST},
    title = {{Quality estimation for {ASR} k-best list rescoring in spoken language translation}},
    year = {2015}
    }
  • [PDF] R. W. M. Ng, K. Shah, L. Specia, and T. Hain, “Groupwise learning for ASR k-best list reranking in spoken langauge translation,” in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016.
    [Bibtex]
    @inproceedings{ng_icassp16,
    author = {Raymond W. M. Ng and Kashif Shah and Lucia Specia and Thomas Hain},
    booktitle = {{2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}},
    keyword = {groupwise learning, spoken language translation},
    month = {March},
    project = {WFST,NST},
    title = {{Groupwise learning for ASR k-best list reranking in spoken langauge translation}},
    year = {2016}
    }
  • [PDF] R. W. M. Ng, K. Shah, L. Specia, and T. Hain, “A study on the stability and effectiveness of features in quality estimation for spoken langauge translation,” in the 16th Annual Conference of the International Speech Communication Association (Interspeech), 2015.
    [Bibtex]
    @inproceedings{ng_is15,
    author = {Raymond W. M. Ng and Kashif Shah and Lucia Specia and Thomas Hain},
    booktitle = {{the 16th Annual Conference of the International Speech Communication Association (Interspeech)}},
    keyword = {spoken language translation, quality estimation, system robustness},
    month = {September},
    project = {WFST,NST},
    title = {{A study on the stability and effectiveness of features in quality estimation for spoken langauge translation}},
    year = {2015}
    }
  • [PDF] R. W. M. Ng, B. Chettri, and T. Hain, “Combining weak tokenisers for phonotactic langauge recognition in a resource-constrained setting,” in the 17th Annual Conference of the International Speech Communication Association (Interspeech), 2016.
    [Bibtex]
    @inproceedings{ng_is16,
    author = {Raymond W. M. Ng and Bhusan Chettri and Thomas Hain},
    booktitle = {{the 17th Annual Conference of the International Speech Communication Association (Interspeech)}},
    keyword = {language recognition},
    month = {September},
    project = {NST},
    title = {{Combining weak tokenisers for phonotactic langauge recognition in a resource-constrained setting}},
    year = {2016}
    }
  • [PDF] R. W. M. Ng, M. Nicolao, O. Saz, M. Hasan, B. Chettri, M. Doulaty, T. Lee, and T. Hain, “Sheffield LRE 2015 System Description,” in Odyssey: The Speaker and Language Recognition Workshop, 2016.
    [Bibtex]
    @inproceedings{ng_odyssey16,
    author = {Raymond W. M. Ng and Mauro Nicolao and Oscar Saz and Madina Hasan and Bhusan Chettri and Mortaza Doulaty and Tan Lee and Thomas Hain},
    booktitle = {{Odyssey: The Speaker and Language Recognition Workshop}},
    month = {June},
    project = {NST},
    title = {{Sheffield {LRE} 2015 System Description}},
    year = {2016}
    }

Back to Top