The homeService Corpus

The homeService Corpus

The homeService corpus is a new English speech database which has been gathered as part of the homeService project. The homeService project is the impact showcase for the UK EPSRC Programme Grant Project, Natural Speech Technology (NST) a collaboration between the Universities of Edinburgh, Cambridge and Sheffield and it is concerned with how speech technology can be of use for people with speech disorders and restricted upper-limb mobility.

The audio recorded during such interactions consists of realistic speech data of speakers with severe dysarthria. The audio recorded during such interactions consists of realistic data of speakers with severe dysarthria. The majority of the homeService corpus is recorded in real home environments where voice control is often the normal means by which users interact with their devices.

sponsor-hiresnst

 

 

 

The homeService corpus v1.1

The homeService corpus v1.1 is the second release of the audio recorded within the homeService project and it consists of audio recordings of dysarthric speech from 5 different subjects (three male, two female).

SpeakerType of dataVocabularyNumber of interactionsDurationAnnotated
F01ER01train32972'19"yes
F02ER01train3131411'58"yes
F02ID01train3236430'02"yes
F02ID01test201439'58"yes
M01ER01train312306'34"yes
M02ER01train311303'16"yes
M02ID01test4015711h44'44"yes
M02ID01train4758076h29'40"yes
M03ER01train121142'47"yes
M03ID01train2547236'41"yes
M03ID01test1413311'05"yes
TOTAL131936010h07'32"

The homeService corpus v1.0

The homeService corpus v1.0 is the first release of the audio recorded within the homeService project and it consists of audio recordings of dysarthric speech from 5 different subjects (three male, two female).

SpeakerType of dataVocabularyNumber of interactionsDurationAnnotated
F01ER01train32972'19"yes
F02ER01train3131411'58"yes
F02ID01train3031425'52"yes
F02ID01test16855'40"yes
M01ER01train312306'34"yes
M02ER01train311303'16"yes
M02ID01test4015711h44'44"yes
M02ID01train4758076h29'40"yes
M03ER01train121142'47"yes
M03ID01train1816911'26"yes
M03ID01test11363'00"yes
TOTAL13188679h27'20"yes

 

Each subject’s set is composed by two subsets: enrolment data (ER) and interaction data (ID).

  • ER is obtained by the user reading lists of the words that they have chosen as commands in their system. To match the acoustic conditions in user’s home, the recording takes place in the same environment in which  the system is supposed to function. As the user is reading from a list, the resulting speech will be less natural but is still effective for initial training.
  • ID is recorded as the user operates the electronic devices in his/her house with the homeService speech enabled interface. Recording starts after the user presses a switch and the microphone is open for a predefined number of seconds. In contrast with the ER data, each produced word is chosen by the user autonomously.

Project team

Mauro Nicolao, Heidi Christensen, Stuart Cunningham, Phil Green, Thomas Hain

Data example

Annotation

Annotation provided in HTK STM format

Filename Mic SpeakerID startTime endTime <Mic,SesId,Lang,Impair,level,intel,purpose> Transcription

hom-F01ER01MCW0000003000003 MC F01 0.00 2.55 <MC,ER01,GBEng,CP,SE,LL,a55,ER01train> delete
hom-M02ID01MC20150309104753 MC M02 0.00 3.00 <MC,ID01,GBEng,MND,MO,MM,a75,ID01train> skysportone

Audio

Audio data is provided in the standard MS-WAVE mono format at 16kHz and 16 bit.  It was recorded with a 6-channel Microcone microphone array at 48kHz sampling rate and 32bit definition (these streams are available but not distributed in the current release). The 16 kHz signal is the result of the beam-formed combination of the 6 channels which is embedded in the Microcone hardware.

All audio (ER and ID) was recorded in real home environment.

License

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

An agreement with University of Sheffield has to be signed to use the data.

Due to the sensitive nature of the data and the obligation to participant confidentiality, the audio of the homeService corpus cannot be redistributed under any circumstance.

Download

To download the homeService corpus please send a request to homeservice-group@sheffield.ac.uk

Figshare link

Download page

Citation

M. Nicolao, H. Christensen, S. Cunningham, P. Green, and T. Hain, The homeService corpus v. 1.0, University of Sheffield at http://mini.dcs.shef.ac.uk/resources/homeservice-corpus, 2016, doi: 10.15131/shef.data.3116833

Projects

Publications

2016

  • [PDF] M. Nicolao, H. Christensen, S. Cunningham, P. Green, and T. Hain, “A framework for collecting realistic recordings of dysarthric speech – the homeService corpus,” in The International Conference on Language Resources and Evaluation – LREC 2016, Portorož, SLO, 2016.
    [Bibtex]
    @inproceedings{nicolao_lrec2016,
    address = {Portorož, SLO},
    author = {Nicolao, Mauro and Christensen, Heidi and Cunningham, Stuart and Green, Phil and Hain, Thomas},
    booktitle = {{The International Conference on Language Resources and Evaluation - LREC 2016}},
    project = {nst,nst-homeservice},
    title = {{A framework for collecting realistic recordings of dysarthric speech - the homeService corpus}},
    year = {2016}
    }

2015

  • [PDF] H. Christensen, M. Nicolao, S. Cunningham, S. Deena, P. Green, and T. Hain, “Speech-Enabled Environmental Control in an AAL setting for people with Speech Disorders: a Case Study,” in IET International Conference on Technologies for Active and Assisted Living, TechAAL 2015, London, UK, 2015.
    [Bibtex]
    @inproceedings{christensen_techaal15,
    address = {London, UK},
    author = {Christensen, Heidi and Nicolao, Mauro and Cunningham, Stuart and Deena, Salil and Green, Phil and Hain, Thomas},
    booktitle = {{IET International Conference on Technologies for Active and Assisted Living, TechAAL 2015}},
    project = {nst,nst-homeservice},
    title = {{Speech-Enabled Environmental Control in an AAL setting for people with Speech Disorders: a Case Study}},
    year = {2015}
    }
  • [PDF] D. Mart{‘i}nez, E. Lleida, P. Green, H. Christensen, A. Ortega, and A. Miguel, “Intelligibility Assessment and Speech Recognizer Word Accuracy Rate Prediction for Dysarthric Speakers in a Factor Analysis Subspace,” Acm transactions on accessible computing (taccess), vol. 6, iss. 3, p. 10, 2015.
    [Bibtex]
    @article{martinez2015intelligibility,
    author = {Mart{\'\i}nez, David and Lleida, Eduardo and Green, Phil and Christensen, Heidi and Ortega, Alfonso and Miguel, Antonio},
    journal = {ACM Transactions on Accessible Computing (TACCESS)},
    number = {3},
    pages = {10},
    project = {nst,nst-homeservice},
    publisher = {ACM},
    title = {{Intelligibility Assessment and Speech Recognizer Word Accuracy Rate Prediction for Dysarthric Speakers in a Factor Analysis Subspace}},
    volume = {6},
    year = {2015}
    }
  • [PDF] I. Casanueva, T. Hain, H. Christensen, R. Marxer, and P. Green, “Knowledge transfer between speakers for personalised dialogue management,” in Proceedings of SIGDial, Prague, Czech Republic, 2015.
    [Bibtex]
    @inproceedings{casanueva:15,
    address = {Prague, Czech Republic},
    author = {Inigo Casanueva and Thomas Hain and Heidi Christensen and Ricard Marxer and Phil Green},
    booktitle = {{Proceedings of SIGDial}},
    pdf = {http://staffwww.dcs.shef.ac.uk/people/i.casanueva/pdfs/SIGDIAL15_casanueva.pdf},
    project = {nst-homeService},
    title = {{Knowledge transfer between speakers for personalised dialogue management}},
    year = {2015}
    }

2014

  • H. Christensen, I. Casanueva, S. Cunningham, P. Green, and T. Hain, “Automatic Selection of Speakers for Improved Acoustic Modelling : Recognition of Disordered Speech with Sparse Data,” in Spoken Language Technology Workshop, SLT\textquoteright14, Lake Tahoe, 2014.
    [Bibtex]
    @conference{Christensen2014_spoken,
    address = {Lake Tahoe},
    author = {H. Christensen and I. Casanueva and S. Cunningham and P. Green and T. Hain},
    booktitle = {{Spoken Language Technology Workshop, SLT{\textquoteright}14}},
    project = {nst,nst-homeservice},
    title = {{Automatic Selection of Speakers for Improved Acoustic Modelling : Recognition of Disordered Speech with Sparse Data}},
    year = {2014}
    }
  • [PDF] I. Casanueva, H. Christensen, T. Hain, and P. Green, “Adaptive speech recognition and dialogue management for users with speech disorders,” in Proceedings of Interspeech, Singapore, Singapore, 2014.
    [Bibtex]
    @inproceedings{casanueva:14,
    address = {Singapore, Singapore},
    author = {Inigo Casanueva and Heidi Christensen and Thomas Hain and Phil Green},
    booktitle = {{Proceedings of Interspeech}},
    pdf = {http://staffwww.dcs.shef.ac.uk/people/i.casanueva/pdfs/IS14_casanueva.pdf},
    project = {nst-homeService},
    title = {{Adaptive speech recognition and dialogue management for users with speech disorders}},
    year = {2014}
    }
  • [PDF] H. Christensen, I. Casanueva, S. Cunningham, P. Green, and T. Hain, “Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data,” in Proceedings of SLT, Nevada, USA, 2014.
    [Bibtex]
    @inproceedings{christensen:14,
    address = {Nevada, USA},
    author = {Heidi Christensen and Inigo Casanueva and Stuart Cunningham and Phil Green and Thomas Hain},
    booktitle = {{Proceedings of SLT}},
    pdf = {http://staffwww.dcs.shef.ac.uk/people/i.casanueva/pdfs/SLT14_hc.pdf},
    project = {nst-homeService},
    title = {{Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data}},
    year = {2014}
    }

2013

  • H. Christensen, M. B. Aniol, P. Bell, P. Green, T. Hain, S. King, and P. Swietojanski, “Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech,” in Interspeech\textquoteright13, 2013.
    [Bibtex]
    @conference{Christensen2013,
    author = {H. Christensen and M. B. Aniol and Bell, P. and P. Green and T. Hain and S. King and P Swietojanski},
    booktitle = {{Interspeech{\textquoteright}13}},
    link = {http://staffwww.dcs.shef.ac.uk/people/H.Christensen/pubs/christensen_is13_2.pdf},
    project = {nst,nst-homeservice},
    title = {{Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech}},
    year = {2013}
    }
  • H. Christensen, P. Green, and T. Hain, “Learning speaker-specific pronunciations of disordered speech,” in Interspeech\textquoteright13, 2013.
    [Bibtex]
    @conference{christensen_pron_is13,
    author = {H. Christensen and P. Green and T. Hain},
    booktitle = {{Interspeech{\textquoteright}13}},
    link = {http://staffwww.dcs.shef.ac.uk/people/H.Christensen/pubs/christensen_is13_1.pdf},
    project = {nst,nst-homeservice},
    title = {{Learning speaker-specific pronunciations of disordered speech}},
    year = {2013}
    }
  • H. Christensen, S. Cunningham, P. Green, and T. Hain, “homeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition,” in 4th Workshop on Speech and Language Processing (SLPAT), 2013.
    [Bibtex]
    @conference{christensen_slpat13,
    author = {H. Christensen and S. Cunningham and P. Green and T. Hain},
    booktitle = {{4th Workshop on Speech and Language Processing (SLPAT)}},
    link = {http://staffwww.dcs.shef.ac.uk/people/H.Christensen/pubs/christensen_slpat13.pdf},
    project = {nst,nst-homeservice},
    title = {{homeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition}},
    year = {2013}
    }
  • D. M. González, P. Green, and H. Christensen, “Dysarthria Intelligibility Assessment in a Factor Analysis Total Variability Space,” in Interspeech\textquoteright13, 2013.
    [Bibtex]
    @conference{Gonzalez2013,
    author = {D. M Gonz{\'a}lez and P. Green and H. Christensen},
    booktitle = {{Interspeech{\textquoteright}13}},
    link = {http://staffwww.dcs.shef.ac.uk/people/H.Christensen/pubs/christensen_is13_3.pdf},
    project = {nst,nst-homeservice},
    title = {{Dysarthria Intelligibility Assessment in a Factor Analysis Total Variability Space}},
    year = {2013}
    }
  • [PDF] H. Christensen, I. Casanueva, S. Cunningham, P. Green, and T. Hain, “HomeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition,” in Proceedings of SLPAT, Grenoble, France, 2013.
    [Bibtex]
    @inproceedings{christensen:13,
    address = {Grenoble, France},
    author = {Heidi Christensen and Inigo Casanueva and Stuart Cunningham and Phil Green and Thomas Hain},
    booktitle = {{Proceedings of SLPAT}},
    pdf = {http://staffwww.dcs.shef.ac.uk/people/i.casanueva/pdfs/SLPAT13_hc.pdf},
    project = {nst-homeService},
    title = {{HomeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition}},
    year = {2013}
    }

2012

  • H. Christensen, S. Siddharth, P. O. textquoteright, Z. Clarke, S. Judge, S. Cunningham, and M. Hawley, “SPECS – an embedded platform, speech-driven environmental control system evaluated in a virtuous circle framework,” in Proc. Workshop on Innovation and Applications in Speech Technology, 2012.
    [Bibtex]
    @conference{Christensen_iast2012,
    author = {H. Christensen and S. Siddharth and P. O{\textquoteright}Neill and Z. Clarke and S. Judge and S. Cunningham and M. Hawley},
    booktitle = {{Proc. Workshop on Innovation and Applications in Speech Technology}},
    link = {http://www.dcs.shef.ac.uk/~heidi/pubs/iast-abstract.pdf},
    project = {nst,nst-homeservice},
    title = {{SPECS - an embedded platform, speech-driven environmental control system evaluated in a virtuous circle framework}},
    year = {2012}
    }
  • H. Christensen, S. Cunningham, C. Fox, P. Green, and T. Hain, “A comparative study of adaptive, automatic recognition of disordered speech,” in Proc Interspeech 2012, Portland, Oregon, US, 2012.
    [Bibtex]
    @conference{christensen_is12,
    address = {Portland, Oregon, US},
    author = {H. Christensen and S. Cunningham and Charles Fox and P. Green and T. Hain},
    booktitle = {{Proc Interspeech 2012}},
    link = {http://staffwww.dcs.shef.ac.uk/people/H.Christensen/pubs/christensen_is12.pdf},
    month = {Sep},
    project = {nst,nst-homeservice},
    title = {{A comparative study of adaptive, automatic recognition of disordered speech}},
    year = {2012}
    }

Back to Top