PhonBank Clinical English PERCEPT-R Corpus
|
Nina Benway
Communication Sciences and Disorders
Syracuse University
nbenway@syr.edu
website
|
|
Jonathan Preston
Communication Sciences and Disorders
Syracuse University
jopresto@syr.edu
website
|
|
Elaine Hitchcock
Communication Sciences and Disorders
Montclair State University
hitchcocke@montclair.edu
website
|
|
Tara McAllister
Communication Sciences and Disorders
New York University
tkm214@nyu.edu
website
|
Participants: | 280, ages 6-17, some adults |
Type of Study: | clinical, cross-sectional |
Location: | USA |
Media type: | audio |
DOI: | doi:10.21415/0JPJ-X403 |
Browsable transcripts
Phon data
CHAT data
Link to media folder
Citation information
- Benway, N. R., Preston, J. L., Hitchcock, E. R., Salekin, A.,
Sharma, H., & McAllister, T. (2022). PERCEPT-R: An Open-Access American
English Child/Clinical Speech Corpus Specialized for the Audio
Classification of /ɹ/. INTERSPEECH 2022: Proceedings of the 23rd Annual
Conference of the International Speech Communication Association (ISCA),
Incheon, Republic of Korea.
- McAllister Byun, T., Hitchcock, E. R., & Ferron, J. (2017). Masked
visual analysis: Minimizing type I error in visually guided single-case
design for communication disorders. Journal of Speech, Language, and
Hearing Research, 60(6), 1455-1466.
- McAllister Byun, T., & Campbell, H. (2016). Differential effects of
visual-acoustic biofeedback intervention for residual speech errors.
Frontiers in Human Neuroscience, 10, 567.
- McAllister Byun, T. (2017). Efficacy of visual–acoustic biofeedback
intervention for residual rhotic errors: A single-subject randomization
study. Journal of Speech, Language, and Hearing Research, 60(5),
1175-1193.
- McAllister, T., Hitchcock, E. R., & Ortiz, J. A. (2021).
Computer-assisted challenge point intervention for residual speech
errors. Perspectives of the ASHA special interest groups, 6(1), 214-229.
- Ayala, S.A., Eads, A., Kabakoff, H., Swartz, M., Shiller, D.M.,
Hill, J., Hitchcock, E.R., Preston, J.L., & McAllister, T. (Under
review). Auditory and Somatosensory Development for Speech in Later
Childhood. DOI 10.17605/OSF.IO/BKASM
- McAllister, T., Preston, J. L., Hitchcock, E. R., & Hill, J.
(2020). Protocol for correcting residual errors with spectral,
ultrasound, traditional speech therapy randomized controlled trial
(C-RESULTS RCT). BMC Pediatrics, 20(1), 1-14.
- Benway, N. R., Hitchcock, E. R., McAllister, T., Feeny, G. T.,
Hill, J., & Preston, J. L. (2021). Comparing biofeedback types for
children with residual/ɹ/errors in American English: A single-case
randomization design. American Journal of Speech-Language Pathology,
30(4), 1819-1845.
- McAllister Byun, T., Swartz, M. T., Halpin, P. F., Szeredi, D., &
Maas, E. (2016). Direction of attentional focus in biofeedback treatment
for /r/ misarticulation. International Journal of Language &
Communication Disorders, 51(4), 384-401.
- Hitchcock, E. R., McAllister Byun, T., Swartz, M., & Lazarus, R.
(2017). Efficacy of electropalatography for treating misarticulation of
/r/. American Journal of Speech-Language Pathology, 26(4), 1141-1158.
- Campbell, H., Harel, D., Hitchcock, E., & McAllister Byun, T.
(2017). Selecting an acoustic correlate for automated measurement of
American English rhotic production in children. International Journal of
Speech-Language Pathology, 20(6), 635-643.
- Hitchcock, E. R., & McAllister Byun, T. (2015). Enhancing
generalisation in biofeedback intervention using the challenge point
framework: A case study. Clinical linguistics & phonetics, 29(1), 59-75.
- Preston, J. L., Hitchcock, E. R., & Leece, M. C. (2020). Auditory
perception and ultrasound biofeedback treatment outcomes for children
with residual/ɹ/distortions: A randomized controlled trial. Journal of
Speech, Language, and Hearing Research, 63(2), 444-455.
- Preston, J. L., & Edwards, M. L. (2007). Phonological processing
skills of adolescents with residual speech sound errors. Language,
Speech, and Hearing Services in Schools, 38(4), 297-308.
- Preston, J. L., Hull, M., & Edwards, M. L. (2013). Preschool speech
error patterns predict articulation and phonological awareness outcomes
in children with histories of speech sound disorders. American Journal
of Speech-Language Pathology, 22(2), 173-184.
- McAllister, T., Eads, A., Kabakoff, H., Scott, M., Boyce, S.,
Whalen, D. H., & Preston, J. L. (2022). Baseline Stimulability Predicts
Patterns of Response to Traditional and Ultrasound Biofeedback Treatment
for Residual Speech Sound Disorder. Journal of Speech, Language, and
Hearing Research, 1-21.
- Sjolie, G. M., Leece, M. C., & Preston, J. L. (2016). Acquisition,
retention, and generalization of rhotics with and without ultrasound
visual feedback. Journal of Communication Disorders, 64, 62-77.
- Peterson, L., Savarese, C., Campbell, T., Ma, Z., Simpson, K. O., &
McAllister, T. (2022). telepractice treatment of residual rhotic errors
using app-based biofeedback: A pilot study. Language, Speech, and
Hearing Services in Schools, 53(2), 256-274.
- McAllister Byun, T., Hitchcock, E. R., & Swartz, M. T. (2014).
Retroflex versus bunched in treatment for rhotic misarticulation:
Evidence from ultrasound biofeedback intervention. Journal of Speech,
Language, and Hearing Research, 57(6), 2116-2130.
- McAllister Byun, T., & Hitchcock, E. R. (2012). Investigating the
use of traditional and spectral biofeedback approaches to intervention
for /r/ misarticulation. American Journal of Speech-Language Pathology,
21(3), 207-221.
In accordance with TalkBank rules, any use of data from this corpus
must be accompanied by at least one of the above references.
Please cite all papers relevant to the portion of the corpus you used in your own research.
Project Description
This corpus consists of 22 sub-corpora drawing on 19 published/under
review studies and three as yet unpublished studies of the development
and treatment of American English rhotics. The majority of talkers in
this corpus were children and adolescents aged 6-17; a small number of
adult talkers are also included. In some cases, participants are
characterized by typical speech production; other participants were
recruited for suspected speech sound disorder (SSD) affecting rhotic
production or for a history of SSD at preschool age. Participants were
required to speak American English as a dominant or equally dominant
language. They were recruited from sites in New York and New Jersey,
USA. Information about the number of participants in each study, their
classification, and their range of ages can be found in
these tables .
Individual-level age and sex information are provided as part of the
corpus records.
The recordings presented here are isolated syllables or words, as
well as some multi-word phrases, elicited in the context of probe lists.
Table 1 contains information about the nature of elicitation (e.g., via
imitation, reading, or picture naming). Most of the component studies
were conducted in the lab setting, with audio recorded using a
participant-worn headset (e.g., AKG C520) or lavalier mic (e.g.,
Sennheiser MKE 2). Each record in the corpus contains an orthographic
and IPA target record for the utterance in question. In lieu of IPA
transcription of the participant’s actual production, the records
contain quantitative data representing listeners’ perceptual ratings of
the accuracy of the rhotic in each utterance (maximum one rhotic per
utterance). In some cases, these ratings were provided by untrained
listeners recruited online through the Amazon Mechanical Turk
crowdsourcing platform (see McAllister Byun, 2017, for an example). In
other cases, the ratings were provided by lab-based listeners, either
speech-language pathologists or students in speech-language pathology
programs (see example in Benway et al., 2021).
Table 1 .indicates the
rater type for each component study.
In a subset of cases, participants were engaged in intervention
targeting production of American English rhotics. In these cases,
recordings are labeled to reflect the time point in the study (e.g.,
before, during, and after treatment). Various types of treatment were
provided across the component studies, including motor-based treatment
and various types of biofeedback treatment (ultrasound, visual-acoustic,
and/or electropalatographic). For studies with a treatment component,
Table 2 contains information about the type and duration of the
intervention provided.
For additional detail, see publication (1) listed below, or any of
the publications linked to sub-corpora, as indicated in
these tables .
The files that make up this corpus are formatted for use with Phon
software. An additional version of the same corpus, formatted for ease
of use in training of Automated Speech Recognition (ASR) systems, can be
found in the Derived Corpora section at
https://phon.talkbank.org/derived/.