Infrastructural theory of human language and its origins
Oller, D. K. The Emergence of the Speech Capacity (2000). Erlbaum. This project and its publications provide a theoretical framework for all the other
projects in the laboratory, and a basis for collaboration with a wide variety of additional
institutions and scholars. The key idea is that language is founded in capabilities
that emerge systematically and in an ordered fashion starting in very early infancy.
Primitive capabilities to vocalize and use vocalization as well as to express affect
facially and through bodily movements form the basis for more advanced capabilities
that are built upon them.
Oller, D. K., Dale, R., Griebel, U. (2016). New frontiers in language evolution and
development: the Editors’ Introduction. In New Frontiers in Language Evolution and
Development Edited by: Wayne D. Gray. Topics in Cognitive Science (topiCS), 8(2),
353-360. Abstract: This article introduces the Special Issue and its focus on research in language
evolution with emphasis on theory as well as computational and robotic modeling. A
key theme is based on the growth of evolutionary developmental biology or evo-devo.
The Special Issue consists of 13 articles organized in two sections: A) Theoretical
foundations and B) Modeling and simulation studies. All the papers are interdisciplinary
in nature, encompassing work in biological and linguistic foundations for the study
of language evolution as well as a variety of computational and robotic modeling efforts
shedding light on how language may be developed and may have evolved.
Oller, D. K., Griebel, U., & Warlaumont, A. S. (2016). Vocal development as a guide
to modeling the evolution of language. In New Frontiers in Language Evolution and
Development Edited by: Wayne D. Gray. Topics in Cognitive Science (topiCS), 8(2),
382-392. Abstract: Modeling of evolution and development of language has principally utilized
mature units of spoken language, phonemes and words, as both targets and inputs. This
approach cannot address the earliest phases of development because young infants are
unable to produce such language features. We argue that units of early vocal development—protophones
and their primitive illocutionary/perlocutionary forces—should be targeted in evolutionary
modeling because they suggest likely units of hominin vocalization/communication shortly
after the split from the chimpanzee/bonobo lineage, and because early development
of spontaneous vocal capability is a logically necessary step toward vocal language,
a root capability without which other crucial steps toward vocal language capability
are impossible. Modeling of language evolution/development must account for dynamic
change in early communicative units of form/function across time. We argue for interactive
contributions of sender/infants and receiver/caregivers in a feedback loop involving
both development and evolution and propose to begin computational modeling at the
hominin break from the primate communicative background.
Empirical work on origins of communication in humans and animals (human infants and
children, bonobos, dog, wolf, and squid)
Griebel, U. & Oller, D. K. (2008). Evolutionary forces favoring contextual flexibility.
In Oller, D.K. and Griebel, U. (editors) Evolution of Communicative Flexibility: Complexity,
Creativity, and Adaptability in Human and Animal Communication, MIT Press pp. 9-40.
This chapter defines “signal flexibility” and “functional flexibility” as two kinds
of contextual flexibility in communication. It evaluates the environmental/social
conditions and communication types that seem to favor selection for variability or
complexity in communication systems and which may result in signal and functional
flexibility. The chapter also presents clear cases of actions such as camouflage and
deception, and proposes that some degree of signal flexibility is a prerequisite for
functional flexibility. It analyzes the occurrence of the various mapping options
for signals and functions in nonhumans from invertebrates to primates, and suggests
that human communicative flexibility first emerged under pressures of social cohesion.
Griebel, U., & Oller, D. K. (2014). Origins of language in a comparative perspective.
In P. J. Lafreniere & G. Weisfeld (Eds.), Evolutionary Science of Human Behavior:
An Interdisciplinary Approach. Linus Learning, Rokokoma, NY. 257-280. Since our cognitive abilities have evolved from those of our ancestors, we suggest
a comparative approach to enlightening us about language origins, examining animal
language learning studies to determine what animals actually can come to understand
about human language. This approach may lead to better understanding of the cognitive
underpinnings of language. It will definitely help to pinpoint features that seem
to be uniquely involved in human language, ones that may be present to lesser extents
in animal communication, as well as ones that are shared across many species.
Oller, D. K. & Griebel, U. (2014). On Quantitative Comparative Research in Communication
and Language Evolution. Biological Theory, 9, 2, 296-308. doi: 10.1007/s13752-014-0186-7.
NIHMSID #613148. PMCID: PMC4179202. Abstract: Quantitative comparison of human language and natural animal communication
requires improved conceptualizations. We argue that an infrastructural approach to
development and evolution incorporating an extended interpretation of the distinctions
among illocution, perlocution, and meaning (Austin 1962; Oller and Griebel 2008) can
help place the issues relevant to quantitative comparison in perspective. The approach
can illuminate the controversy revolving around the notion of functional referentiality
as applied to alarm calls, for example in the vervet monkey. We argue that referentiality
offers a poor point of quantitative comparison across language and animal communication
in the wild. Evidence shows that even newborn human cry could be deemed to show functional
referentiality according to the criteria typically invoked by advocates of referentiality
in animal communication. Exploring the essence of the idea of illocution, we illustrate
an important realm of commonality among animal communication systems and human language,
a commonality that opens the door to more productive, quantifiable comparisons. Finally,
we delineate two examples of infrastructural communicative capabilities that should
be particularly amenable to direct quantitative comparison across humans and our closest
relatives.
Griebel, U., Pepperberg, I., Oller, D. K. (2016). Developmental Plasticity and Language:
A Comparative Perspective. In New Frontiers in Language Evolution and Development
Edited by: Wayne D. Gray. Topics in Cognitive Science (topiCS). 8(2), 435-445. Abstract: The growing field of evo-devo is increasingly demonstrating the complexity
of steps involved in genetic, intracellular regulatory, and extracellular environmental
control of the development of phenotypes. A key result of such work is an account
for the remarkable plasticity of organismal form in many species based on relatively
minor changes in regulation of highly conserved genes and genetic processes. Accounting
for behavioral plasticity is of similar potential interest but has received far less
attention. Of particular interest is plasticity in communication systems, where human
language represents an ultimate target for research. The present paper considers plasticity
of language capabilities in a comparative framework, focusing attention on examples
of a remarkable fact: Whereas there exist design features of mature human language
that have never been observed to occur in non-humans in the wild, many of these features
can be developed to notable extents when non-humans are enculturated through human
training (especially with intensive social interaction). These examples of enculturated
developmental plasticity across extremely diverse taxa suggest, consistent with the
evo-devo theme of highly conserved processes in evolution, that human language is
founded in part on cognitive capabilities that are indeed ancient and that even modern
humans show self-organized emergence of many language capabilities in the context
of rich enculturation, built on the special social/ecological history of the hominin
line. Human culture can thus be seen as a regulatory system encouraging language development
in the context of a cognitive background with many highly conserved features.
Tchernichovski, O. & Oller, D. K. (2016) Vocal Development: How marmoset infants express
their feelings. Current Biology. 26(10), R422-R424. Abstract: A new study shows that vocal sequences produced by newborn marmoset monkeys
are driven by slow fluctuations in physiological state; the results shed light on
the evolution of vocal communication between newborns and parents.
Griebel U., Oller D.K. (2012) Vocabulary learning in a Yorkshire terrier: Slow mapping
of spoken words. PLoS ONE. 7(2): e30182. doi:10.1371/journal.pone.0030182. Abstract: Rapid vocabulary learning in children has been attributed to "fast mapping",
with new words often claimed to be learned through a single presentation. As reported
in 2004 in Science a border collie (Rico) not only learned to identify more than 200
words, but fast mapped the new words, remembering meanings after just one presentation.
Our research tests the fast mapping interpretation of the Science paper based on Rico's
results, while extending the demonstration of large vocabulary recognition to a lap
dog. We tested a Yorkshire terrier (Bailey) with the same procedures as Rico, illustrating
that Bailey accurately retrieved randomly selected toys from a set of 117 on voice
command of the owner. Second we tested her retrieval based on two additional voices,
one male, one female, with different accents that had never been involved in her training,
again showing she was capable of recognition by voice command. Third, we did both
exclusion-based training of new items (toys she had never seen before with names she
had never heard before) embedded in a set of known items, with subsequent retention
tests designed as in the Rico experiment. After Bailey succeeded on exclusion and
retention tests, a crucial evaluation of true mapping tested items previously successfully
retrieved in exclusion and retention, but now pitted against each other in a two-choice
task. Bailey failed on the true mapping task repeatedly, illustrating that the claim
of fast mapping in Rico had not been proven, because no true mapping task had ever
been conducted with him. It appears that the task called retention in the Rico study
only demonstrated success in retrieval by a process of extended exclusion.
Studies of communication development in typically developing human infants
Oller, D. K., Buder, E. H., Ramsdell, H. L., Warlaumont, A. S., Chorna, L., & Bakeman,
R. (2013). Functional flexibility of infant vocalization and the emergence of language.
Proceedings of the National Academy of Sciences. 110 (16) 6318-6323. doi:10.1073/pnas.1300337110/-/DCSupplemental.
PMCID: PMC3631625. Abstract: We report on the emergence of functional flexibility in vocalizations of
human infants. This vastly underappreciated capability becomes apparent when prelinguistic
vocalizations express a full range of emotional content—positive, neutral, and negative.
The data show that at least three types of infant vocalizations (squeals, vowel-like
sounds, and growls) occur with this full range of expression by 3–4 months of age.
In contrast, infant cry and laughter, which are species-specific signals apparently
homologous to vocal calls in other primates, show functional stability, with cry overwhelmingly
expressing negative and laughter positive emotional states. Functional flexibility
is a sine qua non in spoken language, because all words or sentences can be produced
as expressions of varying emotional states and because learning conventional "meanings"
requires the ability to produce sounds that are free of any predetermined function.
Functional flexibility is a defining characteristic of language, and empirically it
appears before syntax, word learning, and even earlier-developing features presumed
to be critical to language (e.g., joint attention, syllable imitation, and canonical
babbling). The appearance of functional flexibility early in the first year of human
life is a critical step in the development of vocal language and may have been a critical
step in the evolution of human language, preceding protosyntax and even primitive
single words. Such flexible affect expression of vocalizations has not yet been reported
for any non-human primate but if found to occur would suggest deep roots for functional
flexibility of vocalization in our primate heritage.
Nathani Iyer, S., Denson, H., Lazar, N. & Oller, D. K. (2016) Volubility of the human
infant: Effects of parental interaction (or lack of it). Clinical Linguistic & Phonetics.
DOI:10.3109/02699206.2016.1147082, 80747, PMC4902155. Abstract: Although parental volubility, or amount of talk, has received considerable
recent attention, infant volubility has received comparatively little attention despite
its potential significance for communicative risk status and later linguistic and
cognitive outcomes. Volubility of 16 typically developing infants from 2 to 11 months
of age was longitudinally investigated in the present study across three social circumstances:
parent talking to infant, parent not talking to infant and parent talking to interviewer
while the infant was in the room. Results indicated that volubility was least in the
Interview circumstance. There were no significant differences in volubility between
the parent Talk and No Talk circumstances. Volubility was found to reduce with age.
These results suggest that infants vocalize in a variety of circumstances, even when
no one talks to or interacts with them. The presence of a stranger or perhaps overhearing
adults speaking to each other, however, may significantly reduce infant volubility.
Jhang. Y., & Oller, D. K (2017). Emergence of Functional Flexibility in Early Infant
Vocalization of the First Three Months: Analyses of laboratory recordings. Frontiers
in Psychology. 8, DOI:10.3389/fpsyg.2017.00300. Abstract: Functional flexibility, as manifest in the use of any word or sentence to
express different affective valences on different occasions, is required in linguistic
communication and can be said to be an infrastructural property of language. Early
infant vocalizations (protophones), believed to be precursors to speech, occur in
the first month and are functionally different from non-speech-like signals (e.g.,
cries and laughs). Oller et al. (2013) showed that infants by 3 months used three
different protophone types with a full range of affect as manifest in facial expression,
from positive to neutral to negative. These differences in affect were also shown
to correspond to different illocutionary functions, unlike fixed signals, or vegetative
sounds, which showed functional rigidity. The present study investigated whether infants
show functional flexibility in protophones even earlier than the ages studied by Oller
et al. (2013). Data were obtained from 6 infants across the first 3 months. Results
showed that as early as the first month, infant protophones were already accompanied
by variable facial affect valences and continued to be affectively flexible at the
later ages. The present study thus documents the very early emergence of an infrastructural
property of human communication.
Yoo, H., Bidelman, G., Buder, E. H., van Mersbergen, M., Oller, D. K. (2017). Differentiating
infant cry from non-cry vocalizations based on perception of negativity and acoustic
features. The Journal of the Acoustical Society of America. 141. 3748-3749. Abstract: This study seeks to determine how human listeners discriminate cry vs. non-cry
sounds by investigating acoustic factors that may contribute to the perception of
negativity in infant vocalizations (e.g., cry, whine, and vowel-like sounds). The
assumption is that identification of cry is self-evident; therefore, there has been
no attempt to systematically differentiate cry from non-cry vocalizations. Twelve
exemplars each of cry, whine, and vowel-like sound segments (36 total) were selected
from archival audio recordings of infant vocalizations. Categories were selected from
expert-judged audio signals of vocal development. Adult listeners identified each
utterance as either cry, whine, or vowel-like sound as quickly and accurately as possible.
They also judged the extent of negativity of each utterance. Acoustic features of
each utterance were analyzed in association with the categories and degrees of negativity.
Results suggest a continuum of negativity from cries (most negative) to vowel-like
sounds (least negative), and that acoustic variables are gradated across the negativity
continuum. However, preliminary results suggest that peak F0, peak RMS, and spectral
slope best differentiate the categories.
Franklin, B., Warlaumont, A. S., Messinger, D. S., Bene, E., Nathani Iyer, S., Lee,
C-C., Lambert, B., Oller, D. K. (2014). Effects of Parental Interaction on Infant
Vocalization Rate, Variability and Vocal Type. Language Learning and Development.
10(3):279–296. Abstract: Examination of infant vocalization patterns across interactive and noninteractive
contexts may facilitate better understanding of early communication development. In
the current study, with 24 infant-parent dyads, infant volubility increased significantly
when parent interaction ceased (presenting a “still face,” or SF) after a period of
normal interaction (“face-to-face,” or FF). Infant volubility continued at the higher
rate than in FF when the parent re-engaged (“reunion,” or RE). Additionally, during
SF, the variability in volubility across infants decreased, suggesting the infants
adopted relatively similar rates of vocalization to re-engage the parent. The pattern
of increasing volubility in SF was seen across all of the most common speech-like
vocal types of the first half-year of life (e.g., full vowels, quasivowels, squeals,
growls). Parent and infant volubility levels were not significantly correlated. The
findings suggest that by six months of age infants have learned that their vocalizations
have social value and that changes in volubility can affect parental engagement.
Studies of vocal development in infants with or at risk for disorders (autism, premature
birth, deafness, Down syndrome, Fragile X, low socio-economic status)
Oller, D. K., & Eilers, R. E. (1988). The role of audition in infant babbling. Child
Development, 59, 441 449. Abstract: The traditional belief that audition plays only a minor role in infant vocal
development depends upon evidence that deaf infants produce the same kinds of babbling
sounds as hearing infants. Evidence in support of this position has been very limited.
A more extensive comparison of vocal development in deaf and hearing infants indicates
that the traditional belief is in error. Well-formed syllable production is established
in the first 10 months of life by hearing infants but not by deaf infants, indicating
that audition plays an important role in vocal development. The difference between
babbling in the deaf and hearing is apparent if infant vocal sounds are observed from
a metaphonological perspective, a view that takes account of the articulatory/acoustic
patterns of speech sounds in all mature spoken languages.
Eilers, R. E., & Oller, D. K. (1994). Infant vocalizations and the early diagnosis
of severe hard of hearing. Journal of Pediatrics, 124, 199-203. Abstract: To determine whether late onset of canonical babbling could be used as a
criterion to determine risk of hard of hearing, we obtained vocalization samples longitudinally
from 94 infants with normal hearing and 37 infants with severe to profound hard of
hearing. Parents were instructed to report the onset of canonical babbling (the production
of well-formed syllables such as "da," "na," "bee," "yaya"). Verification that the
infants were producing canonical syllables was collected in laboratory audio recordings.
Infants with normal hearing produced canonical vocalizations before 11 months of age
(range, 3 to 10 months; mode, 7 months); infants who were deaf failed to produce canonical
syllables until 11 months of age or older, often well into the third year of life
(range, 11 to 49 months; mode, 24 months). The correlation between age at onset of
the canonical stage and age at auditory amplification was 0.68, indicating that early
identification and fitting of hearing aids is of significant benefit to infants learning
language. The fact that there is no overlap in the distribution of the onset of canonical
babbling between infants with normal hearing and infants with hard of hearing means
that the failure of otherwise healthy infants to produce canonical syllables before
11 months of age should be considered a serious risk factor for hard of hearing and,
when observed, should result in immediate referral for audiologic evaluation.
Belardi, K., Watson, L. R., Faldowski, R. A., Hazlett, H., Crais, E. R., Baranek,
G. T., McComish, C., Patten, E., Oller, D. K. (2017). A retrospective video analysis
of canonical babbling and volubility in infants with Fragile-X syndrome at 9-12 months
of age. Journal of Autism and Developmental Disorders, 47(4), 1193-1206. doi: 10.1007/s10803-017-3033-4.
Abstract: An infant’s vocal capacity develops significantly during the first year
of life. Research suggests early measures of pre-speech development, such as canonical
babbling and volubility, can differentiate typical versus disordered development.
This study offers a new contribution by comparing early vocal development in 10 infants
with Fragile X syndrome and 14 with typical development. Results suggest infants with
Fragile X syndrome produce fewer syllables and have significantly lower canonical
babbling ratios compared to infants who are typically developing. Furthermore, the
particular measures of babbling were strong predictors of group membership, adding
evidence regarding the possible utility of these markers in early identification.
Patten, E., Belardi, K., Baranek, G., Oller, D. K. (2014). Vocal patterns in infants
with Autism Spectrum Disorder: Canonical babbling status and vocalization frequency.
Journal of Autism and Developmental Disorders, 44(10), 2413–2428. doi:10.1007/s10803-014-2047-4.
Abstract: Canonical babbling is a critical milestone for speech development and is
usually well in place by 10 months. The possibility that infants with autism spectrum
disorder (ASD) show late onset of canonical babbling has so far eluded evaluation.
Rate of vocalization or "volubility" has also been suggested as possibly aberrant
in infants with ASD. We conducted a retrospective video study examining vocalizations
of 37 infants at 9-12 and 15-18 months. Twenty-three of the 37 infants were later
diagnosed with ASD and indeed produced low rates of canonical babbling and low volubility
by comparison with the 14 typically developing infants. The study thus supports suggestions
that very early vocal patterns may prove to be a useful component of early screening
and diagnosis of ASD.
Multicultural studies of vocal development
Lee, C-C., Jhang, Y., Chen, L., Relyea, G., Oller, D. K. (2016). Subtlety of ambient-language
effects in babbling: A study of English- and Chinese-speaking infants at 8, 10, and
12 months. Language Learning and Development. DOI: 10.1080/15475441.2016.1180983.
Abstract: Prior research on ambient-language effects in babbling has often suggested
infants produce language-specific phonological features within the first year. These
results have been questioned in research failing to find such effects and challenging
the positive findings on methodological grounds. We studied English- and Chinese-learning
infants at 8, 10, and 12 months and found listeners could not detect ambient-language
effects in the vast majority of infant utterances, but only in items deemed to be
words or to contain canonical syllables that may have made them sound like words with
language-specific shapes. Thus, the present research suggests the earliest ambient-language
effects may be found in emerging lexical items or in utterances influenced by language-specific
features of lexical items. Even the ambient-language effects for infant canonical
syllables and words were very small compared with ambient-language effects for meaningless
but phonotactically well-formed syllable sequences spoken by adult native speakers
of English and Chinese.
Farran, L. K., Lee, C-C., Yoo, H., Oller, D. K. (2016). Cross-cultural register differences
in infant-directed speech: An initial study. PLoS ONE. 11(3). e0151518. doi:10.1371/journal.pone.0151518.
Abstract: Infant-directed speech (IDS) provides an environment that appears to play
a significant role in the origins of language in the human infant. Differences have
been reported in the use of IDS across cultures, suggesting different styles of infant
language-learning. Importantly, both cross-cultural and intra-cultural research suggest
there may be a positive relationship between the use of IDS and rates of language
development, underscoring the need to investigate cultural differences more deeply.
The majority of studies, however, have conceptualized IDS monolithically, granting
little attention to a potentially key distinction in how IDS manifests across cultures
during the first two years. This study examines and quantifies for the first time
differences within IDS in the use of baby register (IDS/BR), an acoustically identifiable
type of IDS that includes features such as high pitch, long duration, and smooth intonation
(the register that is usually assumed to occur in IDS), and adult register (IDS/AR),
the type of IDS that does not include such features and thus sounds as if it could
have been addressed to an adult. We studied IDS across 19 American and 19 Lebanese
mother-infant dyads, with particular focus on the differential use of registers within
IDS as mothers interacted with their infants ages 0–24 months. Our results showed
considerable usage of IDS/AR (>30% of utterances) and a tendency for Lebanese mothers
to use more IDS than American mothers. Implications for future research on IDS and
its role in elucidating how language evolves across cultures are explored.
Automated computational analysis of infant vocal development and vocal disorders
Oller, D. K., Niyogi, P., S. Gray, J. A. Richards, J. Gilkerson, D. Xu, U. Yapanel,
S. F. Warren (2010). Automated Vocal Analysis of Naturalistic Recordings from Children
with Autism, Language Delay and Typical Development. Proceedings of the National Academy
of Sciences, 107, 30, 13354-13359. PMCID: PMC2922144. Abstract: For generations the study of vocal development and its role in language
has been conducted laboriously, with human transcribers and analysts coding and taking
measurements from small recorded samples. Our research illustrates a method to obtain
measures of early speech development through automated analysis of massive quantities
of day-long audio recordings collected naturalistically in children's homes. A primary
goal is to provide insights into the development of infant control over infrastructural
characteristics of speech through large-scale statistical analysis of strategically
selected acoustic parameters. In pursuit of this goal we have discovered that the
first automated approach we implemented is not only able to track children's development
on acoustic parameters known to play key roles in speech, but also is able to differentiate
vocalizations from typically developing children and children with autism or language
delay. The method is totally automated, with no human intervention, allowing efficient
sampling and analysis at unprecedented scales. The work shows the potential to fundamentally
enhance research in vocal development and to add a fully objective measure to the
battery used to detect speech-related disorders in early childhood. Thus, automated
analysis should soon be able to contribute to screening and diagnosis procedures for
early disorders, and more generally, the findings suggest fundamental methods for
the study of language in natural environments.
Warren, S. F., Gilkerson, J., Richards, J., Oller, D. K., Xu, D., Yapanel, U. (2010).
What automated vocal analysis reveals about the language learning environment of young
children with autism. Journal of Autism and Developmental Disorders, 40, 555-569.
Abstract: The study compared the vocal production and language learning environments
of 26 young children with autism spectrum disorder (ASD) to 78 typically developing
children using measures derived from automated vocal analysis. A digital language
processor and audio-processing algorithms measured the amount of adult words to children
and the amount of vocalizations they produced during 12-h recording periods in their
natural environments. The results indicated significant differences between typically
developing children and children with ASD in the characteristics of conversations,
the number of conversational turns, and in child vocalizations that correlated with
parent measures of various child characteristics. Automated measurement of the language
learning environment of young children with ASD reveals important differences from
the environments experienced by typically developing children.
Woynaroski, T. G., Oller, D. K., Kaysili, B. K., Yoder, P. (2016). The stability and
validity of automated vocal analysis in preverbal preschoolers with autism spectrum
disorder: Automated Vocal Analysis in ASD. Autism Research. DOI: 10.1002/aur.1667
DOI: 10.1002/aur.1667. Abstract: Theory and research suggest that vocal development predicts "useful speech"
in preschoolers with autism spectrum disorder (ASD), but conventional methods for
measurement of vocal development are costly and time consuming. This longitudinal
correlational study examines the reliability and validity of several automated indices
of vocalization development relative to an index derived from human coded, conventional
communication samples in a sample of preverbal preschoolers with ASD. Automated indices
of vocal development were derived using software that is presently "in development"
and/or only available for research purposes and using commercially available Language
ENvironment Analysis (LENA) software. Indices of vocal development that could be derived
using the software available for research purposes: (a) were highly stable with a
single day-long audio recording, (b) predicted future spoken vocabulary to a degree
that was nonsignificantly different from the index derived from conventional communication
samples, and (c) continued to predict future spoken vocabulary even after controlling
for concurrent vocabulary in our sample. The score derived from standard LENA software
was similarly stable, but was not significantly correlated with future spoken vocabulary.
Findings suggest that automated vocal analysis is a valid and reliable alternative
to time intensive and expensive conventional communication samples for measurement
of vocal development of preverbal preschoolers with ASD in research and clinical practice.
Warlaumont, A. S., Richards, J. A., Gilkerson, J. & Oller, D. K. (2014). A Social
Feedback Loop for Speech Development and Its Reduction in Autism. Psychological Science,
25: 1314-1324. DOI: 10.1177/0956797614531023 NIHMS577758. Abstract: We analyzed the microstructure of child-adult interaction during naturalistic,
daylong, automatically labeled audio recordings (13,836 hr total) of children (8-
to 48-month-olds) with and without autism. We found that an adult was more likely
to respond when the child's vocalization was speech related rather than not speech
related. In turn, a child's vocalization was more likely to be speech related if the
child's previous speech-related vocalization had received an immediate adult response
rather than no response. Taken together, these results are consistent with the idea
that there is a social feedback loop between child and caregiver that promotes speech
development. Although this feedback loop applies in both typical development and autism,
children with autism produced proportionally fewer speech-related vocalizations, and
the responses they received were less contingent on whether their vocalizations were
speech related. We argue that such differences will diminish the strength of the social
feedback loop and have cascading effects on speech development over time. Differences
related to socioeconomic status are also reported.
Abney, D. H., Warlaumont, A. S., Oller, D. K., Wallot, S., & Kello, C. T. (2016).
Multiple Coordination Patterns in Infant and Adult Vocalizations. Infancy, 9, 1. doi:10.1111/infa.12165.
Abstract: The study of vocal coordination between infants and adults has led to important
insights into the development of social, cognitive, emotional, and linguistic abilities.
We used an automatic system to identify vocalizations produced by infants and adults
over the course of the day for fifteen infants studied longitudinally during the first
2 years of life. We measured three different types of vocal coordination: coincidence-based,
rate-based, and cluster-based. Coincidence-based coordination and rate-based coordination
are established measures in the developmental literature. Cluster-based coordination
is new and measures the strength of matching in the degree to which vocalization events
occur in hierarchically nested clusters. We investigated whether various coordination
patterns differ as a function of vocalization type, whether different coordination
patterns provide unique information about the dynamics of vocal interaction, and how
the various coordination patterns each relate to infant age. All vocal coordination
patterns displayed greater coordination for infant speech-related vocalizations, adults
adapted the hierarchical clustering of their vocalizations to match that of infants,
and each of the three coordination patterns had unique associations with infant age.
Altogether, our results indicate that vocal coordination between infants and adults
is multifaceted, suggesting a complex relationship between vocal coordination and
the development of vocal communication.
Gilkerson, J., Richards, J.A., Warren, S. F., Greenwood, C. R., Oller, D. K., Montgomery,
J. K., Hansen, J. H. L., Xu, D. & Paul, T. D. (2017). Mapping the early language environment
using all-day recordings and automated analysis. American Journal of Speech Language
Pathology. doi:10.1044/2016. Abstract: Purpose: This research provided a first-generation standardization of automated
language environment estimates, validated these estimates against standard language
assessments, and extended on previous research reporting language behavior differences
across socioeconomic groups. Method: Typically developing children between 2 to 48
months of age completed monthly, daylong recordings in their natural language environments
over a span of approximately 6-38 months. The resulting data set contained 3,213 12-hr
recordings automatically analyzed by using the Language Environment Analysis (LENA)
System to generate estimates of (a) the number of adult words in the child's environment,
(b) the amount of caregiver-child interaction, and (c) the frequency of child vocal
output. Results: Child vocalization frequency and turn-taking increased with age,
whereas adult word counts were age independent after early infancy. Child vocalization
and conversational turn estimates predicted 7%-16% of the variance observed in child
language assessment scores. Lower socioeconomic status (SES) children produced fewer
vocalizations, engaged in fewer adult-child interactions, and were exposed to fewer
daily adult words compared with their higher socioeconomic status peers, but within-group
variability was high. Conclusions: The results offer new insight into the landscape
of the early language environment, with clinical implications for identification of
children at-risk for impoverished language environments.
Warlaumont, A. S., Oller, D. K., Buder, E. H., & Westermann, G. (2013). Prespeech
motor learning in a neural network using reinforcement. Neural Networks, 38, 64–75.
PMCID: PMC3541464. doi:10.1016.j.neunet.2012.11.012. Abstract: Vocal motor development in infancy provides a crucial foundation for language
development. Some significant early accomplishments include learning to control the
process of phonation (the production of sound at the larynx) and learning to produce
the sounds of one's language. Previous work has shown that social reinforcement shapes
the kinds of vocalizations infants produce. We present a neural network model that
provides an account of how vocal learning may be guided by reinforcement. The model
consists of a self-organizing map that outputs to muscles of a realistic vocalization
synthesizer. Vocalizations are spontaneously produced by the network. If a vocalization
meets certain acoustic criteria, it is reinforced, and the weights are updated to
make similar muscle activations increasingly likely to recur. We ran simulations of
the model under various reinforcement criteria and tested the types of vocalizations
it produced after learning in the different conditions. When reinforcement was contingent
on the production of phonated (i.e. voiced) sounds, the network's post-learning productions
were almost always phonated, whereas when reinforcement was not contingent on phonation,
the network's post-learning productions were almost always not phonated. When reinforcement
was contingent on both phonation and proximity to English vowels as opposed to Korean
vowels, the model's post-learning productions were more likely to resemble the English
vowels and vice versa.