kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Reverse engineering great ape vocal tract configurations with implications for evolving speech biomechanics
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Speech, Music and Hearing, TMH.ORCID iD: 0000-0002-6739-0838
Show others and affiliations
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Great ape call production may inform research on the evolution of speech but remains poorly understood. The vowel-like qualities of long-distance vocalizations of nonhuman great apes are seemingly both acoustically and perceptually comparable to human close back vowel [u]. However, nonhuman great ape vocal tract morphology, including the species-typical tongue and lack of an expanded pharynx, preclude comparable articulation. Here, we explore possible vocal tract configurations underlying chimpanzee (Pan troglodytes) pant hoots, gorilla (Gorilla gorilla) hoots , and orangutan (Pongo abelii) long calls. We present the result of computer simulations of acoustic tube vocal tract models based on MRI great ape articulator data and behavior. Predicted first and second formant simulation data were compared against data collected from adult male chimpanzees, silverback male gorillas, and flanged male orangutans in the wild. We explored the explanatory value of four sets of models corresponding to (i) uniform tubes, (ii) narrowed lip passage, (iii) narrowed and extremely protruded lip passage, and (iv) a “retracted model”, with dorsal oral tract constriction achieved via tongue retraction, combined with a narrowed lip passage. Our results show that great ape hoot data are most consistent with an articulatory model assuming dorsal oral tract stricture through tongue retraction (the only model to achieve a fit for the second formant). Our work indicates articulatory configurations employed in great ape call production may exist in continuity with speech production, without being identical to those observed in modern humans. 

Keywords [en]
Phonetics, Primatology
National Category
Zoology
Research subject
Speech and Music Communication
Identifiers
URN: urn:nbn:se:kth:diva-351244OAI: oai:DiVA.org:kth-351244DiVA, id: diva2:1886738
Note

QC 20240805

Available from: 2024-08-04 Created: 2024-08-04 Last updated: 2024-08-05Bibliographically approved
In thesis
1. Phonetic potential in the extant apes and extinct hominins
Open this publication in new window or tab >>Phonetic potential in the extant apes and extinct hominins
2024 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Several novel claims with bearing on the evolution of speech production are made. It is shown through a series of theoretical, empirical, and computational works that the vocal anatomy of non-human apes, such as gibbons, orangutans, and chimpanzees, allows for the production of variable vowel-like contrasts. These phenomena in extant nonhuman primates are likely consistent with the animals’ retracting the tongue, potentially homologous with aspects of speech production. However, relationships of biomechanics inherent to the primate vocal production apparatus render fluid speech unrealistic. The articulatory configurations necessary to achieve these contrasts likely recruit lingual gestures disparate to those of humans, reflecting disparate anatomy. Novel evidence is also presented, illustrating elementary vocal production learning capacities in chimpanzees. These capacities are thus unlikely to have emerged de novo in our lineage. Building on these two sources of evidence, the evolution of speech is not straightforwardly reducible to “neural evolution”. Rather, additional evolutionary pressures must have acted upon hominin ancestors to ultimately trigger the evolution of spoken language. Toward this end, paleoanthropological evidence of articulator evolution in the hominin lineage is explored. The introduction of increasingly complex food processing and tool use, typically argued to have led to widespread anatomical changes in the face and guts of human ancestors, appear simultaneously with changes on the hominin would-be articulatory complex. Potential articulatory benefits of these changes in ancestral hominins are explored. An efficient articulatory apparatus, and the neural substrates by which to efficiently control it, likely evolved simultaneously with the human genus itself.  

Abstract [sv]

Avhandlingen presenterar ett flertal argument med innebörd för talets utveckling. Anatomin hos icke-mänskliga primater som gibboner, orangutanger och schimpanser möjliggör produktion av flertalet vokalliknande vokaliseringar. Dessa fenomen visas vara förenliga med att djuren drar tillbaka tungan - en möjlig homolog med talproduktion. De tungester som rekryteras för att uppnå dessa ljudkvaliteer skiljer sig dock sannolikt från de som studerats i mänskligt tal, och återspeglar anatomiska begräsningar i de icke-mänskliga primaternas ansatsrör. För primater tycks ansatsrörets inneboende biomekanik förhindra flytande, effektiva talsekvenser. Nya bevis presenteras också, vilka påvisar en grundläggande inlärningsförmåga för talliknande ljud hos schimpanser. Denna kapacitet torde därför inte ha utvecklats bara i människosläktet. Talets utveckling kan därför inte reduceras till enbart “neural evolution”. Ytterligare och unika evolutionära tryck ha verkat på mänskliga förfäder för att i slutändan möjliggöra utvecklingen av talat språk. Paleoantropologiska bevis på talapparatens evolution i utdöda människor utforskas. Bevis på allt mer komplex tillverkning av verktyg uppträder tillsammans med utbredda anatomiska förändringar i ansiktet hos mänskliga anfäder. I avhandlingen undersöks fonetiska konsekvenser av dessa förändringar. En effektiv talapparat, och de neuralogiska underlagen för att kontrollera den, utvecklades sannolikt tillsammans hos den blivande moderna människan. 

Place, publisher, year, edition, pages
Stockholm, Sweden: KTH Royal Institute of Technology, 2024. p. 71
Series
TRITA-EECS-AVL ; 55
Keywords
Evolution of speech, speech acoustics, source/filter theory, primatology, evolutionary anthropology, Talevolution, talakustik, källa/filter-teori, primatologi, evolutionär antropologi
National Category
General Language Studies and Linguistics
Research subject
Speech and Music Communication
Identifiers
urn:nbn:se:kth:diva-351250 (URN)978-91-8040-967-4 (ISBN)
Public defence
2024-09-26, Fantum, Lindstedtsvägen 24, Stockholm, 15:00 (English)
Opponent
Supervisors
Note

QC 20240805

Available from: 2024-08-05 Created: 2024-08-04 Last updated: 2024-08-14Bibliographically approved

Open Access in DiVA

No full text in DiVA

Authority records

Ekström, Axel G.Edlund, Jens

Search in DiVA

By author/editor
Ekström, Axel G.Edlund, Jens
By organisation
Speech, Music and Hearing, TMH
Zoology

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 196 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf