Archive for Issues

Rhythm in Music and Speech

Rhythm appears to be a fundamental capacity of humans. Rhythm plays a role in the prenatal environment and the early socialization of infants (Bertoncini, et al., 1995; Fassbender, 1996; Hargreaves, 1986; Papoušek, 1996). It has been implicated in the coordination of motor activity and locomotion (Iverson & Thelen, 1999). Rhythmic processing is a late deteriorating function in neurodegenerative diseases, such as Alzheimer’s (Beatty, et al., 1999). Rhythm appears to be a basic element in the construction of more complex human behaviors and interactions, such as music and language (Iverson & Thelen, 1999; Patel, et al., 1998), and has been implicated in aspects of memory (Brower, 1993; Payne & Holzman, 1986; Patel, et al., 1998).

A greater understanding of rhythm processing will therefore benefit from joint explorations across these domains of human behavior, in particular in music and language because of their universal presence across cultures and throughout the lifespan. Both music and speech share the same acoustic medium. Both are processed by the same perceptual apparatus. I find it reasonable to assume that the cognitive heuristics used for making sense of music and speech are at least similar, because we lack sufficient evidence to suggest that humans have evolved two entirely different mental modules for music and for language. To the contrary, there is great evidence to suggest that the distinction between music and speech is only achieved at higher levels of processing (Patel, et al., 1998).

There are many aspects of temporal processing that are relevant for this examination, and which necessarily impact an understanding of the subject. Unfortunately, well-formed and agreed upon definitions are in short supply. Paul Fraisse (1982), for instance, has written: “The task of those who study rhythm is a difficult one, because a precise, generally accepted definition of rhythm does not exist.” (149) What’s more, the definitions that occasionally arise lack consistency in what they describe. In an attempt to clarify and tease apart the various aspects of temporal organization, I provide my own definitions of certain aspects, which I trust are no less nor more arbitrary than most. I make no attempt however to be exhaustive in these definitions, in part because there appear to be many equally valid ways to divide up the temporal domain. I merely seek a first approximation of terms to address those aspects which will most facilitate questions dealing jointly with music and speech. Read the rest of this entry »

Comments

Foreign accent syndrome

[Update pending. Look for review of Kurowski, Blumstein, and Alexander (1996).]

What has been dubbed foreign accent syndrome was first described by Monrad-Krohn in 1947,1 in which he presented the case of a woman who suffered a shrapnel wound in WWII, that damaged portions of the left hemisphere of her brain. Her ability to produce and comprehend language was mostly spared, except for the odd effect to her speech prosody that others perceived as a foreign accent. In that particular case, sounding German in Oslo just following WWII was not an easy thing.

What must be pointed out however is that no one ever has been reported in the neurological literature spontaneously, or as a result of head injury, to have begun speaking a foreign tongue. The term foreign accent syndrome, as well as some of the descriptions that have accompanied the term, is a bit of a misnomer, in that it implies the patients of FAS somehow acquire the accent of a particular foreign language. Rather, the perception of hearers is that the prosody is somehow off, leading them to entertain the theory that the speaker is non-native in the language. Read the rest of this entry »

Comments

Priming and polysemy

It has been observed in psychological studies of lexical priming, that polysemous words (in English, “bug”, “pit”, “fall”) sometimes force a processing delay, as the mind entertains several meanings. It would be interesting to consider these polysemous words as pivots. In musical modulation, pivot chords often serve similarly ambiguous (polysemous) roles. They are valid (diatonic) chords in the two keys. It is their dual roles that permits them to serve as pivots. A study of polysemous words in discourse as serving such a modulating role, in with the polysemous nature of pivot chords in musical modulation, might be a fruitful avenue to pursue.

Comments (1)

Thoughts on child-directed speech

It has been noted that child-directed speech (CDS) is often characterized by higher pitch and wider pitch range. However, these features are not universal. In Mayan society for example it has been reported that child-directed speech is characterized by a low-whispery murmuring quality. What could explain this difference?

One factor that has been largely unconsidered is the influence of sound environment on the specific choices made by speakers. An empirically verifiable theory would be that for instance the sound environment of Mayan children is more characterized by sounds within the higher pitched range, thus leading caregivers to modify their vocalisms toward the lower (and whispery) ends. It is a question of perceptual salience. The child is naturally equipped with the ability to pick out the human voice from its surrounding (cf. Belin, et al., 2000), likely by means of timbre recognition. Auditory scene analysis permits the child’s mind to pick out these features. However, if the sound environment muddies the soundscape in a particular range of frequencies, caregivers will likely veer in a different direction in order to aid the child in isolating the voice from surrounding sounds.

Reduplication and use of diminutives is often noted in CDS. Various explanations have been proposed. However, one that has yet to gain prominence regards the value of extra syllables in permitting intonational variety and contrast. What do we gain, in English for example, in transforming dog into doggy; in Czech, by rendering chlapec into chlapeček? Are we not adding greater phonetic complexity? Shouldn’t this be more difficult for the child? But this is in line with Slobin’s proposition that morphemes are more easily acquired which contain more than one phoneme. Further, we can not discount the value of intonation as contributing to the signal. It can be observed in Taiwanese Mandarin, for instance, that CDS not only contains reduplication, but that the tone is modified as well. An example that was given to me was the word /gu/ (high level tone) meaning brother. The CDS version is often /gu-gu/ (with a low followed by a high tone). Thus there is modification not only in the reduplication, but also in the change in tone, which is not lexically or phonetically determined.

Comments

Infant Sound Environment Project (ISEP)

Draft

The Infant Sound Environment Project (ISEP) is a longitudinal study of the sound inputs to infants and the relationship of these inputs to the sound production of these same children as they emerge from infancy. While previous studies have often addressed the acquisition of words and grammar—how meaning and form emerge in the human mind—the present study will address a different aspect of this experience. The focus will be on the melodic and rhythmic elements of sound—both those encountered by the child, and those which the child eventually produces. These aspects in communicative behaviors such as language and music carry a fundamental level of emotion that has heretofore gone largely unexplored. Further their foregrounding in this study will permit us to explore patterning, imitation, and creativity, without prejudicing the linguistic context.

It is often posited that language is unique to our species, and that what it contributes to our being is nothing less than defining of our nature. The intent of this study is not directly to challenge this notion, but rather to put question to what fundamentally characterizes language in this vein. If language is the defining element of humanity, what is language? The answer to this question underlies all of this research. It is quite possible that the “what” that will emerge is not exclusive to the domain of language.

We will study the origins of sound comprehension and sound production in the individual, with particular attention paid to the rhythmic and melodic features of this sound. We will retain an interest in what this ontogenesis may have to say about the phylogenesis of these same qualities in the human species. Existing theories of the origins of human language, and of the uniqueness of the human animal with regards to language, have mostly focused on the higher level domain of words and grammar. By focusing on these lower-level elements of sound, we can begin to test the point at which our species’ uniqueness arises1.

The present attempt will begin with the data, rather than any presupposed theoretical framework, with a broad interest in sounds of all variety, without prejudicing language or any other behavior a priori. One key feature of the current project is its methodology. This is a painstaking effort to gather raw data that can be analyzed and dissected to yield fruitful information. No hypothesis is being tested. Rather the evidence is being gathered first, with an eye toward constructing a theory from this evidence. The basic research in this domain is yet to be done. This effort is an attempt to redress this failing.

While studies have been done both of the prenatal environment, and of the capacities for sound perception of neonates and young infants, no longitudinal study has yet been done to correlate the sound production of preschoolers with the sounds they have been exposed to earlier in life. It seems a reasonable assumption for example that young children will model the prosodic patterns that they have encountered. Yet so far this remains mostly an assumption. What can be empirically established regarding the relationship between the sound environment of infants and their own later sound productions? This study is intended to establish these relationships.

It is clear that infants (as the meaning of the word connotes) are born without speech. Their earliest indulgences with sound are described first as cries, then also as babbling. At what point does this babbling become intentional experimentation? What is the nature of this experimentation? If imitative, what is being imitated? These are some of the questions that we wish to elucidate. Previous studies have focused exclusively on the vocal output of the infants and children, or have considered the primarily linguistic utterances of parents and caregivers, described nowadays as infant-directed speech, but little work has been done to compare these facets, and what has been done presupposes an often ill-defined distinction between linguistic and nonlinguistic behaviors.

However, as noted by Trehub et al2, distinguishing language from other sounds is a non-trivial task for the infant. Put another way, until language has been acquired, the infant is tasked with making sense of a mass of meaningless sounds. Only by stipulating a hard-wired language module in the infant brain (for which at the moment there is scant empirical evidence) are we able to view their task as language acquisition. Yet language, to the infant, shares many features in common with nonlinguistic sounds in their environment.

Even to an adult, there remains overlap between musical and linguistic sound production. What are the features that distinguish language from other sound-producing behaviors? How does the infant come to understand these differences? To what extent are these features universal or culture-specific? Rather than beginning with the assumption that an infant’s task is to acquire language, perhaps we should seek to understand the child’s behavior in more general terms. If patterns emerge in the sound environment of infants, how do infants go about recognizing, understanding, and reproducing these patterns? Only a study that seeks to examine sound without presupposing the prominence of language will be capable of answering these questions.

The methodology we will adopt is based in a naturalistic approach. Rather than a laboratory setting that trades off naturalism for control in the environment, ISEP will collect data directly from the natural environment of the child. Daylong digital audio recordings will be made of the children within their natural home, play, and school contexts. The length of recordings will ensure a sufficient quantity of usable data, and will help mitigate against the self-consciousness of caregivers and others. These recordings will be captured at intervals of about once a month for the first two to three years of life.

Every effort will be made to maintain the confidentiality and privacy of the participants. However the data collected will have much broader application than the focus of this study, and thus we hope to make this data available to other researchers involved in complementary work. Similarly, it is likely that data collected for other projects, especially from diverse cultural contexts, will be useful for the present purpose. Thus efforts will be made to find outside collaborators for this project.

The question, which by some is supposed to have been settled in 1959 by the suppositions and arguments of Noam Chomsky regarding the unlearnability of many aspects of the linguistic system3, nonetheless remains: to what extent is language acquired from the environment; and to what extent is it innate? More broadly, what influence on a child’s cognitive development is directly played by the inputs to that child during maturation? Since the focus here in on the rhythmic and melodic aspects of sound, it is now possible to gather data which can be used to answer this question. Until recently, with the advent of long-term recordings of sound, this question has been constrained by the unreliability of report, or by extrapolations from limited data gathered in laboratories and elsewhere.

It is well-known that all normal children acquire the language(s) to which they are exposed, and no other. Little is known however about the acquisition of aspects of language and interactive behaviors beyond the domains of words and grammar, as for instance intonation, timing, stress. It has generally been supposed that language socialization (both socialization to learn language, and further socialization through language) is the dominant, even all-encompassing task of the prelinguistic infant. Additionally, it has been assumed that the principal or sole function of these behaviors has been communicative, while their possible role as part of the child’s mental construction of the world and oneself has mostly remained unconsidered.

The current study will dispose with any such prejudice, and suppose merely that sound exists in the environment in which an infant dwells; that this sound assuredly contains remarkable features in terms of its melodic and rhythmic content, and likely in other aspects such as timbre; that the infant’s brain is in some way predisposed by nature and biology to selectively attend to that sound4; and further that through some process, which is yet to be clearly understood, the child as it emerges from infancy begins to contribute its own sounds to that environment, which likely bear some resemblance to those it has experienced. The process by which the child’s own sound manipulations reflect natural dispositions or exhibit cultural or idiosyncratic features is little understood. The intent of the present study is to provide a means for measuring the impact of the sound environment of early life on the child’s own later productions of sound.

Surely, the patterning through time that rhythm presents can be found in a soundless environment as well. Other features, such as melody and timbre, may also be found to have silent correlates5. Later research should allow us to compare the normal development of hearing children to the cognitive, linguistic, and social development of the congenitally deaf in ways that have so far eluded comparison. Through the establishment of a baseline for development at the lower levels that will be exposed through this study, we will be able to more readily and more early identify certain deficits which may affect a child’s cognitive development and language acquisition.

Further, this study will permit us to uncover other elements in the maturation and socialization of the child through sound, which are extralinguistic, such as music acquisition. More broadly, by focusing not on the lexicon and syntax, but rather on melody, rhythm, and timbre, we will be able to discuss in finer-grained detail the child’s sound perception and production than has previously been the norm. In this way we will be able to see what features are shared among a variety of behaviors, without presuming their function and significance as linguistic or otherwise.

1 Don Hodges has presented a video of a captive and socialized chimpanzee purportedly engaged in musical improvisation at a keyboard. Is it possible that lower-level manipulations of melody and rhythm are shared behaviors with our primate cousins?

2 Trehub, et al. “Music and speech processing in the first year of life,” Advances in Child Development and Behavior 24 (1993).

3 For example, Boysson-Bardies (1999) writes: “By the middle of the twentieth century, after a brief interval in which Anglo-American psychologists held the acquisition of language to be the exclusive result of learning and imitation, it came to be recognized that language development could not be reduced to a mechanism of elementary linkages between images or sensations and sounds. In 1959, Noam Chomsky demonstrated the impossibility of acquiring language with approaches of this type. … Only a powerful innate system could allow the child to extract a model of language from adult speech … This endowment consists of a universal system that belongs to the human brain, which he called a universal grammar. This grammar is the basic schema that grounds the grammars of all human languages. A mental circuitry, inscribed in the biological constraints governing the development of the brain, underlies this schema and permits it to select the sounds, signs, and sign combinations of the mature language.”

4 Belin, et al. (2000) have identified what they term “voice selective areas” of the brain, neurons that respond specifically to the vocal sounds of conspecifics.

5 Sherman Wilcox has argued that sign languages exhibit features which should properly be interpreted as prosodic, an affective layering on top of the lexical and syntactic foundation, which might likely correspond to melodic and timbral aspects of spoken language.

Comments

Dichotic listening (tba)

Need to add a critical commentary on dichotic listening experiments. Why are these studies still so often cited? From my review of the literature most of them are fatally flawed by too many confounds, especially in their assumptions regarding what consititutes linguistic (or “verbal”) vs. non-verbal stimuli. But see Kimura’s (1967) comments, p. 167, acknowledging the difficulty of determining “what verbal activity consists in”.

Comments

« Previous Page « Previous Page Next entries »
Register Login
Locations of visitors to this page