Statistical learning, cross-linguistic constraints, and the acquisition of speech categories

Posted on 1 Nov 2005 by Joe Toscano

A computational approach

Toscano, J. C. and McMurray, B. (2005, November). Paper presented at the 11th Midcontinental Workshop on Phonology, University of Michigan, Ann Arbor, MI.

Abstract:

Infants learning the phonetic categories of their native language must recognize which distinctions are relevant to their language and which are not. While they initially discriminate both native and non-native phoneme contrasts, infants quickly learn to discriminate only those contrasts that are present in their language (Werker & Tees, 1984), and eventually form language-appropriate phonetic categories. One way they might do this is to take advantage of the statistics available in their linguistic environment. Previous work has shown that infants are indeed sensitive to and make use of the distributional statistics for speech sounds (Maye et al, 2002). Infants exposed a series of sound in which phonetic cues formed two clusters learned two categories. Infants exposed to a unimodal distribution learned only one.

We implemented this hypothesis in a computational model. Data representing the distribution of Voice Onset Times (VOTs) for one of several languages were fed into a statistical learning model. These data were based on the statistical distributions of VOT measured by Lisker and Abramson (1964). The model began with a set of Gaussian distributions located at random locations in VOT-space. On each generation, it was given a particular VOT. The model then adjusted the distributions, giving a greater weight to the distribution that best matched the input. Over successive generations, the model was able to fit the input distributions for a variety of languages differing in VOT boundaries and categories. Thus, this form of statistical learning, as implemented in a relatively simple learning device, and can learn actual phonetic categories.

We next used the model to examine the role of cross-linguistic patterns on learning. Cross-linguistic similarities may place constraints on the properties of the phoneme categories that must be learned (see Newport & Aslin, 2000, for a similar argument). By varying the starting states of the distributions in the model and evaluating their effect on successful learning, we can determine the relative importance of the initial category locations on the model’s performance. If the starting states correspond to categories that are common across languages, they may yield better performance by the model. However, if these starting states provide no advantage, the model’s performance will be similar to the condition in which its initial categories are random. This would suggest that statistical learning is a sufficiently powerful mechanism for the acquisition of speech categories without the cross-linguistic constraints. Findings suggest that while statistical learning is sufficient for a most learning-situations, there may be a small benefit to cross-linguistic constraints.

PDF of abstract

Tagged with: computational modeling, cross-linguistic constraints, gaussian mixture models, language acquisition, speech development
Posted in Presentations