Cogprints

Neural Network Models of Categorical Perception

Damper, R.I. and Harnad, S.R. (2000) Neural Network Models of Categorical Perception. [Journal (Paginated)]

Full text available as:

[img]PDF
362Kb

Abstract

Studies of the categorical perception (CP) of sensory continua have a long and rich history in psychophysics. In 1977, Macmillan et al. introduced the use of signal detection theory to CP studies. Anderson et al. simultaneously proposed the first neural model for CP, yet this line of research has been less well explored. In this paper, we assess the ability of neural-network models of CP to predict the psychophysical performance of real observers with speech sounds and artificial/novel stimuli. We show that a variety of neural mechanisms is capable of gen-erating the characteristics of categorical perception. Hence, CP may not be a special mode of perception but an emergent property of any sufficiently powerful general learning system.

Item Type:Journal (Paginated)
Keywords:categorical perception, neural networks
Subjects:Computer Science > Neural Nets
Psychology > Perceptual Cognitive Psychology
ID Code:1620
Deposited By:Harnad, Stevan
Deposited On:19 Jun 2001
Last Modified:11 Mar 2011 08:54

References in Article

Select the SEEK icon to attempt to find the referenced article. If it does not appear to be in cogprints you will be forwarded to the paracite service. Poorly formated references will probably not work.

Abramson, A., & Lisker, L. (1970). Discrimination along the

voicing continuum: Cross-language tests. In Proceedings of 6th

international congress of phonetic sciences, prague, 1967 (pp.

569-573). Academia, Prague.

Amari, S., & Takeuchi, A. (1978). A mathematical theory on for-mation

of category detecting neurons. Biological Cybernetics,

29, 127-136.

Anderson, J. A. (1968). A memory storage model utilizing spatial

correlation functions. Kybernetik, 5, 113-119.

Anderson, J. A. (1977). Neural models with cognitive implications.

In D. LaBerge & S. J. Samuels (Eds.), Basic processes in read-ing:

Perception and comprehension (pp. 27-90). Hillsdale, NJ:

Lawrence Erlbaum.

Anderson, J. A. (1995). An introduction to neural networks. Cam-bridge,

MA: MIT Press.

Anderson, J. A., & Silverstein, J. W. (1978). Reply to Grossberg.

Psychological Review, 85, 597-603.

Anderson, J. A., Silverstein, J. W., & Ritz, S. A. (1977). Vowel

pre-processing with a neurally based model. In Proceedings of

ieee international conference on acoustics, speech and signal

processing, icassp?77 (pp. 265-269). Hartford, CN.

Anderson, J. A., Silverstein, J. W., Ritz, S. A., & Jones, R. S.

(1977). Distinctive features, categorical perception, and prob-ability

learning: Some applications for a neural model. Psycho-logical

Review, 84, 413-451.

Arbib, M. A. (1995). Handbook of brain theory and neural net-works.

Cambridge, MA: MIT Press.

Baldi, P., & Hornik, K. (1989). Neural networks and principal com-ponent

analysis: Learning from examples without local minima.

Neural Networks, 2, 53-58.

Beale, J. M., & Keil, F. C. (1995). Categorical effects in the per-ception

of faces. Cognition, 57, 217-239.

B´ egin, J., & Proulx, R. (1996). Categorization in unsupervised neu-ral

networks: The Eidos model. IEEE Transactions on Neural

Networks, 7, 147-154.

Bornstein, M. H. (1987). Perceptual categories in vision and au-dition.

In S. Harnad (Ed.), Categorical perception: the ground-work

of cognition (pp. 287-300). Cambridge, UK: Cambridge

University Press.

Bourland, H., & Kamp, Y. (1988). Auto-association by multilayer

perceptrons and singular value decomposition. Biological Cy-bernetics,

59, 291-294.

Brady, S. A., & Darwin, C. J. (1978). Range effects in the percep-tion

of voicing. Journal of the Acoustical Society of America,

63, 1556-1558.

Braida, L. D., & Durlach, N. I. (1972). Intensity perception:

II. Resolution in one-interval paradigms. Journal of the Acous-tical

Society of America, 51, 483-502.

Carpenter, G. A., Grossberg, S., & Rosen, D. B. (1991a). ART 2-A:

An adaptive resonance algorithm for rapid category learning and

recognition. Neural Networks, 4, 493-504.

Carpenter, G. A., Grossberg, S., & Rosen, D. B. (1991b). Fuzzy

ART: Fast stable learning and categorization of analog patterns

by an adaptive resonance system. Neural Networks, 4, 759-771.

Damper, R. I. (1998). Auditory representations of speech sounds

in a neural model: The role of peripheral processing. In Pro-ceedings

of international joint conference on neural networks

(ijcnn 98) (pp. 2196-2201). Anchorage, AL.

Damper, R. I., Gunn, S. R., & Gore, M. O. (2000). Extracting pho-netic

knowledge from learning systems: Perceptrons, support

vector machines and linear discriminants. Applied Intelligence,

12, 43-62.

Damper, R. I., Pont, M. J., & Elenius, K. (1990). Representation

of initial stop consonants in a computational model of the dor-sal

cochlear nucleus (Tech. Rep. No. STL-QPSR 4/90). Speech

Transmission Laboratory Quarterly Progress and Status Report,

Royal Institute of Technology (KTH), Stockholm. (Also pub-lished

in W. A. Ainsworth (Ed.), Advances in Speech, Hearing

and Language Processing, Vol. 3 (Part B), pp. 497-546. Green-wich,

CT: JAI Press, 1996)

Diehl, R. L., Elman, J. E., & McCusker, S. B. (1978). Contrast ef-fects

on stop consonant identification. Journal of Experimental

Psychology: Human Perception and Performance, 4, 599-609.

Diehl, R. L., & Kluender, K. R. (1987). On the categorization

of speech sounds. In S. Harnad (Ed.), Categorical perception:

the groundwork of cognition (pp. 226-253). Cambridge, UK:

Cambridge University Press.

Dreyfus, H. L., & Dreyfus, S. E. (1988). Making a mind ver-sus

modeling the brain: Artificial intelligence back at a branch-point.

Daedalus, 117, 15-43.

Eijkman, E. G. J. (1992). Neural nets tested by psychophysical

methods. Neural Networks, 5, 153-162.

Elman, J. L. (1979). Perceptual origins of the phoneme boundary

effect and selective adaptation to speech: A signal detection the-ory

analysis. Journal of the Acoustical Society of America, 65,

190-207.

Elman, J. L. (1990). Finding structure in time. Cognitive Science,

14(2), 179-211.

Elman, J. L., & McClelland, J. L. (1986). Exploiting lawful vari-ability

in the speech wave. In J. S. Perkell & D. H. Klatt (Eds.),

Invariance and variability in speech processes (pp. 360-380).

Hillsdale, NJ: Lawrence Erlbaum.

Elman, J. L., & Zipser, D. (1988). Learning the hidden structure

of speech. Journal of the Acoustical Society of America, 83,

1615-1626.

Finney, D. J. (1975). Probit analysis: A statistical treatment of the

sigmoid response curve (3rd edition). Cambridge, UK: Cam-bridge

University Press.

Fry, D. B., Abramson, A. S., Eimas, P. D., & Liberman, A. M.

(1962). The identification and discrimination of synthetic vow-els.

Language and Speech, 5, 171-189.

Fujisaki, H., & Kawashima, T. (1969). On the modes and mecha-nisms

of speech perception. Annual Report of the Engineering

Research Institute, Faculty of Engineering, University of Tokyo,

28, 67-73.

Fujisaki, H., & Kawashima, T. (1970). Some experiments on speech

perception and a model for the perceptual mechanism. Annual

Report of the Engineering Research Institute, Faculty of Engi-neering,

University of Tokyo, 29, 207-214.

Fujisaki, H., & Kawashima, T. (1971). A model of the mecha-nisms

for speech perception - quantitative analysis of categori-cal

effects in discrimination. Annual Report of the Engineering

Research Institute, Faculty of Engineering, University of Tokyo,

30, 59-68.

Goldstone, R. L. (1994). Influences of categorization on perceptual

discrimination. Journal of Experimental Psychology: General,

123, 178-200.

Goldstone, R. L. (1998). Perceptual learning. Annual Review of

Psychology, 49, 585-612.

Goldstone, R. L., Steyvers, M., & Larimer, K. (1996). Categorical

perception of novel dimensions. In Proceedings of the 18th an-nual

conference of the cognitive science society (pp. 243-248).

Hillsdale, NJ: Lawrence Erlbaum.

Green, D. M., & Swets, J. (1966). Signal detection theory and psy-chophysics.

New York: Wiley. ((1988 reprint edition published

by Penisula Press, Los Altos, CA))

Greenwood, D. D. (1961). Critical bandwidth and the frequency

coordinates on the basilar membrane. Journal of the Acoustical

Society of America, 33, 780-801.

Grossberg, S. (1968a). Some nonlinear networks capable of learn-ing

a spatial pattern of arbitrary complexity. Proceedings of the

National Academy of Sciences, USA, 59, 368-372.

Grossberg, S. (1968b). Some physiological and biological con-sequences

of psychological postulates. Proceedings of the Na-tional

Academy of Sciences, USA, 60, 758-765.

Grossberg, S. (1969). Embedding fields: A theory of learning with

physiological implications. Journal of Mathematical Psychol-ogy,

6, 209-239.

Grossberg, S. (1978). Do all neural models really look alike?

A comment on Anderson, Silverstein, Ritz and Jones. Psycho-logical

Review, 85, 592-596.

Grossberg, S. (1986). The adaptive self-organization of serial or-der

in behavior: Speech, language and motor control. In E. C.

Schwab & H. C. Nusbaum (Eds.), Pattern recognition by hu-mans

and machines. vol. 1: Speech perception (pp. 187-294).

London, UK: Academic.

Grossberg, S. (1987). Competitive learning: From interactive acti-vation

to adaptive resonance. Cognitive Science, 11, 23-63.

Guenter, F. H., & Gjaja, M. N. (1996). The perceptual magnet effect

as an emergent property of neural map formation. Journal of the

Acoustical Society of America, 100, 1111-1121.

Hanson, S. J., & Burr, D. J. (1990). What connectionist models

learn: Learning and representation in connectionist networks.

Behavioral and Brain Sciences, 13, 471-518.

Harnad, S. (1982). Metaphor and mental duality. In T. Simon

& R. Scholes (Eds.), Language, mind and brain (pp. 189-211).

Hillsdale, NJ: Lawrence Erlbaum.

Harnad, S. (Ed.). (1987). Categorical perception: the groundwork

of cognition. Cambridge, UK: Cambridge University Press.

Harnad, S., Hanson, S. J., & Lubin, J. (1991). Categorical percep-tion

and the evolution of supervised learning in neural nets. In

D. W. Powers & L. Reeker (Eds.), Working papers of the aaai

spring symposium on machine learning of natural language and

ontology (pp. 65-74). Stanford, CA.

Harnad, S., Hanson, S. J., & Lubin, J. (1995). Learned categorical

perception in neural nets: Implications for symbol grounding.

In V. Honavar & L. Uhr (Eds.), Symbol processors and con-nectionist

network models in artificial intelligence and cognitive

modeling: Steps towards principled integration (pp. 191-206).

London: Academic Press.

Hary, J. M., & Massaro, D. M. (1982). Categorical results do not

imply categorical perception. Perception and Psychophysics, 32,

409-418.

Healy, A. F., & Repp, B. H. (1982). Context independence and

phonetic mediation in categorical perception. Journal of Ex-perimental

Psychology: Human Perception and Performance, 8,

68-80.

Hinton, G. E., & Anderson, J. A. (Eds.). (1981). Parallel models of

associative memory. Hillsdale, NJ: Lawrence Erlbaum.

Holland, J. H. (1998). Emergence: From chaos to order. Reading,

MA: Addison-Wesley.

Howell, P., Rosen, S., Laing, H., & Sackin, S. (1992). The role

of F1 transitions in the perception of voicing in initial plosives

(Tech. Rep. No. 6). Speech, Hearing and Language: Work in

Progress, University College London.

Iverson, P., & Kuhl, P. K. (1995). Mapping the perceptual magnet

effect for speech using signal detection theory and multidimen-sional

scaling. Journal of the Acoustical Society of America, 97,

553-562.

Jordan, M. I. (1986). Attractor dynamics and parallelism in a con-nectionist

sequential machine. In Proceedings of the 8th annual

conference of the cognitive science society (pp. 531-546). Hills-dale,

NJ: Lawrence Erlbaum.

Kaplan, H. L., Macmillan, N. A., & Creelman, C. D. (1978). Tables

of d

n

for variable standard discrimination paradigms. Behavioral

Research Methods and Instrumentation, 10, 796-813.

Kohonen, T. (1977). Associative memory: A system theoretic ap-proach.

Berlin: Springer-Verlag.

Kruschke, J. K. (1992). ALCOVE: An exemplar-based connec-tionist

model of category learning. Psychological Review, 99,

22-44.

Kuhl, P. K. (1987). The special-mechanisms debate in speech re-search:

Categorization tests on animals and infants. In S. Har-nad

(Ed.), Categorical perception: the groundwork of cognition.

Cambridge, UK: Cambridge University Press.

Kuhl, P. K. (1991). Human adults and human infants show a ?per-ceptual

magnet effect? for the prototypes of speech categories,

monkeys do not. Perception and Psychophysics, 50, 93-107.

Kuhl, P. K., & Miller, J. D. (1978). Speech perception by the

chinchilla: Identification functions for synthetic VOT stimuli.

Journal of the Acoustical Society of America, 63, 905-917.

Kuhl, P. K., & Padden, D. M. (1982). Enhanced discriminability

at the phonetic boundaries for the voicing feature in macaques.

Perception and Psychophysics, 32, 542-550.

Kuhl, P. K., & Padden, D. M. (1983). Enhanced discriminability

at the phonetic boundaries for the place feature in macaques.

Journal of the Acoustical Society of America, 73, 1003-1010.

Lacerda, F. (1998). An exemplar-based account of emergent pho-netic

categories. Journal of the Acoustical Society of America,

103, 2980(A).

Liberman, A. M. (1996). Speech: A special code. Cambridge, MA:

MIT Press.

Liberman, A. M., Cooper, F. S., Shankweiler, D. P., & Studdert-Kennedy,

M. (1967). Perception of the speech code. Psycholog-ical

Review, 74, 431-461.

Liberman, A. M., Delattre, P. C., & Cooper, F. S. (1958). Some

cues for the distinction between voiced and voiceless stops in

initial position. Language and Speech, 1, 153-167.

Liberman, A. M., Harris, K. S., Hoffman, H. S., & Griffith, B. C.

(1957). The discrimination of speech sounds within and across

phoneme boundaries. Journal of Experimental Psychology, 54,

358-368.

Liberman, A. M., & Mattingly, I. (1985). The motor theory of

speech perception revised. Cognition, 21, 1-36.

Liberman, A. M., & Mattingly, I. (1989). A specialization for

speech perception. Science, 243, 489-494.

Lisker, L., & Abramson, A. (1964). A cross-language study of

voicing in initial stops. Word, 20, 384-422.

Lisker, L., & Abramson, A. (1970). The voicing dimension: Some

experiments in comparative phonetics. In Proceedings of 6th

international congress of phonetic sciences, prague, 1967 (pp.

563-567). Academia, Prague.

Livingstone, K. R., Andrews, J. K., & Harnad, S. (1998). Cate-gorical

perception effects induced by category learning. Journal

of Experimental Psychology: Learning, Memory and Cognition,

123, 178-200.

Lloyd, D. (1989). Simple minds. Cambridge, MA: Bradford

Books/MIT Press.

Lotto, A. J., Kluender, K. R., & Holt, L. L. (1998). Depolarizing

the perceptual magnet effect. Journal of the Acoustical Society

of America, 103, 3648-3655.

Luce, R. D. (1959). Individual choice behavior. New York, NY:

John Wiley.

Luce, R. D. (1963). A threshold theory for simple detection exper-iments.

Psychological Review, 70, 61-79.

Macmillan, N. A. (1987). Beyond the categorical/continuous dis-tinction:

A psychophysical approach to processing modes. In

S. Harnad (Ed.), Categorical perception: the groundwork of

cognition (pp. 53-85). Cambridge, UK: Cambridge University

Press.

Macmillan, N. A., Braida, L. D., & Goldberg, R. F. (1987). Central

and peripheral effects in the perception of speech and nonspeech

sounds. In M. E. H. Schouten (Ed.), The psychophysics of speech

perception (pp. 28-45). Dordrecht, The Netherlands: Martinus

Nijhoff.

Macmillan, N. A., & Creelman, C. D. (1991). Detection theory:

A user?s guide. New York, NY: Cambridge University Press.

Macmillan, N. A., Goldberg, R. F., & Braida, L. D. (1988). Resolu-tion

for speech sounds: Basic sensitivity and context memory on

vowel and consonant continua. Journal of the Acoustical Society

of America, 84, 1262-1280.

Macmillan, N. A., Kaplan, H. L., & Creelman, C. D. (1977). The

psychophysics of categorical perception. Psychological Review,

84, 452-471.

Massaro, D. W. (1987a). Categorical partition: A fuzzy logical

model of categorical behavior. In S. Harnad (Ed.), Categorical

perception: the groundwork of cognition (pp. 254-283). Cam-bridge,

UK: Cambridge University Press.

Massaro, D. W. (1987b). Speech perception by ear and eye: A

paradigm for psychological inquiry. Hillsdale, NJ: Lawrence

Erlbaum Associates.

Massaro, D. W., & Oden, G. C. (1980). Speech perception:

A framework for research and theory. In N. Lass (Ed.), Speech

and language, vol. 3: Advances in basic research and practice

(pp. 129-165). New York, NY: Academic Press.

McClelland, J. L., & Elman, J. L. (1986). The TRACE model of

speech perception. Cognitive Psychology, 18, 1-86.

Miller, G. A. (1956). The magical number seven, plus or minus

two: Some limits on our capacity for processing information.

Psychological Review, 63, 81-97.

Miller, J. L. (1994). On the internal structure of phonetic categories:

A progress report. Cognition, 50, 271-285.

Nearey, T. M. (1997). Speech perception as pattern recognition.

Journal of the Acoustical Society of America, 101, 3241-3254.

Nossair, Z. B., & Zahorian, S. A. (1991). Dynamic spectral shape

features as acoustic correlates for initial stop consonants. Jour-nal

of the Acoustical Society of America, 89, 2978-2991.

Pastore, R. E. (1987). Categorical perception: Some psychophys-ical

models. In S. Harnad (Ed.), Categorical perception: the

groundwork of cognition (pp. 29-52). Cambridge, UK: Cam-bridge

University Press.

Pastore, R. E. (1987a). Possible acoustic bases for the percep-tion

of voicing contrasts. In M. E. H. Schouten (Ed.), The psy-chophysics

of speech perception (pp. 188-198). Dordrecht, The

Netherlands: Martinus Nijhoff.

Pastore, R. E., Szczesiul, R., Wielgus, V., Nowikas, K., & Logan, R.

(1984). Categorical perception, category boundary effects, and

continuous perception: A reply to Hary and Massaro. Perception

and Psychophysics, 35, 583-585.

Pevtzow, R., & Harnad, S. (1997). Warping similarity space in

category learning by human subjects: The role of task difficulty.

In Proceedings of SimCat 1997: Interdisciplinary workshop on

similarity and categorisation (pp. 189-195). Department of Ar-tificial

Intelligence, University of Edinburgh.

Pisoni, D. B., & Lazarus, J. H. (1974). Categorical and non-categorical

modes of speech perception along the voicing con-tinuum.

Journal of the Acoustical Society of America, 55, 328-

333.

Pisoni, D. B., & Tash, J. (1974). Reaction time to comparisons

within and across phonetic categories. Perception and Psy-chophysics,

15, 285-290.

Pont, M. J., & Damper, R. I. (1991). A computational model of

afferent neural activity from the cochlea to the dorsal acoustic

stria. Journal of the Acoustical Society of America, 89, 1213-

1228.

Port, R. F. (1990). Representation and recognition of temporal

patterns. Connection Science, 2, 151-176.

Quinlan, P. (1991). Connectionism and psychology: A Psycholog-ical

perspective on new connectionist research. Hemel Hemp-stead,

UK: Harvester Wheatsheaf.

Repp, B. H. (1984). Categorical perception: Issues, methods and

findings. In N. Lass (Ed.), Speech and language, vol. 10: Ad-vances

in basic research and practice (pp. 244-335). Orlando,

FL: Academic Press.

Repp, B. H., Healy, A. F., & Crowder, R. G. (1979). Categories and

context in the perception of isolated steady-state vowels. Jour-nal

of Experimental Psychology: Human Perception and Perfor-mance,

5, 129-145.

Repp, B. H., & Liberman, A. M. (1987). Phonetic category bound-aries

are flexible. In S. Harnad (Ed.), Categorical perception:

the groundwork of cognition (pp. 89-112). Cambridge, UK:

Cambridge University Press.

Rosen, S. M. (1979). Range and frequency effects in consonant

categorization. Journal of Phonetics, 7, 393-402.

Rumelhart, D. E., Hinton, G. E., & Williams, R. (1986). Learning

representations by back-propagating errors. Nature, 323, 533-

536.

Rumelhart, D. E., & McClelland, J. L. (Eds.). (1986). Parallel dis-tributed

processing: Explorations in the microstructure of cogni-tion

(2 volumes). Cambridge, MA: Bradford Books/MIT Press.

Rumelhart, D. E., & Zipser, D. (1985). Feature discovery by com-petitive

learning. Cognitive Science, 9, 75-112.

Schyns, P. G. (1991). A modular neural network model of concept

acquisition. Cognitive Science, 15, 461-508.

Sinex, D. G., & McDonald, L. P. (1988). Average discharge

rate representation of voice-onset time in the chinchilla auditory

nerve. Journal of the Acoustical Society of America, 83, 1817-

1827.

Steels, L. (1991). Towards a theory of emergent functionality. In

J.-A. Meyer & S. W. Wilson (Eds.), From animals to animats:

Proceedings of the 1st international conference on simulation of

adaptive behavior (pp. 451-461). Cambridge, MA: Bradford

Books/MIT Press.

Stevenage, S. V. (1995). Which twin are you? A demonstration of

induced category learning. British Journal of Psychology, 89,

39-57.

Studdert-Kennedy, M., Liberman, A. M., Harris, K. S., & Cooper,

F. S. (1970). Motor theory of speech perception: A reply to

Lane?s critical review. Psychological Review, 77, 234-239.

Takagi, N. (1995). Signal detection modeling of Japanese listen-ers?

/r/-/l/ labeling behavior in a one-interval identification task.

Journal of the Acoustical Society of America, 97, 563-574.

Tijsseling, A., & Harnad, S. (1997). Warping similarity space in

category learning by backprop nets. In Proceedings of SimCat

1997: Interdisciplinary workshop on similarity and categorisa-tion

(pp. 263-269). Department of Artificial Intelligence, Uni-versity

of Edinburgh.

Treisman, M., Faulkner, A., Naish, P. L. N., & Rosner, B. S. (1995).

Voice-onset time and tone-onset time: The role of criterion-setting

mechanisms in categorical perception. Quarterly Journal

of Experimental Psychology, 48A, 334-366.

Treisman, M., & Williams, T. C. (1984). A theory of criterion

setting with an application to sequential dependencies. Psycho-logical

Review, 91, 68-111.

Volaitis, L. E., & Miller, J. L. (1992). Phonetic prototypes: In-fluence

of place of articulation and speaking rate on the internal

structure of voicing categories. Journal of the Acoustical Society

of America, 92, 723-735.

Wood, C. C. (1976). Discriminability, response bias, and phoneme

categories in discrimination of voice onset time. Journal of the

Acoustical Society of America, 60, 1381-1389.

Wood, C. C. (1978). Variations on a theme by Lashley: Lesion ex-periments

with the neural model of Anderson, Silverstein, Ritz

and Jones. Psychological Review, 85, 582-591.

Metadata

Repository Staff Only: item control page