Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Phonetic acquisition in cortical dynamics, a computational approach

Journal Article · · PLoS ONE
 [1];  [2];  [3];  [4];  [5];  [6]
  1. Univ. de Buenos Aires, Buenos Aires (Argentina)
  2. Argonne National Lab. (ANL), Lemont, IL (United States)
  3. Argonne National Lab. (ANL), Lemont, IL (United States); Loyola Univ. Chicago, Chicago, IL (United States)
  4. Inst. de Ciencias Humanas, Mendoza (Argentina)
  5. Univ. de Buenos Aires, Buenos Aires (Argentina); Inst. de Biología y Medicina Experimental-CONICET, Ciudad Autónoma de Buenos Aires (Argentina)
  6. Fraunhofer-Inst. fur Nachrichtentechnik Heinrich-Hertz-Inst. (Germany)

Many computational theories have been developed to improve artificial phonetic classification performance from linguistic auditory streams. However, less attention has been given to psycholinguistic data and neurophysiological features recently found in cortical tissue. We focus on a context in which basic linguistic units-such as phonemes-are extracted and robustly classified by humans and other animals from complex acoustic streams in speech data. We are especially motivated by the fact that 8-month-old human infants can accomplish segmentation of words from fluent audio streams based exclusively on the statistical relationships between neighboring speech sounds without any kind of supervision. In this paper, we introduce a biologically inspired and fully unsupervised neurocomputational approach that incorporates key neurophysiological and anatomical cortical properties, including columnar organization, spontaneous micro-columnar formation, adaptation to contextual activations and Sparse Distributed Representations (SDRs) produced by means of partial N-Methyl-D-aspartic acid (NMDA) depolarization. Its feature abstraction capabilities show promising phonetic invariance and generalization attributes. Our model improves the performance of a Support Vector Machine (SVM) classifier for monosyllabic, disyllabic and trisyllabic word classification tasks in the presence of environmental disturbances such as white noise, reverberation, and pitch and voice variations. Furthermore, our approach emphasizes potential self-organizing cortical principles achieving improvement without any kind of optimization guidance which could minimize hypothetical loss functions by means of-for example-backpropagation. Thus, our computational model outperforms multiresolution spectro-temporal auditory feature representations using only the statistical sequential structure immerse in the phonotactic rules of the input stream.

Research Organization:
Argonne National Lab. (ANL), Argonne, IL (United States)
Sponsoring Organization:
Argonne National Laboratory - Argonne Leadership Computing Facility; USDOE
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1559970
Journal Information:
PLoS ONE, Journal Name: PLoS ONE Journal Issue: 6 Vol. 14; ISSN 1932-6203
Publisher:
Public Library of ScienceCopyright Statement
Country of Publication:
United States
Language:
English

References (72)

Brainstem inputs to the ferret medial geniculate nucleus and the effect of early deafferentation on novel retinal projections to the auditory thalamus journal October 1998
The decade of the dendritic NMDA spike journal June 2010
Self-organized formation of topologically correct feature maps journal January 1982
Sensitivity of cat primary auditory cortex (Al) neurons to the direction and rate of frequency modulation journal February 1985
Sensitivity of neurons in cat primary auditory cortex to tones and frequency-modulated stimuli. I: Effects of variation of stimulus parameters journal November 1992
Top-down knowledge supports the retrieval of lexical information from degraded speech journal June 2007
Integration of iconic gestures and speech in left superior temporal areas boosts speech comprehension under adverse listening conditions journal January 2010
Multiple brain signatures of integration in the comprehension of degraded speech journal March 2011
Computational modeling of phonetic and lexical learning in early language acquisition: Existing models and future directions journal November 2012
Contextual modulation and stimulus selectivity in extrastriate cortex journal November 2014
The effects of distributional learning on rats' sensitivity to phonetic information. journal January 2006
Induction of visual orientation modules in auditory cortex journal April 2000
The functional organization of cortical feedback inputs to primary visual cortex journal April 2018
Adaptive shift in the domain of negative stiffness during spontaneous oscillation by hair bundles from the internal ear journal November 2005
Mechanisms of noise robust representation of speech in primary auditory cortex journal April 2014
Role of cortical N-methyl-D-aspartate receptors in auditory sensory memory and mismatch negativity generation: implications for schizophrenia. journal October 1996
Two mechanisms for transducer adaptation in vertebrate hair cells journal October 2000
A stagewise rejective multiple test procedure based on a modified Bonferroni test journal January 1988
Columnar Transformations in Auditory Cortex? A Comparison to Visual and Somatosensory Cortices journal January 2003
Expectancy Constraints in Degraded Speech Modulate the Language Comprehension Network journal June 2009
Spectral shape analysis in the central auditory system journal January 1995
Receptive fields, binocular interaction and functional architecture in the cat's visual cortex journal January 1962
Receptive fields and functional architecture of monkey striate cortex journal March 1968
Multiresolution spectrotemporal analysis of complex sounds journal August 2005
Enhanced discriminability at the phonetic boundaries for the place feature in macaques journal March 1983
Vowel discrimination in cats: Acquisition, effects of stimulus level, and performance in noise journal June 1996
Perceptual compensation for coarticulation by Japanese quail ( Coturnix coturnix japonica ) journal August 1997
Role of experience for language-specific functional mappings of vowel sounds journal December 1998
Speech perception by the chinchilla: voiced-voiceless distinction in alveolar plosive consonants journal October 1975
A map of visual space induced in primary auditory cortex journal November 1990
Experimentally induced visual projections into auditory thalamus and cortex journal December 1988
Active Properties of Neocortical Pyramidal Neuron Dendrites journal July 2013
Adaptation in Hair Cells journal March 2000
Modality and Topographic Properties of Single Neurons of Cat'S Somatic Sensory Cortex journal July 1957
Functional topography of cat primary auditory cortex: distribution of integrated excitation journal November 1990
Organization of response areas in ferret primary auditory cortex journal February 1993
Experimental evidence for sparse firing in the neocortex. text January 2012
Visual projections routed to the auditory pathway in ferrets: receptive fields of visual neurons in primary auditory cortex journal September 1992
Functional Integration across Brain Regions Improves Speech Perception under Adverse Listening Conditions journal February 2007
Why Neurons Have Thousands of Synapses, A Theory of Sequence Memory in Neocortex text January 2015
Continuous online sequence learning with an unsupervised neural network model text January 2015
Complementary control of sensory adaptation by two types of cortical interneurons journal October 2015
Towards deep learning with segregated dendrites journal December 2017
Brainstem inputs to the ferret medial geniculate nucleus and the effect of early deafferentation on novel retinal projections to the auditory thalamus journal October 1998
Self-organized formation of topologically correct feature maps journal January 1982
Sensitivity of neurons in cat primary auditory cortex to tones and frequency-modulated stimuli. II: Organization of response properties along the ‘isofrequency’ dimension journal November 1992
Distributional regularity and phonotactic constraints are useful for segmentation journal October 1996
Experimental evidence for sparse firing in the neocortex journal June 2012
Random synaptic feedback weights support error backpropagation for deep learning journal November 2016
Cellular organization of cortical barrel columns is whisker-specific journal October 2013
The columnar organization of the neocortex journal April 1997
The Design and Implementation of FFTW3 journal February 2005
Phoneme representation and classification in primary auditory cortex journal February 2008
Perception of synthetic /ba/–/wa/ speech continuum by budgerigars ( Melopsittacus undulatus )
  • Dent, Micheal L.; Brittan-Powell, Elizabeth F.; Dooling, Robert J.
  • The Journal of the Acoustical Society of America, Vol. 102, Issue 3 https://doi.org/10.1121/1.420111
journal September 1997
Statistical Learning by 8-Month-Old Infants journal December 1996
LIBSVM: A library for support vector machines journal April 2011
Continuous Online Sequence Learning with an Unsupervised Neural Network Model journal November 2016
Specificity and timescales of cortical adaptation as inferences about natural movie statistics journal October 2016
Experimental evidence for sparse firing in the neocortex. text January 2018
Perceptual Compensation for Coarticulation by Japanese Quail (Coturnix coturnix japonica) text January 1997
Perceptual compensation for coarticulation by Japanese quail (Coturnix coturnix japonica) text January 1997
Role of Experience for Language-Specific Functional Mappings of Vowel Sounds text January 2018
Role of experience for language-specific functional mappings of vowel sounds text January 1998
Visual projections routed to the auditory pathway in ferrets: receptive fields of visual neurons in primary auditory cortex journal September 1992
Functional Integration across Brain Regions Improves Speech Perception under Adverse Listening Conditions journal February 2007
Why Neurons Have Thousands of Synapses, a Theory of Sequence Memory in Neocortex journal March 2016
Toward an Integration of Deep Learning and Neuroscience journal September 2016
Towards an integration of deep learning and neuroscience text January 2016
Datasets used to train and test the Cortical Spectro-Temporal Model (CSTM). dataset January 2019
Datasets used to train and test the Cortical Spectro-Temporal Model (CSTM). dataset January 2019
Experimental Results and Appendices: Cortical Spectro-Temporal Model (CSTM). dataset January 2019
Experimental Results and Appendices: Cortical Spectro-Temporal Model (CSTM). dataset January 2019

Similar Records

A Computational Theory for the Emergence of Grammatical Categories in Cortical Dynamics
Journal Article · Thu Apr 16 00:00:00 EDT 2020 · Frontiers in Neural Circuits · OSTI ID:1817166

The ''neural'' phonetic typewriter
Journal Article · Mon Feb 29 23:00:00 EST 1988 · Computer; (United States) · OSTI ID:5263327

Application of backpropagation neural networks to phonetic element classification
Conference · Sun Dec 31 23:00:00 EST 1989 · OSTI ID:6321246