skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Phonetic acquisition in cortical dynamics, a computational approach

Journal Article · · PLoS ONE
ORCiD logo [1]; ORCiD logo [2];  [3];  [4];  [5];  [6]
  1. Univ. de Buenos Aires, Buenos Aires (Argentina)
  2. Argonne National Lab. (ANL), Lemont, IL (United States)
  3. Argonne National Lab. (ANL), Lemont, IL (United States); Loyola Univ. Chicago, Chicago, IL (United States)
  4. Inst. de Ciencias Humanas, Mendoza (Argentina)
  5. Univ. de Buenos Aires, Buenos Aires (Argentina); Inst. de Biología y Medicina Experimental-CONICET, Ciudad Autónoma de Buenos Aires (Argentina)
  6. Fraunhofer-Inst. fur Nachrichtentechnik Heinrich-Hertz-Inst. (Germany)

Many computational theories have been developed to improve artificial phonetic classification performance from linguistic auditory streams. However, less attention has been given to psycholinguistic data and neurophysiological features recently found in cortical tissue. We focus on a context in which basic linguistic units-such as phonemes-are extracted and robustly classified by humans and other animals from complex acoustic streams in speech data. We are especially motivated by the fact that 8-month-old human infants can accomplish segmentation of words from fluent audio streams based exclusively on the statistical relationships between neighboring speech sounds without any kind of supervision. In this paper, we introduce a biologically inspired and fully unsupervised neurocomputational approach that incorporates key neurophysiological and anatomical cortical properties, including columnar organization, spontaneous micro-columnar formation, adaptation to contextual activations and Sparse Distributed Representations (SDRs) produced by means of partial N-Methyl-D-aspartic acid (NMDA) depolarization. Its feature abstraction capabilities show promising phonetic invariance and generalization attributes. Our model improves the performance of a Support Vector Machine (SVM) classifier for monosyllabic, disyllabic and trisyllabic word classification tasks in the presence of environmental disturbances such as white noise, reverberation, and pitch and voice variations. Furthermore, our approach emphasizes potential self-organizing cortical principles achieving improvement without any kind of optimization guidance which could minimize hypothetical loss functions by means of-for example-backpropagation. Thus, our computational model outperforms multiresolution spectro-temporal auditory feature representations using only the statistical sequential structure immerse in the phonotactic rules of the input stream.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
Argonne National Laboratory - Argonne Leadership Computing Facility; USDOE
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1559970
Journal Information:
PLoS ONE, Vol. 14, Issue 6; ISSN 1932-6203
Publisher:
Public Library of ScienceCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 2 works
Citation information provided by
Web of Science

References (67)

Perception of synthetic /ba/–/wa/ speech continuum by budgerigars ( Melopsittacus undulatus )
  • Dent, Micheal L.; Brittan-Powell, Elizabeth F.; Dooling, Robert J.
  • The Journal of the Acoustical Society of America, Vol. 102, Issue 3 https://doi.org/10.1121/1.420111
journal September 1997
Active Properties of Neocortical Pyramidal Neuron Dendrites journal July 2013
Distributional regularity and phonotactic constraints are useful for segmentation journal October 1996
Specificity and timescales of cortical adaptation as inferences about natural movie statistics journal October 2016
Enhanced discriminability at the phonetic boundaries for the place feature in macaques journal March 1983
Speech perception by the chinchilla: voiced-voiceless distinction in alveolar plosive consonants journal October 1975
Self-organized formation of topologically correct feature maps journal January 1982
Columnar Transformations in Auditory Cortex? A Comparison to Visual and Somatosensory Cortices journal January 2003
Phoneme representation and classification in primary auditory cortex journal February 2008
Adaptive shift in the domain of negative stiffness during spontaneous oscillation by hair bundles from the internal ear journal November 2005
The decade of the dendritic NMDA spike journal June 2010
The effects of distributional learning on rats' sensitivity to phonetic information. journal January 2006
Spectral shape analysis in the central auditory system journal January 1995
Adaptation in Hair Cells journal March 2000
Role of experience for language-specific functional mappings of vowel sounds journal December 1998
Organization of response areas in ferret primary auditory cortex journal February 1993
Two mechanisms for transducer adaptation in vertebrate hair cells journal October 2000
Random synaptic feedback weights support error backpropagation for deep learning journal November 2016
Receptive fields, binocular interaction and functional architecture in the cat's visual cortex journal January 1962
Vowel discrimination in cats: Acquisition, effects of stimulus level, and performance in noise journal June 1996
Cellular organization of cortical barrel columns is whisker-specific journal October 2013
The columnar organization of the neocortex journal April 1997
Receptive fields and functional architecture of monkey striate cortex journal March 1968
Sensitivity of cat primary auditory cortex (Al) neurons to the direction and rate of frequency modulation journal February 1985
Modality and Topographic Properties of Single Neurons of Cat'S Somatic Sensory Cortex journal July 1957
Top-down knowledge supports the retrieval of lexical information from degraded speech journal June 2007
Role of cortical N-methyl-D-aspartate receptors in auditory sensory memory and mismatch negativity generation: implications for schizophrenia. journal October 1996
Experimentally induced visual projections into auditory thalamus and cortex journal December 1988
Perceptual compensation for coarticulation by Japanese quail ( Coturnix coturnix japonica ) journal August 1997
Why Neurons Have Thousands of Synapses, a Theory of Sequence Memory in Neocortex journal March 2016
Integration of iconic gestures and speech in left superior temporal areas boosts speech comprehension under adverse listening conditions journal January 2010
Expectancy Constraints in Degraded Speech Modulate the Language Comprehension Network journal June 2009
Functional topography of cat primary auditory cortex: distribution of integrated excitation journal November 1990
Statistical Learning by 8-Month-Old Infants journal December 1996
Experimental evidence for sparse firing in the neocortex journal June 2012
Contextual modulation and stimulus selectivity in extrastriate cortex journal November 2014
A stagewise rejective multiple test procedure based on a modified Bonferroni test journal January 1988
A map of visual space induced in primary auditory cortex journal November 1990
LIBSVM: A library for support vector machines journal April 2011
Computational modeling of phonetic and lexical learning in early language acquisition: Existing models and future directions journal November 2012
Toward an Integration of Deep Learning and Neuroscience journal September 2016
Induction of visual orientation modules in auditory cortex journal April 2000
Sensitivity of neurons in cat primary auditory cortex to tones and frequency-modulated stimuli. II: Organization of response properties along the ‘isofrequency’ dimension journal November 1992
Multiresolution spectrotemporal analysis of complex sounds journal August 2005
Continuous Online Sequence Learning with an Unsupervised Neural Network Model journal November 2016
The Design and Implementation of FFTW3 journal February 2005
Multiple brain signatures of integration in the comprehension of degraded speech journal March 2011
Experimental evidence for sparse firing in the neocortex. text January 2012
Perceptual Compensation for Coarticulation by Japanese Quail (Coturnix coturnix japonica) text January 1997
Role of Experience for Language-Specific Functional Mappings of Vowel Sounds text January 2018
Role of experience for language-specific functional mappings of vowel sounds text January 1998
Experimental evidence for sparse firing in the neocortex. text January 2018
Perceptual compensation for coarticulation by Japanese quail (Coturnix coturnix japonica) text January 1997
Datasets used to train and test the Cortical Spectro-Temporal Model (CSTM). dataset January 2019
Why Neurons Have Thousands of Synapses, A Theory of Sequence Memory in Neocortex text January 2015
Towards an integration of deep learning and neuroscience text January 2016
Brainstem inputs to the ferret medial geniculate nucleus and the effect of early deafferentation on novel retinal projections to the auditory thalamus journal October 1998
Sensitivity of neurons in cat primary auditory cortex to tones and frequency-modulated stimuli. I: Effects of variation of stimulus parameters journal November 1992
The functional organization of cortical feedback inputs to primary visual cortex journal April 2018
Mechanisms of noise robust representation of speech in primary auditory cortex journal April 2014
Visual projections routed to the auditory pathway in ferrets: receptive fields of visual neurons in primary auditory cortex journal September 1992
Functional Integration across Brain Regions Improves Speech Perception under Adverse Listening Conditions journal February 2007
Continuous online sequence learning with an unsupervised neural network model text January 2015
Complementary control of sensory adaptation by two types of cortical interneurons journal October 2015
Towards deep learning with segregated dendrites journal December 2017
Datasets used to train and test the Cortical Spectro-Temporal Model (CSTM). dataset January 2019
Experimental Results and Appendices: Cortical Spectro-Temporal Model (CSTM). dataset January 2019

Similar Records

A Computational Theory for the Emergence of Grammatical Categories in Cortical Dynamics
Journal Article · Thu Apr 16 00:00:00 EDT 2020 · Frontiers in Neural Circuits · OSTI ID:1559970

The ''neural'' phonetic typewriter
Journal Article · Tue Mar 01 00:00:00 EST 1988 · Computer; (United States) · OSTI ID:1559970

Neural speech recognition: continuous phoneme decoding using spatiotemporal representations of human cortical activity
Journal Article · Wed Aug 03 00:00:00 EDT 2016 · Journal of Neural Engineering · OSTI ID:1559970