Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Deep learning as a tool for neural data analysis: Speech classification and cross-frequency coupling in human sensorimotor cortex

Journal Article · · PLoS Computational Biology (Online)
 [1];  [1];  [2]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Univ. of California, Berkeley, CA (United States)
  2. Univ. of California, San Francisco, CA (United States)
A fundamental challenge in neuroscience is to understand what structure in the world is represented in spatially distributed patterns of neural activity from multiple single-trial measurements. This is often accomplished by learning a simple, linear transformations between neural features and features of the sensory stimuli or motor task. While successful in some early sensory processing areas, linear mappings are unlikely to be ideal tools for elucidating nonlinear, hierarchical representations of higher-order brain areas during complex tasks, such as the production of speech by humans. Here, we apply deep networks to predict produced speech syllables from a dataset of high gamma cortical surface electric potentials recorded from human sensorimotor cortex. We find that deep networks had higher decoding prediction accuracy compared to baseline models. Having established that deep networks extract more task relevant information from neural data sets relative to linear models (i.e., higher predictive accuracy), we next sought to demonstrate their utility as a data analysis tool for neuroscience. We first show that deep network's confusions revealed hierarchical latent structure in the neural data, which recapitulated the underlying articulatory nature of speech motor control. We next broadened the frequency features beyond high-gamma and identified a novel high-gamma-to-beta coupling during speech production. Finally, we used deep networks to compare task-relevant information in different neural frequency bands, and found that the high-gamma band contains the vast majority of information relevant for the speech prediction task, with little-to-no additional contribution from lower-frequency amplitudes. Together, these results demonstrate the utility of deep networks as a data analysis tool for basic and applied neuroscience.
Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1572056
Journal Information:
PLoS Computational Biology (Online), Journal Name: PLoS Computational Biology (Online) Journal Issue: 9 Vol. 15; ISSN 1553-7358
Publisher:
Public Library of ScienceCopyright Statement
Country of Publication:
United States
Language:
English

References (62)

Cortical gamma responses: Searching high and low journal January 2011
Decoding spoken phonemes from sensorimotor cortex with high-density ECoG grids journal October 2018
Let the Rhythm Guide You: Non-invasive Tracking of Cortical Communication Channels journal January 2016
Brain–computer interfaces for communication and control journal June 2002
A back-propagation programmed network that simulates response properties of a subset of posterior parietal neurons journal February 1988
Functional organization of human sensorimotor cortex for speech articulation journal February 2013
Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning journal July 2015
Perceptual restoration of masked speech in human cortex journal December 2016
Large-scale spatiotemporal spike patterning consistent with wave propagation in motor cortex journal May 2015
Propagating waves mediate information transfer in the motor cortex journal November 2006
The origin of extracellular fields and currents — EEG, ECoG, LFP and spikes journal May 2012
Oscillatory phase coupling coordinates anatomically dispersed functional cell assemblies journal September 2010
Direct classification of all American English phonemes using signals from functional speech motor cortex journal May 2014
Decoding flexion of individual fingers using electrocorticographic signals in humans journal October 2009
Decoding spoken words using local field potentials recorded from the cortical surface journal September 2010
Using the electrocorticographic speech network to control a brain–computer interface in humans journal April 2011
Modeling electroencephalography waveforms with semi-supervised deep belief nets: fast classification and anomaly measurement journal April 2011
Decoding vowels and consonants in spoken and imagined words using electrocorticographic signals in humans journal July 2011
Functional mapping of human sensorimotor cortex with electrocorticographic spectral analysis. I. Alpha and beta event- related desynchronization journal December 1998
Functional mapping of human sensorimotor cortex with electrocorticographic spectral analysis. II. Event-related synchronization in the gamma band journal December 1998
Somatic Motor and Sensory Representation in the Cerebral Cortex of man as Studied by Electrical Stimulation journal January 1937
Networks for approximation and learning journal January 1990
Emergence of Invariance and Disentanglement in Deep Representations conference February 2018
Single-trial spike trains in parietal cortex reveal discrete steps during decision-making journal July 2015
Note on Information Transfer Rates in Human Communication journal October 1998
Spike-triggered neural characterization journal February 2006
Different Origins of Gamma Rhythm and High-Gamma Activity in Macaque Visual Cortex journal April 2011
Power-Law Scaling in the Brain Surface Electric Potential journal December 2009
A Wireless Brain-Machine Interface for Real-Time Speech Synthesis journal December 2009
Random forests in non-invasive sensorimotor rhythm brain-computer interfaces: a practical and convenient non-linear classifier journal February 2016
Control of Spoken Vowel Acoustics and the Influence of Phonetic Context in Human Speech Sensorimotor Cortex journal September 2014
Spectral Changes in Cortical Surface Potentials during Motor Movement journal February 2007
Decoupling the Cortical Power Spectrum Reveals Real-Time Representation of Individual Finger Movements in Humans journal March 2009
Electrocorticographic representations of segmental features in continuous speech journal February 2015
Scikit-learn: Machine Learning in Python text January 2012
Convergent Learning: Do different neural networks learn the same representations? preprint January 2015
Opening the Black Box of Deep Neural Networks via Information preprint January 2017
Brain-to-text: Decoding spoken phrases from phone representations in the brain text January 2015
Visualizing and Understanding Convolutional Networks book January 2014
Comparison of neuronal responses in primate inferior-temporal cortex and feed-forward deep neural network model with regard to information processing of faces journal February 2021
Brain–computer interfaces for communication and control journal June 2002
Event-related EEG/MEG synchronization and desynchronization: basic principles journal November 1999
Beta-band oscillations—signalling the status quo? journal April 2010
Alpha-Beta and Gamma Rhythms Subserve Feedback and Feedforward Influences among Human Visual Cortical Areas journal January 2016
Performance-optimized hierarchical models predict neural responses in higher visual cortex journal May 2014
Enhanced Higgs Boson to τ + τ − Search with Deep Learning journal March 2015
Deep Residual Learning for Image Recognition conference June 2016
Feature extraction with stacked autoencoders for epileptic seizure detection conference August 2014
Pattern learning with deep neural networks in EMG-based speech recognition conference August 2014
Neural decoding of spoken vowels from human sensory-motor cortex with high-density electrocorticography conference August 2014
Emergence of Invariance and Disentanglement in Deep Representations conference February 2018
Learning to Control a Brain–Machine Interface for Reaching and Grasping by Primates journal October 2003
Control of Spoken Vowel Acoustics and the Influence of Phonetic Context in Human Speech Sensorimotor Cortex journal September 2014
Spectral-Temporal Receptive Fields of Nonlinear Auditory Neurons Obtained Using Natural Sounds journal March 2000
Broadband Shifts in Local Field Potential Power Spectra Are Correlated with Single-Neuron Spiking in Humans journal October 2009
Do We Know What the Early Visual System Does? journal November 2005
Spectral Changes in Cortical Surface Potentials during Motor Movement journal February 2007
Decoupling the Cortical Power Spectrum Reveals Real-Time Representation of Individual Finger Movements in Humans journal March 2009
Speech reconstruction from human auditory cortex with deep neural networks conference September 2015
Exploring how deep neural networks form phonemic categories conference September 2015
Brain-to-text: decoding spoken phrases from phone representations in the brain journal June 2015
Deep Residual Learning for Image Recognition preprint January 2015

Cited By (3)

Speech synthesis from ECoG using densely connected 3D convolutional neural networks journal April 2019
Decoding Movement From Electrocorticographic Activity: A Review journal December 2019
Neural ensemble dynamics in dorsal motor cortex during speech in people with paralysis journal December 2019

Similar Records

A Task-Optimized Neural Network Replicates Human Auditory Behavior, Predicts Brain Responses, and Reveals a Cortical Processing Hierarchy
Journal Article · Wed Apr 18 20:00:00 EDT 2018 · Neuron · OSTI ID:1538638

Deep learning approaches for neural decoding across architectures and recording modalities
Journal Article · Mon Dec 28 19:00:00 EST 2020 · Briefings in Bioinformatics · OSTI ID:1826323

Human Sensorimotor Cortex Control of Directly Measured Vocal Tract Movements during Vowel Production
Journal Article · Tue Mar 20 20:00:00 EDT 2018 · Journal of Neuroscience · OSTI ID:1485087

Related Subjects