Automated classification of acoustic startle reflex waveforms in young CBA/CaJ mice using machine learning

doi:10.1016/j.jneumeth.2020.108853

Journal of Neuroscience Methods

Volume 344, 1 October 2020, 108853

https://doi.org/10.1016/j.jneumeth.2020.108853 Get rights and content

Abstract

Background

The acoustic startle response (ASR) is a simple reflex that results in a whole body motor response after animals hear a brief loud sound and is used as a multisensory tool across many disciplines. Unfortunately, a method of how to record, process, and analyze ASRs has yet to be standardized, leading to high variability in the collection, analysis, and interpretation of ASRs within and between laboratories.

New method

ASR waveforms collected from young adult CBA/CaJ mice were normalized with features extracted from the waveform, the resulting power spectral density estimates, and the continuous wavelet transforms. The features were then partitioned into training and test/validation sets. Machine learning methods from different families of algorithms were used to combine startle-related features into robust predictive models to predict whether an ASR waveform is a startle or non-startle.

Results

An ensemble of several machine learning models resulted in an extremely robust model to predict whether an ASR waveform is a startle or non-startle with a mean ROC of 0.9779, training accuracy of 0.9993, and testing accuracy of 0.9301.

Comparison with existing methods

ASR waveforms analyzed using the threshold and RMS techniques resulted in over 80% of accepted startles actually being non-startles when manually classified versus 2.2% for the machine learning method, resulting in statistically significant differences in ASR metrics (such as startle amplitude and pre-pulse inhibition) between classification methods.

Conclusions

The machine learning approach presented in this paper can be adapted to nearly any ASR paradigm to accurately process, sort, and classify startle responses.

Introduction

The acoustic startle reflex (ASR) and modification using pre-pulse stimuli has consistently been one of the most used diagnostic tools for assessing laboratory animals’ internal state over the past several decades (Davis, 1984, Koch, 1999). Modification of this simple reflex resulting from a brief loud sound has been used in many neuropsychological disciplines for evaluating hearing (Lauer et al., 2017), tinnitus (Galazyuk and Hébert, 2015, Gerum et al., 2019, Turner et al., 2006), many neuropsychiatric disorders such as schizophrenia, bi-polar disorder, autism, and many other disorders that disrupt sensory-motor gating (Braff et al., 2001, Kohl et al., 2013, Kohl et al., 2014). Unfortunately, a standardized method of how to record, measure, process, and analyze startle reflex waveforms has yet to be standardized, which leads to high variability within and between laboratories using this assessment tool. Reasons for this variability within a lab are numerous but include animal awareness, habituation/sensitization rates, neural plasticity, anxiety/stress levels, and neuro-muscular interactions. Even more explanations for variability exists between laboratories and include: species/strain variations, loudspeaker/wav file sound quality, recording platform type, platform sensitivity, and mode of assessment which varies between whole body startle in mice/rats (Horlington, 1968, Grimsley et al., 2015), to the Preyer reflex (small ear movements) in guinea pigs (Berger et al., 2013, Böhmer, 1988), and eye blink reflex in humans and primate research (Säring and von Cramon, 1981, Filion et al., 1993, Grillon et al., 1997, Winslow et al., 2002). These factors result in variable ASR waveform measurements, which if processed/analyzed with the same methodology, could result in verifiable comparisons between experiments and laboratories using completely different techniques. This is due to likely differences in the characteristics of the startle waveforms which are included in the analysis.

Standardization of ASR waveform classification is extremely important for many reasons (Lauer et al., 2017). Previous work has included the elimination of the highest and lowest startle responses for each frequency (Longenecker and Galazyuk, 2011), using Grubb's test for outliers (Longenecker and Galazyuk, 2012), elimination of startle responses with maximum magnitude after startle presentation less than that before startle presentation, elimination of startle responses whose RMS after startle presentation is less than that before startle presentation, template matching (Grimsley et al., 2015), and discarding invalid trials containing movement in excess of a threshold prior to stimulus presentation (Schilling et al., 2017) as effective procedures for cleaning startle data by removing “non-startles,” which occur frequently in animal or humans continually presented with loud sounds. Non-startles occur more often when animals are presented low intensity startle stimuli as well as when an intense pre-pulse is presented prior to the startle stimulus (Longenecker et al., 2016). Since pre-pulse inhibition using stimuli placed before the startle elicitor is one of the most critical aspects of the startle reflex studied, proper classification of startles when pre-pulses are presented can dramatically influence the results. However, because each laboratory might utilize different hardware (sensors, filters, etc.) and/or waveform filter configurations which records animals startle-related movements, it makes it problematic to suggest a standardized template. Thus, an alternative approach should be used to classify startle response waveforms.

Machine learning is an evolving field of computational algorithms which learn from data in order to improve their performance on a particular classification or prediction task (Mjolsness and DeCoste, 2001, El Naqa and Murphy, 2015, Kotsiantis et al., 2006). Machine learning has been successfully used in genomics (Libbrecht and Noble, 2015), medical imaging and pathology (Komura and Ishikawa, 2019, Shen et al., 2017), as well as in the diagnosis and treatment of cancer (Goldenberg et al., 2019, Bejnordi et al., 2017). Machine learning could even be used in clinical settings to aid the navigation of the complex health trajectory of an individual patient through machine learning performed on data from many patients (Rajkomar et al., 2019). In this paper, we describe an automated, supervised machine learning approach to classify ASR waveforms acquired using various stimulus protocols and levels, eliciting ASRs of various magnitudes and shapes.

Section snippets

Experimental procedure

Mice were individually tested in wire mesh cages resting on a custom-built platform connected to piezoelectric transducers, located inside one of eight identical sound attenuated chambers. The custom-built 3D-printed platforms consist of a base and four piezoelectric transducers, one in each of the four quadrants of the platform. The piezoelectric transducers are in physical contact with the top animal compartment via four 3D printed rods. Acoustic stimuli were presented through Fostex model

Acoustic startle response waveform preprocessing

Due to the variability in the ASR waveforms shown in Fig. 1, all ASR waveforms were centered (by subtracting the mean) and scaled (by dividing the centered waveform by the standard deviation) using the mean and standard deviation of the ASR waveform before the SES is presented (t < 0), producing normalized ASR waveforms with units of the number of standard deviations from the mean before the SES is presented (Halaki and Gi, 2012, Lee et al., 2019, Lara-Cueva et al., 2016, Hartmann et al., 2019

Results

Several classification performance metrics are presented in Table 2 including the mean receiver operating characteristics (ROC) and the area under the ROC curve (AUC) when predicting a normalized ASR waveform to be a startle or non-startle with the training dataset as well as the accuracy of predicting the correct classification on the testing dataset for each of the individual machine learning methods described above. All mean ROCs were over 0.95 with random forests demonstrating the greatest

Discussion

The startle reflex has been measured for almost 100 years (Landis and Hunt, 1939) and is still used in many disciplines across hundreds of laboratories as a quick and effective measure of the internal state of animals and humans. However, ASR-related data between these disciplines, or even between researchers in the same discipline, are not easily compared. Since resource and data sharing is a key component for making progress in any scientific field, it was our goal to develop a universal

Conflict of interest

The authors declare that there is no conflict of interest.

Acknowledgements

This work was supported by the National Institutes of Health [NIH-NIA AG00954]. The authors would like to acknowledge the use of the services provided by Research Computing at the University of South Florida. We thank Dimitri Brunnell and Mary Reith for their technical assistance and oversight of behavioral experiments and Rachal Love for oversight of animal care.

References (76)

J. Allen
Short term spectral analysis, synthesis, and modification by discrete Fourier transform
IEEE Trans. Acoust. Speech Signal Process.
(1977)
B.E. Bejnordi et al.
Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer
JAMA
(2017)
J.I. Berger et al.
A novel behavioural approach to detecting tinnitus in the guinea pig
J. Neurosci. Methods
(2013)
J. Biesiada et al.
Feature selection for high-dimensional data – a Pearson redundancy based filter
Adv. Soft Comput.
(2006)
P. Bloomfield
Fourier Analysis of Time Series: An Introduction
Wiley Series in Probability and Statistics
(2011)
A. Böhmer
The Preyer reflex – an easy estimate of hearing function in guinea pigs
Acta Oto-Laryngol.
(1988)
D.L. Braff et al.
Impact of prepulse characteristics on the detection of sensorimotor gating deficits in schizophrenia
Schizophr. Res.
(2001)
L. Breiman
Bagging predictors
Mach. Learn.
(1995)
L. Breiman
Prediction games and arcing algorithms
Neural Comput.
(1999)
L. Breiman
Random forests
Mach. Learn.
(2000)

L. Breiman

Population theory for boosting ensembles

Ann. Stat.

(2003)

J. Cai et al.

Feature selection in machine learning: a new perspective

Neurocomputing

(2018)

R. Caruana et al.

Ensemble Selection from Libraries of Models

(2002)

T. Chen et al.

XGBoost

(2014)

T. Chen et al.

xgboost: Extreme Gradient Boosting. Technical Report

(2018)

C. Cortes et al.

Support-vector networks

Mach. Learn.

(1995)

A. Cutler et al.

Tree-based methods

High-Dimensional Data Analysis in Cancer Research

(2008)

M. Davis

The mammalian startle response

Neural Mechanisms of Startle Behavior

(1984)

Z.A. Deane-Mayer et al.

caretEnsemble: Ensembles of Caret Models

(2016)

G.T. Dietterich

An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization

Mach. Learn.

(1996)

J.A. Dobson et al.

An Introduction to Generalized Linear Models

(2018)

I. El Naqa et al.

What is machine learning?

Machine Learning in Radiation Oncology

(2015)

D.L. Filion et al.

Modification of the acoustic startle-reflex eyeblink: a tool for investigating early and late attentional processes

Biol. Psychol.

(1993)

A. Galazyuk et al.

Gap-prepulse inhibition of the acoustic startle reflex (GPIAS) for tinnitus assessment: current status and future directions

Front. Neurol.

(2015)

R. Gerum et al.

Open(G)PIAS: an open-source solution for the construction of a high-precision acoustic startle response setup for tinnitus screening and threshold estimation in rodents

Front. Behav. Neurosci.

(2019)

S.L. Goldenberg et al.

A new era: artificial intelligence and machine learning in prostate cancer

Nat. Rev. Urol.

(2019)

A. Golibagh Mahyari

Spectral estimation using modified Daniell method

Int. J. Electron.

(2010)

C. Grillon et al.

Darkness facilitates the acoustic startle reflex in humans

Biol. Psychiatry

(1997)

C.A. Grimsley et al.

An improved approach to separating startle data from noise

J. Neurosci. Methods

(2015)

I. Guyon et al.

An introduction to variable and feature selection

Mach. Learn. Res.

(2003)

M. Halaki et al.

Normalization of EMG signals: to normalize or not to normalize and what to normalize to?

Computational Intelligence in Electromyography Analysis – A Perspective on Current Applications and Future Challenges

(2012)

V. Hartmann et al.

Quantitative comparison of photoplethysmographic waveform characteristics: effect of measurement site

Front. Physiol.

(2019)

T. Hastie et al.

The Elements of Statistical Learning

Springer Series in Statistics

(2009)

M. Horlington

A method for measuring acoustic startle response latency and magnitude in rats: detection of a single stimulus effect using latency measurements

Physiol. Behav.

(1968)

W.D. Hosmer et al.

Applied Logistic Regression

(2000)

J.E. Jeskey et al.

Modulation of prepulse inhibition by an augmented acoustic environment in DBA/2J mice

Behav. Neurosci.

(2000)

R. Joober et al.

Provisional mapping of quantitative trait loci modulating the acoustic startle response and prepulse inhibition of acoustic startle

Neuropsychopharmacology

(2002)

M. Koch

The neurobiology of startle

Prog. Neurobiol.

(1999)

Cited by (5)

Universal automated classification of the acoustic startle reflex using machine learning
2023, Hearing Research
Citation Excerpt :
Thus, a more systematic and generalized approach to SR analysis would aid in comparing results from different laboratories and species and advance the discipline. To address this issue, we have adapted a machine learning model to automatically classify SR waveforms from various species, paradigms, and modalities (Fawcett et al., 2020; 2021). The relative magnitude of neuro-muscular responses varies between species; thus, the stereotypical startle response waveform varies between species.
The startle reflex (SR), a robust, motor response elicited by an intense auditory, visual, or somatosensory stimulus has been widely used as a tool to assess psychophysiology in humans and animals for almost a century in diverse fields such as schizophrenia, bipolar disorder, hearing loss, and tinnitus. Previously, SR waveforms have been ignored, or assessed with basic statistical techniques and/or simple template matching paradigms. This has led to considerable variability in SR studies from different laboratories, and species. In an effort to standardize SR assessment methods, we developed a machine learning algorithm and workflow to automatically classify SR waveforms in virtually any animal model including mice, rats, guinea pigs, and gerbils obtained with various paradigms and modalities from several laboratories. The universal features common to SR waveforms of various species and paradigms are examined and discussed in the context of each animal model. The procedure describes common results using the SR across species and how to fully implement the open-source R implementation. Since SR is widely used to investigate toxicological or pharmaceutical efficacy, a detailed and universal SR waveform classification protocol should be developed to aid in standardizing SR assessment procedures across different laboratories and species. This machine learning-based method will improve data reliability and translatability between labs that use the startle reflex paradigm.
Machine learning, waveform preprocessing and feature extraction methods for classification of acoustic startle waveforms
2021, MethodsX
Citation Excerpt :
Fig. 22 shows the mean CWT power across all times at 2048 Hz versus that at 100 Hz. A total of 17 features were extracted from the ASR waveforms as described in Fawcett et al. [[4], Table 1] with their distributions presented in Figs. 11,14,16–20, and 22. Feature variability was assessed to ascertain which, if any, features have little to no variability.
The acoustic startle response (ASR) is an involuntary muscle reflex that occurs in response to a transient loud sound and is a highly-utilized method of assessing hearing status in animal models. Currently, a high level of variability exists in the recording and interpretation of ASRs due to the lack of standardization for collecting and analyzing these measures. An ensembled machine learning model was trained to predict whether an ASR waveform is a startle or non-startle using highly-predictive features extracted from normalized ASR waveforms collected from young adult CBA/CaJ mice. Features were extracted from the normalized waveform as well as the power spectral density estimates and continuous wavelet transforms of the normalized waveform. Machine learning models utilizing methods from different families of algorithms were individually trained and then ensembled together, resulting in an extremely robust model.
- •
  ASR waveforms were normalized using the mean and standard deviation computed before the startle elicitor was presented
- •
  9 machine learning algorithms from 4 different families of algorithms were individually trained using features extracted from the normalized ASR waveforms
- •
  Trained machine learning models were ensembled to produce an extremely robust classifier
Signal-in-Noise Detection Across the Lifespan in a Mouse Model of Presbycusis
2023, SSRN
Implementation of Ensemble Deep Learning Coupled with Remote Sensing for the Quantitative Analysis of Changes in Arable Land Use in a Mining Area
2021, Journal of the Indian Society of Remote Sensing
Congenital Deafness and Recent Advances Towards Restoring Hearing Loss
2021, Current Protocols

View full text

Automated classification of acoustic startle reflex waveforms in young CBA/CaJ mice using machine learning

Abstract

Background

New method

Results

Comparison with existing methods

Conclusions

Introduction

Section snippets

Experimental procedure

Acoustic startle response waveform preprocessing

Results

Discussion

Conflict of interest

Acknowledgements

Short term spectral analysis, synthesis, and modification by discrete Fourier transform

IEEE Trans. Acoust. Speech Signal Process.

Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer

JAMA

A novel behavioural approach to detecting tinnitus in the guinea pig

J. Neurosci. Methods

Feature selection for high-dimensional data – a Pearson redundancy based filter

Adv. Soft Comput.

Fourier Analysis of Time Series: An Introduction

Wiley Series in Probability and Statistics

The Preyer reflex – an easy estimate of hearing function in guinea pigs

Acta Oto-Laryngol.

Impact of prepulse characteristics on the detection of sensorimotor gating deficits in schizophrenia

Schizophr. Res.

Bagging predictors

Mach. Learn.

Prediction games and arcing algorithms

Neural Comput.

Random forests

Mach. Learn.

Population theory for boosting ensembles

Ann. Stat.

Feature selection in machine learning: a new perspective

Neurocomputing

Ensemble Selection from Libraries of Models

XGBoost

xgboost: Extreme Gradient Boosting. Technical Report

Support-vector networks

Mach. Learn.

Tree-based methods

High-Dimensional Data Analysis in Cancer Research

The mammalian startle response

Neural Mechanisms of Startle Behavior

caretEnsemble: Ensembles of Caret Models

An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization

Mach. Learn.

An Introduction to Generalized Linear Models

What is machine learning?

Machine Learning in Radiation Oncology

Modification of the acoustic startle-reflex eyeblink: a tool for investigating early and late attentional processes

Biol. Psychol.

Gap-prepulse inhibition of the acoustic startle reflex (GPIAS) for tinnitus assessment: current status and future directions

Front. Neurol.

Open(G)PIAS: an open-source solution for the construction of a high-precision acoustic startle response setup for tinnitus screening and threshold estimation in rodents

Front. Behav. Neurosci.

A new era: artificial intelligence and machine learning in prostate cancer

Nat. Rev. Urol.

Spectral estimation using modified Daniell method

Int. J. Electron.

Darkness facilitates the acoustic startle reflex in humans

Biol. Psychiatry

An improved approach to separating startle data from noise

J. Neurosci. Methods

An introduction to variable and feature selection

Mach. Learn. Res.

Normalization of EMG signals: to normalize or not to normalize and what to normalize to?

Computational Intelligence in Electromyography Analysis – A Perspective on Current Applications and Future Challenges

Quantitative comparison of photoplethysmographic waveform characteristics: effect of measurement site

Front. Physiol.

The Elements of Statistical Learning

Springer Series in Statistics

A method for measuring acoustic startle response latency and magnitude in rats: detection of a single stimulus effect using latency measurements

Physiol. Behav.

Applied Logistic Regression

Modulation of prepulse inhibition by an augmented acoustic environment in DBA/2J mice