Applying machine learning EEG signal classification to emotion‑related brain anticipatory activity

Marco Bilucaglia; Gian Marco Duma; Giovanni Mento; Luca Semenzato; Patrizio E. Tressoldi

doi:10.12688/f1000research.22202.1

Home Browse Applying machine learning EEG signal classification to emotion‑related...

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Method Article

Applying machine learning EEG signal classification to emotion‑related brain anticipatory activity

[version 1; peer review: 2 approved with reservations]

Marco Bilucaglia¹, Gian Marco Duma², Giovanni Mento³, Luca Semenzato³, Patrizio E. Tressoldi ⁴

Marco Bilucaglia¹, Gian Marco Duma², [...] Giovanni Mento³, Luca Semenzato³, Patrizio E. Tressoldi ⁴

PUBLISHED 10 Mar 2020

Author details Author details

¹ Behavior and BrainLab, IULM, Milan, Italy
² Department of Developmental and Social Psychology (DPSS), Università degli Studi di Padova, Padova, Italy
³ Department of General Psychology, Università degli Studi di Padova, Padova, Italy
⁴ Science of Consciousness Research Group, Department of General Psychology, Università degli Studi di Padova, Padova, Italy

Marco Bilucaglia
Roles: Conceptualization, Data Curation, Formal Analysis, Methodology, Validation, Writing – Original Draft Preparation, Writing – Review & Editing

Gian Marco Duma
Roles: Conceptualization, Data Curation, Formal Analysis, Methodology, Writing – Original Draft Preparation, Writing – Review & Editing

Giovanni Mento
Roles: Conceptualization, Supervision, Writing – Review & Editing

Luca Semenzato
Roles: Software, Validation

Patrizio E. Tressoldi
Roles: Conceptualization, Supervision, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the INCF gateway.

This article is included in the Artificial Intelligence and Machine Learning gateway.

Abstract

Machine learning approaches have been fruitfully applied to several neurophysiological signal classification problems. Considering the relevance of emotion in human cognition and behaviour, an important application of machine learning has been found in the field of emotion identification based on neurophysiological activity. Nonetheless, there is high variability in results in the literature depending on the neuronal activity measurement, the signal features and the classifier type. The present work aims to provide new methodological insight into machine learning applied to emotion identification based on electrophysiological brain activity. For this reason, we analysed previously recorded EEG activity measured while emotional stimuli, high and low arousal (auditory and visual) were provided to a group of healthy participants. Our target signal to classify was the pre-stimulus onset brain activity. Classification performance of three different classifiers (linear discriminant analysis, support vector machine and k-nearest neighbour) was compared using both spectral and temporal features. Furthermore, we also contrasted the classifiers’ performance with static and dynamic (time evolving) features. The results show a clear increase in classification accuracy with temporal dynamic features. In particular, the support vector machine classifiers with temporal features showed the best accuracy (63.8 %) in classifying high vs low arousal auditory stimuli.

Keywords

EEG, brain anticipatory activity, machine learning, emotion

Corresponding author: Patrizio E. Tressoldi

Competing interests: No competing interests were disclosed.

Grant information: The author(s) declared that no grants were involved in supporting this work.

Copyright: © 2020 Bilucaglia M et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Bilucaglia M, Duma GM, Mento G et al. Applying machine learning EEG signal classification to emotion‑related brain anticipatory activity [version 1; peer review: 2 approved with reservations]. F1000Research 2020, 9:173 (https://doi.org/10.12688/f1000research.22202.1) First published: 10 Mar 2020, 9:173 (https://doi.org/10.12688/f1000research.22202.1) Latest published: 13 Oct 2021, 9:173 (https://doi.org/10.12688/f1000research.22202.3)

Introduction

In last decades, the vision of the brain has moved from a passive stimuli elaborator to an active reality builder. In other words, the brain is able to extract information from the environment, building up inner models of external reality. These models are used to optimize the behavioural outcome when reacting to upcoming stimuli^1–4.

One of the main theoretical models assumes that the brain, in order to regulate body reaction, runs an internal model of the body in the world, as described by embodied simulation framework⁵. A much investigated hypothesis is that the brain functions as a Bayesian filter for incoming sensory input; that is, it activates a sort of prediction based on previous experiences about what to expect from the interaction with the social and natural environment, including emotion⁶. In light of this, it is possible to consider emotions, not only as a reaction to the external world, but also as partially shaped by our internal representation of the environment, which help us to anticipate possible scenarios and therefore to regulate our behaviour.

The construction model of emotion⁷ argues that the human being actively builds-up his/her emotions in relation to the everyday life and social context in which they are placed. We actively generate a familiar range of emotions in our reality, based on their usefulness and relevance in our environment. In this scenario, in a familiar context we are able to anticipate which emotions will be probably elicited, depending on our model. As a consequence, the study of the anticipation of/preparation for forthcoming stimuli may represent a precious window for understanding the individual internal model and emotion construction process, resulting in a better understanding of human behaviour.

A strategy to study preparatory activity could be related to the experimental paradigm in which cues are provided regarding the forthcoming stimuli, allowing the investigation of the brain activity dedicated to the elaboration of incoming stimuli^8,9. A cue experiment to predict the emotional valence of the forthcoming stimuli showed that the brain’s anticipatory activation facilitates, for example, successful reappraisal via reduced anticipatory prefrontal cognitive elaboration and better integration of affective information in the paralimbic and subcortical systems¹⁰. Furthermore, preparation for forthcoming emotional stimuli also has relevant implications for clinical psychological conditions, such as mood disorders or anxiety^11,12.

Recently, the study of brain anticipatory activity has been extended to statistically unpredictable stimuli^13–15, providing experimental hints of specific anticipatory activity before stimuli are randomly presented. Starting from the abovementioned studies, we focused on the extension of brain anticipatory activity to statistically unpredictable emotional stimuli.

According to the so called dimensional model, emotion can be defined in terms of three different attributes (or dimensions): valence, arousal and dominance. Valence measures the positiveness (ranging from unpleasant to pleasant), arousal measures the activation level (ranging from boredom to frantic excitement) and dominance measures the controllability (i.e. the sense of control)¹⁶.

Emotions can be estimated from various physiological signals¹⁷, such as via skin conductance, electrocardiogram (ECG) and electroencephalogram (EEG). The latter has received a considerable amount of attention in the last decade, introducing several machine learning and signal processing techniques, originally developed in other contexts, such as brain computer interfaces¹⁸. Emotion recognition has been re-drawn as a machine learning problem, where proper EEG related features are used as inputs to specific classifiers.

The most common features belong the spectral domain, in the form of spectral powers in delta, theta, alpha and gamma bands¹⁹, as well as power spectral density (PSD) bins²⁰. The remaining belong to the time domain, in the form of event-related de/synchronizations (ERD/ERS) and event-related potentials (ERP)¹⁹, as well as shape related indices such as the Hjorth parameters and the fractal dimension²⁰.

The most commonly used classifier is the support vector machine (SVM) with the radial basis function (RBF) kernel, followed by the k-nearest neighbour (kNN) and the linear discriminant analysis (LDA)^19,20. Finally, most of the classifiers are implemented as non-adaptive (i.e. static)¹⁹, in contrast to the dynamic versions that take into account the temporal variability of the features²¹.

The classification performances are very variable because of the different features and classifiers adopted. The following examples are taken from 19 - in particular, from the subset (17 out of 63) of reviewed papers that focused on arousal classification. Using an SVM (RBF kernel) and spectral features (e.g. short-time Fourier transform), Lin and colleagues obtained 94.4% accuracy (i.e. percentage of corrected classification)²², while using similar spectral features (e.g. PSD) and classifier (SVM with no kernel), Koelstra and colleagues obtained an accuracy of 55.7%²³. Liu and Sourina obtained an accuracy of 76.5% using temporal features (e.g. fractal dimension) with an SVM (no kernel)²⁴, while Murugappan and Murugappan obtained a an accuracy of 63% using similar temporal features and an SVM with a polynomial kernel²⁵. Finally, Thammasan and collegues obtained an accuracy of 85.3% using spectral features (e.g. PSD), but with a kNN (with k=3)²⁶. All the classifiers were static.

The purpose of the present work is to provide new methodological advancements on the machine learning classification of emotions, based on the brain anticipatory activity. For this purpose, we compared the performances of tree different classifiers (namely LDA, SVM, kNN) trained using two types of EEG features (namely, spectral and temporal). In addition, each classifier was dynamically trained, to take into account the temporal variability of the features. The results provide useful insights regarding the best classifier-features configuration to better discriminate emotion-related brain anticipatory activity.

A more detailed description of the machine learning algorithms is provided as Extended data²⁷.

Classification performances

In introducing pattern recognition, we underlined that the classifiers are built using a set of previously annotated class-prototypical features for the training set. It is common practice to extract from the training set a subset of annotated features (the test set) and use it to evaluate the performances of the trained classifiers – but not to train it.

Since the training set is limited, the specific train/test splitting introduce a bias in both the training and performance evaluation. This can be avoided following the so-called k -fold cross validation scheme. The original training set D is partitioned into k disjoint and equal sized sets, $D = \cup_{i = 1}^{k} D_{k}$ . The classifier is then trained k-times using, each time, as the test set a different partition D_j and as the training set the remaining $\cup_{i \neq j} D_{i} .$ Finally, the overall performance is computed as the average over the k single performances²⁸ (pp. 483–485).

With the general term performance, we mostly refer to the classification accuracy ACC, defined as the ratio between the number of correctly classified features and the total number of features. Introducing the chance-level accuracy ACC₀ as the ratio between the number of features for each class (i.e. how balanced is the training set), we can additionally define as performance the Kappa statistic: κ = (ACC – ACC₀)/ACC₀²⁹.

Compared to ACC, κ is a more robust performance measure, since it is normalized by the class unbalances. Another solution to take into account the class unbalances, is to compare (using for example a t-test) the k cross-validated accuracies against k random accuracies, obtained from a random classifier^29–35.

Dynamic classifiers

To classify a time-varying signal (i.e. to perform a dynamic classification), an ordered sequence of features ${x_{i}}_{i = 1}^{N}$ (i.e. temporal features), corresponding to N adjacent temporal windows, is extracted. The temporal features are fed into either “dynamic” classifiers, such as the Hidden Markow Model (HMM)²¹, or an ordered sequence of “static” classifiers ${f_{i}}_{i = 1}^{N}$ ^36–39. The former fully takes into account the signal’s temporal variability, since it uses the entire sequence during the training phase. The latter train each static classifier f_i, using only the corresponding features x_i, but provides an ordered sequence of accuracies ${A C C_{i}}_{i = 1}^{N}$ . where each ACC_i corresponds to f_i.

Feature selection

As stated in the previous sections, the curse of dimensionality arises when the number of available training features is small compared to the feature dimension m. In such situations, the parameter estimation becomes problematic (see for example the problem of the singularity of the estimated covariance matrix described in the LDA sub-section) and the trained classifier usually underperforms.

As a rule of thumb, the number of training features N should be an exponential function of the dimensionality (e.g. N = 10^m), with the ratio growing with the complexity of the classifiers⁴⁰. By fixing the feature dimension m, linear classifiers require, for example, a less numerous training set. Additionally, even with an adequate training set, feature dimensionality impacts on both the training and classification speed. In fact, as stated in the SVM sub-section, linear classification requires O(m) multiplications and sums to compute each scalar product. Reducing the feature dimensionality by means of so called feature selection algorithms, a classifier can be made more robust (i.e. less sensitive to the curse of dimensionality) and efficient (in terms of computational speed).

Feature selection can be broadly described as a mapping function $s : ℝ^{m} \to ℝ^{n}$ such as:

s (x) = {(x_{s_{1}}, x_{s_{2}}, ..., x_{s_{n}})}^{T} (13)

where n < m and {s₁, s₂, ... , s_n} ⊂ {1, 2, ... , m}. In other words, a feature selection algorithm performs a projection of the original feature vector onto a lower dimensional subspace defined by a subset of scalar features. The best subspace, as selected among all the possible 2^m, should not significantly decrease the classification performances, both globally (i.e. how features are classified overall) and locally (i.e. how the single feature is classified)⁴¹

Feature selection algorithms can be broadly grouped according to the following criteria⁴²:

1. Label information. Supervised algorithms take into account the class information, while unsupervised algorithms do not, in assigning the training features as belonging to the same class.

2. Search strategy. Filter algorithms (also known as classifier-independent) are based on a two-step “ranking and selecting” criterium: scalar features are first ranked according to a proper criterion; then only the “best” ones are selected. Wrapper methods (also known as classifier dependent methods) use the selected classifier, following an “ad hoc” approach: the selected scalar features are those that give the best classification performance

An example of a supervised filter algorithm is the biserial correlation coefficient. Given a training set D composed by N₊ features belonging to the class +1 and N_– features belonging to the class –1, the biserial correlation coefficient for the k-th scalar feature x_k is given by 43:

r_{k}^{2} = \frac{\sqrt{N_{+} N_{-}}}{N_{+} + N_{-}} \frac{m (x_{k}^{+}) - m (x_{k}^{-})}{s (x_{k})} (13)

where m(·), s(·) are the sample mean and sample standard deviation operators, respectively, and $x_{k}^{+}$ , $x_{k}^{-}$ are the subset of x_k belonging to the classes +1 and –1, respectively. The total feature score is obtained by summing the m coefficients of each scalar feature x_k. Once the scores $r_{k}^{2}$ are sorted in descending order, the feature selection is made simply by selecting the first scalar features whose summing score get a percentage (e.g. 95%) of the total feature score.

Methods

Ethical statement

The data of the present study were obtained in the experiment described in 37, which was approved by the Ethical Committee of the Department of General Psychology, University of Padova (No. 2278). Before taking part in the experiment, each subject gave his/her informed consent in writing after having read a description of the experiment. In line with department policies, this re-analysis of an original study approved by the ethics committee did not require new ethical approval.

Stimuli and experimental paradigm

In the present study, we reanalysed the EEG data²⁷ of the experiment described in 37, applying an original static and dynamic features selection and classification by using the three different algorithms explained above.

A more detailed description of the experimental design is available in the original study. Here we describe only the main characteristics.

Two sensory categories of stimuli (i.e. visual and auditory), were extracted according to their arousal value from two standardized international archives. Visual stimuli consisted of pictures of 28 faces, 14 neutral faces and 14 fearful faces were extracted from the NIMSTIM archive⁴⁴, whereas auditory stimuli consisted of 28 sounds, and 14 low- and 14 high-arousal sounds were chosen from the International Affective Digitized Sounds (IADS) archive⁴⁵.

To all 28 adult healthy participants, two different experimental tasks, which were delivered in separate blocks were presented. The two tasks are described in Figure 1, which illustrates the sequence of events and the temporal trial structure relative to the passive (top) and the active (bottom) tasks. Within each task, the stimuli were randomly presented and equally distributed according to either sensory category (faces or sounds) and arousal level (high or low). Full details of these tasks have been described previously in 37.

Figure 1. Experimental tasks.

EEG recording

During the entire experiment, the EEG signal was continuously recorded using a Geodesic high density EEG system (EGI GES-300) through a pre-cabled 128-channel HydroCel Geodesic Sensor Net (HCGSN-128) referenced to the vertex (CZ), with a sampling rate of 500 Hz. The impedance was kept below 60kΩ for each sensor. To reduce the presence of EOG artefacts, subjects were instructed to limit both eye blinks and eye movements, as much as possible.

EEG preprocessing

The continuous EEG signal was off-line band-pass filtered (0.1–45Hz) using a Hamming windowed sinc finite impulse response (FIR) filter (order = 16500) and then downsampled at 250 Hz. The EEG was epoched starting from 200 ms before the cue onset and ending at the stimulus onset. The initial epochs were 1300 ms long from the cue onset, including 300 ms of cue/fixation cross presentation and 1000 ms of interstimulus interval (ISI).

All epochs were visually inspected to remove bad channels and rare artefacts. Artefact-reduced data were then subjected to independent component analysis (ICA)⁴⁵. All independent components were visually inspected, and those that related to eye blinks, eye movements, and muscle artefacts, according to their morphology and scalp distribution, were discarded. The remaining components were back-projected to the original electrode space to obtain cleaner EEG epochs.

The remaining ICA-cleaned epochs that still contained excessive noise or drift (±100 μV at any electrode) were rejected and the removed bad channels were reconstructed. Data were then re-referenced to the common average reference (CAR) and the epochs were baseline-corrected by subtracting the mean signal amplitude in the pre-stimulus interval. From the original 1300 ms long epochs, final epochs were obtained only from the 1000 ms long ISI.

Static spectral features

From each epoch and each channel k, the PSD was estimated by a Welch’s periodogram using 250 points long Hamming’s windows with 50% overlapping. PSD was first log transformed to compensate the skewness of power values⁴⁶, then the spectral bins corresponding to alpha, beta and theta bands – defined as 13~30Hz, 6~13Hz and 4~6Hz, respectively⁴⁷ – were summed together. Finally, alpha, beta and theta total powers were computed as:

β_{t o t}^{k} = \sum_{i \in [13; 30]} P S D^{k} (i)

α_{t o t}^{k} = \sum_{i \in [6; 13]} P S D^{k} (i) (14)

θ_{t o t}^{k} = \sum_{i \in [4; 6]} P S D^{k} (i)

As a measure of emotional arousal, we computed the ratio between beta and alpha total powers $β_{t o t}^{k} / α_{t o t}^{k}$ ⁴⁸, while to measure cognitive arousal, we computed the ratio between beta and theta total powers $θ_{t o t}^{k} / β_{t o t}^{k}$ ⁴⁹.

For each epoch, the feature (with a dimensionality of 256) was obtained, concatenating the beta-over-alpha and beta-over-theta ratio of all the channels:

v ≐ [\frac{β_{t o t}^{1}}{α_{t o t}^{1}}, \frac{θ_{t o t}^{1}}{β_{t o t}^{1}}, \frac{β_{t o t}^{2}}{α_{t o t}^{2}}, \frac{θ_{t o t}^{2}}{β_{t o t}^{2}}, ..., \frac{β_{t o t}^{128}}{α_{t o t}^{128}}, \frac{θ_{t o t}^{128}}{β_{t o t}^{128}}] (15)

Static temporal features

It has been previously shown that arousal level (high or low) can be estimated from the contingent negative variation potentials³⁷. The feature extraction procedure, therefore, follows the classical approach for event-related potentials⁵⁰. Each epoch from each channel was first band pass filtered (0.05~10Hz) using a zero-phase 2^nd order Butterworth filter and decimated to a sample frequency of 20Hz. EEG signal was thus normalized (i.e. z-scored) according to the temporal mean and the temporal standard deviation:

x_{i} (t_{k}) = ({\tilde{x}}_{i} (t_{k}) - m_{i}) / s_{i}

where ${\tilde{x}}_{i} (t_{k})$ is the raw signal from i-th channel at time point t_k, and m_i and s_i are, respectively, the temporal mean and the temporal standard deviation of the i-th channel. For each epoch, the feature (with a dimensionality of 2560) was obtained, concatenating all normalized signal from each channel:

v ≐ [x_{1} (t_{1}), x_{1} (t_{2}), ..., x_{1} (t_{20}), ..., x_{128} (t_{1}), x_{128} (t_{2}), ..., x_{128} (t_{20})] (16)

Dynamic features

Each epoch was partitioned into 125 temporal segments, 500 ms long and shifted by 1/250 s (one sample). Within each time segment, we extracted the dynamic spectral and temporal features, following the same approaches described in Static spectral features and Static temporal features sub-sections, respectively. Dynamic temporal features had a dimensionality of 1280, corresponding to 0.5 × 20 = 10 samples per channel. Dynamic spectral features had the same dimensionality as their static counterparts (256), but the Welch’s periodogram was computed using a 16 points long Hamming’s window (zero-padded to 250 points) with 50% overlapping.

Feature reduction and classification

The extracted features (both static and dynamic) were grouped according to the stimulus type (sound or image) and the task (active or passive), in order to classify the group-related arousal level (high or low). A total of four binary classification problems (high arousal vs low arousal) were performed: active image (Ac_Im), active sound (Ac_So), passive image (Ps_Im) and passive sound (Ps_So).

Static features were reduced by means of the biserial correlation coefficient r² with the threshold set at 90% of the total feature score. In order to identify the discriminative power of each EEG channel, a series of scalp plots (one for each feature type and each group) of the coefficients were drawn. Since each channel is associated with N > 1 features (as well as N r² coefficients), the coefficients (one coefficient for each channel) are calculated as a mean value. In other words, spectral and temporal features had two and 20 scalar features, respectively, for each EEG channel. To compute their scalp plots, we averaged 2 and 20 r² coefficients of each channel. To enhance the visualization of the plots, the coefficients were finally normalized to the total score and expressed as a percentage.

Each classification problem was addressed by the mean of three classifiers: LDA with pseudo-inverse covariance matrix; soft-margin SVM with penalty parameter C = 1 and RBF kernel; and kNN with Euclidean distance and k=1. Additionally, a random classifier, giving a uniform pseudo-random class (Pr{HA} = Pr{LA} = 0.5), served as a benchmark²⁹. The accuracy of the classifiers was measured, repeating 10 times for a 10-fold cross-validation scheme. The feature selection was computed within each cross-validation step, to avoid overfitting and reduce biased results⁴³.

For each group (Ac_Im, Ac_So, Ps_Im, Ps_So) and each feature type (static spectral, static temporal), the classification produced a 10 × 4 matrix containing the mean accuracies (one for each of the 10-fold cross-validation repetitions) of each classifier.

Dynamic features were reduced and classified similarly to the static ones. For each temporal segment, the associated features were reduced by means of the biserial correlation coefficient (threshold at 90%) and the classifiers (SVM, kNN, LDA and random) were evaluated using a 10-fold cross-validation scheme – repeated 10 times.

For each group, each feature type (dynamic spectral, dynamic temporal), each temporal segment and each classifier, the classification produced 10 sequences of mean accuracies ${A C C_{i}}_{i = 1}^{125}$ – one for each repetition of the 10-fold cross-validation scheme.

Data analysis

The syntax in MATLAB used for all analyses is available on GitHub along with the instructions on how to use it (see Software availability)⁵¹. The software can also be used with the open source program Octave.

Statistical analysis

The results of the static classifications were compared against the benchmark classifier by means of a two-sample t-test (right tail).

The results of dynamic classifications (i.e. based on dynamic spectral or dynamic temporal features) were compared following a segment-by-segment approach. For each group, the accuracy sequences of the dynamic classifiers (SVM, kNN and LDA) were compared with the benchmark accuracy sequence. Each sample $A C C_{i}^{k}$ , with k = {SVM, kNN, LDA}, was tested against $A C C_{i}^{Random}$ by means of two-sample t-tests (right tail). The corresponding p-value sequences ${p_{i}^{k}}_{i = 1}^{125}$ were Bonferroni-Holm corrected for multiple comparisons. Finally, the best accuracy point was detected as the left extreme of the temporal window corresponding to the highest significant accuracy.

Results

Static features

In Figure 2 and Figure 3, the scalp distributions of r² coefficients for each binary static classification problem, grouped for feature (spectral, temporal) and groups (Ps_Im, Ps_So, Ac_Im, Ac_So), are shown.

Figure 2. Spectral features.

Scalp distribution of the r² coefficients (normalized to the total score and expressed as percentage) grouped for tasks and stimulus type. (a) Active task: left Image, right Sound; (b) Passive task: left Image, right Sound.

Figure 3. Temporal features.

Scalp distribution of the r² coefficients (normalized to the total score and expressed as percentage), grouped for tasks and stimulus type. (a) Active task: left Image, right Sound; (b) Passive task: left Image, right Sound.

The temporal feature gave the most consistent topographical pattern, showing that the regions that best differentiate between high vs low stimuli (auditory and visual) were located over the central-parietal electrodes, whereas a more diffuse pattern in the scalp topography emerged for the spectral features.

In Figure 4 and Figure 5, box plots of the accuracies of static temporal and spectral classifications, grouped for condition, are shown. Note that SVM accuracies (the 2^nd boxplot from the left) are always shown as lines because the accuracies were constant within each cross-validation step (see also Table 1, Table 2 and Table 3).

Figure 4. Box-plots of the accuracies of the static spectral classifications.

From left: Active Image (Ac_Im), Active Sound (Ac_So), Passive Image (Ps_Im) and Passive Sound (Ps_So).

Figure 5. Box-plots of the accuracies of the static temporal classifications.

From left: Active Image (Ac_Im), Active Sound (Ac_So), Passive Image (Ps_Im) and Passive Sound (Ps_So).

Table 1. Static features.

Ordered accuracies grouped for classifier, feature and group.

Classifier	Accuracy	Feature	Group
SVM	51.80%	Spectral	Ps_Im
LDA	51.40%	Spectral	Ps_Im
kNN	51%	Temporal	Ac_So
kNN	50.90%	Spectral	Ac_So
SVM	50.90%	Spectral	Ac_So
SVM	50.90%	Temporal	Ac_So
SVM	50.40%	Temporal	Ps_So

SVM, support vector machine; LDA, linear discriminant analysis; kNN, k-nearest neighbour.

Table 2. Mean (M) and standard deviations (SD) of the accuracies of the static spectral classifications.

Active Image (Ac_Im), Active Sound (Ac_So), Passive Image (Ps_Im) and Passive Sound (Ps_So).

Group	LDA	SVM	kNN	Random
Ac_Im	M=0.496, SD=0.007	M=0.510, SD=0.000	M=0.500, SD=0.010	M=0.505, SD=0.011
Ac_So	M=0.492, SD=0.004	M=0.509, SD=0.000	M=0.509, SD=0.007	M=0.503, SD=0.009
Ps_Im	M=0.514, SD=0.010	M=0.518, SD=0.000	M=0.496, SD=0.010	M=0.495, SD=0.008
Ps_So	M=0.488, SD=0.005	M=0.504, SD=0.000	M=0.493, SD=0.007	M=0.503, SD=0.013

SVM, support vector machine; LDA, linear discriminant analysis; kNN, k-nearest neighbour.

Table 3. Mean (M) and standard deviations (SD) of the accuracies of the static temporal classifications.

Active Image (Ac_Im), Active Sound (Ac_So), Passive Image (Ps_Im) and Passive Sound (Ps_So).

Group	LDA	SVM	kNN	Random
Ac_Im	M=0.492, SD=0.010	M=0.510, SD=0.000	M=0.500, SD=0.008	M=0.498, SD=0.007
Ac_So	M=0.501, SD=0.007	M=0.509, SD=0.000	M=0.510, SD=0.006	M=0.498, SD=0.012
Ps_Im	M=0.500, SD=0.012	M=0.518, SD=0.000	M=0.492, SD=0.005	M=0.499, SD=0.006
Ps_So	M=0.499, SD=0.008	M=0.504, SD=0.000	M=0.492, SD=0.006	M=0.498, SD=0.008

SVM, support vector machine; LDA, linear discriminant analysis; kNN, k-nearest neighbour.

Note that all the accuracies refer to the same static classification problem (high arousal vs low arousal), performed using different classifiers (SVM, LDA, kNN) and features (spectral, temporal), on different groups (Ps_Im, Ps_So, Ac_Im, Ac_So).

Using spectral features, in only two groups did some classifiers show an accuracy greater than the benchmark. In the Ac_So group, ACC_SVM = 50.9% (t(18)=2.371, p=0.015) and ACC_kNN = 50.9% (t(18)=1.828, p=0.042), while for Ps_Im, ACC_LDA = 51.4% (t(18)=4.667, p<0.001) and ACC_SVM = 51.8% (t(18)=9.513, p<0.001).

Using temporal features, in all the groups some classifiers showed an accuracy greater than the benchmark. In the Ac_So group, ACC_SVM = 50.9% (t(18)=2.907, p=0.005) and ACC_kNN = 51% (t(18)=2.793, p=0.006) and in the Ps_So group, AAC_SVM = 50.4% (t(18)=9.493, p<0.001).

Dynamic features

In Figure 6–Figure 12, the results of the significant dynamic classifications are shown. In the upper section of the plots, the mean (bold line) and the standard deviation (shaded) of the accuracy sequence are shown. In the lower section of the plot (black dashed line), the Bonferroni-Holm corrected p-values sequence, discretized (as a stair graph) as significant (p<0.05) or non-significant (p>0.05) is shown.

Figure 6. Spectral dynamic features.

Accuracy (mean value, coloured line; standard deviation, shaded line) and p-values (black dotted line) in Ac_Im group for LDA (a) and SVM (b) classifiers.

Figure 7. Spectral dynamic features.

Accuracy (mean value, coloured line; standard deviation, shaded line) and p-values (black dotted line) in Ac_So group for LDA (a) and SVM (b) classifiers.

Figure 8. Spectral dynamic features.

Accuracy (mean value, coloured line; standard deviation, shaded line) and p-values (black dotted line) in Ps_Im group for LDA (a) and SVM (b) classifiers.

Figure 9. Spectral dynamic features.

Accuracy (mean value, coloured line; standard deviation, shaded line) and p-values (black dotted line) in Ac_So group for SVM (a) and kNN (b) classifiers.

Figure 10. Temporal dynamic features.

Accuracy (mean value, coloured line; standard deviation, shaded line) and p-values (black dotted line) in Ac_So group for LDA classifier.

Figure 11. Temporal dynamic features.

Accuracy (mean value, coloured line; standard deviation, shaded line) and p-values (black dotted line) in Ps_Im group for LDA (a), SVM (b) and kNN (c) classifiers.

Figure 12. Temporal dynamic features.

Accuracy (mean value, coloured line; standard deviation, shaded line) and p-values (black dotted line) in Ps_So group for LDA (a) and kNN (b) classifiers.

Note that all the accuracy plots refer to the same dynamic classification problem (high arousal vs low arousal), performed using different classifiers (SVM, LDA, kNN) and features on different groups. Spectral: Ac_Im (Figure 6), Ac_So (Figure 7), Ps_Im (Figure 8) and Ps_So (Figure 9); temporal: Ac_So (Figure 10), Ps_Im (Figure 11) and Ps_So (Figure 12).

Using spectral features, in all the groups some classifiers showed an accuracy greater than the benchmark. In the Ac_Im group, ACC_LDA = 51.97% @t = 0.080s (t(18)=6.291, p<0.001) and ACC_SVM = 51.07% @t = 0.416s (t(18)=6.531, p<0.001). In the Ac_So group, ACC_LDA = 53.04% @t = 0.332s (t(18)=8.583, p<0.001) and ACC_SVM = 51.16% @t = 0.146s (t(18)=8.612, p<0.001). In the Ps_Im group, ACC_LDA = 53.12% @t = 0.156s (t(18)=6.372, p=0.000) and ACC_SVM = 51.83% @t = 0.140s (t(18)=6.668, p<0.001). In the Ps_So group, ACC_SVM = 50.62% @t = 0.024s (t(18)=5.236, p=0.003) and ACC_kNN = 51.41% @t = 0.476s (t(18)=4.307, p=0.026).

Using temporal features, in only three groups did some classifiers show an accuracy greater than the benchmark. In the Ac_So group, ACC_SVM = 63.80% @t = 0.100s (t(18)=6.113, p=0.001). In the Ps_Im group, ACC_LDA = 63.68% @t = 0.024s (t(18)=12.108, p<0.001) and ACC_SVM = 51.43% @t = 0.084s (t(18)=4.881, p=0.008). In the Ps_So group, ACC_LDA = 64.30% @t = 0.0276s (t(18)=11.092, p<0.001) and ACC_kNN = 63.70% @t = 0.480s (t(18)=16.621, p<0.001).

Table 4 reports the accuracies for dynamic features, ordered in descending order and grouped for classifier, feature group and time.

Table 4. Dynamic features.

Ordered accuracies grouped for classifier, feature and group.

Classifier	Accuracy	Time [s]	Group	Feature
SVM	63.80%	0.1	Ac_So	Temporal
kNN	63.70%	0.048	Ps_So	Temporal
LDA	63.68%	0.024	Ps_Im	Temporal
LDA	63.30%	0.0276	Ps_So	Temporal
LDA	53.12%	0.156	Ps_Im	Spectral
LDA	53.04%	0.3332	Ac_So	Spectral
LDA	51.97%	0.08	Ac_Im	Spectral
SVM	51.83%	0.14	Ps_Im	Spectral
SVM	51.43%	0.084	Ps_Im	Temporal
kNN	51.41%	0.476	Ps_So	Spectral
SVM	51.16%	0.146	Ac_So	Spectral
SVM	51.07%	0.416	Ac_Im	Spectral
SVM	50.62%	0.024	Ps_So	Spectral

SVM, support vector machine; LDA, linear discriminant analysis; kNN, k-nearest neighbour.

Discussion

The aim of the study was to provide new methodological insights regarding machine learning approaches for the classification of anticipatory emotion-related EEG signals, by testing the performance of different classifiers on different features.

From the ISIs (i.e. the 1000 ms long window preceding each stimulus onset), we extracted two kinds of “static” features, namely spectral and temporal, the most commonly used features in the field of emotion recognition^19,20. As spectral features, we used the beta-over-alpha and the beta-over-theta ratio, whereas for the temporal feature we concatenated the decimated EEG values.

Additionally, we extracted the temporal sequences of both static spectral and temporal features, using a 500 ms long window moving along the ISI to build dynamic spectral and temporal features, respectively. This step is crucial for our work since, considering the temporal resolution of the EEG, an efficient classification should take into account the temporal dimension, to provide information about when the difference between two conditions are maximally expressed and therefore classified.

We trained and tested three different classifiers (LDA, SVM, kNN, the most commonly used in the field of emotion recognition^19,20) using both static and dynamic features, comparing their accuracies against a random classifier that served as benchmark.

Our goal was to identify the best classifier (static vs dynamic) and the best feature type (spectral vs temporal) to classify the arousal level (high vs low) of 56 auditory/visual stimuli. The stimuli, extracted from two standardized datasets (NIMSTIM⁵² and IADS⁴⁴), for visual and auditory stimuli, respectively) were presented in a randomized order, triggered by a TrueRNG™ hardware random number generator.

Considering the number of groups (four), the number of classifiers (three) and the number of feature types (two), each classification (static or dynamic) produced a total of 24 accuracies, whose significances were statistically tested (using a two-sample t-test and the benchmark’s accuracies).

Within the nine significant accuracies obtained using static features, the classifier that obtained the highest number of accuracies was the SVM (six significant accuracies), followed by kNN (two significant accuracies) and LDA (one significant accuracy). The most frequent feature was the temporal (five significant accuracies). Finally, the best (static) feature-classifier combination was the SVM with spectral features (51.8%), followed by LDA with spectral features (51.4%) and kNN with temporal features (51%).

Within the 13 significant accuracies obtained using dynamic features, the classifier that obtained the highest number of accuracies was the SVM (six significant accuracies), followed by LDA (four significant accuracies) and kNN (three significant accuracies). The most frequent feature was the spectral (eight significant accuracies). Finally, the best (dynamic) feature-classifier combination was the SVM with temporal features (63.8%), followed by kNN with temporal features (63.70%) and LDA with temporal features (63.68%). Spectral features produced only the 5th highest accuracy (53.12% with LDA). The three best accuracies were all within the first 100ms of the ISI, although a non-significant Spearman’s correlation between accuracy and time was observed (r=-0.308, p=0.306).

Our results show that globally the SVM presents the best accuracy, independent from feature type (temporal or spectral), but more importantly, the combination of SVM with the dynamic temporal feature produced the best classification performance. This finding is particularly relevant, considering the application of EEG in cognitive science. In fact, due to its high temporal resolution, EEG is often applied to investigate the timing of neural processes in relation to behavioural performance.

Our results therefore suggest that, in order to best classify emotions based on electrophysiological brain activity, the temporal dynamic of the EEG signal should be taken into account with a dynamic feature and consequently with a dynamic classifier. In fact, by including also time evolution of the feature in the machine learning model, it is possible to infer when two different conditions maximally diverge, allowing possible interpretation of the timing of the cognitive processes and the behaviour of the underlying neural substrate.

Finally, the main contribution of our results for the scientific community is that they provide a methodological advancement that is generally valid both for the investigation of emotion based on a machine learning approach with EEG signals and also for the investigation of preparatory brain activity.

Data availability

Underlying data

Figshare: EEG anticipation of random high and low arousal faces and sounds. https://doi.org/10.6084/m9.figshare.6874871.v8²⁷

This project contains the following underlying data:

- EEG metafile (DOCX)
- EEG data related to the Passive, Active and Predictive conditions (CSV)
- Video clips of the EEG activity before stimulus presentation (MPG)

Extended data

Figshare: EEG anticipation of random high and low arousal faces and sounds. https://doi.org/10.6084/m9.figshare.6874871.v8²⁷

This project contains the following extended data:

- Detailed description of LDA, SVM and kNN machine learning algorithms (DOCX)

Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).

Software availability

Source code available from: https://github.com/mbilucaglia/ML_BAA

Archived source code at time of publication: https://doi.org/10.5281/zenodo.3666045⁵¹

License: GPL-3.0

Faculty Opinions recommended

References

1. Friston K: A theory of cortical responses. Philos Trans R Soc Lond B Biol Sci. 2005; 360(1456): 815–836. PubMed Abstract | Publisher Full Text | Free Full Text
2. Nobre AC: Orienting attention to instants in time. Neuropsychologia. 2001; 39(12): 1317–1328. PubMed Abstract | Publisher Full Text
3. Mento G, Vallesi A: Spatiotemporally dissociable neural signatures for generating and updating expectation over time in children: A High Density-ERP study. Dev Cogn Neurosci. 2016; 19: 98–106. PubMed Abstract | Publisher Full Text | Free Full Text
4. Mento G, Tarantino V, Vallesi A, et al.: Spatiotemporal neurodynamics underlying internally and externally driven temporal prediction: A high spatial resolution ERP study. J Cogn Neurosci. 2015; 27(3): 425–439. PubMed Abstract | Publisher Full Text
5. Barsalou LW: Grounded Cognition. Annu Rev Psychol. 2008; 59: 617–645. PubMed Abstract | Publisher Full Text
6. Barrett LF: The theory of constructed emotion: an active inference account of interoception and categorization. Soc Cogn Affect Neurosci. 2017; 12(11): 1833. PubMed Abstract | Publisher Full Text | Free Full Text
7. Bruner JS: Acts of meaning. Harvard University Press, 1990. Reference Source
8. Miniussi C, Wilding EL, Coull JT, et al.: Orienting attention in time. Modulation of brain potentials. Brain. 1999; 122(Pt 8): 1507–1518. PubMed Abstract | Publisher Full Text
9. Stefanics G, Hangya B, Hernádi I, et al.: Phase entrainment of human delta oscillations can mediate the effects of expectation on reaction speed. J Neurosci. 2010; 30(41): 13578–13585. PubMed Abstract | Publisher Full Text | Free Full Text
10. Denny BT, Ochsner KN, Weber J, et al.: Anticipatory brain activity predicts the success or failure of subsequent emotion regulation. Soc Cogn Affect Neurosci. 2014; 9(4): 403–411. PubMed Abstract | Publisher Full Text | Free Full Text
11. Abler B, Erk S, Herwig U, et al.: Anticipation of aversive stimuli activates extended amygdala in unipolar depression. J Psychiatr Res. 2007; 41(6): 511–522. PubMed Abstract | Publisher Full Text
12. Morinaga K, Akiyoshi J, Matsushita H, et al.: Anticipatory anxiety-induced changes in human lateral prefrontal cortex activity. Biol Psychol. 2007; 74(1): 34–38. PubMed Abstract | Publisher Full Text
13. Duma GM, Mento G, Manari T, et al.: Driving with Intuition: A Preregistered Study about the EEG Anticipation of Simulated Random Car Accidents. PLoS One. 2017; 12(1): e0170370. PubMed Abstract | Publisher Full Text | Free Full Text
14. Radin DI, Vieten C, Michel L, et al.: Electrocortical activity prior to unpredictable stimuli in meditators and nonmeditators. Explore (NY). 2011; 7(5): 286–299. PubMed Abstract | Publisher Full Text
15. Mossbridge JA, Tressoldi P, Utts J, et al.: Predicting the unpredictable: critical analysis and practical implications of predictive anticipatory activity. Front Hum Neurosci. 2014; 8: 146. PubMed Abstract | Publisher Full Text | Free Full Text
16. Gunes H, Pantic M: Automatic, Dimensional and Continuous Emotion Recognition. Int J Synth Emot. 2010; 1(1): 32. Publisher Full Text
17. Shu L, Xie J, Yang M, et al.: A Review of Emotion Recognition Using Physiological Signals. Sensors (Basel). 2018; 18(7): pii: E2074. PubMed Abstract | Publisher Full Text | Free Full Text
18. Calvo RA, D’Mello S: Affect detection: An interdisciplinary review of models, methods, and their applications. IEEE Trans Affect Comput. 2010; 1(1): 18–37. Publisher Full Text
19. Alarcao SM, Fonseca MJ: Emotions Recognition Using EEG Signals: A Survey. IEEE Trans Affect Comput. 2017; 3045: 1–20. Publisher Full Text
20. Al-Nafjan A, Hosny M, Al-Ohali Y, et al.: Review and Classification of Emotion Recognition Based on EEG Brain-Computer Interface System Research: A Systematic Review. Appl Sci. 2017; 7(12): 1239. Publisher Full Text
21. Lotte F, Congedo M, Lécuyer A, et al.: A review of classification algorithms for EEG-based brain-computer interfaces. J Neural Eng. 2007; 4(2): R1–R13. PubMed Abstract | Publisher Full Text
22. Lin YP, Wang CH, Wu TL, et al.: EEG-based emotion recognition in music listening: A comparison of schemes for multiclass support vector machine. In: Proceedings of the ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2009. Publisher Full Text
23. Koelstra S, Yazdani A, Soleymani M, et al.: Single trial classification of EEG and peripheral physiological signals for recognition of emotions induced by music videos. In: Proceedings of the Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2010. Publisher Full Text
24. Liu Y, Sourina O: EEG-based valence level recognition for real-time applications. In: Proceedings of the Proceedings of the 2012 International Conference on Cyberworlds, Cyberworlds. 2012; 2012. Publisher Full Text
25. Murugappan M, Murugappan S: Human emotion recognition through short time Electroencephalogram (EEG) signals using Fast Fourier Transform (FFT). In: Proceedings of the Proceedings - 2013 IEEE 9th International Colloquium on Signal Processing and its Applications, CSPA. 2013; 2013. Publisher Full Text
26. Thammasan N, Fukui KI, Numao M: Application of deep belief networks in EEG-based dynamic music-emotion recognition. In: Proceedings of the Proceedings of the International Joint Conference on Neural Networks. 2016. Publisher Full Text
27. Tressoldi P, Duma GM, Mento G: EEG anticipation of random high and low arousal faces and sounds. figshare. 2018; Dataset. http://www.doi.org/10.6084/m9.figshare.6874871.v8
28. Duda RO, Hart PE, Stork DG: Pattern classification, 2nd edition. Wiley, 2000. Reference Source
29. Bilucaglia M, Pederzoli L, Giroldini W, et al.: EEG correlation at a distance: A re-analysis of two studies using a machine learning approach [version 2; peer review: 2 approved]. F1000Research. 2019; 8: 43. PubMed Abstract | Publisher Full Text | Free Full Text
30. Bishop C: Pattern Recognition and Machine Learning. 1st ed Springer, 2006. Reference Source
31. Rubinstein YD, Hastie T: Discriminative vs Informative Learning. In: Proceedings of the Proceedings of the The Third International Conference on Knowledge Discovery and Data Mining. 1997; 49–59. Reference Source
32. Raudys S, Duin RPW: Expected classification error of the Fisher linear classifier with pseudo-inverse covariance matrix. Pattern Recognit Lett. 1998; 19(5–6): 385–392. Publisher Full Text
33. Burges CJC: A Tutorial on Support Vector Machines for Pattern Recognition. Data Min Knowl Discov. 1998; 2: 121–167. Publisher Full Text
34. Müller KR, Mika S, Rätsch G, et al.: An introduction to kernel-based learning algorithms. IEEE Trans Neural Networks. 2001; 12(2): 181–201. PubMed Abstract | Publisher Full Text
35. Atiya AF: Estimating the posterior probabilities using the K-nearest neighbor rule. Neural Comput. 2005; 17(3): 731–740. PubMed Abstract | Publisher Full Text
36. Correia JM, Jansma B, Hausfeld L, et al.: EEG decoding of spoken words in bilingual listeners: From words to language invariant semantic-conceptual representations. Front Psychol. 2015; 6: 71. PubMed Abstract | Publisher Full Text | Free Full Text
37. Duma GM, Mento G, Semenzato L, et al.: EEG anticipation of random high and low arousal faces and sounds [version 2; peer review: 1 approved, 1 not approved]. F1000Research. 2018; 8: 1508. Publisher Full Text
38. Mo C, Lu J, Wu B, et al.: Competing rhythmic neural representations of orientations during concurrent attention to multiple orientation features. Nat Commun. 2019; 10(1): 5264. PubMed Abstract | Publisher Full Text | Free Full Text
39. Roberts T, Cant JS, Nestor A: Elucidating the Neural Representation and the Processing Dynamics of Face Ensembles. J Neurosci. 2019; 39(39): 7737–7747. PubMed Abstract | Publisher Full Text | Free Full Text
40. Jain AK, Duin RP, Mao J: Statistical pattern recognition: A review. IEEE Trans Pattern Anal Mach Intell. 2000; 22(1): 4–37. Publisher Full Text
41. Tang J Alelyani S, Liu H: Feature selection for classification: A review. In Data Classification: Algorithms and Applications . Aggarwal, C.C.,Ed.; 2014; 37–64. Reference Source
42. Miao J, Niu L: A Survey on Feature Selection. Procedia Comput Sci. 2016; 91: 919–926. Publisher Full Text
43. Müller K, Krauledat M, Dornhege G, et al.: Machine learning techniques for brain-computer interfaces. Biomed Tech (Biomed Tech). 2004; 49: 11–22. Reference Source
44. Stevenson RA, James TW: Affective auditory stimuli: characterization of the International Affective Digitized Sounds (IADS) by discrete emotional categories. Behav Res Methods. 2008; 40(1): 315–21. PubMed Abstract | Publisher Full Text
45. Stone JV: Independent component analysis: an introduction. Trends Cogn Sci. 2002; 6(2): 59–64. PubMed Abstract | Publisher Full Text
46. Allen JJ, Coan JA, Nazarian M: Issues and assumptions on the road from raw signals to metrics of frontal EEG asymmetry in emotion. Biol Psychol. 2004; 67(1–2): 183–218. PubMed Abstract | Publisher Full Text
47. Babiloni C, Stella G, Buffo P, et al.: Cortical sources of resting state EEG rhythms are abnormal in dyslexic children. Clin Neurophysiol. 2012; 123(12): 2384–2391. PubMed Abstract | Publisher Full Text
48. Mert A, Akan A: Emotion recognition from EEG signals by using multivariate empirical mode decomposition. Pattern Anal Appl. 2018; 21: 81–89. Publisher Full Text
49. Clarke AR, Barry RJ, Karamacoska D, et al.: The EEG Theta/Beta Ratio: A marker of Arousal or Cognitive Processing Capacity? Appl Psychophysiol Biofeedback. 2019; 44(2): 123–129. PubMed Abstract | Publisher Full Text
50. Blankertz B, Lemm S, Treder M, et al.: Single-trial analysis and classification of ERP components - A tutorial. Neuroimage. 2011; 56(2): 814–825. PubMed Abstract | Publisher Full Text
51. Marco B: BAA - Matlab Code (Version 1). Zenodo. 2020. http://www.doi.org/10.5281/zenodo.3666045
52. Tottenham N, Tanaka JW, Leon AC, et al.: The NimStim set of facial expressions: Judgments from untrained research participants. Psychiatry Res. 2009; 168(3): 242–249. PubMed Abstract | Publisher Full Text | Free Full Text

Comments on this article Comments (0)

Version 3

VERSION 3 PUBLISHED 10 Mar 2020

Author details Author details

Marco Bilucaglia
Roles: Conceptualization, Data Curation, Formal Analysis, Methodology, Validation, Writing – Original Draft Preparation, Writing – Review & Editing

Gian Marco Duma
Roles: Conceptualization, Data Curation, Formal Analysis, Methodology, Writing – Original Draft Preparation, Writing – Review & Editing

Giovanni Mento
Roles: Conceptualization, Supervision, Writing – Review & Editing

Luca Semenzato
Roles: Software, Validation

Patrizio E. Tressoldi
Roles: Conceptualization, Supervision, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

The author(s) declared that no grants were involved in supporting this work.

Article Versions (3)

version 3

Revised

Published: 13 Oct 2021, 9:173

https://doi.org/10.12688/f1000research.22202.3

version 2

Revised

Published: 08 Oct 2021, 9:173

https://doi.org/10.12688/f1000research.22202.2

version 1

Published: 10 Mar 2020, 9:173

https://doi.org/10.12688/f1000research.22202.1

© 2020 Bilucaglia M et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Bilucaglia M, Duma GM, Mento G et al. Applying machine learning EEG signal classification to emotion‑related brain anticipatory activity [version 1; peer review: 2 approved with reservations]. F1000Research 2020, 9:173 (https://doi.org/10.12688/f1000research.22202.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Version 1

VERSION 1

PUBLISHED 10 Mar 2020

Views

Reviewer Report 21 Sep 2021

Rajesh K Tripathy, Department of Electrical and Electronics Engineering, BITS-Pilani, Hyderabad, Telangana, 500078, India

Approved with Reservations

https://doi.org/10.5256/f1000research.24487.r92735

In this paper, authors have applied machine learning to classify emotion‑related brain anticipatory activity using EEG signals. The paper has several flaws and it should be carefully revised before resubmission. My review comments are given as follows:

Why authors have considered linear discriminant analysis, support vector machine, and k-nearest neighbor classifiers for the classification of EEG signals? Why not RNNs and multilayer perceptron.
Could you please add the arousal score? How you have divided into low-arousal and high-arousal. Why not three classes as low arousal vs. medium arousal vs. high arousal. Please clarify.
The cohen kappa score, Mathews correlation coefficients, F score values need to be evaluated.
Recently, the rhythm-specific deep CNN is successful for the classification of emotions using EEG signals. Could you compare with CNN-based methods?
The advantages and disadvantages of your work must be written.

Is the rationale for developing the new method (or application) clearly explained?

Yes
Is the description of the method technically sound?

Yes
Are sufficient details provided to allow replication of the method development and its use by others?

Yes
If any results are presented, are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions about the method and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: AI for healthcare, deep learning, Biomedical signal processing, machine learning

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Author Response 08 Oct 2021

Patrizio Tressoldi, Science of Consciousness Research Group, Department of General Psychology, Università degli Studi di Padova, Padova, Italy

08 Oct 2021

Author Response
1. Why authors have considered linear discriminant analysis, support vector machine, and k-nearest neighbor classifiers for the classification of EEG signals? Why not RNNs and multilayer perceptron.
Reply: According ... Continue reading
Why authors have considered linear discriminant analysis, support vector machine, and k-nearest neighbor classifiers for the classification of EEG signals? Why not RNNs and multilayer perceptron.

Reply: According to the literature (ref. 19-20), LDA, SVM and kNN are the most commonly used (79.3% and 84% overall, respectively) classifiers in the Emotion recognition Field. A more recent review (ref. 53) partially confirmed this, showing that LDA, SVM and kNN are overall used 44.44% of the times. In contrast, neural‑network-based classifiers (e.g. CNN, MLP-BP, ANN) are used only 3.17% (ref. 19), 15.5% (ref. 20) and 5.56% (ref. 53) of the times. In addition to their popularity, LDA SVM and kNN were selected as representative of 3 different families, namely parametric, discriminative and non-parametric classifiers.

Could you please add the arousal score? How you have divided into low-arousal and high-arousal. Why not three classes as low arousal vs. medium arousal vs. high arousal. Please clarify.

Reply: Both the image and sounds were discretized into low and high arousal as those with a SAM score respectively lower and higher than 5. We chose a binary classification since it is associated to lower computational costs than the multiclass alternatives. Multiclass classification could produce different (and probably better) results, at the cost of lowering the number of available instances per class. We added this consideration in the study limitations.

The cohen kappa score, Mathews correlation coefficients, F score values need to be evaluated.

Reply: As evaluation metric, we chose the accuracy since the two classes were approximatively balanced (we reported in Table 1 the class distributions) and we were mainly interested in the overall correct classification (with equal weight on high and low arousal). We agree that additional metrics (that are mandatory for unbalanced dataset) would be very informative also for balanced datasets, providing, e.g., useful insights to which class is better classified. We added this consideration in the study limitations.

Recently, the rhythm-specific deep CNN is successful for the classification of emotions using EEG signals. Could you compare with CNN-based methods?

Reply: In the discussions we compared our results with two studies based on CNN classifiers and we identify that

The advantages and disadvantages of your work must be written.

Reply: In the discussions we added the paragraph “Study limitations”.
Why authors have considered linear discriminant analysis, support vector machine, and k-nearest neighbor classifiers for the classification of EEG signals? Why not RNNs and multilayer perceptron.

Reply: According to the literature (ref. 19-20), LDA, SVM and kNN are the most commonly used (79.3% and 84% overall, respectively) classifiers in the Emotion recognition Field. A more recent review (ref. 53) partially confirmed this, showing that LDA, SVM and kNN are overall used 44.44% of the times. In contrast, neural‑network-based classifiers (e.g. CNN, MLP-BP, ANN) are used only 3.17% (ref. 19), 15.5% (ref. 20) and 5.56% (ref. 53) of the times. In addition to their popularity, LDA SVM and kNN were selected as representative of 3 different families, namely parametric, discriminative and non-parametric classifiers.

Could you please add the arousal score? How you have divided into low-arousal and high-arousal. Why not three classes as low arousal vs. medium arousal vs. high arousal. Please clarify.

Reply: Both the image and sounds were discretized into low and high arousal as those with a SAM score respectively lower and higher than 5. We chose a binary classification since it is associated to lower computational costs than the multiclass alternatives. Multiclass classification could produce different (and probably better) results, at the cost of lowering the number of available instances per class. We added this consideration in the study limitations.

The cohen kappa score, Mathews correlation coefficients, F score values need to be evaluated.

Reply: As evaluation metric, we chose the accuracy since the two classes were approximatively balanced (we reported in Table 1 the class distributions) and we were mainly interested in the overall correct classification (with equal weight on high and low arousal). We agree that additional metrics (that are mandatory for unbalanced dataset) would be very informative also for balanced datasets, providing, e.g., useful insights to which class is better classified. We added this consideration in the study limitations.

Recently, the rhythm-specific deep CNN is successful for the classification of emotions using EEG signals. Could you compare with CNN-based methods?

Reply: In the discussions we compared our results with two studies based on CNN classifiers and we identify that

The advantages and disadvantages of your work must be written.

Reply: In the discussions we added the paragraph “Study limitations”.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 08 Oct 2021

Patrizio Tressoldi, Science of Consciousness Research Group, Department of General Psychology, Università degli Studi di Padova, Padova, Italy

08 Oct 2021

Author Response
1. Why authors have considered linear discriminant analysis, support vector machine, and k-nearest neighbor classifiers for the classification of EEG signals? Why not RNNs and multilayer perceptron.
Reply: According ... Continue reading
Why authors have considered linear discriminant analysis, support vector machine, and k-nearest neighbor classifiers for the classification of EEG signals? Why not RNNs and multilayer perceptron.

Reply: According to the literature (ref. 19-20), LDA, SVM and kNN are the most commonly used (79.3% and 84% overall, respectively) classifiers in the Emotion recognition Field. A more recent review (ref. 53) partially confirmed this, showing that LDA, SVM and kNN are overall used 44.44% of the times. In contrast, neural‑network-based classifiers (e.g. CNN, MLP-BP, ANN) are used only 3.17% (ref. 19), 15.5% (ref. 20) and 5.56% (ref. 53) of the times. In addition to their popularity, LDA SVM and kNN were selected as representative of 3 different families, namely parametric, discriminative and non-parametric classifiers.

Could you please add the arousal score? How you have divided into low-arousal and high-arousal. Why not three classes as low arousal vs. medium arousal vs. high arousal. Please clarify.

Reply: Both the image and sounds were discretized into low and high arousal as those with a SAM score respectively lower and higher than 5. We chose a binary classification since it is associated to lower computational costs than the multiclass alternatives. Multiclass classification could produce different (and probably better) results, at the cost of lowering the number of available instances per class. We added this consideration in the study limitations.

The cohen kappa score, Mathews correlation coefficients, F score values need to be evaluated.

Reply: As evaluation metric, we chose the accuracy since the two classes were approximatively balanced (we reported in Table 1 the class distributions) and we were mainly interested in the overall correct classification (with equal weight on high and low arousal). We agree that additional metrics (that are mandatory for unbalanced dataset) would be very informative also for balanced datasets, providing, e.g., useful insights to which class is better classified. We added this consideration in the study limitations.

Recently, the rhythm-specific deep CNN is successful for the classification of emotions using EEG signals. Could you compare with CNN-based methods?

Reply: In the discussions we compared our results with two studies based on CNN classifiers and we identify that

The advantages and disadvantages of your work must be written.

Reply: In the discussions we added the paragraph “Study limitations”.
Why authors have considered linear discriminant analysis, support vector machine, and k-nearest neighbor classifiers for the classification of EEG signals? Why not RNNs and multilayer perceptron.

Reply: According to the literature (ref. 19-20), LDA, SVM and kNN are the most commonly used (79.3% and 84% overall, respectively) classifiers in the Emotion recognition Field. A more recent review (ref. 53) partially confirmed this, showing that LDA, SVM and kNN are overall used 44.44% of the times. In contrast, neural‑network-based classifiers (e.g. CNN, MLP-BP, ANN) are used only 3.17% (ref. 19), 15.5% (ref. 20) and 5.56% (ref. 53) of the times. In addition to their popularity, LDA SVM and kNN were selected as representative of 3 different families, namely parametric, discriminative and non-parametric classifiers.

Could you please add the arousal score? How you have divided into low-arousal and high-arousal. Why not three classes as low arousal vs. medium arousal vs. high arousal. Please clarify.

Reply: Both the image and sounds were discretized into low and high arousal as those with a SAM score respectively lower and higher than 5. We chose a binary classification since it is associated to lower computational costs than the multiclass alternatives. Multiclass classification could produce different (and probably better) results, at the cost of lowering the number of available instances per class. We added this consideration in the study limitations.

The cohen kappa score, Mathews correlation coefficients, F score values need to be evaluated.

Reply: As evaluation metric, we chose the accuracy since the two classes were approximatively balanced (we reported in Table 1 the class distributions) and we were mainly interested in the overall correct classification (with equal weight on high and low arousal). We agree that additional metrics (that are mandatory for unbalanced dataset) would be very informative also for balanced datasets, providing, e.g., useful insights to which class is better classified. We added this consideration in the study limitations.

Recently, the rhythm-specific deep CNN is successful for the classification of emotions using EEG signals. Could you compare with CNN-based methods?

Reply: In the discussions we compared our results with two studies based on CNN classifiers and we identify that

The advantages and disadvantages of your work must be written.

Reply: In the discussions we added the paragraph “Study limitations”.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Views

Reviewer Report 14 Sep 2021

Zahid Halim, GIK Institute of Engineering Sciences and Technology, Swabi, Khyber Pakhtunkhwa, 23640, Pakistan

Approved with Reservations

https://doi.org/10.5256/f1000research.24487.r92737

The paper presents an interesting approach using ML on EEG data for brain activity. Following issues need to be addressed.

Quantify your results in the abstract.
Add a gap analysis section

The paper presents an interesting approach using ML on EEG data for brain activity. Following issues need to be addressed.

Quantify your results in the abstract.
Add a gap analysis section in your introduction and also mention the research questions.
Fig. 1 seems generic, please add steps specific to your solution. Just a suggestion.
Summarize your key findings in a table in the discussion section.
Following references are missing
1. On Identification of Driving-Induced Stress Using Electroencephalogram Signals: A Framework Based On Wearable Safety-Critical Scheme and Machine Learning
2. A machine learning-based investigation utilizing the in-text features for the identification of dominant emotion in an email
3. Imagined character recognition through EEG signals using deep convolutional neural network
4. Investigating the use of pretrained convolutional neural network on cross-subject and cross-dataset EEG emotion recognition
5. Cross-subject multimodal emotion recognition based on hybrid fusion
6. A novel derivative-based classification method for hyperspectral data processing

Is the rationale for developing the new method (or application) clearly explained?

Yes
Is the description of the method technically sound?

Partly
Are sufficient details provided to allow replication of the method development and its use by others?

Yes
If any results are presented, are all the source data underlying the results available to ensure full reproducibility?

Partly
Are the conclusions about the method and its performance adequately supported by the findings presented in the article?

Partly

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Machine learning

CITE

Report a concern

Author Response 15 Sep 2021

Patrizio Tressoldi, Science of Consciousness Research Group, Department of General Psychology, Università degli Studi di Padova, Padova, Italy

15 Sep 2021

Author Response

Thank you for your review.
We will respond to your comments and suggestions after receiving at least a second review.

Best wishes
Patrizio
Competing Interests: No competing interests were disclosed.
Thank you for your review.
We will respond to your comments and suggestions after receiving at least a second review.

Best wishes
Patrizio
Thank you for your review.
We will respond to your comments and suggestions after receiving at least a second review.

Best wishes
Patrizio
Competing Interests: No competing interests were disclosed. Close
Report a concern
Author Response 08 Oct 2021

Patrizio Tressoldi, Science of Consciousness Research Group, Department of General Psychology, Università degli Studi di Padova, Padova, Italy

08 Oct 2021

Author Response
1. Quantify your results in the abstract.
Reply: We added the accuracies of the 3 best classifier-feature combinations for both the static and dynamic classifiers in the abstract. ... Continue reading
Quantify your results in the abstract.

Reply: We added the accuracies of the 3 best classifier-feature combinations for both the static and dynamic classifiers in the abstract.

Add a gap analysis section in your introduction and also mention the research questions.

Reply: We split the past introduction into two different sections: introduction and state of the art. The introduction describes the theory of the emotion, the brain anticipatory activity, the physiological correlated of the emotions and introduce the emotion recognition as a ML approach. The state of the art summarizes previous works on the emotion recognition, underling their characteristics (in terms of chosen features/classifiers and obtained performances), as well as identify the limits (i.e. performances highly variable depending from the feature/classifier combination). Finally, it describes the aims of the present work. We hope now both the limits in the literature and the aim of the present study are clearer.

Fig. 1 seems generic, please add steps specific to your solution. Just a suggestion.

Reply: This paper re-analysed previously collected data, whose details can be found in the corresponding article (ref. 37). Fig. 1 graphically summarised the experimental task of (ref. 37). We agree that, probably, it is not such informative since interest readers can look at (ref. 37) for all the details. So, we removed it.

Summarize your key findings in a table in the discussion section.

Reply: We added the table 5 to summarise the best 3 classifier/features combinations for both the static and dynamic approaches.

Following references are missing

On Identification of Driving-Induced Stress Using Electroencephalogram Signals: A Framework Based on Wearable Safety-Critical Scheme and Machine Learning

A machine learning-based investigation utilizing the in-text features for the identification of dominant emotion in an email

Imagined character recognition through EEG signals using deep convolutional neural network

Investigating the use of pretrained convolutional neural network on cross-subject and cross-dataset EEG emotion recognition

Cross-subject multimodal emotion recognition based on hybrid fusion

A novel derivative-based classification method for hyperspectral data processing

Reply: We added the suggested references: (1), (2), (3), (5) and (6) in the introduction, while (4) in the discussion.
Quantify your results in the abstract.

Reply: We added the accuracies of the 3 best classifier-feature combinations for both the static and dynamic classifiers in the abstract.

Add a gap analysis section in your introduction and also mention the research questions.

Reply: We split the past introduction into two different sections: introduction and state of the art. The introduction describes the theory of the emotion, the brain anticipatory activity, the physiological correlated of the emotions and introduce the emotion recognition as a ML approach. The state of the art summarizes previous works on the emotion recognition, underling their characteristics (in terms of chosen features/classifiers and obtained performances), as well as identify the limits (i.e. performances highly variable depending from the feature/classifier combination). Finally, it describes the aims of the present work. We hope now both the limits in the literature and the aim of the present study are clearer.

Fig. 1 seems generic, please add steps specific to your solution. Just a suggestion.

Reply: This paper re-analysed previously collected data, whose details can be found in the corresponding article (ref. 37). Fig. 1 graphically summarised the experimental task of (ref. 37). We agree that, probably, it is not such informative since interest readers can look at (ref. 37) for all the details. So, we removed it.

Summarize your key findings in a table in the discussion section.

Reply: We added the table 5 to summarise the best 3 classifier/features combinations for both the static and dynamic approaches.

Following references are missing

On Identification of Driving-Induced Stress Using Electroencephalogram Signals: A Framework Based on Wearable Safety-Critical Scheme and Machine Learning

A machine learning-based investigation utilizing the in-text features for the identification of dominant emotion in an email

Imagined character recognition through EEG signals using deep convolutional neural network

Investigating the use of pretrained convolutional neural network on cross-subject and cross-dataset EEG emotion recognition

Cross-subject multimodal emotion recognition based on hybrid fusion

A novel derivative-based classification method for hyperspectral data processing

Reply: We added the suggested references: (1), (2), (3), (5) and (6) in the introduction, while (4) in the discussion.
Competing Interests: I'm the corresponding author Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 15 Sep 2021

Patrizio Tressoldi, Science of Consciousness Research Group, Department of General Psychology, Università degli Studi di Padova, Padova, Italy

15 Sep 2021

Author Response

Thank you for your review.
We will respond to your comments and suggestions after receiving at least a second review.

Best wishes
Patrizio
Competing Interests: No competing interests were disclosed.
Thank you for your review.
We will respond to your comments and suggestions after receiving at least a second review.

Best wishes
Patrizio
Thank you for your review.
We will respond to your comments and suggestions after receiving at least a second review.

Best wishes
Patrizio
Competing Interests: No competing interests were disclosed. Close
Report a concern
Author Response 08 Oct 2021

Patrizio Tressoldi, Science of Consciousness Research Group, Department of General Psychology, Università degli Studi di Padova, Padova, Italy

08 Oct 2021

Author Response
1. Quantify your results in the abstract.
Reply: We added the accuracies of the 3 best classifier-feature combinations for both the static and dynamic classifiers in the abstract. ... Continue reading
Quantify your results in the abstract.

Reply: We added the accuracies of the 3 best classifier-feature combinations for both the static and dynamic classifiers in the abstract.

Add a gap analysis section in your introduction and also mention the research questions.

Reply: We split the past introduction into two different sections: introduction and state of the art. The introduction describes the theory of the emotion, the brain anticipatory activity, the physiological correlated of the emotions and introduce the emotion recognition as a ML approach. The state of the art summarizes previous works on the emotion recognition, underling their characteristics (in terms of chosen features/classifiers and obtained performances), as well as identify the limits (i.e. performances highly variable depending from the feature/classifier combination). Finally, it describes the aims of the present work. We hope now both the limits in the literature and the aim of the present study are clearer.

Fig. 1 seems generic, please add steps specific to your solution. Just a suggestion.

Reply: This paper re-analysed previously collected data, whose details can be found in the corresponding article (ref. 37). Fig. 1 graphically summarised the experimental task of (ref. 37). We agree that, probably, it is not such informative since interest readers can look at (ref. 37) for all the details. So, we removed it.

Summarize your key findings in a table in the discussion section.

Reply: We added the table 5 to summarise the best 3 classifier/features combinations for both the static and dynamic approaches.

Following references are missing

On Identification of Driving-Induced Stress Using Electroencephalogram Signals: A Framework Based on Wearable Safety-Critical Scheme and Machine Learning

A machine learning-based investigation utilizing the in-text features for the identification of dominant emotion in an email

Imagined character recognition through EEG signals using deep convolutional neural network

Investigating the use of pretrained convolutional neural network on cross-subject and cross-dataset EEG emotion recognition

Cross-subject multimodal emotion recognition based on hybrid fusion

A novel derivative-based classification method for hyperspectral data processing

Reply: We added the suggested references: (1), (2), (3), (5) and (6) in the introduction, while (4) in the discussion.
Quantify your results in the abstract.

Reply: We added the accuracies of the 3 best classifier-feature combinations for both the static and dynamic classifiers in the abstract.

Add a gap analysis section in your introduction and also mention the research questions.

Reply: We split the past introduction into two different sections: introduction and state of the art. The introduction describes the theory of the emotion, the brain anticipatory activity, the physiological correlated of the emotions and introduce the emotion recognition as a ML approach. The state of the art summarizes previous works on the emotion recognition, underling their characteristics (in terms of chosen features/classifiers and obtained performances), as well as identify the limits (i.e. performances highly variable depending from the feature/classifier combination). Finally, it describes the aims of the present work. We hope now both the limits in the literature and the aim of the present study are clearer.

Fig. 1 seems generic, please add steps specific to your solution. Just a suggestion.

Reply: This paper re-analysed previously collected data, whose details can be found in the corresponding article (ref. 37). Fig. 1 graphically summarised the experimental task of (ref. 37). We agree that, probably, it is not such informative since interest readers can look at (ref. 37) for all the details. So, we removed it.

Summarize your key findings in a table in the discussion section.

Reply: We added the table 5 to summarise the best 3 classifier/features combinations for both the static and dynamic approaches.

Following references are missing

On Identification of Driving-Induced Stress Using Electroencephalogram Signals: A Framework Based on Wearable Safety-Critical Scheme and Machine Learning

A machine learning-based investigation utilizing the in-text features for the identification of dominant emotion in an email

Imagined character recognition through EEG signals using deep convolutional neural network

Investigating the use of pretrained convolutional neural network on cross-subject and cross-dataset EEG emotion recognition

Cross-subject multimodal emotion recognition based on hybrid fusion

A novel derivative-based classification method for hyperspectral data processing

Reply: We added the suggested references: (1), (2), (3), (5) and (6) in the introduction, while (4) in the discussion.
Competing Interests: I'm the corresponding author Close
Report a concern

Comments on this article Comments (0)

Version 3

VERSION 3 PUBLISHED 10 Mar 2020

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2	3
Version 3 (revision) 13 Oct 21		read	read
Version 2 (revision) 08 Oct 21
Version 1 10 Mar 20	read	read

Zahid Halim, GIK Institute of Engineering Sciences and Technology, Swabi, Pakistan
Rajesh K Tripathy, BITS-Pilani, Hyderabad, India
Alessandro Bruno, IULM University, Milan, Italy

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

12 Views

26 Oct 2023 | for Version 3

Alessandro Bruno, IULM University, Milan, Italy

12 Views Cite this report Responses(0)

Approved

"The manuscript tackles a meaningful and complex topic such as Neurophysiological signal analysis via Machine Learning. In particular, the authors focused on EEG for brain anticipatory activity signals, which bring in high-variance and noisy data. The latter raises a question concerning the reliability of machine learning techniques on the knowledge inference of the substantiated processes tracked by EEG. To this end, the manuscript tackles analysing the mentioned data using well-known machine learning techniques like SVM, LDA and kNN. Interestingly, the authors dived into a tricky problem: multimodal data analysis. They arranged experimental sessions with visual and auditory features. Participants were carefully instructed to keep eye movements and blinks as low as possible. Afterwards, FIR filters were adopted to extract a band-pass-filtered version of the signals, which underwent visual inspection to discard bad channels and rare artefacts. On top of that, ICA (Independent Component Analysis) is run to reduce data dimensions.

Then, the work presents two different approaches: static and dynamic. Both rely on spatial and temporal feature extraction, with the latter building upon Welch’s periodogram. A statistical significance test (Bonferroni-Holm) assessed the three classifiers (SVM, LDA, and kNN).

From an overall standpoint, the significant contributions of the article are three: a comparison between static and dynamic approaches over three traditional machine learning methods for brain anticipatory activity signal classification; experimental results from SVM, LDA, and kNN with heterogeneous data (temporal and spatial, auditory and visual), a comparison study in the discussion section with some SOTA (state-of-the-art) techniques.

I appreciated the outlooks drawn by the authors pointing out the current limitations in their method, especially when figures from Deep Learning models are compared with more "traditional machine learning" SVM, kNN and LDA."

Is the rationale for developing the new method (or application) clearly explained?

Yes
Is the description of the method technically sound?

Yes
Are sufficient details provided to allow replication of the method development and its use by others?

Yes
If any results are presented, are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions about the method and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Computer Vision, Eye-tracking, Perception, Artificial Intelligence, Machine Learning, Deep Learning.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

11 Views

28 Oct 2021 | for Version 3

Rajesh K Tripathy, Department of Electrical and Electronics Engineering, BITS-Pilani, Hyderabad, Telangana, 500078, India

11 Views Cite this report Responses(0)

Approved

My comments are correctly addressed.

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

28 Views

21 Sep 2021 | for Version 1

Rajesh K Tripathy, Department of Electrical and Electronics Engineering, BITS-Pilani, Hyderabad, Telangana, 500078, India

28 Views Cite this report Responses(1)

Approved With Reservations

Why authors have considered linear discriminant analysis, support vector machine, and k-nearest neighbor classifiers for the classification of EEG signals? Why not RNNs and multilayer perceptron.
Could you please add the arousal score? How you have divided into low-arousal and high-arousal. Why not three classes as low arousal vs. medium arousal vs. high arousal. Please clarify.
The cohen kappa score, Mathews correlation coefficients, F score values need to be evaluated.
Recently, the rhythm-specific deep CNN is successful for the classification of emotions using EEG signals. Could you compare with CNN-based methods?
The advantages and disadvantages of your work must be written.

Is the rationale for developing the new method (or application) clearly explained?

Yes
Is the description of the method technically sound?

Yes
Are sufficient details provided to allow replication of the method development and its use by others?

Yes
If any results are presented, are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions about the method and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

AI for healthcare, deep learning, Biomedical signal processing, machine learning

Respond to this report

Responses (1)

Author Response

08 Oct 2021

Patrizio Tressoldi, Science of Consciousness Research Group, Department of General Psychology, Università degli Studi di Padova, Padova, Italy

Why authors have considered linear discriminant analysis, support vector machine, and k-nearest neighbor classifiers for the classification of EEG signals? Why not RNNs and multilayer perceptron.

Reply: According to the literature (ref. 19-20), LDA, SVM and kNN are the most commonly used (79.3% and 84% overall, respectively) classifiers in the Emotion recognition Field. A more recent review (ref. 53) partially confirmed this, showing that LDA, SVM and kNN are overall used 44.44% of the times. In contrast, neural‑network-based classifiers (e.g. CNN, MLP-BP, ANN) are used only 3.17% (ref. 19), 15.5% (ref. 20) and 5.56% (ref. 53) of the times. In addition to their popularity, LDA SVM and kNN were selected as representative of 3 different families, namely parametric, discriminative and non-parametric classifiers.

Could you please add the arousal score? How you have divided into low-arousal and high-arousal. Why not three classes as low arousal vs. medium arousal vs. high arousal. Please clarify.

Reply: Both the image and sounds were discretized into low and high arousal as those with a SAM score respectively lower and higher than 5. We chose a binary classification since it is associated to lower computational costs than the multiclass alternatives. Multiclass classification could produce different (and probably better) results, at the cost of lowering the number of available instances per class. We added this consideration in the study limitations.

The cohen kappa score, Mathews correlation coefficients, F score values need to be evaluated.

Reply: As evaluation metric, we chose the accuracy since the two classes were approximatively balanced (we reported in Table 1 the class distributions) and we were mainly interested in the overall correct classification (with equal weight on high and low arousal). We agree that additional metrics (that are mandatory for unbalanced dataset) would be very informative also for balanced datasets, providing, e.g., useful insights to which class is better classified. We added this consideration in the study limitations.

Recently, the rhythm-specific deep CNN is successful for the classification of emotions using EEG signals. Could you compare with CNN-based methods?

Reply: In the discussions we compared our results with two studies based on CNN classifiers and we identify that

The advantages and disadvantages of your work must be written.

Reply: In the discussions we added the paragraph “Study limitations”.

View more View less

Competing Interests

No competing interests were disclosed.

Back to all reports

Reviewer Report

30 Views

14 Sep 2021 | for Version 1

Zahid Halim, GIK Institute of Engineering Sciences and Technology, Swabi, Khyber Pakhtunkhwa, 23640, Pakistan

30 Views Cite this report Responses(2)

Approved With Reservations

The paper presents an interesting approach using ML on EEG data for brain activity. Following issues need to be addressed.

Quantify your results in the abstract.
Add a gap analysis section in your introduction and also mention the research questions.
Fig. 1 seems generic, please add steps specific to your solution. Just a suggestion.
Summarize your key findings in a table in the discussion section.
Following references are missing
1. On Identification of Driving-Induced Stress Using Electroencephalogram Signals: A Framework Based On Wearable Safety-Critical Scheme and Machine Learning
2. A machine learning-based investigation utilizing the in-text features for the identification of dominant emotion in an email
3. Imagined character recognition through EEG signals using deep convolutional neural network
4. Investigating the use of pretrained convolutional neural network on cross-subject and cross-dataset EEG emotion recognition
5. Cross-subject multimodal emotion recognition based on hybrid fusion
6. A novel derivative-based classification method for hyperspectral data processing

Is the rationale for developing the new method (or application) clearly explained?

Yes
Is the description of the method technically sound?

Partly
Are sufficient details provided to allow replication of the method development and its use by others?

Yes
If any results are presented, are all the source data underlying the results available to ensure full reproducibility?

Partly
Are the conclusions about the method and its performance adequately supported by the findings presented in the article?

Partly

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Machine learning

Respond to this report

Responses (2)

Author Response

08 Oct 2021

Patrizio Tressoldi, Science of Consciousness Research Group, Department of General Psychology, Università degli Studi di Padova, Padova, Italy

Quantify your results in the abstract.

Reply: We added the accuracies of the 3 best classifier-feature combinations for both the static and dynamic classifiers in the abstract.

Add a gap analysis section in your introduction and also mention the research questions.

Reply: We split the past introduction into two different sections: introduction and state of the art. The introduction describes the theory of the emotion, the brain anticipatory activity, the physiological correlated of the emotions and introduce the emotion recognition as a ML approach. The state of the art summarizes previous works on the emotion recognition, underling their characteristics (in terms of chosen features/classifiers and obtained performances), as well as identify the limits (i.e. performances highly variable depending from the feature/classifier combination). Finally, it describes the aims of the present work. We hope now both the limits in the literature and the aim of the present study are clearer.

Fig. 1 seems generic, please add steps specific to your solution. Just a suggestion.

Reply: This paper re-analysed previously collected data, whose details can be found in the corresponding article (ref. 37). Fig. 1 graphically summarised the experimental task of (ref. 37). We agree that, probably, it is not such informative since interest readers can look at (ref. 37) for all the details. So, we removed it.

Summarize your key findings in a table in the discussion section.

Reply: We added the table 5 to summarise the best 3 classifier/features combinations for both the static and dynamic approaches.

Following references are missing
1. On Identification of Driving-Induced Stress Using Electroencephalogram Signals: A Framework Based on Wearable Safety-Critical Scheme and Machine Learning
2. A machine learning-based investigation utilizing the in-text features for the identification of dominant emotion in an email
3. Imagined character recognition through EEG signals using deep convolutional neural network
4. Investigating the use of pretrained convolutional neural network on cross-subject and cross-dataset EEG emotion recognition
5. Cross-subject multimodal emotion recognition based on hybrid fusion
6. A novel derivative-based classification method for hyperspectral data processing

Reply: We added the suggested references: (1), (2), (3), (5) and (6) in the introduction, while (4) in the discussion.

View more View less

Competing Interests

I'm the corresponding author

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] 1. Friston K: A theory of cortical responses. Philos Trans R Soc Lond B Biol Sci. 2005; 360(1456): 815–836. PubMed Abstract | Publisher Full Text | Free Full Text

[2] 2. Nobre AC: Orienting attention to instants in time. Neuropsychologia. 2001; 39(12): 1317–1328. PubMed Abstract | Publisher Full Text

[3] 3. Mento G, Vallesi A: Spatiotemporally dissociable neural signatures for generating and updating expectation over time in children: A High Density-ERP study. Dev Cogn Neurosci. 2016; 19: 98–106. PubMed Abstract | Publisher Full Text | Free Full Text

[4] 4. Mento G, Tarantino V, Vallesi A, et al.: Spatiotemporal neurodynamics underlying internally and externally driven temporal prediction: A high spatial resolution ERP study. J Cogn Neurosci. 2015; 27(3): 425–439. PubMed Abstract | Publisher Full Text

[5] 5. Barsalou LW: Grounded Cognition. Annu Rev Psychol. 2008; 59: 617–645. PubMed Abstract | Publisher Full Text

[6] 6. Barrett LF: The theory of constructed emotion: an active inference account of interoception and categorization. Soc Cogn Affect Neurosci. 2017; 12(11): 1833. PubMed Abstract | Publisher Full Text | Free Full Text

[7] 7. Bruner JS: Acts of meaning. Harvard University Press, 1990. Reference Source

[8] 8. Miniussi C, Wilding EL, Coull JT, et al.: Orienting attention in time. Modulation of brain potentials. Brain. 1999; 122(Pt 8): 1507–1518. PubMed Abstract | Publisher Full Text

[9] 9. Stefanics G, Hangya B, Hernádi I, et al.: Phase entrainment of human delta oscillations can mediate the effects of expectation on reaction speed. J Neurosci. 2010; 30(41): 13578–13585. PubMed Abstract | Publisher Full Text | Free Full Text

[10] 10. Denny BT, Ochsner KN, Weber J, et al.: Anticipatory brain activity predicts the success or failure of subsequent emotion regulation. Soc Cogn Affect Neurosci. 2014; 9(4): 403–411. PubMed Abstract | Publisher Full Text | Free Full Text

[11] 11. Abler B, Erk S, Herwig U, et al.: Anticipation of aversive stimuli activates extended amygdala in unipolar depression. J Psychiatr Res. 2007; 41(6): 511–522. PubMed Abstract | Publisher Full Text

[12] 12. Morinaga K, Akiyoshi J, Matsushita H, et al.: Anticipatory anxiety-induced changes in human lateral prefrontal cortex activity. Biol Psychol. 2007; 74(1): 34–38. PubMed Abstract | Publisher Full Text

[13] 13. Duma GM, Mento G, Manari T, et al.: Driving with Intuition: A Preregistered Study about the EEG Anticipation of Simulated Random Car Accidents. PLoS One. 2017; 12(1): e0170370. PubMed Abstract | Publisher Full Text | Free Full Text

[14] 14. Radin DI, Vieten C, Michel L, et al.: Electrocortical activity prior to unpredictable stimuli in meditators and nonmeditators. Explore (NY). 2011; 7(5): 286–299. PubMed Abstract | Publisher Full Text

[15] 15. Mossbridge JA, Tressoldi P, Utts J, et al.: Predicting the unpredictable: critical analysis and practical implications of predictive anticipatory activity. Front Hum Neurosci. 2014; 8: 146. PubMed Abstract | Publisher Full Text | Free Full Text

[16] 16. Gunes H, Pantic M: Automatic, Dimensional and Continuous Emotion Recognition. Int J Synth Emot. 2010; 1(1): 32. Publisher Full Text

[17] 17. Shu L, Xie J, Yang M, et al.: A Review of Emotion Recognition Using Physiological Signals. Sensors (Basel). 2018; 18(7): pii: E2074. PubMed Abstract | Publisher Full Text | Free Full Text

[18] 18. Calvo RA, D’Mello S: Affect detection: An interdisciplinary review of models, methods, and their applications. IEEE Trans Affect Comput. 2010; 1(1): 18–37. Publisher Full Text

[19] 19. Alarcao SM, Fonseca MJ: Emotions Recognition Using EEG Signals: A Survey. IEEE Trans Affect Comput. 2017; 3045: 1–20. Publisher Full Text

[20] 20. Al-Nafjan A, Hosny M, Al-Ohali Y, et al.: Review and Classification of Emotion Recognition Based on EEG Brain-Computer Interface System Research: A Systematic Review. Appl Sci. 2017; 7(12): 1239. Publisher Full Text

[21] 21. Lotte F, Congedo M, Lécuyer A, et al.: A review of classification algorithms for EEG-based brain-computer interfaces. J Neural Eng. 2007; 4(2): R1–R13. PubMed Abstract | Publisher Full Text

[22] 22. Lin YP, Wang CH, Wu TL, et al.: EEG-based emotion recognition in music listening: A comparison of schemes for multiclass support vector machine. In: Proceedings of the ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2009. Publisher Full Text

[23] 23. Koelstra S, Yazdani A, Soleymani M, et al.: Single trial classification of EEG and peripheral physiological signals for recognition of emotions induced by music videos. In: Proceedings of the Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2010. Publisher Full Text

[24] 24. Liu Y, Sourina O: EEG-based valence level recognition for real-time applications. In: Proceedings of the Proceedings of the 2012 International Conference on Cyberworlds, Cyberworlds. 2012; 2012. Publisher Full Text

[25] 25. Murugappan M, Murugappan S: Human emotion recognition through short time Electroencephalogram (EEG) signals using Fast Fourier Transform (FFT). In: Proceedings of the Proceedings - 2013 IEEE 9th International Colloquium on Signal Processing and its Applications, CSPA. 2013; 2013. Publisher Full Text

[26] 26. Thammasan N, Fukui KI, Numao M: Application of deep belief networks in EEG-based dynamic music-emotion recognition. In: Proceedings of the Proceedings of the International Joint Conference on Neural Networks. 2016. Publisher Full Text

[27] 27. Tressoldi P, Duma GM, Mento G: EEG anticipation of random high and low arousal faces and sounds. figshare. 2018; Dataset. http://www.doi.org/10.6084/m9.figshare.6874871.v8

[28] 28. Duda RO, Hart PE, Stork DG: Pattern classification, 2nd edition. Wiley, 2000. Reference Source

[29] 29. Bilucaglia M, Pederzoli L, Giroldini W, et al.: EEG correlation at a distance: A re-analysis of two studies using a machine learning approach [version 2; peer review: 2 approved]. F1000Research. 2019; 8: 43. PubMed Abstract | Publisher Full Text | Free Full Text

[30] 30. Bishop C: Pattern Recognition and Machine Learning. 1st ed Springer, 2006. Reference Source

[31] 31. Rubinstein YD, Hastie T: Discriminative vs Informative Learning. In: Proceedings of the Proceedings of the The Third International Conference on Knowledge Discovery and Data Mining. 1997; 49–59. Reference Source

[32] 32. Raudys S, Duin RPW: Expected classification error of the Fisher linear classifier with pseudo-inverse covariance matrix. Pattern Recognit Lett. 1998; 19(5–6): 385–392. Publisher Full Text

[33] 33. Burges CJC: A Tutorial on Support Vector Machines for Pattern Recognition. Data Min Knowl Discov. 1998; 2: 121–167. Publisher Full Text

[34] 34. Müller KR, Mika S, Rätsch G, et al.: An introduction to kernel-based learning algorithms. IEEE Trans Neural Networks. 2001; 12(2): 181–201. PubMed Abstract | Publisher Full Text

[35] 35. Atiya AF: Estimating the posterior probabilities using the K-nearest neighbor rule. Neural Comput. 2005; 17(3): 731–740. PubMed Abstract | Publisher Full Text

[36] 36. Correia JM, Jansma B, Hausfeld L, et al.: EEG decoding of spoken words in bilingual listeners: From words to language invariant semantic-conceptual representations. Front Psychol. 2015; 6: 71. PubMed Abstract | Publisher Full Text | Free Full Text

[37] 37. Duma GM, Mento G, Semenzato L, et al.: EEG anticipation of random high and low arousal faces and sounds [version 2; peer review: 1 approved, 1 not approved]. F1000Research. 2018; 8: 1508. Publisher Full Text

[38] 38. Mo C, Lu J, Wu B, et al.: Competing rhythmic neural representations of orientations during concurrent attention to multiple orientation features. Nat Commun. 2019; 10(1): 5264. PubMed Abstract | Publisher Full Text | Free Full Text

[39] 39. Roberts T, Cant JS, Nestor A: Elucidating the Neural Representation and the Processing Dynamics of Face Ensembles. J Neurosci. 2019; 39(39): 7737–7747. PubMed Abstract | Publisher Full Text | Free Full Text

[40] 40. Jain AK, Duin RP, Mao J: Statistical pattern recognition: A review. IEEE Trans Pattern Anal Mach Intell. 2000; 22(1): 4–37. Publisher Full Text

[41] 41. Tang J Alelyani S, Liu H: Feature selection for classification: A review. In Data Classification: Algorithms and Applications . Aggarwal, C.C.,Ed.; 2014; 37–64. Reference Source

[42] 42. Miao J, Niu L: A Survey on Feature Selection. Procedia Comput Sci. 2016; 91: 919–926. Publisher Full Text

[43] 43. Müller K, Krauledat M, Dornhege G, et al.: Machine learning techniques for brain-computer interfaces. Biomed Tech (Biomed Tech). 2004; 49: 11–22. Reference Source

[44] 44. Stevenson RA, James TW: Affective auditory stimuli: characterization of the International Affective Digitized Sounds (IADS) by discrete emotional categories. Behav Res Methods. 2008; 40(1): 315–21. PubMed Abstract | Publisher Full Text

[45] 45. Stone JV: Independent component analysis: an introduction. Trends Cogn Sci. 2002; 6(2): 59–64. PubMed Abstract | Publisher Full Text

[46] 46. Allen JJ, Coan JA, Nazarian M: Issues and assumptions on the road from raw signals to metrics of frontal EEG asymmetry in emotion. Biol Psychol. 2004; 67(1–2): 183–218. PubMed Abstract | Publisher Full Text

[47] 47. Babiloni C, Stella G, Buffo P, et al.: Cortical sources of resting state EEG rhythms are abnormal in dyslexic children. Clin Neurophysiol. 2012; 123(12): 2384–2391. PubMed Abstract | Publisher Full Text

[48] 48. Mert A, Akan A: Emotion recognition from EEG signals by using multivariate empirical mode decomposition. Pattern Anal Appl. 2018; 21: 81–89. Publisher Full Text

[49] 49. Clarke AR, Barry RJ, Karamacoska D, et al.: The EEG Theta/Beta Ratio: A marker of Arousal or Cognitive Processing Capacity? Appl Psychophysiol Biofeedback. 2019; 44(2): 123–129. PubMed Abstract | Publisher Full Text

[50] 50. Blankertz B, Lemm S, Treder M, et al.: Single-trial analysis and classification of ERP components - A tutorial. Neuroimage. 2011; 56(2): 814–825. PubMed Abstract | Publisher Full Text

[51] 51. Marco B: BAA - Matlab Code (Version 1). Zenodo. 2020. http://www.doi.org/10.5281/zenodo.3666045

[52] 52. Tottenham N, Tanaka JW, Leon AC, et al.: The NimStim set of facial expressions: Judgments from untrained research participants. Psychiatry Res. 2009; 168(3): 242–249. PubMed Abstract | Publisher Full Text | Free Full Text

Applying machine learning EEG signal classification to emotion‑related brain anticipatory activity

Abstract

Keywords

Introduction

Classification performances

Dynamic classifiers

Feature selection

Methods

Ethical statement

Stimuli and experimental paradigm

Figure 1. Experimental tasks.

EEG recording

EEG preprocessing

Static spectral features

Static temporal features

Dynamic features

Feature reduction and classification

Data analysis

Statistical analysis

Results

Static features

Figure 2. Spectral features.

Figure 3. Temporal features.

Figure 4. Box-plots of the accuracies of the static spectral classifications.

Figure 5. Box-plots of the accuracies of the static temporal classifications.

Table 1. Static features.

Table 2. Mean (M) and standard deviations (SD) of the accuracies of the static spectral classifications.

Table 3. Mean (M) and standard deviations (SD) of the accuracies of the static temporal classifications.

Dynamic features

Figure 6. Spectral dynamic features.

Figure 7. Spectral dynamic features.

Figure 8. Spectral dynamic features.

Figure 9. Spectral dynamic features.

Figure 10. Temporal dynamic features.

Figure 11. Temporal dynamic features.

Figure 12. Temporal dynamic features.

Table 4. Dynamic features.

Discussion

Data availability

Underlying data

Extended data

Software availability

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated