Recognizing human activities using light-weight and effective machine learning methodologies

Keerthi Varadhi; Chinta Someswara Rao; GNVG Sirisha; Butchi Raju katari

doi:10.12688/f1000research.124164.1

Home Browse Recognizing human activities using light-weight and effective machine...

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Method Article

Recognizing human activities using light-weight and effective machine learning methodologies

[version 1; peer review: 1 not approved]

Keerthi Varadhi ¹, Chinta Someswara Rao², GNVG Sirisha², Butchi Raju katari¹

PUBLISHED 06 Mar 2023

Author details Author details

¹ CSE Department, Gokaraju Rangaraju Institute of Engineering and Technology, Hyderabad, TELANGANA, 500090, India
² CSE Department, SRKR Engineering College, Bhimavaram, ANDHRA PRADESH, 534204, India

Keerthi Varadhi
Roles: Conceptualization, Methodology

Chinta Someswara Rao
Roles: Methodology, Validation

GNVG Sirisha
Roles: Writing – Original Draft Preparation

Butchi Raju katari
Roles: Formal Analysis, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the Computational Modelling and Numerical Aspects in Engineering collection.

Abstract

Background

Human activity recognition is a dynamic and challenging task. It is a large field of research and development. It involves predicting the movement of a person from the raw sensor data using a machine learning model. To accurately detect human activities for e-health systems, several research attempts have been carried out using data mining and machine learning techniques, but there still is room to improve the performance. To this aim, human activities such as walking, standing, laying, sitting, walking upstairs, walking downstairs are predicted using prominent machine learning models.
The aim of human activity recognition is examining actions from photos or video clips. This serves as the driving force behind human activity identification systems' aim to accurately classify input data into the relevant activity category.
Methods
Six machine learning techniques, including decision tree, random forest, linear regression, Naïve bayes, k-nearest neighbour, and neural networks algorithms, were used for human activity recognition.
Results
The performance of decision tree, random forest, linear regression, Naïve bayes, k-nearest neighbor, and neural network algorithms was assessed with a human activity recognition dataset. From the results, the random forest classifier and neural network gave good results, whereas the Naïve bayes result was not satisfying.
Conclusions
We classified the SITTING, STANDING, LAYING, WALKING, WALKING_DOWNSTAIRS, WALKING_UPSTAIRS activities with machine learning techniques with 98% of accuracy

Keywords

Classification; Land Cover ; Deep Learning; CNN

Corresponding authors: Keerthi Varadhi, Chinta Someswara Rao, Butchi Raju katari

Competing interests: No competing interests were disclosed.

Grant information: The author(s) declared that no grants were involved in supporting this work.

Copyright: © 2023 Varadhi K et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Varadhi K, Someswara Rao C, Sirisha G and katari BR. Recognizing human activities using light-weight and effective machine learning methodologies [version 1; peer review: 1 not approved]. F1000Research 2023, 12:247 (https://doi.org/10.12688/f1000research.124164.1) First published: 06 Mar 2023, 12:247 (https://doi.org/10.12688/f1000research.124164.1) Latest published: 05 Nov 2024, 12:247 (https://doi.org/10.12688/f1000research.124164.4)

Introduction

The task of recognizing the present physical action done by one or more users from a series of observations is known as human activity recognition. These observations are made while the user is doing any action in the defined environment. The rise of omnipresent, wearable, and persuasive computing has spawned a variety of new uses. Human activity recognition has evolved into a significant technology that is altering people's everyday routines. The notion of human activity recognition has recently gained popularity, resulting in a variety of applications such as assistive technology, health and fitness tracking, elder care, and automated surveillance. Furthermore, activity recognition research has progressed at such a quick pace that it is already being used for purposes other than activity recognition like monitoring elderly people or prison inmates.

Mobile phone sensors play an important part in making smart phones more useful and aware of their surroundings such as space, the presence of lamp, rubbish, among others. As a result, most smart phones have a variety of integrated sensors. It is feasible to get a large quantity of information on a person's everyday life and activities using this method. Among these sensor devices are accelerometer and gyroscope sensors. We can capture the motion of an item with the right usage of these sensors. As a result, machine learning algorithms may be used to anticipate human action based on sensor data. The information is constantly monitored. Human activity recognition is capable of identifying a variety of actions, both basic and complicated.

Literature review

Some of the literature papers on human activity recognition are included in this section. The details are depicted in Table 1.

Table 1. Evaluation of various methods.

First name	Last name	Grade
Authors	Description	Future perception
Schuldt et al.,¹	Support vector machine (SVM) is used to recognize activities.	Neural network techniques must be utilized to increase the recognition rate.
Laptev et al.,²	For activity recognition, an SVM with a multichannel Gaussian kernel is utilised.	We need to enhance the recognition rate here as well.
Yamato et al.,³	With the feature vector, HMM is employed for activity recognition.	An increase in the rate of identification is required.
Oliver et al.,⁴	CHMM is a tool for recognising activities.	We need to enhance the recognition rate here as well.
Natarajan et al.,⁵	Human action is detected in this article utilising Conditional Random Fields.	The activity recognition needs to be improved.
Ning et al.,⁶	Human activity is detected using a typical way in this article.	We need to enhance activity identification here as well.
Vali M et al.,⁷	The authors combined HMM and CRF in their study.	It is necessary to boost the rate of recognition.
R. Madharshahian et al.,⁸	The authors utilized multinomial LR in this study.	We need to enhance activity identification here as well.
R.Kiros et al.,⁹	The authors employed consecutive RNNs in their study.	To improve the rate of identification, more layers must be added.
A. Grushin et al.,¹⁰	The authors utilized consecutive RNNs in this study as well.	More layers are required to improve the identification rate in this case as well.
Veeriah et al.,¹¹	The authors employed differential RNN in this study.	More layers are required to boost the rate of activity detection in this case as well.
Du W. Wang, and L. Wang, et al.,¹²	The authors employed hierarchical RNNs in their study.	To improve the rate of activity recognition, DNN, i.e. LSTM, is necessary.

Methods

The methodology for human activity recognition comprises four phases: input, data cleaning, data splitting, and classification and validation. Human activity recognition structure is shown in Figure 1.

Figure 1. Human activity recognition structure.

Input: The dataset downloaded from the UCI dataset repository was the input of our study. The dataset contained information about triaxial acceleration from an accelerometer and triaxial angular velocity from a gyroscope. The dataset contains totally 10,299 data entries and 561 features.

Data cleaning: Data cleaning means removing the unnecessary columns and removing or replacing the inaccurate records from the dataset. In data cleaning, an encoding technique is applied to convert categorical data into numerical data. In our dataset, all feature values were numerical, hence there was no need to apply an encoding technique. Data cleaning prevents an error from happening again.

The first step in data cleaning is removing unwanted observations from the dataset. These unwanted data include irrelevant and duplicate data. Duplicate data most frequently arises when we combine data from multiple resources; irrelevant data refers to data that do not fit to the problem.

The second step is to handle the missing data. This can be done in two ways:

- Removing the record
- Filling the missing values based on other observations

Removing the record and filling the missing value manually were not fully optimal because we were losing some information; for example, range might be replaced with another value. Hence, for missing numerical data we had to take the following two steps:

- Flag the observation with an indicator value of missingness
- Fill the observation with 0 just to ensure that there are no missing value

By using the technique of flagging and filling, the algorithm can estimate the missing value optimally.

Data splitting: In this phase the data is segmented into two components

- Training data
- Testing data

Training data: The data which was used to train the model. We took 60% of the data as the training data.

Testing data: The data which was used to test our model is called testing data. We have taken 20% of the data from the dataset as testing data.

Classification and validation: The classification and validation was further comprised of two phases

- Classification algorithms
- Evaluation metrics

Classification: Classification is the process predicting the target class of new observation based on the training data. The dataset we selected contains the target class hence we implemented the model using classification algorithms. We used the following algorithms

▪ Naïve Bayes¹³
▪ K-nearest neigbours¹⁴
▪ Decision tree¹⁵
▪ Random forest¹⁶
▪ Simple logistic regression¹⁷
▪ Neural network¹⁸

Decision tree: This classification uses tree representation to solve a problem. Each internal node represents the features, branches represent the outcome of the feature and leaf node represent target class labels.

Algorithm

1. Identify the best attribute and make it as root.
2. Subdivide the training set into sections.
3. Repeat the previous two stages until all of the tree's branches have leaf nodes.

Attribute selection measures: We had to select the best attribute for node at each level. This can be done by two measures

- Information gain
- Gini Index

Information gain is mathematically defined as Equation 1

(1)

Entropy = - \sum P_{i} \times {log}_{2} P_{i}

where

P_{i}

is the probability of class i

Hence from entropy, information gain is mathematically represented as

(2)

Gain (S, A) = Entropy (S) - \sum_{v \in values (A)} \frac{|S_{i}|}{|S|} . Entropy (S_{v})

Gini index: It measures how often a randomly chosen element would be incorrectly classified.

(3)

Gini Index = 1 - \sum_{j} P_{j}^{2}

As our dataset were unbalanced, meaning there were multiple classes, a decision tree made use of information gain to select the best attribute.

Logistic regression: Logistic regression is a statistical method used for classification which falls under supervised learning. It is used for predictions based on probability concepts. It analyses data set and take discrete independent input values and give a single output.

Logistic function: Logistic function is also known as the sigmoid function, which takes a real-value number as input and maps it into a value between 0 and 1, but not exactly at those limits. It maps predictions to probabilities.

By using the sigmoid function, if we give an input value on x, then it predicts the target value on the y axis.

(4)

Logistic function = \frac{L (e^{k (x - x_{0})})}{1 + e^{k (x - x_{0})}}

(5)

Sigmoid function = \frac{e^{x}}{1 + e^{x}}

Here k = 1, $x_{0} = 0, L = 1$

Algorithm

1. Plot the labelled data
2. Draw the regression curve
3. Find out the best fitted curve using maximum likelihood estimator (MLE)
- ▪ Convert y-axis probability scale to log (odds)
- ▪ Assume regression line and scale paper data to regression line
- ▪ Apply sigmoid function

Naïve Bayes classifier: Naïve is a probabilistic classification algorithm which is based on Bayes theorem. It assumes that the occurrence of an instance is independent of other instances.

(6)

P (A| B) = \frac{P (B| A) P (A)}{P (B)}

The conditional probability of an object with feature vector $x_{1}, x_{2,}$ … …., $x_{n}$ belongs to a particular class $C_{i}$ , and it is calculated with Equation 7.

(7)

P (C_{i}| x_{2,} \dots \dots .| x_{n}) = \frac{P (x_{1}, x_{2,} \dots \dots ., x_{n}, C_{i}) . P (C_{i})}{P (x_{1}, x_{2,} \dots \dots ., x_{n})} for 1 \leq i \leq k

Algorithm

1. The data set is used to construct a frequency table.
2. By computing the probabilities of all the elements in the dataset, a likelihood table is constructed.
3. Posterior probability is calculated for each class by using (1).
4. The class with the highest posterior probability is the output.

Gaussian naïve Bayes: As values of each feature of our data set are continuous, we use the Gaussian distribution, which is also called the normal distribution.

(8)

P (A / B) = \frac{1}{\sqrt 2 π σ_{B}^{2}} exp (- \frac{{(A - μB)}^{2}}{2 σ_{B}^{2}})

Where $μ$ is the mean of all the values a feature and $σ$ is the standard deviation.

K-nearest neighbours is one of the essential classification algorithms. It falls under the category of supervised learning. It assumes that similar data are close to each other. Here the data is classified into groups based on an attribute. It is used in applications like pattern recognition, intrusion detection and data mining.

Algorithm

Let N be the number of training samples of data and U is the unknown point.

The data set is stored in an array each element represented as a tuple (x, y).

For i = 0 to n-1

Begin

Euclidian distance from each point to U is calculated.

S is the set of m smallest distances (all the points related to distances must be classified)

end

Return (Mode (m labels))

Random forest: Random forests are ensembles, which means combination of two or more models to get better results. Random forests create a forest of decision trees on data samples. Each decision tree gives a target class as output and the final target class is identified by performing some measures on outputs from each decision tree of corresponding input record.

Algorithm

1. Picks random samples from the provided dataset.
2. For each random sample, creates a decision tree.
3. Predicts the result from every decision tree.
4. Predicts target class of sample by simple voting as the final result.

Artificial neural network (ANN): ANN are made up of nodes that are connected to one other. The values of the property are stored in these nodes. The input values for ANN are collected from the training example's characteristics. The weighted values are subsequently delivered to the next set of nodes. This weighted total is subjected to a non-linear activation function, and the resulting value is transmitted to the next layer, where the process is repeated until the final output is reached. Figure 2 depicts this process.

Figure 2. Neural network model structure.

By adjusting the weights in ANN, the anticipated output value gets closer to the observed value. The back-propagation algorithm is the most often used method for altering weights. The complete training set is applied repeatedly due to the nature of ANN learning, where each application is unique.

Validation: Evaluation metrics were used to verify how well the model fits to our data. There are many metrics to evaluate the model; some of the measures are accuracy, precision, recall, F1 score.

Precision: Precision is also known as positive predicted value which is a measure of accuracy. It is mathematically defined as true positives divided by the sum of true positives and false positives.

(9)

Precision = \frac{True positives}{True positives + False positives}

Recall: Recall is also referred toas sensitivity, which is a measure of accuracy. It is mathematically defined as true positives divided by sum of true positives and false negatives.

(10)

Recall = \frac{True positives}{True positives + False negatives}

F1 score: The F1 score is calculated with equation 11

(11)

F1 score = 2 \times \frac{1}{\frac{1}{precision} + \frac{1}{recall}}

Accuracy: Accuracy defines how well the model predicts the class. It is mathematically defined as the number of truly predicted classes divided by total number of classes.

(12)

Accuracy = \frac{True positives + True negatives}{True positives + True negatives + False positives + False negatives}

Results and discussion

The Results section comprises two parts:

- Exploratory data analysis results

Exploratory data analysis means gaining insights from the data before applying any model to it. It can be done using mathematical functions and visualizations. The total number of rows and columns in a dataset are identified by using the ‘shape’ function. It is shown in Figure 3.

Figure 3. Shape of the dataset.

From this we know that the dataset contains 10299 instances and 562 attributes.

Out of 562 attributes 561 are independent attributes and one is dependent variable.

We used graphs to explore the data. We used a bar graph, box plot and heatmap.

A bar graph shows the comparison between different target classes. The bar graph showing how many persons had thesame classification is shownin Figure 4. The following observations were made from Figure 4: LAYING--1944, STANDING--1906, SITTING--1777, WALKING--1722, WALKING_UPSTAIRS--1544, WALKING_DOWNSTAIRS--1406.

Figure 4. Bar graph for the attribute ‘Activity’.

Box plot: Box plots show the distribution of data by using five measures, namely minimum, first quartile(Q1), median, third quartile(Q3), and maximum. The box plot is drawn for the target class ‘Activity’ and the feature ‘tBodyAcc-max()-X’ in Figure 5.

Figure 5. Box plot.

From Figure 5, we made the following observations:

- If tBodyAcc-max()-X is less than -0.75 then activities are either Standing, Sitting or Laying.
- If tBodyAcc-max()-X is greater than -0.50 then activities are classified as Walking or Walking_Downstairs or Walking_Upstairs.
- If tBodyAcc-max()-X is greater than 0.00 then activity is Walking_Downstairs.

Confusion matrix for naïve Bayes classifier is shown in Figure 6. From Figure 6, the following observations were made:

- The actual number of Laying was 389 but the model correctly predicted 361 as Laying and incorrectly predicted 22 as Sitting and nine as Walking_Upstairs.
- The actual number of Sitting was 372 but the model correctly predicted 336 as Sitting and incorrectly predicted two as Laying, 31 as Standing and three as Walking_Upstairs.
- The actual number of Standing was 375 but the model correctly predicted 157 as Standing, and incorrectly predicted 214 as Sitting and four as Walking_Upstairs.
- Th actual number of Walking was 345 but model correctly predicted 256 as Walking and incorrectly predicted 31 as Walking_Downstairs and 58 as Walking_Upstairs.
- The actual number of Walking_Downstairs was 282 but the model correctly predicted 199 of them as Walking_Downstairs and incorrectly predicted 25 as Walking and 58 as Walking_Upstairs.
- The actual number of Walking_Upstairs was 297 but the model correctly predicted 275 of them as Walking_Upstairs and incorrectly predicted 17 as Walking_Downstairs and five as Walking.

Figure 6. Confusion matrix for naïve Bayes classification.

Precision, recall and F1 score values of naïve Bayes classifier are depicted in the bar plot shown in Figure 7. From Figure 7, we can conclude that the class Laying was most correctly predicted among all other classes. Among all the classes, Standing had a low number of correctly predicted instances.

Figure 7. Precision, recall and F1-Score for naïve Bayesclassifier.

Confusion matrix for the decision tree classifier is shown in Figure 8. From Figure 8, the following observations were made:

- The actual number of Laying was 389 and the model correctly predicted 389 as Laying.
- The actual number of Sitting was 372 but the model correctly predicted346 of them as Sitting and incorrectly predicted 26 as Standing.
- The actual number of Standing was 375 but the model correctly predicted 347 of them as Standing and incorrectly predicted 28 as Sitting.
- The actual number of Walkingwas 345 but the model correctly predicted324 as Walking and incorrectly predicted six of them as Walking_Downstairs and 15 as Walking_Upstairs.
- The actual number of Walking_Downstairs was 282, but the model correctly predicted 257 of them as Walking_Downstairs and incorrectly predicted eight as Walking and 15 as Walking_Upstairs.
- The actual number of Walking_Upstairs was 297, but the model correctly predicted 266 as Walking_Upstairs and incorrectly predicted13 as Walking and 18 as Walking_Downstairs.

Figure 8. Confusion matrix for decision tree.

Precision, recall and F1 score values of decision tree classifier are depicted in the bar plot shown in Figure 9. From Figure 9 we can see that the class Laying was predicted correctly without any incorrectly classified classes.

Figure 9. Precision, recall and F1 score for decision tree classifier.

Confusion matrix for K-nearest neighbours classifier is shown in Figure 10. From Figure 10, the following observations were made:

- The actual number of Laying was 389 and the model correctly predicted 389 as Laying.
- The actual number of Sitting was 372, but the model correctly predicted 332 of them as Sitting and incorrectly predicted 239 as Standing and one as Sitting.
- The actual number of Standing was 375, but the model correctly predicted 354 as Standing and incorrectly predicted 21 as Sitting.
- The actual number of Walking was 345, but the model correctly predicted 344 of them as Walking and incorrectly predicted one as Walking_Upstairs.
- The actual number of Walking_Downstairs was 282, but the model correctly predicted 280 of them as Walking_Downstairs and incorrectly predicted two as Walking.
- The actual number of Walking_Upstairs was 297,and the model correctly predicted all 297 as Walking_Upstairs.

Figure 10. Confusion matrix for K-nearest neighbour.

Precision, recall and F1 score values of the K-nearest neighbour classifier are depicted in the bar plot shown in Figure 11. From Figure 11, we can see that the activity Laying was predicted correctly without any incorrectly classified classes; hence, all metrics had a 100% score.

Figure 11. Precision, recall and F1 score for K-nearest neighbour classifier.

Confusion matrix for the random forest classifier is shown in Figure 12. From Figure 12, the following observations were made:

▪ The actual number of Laying was 389 and the model correctly predicted 389 as Laying.
▪ The actual number of Sitting was 372, but the model correctly predicted 355 as Sitting and incorrectly predicted 17 as Standing.
▪ The actual number of Standing was 375, but the model correctly predicted 355 of them as Standing and incorrectly predicted 20 as Sitting.
▪ The actual number of Walking was 345, but the model correctly predicted 336 as Walking and incorrectly predicted two as Walking_Downstairs and seven as Walking_Upstairs.
▪ The actual number of Walking_Downstairs was 282, but the model correctly predicted 265 of them as Walking_Downstairs and incorrectly predicted 12 as Walking and five as Walking_Upstairs.
▪ The actual number of Walking_Upstairs was 297, but the model correctly predicted 286 as Walking_Upstairsand incorrectly predicted seven as Walkingand four as Walking_Downstairs.

Figure 12. Confusion matrix for random forest.

Precision, recall and F1 score values of random forest are depicted as a bar plot shown in Figure 13.From Figure 13 we can see that all instances having Laying as target were correctly predicted.

Figure 13. Precision, recall and F1 score for random forest classifier.

Confusion matrix for logistic regression is shown in Figure 14. From Figure 14, the following observations were made:

- The actual number of Laying was 389 and the model correctly predicted 389 as Laying.
- The actual number of Sitting was 372, but the model correctly predicts 351 of them as Sitting and incorrectly predicts 21 as Standing.
- The actual number of Standing was 375 but the model correctly predicted 360 of them as Standing and incorrectly predicted 15 as Sitting.
- The actual number of Walkingwas 345, but the model correctly predicted 344 of them as Walking and incorrectly predicted one as Walking_Upstairs.
- The actual number of Walking_Downstairs was 282, but the model correctly predicted 281 as Walking_Downstairs and incorrectly predicts one as Walking_Upstairs.
- The actual number of Walking_Upstairs was 297 but the model correctly predicted295 of them as Walking_Upstairs and incorrectly predicted one as Walking and one as Walking_Downstairs.

Figure 14. Confusion matrix for logistic regression.

Precision, recall and F1 score values of logistic regression are depicted as a bar plot shown in Figure 15. From Figure 15, we can see that Laying, Walking, Walking_Downstairs instances were correctly classified.

Figure 15. Precision, recall and F1 score for logistic regression classifier.

Neural network: Epoch versus training loss for the human activity dataset is represented in Figure 16. From Figure 16, the testing loss decreased and hence the accuracy increased. Of all the algorithms we applied, neural networks predicted more accurately, with an accuracy of 98.93%

Figure 16. Loss graph for neural network.

Comparison of different algorithms: Different algorithms used to train the model were considered as variable x and their corresponding accuracy as variable y. The bar graph drawn for these x and y variables is shown in Figure 17. The accuracy results are shown in Table 2. From Figure 17, when comparing all algorithms, neural network predicted the data with the highest accuracy, i.e.98.93%.

Figure 17. Comparison of algorithms.

Table 2. Accuracy of naïve Bayes, decision tree, K-nearest neighbour, random forest, logistic regression, neural network.

Algorithm	Accuracy
Naïve Bayes	76.89
K-nearest neighbours	96.40
Decision tree	93.39
Random forest	96.89
Logistic regression	98.05
Neural networks	98.93

Conclusions

For this work, human activity recognition was used is to detect the activity of a person. The data set was collected by sensors from triaxial accelerometer and gyroscope. The data set consisted of 561 features and 10,299 records, and six classes, namely Sitting, Standing, Laying, Walking, Walking_Downstairs, Walking_Upstairs. In this paper we used six algorithms, namely naïve Bayes, decision tree, random forest, K-nearest neighbours and neural network. For neural networks, the input layer consisted of 561 nodes and three hidden layers (1024,512,64);output layers consisted of six nodes, each representing a target class. From the experimental results, naïve Bayes classifier achieved 76.89% accuracy, the decision tree classifier achieved 93.39%, random forest classifier achieved 96.89%; K-nearest neighbours achieved 96.40%;logistic regression classifier achieved 98.05%. Among all these models,the neural networks model predicted the target class with an accuracy of 98.93%.

Data availability

Underlying data

Kaggle: Human Activity Recognition with Smartphones, https://www.kaggle.com/datasets/uciml/human-activity-recognition-with-smartphones

Data are available under the terms of the Creative Commons Zero “No rights reserved” data waiver (CC0 1.0 Public domain dedication).

Extended data

Analysis code

Source code available from: https://github.com/someshchinta/Human_Actiity_recognition

Archived source code at time of publication: https://doi.org/10.5281/zenodo.7108706

License: Apache-2.0

References

1. Schüldt IL, Caputo B: Recognizing human actions: a local SVM approach. Pattern Proceedings of the 17th International Conference on Pattern Recognition. 2004; vol. 23: pp. 32–36. Cambridge, UK.
2. Laptev I, Marszalek M, Schmid C, et al.: Learning realistic human actions from movies. 2008 IEEE Conference on Computer Vision and Pattern Recognition. 2008; vol. 4: pp. 1–8. Anchorage, AK, USA.
3. Yamato J, Ohya J, Ishii K: Recognizing human action in time-sequential images using hidden markov model. Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 1992; vol. 1992: pp. 379–385, Champaign, IL, USA.
4. Oliver NM, Rosario B, Pentland AP: A Bayesian computer vision system for modeling human interactions. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2000; vol. 22: pp. 831–843,
5. Suk HI, Sin BK, Lee SW: Hand gesture recognition based on dynamic Bayesian network framework. Pattern Recogn. 2000; 43: 3059–3072.
6. Natarajan P, Nevatia R: View and scale invariant action recognition using multiview shape-flow models. 2008 IEEE Conference on Computer Vision and Pattern Recognition. 2008; pp. 1–8. Anchorage, AK, USA.
7. Ning H, Xu W, Gong Y, Huang T:Latent pose estimator for continuous action recognition. Computer Vision--European Conference on Computer Vision 2008. Marseille, France:Springer;2008; vol. 43. : pp. 419–433.
8. Vail DL, Veloso MM, Lafferty JD: Conditional random fields for activity recognition. Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems. 2007; vol. 1: pp. 235.
9. Kiros R, Zhu Y, Salakhutdinov RR, et al.: Skip-thought vectors. Adv. Neural Inf. Proces. Syst. 2015; 1: 3294–3302.
10. Grushin A, Monner DD, Reggia JA, et al.: Robust human action recognition via long short-term memory. The 2013 International Joint Conference on Neural Networks (IJCNN). 2013; vol. 25: pp. 1–8.
11. Veeriah V, Zhuang N, Qi GJ: Differential recurrent neural networks for action recognition. 2015 IEEE International Conference on Computer Vision (ICCV). 2015; vol. 4: pp. 4041–4049.
12. Du Y, Wang W, Wang L: Hierarchical recurrent neural network for skeleton based action recognition. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2015; vol. 23: pp. 1110–1118.
13. Leung KM: Naive bayesian classifier. Polytechnic University Department of Computer Science/Finance and Risk Engineering;2007; pp. 123–156.
14. Peterson LE: K-nearest neighbor. Scholarpedia. 2009; 4(2): 1883. Publisher Full Text
15. Swain PH, Hauska H: The decision tree classifier: Design and potential. IEEE Trans. Geosci. Electron. 1977; 15(3): 142–147. Publisher Full Text
16. Liaw A, Wiener M: Classification and regression by randomForest. R news. 2002; 2(3): 18–22.
17. Domínguez-Almendros S, Benítez-Parejo N, Gonzalez-Ramirez AR: Logistic regression models. Allergol. Immunopathol. 2011; 39(5): 295–305. Publisher Full Text
18. Féraud R, Clérot F: A methodology to explain neural network classification. Neural Netw. 2002; 15(2): 237–246. Publisher Full Text

Comments on this article Comments (0)

Version 4

VERSION 4 PUBLISHED 06 Mar 2023

Author details Author details

Keerthi Varadhi
Roles: Conceptualization, Methodology

Chinta Someswara Rao
Roles: Methodology, Validation

GNVG Sirisha
Roles: Writing – Original Draft Preparation

Butchi Raju katari
Roles: Formal Analysis, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

The author(s) declared that no grants were involved in supporting this work.

Article Versions (4)

version 4

Revised

Published: 05 Nov 2024, 12:247

https://doi.org/10.12688/f1000research.124164.4

version 3

Revised

Published: 30 Sep 2024, 12:247

https://doi.org/10.12688/f1000research.124164.3

version 2

Revised

Published: 06 Feb 2024, 12:247

https://doi.org/10.12688/f1000research.124164.2

version 1

Published: 06 Mar 2023, 12:247

https://doi.org/10.12688/f1000research.124164.1

© 2023 Varadhi K et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Varadhi K, Someswara Rao C, Sirisha G and katari BR. Recognizing human activities using light-weight and effective machine learning methodologies [version 1; peer review: 1 not approved]. F1000Research 2023, 12:247 (https://doi.org/10.12688/f1000research.124164.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Version 1

VERSION 1

PUBLISHED 06 Mar 2023

Views

Reviewer Report 19 Dec 2023

Nurul Amin Choudhury, National Institute of Technology Silchar, Silchar, Assam, India

Not Approved

https://doi.org/10.5256/f1000research.136342.r212045

This paper uses wearable sensor data to use multiple machines and deep learning models to recognise daily human activities. The overall idea of the paper is satisfactory but needs significant revisions and state-of-the-art benchmark comparisons, methodologies inclusion and rigorous testing.
1. The authors fail to describe the dataset adequately. Stating the source of the dataset is not sufficient. Please study the papers like - N. A. Choudhury and B. Soni, "An Adaptive Batch Size-Based-CNN-LSTM Framework for Human Activity Recognition in Uncontrolled Environment," in IEEE Transactions on Industrial Informatics, vol. 19, no. 10, pp. 10379-10387, Oct. 2023, doi: 10.1109/TII.2022.3229522 and many more for detailed information.
2. The paper starts with recognising human activities using raw sensor data. However, the UCI dataset has extracted features from raw sensor data, how the authors incorporated raw sensor data from the UCI-HAR dataset.
3. The paper needs to be better written in the form of scientific information, grammar and overall representation. Please use a software tool and verify the draft.
4. The literature survey needs to be done appropriately. Include recent and good-quality journals and top-tier conference papers.
5. Data Cleaning or pre-processing needs to be explained more clearly, and a feature fusion pipeline must be used for enhanced activity recognition performance.
6. Both the abstract and conclusion could be better written. Please rewrite it by incorporating the paper's contributions and novelties.
7. The models' explanation needs to be revised in the paper by incorporating and highlighting the hyperparameters.
8. Achieved results are not compared with the benchmark models.

Is the rationale for developing the new method (or application) clearly explained?

No
Is the description of the method technically sound?

No
Are sufficient details provided to allow replication of the method development and its use by others?

Partly
If any results are presented, are all the source data underlying the results available to ensure full reproducibility?

Partly
Are the conclusions about the method and its performance adequately supported by the findings presented in the article?

No

References

1. Choudhury N, Soni B: An Adaptive Batch Size-Based-CNN-LSTM Framework for Human Activity Recognition in Uncontrolled Environment. IEEE Transactions on Industrial Informatics. 2023; 19 (10): 10379-10387 Publisher Full Text

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Human Activity Recognition, AI-ML, Feature Engineering, Domain Adaptation, eHealth Applications, Neural Networks.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to state that I do not consider it to be of an acceptable scientific standard, for reasons outlined above.

CITE

Report a concern

Author Response 13 Apr 2024

Keerthi Varadhi, CSE Department, Gokaraju Rangaraju Institute of Engineering and Technology, Hyderabad, 500090, India

13 Apr 2024

Author Response

1. The authors fail to describe the dataset adequately. Stating the source of the dataset is not sufficient. Please study the papers like - N. A. Choudhury and B. Soni, ... Continue reading 1. The authors fail to describe the dataset adequately. Stating the source of the dataset is not sufficient. Please study the papers like - N. A. Choudhury and B. Soni, "An Adaptive Batch Size-Based-CNN-LSTM Framework for Human Activity Recognition in Uncontrolled Environment," in IEEE Transactions on Industrial Informatics, vol. 19, no. 10, pp. 10379-10387, Oct. 2023, doi: 10.1109/TII.2022.3229522 and many more for detailed information.

Answer: Thank you for the suggestion; we described the dataset in detail in the updated paper.

Top of Form

2. The paper starts with recognizing human activities using raw sensor data. However, the UCI dataset has extracted features from raw sensor data, how the authors incorporated raw sensor data from the UCI-HAR dataset.

Answer: Thank you for your suggestion. We have provided a detailed description of the dataset in the updated paper.

3. The paper needs to be better written in the form of scientific information, grammar and overall representation. Please use a software tool and verify the draft.

Answer: Thank you for your suggestion. We have addressed all grammatical mistakes with the assistance of software tools in the updated paper.

4. The literature survey needs to be done appropriately. Include recent and good-quality journals and top-tier conference papers.

Answer: Thank you for your suggestion. We have incorporated additional recent papers from reputable journals such as Springer, Elsevier, and IEEE into the updated literature review.

5. Data Cleaning or pre-processing needs to be explained more clearly, and a feature fusion pipeline must be used for enhanced activity recognition performance.

Answer: Thank you for your suggestion. We have provided a detailed explanation of the pre-processing steps and included a paragraph on the fusion pipeline in the updated paper.

6. Both the abstract and conclusion could be better written. Please rewrite it by incorporating the paper's contributions and novelties.

Answer: Thank you for your suggestion. We have rewritten the abstract and conclusion, incorporating the contributions into the updated paper.

7. The models' explanation needs to be revised in the paper by incorporating and highlighting the hyperparameters.
Answer: Thank you for your suggestion. We have revised the model explanation in the updated paper, emphasizing the inclusion of hyperparameters.

8. Achieved results are not compared with the benchmark models.

Answer: Thank you for your suggestion. We have compared the results with benchmark models in the updated paper.
1. The authors fail to describe the dataset adequately. Stating the source of the dataset is not sufficient. Please study the papers like - N. A. Choudhury and B. Soni, "An Adaptive Batch Size-Based-CNN-LSTM Framework for Human Activity Recognition in Uncontrolled Environment," in IEEE Transactions on Industrial Informatics, vol. 19, no. 10, pp. 10379-10387, Oct. 2023, doi: 10.1109/TII.2022.3229522 and many more for detailed information.

Answer: Thank you for the suggestion; we described the dataset in detail in the updated paper.

Top of Form

2. The paper starts with recognizing human activities using raw sensor data. However, the UCI dataset has extracted features from raw sensor data, how the authors incorporated raw sensor data from the UCI-HAR dataset.

Answer: Thank you for your suggestion. We have provided a detailed description of the dataset in the updated paper.

3. The paper needs to be better written in the form of scientific information, grammar and overall representation. Please use a software tool and verify the draft.

Answer: Thank you for your suggestion. We have addressed all grammatical mistakes with the assistance of software tools in the updated paper.

4. The literature survey needs to be done appropriately. Include recent and good-quality journals and top-tier conference papers.

Answer: Thank you for your suggestion. We have incorporated additional recent papers from reputable journals such as Springer, Elsevier, and IEEE into the updated literature review.

5. Data Cleaning or pre-processing needs to be explained more clearly, and a feature fusion pipeline must be used for enhanced activity recognition performance.

Answer: Thank you for your suggestion. We have provided a detailed explanation of the pre-processing steps and included a paragraph on the fusion pipeline in the updated paper.

6. Both the abstract and conclusion could be better written. Please rewrite it by incorporating the paper's contributions and novelties.

Answer: Thank you for your suggestion. We have rewritten the abstract and conclusion, incorporating the contributions into the updated paper.

7. The models' explanation needs to be revised in the paper by incorporating and highlighting the hyperparameters.
Answer: Thank you for your suggestion. We have revised the model explanation in the updated paper, emphasizing the inclusion of hyperparameters.

8. Achieved results are not compared with the benchmark models.

Answer: Thank you for your suggestion. We have compared the results with benchmark models in the updated paper.
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 13 Apr 2024

Keerthi Varadhi, CSE Department, Gokaraju Rangaraju Institute of Engineering and Technology, Hyderabad, 500090, India

13 Apr 2024

Author Response

1. The authors fail to describe the dataset adequately. Stating the source of the dataset is not sufficient. Please study the papers like - N. A. Choudhury and B. Soni, ... Continue reading 1. The authors fail to describe the dataset adequately. Stating the source of the dataset is not sufficient. Please study the papers like - N. A. Choudhury and B. Soni, "An Adaptive Batch Size-Based-CNN-LSTM Framework for Human Activity Recognition in Uncontrolled Environment," in IEEE Transactions on Industrial Informatics, vol. 19, no. 10, pp. 10379-10387, Oct. 2023, doi: 10.1109/TII.2022.3229522 and many more for detailed information.

Answer: Thank you for the suggestion; we described the dataset in detail in the updated paper.

Top of Form

2. The paper starts with recognizing human activities using raw sensor data. However, the UCI dataset has extracted features from raw sensor data, how the authors incorporated raw sensor data from the UCI-HAR dataset.

Answer: Thank you for your suggestion. We have provided a detailed description of the dataset in the updated paper.

3. The paper needs to be better written in the form of scientific information, grammar and overall representation. Please use a software tool and verify the draft.

Answer: Thank you for your suggestion. We have addressed all grammatical mistakes with the assistance of software tools in the updated paper.

4. The literature survey needs to be done appropriately. Include recent and good-quality journals and top-tier conference papers.

Answer: Thank you for your suggestion. We have incorporated additional recent papers from reputable journals such as Springer, Elsevier, and IEEE into the updated literature review.

5. Data Cleaning or pre-processing needs to be explained more clearly, and a feature fusion pipeline must be used for enhanced activity recognition performance.

Answer: Thank you for your suggestion. We have provided a detailed explanation of the pre-processing steps and included a paragraph on the fusion pipeline in the updated paper.

6. Both the abstract and conclusion could be better written. Please rewrite it by incorporating the paper's contributions and novelties.

Answer: Thank you for your suggestion. We have rewritten the abstract and conclusion, incorporating the contributions into the updated paper.

7. The models' explanation needs to be revised in the paper by incorporating and highlighting the hyperparameters.
Answer: Thank you for your suggestion. We have revised the model explanation in the updated paper, emphasizing the inclusion of hyperparameters.

8. Achieved results are not compared with the benchmark models.

Answer: Thank you for your suggestion. We have compared the results with benchmark models in the updated paper.
1. The authors fail to describe the dataset adequately. Stating the source of the dataset is not sufficient. Please study the papers like - N. A. Choudhury and B. Soni, "An Adaptive Batch Size-Based-CNN-LSTM Framework for Human Activity Recognition in Uncontrolled Environment," in IEEE Transactions on Industrial Informatics, vol. 19, no. 10, pp. 10379-10387, Oct. 2023, doi: 10.1109/TII.2022.3229522 and many more for detailed information.

Answer: Thank you for the suggestion; we described the dataset in detail in the updated paper.

Top of Form

2. The paper starts with recognizing human activities using raw sensor data. However, the UCI dataset has extracted features from raw sensor data, how the authors incorporated raw sensor data from the UCI-HAR dataset.

Answer: Thank you for your suggestion. We have provided a detailed description of the dataset in the updated paper.

3. The paper needs to be better written in the form of scientific information, grammar and overall representation. Please use a software tool and verify the draft.

Answer: Thank you for your suggestion. We have addressed all grammatical mistakes with the assistance of software tools in the updated paper.

4. The literature survey needs to be done appropriately. Include recent and good-quality journals and top-tier conference papers.

Answer: Thank you for your suggestion. We have incorporated additional recent papers from reputable journals such as Springer, Elsevier, and IEEE into the updated literature review.

5. Data Cleaning or pre-processing needs to be explained more clearly, and a feature fusion pipeline must be used for enhanced activity recognition performance.

Answer: Thank you for your suggestion. We have provided a detailed explanation of the pre-processing steps and included a paragraph on the fusion pipeline in the updated paper.

6. Both the abstract and conclusion could be better written. Please rewrite it by incorporating the paper's contributions and novelties.

Answer: Thank you for your suggestion. We have rewritten the abstract and conclusion, incorporating the contributions into the updated paper.

7. The models' explanation needs to be revised in the paper by incorporating and highlighting the hyperparameters.
Answer: Thank you for your suggestion. We have revised the model explanation in the updated paper, emphasizing the inclusion of hyperparameters.

8. Achieved results are not compared with the benchmark models.

Answer: Thank you for your suggestion. We have compared the results with benchmark models in the updated paper.
Competing Interests: No competing interests were disclosed. Close
Report a concern

Comments on this article Comments (0)

Version 4

VERSION 4 PUBLISHED 06 Mar 2023

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2	3	4
Version 4 (revision) 05 Nov 24			read	read
Version 3 (revision) 30 Sep 24
Version 2 (revision) 06 Feb 24		read	read	read
Version 1 06 Mar 23	read

Nurul Amin Choudhury, National Institute of Technology Silchar, Silchar, India
Kristina Host, University of Rijeka, Rijeka, Croatia
Anna Ferrari, Università degli studi di Milano-Bicocca, Milano, Italy; University of Geneva, Geneva, Switzerland
Alwin Poulose, Indian Institute of Science Education and Research Thiruvananthapuram, Thiruvananthapuram, India

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

3 Views

03 Jan 2025 | for Version 4

Anna Ferrari, Università degli studi di Milano-Bicocca, Milano, Italy; University of Geneva, Geneva, Geneva, Switzerland

3 Views Cite this report Responses(0)

Not Approved

The authors improved the article. However, I don't see the comparison between their work and the works done in the literature. There are many studies that used the same algorithms on the same datasets. A table for comparison would highlight the authors contribution. Furthermore, more details must be given in the Literature review section.

Now there is a list of works.
What is the main point coming from the literature?
What are the algorithms that are the most suitable,
What are the (still) open questions?

About the Methods section: the authors mentioned several steps before using the data for the classification. What are the steps that have been undertaken? What's the amount of data before and after these steps? Did you remove some data? Were there missing values?

Results: How are you results compared to the existing literature? What's the main learning from your study?

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

computer science, statistics, human activity recognition, time-series data, machine learning, data science

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

8 Views

22 Nov 2024 | for Version 4

Alwin Poulose, Indian Institute of Science Education and Research Thiruvananthapuram, Thiruvananthapuram, Kerala, India

8 Views Cite this report Responses(0)

Approved

The authors addressed all my comments. No more further comments.

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

13 Views

16 Sep 2024 | for Version 2

Alwin Poulose, Indian Institute of Science Education and Research Thiruvananthapuram, Thiruvananthapuram, Kerala, India

13 Views Cite this report Responses(0)

Approved With Reservations

Please find the following comments on this paper.

The abstract should update with better quality. The current form lacks motivation for research, fundamental idea of research, and experiment result details.
The introduction section has a lack of information. The significant contributions of the paper are not evident in the introduction section.
Please consider the following references in your related work section:
1. Ronald M, et al, 2021 [Ref 1]
2. Poulose A, et al 2022 [Ref 2]
Please add the dataset description.
Please show the confusion matrix in percentage form for better understanding.
The research already exists, and what is the main contribution of this work?

Is the rationale for developing the new method (or application) clearly explained?

Partly
Is the description of the method technically sound?

No
Are sufficient details provided to allow replication of the method development and its use by others?

Yes
If any results are presented, are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions about the method and its performance adequately supported by the findings presented in the article?

Yes

References

1. Ronald M, Poulose A, Han D: iSPLInception: An Inception-ResNet Deep Learning Architecture for Human Activity Recognition. IEEE Access. 2021; 9: 68985-69001 Publisher Full Text
2. Poulose A, Kim JH, Han DS: HIT HAR: Human Image Threshing Machine for Human Activity Recognition Using Deep Learning Models.Comput Intell Neurosci. 2022; 2022: 1808990 PubMed Abstract | Publisher Full Text

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

9 Views

13 Aug 2024 | for Version 2

Anna Ferrari, Università degli studi di Milano-Bicocca, Milano, Italy; University of Geneva, Geneva, Geneva, Switzerland

9 Views Cite this report Responses(0)

Not Approved

The authors discuss and compare different machine learning algorithms for Human Activity Recognition.

In general: the article must be better organized in sections and subsections to make it more readable.

1. Introduction:
The introduction shows insufficiency in references when definitions or examples of applications are given. It must be further developed by providing extended context around HAR. Why do wearable technologies play a pivotal role in HAR? Why are machine learning algorithms preferred over traditional time series techniques? What has the research community already done? More references are needed.
Furthermore, the cited algorithms were already used in the research community. Do you compare your results with those of other studies? Are your results aligned with them? Are they better? What's the tangible contribution of this research?

2. Literature review
The literature review must be extended and better explained. The table can be used as a support, but it is insufficient to give a complete overview. In the table, more information is needed, such as the dataset used and results. The table must be corrected: what does the first row (First Name, Last, Name, Grade) refer to?

3. Method
In the method section, the authors describe the data flow. In the literature, this is described as the Activity Recognition Process (ARP), which comprises several phases, such as data acquisition, preprocessing, segmentation, and feature extraction and classification. The authors describe some phases missing from the HAR structure. Furthermore, you describe the data-cleaning phase. How was your dataset cleaned?
Feature fusion pipeline: how did the authors fuse the features?
Data splitting: what's the reason to split the dataset into 60% and 20%?

Classification: In this section, the authors describe a list of the algorithms used for the analysis. It is, however, incomplete in the context of the article. For instance, which hyperparameter did you use for each algorithm? Did you implement a fine-tuning procedure to search for the best hyperparameters? More information about the algorithms is needed.

4. Results:
In this section, the authors present the confusion matrix, the recall, precision, and F1 metrics of each algorithms' classification, and the overall comparison between the algorithms based on total accuracy. However, it shows insufficiency a comparison with the results in the literature. Are your results better than those already achieved? If yes, why?

5. Conclusion:
The authors say "
This study’s significant contributions lie in its thorough investigation of diverse machine learning methodologies for human activity recognition, shedding light on the strengths and limitations of each algorithm." This affirmation should be further justified by explaining how each algorithm setup is made (hyperparameter, training, test set, etc.) and by showing how the authors overcame the algorithms' limitations (when possible).

Is the rationale for developing the new method (or application) clearly explained?

Partly
Is the description of the method technically sound?

Partly
Are sufficient details provided to allow replication of the method development and its use by others?

Partly
If any results are presented, are all the source data underlying the results available to ensure full reproducibility?

Partly
Are the conclusions about the method and its performance adequately supported by the findings presented in the article?

Partly

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Human Activity Recognition, wearable devices, digital health

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

14 Views

25 Jun 2024 | for Version 2

Kristina Host, University of Rijeka, Rijeka, Croatia

14 Views Cite this report Responses(0)

Not Approved

Introduction

The introduction is not easy to read, is written with redundant phrases like “giving rise to a plethora of…” and words such as "burgened” and similar. T it's okay to have one or two, but the whole introduction is full of them, which makes it difficult to read.
In the introduction, there should be described the goal and the motivation behind it. Why this task? What will you accomplish with it?

Literature review

It's important to provide more context and explanation rather than just presenting a table.

Methods

Don’t call the subsection input, rename it to dataset. The dataset with the preprocessing is well described.

Fig 3 is redundant, Fig 4,5 there is no need to put the code, but you should describe what it is on the image.

People are familiar with reading a confusion matrix, so interpreting all the data in this way may not be necessary. All of this should be excluded from the paper, instead of this you should emphasize some key misclassification, make some conclusions why this is happening, are there similarities in the activities in real life? Also you should interpret figure 7,9,11 and other with precision, recall,..

Maybe it would be interesting to put the matrices all together like in a subplot and then compare all the results, which methods on some misclassifications performed better and similar. And also make a table like for accuracy for the other metrics to compare everything
Also, fig. 17 is irrelevant is showing the same as table 2.

It is not clear what is the difference between cnn, grn and nn that is proposed? What is specific for this proposed one? You didn’t put for cnn and grn confusion matrix and other metrics?

Also, why is the model trained for more than 120 epochs? Is there some early stop? How did you decide for how many epochs will you train?

Is the rationale for developing the new method (or application) clearly explained?

No
Is the description of the method technically sound?

Partly
Are sufficient details provided to allow replication of the method development and its use by others?

Partly
If any results are presented, are all the source data underlying the results available to ensure full reproducibility?

Partly
Are the conclusions about the method and its performance adequately supported by the findings presented in the article?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Computer vision and human action recognition in sports

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

31 Views

19 Dec 2023 | for Version 1

Nurul Amin Choudhury, National Institute of Technology Silchar, Silchar, Assam, India

31 Views Cite this report Responses(1)

Not Approved

Is the rationale for developing the new method (or application) clearly explained?

No
Is the description of the method technically sound?

No
Are sufficient details provided to allow replication of the method development and its use by others?

Partly
If any results are presented, are all the source data underlying the results available to ensure full reproducibility?

Partly
Are the conclusions about the method and its performance adequately supported by the findings presented in the article?

No

References

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Human Activity Recognition, AI-ML, Feature Engineering, Domain Adaptation, eHealth Applications, Neural Networks.

Respond to this report

Responses (1)

Author Response

13 Apr 2024

Keerthi Varadhi, CSE Department, Gokaraju Rangaraju Institute of Engineering and Technology, Hyderabad, 500090, India

1. The authors fail to describe the dataset adequately. Stating the source of the dataset is not sufficient. Please study the papers like - N. A. Choudhury and B. Soni, "An Adaptive Batch Size-Based-CNN-LSTM Framework for Human Activity Recognition in Uncontrolled Environment," in IEEE Transactions on Industrial Informatics, vol. 19, no. 10, pp. 10379-10387, Oct. 2023, doi: 10.1109/TII.2022.3229522 and many more for detailed information.

Answer: Thank you for the suggestion; we described the dataset in detail in the updated paper.

Top of Form

2. The paper starts with recognizing human activities using raw sensor data. However, the UCI dataset has extracted features from raw sensor data, how the authors incorporated raw sensor data from the UCI-HAR dataset.

Answer: Thank you for your suggestion. We have provided a detailed description of the dataset in the updated paper.

3. The paper needs to be better written in the form of scientific information, grammar and overall representation. Please use a software tool and verify the draft.

Answer: Thank you for your suggestion. We have addressed all grammatical mistakes with the assistance of software tools in the updated paper.

4. The literature survey needs to be done appropriately. Include recent and good-quality journals and top-tier conference papers.

Answer: Thank you for your suggestion. We have incorporated additional recent papers from reputable journals such as Springer, Elsevier, and IEEE into the updated literature review.

5. Data Cleaning or pre-processing needs to be explained more clearly, and a feature fusion pipeline must be used for enhanced activity recognition performance.

Answer: Thank you for your suggestion. We have provided a detailed explanation of the pre-processing steps and included a paragraph on the fusion pipeline in the updated paper.

6. Both the abstract and conclusion could be better written. Please rewrite it by incorporating the paper's contributions and novelties.

Answer: Thank you for your suggestion. We have rewritten the abstract and conclusion, incorporating the contributions into the updated paper.

7. The models' explanation needs to be revised in the paper by incorporating and highlighting the hyperparameters.
Answer: Thank you for your suggestion. We have revised the model explanation in the updated paper, emphasizing the inclusion of hyperparameters.

8. Achieved results are not compared with the benchmark models.

Answer: Thank you for your suggestion. We have compared the results with benchmark models in the updated paper.

View more View less

Competing Interests

No competing interests were disclosed.

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] 1. Schüldt IL, Caputo B: Recognizing human actions: a local SVM approach. Pattern Proceedings of the 17th International Conference on Pattern Recognition. 2004; vol. 23: pp. 32–36. Cambridge, UK.

[2] 2. Laptev I, Marszalek M, Schmid C, et al.: Learning realistic human actions from movies. 2008 IEEE Conference on Computer Vision and Pattern Recognition. 2008; vol. 4: pp. 1–8. Anchorage, AK, USA.

[3] 3. Yamato J, Ohya J, Ishii K: Recognizing human action in time-sequential images using hidden markov model. Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 1992; vol. 1992: pp. 379–385, Champaign, IL, USA.

[4] 4. Oliver NM, Rosario B, Pentland AP: A Bayesian computer vision system for modeling human interactions. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2000; vol. 22: pp. 831–843,

[5] 5. Suk HI, Sin BK, Lee SW: Hand gesture recognition based on dynamic Bayesian network framework. Pattern Recogn. 2000; 43: 3059–3072.

[6] 6. Natarajan P, Nevatia R: View and scale invariant action recognition using multiview shape-flow models. 2008 IEEE Conference on Computer Vision and Pattern Recognition. 2008; pp. 1–8. Anchorage, AK, USA.

[7] 7. Ning H, Xu W, Gong Y, Huang T:Latent pose estimator for continuous action recognition. Computer Vision--European Conference on Computer Vision 2008. Marseille, France:Springer;2008; vol. 43. : pp. 419–433.

[8] 8. Vail DL, Veloso MM, Lafferty JD: Conditional random fields for activity recognition. Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems. 2007; vol. 1: pp. 235.

[9] 9. Kiros R, Zhu Y, Salakhutdinov RR, et al.: Skip-thought vectors. Adv. Neural Inf. Proces. Syst. 2015; 1: 3294–3302.

[10] 10. Grushin A, Monner DD, Reggia JA, et al.: Robust human action recognition via long short-term memory. The 2013 International Joint Conference on Neural Networks (IJCNN). 2013; vol. 25: pp. 1–8.

[11] 11. Veeriah V, Zhuang N, Qi GJ: Differential recurrent neural networks for action recognition. 2015 IEEE International Conference on Computer Vision (ICCV). 2015; vol. 4: pp. 4041–4049.

[12] 12. Du Y, Wang W, Wang L: Hierarchical recurrent neural network for skeleton based action recognition. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2015; vol. 23: pp. 1110–1118.

[13] 13. Leung KM: Naive bayesian classifier. Polytechnic University Department of Computer Science/Finance and Risk Engineering;2007; pp. 123–156.

[14] 14. Peterson LE: K-nearest neighbor. Scholarpedia. 2009; 4(2): 1883. Publisher Full Text

[15] 15. Swain PH, Hauska H: The decision tree classifier: Design and potential. IEEE Trans. Geosci. Electron. 1977; 15(3): 142–147. Publisher Full Text

[16] 16. Liaw A, Wiener M: Classification and regression by randomForest. R news. 2002; 2(3): 18–22.

[17] 17. Domínguez-Almendros S, Benítez-Parejo N, Gonzalez-Ramirez AR: Logistic regression models. Allergol. Immunopathol. 2011; 39(5): 295–305. Publisher Full Text

[18] 18. Féraud R, Clérot F: A methodology to explain neural network classification. Neural Netw. 2002; 15(2): 237–246. Publisher Full Text

Recognizing human activities using light-weight and effective machine learning methodologies

Abstract

Keywords

Introduction

Literature review

Table 1. Evaluation of various methods.

Methods

Figure 1. Human activity recognition structure.

Algorithm

(1)

(2)

(3)

(4)

(5)

Algorithm

(6)

(7)

Algorithm

(8)

Algorithm

Algorithm

Figure 2. Neural network model structure.

(9)

(10)

(11)

(12)

Results and discussion

Figure 3. Shape of the dataset.

Figure 4. Bar graph for the attribute ‘Activity’.

Figure 5. Box plot.

Figure 6. Confusion matrix for naïve Bayes classification.

Figure 7. Precision, recall and F1-Score for naïve Bayesclassifier.

Figure 8. Confusion matrix for decision tree.

Figure 9. Precision, recall and F1 score for decision tree classifier.

Figure 10. Confusion matrix for K-nearest neighbour.

Figure 11. Precision, recall and F1 score for K-nearest neighbour classifier.

Figure 12. Confusion matrix for random forest.

Figure 13. Precision, recall and F1 score for random forest classifier.

Figure 14. Confusion matrix for logistic regression.

Figure 15. Precision, recall and F1 score for logistic regression classifier.

Figure 16. Loss graph for neural network.

Figure 17. Comparison of algorithms.

Table 2. Accuracy of naïve Bayes, decision tree, K-nearest neighbour, random forest, logistic regression, neural network.

Conclusions

Data availability

Underlying data

Extended data

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated