Machine Learning for Heavy Metal Pollution Status Classification Using Spectroscopic and Electrochemical Data: A Systematic Review

Widuri Rosman; Sabrina Ocha Felinda; Untung Hartono; Pricilia Amfotis; Andreana Muhammad; Antonius Antonius; Heriyansyah Heriyansyah; Dina Marlina Oktavia; Mirnawati Dewinda Rumalean Ansanai; Yuni Marhayuni

doi:10.12688/f1000research.182642.1

Home Browse Machine Learning for Heavy Metal Pollution Status Classification Using...

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Systematic Review

Machine Learning for Heavy Metal Pollution Status Classification Using Spectroscopic and Electrochemical Data: A Systematic Review

[version 1; peer review: awaiting peer review]

Widuri Rosman ¹, Sabrina Ocha Felinda¹, Untung Hartono¹, [...] Pricilia Amfotis¹, Andreana Muhammad¹, Antonius Antonius¹, Heriyansyah Heriyansyah¹, Dina Marlina Oktavia¹, Mirnawati Dewinda Rumalean Ansanai¹, Yuni Marhayuni¹

Widuri Rosman ¹, Sabrina Ocha Felinda¹, [...] Untung Hartono¹, Pricilia Amfotis¹, Andreana Muhammad¹, Antonius Antonius¹, Heriyansyah Heriyansyah¹, Dina Marlina Oktavia¹, Mirnawati Dewinda Rumalean Ansanai¹, Yuni Marhayuni¹

PUBLISHED 06 Jun 2026

Author details Author details

¹ Chemistry, Institut Teknologi Bandung, Bandung, West Java, 40132, Indonesia

Widuri Rosman
Roles: Conceptualization, Funding Acquisition, Project Administration, Writing – Original Draft Preparation

Sabrina Ocha Felinda
Roles: Data Curation, Funding Acquisition, Writing – Original Draft Preparation

Untung Hartono
Roles: Funding Acquisition, Methodology, Supervision

Pricilia Amfotis
Roles: Funding Acquisition, Investigation, Resources

Andreana Muhammad
Roles: Funding Acquisition, Software, Validation

Antonius Antonius
Roles: Funding Acquisition, Supervision, Writing – Review & Editing

Heriyansyah Heriyansyah
Roles: Funding Acquisition, Software, Validation

Dina Marlina Oktavia
Roles: Data Curation, Funding Acquisition, Visualization

Mirnawati Dewinda Rumalean Ansanai
Roles: Data Curation, Funding Acquisition, Writing – Review & Editing

Yuni Marhayuni
Roles: Funding Acquisition, Resources, Visualization

OPEN PEER REVIEW

REVIEWER STATUS AWAITING PEER REVIEW

This article is included in the Cheminformatics gateway.

Abstract

Heavy metal pollution threatens environmental and human health. Machine learning (ML) offers a powerful approach to process complex spectroscopic and electrochemical data for pollution status classification. This systematic review synthesises studies using ML to classify heavy metal pollution status in environmental matrices. Following PRISMA 2020 guidelines, we searched Scopus, SpringerLink, and ScienceDirect (2015–2026), identifying 825 records. After title/abstract screening and full-text assessment, 11 studies met inclusion criteria: target heavy metal, environmental matrix, ML/chemometrics, and pollution status classification outcome. Lead (Pb, 54.5%), cadmium (Cd, 45.5%), and mercury (Hg, 45.5%) were most frequently studied. Water dominated matrices (72.7%), followed by soil (18.2%); no sediment studies met criteria. Electrochemical techniques (72.7%) were more common than spectroscopy (36.4%). Support Vector Machine (45.5%), Random Forest (36.4%), and Artificial Neural Networks (36.4%) were most used, while deep learning (CNN, LSTM) achieved highest performance (accuracy up to 100%, AUC up to 0.999). Critically, no study integrated spectroscopic and electrochemical data; all used only a single modality. ML achieves excellent classification performance using either spectroscopy or electrochemistry alone. However, the complete absence of integrated approaches represents a significant research gap. Future research should prioritise data fusion, external validation, class imbalance handling, and expansion to soil and sediment matrices.

Keywords

heavy metal; machine learning; spectroscopy; electrochemistry; classification; systematic review

Corresponding author: Widuri Rosman

Competing interests: No competing interests were disclosed.

Grant information: This research was supported by the Indonesia Endowment Fund for Education (Lembaga Pengelola Dana Pendidikan – LPDP), Ministry of Finance, Republic of Indonesia.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2026 Rosman W et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Rosman W, Felinda SO, Hartono U et al. Machine Learning for Heavy Metal Pollution Status Classification Using Spectroscopic and Electrochemical Data: A Systematic Review [version 1; peer review: awaiting peer review]. F1000Research 2026, 15(Chem Inf Sci):880 (https://doi.org/10.12688/f1000research.182642.1) First published: 06 Jun 2026, 15(Chem Inf Sci):880 (https://doi.org/10.12688/f1000research.182642.1) Latest published: 06 Jun 2026, 15(Chem Inf Sci):880 (https://doi.org/10.12688/f1000research.182642.1)

1. Introduction

Heavy metal pollution is one of the most serious environmental challenges of the 21st century. Heavy metals such as lead (Pb), cadmium (Cd), mercury (Hg), chromium (Cr), arsenic (As), copper (Cu), nickel (Ni), and zinc (Zn) enter the environment through various anthropogenic activities, including mining, metal smelting, industrial waste, intensive agriculture, and domestic wastewater discharge (Jaishankar et al., 2014; Tchounwou et al., 2012). Unlike organic pollutants that can degrade, heavy metals are persistent, non-biodegradable, and accumulate in food chains (Ali & Khan, 2019; Rehman et al., 2018).

The health impacts of heavy metals are severe: Pb causes neurological disorders (Flora et al., 2012), Cd is carcinogenic (Godt et al., 2006), Hg damages the central nervous system (Rice et al., 2014), and As is associated with various cancers (Hughes et al., 2011). The World Health Organization (WHO) and regulatory bodies such as the US EPA have established water quality and soil quality standards for heavy metals, which serve as references for determining pollution status (US EPA, 2026; WHO, 2022). For example, WHO sets the Pb limit in drinking water at 10 μg/L, while the European Union sets As limit in soil at 5 mg/kg (Hu et al., 2024) and Cu limit at 20 mg/kg (Qi et al., 2025).

Standard analytical methods for heavy metals such as AAS, ICP-OES, and ICP-MS offer high accuracy and sensitivity, but have limitations: expensive instrumentation, need for trained personnel, complex sample preparation, and lack of in-situ detection capabilities (Bansod et al., 2017; Taylor et al., 2017). As alternatives, spectroscopic techniques (LIBS, SERS, vis-NIR, 3D fluorescence) and electrochemical techniques (voltammetry, EIS) have been developed due to their lower cost, portability, and potential for real-time detection (Gumpu et al., 2015; Sawan et al., 2020). However, both approaches generate complex data that are difficult to interpret manually, thus requiring machine learning (ML) and chemometrics.

Machine learning has proven effective for processing multi-dimensional, non-linear, noisy spectroscopic and electrochemical data (Lussier et al., 2020; Puthongkham et al., 2021). Various algorithms have been applied, including Support Vector Machines (SVM), Random Forest (RF), Convolutional Neural Networks (CNN), Long Short-Term Memory (LSTM), and ensemble learning (Dean et al., 2019; Hu et al., 2024). These algorithms can extract relevant features, improve classification accuracy, and automate decision-making.

In environmental monitoring, pollution status classification refers to determining whether a sample is “contaminated” or “not contaminated” by comparing heavy metal concentrations with regulatory standards or thresholds (Vareda et al., 2019). Classification can be binary (contaminated vs. non-contaminated) or multi-level (low, medium, high). This approach differs from regression (predicting numerical values) or ion type discrimination. Pollution status classification has higher practical value for remediation decisions and regulatory compliance (Hu et al., 2024; Qi et al., 2025).

Several reviews have discussed ML applications in heavy metal detection (Borrill et al., 2019; Huang et al., 2023; Lussier et al., 2020). However, existing reviews generally focus on technical aspects of algorithms without systematically considering: (1) performance comparison between spectroscopic and electrochemical data, (2) pollution status classification outcomes (not merely identification or quantification), and (3) evaluation on real environmental matrices with class imbalance.

More importantly, no systematic review has specifically examined the integration of spectroscopic and electrochemical data for heavy metal pollution status classification in environmental matrices (water, soil, sediment, wastewater). Such integration has the potential to improve accuracy through complementary information. Of 154 articles screened in this study, only 11 met the inclusion criteria (heavy metal target, environmental matrix, ML/chemometrics, and pollution status classification outcome). However, none of these 11 articles integrated both data types; all used only one modality (spectroscopy alone or electrochemistry alone). This indicates a significant research gap.

This systematic review aims to identify, evaluate, and synthesize studies that use ML/chemometrics algorithms for heavy metal pollution status classification based on spectroscopic and/or electrochemical data in environmental matrices. The research questions are formulated using the P-E-O (Population, Exposure, Outcome) framework. Specifically, this review addresses the following questions: (1) Which heavy metals and sample matrices are most frequently reported in studies using ML for pollution status classification? (2) Which ML algorithms are most commonly used, and what data features are extracted from spectra or voltammetric signals? (3) What are the performance metrics (accuracy, precision, recall, AUC, F1-score) of pollution status classification models? (4) Does the integration of spectroscopic and electrochemical data significantly improve classification accuracy compared to using a single data type?

2. Methodology

This systematic review was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 statement (Page et al., 2021). The review protocol was not registered but followed established guidelines for systematic reviews in environmental science and analytical chemistry (Chandler et al., 2019; Liberati et al., 2009). The study selection process is summarized in Figure 1 (see Section 3.1).

Figure 1. PRISMA flow diagram.

2.1. Research design

This study employed a systematic literature review design to identify, evaluate, and synthesize peer-reviewed research articles that apply machine learning (ML) or chemometric algorithms for heavy metal pollution status classification based on spectroscopic and/or electrochemical data in environmental matrices. The review was structured around the P-E-O (Population, Exposure, Outcome) framework (Methley et al., 2014; Schardt et al., 2007), as illustrated in Figure 2.

a) Population (P): Spectroscopic (absorbance/intensity) or electrochemical (potential/current) data from heavy metals (Pb, Cd, Hg, Cr, As, Cu, Ni, Zn, etc.) in environmental matrices (water, soil, sediment, wastewater).
b) Exposure (E): Machine learning or chemometric algorithms used to process analytical data.
c) Outcome (O): Binary or multi-level classification of heavy metal pollution status based on regulatory thresholds or established standards.

Figure 2. P-E-O conceptual framework.

Figure 2 shows the schematic of the P-E-O framework applied in this review.

2.2. Eligibility criteria

Studies were included or excluded based on the criteria summarized in Table 1.

Table 1. Criteria for inclusion and exclusion of studies.

Criterion	Inclusion	Exclusion
Article type	Peer-reviewed original research, English language	Reviews, conference proceedings, editorials, theses, non-English
Target analyte	Heavy metals (Pb, Cd, Hg, Cr, As, Cu, Ni, Zn, etc.)	Non-heavy metals (e.g., organic pollutants, microplastics, nutrients, algal blooms)
Sample matrix	Environmental matrices (water, soil, sediment, wastewater)	Food matrices, plants, laboratory standard solutions only
Data type	Spectroscopic (LIBS, SERS, FTIR, vis-NIR, fluorescence, Raman, hyperspectral) OR electrochemical (voltammetry, EIS, potentiometry)	Non-optical/non-electrochemical methods without ML
Algorithm	ML, deep learning, or chemometric algorithms (SVM, RF, ANN, CNN, LSTM, PLS, PCA, ensemble, etc.)	Traditional statistical methods without ML
Outcome	Classification of pollution status (binary or multi-level) based on regulatory thresholds (WHO, EPA, EU)	Regression (concentration prediction), ion type discrimination, clustering without threshold
Publication period	2015 – April 2026	Before 2015 or after April 2026

2.3. Information sources

Searches were conducted in the following electronic databases:

• Scopus (Elsevier)
• SpringerLink (Springer Nature)
• ScienceDirect (Elsevier)

These databases were selected due to their comprehensive coverage of peer-reviewed literature in environmental science, analytical chemistry, materials science, and engineering disciplines relevant to heavy metal detection and machine learning applications. No additional databases (e.g., Web of Science, PubMed, IEEE Xplore) were used. The final search was performed on 6 May 2026. No filters for publication status were applied beyond the date range (2015–2026). Reference lists of included studies and relevant review articles were manually screened to identify additional eligible studies (snowballing). Table 2 presents the number of records initially identified from each database.

Table 2. Databases and last search dates.

No.	Database	Initial records	Last search date
1	Scopus	133	6 May 2026
2	ScienceDirect	681	6 May 2026
3	Springer	11	6 May 2026
Total		825

2.4. Search strategy

A systematic search was conducted on 6 May 2026 using three electronic databases: Scopus, SpringerLink, and ScienceDirect. The same core search string was applied across all databases, with minor syntactic adjustments to accommodate each platform. The search combined keywords related to three main concepts: (1) heavy metals, (2) spectroscopic/electrochemical techniques, and (3) machine learning/chemometrics, using Boolean operators (AND, OR). The base search string was:

("heavy metal " OR "toxic metal " OR lead OR cadmium OR mercury OR chromium OR arsenic OR copper OR nickel OR zinc OR Pb OR Cd OR Hg OR Cr OR As OR Cu OR Ni OR Zn) AND ("spectroscopy" OR "spectromet" OR "LIBS" OR "SERS" OR "Raman" OR "FTIR" OR "vis-NIR" OR "fluorescence" OR "hyperspectral" OR "voltammetry" OR "SWASV" OR "DPV" OR "CV" OR "electrochemical" OR "EIS" OR "potentiometry" OR "stripping voltammetry") AND ("machine learning" OR "deep learning" OR "chemometric" OR "artificial neural network" OR "support vector machine" OR "random forest" OR "convolutional neural network" OR "LSTM" OR "ensemble learning" OR "PLS" OR "PCA" OR "SVM" OR "ANN" OR "CNN") AND ("classification" OR "contamination" OR "pollution status" OR "threshold" OR "binary classification" OR "multi-class classification")

Additional filters were applied uniformly: peer-reviewed research articles, English language, and publication year >2014 (or 2015–2026 for ScienceDirect). The search was performed by two independent reviewers; discrepancies were resolved by consensus. Results were exported to reference management software for duplicate removal. The number of records retrieved per database is reported in Table 2 (Section 2.3).

2.5. Study selection process

Study selection followed a four-stage process, as illustrated in Figure 1 (Section 3.1). The process was conducted independently by two reviewers, with disagreements resolved through consensus or consultation with a third reviewer.

Stage 1: Identification. A total of 825 records were identified from the three databases (Scopus: 133; SpringerLink: 681; ScienceDirect: 11) on 6 May 2026. All records were exported to reference management software, where duplicates were removed. After duplicate removal, the remaining records proceeded to the screening stage.

Stage 2: Screening (title and abstract). The titles and abstracts of all unique records were screened against the eligibility criteria ( Table 1). Records that clearly did not meet the inclusion criteria were excluded. When eligibility could not be determined from the title and abstract alone, the full text was retained for the next stage. This stage yielded 154 records considered potentially relevant and thus included for full-text review.

Stage 3: Eligibility (full-text screening). Full texts of the 154 potentially eligible records were retrieved and assessed independently by two reviewers against the full eligibility criteria. A total of 143 records were excluded at this stage, primarily because the study area was not relevant to the review’s focus (e.g., target analyte not a heavy metal, matrix not environmental, outcome not classification of pollution status, or no machine learning algorithm applied). Reasons for exclusion were documented for each excluded full text.

Stage 4: Included. After full-text screening, only 11 studies met all inclusion criteria and were included in the final qualitative synthesis.

The PRISMA flow diagram ( Figure 1, Section 3.1) provides a complete visual summary of the selection process, including the number of records at each stage and reasons for exclusion.

2.6. Data extraction

Data were extracted from each included study using a standardized data extraction form developed in Microsoft Excel. The form was piloted on three randomly selected studies and refined accordingly. Data extraction was performed by one reviewer and independently verified by a second reviewer. Discrepancies were resolved through discussion.

The extracted information included bibliographic details, study characteristics, target heavy metal(s), matrix type, concentration range, regulatory threshold, spectroscopic or electrochemical technique, preprocessing methods, machine learning algorithms, validation approach, classification performance metrics (accuracy, precision, recall, F1-score, AUC), and any integration of multiple data types.

Due to space limitations, the full data extraction results are not presented within this article. However, the complete dataset, including all extracted variables and intermediate calculations, is available in the supplementary repository (see Data Availability Statement). A summary of the extracted data is presented in Tables 4–12 (Section 3).

2.7. Data items

Data were extracted from each included study covering the following items:

• Bibliographic information: Authors, year, journal, country.
• Population (P): Target heavy metal(s), matrix type (water, soil, sediment), concentration range, regulatory threshold used for classification (e.g., WHO, EPA, EU).
• Exposure (E): Data type (spectroscopy or electrochemistry), specific technique, preprocessing methods, ML/chemometric algorithm(s), feature extraction, validation approach.
• Outcome (O): Classification task (binary or multi-level), performance metrics reported (accuracy, precision, recall, F1-score, AUC, etc.), best-performing model.
• Data integration: Whether spectroscopic and electrochemical data were combined (yes/no); if yes, fusion level (data, feature, or decision).

No assumptions were made for missing or unclear data; items not reported were recorded as “not reported”. A summary of the extracted data is provided in Tables 3–12 (Section 3).

Table 3. Summary characteristics of included studies.

No.	Author (Year)	Matrix	Target Metal(s)	Data Type	ML Algorithm(s)
1	Hu et al. (2024)	Soil	As	Spectroscopy (vis-NIR)	SVM, RF, GBDT, Ridge, ensemble
2	Wei et al. (2023)	Water	As³⁺, Cr⁶⁺	Spectroscopy (SERS)	SVM, CNN, tSNE
3	Lahari et al. (2025)	Water	Cd²⁺, Pb²⁺, Cu²⁺, Hg²⁺	Electrochemical (DPV)	CNN, ANN
4	Kailasam et al. (2024)	Water	Ni²⁺, Cu²⁺	Electrochemical (CV, DPV)	Naïve Bayes, ANN, SVM, DT
5	Dean et al. (2019)	Seawater	Pb, Cd, Cu, Hg	Electrochemical (CSWV)	LSTM, FCN, ALSTM-FCN, LDA, PCA-SVM
6	Chen et al. (2025)	Water	9 metals	Spectroscopy (3D fluorescence)	RF, SVM, ANN, DT
7	Park et al. (2022)	Water	Pb (NO₃)₂	Spectroscopy (SERS)	RBFSVM, LR, NB, DT, RF, MLP
8	Hajzus et al. (2022))	Seawater	Cd, Cu, Hg, Pb	Electrochemical (CSWV)	LSTM, FCN, ALSTM-FCN
9	Valle et al. (2025)	Water	Hg²⁺, Ag⁺, Fe³⁺	Electrochemical (EIS)	Decision Tree (MCS), IDMAP
10	Qi et al. (2025)	Soil	Cu	Spectroscopy (vis-NIR)	BalancedRandomForest, EasyEnsemble, RUSBoost
11	Maity et al. (2023)	Water	Pb²⁺, Hg²⁺ + E. coli	Electrochemical (GFET)	PCA, ANN

Table 4. Frequency of target heavy metals across included studies.

Heavy Metal	No. of Studies (%)	Source (representative studies)
Lead (Pb)	6 (54.5%)	(Dean et al., 2019; Hajzus et al., 2022; Hu et al., 2024; Lahari et al., 2025; Maity et al., 2023; Park et al., 2022)
Cadmium (Cd)	5 (45.5%)	(Chen et al., 2025; Dean et al., 2019; Hajzus et al., 2022; Kailasam et al., 2024; Lahari et al., 2025)
Mercury (Hg)	5 (45.5%)	(Dean et al., 2019; Hajzus et al., 2022; Lahari et al., 2025; Maity et al., 2023; Valle et al., 2025)
Copper (Cu)	4 (36.4%)	(Dean et al., 2019; Hajzus et al., 2022; Lahari et al., 2025; Qi et al., 2025)
Arsenic (As)	3 (27.3%)	(Chen et al., 2025; Hu et al., 2024; Wei et al., 2023)
Chromium (Cr)	2 (18.2%)	(Chen et al., 2025; Wei et al., 2023)
Nickel (Ni)	2 (18.2%)	(Chen et al., 2025; Kailasam et al., 2024)
Others (Ag, Fe, Mn, Co, Zn)	1 each (9.1%)	(Chen et al., 2025; Valle et al., 2025)

Table 5. Distribution of sample matrices.

Matrix Type	No. of Studies (%)	Source
Water (tap, lake, seawater, wastewater)	8 (72.7%)	(Chen et al., 2025; Dean et al., 2019; Hajzus et al., 2022; Lahari et al., 2025; Maity et al., 2023; Park et al., 2022; Valle et al., 2025; Wei et al., 2023)
Soil	2 (18.2%)	(Hu et al., 2024; Qi et al., 2025)
Sediment	0 (0%)	–

Table 6. Spectroscopic techniques used in included studies.

Technique	No. of Studies	Source
vis-NIR spectroscopy	2	(Hu et al., 2024; Qi et al., 2025)
Surface-Enhanced Raman Spectroscopy (SERS)	2	(Park et al., 2022; Wei et al., 2023)
3D fluorescence spectroscopy	1	(Chen et al., 2025)
Total spectroscopic	4 (36.4%)	–

Table 7. Electrochemical techniques used in included studies.

Technique	No. of Studies	Source
Differential Pulse Voltammetry (DPV)	2	(Kailasam et al., 2024; Lahari et al., 2025)
Cyclic Square Wave Voltammetry (CSWV)	2	(Dean et al., 2019; Hajzus et al., 2022)
Electrochemical Impedance Spectroscopy (EIS)	2	(Maity et al., 2023; Valle et al., 2025)
Cyclic Voltammetry (CV)	1	(Kailasam et al., 2024)
Graphene Field-Effect Transistor (GFET)	1	(Maity et al., 2023)
Total electrochemical	8 (72.7%)	–

Table 8. Frequency of ML/chemometric algorithms.

Algorithm	No. of Studies (%)	Source
Support Vector Machine (SVM)	5 (45.5%)	(Dean et al., 2019; Hu et al., 2024; Kailasam et al., 2024; Park et al., 2022; Wei et al., 2023)
Random Forest (RF)	4 (36.4%)	(Chen et al., 2025; Hu et al., 2024; Park et al., 2022; Qi et al., 2025)
Artificial Neural Network (ANN/MLP)	4 (36.4%)	(Chen et al., 2025; Kailasam et al., 2024; Lahari et al., 2025; Park et al., 2022)
Convolutional Neural Network (CNN)	3 (27.3%)	(Chen et al., 2025; Lahari et al., 2025; Wei et al., 2023)
Long Short-Term Memory (LSTM)	2 (18.2%)	(Dean et al., 2019; Hajzus et al., 2022)
Decision Tree (DT)	2 (18.2%)	(Kailasam et al., 2024; Valle et al., 2025)
Fully Convolutional Network (FCN)	2 (18.2%)	(Dean et al., 2019; Hajzus et al., 2022)
Others (NB, LDA, kNN, LR, BalancedRF, etc.)	1 each (9.1%)	Various

Table 9. Common preprocessing and feature extraction methods.

Method Category	Specific Technique	No. of Studies	Source
Baseline correction	Asymmetric LS, IRLS	3	(Dean et al., 2019; Park et al., 2022; Wei et al., 2023)
Normalization	Min-Max, Z-score, PSD	4	(Lahari et al., 2025; Park et al., 2022; Qi et al., 2025)
Smoothing	Savitzky-Golay, moving average	3	(Chen et al., 2025; Hu et al., 2024; Qi et al., 2025)
Derivatives	First derivative (FD)	2	(Hu et al., 2024; Qi et al., 2025)
Scatter correction	MSC, SNV	2	(Hu et al., 2024; Qi et al., 2025)
Dimensionality reduction	PCA	4	(Chen et al., 2025; Dean et al., 2019; Lahari et al., 2025; Wei et al., 2023)
Data augmentation	SMOTE	3	(Lahari et al., 2025; Qi et al., 2025; Wei et al., 2023)

Table 10. Summary of classification performance metrics.

Study	Best Model	Accuracy (%)	AUC	Recall	F1-Score
Hu et al. (2024)	SVC	83	0.89	0.86	NR
Wei et al. (2023)	SVM	>97	NR	NR	NR
Lahari et al. (2025)	CNN	99–100	NR	0.99	0.99
Kailasam et al. (2024)	Naïve Bayes	93.2	NR	NR	NR
Dean et al. (2019)	ALSTM-FCN	NR	0.999	NR	NR
Chen et al. (2025)	RF	97.8–100	NR	NR	NR
Park et al. (2022)	RBFSVM	–	0.846	NR	NR
Hajzus et al. (2022)	ALSTM-FCN	–	0.998	NR	NR
Valle et al. (2025)	Decision Tree	99	NR	NR	NR
Qi et al. (2025)	BalancedRandomForest	–	0.870	0.816	NR
Maity et al. (2023)	ANN	>99	NR	NR	NR

Table 11. Integration of multiple data types across included studies.

Integration Type	No. of Studies (%)	Source
Spectroscopy only	4 (36.4%)	(Chen et al., 2025; Hu et al., 2024; Park et al., 2022; Qi et al., 2025; Wei et al., 2023)
Electrochemistry only	7 (63.6%)	(Dean et al., 2019; Hajzus et al., 2022; Kailasam et al., 2024; Lahari et al., 2025; Maity et al., 2023; Valle et al., 2025)
Spectroscopy + Electrochemistry integrated	0 (0%)	–

Table 12. Risk of bias summary for included studies.

Domain	Low Risk	Unclear Risk	High Risk	Source
Sample representativeness	9	2	0	Author assessment (adapted from JBI and QUADAS-2)
Reference standard (ICP-MS/AAS)	11	0	0	Author assessment
ML model validation	8	3	0	Author assessment
Reporting of performance metrics	6	5	0	Author assessment
Justification of classification threshold	5	4	2	Author assessment (Valle et al., 2025; Wei et al., 2023)
Handling of confounding factors (pH, temp, matrix)	4	5	2	Author assessment

2.8. Study risk of bias assessment

Risk of bias was assessed independently by two reviewers using a tailored tool adapted from the Joanna Briggs Institute (JBI) checklist and QUADAS-2 (Moola et al., 2020; Whiting et al., 2011). Six domains were evaluated:

1) Sample representativeness
2) Reference standard (e.g., ICP-MS, AAS)
3) ML model validation (cross-validation, independent test set)
4) Reporting of performance metrics
5) Justification of classification threshold
6) Handling of confounding factors (pH, temperature, matrix effects)

Each domain was rated as “low risk”, “high risk”, or “unclear risk”. Studies were not excluded based on bias scores; the assessment was used to inform the narrative synthesis and sensitivity analysis. Results are summarized in Table 12 and Figure 7 (Section 3.8).

2.9. Effect measures

Due to anticipated heterogeneity in study designs, target analytes, matrices, and classification tasks, a meta-analysis was not performed. Effect measures were summarized narratively. For classification performance, the following metrics were extracted when reported:

• Accuracy: proportion of correctly classified samples.
• Sensitivity (recall): true positive rate.
• Specificity: true negative rate.
• Precision: positive predictive value.
• F1-score: harmonic mean of precision and recall.
• AUC: area under the ROC curve.

When multiple models were reported, the best-performing model based on accuracy or AUC was prioritized. For studies that compared integrated vs. single-modality approaches, the improvement in accuracy (percentage points) was calculated. A summary of performance metrics is presented in Table 10 and Figures 5–6 (Section 3.6).

2.10. Synthesis methods

A narrative synthesis approach was adopted (Popay et al., 2006). The synthesis was structured around the four research questions and organized thematically:

• Tabulation: Extracted data were organized into summary tables ( Tables 3–12).
• Thematic analysis: Findings were grouped by:
- 1) Target heavy metal and matrix (RQ1) → Tables 4–5
- 2) ML algorithm and feature extraction (RQ2) → Tables 6–9
- 3) Performance metrics (RQ3) → Table 10, Figures 5–6
- 4) Data integration (RQ4) → Table 11
• Visualization: A PRISMA flow diagram ( Figure 1), bar charts ( Figures 2–5), ROC curves ( Figure 6), and a traffic light plot for risk of bias ( Figure 7) were generated.

Subgroup analyses were planned by matrix type (water vs. soil), data type (spectroscopy vs. electrochemistry), and algorithm family (traditional ML vs. deep learning vs. ensemble). A sensitivity analysis was conducted by excluding studies with high risk of bias to assess the robustness of the findings. Publication bias was not formally assessed because of the narrative nature of the synthesis and heterogeneity of outcome measures.

3. Results

3.1. Study selection

The systematic search yielded a total of 825 records from three databases: Scopus (n = 133), SpringerLink (n = 681), and ScienceDirect (n = 11). After duplicate removal, 567 unique records remained. Title and abstract screening excluded 413 records, leaving 154 records for full-text assessment. Full-text screening resulted in the exclusion of 143 records, primarily because the study area was not relevant to the review’s focus (e.g., target analyte not a heavy metal, matrix not environmental, outcome not pollution status classification, or no machine learning algorithm applied). A total of 11 studies met all eligibility criteria and were included in the final qualitative synthesis. The PRISMA flow diagram is presented in Figure 1.

3.2. Characteristics of included studies

A summary of the 11 included studies is presented in Table 3. The studies were published between 2019 and 2025, with the majority appearing after 2022. The corresponding authors were primarily from China (n = 5), the United States (n = 3), South Korea (n = 2), and India (n = 1). All studies were peer-reviewed original research articles published in English.

3.3. Distribution by year and geographic origin

The number of included studies increased over time, as shown in Figure 2. Only one study was published before 2020 (Dean et al., 2019), followed by two studies in 2022 (Hajzus et al., 2022; Park et al., 2022), three in 2023 (Chen et al., 2025; Maity et al., 2023; Wei et al., 2023), two in 2024 (Hu et al., 2024; Kailasam et al., 2024), and three in 2025 (Lahari et al., 2025; Qi et al., 2025; Valle et al., 2025). The annual growth rate reflects increasing research interest in applying machine learning for heavy metal pollution classification.

Geographically, most studies originated from Asia (China, South Korea, India) and North America (United States). The geographic distribution is presented in Figure 3.

Figure 3. Geographic distribution of included studies by corresponding author affiliation.

3.4. Population characteristics: heavy metals and sample matrices

The most frequently targeted heavy metals were lead (Pb, 54.5%, n = 6), cadmium (Cd, 45.5%, n = 5), and mercury (Hg, 45.5%, n = 5), followed by copper (Cu, 36.4%, n = 4), arsenic (As, 27.3%, n = 3), chromium (Cr, 18.2%, n = 2), and nickel (Ni, 18.2%, n = 2). The frequency distribution is presented in Table 4.

Regarding sample matrices, water dominated the included studies (72.7%, n = 8), including surface water, tap water, lake water, seawater, and wastewater. Soil was the second most common matrix (18.2%, n = 2). No studies on sediment matrices met the inclusion criteria. The distribution of sample matrices is presented in Table 5.

3.5. Exposure characteristics: data types and algorithms

Among the 11 included studies, 8 (72.7%) used electrochemical techniques, while 4 (36.4%) used spectroscopic techniques (note: one study, Chen et al., used fluorescence spectroscopy). No study integrated both data types. The distribution of data types is presented in Figure 4.

Figure 4. Distribution of data types across included studies.

Spectroscopic techniques used included vis-NIR spectroscopy (Hu et al., 2024; Qi et al., 2025), SERS (Park et al., 2022; Wei et al., 2023), and 3D fluorescence spectroscopy (Chen et al., 2025). Details are provided in Table 6.

Electrochemical techniques included cyclic square wave voltammetry (CSWV), differential pulse voltammetry (DPV), electrochemical impedance spectroscopy (EIS), and graphene field-effect transistor (GFET). Details are provided in Table 7.

Machine learning algorithms: The most frequently used algorithms were Support Vector Machine (SVM, 45.5%, n = 5), Random Forest (RF, 36.4%, n = 4), and Artificial Neural Network (ANN/MLP, 36.4%, n = 4). Deep learning algorithms (CNN, LSTM) appeared in 5 studies and demonstrated the highest performance (AUC > 0.99). The frequency distribution is presented in Table 8.

Preprocessing and feature extraction methods commonly included baseline correction, normalization (Z-score, Min-Max, area), PCA for dimensionality reduction, and Savitzky-Golay smoothing. A summary is provided in Table 9.

3.6. Outcome characteristics: classification performance

All 11 studies reported classification outcomes. Seven studies performed binary classification (contaminated vs. non-contaminated or above vs. below threshold), while four studies performed multi-class classification (metal type identification or concentration level classification). The performance metrics are summarized in Table 10.

The best-performing models were deep learning algorithms (CNN, LSTM, ALSTM-FCN), achieving AUC values up to 0.999 and accuracy up to 100%. Ensemble methods (BalancedRandomForest) also demonstrated robust performance for imbalanced soil datasets. A comparison of accuracy across studies is presented in Figure 5, and representative ROC curves are shown in Figure 6.

Figure 5. Best classification accuracy reported by each included study.

Figure 6. Representative micro-average ROC curves for the 11-class seawater dataset (11-SW).

The ALSTM-FCN model achieved an AUC of 0.999.

3.7. Integration of spectroscopic and electrochemical data

None of the 11 included studies (0%) integrated spectroscopic and electrochemical data. All studies used only a single data modality: spectroscopy alone (n = 4, 36.4%) or electrochemistry alone (n = 7, 63.6%). Therefore, no evidence was available to assess whether data integration improves classification accuracy compared to single-modality approaches. The findings are summarized in Table 11.

3.8. Risk of bias assessment

The risk of bias assessment for the 11 included studies is summarized in Table 12 and visualized in Figure 7. Most studies showed low risk of bias across the six domains. Sample representativeness, reference standard, ML model validation, and reporting of performance metrics were generally adequate. Issues were identified in some studies regarding the justification of classification thresholds (e.g., Valle et al. (2025) did not specify a threshold) and handling of confounding factors (e.g., pH, temperature, matrix effects were sometimes not addressed). No study was excluded based on risk of bias.

Figure 7. Risk of bias summary for 11 included studies.

3.9. Summary key findings

From 825 initial records ( Table 2), only 11 studies ( Table 3) met all inclusion criteria. Key findings include:

1) Target heavy metals: Pb (54.5%) and Cd (45.5%) were the most frequently studied ( Table 4).
2) Sample matrices: Water dominated (72.7%); soil accounted for 18.2%; no sediment studies met inclusion criteria ( Table 5).
3) Data types: Electrochemistry (72.7%) was more common than spectroscopy (36.4%). No study integrated both data types ( Table 11; Figure 4).
4) Algorithms: SVM (45.5%), RF (36.4%), and ANN (36.4%) were most common ( Table 8). Deep learning (CNN, LSTM) achieved the highest performance (AUC up to 0.999) ( Table 10; Figure 6).
5) Classification performance: Accuracy ranged from 79.3% to 100%; AUC ranged from 0.846 to 0.999 ( Table 10; Figure 5).
6) Data integration: Zero studies (0%) integrated spectroscopic and electrochemical data ( Table 11), representing a significant research gap.

4. Discussion

This systematic review identified only 11 studies that met the eligibility criteria for using machine learning (ML) to classify heavy metal pollution status based on spectroscopic or electrochemical data in environmental matrices. The most striking finding is that none of the included studies integrated spectroscopic and electrochemical data, despite the complementary nature of these two analytical modalities. This result is consistent with the observation that most ML applications in environmental chemistry still rely on single-source data (Lussier et al., 2020; Puthongkham et al., 2021). However, it also highlights a previously underexplored gap: the potential for data fusion to improve pollution status classification has not been systematically tested.

The predominance of lead (Pb, 54.5%), cadmium (Cd, 45.5%), and mercury (Hg, 45.5%) as target analytes reflects global regulatory priorities (US EPA, 2026; WHO, 2022) and the well-documented toxicity of these metals (Jaishankar et al., 2014; Tchounwou et al., 2012). However, the underrepresentation of chromium (Cr, 18.2%) and nickel (Ni, 18.2%) both common contaminants in industrial wastewater suggests that researchers may be prioritising metals for which spectroscopic or electrochemical signals are easier to obtain, rather than those most needed for environmental monitoring.

Water matrices accounted for 72.7% of included studies, while soil represented only 18.2%, and sediment none. This distribution is not surprising given that liquid samples are easier to analyse with portable sensors and require less complex preprocessing (Bansod et al., 2017). Nevertheless, soil and sediment are major sinks for heavy metals and pose long-term risks to food safety and groundwater (Ali & Khan, 2019). The scarcity of ML-based classification studies in solid matrices indicates a methodological gap that warrants further investigation, particularly for real-world contaminated sites.

Regarding ML algorithms, traditional models such as SVM (45.5%), RF (36.4%), and ANN (36.4%) remain the most commonly used. However, deep learning architectures (CNN, LSTM, ALSTM-FCN, FCN) achieved the highest reported performance, with accuracy up to 100% and AUC up to 0.999 (Dean et al., 2019; Hajzus et al., 2022; Lahari et al., 2025). This superiority is consistent with findings in chemometrics and sensor signal processing, where deep learning automatically extracts hierarchical features from raw or minimally preprocessed data. Yet, deep learning models typically require larger datasets. Among the included studies, only Qi et al. (2025) explicitly addressed class imbalance using SMOTE or resampling techniques, and none reported external validation on independent datasets from different sites or instruments. Thus, the reported high performance may be optimistic and not generalisable to smaller or more imbalanced real-world datasets.

Comparison with previous systematic reviews is instructive. Earlier reviews focused on heavy metal quantification (regression) or metal species identification (Borrill et al., 2019; Huang et al., 2023; Lussier et al., 2020). This review is the first, to our knowledge, to specifically target pollution status classification – a decision-oriented task that directly informs regulatory compliance and remediation actions. The fact that only 11 out of 154 screened studies met this criterion underscores that the field is still nascent. Moreover, none of the existing reviews have highlighted the complete absence of spectroscopic-electrochemical integration, which we report here as a major evidence gap.

4.1. Limitations of the evidence included in the review

The body of evidence has several intrinsic limitations that affect the strength and generalisability of our conclusions.

First, heterogeneity in classification tasks and thresholds. Some studies defined binary contamination based on WHO drinking water guidelines (e.g., Pb > 10 μg/L), while others used national or regional standards (e.g., Chinese soil quality standards, EU guidelines). Multi-class classification tasks varied widely: some distinguished metal types (Pb vs. Cd vs. Hg), others classified concentration levels (low, medium, high). This heterogeneity prevented meta-analysis and makes direct comparisons of performance metrics (e.g., accuracy) tenuous.

Second, incomplete reporting of performance metrics. While most studies reported accuracy and AUC, many omitted precision, recall, F1-score, or specificity. This is problematic for pollution status classification, where class imbalance is common (e.g., contaminated samples are often rare). Without recall or F1-score, it is impossible to assess whether a model simply predicts the majority class (non-contaminated) and still achieves high accuracy. Only Qi et al. (2025) and Hu et al. (2024) explicitly addressed class imbalance using SMOTE or balanced random forest.

Third, lack of external validation. All 11 studies used internal cross-validation or a single held-out test set from the same source. None reported external validation on an independent dataset collected from a different site, instrument, or temporal period. This raises concerns about overfitting and limits the real-world applicability of the models, especially for spectroscopic and electrochemical sensors that are sensitive to matrix effects (pH, temperature, ionic strength, organic matter).

Fourth, limited matrix diversity and geographical bias. Water matrices dominate (72.7%), but even within water, most studies used spiked laboratory samples or controlled natural water (e.g., seawater, tap water) rather than naturally contaminated environmental waters with complex backgrounds. Soil studies were few (Hu et al., 2024; Qi et al., 2025), and sediment studies were absent. Geographically, 8 of 11 studies originated from China, the United States, and South Korea, raising questions about generalisability to other regions with different soil types, water chemistry, and regulatory frameworks.

Fifth, risk of bias in classification threshold justification. Our risk of bias assessment ( Figure 7, Table 12) revealed that several studies did not clearly justify the choice of regulatory threshold for defining “contaminated” vs. “non-contaminated” samples. For example, Valle et al. (2025) tested concentrations down to 1 nM for Hg but did not explicitly classify based on a regulatory limit, making it difficult to translate their findings into a binary pollution status decision.

4.2. Limitations of the review processes

This systematic review has several methodological limitations that should be acknowledged.

Search strategy and database coverage. The search was limited to three databases (Scopus, SpringerLink, ScienceDirect) and did not include Web of Science, PubMed, or IEEE Xplore. Although the combination of Scopus and SpringerLink provides broad coverage of environmental science and analytical chemistry literature, relevant studies in engineering, materials science, or biomedical sensors may have been missed. The final search date was 6 May 2026, so studies published after this date are not included.

Language and publication bias. Only peer-reviewed articles in English were included. This may exclude relevant studies published in other languages or non-peer-reviewed sources (e.g., conference proceedings, preprints). The exclusion of non-English studies may introduce language bias, particularly since heavy metal pollution is a global issue with substantial research output from non-English speaking countries.

Risk of bias assessment. Although we used a tailored tool adapted from JBI and QUADAS-2, the assessment was qualitative and subjective for some domains (e.g., handling of confounding factors). No meta-analysis was performed due to heterogeneity, so we could not statistically assess publication bias using funnel plots or Egger’s test.

Data extraction limitations. Some studies did not report complete performance metrics, and we recorded these as “not reported”. It is possible that contacting authors could have retrieved missing data, but this was beyond the scope of this review.

4.3. Implications for practice, policy, and future research

For environmental monitoring practice and regulatory agencies, current ML-based classification models have demonstrated high accuracy (up to 100%) in controlled settings, but practitioners should exercise caution when applying these models to new environmental matrices without revalidation. The absence of integrated spectroscopic-electrochemical approaches means that field-deployable sensors still rely on a single modality, potentially missing complementary information. Regulatory acceptance of ML-based methods will require standardised protocols for model training, validation, and reporting. Agencies such as the WHO and EPA would benefit from developing guidelines for ML-based pollution status classification, including minimum requirements for external validation and handling of class imbalance.

For future research, six priorities emerge from this review: (1) integration of spectroscopic and electrochemical data through data fusion strategies, which remains untested for pollution status classification; (2) expansion to underrepresented matrices (soil, sediment) and metals (Cr, Ni, Zn) using real-world contaminated samples; (3) external validation on independent datasets from different geographical locations, instruments, and time periods; (4) standardised reporting of accuracy, precision, recall, specificity, F1-score, and AUC, with balanced accuracy for imbalanced data; (5) explicit handling of class imbalance using SMOTE, RUSBoost, or BalancedRandomForest; and (6) incorporation of explainable AI (XAI) methods to enhance trust and understanding of ML decisions. Addressing these gaps will accelerate the translation of ML-based sensors from laboratory prototypes to field-deployable, regulatory-accepted monitoring systems.

5. Conclusion

This systematic review identified only 11 studies that used machine learning for heavy metal pollution status classification based on spectroscopic or electrochemical data. Lead, cadmium, and mercury were the most frequently studied metals, and water was the dominant matrix (72.7%). Support Vector Machine, Random Forest, and deep learning architectures (CNN, LSTM) achieved high classification performance, with accuracy up to 100% and AUC up to 0.999. Critically, no study integrated spectroscopic and electrochemical data; all used only a single modality. This evidence gap precludes any conclusion on whether data fusion improves classification accuracy. Future research should prioritise multi-modal integration, external validation, and expansion to soil and sediment matrices.

5.1. Recommendation

Future research should prioritise the integration of spectroscopic and electrochemical data through data fusion strategies, as this remains the most critical gap identified in this review. No study to date has combined both modalities for pollution status classification, despite their complementary information. Additionally, researchers must expand to underrepresented matrices (soil and sediment) and heavy metals (Cr, Ni, Zn), using real-world contaminated samples rather than spiked laboratory solutions.

Methodologically, future studies should implement external validation on independent datasets from different geographical locations, standardise reporting of performance metrics (accuracy, recall, precision, F1-score, AUC), and explicitly address class imbalance using techniques such as SMOTE or BalancedRandomForest. Incorporating explainable AI methods will also enhance model interpretability and regulatory acceptance. Addressing these priorities will accelerate the translation of ML-based sensors from laboratory prototypes to field-deployable monitoring systems.

Ethical approval and consent to participate

Not applicable. This systematic review did not involve any direct human or animal subjects, nor did it collect primary data requiring ethical approval. All analyses were based on previously published peer-reviewed articles.

Data availability

The PRISMA 2020 checklist, PRISMA flow diagram, conceptual framework figure, and the dataset underlying this systematic literature review (including the data extraction form) have been deposited in the Zenodo repository and are publicly accessible at: https://doi.org/10.5281/zenodo.20162818 (Rosman et al., 2026).

The repository includes the following supplementary files:

Supplementary Document 1: PRISMA 2020 checklist.

Supplementary Document 2: Data extraction form.

All data are available under the terms of the Creative Commons Zero v1.0 Universal

Acknowledgments

The authors gratefully acknowledge the Indonesia Endowment Fund for Education (LPDP) for providing financial support. We also thank the Institut Teknologi Bandung for facilitating this research. We appreciate the authors of the 11 primary studies included in this review for their open data and transparent reporting.

References

Ali H, Khan E: Trophic transfer, bioaccumulation, and biomagnification of non-essential hazardous heavy metals and metalloids in food chains/webs—Concepts and implications for wildlife and human health. Hum. Ecol. Risk Assess. Int. J. 2019; 25(6): 1353–1376. Publisher Full Text
Bansod B, Kumar T, Thakur R, et al.: A review on various electrochemical techniques for heavy metal ions detection with different sensing platforms. Biosens. Bioelectron. 2017; 94: 443–455. PubMed Abstract | Publisher Full Text
Borrill AJ, Reily NE, Macpherson JV: Addressing the practicalities of anodic stripping voltammetry for heavy metal detection: a tutorial review. Analyst. 2019; 144(23): 6834–6849. PubMed Abstract | Publisher Full Text
Chandler J, Cumpston M, Li T, et al.: Cochrane handbook for systematic reviews of interventions. Hoboken: Wiley; 2019; 4(1002): 14651858.
Chen J, Xiong X, Ye J, et al.: Machine learning-assisted three-dimensional fluorescence for heavy metal multi-sensing. Sensors Actuators B Chem. 2025; 431: 137385. Publisher Full Text
Dean SN, Shriver-Lake LC, Stenger DA, et al.: Machine learning techniques for chemical identification using cyclic square wave voltammetry. Sensors. 2019; 19(10): 2392. PubMed Abstract | Publisher Full Text | Free Full Text
Flora G, Gupta D, Tiwari A: Toxicity of lead: a review with recent updates. Interdiscip. Toxicol. 2012; 5(2): 47–58. PubMed Abstract | Publisher Full Text | Free Full Text
Godt J, Scheidig F, Grosse-Siestrup C, et al.: The toxicity of cadmium and resulting hazards for human health. Journal of Occupational Medicine and Toxicology. 2006; 1(1): 22. PubMed Abstract | Publisher Full Text | Free Full Text
Gumpu MB, Sethuraman S, Krishnan UM, et al.: A review on detection of heavy metal ions in water–an electrochemical approach. Sensors Actuators B Chem. 2015; 213: 515–533. Publisher Full Text
Hajzus JR, Shriver-Lake LC, Dean SN, et al.: Modifications of epitaxial graphene on SiC for the electrochemical detection and identification of heavy metal salts in seawater. Sensors. 2022; 22(14): 5367. PubMed Abstract | Publisher Full Text | Free Full Text
Hu T, Qi C, Wu M, et al.: Classification of arsenic contamination in soil across the EU by vis-NIR spectroscopy and machine learning. Int. J. Appl. Earth Obs. Geoinf. 2024; 134: 104158. Publisher Full Text
Huang Y, Harilal SS, Bais A, et al.: Progress toward machine learning methodologies for laser-induced breakdown spectroscopy with an emphasis on soil analysis. IEEE Transactions on Plasma Science. 2023; 51(7): 1729–1749. Publisher Full Text
Hughes MF, Beck BD, Chen Y, et al.: Arsenic exposure and toxicology: a historical perspective. Toxicol. Sci. 2011; 123(2): 305–332. PubMed Abstract | Publisher Full Text | Free Full Text
Jaishankar M, Tseten T, Anbalagan N, et al.: Toxicity, mechanism and health effects of some heavy metals. Interdiscip. Toxicol. 2014; 7(2): 60–72. PubMed Abstract | Publisher Full Text | Free Full Text
Kailasam V, Sankararajan R, Kailasam M, et al.: Machine Learning Assisted Metal Oxide-Bismuth Oxy Halide Nanocomposite for Electrochemical Sensing of Heavy Metals in Aqueous Media. Cryst. Res. Technol. 2024; 59(5): 2300173. Publisher Full Text
Lahari SA, Kumawat N, Amreen K, et al.: IoT integrated and deep learning assisted electrochemical sensor for multiplexed heavy metal sensing in water samples. NPJ Clean Water. 2025; 8(1): 10. Publisher Full Text
Liberati A, Altman DG, Tetzlaff J, et al.: The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate healthcare interventions: explanation and elaboration. BMJ. 2009; 339: b2700. Publisher Full Text
Lussier F, Thibault V, Charron B, et al.: Deep learning and artificial intelligence methods for Raman and surface-enhanced Raman scattering. TrAC Trends Anal. Chem. 2020; 124: 115796. Publisher Full Text
Maity A, Pu H, Sui X, et al.: Scalable graphene sensor array for real-time toxins monitoring in flowing water. Nat. Commun. 2023; 14(1): 4184. PubMed Abstract | Publisher Full Text | Free Full Text
Methley AM, Campbell S, Chew-Graham C, et al.: PICO, PICOS and SPIDER: a comparison study of specificity and sensitivity in three search tools for qualitative systematic reviews. BMC Health Serv. Res. 2014; 14(1): 579. PubMed Abstract | Publisher Full Text | Free Full Text
Moola S, Munn Z, Tufanaru C, et al.: Systematic reviews of economic evidence. JBI manual for evidence synthesis. JBI; 2020.
Page MJ, McKenzie JE, Bossuyt PM, et al.: The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ. 2021; 372.
Park S, Lee J, Khan S, et al.: Machine learning-based heavy metal ion detection using surface-enhanced Raman spectroscopy. Sensors. 2022; 22(2): 596. Publisher Full Text
Popay J, Roberts H, Sowden A, et al.: Guidance on the conduct of narrative synthesis in systematic reviews. A Product from the ESRC Methods Programme Version. 2006; 1(1): b92.
Puthongkham P, Wirojsaengthong S, Suea-Ngam A: Machine learning and chemometrics for electrochemical sensors: moving forward to the future of analytical chemistry. Analyst. 2021; 146(21): 6351–6364. Publisher Full Text
Qi C, Zhou N, Hu T, et al.: Prediction of copper contamination in soil across EU using spectroscopy and machine learning: Handling class imbalance problem. Smart Agricultural Technology. 2025; 10: 100728. Publisher Full Text
Rehman K, Fatima F, Waheed I, et al.: Prevalence of exposure of heavy metals and their impact on health consequences. J. Cell. Biochem. 2018; 119(1): 157–184. Publisher Full Text
Rice KM, Walker EM Jr, Wu M, et al.: Environmental mercury and its toxic effects. J. Prev. Med. Public Health. 2014; 47(2): 74–83. PubMed Abstract | Publisher Full Text | Free Full Text
Rosman W, Felinda SO, Hartono U, et al.: Machine Learning for Heavy Metal Pollution Status Classification Using Spectroscopic and Electrochemical Data: A Systematic Review. Zenodo. 2026. Publisher Full Text
Sawan S, Maalouf R, Errachid A, et al.: Metal and metal oxide nanoparticles in the voltammetric detection of heavy metals: A review. TrAC Trends Anal. Chem. 2020; 131: 116014. Publisher Full Text
Schardt C, Adams MB, Owens T, et al.: Utilization of the PICO framework to improve searching PubMed for clinical questions. BMC Med. Inform. Decis. Mak. 2007; 7(1): 16. PubMed Abstract | Publisher Full Text | Free Full Text
Taylor A, Barlow N, Day MP, et al.: Atomic spectrometry update: review of advances in the analysis of clinical and biological materials, foods and beverages. J. Anal. At. Spectrom. 2017; 32(3): 432–476. Publisher Full Text
Tchounwou PB, Yedjou CG, Patlolla AK, et al.: Heavy metal toxicity and the environment. Molecular, Clinical and Environmental Toxicology: Volume 3: Environmental Toxicology. 2012; 133–164.
US EPA: National recommended water quality criteria – Aquatic life criteria table. DC: 2026.
Valle SF, Soares AC, Neto MP, et al.: Polysulfides From Inverse Vulcanization Used in Electronic Tongues for Heavy Metal Sensing. J. Appl. Polym. Sci. 2025; 142(41): e57582. Publisher Full Text
Vareda JP, Valente AJM, Durães L: Assessment of heavy metal pollution from anthropogenic activities and remediation strategies: A review. J. Environ. Manag. 2019; 246: 101–118. PubMed Abstract | Publisher Full Text
Wei H, Huang Y, Santiago PJ, et al.: Decoding the metabolic response of Escherichia coli for sensing trace heavy metals in water. Proc. Natl. Acad. Sci. USA. 2023; 120(7): e2210061120. Publisher Full Text
Whiting PF, Rutjes AWS, Westwood ME, et al.: QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann. Intern. Med. 2011; 155(8): 529–536. Publisher Full Text
WHO: Guidelines for drinking-water quality: Fourth edition incorporating the first and second addenda. Geneva: 2022.

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 06 Jun 2026