ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Research Article

An interpretable machine learning model of biological age

[version 1; peer review: 2 approved with reservations]
PUBLISHED 04 Jan 2019
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Artificial Intelligence and Machine Learning gateway.

Abstract

Background: Assessments of biological (rather than chronological) age derived from patient biochemical data have been shown to strongly predict both all-cause and disease-specific mortality. However, these population-based approaches have yet to be translated to the individual. As well as using biological age as a research tool, by being able to better answer the question “why did we get this result?”, clinicians may be able to apply personalised interventions that could improve the long-term health of individual patients.
Methods: Here, the boosted decision tree algorithm XGBoost was used to predict biological age using 39 commonly-available blood test results from the US National Health and Nutrition Examination Survey (NHANES) database.
Results: Interrogation of the algorithm produced a description of how each marker contributed to the final output in a single individual. Additive explanation plots were then used to determine biomarker ranges associated with a lower biological age. Importantly, a number of markers that are modifiable with lifestyle changes were found to have a significant effect on biological age, including fasting blood glucose, lipids, and markers of red blood cell production.
Conclusions: The combination of individualised outputs with target ranges could provide the ability to personalise interventions or recommendations based on an individual’s biochemistry and resulting predicted age. This would allow for the investigation of interventions designed to improve health and longevity in a targeted manner, many of which could be rooted in targeted lifestyle modifications.

Keywords

Aging, Machine Learning, Age

Introduction

One of the fastest-growing areas at the intersection of clinical medicine and data science is the investigation of human aging1, with multiple avenues being explored to find biomarkers of aging that could be used to inform efforts to enhance human longevity24. If robust and easily-accessible biomarkers of aging are identified, they could assist in the rapid assessment of promising interventions aimed at increasing longevity, without the need to perform clinical trials that last decades. For instance, epigenetic modifications on DNA are increasingly being used to determine biological (rather than chronological) age, including how environmental determinants may affect an epigenetic signal for longevity4.

An individual’s biological age can be described based on the assumption that cellular aging processes, which are highly-influenced by the environment5, occur at different rates in different people with the same chronological age. As these ageing processes are associated with changes in routine biochemical measures6, algorithmic determination of biological or phenotypic age using widely-available indices such as those from blood test results is therefore becoming increasingly common. This has previously been done using both machine learning (ML) and statistical techniques3,6.

One important aspect for the utility of biological age measures is that a given output can be interpreted in order to guide individualized interventions. ML-based predictions of biological age have the potential to elucidate and describe complex, non-linear, and unintuitive patterns in biochemical data, which may provide greater predictive power compared to other statistical techniques. To date, published approaches to generate predicted biological age from biochemical data have used deep neural networks (DNNs), with the output being directly associated with mortality risk3. However, while individual outputs from DNNs are interpretable7, it is currently not possible to interrogate the effects of the entire training dataset on the model output, which may be important for determining how one may intervene given an individual’s output.

As a result of the issues with interpreting certain ML algorithms, the field of explainable artificial intelligence is developing rapidly8. If such approaches can be successfully applied to determining biological age from commonly available data, biological signatures of aging could be more rapidly discovered and tracked, including the ability to personalise interventions based on the outputs of the model. Here, we describe the development of an explainable ML model using blood marker data from the National Health and Nutrition Examination Survey (NHANES) database to predict biological age, as well as provide individual weighting for how each biomarker affected the final output. By determining how markers affect the model globally, potential target reference ranges associated with lower biological age can also be determined.

Methods

Input data

Data from a total of 46,739 participants (n=22,545 males and n=24,194 females) in the NHANES database were included, with a mean (range) age of 48.5 (19.0–85.0) years. A total of 39 common blood markers were used: complete blood count (CBC) with differential, lipids, fasting glucose, iron panel, and a comprehensive metabolic panel (including electrolytes, and liver and kidney function). Descriptive data for the dataset is listed in Table 1.

Table 1. Demographic data from the entire NHANES dataset.

VariableFemales
(n=24,194)
Males
(n=22,545)
MeanSDMeanSD
Chronological Age (years)48.118.848.818.8
Red blood cells (×103/µl)4.40.44.90.5
Red blood cell distribution width (%)13.21.513.01.1
Hematocrit (%)39.23.644.23.8
Hemoglobin (g/dl)13.31.315.01.3
Mean corpuscular hemoglobin (pg)30.12.530.62.2
Mean corpuscular hemoglobin concentration (g/dl)33.91.034.01.0
Mean corpuscular volume (fl)88.96.190.15.3
Platelets (×103/µl)267.970.7238.061.1
Mean platelet volume (fl)8.20.98.20.9
Neutrophils (×103/µl)4.41.84.21.8
Lymphocytes (×103/µl)2.21.02.11.5
Monocytes (×103/µl)0.50.20.60.2
Eosinophils (×103/µl)0.20.20.20.2
Basophils (×103/µl)0.00.10.00.1
Total Cholesterol (mg/dl)199.042.5192.442.9
Low-density lipoprotein cholesterol (mg/dl)115.735.8115.736.1
High-density lipoprotein cholesterol (mg/dl)57.516.447.814.2
Triglycerides (mg/dl)129.0101.6146.9136.6
Glucose (mg/dl)98.837.1103.439.5
Iron (µg/dl)77.334.492.935.8
Total Iron Binding Capacity (µg/dl)380.971.7351.955.0
Ferritin (ng/ml)78.1103.6183.5180.3
Sodium (mmol/l)138.92.4139.32.3
Potassium (mmol/l)3.90.34.10.3
Chloride (mmol/l)103.83.0103.32.9
Carbon Dioxide (mmol/l)24.32.425.12.2
Calcium (mg/dl)9.40.49.50.4
Phosphorus (mg/dl)3.80.53.70.6
Creatinine (mg/dl)0.80.41.00.5
Urea nitrogen (mg/dl)12.66.014.46.1
Albumin (g/dl)4.10.44.40.3
Globulins (g/dl)3.00.52.90.5
Alanine transaminase (IU/l)21.520.529.527.8
Aspartate transaminase (IU/l)23.615.127.822.2
Alkaline Phosphatase (IU/l)71.127.871.826.6
Bilirubin (mg/dl)0.60.30.80.3
Gamma glutamyl-transferase (IU/l)24.335.734.852.1
Lactate dehydrogenase (IU/l)131.629.7133.135.8
Uric Acid (mg/dl)4.81.36.11.3

Model generation

NHANES data (all available individuals with the 39 markers listed in Table 1 from years 1999–2015) was downloaded as .xpt files from the NHANES website using their in-built web search engine. The data was then concatenated, cross-tabulated, and stratified by gender. A random split in the data set was created to withhold 20% of participants (n=4,509 males and n=4,839) for model validation. The remaining 80% of the dataset was used to train an XGBRegressor model (XGBoost version 0.81) using chronological age and the 39 biochemical input markers. For the remaining 20% of the data, the 39 markers were provided to the algorithm9 with the chronological age withheld, and the resulting dependent variable “predicted age” defined as a measure of biological age. Age predictions for the withheld data were plotted against actual age using jointplot from the seaborn Python library (version 0.9.0).

Model interrogation

For individual predictions, the weight of each marker was extracted using ELI5 (version 0.8.1), and graphed using a waterfall chart (version 3.8). For a given age prediction, each marker was individually weighted with regard to how it contributed to the final output. Shapley additive explanations plots (SHAP, version 0.26.0) were constructed to describe how each individual marker affects the predicted age output within the laboratory normal range.

Worked example

To provide an individual output example based on data not seen by the algorithm9 previously, author C.K. had the necessary input markers measured by Quest Laboratories (Santa Cruz, CA). As C.K. is an author who ran his own data through the algorithm9 he trained during development of the manuscript, institutional ethical approval was not sought for publication of this data. C.K. approved the publication of his data in this manner.

Results

Differences between predicted age and biological age

Linear regression analysis (Figure 1) showed a significant correlation between predicted (biological) and actual (chronological) age (r=0.77 and 0.75 in females and males, respectively; p<0.0001 for both). However, discrepancies between the biological and chronological age could be considered clinically relevant, as they would allow for the generation of a signature of premature biological aging.

6907fdc4-e604-4a5c-aab6-288fc8b45cf1_Figure1.gif

Figure 1. Linear regression analysis comparing actual (chronological) and predicted (biological) age.

Data shown for women (A) and men (B) using the 20% withheld data (n=4,509 males and n=4,839). A significant correlation between predicted and actual age (r=0.77 and 0.75 in females and males, respectively) was seen in both sexes (p<0.0001).

SHAP plots of input markers

SHAP summary plots (Figure 2) were used to determine which markers have the greatest influence on predicted biological age. The top 20 markers in terms of importance are shown. In females, blood urea nitrogen (BUN) had the greatest influence on biological age, with albumin the most influential marker in men. Fasting glucose was the second most influential marker in both sexes (Figure 2). SHAP plots for each of the 20 most influential markers are available on GitHub and Zenodo9. Based on each of these 20 markers, the level at which an inflection point was seen in the SHAP plot (i.e. when a further change in a marker would result in a net increase in predicted biological age) was determined, as well as the estimated range over which each marker would be associated with the lowest biological age (Table 2 and Table 3). Using the five most influential markers as an example, the lowest predicted age in women would be associated with a BUN 6–11 mg/dl, fasting glucose 71–86 mg/dl, bicarbonate (carbon dioxide) 19–22 mmol/l, total cholesterol 130–150 mg/dl, and mean corpuscular volume (MCV) 80–85 fl. In men, the lowest predicted age would be associated with albumin 4.6–4.8 g/dl, fasting glucose 70–88 mg/dl, BUN 6–12 mg/dl, red blood cell (RBC) 5.0–5.7 ×103/µl, and RBC distribution width (RDW) 11.0–12.5%.

6907fdc4-e604-4a5c-aab6-288fc8b45cf1_Figure2.gif

Figure 2. SHAP summary plots showing the adjustment to predicted age (x-axis) for each of the top 20 markers.

Data shown for women (A) and men (B). Each plot is made up of thousands of individual points from the training dataset such with a higher value being more red, and a lower value being more blue. This is depicted by the “feature value” bar on the right of each plot. Therefore, if the dots on one side of the central line are increasingly red or blue, that suggests that increasing values or decreasing values, respectively, move the predicated age in that direction. For instance, lower BUN values (blue dots) are associated with lower predicted age in both men and women.

Table 2. Top 20 markers affecting predicted age in women.

Ranking of markers affecting predicted age in women, in order of importance, as determined by the SHAP summary outputs. Visual examination of the individual SHAP plots for each marker was used to estimate the range over which each marker would result in the lowest predicted age, and the magnitude of the adjustment in years. The final column is the value at which a marker changes from a net negative to net positive effect on biological age.

Marker RankMarkerEstimated range for
lowest predicted age
Magnitude of
effect (years)
Inflection
point
1BUN6–11 mg/dl-9 to -212 mg/dl
2Glucose71–86 mg/dl-7.5 to -386 mg/dl
3Carbon Dioxide18–22 mmol/l-6 to -225 mmol/l
4Total Cholesterol130–150 mg/dl-5 to -1.5195 mg/dl
5MCV80–85 fl-3.5 to -1.090 fl
6LDH120–130 IU-1 to 0130 IU
7Creatinine0.62–0.78 mg/dl-3 to 00.82 mg/dl
8RDW10–12 %-4 to -0.50
9Lymphocytes2.3–3.0 ×10E3/µl-1.2 to 01.9 ×10E3/µl
10Sodium137–139 mmol/l-1.2 to 0140 mmol/l
11AST13–17 IU-2 to -0.522 IU
12Chloride103–106 mmol/l-1.5 to 0103 mmol/l
13GGT5–10 IU-3 to -0.515 IU
14ALT42–44 IU-4 to -121 IU
15ALP40–56 IU-2 to -0.265 IU
16Albumin4.6–4.8 g/dl-4 to -0.54.3 g/dl
17Neutrophils6.6–7 ×10E3/µl-2.5 to -0.55 ×10E3/µl
18Ferritin30–50 ng/ml-3 to 050 ng/ml
19Phosphorus5.2–7.1 mg/dl-4.8 to -14.1 mg/dl
20Potassium3.5–3.9 mmol/l-1 to 04.1 mmol/l

Table 3. Top 20 markers affecting predicted age in men.

Ranking of markers affecting predicted age in women, in order of importance, as determined by the SHAP summary outputs. Visual examination of the individual SHAP plots for each marker was used to estimate the range over which each marker would result in the lowest predicted age, and the magnitude of the adjustment in years. The final column is the value at which a marker changes from a net negative to net positive effect on biological age.

Marker RankMarkerEstimated range for
lowest predicted age
Magnitude of
effect (years)
Inflection
point
1Albumin4.6–4.8 g/dl-10 to -14.4 g/dl
2Glucose70–88 mg/dl-7.0 to -196 mg/dl
3BUN6–12 mg/dl-7.5 to -114 mg/dl
4RBC5.0–5.7 ×10E3/µl-4.0 to -0.54.8 ×10E3/µl
5RDW11.0–12.5 %-7.5 to -113%
6MCV79–87 fl-4.0 to -1.590 fl
7ALT33–45 IU-4.5 to -0.528 IU
8Phosphorus4.1–4.5 mg/dl-4.0 to -0.53.8 mg/dl
9Lymphocytes1.9–3.0 mg/dl-2.0 to -0.51.8 ×10E3/µl
10Total Cholesterol100–160 mg/dl-5.5 to -0.5190 mg/dl
11Platelets250–400 ×10E3/µl-3.0 to -0.2210 ×10E3/µl
12Potassium3.5–4.1 mmol/l-1.8 to -0.24.2 mmol/l
13Creatinine0.5–1.0 mg/dl-1.5 to 01.0 mg/dl
14LDH80–120 IU-3 to -0.5130 IU
15Triglycerides40–60 mg/dl-6 to -2100 mg/dl
16Monocytes0.1–0.5 ×10E3/µl-1.5 to 00.6 ×10E3/µl
17Neutrophils1.5–2.8 ×10E3/µl-2.2 to -0.23.4 ×10E3/µl
18MCHC34.3–35.7 g/dl-1 to 033.9 g/dl
19GGT6–15 IU-2.0 to 021 IU
20Total Bilirubin0.1–0.6 mg/dl-0.75 to 00.8 mg/dl

Fully interpretable personalised predictions

For a given individual, the model output allows for each marker to be individually weighted with regard to how it contributed to the final output (Figure 3). The average age in the training dataset (BIAS) is given as a starting point, with each marker subsequently increasing or decreasing predicted age by a number of years. This allows for the most influential markers for the individual to be determined. The example shown is for one of the study authors (C.K.), the data for whom is available on Zenodo9. Bias (48.3 years) is sequentially adjusted, with the five markers contributing most to an increase in biological age were BUN (+3.5 years), total cholesterol (+2.8 years), potassium (+1.7 years), phosphorus (+1.2 years), and LDH (+0.9 years). The five markers contributing most to a decrease in biological age were lymphocytes (-1.2 years), RBCs (-2.3 years), albumin (-2.7 years) fasting glucose (-3.1 years), and triglycerides (-3.9 years). The final predicted biological age was 43.0 years.

6907fdc4-e604-4a5c-aab6-288fc8b45cf1_Figure3.gif

Figure 3. Waterfall chart depicting how individual input markers contribute to a given predicted biological age (y-axis) for author C.K.

Bias (first column, 48.3 years) is the mean age in the input population. The five markers contributing most to an increase in biological age (columns 2–6 from the left) were BUN, total cholesterol, potassium, phosphorus, and LDH. The five markers contributing most to a decrease in biological age (columns 2–6 from the right) were lymphocytes, RBCs, albumin, glucose and triglycerides. The final predicted biological age (43.0 years) is in the last column.

Discussion

Biomarkers of aging are increasingly important in the development and investigation of interventions with which to slow aging processes, which may also have the ability to aid in the treatment or prevention of aging-associated chronic disease. One such marker is the individual’s biological or phenotypic age, as reflected by patterns of biochemical markers in the blood, which have previously been shown to be associated with risk of mortality2,3,6. While there are a number of approaches to this problem in the published literature, we provide an alternative using a tree-based ML model that a) is fully interpretable, b) can be completely individualized for a given patient, and c) allows the development of target ranges associated with a potential signature for slowed biological aging.

One issue surrounding the utility of algorithmically-derived biological age is the response to any associated interventions or therapeutics. As this field is relatively new, it is uncertain how much an improvement in predicted biological age resulting from a given therapeutic approach will translate into improvements in longevity. Even if a given marker decreases predicted biological age, this also does not guarantee that manipulating the value will increase longevity. For instance, in our models, increasing ALT and decreasing total cholesterol were associated with lower predicted biological age; however, there are a number of scenarios where lower total cholesterol and higher ALT may be associated with increased mortality despite a lower predicted biological age10,11. Despite this, these models are at least able to generate hypotheses that can be tested in both the preclinical and clinical setting. Our approach also provides an example that other groups may use to produce fully-interpretable and personalisable outputs.

Though the current analysis does not include confirmation of the ability to predict mortality risk, certain outputs from the algorithm9 do provide some confidence that the output is likely to be associated with individual health outcomes. For instance, the greatest increase in predicted age associated with fasting glucose level occurs in the range 90–100 mg/dl, which is strikingly similar to the blood glucose level associated with the largest increase in mortality risk in multiple population studies12,13. Similar associations are seen with many of the target ranges derived from the algorithm9, such as for albumin, RDW, and ferritin (especially in men)1416.

If modulation of certain markers does indeed contribute to the reversal of cellular aging processes, the combination of an individual output with the population SHAP plots for a given marker could therefore allow for targeted therapeutic interventions aimed at improving biological age based on an individual’s specific output. For instance, elevated fasting blood glucose could be decreased by addressing diet, exercise, micronutrient deficiencies, and reducing inflammation or psychosocial stress17. Similar approaches are also likely to improve cholesterol, RDW, and MCV, confirming that lifestyle factors should play a key role in the pursuit of health and longevity15,18,19. A personalised approach is important, because the markers contributing most strongly to biological age in the whole dataset are not necessarily the same markers that most strongly contribute to a prediction in a single individual (see example in Figure 3).

The current approach does have some limitations. The dataset may only be applicable in the United States, as different countries and ethnic backgrounds might display variations in both baseline biochemistry and predicted longevity3. Expanding available input data and allowing for stratification based on nationality and ethnic background will be the focus of future work. Larger and more expanded datasets will also allow for the analysis of biological aging in association with other potentially important factors such as genetics and the microbiota20,21. It is also worth mentioning that NHANES is designed to capture data that is representative of the US population. Therefore, this data comes from participants that represent a population that has some of the highest metabolic and cardiovascular disease prevalence in the Western world22,23, which may distort the results. Additionally, the current outputs would benefit from being correlated with disease outcomes or mortality in order to determine how well predicted biological age acts as an accurate biomarker of health and longevity.

By using well-understood and robust biomarkers that are available to almost any clinician, methods such as those described in this study can be used immediately as adjuncts to research investigating the outcomes of interventions designed to increase human longevity. As multiple methods are currently available with which to predict biological or phenotypic age, the field should also collaborate in an attempt to compare methods such that we can find the approach that results in an accurate output that can most easily be used in both the research and clinical settings.

Data availability

All NHANES data used to produce the models is accessible through the CDC website (listed by NHANES study year): https://wwwn.cdc.gov/nchs/nhanes/search/default.aspx.

Data access, tabulation, and concatenation is automated by the “01-download-preprocess” Jupyter notebook file within our Zenodo repository; DOI: https://doi.org/10.5281/zenodo.24402039. This repository also includes the original Quest laboratory test results from author C.K., which were used to provide the worked example (Figure 3).

Software availability

The algorithm developed here, including the associated libraries and the necessary versions, are available on Zenodo: https://doi.org/10.5281/zenodo.24402039.

License: GNU General Public License version 3

Notes: The algorithm itself can be trained and tested by running the “02-train-test-explain” Jupyter notebook. Note that each time the algorithm runs, a new random split in the dataset is generated in order to train and test the algorithm. Therefore, the resulting outputs might be slightly different.

Comments on this article Comments (1)

Version 1
VERSION 1 PUBLISHED 04 Jan 2019
  • Reader Comment 15 Jan 2019
    Evgeny Izumchenko, Johns Hopkins University, USA
    15 Jan 2019
    Reader Comment
    Similar work was performed and published in 2016: 
    https://www.ncbi.nlm.nih.gov/pubmed/27191382
    So here it is the same study performed on a different dataset with just one method. And in that study, the comparison ... Continue reading
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Wood TR, Kelly C, Roberts M and Walsh B. An interpretable machine learning model of biological age [version 1; peer review: 2 approved with reservations]. F1000Research 2019, 8:17 (https://doi.org/10.12688/f1000research.17555.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 04 Jan 2019
Views
33
Cite
Reviewer Report 12 Feb 2019
Peter O. Fedichev, Moscow Institute of Physics and Technology, Moscow Region, Russian Federation;  Gero LLC, Singapore, Singapore 
Approved with Reservations
VIEWS 33
The manuscript concerns quantification of aging by means of biological age (BA) model trained as a predictor of chronological age from the widely available blood markers (complete blood cell counts and biochemistry). Understanding capabilities and the biology behind such biomarkers ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Fedichev PO. Reviewer Report For: An interpretable machine learning model of biological age [version 1; peer review: 2 approved with reservations]. F1000Research 2019, 8:17 (https://doi.org/10.5256/f1000research.19198.r43612)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
56
Cite
Reviewer Report 15 Jan 2019
Alex Zhavoronkov,  Insilico Medicine, Inc.,, Baltimore, MD, USA 
Approved with Reservations
VIEWS 56
While the study is not novel and the technical sophistication is considerably low, the study addresses one of the most important challenges in biomedicine and I recommend accepting it if the authors agree to make substantial improvements to the manuscript, ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Zhavoronkov A. Reviewer Report For: An interpretable machine learning model of biological age [version 1; peer review: 2 approved with reservations]. F1000Research 2019, 8:17 (https://doi.org/10.5256/f1000research.19198.r42590)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (1)

Version 1
VERSION 1 PUBLISHED 04 Jan 2019
  • Reader Comment 15 Jan 2019
    Evgeny Izumchenko, Johns Hopkins University, USA
    15 Jan 2019
    Reader Comment
    Similar work was performed and published in 2016: 
    https://www.ncbi.nlm.nih.gov/pubmed/27191382
    So here it is the same study performed on a different dataset with just one method. And in that study, the comparison ... Continue reading
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.