Predicting human health from biofluid-based metabolomics using machine learning

Nature

Predicting human health from biofluid-based metabolomics using machine learning"


Play all audios:

Loading...

Biofluid-based metabolomics has the potential to provide highly accurate, minimally invasive diagnostics. Metabolomics studies using mass spectrometry typically reduce the high-dimensional


data to only a small number of statistically significant features, that are often chemically identified—where each feature corresponds to a mass-to-charge ratio, retention time, and


intensity. This practice may remove a substantial amount of predictive signal. To test the utility of the complete feature set, we train machine learning models for health state-prediction


in 35 human metabolomics studies, representing 148 individual data sets. Models trained with all features outperform those using only significant features and frequently provide high


predictive performance across nine health state categories, despite disparate experimental and disease contexts. Using only non-significant features it is still often possible to train


models and achieve high predictive performance, suggesting useful predictive signal. This work highlights the potential for health state diagnostics using all metabolomics features with


data-driven analysis.


While fundamental to personalized healthcare, it is often challenging to diagnose an individual’s health state (a general term encompassing disease and non-disease phenotypes like age) due


to low test sensitivity, specificity or the requirement of invasive procedures. Body-fluid sampling (e.g. blood or urine) offers a minimally invasive approach to identify health conditions


throughout the body. The traditional concept of biofluid-based diagnostics relies on health-state biomarkers. Biomarkers cover a broad spectrum of measurements1, but typically refer to a


small number of select and specific molecules or biopolymers, capable of differentiating healthy from diseased states. Currently, many biomarker-containing tests are used in routine lab


monitoring (e.g. complete blood count, ‘basic’ and ‘comprehensive’ metabolic panels, lipid panels, etc.) providing coarse health-state categorization. Tests for many diseases exist and


display a range of sensitivity and specificity, examples include: apolipoprotein E along with other measurements for Alzheimer’s disease2, the prostate-specific antigen test for prostate


cancer3, alpha fetoprotein (AFP) for liver cancer4, as well as a recent use of the SOMAscan5 for diagnosing tuberculosis6.


Metabolomics rapidly supplies information on thousands of molecules, and provides a method for biofluid-based diagnostics7,8,9. To date, serum, plasma, urine and cerebrospinal fluid (CSF)


metabolomics has been applied to many health states, ranging from cancers10,11,12,13 and infectious diseases14,15 to chronic obstructive pulmonary disease (COPD)16, smoking17 and Alzheimer’s


disease18,19. Metabolomics studies are regularly performed using analytical instrumentation like liquid or gas chromatography mass spectrometry (LC–MS and GC–MS respectively) as well as


nuclear magnetic resonance (NMR)20. Frequently, the goal is to determine the chemical identity of the features that are significantly altered between health states. For LC- and GC–MS


studies, a feature is defined by a mass-to-charge ratio (mz), retention time (rt) and intensity. While a chemical name cannot be assigned to the majority of features, analysis of those that


are identified allows for biological interpretation by differential analysis and biochemical pathway mapping18,21. Select chemically identified features are often used for differential


diagnostics or health state association. For instance, certain amino acids have been associated with diabetes22 as well as urinary formate, alanine, and hippurate with blood pressure23.


For diagnostic modeling purposes, full metabolomics data sets are generally not used for training, validation, and testing. To deal with the large number of features, a host of feature


selection methods and classification techniques are employed. Univariate statistical tests (Student’s t-test or Mann–Whitney U-test, MW-U) are routinely used to isolate statistically


significant features—usually identified using false discovery rate (FDR) adjusted P-values 


Trending News

'moana' star auli'i cravalho shames 'reopen hawaii' protesters: 'protect the old'

“Moana” star Auli’i Cravalho is a girl who loves her island. Cravalho had a message Monday for hundreds of “Reopen Hawai...

Novel activating STAT5B mutations as putative drivers of T-cell acute lymphoblastic leukemia

STAT5 has an important role in many hematologic malignancies, but constitutive activation is often a secondary event.1 M...

Birmingham va: cavc & cumberland law celebrate legal support for alabama veterans | va birmingham health care | veterans affairs

For the second year, the Central Alabama Veterans Collaborative (CAVC) recognized law students from Samford University’s...

Stocks Given Lift by GM Investment

The Dow index surged 127 points Wednesday in a broad Wall Street rally touched off by billionaire Kirk Kerkorian’s bold ...

Abc news – breaking news, latest news and videos

1 ‘Jane' breaks down during testimony about 'hotel nights' in Sean Combs trial 4:41Playing 2 Hiring slowe...

Latests News

Predicting human health from biofluid-based metabolomics using machine learning

Biofluid-based metabolomics has the potential to provide highly accurate, minimally invasive diagnostics. Metabolomics s...

A Combined Microphoresis-Autoradiography Technique

BASE ratios can be determined with microelectrophoresis (microphoresis) in as little as 5 × 10−10 g RNA1, a sensitivity ...

British airways: cabin crew say why people get drunk quicker in air

An air stewardess on the Channel 5 documentary ‘First Class vs Economy: Is it worth it?’ claimed altitude can mean alcoh...

Full attention | Nature

Book Review Published: 19 April 1974 Full attention DONALD E. BROADBENT  Nature volume 248, page 715 (1974)Cite this art...

Research heads seek European platform

The science ministers of Austria and Germany are supporting a bid by the European Science Foundation (ESF) to become a f...

Top