Multi-modality machine learning predicting Parkinson’s disease

Output Details

Published May 20, 2021

Personalized medicine promises individualized disease prediction and treatment. The convergence of machine learning (ML) and available multimodal data is key moving forward. We build upon previous work to deliver multimodal predictions of Parkinson’s disease (PD) risk and systematically develop a model using GenoML, an automated ML package, to make improved multi-omic predictions of PD, validated in an external cohort. We investigated top features, constructed hypothesis-free disease-relevant networks, and investigated drug–gene interactions. We performed automated ML on multimodal data from the Parkinson’s progression marker initiative (PPMI). After selecting the best performing algorithm, all PPMI data was used to tune the selected model. The model was validated in the Parkinson’s Disease Biomarker Program (PDBP) dataset. Our initial model showed an area under the curve (AUC) of 89.72% for the diagnosis of PD. The tuned model was then tested for validation on external data (PDBP, AUC 85.03%). Optimizing thresholds for classification increased the diagnosis prediction accuracy and other metrics. Finally, networks were built to identify gene communities specific to PD. Combining data modalities outperforms the single biomarker paradigm. UPSIT and PRS contributed most to the predictive power of the model, but the accuracy of these are supplemented by many smaller effect transcripts and risk SNPs. Our model is best suited to identifying large groups of individuals to monitor within a health registry or biobank to prioritize for further testing. This approach allows complex predictive models to be reproducible and accessible to the community, with the package, code, and results publicly available.
Tags
  • Genomics
  • Machine learning
  • Multi-omics
  • Original Research
  • PD risks
  • Predictive models

Meet the Authors

  • User avatar fallback logo

    Mary B Makarious

  • User avatar fallback logo

    Hampton Leonard

  • User avatar fallback logo

    Dan Vitale

  • User avatar fallback logo

    Hirotaka Iwaki

  • User avatar fallback logo

    Lana Sargent

  • User avatar fallback logo

    Anant Dadu

  • User avatar fallback logo

    Ivo Violich

  • User avatar fallback logo

    Elizabeth Hutchins

  • User avatar fallback logo

    David Saffo

  • User avatar fallback logo

    Sara Bandres-Ciga

  • User avatar fallback logo

    Jonggeol Jeff Kim

  • User avatar fallback logo

    Yeajin Song

  • User avatar fallback logo

    Melina Maleknia

  • User avatar fallback logo

    Matt Bookman

  • User avatar fallback logo

    Willy Nojopranoto

  • User avatar fallback logo

    Roy H Campbell

  • User avatar fallback logo

    Sayed Hadi Hashemi

  • Juan A. Botía, PhD

    Key Personnel: Team Hardy Team Wood

    University of Murcia

  • User avatar fallback logo

    John F. Carter

  • User avatar fallback logo

    David W. Craig

  • User avatar fallback logo

    Kendall Van Keuren-Jensen

  • Huw Morris

    Collaborating PI: Team Hardy

    University College London

  • John Hardy, PhD

    Lead PI (Core Leadership): Team Hardy

    University College London

  • Cornelis Blauwendraat, PhD

    Coalition for Aligning Science

  • User avatar fallback logo

    Mike Nalls