Global ETD Search

Return to search

Optimizing Protein Characterization using Machine Learning-Guided Mass Spectrometry

Mass spectrometry-based proteomics excels at high-throughput identification of proteins expressed in complex biological samples. However, the technology struggles to identify low abundance proteins due to large amounts of redundant data acquired for high abundance proteins with little collected for low abundance proteins. To improve the identification sensitivity of these proteins, I designed a machine learning classifier that assesses protein identification confidence on-the-fly, during mass spectrometry analysis. Proteins deemed confidently identified are excluded from further analysis, saving mass spectrometry resources for lower abundance proteins. Simulating data from a HEK293 cell lysate mass spectrometry analysis, our algorithm uses 16.2% - 66.2% fewer mass spectrometry resources with a 2.6% - 39.5% drop in protein identifications. When applied to live mass spectrometry experiments, these saved resources will likely improve the overall protein identification sensitivity of the experiment, particularly for lower abundance proteins, and will therefore provide a better understanding of the cell’s biology.

bioinformatics

proteomics

machine learning

protein identification

real-time mass spectrometry analaysis

Identifer	oai:union.ndltd.org:uottawa.ca/oai:ruor.uottawa.ca:10393/40865
Date	21 August 2020
Creators	Pelletier, Alexander
Contributors	Lavallée-Adam, Mathieu
Publisher	Université d'Ottawa / University of Ottawa
Source Sets	Université d’Ottawa
Language	English
Detected Language	English
Type	Thesis
Format	application/pdf

Page generated in 0.0021 seconds

Optimizing Protein Characterization using Machine Learning-Guided Mass Spectrometry

Description

Links & Downloads

Tags

Additional Fields