Return to search

Detection and Classification of Cancer and Other Noncommunicable Diseases Using Neural Network Models

Here, we show that training with multiple noncommunicable diseases (NCDs) is both feasible and beneficial to modeling this class of diseases. We first use data from the Cancer Genome Atlas (TCGA) to train a pan cancer model, and then characterize the information the model has learned about the cancers. In doing this we show that the model has learned concepts that are relevant to the task of cancer classification. We also test the model on datasets derived independently of the TCGA cohort and show that the model is robust to data outside of its training distribution such as precancerous legions and metastatic samples. We then utilize the cancer model as the basis of a transfer learning study where we retrain it on other, non-cancer NCDs. In doing so we show that NCDs with very differing underlying biology contain extractible information relevant to each other allowing for a broader model of NCDs to be developed with existing datasets. We then test the importance of the samples source tissue in the model and find that the NCD class and tissue source may not be independent in our model. To address this, we use the tissue encodings to create augmented samples. We test how successfully we can use these augmented samples to remove or diminish tissue source importance to NCD class through retraining the model. In doing this we make key observations about the nature of concept importance and its usefulness in future neural network explainability efforts.

Identiferoai:union.ndltd.org:unt.edu/info:ark/67531/metadc2179319
Date07 1900
CreatorsGore, Steven Lee
ContributorsAzad, Rajeev K., Padilla, Pamela, Mikler, Armin, Shulaev, Vladimir, Mittler, Ron
PublisherUniversity of North Texas
Source SetsUniversity of North Texas
LanguageEnglish
Detected LanguageEnglish
TypeThesis or Dissertation
FormatText
RightsPublic, Gore, Steven Lee, Copyright, Copyright is held by the author, unless otherwise noted. All rights Reserved.

Page generated in 0.002 seconds