Background
Current studies indicate that fluorine-18-fluorodeoxyglucose positron emission tomography/ computed tomography ([18F]FDG PET/CT) is the most accurate imaging modality for the detection of relapsed locally advanced non-small cell lung cancer (NSCLC) after curatively intended chemoradiotherapy. To this day, there is no objective and reproducible definition for the diagnosis of disease recurrence in PET/CT, the reading of which is relevantly influenced by post radiation inflammatory processes. The aim of this study was to evaluate and compare visual and threshold-based semi-automated evaluation criteria for the assessment of suspected tumor recurrence in a well-defined study population investigated during the randomized clinical PET-Plan trial.
Methods
This retrospective analysis comprises 114 PET/CT data sets of 82 patients from the PET-Plan multi-center study cohort who underwent [18F]FDG PET/CT imaging at different timepoints for relapse, as suspected by CT. Scans were first analyzed visually by four blinded readers using a binary scoring system for each possible localization and the associated reader certainty of the evaluation. Visual evaluations were conducted repeatedly without and with additional knowledge of the initial staging PET and radiotherapy delineation volumes. In a second step, uptake was measured quantitatively using maximum standardized uptake value (SUVmax), peak standardized uptake value corrected for lean body mass (SULpeak), and a liver threshold-based quantitative assessment model. Resulting sensitivity and specificity for relapse detection were compared to the findings in the visual assessment. The gold standard of recurrence was independently defined by prospective study routine including external reviewers using CT, PET, biopsies and clinical course of the disease.
Results
Overall interobserver agreement (IOA) of the visual assessment was moderate with a high difference between secure (ĸ = 0.66) and insecure (ĸ = 0.24) evaluations. Additional knowledge of the initial staging PET and radiotherapy delineation volumes improved the sensitivity (0.85 vs 0.92) but did not show significant impact on the specificity (0.86 vs 0.89). PET parameters SUVmax and SULpeak showed lower accuracy compared to the visual assessment, whereas threshold-based reading showed similar sensitivity (0.86) and higher specificity (0.97).
Conclusion
Visual assessment especially if associated with high reader certainty shows very high interobserver agreement and high accuracy that can be further increased by baseline PET/CT information. The implementation of a patient individual liver threshold value definition, similar to the threshold definition in PERCIST, offers a more standardized method matching the accuracy of experienced readers albeit not providing further improvement of accuracy.
Identifer | oai:union.ndltd.org:DRESDEN/oai:qucosa:de:qucosa:93523 |
Date | 06 November 2024 |
Creators | Brose, Alexander, Michalski, Kerstin, Ruf, Juri, Tosch, Marco, Eschmann, Susanne M., Schreckenberger, Mathias, König, Jochem, Nestle, Ursula, Miederer, Matthias |
Publisher | BMC, part of Springer Nature |
Source Sets | Hochschulschriftenserver (HSSS) der SLUB Dresden |
Language | English |
Detected Language | English |
Type | info:eu-repo/semantics/publishedVersion, doc-type:article, info:eu-repo/semantics/article, doc-type:Text |
Rights | info:eu-repo/semantics/openAccess |
Relation | 1470-7330, 45, 10.1186/s40644-023-00567-6 |
Page generated in 0.0028 seconds