Background Differentiated thyroid cancer (DTC) is the most prevalent endocrine malignancy with a recurrence rate of about 20%, necessitating better predictive methods for patient management. This study aims to create a relational classification model to predict DTC recurrence by integrating clinical, pathological, and follow-up data.Methods The balanced dataset comprises 550 DTC samples collected over 15 years, featuring 13 clinicopathological variables. To address the class imbalance in recurrence status, the Synthetic Minority Over-sampling Technique for Nominal and Continuous (SMOTE-NC) was utilized. A hybrid model combining classification algorithms with association rule mining was developed. Two relational classification approaches, regularized class association rules (RCAR) and classification based on association rules (CBAR), were implemented. Binomial logistic regression analyzed independent predictors of recurrence. Model performance was assessed through accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score.Results The RCAR model demonstrated superior performance over the CBAR model, achieving accuracy, sensitivity, and F1 score of 96.7%, 93.1%, and 96.7%, respectively. Association rules highlighted that papillary pathology with an incomplete response strongly predicted recurrence. The combination of incomplete response and lymphadenopathy was also a significant predictor. Conversely, the absence of adenopathy and complete response to treatment were linked to freedom from recurrence. Incomplete structural response was identified as a critical predictor of recurrence risk, even with other low-recurrence conditions.Conclusion This study introduces a robust and interpretable predictive model that enhances personalized medicine in thyroid cancer care. The model effectively identifies high-risk individuals, allowing for tailored follow-up strategies that could improve patient outcomes and optimize resource allocation in DTC management.

A hybrid machine learning model combining association rule mining and classification algorithms to predict differentiated thyroid cancer recurrence

Ardigò, Luca Paolo
2024-01-01

Abstract

Background Differentiated thyroid cancer (DTC) is the most prevalent endocrine malignancy with a recurrence rate of about 20%, necessitating better predictive methods for patient management. This study aims to create a relational classification model to predict DTC recurrence by integrating clinical, pathological, and follow-up data.Methods The balanced dataset comprises 550 DTC samples collected over 15 years, featuring 13 clinicopathological variables. To address the class imbalance in recurrence status, the Synthetic Minority Over-sampling Technique for Nominal and Continuous (SMOTE-NC) was utilized. A hybrid model combining classification algorithms with association rule mining was developed. Two relational classification approaches, regularized class association rules (RCAR) and classification based on association rules (CBAR), were implemented. Binomial logistic regression analyzed independent predictors of recurrence. Model performance was assessed through accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score.Results The RCAR model demonstrated superior performance over the CBAR model, achieving accuracy, sensitivity, and F1 score of 96.7%, 93.1%, and 96.7%, respectively. Association rules highlighted that papillary pathology with an incomplete response strongly predicted recurrence. The combination of incomplete response and lymphadenopathy was also a significant predictor. Conversely, the absence of adenopathy and complete response to treatment were linked to freedom from recurrence. Incomplete structural response was identified as a critical predictor of recurrence risk, even with other low-recurrence conditions.Conclusion This study introduces a robust and interpretable predictive model that enhances personalized medicine in thyroid cancer care. The model effectively identifies high-risk individuals, allowing for tailored follow-up strategies that could improve patient outcomes and optimize resource allocation in DTC management.
2024
associative classification
differentiated thyroid cancer
machine learning
personalized medicine
recurrence prediction
File in questo prodotto:
File Dimensione Formato  
fmed-11-1461372.pdf

accesso aperto

Descrizione: CC BY 4.0 publisher version
Tipologia: Versione dell'editore
Licenza: Creative commons
Dimensione 423.15 kB
Formato Adobe PDF
423.15 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11562/1145252
Citazioni
  • ???jsp.display-item.citation.pmc??? 0
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact