Skip to main content
Log in

Machine learning algorithm ensembles for early oral cancer risk assessment using Raman cyto-spectroscopy

  • Data analytics and machine learning
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Early detection of oral cancer is essential for improving survival rates and diagnostic precision. Raman cyto-spectroscopy is a useful method, but its interpretive complexity prevents frequent clinical use. Prediction algorithms have been used to evaluate RS data and create CAD systems. An ensemble of MLAs and dimensionality reduction techniques has been demonstrated to effectively analyse the complex RS. In present study we utilised two MLA ensembles—(1) Linear Discriminant Analysis with Decision Tree Classifier (2) LDA with Support Vector Machine. Results show that RS of oral epithelial cells, collected from different study groups (viz. Normal-healthy volunteers; histo-pathologically confirmed- potentially cancerable- leucoplakia and cancer patients-OSCC) when classified using the ensembles had better overall accuracy of (88 ± 8%) as compared to either DTC (48 ± 19%) or SVM (58 ± 15%) when applied individually. Class-wise evaluation showed improved performance of LDA-SVM compared to LDA-DTC, and blind test was performed with RS of habitual cigarette smokers (CS) as susceptible group and healthy volunteers as control group to assess the classifier ensembles' performance in cancer risk prediction. The LDA-SVM ensemble showed the highest classification accuracy, indicating that it can be used to accurately detect oral cancer risk and screen susceptible patients.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data availability

All data generated or analysed during this study are included in this published article.

References

Download references

Funding

The authors would like to acknowledge Indian Council of Medical Research (ICMR, India) for providing an ICMR-Senior Research Fellowship [IRIS ID -2017-4029] to AG.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ananya Barui.

Ethics declarations

Conflict of interest

The authors declare no financial or commercial conflicts of interests regarding the publication of this paper.

Ethical approval

All experiments were performed in compliance with the Indian Institute of Engineering, Science and Technology’s ethical committee policy.

Informed consent

All samples were collected from patients under their informed written consent.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chaudhuri, D., Ghosh, A., Raha, S. et al. Machine learning algorithm ensembles for early oral cancer risk assessment using Raman cyto-spectroscopy. Soft Comput 27, 13861–13875 (2023). https://doi.org/10.1007/s00500-023-08995-z

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-023-08995-z

Keywords

Navigation