Symptom-based diagnosis, Disease prediction, Machine learning, Healthcare analytics, Predictive modeling, Medical decision support system, Classification algorithms, Data mining in healthcare, Feature selection, Symptom–disease correlation, Clinical data analysis, Patient symptom dataset, Artificial intelligence in healthcare, Supervised learning, Diagnosiautomation, Machine learning models (SVM, Random Forest, Naïve Bayes),Healthcare prediction system, Data preprocessing, Early disease detection, Health informatics
Introduction
The text discusses the use of machine learning (ML) for symptom-based disease prediction to address the challenges of timely and accurate diagnosis in healthcare. Traditional diagnosis can be slow and error-prone, especially in high-demand settings, impacting vulnerable populations like the elderly, children, and those with limited access to care. ML models can analyze large datasets of symptoms to identify patterns and predict diseases quickly, improving diagnosis accuracy, enabling early treatment, and reducing healthcare costs.
The study reviews prior work on disease prediction using algorithms such as Naive Bayes, SVM, Decision Trees, Random Forest, and ensemble methods across illnesses including diabetes, heart disease, malaria, dengue, and tuberculosis. Data sources include structured and unstructured medical records, and preprocessing enhances model performance.
In the methodology, a dataset with 132 symptom features and training/testing samples was used. Various ML models—KNN, Gaussian Naive Bayes, SVM, Decision Tree, Random Forest, and XGBoost—were trained to predict diseases from symptoms. All models achieved perfect performance metrics (accuracy, precision, recall, F1-score = 1.0) on the available data, demonstrating strong potential for disease prediction.
The study highlights that symptom-based ML models can be integrated into web applications for public use, enabling early self-assessment, awareness, and timely medical intervention. While results are promising, models require testing on more diverse and real-world data, along with updates to incorporate new healthcare information. Overall, ML-based disease prediction offers a transformative approach to healthcare, improving diagnostic efficiency and outcomes globally.
References
[1] Gomathy, C. K., and Mr. A. Rohith Naidu. \"The prediction of disease using machine learning.\" International Journal of Scientific Research in Engineering and Management ( IJSREM) 5.10 (2021).
[2] Singh, Smriti Mukesh, and Dinesh B. Hanchate. \"Improving disease prediction by machine learning.\" Int. J. Res. Eng. Technol 5 (2018): 1542-1548.
[3] Arumugam, K., et al. \"Multiple disease prediction using Machine learning algorithms.\" Materials Today: Proceedings 80 (2023): 3682-3685.
[4] Sujatha, K., et al. \"Diabetes Disease Prediction Based on Symptoms Using Machine Learning Algorithms.\" Annals of the Romanian Society for Cell Biology 25.6 (2021): 3805-3817.
[5] Bhanuteja, Talasila, et al. \"Symptoms Based Multiple Disease Prediction Model using Machine Learning Approach.\" International Journal of Innovative Technology and Exploring Engineering (IJITEE) ISSN: 2278-3075.
[6] Pingale, Kedar, et al. \"Disease prediction using machine learning.\" International Research Journal of Engineering and Technology (IRJET) 6.12 (2019): 831-833.
[7] Sah, Rahul Deo, and Jitendra Sheetalani. \"Review of medical disease symptoms prediction using data mining technique.\" IOSR Journal of Computer Engineering 19.3 (2017): 59-70.
[8] Gupta, Jay Prakash, Ashutosh Singh, and Ravi Kant Kumar. \"A computer-based disease prediction and medicine recommendation system using machine learning approach.\" Int J Adv Res Eng Technol (IJARET) 12.3 (2021): 673-683.
[9] Kanakaraddi, Suvarna G., et al. \"Disease prediction using data mining and machine learning techniques.\" Advanced Prognostic Predictive Modelling in Healthcare Data Analytics (2021): 71-92.
[10] Jadhav, Saiesh, et al. \"Disease prediction by machine learning from healthcare communities.\" International Journal of Scientific Research in Science and Technology 5 (2019): 8869-8869.
[11] M. Sirigineedi, T. Kumaravel, P. Natesan, V. K. Shruthi, M. Kowsalya, and M. S. Malarkodi: “Deep Learning Approaches for Autonomous Driving to Detect Traffic Signs”, 2023 International Conference on Sustainable Communication Networks and Application (ICSCNA), Theni, India, 2023.
[12] M. Sirigineedi, R. N. V. J. Mohan and B. Sahu: “Improving Fisheries Management through Deep learning based Automated fish counting”, 2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT), Delhi, India, 2023.
[13] M. Srikanth, Bhanurangarao M, Manikanta Sirigineedi, Padma Bellapukonda: “Integrated Technologies for Proactive Bridge-Related Suicide Prevention”, Journal of Namibian Studies, Volume 1, Issue 33, Pages 2117-2136, Sep 2023.
[14] Srikanth Mandela, Padma Bellapukonda, Manikanta Sirigineedi: “Using Machine Learning and Neural Networks Technologies, a Bottom-Up Water Process Is Being Used To Reduce All Water Pollution Diseases”, Journal of Artificial Intelligence, Machine Learning and Neural Network (JAIMLNN), vol. 2, Oct. 2022.
[15] M. Srikanth, Padma Bellapukonda, Manikanta Sirigineedi: Protecting tribal peoples nearby patient care centers use hybrid techniques based on a distribution network, International Journal of Health Sciences, 2022.
[16] John, A., McGregor, J., Fone, D., Dunstan, F., Cornish, R., Lyons, R. A., & Lloyd, K. R. (2016). Case-finding for common mental disorders of anxiety and depression in primary care: externalvalidation of routinely collected data. BMC Medical Informatics and Decision Making, 16(1), 35
[17] Essau, C. A., Lewinsohn, P. M., Lim, J. X., Ho, M.-H. R., & Rohde, P. (2018). Incidence,recurrence, and comorbidity of anxiety disorders in four major developmental stages. Journal of Affective Disorders, 228, 248–253
[18] Vos, T., Abajobir, A. A., Abate, K. H., Abbafati, C., Abbas, K. M., Abd-Allah, F., Abdulkader, R. S., Abdulle, A. M., Abebo, T. A., Abera, S. F., Aboyans, V., Abu-Raddad, L. J., Ackerman, I. N., Adamu, A. A., Adetokunboh, O., Afarideh, M., Afshin, A., Agarwal, S. K., Aggarwal, R., … Murray,C. J. L. (2017). Global, regional, and national incidence, prevalence, and years lived withdisability for 328 diseases and injuries for 195 countries, 1990–2016: a systematic analysis forthe Global Burden of Disease Study 2016. Lancet, 390(10100), 1211–120
[19] Psychiatric Association - Apa, A. (Ed.). (2014). Manual diagnóstico y estadístico de los trastornosmentales DSM-5 (5a). En Madrid: Editorial Médica Panamericana.
[20] Somers, J. M., Goldner, E. M., & Waraich, P. (2006). Prevalence and incidence studies of anxiety disorders. A systematic review of the literature. Can J Psychiatry, 51, 100–113.
[21] Costa e Silva, J. A. (1998). The public health impact of anxiety disorders: a WHO perspective.Acta Psychiatrica Scandinavica. Supplementum, 393, 2–5.
[22] National Collaborating Centre for Mental Health (UK). (2013). Social anxiety disorder. British Psychological Society