Multi-Disease Prediction Using Machine Learning Algorithm
Multi-Disease Prediction Using Machine Learning Algorithm
https://doi.org/10.22214/ijraset.2023.50128
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue IV Apr 2023- Available at www.ijraset.com
Abstract: In the medical sector, disease diagnosis is an essential duty, and prompt and accurate diagnosis is crucial to effective
management and therapy. Machine learning techniques, including Naive Bayesian networks, have shown promise in disease
prediction and diagnosis. In this study, we present a machine learning-based multi-disease prediction system that uses Naive
Bayesian networks.
The proposed methodology seeks to deliver precise illness prediction for several diseases instantaneously. In addition to
describing the methods adopted, which included dataset selection, preprocessing, feature selection, and the Naive Bayesian
network algorithm, we also discuss the social relevance of this work, emphasizing the potential impact of accurate disease
prediction in improving patient outcomes and bringing down healthcare costs.
To evaluate the performance of the proposed model, we conducted experiments using a publicly available disease dataset. The
results demonstrated that the proposed model achieved high accuracy of 91.2% and outperformed other state-of-the-art models
for multi-disease prediction some of them are, Random Forest obtained 85.7% and Decision Tree obtained 81.3% respectively.
In summary, the proposed system demonstrates the effectiveness of Naive Bayesian networks for multi-disease prediction and
has the potential to improve disease diagnosis and management in the medical domain.
Keywords: Disease Prediction, Naïve BayesianNetworks, Random Forest, Decision Tree,Feature Selection.
I. INTRODUCTION
The adoption of modern computational techniques, such as machine learning, has drawn a lot of fascination in recent years since
illness detection is a challenging process with many facets in the healthcare industry. Healthcare practitioners may now make
educated decisions and deliver improved patient care courtesy of machine learning algorithms that have demonstrated enormous
opportunities in illness diagnosis and prediction. In this respect, we suggest a novel method for multi-disease prediction based on
Naive Bayesian networks, a probabilistic modeling approach that is extensively employed in many industries, including the
healthcare industry.
Using a sizable dataset of patient data, our suggested technique intends to deliver an extensive and precise illness prediction model
for several diseases at once. To improve patient outcomes, lower healthcare expenditures, and raise the general standard of
healthcare services, such a system must be developed. Accurate illness prediction is made possible by the system's use of cutting-
edge algorithms for data preprocessing, feature selection, and model training.
The approach utilized to create the suggested system is described in this study, along with the necessary dataset selection
procedures, preprocessing techniques, feature selection methods, and application of the Naive Bayesian network algorithm. We also
provide the findings from our research, emphasizing how precisely and successfully the suggested approach can diagnose numerous
illnesses at once. The suggested approach, which makes use of machine learning algorithms to enable precise and prompt illness
detection, also represents an important leap in disease prediction and diagnosis overall. The system has the potential to dramatically
enhance patient care and treatment results, hence enhancing the healthcare sector as a whole.
A. Objectives
Using Naive Bayesian networks, a well-liked probabilistic modeling approach in machine learning, the objective of this work is to
build and create a multi-disease prediction system. The goal is to give medical practitioners an effective tool that enables them to
correctly forecast the onset of several illnesses in patients based on their symptoms and medical background. The intention of this
research is to train the Naive Bayesian network algorithm to concurrently forecast various illnesses using a big dataset of patient
information and sophisticated data preparation, data cleaning, normalization, and feature selection techniques, after all the
processing the results are produced.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 447
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue IV Apr 2023- Available at www.ijraset.com
2) Scope
The importance of this research rests in its potential to considerably increase the precision and effectiveness of medical diagnosis in
contemporary culture. Healthcare workers may choose better patient care and treatment choices by employing machine learning
algorithms like Naive Bayes networks to forecast different illnesses based on patient symptoms. In the end, this may result in
improved patient outcomes and lower medical expenses. The method might also be used to deliver a prompt and precise diagnosis
in disadvantaged regions with limited access to healthcare. The prospective outcomes of this effort are broad-ranging and might
have a favorable effect on the healthcare system as well as society at large.
3) Limitations
The Smart Healthcare System has the following limitations:
a) The accuracy of the system is dependent on the quality and completeness of the dataset used for training. Incomplete or
inconsistent data may lead to inaccurate predictions.
b) The system may not be effective in predicting rare or emerging diseases that have not been included in the training dataset.
c) The system may not be able to handle complex cases, where patients have multiple diseases or conditions.
d) The system may not be able to replace the expertise and judgment of healthcare professionals entirely, as the interpretation of
the results requires clinical knowledge and experience.
e) The system may not take into account external factors, such as environmental or social determinants of health, that can affect
disease occurrence and progression.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 448
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue IV Apr 2023- Available at www.ijraset.com
Selvaraj, et al. 2021 [5], In this paper, the author extracted personal data such as user health conditions from day-to-day life. The
lifestyle data are gathered and stored at the data repository by using web technology and mobile applications. The user enters their
daily health conditions in textual format.
Natural Language Processing is used to understand the given input and further forecast the user’s illness.
The architecture diagram for a multi-disease prediction using Naive Bayes networks typically consists of several components.
The first component is the data source, which can be electronic health records, medical imaging, or other medical data. This data is
used to train the Naive Bayes networks for each disease.
The second component is the Naive Bayes network itself, which is composed of nodes representing various risk factors or
symptoms for each disease. The model calculates the probability of each disease based on the presence or absence of these risk
factors or symptoms.
The third component is the prediction engine, which uses the trained Naive Bayes networks to predict the probability of each
disease for a given patient. This prediction engine can be integrated into a larger clinical decision support system to assist physicians
in making accurate diagnoses and treatment plans. Finally, there is a feedback loop that continuously updates the Naive Bayes
networks based on new data from patient outcomes, ensuring that the model remains accurate over time.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 449
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue IV Apr 2023- Available at www.ijraset.com
To adopt the suggested system, a few adjustments must be made to the current one. To prepare a collection of patient data that can
be used to train the Naive Bayesian network technique, a sizable and high-quality dataset must first be gathered.
The system's correctness and dependability must also be guaranteed by the application of sophisticated feature selection and data
preparation procedures.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 450
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue IV Apr 2023- Available at www.ijraset.com
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 451
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue IV Apr 2023- Available at www.ijraset.com
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 452
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.538
Volume 11 Issue IV Apr 2023- Available at www.ijraset.com
VI. CONCLUSION
In conclusion, the suggested method for multi-disease prediction utilizing Naive Bayes networks has demonstrated promising results
in effectively predicting several illnesses using patient symptoms as input data. To determine the likelihood of a disease given a
collection of symptoms, the system uses the Naive Bayes method, a probabilistic model that assumes the independence of features.
The system demonstrated a high degree of accuracy in predicting different diseases when it was tested using a dataset of patient
symptoms and associated diagnoses.
A. Future Enhancement
In the future, this work might be improved by using more modern machine-learning algorithms and techniques, such as deep
learning, to increase the prediction models' accuracy and efficiency. Also, expanding the dataset to incorporate more varied patient
groups and a larger spectrum of illnesses and symptoms may help shed additional light on the models' efficiency. The technology
might be made easier to use in clinical settings by being integrated into an intuitive interface that both patients and healthcare
professionals can access. This would eventually improve patient outcomes and increase the effectiveness of healthcare delivery.
REFERENCES
[1] Sameer Meshram1, Shital Dongre, Triveni Fole. “Disease Prediction System using naïve bayes”. International Journal for Research in Applied Science &
Engineering Technology Volume 10 Issue XII Dec 2022.
[2] Abid ishaq, saima sadiq, muhammad umer, saleem ullah, seyedali mirjalili, vaibhav rupapara , and michele nappi. “Improving the Prediction of Heart Failure
Patients’ Survival Using SMOTE and Effective Data Mining Techniques”. March 16, 2021.
[3] Bilal khan, Rashid naseem, Fazal muhammad, ghulam abbas, and sung hwan kim. “An Empirical Evaluation of Machine Learning Techniques for Chronic
Kidney Disease Prophecy”. March 30, 2020.
[4] Chandrasekhar Rao Jetti, Rehamatulla Shaik, Sadhik Shaik, Sowmya Sanagapalli “Disease Prediction using Naïve Bayes - Machine Learning Algorithm”,
December 2021.
[5] Prediction Support System for Multiple Disease Prediction Using Naive Bayes Classifier”. Selvaraj A, Mithra MK, Keerthana S, Deepika M. International
Journal of Engineering and Techniques - Volume 4 Issue 2, Mar-Apr 2021.
[6] Akkem Yaganteeswarudu,” Multi Disease Prediction Model by using Machine Learning and Flask API”, IEEE, July 2022.
[7] Yashaswi G Sagar, Sahana Gajanana, Riyal Vivek, Swetha P,” MediInsight: A Smart Health Prediction System”, (IRJET), June 2021.
[8] Hsiu-Sen Chiang, Mu-Yen Chen, “Cognitive Depression Detection Cyber-Medical System Based on EEG Analysis and Deep Learning Approaches” IEEE,
February 2023.
[9] Wei Shao, Zhiyang You, Lesheng Liang, Xiping Hu, “A Multi-Modal Gait Analysis-Based Detection System of the Risk of Depression” IEEE, October 2022.
[10] Akash C. Jamgade, Prof. S. D. Zade, International Research Journal of Engineering and Technology, May 2019.
[11] Anjan Nikhil Repaka, Sai Deepak Ravikanti, Ramya G Franklin,” Design and Implementing Heart Disease Prediction Using Naïve Bayesian”, IEEE, June
2019.
[12] N. P. Tigga and S. Garg, ‘‘Prediction of type 2 diabetes using machine learning classification methods,’’ Proc. Computer Sci., Jan. 2020.
©IJRASET: All Rights are Reserved | SJ Impact Factor 7.538 | ISRA Journal Impact Factor 7.894 | 453