• P-ISSN 0974-6846 E-ISSN 0974-5645

Indian Journal of Science and Technology


Indian Journal of Science and Technology

Year: 2024, Volume: 17, Issue: 11, Pages: 1043-1050

Original Article

Classification of Parkinson’s Disease Data Using Traditional and Advanced Data Mining Techniques

Received Date:02 December 2023, Accepted Date:17 January 2024, Published Date:05 March 2024


Objectives: (1) To apply various traditional classification tools, (2) To check effectiveness of the classifiers to the Parkinson Dataset (3) To use boosting classification tools and (4) Compare performance of all used classification tools and find the best accuracy classifier algorithm. Thus, the main aim of the study is to discriminate healthy people from those with PD. Methods: The methodology of this study is categorised into three stages:(1) Preprocessing and feature selection; (2) Application of classifiers; (3) Comparative study. We have used secondary dataset of voice recordings originally collected by University of Oxford by Max Little. In first step, the voice data of PD patients is collected for analysis. Then the collected data is normalized using min-max normalization followed by feature extraction. Thus, uses classification Data Mining Techniques viz., KNN, Logistic Regression, Decision Tree, SVM, Random Forest and boosting algorithm etc. to predict whether the person is healthy or has Parkinson’s disease. Finally, comparative analysis is made based on the accuracy provided by different data mining models. Findings: Results of our study reveals that GB algorithm is more accurate as compared with other models. It gives the highest accuracy, so that we recommend this algorithm to deal similar kind of studies in the future. These models are very useful in better and exact medical diagnosis and decision making. It is also found that, proposed methods are fully computerized and produce enhanced performance hence can be recommended for similar studies. Here, it is observed that Gradient Boost algorithm provide the best accuracy (100% for training and 92.02% for testing, 98.46% overall). Novelty: We have used boosting classification model for the classification of Parkinson’s disease. Our proposed method is one such good example giving faster and more accurate results for the classification of Parkinson’s disease patients with excellent accuracy. We have also compared the results with other existing approaches like linear discriminant analysis, support vector machine, K-nearest neighbour, decision tree, classification and regression trees, random forest, linear regression, logistic regression and Naive Bayes, but our proposed techniques were superior to existing studies in which gradient boost algorithm yielded an accuracy of 98.46%, so our method can be used as an effective means of computer-aided diagnosis of PD, and has important practical value.

Keywords: Data Mining, Parkinson's Disease, Classification, Boosting Algorithms, Feature Selection


  1. Raza C, Anjum R, Shakeel NUA. Parkinson's disease: Mechanisms, translational models and management strategies. Life Sciences. 2019;226:77–90. Available from: https://doi.org/10.1016/j.lfs.2019.03.057
  2. Bloem BR, Okun MS, Klein C. Parkinson's disease. The Lancet. 2021;397(10291):2284–2303. Available from: https://doi.org/10.1016/S0140-6736(21)00218-X
  3. Ghorbani R, Ghousi R. Predictive data mining approaches in medical diagnosis: A review of some diseases prediction. International Journal of Data and Network Science. 2019;3(2):47–70. Available from: http://dx.doi.org/10.5267/j.ijdns.2019.1.003
  4. Baez S, Herrera E, Trujillo C, Cardona JF, Diazgranados JA, Pino M, et al. Classifying Parkinson’s Disease Patients With Syntactic and Socio-emotional Verbal Measures. Frontiers in Aging Neuroscience. 2020;12:1–11. Available from: https://doi.org/10.3389/fnagi.2020.586233
  5. Ricciardi C, Amboni M, Santis CD, Improta G, Volpe G, Iuppariello L, et al. Using gait analysis’ parameters to classify Parkinsonism: A data mining approach. Computer Methods and Programs in Biomedicine. 2019;180:105033. Available from: https://doi.org/10.1016/j.cmpb.2019.105033
  6. Wingate J, Kollia I, Bidaut L, Kollias S. Unified deep learning approach for prediction of Parkinson's disease. IET Image Processing. 2020;14(10):1980–1989. Available from: https://doi.org/10.1049/iet-ipr.2019.1526
  7. Salmanpour MR, Shamsaei M, Saberi A, Setayeshi S, Klyuzhin IS, Sossi V, et al. Optimized machine learning methods for prediction of cognitive outcome in Parkinson's disease. Computers in Biology and Medicine. 2019;111:103347. Available from: https://doi.org/10.1016/j.compbiomed.2019.103347
  8. Haq AU, Li JP, Agbley BLY, Mawuli CB, Ali Z, Nazir S, et al. A survey of deep learning techniques based Parkinson’s disease recognition methods employing clinical data. Expert Systems with Applications. 2022;208:118045. Available from: https://doi.org/10.1016/j.eswa.2022.118045
  9. Templeton JM, Poellabauer C, Schneider S. Classification of Parkinson’s disease and its stages using machine learning. Scientific Reports. 2022;12(1):1–11. Available from: https://doi.org/10.1038/s41598-022-18015-z
  10. Lee S, Hussein R, Ward R, Wang ZJ, Mckeown MJ. A convolutional-recurrent neural network approach to resting-state EEG classification in Parkinson’s disease. Journal of Neuroscience Methods. 2021;361:109282. Available from: https://doi.org/10.1016/j.jneumeth.2021.109282
  11. Mittal V, Sharma RK. Machine learning approach for classification of Parkinson disease using acoustic features. Journal of Reliable Intelligent Environments. 2021;7(3):233–239. Available from: https://doi.org/10.1007/s40860-021-00141-6
  12. Ouhmida A, Raihani A, Cherradi B, Terrada O. A Novel Approach for Parkinson’s Disease Detection Based on Voice Classification and Features Selection Techniques. International Journal of Online and Biomedical Engineering (iJOE). 2021;17(10):111–130. Available from: https://doi.org/10.3991/ijoe.v17i10.24499
  13. Priya S, Priyatharshini R, Shruthi R, Pooja V, Swarna RS. Early detection of Parkinson's disease using data mining techniques from multimodal clinical data. In: Advanced Machine Vision Paradigms for Medical Image Analysis. (pp. 213-228) Academic Press. 2021.
  14. Ahmed I, Aljahdali S, Khan MS, Kaddoura S. Classification of Parkinson Disease Based on Patient’s Voice Signal Using Machine Learning. Intelligent Automation & Soft Computing. 2022;32(2):705–722. Available from: https://doi.org/10.32604/iasc.2022.022037
  15. Jyotiyana M, Kesswani N, Kumar M. A deep learning approach for classification and diagnosis of Parkinson’s disease. Soft Computing. 2022;26(18):9155–9165. Available from: https://doi.org/10.1007/s00500-022-07275-6


© 2024 Tangawade & Muley. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Published By Indian Society for Education and Environment (iSee)


Subscribe now for latest articles and news.