Classification of Parkinson’s Disease Data Using Traditional and Advanced Data Mining Techniques

Atish S Tangawade; Aniket A Muley

doi:10.17485/IJST/v17i11.3059

Article

Classification of Parkinson’s Disease Data Using Traditional and Advanced Data Mining Techniques

VIEWS 261
PDF 1138

Indian Journal of Science and Technology

DOI: 10.17485/IJST/v17i11.3059

Year: 2024, Volume: 17, Issue: 11, Pages: 1043-1050

Original Article

Classification of Parkinson’s Disease Data Using Traditional and Advanced Data Mining Techniques

Atish S Tangawade^1*, Aniket A Muley²

¹Research Fellow, School of Mathematical Sciences, SRTM University, Nanded, Maharashtra, India
²Associate Professor, School of Mathematical Sciences, SRTM University, Nanded, Maharashtra, India

*Corresponding Author
Email: [email protected]

Received Date:02 December 2023, Accepted Date:17 January 2024, Published Date:05 March 2024

This work is licensed under a Creative Commons Attribution 4.0 International License.

Abstract

Objectives: (1) To apply various traditional classification tools, (2) To check effectiveness of the classifiers to the Parkinson Dataset (3) To use boosting classification tools and (4) Compare performance of all used classification tools and find the best accuracy classifier algorithm. Thus, the main aim of the study is to discriminate healthy people from those with PD. Methods: The methodology of this study is categorised into three stages:(1) Preprocessing and feature selection; (2) Application of classifiers; (3) Comparative study. We have used secondary dataset of voice recordings originally collected by University of Oxford by Max Little. In first step, the voice data of PD patients is collected for analysis. Then the collected data is normalized using min-max normalization followed by feature extraction. Thus, uses classification Data Mining Techniques viz., KNN, Logistic Regression, Decision Tree, SVM, Random Forest and boosting algorithm etc. to predict whether the person is healthy or has Parkinson’s disease. Finally, comparative analysis is made based on the accuracy provided by different data mining models. Findings: Results of our study reveals that GB algorithm is more accurate as compared with other models. It gives the highest accuracy, so that we recommend this algorithm to deal similar kind of studies in the future. These models are very useful in better and exact medical diagnosis and decision making. It is also found that, proposed methods are fully computerized and produce enhanced performance hence can be recommended for similar studies. Here, it is observed that Gradient Boost algorithm provide the best accuracy (100% for training and 92.02% for testing, 98.46% overall). Novelty: We have used boosting classification model for the classification of Parkinson’s disease. Our proposed method is one such good example giving faster and more accurate results for the classification of Parkinson’s disease patients with excellent accuracy. We have also compared the results with other existing approaches like linear discriminant analysis, support vector machine, K-nearest neighbour, decision tree, classification and regression trees, random forest, linear regression, logistic regression and Naive Bayes, but our proposed techniques were superior to existing studies in which gradient boost algorithm yielded an accuracy of 98.46%, so our method can be used as an effective means of computer-aided diagnosis of PD, and has important practical value.

Keywords: Data Mining, Parkinson's Disease, Classification, Boosting Algorithms, Feature Selection

References

Raza C, Anjum R, Shakeel NUA. Parkinson's disease: Mechanisms, translational models and management strategies. Life Sciences. 2019;226:77–90. Available from: https://doi.org/10.1016/j.lfs.2019.03.057
Bloem BR, Okun MS, Klein C. Parkinson's disease. The Lancet. 2021;397(10291):2284–2303. Available from: https://doi.org/10.1016/S0140-6736(21)00218-X
Ghorbani R, Ghousi R. Predictive data mining approaches in medical diagnosis: A review of some diseases prediction. International Journal of Data and Network Science. 2019;3(2):47–70. Available from: http://dx.doi.org/10.5267/j.ijdns.2019.1.003
Goyal P, Rani R. Comparative Analysis of Machine Learning, Ensemble Learning and Deep Learning Classifiers for Parkinson’s Disease Detection. SN Computer Science. 2023;5(1). Available from: https://doi.org/10.1007/s42979-023-02368-x
Baez S, Herrera E, Trujillo C, Cardona JF, Diazgranados JA, Pino M, et al. Classifying Parkinson’s Disease Patients With Syntactic and Socio-emotional Verbal Measures. Frontiers in Aging Neuroscience. 2020;12:1–11. Available from: https://doi.org/10.3389/fnagi.2020.586233
Ricciardi C, Amboni M, Santis CD, Improta G, Volpe G, Iuppariello L, et al. Using gait analysis’ parameters to classify Parkinsonism: A data mining approach. Computer Methods and Programs in Biomedicine. 2019;180:105033. Available from: https://doi.org/10.1016/j.cmpb.2019.105033
Wingate J, Kollia I, Bidaut L, Kollias S. Unified deep learning approach for prediction of Parkinson's disease. IET Image Processing. 2020;14(10):1980–1989. Available from: https://doi.org/10.1049/iet-ipr.2019.1526
Salmanpour MR, Shamsaei M, Saberi A, Setayeshi S, Klyuzhin IS, Sossi V, et al. Optimized machine learning methods for prediction of cognitive outcome in Parkinson's disease. Computers in Biology and Medicine. 2019;111:103347. Available from: https://doi.org/10.1016/j.compbiomed.2019.103347
Haq AU, Li JP, Agbley BLY, Mawuli CB, Ali Z, Nazir S, et al. A survey of deep learning techniques based Parkinson’s disease recognition methods employing clinical data. Expert Systems with Applications. 2022;208:118045. Available from: https://doi.org/10.1016/j.eswa.2022.118045
Templeton JM, Poellabauer C, Schneider S. Classification of Parkinson’s disease and its stages using machine learning. Scientific Reports. 2022;12(1):1–11. Available from: https://doi.org/10.1038/s41598-022-18015-z
Tong J, Zhang J, Dong E, Du S. Severity Classification of Parkinson’s Disease Based on Permutation-Variable Importance and Persistent Entropy. Applied Sciences. 2021;11(4):1–20. Available from: https://doi.org/10.3390/app11041834
Alalayah KM, Senan EM, Atlam HF, Ahmed IA, Shatnawi HSA. Automatic and Early Detection of Parkinson’s Disease by Analyzing Acoustic Signals Using Classification Algorithms Based on Recursive Feature Elimination Method. Diagnostics. 2023;13(11):1–24. Available from: https://doi.org/10.3390/diagnostics13111924
Zhang J. Mining imaging and clinical data with machine learning approaches for the diagnosis and early detection of Parkinson’s disease. npj Parkinson's Disease. 2022;8(1):1–15. Available from: https://doi.org/10.1038/s41531-021-00266-8
Lee S, Hussein R, Ward R, Wang ZJ, Mckeown MJ. A convolutional-recurrent neural network approach to resting-state EEG classification in Parkinson’s disease. Journal of Neuroscience Methods. 2021;361:109282. Available from: https://doi.org/10.1016/j.jneumeth.2021.109282
Rasheed J, Hameed AA, Ajlouni N, Jamil A, Ozyavas A, Orman Z. Application of Adaptive Back-Propagation Neural Networks for Parkinson’s Disease Prediction. In: 2020 International Conference on Data Analytics for Business and Industry: Way Towards a Sustainable Economy (ICDABI). (pp. 1-5) IEEE. 2021.
Mittal V, Sharma RK. Machine learning approach for classification of Parkinson disease using acoustic features. Journal of Reliable Intelligent Environments. 2021;7(3):233–239. Available from: https://doi.org/10.1007/s40860-021-00141-6
Ouhmida A, Raihani A, Cherradi B, Terrada O. A Novel Approach for Parkinson’s Disease Detection Based on Voice Classification and Features Selection Techniques. International Journal of Online and Biomedical Engineering (iJOE). 2021;17(10):111–130. Available from: https://doi.org/10.3991/ijoe.v17i10.24499
Priya S, Priyatharshini R, Shruthi R, Pooja V, Swarna RS. Early detection of Parkinson's disease using data mining techniques from multimodal clinical data. In: Advanced Machine Vision Paradigms for Medical Image Analysis. (pp. 213-228) Academic Press. 2021.
Sharanyaa S, Renjith PN, Ramesh K. Classification of Parkinson's Disease using Speech Attributes with Parametric and Nonparametric Machine Learning Techniques. In: 2020 3rd International Conference on Intelligent Sustainable Systems (ICISS). (pp. 437-442) IEEE. 2021.
Ahmed I, Aljahdali S, Khan MS, Kaddoura S. Classification of Parkinson Disease Based on Patient’s Voice Signal Using Machine Learning. Intelligent Automation & Soft Computing. 2022;32(2):705–722. Available from: https://doi.org/10.32604/iasc.2022.022037
Jyotiyana M, Kesswani N, Kumar M. A deep learning approach for classification and diagnosis of Parkinson’s disease. Soft Computing. 2022;26(18):9155–9165. Available from: https://doi.org/10.1007/s00500-022-07275-6

Copyright

© 2024 Tangawade & Muley. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Published By Indian Society for Education and Environment (iSee)