Enco – Standardization Data Pre-Processing Technique in Autism Spectrum Disorder Detection

M Kavitha; M Kasthuri

doi:10.17485/IJST/v16i45.1781

Article

Enco – Standardization Data Pre-Processing Technique in Autism Spectrum Disorder Detection

VIEWS 330
PDF 85

Indian Journal of Science and Technology

DOI: 10.17485/IJST/v16i45.1781

Year: 2023, Volume: 16, Issue: 45, Pages: 4156-4163

Original Article

Enco – Standardization Data Pre-Processing Technique in Autism Spectrum Disorder Detection

M Kavitha^1*, M Kasthuri²

¹Research Scholar/Assistant Professor, Department of Computer Applications, Bishop Heber College, Affiliated to Bharathidasan University, Trichirappalli, 620024, Tamil Nadu, India
²Associate Professor, Department of Computer Applications, Bishop Heber College, Affiliated to Bharathidasan University, Trichirappalli, 620024, Tamil Nadu, India

*Corresponding Author
Email: [email protected]

Received Date:15 July 2023, Accepted Date:08 October 2023, Published Date:05 December 2023

This work is licensed under a Creative Commons Attribution 4.0 International License.

Abstract

Background/Objectives: The goal of this study was to create an Enco-Standardization technique that would produce accurate data and improve the diagnosis of Autism Spectrum Disorder (ASD).This method uses mean values to replace missing values in a dataset and improves them by combining label encoding and conventional scaling techniques. Methods: The ASD dataset, which has 704 instances and 21 attributes, is used in this study. Training and testing are divided by the dataset (80%-20%). As an imputation strategy in this dataset, missing values are located and replaced with the mean value. Attributes are encoded using the Enco-Standardization methodology using a label encoding technique that changes non-numeric variables into numeric ones. After that, the data were scaled into a machine-readable format to standardise it. Different machine learning classifier models are compared to the hybrid strategy of encoding and scaling techniques. Based on the accuracy found using machine learning classifier models, the dataset acquired using the Enco-Standardization technique is assessed. Findings: The dataset needs to be accurate and relevant in order to increase accuracy and decrease computing time. The findings of the Enco-Standardization methodology showed a good pre-processing method with accuracy values of 98% for Naive Bayes (NB), 71% for K Nearest Neighbour (KNN), 74% for Support Vector Machine (SVM), 97% for Linear Regression (LR), 100% for Decision Tree (DT), and 100% for Random Forest (RF). The deletion of missing values improves performance in KNN (94%), SVM (95.9%), LR, DT, and RF (100%) but decreases the number of instances in the dataset, rendering the model ineffective. Novelty: The data in a dataset are transformed and encoded using the proposed Enco-Standardization pre-processing technique, which increases the precision of the data analysis process in ASD prediction. Data discrepancies are avoided by using this eco-standardization technique.

Keywords: Autism Spectrum Disorder, Preprocessing, Scaling, EncoStandardization, Machine Learning

References

Tiwari R, Purkayastha K, Gulati S. Public Health Dimensions of Autism Spectrum Disorder in India: An Overview. Journal of Comprehensive Health. 2021;9(2):57–62. Available from: http://dx.doi.org/10.53553/jch.v09i02.002
Farooq MS, Tehseen R, Sabir M, Atal Z. Detection of autism spectrum disorder (ASD) in children and adults using machine learning. Scientific Reports. 13(1). Available from: http://dx.doi.org/10.1038/s41598-023-35910-1
Qureshi MS, Qureshi MB, Asghar J, Alam F, Aljarbouh A. Prediction and Analysis of Autism Spectrum Disorder Using Machine Learning Techniques. Journal of Healthcare Engineering. 2023;2023:1–10. Available from: http://dx.doi.org/10.1155/2023/4853800
Shinde AV, Patil DD. A Multi-Classifier-Based Recommender System for Early Autism Spectrum Disorder Detection using Machine Learning. Healthcare Analytics. 2023;4:100211. Available from: http://dx.doi.org/10.1016/j.health.2023.100211
Chen J, Engelhard M, Henao R, Berchuck S, Eichner B, Perrin EM, et al. Enhancing early autism prediction based on electronic records using clinical narratives. Journal of Biomedical Informatics. 2023;144:104390. Available from: http://dx.doi.org/10.1016/j.jbi.2023.104390
Olisah CC, Smith L, Smith M. Diabetes mellitus prediction and diagnosis from a data preprocessing and machine learning perspective. Computer Methods and Programs in Biomedicine. 2022;220:106773. Available from: http://dx.doi.org/10.1016/j.cmpb.2022.106773
Grossi E, White R, Bs RJ, Swatzyna. A simple preprocessing method enhances machine learning application to EEG data for differential diagnosis of autism. ResearchGate. 2022. Available from: https://www.researchgate.net/publication/360034087
Vakadkar K, Purkayastha D, Krishnan D. Detection of Autism Spectrum Disorder in Children Using Machine Learning Techniques. SN Computer Science. 2021;2(5). Available from: http://dx.doi.org/10.1007/s42979-021-00776-5
Shihab AI, Dawood FA, Kashmar AH. Data Analysis and Classification of Autism Spectrum Disorder Using Principal Component Analysis. Advances in Bioinformatics. 2020;2020:1–8. Available from: http://dx.doi.org/10.1155/2020/3407907
Peral J, Gil D, Rotbei S, Amador S, Guerrero M, Moradi H. A Machine Learning and Integration Based Architecture for Cognitive Disorder Detection Used for Early Autism Screening. Electronics. 2020;9(3):516. Available from: http://dx.doi.org/10.3390/electronics9030516
Eslami T, Almuqhim F, Raiker JS, Saeed F. Machine Learning Methods for Diagnosing Autism Spectrum Disorder and Attention- Deficit/Hyperactivity Disorder Using Functional and Structural MRI: A Survey. Frontiers in Neuroinformatics. 2021;14. Available from: http://dx.doi.org/10.3389/fninf.2020.575999
Oh SL, Jahmunah V, Arunkumar N, Abdulhay EW, Gururajan R, Adib N. A novel automated autism spectrum disorder detection system. Complex Intell Syst. 2021;7:2399–413. Available from: https://doi.org/10.1007/s40747-021-00408-8
Sherkatghanad Z, Akhondzadeh M, Salari S, Zomorodi-Moghadam M, Abdar M, Acharya UR. Automated detection of autism spectrum disorder using a convolutional neural network. 2020. Available from: https://doi.org/10.3389/fnins.2019.01325
Khadem-Reza ZK, Zare H. Automatic detection of autism spectrum disorder (ASD) in children using structural magnetic resonance imaging with machine vision system. Middle East Current Psychiatry. 2022;29(1). Available from: https://doi.org/10.1186/s43045-022-00220-1
Prasad PKC, Khare Y, Dadi K, Vinod PK, Surampudi BR. Deep Learning Approach for Classification and Interpretation of Autism Spectrum Disorder. In: 2022 International Joint Conference on Neural Networks (IJCNN). (pp. 1-8) IEEE. 2022.
Din QMU, Jayanthy AK. Automated classification of Autism Spectrum Disorder using EEG signals and Convolutional Neural Networks. Biomedical Engineering: Applications, Basis and Communications. 2022;34(02). Available from: https://doi.org/10.4015/S101623722250020X
Sivaranjani S, Ananya S, Aravinth J, Karthika R. Diabetes Prediction using Machine Learning Algorithms with Feature Selection and Dimensionality Reduction. 7th International Conference on Advanced Computing and Communication Systems (ICACCS). 2021. Available from: https://doi.org/10.1109/ICACCS51430.2021.9441935
Raj S, Masood S. Analysis and Detection of Autism Spectrum Disorder Using Machine Learning Techniques. Procedia Computer Science. 2020;167:994–1004. Available from: https://doi.org/10.1016/j.procs.2020.03.399
Priya N, Radhika C. Effective Implementation of Pre-Processing Techniques in Machine Learning for Autism Spectrum Disorder. International Journal of Innovative Technology and Exploring Engineering. 2020;9(5):2253–2257. Available from: https://doi.org/10.35940/ijitee.E2676.039520
Erkan U, Thanh DNH. Autism Spectrum Disorder Detection with Machine Learning Methods. Current Psychiatry Research and Reviews. 2020;15(4):297–308. Available from: http://dx.doi.org/10.2174/2666082215666191111121115
Kaur H, Kumari V. Predictive modelling and analytics for diabetes using a machine learning approach. Applied Computing and Informatics. 2022;18(1/2):90–100. Available from: https://doi.org/10.1016/j.aci.2018.12.004
Jacob SG, Sulaiman MMBA, Bennet B. Feature Signature Discovery for Autism Detection: An Automated Machine Learning Based Feature Ranking Framework. Computational Intelligence and Neuroscience. 2023;2023:1–14. Available from: https://doi.org/10.1155/2023/6330002
Alkahtani H, Aldhyani THH, Alzahrani MY. Early Screening of Autism Spectrum Disorder Diagnoses of Children Using Artificial Intelligence. Journal of Disability Research. 2023;2(1). Available from: https://doi.org/10.57197/JDR-2023-0004
Chen YH, Chen Q, Kong L, Liu G. Early detection of autism spectrum disorder in young children with machine learning using medical claims data. BMJ Health Care Inform. 2022;29(1):e100544. Available from: https://doi.org/10.1136/bmjhci-2022-100544

Copyright

© 2023 Kavitha & Kasthuri. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Published By Indian Society for Education and Environment (iSee)