• P-ISSN 0974-6846 E-ISSN 0974-5645

Indian Journal of Science and Technology

Article

Indian Journal of Science and Technology

Year: 2023, Volume: 16, Issue: 45, Pages: 4156-4163

Original Article

Enco – Standardization Data Pre-Processing Technique in Autism Spectrum Disorder Detection

Received Date:15 July 2023, Accepted Date:08 October 2023, Published Date:05 December 2023

Abstract

Background/Objectives: The goal of this study was to create an Enco-Standardization technique that would produce accurate data and improve the diagnosis of Autism Spectrum Disorder (ASD).This method uses mean values to replace missing values in a dataset and improves them by combining label encoding and conventional scaling techniques. Methods: The ASD dataset, which has 704 instances and 21 attributes, is used in this study. Training and testing are divided by the dataset (80%-20%). As an imputation strategy in this dataset, missing values are located and replaced with the mean value. Attributes are encoded using the Enco-Standardization methodology using a label encoding technique that changes non-numeric variables into numeric ones. After that, the data were scaled into a machine-readable format to standardise it. Different machine learning classifier models are compared to the hybrid strategy of encoding and scaling techniques. Based on the accuracy found using machine learning classifier models, the dataset acquired using the Enco-Standardization technique is assessed. Findings: The dataset needs to be accurate and relevant in order to increase accuracy and decrease computing time. The findings of the Enco-Standardization methodology showed a good pre-processing method with accuracy values of 98% for Naive Bayes (NB), 71% for K Nearest Neighbour (KNN), 74% for Support Vector Machine (SVM), 97% for Linear Regression (LR), 100% for Decision Tree (DT), and 100% for Random Forest (RF). The deletion of missing values improves performance in KNN (94%), SVM (95.9%), LR, DT, and RF (100%) but decreases the number of instances in the dataset, rendering the model ineffective. Novelty: The data in a dataset are transformed and encoded using the proposed Enco-Standardization pre-processing technique, which increases the precision of the data analysis process in ASD prediction. Data discrepancies are avoided by using this eco-standardization technique.

Keywords: Autism Spectrum Disorder, Pre­processing, Scaling, Enco­Standardization, Machine Learning

References

  1. Tiwari R, Purkayastha K, Gulati S. Public Health Dimensions of Autism Spectrum Disorder in India: An Overview. Journal of Comprehensive Health. 2021;9(2):57–62. Available from: http://dx.doi.org/10.53553/jch.v09i02.002
  2. Farooq MS, Tehseen R, Sabir M, Atal Z. Detection of autism spectrum disorder (ASD) in children and adults using machine learning. Scientific Reports. 13(1). Available from: http://dx.doi.org/10.1038/s41598-023-35910-1
  3. Qureshi MS, Qureshi MB, Asghar J, Alam F, Aljarbouh A. Prediction and Analysis of Autism Spectrum Disorder Using Machine Learning Techniques. Journal of Healthcare Engineering. 2023;2023:1–10. Available from: http://dx.doi.org/10.1155/2023/4853800
  4. Chen J, Engelhard M, Henao R, Berchuck S, Eichner B, Perrin EM, et al. Enhancing early autism prediction based on electronic records using clinical narratives. Journal of Biomedical Informatics. 2023;144:104390. Available from: http://dx.doi.org/10.1016/j.jbi.2023.104390
  5. Olisah CC, Smith L, Smith M. Diabetes mellitus prediction and diagnosis from a data preprocessing and machine learning perspective. Computer Methods and Programs in Biomedicine. 2022;220:106773. Available from: http://dx.doi.org/10.1016/j.cmpb.2022.106773
  6. Vakadkar K, Purkayastha D, Krishnan D. Detection of Autism Spectrum Disorder in Children Using Machine Learning Techniques. SN Computer Science. 2021;2(5). Available from: http://dx.doi.org/10.1007/s42979-021-00776-5
  7. Shihab AI, Dawood FA, Kashmar AH. Data Analysis and Classification of Autism Spectrum Disorder Using Principal Component Analysis. Advances in Bioinformatics. 2020;2020:1–8. Available from: http://dx.doi.org/10.1155/2020/3407907
  8. Peral J, Gil D, Rotbei S, Amador S, Guerrero M, Moradi H. A Machine Learning and Integration Based Architecture for Cognitive Disorder Detection Used for Early Autism Screening. Electronics. 2020;9(3):516. Available from: http://dx.doi.org/10.3390/electronics9030516
  9. Oh SL, Jahmunah V, Arunkumar N, Abdulhay EW, Gururajan R, Adib N. A novel automated autism spectrum disorder detection system. Complex Intell Syst. 2021;7:2399–413. Available from: https://doi.org/10.1007/s40747-021-00408-8
  10. Sherkatghanad Z, Akhondzadeh M, Salari S, Zomorodi-Moghadam M, Abdar M, Acharya UR. Automated detection of autism spectrum disorder using a convolutional neural network. 2020. Available from: https://doi.org/10.3389/fnins.2019.01325
  11. Din QMU, Jayanthy AK. Automated classification of Autism Spectrum Disorder using EEG signals and Convolutional Neural Networks. Biomedical Engineering: Applications, Basis and Communications. 2022;34(02). Available from: https://doi.org/10.4015/S101623722250020X
  12. Sivaranjani S, Ananya S, Aravinth J, Karthika R. Diabetes Prediction using Machine Learning Algorithms with Feature Selection and Dimensionality Reduction. 7th International Conference on Advanced Computing and Communication Systems (ICACCS). 2021. Available from: https://doi.org/10.1109/ICACCS51430.2021.9441935
  13. Raj S, Masood S. Analysis and Detection of Autism Spectrum Disorder Using Machine Learning Techniques. Procedia Computer Science. 2020;167:994–1004. Available from: https://doi.org/10.1016/j.procs.2020.03.399
  14. Priya N, Radhika C. Effective Implementation of Pre-Processing Techniques in Machine Learning for Autism Spectrum Disorder. International Journal of Innovative Technology and Exploring Engineering. 2020;9(5):2253–2257. Available from: https://doi.org/10.35940/ijitee.E2676.039520
  15. Erkan U, Thanh DNH. Autism Spectrum Disorder Detection with Machine Learning Methods. Current Psychiatry Research and Reviews. 2020;15(4):297–308. Available from: http://dx.doi.org/10.2174/2666082215666191111121115
  16. Kaur H, Kumari V. Predictive modelling and analytics for diabetes using a machine learning approach. Applied Computing and Informatics. 2022;18(1/2):90–100. Available from: https://doi.org/10.1016/j.aci.2018.12.004
  17. Jacob SG, Sulaiman MMBA, Bennet B. Feature Signature Discovery for Autism Detection: An Automated Machine Learning Based Feature Ranking Framework. Computational Intelligence and Neuroscience. 2023;2023:1–14. Available from: https://doi.org/10.1155/2023/6330002
  18. Alkahtani H, Aldhyani THH, Alzahrani MY. Early Screening of Autism Spectrum Disorder Diagnoses of Children Using Artificial Intelligence. Journal of Disability Research. 2023;2(1). Available from: https://doi.org/10.57197/JDR-2023-0004
  19. Chen YH, Chen Q, Kong L, Liu G. Early detection of autism spectrum disorder in young children with machine learning using medical claims data. BMJ Health Care Inform. 2022;29(1):e100544. Available from: https://doi.org/10.1136/bmjhci-2022-100544

Copyright

© 2023 Kavitha & Kasthuri.  This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Published By Indian Society for Education and Environment (iSee)

DON'T MISS OUT!

Subscribe now for latest articles and news.