Hadoop based Feature Selection and Decision Making Models on Big Data

Thulasi Bikku; N  Sambasiva Rao  and Ananda Rao Akepogu

doi:10.17485/ijst/2016/v9i10/88905

Article

Hadoop based Feature Selection and Decision Making Models on Big Data

VIEWS 2449
PDF 248

Abstract
Full-Text HTML
Full-Text PDF
How to Cite

Indian Journal of Science and Technology

DOI: 10.17485/ijst/2016/v9i10/88905

Year: 2016, Volume: 9, Issue: 10, Pages: 1-6

Original Article

Hadoop based Feature Selection and Decision Making Models on Big Data

Thulasi Bikku¹ , N. Sambasiva Rao² and Ananda Rao Akepogu³

¹Department of CSE, JNTUA, Anantapur – 515002, Andhra Pradesh, India;[email protected] ²SRITW, Warangal – 506371, Telangana, India; [email protected] ³Director of Academics and Planning, JNTUCEA, Anantapur – 515002, Andhra Pradesh, India; [email protected]

This work is licensed under a Creative Commons Attribution 4.0 International License.

Abstract

Objectives: A large amount of informative data is being captured and processed by today’s organizations and is continuing to increase exponentially. It becomes computationally inaccurate to analyze such big data for decision making systems. Methods/Analysis: Hadoop, which is a working model based on the Map-Reduce framework with efficient computation and processing of Big Data. Findings: Most of the traditional classification algorithms have issues such as class imbalance and dimension reduction on Big Data. However, a large part of the data produced today are incomplete and inaccurate, so large organizations prefer relational databases to store their information, but the user query processing speed is very low. Unlike existing solutions that require a prior knowledge of classification accuracy for various types of data characteristics, which is impossible to obtain in practice. Applications/Improvement: In this paper, we have given a compared proposed model to different big data feature selection and classification models along with advantages and limitations.

Keywords: Big Data, Decision Tree, Feature Selection, Hadoop, MapReduce