Content Extraction Studies using Neural Network and Attribute Generation

Kolla Bhanu Prakash; and M  A  Dorai Rangaswamy

doi:10.17485/ijst/2016/v9i22/95165

Article

Content Extraction Studies using Neural Network and Attribute Generation

VIEWS 965
PDF 209

Abstract
Full-Text HTML
Full-Text PDF
How to Cite

Indian Journal of Science and Technology

DOI: 10.17485/ijst/2016/v9i22/95165

Year: 2016, Volume: 9, Issue: 22, Pages: 1-10

Original Article

Content Extraction Studies using Neural Network and Attribute Generation

Kolla Bhanu Prakash^1,2 and M. A. Dorai Rangaswamy¹

¹Faculty of Computer Science Engineering, [email protected]
[email protected]
²Faculty of Computing, Chirala Engineering College, [email protected]

This work is licensed under a Creative Commons Attribution 4.0 International License.

Abstract

Objectives: The amount of information available on web today is more than at any point in history, and greater challenges arouse due to this huge wealth of information available. Also to deal with this information overload, challenging tools are required. Method of Analysis: Internet in the present day especially in India is spreading both in rural and urban areas. Bilingual and Multilingual websites are increasing to a larger extent. Even websites are becoming multitasking. Our main problem is to deal with multilingual web documents and ancient documents. Because, content extraction becomes difficult when such documents are considered. The present paper proposes a neural network approach and attribute generation to justify the content extraction studies for multilingual web documents. Findings: Results obtained are well defined and a thorough analysis is done. Novelty/Improvement: The method is versatile in using pixel-maps, analytically stable in that the matrix input is used and is demonstrated for adoption to different models.
Keywords: Attribute, Content Extraction, Mining, Multi-Lingual, Neural Network, Pattern, Pixel