Indian Journal of Science and Technology
Year: 2020, Volume: 13, Issue: 27, Pages: 2711-2719
Mounika Kandukuri1, V V Haragopal2*
1INSPIRE SRF, Department of Statistics, University College of Science, Osmania University, Hyderabad, Telangana, India
2Professor of Statistics, Department of Mathematics, BITS-Pilani, Hyderabad Campus, Hyderabad, Telangana, India
Email: [email protected]
Received Date:09 June 2020, Accepted Date:14 July 2020, Published Date:31 July 2020
Background and Objective: As computers and the Internet are broadly utilized in nearly every region, numerous computerized text data is produced each day. It becomes a fundamental task to explore and effectively search such massive data. The main aim of the present study is to emphasize the recurrence of topics and identifying main ideas from a popular monthly addressing radio program Mann Ki Baat by using topic modeling technique. Data and Method: The present study utilizes the unstructured data of Mann ki Baat from January 2020 to March 2020, collected from the PMINDIA website. This program was initiated by the Honorable Prime Minister of India, Mr. Narendra Modi. This examination uses a popular technique Topic modeling based on LDA (Latent Dirichlet Allocation). Findings: The results show that the method automatically extracts the main ideas and issues discussed. Besides it provides information about the most likely topics and themes discussed in each month that left an impact on people and helped in raising awareness. Novelty: This is a first study of the application of popular technique topic modelling on Mann ki Baat. Further, this is the first attempt to extract the ideas discussed in a social campaign using a statistical model.
Keywords: Unstructured data; preprocessing; topic modelling; latent dirichlet allocation (LDA); mann ki baat
© 2020 Kandukuri, Haragopal. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Published By Indian Society for Education and Environment (iSee)
Subscribe now for latest articles and news.