• P-ISSN 0974-6846 E-ISSN 0974-5645

Indian Journal of Science and Technology

Article

Indian Journal of Science and Technology

Year: 2023, Volume: 16, Issue: Special Issue 2, Pages: 22-29

Original Article

Text Summarization in Assamese Language using Sequence to Sequence RNNs

Received Date:23 March 2023, Accepted Date:26 June 2023, Published Date:02 November 2023

Abstract

This study discusses an approach for abstractive text summarization in the Assamese language. Objective: The main objective of this paper is to develop a novel approach for abstractive text summarization in Assamese that efficiently condenses large information while keeping its core meaning. Methods: We utilise a sequence-to-sequence Recurrent Neural Network [RNN] model with an encoder-decoder architecture in this paper. In this study, we use a Bi-LSTM on the encoder side, an attention mechanism, a softmax layer on the decoder side, and Assamese news items obtained from renowned daily newspapers in Assam. Findings: The results show that the suggested model is effective, with a train loss of 0.008 and assessment scores based on ROUGE-1, ROUGE-2, and ROUGE-L criteria. Novelty: The novelty lies in filling the aforementioned gap by proposing and implementing an abstractive text summarization approach for the Assamese language. Improve Assamese text summarization approaches by applying the presented strategy to news items.

Keywords: Abstractive, NLP, RNN, Seq2Seq, Text summarization

References

  1. Baruah R, Mundotiya RK, Singh AK. Low Resource Neural Machine Translation: Assamese to/from Other Indo-Aryan (Indic) Languages. ACM Transactions on Asian and Low-Resource Language Information Processing. 2021;21(1):1–32. Available from: https://doi.org/10.1145/3469721
  2. Varade S, Sayyed E, Nagtode V, Shinde S. Text Summarization using Extractive and Abstractive Methods. ITM Web of Conferences. 2021;40:1–5. Available from: https://doi.org/10.1051/itmconf/20214003023
  3. Bahare RS, Ganguly S, Choowongkomon K, Seetaha S, . Synthesis, HIV-1 RT inhibitory, antibacterial, antifungal and binding mode studies of some novel N-substituted 5-benzylidine-2, 4-thiazolidinediones. DARU Journal of Pharmaceutical Sciences. 2015;23(6):1–15. Available from: https://doi.org/10.1186/s40199-014-0086-1
  4. Babu GLA, Badugu S. Deep learning based sequence to sequence model for abstractive telugu text summarization. Multimedia Tools and Applications. 2023;82:17075–17096. Available from: https://doi.org/10.1007/s11042-022-14099-x
  5. Liu PJ, Saleh M, Pot E, Goodrich B, Sepassi R, Kaiser L, et al. Generating Wikipedia by Summarizing Long Sequences. 2018. Available from: https://arxiv.org/abs/1801.10198
  6. Nath P, Adhikary PK, Dadure P, Pakray P, Manna R, Bandyopadhyay S. Image Caption Generation for Low-Resource Assamese Language. In: The 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022). Taipei, Taiwan, November 21-22, 2022. p. 263–272.
  7. Baruah R, Mundotiya RK, Singh AK. Low Resource Neural Machine Translation: Assamese to/from Other Indo-Aryan (Indic) Languages. ACM Transactions on Asian and Low-Resource Language Information Processing. 2021;21(1):1–32. Available from: https://doi.org/10.1145/3469721
  8. Talukder MAI, Abujar S, Masum AKM, Faisal F, Hossain SA. Bengali abstractive text summarization using sequence to sequence RNNs. In: 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT). Kanpur, India, 06-08 July 2019. IEEE. .
  9. Yang J, You J, Wan X. Graph Embedding via Graph Summarization. IEEE Access. 2021;9:45163–45174. Available from: https://doi.org/10.1109/access.2021.3067901
  10. Cho K, Merrienboer BV, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation. Association for Computational Linguistics. 2014. Available from: https://arxiv.org/pdf/1406.1078.pdf
  11. Goutom PJ, Baruah N, Sonowal P. An abstractive text summarization using deep learning in Assamese. International Journal of Information Technology. 2023;15:2365–2372. Available from: https://link.springer.com/article/10.1007/s41870-023-01279-7
  12. Chowdhury RR, Nayeem MT, Mim TT, Chowdhury MSR, Jannat T. Unsupervised Abstractive Summarization of Bengali Text Documents. 2021. Available from: https://arxiv.org/pdf/2102.04490.pdf

Copyright

© 2023 Goutom & Baruah. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Published By Indian Society for Education and Environment (iSee)

DON'T MISS OUT!

Subscribe now for latest articles and news.