Skip to main content

Urdu and Hindi Poetry Generation Using Neural Networks

  • Conference paper
  • First Online:
Data Management, Analytics and Innovation (ICDMAI 2022)

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 137))

Included in the following conference series:

  • 509 Accesses

Abstract

One major problem writers and poets face is writer’s block. It is a condition in which an author loses the ability to produce new work or experiences a creative slowdown. The problem is more difficult in the context of poetry than prose, as in the latter case authors need not be very concise while expressing their ideas, also the aspects such as rhyme, poetic meters are not relevant for prose. One of the most effective ways to overcome this writing block for poets can be, to have a prompt system, which would help their imagination and open their minds for new ideas. A prompt system can generate one-liner, two-liner, or full ghazals. The purpose of this work is to give an ode to the Urdu/Hindi poets and help them start their next line of poetry, a couplet or a complete ghazal considering various factors like rhymes, refrain, and meters. The result will help aspiring poets to get new ideas and help them overcome writer’s block by auto-generating pieces of poetry using Deep Learning techniques. A concern with creative works like this, especially in the literary context, is to ensure that the output is not plagiarized. This work also addresses the concern and makes sure that the resulting odes are not exact matches with input data using parameters like temperature and manual plagiarism check against input corpus. To the best of our knowledge, although the automatic text generation problem has been studied quite extensively in the literature, the specific problem of Urdu/Hindi poetry generation has not been explored much. Apart from developing a system to auto-generate Urdu/Hindi poetry, another key contribution of our work is to create a cleaned and preprocessed corpus of Urdu/Hindi poetry (derived from authentic resources) and make it freely available for researchers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.shakeeb.in and https://ur.shakeeb.in.

References

  1. Rekhta: Urdu Poetry, Urdu Shayari of Famous Poets, Founder(s): Sanjiv Saraf. Launched: 2013. Owner: Rekhta Foundation. Content license: Creative Commons license [Online]. https://rekhta.org

  2. C. Shackle, Ghazal—Origin, Features and Rhetoric, Department of South Asia, SOAS, University of London (cs2@soas.ac.uk) [Online]. Available on: https://www.soas.ac.uk/south-asia-institute/keywords/file24804.pdf

  3. A. Das, Can Machines Tell Stories? A Comparative Study of Deep Neural Language Models and Metrics, Sept 2020. Available on https://www.researchgate.net/publication/344270129_Can_Machines_Tell_Stories_A_Comparative_Study_of_Deep_Neural_Language_Models_and_Metrics

  4. D. Pawade, Story Scrambler—Automatic Text Generation Using Word Level RNN-LSTM, 08 June 2018

    Google Scholar 

  5. D. Pawade, A. Sakhapara, M. Jain, N. Jain, K. Gada, Story scrambler—automatic text generation using word level RNN-LSTM. Int. J. Inf. Technol. Comput. Sci. 6, 44–53. Copyright © 2018 MECS 2018. P. Jain, P. Agrawal, A. Mishra, M. Sukhwani, A. Laha, K. Sankaranarayanan, Story generation from sequence of independent short descriptions, in Proceedings of Workshop on Machine Learning for Creativity (SIGKDD’17) (Halifax, Canada, 2017)

    Google Scholar 

  6. S. Thomaidou, I. Lourentzou, P. Katsivelis-Perakis, M. Vazirgiannis, Automated snippet generation for online advertising, in Proceedings of ACM International Conference on Information and Knowledge Management (CIKM’13) (San Francisco, USA, 2013), pp.1841–1844

    Google Scholar 

  7. R.S. Colon, P.K. Patra, K.M. Elleithy, Random word retrieval for automatic story generation, in Proceedings of Zone 1 Conference of the American Society for Engineering Education (ASEE Zone 1). Copenhagen, Denmark, 3–5 Apr 2014

    Google Scholar 

  8. K. Chauhan, Hindi Shayari Generation—ShareChat [Online]. Available on: https://medium.com/sharechat/hindi-shayari-generation-b2f1ade0fb64

  9. Urdu Mehfil: Mehfil-e-dostaan, Bazm-e-yaraan. Type: Forum. Launched in 2005. Part of the Urduweb.org portal also hosts Urduweb Digital Library [Online]. https://urduweb.org/mehfil

  10. Scraped Raw and Cleaned Corpus, Bookmarklets and Scraper for new literary Portal LitUrdu using Node, Github, Last commit: 9393927, 24 Dec 2020 [Online]. Available on: https://github.com/ShakesVision/scraperforliturdu

  11. A. Karpathy, Unreasonable Effectiveness of Char-RNNs [Online] (2015). Available on http://karpathy.github.io/2015/05/21/rnn-effectiveness

  12. S.A.M. Mukhtar, P.S. Joglekar, Urdu and Hindi Poetry Generation Using Neural Networks [Online]. The full version of the paper is available at CS arxiv https://arxiv.org/abs/2107.14587

  13. G.S. Lehal, T.S. Saini, Sangam: A Perso-Arabic to Indic Script Machine Transliteration Model (ICON, 2014)

    Google Scholar 

  14. S.Z. Asghar, Aruuz API. Updated 14 Aug 2020 [Online]. Available on: https://aruuz.com/resources/article/25cdFqvoA08Gu7DN2bLs

  15. S. Ahmad, Rekhta Content Scraper [Online] (2020). Available on: https://www.shakeeb.in/2020/12/rekhta-content-scraper-by-shakeeb-ahmad.html

  16. Urdu Alphabet, Arabic Diacritics, Wikipedia. Retrieved June 18, 2021 [Online]. Available on: https://en.wikipedia.org/wiki/Arabic_diacritics#Harakat_(short_vowel_marks), https://en.wikipedia.org/wiki/Urdu_alphabet#I%E1%BA%93%C4%81fat

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pushkar Joglekar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ahmad, S., Joglekar, P. (2023). Urdu and Hindi Poetry Generation Using Neural Networks. In: Goswami, S., Barara, I.S., Goje, A., Mohan, C., Bruckstein, A.M. (eds) Data Management, Analytics and Innovation. ICDMAI 2022. Lecture Notes on Data Engineering and Communications Technologies, vol 137. Springer, Singapore. https://doi.org/10.1007/978-981-19-2600-6_34

Download citation

Publish with us

Policies and ethics