Recommending Domain Specific Keywords for Twitter

  • Muhammad Adeel Abid
  • Muhammad Faheem MushtaqEmail author
  • Urooj Akram
  • Bushra Mughal
  • Maqsood Ahmad
  • Muhammad Imran
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 978)


Twitter has become the most popular social media in today’s world. More than 284 million users are online monthly, and 80% user accesses their twitter account through mobile. A tweet is limited to 140 characters, so it contains concise information about particulars. Due to its popularity and usage, near about 500 million tweets are sent per day that relates to different domains. This work focuses on recommending domain specific keywords for twitter. For this purpose, 10 domains are chosen as a sample. Then we apply Term Frequency-Inverse Document Frequency (TF-IDF) and Log likelihood methods and compared the keywords extracted from both against each domain to make our result much valuable. Furthermore, the categorization of keywords is made as noun and verb, and also finds out the sentiment words. At the end, a relevancy test is performed from five users. These keywords can be great value in clustering tweets data and can be used for identifying a user’s interest in any specific domain. Furthermore, these keywords are of the great asset for advertisement purpose.


Twitter Social media Domain-specific keywords Clustering tweets data Advertisement 


  1. 1.
    Asghar M, Mushtaq MF, Asmat H, Saad Missen MM, Khan TA, Ullah S (2014) Finding correlation between content based features and the popularity of a celebrity on Twitter. Int J Comput Sci 11:177–181Google Scholar
  2. 2.
  3. 3.
  4. 4.
    Cai Y, Chen Y (2009) Mining Influential bloggers: from general to domain specific. In: Proceedings of 13th international conference on knowledge-based and intelligent information and engineering systems, pp 1–8Google Scholar
  5. 5.
    Fan W, Bifet A (2014) Mining big data: current status, and forecast to the future. SIGKDD Explor. 14:1–5CrossRefGoogle Scholar
  6. 6.
    Prasad AVK, Saibaba CM (2016) Mining big data: current status, and forecast to the future for telecom data. Int J Priv Cloud Comput Environ Manag 1–10CrossRefGoogle Scholar
  7. 7.
    Agarwal A, Xie B, Vovsha I, Rambow O, Passonneau R (2011) Sentiment analysis of Twitter data. In: Proceedings of the workshop on language in social media, pp 30–38Google Scholar
  8. 8.
    Lott B (2012) Survey of keyword extraction techniques, Semantic Scholar, 1–10Google Scholar
  9. 9.
    Jones KS (2004) A statistical interpretation of term specificity and its application in retrieval. J Doc 28:11–21CrossRefGoogle Scholar
  10. 10.
    Wartena C, Brussee R, Slakhorst W (2010) Keyword extraction using word co-occurrence. In: Proceedings of 21st international workshops on database and expert systems applications DEXA, pp 54–58Google Scholar
  11. 11.
    Matsuo MIY (2004) Keyword extraction from a single document using word co-occurrence statistical information. Int J Artif Intell Tools 13:157–169CrossRefGoogle Scholar
  12. 12.
    Ohsawa Y, Benson NE, Yachida M, Science H (1998) KeyGraph: indexing by a co-occurrence graph based on building construction metaphor. In: Proceedings IEEE international forum on research and technology advances in digital libraries, ADL’98Google Scholar
  13. 13.
    Wu YB, Li Q, Bot RS, Chen X (2006) Domain-specific keyphrase extraction. In: Proceedings of the 14th ACM international conference on information and knowledge managementGoogle Scholar
  14. 14.
    Kim SN, Baldwin T, Kan M (2009) An unsupervised approach to domain-specific term extraction. In: Proceedings of Australasian language technology association workshop, pp 94–98Google Scholar
  15. 15.
    Gelbukh A, Sidorov G, Lavin-Villa E, Chanona-Hernandez L (2010) Automatic term extraction using log-likelihood based comparison with general reference corpus. In: International conference on application of natural language to information systems. Lecture notes in computer science, vol 6177, pp 248–255CrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  • Muhammad Adeel Abid
  • Muhammad Faheem Mushtaq
    • 1
    Email author
  • Urooj Akram
    • 1
  • Bushra Mughal
  • Maqsood Ahmad
  • Muhammad Imran
    • 1
  1. 1.Faculty of Computer Science and Information TechnologyKhwaja Fareed University of Engineering and Information TechnologyRahim Yar KhanPakistan

Personalised recommendations