Skip to main content
Log in

Research and implementation of event extraction from twitter using LDA and scoring function

  • Original Research
  • Published:
International Journal of Information Technology Aims and scope Submit manuscript

Abstract

With the fast growth of social media, interest is increasing in detecting popular events from tweets. Event extraction is a work which identifies events from tweets or database of tweets. Each and every day, hundreds of megabytes of current stories are being added into the news archives of the major news agencies, containing much important and interesting news. Aim of this event extraction strategy is to extract and retrieve major life events from twitter data. Example of events extraction include seminar presentation, Job opening, Admission in Top universities, new technology etc. The role of this extraction is to collect major life events in the form of retrievable entries that include structured data about major life event name, location and time. Most of previous research on event extraction was mainly on textual level extraction such as News, medical systems, text summarization, whereas less work has been done on event extraction from noisy text such as tweets. For instance, tweets are short and self-contained which make them lack useful information. The target of this research is to develop algorithm and methodology that extract and efficiently conclude major life events extracted from social media.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Mohammad A-S, Qawasmeh O (2016) Knowledge-based approach for event extraction from Arabic tweets. IJACSA 7(6):483–490

    Google Scholar 

  2. Foley J, Benderky M, Josifovski V (2015) Learning to extract local events from the web. SIGIR. https://doi.org/10.1145/2766462.2767739

    Google Scholar 

  3. Li J, Ritter A, Cardie C, Hovy E (2014) Major life event extraction from twitter based on congratulations/condolences speech Acts. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), ACM, Doha, 25–29 Oct 2014, pp 1997–2007

  4. Katsios G, Vakulenko S, Krithara A, Paliouras G (2012) Open domain event extraction from twitter. ACM 978-1-4503-1462

  5. Liu F, Chen J, Jagannathha A, Yu H (2016) Learning from biomedical information extraction: methodology review of recent advances. ACM

  6. Basher ARMA, Purdy AS, Birol I (2015) Event extraction from biomedical literature. https://doi.org/10.1101/034397

    Google Scholar 

  7. Hogenboom F, Frasincar F, Kaymak U, de Jong F, Caron E (2016) A survey of event extraction methods from text for decision support system. ACM  

  8. Ramesh D, Suresh Kumar S (2016) Event extraction from natural language text. IJESRT. https://doi.org/10.5281/zenodo.58638

    Google Scholar 

  9. Ramakrishna K, Guda V, Padmaja Rani B, Chakthi V (2011) Novel model for timed event extraction and temporal reasoning in legal text documents. IJCSES 2(1):39–48

    Article  Google Scholar 

  10. Aliane H, Guendouzi W, Mokrani A (2013) Annotating events, time and place expressions in arabic texts. In: Proceedings of recent advances in natural language processing, Hissar, Bulgaria, 7–13 Sept 2013, pp 25–31

  11. Pawlowski JM, Bick M, Peinl R, Thalmann S, Maier R, Hetmank DWIL, Kruse D-W-IP, Martensen M, Pirkkalainen H (2014) Social knowledge environments. Bus Inf Syst Eng 6(2):81–88

    Article  Google Scholar 

  12. Lehmann J, Isele R, Jakob M, Jentzsch A, Kontokostas D, Mendes PN, Hellmann S, Morsey M, van Kleef P, Auer S et al (2015) Dbpedia—a large-scale, multilingual knowledge base extracted from wikipedia. Semantic Web 6(2):167–195

    Google Scholar 

  13. Thamviset W, Wongthanavasu S (2014) Bottom-up region extractor for semi-structured web pages. ICSEC, IEEE, pp 284–289

  14. Kuzey E, Vreeken J, Weikum G (2014) A fresh look on knowledge bases: distilling named events from news. CIKM’14, ACM, pp 1689–1698

  15. Manjula R, Chilambuchelvan A (2013) Extracting templates from web pages. In Green computing, communication and conservation of energy (ICGCE), 2013 international conference on, IEEE, pp 788–791

  16. Tanev H, Ehrmann M, Piskorski J, Zavarella V (2014) Enhancing event descriptions through twitter mining. Sixth international AAAI conference on weblogs and social media pages

  17. Piskorski J, Yangarbe R (2013) Information extraction: past, future, present. Springer-Verlag, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28569-1__2

    Google Scholar 

  18. Sleiman HA, Corchuelo R (2013) A survey on region extractors from web documents. Knowl Data Eng IEEE 25(9):1960–1981

    Article  Google Scholar 

  19. Lingad J, Karimi S, Yin J (2013) Location extraction from disasterrelated microblogs in WWW (companion volume). International World Wide Web conferences steering committee/ACM, pp 1017–1020

  20. Li C, Sun A (2014) Fine-grained location extraction from tweets with temporal awareness. In the 37th international ACM SIGIR conference on research and development in information retrieval, SIGIR’14, Gold Coast, QLD, Australia—July 06–11, 2014, pp 43–52. http://doi.org/10.1145/2600428.2609582

  21. McClosky D, Surdeanu M, Manning CD (2013) Event extraction as dependency parsing. In: Proceedings of the 2013 meeting of the association for computational linguistics (ACL), pp.178–186

  22. Radinsky K, Horvitz E (2013) Mining the web to predict future events. ACM 978-1-4503-1869

Download references

Acknowledgements

A special thank to Dr. Parul gupta for her technical support to implement this model and also for useful comments, discussions, and suggestions regarding this approach. All data is extracted from twitterdata warehouse.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Monika Gupta.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gupta, M., Gupta, P. Research and implementation of event extraction from twitter using LDA and scoring function. Int. j. inf. tecnol. 11, 365–371 (2019). https://doi.org/10.1007/s41870-018-0206-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41870-018-0206-0

Keywords

Navigation