Skip to main content

Towards Better Understanding Researcher Strategies in Cross-Lingual Event Analytics

  • Conference paper
  • First Online:
Digital Libraries for Open Knowledge (TPDL 2018)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11057))

Included in the following conference series:

Abstract

With an increasing amount of information on globally important events, there is a growing demand for efficient analytics of multilingual event-centric information. Such analytics is particularly challenging due to the large amount of content, the event dynamics and the language barrier. Although memory institutions increasingly collect event-centric Web content in different languages, very little is known about the strategies of researchers who conduct analytics of such content. In this paper we present researchers’ strategies for the content, method and feature selection in the context of cross-lingual event-centric analytics observed in two case studies on multilingual Wikipedia. We discuss the influence factors for these strategies, the findings enabled by the adopted methods along with the current limitations and provide recommendations for services supporting researchers in cross-lingual event-centric analytics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    archive-it.org.

  2. 2.

    wiki.digitalmethods.net/Dmi/ToolWikipediaCrosslingualImageAnalysis.

  3. 3.

    wiki.digitalmethods.net/Dmi/ToolWikipediaEditsScraperAndIPLocalizer.

  4. 4.

    http://tools.medialab.sciences-po.fr/table2net/.

References

  1. Al Khatib, K., Schütze, H., Kantner, C.: Automatic detection of point of view differences in wikipedia. In: COLING 2012, pp. 33–50 (2012)

    Google Scholar 

  2. Bao, P., Hecht, B., Carton, S., Quaderi, M., Horn, M., Gergle, D.: Omnipedia: bridging the wikipedia language gap. In: CHI 2012, pp. 1075–1084 (2012)

    Google Scholar 

  3. Borra, E., Laniado, D., et al.: A platform for visually exploring the development of wikipedia articles. In: ICWSM 2015 (2015)

    Google Scholar 

  4. Demidova, E., Zhou, X., Nejdl, W.: Efficient query construction for large scale data. In: Proceedings of SIGIR 2013, pp. 573–582 (2013)

    Google Scholar 

  5. Fernando, Z.T., Marenzi, I., Nejdl, W.: ArchiveWeb: collaboratively extending and exploring web archive collections - how would you like to work with your collections? Int. J. Digit. Libr. 19(1), 39–55 (2018)

    Article  Google Scholar 

  6. Gossen, G., Demidova, E., Risse, T.: iCrawl: improving the freshness of web collections by integrating social web and focused web crawling. In: JCDL (2015)

    Google Scholar 

  7. Gossen, G., Demidova, E., Risse, T.: Extracting event-centric document collections from large-scale web archives. In: Kamps, J., Tsakonas, G., Manolopoulos, Y., Iliadis, L., Karydis, I. (eds.) TPDL 2017. LNCS, vol. 10450, pp. 116–127. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-67008-9_10

    Chapter  Google Scholar 

  8. Gottschalk, S., Demidova, E.: Analysing temporal evolution of interlingual wikipedia article pairs. In: SIGIR 2016, pp. 1089–1092 (2016)

    Google Scholar 

  9. Gottschalk, S., Demidova, E.: MultiWiki: interlingual text passage alignment in wikipedia. TWEB 11(1), 6:1–6:30 (2017)

    Article  Google Scholar 

  10. Gottschalk, S., Demidova, E.: EventKG: a multilingual event-centric temporal knowledge graph. In: Gangemi, A., et al. (eds.) ESWC 2018. LNCS, vol. 10843, pp. 272–287. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93417-4_18

    Chapter  Google Scholar 

  11. Govind, Spaniol, M.: ELEVATE: a framework for entity-level event diffusion prediction into foreign language communities. In: WebSci 2017 (2017)

    Google Scholar 

  12. Jacomy, M., et al.: ForceAtlas2, a continuous graph layout algorithm for handy network visualization designed for the Gephi software. PloS One 9(6), e98679 (2014)

    Article  Google Scholar 

  13. Kim, S., Park, S., Hale, S.A., Kim, S., Byun, J., Oh, A.H.: Understanding editing behaviors in multilingual wikipedia. PloS One 11(5), e0155305 (2016)

    Article  Google Scholar 

  14. Massa, P., Scrinzi, F.: Manypedia: comparing language points of view of wikipedia communities. In: WikiSym 2012, p. 21 (2012)

    Google Scholar 

  15. Odijk, D., et al.: Supporting exploration of historical perspectives across collections. In: Kapidakis, S., Mazurek, C., Werla, M. (eds.) TPDL 2015. LNCS, vol. 9316, pp. 238–251. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24592-8_18

    Chapter  Google Scholar 

  16. Pentzold, C., et al.: Digging wikipedia: the online encyclopedia as a digital cultural heritage gateway and site. J. Comput. Cult. Herit. 10(1), 5:1–5:19 (2017)

    Article  Google Scholar 

  17. Risse, T., Demidova, E., Gossen, G.: What do you want to collect from the web? In: Proceedings of the Building Web Observatories Workshop (BWOW) 2014 (2014)

    Google Scholar 

  18. Rogers, R.: Digital Methods. MIT Press, Cambridge (2013)

    Google Scholar 

Download references

Acknowledgements

This work was partially funded by the ERC (“ALEXANDRIA”, 339233) and H2020-MSCA-ITN-2018-812997 “Cleopatra”.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Simon Gottschalk .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gottschalk, S., Bernacchi, V., Rogers, R., Demidova, E. (2018). Towards Better Understanding Researcher Strategies in Cross-Lingual Event Analytics. In: Méndez, E., Crestani, F., Ribeiro, C., David, G., Lopes, J. (eds) Digital Libraries for Open Knowledge. TPDL 2018. Lecture Notes in Computer Science(), vol 11057. Springer, Cham. https://doi.org/10.1007/978-3-030-00066-0_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-00066-0_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-00065-3

  • Online ISBN: 978-3-030-00066-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics