Advertisement

Marky: A Lightweight Web Tracking Tool for Document Annotation

  • Martín Pérez-Pérez
  • Daniel Glez-Peña
  • Florentino Fdez-Riverola
  • Anália Lourenço
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 294)

Abstract

Document annotation is an elementary task in the development of Text Mining applications, notably in defining the entities and relationships that are relevant to a given domain. Many annotation software tools have been implemented. Some are particular to a Text Mining framework while others are typical stand-alone tools. Regardless, most development efforts were driven to basic functionality, i.e. performing the annotation, and to interface, making sure operation was intuitive and visually appellative. The deployment of large-scale annotation jamborees and projects showed the need for additional features regarding inter- and intra-annotation management. Therefore, this paper presents Marky, a new Web-based document annotation tool that integrates a highly customisable annotation environment with a robust project management system. Novelty lays on the annotation tracking system, which supports per user and per round annotation change tracking and thus, enables automatic annotation correction and agreement analysis.

Keywords

Text mining document annotation annotation guidelines interannotator agreement Web application 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Miner, G.: Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications. Academic Press (2012)Google Scholar
  2. 2.
    Ferrucci, D., Lally, A.: Building an example application with the Unstructured Information Management Architecture. IBM Syst. J. 43, 455–475 (2004)CrossRefGoogle Scholar
  3. 3.
    Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V., Aswani, N., Roberts, I., Gorrell, G., Funk, A., Roberts, A., Damljanovic, D., Heitz, T., Greenwood, M.A., Saggion, H., Petrak, J., Li, Y., Peters, W.: Text Processing with GATE, Version 6 (2011)Google Scholar
  4. 4.
    Kano, Y., Baumgartner, W.A., McCrohon, L., Ananiadou, S., Cohen, K.B., Hunter, L., Tsujii, J.: U-Compare: share and compare text mining tools with UIMA. Bioinformatics 25, 1997–1998 (2009)CrossRefGoogle Scholar
  5. 5.
    Bontcheva, K., Cunningham, H., Roberts, I., Roberts, A., Tablan, V., Aswani, N., Gorrell, G.: GATE Teamware: A web-based, collaborative text annotation framework. Lang. Resour. Eval. 47, 1007–1029 (2013)CrossRefGoogle Scholar
  6. 6.
    Salgado, D., Krallinger, M., Depaule, M., Drula, E., Tendulkar, A.V., Leitner, F., Valencia, A., Marcelle, C.: MyMiner: A web application for computer-assisted biocuration and text annotation. Bioinformatics 28, 2285–2287 (2012)CrossRefGoogle Scholar
  7. 7.
    Campos, D., Lourenço, J., Nunes, T., Vitorino, R., Domingues, P.S.M., Oliveira, J.L.: Egas - Collaborative Biomedical Annotation as a Service. In: Fourth BioCreative Challenge Evaluation Workshop, pp. 254–259 (2013)Google Scholar
  8. 8.
    Wei, C.-H., Kao, H.-Y., Lu, Z.: PubTator: A web-based text mining tool for assisting biocuration. Nucleic Acids Res. 41, W518–22 (2013)Google Scholar
  9. 9.
    Iglesias, M.: CakePHP 1.3 Application Development Cookbook. Packt Publishing (2011)Google Scholar
  10. 10.
    Thompson, P., Iqbal, S.A., McNaught, J., Ananiadou, S.: Construction of an annotated corpus to support biomedical information extraction. BMC Bioinformatics 10, 349 (2009)CrossRefGoogle Scholar
  11. 11.
    Brants, T.: Inter-annotator agreement for a German newspaper corpus. In: Second International Conference on Language Resources and Evaluation, LREC 2000 (2000)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Martín Pérez-Pérez
    • 1
  • Daniel Glez-Peña
    • 1
  • Florentino Fdez-Riverola
    • 1
  • Anália Lourenço
    • 1
    • 2
  1. 1.ESEI - Escuela Superior de Ingeniería Informática, Edificio PolitécnicoUniversidad de VigoOurenseSpain
  2. 2.IBB - Institute for Biotechnology and Bioengineering, Centre of Biological EngineeringUniversity of MinhoBragaPortugal

Personalised recommendations