Abstract
Semantic enhancement of texts aids their use by researchers. However, mark-up of large bodies of text is slow and requires precious expert resources. The task could be automated if there were marked-up texts to train and test mark-up tools. This paper looks at the re-purposing of texts originally marked-up to support taxonomists to provide computer scientists with training and test data for their mark-up tools. The re-purposing highlighted some key differences in the requirements of taxonomists and computer scientists and their approaches to mark-up.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Biodiversity Heritage Library, http://www.biodiversitylibrary.org/
BHL Book of the Week: BiologiaCentrali-Americana, http://blog.biodiversitylibrary.org/2012/09/biologia-centrali-americana-hispanic.html
INOTAXA, INtegrated Open TAXonomic Access, http://www.inotaxa.org/
Weitzman, A.L., Lyal, C.H.C.: INOTAXA — INtegrated Open TAXonomic Access and the “ BiologiaCentrali-Americana”. In: Proceedings Of The Contributed Papers Sessions Biomedical And Life Sciences Division, SLA, p. 8 (2006), http://units.sla.org/division/dbio/Baltimore/index.html
ViBRANT, Virtual Biodiversity Research and Access Network for Taxonomy, http://vbrant.eu/
Murray-Rust, P., Rzepa, H.S.: Scientific publications in XML - towards a global knowledge base. Data Science 1, 84–98 (2002)
Cui, H.: Approaches to Semantic Mark-up for Natural Heritage Literature. In: Proceedings of the iConference 2008 (2008), http://ischools.org/conference08/pc/PA5-2_iconf08.doc
Parr, C.S., Lyal, C.H.C.: Use cases for online taxonomic literature from taxonomists, conservationists, and others. In: Proceedings of TDWG Annual Conference (2007), http://www.tdwg.org/proceedings/article/view/269
Penev, L., Lyal, C.H.C., Weitzman, A., Morse, D., King, D., Sautter, G., Georgiev, T., Morris, R.A., Catapano, T., Agosti, D.: XML schemas and mark-up practices of taxonomic literature. In: Smith, V., Penev, L. (eds.) e-Infrastructures for Data Publishing in Biodiversity Science, vol. 150, pp. 89–116. ZooKeys (2011)
TaxonX, http://www.taxonx.org/
PLAZI, http://www.plazi.org/
Weitzman, A.L., Lyal, C.H.C.: An XML schema for taxonomic literature – taXMLit - (2004), http://www.sil.si.edu/digitalcollections/bca/documentation/taXMLitv1-3Intro.pdf
TEI, Text Encoding Initiative, http://www.tei-c.org/index.xml
TaxPub, http://sourceforge.net/projects/
Catapano, T.: TaxPub: An extension of the NLM/NCBI Journal Publishing DTD for taxonomic descriptions. Proceedings of the Journal Article Tag Suite Conference (2010), http://www.ncbi.nlm.nih.gov/books/NBK47081/#ref2
US National Center for Biotechnology Information, http://www.ncbi.nlm.nih.gov/
Penev, L., Agosti, D., Georgiev, T., Catapano, T., Miller, J., Blagoderov, V., Roberts, D., Smith, V., Brake, I., Ryrcroft, S., Scott, B., Johnson, N., Morris, R., Sautter, G., Chavan, V., Robertson, T., Remsen, D., Stoev, P., Parr, C., Knapp, S., Kress, W., Thompson, C., Erwin, T.: Semantic tagging of and semantic enhancements to systematics papers: ZooKeys working examples. ZooKeys 50, 1–16 (2010), doi:10.3897/zookeys.50.538
PubMedCentral, http://www.ncbi.nlm.nih.gov/pmc/
Willis, A., King, D., Morse, D., Dil, A., Lyal, C., Roberts, D.: From XML to XML: The Why and How of Making the Biodiversity Literature Accessible to Researchers. In: Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC 2010), European Language Resources Association (ELRA), Valletta (2010), http://www.lrec-conf.org/proceedings/lrec2010/pdf/787_Paper.pdf
Ide, N., Romary, L.: International standard for a linguistic annotation framework. Journal of Natural Language Engineering 10(3-4), 211–225 (2004)
brat standoff format, http://brat.nlplab.org/standoff.html
brat rapid annotation tool, http://brat.nlplab.org/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
King, D., Morse, D.R. (2013). Document Mark-Up for Different Users and Purposes. In: Garoufallou, E., Greenberg, J. (eds) Metadata and Semantics Research. MTSR 2013. Communications in Computer and Information Science, vol 390. Springer, Cham. https://doi.org/10.1007/978-3-319-03437-9_34
Download citation
DOI: https://doi.org/10.1007/978-3-319-03437-9_34
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03436-2
Online ISBN: 978-3-319-03437-9
eBook Packages: Computer ScienceComputer Science (R0)