Skip to main content

Developing Morpho-SLaWS: An API for the Morphosyntactic Annotation of the Serbian Language

  • 265 Accesses

Part of the Communications in Computer and Information Science book series (CCIS,volume 537)

Abstract

Serbian Lexical Web Service (SLaWS) is a resource-oriented web service designed to offer multiple functionalities—including morphosyntactic, lexicographic, and canonical text services—to create the backbone of a digital humanities infrastructure for the Serbian language. In this paper, we describe a key component of this service called Morpho-SLaWS, the atomic morphosyntactic component of the service infrastructure. The goal of Morpho-SLaWs is to offer a reliable, programmatic way of extracting morphosyntactic information about word forms using a revised version of the MULTEXT-East specification. As a service-oriented lexical tool, Morpho-SLaWS can be deployed in a variety of contexts and combined with other linguistic and DH tools.

Keywords

  • API design
  • Service architecture
  • Morphological lexicon
  • Serbian language
  • Digital humanities

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • DOI: 10.1007/978-3-319-23980-4_9
  • Chapter length: 11 pages
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
eBook
USD   44.99
Price excludes VAT (USA)
  • ISBN: 978-3-319-23980-4
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book
USD   59.99
Price excludes VAT (USA)
Fig. 1.
Fig. 2.

Notes

  1. 1.

    At the same time, it is important to keep in mind that Serbian is not the only language suffering from the predicament of underdeveloped resources. “It is estimated that for most European languages, even the basic resources are not yet available.” (Váradi et al. 2008).

  2. 2.

    http://www.folgerdigitaltexts.org/api.

  3. 3.

    http://correspsearch.bbaw.de/index.xql?id=api.

  4. 4.

    http://litterra.info/.

  5. 5.

    http://bukvik.litterra.info/.

References

  • Berry, D.M.: Understanding Digital Humanities. Palgrave Macmillan, Houndmills (2012)

    CrossRef  Google Scholar 

  • Calzolari, N: Approaches towards a lexical web: the role of interoperability. In: Proceedings of the First International Conference on Global Interoperability for Language Resources, pp. 34–42 (2008)

    Google Scholar 

  • Cooper, D.: When nice people won’t share: shy data, web APIs, and beyond. In: Proceedings of the Second International Conference on Global Interoperability for Language Resources (ICGL 2010) (2010)

    Google Scholar 

  • Erjavec, T.: MULTEXT-east version 4: multilingual morphosyntactic specifications, lexicons and corpora. In: Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010), European Language Resources Association (ELRA) (2010)

    Google Scholar 

  • Gold, M.K. (ed.): Debates in the Digital Humanities. University of Minnesota Press, Minneapolis (2012)

    Google Scholar 

  • Haslhofer, B., Simon, R., Sanderson, R., Van de Sompel, H.: The open annotation collaboration (OAC) model. In: Cyberpsychology and Behavior: The Impact of the Internet, Multimedia and Virtual Reality on Behavior and Society, pp. 5–9 (2011)

    Google Scholar 

  • Holmes, M.: CodeSharing: a simple API for disseminating our TEI encoding. In: Jenstad, J. (ed.): The Map of Early Modern London (2014). http://mapoflondon.uvic.ca/BLOG10.htm

  • Hunter, J., Cole, T., Sanderson, R., Van de Sompel, H.: The open annotation collaboration: a data model to support sharing and interoperability of scholarly annotations. In: Digital Humanities 2010, pp. 175–78 (2010), espace.library.uq.edu.au

  • Ishida, T.: Language grid: an infrastructure for intercultural collaboration. In: International Symposium on Applications and the Internet, SAINT 2006 (2006)

    Google Scholar 

  • Ivković, D.: Pragmatics meets ideology: digraphia and non-standard orthographic practices in serbian online news forums. J. Lang. Politics 12(3), 335–356 (2013)

    CrossRef  Google Scholar 

  • Krstev, C.: Processing of Serbian: Automata, Texts and Electronic Dictionaries. Faculty of Philology, Belgrade (2008)

    Google Scholar 

  • Krstev, C., Vitas, D., Obradović, I., Utvić, M.: E-Dictionaries and finite-state automata for the recognition of named entities. In: Proceedings of the 9th International Workshop on Finite State Methods and Natural Language Processing, FSMNLP 2011, pp. 48–56. ACL, Stroudsburgh (2011)

    Google Scholar 

  • Liu, A.: The state of the digital humanities: a report and a critique. Arts Humanit. High. Educ. 11(1–2), 8–41 (2012)

    CrossRef  Google Scholar 

  • Magner, T.F.: Digraphia in the territories of the croats and serbs. Int. J. Sociol. Lang. 2001(150), 11–26 (2001)

    CrossRef  Google Scholar 

  • Muehlen, M., Nickerson, J.V., Swenson, K.D.: Developing web services choreography standards—the case of REST vs. SOAP. Decis. Support Syst. 40(1), 9–29 (2005)

    CrossRef  Google Scholar 

  • Murakami, Y., Lin, D., Tanaka, M., Nakaguchi, T., Ishida, T.: Language service management with the language grid. In: Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010), European Language Resources Association (ELRA) (2010)

    Google Scholar 

  • Nosek, J.D.: Open access liturgical resources for judaism. Theological Librarianship: Online J. Am. Theological Library Assoc. 6(2), 63–66 (2013)

    Google Scholar 

  • Ramsay, S.: Reading Machines: Toward an Algorithmic Criticism. University of Illinois Press, Urbana (2011)

    Google Scholar 

  • Richardson, L., Ruby, S.: RESTful Web Services. O’Reilly Media Inc., Sebastopol (2007)

    Google Scholar 

  • Sanderson, R., Ciccarese, P., Van de Sompel, H.: Open Annotation Data Model. W3C Community Draft 8 (2013)

    Google Scholar 

  • Smith, N.: Citation in Classical Studies. Digital Humanities Quarterly (2009). http://www.digitalhumanities.org/dhq/vol/3/1/000028/000028.html

  • Tasovac, T.: More or less than a dictionary? wordnet as a model for Serbian L2 lexicography. Infotheca: J. Inf. Librarinaship 10(1–2), 13–22 (2009)

    Google Scholar 

  • Tasovac, T.: Potentials and challenges of WordNet-based pedagogical lexicography: the transpoetika dictionary. In: Granger, S. (ed.) Potentials and Challenges of WordNet-Based Pedagogical Lexicography: The Transpoetika Dictionary. Electronic Lexicography, pp. 237–58. Oxford University Press (2012)

    Google Scholar 

  • Tasovac, T., Ermolaev, N.: Encoding diachrony: digital editions of serbian 18th-century texts. In: Gradmann, S., Borri, F., Meghini, C., Schuldt, H. (eds.) TPDL 2011. LNCS, vol. 6966, pp. 497–500. Springer, Heidelberg (2011a)

    Google Scholar 

  • Tasovac, T., Ermolaev, N.: A User-Centered Digital Edition of Vuk Stefanović Karadžić’s Lexicon Serbico-Germanico-Latinum. Digital Humanities 2011 (2011b). http://xtf-prod.stanford.edu/xtf/view?docId=tei/ab-297.xml;query=;brand=default

  • Tasovac, T., Petrović, S.: MULTEXT-East Revisited: Serbian Morphosyntactic Tags in Action (forthcoming)

    Google Scholar 

  • Tiepmar, J., Teichmann, C., Heyer, G., Berti, M., Crane, G.: A new implementation for canonical text services. In: Proceedings of the 8th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH). ACL, Stroudsburgh (2014)

    Google Scholar 

  • Zhu, W.-P., Li M.-X., Huan,C.: Using MongoDB to implement textbook management system instead of MySQL. In: IEEE 3rd International Conference on Communication Software and Networks (ICCSN) (2011)

    Google Scholar 

  • Vitas, D., Popović, L., Krstev, C., Obradović, I., Pavlović-Lažetić, G., Stanojević, M.: The serbian language in the digital age. In: Rehm, G., Uszkoreit, H. (eds.) META-NET White Paper Series. Springer, Heidelberg (2012)

    Google Scholar 

  • Váradi, T., Wittenburg, P., Krauwer, S., Wynne, M., Koskenniemi, K.: CLARIN: common language resources and technology infrastructure. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008), European Language Resources Association (ELRA) (2008)

    Google Scholar 

  • Бpбopић, B.: O Jeзичкoм pacкoлy. Coциoлингвиcтички oглeди I. Бeoгpaд и Hoви Caд: ЦПЛ-Пpoмeтej (2000)

    Google Scholar 

  • Tacoвaц, T., Jepмoлaeв, H.: Диjaxpoниjcки пpиcтyп дигитaлним издaњимa cpпcкиx тeкcтoвa 18. вeкa. In: Bpaнeш, A. (ed.) Дигитaлизaциja кyлтypнe и нayчнe бaштинe 71–88. Филoлoшки фaкyлтeт, Бeoгpaд (2012)

    Google Scholar 

  • Чeмepикић, Д.: Збиpкa peчи из Пpизpeнa ДимитpиjaЧeмepикићa. Цeнтap зa дигитaлнe xyмaниcтичкe нayкe, Пpeпиc.opг. Бeoгpaд (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Toma Tasovac .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Tasovac, T., Rudan, S., Rudan, S. (2015). Developing Morpho-SLaWS: An API for the Morphosyntactic Annotation of the Serbian Language. In: Mahlow, C., Piotrowski, M. (eds) Systems and Frameworks for Computational Morphology. SFCM 2015. Communications in Computer and Information Science, vol 537. Springer, Cham. https://doi.org/10.1007/978-3-319-23980-4_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-23980-4_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-23978-1

  • Online ISBN: 978-3-319-23980-4

  • eBook Packages: Computer ScienceComputer Science (R0)