Skip to main content

Markup Languages and Internationalization

  • Chapter
  • First Online:
Linguistic Modeling of Information and Markup Languages

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 41))

  • 781 Accesses

Abstract

The success of the markup language XML is partly due to its internationalization capabilities. “internationalization” means the readiness of a product or a technology for an international market and users with different languages, cultures and cultural preferences. The aim of the paper is threefold. First, it introduces aspects of internationalization found within the XML standard itself, like the support of the Unicode character encoding. Second, it demonstrates specific benefits of XML-related technologies for users with internationalization and localization needs. That includes technologies like ITS and XLIFF which are used within the localization industry. Third, the paper discusses current and future topics of Web internationalization, with XML as the backbone of a yet to come truly multilingual web. These three aspects are presented in relation to other areas of information modeling which are part of this volume.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Aliprand, J., Allen, J., et al., editors (2005). The Unicode Standard. Version 4.0. Addison-Wesley, Boston.

    Google Scholar 

  • Alvestrand, H. (2001). Tags for the Identification of Languages. Technical report, IETF. http://www.ietf.org/rfc/rfc3066.txt.

  • Berglund, A., Boag, S., et al. (2005). XML Path Language (XPath) 2.0. W3C Canidate Recommendation. Technical report, W3C. http://www.w3.org/TR/xpath20/.

  • Biron, P. V. and Malhoctra, A. (2004). XML Schema Part 2: Datatypes Second Edition. W3C Recommendation. Technical report, W3C. http://www.w3.org/TR/xmlschema-2/.

  • Boag, S., Chamberlin, D., et al. (2005). XQuery 1.0: An XML Query Language. W3C Candidate Recommendation. Technical report, W3C. http://www.w3.org/TR/xquery/.

  • Davis, M. (2005). The Bidirectional Algorithm. Unicode Standard Annex #9. Technical report, Unicode Consortium. http://www.unicode.org/reports/tr9/.

  • DĂĽrst, M. and Freytag, A. (2003). Unicode in XML and other Markup Languages. Technical report, W3C and Unicode Consortium. http://www.w3.org/TR/unicode-xml/.

  • Emerson, T. (2006). Large Corpus Construction for Chinese Lexicon Development. In Proceedings of the 29th Internationalization and Unicode Conference, San Francisco.

    Google Scholar 

  • Ishida, R. and Miller, S. (2006). Localization vs. Internationalization. Article of the W3C i18n Activity. http://www.w3.org/International/questions/qa-i18n.

  • Lieske, C. and F. Sasaki (2007). Internationalization Tag Set (ITS) 1.0. W3C Recommendation. Technical report, W3C. http://www.w3.org/TR/its/.

  • Phillips, A. (2006). Internationalization: An Introduction. In Proceedings of the 29th Internationalization and Unicode Conference, San Francisco.

    Google Scholar 

  • Phillips, A. and Davis, M. (2006a). Matching of Language Tags. Technical report, IETF. http://www.ietf.org/rfc/rfc4647.txt.

  • Phillips, A. and Davis, M. (2006b). Tags for Identifying Languages. Technical report, IETF. http://www.ietf.org/rfc/rfc4647.txt.

  • Phillips, A. and Davis, M. (2006c). Tags for Identifying Languages. Technical report, IETF. http://www.ietf.org/rfc/rfc4646.txt.

  • Raggett, D., Hors, A. Le, and Jacobs, I. (1999). HTML 4.01 Specification. W3C Recommendation. Technical report, W3C. http://www.w3.org/TR/html401.

  • Sasaki, F. (2005). Internationalization is everywhere. ACM Ubiquity, 6. http://www.acm.org/ubiquity/issues6.html.

  • Savourel, Y. (2005). TMX 1.4b Specification. Technical report, Localisation Industry Standards Association (LISA). TMX 1.4b Specification.

    Google Scholar 

  • Savourel, Y. and Reid, J. (2003). XLIFF 1.1 Specification. Technical report, OASIS. http://www.oasis-open.org/committees/xliff/documents/ xliff-specification.htm.

  • Sawicki, M., Suignard, M., et al. (2001). Ruby Annotation. W3C Recommendation. Technical report, W3C. http://www.w3.org/TR/ruby/.

  • Senellart, P. and Senellart, J. (2005). SYSTRAN Translation Stylesheets: Machine Translation driven by XSLT. In Proceedings of XML Conference 2005, Atlanta.

    Google Scholar 

  • Simons, G. and Bird, S. (2003). OLAC Metadata. Technical report, SIL International and others. http://www.language-archives.org/OLAC/metadata.html.

  • Simons, G. F., Lewis, W. D., et al. (2004). The Semantics of Markup: Mapping Legacy Markup Schemas to a Common Semantics. In Proceedings of Coling 2004, Geneva.

    Google Scholar 

  • Wahlster, W., editor (2000). Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Berlin.

    Google Scholar 

  • Wittenburg, P., Broeder, D., and Sloman, B. (2000). EAGLES/ISLE: A Proposal for a Meta Description Standard for Language Resources. In Proceedings of LREC 2000, Athens.

    Google Scholar 

  • Yergeau, F., DĂĽrst, M., et al. (2005). Character Model for the World Wide Web 1.0: Normalization. W3C Working Draft. Technical report, W3C. http://www.w3.org/TR/charmod-norm/.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Felix Sasaki .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media B.V.

About this chapter

Cite this chapter

Sasaki, F. (2010). Markup Languages and Internationalization. In: Witt, A., Metzing, D. (eds) Linguistic Modeling of Information and Markup Languages. Text, Speech and Language Technology, vol 41. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-3331-4_4

Download citation

  • DOI: https://doi.org/10.1007/978-90-481-3331-4_4

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-90-481-3330-7

  • Online ISBN: 978-90-481-3331-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics