Abstract
The success of the markup language XML is partly due to its internationalization capabilities. “internationalization” means the readiness of a product or a technology for an international market and users with different languages, cultures and cultural preferences. The aim of the paper is threefold. First, it introduces aspects of internationalization found within the XML standard itself, like the support of the Unicode character encoding. Second, it demonstrates specific benefits of XML-related technologies for users with internationalization and localization needs. That includes technologies like ITS and XLIFF which are used within the localization industry. Third, the paper discusses current and future topics of Web internationalization, with XML as the backbone of a yet to come truly multilingual web. These three aspects are presented in relation to other areas of information modeling which are part of this volume.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aliprand, J., Allen, J., et al., editors (2005). The Unicode Standard. Version 4.0. Addison-Wesley, Boston.
Alvestrand, H. (2001). Tags for the Identification of Languages. Technical report, IETF. http://www.ietf.org/rfc/rfc3066.txt.
Berglund, A., Boag, S., et al. (2005). XML Path Language (XPath) 2.0. W3C Canidate Recommendation. Technical report, W3C. http://www.w3.org/TR/xpath20/.
Biron, P. V. and Malhoctra, A. (2004). XML Schema Part 2: Datatypes Second Edition. W3C Recommendation. Technical report, W3C. http://www.w3.org/TR/xmlschema-2/.
Boag, S., Chamberlin, D., et al. (2005). XQuery 1.0: An XML Query Language. W3C Candidate Recommendation. Technical report, W3C. http://www.w3.org/TR/xquery/.
Davis, M. (2005). The Bidirectional Algorithm. Unicode Standard Annex #9. Technical report, Unicode Consortium. http://www.unicode.org/reports/tr9/.
DĂĽrst, M. and Freytag, A. (2003). Unicode in XML and other Markup Languages. Technical report, W3C and Unicode Consortium. http://www.w3.org/TR/unicode-xml/.
Emerson, T. (2006). Large Corpus Construction for Chinese Lexicon Development. In Proceedings of the 29th Internationalization and Unicode Conference, San Francisco.
Ishida, R. and Miller, S. (2006). Localization vs. Internationalization. Article of the W3C i18n Activity. http://www.w3.org/International/questions/qa-i18n.
Lieske, C. and F. Sasaki (2007). Internationalization Tag Set (ITS) 1.0. W3C Recommendation. Technical report, W3C. http://www.w3.org/TR/its/.
Phillips, A. (2006). Internationalization: An Introduction. In Proceedings of the 29th Internationalization and Unicode Conference, San Francisco.
Phillips, A. and Davis, M. (2006a). Matching of Language Tags. Technical report, IETF. http://www.ietf.org/rfc/rfc4647.txt.
Phillips, A. and Davis, M. (2006b). Tags for Identifying Languages. Technical report, IETF. http://www.ietf.org/rfc/rfc4647.txt.
Phillips, A. and Davis, M. (2006c). Tags for Identifying Languages. Technical report, IETF. http://www.ietf.org/rfc/rfc4646.txt.
Raggett, D., Hors, A. Le, and Jacobs, I. (1999). HTML 4.01 Specification. W3C Recommendation. Technical report, W3C. http://www.w3.org/TR/html401.
Sasaki, F. (2005). Internationalization is everywhere. ACM Ubiquity, 6. http://www.acm.org/ubiquity/issues6.html.
Savourel, Y. (2005). TMX 1.4b Specification. Technical report, Localisation Industry Standards Association (LISA). TMX 1.4b Specification.
Savourel, Y. and Reid, J. (2003). XLIFF 1.1 Specification. Technical report, OASIS. http://www.oasis-open.org/committees/xliff/documents/ xliff-specification.htm.
Sawicki, M., Suignard, M., et al. (2001). Ruby Annotation. W3C Recommendation. Technical report, W3C. http://www.w3.org/TR/ruby/.
Senellart, P. and Senellart, J. (2005). SYSTRAN Translation Stylesheets: Machine Translation driven by XSLT. In Proceedings of XML Conference 2005, Atlanta.
Simons, G. and Bird, S. (2003). OLAC Metadata. Technical report, SIL International and others. http://www.language-archives.org/OLAC/metadata.html.
Simons, G. F., Lewis, W. D., et al. (2004). The Semantics of Markup: Mapping Legacy Markup Schemas to a Common Semantics. In Proceedings of Coling 2004, Geneva.
Wahlster, W., editor (2000). Verbmobil: Foundations of Speech-to-Speech Translation. Springer, Berlin.
Wittenburg, P., Broeder, D., and Sloman, B. (2000). EAGLES/ISLE: A Proposal for a Meta Description Standard for Language Resources. In Proceedings of LREC 2000, Athens.
Yergeau, F., DĂĽrst, M., et al. (2005). Character Model for the World Wide Web 1.0: Normalization. W3C Working Draft. Technical report, W3C. http://www.w3.org/TR/charmod-norm/.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media B.V.
About this chapter
Cite this chapter
Sasaki, F. (2010). Markup Languages and Internationalization. In: Witt, A., Metzing, D. (eds) Linguistic Modeling of Information and Markup Languages. Text, Speech and Language Technology, vol 41. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-3331-4_4
Download citation
DOI: https://doi.org/10.1007/978-90-481-3331-4_4
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-3330-7
Online ISBN: 978-90-481-3331-4
eBook Packages: Computer ScienceComputer Science (R0)