Skip to main content

Character Representation

  • Chapter
Book cover Text Encoding Initiative
  • 74 Accesses

Abstract

This paper describes ISO standard character sets currently in use, the use of SGML entity sets, and the TEI writing system declaration.

Harry Gaylord is a senior lecturer in the Department of Humanities Computing at Groningen University, Groningen, The Netherlands. He is Chair of the SC 18 committee of the Dutch Standards Institute (NNI), and member of ISO SC 2 (Coded Character Sets) and ISO SC 18/WG8 (Document Processing and Related Communication). He is the author of TEI TR1 W4 Character Entities and Public Entity Sets and co-author with John Esling of “Computer Codes for Phonetic Symbols”.Journal of the International Phonetic Association (1993), 23,2,85–97.

The Chair of our technical committee submitted a draft of this article. Comments were received from David Birnbaum, Bert Bos, Steve De Rose, Berend Dijk, and Michael Sperberg-McQueen for which we thank them. The committee made revisions before submitting the final version to the editors of this issue of CHUM. In recognition of longstanding contributions to our work, the committee has elected the above mentioned contributors as honourable members, which increases our numbers five-fold. We also wish to thank the secretary general of ISO, L.D. Eicher, for permission to publish portions of the ISO standards and Jan van den Beld, secretary general of ECMA, for furnishing a complete copy of its official register of all known character sets and current versions of the ECMA standards. We also thank Edwin Smura, registrar of AFII, for supplying a copy of much of their font registry.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Birnbaum, David J. “Issues in Developing International Standards for Encoding Non-Latin Alphabets”. Proceedings of the Fourth International Conference on Symbolic and Logical Computing. 1989, pp. 41–54.

    Google Scholar 

  • Esling, John and Harry Gaylord. “IPA and TEI encoding for Phonetic Symbols”. Journal of the International Phonetic Association. (1993), 23, 2.85–97.

    Article  Google Scholar 

  • Exoterica. Record Boundary Processing in SGML. Exoterica Corporation, 1992a. (1545 Carling Avenue, Suite 404, Ottawa, Ont., CA K1Z 8P9.) ECM13-1092.

    Google Scholar 

  • Exoterica. Understanding The SGML Declaration. Exoterica Corporation, 1992b. (1545 Carling Avenue, Suite 404, Ottawa, Ont., CA K1Z 8P9.) ECM03-1092.

    Google Scholar 

  • Gaylord, Harry E. “Character Entities and Public Entity Sets”. Technical Report TEI TR1 W4, Text Encoding Initiative, 1992. Available on internet SGML servers. On sgmll.ex.ac.uk it is in tei/tech/trlw4.teil.

    Google Scholar 

  • Goldfarb, Charles F. The SGML Handbook. Clarendon Press, Oxford, 1990. Edited and with a foreword by Yuri Rubinsky.

    Google Scholar 

  • Edwin Hart, ed. ASCII and EBCDIC: Character Set and Code Issues in Systems Application Architecture. SHARE Inc., 1989. (111 East Wacher Drive, Chicago, I11 60601, U.S.A.)

    Google Scholar 

  • ISO. ISO 8879:1986, Standard Generalized Markup Language. Geneva: ISO, 1986.

    Google Scholar 

  • ISO. ISO 10646:1992 Universal Coded Character Set (UCS). Geneva: ISO, 1992.

    Google Scholar 

  • ISO. ISO 10744:1992 Hypermedia/Time-based Structuring Language (HyTime). Geneva: ISO, 1992.

    Google Scholar 

  • Mackenzie, Charles E. Coded Character Sets: History and Development. The System Programming Series. Reading, MA: Addison-Wesley, 1980. ISBN 0-201-14460-3.

    Google Scholar 

  • The Unicode Consortium. The Unicode Standard: Worldwide Character Encoding. Vol. 1. Addison-Wesley, 1.0 edition, 1991. ISBN 0-201-56788-1.

    Google Scholar 

  • Wingen, Johan W. van. Manual: standards for the electronic exchange of personal data. Part 5: character sets. The Hague: Ministry of the Interior, 1995. ISBN: 90-5414-019-4.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1995 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Gaylord, H.E. (1995). Character Representation. In: Ide, N., Véronis, J. (eds) Text Encoding Initiative. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-0325-1_5

Download citation

  • DOI: https://doi.org/10.1007/978-94-011-0325-1_5

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-0-7923-3704-1

  • Online ISBN: 978-94-011-0325-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics