SIST ISO/IEC 6937:2010
(Main)Information technology - Coded graphic character set for text communication - Latin alphabet
Information technology - Coded graphic character set for text communication - Latin alphabet
This International Standard
a) specifies the coded representation of the characters;
b) specifies a repertoire of the Latin alphabetic and non-alphabetic characters for the communication of text in
many European languages using the Latin script;
c) specifies rules for the definitions and use of graphic character subrepertoires, i.e. subsets of the specified
character repertoire.
Technologies de l'information - Jeu de caractères graphiques codés pour la transmission de texte - Alphabet latin
Informacijska tehnologija - Nabor grafičnih znakov za komunikacijo z besedili - Latinična abeceda
Ta mednarodni standard a) določa grafično prestavitev znakov; b) določa imenik latinskih abecednih in neabecednih znakov za komunikacijo z besedilom v številnih evropskih jezikih, ki uporabljajo latinico; c) določa pravila za definicije in uporabo podimenikov grafičnih znakov, t.i. podmnožic določenega znakovnega imenika.
General Information
Relations
Standards Content (Sample)
SLOVENSKI STANDARD
01-september-2010
1DGRPHãþD
SIST ISO/IEC 6937:1995
,QIRUPDFLMVNDWHKQRORJLMD1DERUJUDILþQLK]QDNRY]DNRPXQLNDFLMR]EHVHGLOL
/DWLQLþQDDEHFHGD
Information technology - Coded graphic character set for text communication - Latin
alphabet
Technologies de l'information - Jeu de caractères graphiques codés pour la transmission
de texte - Alphabet latin
Ta slovenski standard je istoveten z: ISO/IEC 6937:2001
ICS:
35.040 Nabori znakov in kodiranje Character sets and
informacij information coding
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.
INTERNATIONAL ISO/IEC
STANDARD 6937
Third edition
2001-12-15
Information technology — Coded graphic
character set for text communication —
Latin alphabet
Technologies de l'information — Jeu de caractères graphiques codés pour
la transmission de texte — Alphabet latin
Reference number
©
ISO/IEC 2001
PDF disclaimer
This PDF file may contain embedded typefaces. In accordance with Adobe's licensing policy, this file may be printed or viewed but shall not
be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In downloading this
file, parties accept therein the responsibility of not infringing Adobe's licensing policy. The ISO Central Secretariat accepts no liability in this
area.
Adobe is a trademark of Adobe Systems Incorporated.
Details of the software products used to create this PDF file can be found in the General Info relative to the file; the PDF-creation parameters
were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In the unlikely event
that a problem relating to it is found, please inform the Central Secretariat at the address given below.
© ISO/IEC 2001
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic
or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or ISO's member body
in the country of the requester.
ISO copyright office
Case postale 56 • CH-1211 Geneva 20
Tel. + 41 22 749 01 11
Fax + 41 22 749 09 47
E-mail copyright@iso.ch
Web www.iso.ch
Printed in Switzerland
ii © ISO/IEC 2001 – All rights reserved
Contents Page
Foreword iv
Introduction v
1 Scope 1
2 Conformance and implementation 1
2.1 Conformance 1
2.2 Implementation 2
3 Normative references 2
4 Terms and definitions 3
5 Notation, code table and names 5
5.1 Notation 5
5.2 Code table 5
5.3 Names 5
6 Specifications of SPACE, NO-BREAK SPACE and SOFT HYPHEN 6
7 Composition of the character repertoire 6
8 Specification of the coded character set 6
8.1 Character sets 6
8.2 Explanations concerning the code table 7
8.3 Coded representations of the graphic characters of the repertoire 7
9 Graphic character subrepertoires 8
10 Identification of options 9
10.1 Purpose and context of identification 9
10.2 Identification of coding method 9
10.3 Identification of primary and supplementary sets 9
10.4 Identification of subrepertoire 9
Annex A (normative) 7-bit code 20
Annex B (informative) Method of definition of short identifiers of this International Standard 23
Annex C (informative) Use of non-spacing diacritical marks 33
Annex D (informative) Use of Latin alphabetic characters in various languages 34
Annex E (informative) Alternative coded representation of the repertoire
with no non-spacing diacritical marks 38
Annex F (informative) Main differences between the 1994 (second) edition of ISO/IEC 6937
and the present (third) edition of this International Standard 39
Bibliography 40
© ISO/IEC 2001 – All rights reserved iii
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission)
form the specialized system for worldwide standardization. National bodies that are members of ISO or IEC
participate in the development of International Standards through technical committees established by the
respective organization to deal with particular fields of technical activity. ISO and IEC technical committees
collaborate in fields of mutual interest. Other international organizations, governmental and non-governmental, in
liaison with ISO and IEC, also take part in the work.
International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 3.
In the field of information technology, ISO and IEC have established a joint technical committee, ISO/IEC JTC 1.
Draft International Standards adopted by the joint technical committee are circulated to national bodies for voting.
Publication as an International Standard requires approval by at least 75 % of the national bodies casting a vote.
Attention is drawn to the possibility that some of the elements of this International Standard may be the subject of
patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent rights.
International Standard ISO/IEC 6937 was prepared by Joint Technical Committee ISO/IEC JTC 1, Information
technology, Subcommittee SC 2, Coded character sets.
This third edition cancels and replaces the second edition (ISO/IEC 6937:1994), which has been technically
revised.
Annex A forms a normative part of this International Standard. Annexes B, C, D, E and F are for information only.
iv © ISO/IEC 2001 – All rights reserved
Introduction
This International Standard specifies a repertoire of graphic characters and their coded representations, for use
in text communication.
Although, in general, text (see 4.16) consists of characters and pictures, this International Standard applies only
to text made up of characters.
The specifications are based on 8-bit coding; Annex A specifies the 7-bit code for the character set of this
International Standard.
Other annexes include:
a) a description of the method used to define a short identifier for each character specified in this International
Standard (Annex B);
b) a summary of the use of non-spacing diacritical marks in combination with letters of the basic Latin alphabetic
characters (Annex C);
c) a summary of the use of Latin alphabetic characters in various languages (Annex D);
d) an alternative coded representation of the repertoire with no non-spacing diacritical marks (Annex E);
e) a summary of differences between the 1994 (second) edition of ISO/IEC 6937, and the present (third) edition
of this International Standard (Annex F);
f) a bibliography.
© ISO/IEC 2001 – All rights reserved v
INTERNATIONAL STANDARD ISO/IEC 6937:2001(E)
Information technology — Coded graphic character set for text
communication — Latin alphabet
1 Scope
This International Standard
a) specifies the coded representation of the characters;
b) specifies a repertoire of the Latin alphabetic and non-alphabetic characters for the communication of text in
many European languages using the Latin script;
c) specifies rules for the definitions and use of graphic character subrepertoires, i.e. subsets of the specified
character repertoire.
2 Conformance and implementation
2.1 Conformance
2.1.1 Conformance of information interchange
A coded-character-data-element (CC-data-element) within coded information for interchange is in conformance with
this International Standard if all coded representations of characters within that CC-data-element conform to the
mandatory requirements of this International Standard.
A claim of conformance shall identify:
- the subrepertoire in accordance with clause 9, if one has been adopted,
- the 7-bit coding in accordance with Annex A, if it has been adopted.
2.1.2 Conformance of devices
A device is in conformance with this International Standard if it conforms to the requirements of 2.1.2.1 and either
or both 2.1.2.2 and 2.1.2.3 below.
2.1.2.1 Device description
A device that conforms to this International Standard shall be the subject of a description that identifies the means
by which the user may supply characters to the device, or may recognize them when they are made available to
the user, as specified respectively in 2.1.2.2 and 2.1.2.3 below.
2.1.2.2 Originating devices
An originating device shall allow its user to supply any sequence of characters of the character repertoire, and shall
be capable of transmitting their coded representations within a CC-data-element.
2.1.2.3 Receiving devices
A receiving device shall be capable of receiving and interpreting any coded representation of characters that are
within a CC-data-element, and that conform to 2.1.1 of this International Standard, and shall make the
corresponding characters available to its user in such a way that the user can identify them among those of the
repertoire, and can distinguish them from each other.
© ISO/IEC 2001 - All rights reserved 1
2.2 Implementation
The use of this character set requires definitions of its implementation in various media. For example, these could
include magnetic and optical interchangeable media and transmission channels, thus permitting interchange of data
to take place either indirectly by means of an intermediate recording on a physical medium, or by local connection
of various units (such as input and output devices and computers) or by means of data transmission equipment.
The implementation of this coded character set in physical media and for transmission, taking into account the need
for error checking, may be the subject of other International Standards.
3 Normative references
The following normative documents contain provisions which, through reference in this text, constitute provisions of
this International Standard. For dated references, subsequent amendments to, or revisions of, any of these
publications do not apply. However, parties to agreements based on this International Standard are encouraged to
investigate the possibility of applying the most recent editions of the normative documents indicated below. For
undated references, the latest edition of the normative document referred to applies. Members of ISO and IEC
maintain registers of currently valid International Standards.
ISO/IEC 2022:1994, Information technology - Character code structure and extension techniques
ISO 2375:1985, Data processing - Procedure for registration of escape sequences
ISO/IEC 7350:1991, Information technology - Registration of repertoires of graphic characters from
ISO/IEC 10367
ISO/IEC 10367:1991, Information technology - Standardized coded graphic character sets for use in 8-bit
codes
ISO/IEC 10538:1991, Information technology - Control functions for text communication
ISO/IEC 10646-1:2000, Information technology - Universal Multiple-Octet Coded Character Set (UCS) - Part 1:
Architecture and Basic Multilingual Plane
2 © ISO/IEC 2001 - All rights reserved
4 Terms and definitions
For the purposes of this International Standard, the following terms and definitions apply:
4.1
active position
the character position which is to image the graphic symbol representing the next graphic character or relative
to which the next control function is to be executed
4.2
bit combination
an ordered set of bits used for the representation of characters
4.3
character
a member of a set of elements used for the organization, control or representation of data
4.4
character position
the portion of a display that is imaging or is capable of imaging a graphic symbol
4.5
coded-character-data-element (CC-data-element)
an element of interchanged information that is specified to consist of a sequence of coded representations of
characters, in accordance with one or more identified standards for coded character sets
NOTE 1: In a communication environment in accordance with the Reference Model for Open Systems Interconnection of ISO 7498, a
CC-data-element will form all or part of the information that corresponds to the Presentation-Protocol-Data-Unit (PPDU) defined in that
International Standard.
NOTE 2: When information interchange is accomplished by means of interchangeable media, a CC-data-element will form all or part of the
information that corresponds to the user data, and not that recorded during formatting and initialization.
4.6
coded character set; code
a set of unambiguous rules that establishes a character set and the one-to-one relationship between the characters
of the set and their bit combinations
4.7
code extension
the techniques for the encoding of characters that are not included in the character set of a given code
4.8
code table
a table showing the characters allocated to each bit combination in a code
4.9
control character
a control function the coded representation of which consists of a single bit combination
4.10
control function
an element of a character set that affects the recording, processing, transmission or interpretation of data, and that
has a coded representation consisting of one or more bit combinations
© ISO/IEC 2001 - All rights reserved 3
4.11 device: A component of information processing equipment which can transmit, and/or receive, coded
information within CC-data-elements
NOTE: It may be an input/output device in the conventional sense, or a process such as an application program or gateway function.
4.12
escape sequence
a string of bit combinations that are used for control purposes in code extension procedures. The first of these bit
combinations represents the control function ESCA
...
INTERNATIONAL ISO/IEC
STANDARD 6937
Third edition
2001-12-15
Information technology — Coded graphic
character set for text communication —
Latin alphabet
Technologies de l'information — Jeu de caractères graphiques codés pour
la transmission de texte — Alphabet latin
Reference number
©
ISO/IEC 2001
PDF disclaimer
This PDF file may contain embedded typefaces. In accordance with Adobe's licensing policy, this file may be printed or viewed but shall not
be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In downloading this
file, parties accept therein the responsibility of not infringing Adobe's licensing policy. The ISO Central Secretariat accepts no liability in this
area.
Adobe is a trademark of Adobe Systems Incorporated.
Details of the software products used to create this PDF file can be found in the General Info relative to the file; the PDF-creation parameters
were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In the unlikely event
that a problem relating to it is found, please inform the Central Secretariat at the address given below.
© ISO/IEC 2001
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic
or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or ISO's member body
in the country of the requester.
ISO copyright office
Case postale 56 • CH-1211 Geneva 20
Tel. + 41 22 749 01 11
Fax + 41 22 749 09 47
E-mail copyright@iso.ch
Web www.iso.ch
Printed in Switzerland
ii © ISO/IEC 2001 – All rights reserved
Contents Page
Foreword iv
Introduction v
1 Scope 1
2 Conformance and implementation 1
2.1 Conformance 1
2.2 Implementation 2
3 Normative references 2
4 Terms and definitions 3
5 Notation, code table and names 5
5.1 Notation 5
5.2 Code table 5
5.3 Names 5
6 Specifications of SPACE, NO-BREAK SPACE and SOFT HYPHEN 6
7 Composition of the character repertoire 6
8 Specification of the coded character set 6
8.1 Character sets 6
8.2 Explanations concerning the code table 7
8.3 Coded representations of the graphic characters of the repertoire 7
9 Graphic character subrepertoires 8
10 Identification of options 9
10.1 Purpose and context of identification 9
10.2 Identification of coding method 9
10.3 Identification of primary and supplementary sets 9
10.4 Identification of subrepertoire 9
Annex A (normative) 7-bit code 20
Annex B (informative) Method of definition of short identifiers of this International Standard 23
Annex C (informative) Use of non-spacing diacritical marks 33
Annex D (informative) Use of Latin alphabetic characters in various languages 34
Annex E (informative) Alternative coded representation of the repertoire
with no non-spacing diacritical marks 38
Annex F (informative) Main differences between the 1994 (second) edition of ISO/IEC 6937
and the present (third) edition of this International Standard 39
Bibliography 40
© ISO/IEC 2001 – All rights reserved iii
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission)
form the specialized system for worldwide standardization. National bodies that are members of ISO or IEC
participate in the development of International Standards through technical committees established by the
respective organization to deal with particular fields of technical activity. ISO and IEC technical committees
collaborate in fields of mutual interest. Other international organizations, governmental and non-governmental, in
liaison with ISO and IEC, also take part in the work.
International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 3.
In the field of information technology, ISO and IEC have established a joint technical committee, ISO/IEC JTC 1.
Draft International Standards adopted by the joint technical committee are circulated to national bodies for voting.
Publication as an International Standard requires approval by at least 75 % of the national bodies casting a vote.
Attention is drawn to the possibility that some of the elements of this International Standard may be the subject of
patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent rights.
International Standard ISO/IEC 6937 was prepared by Joint Technical Committee ISO/IEC JTC 1, Information
technology, Subcommittee SC 2, Coded character sets.
This third edition cancels and replaces the second edition (ISO/IEC 6937:1994), which has been technically
revised.
Annex A forms a normative part of this International Standard. Annexes B, C, D, E and F are for information only.
iv © ISO/IEC 2001 – All rights reserved
Introduction
This International Standard specifies a repertoire of graphic characters and their coded representations, for use
in text communication.
Although, in general, text (see 4.16) consists of characters and pictures, this International Standard applies only
to text made up of characters.
The specifications are based on 8-bit coding; Annex A specifies the 7-bit code for the character set of this
International Standard.
Other annexes include:
a) a description of the method used to define a short identifier for each character specified in this International
Standard (Annex B);
b) a summary of the use of non-spacing diacritical marks in combination with letters of the basic Latin alphabetic
characters (Annex C);
c) a summary of the use of Latin alphabetic characters in various languages (Annex D);
d) an alternative coded representation of the repertoire with no non-spacing diacritical marks (Annex E);
e) a summary of differences between the 1994 (second) edition of ISO/IEC 6937, and the present (third) edition
of this International Standard (Annex F);
f) a bibliography.
© ISO/IEC 2001 – All rights reserved v
INTERNATIONAL STANDARD ISO/IEC 6937:2001(E)
Information technology — Coded graphic character set for text
communication — Latin alphabet
1 Scope
This International Standard
a) specifies the coded representation of the characters;
b) specifies a repertoire of the Latin alphabetic and non-alphabetic characters for the communication of text in
many European languages using the Latin script;
c) specifies rules for the definitions and use of graphic character subrepertoires, i.e. subsets of the specified
character repertoire.
2 Conformance and implementation
2.1 Conformance
2.1.1 Conformance of information interchange
A coded-character-data-element (CC-data-element) within coded information for interchange is in conformance with
this International Standard if all coded representations of characters within that CC-data-element conform to the
mandatory requirements of this International Standard.
A claim of conformance shall identify:
- the subrepertoire in accordance with clause 9, if one has been adopted,
- the 7-bit coding in accordance with Annex A, if it has been adopted.
2.1.2 Conformance of devices
A device is in conformance with this International Standard if it conforms to the requirements of 2.1.2.1 and either
or both 2.1.2.2 and 2.1.2.3 below.
2.1.2.1 Device description
A device that conforms to this International Standard shall be the subject of a description that identifies the means
by which the user may supply characters to the device, or may recognize them when they are made available to
the user, as specified respectively in 2.1.2.2 and 2.1.2.3 below.
2.1.2.2 Originating devices
An originating device shall allow its user to supply any sequence of characters of the character repertoire, and shall
be capable of transmitting their coded representations within a CC-data-element.
2.1.2.3 Receiving devices
A receiving device shall be capable of receiving and interpreting any coded representation of characters that are
within a CC-data-element, and that conform to 2.1.1 of this International Standard, and shall make the
corresponding characters available to its user in such a way that the user can identify them among those of the
repertoire, and can distinguish them from each other.
© ISO/IEC 2001 - All rights reserved 1
2.2 Implementation
The use of this character set requires definitions of its implementation in various media. For example, these could
include magnetic and optical interchangeable media and transmission channels, thus permitting interchange of data
to take place either indirectly by means of an intermediate recording on a physical medium, or by local connection
of various units (such as input and output devices and computers) or by means of data transmission equipment.
The implementation of this coded character set in physical media and for transmission, taking into account the need
for error checking, may be the subject of other International Standards.
3 Normative references
The following normative documents contain provisions which, through reference in this text, constitute provisions of
this International Standard. For dated references, subsequent amendments to, or revisions of, any of these
publications do not apply. However, parties to agreements based on this International Standard are encouraged to
investigate the possibility of applying the most recent editions of the normative documents indicated below. For
undated references, the latest edition of the normative document referred to applies. Members of ISO and IEC
maintain registers of currently valid International Standards.
ISO/IEC 2022:1994, Information technology - Character code structure and extension techniques
ISO 2375:1985, Data processing - Procedure for registration of escape sequences
ISO/IEC 7350:1991, Information technology - Registration of repertoires of graphic characters from
ISO/IEC 10367
ISO/IEC 10367:1991, Information technology - Standardized coded graphic character sets for use in 8-bit
codes
ISO/IEC 10538:1991, Information technology - Control functions for text communication
ISO/IEC 10646-1:2000, Information technology - Universal Multiple-Octet Coded Character Set (UCS) - Part 1:
Architecture and Basic Multilingual Plane
2 © ISO/IEC 2001 - All rights reserved
4 Terms and definitions
For the purposes of this International Standard, the following terms and definitions apply:
4.1
active position
the character position which is to image the graphic symbol representing the next graphic character or relative
to which the next control function is to be executed
4.2
bit combination
an ordered set of bits used for the representation of characters
4.3
character
a member of a set of elements used for the organization, control or representation of data
4.4
character position
the portion of a display that is imaging or is capable of imaging a graphic symbol
4.5
coded-character-data-element (CC-data-element)
an element of interchanged information that is specified to consist of a sequence of coded representations of
characters, in accordance with one or more identified standards for coded character sets
NOTE 1: In a communication environment in accordance with the Reference Model for Open Systems Interconnection of ISO 7498, a
CC-data-element will form all or part of the information that corresponds to the Presentation-Protocol-Data-Unit (PPDU) defined in that
International Standard.
NOTE 2: When information interchange is accomplished by means of interchangeable media, a CC-data-element will form all or part of the
information that corresponds to the user data, and not that recorded during formatting and initialization.
4.6
coded character set; code
a set of unambiguous rules that establishes a character set and the one-to-one relationship between the characters
of the set and their bit combinations
4.7
code extension
the techniques for the encoding of characters that are not included in the character set of a given code
4.8
code table
a table showing the characters allocated to each bit combination in a code
4.9
control character
a control function the coded representation of which consists of a single bit combination
4.10
control function
an element of a character set that affects the recording, processing, transmission or interpretation of data, and that
has a coded representation consisting of one or more bit combinations
© ISO/IEC 2001 - All rights reserved 3
4.11 device: A component of information processing equipment which can transmit, and/or receive, coded
information within CC-data-elements
NOTE: It may be an input/output device in the conventional sense, or a process such as an application program or gateway function.
4.12
escape sequence
a string of bit combinations that are used for control purposes in code extension procedures. The first of these bit
combinations represents the control function ESCAPE
NOTE: Formats and rules regarding the use of escape sequences are specified in ISO/IEC 2022.
4.13
graphic character
a character, other than a control function, that has a visual representation normally handwritten, printed or
displayed, and that has a coded representation consisting of one or more bit combinations
4.14
graphic symbol
a visual representation of a graphic character or of a control function
4.15
repertoire
a specified set of characters that are represented by one or more bit combinations of a coded character set
4.16
text
a representation of information for human comprehension that is intended for presentation in a two-dimensional
form, for example printed on paper or displayed on a screen.
Text consists of symbols, phrases or sentences in natural or artificial languages, pictures, diagrams and tables
NOTE: This International Standard applies only to text made up of characters.
4.17
text communication; communication of text
the transfer of text by means of telecommunications
NOTE: In the context of this International Standard, text communication is by means of binary-coded representations of character
...
Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.