SIST ISO 24617-2:2013
(Main)Language resource management -- Semantic annotation framework (SemAF) -- Part 2: Dialogue acts
Language resource management -- Semantic annotation framework (SemAF) -- Part 2: Dialogue acts
ISO 24617-2:2012 provides a set of empirically and theoretically well-motivated concepts for dialogue annotation, a formal language for expressing dialogue annotations -- the dialogue act markup language (DiAML) -- and a method for segmenting a dialogue into semantic units. This allows the manual or automatic annotation of dialogue segments with information about the communicative actions which the participants perform by their contributions to the dialogue. It supports multidimensional annotation, in which units in dialogue are viewed as having multiple communicative functions. The DiAML language has an XML-based representation format and a formal semantics which makes it possible to apply inference to DiAML representations.
ISO 24617-2:2012 specifies data categories for reference sets of communicative functions and dimensions of dialogue analysis and provides principles and guidelines for extending these sets or selecting coherent subsets of them. Additionally, it provides guidelines for annotators and annotated examples. It is applicable to spoken, written and multimodal dialogues involving two or more participants.
Gestion des ressources langagières -- Cadre d'annotation sémantique (SemAF) -- Partie 2: Actes de dialogue
Upravljanje z jezikovnimi viri - Ogrodje za semantično označevanje (SemAF) - 2. del: Dialogi
Ta del standarda ISO 24617 podaja sklop empirično in teoretično utemeljenih pojmov za označevanje dialoga, uradni jezik za izražanje označevanja dialoga – jezik za označevanje dialoga (DiAML) – in metodo za segmentiranje dialoga v semantične enote. To omogoča ročno ali samodejno označevanje segmentov dialoga z informacijami o komunikacijskih dejanjih, ki jih izvedejo udeleženci, ki sodelujejo v dialogu. Podpira multidimenzionalno označevanje, pri katerem se enote dialoga obravnavajo, kot da imajo več komunikacijskih funkcij. Jezik DiAML ima format predstavitve, ki temelji na XML in formalni semantiki, kar omogoča sklepanje na podlagi predstavitev DiAML. Ta del standarda ISO 24617 določa podatkovne kategorije za referenčne sklope komunikacijskih funkcij in dimenzij analize dialoga ter zagotavlja načela in smernice za razširitev teh sklopov ali izbiro skladnih podsklopov. Zagotavlja tudi smernice za označevalce in primere označevanja. Uporablja se lahko za govorjene, zapisane in multimodalne dialoge, ki vključujejo dva ali več udeležencev.
General Information
Relations
Standards Content (Sample)
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.Gestion des ressources langagières -- Cadre d'annotation sémantique (SemAF) -- Partie 2: Actes de dialogueLanguage resource management -- Semantic annotation framework (SemAF) -- Part 2: Dialogue acts01.020NRRUGLQDFLMDTerminology (principles and coordination)ICS:Ta slovenski standard je istoveten z:ISO 24617-2:2012SIST ISO 24617-2:2013en,fr,de01-julij-2013SIST ISO 24617-2:2013SLOVENSKI
STANDARD
Reference numberISO 24617-2:2012(E)© ISO 2012
INTERNATIONAL STANDARD ISO24617-2First edition2012-09-01Language resource management — Semantic annotation framework (SemAF) — Part 2: Dialogue acts Gestion des ressources langagières — Cadre d'annotation sémantique (SemAF) — Partie 2: Actes de dialogue
©
ISO 2012 All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or ISO's member body in the country of the requester. ISO copyright office Case postale 56 CH-1211 Geneva 20 Tel.
+ 41 22 749 01 11 Fax
+ 41 22 749 09 47 E-mail
copyright@iso.org Web
www.iso.org Published in Switzerland
ii
iii Contents Page Foreword . iv 1 Scope . 1 2 Normative references . 1 3 Terms and definitions . 1 4 Purpose and justification . 5 5 Basic concepts and metamodel . 6 6 Definition of communicative functions . 8 7 Annotation schemes . 9 7.1 Structure of annotation schemes . 9 7.2 Multidimensionality and multifunctionality . 10 7.3 Multidimensionality, clustering and dimensions . 11 7.4 Dimension- specific and general-purpose functions . 11 8 Dialogue segmentation . 13 9 Dimensions . 14 9.1 Task. 15 9.2 Auto-Feedback . 15 9.3 Allo-Feedback . 15 9.4 Turn Management . 15 9.5 Time Management . 16 9.6 Discourse Structuring . 16 9.7 Social Obligations Management . 16 9.8 Own Communication Management . 16 9.9 Partner Communication Management . 16 10 Core dialogue acts . 17 10.1 General-purpose functions . 19 10.2 Dimension-specific functions . 20 10.3 Function qualifiers . 22 11 Dialogue act markup language (DiAML) . 23 11.1 Abstract syntax . 23 11.2 Concrete syntax . 24 12 Principles for extending and restricting the standard . 25 12.1 Main design principles . 25 12.2 Schema extension . 27 12.3 Scheme restriction . 27 Annex A (informative)
Annotation guidelines . 29 Annex B (informative)
Annotated dialogue examples . 43 Annex C (normative)
Formal definition of DiAML . 56 Annex D (normative)
DiAML technical schema . 63 Annex E (normative)
Data categories for core concepts . 68 Annex F (informative)
Examples of possible additional data categories . 88 Annex G (informative)
Concepts in existing schemes . 90 Bibliography . 100 SIST ISO 24617-2:2013
INTERNATIONAL STANDARD ISO 24617-2:2012(E) © ISO 2012 – All rights reserved 1
Language resource management — Semantic annotation framework (SemAF) — Part 2: Dialogue acts 1 Scope This part of ISO 24617 provides a set of empirically and theoretically well-motivated concepts for dialogue annotation, a formal language for expressing dialogue annotations — the dialogue act markup language (DiAML) — and a method for segmenting a dialogue into semantic units. This allows the manual or automatic annotation of dialogue segments with information about the communicative actions which the participants perform by their contributions to the dialogue. It supports multidimensional annotation, in which units in dialogue are viewed as having multiple communicative functions. The DiAML language has an XML-based representation format and a formal semantics which makes it possible to apply inference to DiAML representations.
This part of ISO 24617 specifies data categories for reference sets of communicative functions and dimensions of dialogue analysis and provides principles and guidelines for extending these sets or selecting coherent subsets of them. Additionally, it provides guidelines for annotators and annotated examples. It is applicable to spoken, written and multimodal dialogues involving two or more participants. 2 Normative references The following referenced documents are indispensable for the application of this document. For dated references, only the edition cited applies. For undated references, the latest edition of the referenced document (including any amendments) applies. ISO 12620:2009, Terminology and other language resources — Specification of data categories and management of a Data Category Registry for language resources ISO 24610-1:2006, Language resource management — Feature structures — Part 1: Feature structure representation ISO 24612:2011, Language resource management — Linguistic annotation framework 3 Terms and definitions For the purposes of this document, the following terms and definitions apply.1) 1) In this document, “he”, “him” and “his” are used in a generic sense, without implying any gender-related distinctions. SIST ISO 24617-2:2013
3.2 allo-feedback act feedback act (3.8) where the sender (3.18) elicits information about the addressee's (3.1) processing of an utterance (3.22) that the sender contributed to the dialogue (3.5) or where the sender provides information about his perceived processing by the addressee of an utterance that the sender contributed to the dialogue before EXAMPLE A: Now move up.
B: Slightly northeast you mean?
A: Slightly yeah.
A performs an allo-feedback act signalling that he thinks B understood his first utterance correctly. 3.3 auto-feedback act feedback act (3.8) where the sender (3.18) provides information about his own processing of an utterance (3.22) contributed to the dialogue (3.5) by another participant (3.13) EXAMPLE B's utterance in the example dialogue fragment in (3.2) signals that he is uncertain whether he understood the previous utterance correctly. 3.4 communicative function property of certain stretches of communicative behaviour, describing how the behaviour changes the information state (3.12) of an understander of the behaviour Note to entry: A communicative function may be “qualified”, i.e. one or more qualifiers (3.14) may be associated with it. For example, an answer may be qualified as “uncertain” and the acceptance of a request may be “conditional”. See 10.3 for explanation and examples. 3.5 dialogue exchange of utterances (3.22) between two or more persons or artificial conversational systems 3.6 dialogue act communicative activity of a dialogue (3.5) participant (3.13), interpreted as having a certain communicative function (3.4) and semantic content (3.16) Note to entry: A dialogue act may also have certain functional dependence relations (3.10), rhetorical relations (3.15) and feedback dependence relations (3.9) with other units in a dialogue (3.5). 3.7 dimension class of dialogue acts (3.6) that are concerned with a particular aspect of communication, corresponding to a particular category of semantic content SIST ISO 24617-2:2013
...
SLOVENSKI STANDARD
01-julij-2013
Upravljanje z jezikovnimi viri - Ogrodje za semantično označevanje (SemAF) - 2.
del: Dialogi
Language resource management -- Semantic annotation framework (SemAF) -- Part 2:
Dialogue acts
Gestion des ressources langagières -- Cadre d'annotation sémantique (SemAF) -- Partie
2: Actes de dialogue
Ta slovenski standard je istoveten z: ISO 24617-2:2012
ICS:
01.020 Terminologija (načela in Terminology (principles and
koordinacija) coordination)
35.240.30 Uporabniške rešitve IT v IT applications in information,
informatiki, dokumentiranju in documentation and
založništvu publishing
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.
INTERNATIONAL ISO
STANDARD 24617-2
First edition
2012-09-01
Language resource management —
Semantic annotation framework
(SemAF) —
Part 2:
Dialogue acts
Gestion des ressources langagières — Cadre d'annotation sémantique
(SemAF) —
Partie 2: Actes de dialogue
Reference number
©
ISO 2012
© ISO 2012
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means,
electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or
ISO's member body in the country of the requester.
ISO copyright office
Case postale 56 CH-1211 Geneva 20
Tel. + 41 22 749 01 11
Fax + 41 22 749 09 47
E-mail copyright@iso.org
Web www.iso.org
Published in Switzerland
ii © ISO 2012 – All rights reserved
Contents Page
Foreword . iv
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Purpose and justification . 5
5 Basic concepts and metamodel . 6
6 Definition of communicative functions . 8
7 Annotation schemes . 9
7.1 Structure of annotation schemes . 9
7.2 Multidimensionality and multifunctionality . 10
7.3 Multidimensionality, clustering and dimensions . 11
7.4 Dimension- specific and general-purpose functions . 11
8 Dialogue segmentation . 13
9 Dimensions . 14
9.1 Task. 15
9.2 Auto-Feedback . 15
9.3 Allo-Feedback . 15
9.4 Turn Management . 15
9.5 Time Management . 16
9.6 Discourse Structuring . 16
9.7 Social Obligations Management . 16
9.8 Own Communication Management . 16
9.9 Partner Communication Management . 16
10 Core dialogue acts . 17
10.1 General-purpose functions . 19
10.2 Dimension-specific functions . 20
10.3 Function qualifiers . 22
11 Dialogue act markup language (DiAML) . 23
11.1 Abstract syntax . 23
11.2 Concrete syntax . 24
12 Principles for extending and restricting the standard . 25
12.1 Main design principles . 25
12.2 Schema extension . 27
12.3 Scheme restriction . 27
Annex A (informative) Annotation guidelines . 29
Annex B (informative) Annotated dialogue examples . 43
Annex C (normative) Formal definition of DiAML . 56
Annex D (normative) DiAML technical schema . 63
Annex E (normative) Data categories for core concepts . 68
Annex F (informative) Examples of possible additional data categories . 88
Annex G (informative) Concepts in existing schemes . 90
Bibliography . 100
Foreword
ISO (the International Organization for Standardization) is a worldwide federation of national standards bodies
(ISO member bodies). The work of preparing International Standards is normally carried out through ISO
technical committees. Each member body interested in a subject for which a technical committee has been
established has the right to be represented on that committee. International organizations, governmental and
non-governmental, in liaison with ISO, also take part in the work. ISO collaborates closely with the
International Electrotechnical Commission (IEC) on all matters of electrotechnical standardization.
International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 2.
The main task of technical committees is to prepare International Standards. Draft International Standards
adopted by the technical committees are circulated to the member bodies for voting. Publication as an
International Standard requires approval by at least 75 % of the member bodies casting a vote.
Attention is drawn to the possibility that some of the elements of this document may be the subject of patent
rights. ISO shall not be held responsible for identifying any or all such patent rights.
ISO 24617-2 was prepared by Technical Committee ISO/TC 37, Terminology and other language and content
resources, Subcommittee SC 4, Language resource management.
ISO 24617 consists of the following parts, under the general title: Language resource management —
Semantic annotation framework:
Part 1: Time and events (SemAF-Time, ISO-TimeML)
Part 2: Dialogue acts
The following parts are under preparation:
Part 3: Named entities (SemAF-NE)
Part 4: Semantic roles (SemAF-SRL)
Part 5: Discourse structure (SemAF-DS)
Part 6: Principles of semantic annotation (SemAF-Basics)
Part 7: Spatial information (ISO-Space)
Part 8: Semantic relations in discourse (SemAF-DRel)
iv © ISO 2012 – All rights reserved
INTERNATIONAL STANDARD ISO 24617-2:2012(E)
Language resource management — Semantic annotation
framework (SemAF) —
Part 2:
Dialogue acts
1 Scope
This part of ISO 24617 provides a set of empirically and theoretically well-motivated concepts for dialogue
annotation, a formal language for expressing dialogue annotations — the dialogue act markup language
(DiAML) — and a method for segmenting a dialogue into semantic units. This allows the manual or automatic
annotation of dialogue segments with information about the communicative actions which the participants
perform by their contributions to the dialogue. It supports multidimensional annotation, in which units in
dialogue are viewed as having multiple communicative functions. The DiAML language has an XML-based
representation format and a formal semantics which makes it possible to apply inference to DiAML
representations.
This part of ISO 24617 specifies data categories for reference sets of communicative functions and
dimensions of dialogue analysis and provides principles and guidelines for extending these sets or selecting
coherent subsets of them. Additionally, it provides guidelines for annotators and annotated examples. It is
applicable to spoken, written and multimodal dialogues involving two or more participants.
2 Normative references
The following referenced documents are indispensable for the application of this document. For dated
references, only the edition cited applies. For undated references, the latest edition of the referenced
document (including any amendments) applies.
ISO 12620:2009, Terminology and other language resources — Specification of data categories and
management of a Data Category Registry for language resources
ISO 24610-1:2006, Language resource management — Feature structures — Part 1: Feature structure
representation
ISO 24612:2011, Language resource management — Linguistic annotation framework
3 Terms and definitions
1)
For the purposes of this document, the following terms and definitions apply.
1) In this document, “he”, “him” and “his” are used in a generic sense, without implying any gender-related distinctions.
3.1
addressee
dialogue (3.5) participant (3.13) oriented to by the sender (3.18) in a manner to suggest that his utterances
(3.22) are particularly intended for this participant and that some response is therefore anticipated from this
participant, more so than from the other participants
Note to entry: This definition is a de facto standard in the linguistics literature. It has been slightly modified here, in
replacing “speaker” by “sender” and avoiding the use of ambiguous pronouns. Goffman's original definition says: “dialogue
participant oriented to by the speaker in a manner to suggest that his utterances are particularly intended for him and that
some response is therefore anticipated from him/her, more so than from the other participants”.
[SOURCE: Goffman (1981).]
3.2
allo-feedback act
feedback act (3.8) where the sender (3.18) elicits information about the addressee's (3.1) processing of an
utterance (3.22) that the sender contributed to the dialogue (3.5) or where the sender provides information
about his perceived processing by the addressee of an utterance that the sender contributed to the dialogue
before
EXAMPLE A: Now move up.
B: Slightly northeast you mean?
A: Slightly yeah.
A performs an allo-feedback act signalling that he thinks B understood his first utterance correctly.
3.3
auto-feedback act
feedback act (3.8) where the sender (3.18) provides information about his own processing of an utterance
(3.22) contributed to the dialogue (3.5) by another participant (3.13)
EXAMPLE B's utterance in the example dialogue fragment in (3.2) signals that he is uncertain whether he
understood the previous utterance correctly.
3.4
communicative function
property of certain stretches of communicative behaviour, describing how the behaviour changes the
information state (3.12) of an understander of the behaviour
Note to entry: A communicative function may be “qualified”, i.e. one or more qualifiers (3.14) may be associated with it.
For example, an answer may be qualified as “uncertain” and the acceptance of a request may be “conditional”. See 10.3
...
SLOVENSKI STANDARD
01-julij-2013
Upravljanje z jezikovnimi viri - Ogrodje za semantično označevanje (SemAF) - 2.
del: Dialogi
Language resource management -- Semantic annotation framework (SemAF) -- Part 2:
Dialogue acts
Gestion des ressources langagières -- Cadre d'annotation sémantique (SemAF) -- Partie
2: Actes de dialogue
Ta slovenski standard je istoveten z: ISO 24617-2:2012
ICS:
01.020 Terminologija (načela in Terminology (principles and
koordinacija) coordination)
01.140.20 Informacijske vede Information sciences
35.240.30 Uporabniške rešitve IT v IT applications in information,
informatiki, dokumentiranju in documentation and
založništvu publishing
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.
INTERNATIONAL ISO
STANDARD 24617-2
First edition
2012-09-01
Language resource management —
Semantic annotation framework
(SemAF) —
Part 2:
Dialogue acts
Gestion des ressources langagières — Cadre d'annotation sémantique
(SemAF) —
Partie 2: Actes de dialogue
Reference number
©
ISO 2012
© ISO 2012
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means,
electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or
ISO's member body in the country of the requester.
ISO copyright office
Case postale 56 CH-1211 Geneva 20
Tel. + 41 22 749 01 11
Fax + 41 22 749 09 47
E-mail copyright@iso.org
Web www.iso.org
Published in Switzerland
ii © ISO 2012 – All rights reserved
Contents Page
Foreword . iv
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Purpose and justification . 5
5 Basic concepts and metamodel . 6
6 Definition of communicative functions . 8
7 Annotation schemes . 9
7.1 Structure of annotation schemes . 9
7.2 Multidimensionality and multifunctionality . 10
7.3 Multidimensionality, clustering and dimensions . 11
7.4 Dimension- specific and general-purpose functions . 11
8 Dialogue segmentation . 13
9 Dimensions . 14
9.1 Task. 15
9.2 Auto-Feedback . 15
9.3 Allo-Feedback . 15
9.4 Turn Management . 15
9.5 Time Management . 16
9.6 Discourse Structuring . 16
9.7 Social Obligations Management . 16
9.8 Own Communication Management . 16
9.9 Partner Communication Management . 16
10 Core dialogue acts . 17
10.1 General-purpose functions . 19
10.2 Dimension-specific functions . 20
10.3 Function qualifiers . 22
11 Dialogue act markup language (DiAML) . 23
11.1 Abstract syntax . 23
11.2 Concrete syntax . 24
12 Principles for extending and restricting the standard . 25
12.1 Main design principles . 25
12.2 Schema extension . 27
12.3 Scheme restriction . 27
Annex A (informative) Annotation guidelines . 29
Annex B (informative) Annotated dialogue examples . 43
Annex C (normative) Formal definition of DiAML . 56
Annex D (normative) DiAML technical schema . 63
Annex E (normative) Data categories for core concepts . 68
Annex F (informative) Examples of possible additional data categories . 88
Annex G (informative) Concepts in existing schemes . 90
Bibliography . 100
Foreword
ISO (the International Organization for Standardization) is a worldwide federation of national standards bodies
(ISO member bodies). The work of preparing International Standards is normally carried out through ISO
technical committees. Each member body interested in a subject for which a technical committee has been
established has the right to be represented on that committee. International organizations, governmental and
non-governmental, in liaison with ISO, also take part in the work. ISO collaborates closely with the
International Electrotechnical Commission (IEC) on all matters of electrotechnical standardization.
International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 2.
The main task of technical committees is to prepare International Standards. Draft International Standards
adopted by the technical committees are circulated to the member bodies for voting. Publication as an
International Standard requires approval by at least 75 % of the member bodies casting a vote.
Attention is drawn to the possibility that some of the elements of this document may be the subject of patent
rights. ISO shall not be held responsible for identifying any or all such patent rights.
ISO 24617-2 was prepared by Technical Committee ISO/TC 37, Terminology and other language and content
resources, Subcommittee SC 4, Language resource management.
ISO 24617 consists of the following parts, under the general title: Language resource management —
Semantic annotation framework:
Part 1: Time and events (SemAF-Time, ISO-TimeML)
Part 2: Dialogue acts
The following parts are under preparation:
Part 3: Named entities (SemAF-NE)
Part 4: Semantic roles (SemAF-SRL)
Part 5: Discourse structure (SemAF-DS)
Part 6: Principles of semantic annotation (SemAF-Basics)
Part 7: Spatial information (ISO-Space)
Part 8: Semantic relations in discourse (SemAF-DRel)
iv © ISO 2012 – All rights reserved
INTERNATIONAL STANDARD ISO 24617-2:2012(E)
Language resource management — Semantic annotation
framework (SemAF) —
Part 2:
Dialogue acts
1 Scope
This part of ISO 24617 provides a set of empirically and theoretically well-motivated concepts for dialogue
annotation, a formal language for expressing dialogue annotations — the dialogue act markup language
(DiAML) — and a method for segmenting a dialogue into semantic units. This allows the manual or automatic
annotation of dialogue segments with information about the communicative actions which the participants
perform by their contributions to the dialogue. It supports multidimensional annotation, in which units in
dialogue are viewed as having multiple communicative functions. The DiAML language has an XML-based
representation format and a formal semantics which makes it possible to apply inference to DiAML
representations.
This part of ISO 24617 specifies data categories for reference sets of communicative functions and
dimensions of dialogue analysis and provides principles and guidelines for extending these sets or selecting
coherent subsets of them. Additionally, it provides guidelines for annotators and annotated examples. It is
applicable to spoken, written and multimodal dialogues involving two or more participants.
2 Normative references
The following referenced documents are indispensable for the application of this document. For dated
references, only the edition cited applies. For undated references, the latest edition of the referenced
document (including any amendments) applies.
ISO 12620:2009, Terminology and other language resources — Specification of data categories and
management of a Data Category Registry for language resources
ISO 24610-1:2006, Language resource management — Feature structures — Part 1: Feature structure
representation
ISO 24612:2011, Language resource management — Linguistic annotation framework
3 Terms and definitions
1)
For the purposes of this document, the following terms and definitions apply.
1) In this document, “he”, “him” and “his” are used in a generic sense, without implying any gender-related distinctions.
3.1
addressee
dialogue (3.5) participant (3.13) oriented to by the sender (3.18) in a manner to suggest that his utterances
(3.22) are particularly intended for this participant and that some response is therefore anticipated from this
participant, more so than from the other participants
Note to entry: This definition is a de facto standard in the linguistics literature. It has been slightly modified here, in
replacing “speaker” by “sender” and avoiding the use of ambiguous pronouns. Goffman's original definition says: “dialogue
participant oriented to by the speaker in a manner to suggest that his utterances are particularly intended for him and that
some response is therefore anticipated from him/her, more so than from the other participants”.
[SOURCE: Goffman (1981).]
3.2
allo-feedback act
feedback act (3.8) where the sender (3.18) elicits information about the addressee's (3.1) processing of an
utterance (3.22) that the sender contributed to the dialogue (3.5) or where the sender provides information
about his perceived processing by the addressee of an utterance that the sender contributed to the dialogue
before
EXAMPLE A: Now move up.
B: Slightly northeast you mean?
A: Slightly yeah.
A performs an allo-feedback act signalling that he thinks B understood his first utterance correctly.
3.3
auto-feedback act
feedback act (3.8) where the sender (3.18) provides information about his own processing of an utterance
(3.22) contributed to the dialogue (3.5) by another participant (3.13)
EXAMPLE B's utterance in the example dialogue fragment in (3.2) signals that he is uncertain whether he
understood the previous utterance correctly.
3.4
communicative function
property of certain stretches of communicative behaviour, describing how the behaviour changes the
information state (3.12) of an understander of the behaviour
Note to entry: A communicative function may be “qualified”, i.e. one or more qualifiers (3.14) may be associated with it.
For example, an answer may be qualified as “uncertain” and the accept
...
INTERNATIONAL ISO
STANDARD 24617-2
First edition
2012-09-01
Language resource management —
Semantic annotation framework
(SemAF) —
Part 2:
Dialogue acts
Gestion des ressources langagières — Cadre d'annotation sémantique
(SemAF) —
Partie 2: Actes de dialogue
Reference number
©
ISO 2012
© ISO 2012
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means,
electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or
ISO's member body in the country of the requester.
ISO copyright office
Case postale 56 CH-1211 Geneva 20
Tel. + 41 22 749 01 11
Fax + 41 22 749 09 47
E-mail copyright@iso.org
Web www.iso.org
Published in Switzerland
ii © ISO 2012 – All rights reserved
Contents Page
Foreword . iv
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Purpose and justification . 5
5 Basic concepts and metamodel . 6
6 Definition of communicative functions . 8
7 Annotation schemes . 9
7.1 Structure of annotation schemes . 9
7.2 Multidimensionality and multifunctionality . 10
7.3 Multidimensionality, clustering and dimensions . 11
7.4 Dimension- specific and general-purpose functions . 11
8 Dialogue segmentation . 13
9 Dimensions . 14
9.1 Task. 15
9.2 Auto-Feedback . 15
9.3 Allo-Feedback . 15
9.4 Turn Management . 15
9.5 Time Management . 16
9.6 Discourse Structuring . 16
9.7 Social Obligations Management . 16
9.8 Own Communication Management . 16
9.9 Partner Communication Management . 16
10 Core dialogue acts . 17
10.1 General-purpose functions . 19
10.2 Dimension-specific functions . 20
10.3 Function qualifiers . 22
11 Dialogue act markup language (DiAML) . 23
11.1 Abstract syntax . 23
11.2 Concrete syntax . 24
12 Principles for extending and restricting the standard . 25
12.1 Main design principles . 25
12.2 Schema extension . 27
12.3 Scheme restriction . 27
Annex A (informative) Annotation guidelines . 29
Annex B (informative) Annotated dialogue examples . 43
Annex C (normative) Formal definition of DiAML . 56
Annex D (normative) DiAML technical schema . 63
Annex E (normative) Data categories for core concepts . 68
Annex F (informative) Examples of possible additional data categories . 88
Annex G (informative) Concepts in existing schemes . 90
Bibliography . 100
Foreword
ISO (the International Organization for Standardization) is a worldwide federation of national standards bodies
(ISO member bodies). The work of preparing International Standards is normally carried out through ISO
technical committees. Each member body interested in a subject for which a technical committee has been
established has the right to be represented on that committee. International organizations, governmental and
non-governmental, in liaison with ISO, also take part in the work. ISO collaborates closely with the
International Electrotechnical Commission (IEC) on all matters of electrotechnical standardization.
International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 2.
The main task of technical committees is to prepare International Standards. Draft International Standards
adopted by the technical committees are circulated to the member bodies for voting. Publication as an
International Standard requires approval by at least 75 % of the member bodies casting a vote.
Attention is drawn to the possibility that some of the elements of this document may be the subject of patent
rights. ISO shall not be held responsible for identifying any or all such patent rights.
ISO 24617-2 was prepared by Technical Committee ISO/TC 37, Terminology and other language and content
resources, Subcommittee SC 4, Language resource management.
ISO 24617 consists of the following parts, under the general title: Language resource management —
Semantic annotation framework:
Part 1: Time and events (SemAF-Time, ISO-TimeML)
Part 2: Dialogue acts
The following parts are under preparation:
Part 3: Named entities (SemAF-NE)
Part 4: Semantic roles (SemAF-SRL)
Part 5: Discourse structure (SemAF-DS)
Part 6: Principles of semantic annotation (SemAF-Basics)
Part 7: Spatial information (ISO-Space)
Part 8: Semantic relations in discourse (SemAF-DRel)
iv © ISO 2012 – All rights reserved
INTERNATIONAL STANDARD ISO 24617-2:2012(E)
Language resource management — Semantic annotation
framework (SemAF) —
Part 2:
Dialogue acts
1 Scope
This part of ISO 24617 provides a set of empirically and theoretically well-motivated concepts for dialogue
annotation, a formal language for expressing dialogue annotations — the dialogue act markup language
(DiAML) — and a method for segmenting a dialogue into semantic units. This allows the manual or automatic
annotation of dialogue segments with information about the communicative actions which the participants
perform by their contributions to the dialogue. It supports multidimensional annotation, in which units in
dialogue are viewed as having multiple communicative functions. The DiAML language has an XML-based
representation format and a formal semantics which makes it possible to apply inference to DiAML
representations.
This part of ISO 24617 specifies data categories for reference sets of communicative functions and
dimensions of dialogue analysis and provides principles and guidelines for extending these sets or selecting
coherent subsets of them. Additionally, it provides guidelines for annotators and annotated examples. It is
applicable to spoken, written and multimodal dialogues involving two or more participants.
2 Normative references
The following referenced documents are indispensable for the application of this document. For dated
references, only the edition cited applies. For undated references, the latest edition of the referenced
document (including any amendments) applies.
ISO 12620:2009, Terminology and other language resources — Specification of data categories and
management of a Data Category Registry for language resources
ISO 24610-1:2006, Language resource management — Feature structures — Part 1: Feature structure
representation
ISO 24612:2011, Language resource management — Linguistic annotation framework
3 Terms and definitions
1)
For the purposes of this document, the following terms and definitions apply.
1) In this document, “he”, “him” and “his” are used in a generic sense, without implying any gender-related distinctions.
3.1
addressee
dialogue (3.5) participant (3.13) oriented to by the sender (3.18) in a manner to suggest that his utterances
(3.22) are particularly intended for this participant and that some response is therefore anticipated from this
participant, more so than from the other participants
Note to entry: This definition is a de facto standard in the linguistics literature. It has been slightly modified here, in
replacing “speaker” by “sender” and avoiding the use of ambiguous pronouns. Goffman's original definition says: “dialogue
participant oriented to by the speaker in a manner to suggest that his utterances are particularly intended for him and that
some response is therefore anticipated from him/her, more so than from the other participants”.
[SOURCE: Goffman (1981).]
3.2
allo-feedback act
feedback act (3.8) where the sender (3.18) elicits information about the addressee's (3.1) processing of an
utterance (3.22) that the sender contributed to the dialogue (3.5) or where the sender provides information
about his perceived processing by the addressee of an utterance that the sender contributed to the dialogue
before
EXAMPLE A: Now move up.
B: Slightly northeast you mean?
A: Slightly yeah.
A performs an allo-feedback act signalling that he thinks B understood his first utterance correctly.
3.3
auto-feedback act
feedback act (3.8) where the sender (3.18) provides information about his own processing of an utterance
(3.22) contributed to the dialogue (3.5) by another participant (3.13)
EXAMPLE B's utterance in the example dialogue fragment in (3.2) signals that he is uncertain whether he
understood the previous utterance correctly.
3.4
communicative function
property of certain stretches of communicative behaviour, describing how the behaviour changes the
information state (3.12) of an understander of the behaviour
Note to entry: A communicative function may be “qualified”, i.e. one or more qualifiers (3.14) may be associated with it.
For example, an answer may be qualified as “uncertain” and the acceptance of a request may be “conditional”. See 10.3
for explanation and examples.
3.5
dialogue
exchange of utterances (3.22) between two or more persons or artificial conversational systems
3.6
dialogue act
communicative activity of a dialogue (3.5) participant (3.13), interpreted as having a certain communicative
function (3.4) and semantic content (3.16)
Note to entry: A dialogue act may also have certain functional dependence relations (3.10), rhetorical relations (3.15) and
feedback dependence relations (3.9) with other units in a dialogue (3.5).
3.7
dimension
class of dialogue acts (3.6) that are concerned with a particular aspect of communication, corresponding to a
particular category of semantic content
2 © ISO 2012 – All rights reserved
EXAMPLE Dialogue acts advancing the task or activity that motivates the dialogue (the Task dimension), dialogue
acts providing and eliciting feedback (the Auto- and Allo-Feedback dimensions) and dialogue acts for allocating the
speaker role (the Turn Manag
...
Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.