Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

MEDIA: a semantically annotated corpus of task oriented dialogs in French: Results of the French MEDIA evaluation campaign

Identifieur interne : 003C72 ( Main/Merge ); précédent : 003C71; suivant : 003C73

MEDIA: a semantically annotated corpus of task oriented dialogs in French: Results of the French MEDIA evaluation campaign

Auteurs : Hélène Bonneau-Maynard [France] ; Matthieu Quignard [France] ; Alexandre Denis [France]

Source :

RBID : Francis:10-0023530

Descripteurs français

English descriptors

Abstract

The aim of the French MEDIA project was to define a protocol for the evaluation of speech understanding modules for dialog systems. Accordingly, a corpus of 1,257 real spoken dialogs related to hotel reservation and tourist information was recorded, transcribed and semantically annotated, and a semantic attribute-value representation was defined in which each conceptual relationship was represented by the names of the attributes. Two semantic annotation levels are distinguished in this approach. At the first level, each utterance is considered separately and the annotation represents the meaning of the statement without taking into account the dialog context. The second level of annotation then corresponds to the interpretation of the meaning of the statement by taking into account the dialog context; in this way a semantic representation of the dialog context is defined. This paper discusses the data collection, the detailed definition of both annotation levels, and the annotation scheme. Then the paper comments on both evaluation campaigns which were carried out during the project and discusses some results.

Links toward previous steps (curation, corpus...)


Links to Exploration step

Francis:10-0023530

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">MEDIA: a semantically annotated corpus of task oriented dialogs in French: Results of the French MEDIA evaluation campaign</title>
<author>
<name sortKey="Bonneau Maynard, Helene" sort="Bonneau Maynard, Helene" uniqKey="Bonneau Maynard H" first="Hélène" last="Bonneau-Maynard">Hélène Bonneau-Maynard</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>LIMSI-CNRS, Université Paris-Sud 11, Bât. 508, BP 133</s1>
<s2>91403 Orsay</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Île-de-France</region>
<settlement type="city">Orsay</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Quignard, Matthieu" sort="Quignard, Matthieu" uniqKey="Quignard M" first="Matthieu" last="Quignard">Matthieu Quignard</name>
<affiliation wicri:level="3">
<inist:fA14 i1="02">
<s1>LORIA, Campus Scientifique, BP 239</s1>
<s2>54506 Vandoeuvre-les-Nancy</s2>
<s3>FRA</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Denis, Alexandre" sort="Denis, Alexandre" uniqKey="Denis A" first="Alexandre" last="Denis">Alexandre Denis</name>
<affiliation wicri:level="3">
<inist:fA14 i1="02">
<s1>LORIA, Campus Scientifique, BP 239</s1>
<s2>54506 Vandoeuvre-les-Nancy</s2>
<s3>FRA</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">10-0023530</idno>
<date when="2009">2009</date>
<idno type="stanalyst">FRANCIS 10-0023530 INIST</idno>
<idno type="RBID">Francis:10-0023530</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000255</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000772</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000238</idno>
<idno type="wicri:explorRef" wicri:stream="PascalFrancis" wicri:step="Checkpoint">000238</idno>
<idno type="wicri:doubleKey">1574-020X:2009:Bonneau Maynard H:media:a:semantically</idno>
<idno type="wicri:Area/Main/Merge">003C72</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">MEDIA: a semantically annotated corpus of task oriented dialogs in French: Results of the French MEDIA evaluation campaign</title>
<author>
<name sortKey="Bonneau Maynard, Helene" sort="Bonneau Maynard, Helene" uniqKey="Bonneau Maynard H" first="Hélène" last="Bonneau-Maynard">Hélène Bonneau-Maynard</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>LIMSI-CNRS, Université Paris-Sud 11, Bât. 508, BP 133</s1>
<s2>91403 Orsay</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Île-de-France</region>
<settlement type="city">Orsay</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Quignard, Matthieu" sort="Quignard, Matthieu" uniqKey="Quignard M" first="Matthieu" last="Quignard">Matthieu Quignard</name>
<affiliation wicri:level="3">
<inist:fA14 i1="02">
<s1>LORIA, Campus Scientifique, BP 239</s1>
<s2>54506 Vandoeuvre-les-Nancy</s2>
<s3>FRA</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Denis, Alexandre" sort="Denis, Alexandre" uniqKey="Denis A" first="Alexandre" last="Denis">Alexandre Denis</name>
<affiliation wicri:level="3">
<inist:fA14 i1="02">
<s1>LORIA, Campus Scientifique, BP 239</s1>
<s2>54506 Vandoeuvre-les-Nancy</s2>
<s3>FRA</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Language resources and evaluation </title>
<idno type="ISSN">1574-020X</idno>
<imprint>
<date when="2009">2009</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Language resources and evaluation </title>
<idno type="ISSN">1574-020X</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Assessment</term>
<term>Computational linguistics</term>
<term>Corpus annotation</term>
<term>French</term>
<term>Speech processing</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Evaluation</term>
<term>Annotation de corpus</term>
<term>Traitement automatique de la parole</term>
<term>Linguistique informatique</term>
<term>Français</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">The aim of the French MEDIA project was to define a protocol for the evaluation of speech understanding modules for dialog systems. Accordingly, a corpus of 1,257 real spoken dialogs related to hotel reservation and tourist information was recorded, transcribed and semantically annotated, and a semantic attribute-value representation was defined in which each conceptual relationship was represented by the names of the attributes. Two semantic annotation levels are distinguished in this approach. At the first level, each utterance is considered separately and the annotation represents the meaning of the statement without taking into account the dialog context. The second level of annotation then corresponds to the interpretation of the meaning of the statement by taking into account the dialog context; in this way a semantic representation of the dialog context is defined. This paper discusses the data collection, the detailed definition of both annotation levels, and the annotation scheme. Then the paper comments on both evaluation campaigns which were carried out during the project and discusses some results.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
<region>
<li>Grand Est</li>
<li>Lorraine (région)</li>
<li>Île-de-France</li>
</region>
<settlement>
<li>Orsay</li>
<li>Vandœuvre-lès-Nancy</li>
</settlement>
</list>
<tree>
<country name="France">
<region name="Île-de-France">
<name sortKey="Bonneau Maynard, Helene" sort="Bonneau Maynard, Helene" uniqKey="Bonneau Maynard H" first="Hélène" last="Bonneau-Maynard">Hélène Bonneau-Maynard</name>
</region>
<name sortKey="Denis, Alexandre" sort="Denis, Alexandre" uniqKey="Denis A" first="Alexandre" last="Denis">Alexandre Denis</name>
<name sortKey="Quignard, Matthieu" sort="Quignard, Matthieu" uniqKey="Quignard M" first="Matthieu" last="Quignard">Matthieu Quignard</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 003C72 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 003C72 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     Francis:10-0023530
   |texte=   MEDIA: a semantically annotated corpus of task oriented dialogs in French: Results of the French MEDIA evaluation campaign
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022