TeiVM2, Istex, Corpus, bibRecord, 000477

Transcriber: Development and use of a tool for assisting speech corpora production

Identifieur interne : 000477 ( Istex/Corpus ); précédent : 000476; suivant : 000478

Transcriber: Development and use of a tool for assisting speech corpora production

Auteurs : Claude Barras ; Edouard Geoffrois ; Zhibiao Wu ; Mark Liberman

Source :

Speech Communication [ 0167-6393 ] ; 2000.

RBID : ISTEX:451C86626179A17CCC13ACCCE699A1A2EBDFA996

Abstract

We present “Transcriber”, a tool for assisting in the creation of speech corpora, and describe some aspects of its development and use. Transcriber was designed for the manual segmentation and transcription of long duration broadcast news recordings, including annotation of speech turns, topics and acoustic conditions. It is highly portable, relying on the scripting language Tcl/Tk with extensions such as Snack for advanced audio functions and tcLex for lexical analysis, and has been tested on various Unix systems and Windows. The data format follows the XML standard with Unicode support for multilingual transcriptions. Distributed as free software in order to encourage the production of corpora, ease their sharing, increase user feedback and motivate software contributions, Transcriber has been in use for over a year in several countries. As a result of this collective experience, new requirements arose to support additional data formats, video control, and a better management of conversational speech. Using the annotation graphs framework recently formalized, adaptation of the tool towards new tasks and support of different data formats will become easier.

Url:

https://api.istex.fr/document/451C86626179A17CCC13ACCCE699A1A2EBDFA996/fulltext/pdf

DOI: 10.1016/S0167-6393(00)00067-4

Links to Exploration step

ISTEX:451C86626179A17CCC13ACCCE699A1A2EBDFA996

Le document en format XML

<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title>Transcriber: Development and use of a tool for assisting speech corpora production</title>
<author><name sortKey="Barras, Claude" sort="Barras, Claude" uniqKey="Barras C" first="Claude" last="Barras">Claude Barras</name>
<affiliation><mods:affiliation>Spoken Language Processing Group, LIMSI-CNRS, BP 133, 91403 Orsay Cedex, France</mods:affiliation>
</affiliation>
<affiliation><mods:affiliation>Corresponding author. Tel.: +33-1-69-85-80-61; fax: +33-1-69-85-80-88</mods:affiliation>
</affiliation>
<affiliation><mods:affiliation>E-mail: claude.barras@limsi.fr</mods:affiliation>
</affiliation>
</author>
<author><name sortKey="Geoffrois, Edouard" sort="Geoffrois, Edouard" uniqKey="Geoffrois E" first="Edouard" last="Geoffrois">Edouard Geoffrois</name>
<affiliation><mods:affiliation>DGA/CTA/GIP, 16 bis av. Prieur de la Côte d'Or, 94114 Arcueil Cedex, France</mods:affiliation>
</affiliation>
</author>
<author><name sortKey="Wu, Zhibiao" sort="Wu, Zhibiao" uniqKey="Wu Z" first="Zhibiao" last="Wu">Zhibiao Wu</name>
<affiliation><mods:affiliation>LDC, 3615 Market Street, Suite 200, Philadelphia, PA 19104-2608, USA</mods:affiliation>
</affiliation>
</author>
<author><name sortKey="Liberman, Mark" sort="Liberman, Mark" uniqKey="Liberman M" first="Mark" last="Liberman">Mark Liberman</name>
<affiliation><mods:affiliation>LDC, 3615 Market Street, Suite 200, Philadelphia, PA 19104-2608, USA</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:451C86626179A17CCC13ACCCE699A1A2EBDFA996</idno>
<date when="2001" year="2001">2001</date>
<idno type="doi">10.1016/S0167-6393(00)00067-4</idno>
<idno type="url">https://api.istex.fr/document/451C86626179A17CCC13ACCCE699A1A2EBDFA996/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000477</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a">Transcriber: Development and use of a tool for assisting speech corpora production</title>
<author><name sortKey="Barras, Claude" sort="Barras, Claude" uniqKey="Barras C" first="Claude" last="Barras">Claude Barras</name>
<affiliation><mods:affiliation>Spoken Language Processing Group, LIMSI-CNRS, BP 133, 91403 Orsay Cedex, France</mods:affiliation>
</affiliation>
<affiliation><mods:affiliation>Corresponding author. Tel.: +33-1-69-85-80-61; fax: +33-1-69-85-80-88</mods:affiliation>
</affiliation>
<affiliation><mods:affiliation>E-mail: claude.barras@limsi.fr</mods:affiliation>
</affiliation>
</author>
<author><name sortKey="Geoffrois, Edouard" sort="Geoffrois, Edouard" uniqKey="Geoffrois E" first="Edouard" last="Geoffrois">Edouard Geoffrois</name>
<affiliation><mods:affiliation>DGA/CTA/GIP, 16 bis av. Prieur de la Côte d'Or, 94114 Arcueil Cedex, France</mods:affiliation>
</affiliation>
</author>
<author><name sortKey="Wu, Zhibiao" sort="Wu, Zhibiao" uniqKey="Wu Z" first="Zhibiao" last="Wu">Zhibiao Wu</name>
<affiliation><mods:affiliation>LDC, 3615 Market Street, Suite 200, Philadelphia, PA 19104-2608, USA</mods:affiliation>
</affiliation>
</author>
<author><name sortKey="Liberman, Mark" sort="Liberman, Mark" uniqKey="Liberman M" first="Mark" last="Liberman">Mark Liberman</name>
<affiliation><mods:affiliation>LDC, 3615 Market Street, Suite 200, Philadelphia, PA 19104-2608, USA</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Speech Communication</title>
<title level="j" type="abbrev">SPECOM</title>
<idno type="ISSN">0167-6393</idno>
<imprint><publisher>ELSEVIER</publisher>
<date type="published" when="2000">2000</date>
<biblScope unit="volume">33</biblScope>
<biblScope unit="issue">1–2</biblScope>
<biblScope unit="page" from="5">5</biblScope>
<biblScope unit="page" to="22">22</biblScope>
</imprint>
<idno type="ISSN">0167-6393</idno>
</series>
<idno type="istex">451C86626179A17CCC13ACCCE699A1A2EBDFA996</idno>
<idno type="DOI">10.1016/S0167-6393(00)00067-4</idno>
<idno type="PII">S0167-6393(00)00067-4</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0167-6393</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">We present “Transcriber”, a tool for assisting in the creation of speech corpora, and describe some aspects of its development and use. Transcriber was designed for the manual segmentation and transcription of long duration broadcast news recordings, including annotation of speech turns, topics and acoustic conditions. It is highly portable, relying on the scripting language Tcl/Tk with extensions such as Snack for advanced audio functions and tcLex for lexical analysis, and has been tested on various Unix systems and Windows. The data format follows the XML standard with Unicode support for multilingual transcriptions. Distributed as free software in order to encourage the production of corpora, ease their sharing, increase user feedback and motivate software contributions, Transcriber has been in use for over a year in several countries. As a result of this collective experience, new requirements arose to support additional data formats, video control, and a better management of conversational speech. Using the annotation graphs framework recently formalized, adaptation of the tool towards new tasks and support of different data formats will become easier.</div>
</front>
</TEI>
<istex><corpusName>elsevier</corpusName>
<author><json:item><name>Claude Barras</name>
<affiliations><json:string>Spoken Language Processing Group, LIMSI-CNRS, BP 133, 91403 Orsay Cedex, France</json:string>
<json:string>Corresponding author. Tel.: +33-1-69-85-80-61; fax: +33-1-69-85-80-88</json:string>
<json:string>E-mail: claude.barras@limsi.fr</json:string>
</affiliations>
</json:item>
<json:item><name>Edouard Geoffrois</name>
<affiliations><json:string>DGA/CTA/GIP, 16 bis av. Prieur de la Côte d'Or, 94114 Arcueil Cedex, France</json:string>
</affiliations>
</json:item>
<json:item><name>Zhibiao Wu</name>
<affiliations><json:string>LDC, 3615 Market Street, Suite 200, Philadelphia, PA 19104-2608, USA</json:string>
</affiliations>
</json:item>
<json:item><name>Mark Liberman</name>
<affiliations><json:string>LDC, 3615 Market Street, Suite 200, Philadelphia, PA 19104-2608, USA</json:string>
</affiliations>
</json:item>
</author>
<subject><json:item><lang><json:string>eng</json:string>
</lang>
<value>Transcription tool</value>
</json:item>
<json:item><lang><json:string>eng</json:string>
</lang>
<value>Speech corpora</value>
</json:item>
<json:item><lang><json:string>eng</json:string>
</lang>
<value>Broadcast news</value>
</json:item>
<json:item><lang><json:string>eng</json:string>
</lang>
<value>Linguistic annotation formats</value>
</json:item>
</subject>
<language><json:string>eng</json:string>
</language>
<originalGenre><json:string>Full-length article</json:string>
</originalGenre>
<abstract>We present “Transcriber”, a tool for assisting in the creation of speech corpora, and describe some aspects of its development and use. Transcriber was designed for the manual segmentation and transcription of long duration broadcast news recordings, including annotation of speech turns, topics and acoustic conditions. It is highly portable, relying on the scripting language Tcl/Tk with extensions such as Snack for advanced audio functions and tcLex for lexical analysis, and has been tested on various Unix systems and Windows. The data format follows the XML standard with Unicode support for multilingual transcriptions. Distributed as free software in order to encourage the production of corpora, ease their sharing, increase user feedback and motivate software contributions, Transcriber has been in use for over a year in several countries. As a result of this collective experience, new requirements arose to support additional data formats, video control, and a better management of conversational speech. Using the annotation graphs framework recently formalized, adaptation of the tool towards new tasks and support of different data formats will become easier.</abstract>
<qualityIndicators><score>7.088</score>
<pdfVersion>1.2</pdfVersion>
<pdfPageSize>544 x 743 pts</pdfPageSize>
<refBibsNative>true</refBibsNative>
<keywordCount>4</keywordCount>
<abstractCharCount>1176</abstractCharCount>
<pdfWordCount>8772</pdfWordCount>
<pdfCharCount>55141</pdfCharCount>
<pdfPageCount>18</pdfPageCount>
<abstractWordCount>174</abstractWordCount>
</qualityIndicators>
<title>Transcriber: Development and use of a tool for assisting speech corpora production</title>
<pii><json:string>S0167-6393(00)00067-4</json:string>
</pii>
<genre><json:string>research-article</json:string>
</genre>
<host><volume>33</volume>
<pii><json:string>S0167-6393(00)X0050-7</json:string>
</pii>
<editor><json:item><name>S. Bird and J. Harrington</name>
</json:item>
</editor>
<pages><last>22</last>
<first>5</first>
</pages>
<conference><json:item><name>Speech Annotation and Corpus Tools Speech Annotation</name>
</json:item>
</conference>
<issn><json:string>0167-6393</json:string>
</issn>
<issue>1–2</issue>
<genre><json:string>journal</json:string>
</genre>
<language><json:string>unknown</json:string>
</language>
<title>Speech Communication</title>
<publicationDate>2001</publicationDate>
</host>
<categories><wos><json:string>COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS</json:string>
<json:string>ACOUSTICS</json:string>
</wos>
</categories>
<publicationDate>2001</publicationDate>
<copyrightDate>2001</copyrightDate>
<doi><json:string>10.1016/S0167-6393(00)00067-4</json:string>
</doi>
<id>451C86626179A17CCC13ACCCE699A1A2EBDFA996</id>
<score>0.10527899</score>
<fulltext><json:item><original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/451C86626179A17CCC13ACCCE699A1A2EBDFA996/fulltext/pdf</uri>
</json:item>
<json:item><original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/451C86626179A17CCC13ACCCE699A1A2EBDFA996/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/451C86626179A17CCC13ACCCE699A1A2EBDFA996/fulltext/tei"><teiHeader><fileDesc><titleStmt><title level="a">Transcriber: Development and use of a tool for assisting speech corpora production</title>
</titleStmt>
<publicationStmt><authority>ISTEX</authority>
<publisher>ELSEVIER</publisher>
<availability><p>ELSEVIER</p>
</availability>
<date>2001</date>
</publicationStmt>
<notesStmt><note>This work was done while the first author was working with DGA.</note>
<note type="content">Fig. 1: Screen shot of the user interface.</note>
<note type="content">Fig. 2: The four segmentation levels of a transcription.</note>
<note type="content">Fig. 3: Sample of a transcription file.</note>
<note type="content">Fig. 4: Two solutions tested for the representation of overlapping speech.</note>
</notesStmt>
<sourceDesc><biblStruct type="inbook"><analytic><title level="a">Transcriber: Development and use of a tool for assisting speech corpora production</title>
<author><persName><forename type="first">Claude</forename>
<surname>Barras</surname>
</persName>
<email>claude.barras@limsi.fr</email>
<affiliation>Spoken Language Processing Group, LIMSI-CNRS, BP 133, 91403 Orsay Cedex, France</affiliation>
<affiliation>Corresponding author. Tel.: +33-1-69-85-80-61; fax: +33-1-69-85-80-88</affiliation>
</author>
<author><persName><forename type="first">Edouard</forename>
<surname>Geoffrois</surname>
</persName>
<affiliation>DGA/CTA/GIP, 16 bis av. Prieur de la Côte d'Or, 94114 Arcueil Cedex, France</affiliation>
</author>
<author><persName><forename type="first">Zhibiao</forename>
<surname>Wu</surname>
</persName>
<affiliation>LDC, 3615 Market Street, Suite 200, Philadelphia, PA 19104-2608, USA</affiliation>
</author>
<author><persName><forename type="first">Mark</forename>
<surname>Liberman</surname>
</persName>
<affiliation>LDC, 3615 Market Street, Suite 200, Philadelphia, PA 19104-2608, USA</affiliation>
</author>
</analytic>
<monogr><title level="j">Speech Communication</title>
<title level="j" type="abbrev">SPECOM</title>
<idno type="pISSN">0167-6393</idno>
<idno type="PII">S0167-6393(00)X0050-7</idno>
<meeting><addName>Speech Annotation and Corpus Tools</addName>
<addName>Speech Annotation</addName>
</meeting>
<editor><persName>S. Bird and J. Harrington</persName>
</editor>
<imprint><publisher>ELSEVIER</publisher>
<date type="published" when="2000"></date>
<biblScope unit="volume">33</biblScope>
<biblScope unit="issue">1–2</biblScope>
<biblScope unit="page" from="5">5</biblScope>
<biblScope unit="page" to="22">22</biblScope>
</imprint>
</monogr>
<idno type="istex">451C86626179A17CCC13ACCCE699A1A2EBDFA996</idno>
<idno type="DOI">10.1016/S0167-6393(00)00067-4</idno>
<idno type="PII">S0167-6393(00)00067-4</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><creation><date>2001</date>
</creation>
<langUsage><language ident="en">en</language>
</langUsage>
<abstract xml:lang="en"><p>We present “Transcriber”, a tool for assisting in the creation of speech corpora, and describe some aspects of its development and use. Transcriber was designed for the manual segmentation and transcription of long duration broadcast news recordings, including annotation of speech turns, topics and acoustic conditions. It is highly portable, relying on the scripting language Tcl/Tk with extensions such as Snack for advanced audio functions and tcLex for lexical analysis, and has been tested on various Unix systems and Windows. The data format follows the XML standard with Unicode support for multilingual transcriptions. Distributed as free software in order to encourage the production of corpora, ease their sharing, increase user feedback and motivate software contributions, Transcriber has been in use for over a year in several countries. As a result of this collective experience, new requirements arose to support additional data formats, video control, and a better management of conversational speech. Using the annotation graphs framework recently formalized, adaptation of the tool towards new tasks and support of different data formats will become easier.</p>
</abstract>
<abstract xml:lang="fr"><p>Nous présentons “Transcriber”, un outil d'aide à la création de corpus de parole, et nous décrivons des éléments de son développement et de son utilisation. Transcriber a été conçu pour permettre la segmentation manuelle et la transcription d'enregistrements de nouvelles radio-diffusées de longue durée, ainsi que l'annotation des tours de parole, des thèmes et des conditions acoustiques. Cet outil très portable, reposant sur le langage de script Tcl/Tk et des extensions telles que Snack pour les fonctionnalités audio et tcLex pour l'analyse lexicale, a été testé sur différents systèmes Unix et sous Windows. Le format de données respecte le standard XML avec un support d'Unicode pour les transcriptions multilingues. Distribué sous license libre pour encourager la production de corpus, faciliter leur échange, augmenter le retour d'expérience des utilisateurs et motiver les contributions logicielles extérieures, Transcriber est utilisé depuis plus d'un an dans plusieurs pays. Suite à cette utilisation, de nouveaux besoins sont apparus comme le support de formats de données supplémentaires, de la vidéo, et un meilleur traitement de la parole conversationnelle. En utilisant le modèle des graphes d'annotation formalisé récemment, l'adaptation de l'outil vers de nouvelles tâches et le support de différents formats de données sera facilité.</p>
</abstract>
<textClass><keywords scheme="keyword"><list><head>Keywords</head>
<item><term>Transcription tool</term>
</item>
<item><term>Speech corpora</term>
</item>
<item><term>Broadcast news</term>
</item>
<item><term>Linguistic annotation formats</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc><change when="2000-08-02">Registration</change>
<change when="2000">Published</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item><original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/451C86626179A17CCC13ACCCE699A1A2EBDFA996/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata><istex:metadataXml wicri:clean="Elsevier, elements deleted: ce:floats; body; tail"><istex:xmlDeclaration>version="1.0" encoding="utf-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//ES//DTD journal article DTD version 4.5.2//EN//XML" URI="art452.dtd" name="istex:docType"><istex:entity SYSTEM="gr1" NDATA="IMAGE" name="gr1"></istex:entity>
<istex:entity SYSTEM="gr2" NDATA="IMAGE" name="gr2"></istex:entity>
<istex:entity SYSTEM="gr3" NDATA="IMAGE" name="gr3"></istex:entity>
<istex:entity SYSTEM="gr4" NDATA="IMAGE" name="gr4"></istex:entity>
</istex:docType>
<istex:document><converted-article version="4.5.2" docsubtype="fla"><item-info><jid>SPECOM</jid>
<aid>1105</aid>
<ce:pii>S0167-6393(00)00067-4</ce:pii>
<ce:doi>10.1016/S0167-6393(00)00067-4</ce:doi>
<ce:copyright type="full-transfer" year="2001">Elsevier Science B.V.</ce:copyright>
</item-info>
<head><ce:article-footnote><ce:label>☆</ce:label>
<ce:note-para>This work was done while the first author was working with DGA.</ce:note-para>
</ce:article-footnote>
<ce:title>Transcriber: Development and use of a tool for assisting speech corpora production</ce:title>
<ce:author-group><ce:author><ce:given-name>Claude</ce:given-name>
<ce:surname>Barras</ce:surname>
<ce:cross-ref refid="AFF1"><ce:sup>a</ce:sup>
</ce:cross-ref>
<ce:cross-ref refid="CORR1">*</ce:cross-ref>
<ce:e-address>claude.barras@limsi.fr</ce:e-address>
</ce:author>
<ce:author><ce:given-name>Edouard</ce:given-name>
<ce:surname>Geoffrois</ce:surname>
<ce:cross-ref refid="AFF2"><ce:sup>b</ce:sup>
</ce:cross-ref>
</ce:author>
<ce:author><ce:given-name>Zhibiao</ce:given-name>
<ce:surname>Wu</ce:surname>
<ce:cross-ref refid="AFF3"><ce:sup>c</ce:sup>
</ce:cross-ref>
</ce:author>
<ce:author><ce:given-name>Mark</ce:given-name>
<ce:surname>Liberman</ce:surname>
<ce:cross-ref refid="AFF3"><ce:sup>c</ce:sup>
</ce:cross-ref>
</ce:author>
<ce:affiliation id="AFF1"><ce:label>a</ce:label>
<ce:textfn>Spoken Language Processing Group, LIMSI-CNRS, BP 133, 91403 Orsay Cedex, France</ce:textfn>
</ce:affiliation>
<ce:affiliation id="AFF2"><ce:label>b</ce:label>
<ce:textfn>DGA/CTA/GIP, 16 bis av. Prieur de la Côte d'Or, 94114 Arcueil Cedex, France</ce:textfn>
</ce:affiliation>
<ce:affiliation id="AFF3"><ce:label>c</ce:label>
<ce:textfn>LDC, 3615 Market Street, Suite 200, Philadelphia, PA 19104-2608, USA</ce:textfn>
</ce:affiliation>
<ce:correspondence id="CORR1"><ce:label>*</ce:label>
<ce:text>Corresponding author. Tel.: +33-1-69-85-80-61; fax: +33-1-69-85-80-88</ce:text>
</ce:correspondence>
</ce:author-group>
<ce:date-accepted day="2" month="8" year="2000"></ce:date-accepted>
<ce:abstract><ce:section-title>Abstract</ce:section-title>
<ce:abstract-sec><ce:simple-para>We present “Transcriber”, a tool for assisting in the creation of speech corpora, and describe some aspects of its development and use. Transcriber was designed for the manual segmentation and transcription of long duration broadcast news recordings, including annotation of speech turns, topics and acoustic conditions. It is highly portable, relying on the scripting language Tcl/Tk with extensions such as Snack for advanced audio functions and tcLex for lexical analysis, and has been tested on various Unix systems and Windows. The data format follows the XML standard with Unicode support for multilingual transcriptions. Distributed as free software in order to encourage the production of corpora, ease their sharing, increase user feedback and motivate software contributions, Transcriber has been in use for over a year in several countries. As a result of this collective experience, new requirements arose to support additional data formats, video control, and a better management of conversational speech. Using the annotation graphs framework recently formalized, adaptation of the tool towards new tasks and support of different data formats will become easier.</ce:simple-para>
</ce:abstract-sec>
</ce:abstract>
<ce:abstract xml:lang="fr"><ce:section-title>Résumé</ce:section-title>
<ce:abstract-sec><ce:simple-para>Nous présentons “Transcriber”, un outil d'aide à la création de corpus de parole, et nous décrivons des éléments de son développement et de son utilisation. Transcriber a été conçu pour permettre la segmentation manuelle et la transcription d'enregistrements de nouvelles radio-diffusées de longue durée, ainsi que l'annotation des tours de parole, des thèmes et des conditions acoustiques. Cet outil très portable, reposant sur le langage de script Tcl/Tk et des extensions telles que Snack pour les fonctionnalités audio et tcLex pour l'analyse lexicale, a été testé sur différents systèmes Unix et sous Windows. Le format de données respecte le standard XML avec un support d'Unicode pour les transcriptions multilingues. Distribué sous license libre pour encourager la production de corpus, faciliter leur échange, augmenter le retour d'expérience des utilisateurs et motiver les contributions logicielles extérieures, Transcriber est utilisé depuis plus d'un an dans plusieurs pays. Suite à cette utilisation, de nouveaux besoins sont apparus comme le support de formats de données supplémentaires, de la vidéo, et un meilleur traitement de la parole conversationnelle. En utilisant le modèle des graphes d'annotation formalisé récemment, l'adaptation de l'outil vers de nouvelles tâches et le support de différents formats de données sera facilité.</ce:simple-para>
</ce:abstract-sec>
</ce:abstract>
<ce:keywords class="keyword"><ce:section-title>Keywords</ce:section-title>
<ce:keyword><ce:text>Transcription tool</ce:text>
</ce:keyword>
<ce:keyword><ce:text>Speech corpora</ce:text>
</ce:keyword>
<ce:keyword><ce:text>Broadcast news</ce:text>
</ce:keyword>
<ce:keyword><ce:text>Linguistic annotation formats</ce:text>
</ce:keyword>
</ce:keywords>
</head>
</converted-article>
</istex:document>
</istex:metadataXml>
<mods version="3.6"><titleInfo><title>Transcriber: Development and use of a tool for assisting speech corpora production</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA"><title>Transcriber: Development and use of a tool for assisting speech corpora production</title>
</titleInfo>
<name type="personal"><namePart type="given">Claude</namePart>
<namePart type="family">Barras</namePart>
<affiliation>Spoken Language Processing Group, LIMSI-CNRS, BP 133, 91403 Orsay Cedex, France</affiliation>
<affiliation>Corresponding author. Tel.: +33-1-69-85-80-61; fax: +33-1-69-85-80-88</affiliation>
<affiliation>E-mail: claude.barras@limsi.fr</affiliation>
<role><roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Edouard</namePart>
<namePart type="family">Geoffrois</namePart>
<affiliation>DGA/CTA/GIP, 16 bis av. Prieur de la Côte d'Or, 94114 Arcueil Cedex, France</affiliation>
<role><roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Zhibiao</namePart>
<namePart type="family">Wu</namePart>
<affiliation>LDC, 3615 Market Street, Suite 200, Philadelphia, PA 19104-2608, USA</affiliation>
<role><roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Mark</namePart>
<namePart type="family">Liberman</namePart>
<affiliation>LDC, 3615 Market Street, Suite 200, Philadelphia, PA 19104-2608, USA</affiliation>
<role><roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="research-article" displayLabel="Full-length article"></genre>
<originInfo><publisher>ELSEVIER</publisher>
<dateIssued encoding="w3cdtf">2001</dateIssued>
<copyrightDate encoding="w3cdtf">2001</copyrightDate>
</originInfo>
<language><languageTerm type="code" authority="iso639-2b">eng</languageTerm>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
</language>
<physicalDescription><internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">We present “Transcriber”, a tool for assisting in the creation of speech corpora, and describe some aspects of its development and use. Transcriber was designed for the manual segmentation and transcription of long duration broadcast news recordings, including annotation of speech turns, topics and acoustic conditions. It is highly portable, relying on the scripting language Tcl/Tk with extensions such as Snack for advanced audio functions and tcLex for lexical analysis, and has been tested on various Unix systems and Windows. The data format follows the XML standard with Unicode support for multilingual transcriptions. Distributed as free software in order to encourage the production of corpora, ease their sharing, increase user feedback and motivate software contributions, Transcriber has been in use for over a year in several countries. As a result of this collective experience, new requirements arose to support additional data formats, video control, and a better management of conversational speech. Using the annotation graphs framework recently formalized, adaptation of the tool towards new tasks and support of different data formats will become easier.</abstract>
<abstract lang="fr">Nous présentons “Transcriber”, un outil d'aide à la création de corpus de parole, et nous décrivons des éléments de son développement et de son utilisation. Transcriber a été conçu pour permettre la segmentation manuelle et la transcription d'enregistrements de nouvelles radio-diffusées de longue durée, ainsi que l'annotation des tours de parole, des thèmes et des conditions acoustiques. Cet outil très portable, reposant sur le langage de script Tcl/Tk et des extensions telles que Snack pour les fonctionnalités audio et tcLex pour l'analyse lexicale, a été testé sur différents systèmes Unix et sous Windows. Le format de données respecte le standard XML avec un support d'Unicode pour les transcriptions multilingues. Distribué sous license libre pour encourager la production de corpus, faciliter leur échange, augmenter le retour d'expérience des utilisateurs et motiver les contributions logicielles extérieures, Transcriber est utilisé depuis plus d'un an dans plusieurs pays. Suite à cette utilisation, de nouveaux besoins sont apparus comme le support de formats de données supplémentaires, de la vidéo, et un meilleur traitement de la parole conversationnelle. En utilisant le modèle des graphes d'annotation formalisé récemment, l'adaptation de l'outil vers de nouvelles tâches et le support de différents formats de données sera facilité.</abstract>
<note>This work was done while the first author was working with DGA.</note>
<note type="content">Fig. 1: Screen shot of the user interface.</note>
<note type="content">Fig. 2: The four segmentation levels of a transcription.</note>
<note type="content">Fig. 3: Sample of a transcription file.</note>
<note type="content">Fig. 4: Two solutions tested for the representation of overlapping speech.</note>
<subject><genre>Keywords</genre>
<topic>Transcription tool</topic>
<topic>Speech corpora</topic>
<topic>Broadcast news</topic>
<topic>Linguistic annotation formats</topic>
</subject>
<relatedItem type="host"><titleInfo><title>Speech Communication</title>
</titleInfo>
<titleInfo type="abbreviated"><title>SPECOM</title>
</titleInfo>
<name type="conference"><namePart>Speech Annotation and Corpus Tools</namePart>
<namePart>Speech Annotation</namePart>
</name>
<name type="personal"><namePart>S. Bird and J. Harrington</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="journal">journal</genre>
<originInfo><dateIssued encoding="w3cdtf">200101</dateIssued>
</originInfo>
<identifier type="ISSN">0167-6393</identifier>
<identifier type="PII">S0167-6393(00)X0050-7</identifier>
<part><date>200101</date>
<detail type="issue"><title>Speech Annotation and Corpus Tools</title>
</detail>
<detail type="volume"><number>33</number>
<caption>vol.</caption>
</detail>
<detail type="issue"><number>1–2</number>
<caption>no.</caption>
</detail>
<extent unit="issue pages"><start>1</start>
<end>176</end>
</extent>
<extent unit="pages"><start>5</start>
<end>22</end>
</extent>
</part>
</relatedItem>
<identifier type="istex">451C86626179A17CCC13ACCCE699A1A2EBDFA996</identifier>
<identifier type="DOI">10.1016/S0167-6393(00)00067-4</identifier>
<identifier type="PII">S0167-6393(00)00067-4</identifier>
<accessCondition type="use and reproduction" contentType="">© 2001Elsevier Science B.V.</accessCondition>
<recordInfo><recordContentSource>ELSEVIER</recordContentSource>
<recordOrigin>Elsevier Science B.V., ©2001</recordOrigin>
</recordInfo>
</mods>
</metadata>
<enrichments><istex:catWosTEI uri="https://api.istex.fr/document/451C86626179A17CCC13ACCCE699A1A2EBDFA996/enrichments/catWos"><teiHeader><profileDesc><textClass><classCode scheme="WOS">COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS</classCode>
<classCode scheme="WOS">ACOUSTICS</classCode>
</textClass>
</profileDesc>
</teiHeader>
</istex:catWosTEI>
</enrichments>
<serie></serie>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Istex/Corpus

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000477 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000477 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:451C86626179A17CCC13ACCCE699A1A2EBDFA996
   |texte=   Transcriber: Development and use of a tool for assisting speech corpora production
}}

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024

	Serveur d'exploration sur la TEI
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur la TEI

Transcriber: Development and use of a tool for assisting speech corpora production

Transcriber: Development and use of a tool for assisting speech corpora production

Source :

Abstract

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri