Duration of Phones as Function of Utterance Length and its Use in Automatic Speech
Identifieur interne : 001187 ( Crin/Corpus ); précédent : 001186; suivant : 001188Duration of Phones as Function of Utterance Length and its Use in Automatic Speech
Auteurs : Y. Gong ; W. C. TreurnietSource :
English descriptors
Abstract
Duration probability of phonemes is widely used as a constraint in phoneme-based continuous speech recognizers, and is known to improve recognition accuracy. Usually, models of phoneme duration are extracted from continuous utterances of sentences in a training database. However, tokens obtained from shorter utterances may not be well represented by models created from longer utterances. We designed an experiment to compute observed average phoneme duration as a function of the number of phonemes per utterance. We observed that the average duration consistently increases as the number of phonemes per utterance increases. The experiment showed that the average duration of phonemes in words spoken in isolation may be as much as 50========percnt; longer than the average duration of phonemes in continuously spoken sentences. The variation of phoneme duration as a function of utterance duration was modeled in both the phoneme probability estimation stage and the utterance search stage of a recognition system. As a result, a 47========percnt; reduction in word recognition errors was obtained.
Links to Exploration step
CRIN:gong93bLe document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" wicri:score="464">Duration of Phones as Function of Utterance Length and its Use in Automatic Speech</title>
</titleStmt>
<publicationStmt><idno type="RBID">CRIN:gong93b</idno>
<date when="1993" year="1993">1993</date>
<idno type="wicri:Area/Crin/Corpus">001187</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Duration of Phones as Function of Utterance Length and its Use in Automatic Speech</title>
<author><name sortKey="Gong, Y" sort="Gong, Y" uniqKey="Gong Y" first="Y." last="Gong">Y. Gong</name>
</author>
<author><name sortKey="Treurniet, W C" sort="Treurniet, W C" uniqKey="Treurniet W" first="W. C." last="Treurniet">W. C. Treurniet</name>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>duration probability</term>
<term>phoneme duration</term>
<term>phoneme-based speech recognition</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en" wicri:score="3491">Duration probability of phonemes is widely used as a constraint in phoneme-based continuous speech recognizers, and is known to improve recognition accuracy. Usually, models of phoneme duration are extracted from continuous utterances of sentences in a training database. However, tokens obtained from shorter utterances may not be well represented by models created from longer utterances. We designed an experiment to compute observed average phoneme duration as a function of the number of phonemes per utterance. We observed that the average duration consistently increases as the number of phonemes per utterance increases. The experiment showed that the average duration of phonemes in words spoken in isolation may be as much as 50========percnt; longer than the average duration of phonemes in continuously spoken sentences. The variation of phoneme duration as a function of utterance duration was modeled in both the phoneme probability estimation stage and the utterance search stage of a recognition system. As a result, a 47========percnt; reduction in word recognition errors was obtained.</div>
</front>
</TEI>
<BibTex type="inproceedings"><ref>gong93b</ref>
<crinnumber>93-R-172</crinnumber>
<category>3</category>
<equipe>RFIA</equipe>
<author><e>Gong, Y.</e>
<e>Treurniet, W.C.</e>
</author>
<title>Duration of Phones as Function of Utterance Length and its Use in Automatic Speech</title>
<booktitle>{Proceedings 3rd European Conference on Speech Communication and Technology, Berlin (Germany)}</booktitle>
<year>1993</year>
<volume>1</volume>
<pages>315-318</pages>
<month>sep</month>
<keywords><e>phoneme duration</e>
<e>duration probability</e>
<e>phoneme-based speech recognition</e>
</keywords>
<abstract>Duration probability of phonemes is widely used as a constraint in phoneme-based continuous speech recognizers, and is known to improve recognition accuracy. Usually, models of phoneme duration are extracted from continuous utterances of sentences in a training database. However, tokens obtained from shorter utterances may not be well represented by models created from longer utterances. We designed an experiment to compute observed average phoneme duration as a function of the number of phonemes per utterance. We observed that the average duration consistently increases as the number of phonemes per utterance increases. The experiment showed that the average duration of phonemes in words spoken in isolation may be as much as 50========percnt; longer than the average duration of phonemes in continuously spoken sentences. The variation of phoneme duration as a function of utterance duration was modeled in both the phoneme probability estimation stage and the utterance search stage of a recognition system. As a result, a 47========percnt; reduction in word recognition errors was obtained.</abstract>
</BibTex>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Crin/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001187 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Crin/Corpus/biblio.hfd -nk 001187 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Lorraine |area= InforLorV4 |flux= Crin |étape= Corpus |type= RBID |clé= CRIN:gong93b |texte= Duration of Phones as Function of Utterance Length and its Use in Automatic Speech }}
![]() | This area was generated with Dilib version V0.6.33. | ![]() |