Optimization of Perceptually-based ASR Frond-end
Identifieur interne : 00EB32 ( Main/Merge ); précédent : 00EB31; suivant : 00EB33Optimization of Perceptually-based ASR Frond-end
Auteurs : H. Hermansky ; J.-C. JunquaSource :
English descriptors
- KwdEn :
Abstract
Several recently proposed automatic speech recognition (ASR) front-ends are experimentally compared for speaker-dependent and cross-speaker ASR. The perceptually based linear predictive (PLP) front-end yields the highest accuracies. By modifying its sensitivity to spectral peaks and to spectral tilt and by utilizing the speech dynamics we further improve by about 10========percnt; its error rate in speaker-independent ASR.
Links toward previous steps (curation, corpus...)
- to stream Crin, to step Corpus: 000668
- to stream Crin, to step Curation: 000668
- to stream Crin, to step Checkpoint: 003F70
Links to Exploration step
CRIN:hermansky88aLe document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" wicri:score="185">Optimization of Perceptually-based ASR Frond-end</title>
</titleStmt>
<publicationStmt><idno type="RBID">CRIN:hermansky88a</idno>
<date when="1988" year="1988">1988</date>
<idno type="wicri:Area/Crin/Corpus">000668</idno>
<idno type="wicri:Area/Crin/Curation">000668</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Curation">000668</idno>
<idno type="wicri:Area/Crin/Checkpoint">003F70</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Checkpoint">003F70</idno>
<idno type="wicri:Area/Main/Merge">00EB32</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Optimization of Perceptually-based ASR Frond-end</title>
<author><name sortKey="Hermansky, H" sort="Hermansky, H" uniqKey="Hermansky H" first="H." last="Hermansky">H. Hermansky</name>
</author>
<author><name sortKey="Junqua, J C" sort="Junqua, J C" uniqKey="Junqua J" first="J.-C." last="Junqua">J.-C. Junqua</name>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>distance measure</term>
<term>front-end</term>
<term>speech recognition</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en" wicri:score="484">Several recently proposed automatic speech recognition (ASR) front-ends are experimentally compared for speaker-dependent and cross-speaker ASR. The perceptually based linear predictive (PLP) front-end yields the highest accuracies. By modifying its sensitivity to spectral peaks and to spectral tilt and by utilizing the speech dynamics we further improve by about 10========percnt; its error rate in speaker-independent ASR.</div>
</front>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 00EB32 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 00EB32 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Lorraine |area= InforLorV4 |flux= Main |étape= Merge |type= RBID |clé= CRIN:hermansky88a |texte= Optimization of Perceptually-based ASR Frond-end }}
![]() | This area was generated with Dilib version V0.6.33. | ![]() |