Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Reconnaissance de la parole pour les locuteurs non natifs en présence de bruit

Identifieur interne : 003285 ( Crin/Corpus ); précédent : 003284; suivant : 003286

Reconnaissance de la parole pour les locuteurs non natifs en présence de bruit

Auteurs : Dominique Fohr ; Odile Mella ; Irina Illina ; Fabrice Lauri ; Christophe Cerisara ; Christophe Antoine

Source :

RBID : CRIN:fohr02a

English descriptors

Abstract

In real world applications, speech recognition is confronted with two main difficulties : the non native speakers and the background noise. The aim of this paper is to compare on the same noisy database different methods in order to increase the robustness of our HMM-based automatic speech recognition system. To deal with the non native speakers, we have tested two solutions : multi-models and adaptation techniques. For noisy speech, we have evaluated two types of methods : the first one (PMC and MLLR) adapts the initial models, trained in clean speech, with a few noisy sentences. The second one (RATZ and MCR) tries to remove the noise from the signal without modifying the HMM models.

Links to Exploration step

CRIN:fohr02a

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="fr" wicri:score="-38">Reconnaissance de la parole pour les locuteurs non natifs en présence de bruit</title>
</titleStmt>
<publicationStmt>
<idno type="RBID">CRIN:fohr02a</idno>
<date when="2002" year="2002">2002</date>
<idno type="wicri:Area/Crin/Corpus">003285</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="fr">Reconnaissance de la parole pour les locuteurs non natifs en présence de bruit</title>
<author>
<name sortKey="Fohr, Dominique" sort="Fohr, Dominique" uniqKey="Fohr D" first="Dominique" last="Fohr">Dominique Fohr</name>
</author>
<author>
<name sortKey="Mella, Odile" sort="Mella, Odile" uniqKey="Mella O" first="Odile" last="Mella">Odile Mella</name>
</author>
<author>
<name sortKey="Illina, Irina" sort="Illina, Irina" uniqKey="Illina I" first="Irina" last="Illina">Irina Illina</name>
</author>
<author>
<name sortKey="Lauri, Fabrice" sort="Lauri, Fabrice" uniqKey="Lauri F" first="Fabrice" last="Lauri">Fabrice Lauri</name>
</author>
<author>
<name sortKey="Cerisara, Christophe" sort="Cerisara, Christophe" uniqKey="Cerisara C" first="Christophe" last="Cerisara">Christophe Cerisara</name>
</author>
<author>
<name sortKey="Antoine, Christophe" sort="Antoine, Christophe" uniqKey="Antoine C" first="Christophe" last="Antoine">Christophe Antoine</name>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>automatic speech recognition</term>
<term>background noise</term>
<term>non native speakers</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en" wicri:score="2031">In real world applications, speech recognition is confronted with two main difficulties : the non native speakers and the background noise. The aim of this paper is to compare on the same noisy database different methods in order to increase the robustness of our HMM-based automatic speech recognition system. To deal with the non native speakers, we have tested two solutions : multi-models and adaptation techniques. For noisy speech, we have evaluated two types of methods : the first one (PMC and MLLR) adapts the initial models, trained in clean speech, with a few noisy sentences. The second one (RATZ and MCR) tries to remove the noise from the signal without modifying the HMM models.</div>
</front>
</TEI>
<BibTex type="inproceedings">
<ref>fohr02a</ref>
<crinnumber>A02-R-112</crinnumber>
<category>3</category>
<equipe>PAROLE</equipe>
<author>
<e>Fohr, Dominique</e>
<e>Mella, Odile</e>
<e>Illina, Irina</e>
<e>Lauri, Fabrice</e>
<e>Cerisara, Christophe</e>
<e>Antoine, Christophe</e>
</author>
<title>Reconnaissance de la parole pour les locuteurs non natifs en présence de bruit</title>
<booktitle>{XXIVèmes Journées d'Etude sur la Parole - JEP'02, Nancy, France}</booktitle>
<year>2002</year>
<pages>297-301</pages>
<month>Jun</month>
<keywords>
<e>automatic speech recognition</e>
<e>non native speakers</e>
<e>background noise</e>
</keywords>
<abstract>In real world applications, speech recognition is confronted with two main difficulties : the non native speakers and the background noise. The aim of this paper is to compare on the same noisy database different methods in order to increase the robustness of our HMM-based automatic speech recognition system. To deal with the non native speakers, we have tested two solutions : multi-models and adaptation techniques. For noisy speech, we have evaluated two types of methods : the first one (PMC and MLLR) adapts the initial models, trained in clean speech, with a few noisy sentences. The second one (RATZ and MCR) tries to remove the noise from the signal without modifying the HMM models.</abstract>
</BibTex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Crin/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 003285 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Crin/Corpus/biblio.hfd -nk 003285 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Crin
   |étape=   Corpus
   |type=    RBID
   |clé=     CRIN:fohr02a
   |texte=   Reconnaissance de la parole pour les locuteurs non natifs en présence de bruit
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022