InforLorV4, PascalFrancis, Corpus, bibRecord, 000298

On noise masking for automatic missing data speech recognition : A survey and discussion

Identifieur interne : 000298 ( PascalFrancis/Corpus ); précédent : 000297; suivant : 000299

On noise masking for automatic missing data speech recognition : A survey and discussion

Auteurs : Christophe Cerisara ; Sébastien Demange ; Jean-Paul Haton

Source :

Computer speech & language : (Print) [ 0885-2308 ] ; 2007.

RBID : Francis:09-0009259

Descripteurs français

Pascal (Inist)
- Linguistique informatique.

English descriptors

KwdEn :
- Computational linguistics.

Abstract

Automatic speech recognition (ASR) has reached very high levels of performance in controlled situations. However, the performance degrades significantly when environmental noise occurs during the recognition process. Nowadays, the major challenge is to reach a good robustness to adverse conditions, so that automatic speech recognizers can be used in real situations. Missing data theory is a very attractive and promising approach. Unlike other denoising methods, missing data recognition does not match the whole data with the acoustic models, but instead considers part of the signal as missing, i.e. corrupted by noise. While speech recognition with missing data can be handled efficiently by methods such as data imputation or marginalization, accurately identifying missing parts (also called masks) remains a very challenging task. This paper reviews the main approaches that have been proposed to address this problem. The objective of this study is to identify the mask estimation methods that have been proposed so far, and to open this domain up to other related research, which could be adapted to overcome this difficult challenge. In order to restrict the range of methods, only the techniques using a single microphone are considered.

Notice en format standard (ISO 2709)

Pour connaître la documentation sur le format Inist Standard.

A01	`01`	`1`		`@0 0885-2308`
A03		`1`		`@0 Comput. speech lang. : (Print)`
A05				`@2 21`
A06				`@2 3`
A08	`01`	`1`	`ENG`	`@1 On noise masking for automatic missing data speech recognition : A survey and discussion`
A11	`01`	`1`		`@1 CERISARA (Christophe)`
A11	`02`	`1`		`@1 DEMANGE (Sébastien)`
A11	`03`	`1`		`@1 HATON (Jean-Paul)`
A14	`01`			`@1 LORIA, UMR 7503 @2 Nancy @3 FRA @Z 1 aut. @Z 2 aut. @Z 3 aut.`
A20				`@1 443-457`
A21				`@1 2007`
A23	`01`			`@0 ENG`
A43	`01`			`@1 INIST @2 21332 @5 354000145646040020`
A44				`@0 0000 @1 © 2009 INIST-CNRS. All rights reserved.`
A45				`@0 1 p.3/4`
A47	`01`	`1`		`@0 09-0009259`
A60				`@1 P`
A61				`@0 A`
A64	`01`	`1`		`@0 Computer speech & language : (Print)`
A66	`01`			`@0 GBR`
C01	`01`		`ENG`	@0 Automatic speech recognition (ASR) has reached very high levels of performance in controlled situations. However, the performance degrades significantly when environmental noise occurs during the recognition process. Nowadays, the major challenge is to reach a good robustness to adverse conditions, so that automatic speech recognizers can be used in real situations. Missing data theory is a very attractive and promising approach. Unlike other denoising methods, missing data recognition does not match the whole data with the acoustic models, but instead considers part of the signal as missing, i.e. corrupted by noise. While speech recognition with missing data can be handled efficiently by methods such as data imputation or marginalization, accurately identifying missing parts (also called masks) remains a very challenging task. This paper reviews the main approaches that have been proposed to address this problem. The objective of this study is to identify the mask estimation methods that have been proposed so far, and to open this domain up to other related research, which could be adapted to overcome this difficult challenge. In order to restrict the range of methods, only the techniques using a single microphone are considered.
C02	`01`	`L`		`@0 52478 @1 XV`
C02	`02`	`L`		`@0 524`
C03	`01`	`L`	`FRE`	`@0 Linguistique informatique @2 NI @5 01`
C03	`01`	`L`	`ENG`	`@0 Computational linguistics @2 NI @5 01`
N21				`@1 004`
N44	`01`			`@1 OTO`
N82				`@1 OTO`

Format Inist (serveur)

NO :	FRANCIS 09-0009259 INIST
ET :	On noise masking for automatic missing data speech recognition : A survey and discussion
AU :	CERISARA (Christophe); DEMANGE (Sébastien); HATON (Jean-Paul)
AF :	LORIA, UMR 7503/Nancy/France (1 aut., 2 aut., 3 aut.)
DT :	Publication en série; Niveau analytique
SO :	Computer speech & language : (Print); ISSN 0885-2308; Royaume-Uni; Da. 2007; Vol. 21; No. 3; Pp. 443-457; Bibl. 1 p.3/4
LA :	Anglais
EA :	Automatic speech recognition (ASR) has reached very high levels of performance in controlled situations. However, the performance degrades significantly when environmental noise occurs during the recognition process. Nowadays, the major challenge is to reach a good robustness to adverse conditions, so that automatic speech recognizers can be used in real situations. Missing data theory is a very attractive and promising approach. Unlike other denoising methods, missing data recognition does not match the whole data with the acoustic models, but instead considers part of the signal as missing, i.e. corrupted by noise. While speech recognition with missing data can be handled efficiently by methods such as data imputation or marginalization, accurately identifying missing parts (also called masks) remains a very challenging task. This paper reviews the main approaches that have been proposed to address this problem. The objective of this study is to identify the mask estimation methods that have been proposed so far, and to open this domain up to other related research, which could be adapted to overcome this difficult challenge. In order to restrict the range of methods, only the techniques using a single microphone are considered.
CC :	52478; 524
FD :	Linguistique informatique
ED :	Computational linguistics
LO :	INIST-21332.354000145646040020
ID :	09-0009259

Links to Exploration step

Francis:09-0009259

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">On noise masking for automatic missing data speech recognition : A survey and discussion</title>
<author><name sortKey="Cerisara, Christophe" sort="Cerisara, Christophe" uniqKey="Cerisara C" first="Christophe" last="Cerisara">Christophe Cerisara</name>
<affiliation><inist:fA14 i1="01"><s1>LORIA, UMR 7503</s1>
<s2>Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Demange, Sebastien" sort="Demange, Sebastien" uniqKey="Demange S" first="Sébastien" last="Demange">Sébastien Demange</name>
<affiliation><inist:fA14 i1="01"><s1>LORIA, UMR 7503</s1>
<s2>Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Haton, Jean Paul" sort="Haton, Jean Paul" uniqKey="Haton J" first="Jean-Paul" last="Haton">Jean-Paul Haton</name>
<affiliation><inist:fA14 i1="01"><s1>LORIA, UMR 7503</s1>
<s2>Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">09-0009259</idno>
<date when="2007">2007</date>
<idno type="stanalyst">FRANCIS 09-0009259 INIST</idno>
<idno type="RBID">Francis:09-0009259</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000298</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">On noise masking for automatic missing data speech recognition : A survey and discussion</title>
<author><name sortKey="Cerisara, Christophe" sort="Cerisara, Christophe" uniqKey="Cerisara C" first="Christophe" last="Cerisara">Christophe Cerisara</name>
<affiliation><inist:fA14 i1="01"><s1>LORIA, UMR 7503</s1>
<s2>Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Demange, Sebastien" sort="Demange, Sebastien" uniqKey="Demange S" first="Sébastien" last="Demange">Sébastien Demange</name>
<affiliation><inist:fA14 i1="01"><s1>LORIA, UMR 7503</s1>
<s2>Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Haton, Jean Paul" sort="Haton, Jean Paul" uniqKey="Haton J" first="Jean-Paul" last="Haton">Jean-Paul Haton</name>
<affiliation><inist:fA14 i1="01"><s1>LORIA, UMR 7503</s1>
<s2>Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">Computer speech & language : (Print)</title>
<title level="j" type="abbreviated">Comput. speech lang. : (Print)</title>
<idno type="ISSN">0885-2308</idno>
<imprint><date when="2007">2007</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">Computer speech & language : (Print)</title>
<title level="j" type="abbreviated">Comput. speech lang. : (Print)</title>
<idno type="ISSN">0885-2308</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Computational linguistics</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Linguistique informatique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Automatic speech recognition (ASR) has reached very high levels of performance in controlled situations. However, the performance degrades significantly when environmental noise occurs during the recognition process. Nowadays, the major challenge is to reach a good robustness to adverse conditions, so that automatic speech recognizers can be used in real situations. Missing data theory is a very attractive and promising approach. Unlike other denoising methods, missing data recognition does not match the whole data with the acoustic models, but instead considers part of the signal as missing, i.e. corrupted by noise. While speech recognition with missing data can be handled efficiently by methods such as data imputation or marginalization, accurately identifying missing parts (also called masks) remains a very challenging task. This paper reviews the main approaches that have been proposed to address this problem. The objective of this study is to identify the mask estimation methods that have been proposed so far, and to open this domain up to other related research, which could be adapted to overcome this difficult challenge. In order to restrict the range of methods, only the techniques using a single microphone are considered.</div>
</front>
</TEI>
<inist><standard h6="B"><pA><fA01 i1="01" i2="1"><s0>0885-2308</s0>
</fA01>
<fA03 i2="1"><s0>Comput. speech lang. : (Print)</s0>
</fA03>
<fA05><s2>21</s2>
</fA05>
<fA06><s2>3</s2>
</fA06>
<fA08 i1="01" i2="1" l="ENG"><s1>On noise masking for automatic missing data speech recognition : A survey and discussion</s1>
</fA08>
<fA11 i1="01" i2="1"><s1>CERISARA (Christophe)</s1>
</fA11>
<fA11 i1="02" i2="1"><s1>DEMANGE (Sébastien)</s1>
</fA11>
<fA11 i1="03" i2="1"><s1>HATON (Jean-Paul)</s1>
</fA11>
<fA14 i1="01"><s1>LORIA, UMR 7503</s1>
<s2>Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</fA14>
<fA20><s1>443-457</s1>
</fA20>
<fA21><s1>2007</s1>
</fA21>
<fA23 i1="01"><s0>ENG</s0>
</fA23>
<fA43 i1="01"><s1>INIST</s1>
<s2>21332</s2>
<s5>354000145646040020</s5>
</fA43>
<fA44><s0>0000</s0>
<s1>© 2009 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45><s0>1 p.3/4</s0>
</fA45>
<fA47 i1="01" i2="1"><s0>09-0009259</s0>
</fA47>
<fA60><s1>P</s1>
</fA60>
<fA61><s0>A</s0>
</fA61>
<fA64 i1="01" i2="1"><s0>Computer speech & language : (Print)</s0>
</fA64>
<fA66 i1="01"><s0>GBR</s0>
</fA66>
<fC01 i1="01" l="ENG"><s0>Automatic speech recognition (ASR) has reached very high levels of performance in controlled situations. However, the performance degrades significantly when environmental noise occurs during the recognition process. Nowadays, the major challenge is to reach a good robustness to adverse conditions, so that automatic speech recognizers can be used in real situations. Missing data theory is a very attractive and promising approach. Unlike other denoising methods, missing data recognition does not match the whole data with the acoustic models, but instead considers part of the signal as missing, i.e. corrupted by noise. While speech recognition with missing data can be handled efficiently by methods such as data imputation or marginalization, accurately identifying missing parts (also called masks) remains a very challenging task. This paper reviews the main approaches that have been proposed to address this problem. The objective of this study is to identify the mask estimation methods that have been proposed so far, and to open this domain up to other related research, which could be adapted to overcome this difficult challenge. In order to restrict the range of methods, only the techniques using a single microphone are considered.</s0>
</fC01>
<fC02 i1="01" i2="L"><s0>52478</s0>
<s1>XV</s1>
</fC02>
<fC02 i1="02" i2="L"><s0>524</s0>
</fC02>
<fC03 i1="01" i2="L" l="FRE"><s0>Linguistique informatique</s0>
<s2>NI</s2>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="L" l="ENG"><s0>Computational linguistics</s0>
<s2>NI</s2>
<s5>01</s5>
</fC03>
<fN21><s1>004</s1>
</fN21>
<fN44 i1="01"><s1>OTO</s1>
</fN44>
<fN82><s1>OTO</s1>
</fN82>
</pA>
</standard>
<server><NO>FRANCIS 09-0009259 INIST</NO>
<ET>On noise masking for automatic missing data speech recognition : A survey and discussion</ET>
<AU>CERISARA (Christophe); DEMANGE (Sébastien); HATON (Jean-Paul)</AU>
<AF>LORIA, UMR 7503/Nancy/France (1 aut., 2 aut., 3 aut.)</AF>
<DT>Publication en série; Niveau analytique</DT>
<SO>Computer speech & language : (Print); ISSN 0885-2308; Royaume-Uni; Da. 2007; Vol. 21; No. 3; Pp. 443-457; Bibl. 1 p.3/4</SO>
<LA>Anglais</LA>
<EA>Automatic speech recognition (ASR) has reached very high levels of performance in controlled situations. However, the performance degrades significantly when environmental noise occurs during the recognition process. Nowadays, the major challenge is to reach a good robustness to adverse conditions, so that automatic speech recognizers can be used in real situations. Missing data theory is a very attractive and promising approach. Unlike other denoising methods, missing data recognition does not match the whole data with the acoustic models, but instead considers part of the signal as missing, i.e. corrupted by noise. While speech recognition with missing data can be handled efficiently by methods such as data imputation or marginalization, accurately identifying missing parts (also called masks) remains a very challenging task. This paper reviews the main approaches that have been proposed to address this problem. The objective of this study is to identify the mask estimation methods that have been proposed so far, and to open this domain up to other related research, which could be adapted to overcome this difficult challenge. In order to restrict the range of methods, only the techniques using a single microphone are considered.</EA>
<CC>52478; 524</CC>
<FD>Linguistique informatique</FD>
<ED>Computational linguistics</ED>
<LO>INIST-21332.354000145646040020</LO>
<ID>09-0009259</ID>
</server>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/PascalFrancis/Corpus

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000298 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000298 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    PascalFrancis
   |étape=   Corpus
   |type=    RBID
   |clé=     Francis:09-0009259
   |texte=   On noise masking for automatic missing data speech recognition : A survey and discussion
}}

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022

	Serveur d'exploration sur la recherche en informatique en Lorraine
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur la recherche en informatique en Lorraine

On noise masking for automatic missing data speech recognition : A survey and discussion

On noise masking for automatic missing data speech recognition : A survey and discussion

Source :

Descripteurs français

English descriptors

Abstract

Notice en format standard (ISO 2709)

Format Inist (serveur)

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri