Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Spatial location priors for Gaussian model based reverberant audio source separation

Identifieur interne : 004676 ( Hal/Curation ); précédent : 004675; suivant : 004677

Spatial location priors for Gaussian model based reverberant audio source separation

Auteurs : Ngoc Q. K. Duong [France] ; Emmanuel Vincent [France] ; Rémi Gribonval [France]

Source :

RBID : Hal:hal-01289060

English descriptors

Abstract

We consider the Gaussian framework for reverberant audio source separation, where the sources are modeled in the time-frequency domain by their short-term power spectra and their spatial covariance matrices. We propose two alternative probabilistic priors over the spatial covariance matrices which are consistent with the theory of statistical room acoustics and we derive expectation-maximization algorithms for maximum a posteriori (MAP) estimation. We argue that these algorithms provide a statistically principled solution to the permutation problem and to the risk of overfitting resulting from conventional maximum likelihood (ML) estimation. We show experimentally that in a semi-informed scenario where the source positions and certain room characteristics are known, the MAP algorithms outperform their ML counterparts. This opens the way to rigorous statistical treatment of this family of models in other scenarios in the future.

Url:
DOI: 10.1016/j.sigpro.2011.10.007

Links toward previous steps (curation, corpus...)


Links to Exploration step

Hal:hal-01289060

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Spatial location priors for Gaussian model based reverberant audio source separation</title>
<author>
<name sortKey="Duong, Ngoc Q K" sort="Duong, Ngoc Q K" uniqKey="Duong N" first="Ngoc Q. K." last="Duong">Ngoc Q. K. Duong</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-118587" status="VALID">
<orgName>Technicolor R & I [Cesson Sévigné]</orgName>
<desc>
<address>
<addrLine>975 avenue des champs blancs CS 17616 35576 Cesson Sévigné</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.technicolor.com/en/innovation/research-innovation/rennes</ref>
</desc>
<listRelation>
<relation active="#struct-367121" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-367121" type="direct">
<org type="institution" xml:id="struct-367121" status="INCOMING">
<orgName>Technicolor</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Vincent, Emmanuel" sort="Vincent, Emmanuel" uniqKey="Vincent E" first="Emmanuel" last="Vincent">Emmanuel Vincent</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-420403" status="VALID">
<idno type="RNSR">201421147E</idno>
<orgName>Speech Modeling for Facilitating Oral-Based Communication</orgName>
<orgName type="acronym">MULTISPEECH</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/multispeech</ref>
</desc>
<listRelation>
<relation active="#struct-129671" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-423086" type="direct"></relation>
<relation active="#struct-206040" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-129671" type="direct">
<org type="laboratory" xml:id="struct-129671" status="VALID">
<idno type="RNSR">198618246Y</idno>
<orgName>INRIA Nancy - Grand Est</orgName>
<desc>
<address>
<addrLine>615 rue du Jardin Botanique 54600 Villers-lès-Nancy</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/nancy</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-423086" type="direct">
<org type="department" xml:id="struct-423086" status="VALID">
<orgName>Department of Natural Language Processing & Knowledge Discovery</orgName>
<orgName type="acronym">LORIA - NLPKD</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr/la-recherche-en/departements/Knowledge-and-Language-Management</ref>
</desc>
<listRelation>
<relation active="#struct-206040" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-206040" type="indirect">
<org type="laboratory" xml:id="struct-206040" status="VALID">
<idno type="IdRef">067077927</idno>
<idno type="RNSR">198912571S</idno>
<idno type="IdUnivLorraine">[UL]RSI--</idno>
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-413289" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-413289" type="indirect">
<org type="institution" xml:id="struct-413289" status="VALID">
<idno type="IdRef">157040569</idno>
<idno type="IdUnivLorraine">[UL]100--</idno>
<orgName>Université de Lorraine</orgName>
<orgName type="acronym">UL</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>34 cours Léopold - CS 25233 - 54052 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lorraine.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Nancy</settlement>
<settlement type="city">Metz</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Université de Lorraine</orgName>
</affiliation>
</author>
<author>
<name sortKey="Gribonval, Remi" sort="Gribonval, Remi" uniqKey="Gribonval R" first="Rémi" last="Gribonval">Rémi Gribonval</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-210613" status="VALID">
<idno type="RNSR">201321091Y</idno>
<orgName>Parcimonie et Nouveaux Algorithmes pour le Signal et la Modélisation Audio</orgName>
<orgName type="acronym">PANAMA</orgName>
<desc>
<address>
<addrLine>Campus de Beaulieu 35042 Rennes cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/panama</ref>
</desc>
<listRelation>
<relation active="#struct-419153" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-419366" type="direct"></relation>
<relation active="#struct-105128" type="indirect"></relation>
<relation name="UMR6074" active="#struct-441569" type="indirect"></relation>
<relation active="#struct-172265" type="indirect"></relation>
<relation active="#struct-105160" type="indirect"></relation>
<relation active="#struct-247362" type="indirect"></relation>
<relation active="#struct-300812" type="indirect"></relation>
<relation name="- RENNES" active="#struct-301232" type="indirect"></relation>
<relation active="#struct-301262" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-419153" type="direct">
<org type="laboratory" xml:id="struct-419153" status="VALID">
<idno type="RNSR">198018249C</idno>
<orgName>Inria Rennes – Bretagne Atlantique </orgName>
<desc>
<address>
<addrLine>Campus de beaulieu35042 Rennes cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/centre/rennes</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-419366" type="direct">
<org type="department" xml:id="struct-419366" status="VALID">
<orgName>SIGNAUX ET IMAGES NUMÉRIQUES, ROBOTIQUE</orgName>
<orgName type="acronym">IRISA-D5</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">https://www.irisa.fr/fr/departements/d5-signaux-images-numeriques-robotique</ref>
</desc>
<listRelation>
<relation active="#struct-105128" type="direct"></relation>
<relation name="UMR6074" active="#struct-441569" type="indirect"></relation>
<relation active="#struct-172265" type="indirect"></relation>
<relation active="#struct-105160" type="indirect"></relation>
<relation active="#struct-247362" type="indirect"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-300812" type="indirect"></relation>
<relation name="- RENNES" active="#struct-301232" type="indirect"></relation>
<relation active="#struct-301262" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-105128" type="indirect">
<org type="laboratory" xml:id="struct-105128" status="VALID">
<idno type="IdRef">026386909</idno>
<idno type="RNSR">200012163A</idno>
<orgName>Institut de Recherche en Informatique et Systèmes Aléatoires</orgName>
<orgName type="acronym">IRISA</orgName>
<date type="start">2000</date>
<desc>
<address>
<addrLine>Avenue du général LeclercCampus de Beaulieu 35042 RENNES CEDEX</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.irisa.fr</ref>
</desc>
<listRelation>
<relation name="UMR6074" active="#struct-441569" type="direct"></relation>
<relation active="#struct-172265" type="direct"></relation>
<relation active="#struct-105160" type="direct"></relation>
<relation active="#struct-247362" type="direct"></relation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-300812" type="direct"></relation>
<relation name="- RENNES" active="#struct-301232" type="direct"></relation>
<relation active="#struct-301262" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle name="UMR6074" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-172265" type="indirect">
<org type="institution" xml:id="struct-172265" status="VALID">
<orgName>Université de Bretagne Sud</orgName>
<orgName type="acronym">UBS</orgName>
<desc>
<address>
<addrLine>BP 92116 - 56321 Lorient cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-ubs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-105160" type="indirect">
<org type="institution" xml:id="struct-105160" status="VALID">
<orgName>Université de Rennes 1</orgName>
<orgName type="acronym">UR1</orgName>
<desc>
<address>
<addrLine>2 rue du Thabor - CS 46510 - 35065 Rennes cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-rennes1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-247362" type="indirect">
<org type="institution" xml:id="struct-247362" status="VALID">
<orgName>École normale supérieure - Rennes</orgName>
<orgName type="acronym">ENS Rennes</orgName>
<desc>
<address>
<addrLine>Campus de Ker Lann - avenue Robert Schuman - 35170 Bruz</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ens-rennes.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300812" type="indirect">
<org type="institution" xml:id="struct-300812" status="VALID">
<orgName>SUPELEC</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="- RENNES" active="#struct-301232" type="indirect">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-301262" type="indirect">
<org type="institution" xml:id="struct-301262" status="VALID">
<orgName>Télécom Bretagne</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Lorient</settlement>
<region type="region" nuts="2">Région Bretagne</region>
</placeName>
<orgName type="university">Université de Bretagne-Sud</orgName>
<orgName type="institution" wicri:auto="newGroup">Université européenne de Bretagne</orgName>
<placeName>
<settlement type="city">Rennes</settlement>
<region type="region" nuts="2">Région Bretagne</region>
</placeName>
<orgName type="university">Université de Rennes 1</orgName>
<orgName type="institution" wicri:auto="newGroup">Université européenne de Bretagne</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:hal-01289060</idno>
<idno type="halId">hal-01289060</idno>
<idno type="halUri">https://hal.inria.fr/hal-01289060</idno>
<idno type="url">https://hal.inria.fr/hal-01289060</idno>
<idno type="doi">10.1016/j.sigpro.2011.10.007</idno>
<date when="2013">2013</date>
<idno type="wicri:Area/Hal/Corpus">004676</idno>
<idno type="wicri:Area/Hal/Curation">004676</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Spatial location priors for Gaussian model based reverberant audio source separation</title>
<author>
<name sortKey="Duong, Ngoc Q K" sort="Duong, Ngoc Q K" uniqKey="Duong N" first="Ngoc Q. K." last="Duong">Ngoc Q. K. Duong</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-118587" status="VALID">
<orgName>Technicolor R & I [Cesson Sévigné]</orgName>
<desc>
<address>
<addrLine>975 avenue des champs blancs CS 17616 35576 Cesson Sévigné</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.technicolor.com/en/innovation/research-innovation/rennes</ref>
</desc>
<listRelation>
<relation active="#struct-367121" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-367121" type="direct">
<org type="institution" xml:id="struct-367121" status="INCOMING">
<orgName>Technicolor</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Vincent, Emmanuel" sort="Vincent, Emmanuel" uniqKey="Vincent E" first="Emmanuel" last="Vincent">Emmanuel Vincent</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-420403" status="VALID">
<idno type="RNSR">201421147E</idno>
<orgName>Speech Modeling for Facilitating Oral-Based Communication</orgName>
<orgName type="acronym">MULTISPEECH</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/multispeech</ref>
</desc>
<listRelation>
<relation active="#struct-129671" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-423086" type="direct"></relation>
<relation active="#struct-206040" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-129671" type="direct">
<org type="laboratory" xml:id="struct-129671" status="VALID">
<idno type="RNSR">198618246Y</idno>
<orgName>INRIA Nancy - Grand Est</orgName>
<desc>
<address>
<addrLine>615 rue du Jardin Botanique 54600 Villers-lès-Nancy</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/nancy</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-423086" type="direct">
<org type="department" xml:id="struct-423086" status="VALID">
<orgName>Department of Natural Language Processing & Knowledge Discovery</orgName>
<orgName type="acronym">LORIA - NLPKD</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr/la-recherche-en/departements/Knowledge-and-Language-Management</ref>
</desc>
<listRelation>
<relation active="#struct-206040" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-206040" type="indirect">
<org type="laboratory" xml:id="struct-206040" status="VALID">
<idno type="IdRef">067077927</idno>
<idno type="RNSR">198912571S</idno>
<idno type="IdUnivLorraine">[UL]RSI--</idno>
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-413289" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-413289" type="indirect">
<org type="institution" xml:id="struct-413289" status="VALID">
<idno type="IdRef">157040569</idno>
<idno type="IdUnivLorraine">[UL]100--</idno>
<orgName>Université de Lorraine</orgName>
<orgName type="acronym">UL</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>34 cours Léopold - CS 25233 - 54052 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lorraine.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Nancy</settlement>
<settlement type="city">Metz</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Université de Lorraine</orgName>
</affiliation>
</author>
<author>
<name sortKey="Gribonval, Remi" sort="Gribonval, Remi" uniqKey="Gribonval R" first="Rémi" last="Gribonval">Rémi Gribonval</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-210613" status="VALID">
<idno type="RNSR">201321091Y</idno>
<orgName>Parcimonie et Nouveaux Algorithmes pour le Signal et la Modélisation Audio</orgName>
<orgName type="acronym">PANAMA</orgName>
<desc>
<address>
<addrLine>Campus de Beaulieu 35042 Rennes cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/panama</ref>
</desc>
<listRelation>
<relation active="#struct-419153" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-419366" type="direct"></relation>
<relation active="#struct-105128" type="indirect"></relation>
<relation name="UMR6074" active="#struct-441569" type="indirect"></relation>
<relation active="#struct-172265" type="indirect"></relation>
<relation active="#struct-105160" type="indirect"></relation>
<relation active="#struct-247362" type="indirect"></relation>
<relation active="#struct-300812" type="indirect"></relation>
<relation name="- RENNES" active="#struct-301232" type="indirect"></relation>
<relation active="#struct-301262" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-419153" type="direct">
<org type="laboratory" xml:id="struct-419153" status="VALID">
<idno type="RNSR">198018249C</idno>
<orgName>Inria Rennes – Bretagne Atlantique </orgName>
<desc>
<address>
<addrLine>Campus de beaulieu35042 Rennes cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/centre/rennes</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-419366" type="direct">
<org type="department" xml:id="struct-419366" status="VALID">
<orgName>SIGNAUX ET IMAGES NUMÉRIQUES, ROBOTIQUE</orgName>
<orgName type="acronym">IRISA-D5</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">https://www.irisa.fr/fr/departements/d5-signaux-images-numeriques-robotique</ref>
</desc>
<listRelation>
<relation active="#struct-105128" type="direct"></relation>
<relation name="UMR6074" active="#struct-441569" type="indirect"></relation>
<relation active="#struct-172265" type="indirect"></relation>
<relation active="#struct-105160" type="indirect"></relation>
<relation active="#struct-247362" type="indirect"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-300812" type="indirect"></relation>
<relation name="- RENNES" active="#struct-301232" type="indirect"></relation>
<relation active="#struct-301262" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-105128" type="indirect">
<org type="laboratory" xml:id="struct-105128" status="VALID">
<idno type="IdRef">026386909</idno>
<idno type="RNSR">200012163A</idno>
<orgName>Institut de Recherche en Informatique et Systèmes Aléatoires</orgName>
<orgName type="acronym">IRISA</orgName>
<date type="start">2000</date>
<desc>
<address>
<addrLine>Avenue du général LeclercCampus de Beaulieu 35042 RENNES CEDEX</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.irisa.fr</ref>
</desc>
<listRelation>
<relation name="UMR6074" active="#struct-441569" type="direct"></relation>
<relation active="#struct-172265" type="direct"></relation>
<relation active="#struct-105160" type="direct"></relation>
<relation active="#struct-247362" type="direct"></relation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-300812" type="direct"></relation>
<relation name="- RENNES" active="#struct-301232" type="direct"></relation>
<relation active="#struct-301262" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle name="UMR6074" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-172265" type="indirect">
<org type="institution" xml:id="struct-172265" status="VALID">
<orgName>Université de Bretagne Sud</orgName>
<orgName type="acronym">UBS</orgName>
<desc>
<address>
<addrLine>BP 92116 - 56321 Lorient cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-ubs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-105160" type="indirect">
<org type="institution" xml:id="struct-105160" status="VALID">
<orgName>Université de Rennes 1</orgName>
<orgName type="acronym">UR1</orgName>
<desc>
<address>
<addrLine>2 rue du Thabor - CS 46510 - 35065 Rennes cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-rennes1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-247362" type="indirect">
<org type="institution" xml:id="struct-247362" status="VALID">
<orgName>École normale supérieure - Rennes</orgName>
<orgName type="acronym">ENS Rennes</orgName>
<desc>
<address>
<addrLine>Campus de Ker Lann - avenue Robert Schuman - 35170 Bruz</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ens-rennes.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300812" type="indirect">
<org type="institution" xml:id="struct-300812" status="VALID">
<orgName>SUPELEC</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="- RENNES" active="#struct-301232" type="indirect">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-301262" type="indirect">
<org type="institution" xml:id="struct-301262" status="VALID">
<orgName>Télécom Bretagne</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Lorient</settlement>
<region type="region" nuts="2">Région Bretagne</region>
</placeName>
<orgName type="university">Université de Bretagne-Sud</orgName>
<orgName type="institution" wicri:auto="newGroup">Université européenne de Bretagne</orgName>
<placeName>
<settlement type="city">Rennes</settlement>
<region type="region" nuts="2">Région Bretagne</region>
</placeName>
<orgName type="university">Université de Rennes 1</orgName>
<orgName type="institution" wicri:auto="newGroup">Université européenne de Bretagne</orgName>
</affiliation>
</author>
</analytic>
<idno type="DOI">10.1016/j.sigpro.2011.10.007</idno>
<series>
<title level="j">EURASIP Journal on Advances in Signal Processing</title>
<idno type="ISSN">1687-6172</idno>
<imprint>
<date type="datePub">2013</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="mix" xml:lang="en">
<term>Audio source separation</term>
<term>EM algorithm</term>
<term>Gaussian</term>
<term>Inverse-Wishart</term>
<term>Probabilistic priors</term>
<term>Spatial covariance</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">We consider the Gaussian framework for reverberant audio source separation, where the sources are modeled in the time-frequency domain by their short-term power spectra and their spatial covariance matrices. We propose two alternative probabilistic priors over the spatial covariance matrices which are consistent with the theory of statistical room acoustics and we derive expectation-maximization algorithms for maximum a posteriori (MAP) estimation. We argue that these algorithms provide a statistically principled solution to the permutation problem and to the risk of overfitting resulting from conventional maximum likelihood (ML) estimation. We show experimentally that in a semi-informed scenario where the source positions and certain room characteristics are known, the MAP algorithms outperform their ML counterparts. This opens the way to rigorous statistical treatment of this family of models in other scenarios in the future.</div>
</front>
</TEI>
<hal api="V3">
<titleStmt>
<title xml:lang="en">Spatial location priors for Gaussian model based reverberant audio source separation</title>
<author role="aut">
<persName>
<forename type="first">Ngoc Q. K.</forename>
<surname>Duong</surname>
</persName>
<email></email>
<idno type="halauthor">757380</idno>
<affiliation ref="#struct-118587"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Emmanuel</forename>
<surname>Vincent</surname>
</persName>
<email>emmanuel.vincent@inria.fr</email>
<idno type="idhal">emmanuelv</idno>
<idno type="halauthor">176768</idno>
<orgName ref="#struct-300009"></orgName>
<affiliation ref="#struct-420403"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Rémi</forename>
<surname>Gribonval</surname>
</persName>
<email>Remi.Gribonval@inria.fr</email>
<ptr type="url" target="http://people.irisa.fr/Remi.Gribonval/"></ptr>
<idno type="idhal">remi-gribonval</idno>
<idno type="halauthor">888631</idno>
<orgName ref="#struct-300009"></orgName>
<affiliation ref="#struct-210613"></affiliation>
</author>
<editor role="depositor">
<persName>
<forename>Ngoc</forename>
<surname>Duong</surname>
</persName>
<email>quang-khanh-ngoc.duong@technicolor.com</email>
</editor>
</titleStmt>
<editionStmt>
<edition n="v1" type="current">
<date type="whenSubmitted">2016-03-16 09:18:39</date>
<date type="whenModified">2016-04-01 10:50:55</date>
<date type="whenReleased">2016-03-25 11:17:40</date>
<date type="whenProduced">2013</date>
<date type="whenEndEmbargoed">2016-03-16</date>
<ref type="file" target="https://hal.inria.fr/hal-01289060/document">
<date notBefore="2016-03-16"></date>
</ref>
<ref type="file" subtype="author" n="1" target="https://hal.inria.fr/hal-01289060/file/published%20version.pdf">
<date notBefore="2016-03-16"></date>
</ref>
</edition>
<respStmt>
<resp>contributor</resp>
<name key="145601">
<persName>
<forename>Ngoc</forename>
<surname>Duong</surname>
</persName>
<email>quang-khanh-ngoc.duong@technicolor.com</email>
</name>
</respStmt>
</editionStmt>
<publicationStmt>
<distributor>CCSD</distributor>
<idno type="halId">hal-01289060</idno>
<idno type="halUri">https://hal.inria.fr/hal-01289060</idno>
<idno type="halBibtex">duong:hal-01289060</idno>
<idno type="halRefHtml">EURASIP Journal on Advances in Signal Processing, SpringerOpen, 2013, <10.1016/j.sigpro.2011.10.007></idno>
<idno type="halRef">EURASIP Journal on Advances in Signal Processing, SpringerOpen, 2013, <10.1016/j.sigpro.2011.10.007></idno>
</publicationStmt>
<seriesStmt>
<idno type="stamp" n="CNRS">CNRS - Centre national de la recherche scientifique</idno>
<idno type="stamp" n="UNIV-UBS">Université de Bretagne Sud</idno>
<idno type="stamp" n="INSTITUT-TELECOM">Institut Télécom</idno>
<idno type="stamp" n="IRISA_SET">IRISA_SET</idno>
<idno type="stamp" n="INRIA-LORRAINE">INRIA Nancy - Grand Est</idno>
<idno type="stamp" n="INRIA-NANCY-GRAND-EST">INRIA Nancy - Grand Est</idno>
<idno type="stamp" n="UNIV-RENNES1">Université de Rennes 1</idno>
<idno type="stamp" n="IRISA">Irisa</idno>
<idno type="stamp" n="INRIA_TEST">INRIA - Institut National de Recherche en Informatique et en Automatique</idno>
<idno type="stamp" n="UNIV-LORRAINE">Université de Lorraine</idno>
<idno type="stamp" n="INRIA2">INRIA 2</idno>
<idno type="stamp" n="LORIA2">Publications du LORIA</idno>
<idno type="stamp" n="INRIA">INRIA - Institut National de Recherche en Informatique et en Automatique</idno>
</seriesStmt>
<notesStmt>
<note type="audience" n="2">International</note>
<note type="popular" n="0">No</note>
<note type="peer" n="1">Yes</note>
</notesStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Spatial location priors for Gaussian model based reverberant audio source separation</title>
<author role="aut">
<persName>
<forename type="first">Ngoc Q. K.</forename>
<surname>Duong</surname>
</persName>
<idno type="halAuthorId">757380</idno>
<affiliation ref="#struct-118587"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Emmanuel</forename>
<surname>Vincent</surname>
</persName>
<email>emmanuel.vincent@inria.fr</email>
<idno type="idHal">emmanuelv</idno>
<idno type="halAuthorId">176768</idno>
<orgName ref="#struct-300009"></orgName>
<affiliation ref="#struct-420403"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Rémi</forename>
<surname>Gribonval</surname>
</persName>
<email>Remi.Gribonval@inria.fr</email>
<ptr type="url" target="http://people.irisa.fr/Remi.Gribonval/"></ptr>
<idno type="idHal">remi-gribonval</idno>
<idno type="halAuthorId">888631</idno>
<orgName ref="#struct-300009"></orgName>
<affiliation ref="#struct-210613"></affiliation>
</author>
</analytic>
<monogr>
<idno type="halJournalId" status="VALID">28591</idno>
<idno type="issn">1687-6172</idno>
<idno type="eissn">1687-6180</idno>
<title level="j">EURASIP Journal on Advances in Signal Processing</title>
<imprint>
<publisher>SpringerOpen</publisher>
<date type="datePub">2013</date>
</imprint>
</monogr>
<idno type="doi">10.1016/j.sigpro.2011.10.007</idno>
</biblStruct>
</sourceDesc>
<profileDesc>
<langUsage>
<language ident="en">English</language>
</langUsage>
<textClass>
<keywords scheme="author">
<term xml:lang="en">Audio source separation</term>
<term xml:lang="en">Spatial covariance</term>
<term xml:lang="en">EM algorithm</term>
<term xml:lang="en">Probabilistic priors</term>
<term xml:lang="en">Inverse-Wishart</term>
<term xml:lang="en">Gaussian</term>
</keywords>
<classCode scheme="halDomain" n="stat.ml">Statistics [stat]/Machine Learning [stat.ML]</classCode>
<classCode scheme="halDomain" n="info.info-hc">Computer Science [cs]/Human-Computer Interaction [cs.HC]</classCode>
<classCode scheme="halDomain" n="info.info-ts">Computer Science [cs]/Signal and Image Processing</classCode>
<classCode scheme="halTypology" n="ART">Journal articles</classCode>
</textClass>
<abstract xml:lang="en">We consider the Gaussian framework for reverberant audio source separation, where the sources are modeled in the time-frequency domain by their short-term power spectra and their spatial covariance matrices. We propose two alternative probabilistic priors over the spatial covariance matrices which are consistent with the theory of statistical room acoustics and we derive expectation-maximization algorithms for maximum a posteriori (MAP) estimation. We argue that these algorithms provide a statistically principled solution to the permutation problem and to the risk of overfitting resulting from conventional maximum likelihood (ML) estimation. We show experimentally that in a semi-informed scenario where the source positions and certain room characteristics are known, the MAP algorithms outperform their ML counterparts. This opens the way to rigorous statistical treatment of this family of models in other scenarios in the future.</abstract>
</profileDesc>
</hal>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Hal/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 004676 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Hal/Curation/biblio.hfd -nk 004676 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Hal
   |étape=   Curation
   |type=    RBID
   |clé=     Hal:hal-01289060
   |texte=   Spatial location priors for Gaussian model based reverberant audio source separation
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022