Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

A simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds

Identifieur interne : 000964 ( Hal/Corpus ); précédent : 000963; suivant : 000965

A simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds

Auteurs : Zafar Rafii ; Antoine Liutkus ; Bryan Pardo

Source :

RBID : Hal:hal-01116689

English descriptors

Abstract

Repetition is a fundamental element in generating and perceiving structure in audio. Especially in music, structures tend to be composed of patterns that repeat through time (e.g., rhythmic elements in a musical accompaniment), and also frequency (e.g., different notes of the same instrument). The auditory system has the remarkable ability to parse such patterns by identifying repetitions within the audio mixture. On this basis, we propose a simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds. A user selects a region in the log-frequency spectrogram of an audio recording from which she/he wishes to recover a repeating pattern masked by an undesired element (e.g., a note masked by a cough). The selected region is then cross-correlated with the spectrogram to identify similar regions where the underlying pattern repeats. The identified regions are finally averaged over their repetitions and the repeating pattern is recovered.

Url:

Links to Exploration step

Hal:hal-01116689

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">A simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds</title>
<author>
<name sortKey="Rafii, Zafar" sort="Rafii, Zafar" uniqKey="Rafii Z" first="Zafar" last="Rafii">Zafar Rafii</name>
<affiliation>
<hal:affiliation type="laboratory" xml:id="struct-412268" status="INCOMING">
<orgName>Gracenote, Media Technology Lab</orgName>
<desc>
<address>
<country key="US"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-412267" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-412267" type="direct">
<org type="institution" xml:id="struct-412267" status="INCOMING">
<orgName>Gracenote</orgName>
<desc>
<address>
<country key="US"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Liutkus, Antoine" sort="Liutkus, Antoine" uniqKey="Liutkus A" first="Antoine" last="Liutkus">Antoine Liutkus</name>
<affiliation>
<hal:affiliation type="researchteam" xml:id="struct-2359" status="OLD">
<idno type="RNSR">200118295L</idno>
<orgName>Analysis, perception and recognition of speech</orgName>
<orgName type="acronym">PAROLE</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/parole</ref>
</desc>
<listRelation>
<relation active="#struct-160" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-300291" type="indirect"></relation>
<relation active="#struct-300292" type="indirect"></relation>
<relation active="#struct-300293" type="indirect"></relation>
<relation active="#struct-2496" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-160" type="direct">
<org type="laboratory" xml:id="struct-160" status="OLD">
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-300291" type="direct"></relation>
<relation active="#struct-300292" type="direct"></relation>
<relation active="#struct-300293" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300291" type="indirect">
<org type="institution" xml:id="struct-300291" status="OLD">
<orgName>Université Henri Poincaré - Nancy 1</orgName>
<orgName type="acronym">UHP</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<addrLine>24-30 rue Lionnois, BP 60120, 54 003 NANCY cedex, France</addrLine>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300292" type="indirect">
<org type="institution" xml:id="struct-300292" status="OLD">
<orgName>Université Nancy 2</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<addrLine>91 avenue de la Libération, BP 454, 54001 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300293" type="indirect">
<org type="institution" xml:id="struct-300293" status="OLD">
<orgName>Institut National Polytechnique de Lorraine</orgName>
<orgName type="acronym">INPL</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-2496" type="direct">
<org type="laboratory" xml:id="struct-2496" status="OLD">
<orgName>INRIA Lorraine</orgName>
<desc>
<address>
<addrLine>615 rue du Jardin Botanique 54600 Villers-lès-Nancy</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/centre-de-recherche-inria/nancy-grand-est</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Pardo, Bryan" sort="Pardo, Bryan" uniqKey="Pardo B" first="Bryan" last="Pardo">Bryan Pardo</name>
<affiliation>
<hal:affiliation type="institution" xml:id="struct-133711" status="VALID">
<orgName>Northwestern University [Evanston]</orgName>
<desc>
<address>
<addrLine>633 Clark Street Evanston, IL 60208 Evanston</addrLine>
<country key="US"></country>
</address>
<ref type="url">http://www.northwestern.edu/</ref>
</desc>
</hal:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:hal-01116689</idno>
<idno type="halId">hal-01116689</idno>
<idno type="halUri">https://hal.inria.fr/hal-01116689</idno>
<idno type="url">https://hal.inria.fr/hal-01116689</idno>
<date when="2015-04-19">2015-04-19</date>
<idno type="wicri:Area/Hal/Corpus">000964</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">A simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds</title>
<author>
<name sortKey="Rafii, Zafar" sort="Rafii, Zafar" uniqKey="Rafii Z" first="Zafar" last="Rafii">Zafar Rafii</name>
<affiliation>
<hal:affiliation type="laboratory" xml:id="struct-412268" status="INCOMING">
<orgName>Gracenote, Media Technology Lab</orgName>
<desc>
<address>
<country key="US"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-412267" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-412267" type="direct">
<org type="institution" xml:id="struct-412267" status="INCOMING">
<orgName>Gracenote</orgName>
<desc>
<address>
<country key="US"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Liutkus, Antoine" sort="Liutkus, Antoine" uniqKey="Liutkus A" first="Antoine" last="Liutkus">Antoine Liutkus</name>
<affiliation>
<hal:affiliation type="researchteam" xml:id="struct-2359" status="OLD">
<idno type="RNSR">200118295L</idno>
<orgName>Analysis, perception and recognition of speech</orgName>
<orgName type="acronym">PAROLE</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/parole</ref>
</desc>
<listRelation>
<relation active="#struct-160" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-300291" type="indirect"></relation>
<relation active="#struct-300292" type="indirect"></relation>
<relation active="#struct-300293" type="indirect"></relation>
<relation active="#struct-2496" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-160" type="direct">
<org type="laboratory" xml:id="struct-160" status="OLD">
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-300291" type="direct"></relation>
<relation active="#struct-300292" type="direct"></relation>
<relation active="#struct-300293" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300291" type="indirect">
<org type="institution" xml:id="struct-300291" status="OLD">
<orgName>Université Henri Poincaré - Nancy 1</orgName>
<orgName type="acronym">UHP</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<addrLine>24-30 rue Lionnois, BP 60120, 54 003 NANCY cedex, France</addrLine>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300292" type="indirect">
<org type="institution" xml:id="struct-300292" status="OLD">
<orgName>Université Nancy 2</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<addrLine>91 avenue de la Libération, BP 454, 54001 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300293" type="indirect">
<org type="institution" xml:id="struct-300293" status="OLD">
<orgName>Institut National Polytechnique de Lorraine</orgName>
<orgName type="acronym">INPL</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-2496" type="direct">
<org type="laboratory" xml:id="struct-2496" status="OLD">
<orgName>INRIA Lorraine</orgName>
<desc>
<address>
<addrLine>615 rue du Jardin Botanique 54600 Villers-lès-Nancy</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/centre-de-recherche-inria/nancy-grand-est</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Pardo, Bryan" sort="Pardo, Bryan" uniqKey="Pardo B" first="Bryan" last="Pardo">Bryan Pardo</name>
<affiliation>
<hal:affiliation type="institution" xml:id="struct-133711" status="VALID">
<orgName>Northwestern University [Evanston]</orgName>
<desc>
<address>
<addrLine>633 Clark Street Evanston, IL 60208 Evanston</addrLine>
<country key="US"></country>
</address>
<ref type="url">http://www.northwestern.edu/</ref>
</desc>
</hal:affiliation>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="mix" xml:lang="en">
<term>Constant Q Transform</term>
<term>audio source separation</term>
<term>median filter</term>
<term>normalized 2-d cross-correlation</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Repetition is a fundamental element in generating and perceiving structure in audio. Especially in music, structures tend to be composed of patterns that repeat through time (e.g., rhythmic elements in a musical accompaniment), and also frequency (e.g., different notes of the same instrument). The auditory system has the remarkable ability to parse such patterns by identifying repetitions within the audio mixture. On this basis, we propose a simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds. A user selects a region in the log-frequency spectrogram of an audio recording from which she/he wishes to recover a repeating pattern masked by an undesired element (e.g., a note masked by a cough). The selected region is then cross-correlated with the spectrogram to identify similar regions where the underlying pattern repeats. The identified regions are finally averaged over their repetitions and the repeating pattern is recovered.</div>
</front>
</TEI>
<hal api="V3">
<titleStmt>
<title xml:lang="en">A simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds</title>
<author role="aut">
<persName>
<forename type="first">Zafar</forename>
<surname>Rafii</surname>
</persName>
<email></email>
<idno type="halauthor">983509</idno>
<affiliation ref="#struct-412268"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Antoine</forename>
<surname>Liutkus</surname>
</persName>
<email></email>
<idno type="idhal">antoine-liutkus</idno>
<idno type="halauthor">628142</idno>
<affiliation ref="#struct-2359"></affiliation>
<affiliation ref="#struct-420403"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Bryan</forename>
<surname>Pardo</surname>
</persName>
<email></email>
<idno type="halauthor">983510</idno>
<affiliation ref="#struct-133711"></affiliation>
</author>
<editor role="depositor">
<persName>
<forename>Antoine</forename>
<surname>Liutkus</surname>
</persName>
<email>antoine@liutkus.fr</email>
</editor>
</titleStmt>
<editionStmt>
<edition n="v1" type="current">
<date type="whenSubmitted">2015-02-13 23:46:51</date>
<date type="whenWritten">2015-01-14</date>
<date type="whenModified">2015-11-13 01:08:39</date>
<date type="whenReleased">2015-02-17 13:43:11</date>
<date type="whenProduced">2015-04-19</date>
<date type="whenEndEmbargoed">2015-02-13</date>
<ref type="file" target="https://hal.inria.fr/hal-01116689/document">
<date notBefore="2015-02-13"></date>
</ref>
<ref type="file" subtype="author" n="1" target="https://hal.inria.fr/hal-01116689/file/Rafii-Liutkus-Pardo%20-%20A%20Simple%20User%20Interface%20System%20for%20Recovering%20Patterns%20Repeating%20in%20Time%20and%20Frequency%20in%20Mixtures%20of%20Sounds%20-%20ICASSP%202015.pdf">
<date notBefore="2015-02-13"></date>
</ref>
</edition>
<respStmt>
<resp>contributor</resp>
<name key="153135">
<persName>
<forename>Antoine</forename>
<surname>Liutkus</surname>
</persName>
<email>antoine@liutkus.fr</email>
</name>
</respStmt>
</editionStmt>
<publicationStmt>
<distributor>CCSD</distributor>
<idno type="halId">hal-01116689</idno>
<idno type="halUri">https://hal.inria.fr/hal-01116689</idno>
<idno type="halBibtex">rafii:hal-01116689</idno>
<idno type="halRefHtml">IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, France. 2015</idno>
<idno type="halRef">IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, France. 2015</idno>
</publicationStmt>
<seriesStmt>
<idno type="stamp" n="CNRS">CNRS - Centre national de la recherche scientifique</idno>
<idno type="stamp" n="INRIA">INRIA - Institut National de Recherche en Informatique et en Automatique</idno>
<idno type="stamp" n="INPL">Institut National Polytechnique de Lorraine</idno>
<idno type="stamp" n="INRIA-LORRAINE">INRIA Nancy - Grand Est</idno>
<idno type="stamp" n="LORIA2">Publications du LORIA</idno>
<idno type="stamp" n="INRIA-NANCY-GRAND-EST">INRIA Nancy - Grand Est</idno>
<idno type="stamp" n="LORIA-TALC" p="LORIA">Traitement automatique des langues et des connaissances</idno>
<idno type="stamp" n="INRIA2">INRIA 2</idno>
<idno type="stamp" n="LABO-LORIA-SET" p="LORIA">LABO-LORIA-SET</idno>
<idno type="stamp" n="LORIA">LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications</idno>
<idno type="stamp" n="INRIA_TEST">INRIA - Institut National de Recherche en Informatique et en Automatique</idno>
<idno type="stamp" n="UNIV-LORRAINE">Université de Lorraine</idno>
</seriesStmt>
<notesStmt>
<note type="audience" n="2">International</note>
<note type="invited" n="0">No</note>
<note type="popular" n="0">No</note>
<note type="peer" n="1">Yes</note>
<note type="proceedings" n="1">Yes</note>
</notesStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">A simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds</title>
<author role="aut">
<persName>
<forename type="first">Zafar</forename>
<surname>Rafii</surname>
</persName>
<idno type="halAuthorId">983509</idno>
<affiliation ref="#struct-412268"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Antoine</forename>
<surname>Liutkus</surname>
</persName>
<idno type="idHal">antoine-liutkus</idno>
<idno type="halAuthorId">628142</idno>
<affiliation ref="#struct-2359"></affiliation>
<affiliation ref="#struct-420403"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Bryan</forename>
<surname>Pardo</surname>
</persName>
<idno type="halAuthorId">983510</idno>
<affiliation ref="#struct-133711"></affiliation>
</author>
</analytic>
<monogr>
<meeting>
<title>IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</title>
<date type="start">2015-04-19</date>
<date type="end">2015-04-24</date>
<settlement>Brisbane</settlement>
<country key="FR">France</country>
</meeting>
<imprint>
<date type="datePub">2015-02-13</date>
</imprint>
</monogr>
</biblStruct>
</sourceDesc>
<profileDesc>
<langUsage>
<language ident="en">English</language>
</langUsage>
<textClass>
<keywords scheme="author">
<term xml:lang="en">audio source separation</term>
<term xml:lang="en">median filter</term>
<term xml:lang="en">Constant Q Transform</term>
<term xml:lang="en">normalized 2-d cross-correlation</term>
</keywords>
<classCode scheme="halDomain" n="info.info-ts">Computer Science [cs]/Signal and Image Processing</classCode>
<classCode scheme="halDomain" n="info.info-ir">Computer Science [cs]/Information Retrieval [cs.IR]</classCode>
<classCode scheme="halTypology" n="COMM">Conference papers</classCode>
</textClass>
<abstract xml:lang="en">Repetition is a fundamental element in generating and perceiving structure in audio. Especially in music, structures tend to be composed of patterns that repeat through time (e.g., rhythmic elements in a musical accompaniment), and also frequency (e.g., different notes of the same instrument). The auditory system has the remarkable ability to parse such patterns by identifying repetitions within the audio mixture. On this basis, we propose a simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds. A user selects a region in the log-frequency spectrogram of an audio recording from which she/he wishes to recover a repeating pattern masked by an undesired element (e.g., a note masked by a cough). The selected region is then cross-correlated with the spectrogram to identify similar regions where the underlying pattern repeats. The identified regions are finally averaged over their repetitions and the repeating pattern is recovered.</abstract>
</profileDesc>
</hal>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Hal/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000964 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Hal/Corpus/biblio.hfd -nk 000964 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Hal
   |étape=   Corpus
   |type=    RBID
   |clé=     Hal:hal-01116689
   |texte=   A simple user interface system for recovering patterns repeating in time and frequency in mixtures of sounds
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022