Extracting Predominant Local Pulse Information From Music Recordings
Identifieur interne : 000007 ( PascalFrancis/Corpus ); précédent : 000006; suivant : 000008Extracting Predominant Local Pulse Information From Music Recordings
Auteurs : Peter Grosche ; Meinard MiillerSource :
- IEEE transactions on audio, speech, and language processing [ 1558-7916 ] ; 2011.
Descripteurs français
- Pascal (Inist)
English descriptors
- KwdEn :
Abstract
The extraction of tempo and beat information from music recordings constitutes a challenging task in particular for non-percussive music with soft note onsets and time-varying tempo. In this paper, we introduce a novel mid-level representation that captures musically meaningful local pulse information even for the case of complex music. Our main idea is to derive for each time position a sinusoidal kernel that best explains the local periodic nature of a previously extracted note onset representation. Then we employ an overlap-add technique accumulating all these kernels over time to obtain a single function that reveals the predominant local pulse (PLP). Our concept introduces a high degree of robustness to noise and distortions resulting from weak and blurry onsets. Furthermore, the resulting PLP curve reveals the local pulse information even in the presence of continuous tempo changes and indicates a kind of confidence in the periodicity estimation. As further contribution, we show how our PLP concept can be used as a flexible tool for enhancing tempo estimation and beat tracking. The practical relevance of our approach is demonstrated by extensive experiments based on music recordings of various genres.
Notice en format standard (ISO 2709)
Pour connaître la documentation sur le format Inist Standard.
pA |
|
---|
Format Inist (serveur)
NO : | PASCAL 11-0363501 INIST |
---|---|
ET : | Extracting Predominant Local Pulse Information From Music Recordings |
AU : | GROSCHE (Peter); MIILLER (Meinard) |
AF : | Saarland University and the Max-Planck Institut fur Informatik/66123 Saarbrücken/Allemagne (1 aut., 2 aut.) |
DT : | Publication en série; Niveau analytique |
SO : | IEEE transactions on audio, speech, and language processing; ISSN 1558-7916; Etats-Unis; Da. 2011; Vol. 19; No. 6; Pp. 1688-1701; Bibl. 45 ref. |
LA : | Anglais |
EA : | The extraction of tempo and beat information from music recordings constitutes a challenging task in particular for non-percussive music with soft note onsets and time-varying tempo. In this paper, we introduce a novel mid-level representation that captures musically meaningful local pulse information even for the case of complex music. Our main idea is to derive for each time position a sinusoidal kernel that best explains the local periodic nature of a previously extracted note onset representation. Then we employ an overlap-add technique accumulating all these kernels over time to obtain a single function that reveals the predominant local pulse (PLP). Our concept introduces a high degree of robustness to noise and distortions resulting from weak and blurry onsets. Furthermore, the resulting PLP curve reveals the local pulse information even in the presence of continuous tempo changes and indicates a kind of confidence in the periodicity estimation. As further contribution, we show how our PLP concept can be used as a flexible tool for enhancing tempo estimation and beat tracking. The practical relevance of our approach is demonstrated by extensive experiments based on music recordings of various genres. |
CC : | 001D04A05D; 001D04A04A2 |
FD : | Rythme; Temps établissement; Variation temporelle; Méthode noyau; Prédiction linéaire; Immunité bruit; Poursuite cible; Son musical; Traitement signal audio; Traitement signal; Traitement signal acoustique |
ED : | Rhythm; Onset time; Time variation; Kernel method; Linear prediction; Noise immunity; Target tracking; Musical sound; Audio signal processing; Signal processing; Acoustic signal processing |
SD : | Ritmo; Tiempo establecimiento; Variación temporal; Método núcleo; Predicción lineal; Inmunidad ruido; Sonido musical; Procesamiento señal |
LO : | INIST-26266.354000191110700190 |
ID : | 11-0363501 |
Links to Exploration step
Pascal:11-0363501Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Extracting Predominant Local Pulse Information From Music Recordings</title>
<author><name sortKey="Grosche, Peter" sort="Grosche, Peter" uniqKey="Grosche P" first="Peter" last="Grosche">Peter Grosche</name>
<affiliation><inist:fA14 i1="01"><s1>Saarland University and the Max-Planck Institut fur Informatik</s1>
<s2>66123 Saarbrücken</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Miiller, Meinard" sort="Miiller, Meinard" uniqKey="Miiller M" first="Meinard" last="Miiller">Meinard Miiller</name>
<affiliation><inist:fA14 i1="01"><s1>Saarland University and the Max-Planck Institut fur Informatik</s1>
<s2>66123 Saarbrücken</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">11-0363501</idno>
<date when="2011">2011</date>
<idno type="stanalyst">PASCAL 11-0363501 INIST</idno>
<idno type="RBID">Pascal:11-0363501</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000007</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Extracting Predominant Local Pulse Information From Music Recordings</title>
<author><name sortKey="Grosche, Peter" sort="Grosche, Peter" uniqKey="Grosche P" first="Peter" last="Grosche">Peter Grosche</name>
<affiliation><inist:fA14 i1="01"><s1>Saarland University and the Max-Planck Institut fur Informatik</s1>
<s2>66123 Saarbrücken</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Miiller, Meinard" sort="Miiller, Meinard" uniqKey="Miiller M" first="Meinard" last="Miiller">Meinard Miiller</name>
<affiliation><inist:fA14 i1="01"><s1>Saarland University and the Max-Planck Institut fur Informatik</s1>
<s2>66123 Saarbrücken</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">IEEE transactions on audio, speech, and language processing</title>
<title level="j" type="abbreviated">IEEE trans. audio speech lang. process.</title>
<idno type="ISSN">1558-7916</idno>
<imprint><date when="2011">2011</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">IEEE transactions on audio, speech, and language processing</title>
<title level="j" type="abbreviated">IEEE trans. audio speech lang. process.</title>
<idno type="ISSN">1558-7916</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Acoustic signal processing</term>
<term>Audio signal processing</term>
<term>Kernel method</term>
<term>Linear prediction</term>
<term>Musical sound</term>
<term>Noise immunity</term>
<term>Onset time</term>
<term>Rhythm</term>
<term>Signal processing</term>
<term>Target tracking</term>
<term>Time variation</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Rythme</term>
<term>Temps établissement</term>
<term>Variation temporelle</term>
<term>Méthode noyau</term>
<term>Prédiction linéaire</term>
<term>Immunité bruit</term>
<term>Poursuite cible</term>
<term>Son musical</term>
<term>Traitement signal audio</term>
<term>Traitement signal</term>
<term>Traitement signal acoustique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">The extraction of tempo and beat information from music recordings constitutes a challenging task in particular for non-percussive music with soft note onsets and time-varying tempo. In this paper, we introduce a novel mid-level representation that captures musically meaningful local pulse information even for the case of complex music. Our main idea is to derive for each time position a sinusoidal kernel that best explains the local periodic nature of a previously extracted note onset representation. Then we employ an overlap-add technique accumulating all these kernels over time to obtain a single function that reveals the predominant local pulse (PLP). Our concept introduces a high degree of robustness to noise and distortions resulting from weak and blurry onsets. Furthermore, the resulting PLP curve reveals the local pulse information even in the presence of continuous tempo changes and indicates a kind of confidence in the periodicity estimation. As further contribution, we show how our PLP concept can be used as a flexible tool for enhancing tempo estimation and beat tracking. The practical relevance of our approach is demonstrated by extensive experiments based on music recordings of various genres.</div>
</front>
</TEI>
<inist><standard h6="B"><pA><fA01 i1="01" i2="1"><s0>1558-7916</s0>
</fA01>
<fA03 i2="1"><s0>IEEE trans. audio speech lang. process.</s0>
</fA03>
<fA05><s2>19</s2>
</fA05>
<fA06><s2>6</s2>
</fA06>
<fA08 i1="01" i2="1" l="ENG"><s1>Extracting Predominant Local Pulse Information From Music Recordings</s1>
</fA08>
<fA11 i1="01" i2="1"><s1>GROSCHE (Peter)</s1>
</fA11>
<fA11 i1="02" i2="1"><s1>MIILLER (Meinard)</s1>
</fA11>
<fA14 i1="01"><s1>Saarland University and the Max-Planck Institut fur Informatik</s1>
<s2>66123 Saarbrücken</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</fA14>
<fA20><s1>1688-1701</s1>
</fA20>
<fA21><s1>2011</s1>
</fA21>
<fA23 i1="01"><s0>ENG</s0>
</fA23>
<fA43 i1="01"><s1>INIST</s1>
<s2>26266</s2>
<s5>354000191110700190</s5>
</fA43>
<fA44><s0>0000</s0>
<s1>© 2011 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45><s0>45 ref.</s0>
</fA45>
<fA47 i1="01" i2="1"><s0>11-0363501</s0>
</fA47>
<fA60><s1>P</s1>
</fA60>
<fA61><s0>A</s0>
</fA61>
<fA64 i1="01" i2="1"><s0>IEEE transactions on audio, speech, and language processing</s0>
</fA64>
<fA66 i1="01"><s0>USA</s0>
</fA66>
<fC01 i1="01" l="ENG"><s0>The extraction of tempo and beat information from music recordings constitutes a challenging task in particular for non-percussive music with soft note onsets and time-varying tempo. In this paper, we introduce a novel mid-level representation that captures musically meaningful local pulse information even for the case of complex music. Our main idea is to derive for each time position a sinusoidal kernel that best explains the local periodic nature of a previously extracted note onset representation. Then we employ an overlap-add technique accumulating all these kernels over time to obtain a single function that reveals the predominant local pulse (PLP). Our concept introduces a high degree of robustness to noise and distortions resulting from weak and blurry onsets. Furthermore, the resulting PLP curve reveals the local pulse information even in the presence of continuous tempo changes and indicates a kind of confidence in the periodicity estimation. As further contribution, we show how our PLP concept can be used as a flexible tool for enhancing tempo estimation and beat tracking. The practical relevance of our approach is demonstrated by extensive experiments based on music recordings of various genres.</s0>
</fC01>
<fC02 i1="01" i2="X"><s0>001D04A05D</s0>
</fC02>
<fC02 i1="02" i2="X"><s0>001D04A04A2</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE"><s0>Rythme</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG"><s0>Rhythm</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA"><s0>Ritmo</s0>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE"><s0>Temps établissement</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG"><s0>Onset time</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA"><s0>Tiempo establecimiento</s0>
<s5>02</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE"><s0>Variation temporelle</s0>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG"><s0>Time variation</s0>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA"><s0>Variación temporal</s0>
<s5>03</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE"><s0>Méthode noyau</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG"><s0>Kernel method</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA"><s0>Método núcleo</s0>
<s5>04</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE"><s0>Prédiction linéaire</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG"><s0>Linear prediction</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA"><s0>Predicción lineal</s0>
<s5>05</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE"><s0>Immunité bruit</s0>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG"><s0>Noise immunity</s0>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA"><s0>Inmunidad ruido</s0>
<s5>06</s5>
</fC03>
<fC03 i1="07" i2="3" l="FRE"><s0>Poursuite cible</s0>
<s5>07</s5>
</fC03>
<fC03 i1="07" i2="3" l="ENG"><s0>Target tracking</s0>
<s5>07</s5>
</fC03>
<fC03 i1="08" i2="X" l="FRE"><s0>Son musical</s0>
<s5>08</s5>
</fC03>
<fC03 i1="08" i2="X" l="ENG"><s0>Musical sound</s0>
<s5>08</s5>
</fC03>
<fC03 i1="08" i2="X" l="SPA"><s0>Sonido musical</s0>
<s5>08</s5>
</fC03>
<fC03 i1="09" i2="3" l="FRE"><s0>Traitement signal audio</s0>
<s5>09</s5>
</fC03>
<fC03 i1="09" i2="3" l="ENG"><s0>Audio signal processing</s0>
<s5>09</s5>
</fC03>
<fC03 i1="10" i2="X" l="FRE"><s0>Traitement signal</s0>
<s5>10</s5>
</fC03>
<fC03 i1="10" i2="X" l="ENG"><s0>Signal processing</s0>
<s5>10</s5>
</fC03>
<fC03 i1="10" i2="X" l="SPA"><s0>Procesamiento señal</s0>
<s5>10</s5>
</fC03>
<fC03 i1="11" i2="3" l="FRE"><s0>Traitement signal acoustique</s0>
<s5>46</s5>
</fC03>
<fC03 i1="11" i2="3" l="ENG"><s0>Acoustic signal processing</s0>
<s5>46</s5>
</fC03>
<fN21><s1>249</s1>
</fN21>
<fN44 i1="01"><s1>OTO</s1>
</fN44>
<fN82><s1>OTO</s1>
</fN82>
</pA>
</standard>
<server><NO>PASCAL 11-0363501 INIST</NO>
<ET>Extracting Predominant Local Pulse Information From Music Recordings</ET>
<AU>GROSCHE (Peter); MIILLER (Meinard)</AU>
<AF>Saarland University and the Max-Planck Institut fur Informatik/66123 Saarbrücken/Allemagne (1 aut., 2 aut.)</AF>
<DT>Publication en série; Niveau analytique</DT>
<SO>IEEE transactions on audio, speech, and language processing; ISSN 1558-7916; Etats-Unis; Da. 2011; Vol. 19; No. 6; Pp. 1688-1701; Bibl. 45 ref.</SO>
<LA>Anglais</LA>
<EA>The extraction of tempo and beat information from music recordings constitutes a challenging task in particular for non-percussive music with soft note onsets and time-varying tempo. In this paper, we introduce a novel mid-level representation that captures musically meaningful local pulse information even for the case of complex music. Our main idea is to derive for each time position a sinusoidal kernel that best explains the local periodic nature of a previously extracted note onset representation. Then we employ an overlap-add technique accumulating all these kernels over time to obtain a single function that reveals the predominant local pulse (PLP). Our concept introduces a high degree of robustness to noise and distortions resulting from weak and blurry onsets. Furthermore, the resulting PLP curve reveals the local pulse information even in the presence of continuous tempo changes and indicates a kind of confidence in the periodicity estimation. As further contribution, we show how our PLP concept can be used as a flexible tool for enhancing tempo estimation and beat tracking. The practical relevance of our approach is demonstrated by extensive experiments based on music recordings of various genres.</EA>
<CC>001D04A05D; 001D04A04A2</CC>
<FD>Rythme; Temps établissement; Variation temporelle; Méthode noyau; Prédiction linéaire; Immunité bruit; Poursuite cible; Son musical; Traitement signal audio; Traitement signal; Traitement signal acoustique</FD>
<ED>Rhythm; Onset time; Time variation; Kernel method; Linear prediction; Noise immunity; Target tracking; Musical sound; Audio signal processing; Signal processing; Acoustic signal processing</ED>
<SD>Ritmo; Tiempo establecimiento; Variación temporal; Método núcleo; Predicción lineal; Inmunidad ruido; Sonido musical; Procesamiento señal</SD>
<LO>INIST-26266.354000191110700190</LO>
<ID>11-0363501</ID>
</server>
</inist>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Sarre/explor/MusicSarreV3/Data/PascalFrancis/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000007 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000007 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Sarre |area= MusicSarreV3 |flux= PascalFrancis |étape= Corpus |type= RBID |clé= Pascal:11-0363501 |texte= Extracting Predominant Local Pulse Information From Music Recordings }}
This area was generated with Dilib version V0.6.33. |