Aligning music audio with symbolic scores using a hybrid graphical model
Identifieur interne : 000421 ( Istex/Checkpoint ); précédent : 000420; suivant : 000422Aligning music audio with symbolic scores using a hybrid graphical model
Auteurs : Christopher Raphael [États-Unis]Source :
- Machine Learning [ 0885-6125 ] ; 2006-12-01.
Descripteurs français
- Wicri :
- topic : Musique.
English descriptors
- KwdEn :
Abstract
Abstract: We present a new method for establishing an alignment between a polyphonic musical score and a corresponding sampled audio performance. The method uses a graphical model containing both latent discrete variables, corresponding to score position, as well as a latent continuous tempo process. We use a simple data model based only on the pitch content of the audio signal. The data interpretation is defined to be the most likely configuration of the hidden variables, given the data, and we develop computational methodology to identify or approximate this configuration using a variant of dynamic programming involving parametrically represented continuous variables. Experiments are presented on a 55-minute hand-marked orchestral test set.
Url:
DOI: 10.1007/s10994-006-8415-3
Affiliations:
Links toward previous steps (curation, corpus...)
Links to Exploration step
ISTEX:698850BDEB0B2344798C55B050A2C90ACF55CFEALe document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Aligning music audio with symbolic scores using a hybrid graphical model</title>
<author><name sortKey="Raphael, Christopher" sort="Raphael, Christopher" uniqKey="Raphael C" first="Christopher" last="Raphael">Christopher Raphael</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:698850BDEB0B2344798C55B050A2C90ACF55CFEA</idno>
<date when="2006" year="2006">2006</date>
<idno type="doi">10.1007/s10994-006-8415-3</idno>
<idno type="url">https://api.istex.fr/document/698850BDEB0B2344798C55B050A2C90ACF55CFEA/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000604</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">000604</idno>
<idno type="wicri:Area/Istex/Curation">000604</idno>
<idno type="wicri:Area/Istex/Checkpoint">000421</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000421</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Aligning music audio with symbolic scores using a hybrid graphical model</title>
<author><name sortKey="Raphael, Christopher" sort="Raphael, Christopher" uniqKey="Raphael C" first="Christopher" last="Raphael">Christopher Raphael</name>
<affiliation></affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">États-Unis</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Machine Learning</title>
<title level="j" type="abbrev">Mach Learn</title>
<idno type="ISSN">0885-6125</idno>
<idno type="eISSN">1573-0565</idno>
<imprint><publisher>Kluwer Academic Publishers</publisher>
<pubPlace>Boston</pubPlace>
<date type="published" when="2006-12-01">2006-12-01</date>
<biblScope unit="volume">65</biblScope>
<biblScope unit="issue">2-3</biblScope>
<biblScope unit="page" from="389">389</biblScope>
<biblScope unit="page" to="409">409</biblScope>
</imprint>
<idno type="ISSN">0885-6125</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0885-6125</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Graphical models</term>
<term>Music</term>
<term>Score following</term>
<term>Score matching</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr"><term>Musique</term>
</keywords>
</textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: We present a new method for establishing an alignment between a polyphonic musical score and a corresponding sampled audio performance. The method uses a graphical model containing both latent discrete variables, corresponding to score position, as well as a latent continuous tempo process. We use a simple data model based only on the pitch content of the audio signal. The data interpretation is defined to be the most likely configuration of the hidden variables, given the data, and we develop computational methodology to identify or approximate this configuration using a variant of dynamic programming involving parametrically represented continuous variables. Experiments are presented on a 55-minute hand-marked orchestral test set.</div>
</front>
</TEI>
<affiliations><list><country><li>États-Unis</li>
</country>
</list>
<tree><country name="États-Unis"><noRegion><name sortKey="Raphael, Christopher" sort="Raphael, Christopher" uniqKey="Raphael C" first="Christopher" last="Raphael">Christopher Raphael</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Musique/explor/DebussyV1/Data/Istex/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000421 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Istex/Checkpoint/biblio.hfd -nk 000421 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Musique |area= DebussyV1 |flux= Istex |étape= Checkpoint |type= RBID |clé= ISTEX:698850BDEB0B2344798C55B050A2C90ACF55CFEA |texte= Aligning music audio with symbolic scores using a hybrid graphical model }}
This area was generated with Dilib version V0.6.33. |