Serveur d'exploration sur l'opéra

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Learning to Tag from Open Vocabulary Labels

Identifieur interne : 000765 ( Istex/Corpus ); précédent : 000764; suivant : 000766

Learning to Tag from Open Vocabulary Labels

Auteurs : Edith Law ; Burr Settles ; Tom Mitchell

Source :

RBID : ISTEX:0A645893CDD5762C7749FF9B0CDC78E44DBDB935

Abstract

Abstract: Most approaches to classifying media content assume a fixed, closed vocabulary of labels. In contrast, we advocate machine learning approaches which take advantage of the millions of free-form tags obtainable via online crowd-sourcing platforms and social tagging websites. The use of such open vocabularies presents learning challenges due to typographical errors, synonymy, and a potentially unbounded set of tag labels. In this work, we present a new approach that organizes these noisy tags into well-behaved semantic classes using topic modeling, and learn to predict tags accurately using a mixture of topic classes. This method can utilize an arbitrary open vocabulary of tags, reduces training time by 94% compared to learning from these tags directly, and achieves comparable performance for classification and superior performance for retrieval. We also demonstrate that on open vocabulary tasks, human evaluations are essential for measuring the true performance of tag classifiers, which traditional evaluation methods will consistently underestimate. We focus on the domain of tagging music clips, and demonstrate our results using data collected with a human computation game called TagATune.

Url:
DOI: 10.1007/978-3-642-15883-4_14

Links to Exploration step

ISTEX:0A645893CDD5762C7749FF9B0CDC78E44DBDB935

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct:series">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Learning to Tag from Open Vocabulary Labels</title>
<author>
<name sortKey="Law, Edith" sort="Law, Edith" uniqKey="Law E" first="Edith" last="Law">Edith Law</name>
<affiliation>
<mods:affiliation>Machine Learning Department, Carnegie Mellon University</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: elaw@cs.cmu.edu</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Settles, Burr" sort="Settles, Burr" uniqKey="Settles B" first="Burr" last="Settles">Burr Settles</name>
<affiliation>
<mods:affiliation>Machine Learning Department, Carnegie Mellon University</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: bsettles@cs.cmu.edu</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Mitchell, Tom" sort="Mitchell, Tom" uniqKey="Mitchell T" first="Tom" last="Mitchell">Tom Mitchell</name>
<affiliation>
<mods:affiliation>Machine Learning Department, Carnegie Mellon University</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: tom.mitchell@cs.cmu.edu</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:0A645893CDD5762C7749FF9B0CDC78E44DBDB935</idno>
<date when="2010" year="2010">2010</date>
<idno type="doi">10.1007/978-3-642-15883-4_14</idno>
<idno type="url">https://api.istex.fr/document/0A645893CDD5762C7749FF9B0CDC78E44DBDB935/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000765</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Learning to Tag from Open Vocabulary Labels</title>
<author>
<name sortKey="Law, Edith" sort="Law, Edith" uniqKey="Law E" first="Edith" last="Law">Edith Law</name>
<affiliation>
<mods:affiliation>Machine Learning Department, Carnegie Mellon University</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: elaw@cs.cmu.edu</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Settles, Burr" sort="Settles, Burr" uniqKey="Settles B" first="Burr" last="Settles">Burr Settles</name>
<affiliation>
<mods:affiliation>Machine Learning Department, Carnegie Mellon University</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: bsettles@cs.cmu.edu</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Mitchell, Tom" sort="Mitchell, Tom" uniqKey="Mitchell T" first="Tom" last="Mitchell">Tom Mitchell</name>
<affiliation>
<mods:affiliation>Machine Learning Department, Carnegie Mellon University</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: tom.mitchell@cs.cmu.edu</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2010</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">0A645893CDD5762C7749FF9B0CDC78E44DBDB935</idno>
<idno type="DOI">10.1007/978-3-642-15883-4_14</idno>
<idno type="ChapterID">Chap14</idno>
<idno type="ChapterID">14</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Most approaches to classifying media content assume a fixed, closed vocabulary of labels. In contrast, we advocate machine learning approaches which take advantage of the millions of free-form tags obtainable via online crowd-sourcing platforms and social tagging websites. The use of such open vocabularies presents learning challenges due to typographical errors, synonymy, and a potentially unbounded set of tag labels. In this work, we present a new approach that organizes these noisy tags into well-behaved semantic classes using topic modeling, and learn to predict tags accurately using a mixture of topic classes. This method can utilize an arbitrary open vocabulary of tags, reduces training time by 94% compared to learning from these tags directly, and achieves comparable performance for classification and superior performance for retrieval. We also demonstrate that on open vocabulary tasks, human evaluations are essential for measuring the true performance of tag classifiers, which traditional evaluation methods will consistently underestimate. We focus on the domain of tagging music clips, and demonstrate our results using data collected with a human computation game called TagATune.</div>
</front>
</TEI>
<istex>
<corpusName>springer</corpusName>
<author>
<json:item>
<name>Edith Law</name>
<affiliations>
<json:string>Machine Learning Department, Carnegie Mellon University,</json:string>
<json:string>E-mail: elaw@cs.cmu.edu</json:string>
</affiliations>
</json:item>
<json:item>
<name>Burr Settles</name>
<affiliations>
<json:string>Machine Learning Department, Carnegie Mellon University,</json:string>
<json:string>E-mail: bsettles@cs.cmu.edu</json:string>
</affiliations>
</json:item>
<json:item>
<name>Tom Mitchell</name>
<affiliations>
<json:string>Machine Learning Department, Carnegie Mellon University,</json:string>
<json:string>E-mail: tom.mitchell@cs.cmu.edu</json:string>
</affiliations>
</json:item>
</author>
<language>
<json:string>eng</json:string>
</language>
<abstract>Abstract: Most approaches to classifying media content assume a fixed, closed vocabulary of labels. In contrast, we advocate machine learning approaches which take advantage of the millions of free-form tags obtainable via online crowd-sourcing platforms and social tagging websites. The use of such open vocabularies presents learning challenges due to typographical errors, synonymy, and a potentially unbounded set of tag labels. In this work, we present a new approach that organizes these noisy tags into well-behaved semantic classes using topic modeling, and learn to predict tags accurately using a mixture of topic classes. This method can utilize an arbitrary open vocabulary of tags, reduces training time by 94% compared to learning from these tags directly, and achieves comparable performance for classification and superior performance for retrieval. We also demonstrate that on open vocabulary tasks, human evaluations are essential for measuring the true performance of tag classifiers, which traditional evaluation methods will consistently underestimate. We focus on the domain of tagging music clips, and demonstrate our results using data collected with a human computation game called TagATune.</abstract>
<qualityIndicators>
<score>8.612</score>
<pdfVersion>1.6</pdfVersion>
<pdfPageSize>430 x 660 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>1216</abstractCharCount>
<pdfWordCount>6374</pdfWordCount>
<pdfCharCount>37315</pdfCharCount>
<pdfPageCount>16</pdfPageCount>
<abstractWordCount>176</abstractWordCount>
</qualityIndicators>
<title>Learning to Tag from Open Vocabulary Labels</title>
<chapterId>
<json:string>Chap14</json:string>
<json:string>14</json:string>
</chapterId>
<genre>
<json:string>conference [research-article]</json:string>
</genre>
<serie>
<editor>
<json:item>
<name>David Hutchison</name>
<affiliations>
<json:string>Lancaster University, Lancaster, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Takeo Kanade</name>
<affiliations>
<json:string>Carnegie Mellon University, Pittsburgh, PA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Josef Kittler</name>
<affiliations>
<json:string>University of Surrey, Guildford, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Jon M. Kleinberg</name>
<affiliations>
<json:string>Cornell University, Ithaca, NY, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Friedemann Mattern</name>
<affiliations>
<json:string>ETH Zurich, Zurich, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>John C. Mitchell</name>
<affiliations>
<json:string>Stanford University, Stanford, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moni Naor</name>
<affiliations>
<json:string>Weizmann Institute of Science, Rehovot, Israel</json:string>
</affiliations>
</json:item>
<json:item>
<name>Oscar Nierstrasz</name>
<affiliations>
<json:string>University of Bern, Bern, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>C. Pandu Rangan</name>
<affiliations>
<json:string>Indian Institute of Technology, Madras, India</json:string>
</affiliations>
</json:item>
<json:item>
<name>Bernhard Steffen</name>
<affiliations>
<json:string>University of Dortmund, Dortmund, Germany</json:string>
</affiliations>
</json:item>
<json:item>
<name>Madhu Sudan</name>
<affiliations>
<json:string>Massachusetts Institute of Technology, MA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Demetri Terzopoulos</name>
<affiliations>
<json:string>University of California, Los Angeles, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Doug Tygar</name>
<affiliations>
<json:string>University of California, Berkeley, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moshe Y. Vardi</name>
<affiliations>
<json:string>Rice University, Houston, TX, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Gerhard Weikum</name>
<affiliations>
<json:string>Max-Planck Institute of Computer Science, Saarbrücken, Germany</json:string>
</affiliations>
</json:item>
</editor>
<issn>
<json:string>0302-9743</json:string>
</issn>
<genre></genre>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>Lecture Notes in Computer Science</title>
<copyrightDate>2010</copyrightDate>
</serie>
<host>
<editor>
<json:item>
<name>José Luis Balcázar</name>
<affiliations>
<json:string>Departamento de Matemáticas, Estadística y Computación, Universidad de Cantabria, Avenida de los Castros, s/n, 39071, Santander, Spain</json:string>
<json:string>E-mail: joseluis.balcazar@unican.es</json:string>
</affiliations>
</json:item>
<json:item>
<name>Francesco Bonchi</name>
<affiliations>
<json:string>Yahoo! Research Barcelona, Avinguda Diagonal 177, 08018, Barcelona, Spain</json:string>
<json:string>E-mail: bonchi@yahoo-inc.corp</json:string>
</affiliations>
</json:item>
<json:item>
<name>Aristides Gionis</name>
<affiliations>
<json:string>Yahoo! Research Barcelona, Avinguda Diagnonal 177, 08018, Barcelona, Spain</json:string>
<json:string>E-mail: gionis@yahoo-inc.corp</json:string>
</affiliations>
</json:item>
<json:item>
<name>Michèle Sebag</name>
<affiliations>
<json:string>TAO, CNRS-INRIA-LRI, Université Paris-Sud, 91405, Orsay, France</json:string>
<json:string>E-mail: sebag@lri.fr</json:string>
</affiliations>
</json:item>
</editor>
<subject>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Artificial Intelligence (incl. Robotics)</value>
</json:item>
<json:item>
<value>Information Systems Applications (incl.Internet)</value>
</json:item>
<json:item>
<value>Information Storage and Retrieval</value>
</json:item>
<json:item>
<value>Database Management</value>
</json:item>
<json:item>
<value>Data Mining and Knowledge Discovery</value>
</json:item>
<json:item>
<value>Information Systems and Communication Service</value>
</json:item>
</subject>
<isbn>
<json:string>978-3-642-15882-7</json:string>
</isbn>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>Machine Learning and Knowledge Discovery in Databases</title>
<bookId>
<json:string>978-3-642-15883-4</json:string>
</bookId>
<volume>6322</volume>
<pages>
<last>226</last>
<first>211</first>
</pages>
<issn>
<json:string>0302-9743</json:string>
</issn>
<genre>
<json:string>Book Series</json:string>
</genre>
<eisbn>
<json:string>978-3-642-15883-4</json:string>
</eisbn>
<copyrightDate>2010</copyrightDate>
<doi>
<json:string>10.1007/978-3-642-15883-4</json:string>
</doi>
</host>
<publicationDate>2010</publicationDate>
<copyrightDate>2010</copyrightDate>
<doi>
<json:string>10.1007/978-3-642-15883-4_14</json:string>
</doi>
<id>0A645893CDD5762C7749FF9B0CDC78E44DBDB935</id>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/0A645893CDD5762C7749FF9B0CDC78E44DBDB935/fulltext/pdf</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/0A645893CDD5762C7749FF9B0CDC78E44DBDB935/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/0A645893CDD5762C7749FF9B0CDC78E44DBDB935/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">Learning to Tag from Open Vocabulary Labels</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<availability>
<p>SPRINGER</p>
</availability>
<date>2010</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a" type="main" xml:lang="en">Learning to Tag from Open Vocabulary Labels</title>
<author>
<persName>
<forename type="first">Edith</forename>
<surname>Law</surname>
</persName>
<email>elaw@cs.cmu.edu</email>
<affiliation>Machine Learning Department, Carnegie Mellon University,</affiliation>
</author>
<author>
<persName>
<forename type="first">Burr</forename>
<surname>Settles</surname>
</persName>
<email>bsettles@cs.cmu.edu</email>
<affiliation>Machine Learning Department, Carnegie Mellon University,</affiliation>
</author>
<author>
<persName>
<forename type="first">Tom</forename>
<surname>Mitchell</surname>
</persName>
<email>tom.mitchell@cs.cmu.edu</email>
<affiliation>Machine Learning Department, Carnegie Mellon University,</affiliation>
</author>
</analytic>
<monogr>
<title level="m">Machine Learning and Knowledge Discovery in Databases</title>
<title level="m" type="sub">European Conference, ECML PKDD 2010, Barcelona, Spain, September 20-24, 2010, Proceedings, Part II</title>
<idno type="pISBN">978-3-642-15882-7</idno>
<idno type="eISBN">978-3-642-15883-4</idno>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="DOI">10.1007/978-3-642-15883-4</idno>
<idno type="BookID">978-3-642-15883-4</idno>
<idno type="BookTitleID">213574</idno>
<idno type="BookSequenceNumber">6322</idno>
<idno type="BookVolumeNumber">6322</idno>
<idno type="BookChapterCount">32</idno>
<editor>
<persName>
<forename type="first">José</forename>
<forename type="first">Luis</forename>
<surname>Balcázar</surname>
</persName>
<email>joseluis.balcazar@unican.es</email>
<affiliation>Departamento de Matemáticas, Estadística y Computación, Universidad de Cantabria, Avenida de los Castros, s/n, 39071, Santander, Spain</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Francesco</forename>
<surname>Bonchi</surname>
</persName>
<email>bonchi@yahoo-inc.corp</email>
<affiliation>Yahoo! Research Barcelona, Avinguda Diagonal 177, 08018, Barcelona, Spain</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Aristides</forename>
<surname>Gionis</surname>
</persName>
<email>gionis@yahoo-inc.corp</email>
<affiliation>Yahoo! Research Barcelona, Avinguda Diagnonal 177, 08018, Barcelona, Spain</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Michèle</forename>
<surname>Sebag</surname>
</persName>
<email>sebag@lri.fr</email>
<affiliation>TAO, CNRS-INRIA-LRI, Université Paris-Sud, 91405, Orsay, France</affiliation>
</editor>
<imprint>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<date type="published" when="2010"></date>
<biblScope unit="volume">6322</biblScope>
<biblScope unit="page" from="211">211</biblScope>
<biblScope unit="page" to="226">226</biblScope>
</imprint>
</monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<editor>
<persName>
<forename type="first">David</forename>
<surname>Hutchison</surname>
</persName>
<affiliation>Lancaster University, Lancaster, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Takeo</forename>
<surname>Kanade</surname>
</persName>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Josef</forename>
<surname>Kittler</surname>
</persName>
<affiliation>University of Surrey, Guildford, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Jon</forename>
<forename type="first">M.</forename>
<surname>Kleinberg</surname>
</persName>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Friedemann</forename>
<surname>Mattern</surname>
</persName>
<affiliation>ETH Zurich, Zurich, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">John</forename>
<forename type="first">C.</forename>
<surname>Mitchell</surname>
</persName>
<affiliation>Stanford University, Stanford, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moni</forename>
<surname>Naor</surname>
</persName>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Oscar</forename>
<surname>Nierstrasz</surname>
</persName>
<affiliation>University of Bern, Bern, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">C.</forename>
<surname>Pandu Rangan</surname>
</persName>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Bernhard</forename>
<surname>Steffen</surname>
</persName>
<affiliation>University of Dortmund, Dortmund, Germany</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Madhu</forename>
<surname>Sudan</surname>
</persName>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Demetri</forename>
<surname>Terzopoulos</surname>
</persName>
<affiliation>University of California, Los Angeles, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Doug</forename>
<surname>Tygar</surname>
</persName>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moshe</forename>
<forename type="first">Y.</forename>
<surname>Vardi</surname>
</persName>
<affiliation>Rice University, Houston, TX, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Gerhard</forename>
<surname>Weikum</surname>
</persName>
<affiliation>Max-Planck Institute of Computer Science, Saarbrücken, Germany</affiliation>
</editor>
<biblScope>
<date>2010</date>
</biblScope>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="seriesId">558</idno>
</series>
<series>
<title level="s">Lecture Notes in Artificial Intelligence</title>
<editor>
<persName>
<forename type="first">David</forename>
<surname>Hutchison</surname>
</persName>
<affiliation>Lancaster University, Lancaster, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Takeo</forename>
<surname>Kanade</surname>
</persName>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Josef</forename>
<surname>Kittler</surname>
</persName>
<affiliation>University of Surrey, Guildford, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Jon</forename>
<forename type="first">M.</forename>
<surname>Kleinberg</surname>
</persName>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Friedemann</forename>
<surname>Mattern</surname>
</persName>
<affiliation>ETH Zurich, Zurich, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">John</forename>
<forename type="first">C.</forename>
<surname>Mitchell</surname>
</persName>
<affiliation>Stanford University, Stanford, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moni</forename>
<surname>Naor</surname>
</persName>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Oscar</forename>
<surname>Nierstrasz</surname>
</persName>
<affiliation>University of Bern, Bern, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">C.</forename>
<surname>Pandu Rangan</surname>
</persName>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Bernhard</forename>
<surname>Steffen</surname>
</persName>
<affiliation>University of Dortmund, Dortmund, Germany</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Madhu</forename>
<surname>Sudan</surname>
</persName>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Demetri</forename>
<surname>Terzopoulos</surname>
</persName>
<affiliation>University of California, Los Angeles, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Doug</forename>
<surname>Tygar</surname>
</persName>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moshe</forename>
<forename type="first">Y.</forename>
<surname>Vardi</surname>
</persName>
<affiliation>Rice University, Houston, TX, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Gerhard</forename>
<surname>Weikum</surname>
</persName>
<affiliation>Max-Planck Institute of Computer Science, Saarbrücken, Germany</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Randy</forename>
<surname>Goebel</surname>
</persName>
<affiliation>University of Alberta, Edmonton, Canada</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Jörg</forename>
<surname>Siekmann</surname>
</persName>
<affiliation>University of Saarland, Saarbrücken, Germany</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Wolfgang</forename>
<surname>Wahlster</surname>
</persName>
<affiliation>DFKI and University of Saarland, Saarbrücken, Germany</affiliation>
</editor>
<editor>
<persName>
<forename type="first">José</forename>
<forename type="first">Luis</forename>
<surname>Balcázar</surname>
</persName>
<email>joseluis.balcazar@unican.es</email>
<affiliation>Departamento de Matemáticas, Estadística y Computación, Universidad de Cantabria, Avenida de los Castros, s/n, 39071, Santander, Spain</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Francesco</forename>
<surname>Bonchi</surname>
</persName>
<email>bonchi@yahoo-inc.corp</email>
<affiliation>Yahoo! Research Barcelona, Avinguda Diagonal 177, 08018, Barcelona, Spain</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Aristides</forename>
<surname>Gionis</surname>
</persName>
<email>gionis@yahoo-inc.corp</email>
<affiliation>Yahoo! Research Barcelona, Avinguda Diagnonal 177, 08018, Barcelona, Spain</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Michèle</forename>
<surname>Sebag</surname>
</persName>
<email>sebag@lri.fr</email>
<affiliation>TAO, CNRS-INRIA-LRI, Université Paris-Sud, 91405, Orsay, France</affiliation>
</editor>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<biblScope type="seriesId">1244</biblScope>
</series>
<idno type="istex">0A645893CDD5762C7749FF9B0CDC78E44DBDB935</idno>
<idno type="DOI">10.1007/978-3-642-15883-4_14</idno>
<idno type="ChapterID">Chap14</idno>
<idno type="ChapterID">14</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>2010</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>Abstract: Most approaches to classifying media content assume a fixed, closed vocabulary of labels. In contrast, we advocate machine learning approaches which take advantage of the millions of free-form tags obtainable via online crowd-sourcing platforms and social tagging websites. The use of such open vocabularies presents learning challenges due to typographical errors, synonymy, and a potentially unbounded set of tag labels. In this work, we present a new approach that organizes these noisy tags into well-behaved semantic classes using topic modeling, and learn to predict tags accurately using a mixture of topic classes. This method can utilize an arbitrary open vocabulary of tags, reduces training time by 94% compared to learning from these tags directly, and achieves comparable performance for classification and superior performance for retrieval. We also demonstrate that on open vocabulary tasks, human evaluations are essential for measuring the true performance of tag classifiers, which traditional evaluation methods will consistently underestimate. We focus on the domain of tagging music clips, and demonstrate our results using data collected with a human computation game called TagATune.</p>
</abstract>
<textClass>
<keywords scheme="Book Subject Collection">
<list>
<label>SUCO11645</label>
<item>
<term>Computer Science</term>
</item>
</list>
</keywords>
</textClass>
<textClass>
<keywords scheme="Book Subject Group">
<list>
<label>I</label>
<label>I21017</label>
<label>I18040</label>
<label>I18032</label>
<label>I18024</label>
<label>I18030</label>
<label>I18008</label>
<item>
<term>Computer Science</term>
</item>
<item>
<term>Artificial Intelligence (incl. Robotics)</term>
</item>
<item>
<term>Information Systems Applications (incl.Internet)</term>
</item>
<item>
<term>Information Storage and Retrieval</term>
</item>
<item>
<term>Database Management</term>
</item>
<item>
<term>Data Mining and Knowledge Discovery</term>
</item>
<item>
<term>Information Systems and Communication Service</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change when="2010">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-2">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/0A645893CDD5762C7749FF9B0CDC78E44DBDB935/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="Springer, Publisher found" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="UTF-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//Springer-Verlag//DTD A++ V2.4//EN" URI="http://devel.springer.de/A++/V2.4/DTD/A++V2.4.dtd" name="istex:docType"></istex:docType>
<istex:document>
<Publisher>
<PublisherInfo>
<PublisherName>Springer Berlin Heidelberg</PublisherName>
<PublisherLocation>Berlin, Heidelberg</PublisherLocation>
</PublisherInfo>
<Series>
<SeriesInfo SeriesType="Series" TocLevels="0">
<SeriesID>558</SeriesID>
<SeriesPrintISSN>0302-9743</SeriesPrintISSN>
<SeriesElectronicISSN>1611-3349</SeriesElectronicISSN>
<SeriesTitle Language="En">Lecture Notes in Computer Science</SeriesTitle>
</SeriesInfo>
<SeriesHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff1">
<EditorName DisplayOrder="Western">
<GivenName>David</GivenName>
<FamilyName>Hutchison</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff2">
<EditorName DisplayOrder="Western">
<GivenName>Takeo</GivenName>
<FamilyName>Kanade</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff3">
<EditorName DisplayOrder="Western">
<GivenName>Josef</GivenName>
<FamilyName>Kittler</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff4">
<EditorName DisplayOrder="Western">
<GivenName>Jon</GivenName>
<GivenName>M.</GivenName>
<FamilyName>Kleinberg</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff5">
<EditorName DisplayOrder="Western">
<GivenName>Friedemann</GivenName>
<FamilyName>Mattern</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff6">
<EditorName DisplayOrder="Western">
<GivenName>John</GivenName>
<GivenName>C.</GivenName>
<FamilyName>Mitchell</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff7">
<EditorName DisplayOrder="Western">
<GivenName>Moni</GivenName>
<FamilyName>Naor</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff8">
<EditorName DisplayOrder="Western">
<GivenName>Oscar</GivenName>
<FamilyName>Nierstrasz</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff9">
<EditorName DisplayOrder="Western">
<GivenName>C.</GivenName>
<FamilyName>Pandu Rangan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff10">
<EditorName DisplayOrder="Western">
<GivenName>Bernhard</GivenName>
<FamilyName>Steffen</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff11">
<EditorName DisplayOrder="Western">
<GivenName>Madhu</GivenName>
<FamilyName>Sudan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff12">
<EditorName DisplayOrder="Western">
<GivenName>Demetri</GivenName>
<FamilyName>Terzopoulos</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff13">
<EditorName DisplayOrder="Western">
<GivenName>Doug</GivenName>
<FamilyName>Tygar</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff14">
<EditorName DisplayOrder="Western">
<GivenName>Moshe</GivenName>
<GivenName>Y.</GivenName>
<FamilyName>Vardi</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff15">
<EditorName DisplayOrder="Western">
<GivenName>Gerhard</GivenName>
<FamilyName>Weikum</FamilyName>
</EditorName>
</Editor>
<Affiliation ID="Aff1">
<OrgName>Lancaster University</OrgName>
<OrgAddress>
<City>Lancaster</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff2">
<OrgName>Carnegie Mellon University</OrgName>
<OrgAddress>
<City>Pittsburgh</City>
<State>PA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff3">
<OrgName>University of Surrey</OrgName>
<OrgAddress>
<City>Guildford</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff4">
<OrgName>Cornell University</OrgName>
<OrgAddress>
<City>Ithaca</City>
<State>NY</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff5">
<OrgName>ETH Zurich</OrgName>
<OrgAddress>
<City>Zurich</City>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff6">
<OrgName>Stanford University</OrgName>
<OrgAddress>
<City>Stanford</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff7">
<OrgName>Weizmann Institute of Science</OrgName>
<OrgAddress>
<City>Rehovot</City>
<Country>Israel</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff8">
<OrgName>University of Bern</OrgName>
<OrgAddress>
<City>Bern</City>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff9">
<OrgName>Indian Institute of Technology</OrgName>
<OrgAddress>
<City>Madras</City>
<Country>India</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff10">
<OrgName>University of Dortmund</OrgName>
<OrgAddress>
<City>Dortmund</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff11">
<OrgName>Massachusetts Institute of Technology</OrgName>
<OrgAddress>
<State>MA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff12">
<OrgName>University of California</OrgName>
<OrgAddress>
<City>Los Angeles</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff13">
<OrgName>University of California</OrgName>
<OrgAddress>
<City>Berkeley</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff14">
<OrgName>Rice University</OrgName>
<OrgAddress>
<City>Houston</City>
<State>TX</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff15">
<OrgName>Max-Planck Institute of Computer Science</OrgName>
<OrgAddress>
<City>Saarbrücken</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</SeriesHeader>
<SubSeries>
<SubSeriesInfo>
<SubSeriesID>1244</SubSeriesID>
<SubSeriesPrintISSN>0302-9743</SubSeriesPrintISSN>
<SubSeriesElectronicISSN>1611-3349</SubSeriesElectronicISSN>
<SubSeriesTitle Language="En">Lecture Notes in Artificial Intelligence</SubSeriesTitle>
</SubSeriesInfo>
<SubSeriesHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff16">
<EditorName DisplayOrder="Western">
<GivenName>Randy</GivenName>
<FamilyName>Goebel</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff17">
<EditorName DisplayOrder="Western">
<GivenName>Jörg</GivenName>
<FamilyName>Siekmann</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff18">
<EditorName DisplayOrder="Western">
<GivenName>Wolfgang</GivenName>
<FamilyName>Wahlster</FamilyName>
</EditorName>
</Editor>
<Affiliation ID="Aff16">
<OrgName>University of Alberta</OrgName>
<OrgAddress>
<City>Edmonton</City>
<Country>Canada</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff17">
<OrgName>University of Saarland</OrgName>
<OrgAddress>
<City>Saarbrücken</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff18">
<OrgName>DFKI and University of Saarland</OrgName>
<OrgAddress>
<City>Saarbrücken</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</SubSeriesHeader>
</SubSeries>
<Book Language="En">
<BookInfo BookProductType="Proceedings" ContainsESM="No" Language="En" MediaType="eBook" NumberingDepth="2" NumberingStyle="ContentOnly" OutputMedium="All" TocLevels="0">
<BookID>978-3-642-15883-4</BookID>
<BookTitle>Machine Learning and Knowledge Discovery in Databases</BookTitle>
<BookSubTitle>European Conference, ECML PKDD 2010, Barcelona, Spain, September 20-24, 2010, Proceedings, Part II</BookSubTitle>
<BookVolumeNumber>6322</BookVolumeNumber>
<BookSequenceNumber>6322</BookSequenceNumber>
<BookDOI>10.1007/978-3-642-15883-4</BookDOI>
<BookTitleID>213574</BookTitleID>
<BookPrintISBN>978-3-642-15882-7</BookPrintISBN>
<BookElectronicISBN>978-3-642-15883-4</BookElectronicISBN>
<BookChapterCount>32</BookChapterCount>
<BookCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2010</CopyrightYear>
</BookCopyright>
<BookSubjectGroup>
<BookSubject Code="I" Type="Primary">Computer Science</BookSubject>
<BookSubject Code="I21017" Priority="1" Type="Secondary">Artificial Intelligence (incl. Robotics)</BookSubject>
<BookSubject Code="I18040" Priority="2" Type="Secondary">Information Systems Applications (incl.Internet)</BookSubject>
<BookSubject Code="I18032" Priority="3" Type="Secondary">Information Storage and Retrieval</BookSubject>
<BookSubject Code="I18024" Priority="4" Type="Secondary">Database Management</BookSubject>
<BookSubject Code="I18030" Priority="5" Type="Secondary">Data Mining and Knowledge Discovery</BookSubject>
<BookSubject Code="I18008" Priority="6" Type="Secondary">Information Systems and Communication Service</BookSubject>
<SubjectCollection Code="SUCO11645">Computer Science</SubjectCollection>
</BookSubjectGroup>
<BookContext>
<SeriesID>558</SeriesID>
<SubSeriesID>1244</SubSeriesID>
</BookContext>
</BookInfo>
<BookHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff19">
<EditorName DisplayOrder="Western">
<GivenName>José</GivenName>
<GivenName>Luis</GivenName>
<FamilyName>Balcázar</FamilyName>
</EditorName>
<Contact>
<Email>joseluis.balcazar@unican.es</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff20">
<EditorName DisplayOrder="Western">
<GivenName>Francesco</GivenName>
<FamilyName>Bonchi</FamilyName>
</EditorName>
<Contact>
<Email>bonchi@yahoo-inc.corp</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff21">
<EditorName DisplayOrder="Western">
<GivenName>Aristides</GivenName>
<FamilyName>Gionis</FamilyName>
</EditorName>
<Contact>
<Email>gionis@yahoo-inc.corp</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff22">
<EditorName DisplayOrder="Western">
<GivenName>Michèle</GivenName>
<FamilyName>Sebag</FamilyName>
</EditorName>
<Contact>
<Email>sebag@lri.fr</Email>
</Contact>
</Editor>
<Affiliation ID="Aff19">
<OrgDivision>Departamento de Matemáticas, Estadística y Computación</OrgDivision>
<OrgName>Universidad de Cantabria</OrgName>
<OrgAddress>
<Street>Avenida de los Castros, s/n</Street>
<Postcode>39071</Postcode>
<City>Santander</City>
<Country>Spain</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff20">
<OrgName>Yahoo! Research Barcelona</OrgName>
<OrgAddress>
<Street>Avinguda Diagonal 177</Street>
<Postcode>08018</Postcode>
<City>Barcelona</City>
<Country>Spain</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff21">
<OrgName>Yahoo! Research Barcelona</OrgName>
<OrgAddress>
<Street>Avinguda Diagnonal 177</Street>
<Postcode>08018</Postcode>
<City>Barcelona</City>
<Country>Spain</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff22">
<OrgName>TAO, CNRS-INRIA-LRI, Université Paris-Sud</OrgName>
<OrgAddress>
<Postcode>91405</Postcode>
<City>Orsay</City>
<Country>France</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</BookHeader>
<Part ID="Part1">
<PartInfo TocLevels="0">
<PartID>1</PartID>
<PartSequenceNumber>1</PartSequenceNumber>
<PartTitle>Regular Papers</PartTitle>
<PartChapterCount>32</PartChapterCount>
<PartContext>
<SeriesID>558</SeriesID>
<BookTitle>Machine Learning and Knowledge Discovery in Databases</BookTitle>
</PartContext>
</PartInfo>
<Chapter ID="Chap14" Language="En">
<ChapterInfo ChapterType="OriginalPaper" ContainsESM="No" NumberingDepth="2" NumberingStyle="ContentOnly" TocLevels="0">
<ChapterID>14</ChapterID>
<ChapterDOI>10.1007/978-3-642-15883-4_14</ChapterDOI>
<ChapterSequenceNumber>14</ChapterSequenceNumber>
<ChapterTitle Language="En">Learning to Tag from Open Vocabulary Labels</ChapterTitle>
<ChapterFirstPage>211</ChapterFirstPage>
<ChapterLastPage>226</ChapterLastPage>
<ChapterCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2010</CopyrightYear>
</ChapterCopyright>
<ChapterGrants Type="Regular">
<MetadataGrant Grant="OpenAccess"></MetadataGrant>
<AbstractGrant Grant="OpenAccess"></AbstractGrant>
<BodyPDFGrant Grant="Restricted"></BodyPDFGrant>
<BodyHTMLGrant Grant="Restricted"></BodyHTMLGrant>
<BibliographyGrant Grant="Restricted"></BibliographyGrant>
<ESMGrant Grant="Restricted"></ESMGrant>
</ChapterGrants>
<ChapterContext>
<SeriesID>558</SeriesID>
<PartID>1</PartID>
<BookID>978-3-642-15883-4</BookID>
<BookTitle>Machine Learning and Knowledge Discovery in Databases</BookTitle>
</ChapterContext>
</ChapterInfo>
<ChapterHeader>
<AuthorGroup>
<Author AffiliationIDS="Aff23">
<AuthorName DisplayOrder="Western">
<GivenName>Edith</GivenName>
<FamilyName>Law</FamilyName>
</AuthorName>
<Contact>
<Email>elaw@cs.cmu.edu</Email>
</Contact>
</Author>
<Author AffiliationIDS="Aff23">
<AuthorName DisplayOrder="Western">
<GivenName>Burr</GivenName>
<FamilyName>Settles</FamilyName>
</AuthorName>
<Contact>
<Email>bsettles@cs.cmu.edu</Email>
</Contact>
</Author>
<Author AffiliationIDS="Aff23">
<AuthorName DisplayOrder="Western">
<GivenName>Tom</GivenName>
<FamilyName>Mitchell</FamilyName>
</AuthorName>
<Contact>
<Email>tom.mitchell@cs.cmu.edu</Email>
</Contact>
</Author>
<Affiliation ID="Aff23">
<OrgDivision>Machine Learning Department</OrgDivision>
<OrgName>Carnegie Mellon University</OrgName>
<OrgAddress>
<Country> </Country>
</OrgAddress>
</Affiliation>
</AuthorGroup>
<Abstract ID="Abs1" Language="En">
<Heading>Abstract</Heading>
<Para>Most approaches to classifying media content assume a fixed, closed vocabulary of labels. In contrast, we advocate machine learning approaches which take advantage of the millions of free-form tags obtainable via online crowd-sourcing platforms and social tagging websites. The use of such open vocabularies presents learning challenges due to typographical errors, synonymy, and a potentially unbounded set of tag labels. In this work, we present a new approach that organizes these noisy tags into well-behaved semantic classes using topic modeling, and learn to predict tags accurately using a mixture of topic classes. This method can utilize an arbitrary open vocabulary of tags, reduces training time by 94% compared to learning from these tags directly, and achieves comparable performance for classification and superior performance for retrieval. We also demonstrate that on open vocabulary tasks, human evaluations are essential for measuring the true performance of tag classifiers, which traditional evaluation methods will consistently underestimate. We focus on the domain of tagging music clips, and demonstrate our results using data collected with a human computation game called TagATune.</Para>
</Abstract>
<KeywordGroup Language="En">
<Heading>Keywords</Heading>
<Keyword>Human Computation</Keyword>
<Keyword>Music Information Retrieval</Keyword>
<Keyword>Tagging Algorithms</Keyword>
<Keyword>Topic Modeling</Keyword>
</KeywordGroup>
</ChapterHeader>
<NoBody></NoBody>
</Chapter>
</Part>
</Book>
</Series>
</Publisher>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>Learning to Tag from Open Vocabulary Labels</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA" lang="en">
<title>Learning to Tag from Open Vocabulary Labels</title>
</titleInfo>
<name type="personal">
<namePart type="given">Edith</namePart>
<namePart type="family">Law</namePart>
<affiliation>Machine Learning Department, Carnegie Mellon University</affiliation>
<affiliation>E-mail: elaw@cs.cmu.edu</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Burr</namePart>
<namePart type="family">Settles</namePart>
<affiliation>Machine Learning Department, Carnegie Mellon University</affiliation>
<affiliation>E-mail: bsettles@cs.cmu.edu</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Tom</namePart>
<namePart type="family">Mitchell</namePart>
<affiliation>Machine Learning Department, Carnegie Mellon University</affiliation>
<affiliation>E-mail: tom.mitchell@cs.cmu.edu</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="conference [research-article]" displayLabel="OriginalPaper"></genre>
<originInfo>
<publisher>Springer Berlin Heidelberg</publisher>
<place>
<placeTerm type="text">Berlin, Heidelberg</placeTerm>
</place>
<dateIssued encoding="w3cdtf">2010</dateIssued>
<copyrightDate encoding="w3cdtf">2010</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">Abstract: Most approaches to classifying media content assume a fixed, closed vocabulary of labels. In contrast, we advocate machine learning approaches which take advantage of the millions of free-form tags obtainable via online crowd-sourcing platforms and social tagging websites. The use of such open vocabularies presents learning challenges due to typographical errors, synonymy, and a potentially unbounded set of tag labels. In this work, we present a new approach that organizes these noisy tags into well-behaved semantic classes using topic modeling, and learn to predict tags accurately using a mixture of topic classes. This method can utilize an arbitrary open vocabulary of tags, reduces training time by 94% compared to learning from these tags directly, and achieves comparable performance for classification and superior performance for retrieval. We also demonstrate that on open vocabulary tasks, human evaluations are essential for measuring the true performance of tag classifiers, which traditional evaluation methods will consistently underestimate. We focus on the domain of tagging music clips, and demonstrate our results using data collected with a human computation game called TagATune.</abstract>
<relatedItem type="host">
<titleInfo>
<title>Machine Learning and Knowledge Discovery in Databases</title>
<subTitle>European Conference, ECML PKDD 2010, Barcelona, Spain, September 20-24, 2010, Proceedings, Part II</subTitle>
</titleInfo>
<name type="personal">
<namePart type="given">José</namePart>
<namePart type="given">Luis</namePart>
<namePart type="family">Balcázar</namePart>
<affiliation>Departamento de Matemáticas, Estadística y Computación, Universidad de Cantabria, Avenida de los Castros, s/n, 39071, Santander, Spain</affiliation>
<affiliation>E-mail: joseluis.balcazar@unican.es</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Francesco</namePart>
<namePart type="family">Bonchi</namePart>
<affiliation>Yahoo! Research Barcelona, Avinguda Diagonal 177, 08018, Barcelona, Spain</affiliation>
<affiliation>E-mail: bonchi@yahoo-inc.corp</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Aristides</namePart>
<namePart type="family">Gionis</namePart>
<affiliation>Yahoo! Research Barcelona, Avinguda Diagnonal 177, 08018, Barcelona, Spain</affiliation>
<affiliation>E-mail: gionis@yahoo-inc.corp</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Michèle</namePart>
<namePart type="family">Sebag</namePart>
<affiliation>TAO, CNRS-INRIA-LRI, Université Paris-Sud, 91405, Orsay, France</affiliation>
<affiliation>E-mail: sebag@lri.fr</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="Book Series" displayLabel="Proceedings"></genre>
<originInfo>
<copyrightDate encoding="w3cdtf">2010</copyrightDate>
<issuance>monographic</issuance>
</originInfo>
<subject>
<genre>Book Subject Collection</genre>
<topic authority="SpringerSubjectCodes" authorityURI="SUCO11645">Computer Science</topic>
</subject>
<subject>
<genre>Book Subject Group</genre>
<topic authority="SpringerSubjectCodes" authorityURI="I">Computer Science</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I21017">Artificial Intelligence (incl. Robotics)</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18040">Information Systems Applications (incl.Internet)</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18032">Information Storage and Retrieval</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18024">Database Management</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18030">Data Mining and Knowledge Discovery</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18008">Information Systems and Communication Service</topic>
</subject>
<identifier type="DOI">10.1007/978-3-642-15883-4</identifier>
<identifier type="ISBN">978-3-642-15882-7</identifier>
<identifier type="eISBN">978-3-642-15883-4</identifier>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="BookTitleID">213574</identifier>
<identifier type="BookID">978-3-642-15883-4</identifier>
<identifier type="BookChapterCount">32</identifier>
<identifier type="BookVolumeNumber">6322</identifier>
<identifier type="BookSequenceNumber">6322</identifier>
<identifier type="PartChapterCount">32</identifier>
<part>
<date>2010</date>
<detail type="part">
<title>Regular Papers</title>
</detail>
<detail type="volume">
<number>6322</number>
<caption>vol.</caption>
</detail>
<extent unit="pages">
<start>211</start>
<end>226</end>
</extent>
</part>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2010</recordOrigin>
</recordInfo>
</relatedItem>
<relatedItem type="series">
<titleInfo>
<title>Lecture Notes in Computer Science</title>
</titleInfo>
<name type="personal">
<namePart type="given">David</namePart>
<namePart type="family">Hutchison</namePart>
<affiliation>Lancaster University, Lancaster, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Takeo</namePart>
<namePart type="family">Kanade</namePart>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Josef</namePart>
<namePart type="family">Kittler</namePart>
<affiliation>University of Surrey, Guildford, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jon</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Kleinberg</namePart>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Friedemann</namePart>
<namePart type="family">Mattern</namePart>
<affiliation>ETH Zurich, Zurich, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">John</namePart>
<namePart type="given">C.</namePart>
<namePart type="family">Mitchell</namePart>
<affiliation>Stanford University, Stanford, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moni</namePart>
<namePart type="family">Naor</namePart>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Oscar</namePart>
<namePart type="family">Nierstrasz</namePart>
<affiliation>University of Bern, Bern, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">C.</namePart>
<namePart type="family">Pandu Rangan</namePart>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Bernhard</namePart>
<namePart type="family">Steffen</namePart>
<affiliation>University of Dortmund, Dortmund, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Madhu</namePart>
<namePart type="family">Sudan</namePart>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Demetri</namePart>
<namePart type="family">Terzopoulos</namePart>
<affiliation>University of California, Los Angeles, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Doug</namePart>
<namePart type="family">Tygar</namePart>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moshe</namePart>
<namePart type="given">Y.</namePart>
<namePart type="family">Vardi</namePart>
<affiliation>Rice University, Houston, TX, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<affiliation>Max-Planck Institute of Computer Science, Saarbrücken, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<originInfo>
<copyrightDate encoding="w3cdtf">2010</copyrightDate>
<issuance>serial</issuance>
</originInfo>
<relatedItem type="constituent">
<titleInfo>
<title>Lecture Notes in Artificial Intelligence</title>
</titleInfo>
<name type="personal">
<namePart type="given">David</namePart>
<namePart type="family">Hutchison</namePart>
<affiliation>Lancaster University, Lancaster, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Takeo</namePart>
<namePart type="family">Kanade</namePart>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Josef</namePart>
<namePart type="family">Kittler</namePart>
<affiliation>University of Surrey, Guildford, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jon</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Kleinberg</namePart>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Friedemann</namePart>
<namePart type="family">Mattern</namePart>
<affiliation>ETH Zurich, Zurich, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">John</namePart>
<namePart type="given">C.</namePart>
<namePart type="family">Mitchell</namePart>
<affiliation>Stanford University, Stanford, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moni</namePart>
<namePart type="family">Naor</namePart>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Oscar</namePart>
<namePart type="family">Nierstrasz</namePart>
<affiliation>University of Bern, Bern, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">C.</namePart>
<namePart type="family">Pandu Rangan</namePart>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Bernhard</namePart>
<namePart type="family">Steffen</namePart>
<affiliation>University of Dortmund, Dortmund, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Madhu</namePart>
<namePart type="family">Sudan</namePart>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Demetri</namePart>
<namePart type="family">Terzopoulos</namePart>
<affiliation>University of California, Los Angeles, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Doug</namePart>
<namePart type="family">Tygar</namePart>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moshe</namePart>
<namePart type="given">Y.</namePart>
<namePart type="family">Vardi</namePart>
<affiliation>Rice University, Houston, TX, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<affiliation>Max-Planck Institute of Computer Science, Saarbrücken, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Randy</namePart>
<namePart type="family">Goebel</namePart>
<affiliation>University of Alberta, Edmonton, Canada</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jörg</namePart>
<namePart type="family">Siekmann</namePart>
<affiliation>University of Saarland, Saarbrücken, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Wolfgang</namePart>
<namePart type="family">Wahlster</namePart>
<affiliation>DFKI and University of Saarland, Saarbrücken, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">José</namePart>
<namePart type="given">Luis</namePart>
<namePart type="family">Balcázar</namePart>
<affiliation>Departamento de Matemáticas, Estadística y Computación, Universidad de Cantabria, Avenida de los Castros, s/n, 39071, Santander, Spain</affiliation>
<affiliation>E-mail: joseluis.balcazar@unican.es</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Francesco</namePart>
<namePart type="family">Bonchi</namePart>
<affiliation>Yahoo! Research Barcelona, Avinguda Diagonal 177, 08018, Barcelona, Spain</affiliation>
<affiliation>E-mail: bonchi@yahoo-inc.corp</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Aristides</namePart>
<namePart type="family">Gionis</namePart>
<affiliation>Yahoo! Research Barcelona, Avinguda Diagnonal 177, 08018, Barcelona, Spain</affiliation>
<affiliation>E-mail: gionis@yahoo-inc.corp</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Michèle</namePart>
<namePart type="family">Sebag</namePart>
<affiliation>TAO, CNRS-INRIA-LRI, Université Paris-Sud, 91405, Orsay, France</affiliation>
<affiliation>E-mail: sebag@lri.fr</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="Sub-Series"></genre>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="SubSeriesID">1244</identifier>
</relatedItem>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="SeriesID">558</identifier>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2010</recordOrigin>
</recordInfo>
</relatedItem>
<identifier type="istex">0A645893CDD5762C7749FF9B0CDC78E44DBDB935</identifier>
<identifier type="DOI">10.1007/978-3-642-15883-4_14</identifier>
<identifier type="ChapterID">Chap14</identifier>
<identifier type="ChapterID">14</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Springer-Verlag Berlin Heidelberg</accessCondition>
<recordInfo>
<recordContentSource>SPRINGER</recordContentSource>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2010</recordOrigin>
</recordInfo>
</mods>
</metadata>
<enrichments>
<istex:refBibTEI uri="https://api.istex.fr/document/0A645893CDD5762C7749FF9B0CDC78E44DBDB935/enrichments/refBib">
<teiHeader></teiHeader>
<text>
<front></front>
<body></body>
<back>
<listBibl>
<biblStruct xml:id="b0">
<monogr>
<title level="m" type="main">Predicting genre labels for artists using freedb</title>
<author>
<persName>
<forename type="first">J</forename>
<surname>Bergstra</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">A</forename>
<surname>Lacoste</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Eck</surname>
</persName>
</author>
<imprint>
<date type="published" when="2006"></date>
<biblScope unit="page" from="85" to="88"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b1">
<analytic>
<title level="a" type="main">Autotagger: a model for predicting social tags from acoustic features on large music databases</title>
<author>
<persName>
<forename type="first">T</forename>
<surname>Bertin-Mahieux</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Eck</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">F</forename>
<surname>Maillet</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">P</forename>
<surname>Lamere</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">TASLP</title>
<imprint>
<biblScope unit="volume">37</biblScope>
<biblScope unit="issue">2</biblScope>
<biblScope unit="page" from="115" to="135"></biblScope>
<date type="published" when="2008"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b2">
<monogr>
<title level="m" type="main">Supervised topic models</title>
<author>
<persName>
<forename type="first">D</forename>
<surname>Blei</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<forename type="middle">D</forename>
<surname>Mcauliffe</surname>
</persName>
</author>
<imprint>
<date type="published" when="2007"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b3">
<analytic>
<title level="a" type="main">Latent dirichlet allocation</title>
<author>
<persName>
<forename type="first">D</forename>
<surname>Blei</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">A</forename>
<surname>Ng</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Jordan</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Journal of Machine Learning Research</title>
<imprint>
<biblScope unit="volume">3</biblScope>
<biblScope unit="page" from="993" to="1022"></biblScope>
<date type="published" when="2003"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b4">
<analytic>
<title level="a" type="main">Maxent, mathematics, and information theory</title>
<author>
<persName>
<forename type="first">I</forename>
<surname>Csiszar</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Maximum Entropy and Bayesian Methods</title>
<editor>Hanson, K., Silver, R.</editor>
<meeting>
<address>
<addrLine>Dordrecht</addrLine>
</address>
</meeting>
<imprint>
<publisher>Kluwer Academic Publishers</publisher>
<date type="published" when="1996"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b5">
<monogr>
<title level="m" type="main">Understanding search performance in query-byhumming systems</title>
<author>
<persName>
<forename type="first">R</forename>
<forename type="middle">B</forename>
<surname>Dannenberg</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">N</forename>
<surname>Hu</surname>
</persName>
</author>
<imprint>
<date type="published" when="2004"></date>
<biblScope unit="page" from="41" to="50"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b6">
<analytic>
<title level="a" type="main">Beatbank – an mpeg-7 compliant query by tapping system</title>
<author>
<persName>
<forename type="first">G</forename>
<surname>Eisenberg</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<forename type="middle">M</forename>
<surname>Batke</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">T</forename>
<surname>Sikora</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Audio Engineering Society Convention</title>
<imprint>
<biblScope unit="page">6136</biblScope>
<date type="published" when="2004"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b7">
<analytic>
<title level="a" type="main">Recent studies on music information processing</title>
<author>
<persName>
<forename type="first">M</forename>
<surname>Goto</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">K</forename>
<surname>Hirata</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Acoustic Science and Technology</title>
<imprint>
<biblScope unit="page" from="419" to="425"></biblScope>
<date type="published" when="2004"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b8">
<analytic>
<title level="a" type="main">Automatic classification of music instrument sounds</title>
<author>
<persName>
<forename type="first">P</forename>
<surname>Herrera</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">G</forename>
<surname>Peeters</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Dubnov</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Journal of New Music Research</title>
<imprint>
<biblScope unit="page" from="3" to="21"></biblScope>
<date type="published" when="2003"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b9">
<monogr>
<title level="m" type="main">Easy as CBA: A simple probabilistic model for tagging music</title>
<author>
<persName>
<forename type="first">M</forename>
<surname>Hoffman</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Blei</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">P</forename>
<surname>Cook</surname>
</persName>
</author>
<imprint>
<date type="published" when="2009"></date>
<publisher>ISMIR</publisher>
<biblScope unit="page" from="369" to="374"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b10">
<monogr>
<title level="m" type="main">Modeling social annotation data with content relevance using a topic model</title>
<author>
<persName>
<forename type="first">T</forename>
<surname>Iwata</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">T</forename>
<surname>Yamada</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">N</forename>
<surname>Ueda</surname>
</persName>
</author>
<imprint>
<date type="published" when="2009"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b11">
<analytic>
<title level="a" type="main">Social tagging and music information retrieval</title>
<author>
<persName>
<forename type="first">P</forename>
<surname>Lamere</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Journal of New Music Research</title>
<imprint>
<biblScope unit="volume">37</biblScope>
<biblScope unit="issue">2</biblScope>
<biblScope unit="page" from="101" to="114"></biblScope>
<date type="published" when="2008"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b12">
<monogr>
<title level="m" type="main">Music mood representations from social tags</title>
<author>
<persName>
<forename type="first">C</forename>
<surname>Laurier</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Sordo</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<surname>Serra</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">P</forename>
<surname>Herrera</surname>
</persName>
</author>
<imprint>
<date type="published" when="2009"></date>
<publisher>ISMIR</publisher>
<biblScope unit="page" from="381" to="386"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b13">
<monogr>
<title level="m" type="main">Input-agreement: A new mechanism for collecting data using human computation games</title>
<author>
<persName>
<forename type="first">E</forename>
<surname>Law</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">L</forename>
<surname>Von Ahn</surname>
</persName>
</author>
<imprint>
<date type="published" when="2009"></date>
<publisher>CHI</publisher>
<biblScope unit="page" from="1197" to="1206"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b14">
<monogr>
<title level="m" type="main">Evaluation of algorithms using games: The case of music tagging</title>
<author>
<persName>
<forename type="first">E</forename>
<surname>Law</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">K</forename>
<surname>West</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Mandel</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Bay</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Downie</surname>
</persName>
</author>
<imprint>
<date type="published" when="2009"></date>
<publisher>ISMIR</publisher>
<biblScope unit="page" from="387" to="392"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b15">
<monogr>
<title level="m" type="main">A semantic space for music derived from social tags</title>
<author>
<persName>
<forename type="first">M</forename>
<surname>Levy</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Sandler</surname>
</persName>
</author>
<imprint>
<date type="published" when="2007"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b16">
<monogr>
<title level="m" type="main">A comparative study on content-based music genre classification</title>
<author>
<persName>
<forename type="first">T</forename>
<surname>Li</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Ogihara</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">Q</forename>
<surname>Li</surname>
</persName>
</author>
<imprint>
<date type="published" when="2003"></date>
<publisher>SIGIR</publisher>
<biblScope unit="page" from="282" to="289"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b17">
<monogr>
<title level="m" type="main">Song-level features and support vector machines for music classification</title>
<author>
<persName>
<forename type="first">M</forename>
<surname>Mandel</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Ellis</surname>
</persName>
</author>
<imprint>
<date type="published" when="2005"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b18">
<monogr>
<title level="m" type="main">Labrosa's audio classification submissions</title>
<author>
<persName>
<forename type="first">M</forename>
<surname>Mandel</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Ellis</surname>
</persName>
</author>
<imprint>
<date type="published" when="2009"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b19">
<analytic>
<title level="a" type="main">A web-based game for collecting music metadata</title>
<author>
<persName>
<forename type="first">M</forename>
<surname>Mandel</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Ellis</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Journal of New Music Research</title>
<imprint>
<biblScope unit="volume">37</biblScope>
<biblScope unit="issue">2</biblScope>
<biblScope unit="page" from="151" to="165"></biblScope>
<date type="published" when="2009"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b20">
<monogr>
<title level="m" type="main">Topic models conditioned on arbitrary features with dirichlet-multinomial regression</title>
<author>
<persName>
<forename type="first">D</forename>
<surname>Mimno</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">A</forename>
<surname>Mccallum</surname>
</persName>
</author>
<imprint>
<date type="published" when="2008"></date>
<biblScope unit="page">UAI</biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b21">
<monogr>
<title level="m" type="main">Probabilistic topic models Handbook of Latent Semantic Analysis</title>
<author>
<persName>
<forename type="first">M</forename>
<surname>Steyvers</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">T</forename>
<surname>Griffiths</surname>
</persName>
</author>
<editor>Landauer, T., McNamara , D.S., Dennis, S., Kintsch, W.</editor>
<imprint>
<date type="published" when="2007"></date>
<publisher>Erlbaum</publisher>
<pubPlace>Hillsdale</pubPlace>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b22">
<monogr>
<title level="m" type="main">Multi-label classification of music emotions</title>
<author>
<persName>
<forename type="first">K</forename>
<surname>Trohidis</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">G</forename>
<surname>Tsoumakas</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">G</forename>
<surname>Kalliris</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">I</forename>
<surname>Vlahavas</surname>
</persName>
</author>
<imprint>
<date type="published" when="2008"></date>
<publisher>ISMIR</publisher>
<biblScope unit="page" from="325" to="330"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b23">
<analytic>
<title level="a" type="main">Semantic annotation and retrieval of music and sound effects</title>
<author>
<persName>
<forename type="first">D</forename>
<surname>Turnbull</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">L</forename>
<surname>Barrington</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Torres</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">G</forename>
<surname>Lanckriet</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">TASLP</title>
<imprint>
<biblScope unit="volume">16</biblScope>
<biblScope unit="issue">2</biblScope>
<biblScope unit="page" from="467" to="476"></biblScope>
<date type="published" when="2008"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b24">
<monogr>
<title level="m" type="main">A game-based approach for collecting semantic annotations of music</title>
<author>
<persName>
<forename type="first">D</forename>
<surname>Turnbull</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">R</forename>
<surname>Liu</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">L</forename>
<surname>Barrington</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">G</forename>
<surname>Lanckriet</surname>
</persName>
</author>
<imprint>
<date type="published" when="2007"></date>
<publisher>ISMIR</publisher>
<biblScope unit="page" from="535" to="538"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b25">
<analytic>
<title level="a" type="main">Musical genre classification of audio signals</title>
<author>
<persName>
<forename type="first">G</forename>
<surname>Tzanetakis</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">P</forename>
<surname>Cook</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Transactions on Speech and Audio Processing</title>
<imprint>
<biblScope unit="volume">10</biblScope>
<biblScope unit="issue">5</biblScope>
<biblScope unit="page" from="293" to="302"></biblScope>
<date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b26">
<monogr>
<title level="m" type="main">Labeling images with a computer game Combining musical and cultural features for intelligent style detection</title>
<author>
<persName>
<forename type="first">L</forename>
<surname>Von Ahn</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">L</forename>
<surname>Dabbish</surname>
</persName>
</author>
<imprint>
<date type="published" when="2002"></date>
<publisher>CHI</publisher>
<biblScope unit="page" from="319" to="326"></biblScope>
<pubPlace>Whitman, B., Smaragdis, P.</pubPlace>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b27">
<monogr>
<title level="m" type="main">Efficient methods for topic model inference on streaming document collections</title>
<author>
<persName>
<forename type="first">L</forename>
<surname>Yao</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Mimno</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">A</forename>
<surname>Mccallum</surname>
</persName>
</author>
<imprint>
<date type="published" when="2009"></date>
<publisher>KDD</publisher>
<biblScope unit="page" from="937" to="946"></biblScope>
</imprint>
</monogr>
</biblStruct>
</listBibl>
</back>
</text>
</istex:refBibTEI>
</enrichments>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Musique/explor/OperaV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000765 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000765 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Musique
   |area=    OperaV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:0A645893CDD5762C7749FF9B0CDC78E44DBDB935
   |texte=   Learning to Tag from Open Vocabulary Labels
}}

Wicri

This area was generated with Dilib version V0.6.21.
Data generation: Thu Apr 14 14:59:05 2016. Site generation: Thu Jan 4 23:09:23 2024