Système d'information stratégique et agriculture (serveur d'exploration)

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Automatic Multi-label Subject Indexing in a Multilingual Environment

Identifieur interne : 000F36 ( Main/Merge ); précédent : 000F35; suivant : 000F37

Automatic Multi-label Subject Indexing in a Multilingual Environment

Auteurs : Boris Lauser [Italie] ; Andreas Hotho [Allemagne]

Source :

RBID : ISTEX:1A40FC7CB8FF8D6ADF6FF04BDB64226A76B64CA5

Abstract

Abstract: This paper presents an approach to automatically subject index full-text documents with multiple labels based on binary support vector machines (SVM). The aim was to test the applicability of SVMs with a real world dataset. We have also explored the feasibility of incorporating multilingual background knowledge, as represented in thesauri or ontologies, into our text document representation for indexing purposes. The test set for our evaluations has been compiled from an extensive document base maintained by the Food and Agriculture Organization (FAO) of the United Nations (UN). Empirical results show that SVMs are a good method for automatic multi- label classification of documents in multiple languages.

Url:
DOI: 10.1007/978-3-540-45175-4_14

Links toward previous steps (curation, corpus...)


Links to Exploration step

ISTEX:1A40FC7CB8FF8D6ADF6FF04BDB64226A76B64CA5

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Automatic Multi-label Subject Indexing in a Multilingual Environment</title>
<author>
<name sortKey="Lauser, Boris" sort="Lauser, Boris" uniqKey="Lauser B" first="Boris" last="Lauser">Boris Lauser</name>
</author>
<author>
<name sortKey="Hotho, Andreas" sort="Hotho, Andreas" uniqKey="Hotho A" first="Andreas" last="Hotho">Andreas Hotho</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:1A40FC7CB8FF8D6ADF6FF04BDB64226A76B64CA5</idno>
<date when="2003" year="2003">2003</date>
<idno type="doi">10.1007/978-3-540-45175-4_14</idno>
<idno type="url">https://api.istex.fr/document/1A40FC7CB8FF8D6ADF6FF04BDB64226A76B64CA5/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000335</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">000335</idno>
<idno type="wicri:Area/Istex/Curation">000323</idno>
<idno type="wicri:Area/Istex/Checkpoint">000815</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000815</idno>
<idno type="wicri:doubleKey">0302-9743:2003:Lauser B:automatic:multi:label</idno>
<idno type="wicri:Area/Main/Merge">000F36</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Automatic Multi-label Subject Indexing in a Multilingual Environment</title>
<author>
<name sortKey="Lauser, Boris" sort="Lauser, Boris" uniqKey="Lauser B" first="Boris" last="Lauser">Boris Lauser</name>
<affiliation wicri:level="3">
<country xml:lang="fr">Italie</country>
<wicri:regionArea>FAO of the UN, Library & Documentation Systems Division, 00100, Rome</wicri:regionArea>
<placeName>
<settlement type="city">Rome</settlement>
<region nuts="2">Latium</region>
</placeName>
</affiliation>
<affiliation></affiliation>
</author>
<author>
<name sortKey="Hotho, Andreas" sort="Hotho, Andreas" uniqKey="Hotho A" first="Andreas" last="Hotho">Andreas Hotho</name>
<affiliation wicri:level="3">
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Institute AIFB, University of Karlsruhe, 76131, Karlsruhe</wicri:regionArea>
<placeName>
<region type="land" nuts="1">Bade-Wurtemberg</region>
<region type="district" nuts="2">District de Karlsruhe</region>
<settlement type="city">Karlsruhe</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Allemagne</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2003</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">1A40FC7CB8FF8D6ADF6FF04BDB64226A76B64CA5</idno>
<idno type="DOI">10.1007/978-3-540-45175-4_14</idno>
<idno type="ChapterID">14</idno>
<idno type="ChapterID">Chap14</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: This paper presents an approach to automatically subject index full-text documents with multiple labels based on binary support vector machines (SVM). The aim was to test the applicability of SVMs with a real world dataset. We have also explored the feasibility of incorporating multilingual background knowledge, as represented in thesauri or ontologies, into our text document representation for indexing purposes. The test set for our evaluations has been compiled from an extensive document base maintained by the Food and Agriculture Organization (FAO) of the United Nations (UN). Empirical results show that SVMs are a good method for automatic multi- label classification of documents in multiple languages.</div>
</front>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Agronomie/explor/SisAgriV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000F36 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 000F36 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Agronomie
   |area=    SisAgriV1
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     ISTEX:1A40FC7CB8FF8D6ADF6FF04BDB64226A76B64CA5
   |texte=   Automatic Multi-label Subject Indexing in a Multilingual Environment
}}

Wicri

This area was generated with Dilib version V0.6.28.
Data generation: Wed Mar 29 00:06:34 2017. Site generation: Tue Mar 12 12:44:16 2024