InforLorV4, PascalFrancis, Corpus, bibRecord, 000348

A proposal for annotation, semantic similarity and classification of textual documents

Identifieur interne : 000348 ( PascalFrancis/Corpus ); précédent : 000347; suivant : 000349

A proposal for annotation, semantic similarity and classification of textual documents

Auteurs : Emmanuel Nauer ; Amedeo Napoli

Source :

Lecture notes in computer science

RBID : Pascal:08-0032186

Descripteurs français

Pascal (Inist)
- Intelligence artificielle, Similitude, Classification, Texte, Analyse contenu, Ontologie, Recherche information, Annotation, Sémantique, Mensonge.

English descriptors

KwdEn :
- Annotation, Artificial intelligence, Classification, Content analysis, Information retrieval, Lying, Ontology, Semantics, Similarity, Text.

Abstract

In this paper, we present an approach for classifying documents based on the notion of a semantic similarity and the effective representation of the content of the documents. The content of a document is annotated and the resulting annotation is represented by a labeled tree whose nodes and edges are represented by concepts lying within a domain ontology. A reasoning process may be carried out on annotation trees, allowing the comparison of documents between each others, for classification or information retrieval purposes. An algorithm for classifying documents with respect to semantic similarity and a discussion conclude the paper.

Notice en format standard (ISO 2709)

Pour connaître la documentation sur le format Inist Standard.

A05				`@2 4183`
A08	`01`	`1`	`ENG`	`@1 A proposal for annotation, semantic similarity and classification of textual documents`
A09	`01`	`1`	`ENG`	`@1 Artificial intelligence : methodology, systems, and applications : 12th international conference, AIMSA 2006, Varna, Bulgaria, September 12-15, 2006 : proceedings`
A11	`01`	`1`		`@1 NAUER (Emmanuel)`
A11	`02`	`1`		`@1 NAPOLI (Amedeo)`
A12	`01`	`1`		`@1 EUZENAT (Jérôme) @9 ed.`
A12	`02`	`1`		`@1 DOMINGUE (John) @9 ed.`
A14	`01`			`@1 LORIA -UMR 7503 Bâtiment B, B.P. 239 @2 54506 Vandœuvre-lès-Nancy @3 FRA @Z 1 aut. @Z 2 aut.`
A20				`@1 201-212`
A21				`@1 2006`
A23	`01`			`@0 ENG`
A26	`01`			`@0 3-540-40930-0`
A43	`01`			`@1 INIST @2 16343 @5 354000153641600200`
A44				`@0 0000 @1 © 2008 INIST-CNRS. All rights reserved.`
A45				`@0 18 ref.`
A47	`01`	`1`		`@0 08-0032186`
A60				`@1 P @2 C`
A61				`@0 A`
A64	`01`	`2`		`@0 Lecture notes in computer science`
A66	`01`			`@0 DEU`
C01	`01`		`ENG`	@0 In this paper, we present an approach for classifying documents based on the notion of a semantic similarity and the effective representation of the content of the documents. The content of a document is annotated and the resulting annotation is represented by a labeled tree whose nodes and edges are represented by concepts lying within a domain ontology. A reasoning process may be carried out on annotation trees, allowing the comparison of documents between each others, for classification or information retrieval purposes. An algorithm for classifying documents with respect to semantic similarity and a discussion conclude the paper.
C02	`01`	`X`		`@0 001D02B07D`
C02	`02`	`X`		`@0 001D02C`
C03	`01`	`X`	`FRE`	`@0 Intelligence artificielle @5 01`
C03	`01`	`X`	`ENG`	`@0 Artificial intelligence @5 01`
C03	`01`	`X`	`SPA`	`@0 Inteligencia artificial @5 01`
C03	`02`	`X`	`FRE`	`@0 Similitude @5 06`
C03	`02`	`X`	`ENG`	`@0 Similarity @5 06`
C03	`02`	`X`	`SPA`	`@0 Similitud @5 06`
C03	`03`	`X`	`FRE`	`@0 Classification @5 07`
C03	`03`	`X`	`ENG`	`@0 Classification @5 07`
C03	`03`	`X`	`SPA`	`@0 Clasificación @5 07`
C03	`04`	`X`	`FRE`	`@0 Texte @5 08`
C03	`04`	`X`	`ENG`	`@0 Text @5 08`
C03	`04`	`X`	`SPA`	`@0 Texto @5 08`
C03	`05`	`X`	`FRE`	`@0 Analyse contenu @5 09`
C03	`05`	`X`	`ENG`	`@0 Content analysis @5 09`
C03	`05`	`X`	`SPA`	`@0 Análisis contenido @5 09`
C03	`06`	`X`	`FRE`	`@0 Ontologie @5 10`
C03	`06`	`X`	`ENG`	`@0 Ontology @5 10`
C03	`06`	`X`	`SPA`	`@0 Ontología @5 10`
C03	`07`	`X`	`FRE`	`@0 Recherche information @5 11`
C03	`07`	`X`	`ENG`	`@0 Information retrieval @5 11`
C03	`07`	`X`	`SPA`	`@0 Búsqueda información @5 11`
C03	`08`	`X`	`FRE`	`@0 Annotation @5 18`
C03	`08`	`X`	`ENG`	`@0 Annotation @5 18`
C03	`08`	`X`	`SPA`	`@0 Anotación @5 18`
C03	`09`	`X`	`FRE`	`@0 Sémantique @5 19`
C03	`09`	`X`	`ENG`	`@0 Semantics @5 19`
C03	`09`	`X`	`SPA`	`@0 Semántica @5 19`
C03	`10`	`X`	`FRE`	`@0 Mensonge @5 20`
C03	`10`	`X`	`ENG`	`@0 Lying @5 20`
C03	`10`	`X`	`SPA`	`@0 Mentira @5 20`
N21				`@1 052`
N44	`01`			`@1 OTO`
N82				`@1 OTO`

A30	`01`	`1`	`ENG`	`@1 International Conference on Artificial Intelligence : Methodology, Systems, and Applications @2 12 @3 Varna BGR @4 2006`

Format Inist (serveur)

NO :	PASCAL 08-0032186 INIST
ET :	A proposal for annotation, semantic similarity and classification of textual documents
AU :	NAUER (Emmanuel); NAPOLI (Amedeo); EUZENAT (Jérôme); DOMINGUE (John)
AF :	LORIA -UMR 7503 Bâtiment B, B.P. 239/54506 Vandœuvre-lès-Nancy/France (1 aut., 2 aut.)
DT :	Publication en série; Congrès; Niveau analytique
SO :	Lecture notes in computer science; Allemagne; Da. 2006; Vol. 4183; Pp. 201-212; Bibl. 18 ref.
LA :	Anglais
EA :	In this paper, we present an approach for classifying documents based on the notion of a semantic similarity and the effective representation of the content of the documents. The content of a document is annotated and the resulting annotation is represented by a labeled tree whose nodes and edges are represented by concepts lying within a domain ontology. A reasoning process may be carried out on annotation trees, allowing the comparison of documents between each others, for classification or information retrieval purposes. An algorithm for classifying documents with respect to semantic similarity and a discussion conclude the paper.
CC :	001D02B07D; 001D02C
FD :	Intelligence artificielle; Similitude; Classification; Texte; Analyse contenu; Ontologie; Recherche information; Annotation; Sémantique; Mensonge
ED :	Artificial intelligence; Similarity; Classification; Text; Content analysis; Ontology; Information retrieval; Annotation; Semantics; Lying
SD :	Inteligencia artificial; Similitud; Clasificación; Texto; Análisis contenido; Ontología; Búsqueda información; Anotación; Semántica; Mentira
LO :	INIST-16343.354000153641600200
ID :	08-0032186

Links to Exploration step

Pascal:08-0032186

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">A proposal for annotation, semantic similarity and classification of textual documents</title>
<author><name sortKey="Nauer, Emmanuel" sort="Nauer, Emmanuel" uniqKey="Nauer E" first="Emmanuel" last="Nauer">Emmanuel Nauer</name>
<affiliation><inist:fA14 i1="01"><s1>LORIA -UMR 7503 Bâtiment B, B.P. 239</s1>
<s2>54506 Vandœuvre-lès-Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Napoli, Amedeo" sort="Napoli, Amedeo" uniqKey="Napoli A" first="Amedeo" last="Napoli">Amedeo Napoli</name>
<affiliation><inist:fA14 i1="01"><s1>LORIA -UMR 7503 Bâtiment B, B.P. 239</s1>
<s2>54506 Vandœuvre-lès-Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">08-0032186</idno>
<date when="2006">2006</date>
<idno type="stanalyst">PASCAL 08-0032186 INIST</idno>
<idno type="RBID">Pascal:08-0032186</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000348</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">A proposal for annotation, semantic similarity and classification of textual documents</title>
<author><name sortKey="Nauer, Emmanuel" sort="Nauer, Emmanuel" uniqKey="Nauer E" first="Emmanuel" last="Nauer">Emmanuel Nauer</name>
<affiliation><inist:fA14 i1="01"><s1>LORIA -UMR 7503 Bâtiment B, B.P. 239</s1>
<s2>54506 Vandœuvre-lès-Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Napoli, Amedeo" sort="Napoli, Amedeo" uniqKey="Napoli A" first="Amedeo" last="Napoli">Amedeo Napoli</name>
<affiliation><inist:fA14 i1="01"><s1>LORIA -UMR 7503 Bâtiment B, B.P. 239</s1>
<s2>54506 Vandœuvre-lès-Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">Lecture notes in computer science</title>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">Lecture notes in computer science</title>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Annotation</term>
<term>Artificial intelligence</term>
<term>Classification</term>
<term>Content analysis</term>
<term>Information retrieval</term>
<term>Lying</term>
<term>Ontology</term>
<term>Semantics</term>
<term>Similarity</term>
<term>Text</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Intelligence artificielle</term>
<term>Similitude</term>
<term>Classification</term>
<term>Texte</term>
<term>Analyse contenu</term>
<term>Ontologie</term>
<term>Recherche information</term>
<term>Annotation</term>
<term>Sémantique</term>
<term>Mensonge</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">In this paper, we present an approach for classifying documents based on the notion of a semantic similarity and the effective representation of the content of the documents. The content of a document is annotated and the resulting annotation is represented by a labeled tree whose nodes and edges are represented by concepts lying within a domain ontology. A reasoning process may be carried out on annotation trees, allowing the comparison of documents between each others, for classification or information retrieval purposes. An algorithm for classifying documents with respect to semantic similarity and a discussion conclude the paper.</div>
</front>
</TEI>
<inist><standard h6="B"><pA><fA05><s2>4183</s2>
</fA05>
<fA08 i1="01" i2="1" l="ENG"><s1>A proposal for annotation, semantic similarity and classification of textual documents</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG"><s1>Artificial intelligence : methodology, systems, and applications : 12th international conference, AIMSA 2006, Varna, Bulgaria, September 12-15, 2006 : proceedings</s1>
</fA09>
<fA11 i1="01" i2="1"><s1>NAUER (Emmanuel)</s1>
</fA11>
<fA11 i1="02" i2="1"><s1>NAPOLI (Amedeo)</s1>
</fA11>
<fA12 i1="01" i2="1"><s1>EUZENAT (Jérôme)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="02" i2="1"><s1>DOMINGUE (John)</s1>
<s9>ed.</s9>
</fA12>
<fA14 i1="01"><s1>LORIA -UMR 7503 Bâtiment B, B.P. 239</s1>
<s2>54506 Vandœuvre-lès-Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</fA14>
<fA20><s1>201-212</s1>
</fA20>
<fA21><s1>2006</s1>
</fA21>
<fA23 i1="01"><s0>ENG</s0>
</fA23>
<fA26 i1="01"><s0>3-540-40930-0</s0>
</fA26>
<fA43 i1="01"><s1>INIST</s1>
<s2>16343</s2>
<s5>354000153641600200</s5>
</fA43>
<fA44><s0>0000</s0>
<s1>© 2008 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45><s0>18 ref.</s0>
</fA45>
<fA47 i1="01" i2="1"><s0>08-0032186</s0>
</fA47>
<fA60><s1>P</s1>
<s2>C</s2>
</fA60>
<fA61><s0>A</s0>
</fA61>
<fA64 i1="01" i2="2"><s0>Lecture notes in computer science</s0>
</fA64>
<fA66 i1="01"><s0>DEU</s0>
</fA66>
<fC01 i1="01" l="ENG"><s0>In this paper, we present an approach for classifying documents based on the notion of a semantic similarity and the effective representation of the content of the documents. The content of a document is annotated and the resulting annotation is represented by a labeled tree whose nodes and edges are represented by concepts lying within a domain ontology. A reasoning process may be carried out on annotation trees, allowing the comparison of documents between each others, for classification or information retrieval purposes. An algorithm for classifying documents with respect to semantic similarity and a discussion conclude the paper.</s0>
</fC01>
<fC02 i1="01" i2="X"><s0>001D02B07D</s0>
</fC02>
<fC02 i1="02" i2="X"><s0>001D02C</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE"><s0>Intelligence artificielle</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG"><s0>Artificial intelligence</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA"><s0>Inteligencia artificial</s0>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE"><s0>Similitude</s0>
<s5>06</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG"><s0>Similarity</s0>
<s5>06</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA"><s0>Similitud</s0>
<s5>06</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE"><s0>Classification</s0>
<s5>07</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG"><s0>Classification</s0>
<s5>07</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA"><s0>Clasificación</s0>
<s5>07</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE"><s0>Texte</s0>
<s5>08</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG"><s0>Text</s0>
<s5>08</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA"><s0>Texto</s0>
<s5>08</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE"><s0>Analyse contenu</s0>
<s5>09</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG"><s0>Content analysis</s0>
<s5>09</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA"><s0>Análisis contenido</s0>
<s5>09</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE"><s0>Ontologie</s0>
<s5>10</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG"><s0>Ontology</s0>
<s5>10</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA"><s0>Ontología</s0>
<s5>10</s5>
</fC03>
<fC03 i1="07" i2="X" l="FRE"><s0>Recherche information</s0>
<s5>11</s5>
</fC03>
<fC03 i1="07" i2="X" l="ENG"><s0>Information retrieval</s0>
<s5>11</s5>
</fC03>
<fC03 i1="07" i2="X" l="SPA"><s0>Búsqueda información</s0>
<s5>11</s5>
</fC03>
<fC03 i1="08" i2="X" l="FRE"><s0>Annotation</s0>
<s5>18</s5>
</fC03>
<fC03 i1="08" i2="X" l="ENG"><s0>Annotation</s0>
<s5>18</s5>
</fC03>
<fC03 i1="08" i2="X" l="SPA"><s0>Anotación</s0>
<s5>18</s5>
</fC03>
<fC03 i1="09" i2="X" l="FRE"><s0>Sémantique</s0>
<s5>19</s5>
</fC03>
<fC03 i1="09" i2="X" l="ENG"><s0>Semantics</s0>
<s5>19</s5>
</fC03>
<fC03 i1="09" i2="X" l="SPA"><s0>Semántica</s0>
<s5>19</s5>
</fC03>
<fC03 i1="10" i2="X" l="FRE"><s0>Mensonge</s0>
<s5>20</s5>
</fC03>
<fC03 i1="10" i2="X" l="ENG"><s0>Lying</s0>
<s5>20</s5>
</fC03>
<fC03 i1="10" i2="X" l="SPA"><s0>Mentira</s0>
<s5>20</s5>
</fC03>
<fN21><s1>052</s1>
</fN21>
<fN44 i1="01"><s1>OTO</s1>
</fN44>
<fN82><s1>OTO</s1>
</fN82>
</pA>
<pR><fA30 i1="01" i2="1" l="ENG"><s1>International Conference on Artificial Intelligence : Methodology, Systems, and Applications</s1>
<s2>12</s2>
<s3>Varna BGR</s3>
<s4>2006</s4>
</fA30>
</pR>
</standard>
<server><NO>PASCAL 08-0032186 INIST</NO>
<ET>A proposal for annotation, semantic similarity and classification of textual documents</ET>
<AU>NAUER (Emmanuel); NAPOLI (Amedeo); EUZENAT (Jérôme); DOMINGUE (John)</AU>
<AF>LORIA -UMR 7503 Bâtiment B, B.P. 239/54506 Vandœuvre-lès-Nancy/France (1 aut., 2 aut.)</AF>
<DT>Publication en série; Congrès; Niveau analytique</DT>
<SO>Lecture notes in computer science; Allemagne; Da. 2006; Vol. 4183; Pp. 201-212; Bibl. 18 ref.</SO>
<LA>Anglais</LA>
<EA>In this paper, we present an approach for classifying documents based on the notion of a semantic similarity and the effective representation of the content of the documents. The content of a document is annotated and the resulting annotation is represented by a labeled tree whose nodes and edges are represented by concepts lying within a domain ontology. A reasoning process may be carried out on annotation trees, allowing the comparison of documents between each others, for classification or information retrieval purposes. An algorithm for classifying documents with respect to semantic similarity and a discussion conclude the paper.</EA>
<CC>001D02B07D; 001D02C</CC>
<FD>Intelligence artificielle; Similitude; Classification; Texte; Analyse contenu; Ontologie; Recherche information; Annotation; Sémantique; Mensonge</FD>
<ED>Artificial intelligence; Similarity; Classification; Text; Content analysis; Ontology; Information retrieval; Annotation; Semantics; Lying</ED>
<SD>Inteligencia artificial; Similitud; Clasificación; Texto; Análisis contenido; Ontología; Búsqueda información; Anotación; Semántica; Mentira</SD>
<LO>INIST-16343.354000153641600200</LO>
<ID>08-0032186</ID>
</server>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/PascalFrancis/Corpus

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000348 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000348 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    PascalFrancis
   |étape=   Corpus
   |type=    RBID
   |clé=     Pascal:08-0032186
   |texte=   A proposal for annotation, semantic similarity and classification of textual documents
}}

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022

	Serveur d'exploration sur la recherche en informatique en Lorraine
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur la recherche en informatique en Lorraine

A proposal for annotation, semantic similarity and classification of textual documents

A proposal for annotation, semantic similarity and classification of textual documents

Source :

Descripteurs français

English descriptors

Abstract

Notice en format standard (ISO 2709)

Format Inist (serveur)

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri