Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Automatic Web Page Annotation with Google Rich Snippets

Identifieur interne : 002D97 ( Istex/Corpus ); précédent : 002D96; suivant : 002D98

Automatic Web Page Annotation with Google Rich Snippets

Auteurs : Walter Hop ; Stephan Lachner ; Flavius Frasincar ; Roberto De Virgilio

Source :

RBID : ISTEX:70B04F706C4028A34068DCBC9EC4803A88DF54B5

Abstract

Abstract: Web pages are designed to be read by people, not machines. Consequently, searching and reusing information on the Web is a difficult task without human participation. Adding semantics (i.e meaning) to a Web page would help machines to understand Web contents and better support the Web search process. One of the latest developments in this field is Google’s Rich Snippets, a service for Web site owners to add semantics to their Web pages. In this paper we provide an approach to automatically annotate a Web page with Rich Snippets RDFa tags. Exploiting several heuristics and a named entity recognition technique, our method is capable of recognizing and annotating a subset of Rich Snippets’ vocabulary, i.e., all attributes of its Review concept, and the names of Person and Organization concepts. We implemented an on-line service and evaluated the accuracy of the approach on real E-commerce Web sites.

Url:
DOI: 10.1007/978-3-642-16949-6_21

Links to Exploration step

ISTEX:70B04F706C4028A34068DCBC9EC4803A88DF54B5

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Automatic Web Page Annotation with Google Rich Snippets</title>
<author>
<name sortKey="Hop, Walter" sort="Hop, Walter" uniqKey="Hop W" first="Walter" last="Hop">Walter Hop</name>
<affiliation>
<mods:affiliation>Erasmus School of Economics, Erasmus University Rotterdam, PO Box 1738, NL-3000, Rotterdam, DR, The Netherlands</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: w.w.hop@student.eur.nl</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Lachner, Stephan" sort="Lachner, Stephan" uniqKey="Lachner S" first="Stephan" last="Lachner">Stephan Lachner</name>
<affiliation>
<mods:affiliation>Erasmus School of Economics, Erasmus University Rotterdam, PO Box 1738, NL-3000, Rotterdam, DR, The Netherlands</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: s.lachner@student.eur.nl</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Frasincar, Flavius" sort="Frasincar, Flavius" uniqKey="Frasincar F" first="Flavius" last="Frasincar">Flavius Frasincar</name>
<affiliation>
<mods:affiliation>Erasmus School of Economics, Erasmus University Rotterdam, PO Box 1738, NL-3000, Rotterdam, DR, The Netherlands</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: frasincar@ese.eur.nl</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="De Virgilio, Roberto" sort="De Virgilio, Roberto" uniqKey="De Virgilio R" first="Roberto" last="De Virgilio">Roberto De Virgilio</name>
<affiliation>
<mods:affiliation>Dipartimento di Informatica e Automazione, Universitá Roma Tre, Rome, Italy</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: devirgilio@dia.uniroma3.it</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:70B04F706C4028A34068DCBC9EC4803A88DF54B5</idno>
<date when="2010" year="2010">2010</date>
<idno type="doi">10.1007/978-3-642-16949-6_21</idno>
<idno type="url">https://api.istex.fr/document/70B04F706C4028A34068DCBC9EC4803A88DF54B5/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">002D97</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Automatic Web Page Annotation with Google Rich Snippets</title>
<author>
<name sortKey="Hop, Walter" sort="Hop, Walter" uniqKey="Hop W" first="Walter" last="Hop">Walter Hop</name>
<affiliation>
<mods:affiliation>Erasmus School of Economics, Erasmus University Rotterdam, PO Box 1738, NL-3000, Rotterdam, DR, The Netherlands</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: w.w.hop@student.eur.nl</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Lachner, Stephan" sort="Lachner, Stephan" uniqKey="Lachner S" first="Stephan" last="Lachner">Stephan Lachner</name>
<affiliation>
<mods:affiliation>Erasmus School of Economics, Erasmus University Rotterdam, PO Box 1738, NL-3000, Rotterdam, DR, The Netherlands</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: s.lachner@student.eur.nl</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Frasincar, Flavius" sort="Frasincar, Flavius" uniqKey="Frasincar F" first="Flavius" last="Frasincar">Flavius Frasincar</name>
<affiliation>
<mods:affiliation>Erasmus School of Economics, Erasmus University Rotterdam, PO Box 1738, NL-3000, Rotterdam, DR, The Netherlands</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: frasincar@ese.eur.nl</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="De Virgilio, Roberto" sort="De Virgilio, Roberto" uniqKey="De Virgilio R" first="Roberto" last="De Virgilio">Roberto De Virgilio</name>
<affiliation>
<mods:affiliation>Dipartimento di Informatica e Automazione, Universitá Roma Tre, Rome, Italy</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: devirgilio@dia.uniroma3.it</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2010</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">70B04F706C4028A34068DCBC9EC4803A88DF54B5</idno>
<idno type="DOI">10.1007/978-3-642-16949-6_21</idno>
<idno type="ChapterID">21</idno>
<idno type="ChapterID">Chap21</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Web pages are designed to be read by people, not machines. Consequently, searching and reusing information on the Web is a difficult task without human participation. Adding semantics (i.e meaning) to a Web page would help machines to understand Web contents and better support the Web search process. One of the latest developments in this field is Google’s Rich Snippets, a service for Web site owners to add semantics to their Web pages. In this paper we provide an approach to automatically annotate a Web page with Rich Snippets RDFa tags. Exploiting several heuristics and a named entity recognition technique, our method is capable of recognizing and annotating a subset of Rich Snippets’ vocabulary, i.e., all attributes of its Review concept, and the names of Person and Organization concepts. We implemented an on-line service and evaluated the accuracy of the approach on real E-commerce Web sites.</div>
</front>
</TEI>
<istex>
<corpusName>springer</corpusName>
<author>
<json:item>
<name>Walter Hop</name>
<affiliations>
<json:string>Erasmus School of Economics, Erasmus University Rotterdam, PO Box 1738, NL-3000, Rotterdam, DR, The Netherlands</json:string>
<json:string>E-mail: w.w.hop@student.eur.nl</json:string>
</affiliations>
</json:item>
<json:item>
<name>Stephan Lachner</name>
<affiliations>
<json:string>Erasmus School of Economics, Erasmus University Rotterdam, PO Box 1738, NL-3000, Rotterdam, DR, The Netherlands</json:string>
<json:string>E-mail: s.lachner@student.eur.nl</json:string>
</affiliations>
</json:item>
<json:item>
<name>Flavius Frasincar</name>
<affiliations>
<json:string>Erasmus School of Economics, Erasmus University Rotterdam, PO Box 1738, NL-3000, Rotterdam, DR, The Netherlands</json:string>
<json:string>E-mail: frasincar@ese.eur.nl</json:string>
</affiliations>
</json:item>
<json:item>
<name>Roberto De Virgilio</name>
<affiliations>
<json:string>Dipartimento di Informatica e Automazione, Universitá Roma Tre, Rome, Italy</json:string>
<json:string>E-mail: devirgilio@dia.uniroma3.it</json:string>
</affiliations>
</json:item>
</author>
<language>
<json:string>eng</json:string>
</language>
<abstract>Abstract: Web pages are designed to be read by people, not machines. Consequently, searching and reusing information on the Web is a difficult task without human participation. Adding semantics (i.e meaning) to a Web page would help machines to understand Web contents and better support the Web search process. One of the latest developments in this field is Google’s Rich Snippets, a service for Web site owners to add semantics to their Web pages. In this paper we provide an approach to automatically annotate a Web page with Rich Snippets RDFa tags. Exploiting several heuristics and a named entity recognition technique, our method is capable of recognizing and annotating a subset of Rich Snippets’ vocabulary, i.e., all attributes of its Review concept, and the names of Person and Organization concepts. We implemented an on-line service and evaluated the accuracy of the approach on real E-commerce Web sites.</abstract>
<qualityIndicators>
<score>6.764</score>
<pdfVersion>1.3</pdfVersion>
<pdfPageSize>430 x 660 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>919</abstractCharCount>
<pdfWordCount>6602</pdfWordCount>
<pdfCharCount>38304</pdfCharCount>
<pdfPageCount>18</pdfPageCount>
<abstractWordCount>147</abstractWordCount>
</qualityIndicators>
<title>Automatic Web Page Annotation with Google Rich Snippets</title>
<genre.original>
<json:string>OriginalPaper</json:string>
</genre.original>
<chapterId>
<json:string>21</json:string>
<json:string>Chap21</json:string>
</chapterId>
<genre>
<json:string>conference [eBooks]</json:string>
</genre>
<serie>
<editor>
<json:item>
<name>David Hutchison</name>
<affiliations>
<json:string>Lancaster University, Lancaster, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Takeo Kanade</name>
<affiliations>
<json:string>Carnegie Mellon University, Pittsburgh, PA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Josef Kittler</name>
<affiliations>
<json:string>University of Surrey, Guildford, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Jon M. Kleinberg</name>
<affiliations>
<json:string>Cornell University, Ithaca, NY, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Friedemann Mattern</name>
<affiliations>
<json:string>ETH Zurich, Zurich, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>John C. Mitchell</name>
<affiliations>
<json:string>Stanford University, Stanford, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moni Naor</name>
<affiliations>
<json:string>Weizmann Institute of Science, Rehovot, Israel</json:string>
</affiliations>
</json:item>
<json:item>
<name>Oscar Nierstrasz</name>
<affiliations>
<json:string>University of Bern, Bern, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>C. Pandu Rangan</name>
<affiliations>
<json:string>Indian Institute of Technology, Madras, India</json:string>
</affiliations>
</json:item>
<json:item>
<name>Bernhard Steffen</name>
<affiliations>
<json:string>University of Dortmund, Dortmund, Germany</json:string>
</affiliations>
</json:item>
<json:item>
<name>Madhu Sudan</name>
<affiliations>
<json:string>Massachusetts Institute of Technology, MA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Demetri Terzopoulos</name>
<affiliations>
<json:string>University of California, Los Angeles, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Doug Tygar</name>
<affiliations>
<json:string>University of California, Berkeley, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moshe Y. Vardi</name>
<affiliations>
<json:string>Rice University, Houston, TX, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Gerhard Weikum</name>
<affiliations>
<json:string>Max-Planck Institute of Computer Science, Saarbrücken, Germany</json:string>
</affiliations>
</json:item>
</editor>
<issn>
<json:string>0302-9743</json:string>
</issn>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>Lecture Notes in Computer Science</title>
<copyrightDate>2010</copyrightDate>
</serie>
<host>
<editor>
<json:item>
<name>Robert Meersman</name>
<affiliations>
<json:string>STAR Lab, Vrije Universiteit Brussel (VUB), Bldg G/10, Pleinlaan 2, 1050, Brussels, Belgium</json:string>
<json:string>E-mail: meersman@vub.ac.be</json:string>
</affiliations>
</json:item>
<json:item>
<name>Tharam Dillon</name>
<affiliations>
<json:string>DEBII - CBS, Curtin University of Technology, De Laeter Way, 6102, Bentley, WA, Australia</json:string>
<json:string>E-mail: t.dillon@curtin.edu.au</json:string>
</affiliations>
</json:item>
<json:item>
<name>Pilar Herrero</name>
<affiliations>
<json:string>Facultad de Informática, Universidad Politécnica de Madrid, Campus de Montegancedo S/N, 28660, Boadilla del Monte, Madrid, Spain</json:string>
<json:string>E-mail: pherrero@fi.upm.es</json:string>
</affiliations>
</json:item>
</editor>
<subject>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Computer Communication Networks</value>
</json:item>
<json:item>
<value>Software Engineering</value>
</json:item>
<json:item>
<value>Artificial Intelligence (incl. Robotics)</value>
</json:item>
<json:item>
<value>Information Systems Applications (incl.Internet)</value>
</json:item>
<json:item>
<value>Algorithm Analysis and Problem Complexity</value>
</json:item>
<json:item>
<value>Management of Computing and Information Systems</value>
</json:item>
</subject>
<isbn>
<json:string>978-3-642-16948-9</json:string>
</isbn>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>On the Move to Meaningful Internet Systems, OTM 2010</title>
<genre.original>
<json:string>Proceedings</json:string>
</genre.original>
<bookId>
<json:string>978-3-642-16949-6</json:string>
</bookId>
<volume>6427</volume>
<pages>
<last>974</last>
<first>957</first>
</pages>
<issn>
<json:string>0302-9743</json:string>
</issn>
<genre>
<json:string>Book Series</json:string>
</genre>
<eisbn>
<json:string>978-3-642-16949-6</json:string>
</eisbn>
<copyrightDate>2010</copyrightDate>
<doi>
<json:string>10.1007/978-3-642-16949-6</json:string>
</doi>
</host>
<publicationDate>2010</publicationDate>
<copyrightDate>2010</copyrightDate>
<doi>
<json:string>10.1007/978-3-642-16949-6_21</json:string>
</doi>
<id>70B04F706C4028A34068DCBC9EC4803A88DF54B5</id>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/70B04F706C4028A34068DCBC9EC4803A88DF54B5/fulltext/pdf</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/70B04F706C4028A34068DCBC9EC4803A88DF54B5/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/70B04F706C4028A34068DCBC9EC4803A88DF54B5/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">Automatic Web Page Annotation with Google Rich Snippets</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<availability>
<p>SPRINGER</p>
</availability>
<date>2010</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a" type="main" xml:lang="en">Automatic Web Page Annotation with Google Rich Snippets</title>
<author>
<persName>
<forename type="first">Walter</forename>
<surname>Hop</surname>
</persName>
<email>w.w.hop@student.eur.nl</email>
<affiliation>Erasmus School of Economics, Erasmus University Rotterdam, PO Box 1738, NL-3000, Rotterdam, DR, The Netherlands</affiliation>
</author>
<author>
<persName>
<forename type="first">Stephan</forename>
<surname>Lachner</surname>
</persName>
<email>s.lachner@student.eur.nl</email>
<affiliation>Erasmus School of Economics, Erasmus University Rotterdam, PO Box 1738, NL-3000, Rotterdam, DR, The Netherlands</affiliation>
</author>
<author>
<persName>
<forename type="first">Flavius</forename>
<surname>Frasincar</surname>
</persName>
<email>frasincar@ese.eur.nl</email>
<affiliation>Erasmus School of Economics, Erasmus University Rotterdam, PO Box 1738, NL-3000, Rotterdam, DR, The Netherlands</affiliation>
</author>
<author>
<persName>
<forename type="first">Roberto</forename>
<surname>De Virgilio</surname>
</persName>
<email>devirgilio@dia.uniroma3.it</email>
<affiliation>Dipartimento di Informatica e Automazione, Universitá Roma Tre, Rome, Italy</affiliation>
</author>
</analytic>
<monogr>
<title level="m">On the Move to Meaningful Internet Systems, OTM 2010</title>
<title level="m" type="sub">Confederated International Conferences: CoopIS, IS, DOA and ODBASE, Hersonissos, Crete, Greece, October 25-29, 2010, Proceedings, Part II</title>
<idno type="pISBN">978-3-642-16948-9</idno>
<idno type="eISBN">978-3-642-16949-6</idno>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="DOI">10.1007/978-3-642-16949-6</idno>
<idno type="BookID">978-3-642-16949-6</idno>
<idno type="BookTitleID">215030</idno>
<idno type="BookSequenceNumber">6427</idno>
<idno type="BookVolumeNumber">6427</idno>
<idno type="BookChapterCount">36</idno>
<editor>
<persName>
<forename type="first">Robert</forename>
<surname>Meersman</surname>
</persName>
<email>meersman@vub.ac.be</email>
<affiliation>STAR Lab, Vrije Universiteit Brussel (VUB), Bldg G/10, Pleinlaan 2, 1050, Brussels, Belgium</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Tharam</forename>
<surname>Dillon</surname>
</persName>
<email>t.dillon@curtin.edu.au</email>
<affiliation>DEBII - CBS, Curtin University of Technology, De Laeter Way, 6102, Bentley, WA, Australia</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Pilar</forename>
<surname>Herrero</surname>
</persName>
<email>pherrero@fi.upm.es</email>
<affiliation>Facultad de Informática, Universidad Politécnica de Madrid, Campus de Montegancedo S/N, 28660, Boadilla del Monte, Madrid, Spain</affiliation>
</editor>
<imprint>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<date type="published" when="2010"></date>
<biblScope unit="volume">6427</biblScope>
<biblScope unit="page" from="957">957</biblScope>
<biblScope unit="page" to="974">974</biblScope>
</imprint>
</monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<editor>
<persName>
<forename type="first">David</forename>
<surname>Hutchison</surname>
</persName>
<affiliation>Lancaster University, Lancaster, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Takeo</forename>
<surname>Kanade</surname>
</persName>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Josef</forename>
<surname>Kittler</surname>
</persName>
<affiliation>University of Surrey, Guildford, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Jon</forename>
<forename type="first">M.</forename>
<surname>Kleinberg</surname>
</persName>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Friedemann</forename>
<surname>Mattern</surname>
</persName>
<affiliation>ETH Zurich, Zurich, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">John</forename>
<forename type="first">C.</forename>
<surname>Mitchell</surname>
</persName>
<affiliation>Stanford University, Stanford, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moni</forename>
<surname>Naor</surname>
</persName>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Oscar</forename>
<surname>Nierstrasz</surname>
</persName>
<affiliation>University of Bern, Bern, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">C.</forename>
<surname>Pandu Rangan</surname>
</persName>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Bernhard</forename>
<surname>Steffen</surname>
</persName>
<affiliation>University of Dortmund, Dortmund, Germany</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Madhu</forename>
<surname>Sudan</surname>
</persName>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Demetri</forename>
<surname>Terzopoulos</surname>
</persName>
<affiliation>University of California, Los Angeles, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Doug</forename>
<surname>Tygar</surname>
</persName>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moshe</forename>
<forename type="first">Y.</forename>
<surname>Vardi</surname>
</persName>
<affiliation>Rice University, Houston, TX, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Gerhard</forename>
<surname>Weikum</surname>
</persName>
<affiliation>Max-Planck Institute of Computer Science, Saarbrücken, Germany</affiliation>
</editor>
<biblScope>
<date>2010</date>
</biblScope>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="seriesId">558</idno>
</series>
<idno type="istex">70B04F706C4028A34068DCBC9EC4803A88DF54B5</idno>
<idno type="DOI">10.1007/978-3-642-16949-6_21</idno>
<idno type="ChapterID">21</idno>
<idno type="ChapterID">Chap21</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>2010</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>Abstract: Web pages are designed to be read by people, not machines. Consequently, searching and reusing information on the Web is a difficult task without human participation. Adding semantics (i.e meaning) to a Web page would help machines to understand Web contents and better support the Web search process. One of the latest developments in this field is Google’s Rich Snippets, a service for Web site owners to add semantics to their Web pages. In this paper we provide an approach to automatically annotate a Web page with Rich Snippets RDFa tags. Exploiting several heuristics and a named entity recognition technique, our method is capable of recognizing and annotating a subset of Rich Snippets’ vocabulary, i.e., all attributes of its Review concept, and the names of Person and Organization concepts. We implemented an on-line service and evaluated the accuracy of the approach on real E-commerce Web sites.</p>
</abstract>
<textClass>
<keywords scheme="Book Subject Collection">
<list>
<label>SUCO11645</label>
<item>
<term>Computer Science</term>
</item>
</list>
</keywords>
</textClass>
<textClass>
<keywords scheme="Book Subject Group">
<list>
<label>I</label>
<label>I13022</label>
<label>I14029</label>
<label>I21017</label>
<label>I18040</label>
<label>I16021</label>
<label>I24067</label>
<item>
<term>Computer Science</term>
</item>
<item>
<term>Computer Communication Networks</term>
</item>
<item>
<term>Software Engineering</term>
</item>
<item>
<term>Artificial Intelligence (incl. Robotics)</term>
</item>
<item>
<term>Information Systems Applications (incl.Internet)</term>
</item>
<item>
<term>Algorithm Analysis and Problem Complexity</term>
</item>
<item>
<term>Management of Computing and Information Systems</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change when="2010">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-19">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/70B04F706C4028A34068DCBC9EC4803A88DF54B5/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="Springer, Publisher found" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="UTF-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//Springer-Verlag//DTD A++ V2.4//EN" URI="http://devel.springer.de/A++/V2.4/DTD/A++V2.4.dtd" name="istex:docType"></istex:docType>
<istex:document>
<Publisher>
<PublisherInfo>
<PublisherName>Springer Berlin Heidelberg</PublisherName>
<PublisherLocation>Berlin, Heidelberg</PublisherLocation>
</PublisherInfo>
<Series>
<SeriesInfo SeriesType="Series" TocLevels="0">
<SeriesID>558</SeriesID>
<SeriesPrintISSN>0302-9743</SeriesPrintISSN>
<SeriesElectronicISSN>1611-3349</SeriesElectronicISSN>
<SeriesTitle Language="En">Lecture Notes in Computer Science</SeriesTitle>
</SeriesInfo>
<SeriesHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff1">
<EditorName DisplayOrder="Western">
<GivenName>David</GivenName>
<FamilyName>Hutchison</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff2">
<EditorName DisplayOrder="Western">
<GivenName>Takeo</GivenName>
<FamilyName>Kanade</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff3">
<EditorName DisplayOrder="Western">
<GivenName>Josef</GivenName>
<FamilyName>Kittler</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff4">
<EditorName DisplayOrder="Western">
<GivenName>Jon</GivenName>
<GivenName>M.</GivenName>
<FamilyName>Kleinberg</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff5">
<EditorName DisplayOrder="Western">
<GivenName>Friedemann</GivenName>
<FamilyName>Mattern</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff6">
<EditorName DisplayOrder="Western">
<GivenName>John</GivenName>
<GivenName>C.</GivenName>
<FamilyName>Mitchell</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff7">
<EditorName DisplayOrder="Western">
<GivenName>Moni</GivenName>
<FamilyName>Naor</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff8">
<EditorName DisplayOrder="Western">
<GivenName>Oscar</GivenName>
<FamilyName>Nierstrasz</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff9">
<EditorName DisplayOrder="Western">
<GivenName>C.</GivenName>
<FamilyName>Pandu Rangan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff10">
<EditorName DisplayOrder="Western">
<GivenName>Bernhard</GivenName>
<FamilyName>Steffen</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff11">
<EditorName DisplayOrder="Western">
<GivenName>Madhu</GivenName>
<FamilyName>Sudan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff12">
<EditorName DisplayOrder="Western">
<GivenName>Demetri</GivenName>
<FamilyName>Terzopoulos</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff13">
<EditorName DisplayOrder="Western">
<GivenName>Doug</GivenName>
<FamilyName>Tygar</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff14">
<EditorName DisplayOrder="Western">
<GivenName>Moshe</GivenName>
<GivenName>Y.</GivenName>
<FamilyName>Vardi</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff15">
<EditorName DisplayOrder="Western">
<GivenName>Gerhard</GivenName>
<FamilyName>Weikum</FamilyName>
</EditorName>
</Editor>
<Affiliation ID="Aff1">
<OrgName>Lancaster University</OrgName>
<OrgAddress>
<City>Lancaster</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff2">
<OrgName>Carnegie Mellon University</OrgName>
<OrgAddress>
<City>Pittsburgh</City>
<State>PA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff3">
<OrgName>University of Surrey</OrgName>
<OrgAddress>
<City>Guildford</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff4">
<OrgName>Cornell University</OrgName>
<OrgAddress>
<City>Ithaca</City>
<State>NY</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff5">
<OrgName>ETH Zurich</OrgName>
<OrgAddress>
<City>Zurich</City>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff6">
<OrgName>Stanford University</OrgName>
<OrgAddress>
<City>Stanford</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff7">
<OrgName>Weizmann Institute of Science</OrgName>
<OrgAddress>
<City>Rehovot</City>
<Country>Israel</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff8">
<OrgName>University of Bern</OrgName>
<OrgAddress>
<City>Bern</City>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff9">
<OrgName>Indian Institute of Technology</OrgName>
<OrgAddress>
<City>Madras</City>
<Country>India</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff10">
<OrgName>University of Dortmund</OrgName>
<OrgAddress>
<City>Dortmund</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff11">
<OrgName>Massachusetts Institute of Technology</OrgName>
<OrgAddress>
<State>MA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff12">
<OrgName>University of California</OrgName>
<OrgAddress>
<City>Los Angeles</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff13">
<OrgName>University of California</OrgName>
<OrgAddress>
<City>Berkeley</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff14">
<OrgName>Rice University</OrgName>
<OrgAddress>
<City>Houston</City>
<State>TX</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff15">
<OrgName>Max-Planck Institute of Computer Science</OrgName>
<OrgAddress>
<City>Saarbrücken</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</SeriesHeader>
<Book Language="En">
<BookInfo BookProductType="Proceedings" ContainsESM="No" Language="En" MediaType="eBook" NumberingDepth="2" NumberingStyle="ContentOnly" OutputMedium="All" TocLevels="0">
<BookID>978-3-642-16949-6</BookID>
<BookTitle>On the Move to Meaningful Internet Systems, OTM 2010</BookTitle>
<BookSubTitle>Confederated International Conferences: CoopIS, IS, DOA and ODBASE, Hersonissos, Crete, Greece, October 25-29, 2010, Proceedings, Part II</BookSubTitle>
<BookVolumeNumber>6427</BookVolumeNumber>
<BookSequenceNumber>6427</BookSequenceNumber>
<BookDOI>10.1007/978-3-642-16949-6</BookDOI>
<BookTitleID>215030</BookTitleID>
<BookPrintISBN>978-3-642-16948-9</BookPrintISBN>
<BookElectronicISBN>978-3-642-16949-6</BookElectronicISBN>
<BookChapterCount>36</BookChapterCount>
<BookCopyright>
<CopyrightHolderName>Springer Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2010</CopyrightYear>
</BookCopyright>
<BookSubjectGroup>
<BookSubject Code="I" Type="Primary">Computer Science</BookSubject>
<BookSubject Code="I13022" Priority="1" Type="Secondary">Computer Communication Networks</BookSubject>
<BookSubject Code="I14029" Priority="2" Type="Secondary">Software Engineering</BookSubject>
<BookSubject Code="I21017" Priority="3" Type="Secondary">Artificial Intelligence (incl. Robotics)</BookSubject>
<BookSubject Code="I18040" Priority="4" Type="Secondary">Information Systems Applications (incl.Internet)</BookSubject>
<BookSubject Code="I16021" Priority="5" Type="Secondary">Algorithm Analysis and Problem Complexity</BookSubject>
<BookSubject Code="I24067" Priority="6" Type="Secondary">Management of Computing and Information Systems</BookSubject>
<SubjectCollection Code="SUCO11645">Computer Science</SubjectCollection>
</BookSubjectGroup>
<BookContext>
<SeriesID>558</SeriesID>
</BookContext>
</BookInfo>
<BookHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff16">
<EditorName DisplayOrder="Western">
<GivenName>Robert</GivenName>
<FamilyName>Meersman</FamilyName>
</EditorName>
<Contact>
<Email>meersman@vub.ac.be</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff17">
<EditorName DisplayOrder="Western">
<GivenName>Tharam</GivenName>
<FamilyName>Dillon</FamilyName>
</EditorName>
<Contact>
<Email>t.dillon@curtin.edu.au</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff18">
<EditorName DisplayOrder="Western">
<GivenName>Pilar</GivenName>
<FamilyName>Herrero</FamilyName>
</EditorName>
<Contact>
<Email>pherrero@fi.upm.es</Email>
</Contact>
</Editor>
<Affiliation ID="Aff16">
<OrgDivision>STAR Lab</OrgDivision>
<OrgName>Vrije Universiteit Brussel (VUB)</OrgName>
<OrgAddress>
<Street>Bldg G/10, Pleinlaan 2</Street>
<Postcode>1050</Postcode>
<City>Brussels</City>
<Country>Belgium</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff17">
<OrgDivision>DEBII - CBS</OrgDivision>
<OrgName>Curtin University of Technology</OrgName>
<OrgAddress>
<Street>De Laeter Way</Street>
<Postcode>6102</Postcode>
<City>Bentley</City>
<State>WA</State>
<Country>Australia</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff18">
<OrgDivision>Facultad de Informática</OrgDivision>
<OrgName>Universidad Politécnica de Madrid</OrgName>
<OrgAddress>
<Street>Campus de Montegancedo S/N</Street>
<Postcode>28660</Postcode>
<City>Boadilla del Monte</City>
<State>Madrid</State>
<Country>Spain</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</BookHeader>
<Part ID="Part9">
<PartInfo TocLevels="0">
<PartID>9</PartID>
<PartSequenceNumber>9</PartSequenceNumber>
<PartTitle>Annotations</PartTitle>
<PartChapterCount>4</PartChapterCount>
<PartContext>
<SeriesID>558</SeriesID>
<BookTitle>On the Move to Meaningful Internet Systems, OTM 2010</BookTitle>
</PartContext>
</PartInfo>
<Chapter ID="Chap21" Language="En">
<ChapterInfo ChapterType="OriginalPaper" ContainsESM="No" NumberingDepth="2" NumberingStyle="ContentOnly" TocLevels="0">
<ChapterID>21</ChapterID>
<ChapterDOI>10.1007/978-3-642-16949-6_21</ChapterDOI>
<ChapterSequenceNumber>21</ChapterSequenceNumber>
<ChapterTitle Language="En">Automatic Web Page Annotation with Google
<Emphasis Type="Italic">Rich Snippets</Emphasis>
</ChapterTitle>
<ChapterFirstPage>957</ChapterFirstPage>
<ChapterLastPage>974</ChapterLastPage>
<ChapterCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2010</CopyrightYear>
</ChapterCopyright>
<ChapterGrants Type="Regular">
<MetadataGrant Grant="OpenAccess"></MetadataGrant>
<AbstractGrant Grant="OpenAccess"></AbstractGrant>
<BodyPDFGrant Grant="Restricted"></BodyPDFGrant>
<BodyHTMLGrant Grant="Restricted"></BodyHTMLGrant>
<BibliographyGrant Grant="Restricted"></BibliographyGrant>
<ESMGrant Grant="Restricted"></ESMGrant>
</ChapterGrants>
<ChapterContext>
<SeriesID>558</SeriesID>
<PartID>9</PartID>
<BookID>978-3-642-16949-6</BookID>
<BookTitle>On the Move to Meaningful Internet Systems, OTM 2010</BookTitle>
</ChapterContext>
</ChapterInfo>
<ChapterHeader>
<AuthorGroup>
<Author AffiliationIDS="Aff19">
<AuthorName DisplayOrder="Western">
<GivenName>Walter</GivenName>
<FamilyName>Hop</FamilyName>
</AuthorName>
<Contact>
<Email>w.w.hop@student.eur.nl</Email>
</Contact>
</Author>
<Author AffiliationIDS="Aff19">
<AuthorName DisplayOrder="Western">
<GivenName>Stephan</GivenName>
<FamilyName>Lachner</FamilyName>
</AuthorName>
<Contact>
<Email>s.lachner@student.eur.nl</Email>
</Contact>
</Author>
<Author AffiliationIDS="Aff19">
<AuthorName DisplayOrder="Western">
<GivenName>Flavius</GivenName>
<FamilyName>Frasincar</FamilyName>
</AuthorName>
<Contact>
<Email>frasincar@ese.eur.nl</Email>
</Contact>
</Author>
<Author AffiliationIDS="Aff20">
<AuthorName DisplayOrder="Western">
<GivenName>Roberto</GivenName>
<Particle>De</Particle>
<FamilyName>Virgilio</FamilyName>
</AuthorName>
<Contact>
<Email>devirgilio@dia.uniroma3.it</Email>
</Contact>
</Author>
<Affiliation ID="Aff19">
<OrgDivision>Erasmus School of Economics</OrgDivision>
<OrgName>Erasmus University Rotterdam</OrgName>
<OrgAddress>
<Postbox>PO Box 1738</Postbox>
<Postcode>NL-3000</Postcode>
<State>DR</State>
<City>Rotterdam</City>
<Country>The Netherlands</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff20">
<OrgDivision>Dipartimento di Informatica e Automazione</OrgDivision>
<OrgName>Universitá Roma Tre</OrgName>
<OrgAddress>
<City>Rome</City>
<Country>Italy</Country>
</OrgAddress>
</Affiliation>
</AuthorGroup>
<Abstract ID="Abs1" Language="En">
<Heading>Abstract</Heading>
<Para>Web pages are designed to be read by people, not machines. Consequently, searching and reusing information on the Web is a difficult task without human participation. Adding semantics (i.e meaning) to a Web page would help machines to understand Web contents and better support the Web search process. One of the latest developments in this field is Google’s
<Emphasis Type="Italic">Rich Snippets</Emphasis>
, a service for Web site owners to add semantics to their Web pages. In this paper we provide an approach to automatically annotate a Web page with Rich Snippets RDFa tags. Exploiting several heuristics and a named entity recognition technique, our method is capable of recognizing and annotating a subset of Rich Snippets’ vocabulary, i.e., all attributes of its
<Emphasis Type="Italic">Review</Emphasis>
concept, and the names of
<Emphasis Type="Italic">Person</Emphasis>
and
<Emphasis Type="Italic">Organization</Emphasis>
concepts. We implemented an on-line service and evaluated the accuracy of the approach on real E-commerce Web sites.</Para>
</Abstract>
</ChapterHeader>
<NoBody></NoBody>
</Chapter>
</Part>
</Book>
</Series>
</Publisher>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>Automatic Web Page Annotation with Google Rich Snippets</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA" lang="en">
<title>Automatic Web Page Annotation with Google Rich Snippets</title>
</titleInfo>
<name type="personal">
<namePart type="given">Walter</namePart>
<namePart type="family">Hop</namePart>
<affiliation>Erasmus School of Economics, Erasmus University Rotterdam, PO Box 1738, NL-3000, Rotterdam, DR, The Netherlands</affiliation>
<affiliation>E-mail: w.w.hop@student.eur.nl</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Stephan</namePart>
<namePart type="family">Lachner</namePart>
<affiliation>Erasmus School of Economics, Erasmus University Rotterdam, PO Box 1738, NL-3000, Rotterdam, DR, The Netherlands</affiliation>
<affiliation>E-mail: s.lachner@student.eur.nl</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Flavius</namePart>
<namePart type="family">Frasincar</namePart>
<affiliation>Erasmus School of Economics, Erasmus University Rotterdam, PO Box 1738, NL-3000, Rotterdam, DR, The Netherlands</affiliation>
<affiliation>E-mail: frasincar@ese.eur.nl</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Roberto</namePart>
<namePart type="family">De Virgilio</namePart>
<affiliation>Dipartimento di Informatica e Automazione, Universitá Roma Tre, Rome, Italy</affiliation>
<affiliation>E-mail: devirgilio@dia.uniroma3.it</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="conference [eBooks]" displayLabel="OriginalPaper"></genre>
<originInfo>
<publisher>Springer Berlin Heidelberg</publisher>
<place>
<placeTerm type="text">Berlin, Heidelberg</placeTerm>
</place>
<dateIssued encoding="w3cdtf">2010</dateIssued>
<copyrightDate encoding="w3cdtf">2010</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">Abstract: Web pages are designed to be read by people, not machines. Consequently, searching and reusing information on the Web is a difficult task without human participation. Adding semantics (i.e meaning) to a Web page would help machines to understand Web contents and better support the Web search process. One of the latest developments in this field is Google’s Rich Snippets, a service for Web site owners to add semantics to their Web pages. In this paper we provide an approach to automatically annotate a Web page with Rich Snippets RDFa tags. Exploiting several heuristics and a named entity recognition technique, our method is capable of recognizing and annotating a subset of Rich Snippets’ vocabulary, i.e., all attributes of its Review concept, and the names of Person and Organization concepts. We implemented an on-line service and evaluated the accuracy of the approach on real E-commerce Web sites.</abstract>
<relatedItem type="host">
<titleInfo>
<title>On the Move to Meaningful Internet Systems, OTM 2010</title>
<subTitle>Confederated International Conferences: CoopIS, IS, DOA and ODBASE, Hersonissos, Crete, Greece, October 25-29, 2010, Proceedings, Part II</subTitle>
</titleInfo>
<name type="personal">
<namePart type="given">Robert</namePart>
<namePart type="family">Meersman</namePart>
<affiliation>STAR Lab, Vrije Universiteit Brussel (VUB), Bldg G/10, Pleinlaan 2, 1050, Brussels, Belgium</affiliation>
<affiliation>E-mail: meersman@vub.ac.be</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Tharam</namePart>
<namePart type="family">Dillon</namePart>
<affiliation>DEBII - CBS, Curtin University of Technology, De Laeter Way, 6102, Bentley, WA, Australia</affiliation>
<affiliation>E-mail: t.dillon@curtin.edu.au</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Pilar</namePart>
<namePart type="family">Herrero</namePart>
<affiliation>Facultad de Informática, Universidad Politécnica de Madrid, Campus de Montegancedo S/N, 28660, Boadilla del Monte, Madrid, Spain</affiliation>
<affiliation>E-mail: pherrero@fi.upm.es</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="Book Series" displayLabel="Proceedings"></genre>
<originInfo>
<copyrightDate encoding="w3cdtf">2010</copyrightDate>
<issuance>monographic</issuance>
</originInfo>
<subject>
<genre>Book Subject Collection</genre>
<topic authority="SpringerSubjectCodes" authorityURI="SUCO11645">Computer Science</topic>
</subject>
<subject>
<genre>Book Subject Group</genre>
<topic authority="SpringerSubjectCodes" authorityURI="I">Computer Science</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I13022">Computer Communication Networks</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I14029">Software Engineering</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I21017">Artificial Intelligence (incl. Robotics)</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18040">Information Systems Applications (incl.Internet)</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I16021">Algorithm Analysis and Problem Complexity</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I24067">Management of Computing and Information Systems</topic>
</subject>
<identifier type="DOI">10.1007/978-3-642-16949-6</identifier>
<identifier type="ISBN">978-3-642-16948-9</identifier>
<identifier type="eISBN">978-3-642-16949-6</identifier>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="BookTitleID">215030</identifier>
<identifier type="BookID">978-3-642-16949-6</identifier>
<identifier type="BookChapterCount">36</identifier>
<identifier type="BookVolumeNumber">6427</identifier>
<identifier type="BookSequenceNumber">6427</identifier>
<identifier type="PartChapterCount">4</identifier>
<part>
<date>2010</date>
<detail type="part">
<title>Annotations</title>
</detail>
<detail type="volume">
<number>6427</number>
<caption>vol.</caption>
</detail>
<extent unit="pages">
<start>957</start>
<end>974</end>
</extent>
</part>
<recordInfo>
<recordOrigin>Springer Berlin Heidelberg, 2010</recordOrigin>
</recordInfo>
</relatedItem>
<relatedItem type="series">
<titleInfo>
<title>Lecture Notes in Computer Science</title>
</titleInfo>
<name type="personal">
<namePart type="given">David</namePart>
<namePart type="family">Hutchison</namePart>
<affiliation>Lancaster University, Lancaster, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Takeo</namePart>
<namePart type="family">Kanade</namePart>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Josef</namePart>
<namePart type="family">Kittler</namePart>
<affiliation>University of Surrey, Guildford, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jon</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Kleinberg</namePart>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Friedemann</namePart>
<namePart type="family">Mattern</namePart>
<affiliation>ETH Zurich, Zurich, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">John</namePart>
<namePart type="given">C.</namePart>
<namePart type="family">Mitchell</namePart>
<affiliation>Stanford University, Stanford, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moni</namePart>
<namePart type="family">Naor</namePart>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Oscar</namePart>
<namePart type="family">Nierstrasz</namePart>
<affiliation>University of Bern, Bern, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">C.</namePart>
<namePart type="family">Pandu Rangan</namePart>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Bernhard</namePart>
<namePart type="family">Steffen</namePart>
<affiliation>University of Dortmund, Dortmund, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Madhu</namePart>
<namePart type="family">Sudan</namePart>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Demetri</namePart>
<namePart type="family">Terzopoulos</namePart>
<affiliation>University of California, Los Angeles, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Doug</namePart>
<namePart type="family">Tygar</namePart>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moshe</namePart>
<namePart type="given">Y.</namePart>
<namePart type="family">Vardi</namePart>
<affiliation>Rice University, Houston, TX, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<affiliation>Max-Planck Institute of Computer Science, Saarbrücken, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<originInfo>
<copyrightDate encoding="w3cdtf">2010</copyrightDate>
<issuance>serial</issuance>
</originInfo>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="SeriesID">558</identifier>
<recordInfo>
<recordOrigin>Springer Berlin Heidelberg, 2010</recordOrigin>
</recordInfo>
</relatedItem>
<identifier type="istex">70B04F706C4028A34068DCBC9EC4803A88DF54B5</identifier>
<identifier type="DOI">10.1007/978-3-642-16949-6_21</identifier>
<identifier type="ChapterID">21</identifier>
<identifier type="ChapterID">Chap21</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Springer Berlin Heidelberg, 2010</accessCondition>
<recordInfo>
<recordContentSource>SPRINGER</recordContentSource>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2010</recordOrigin>
</recordInfo>
</mods>
</metadata>
<enrichments>
<istex:refBibTEI uri="https://api.istex.fr/document/70B04F706C4028A34068DCBC9EC4803A88DF54B5/enrichments/refBib">
<teiHeader></teiHeader>
<text>
<front></front>
<body></body>
<back>
<listBibl>
<biblStruct xml:id="b0">
<analytic>
<title level="a" type="main">The Semantic Web</title>
<author>
<persName>
<forename type="first">T</forename>
<surname>Berners-Lee</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<surname>Hendler</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">O</forename>
<surname>Lassila</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Scientific American</title>
<imprint>
<biblScope unit="volume">284</biblScope>
<biblScope unit="page" from="34" to="43"></biblScope>
<date type="published" when="2001"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b1">
<monogr>
<title level="m" type="main">Introducing Rich Snippets</title>
<author>
<persName>
<forename type="first">K</forename>
<surname>Goel</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">R</forename>
<forename type="middle">V</forename>
<surname>Guha</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">O</forename>
<surname>Hansson</surname>
</persName>
</author>
<imprint></imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b2">
<analytic>
<title></title>
<author>
<persName>
<forename type="first">B</forename>
<surname>Adida</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Birbeck</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Google: Google Webmaster Tools: About review data RDFa Primer: Bridging the Human and Data Webs</title>
<imprint>
<date type="published" when="2008"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b3">
<analytic>
<title level="a" type="main">Named Entity Recognition without gazetteers</title>
<author>
<persName>
<forename type="first">A</forename>
<surname>Mikheev</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Moens</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">C</forename>
<surname>Grover</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Ninth Conference on European Chapter of the Association for Computational Linguistics</title>
<imprint>
<publisher>Association for Computational Linguistics</publisher>
<date type="published" when="1999"></date>
<biblScope unit="page" from="1" to="8"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b4">
<analytic>
<title level="a" type="main">University of Durham: Description of the LOLITA System as Used in MUC-6</title>
<author>
<persName>
<forename type="first">R</forename>
<surname>Morgan</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">R</forename>
<surname>Garigliano</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">P</forename>
<surname>Callaghan</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Poria</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Smith</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">A</forename>
<surname>Urbanowicz</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">R</forename>
<surname>Collingham</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Costantino</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">C</forename>
<surname>Cooper</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">L</forename>
<surname>Group</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Sixth Message Understanding Conference</title>
<meeting>
<address>
<addrLine>San Francisco</addrLine>
</address>
</meeting>
<imprint>
<publisher>Morgan Kaufmann Publishers</publisher>
<date type="published" when="1995"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b5">
<analytic>
<title level="a" type="main">IsoQuest, Inc: Description of the NetOwl(TM) extractor system as used for MUC-7</title>
<author>
<persName>
<forename type="first">G</forename>
<forename type="middle">R</forename>
<surname>Krupka</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">K</forename>
<surname>Hausman</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Seventh Message Understanding Conference</title>
<imprint>
<date type="published" when="1998"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b6">
<monogr>
<title level="m" type="main">Search Engine Ranking Factors</title>
<author>
<persName>
<surname>Seomoz</surname>
</persName>
</author>
<imprint>
<date type="published" when="2009"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b7">
<analytic>
<title level="a" type="main">A Structured Approach to Data Reverse Engineering of Web Applications</title>
<author>
<persName>
<forename type="first">R</forename>
<forename type="middle">D</forename>
<surname>Virgilio</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">R</forename>
<surname>Torlone</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">9th International Conference on Web Engineering</title>
<meeting>
<address>
<addrLine>Heidelberg</addrLine>
</address>
</meeting>
<imprint>
<publisher>Springer</publisher>
<date type="published" when="2009"></date>
<biblScope unit="page" from="91" to="105"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b8">
<analytic>
<title level="a" type="main">Postal Address Detection from Web Documents</title>
<author>
<persName>
<forename type="first">L</forename>
<surname>Can</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">Z</forename>
<surname>Qian</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Xiaofeng</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">L</forename>
<surname>Wenyin</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">International Workshop on Challenges in Web Information Retrieval and Integration</title>
<meeting>
<address>
<addrLine>Los Alamitos</addrLine>
</address>
</meeting>
<imprint>
<publisher>IEEE Computer Society</publisher>
<date type="published" when="2005"></date>
<biblScope unit="page" from="40" to="45"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b9">
<monogr>
<title level="m" type="main">Site Owner Overview</title>
<author>
<persName>
<forename type="first">!</forename>
<surname>Yahoo</surname>
</persName>
</author>
<author>
<persName>
<surname>Searchmonkey</surname>
</persName>
</author>
<imprint>
<date type="published" when="2009"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b10">
<analytic>
<title></title>
</analytic>
<monogr>
<title level="j">Electrum: Valid HTML Statistics</title>
<imprint>
<date type="published" when="2009"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b11">
<analytic>
<title level="a" type="main">RDFa versus Microformats: Exploring the Potential for Semantic Interoperability of Mash-up Personal Learning Environments</title>
<author>
<persName>
<forename type="first">V</forename>
<surname>Tomberg</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Laanpere</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Second International Workshop on Mashup Personal Learning Environments</title>
<imprint>
<date type="published" when="2009"></date>
<biblScope unit="page" from="102" to="109"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b12">
<analytic>
<title level="a" type="main">Design Challenges and Misconceptions in Named Entity Recognition</title>
<author>
<persName>
<forename type="first">L</forename>
<surname>Ratinov</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Roth</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Thirteenth Conference on Computational Natural Language Learning</title>
<imprint>
<publisher>Association for Computational Linguistics</publisher>
<date type="published" when="2009"></date>
<biblScope unit="page" from="147" to="155"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b13">
<analytic>
<title level="a" type="main">Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews</title>
<author>
<persName>
<forename type="first">P</forename>
<surname>Turney</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">40th Annual Meeting of the Association for Computational Linguistics</title>
<imprint>
<date type="published" when="2002"></date>
<biblScope unit="page" from="417" to="424"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b14">
<analytic>
<title level="a" type="main">Thumbs up? Sentiment Classification using Machine Learning Techniques</title>
<author>
<persName>
<forename type="first">B</forename>
<surname>Pang</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">L</forename>
<surname>Lee</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Vaithyanathan</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Conference on Emprirical Methods in Natural Language Processing</title>
<imprint>
<date type="published" when="2002"></date>
<biblScope unit="page" from="79" to="86"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b15">
<analytic>
<title level="a" type="main">Sentiment Classification of Online Reviews to Travel Destinations by Supervised Machine Learning Approaches</title>
<author>
<persName>
<forename type="first">Q</forename>
<surname>Ye</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">Z</forename>
<surname>Zhang</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">R</forename>
<surname>Law</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Expert Systems with Applications</title>
<imprint>
<biblScope unit="volume">36</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="6527" to="6535"></biblScope>
<date type="published" when="2009"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b16">
<analytic>
<title level="a" type="main">Sentiment Classification of Movie Reviews Using Contextual Valence Shifters</title>
<author>
<persName>
<forename type="first">A</forename>
<surname>Kennedy</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Inkpen</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Computational Intelligence</title>
<imprint>
<biblScope unit="volume">22</biblScope>
<biblScope unit="issue">2</biblScope>
<biblScope unit="page" from="110" to="225"></biblScope>
<date type="published" when="2006"></date>
</imprint>
</monogr>
</biblStruct>
</listBibl>
</back>
</text>
</istex:refBibTEI>
</enrichments>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002D97 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 002D97 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:70B04F706C4028A34068DCBC9EC4803A88DF54B5
   |texte=   Automatic Web Page Annotation with Google Rich Snippets
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024