Serveur d'exploration sur la musique en Sarre

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Harvesting and Organizing Knowledge from the Web

Identifieur interne : 000A93 ( Istex/Corpus ); précédent : 000A92; suivant : 000A94

Harvesting and Organizing Knowledge from the Web

Auteurs : Gerhard Weikum

Source :

RBID : ISTEX:674E11A735F9A5B51D1E8967050168105885CA1B

English descriptors

Abstract

Abstract: Information organization and search on the Web is gaining structure and context awareness and more semantic flavor, for example, in the forms of faceted search, vertical search, entity search, and Deep-Web search. I envision another big leap forward by automatically harvesting and organizing knowledge from the Web, represented in terms of explicit entities and relations as well as ontological concepts. This will be made possible by the confluence of three strong trends: 1) rich Semantic-Web-style knowledge repositories like ontologies and taxonomies, 2) large-scale information extraction from high-quality text sources such as Wikipedia, and 3) social tagging in the spirit of Web 2.0. I refer to the three directions as Semantic Web, Statistical Web, and Social Web (at the risk of some oversimplification), and I briefly characterize each of them.

Url:
DOI: 10.1007/978-3-540-75185-4_2

Links to Exploration step

ISTEX:674E11A735F9A5B51D1E8967050168105885CA1B

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Harvesting and Organizing Knowledge from the Web</title>
<author>
<name sortKey="Weikum, Gerhard" sort="Weikum, Gerhard" uniqKey="Weikum G" first="Gerhard" last="Weikum">Gerhard Weikum</name>
<affiliation>
<mods:affiliation>Max-Planck Institute for Informatics, Saarbruecken, Germany</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: weikum@mpi-inf.mpg.de</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:674E11A735F9A5B51D1E8967050168105885CA1B</idno>
<date when="2007" year="2007">2007</date>
<idno type="doi">10.1007/978-3-540-75185-4_2</idno>
<idno type="url">https://api.istex.fr/document/674E11A735F9A5B51D1E8967050168105885CA1B/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000A93</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">000A93</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Harvesting and Organizing Knowledge from the Web</title>
<author>
<name sortKey="Weikum, Gerhard" sort="Weikum, Gerhard" uniqKey="Weikum G" first="Gerhard" last="Weikum">Gerhard Weikum</name>
<affiliation>
<mods:affiliation>Max-Planck Institute for Informatics, Saarbruecken, Germany</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: weikum@mpi-inf.mpg.de</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2007</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="Teeft" xml:lang="en">
<term>Abstract information organization</term>
<term>Adbis</term>
<term>Agichtein</term>
<term>Algorithmic</term>
<term>Annotating</term>
<term>Artif</term>
<term>Auer</term>
<term>Avor</term>
<term>Banko</term>
<term>Berlin heidelberg</term>
<term>Best example</term>
<term>Broadhead</term>
<term>Cafarella</term>
<term>Certain types</term>
<term>Computationally</term>
<term>Conceptnet</term>
<term>Context awareness</term>
<term>Data management issues</term>
<term>Database</term>
<term>December</term>
<term>Downey</term>
<term>Eld</term>
<term>Elusive goal</term>
<term>Enormous progress</term>
<term>Entity search</term>
<term>Eswc</term>
<term>Etzioni</term>
<term>Experimental study</term>
<term>Explicit entities</term>
<term>Explicit knowledge sources</term>
<term>Explicit structure</term>
<term>Faceted</term>
<term>Faceted search</term>
<term>Fellbaum</term>
<term>Folksonomies</term>
<term>Geneontology</term>
<term>Gerhard weikum institute</term>
<term>Gloomy picture</term>
<term>Glorious form</term>
<term>Great opportunties</term>
<term>Harvest knowledge</term>
<term>Heidelberg</term>
<term>Human contributions</term>
<term>Human supervision</term>
<term>Hyperlinked</term>
<term>Hyperlinked text</term>
<term>Ieee</term>
<term>Ieee data engineering bulletin</term>
<term>Ifrim</term>
<term>Ijcai</term>
<term>Informatics</term>
<term>Informatics saarbruecken</term>
<term>Information extraction</term>
<term>Innsbruck</term>
<term>Intell</term>
<term>Interesting asset</term>
<term>Interesting research themes</term>
<term>Ioannidis</term>
<term>June</term>
<term>Kasneci</term>
<term>Knowledge bases</term>
<term>Knowledge management</term>
<term>Knowledge repositories</term>
<term>Knowledge sources</term>
<term>Koudas</term>
<term>Large extent</term>
<term>Leipzig</term>
<term>Link structure</term>
<term>Lncs</term>
<term>Major advances</term>
<term>More knowledge</term>
<term>Multilingual</term>
<term>Multilingual thesauri</term>
<term>Music bands</term>
<term>Natural language processing</term>
<term>Novikov</term>
<term>Ontological concepts</term>
<term>Ontology</term>
<term>Open information extraction</term>
<term>Opencyc</term>
<term>Opportunties</term>
<term>Other sources</term>
<term>Popescu</term>
<term>Rachev</term>
<term>Recent years</term>
<term>Relation patterns</term>
<term>Rich knowledge repositories</term>
<term>Rigorous representations</term>
<term>Rocket science</term>
<term>Saarbruecken</term>
<term>Sarawagi</term>
<term>Scalable</term>
<term>Scalable information extraction</term>
<term>Semantic knowledge</term>
<term>Semantics</term>
<term>Shaked</term>
<term>Similar sources</term>
<term>Snomed</term>
<term>Social networks</term>
<term>Soderland</term>
<term>Special issue</term>
<term>Springer</term>
<term>Staab</term>
<term>Statistical analysis</term>
<term>Strong proliferation</term>
<term>Strong trends</term>
<term>Studer</term>
<term>Such issues</term>
<term>Suchanek</term>
<term>Suciu</term>
<term>Sumo</term>
<term>Synergy</term>
<term>Taxonomy</term>
<term>Technology entity recognition</term>
<term>Terminological</term>
<term>Terminological taxonomies</term>
<term>Text sources</term>
<term>Thematic categories</term>
<term>Thesaurus</term>
<term>Topic recognition</term>
<term>Tutorial</term>
<term>Tutorial slides</term>
<term>Umls</term>
<term>Unsupervised</term>
<term>Unsupervised extraction</term>
<term>Vertical search</term>
<term>Weikum</term>
<term>Wiki</term>
<term>Wiki content</term>
<term>Wikipedia</term>
<term>Wordnet</term>
<term>Yago</term>
<term>Yates</term>
</keywords>
</textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Information organization and search on the Web is gaining structure and context awareness and more semantic flavor, for example, in the forms of faceted search, vertical search, entity search, and Deep-Web search. I envision another big leap forward by automatically harvesting and organizing knowledge from the Web, represented in terms of explicit entities and relations as well as ontological concepts. This will be made possible by the confluence of three strong trends: 1) rich Semantic-Web-style knowledge repositories like ontologies and taxonomies, 2) large-scale information extraction from high-quality text sources such as Wikipedia, and 3) social tagging in the spirit of Web 2.0. I refer to the three directions as Semantic Web, Statistical Web, and Social Web (at the risk of some oversimplification), and I briefly characterize each of them.</div>
</front>
</TEI>
<istex>
<corpusName>springer</corpusName>
<keywords>
<teeft>
<json:string>wikipedia</json:string>
<json:string>ontology</json:string>
<json:string>weikum</json:string>
<json:string>wordnet</json:string>
<json:string>taxonomy</json:string>
<json:string>scalable</json:string>
<json:string>semantics</json:string>
<json:string>suchanek</json:string>
<json:string>soderland</json:string>
<json:string>etzioni</json:string>
<json:string>ieee</json:string>
<json:string>ieee data engineering bulletin</json:string>
<json:string>special issue</json:string>
<json:string>cafarella</json:string>
<json:string>strong trends</json:string>
<json:string>rich knowledge repositories</json:string>
<json:string>december</json:string>
<json:string>informatics</json:string>
<json:string>information extraction</json:string>
<json:string>text sources</json:string>
<json:string>saarbruecken</json:string>
<json:string>glorious form</json:string>
<json:string>elusive goal</json:string>
<json:string>suciu</json:string>
<json:string>sumo</json:string>
<json:string>opencyc</json:string>
<json:string>conceptnet</json:string>
<json:string>informatics saarbruecken</json:string>
<json:string>terminological</json:string>
<json:string>terminological taxonomies</json:string>
<json:string>geneontology</json:string>
<json:string>snomed</json:string>
<json:string>umls</json:string>
<json:string>knowledge sources</json:string>
<json:string>more knowledge</json:string>
<json:string>rigorous representations</json:string>
<json:string>multilingual</json:string>
<json:string>multilingual thesauri</json:string>
<json:string>interesting asset</json:string>
<json:string>technology entity recognition</json:string>
<json:string>relation patterns</json:string>
<json:string>enormous progress</json:string>
<json:string>abstract information organization</json:string>
<json:string>recent years</json:string>
<json:string>human supervision</json:string>
<json:string>major advances</json:string>
<json:string>eld</json:string>
<json:string>natural language processing</json:string>
<json:string>algorithmic</json:string>
<json:string>computationally</json:string>
<json:string>ioannidis</json:string>
<json:string>novikov</json:string>
<json:string>rachev</json:string>
<json:string>adbis</json:string>
<json:string>lncs</json:string>
<json:string>semantic knowledge</json:string>
<json:string>berlin heidelberg</json:string>
<json:string>gloomy picture</json:string>
<json:string>such issues</json:string>
<json:string>harvest knowledge</json:string>
<json:string>rocket science</json:string>
<json:string>large extent</json:string>
<json:string>human contributions</json:string>
<json:string>annotating</json:string>
<json:string>folksonomies</json:string>
<json:string>strong proliferation</json:string>
<json:string>gerhard weikum institute</json:string>
<json:string>explicit structure</json:string>
<json:string>topic recognition</json:string>
<json:string>best example</json:string>
<json:string>hyperlinked</json:string>
<json:string>hyperlinked text</json:string>
<json:string>link structure</json:string>
<json:string>thematic categories</json:string>
<json:string>certain types</json:string>
<json:string>music bands</json:string>
<json:string>similar sources</json:string>
<json:string>knowledge bases</json:string>
<json:string>other sources</json:string>
<json:string>interesting research themes</json:string>
<json:string>explicit knowledge sources</json:string>
<json:string>synergy</json:string>
<json:string>opportunties</json:string>
<json:string>great opportunties</json:string>
<json:string>knowledge management</json:string>
<json:string>agichtein</json:string>
<json:string>sarawagi</json:string>
<json:string>scalable information extraction</json:string>
<json:string>tutorial</json:string>
<json:string>tutorial slides</json:string>
<json:string>auer</json:string>
<json:string>innsbruck</json:string>
<json:string>leipzig</json:string>
<json:string>context awareness</json:string>
<json:string>wiki</json:string>
<json:string>wiki content</json:string>
<json:string>eswc</json:string>
<json:string>banko</json:string>
<json:string>avor</json:string>
<json:string>faceted</json:string>
<json:string>broadhead</json:string>
<json:string>faceted search</json:string>
<json:string>open information extraction</json:string>
<json:string>ijcai</json:string>
<json:string>downey</json:string>
<json:string>popescu</json:string>
<json:string>shaked</json:string>
<json:string>yates</json:string>
<json:string>unsupervised</json:string>
<json:string>unsupervised extraction</json:string>
<json:string>experimental study</json:string>
<json:string>artif</json:string>
<json:string>intell</json:string>
<json:string>fellbaum</json:string>
<json:string>database</json:string>
<json:string>koudas</json:string>
<json:string>vertical search</json:string>
<json:string>entity search</json:string>
<json:string>explicit entities</json:string>
<json:string>data management issues</json:string>
<json:string>social networks</json:string>
<json:string>june</json:string>
<json:string>staab</json:string>
<json:string>studer</json:string>
<json:string>springer</json:string>
<json:string>ontological concepts</json:string>
<json:string>ifrim</json:string>
<json:string>statistical analysis</json:string>
<json:string>kasneci</json:string>
<json:string>yago</json:string>
<json:string>knowledge repositories</json:string>
<json:string>heidelberg</json:string>
<json:string>thesaurus</json:string>
</teeft>
</keywords>
<author>
<json:item>
<name>Gerhard Weikum</name>
<affiliations>
<json:string>Max-Planck Institute for Informatics, Saarbruecken, Germany</json:string>
<json:string>E-mail: weikum@mpi-inf.mpg.de</json:string>
</affiliations>
</json:item>
</author>
<language>
<json:string>eng</json:string>
</language>
<originalGenre>
<json:string>OriginalPaper</json:string>
</originalGenre>
<abstract>Abstract: Information organization and search on the Web is gaining structure and context awareness and more semantic flavor, for example, in the forms of faceted search, vertical search, entity search, and Deep-Web search. I envision another big leap forward by automatically harvesting and organizing knowledge from the Web, represented in terms of explicit entities and relations as well as ontological concepts. This will be made possible by the confluence of three strong trends: 1) rich Semantic-Web-style knowledge repositories like ontologies and taxonomies, 2) large-scale information extraction from high-quality text sources such as Wikipedia, and 3) social tagging in the spirit of Web 2.0. I refer to the three directions as Semantic Web, Statistical Web, and Social Web (at the risk of some oversimplification), and I briefly characterize each of them.</abstract>
<qualityIndicators>
<score>2.334</score>
<pdfWordCount>774</pdfWordCount>
<pdfCharCount>4950</pdfCharCount>
<pdfVersion>1.3</pdfVersion>
<pdfPageCount>2</pdfPageCount>
<pdfPageSize>430 x 660 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<abstractWordCount>130</abstractWordCount>
<abstractCharCount>866</abstractCharCount>
<keywordCount>0</keywordCount>
</qualityIndicators>
<title>Harvesting and Organizing Knowledge from the Web</title>
<chapterId>
<json:string>2</json:string>
<json:string>Chap2</json:string>
</chapterId>
<genre>
<json:string>conference</json:string>
</genre>
<serie>
<title>Lecture Notes in Computer Science</title>
<language>
<json:string>unknown</json:string>
</language>
<copyrightDate>2007</copyrightDate>
<issn>
<json:string>0302-9743</json:string>
</issn>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<editor>
<json:item>
<name>David Hutchison</name>
</json:item>
<json:item>
<name>Takeo Kanade</name>
</json:item>
<json:item>
<name>Josef Kittler</name>
</json:item>
<json:item>
<name>Jon M. Kleinberg</name>
</json:item>
<json:item>
<name>Friedemann Mattern</name>
</json:item>
<json:item>
<name>John C. Mitchell</name>
</json:item>
<json:item>
<name>Moni Naor</name>
</json:item>
<json:item>
<name>Oscar Nierstrasz</name>
</json:item>
<json:item>
<name>C. Pandu Rangan</name>
</json:item>
<json:item>
<name>Bernhard Steffen</name>
</json:item>
<json:item>
<name>Madhu Sudan</name>
</json:item>
<json:item>
<name>Demetri Terzopoulos</name>
</json:item>
<json:item>
<name>Doug Tygar</name>
</json:item>
<json:item>
<name>Moshe Y. Vardi</name>
</json:item>
<json:item>
<name>Gerhard Weikum</name>
</json:item>
</editor>
</serie>
<host>
<title>Advances in Databases and Information Systems</title>
<language>
<json:string>unknown</json:string>
</language>
<copyrightDate>2007</copyrightDate>
<doi>
<json:string>10.1007/978-3-540-75185-4</json:string>
</doi>
<issn>
<json:string>0302-9743</json:string>
</issn>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<eisbn>
<json:string>978-3-540-75185-4</json:string>
</eisbn>
<bookId>
<json:string>978-3-540-75185-4</json:string>
</bookId>
<isbn>
<json:string>978-3-540-75184-7</json:string>
</isbn>
<volume>4690</volume>
<pages>
<first>12</first>
<last>13</last>
</pages>
<genre>
<json:string>book-series</json:string>
</genre>
<editor>
<json:item>
<name>Yannis Ioannidis</name>
</json:item>
<json:item>
<name>Boris Novikov</name>
</json:item>
<json:item>
<name>Boris Rachev</name>
</json:item>
</editor>
<subject>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Database Management</value>
</json:item>
<json:item>
<value>Information Storage and Retrieval</value>
</json:item>
<json:item>
<value>Data Mining and Knowledge Discovery</value>
</json:item>
<json:item>
<value>Information Systems Applications (incl.Internet)</value>
</json:item>
<json:item>
<value>Multimedia Information Systems</value>
</json:item>
<json:item>
<value>Business Information Systems</value>
</json:item>
</subject>
</host>
<categories>
<inist>
<json:string>sciences humaines et sociales</json:string>
</inist>
</categories>
<publicationDate>2007</publicationDate>
<copyrightDate>2007</copyrightDate>
<doi>
<json:string>10.1007/978-3-540-75185-4_2</json:string>
</doi>
<id>674E11A735F9A5B51D1E8967050168105885CA1B</id>
<score>1</score>
<fulltext>
<json:item>
<extension>pdf</extension>
<original>true</original>
<mimetype>application/pdf</mimetype>
<uri>https://api.istex.fr/document/674E11A735F9A5B51D1E8967050168105885CA1B/fulltext/pdf</uri>
</json:item>
<json:item>
<extension>zip</extension>
<original>false</original>
<mimetype>application/zip</mimetype>
<uri>https://api.istex.fr/document/674E11A735F9A5B51D1E8967050168105885CA1B/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/674E11A735F9A5B51D1E8967050168105885CA1B/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">Harvesting and Organizing Knowledge from the Web</title>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<availability>
<p>Springer-Verlag Berlin Heidelberg, 2007</p>
</availability>
<date>2007</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a" type="main" xml:lang="en">Harvesting and Organizing Knowledge from the Web</title>
<author xml:id="author-0000">
<persName>
<forename type="first">Gerhard</forename>
<surname>Weikum</surname>
</persName>
<email>weikum@mpi-inf.mpg.de</email>
<affiliation>Max-Planck Institute for Informatics, Saarbruecken, Germany</affiliation>
</author>
<idno type="istex">674E11A735F9A5B51D1E8967050168105885CA1B</idno>
<idno type="DOI">10.1007/978-3-540-75185-4_2</idno>
<idno type="ChapterID">2</idno>
<idno type="ChapterID">Chap2</idno>
</analytic>
<monogr>
<title level="m">Advances in Databases and Information Systems</title>
<title level="m" type="sub">11th East European Conference, ADBIS 2007, Varna, Bulgaria, September 29-October 3, 2007. Proceedings</title>
<idno type="DOI">10.1007/978-3-540-75185-4</idno>
<idno type="pISBN">978-3-540-75184-7</idno>
<idno type="eISBN">978-3-540-75185-4</idno>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="book-title-ID">156678</idno>
<idno type="book-ID">978-3-540-75185-4</idno>
<idno type="book-chapter-count">26</idno>
<idno type="book-volume-number">4690</idno>
<idno type="book-sequence-number">4690</idno>
<idno type="PartChapterCount">3</idno>
<editor xml:id="book-author-0000">
<persName>
<forename type="first">Yannis</forename>
<surname>Ioannidis</surname>
</persName>
</editor>
<editor xml:id="book-author-0001">
<persName>
<forename type="first">Boris</forename>
<surname>Novikov</surname>
</persName>
</editor>
<editor xml:id="book-author-0002">
<persName>
<forename type="first">Boris</forename>
<surname>Rachev</surname>
</persName>
</editor>
<imprint>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<date type="published" when="2007"></date>
<biblScope unit="volume">4690</biblScope>
<biblScope unit="page" from="12">12</biblScope>
<biblScope unit="page" to="13">13</biblScope>
</imprint>
</monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<editor xml:id="serie-author-0000">
<persName>
<forename type="first">David</forename>
<surname>Hutchison</surname>
</persName>
</editor>
<editor xml:id="serie-author-0001">
<persName>
<forename type="first">Takeo</forename>
<surname>Kanade</surname>
</persName>
</editor>
<editor xml:id="serie-author-0002">
<persName>
<forename type="first">Josef</forename>
<surname>Kittler</surname>
</persName>
</editor>
<editor xml:id="serie-author-0003">
<persName>
<forename type="first">Jon</forename>
<forename type="first">M.</forename>
<surname>Kleinberg</surname>
</persName>
</editor>
<editor xml:id="serie-author-0004">
<persName>
<forename type="first">Friedemann</forename>
<surname>Mattern</surname>
</persName>
</editor>
<editor xml:id="serie-author-0005">
<persName>
<forename type="first">John</forename>
<forename type="first">C.</forename>
<surname>Mitchell</surname>
</persName>
</editor>
<editor xml:id="serie-author-0006">
<persName>
<forename type="first">Moni</forename>
<surname>Naor</surname>
</persName>
</editor>
<editor xml:id="serie-author-0007">
<persName>
<forename type="first">Oscar</forename>
<surname>Nierstrasz</surname>
</persName>
</editor>
<editor xml:id="serie-author-0008">
<persName>
<forename type="first">C.</forename>
<surname>Pandu Rangan</surname>
</persName>
</editor>
<editor xml:id="serie-author-0009">
<persName>
<forename type="first">Bernhard</forename>
<surname>Steffen</surname>
</persName>
</editor>
<editor xml:id="serie-author-0010">
<persName>
<forename type="first">Madhu</forename>
<surname>Sudan</surname>
</persName>
</editor>
<editor xml:id="serie-author-0011">
<persName>
<forename type="first">Demetri</forename>
<surname>Terzopoulos</surname>
</persName>
</editor>
<editor xml:id="serie-author-0012">
<persName>
<forename type="first">Doug</forename>
<surname>Tygar</surname>
</persName>
</editor>
<editor xml:id="serie-author-0013">
<persName>
<forename type="first">Moshe</forename>
<forename type="first">Y.</forename>
<surname>Vardi</surname>
</persName>
</editor>
<editor xml:id="serie-author-0014">
<persName>
<forename type="first">Gerhard</forename>
<surname>Weikum</surname>
</persName>
</editor>
<biblScope>
<date>2007</date>
</biblScope>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="series-Id">558</idno>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>2007</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>Abstract: Information organization and search on the Web is gaining structure and context awareness and more semantic flavor, for example, in the forms of faceted search, vertical search, entity search, and Deep-Web search. I envision another big leap forward by automatically harvesting and organizing knowledge from the Web, represented in terms of explicit entities and relations as well as ontological concepts. This will be made possible by the confluence of three strong trends: 1) rich Semantic-Web-style knowledge repositories like ontologies and taxonomies, 2) large-scale information extraction from high-quality text sources such as Wikipedia, and 3) social tagging in the spirit of Web 2.0. I refer to the three directions as Semantic Web, Statistical Web, and Social Web (at the risk of some oversimplification), and I briefly characterize each of them.</p>
</abstract>
<textClass>
<keywords scheme="Book-Subject-Collection">
<list>
<label>SUCO11645</label>
<item>
<term>Computer Science</term>
</item>
</list>
</keywords>
</textClass>
<textClass>
<keywords scheme="Book-Subject-Group">
<list>
<label>I</label>
<item>
<term>Computer Science</term>
</item>
<label>I18024</label>
<item>
<term>Database Management</term>
</item>
<label>I18032</label>
<item>
<term>Information Storage and Retrieval</term>
</item>
<label>I18030</label>
<item>
<term>Data Mining and Knowledge Discovery</term>
</item>
<label>I18040</label>
<item>
<term>Information Systems Applications (incl.Internet)</term>
</item>
<label>I18059</label>
<item>
<term>Multimedia Information Systems</term>
</item>
<label>W26007</label>
<item>
<term>Business Information Systems</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change when="2007">Published</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<extension>txt</extension>
<original>false</original>
<mimetype>text/plain</mimetype>
<uri>https://api.istex.fr/document/674E11A735F9A5B51D1E8967050168105885CA1B/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="Springer, Publisher found" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="UTF-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//Springer-Verlag//DTD A++ V2.4//EN" URI="http://devel.springer.de/A++/V2.4/DTD/A++V2.4.dtd" name="istex:docType"></istex:docType>
<istex:document>
<Publisher>
<PublisherInfo>
<PublisherName>Springer Berlin Heidelberg</PublisherName>
<PublisherLocation>Berlin, Heidelberg</PublisherLocation>
</PublisherInfo>
<Series>
<SeriesInfo SeriesType="Series" TocLevels="0">
<SeriesID>558</SeriesID>
<SeriesPrintISSN>0302-9743</SeriesPrintISSN>
<SeriesElectronicISSN>1611-3349</SeriesElectronicISSN>
<SeriesTitle Language="En">Lecture Notes in Computer Science</SeriesTitle>
</SeriesInfo>
<SeriesHeader>
<EditorGroup>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>David</GivenName>
<FamilyName>Hutchison</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Takeo</GivenName>
<FamilyName>Kanade</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Josef</GivenName>
<FamilyName>Kittler</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Jon</GivenName>
<GivenName>M.</GivenName>
<FamilyName>Kleinberg</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Friedemann</GivenName>
<FamilyName>Mattern</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>John</GivenName>
<GivenName>C.</GivenName>
<FamilyName>Mitchell</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Moni</GivenName>
<FamilyName>Naor</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Oscar</GivenName>
<FamilyName>Nierstrasz</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>C.</GivenName>
<FamilyName>Pandu Rangan</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Bernhard</GivenName>
<FamilyName>Steffen</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Madhu</GivenName>
<FamilyName>Sudan</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Demetri</GivenName>
<FamilyName>Terzopoulos</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Doug</GivenName>
<FamilyName>Tygar</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Moshe</GivenName>
<GivenName>Y.</GivenName>
<FamilyName>Vardi</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Gerhard</GivenName>
<FamilyName>Weikum</FamilyName>
</EditorName>
</Editor>
</EditorGroup>
</SeriesHeader>
<Book Language="En">
<BookInfo BookProductType="Proceedings" ContainsESM="No" Language="En" MediaType="eBook" NumberingStyle="Unnumbered" OutputMedium="All" TocLevels="0">
<BookID>978-3-540-75185-4</BookID>
<BookTitle>Advances in Databases and Information Systems</BookTitle>
<BookSubTitle>11th East European Conference, ADBIS 2007, Varna, Bulgaria, September 29-October 3, 2007. Proceedings</BookSubTitle>
<BookVolumeNumber>4690</BookVolumeNumber>
<BookSequenceNumber>4690</BookSequenceNumber>
<BookDOI>10.1007/978-3-540-75185-4</BookDOI>
<BookTitleID>156678</BookTitleID>
<BookPrintISBN>978-3-540-75184-7</BookPrintISBN>
<BookElectronicISBN>978-3-540-75185-4</BookElectronicISBN>
<BookChapterCount>26</BookChapterCount>
<BookCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2007</CopyrightYear>
</BookCopyright>
<BookSubjectGroup>
<BookSubject Code="I" Type="Primary">Computer Science</BookSubject>
<BookSubject Code="I18024" Priority="1" Type="Secondary">Database Management</BookSubject>
<BookSubject Code="I18032" Priority="2" Type="Secondary">Information Storage and Retrieval</BookSubject>
<BookSubject Code="I18030" Priority="3" Type="Secondary">Data Mining and Knowledge Discovery</BookSubject>
<BookSubject Code="I18040" Priority="4" Type="Secondary">Information Systems Applications (incl.Internet)</BookSubject>
<BookSubject Code="I18059" Priority="5" Type="Secondary">Multimedia Information Systems</BookSubject>
<BookSubject Code="W26007" Priority="6" Type="Secondary">Business Information Systems</BookSubject>
<SubjectCollection Code="SUCO11645">Computer Science</SubjectCollection>
</BookSubjectGroup>
</BookInfo>
<BookHeader>
<EditorGroup>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Yannis</GivenName>
<FamilyName>Ioannidis</FamilyName>
</EditorName>
<Contact>
<Email>yannis@di.uoa.gr</Email>
</Contact>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Boris</GivenName>
<FamilyName>Novikov</FamilyName>
</EditorName>
<Contact>
<Email>borisnov@acm.org</Email>
</Contact>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Boris</GivenName>
<FamilyName>Rachev</FamilyName>
</EditorName>
<Contact>
<Email>brachev@gmail.com</Email>
</Contact>
</Editor>
</EditorGroup>
</BookHeader>
<Part ID="Part1">
<PartInfo TocLevels="0">
<PartID>1</PartID>
<PartSequenceNumber>1</PartSequenceNumber>
<PartTitle>Invited Lectures</PartTitle>
<PartChapterCount>3</PartChapterCount>
<PartContext>
<SeriesID>558</SeriesID>
<BookTitle>Advances in Databases and Information Systems</BookTitle>
</PartContext>
</PartInfo>
<Chapter ID="Chap2" Language="En">
<ChapterInfo ChapterType="OriginalPaper" ContainsESM="No" NumberingStyle="Unnumbered" TocLevels="0">
<ChapterID>2</ChapterID>
<ChapterDOI>10.1007/978-3-540-75185-4_2</ChapterDOI>
<ChapterSequenceNumber>2</ChapterSequenceNumber>
<ChapterTitle Language="En">Harvesting and Organizing Knowledge from the Web</ChapterTitle>
<ChapterFirstPage>12</ChapterFirstPage>
<ChapterLastPage>13</ChapterLastPage>
<ChapterCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2007</CopyrightYear>
</ChapterCopyright>
<ChapterGrants Type="Regular">
<MetadataGrant Grant="OpenAccess"></MetadataGrant>
<AbstractGrant Grant="OpenAccess"></AbstractGrant>
<BodyPDFGrant Grant="Restricted"></BodyPDFGrant>
<BodyHTMLGrant Grant="Restricted"></BodyHTMLGrant>
<BibliographyGrant Grant="Restricted"></BibliographyGrant>
<ESMGrant Grant="Restricted"></ESMGrant>
</ChapterGrants>
<ChapterContext>
<SeriesID>558</SeriesID>
<PartID>1</PartID>
<BookID>978-3-540-75185-4</BookID>
<BookTitle>Advances in Databases and Information Systems</BookTitle>
</ChapterContext>
</ChapterInfo>
<ChapterHeader>
<AuthorGroup>
<Author AffiliationIDS="Aff1">
<AuthorName DisplayOrder="Western">
<GivenName>Gerhard</GivenName>
<FamilyName>Weikum</FamilyName>
</AuthorName>
<Contact>
<Email>weikum@mpi-inf.mpg.de</Email>
</Contact>
</Author>
<Affiliation ID="Aff1">
<OrgName>Max-Planck Institute for Informatics, Saarbruecken</OrgName>
<OrgAddress>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
</AuthorGroup>
<Abstract ID="Abs1" Language="En">
<Heading>Abstract</Heading>
<Para>Information organization and search on the Web is gaining structure and context awareness and more semantic flavor, for example, in the forms of faceted search, vertical search, entity search, and Deep-Web search. I envision another big leap forward by automatically harvesting and organizing knowledge from the Web, represented in terms of explicit entities and relations as well as ontological concepts. This will be made possible by the confluence of three strong trends: 1) rich Semantic-Web-style knowledge repositories like ontologies and taxonomies, 2) large-scale information extraction from high-quality text sources such as Wikipedia, and 3) social tagging in the spirit of Web 2.0. I refer to the three directions as Semantic Web, Statistical Web, and Social Web (at the risk of some oversimplification), and I briefly characterize each of them.</Para>
</Abstract>
</ChapterHeader>
<NoBody></NoBody>
</Chapter>
</Part>
</Book>
</Series>
</Publisher>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>Harvesting and Organizing Knowledge from the Web</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA" lang="en">
<title>Harvesting and Organizing Knowledge from the Web</title>
</titleInfo>
<name type="personal">
<namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<affiliation>Max-Planck Institute for Informatics, Saarbruecken, Germany</affiliation>
<affiliation>E-mail: weikum@mpi-inf.mpg.de</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="conference" displayLabel="OriginalPaper"></genre>
<originInfo>
<publisher>Springer Berlin Heidelberg</publisher>
<place>
<placeTerm type="text">Berlin, Heidelberg</placeTerm>
</place>
<dateIssued encoding="w3cdtf">2007</dateIssued>
<copyrightDate encoding="w3cdtf">2007</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">Abstract: Information organization and search on the Web is gaining structure and context awareness and more semantic flavor, for example, in the forms of faceted search, vertical search, entity search, and Deep-Web search. I envision another big leap forward by automatically harvesting and organizing knowledge from the Web, represented in terms of explicit entities and relations as well as ontological concepts. This will be made possible by the confluence of three strong trends: 1) rich Semantic-Web-style knowledge repositories like ontologies and taxonomies, 2) large-scale information extraction from high-quality text sources such as Wikipedia, and 3) social tagging in the spirit of Web 2.0. I refer to the three directions as Semantic Web, Statistical Web, and Social Web (at the risk of some oversimplification), and I briefly characterize each of them.</abstract>
<relatedItem type="host">
<titleInfo>
<title>Advances in Databases and Information Systems</title>
<subTitle>11th East European Conference, ADBIS 2007, Varna, Bulgaria, September 29-October 3, 2007. Proceedings</subTitle>
</titleInfo>
<name type="personal">
<namePart type="given">Yannis</namePart>
<namePart type="family">Ioannidis</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Boris</namePart>
<namePart type="family">Novikov</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Boris</namePart>
<namePart type="family">Rachev</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="book-series" displayLabel="Proceedings"></genre>
<originInfo>
<copyrightDate encoding="w3cdtf">2007</copyrightDate>
<issuance>monographic</issuance>
</originInfo>
<subject>
<genre>Book-Subject-Collection</genre>
<topic authority="SpringerSubjectCodes" authorityURI="SUCO11645">Computer Science</topic>
</subject>
<subject>
<genre>Book-Subject-Group</genre>
<topic authority="SpringerSubjectCodes" authorityURI="I">Computer Science</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18024">Database Management</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18032">Information Storage and Retrieval</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18030">Data Mining and Knowledge Discovery</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18040">Information Systems Applications (incl.Internet)</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18059">Multimedia Information Systems</topic>
<topic authority="SpringerSubjectCodes" authorityURI="W26007">Business Information Systems</topic>
</subject>
<identifier type="DOI">10.1007/978-3-540-75185-4</identifier>
<identifier type="ISBN">978-3-540-75184-7</identifier>
<identifier type="eISBN">978-3-540-75185-4</identifier>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="BookTitleID">156678</identifier>
<identifier type="BookID">978-3-540-75185-4</identifier>
<identifier type="BookChapterCount">26</identifier>
<identifier type="BookVolumeNumber">4690</identifier>
<identifier type="BookSequenceNumber">4690</identifier>
<identifier type="PartChapterCount">3</identifier>
<part>
<date>2007</date>
<detail type="part">
<title>Invited Lectures</title>
</detail>
<detail type="volume">
<number>4690</number>
<caption>vol.</caption>
</detail>
<extent unit="pages">
<start>12</start>
<end>13</end>
</extent>
</part>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2007</recordOrigin>
</recordInfo>
</relatedItem>
<relatedItem type="series">
<titleInfo>
<title>Lecture Notes in Computer Science</title>
</titleInfo>
<name type="personal">
<namePart type="given">David</namePart>
<namePart type="family">Hutchison</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Takeo</namePart>
<namePart type="family">Kanade</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Josef</namePart>
<namePart type="family">Kittler</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jon</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Kleinberg</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Friedemann</namePart>
<namePart type="family">Mattern</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">John</namePart>
<namePart type="given">C.</namePart>
<namePart type="family">Mitchell</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moni</namePart>
<namePart type="family">Naor</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Oscar</namePart>
<namePart type="family">Nierstrasz</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">C.</namePart>
<namePart type="family">Pandu Rangan</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Bernhard</namePart>
<namePart type="family">Steffen</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Madhu</namePart>
<namePart type="family">Sudan</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Demetri</namePart>
<namePart type="family">Terzopoulos</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Doug</namePart>
<namePart type="family">Tygar</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moshe</namePart>
<namePart type="given">Y.</namePart>
<namePart type="family">Vardi</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<originInfo>
<copyrightDate encoding="w3cdtf">2007</copyrightDate>
<issuance>serial</issuance>
</originInfo>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="SeriesID">558</identifier>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2007</recordOrigin>
</recordInfo>
</relatedItem>
<identifier type="istex">674E11A735F9A5B51D1E8967050168105885CA1B</identifier>
<identifier type="DOI">10.1007/978-3-540-75185-4_2</identifier>
<identifier type="ChapterID">2</identifier>
<identifier type="ChapterID">Chap2</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Springer-Verlag Berlin Heidelberg, 2007</accessCondition>
<recordInfo>
<recordContentSource>SPRINGER</recordContentSource>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2007</recordOrigin>
</recordInfo>
</mods>
</metadata>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Sarre/explor/MusicSarreV3/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000A93 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000A93 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Sarre
   |area=    MusicSarreV3
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:674E11A735F9A5B51D1E8967050168105885CA1B
   |texte=   Harvesting and Organizing Knowledge from the Web
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Sun Jul 15 18:16:09 2018. Site generation: Tue Mar 5 19:21:25 2024