Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Google Book Search and Metadata

Identifieur interne : 000115 ( PascalFrancis/Checkpoint ); précédent : 000114; suivant : 000116

Google Book Search and Metadata

Auteurs : Julia T. Pope [États-Unis] ; Robert P. Holley [États-Unis]

Source :

RBID : Pascal:11-0227997

Descripteurs français

English descriptors

Abstract

This article summarizes published documents on metadata provided by Google for books scanned as part of the Google Book Search (GBS) project and provides suggestions for improvement. The faulty, misleading, and confusing metadata in current Google records can pose potentially serious problems for users of GBS. Google admits that it took data, which proved to be inaccurate, from many sources and is attempting to correct errors. Some argue that metadata is not needed with keyword searching; but optical character recognition (OCR) errors, synonym control, and materials in foreign languages make reliable metadata a requirement for academic researchers. The authors recommend that users should be able to submit error reports to Google to correct faulty metadata.


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

Pascal:11-0227997

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Google Book Search and Metadata</title>
<author>
<name sortKey="Pope, Julia T" sort="Pope, Julia T" uniqKey="Pope J" first="Julia T." last="Pope">Julia T. Pope</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>School of Library & Information Science, Wayne State University</s1>
<s2>Detroit, Michigan</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Michigan</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Holley, Robert P" sort="Holley, Robert P" uniqKey="Holley R" first="Robert P." last="Holley">Robert P. Holley</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>School of Library & Information Science, Wayne State University</s1>
<s2>Detroit, Michigan</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Michigan</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">11-0227997</idno>
<date when="2011">2011</date>
<idno type="stanalyst">PASCAL 11-0227997 INIST</idno>
<idno type="RBID">Pascal:11-0227997</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000139</idno>
<idno type="stanalyst">FRANCIS 11-0227997 INIST</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000155</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000634</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000115</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Google Book Search and Metadata</title>
<author>
<name sortKey="Pope, Julia T" sort="Pope, Julia T" uniqKey="Pope J" first="Julia T." last="Pope">Julia T. Pope</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>School of Library & Information Science, Wayne State University</s1>
<s2>Detroit, Michigan</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Michigan</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Holley, Robert P" sort="Holley, Robert P" uniqKey="Holley R" first="Robert P." last="Holley">Robert P. Holley</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>School of Library & Information Science, Wayne State University</s1>
<s2>Detroit, Michigan</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Michigan</region>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Cataloging & classification quarterly</title>
<title level="j" type="abbreviated">Cat. classif. q.</title>
<idno type="ISSN">0163-9374</idno>
<imprint>
<date when="2011">2011</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Cataloging & classification quarterly</title>
<title level="j" type="abbreviated">Cat. classif. q.</title>
<idno type="ISSN">0163-9374</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Electronic book</term>
<term>Information retrieval system</term>
<term>Metadata</term>
<term>Search engine</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Système de recherche d'information</term>
<term>Moteur recherche</term>
<term>Livre électronique</term>
<term>Métadonnée</term>
<term>Google book search</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">This article summarizes published documents on metadata provided by Google for books scanned as part of the Google Book Search (GBS) project and provides suggestions for improvement. The faulty, misleading, and confusing metadata in current Google records can pose potentially serious problems for users of GBS. Google admits that it took data, which proved to be inaccurate, from many sources and is attempting to correct errors. Some argue that metadata is not needed with keyword searching; but optical character recognition (OCR) errors, synonym control, and materials in foreign languages make reliable metadata a requirement for academic researchers. The authors recommend that users should be able to submit error reports to Google to correct faulty metadata.</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA01 i1="01" i2="1">
<s0>0163-9374</s0>
</fA01>
<fA02 i1="01">
<s0>CCQUDB</s0>
</fA02>
<fA03 i2="1">
<s0>Cat. classif. q.</s0>
</fA03>
<fA05>
<s2>49</s2>
</fA05>
<fA06>
<s2>1</s2>
</fA06>
<fA08 i1="01" i2="1" l="ENG">
<s1>Google Book Search and Metadata</s1>
</fA08>
<fA11 i1="01" i2="1">
<s1>POPE (Julia T.)</s1>
</fA11>
<fA11 i1="02" i2="1">
<s1>HOLLEY (Robert P.)</s1>
</fA11>
<fA14 i1="01">
<s1>School of Library & Information Science, Wayne State University</s1>
<s2>Detroit, Michigan</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</fA14>
<fA20>
<s1>1-13</s1>
</fA20>
<fA21>
<s1>2011</s1>
</fA21>
<fA23 i1="01">
<s0>ENG</s0>
</fA23>
<fA43 i1="01">
<s1>INIST</s1>
<s2>18351</s2>
<s5>354000194479460010</s5>
</fA43>
<fA44>
<s0>0000</s0>
<s1>© 2011 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA47 i1="01" i2="1">
<s0>11-0227997</s0>
</fA47>
<fA60>
<s1>P</s1>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA64 i1="01" i2="1">
<s0>Cataloging & classification quarterly</s0>
</fA64>
<fA66 i1="01">
<s0>USA</s0>
</fA66>
<fC01 i1="01" l="ENG">
<s0>This article summarizes published documents on metadata provided by Google for books scanned as part of the Google Book Search (GBS) project and provides suggestions for improvement. The faulty, misleading, and confusing metadata in current Google records can pose potentially serious problems for users of GBS. Google admits that it took data, which proved to be inaccurate, from many sources and is attempting to correct errors. Some argue that metadata is not needed with keyword searching; but optical character recognition (OCR) errors, synonym control, and materials in foreign languages make reliable metadata a requirement for academic researchers. The authors recommend that users should be able to submit error reports to Google to correct faulty metadata.</s0>
</fC01>
<fC02 i1="01" i2="X">
<s0>001A01E02A</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE">
<s0>Système de recherche d'information</s0>
<s2>563</s2>
<s5>04</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG">
<s0>Information retrieval system</s0>
<s2>563</s2>
<s5>04</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA">
<s0>Sistema de recuperación de información</s0>
<s2>563</s2>
<s5>04</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE">
<s0>Moteur recherche</s0>
<s5>05</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG">
<s0>Search engine</s0>
<s5>05</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA">
<s0>Buscador</s0>
<s5>05</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE">
<s0>Livre électronique</s0>
<s2>NI</s2>
<s5>06</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG">
<s0>Electronic book</s0>
<s2>NI</s2>
<s5>06</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA">
<s0>Libro electrónico</s0>
<s2>NI</s2>
<s5>06</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE">
<s0>Métadonnée</s0>
<s5>07</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG">
<s0>Metadata</s0>
<s5>07</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA">
<s0>Metadatos</s0>
<s5>07</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE">
<s0>Google book search</s0>
<s4>INC</s4>
<s5>27</s5>
</fC03>
<fN21>
<s1>150</s1>
</fN21>
</pA>
</standard>
</inist>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Michigan</li>
</region>
</list>
<tree>
<country name="États-Unis">
<region name="Michigan">
<name sortKey="Pope, Julia T" sort="Pope, Julia T" uniqKey="Pope J" first="Julia T." last="Pope">Julia T. Pope</name>
</region>
<name sortKey="Holley, Robert P" sort="Holley, Robert P" uniqKey="Holley R" first="Robert P." last="Holley">Robert P. Holley</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000115 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Checkpoint/biblio.hfd -nk 000115 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PascalFrancis
   |étape=   Checkpoint
   |type=    RBID
   |clé=     Pascal:11-0227997
   |texte=   Google Book Search and Metadata
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024