Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Cost analysis of a project to digitize classic articles in neurosurgery.

Identifieur interne : 000069 ( PubMed/Checkpoint ); précédent : 000068; suivant : 000070

Cost analysis of a project to digitize classic articles in neurosurgery.

Auteurs : Kathleen Bauer [États-Unis]

Source :

RBID : pubmed:11999182

English descriptors

Abstract

In summer 2000, the Cushing/Whitney Medical Library at Yale University began a demonstration project to digitize classic articles in neurosurgery from the late 1800s and early 1900s. The objective of the first phase of the project was to measure the time and costs involved in digitization, and those results are reported here. In the second phase, metadata will be added to the digitized articles, and the project will be publicized. Thirteen articles were scanned using optical character recognition (OCR) software, and the resulting text files were carefully proofread. Time for photocopying, scanning, and proofreading were recorded. This project achieved an average cost per item (total pages plus images) of $4.12, a figure at the high end of average costs found in other studies. This project experienced high costs for two reasons. First, the articles contained many images, which required extra processing. Second, the older fonts and the poor condition of many of these articles complicated the OCR process. The average article cost $84.46 to digitize. Although costs were high, the selection of historically important articles maximized the benefit gained from the investment in digitization.

PubMed: 11999182


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

pubmed:11999182

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Cost analysis of a project to digitize classic articles in neurosurgery.</title>
<author>
<name sortKey="Bauer, Kathleen" sort="Bauer, Kathleen" uniqKey="Bauer K" first="Kathleen" last="Bauer">Kathleen Bauer</name>
<affiliation wicri:level="1">
<nlm:affiliation>Yale School of Nursing, USA. kathleen.bauer@yale.edu</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Yale School of Nursing</wicri:regionArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PubMed</idno>
<date when="2002">2002</date>
<idno type="RBID">pubmed:11999182</idno>
<idno type="pmid">11999182</idno>
<idno type="wicri:Area/PubMed/Corpus">000075</idno>
<idno type="wicri:Area/PubMed/Curation">000075</idno>
<idno type="wicri:Area/PubMed/Checkpoint">000075</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Cost analysis of a project to digitize classic articles in neurosurgery.</title>
<author>
<name sortKey="Bauer, Kathleen" sort="Bauer, Kathleen" uniqKey="Bauer K" first="Kathleen" last="Bauer">Kathleen Bauer</name>
<affiliation wicri:level="1">
<nlm:affiliation>Yale School of Nursing, USA. kathleen.bauer@yale.edu</nlm:affiliation>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Yale School of Nursing</wicri:regionArea>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Journal of the Medical Library Association : JMLA</title>
<idno type="ISSN">1536-5050</idno>
<imprint>
<date when="2002" type="published">2002</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Automatic Data Processing (economics)</term>
<term>Connecticut</term>
<term>Cost-Benefit Analysis</term>
<term>History, 19th Century</term>
<term>History, 20th Century</term>
<term>Library Automation (economics)</term>
<term>Library Collection Development (economics)</term>
<term>Neurosurgery (history)</term>
<term>Periodicals as Topic (history)</term>
</keywords>
<keywords scheme="MESH" type="geographic" xml:lang="en">
<term>Connecticut</term>
</keywords>
<keywords scheme="MESH" qualifier="economics" xml:lang="en">
<term>Automatic Data Processing</term>
<term>Library Automation</term>
<term>Library Collection Development</term>
</keywords>
<keywords scheme="MESH" qualifier="history" xml:lang="en">
<term>Neurosurgery</term>
<term>Periodicals as Topic</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Cost-Benefit Analysis</term>
<term>History, 19th Century</term>
<term>History, 20th Century</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">In summer 2000, the Cushing/Whitney Medical Library at Yale University began a demonstration project to digitize classic articles in neurosurgery from the late 1800s and early 1900s. The objective of the first phase of the project was to measure the time and costs involved in digitization, and those results are reported here. In the second phase, metadata will be added to the digitized articles, and the project will be publicized. Thirteen articles were scanned using optical character recognition (OCR) software, and the resulting text files were carefully proofread. Time for photocopying, scanning, and proofreading were recorded. This project achieved an average cost per item (total pages plus images) of $4.12, a figure at the high end of average costs found in other studies. This project experienced high costs for two reasons. First, the articles contained many images, which required extra processing. Second, the older fonts and the poor condition of many of these articles complicated the OCR process. The average article cost $84.46 to digitize. Although costs were high, the selection of historically important articles maximized the benefit gained from the investment in digitization.</div>
</front>
</TEI>
<pubmed>
<MedlineCitation Owner="NLM" Status="MEDLINE">
<PMID Version="1">11999182</PMID>
<DateCreated>
<Year>2002</Year>
<Month>05</Month>
<Day>09</Day>
</DateCreated>
<DateCompleted>
<Year>2002</Year>
<Month>10</Month>
<Day>22</Day>
</DateCompleted>
<DateRevised>
<Year>2014</Year>
<Month>06</Month>
<Day>12</Day>
</DateRevised>
<Article PubModel="Print">
<Journal>
<ISSN IssnType="Print">1536-5050</ISSN>
<JournalIssue CitedMedium="Print">
<Volume>90</Volume>
<Issue>2</Issue>
<PubDate>
<Year>2002</Year>
<Month>Apr</Month>
</PubDate>
</JournalIssue>
<Title>Journal of the Medical Library Association : JMLA</Title>
<ISOAbbreviation>J Med Libr Assoc</ISOAbbreviation>
</Journal>
<ArticleTitle>Cost analysis of a project to digitize classic articles in neurosurgery.</ArticleTitle>
<Pagination>
<MedlinePgn>230-4</MedlinePgn>
</Pagination>
<Abstract>
<AbstractText>In summer 2000, the Cushing/Whitney Medical Library at Yale University began a demonstration project to digitize classic articles in neurosurgery from the late 1800s and early 1900s. The objective of the first phase of the project was to measure the time and costs involved in digitization, and those results are reported here. In the second phase, metadata will be added to the digitized articles, and the project will be publicized. Thirteen articles were scanned using optical character recognition (OCR) software, and the resulting text files were carefully proofread. Time for photocopying, scanning, and proofreading were recorded. This project achieved an average cost per item (total pages plus images) of $4.12, a figure at the high end of average costs found in other studies. This project experienced high costs for two reasons. First, the articles contained many images, which required extra processing. Second, the older fonts and the poor condition of many of these articles complicated the OCR process. The average article cost $84.46 to digitize. Although costs were high, the selection of historically important articles maximized the benefit gained from the investment in digitization.</AbstractText>
</Abstract>
<AuthorList CompleteYN="Y">
<Author ValidYN="Y">
<LastName>Bauer</LastName>
<ForeName>Kathleen</ForeName>
<Initials>K</Initials>
<AffiliationInfo>
<Affiliation>Yale School of Nursing, USA. kathleen.bauer@yale.edu</Affiliation>
</AffiliationInfo>
</Author>
</AuthorList>
<Language>eng</Language>
<PublicationTypeList>
<PublicationType UI="D016456">Historical Article</PublicationType>
<PublicationType UI="D016428">Journal Article</PublicationType>
<PublicationType UI="D013485">Research Support, Non-U.S. Gov't</PublicationType>
</PublicationTypeList>
</Article>
<MedlineJournalInfo>
<Country>United States</Country>
<MedlineTA>J Med Libr Assoc</MedlineTA>
<NlmUniqueID>101132728</NlmUniqueID>
<ISSNLinking>1536-5050</ISSNLinking>
</MedlineJournalInfo>
<CitationSubset>IM</CitationSubset>
<CommentsCorrectionsList>
<CommentsCorrections RefType="Cites">
<RefSource>Bull Med Libr Assoc. 2001 Jan;89(1):71-5</RefSource>
<PMID Version="1">11209804</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Bull Med Libr Assoc. 1997 Oct;85(4):402-10</RefSource>
<PMID Version="1">9431430</PMID>
</CommentsCorrections>
</CommentsCorrectionsList>
<MeshHeadingList>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D001330">Automatic Data Processing</DescriptorName>
<QualifierName MajorTopicYN="Y" UI="Q000191">economics</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" Type="Geographic" UI="D003237">Connecticut</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D003362">Cost-Benefit Analysis</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D049672">History, 19th Century</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D049673">History, 20th Century</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D016242">Library Automation</DescriptorName>
<QualifierName MajorTopicYN="Y" UI="Q000191">economics</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D016243">Library Collection Development</DescriptorName>
<QualifierName MajorTopicYN="Y" UI="Q000191">economics</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D009493">Neurosurgery</DescriptorName>
<QualifierName MajorTopicYN="N" UI="Q000266">history</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D010506">Periodicals as Topic</DescriptorName>
<QualifierName MajorTopicYN="N" UI="Q000266">history</QualifierName>
</MeshHeading>
</MeshHeadingList>
<OtherID Source="NLM">PMC100769</OtherID>
</MedlineCitation>
<PubmedData>
<History>
<PubMedPubDate PubStatus="pubmed">
<Year>2002</Year>
<Month>5</Month>
<Day>10</Day>
<Hour>10</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline">
<Year>2002</Year>
<Month>10</Month>
<Day>31</Day>
<Hour>4</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="entrez">
<Year>2002</Year>
<Month>5</Month>
<Day>10</Day>
<Hour>10</Hour>
<Minute>0</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>ppublish</PublicationStatus>
<ArticleIdList>
<ArticleId IdType="pubmed">11999182</ArticleId>
<ArticleId IdType="pmc">PMC100769</ArticleId>
</ArticleIdList>
</PubmedData>
</pubmed>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
</list>
<tree>
<country name="États-Unis">
<noRegion>
<name sortKey="Bauer, Kathleen" sort="Bauer, Kathleen" uniqKey="Bauer K" first="Kathleen" last="Bauer">Kathleen Bauer</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PubMed/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000069 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PubMed/Checkpoint/biblio.hfd -nk 000069 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PubMed
   |étape=   Checkpoint
   |type=    RBID
   |clé=     pubmed:11999182
   |texte=   Cost analysis of a project to digitize classic articles in neurosurgery.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/PubMed/Checkpoint/RBID.i   -Sk "pubmed:11999182" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/PubMed/Checkpoint/biblio.hfd   \
       | NlmPubMed2Wicri -a OcrV1 

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024