Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Historical author affiliations assist verification of automatically generated MEDLINE citations.

Identifieur interne : 000062 ( PubMed/Corpus ); précédent : 000061; suivant : 000063

Historical author affiliations assist verification of automatically generated MEDLINE citations.

Auteurs : Tehseen F. Sabir ; Susan E. Hauser ; George R. Thoma

Source :

RBID : pubmed:17238701

English descriptors

Abstract

High OCR error rates encountered in author affiliations increase the manual labor needed to verify MEDLINE citations automatically created from scanned journal articles. This is due to poor OCR recognition of the small text and italics frequently used in printed affiliations. Using author-affiliation relationships found in existing MEDLINE records, the SeekAffiliation (SA) program automatically finds potentially correct and complete affiliations, thereby reducing manual effort and increasing the efficiency of creating the citations.

PubMed: 17238701

Links to Exploration step

pubmed:17238701

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Historical author affiliations assist verification of automatically generated MEDLINE citations.</title>
<author>
<name sortKey="Sabir, Tehseen F" sort="Sabir, Tehseen F" uniqKey="Sabir T" first="Tehseen F" last="Sabir">Tehseen F. Sabir</name>
<affiliation>
<nlm:affiliation>National Library of Medicine, NIH, DHHS, Bethesda, MD, USA.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Hauser, Susan E" sort="Hauser, Susan E" uniqKey="Hauser S" first="Susan E" last="Hauser">Susan E. Hauser</name>
</author>
<author>
<name sortKey="Thoma, George R" sort="Thoma, George R" uniqKey="Thoma G" first="George R" last="Thoma">George R. Thoma</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PubMed</idno>
<date when="2006">2006</date>
<idno type="RBID">pubmed:17238701</idno>
<idno type="pmid">17238701</idno>
<idno type="wicri:Area/PubMed/Corpus">000062</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Historical author affiliations assist verification of automatically generated MEDLINE citations.</title>
<author>
<name sortKey="Sabir, Tehseen F" sort="Sabir, Tehseen F" uniqKey="Sabir T" first="Tehseen F" last="Sabir">Tehseen F. Sabir</name>
<affiliation>
<nlm:affiliation>National Library of Medicine, NIH, DHHS, Bethesda, MD, USA.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Hauser, Susan E" sort="Hauser, Susan E" uniqKey="Hauser S" first="Susan E" last="Hauser">Susan E. Hauser</name>
</author>
<author>
<name sortKey="Thoma, George R" sort="Thoma, George R" uniqKey="Thoma G" first="George R" last="Thoma">George R. Thoma</name>
</author>
</analytic>
<series>
<title level="j">AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium</title>
<idno type="eISSN">1942-597X</idno>
<imprint>
<date when="2006" type="published">2006</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Authorship</term>
<term>Automatic Data Processing</term>
<term>MEDLINE</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Authorship</term>
<term>Automatic Data Processing</term>
<term>MEDLINE</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">High OCR error rates encountered in author affiliations increase the manual labor needed to verify MEDLINE citations automatically created from scanned journal articles. This is due to poor OCR recognition of the small text and italics frequently used in printed affiliations. Using author-affiliation relationships found in existing MEDLINE records, the SeekAffiliation (SA) program automatically finds potentially correct and complete affiliations, thereby reducing manual effort and increasing the efficiency of creating the citations.</div>
</front>
</TEI>
<pubmed>
<MedlineCitation Owner="NLM" Status="MEDLINE">
<PMID Version="1">17238701</PMID>
<DateCreated>
<Year>2007</Year>
<Month>01</Month>
<Day>22</Day>
</DateCreated>
<DateCompleted>
<Year>2007</Year>
<Month>09</Month>
<Day>28</Day>
</DateCompleted>
<DateRevised>
<Year>2009</Year>
<Month>03</Month>
<Day>09</Day>
</DateRevised>
<Article PubModel="Print">
<Journal>
<ISSN IssnType="Electronic">1942-597X</ISSN>
<JournalIssue CitedMedium="Internet">
<PubDate>
<Year>2006</Year>
</PubDate>
</JournalIssue>
<Title>AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium</Title>
<ISOAbbreviation>AMIA Annu Symp Proc</ISOAbbreviation>
</Journal>
<ArticleTitle>Historical author affiliations assist verification of automatically generated MEDLINE citations.</ArticleTitle>
<Pagination>
<MedlinePgn>1082</MedlinePgn>
</Pagination>
<Abstract>
<AbstractText>High OCR error rates encountered in author affiliations increase the manual labor needed to verify MEDLINE citations automatically created from scanned journal articles. This is due to poor OCR recognition of the small text and italics frequently used in printed affiliations. Using author-affiliation relationships found in existing MEDLINE records, the SeekAffiliation (SA) program automatically finds potentially correct and complete affiliations, thereby reducing manual effort and increasing the efficiency of creating the citations.</AbstractText>
</Abstract>
<AuthorList CompleteYN="Y">
<Author ValidYN="Y">
<LastName>Sabir</LastName>
<ForeName>Tehseen F</ForeName>
<Initials>TF</Initials>
<AffiliationInfo>
<Affiliation>National Library of Medicine, NIH, DHHS, Bethesda, MD, USA.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Hauser</LastName>
<ForeName>Susan E</ForeName>
<Initials>SE</Initials>
</Author>
<Author ValidYN="Y">
<LastName>Thoma</LastName>
<ForeName>George R</ForeName>
<Initials>GR</Initials>
</Author>
</AuthorList>
<Language>eng</Language>
<PublicationTypeList>
<PublicationType UI="D016428">Journal Article</PublicationType>
</PublicationTypeList>
</Article>
<MedlineJournalInfo>
<Country>United States</Country>
<MedlineTA>AMIA Annu Symp Proc</MedlineTA>
<NlmUniqueID>101209213</NlmUniqueID>
<ISSNLinking>1559-4076</ISSNLinking>
</MedlineJournalInfo>
<CitationSubset>IM</CitationSubset>
<MeshHeadingList>
<MeshHeading>
<DescriptorName MajorTopicYN="N" UI="D001319">Authorship</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="Y" UI="D001330">Automatic Data Processing</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName MajorTopicYN="Y" UI="D016239">MEDLINE</DescriptorName>
</MeshHeading>
</MeshHeadingList>
<OtherID Source="NLM">PMC1839323</OtherID>
</MedlineCitation>
<PubmedData>
<History>
<PubMedPubDate PubStatus="pubmed">
<Year>2007</Year>
<Month>1</Month>
<Day>24</Day>
<Hour>9</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline">
<Year>2007</Year>
<Month>9</Month>
<Day>29</Day>
<Hour>9</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="entrez">
<Year>2007</Year>
<Month>1</Month>
<Day>24</Day>
<Hour>9</Hour>
<Minute>0</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>ppublish</PublicationStatus>
<ArticleIdList>
<ArticleId IdType="pii">85991</ArticleId>
<ArticleId IdType="pubmed">17238701</ArticleId>
<ArticleId IdType="pmc">PMC1839323</ArticleId>
</ArticleIdList>
</PubmedData>
</pubmed>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PubMed/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000062 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PubMed/Corpus/biblio.hfd -nk 000062 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PubMed
   |étape=   Corpus
   |type=    RBID
   |clé=     pubmed:17238701
   |texte=   Historical author affiliations assist verification of automatically generated MEDLINE citations.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/PubMed/Corpus/RBID.i   -Sk "pubmed:17238701" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/PubMed/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a OcrV1 

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024