OcrV1, PubMed, Corpus, bibRecord, 000002

Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight.

Identifieur interne : 000002 ( PubMed/Corpus ); précédent : 000001; suivant : 000003

Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight.

Auteurs : Michael Cutter ; Roberto Manduchi

Source :

Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering [ ]

RBID : pubmed:26677461

Abstract

The advent of mobile OCR (optical character recognition) applications on regular smartphones holds great promise for enabling blind people to access printed information. Unfortunately, these systems suffer from a problem: in order for OCR output to be meaningful, a well-framed image of the document needs to be taken, something that is difficult to do without sight. This contribution presents an experimental investigation of how blind people position and orient a camera phone while acquiring document images. We developed experimental software to investigate if verbal guidance aids in the acquisition of OCR-readable images without sight. We report on our participant's feedback and performance before and after assistance from our software.

DOI: 10.1145/2682571.2797066
PubMed: 26677461

Links to Exploration step

pubmed:26677461

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight.</title>
<author><name sortKey="Cutter, Michael" sort="Cutter, Michael" uniqKey="Cutter M" first="Michael" last="Cutter">Michael Cutter</name>
<affiliation><nlm:affiliation>University of California, Santa Cruz, mcutter@soe.ucsc.edu.</nlm:affiliation>
</affiliation>
</author>
<author><name sortKey="Manduchi, Roberto" sort="Manduchi, Roberto" uniqKey="Manduchi R" first="Roberto" last="Manduchi">Roberto Manduchi</name>
<affiliation><nlm:affiliation>University of California, Santa Cruz, manduchi@soe.ucsc.edu.</nlm:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PubMed</idno>
<date when="????"><PubDate><MedlineDate>2015</MedlineDate>
</PubDate>
</date>
<idno type="doi">10.1145/2682571.2797066</idno>
<idno type="RBID">pubmed:26677461</idno>
<idno type="pmid">26677461</idno>
<idno type="wicri:Area/PubMed/Corpus">000002</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight.</title>
<author><name sortKey="Cutter, Michael" sort="Cutter, Michael" uniqKey="Cutter M" first="Michael" last="Cutter">Michael Cutter</name>
<affiliation><nlm:affiliation>University of California, Santa Cruz, mcutter@soe.ucsc.edu.</nlm:affiliation>
</affiliation>
</author>
<author><name sortKey="Manduchi, Roberto" sort="Manduchi, Roberto" uniqKey="Manduchi R" first="Roberto" last="Manduchi">Roberto Manduchi</name>
<affiliation><nlm:affiliation>University of California, Santa Cruz, manduchi@soe.ucsc.edu.</nlm:affiliation>
</affiliation>
</author>
</analytic>
<series><title level="j">Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering</title>
<idno type="ISSN"></idno>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">The advent of mobile OCR (optical character recognition) applications on regular smartphones holds great promise for enabling blind people to access printed information. Unfortunately, these systems suffer from a problem: in order for OCR output to be meaningful, a well-framed image of the document needs to be taken, something that is difficult to do without sight. This contribution presents an experimental investigation of how blind people position and orient a camera phone while acquiring document images. We developed experimental software to investigate if verbal guidance aids in the acquisition of OCR-readable images without sight. We report on our participant's feedback and performance before and after assistance from our software.</div>
</front>
</TEI>
<pubmed><MedlineCitation Status="Publisher" Owner="NLM"><PMID Version="1">26677461</PMID>
<DateCreated><Year>2015</Year>
<Month>12</Month>
<Day>17</Day>
</DateCreated>
<DateRevised><Year>2015</Year>
<Month>12</Month>
<Day>20</Day>
</DateRevised>
<Article PubModel="Print"><Journal><ISSN IssnType="Print"></ISSN>
<JournalIssue CitedMedium="Print"><Volume>2015</Volume>
<PubDate><MedlineDate>2015</MedlineDate>
</PubDate>
</JournalIssue>
<Title>Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering</Title>
<ISOAbbreviation>Proc ACM Symp Doc Eng</ISOAbbreviation>
</Journal>
<ArticleTitle>Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight.</ArticleTitle>
<Pagination><MedlinePgn>75-84</MedlinePgn>
</Pagination>
<Abstract><AbstractText NlmCategory="UNASSIGNED">The advent of mobile OCR (optical character recognition) applications on regular smartphones holds great promise for enabling blind people to access printed information. Unfortunately, these systems suffer from a problem: in order for OCR output to be meaningful, a well-framed image of the document needs to be taken, something that is difficult to do without sight. This contribution presents an experimental investigation of how blind people position and orient a camera phone while acquiring document images. We developed experimental software to investigate if verbal guidance aids in the acquisition of OCR-readable images without sight. We report on our participant's feedback and performance before and after assistance from our software.</AbstractText>
</Abstract>
<AuthorList><Author><LastName>Cutter</LastName>
<ForeName>Michael</ForeName>
<Initials>M</Initials>
<AffiliationInfo><Affiliation>University of California, Santa Cruz, mcutter@soe.ucsc.edu.</Affiliation>
</AffiliationInfo>
</Author>
<Author><LastName>Manduchi</LastName>
<ForeName>Roberto</ForeName>
<Initials>R</Initials>
<AffiliationInfo><Affiliation>University of California, Santa Cruz, manduchi@soe.ucsc.edu.</Affiliation>
</AffiliationInfo>
</Author>
</AuthorList>
<Language>ENG</Language>
<GrantList><Grant><GrantID>R21 EY025077</GrantID>
<Acronym>EY</Acronym>
<Agency>NEI NIH HHS</Agency>
<Country>United States</Country>
</Grant>
</GrantList>
<PublicationTypeList><PublicationType UI="">JOURNAL ARTICLE</PublicationType>
</PublicationTypeList>
</Article>
<MedlineJournalInfo><MedlineTA>Proc ACM Symp Doc Eng</MedlineTA>
<NlmUniqueID>101672450</NlmUniqueID>
</MedlineJournalInfo>
<KeywordList Owner="NOTNLM"><Keyword MajorTopicYN="N">Design</Keyword>
<Keyword MajorTopicYN="N">Document Processing</Keyword>
<Keyword MajorTopicYN="N">Experimentation</Keyword>
<Keyword MajorTopicYN="N">Human Factors</Keyword>
<Keyword MajorTopicYN="N">Optical Character Recognition</Keyword>
<Keyword MajorTopicYN="N">Visual Impairment</Keyword>
</KeywordList>
</MedlineCitation>
<PubmedData><History><PubMedPubDate PubStatus="entrez"><Year>2015</Year>
<Month>12</Month>
<Day>18</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="pubmed"><Year>2015</Year>
<Month>12</Month>
<Day>18</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline"><Year>2015</Year>
<Month>12</Month>
<Day>18</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>ppublish</PublicationStatus>
<ArticleIdList><ArticleId IdType="doi">10.1145/2682571.2797066</ArticleId>
<ArticleId IdType="pubmed">26677461</ArticleId>
<ArticleId IdType="pmc">PMC4677830</ArticleId>
<ArticleId IdType="mid">NIHMS738539</ArticleId>
</ArticleIdList>
<pmc-dir>nihms</pmc-dir>
    </PubmedData>
</pubmed>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PubMed/Corpus

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000002 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/PubMed/Corpus/biblio.hfd -nk 000002 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PubMed
   |étape=   Corpus
   |type=    RBID
   |clé=     pubmed:26677461
   |texte=   Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/PubMed/Corpus/RBID.i   -Sk "pubmed:26677461" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/PubMed/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a OcrV1

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024

	Serveur d'exploration sur l'OCR
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur l'OCR

Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight.

Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight.

Source :

Abstract

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri

Pour générer des pages wiki