HapticV1, Pmc, Checkpoint, bibRecord, 000237

Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight

Identifieur interne : 000237 ( Pmc/Checkpoint ); précédent : 000236; suivant : 000238

Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight

Auteurs : Michael Cutter ; Roberto Manduchi

Source :

Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering ; 2015.

RBID : PMC:4677830

Abstract

The advent of mobile OCR (optical character recognition) applications on regular smartphones holds great promise for enabling blind people to access printed information. Unfortunately, these systems suffer from a problem: in order for OCR output to be meaningful, a well-framed image of the document needs to be taken, something that is difficult to do without sight. This contribution presents an experimental investigation of how blind people position and orient a camera phone while acquiring document images. We developed experimental software to investigate if verbal guidance aids in the acquisition of OCR-readable images without sight. We report on our participant's feedback and performance before and after assistance from our software.

Url:

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4677830

DOI: 10.1145/2682571.2797066
PubMed: 26677461
PubMed Central: 4677830

Affiliations:

Links toward previous steps (curation, corpus...)

to stream Pmc, to step Corpus: 001894
to stream Pmc, to step Curation: 001894

Links to Exploration step

PMC:4677830

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight</title>
<author><name sortKey="Cutter, Michael" sort="Cutter, Michael" uniqKey="Cutter M" first="Michael" last="Cutter">Michael Cutter</name>
</author>
<author><name sortKey="Manduchi, Roberto" sort="Manduchi, Roberto" uniqKey="Manduchi R" first="Roberto" last="Manduchi">Roberto Manduchi</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">26677461</idno>
<idno type="pmc">4677830</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4677830</idno>
<idno type="RBID">PMC:4677830</idno>
<idno type="doi">10.1145/2682571.2797066</idno>
<date when="2015">2015</date>
<idno type="wicri:Area/Pmc/Corpus">001894</idno>
<idno type="wicri:Area/Pmc/Curation">001894</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000237</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight</title>
<author><name sortKey="Cutter, Michael" sort="Cutter, Michael" uniqKey="Cutter M" first="Michael" last="Cutter">Michael Cutter</name>
</author>
<author><name sortKey="Manduchi, Roberto" sort="Manduchi, Roberto" uniqKey="Manduchi R" first="Roberto" last="Manduchi">Roberto Manduchi</name>
</author>
</analytic>
<series><title level="j">Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering</title>
<imprint><date when="2015">2015</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><p id="P1">The advent of mobile OCR (optical character recognition) applications on regular smartphones holds great promise for enabling blind people to access printed information. Unfortunately, these systems suffer from a problem: in order for OCR output to be meaningful, a well-framed image of the document needs to be taken, something that is difficult to do without sight. This contribution presents an experimental investigation of how blind people position and orient a camera phone while acquiring document images. We developed experimental software to investigate if verbal guidance aids in the acquisition of OCR-readable images without sight. We report on our participant's feedback and performance before and after assistance from our software.</p>
</div>
</front>
</TEI>
<pmc article-type="research-article"><pmc-comment>The publisher of this article does not allow downloading of the full text in XML form.</pmc-comment>
  <pmc-dir>properties manuscript</pmc-dir>
  <front><journal-meta><journal-id journal-id-type="nlm-journal-id">101672450</journal-id>
<journal-id journal-id-type="pubmed-jr-id">44643</journal-id>
<journal-id journal-id-type="nlm-ta">Proc ACM Symp Doc Eng</journal-id>
<journal-title-group><journal-title>Proceedings of the ACM Symposium on Document Engineering. ACM Symposium on Document Engineering</journal-title>
</journal-title-group>
</journal-meta>
<article-meta><article-id pub-id-type="pmid">26677461</article-id>
<article-id pub-id-type="pmc">4677830</article-id>
<article-id pub-id-type="doi">10.1145/2682571.2797066</article-id>
<article-id pub-id-type="manuscript">NIHMS738539</article-id>
<article-categories><subj-group subj-group-type="heading"><subject>Article</subject>
</subj-group>
</article-categories>
<title-group><article-title>Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight</article-title>
</title-group>
<contrib-group><contrib contrib-type="author"><name><surname>Cutter</surname>
<given-names>Michael</given-names>
</name>
<aff id="A1">University of California, Santa Cruz,<email>mcutter@soe.ucsc.edu</email>
</aff>
</contrib>
<contrib contrib-type="author"><name><surname>Manduchi</surname>
<given-names>Roberto</given-names>
</name>
<aff id="A2">University of California, Santa Cruz,<email>manduchi@soe.ucsc.edu</email>
</aff>
</contrib>
</contrib-group>
<pub-date pub-type="nihms-submitted"><day>28</day>
<month>11</month>
<year>2015</year>
</pub-date>
<pub-date pub-type="ppub"><year>2015</year>
</pub-date>
<pub-date pub-type="pmc-release"><day>14</day>
<month>12</month>
<year>2015</year>
</pub-date>
<volume>2015</volume>
<fpage>75</fpage>
<lpage>84</lpage>
<pmc-comment>elocation-id from pubmed: 10.1145/2682571.2797066</pmc-comment>
      <abstract><p id="P1">The advent of mobile OCR (optical character recognition) applications on regular smartphones holds great promise for enabling blind people to access printed information. Unfortunately, these systems suffer from a problem: in order for OCR output to be meaningful, a well-framed image of the document needs to be taken, something that is difficult to do without sight. This contribution presents an experimental investigation of how blind people position and orient a camera phone while acquiring document images. We developed experimental software to investigate if verbal guidance aids in the acquisition of OCR-readable images without sight. We report on our participant's feedback and performance before and after assistance from our software.</p>
</abstract>
<kwd-group><kwd>Visual Impairment</kwd>
<kwd>Optical Character Recognition</kwd>
<kwd>Document Processing</kwd>
</kwd-group>
<kwd-group><title>General Terms</title>
<kwd>Design</kwd>
<kwd>Experimentation</kwd>
<kwd>Human Factors</kwd>
</kwd-group>
</article-meta>
</front>
</pmc>
<affiliations><list></list>
<tree><noCountry><name sortKey="Cutter, Michael" sort="Cutter, Michael" uniqKey="Cutter M" first="Michael" last="Cutter">Michael Cutter</name>
<name sortKey="Manduchi, Roberto" sort="Manduchi, Roberto" uniqKey="Manduchi R" first="Roberto" last="Manduchi">Roberto Manduchi</name>
</noCountry>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/HapticV1/Data/Pmc/Checkpoint

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000237 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Checkpoint/biblio.hfd -nk 000237 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    HapticV1
   |flux=    Pmc
   |étape=   Checkpoint
   |type=    RBID
   |clé=     PMC:4677830
   |texte=   Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Checkpoint/RBID.i   -Sk "pubmed:26677461" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Checkpoint/biblio.hfd   \
       | NlmPubMed2Wicri -a HapticV1

This area was generated with Dilib version V0.6.23.
Data generation: Mon Jun 13 01:09:46 2016. Site generation: Wed Mar 6 09:54:07 2024

	Serveur d'exploration sur les dispositifs haptiques
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur les dispositifs haptiques

Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight

Towards Mobile OCR: How To Take a Good Picture of a Document Without Sight

Source :

Abstract

Links toward previous steps (curation, corpus...)

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri

Pour générer des pages wiki