Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Open Source OCR Framework Using Mobile Devices

Identifieur interne : 000231 ( PascalFrancis/Checkpoint ); précédent : 000230; suivant : 000232

Open Source OCR Framework Using Mobile Devices

Auteurs : Steven Zhiying Zhou [Singapour] ; SYED OMER GILANI [Singapour] ; Stefan Winkler [Singapour]

Source :

RBID : Pascal:08-0426718

Descripteurs français

English descriptors

Abstract

Mobile phones have evolved from passive one-to-one communication device to powerful handheld computing device. Today most new mobile phones are capable of capturing images, recording video, and browsing internet and do much more. Exciting new social applications are emerging on mobile landscape, like, business card readers, sing detectors and translators. These applications help people quickly gather the information in digital format and interpret them without the need of carrying laptops or tablet PCs. However with all these advancements we find very few open source software available for mobile phones. For instance currently there are many open source OCR engines for desktop platform but, to our knowledge, none are available on mobile platform. Keeping this in perspective we propose a complete text detection and recognition system with speech synthesis ability, using existing desktop technology. In this work we developed a complete OCR framework with subsystems from open source desktop community. This includes a popular open source OCR engine named Tesseract for text detection & recognition and Flite speech synthesis module, for adding text-to-speech ability.


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

Pascal:08-0426718

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Open Source OCR Framework Using Mobile Devices</title>
<author>
<name sortKey="Zhiying Zhou, Steven" sort="Zhiying Zhou, Steven" uniqKey="Zhiying Zhou S" first="Steven" last="Zhiying Zhou">Steven Zhiying Zhou</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Interactive Multimedia Lab, Department of Electrical and Computer Engineering National University of Singapore, 10 Kent Ridge Crescent</s1>
<s2>Singapore 117576</s2>
<s3>SGP</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Singapour</country>
<wicri:noRegion>Singapore 117576</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Syed Omer Gilani" sort="Syed Omer Gilani" uniqKey="Syed Omer Gilani" last="Syed Omer Gilani">SYED OMER GILANI</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Interactive Multimedia Lab, Department of Electrical and Computer Engineering National University of Singapore, 10 Kent Ridge Crescent</s1>
<s2>Singapore 117576</s2>
<s3>SGP</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Singapour</country>
<wicri:noRegion>Singapore 117576</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Winkler, Stefan" sort="Winkler, Stefan" uniqKey="Winkler S" first="Stefan" last="Winkler">Stefan Winkler</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Interactive Multimedia Lab, Department of Electrical and Computer Engineering National University of Singapore, 10 Kent Ridge Crescent</s1>
<s2>Singapore 117576</s2>
<s3>SGP</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Singapour</country>
<wicri:noRegion>Singapore 117576</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">08-0426718</idno>
<date when="2008">2008</date>
<idno type="stanalyst">PASCAL 08-0426718 INIST</idno>
<idno type="RBID">Pascal:08-0426718</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000265</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000519</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000231</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Open Source OCR Framework Using Mobile Devices</title>
<author>
<name sortKey="Zhiying Zhou, Steven" sort="Zhiying Zhou, Steven" uniqKey="Zhiying Zhou S" first="Steven" last="Zhiying Zhou">Steven Zhiying Zhou</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Interactive Multimedia Lab, Department of Electrical and Computer Engineering National University of Singapore, 10 Kent Ridge Crescent</s1>
<s2>Singapore 117576</s2>
<s3>SGP</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Singapour</country>
<wicri:noRegion>Singapore 117576</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Syed Omer Gilani" sort="Syed Omer Gilani" uniqKey="Syed Omer Gilani" last="Syed Omer Gilani">SYED OMER GILANI</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Interactive Multimedia Lab, Department of Electrical and Computer Engineering National University of Singapore, 10 Kent Ridge Crescent</s1>
<s2>Singapore 117576</s2>
<s3>SGP</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Singapour</country>
<wicri:noRegion>Singapore 117576</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Winkler, Stefan" sort="Winkler, Stefan" uniqKey="Winkler S" first="Stefan" last="Winkler">Stefan Winkler</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Interactive Multimedia Lab, Department of Electrical and Computer Engineering National University of Singapore, 10 Kent Ridge Crescent</s1>
<s2>Singapore 117576</s2>
<s3>SGP</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Singapour</country>
<wicri:noRegion>Singapore 117576</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Proceedings electronic imaging science and technology</title>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Proceedings electronic imaging science and technology</title>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Character recognition</term>
<term>Electronic trade</term>
<term>Information browsing</term>
<term>Internet</term>
<term>Linguistic analysis</term>
<term>Mobile phone</term>
<term>Mobile platform</term>
<term>Mobile radiocommunication</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Portable equipment</term>
<term>Reader</term>
<term>Speech synthesis</term>
<term>Subsystem</term>
<term>System synthesis</term>
<term>Wireless telecommunication</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Reconnaissance optique caractère</term>
<term>Radiocommunication service mobile</term>
<term>Téléphone portable</term>
<term>Appareil portatif</term>
<term>Navigation information</term>
<term>Internet</term>
<term>Commerce électronique</term>
<term>Lecteur</term>
<term>Plateforme mobile</term>
<term>Reconnaissance caractère</term>
<term>Synthèse système</term>
<term>Synthèse parole</term>
<term>Sous système</term>
<term>Analyse linguistique</term>
<term>Reconnaissance forme</term>
<term>Télécommunication sans fil</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Commerce électronique</term>
<term>Télécommunication sans fil</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Mobile phones have evolved from passive one-to-one communication device to powerful handheld computing device. Today most new mobile phones are capable of capturing images, recording video, and browsing internet and do much more. Exciting new social applications are emerging on mobile landscape, like, business card readers, sing detectors and translators. These applications help people quickly gather the information in digital format and interpret them without the need of carrying laptops or tablet PCs. However with all these advancements we find very few open source software available for mobile phones. For instance currently there are many open source OCR engines for desktop platform but, to our knowledge, none are available on mobile platform. Keeping this in perspective we propose a complete text detection and recognition system with speech synthesis ability, using existing desktop technology. In this work we developed a complete OCR framework with subsystems from open source desktop community. This includes a popular open source OCR engine named Tesseract for text detection & recognition and Flite speech synthesis module, for adding text-to-speech ability.</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA05>
<s2>6821</s2>
</fA05>
<fA08 i1="01" i2="1" l="ENG">
<s1>Open Source OCR Framework Using Mobile Devices</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG">
<s1>Multimedia on mobile devices 2008 : 28-29 January 2008, San Jose, California, USA</s1>
</fA09>
<fA11 i1="01" i2="1">
<s1>ZHIYING ZHOU (Steven)</s1>
</fA11>
<fA11 i1="02" i2="1">
<s1>SYED OMER GILANI</s1>
</fA11>
<fA11 i1="03" i2="1">
<s1>WINKLER (Stefan)</s1>
</fA11>
<fA12 i1="01" i2="1">
<s1>CREUTZBURG (Reiner)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="02" i2="1">
<s1>TAKALA (Jarmo H.)</s1>
<s9>ed.</s9>
</fA12>
<fA14 i1="01">
<s1>Interactive Multimedia Lab, Department of Electrical and Computer Engineering National University of Singapore, 10 Kent Ridge Crescent</s1>
<s2>Singapore 117576</s2>
<s3>SGP</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</fA14>
<fA18 i1="01" i2="1">
<s1>IS & T--the Society for Imaging Science and Technology</s1>
<s3>USA</s3>
<s9>org-cong.</s9>
</fA18>
<fA18 i1="02" i2="1">
<s1>Society of Photo-optical Instrumentation Engineers</s1>
<s3>USA</s3>
<s9>org-cong.</s9>
</fA18>
<fA20>
<s2>682104.1-682104.6</s2>
</fA20>
<fA21>
<s1>2008</s1>
</fA21>
<fA23 i1="01">
<s0>ENG</s0>
</fA23>
<fA26 i1="01">
<s0>978-0-8194-6993-9</s0>
</fA26>
<fA43 i1="01">
<s1>INIST</s1>
<s2>21760</s2>
<s5>354000172854880030</s5>
</fA43>
<fA44>
<s0>0000</s0>
<s1>© 2008 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45>
<s0>10 ref.</s0>
</fA45>
<fA47 i1="01" i2="1">
<s0>08-0426718</s0>
</fA47>
<fA60>
<s1>P</s1>
<s2>C</s2>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA64 i1="01" i2="2">
<s0>Proceedings electronic imaging science and technology</s0>
</fA64>
<fA66 i1="01">
<s0>USA</s0>
</fA66>
<fC01 i1="01" l="ENG">
<s0>Mobile phones have evolved from passive one-to-one communication device to powerful handheld computing device. Today most new mobile phones are capable of capturing images, recording video, and browsing internet and do much more. Exciting new social applications are emerging on mobile landscape, like, business card readers, sing detectors and translators. These applications help people quickly gather the information in digital format and interpret them without the need of carrying laptops or tablet PCs. However with all these advancements we find very few open source software available for mobile phones. For instance currently there are many open source OCR engines for desktop platform but, to our knowledge, none are available on mobile platform. Keeping this in perspective we propose a complete text detection and recognition system with speech synthesis ability, using existing desktop technology. In this work we developed a complete OCR framework with subsystems from open source desktop community. This includes a popular open source OCR engine named Tesseract for text detection & recognition and Flite speech synthesis module, for adding text-to-speech ability.</s0>
</fC01>
<fC02 i1="01" i2="X">
<s0>001D04A05A</s0>
</fC02>
<fC02 i1="02" i2="X">
<s0>001D04B04G2</s0>
</fC02>
<fC02 i1="03" i2="X">
<s0>001D04B02H1</s0>
</fC02>
<fC02 i1="04" i2="X">
<s0>001D04B03D4</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE">
<s0>Reconnaissance optique caractère</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG">
<s0>Optical character recognition</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA">
<s0>Reconocimento óptico de caracteres</s0>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE">
<s0>Radiocommunication service mobile</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG">
<s0>Mobile radiocommunication</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA">
<s0>Radiocomunicación servicio móvil</s0>
<s5>02</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE">
<s0>Téléphone portable</s0>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG">
<s0>Mobile phone</s0>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA">
<s0>Teléfono móvil</s0>
<s5>03</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE">
<s0>Appareil portatif</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG">
<s0>Portable equipment</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA">
<s0>Aparato portátil</s0>
<s5>04</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE">
<s0>Navigation information</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG">
<s0>Information browsing</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA">
<s0>Navegacíon informacíon</s0>
<s5>05</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE">
<s0>Internet</s0>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG">
<s0>Internet</s0>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA">
<s0>Internet</s0>
<s5>06</s5>
</fC03>
<fC03 i1="07" i2="X" l="FRE">
<s0>Commerce électronique</s0>
<s5>07</s5>
</fC03>
<fC03 i1="07" i2="X" l="ENG">
<s0>Electronic trade</s0>
<s5>07</s5>
</fC03>
<fC03 i1="07" i2="X" l="SPA">
<s0>Comercio electronico</s0>
<s5>07</s5>
</fC03>
<fC03 i1="08" i2="X" l="FRE">
<s0>Lecteur</s0>
<s5>08</s5>
</fC03>
<fC03 i1="08" i2="X" l="ENG">
<s0>Reader</s0>
<s5>08</s5>
</fC03>
<fC03 i1="08" i2="X" l="SPA">
<s0>Lector</s0>
<s5>08</s5>
</fC03>
<fC03 i1="09" i2="X" l="FRE">
<s0>Plateforme mobile</s0>
<s5>09</s5>
</fC03>
<fC03 i1="09" i2="X" l="ENG">
<s0>Mobile platform</s0>
<s5>09</s5>
</fC03>
<fC03 i1="09" i2="X" l="SPA">
<s0>Plataforma móvil</s0>
<s5>09</s5>
</fC03>
<fC03 i1="10" i2="X" l="FRE">
<s0>Reconnaissance caractère</s0>
<s5>10</s5>
</fC03>
<fC03 i1="10" i2="X" l="ENG">
<s0>Character recognition</s0>
<s5>10</s5>
</fC03>
<fC03 i1="10" i2="X" l="SPA">
<s0>Reconocimiento carácter</s0>
<s5>10</s5>
</fC03>
<fC03 i1="11" i2="X" l="FRE">
<s0>Synthèse système</s0>
<s5>11</s5>
</fC03>
<fC03 i1="11" i2="X" l="ENG">
<s0>System synthesis</s0>
<s5>11</s5>
</fC03>
<fC03 i1="11" i2="X" l="SPA">
<s0>Síntesis sistema</s0>
<s5>11</s5>
</fC03>
<fC03 i1="12" i2="X" l="FRE">
<s0>Synthèse parole</s0>
<s5>12</s5>
</fC03>
<fC03 i1="12" i2="X" l="ENG">
<s0>Speech synthesis</s0>
<s5>12</s5>
</fC03>
<fC03 i1="12" i2="X" l="SPA">
<s0>Síntesis palabra</s0>
<s5>12</s5>
</fC03>
<fC03 i1="13" i2="X" l="FRE">
<s0>Sous système</s0>
<s5>13</s5>
</fC03>
<fC03 i1="13" i2="X" l="ENG">
<s0>Subsystem</s0>
<s5>13</s5>
</fC03>
<fC03 i1="13" i2="X" l="SPA">
<s0>Subsistema</s0>
<s5>13</s5>
</fC03>
<fC03 i1="14" i2="X" l="FRE">
<s0>Analyse linguistique</s0>
<s5>14</s5>
</fC03>
<fC03 i1="14" i2="X" l="ENG">
<s0>Linguistic analysis</s0>
<s5>14</s5>
</fC03>
<fC03 i1="14" i2="X" l="SPA">
<s0>Análisis linguístico</s0>
<s5>14</s5>
</fC03>
<fC03 i1="15" i2="X" l="FRE">
<s0>Reconnaissance forme</s0>
<s5>31</s5>
</fC03>
<fC03 i1="15" i2="X" l="ENG">
<s0>Pattern recognition</s0>
<s5>31</s5>
</fC03>
<fC03 i1="15" i2="X" l="SPA">
<s0>Reconocimiento patrón</s0>
<s5>31</s5>
</fC03>
<fC03 i1="16" i2="X" l="FRE">
<s0>Télécommunication sans fil</s0>
<s5>32</s5>
</fC03>
<fC03 i1="16" i2="X" l="ENG">
<s0>Wireless telecommunication</s0>
<s5>32</s5>
</fC03>
<fC03 i1="16" i2="X" l="SPA">
<s0>Telecomunicación sin hilo</s0>
<s5>32</s5>
</fC03>
<fN21>
<s1>280</s1>
</fN21>
<fN44 i1="01">
<s1>OTO</s1>
</fN44>
<fN82>
<s1>OTO</s1>
</fN82>
</pA>
<pR>
<fA30 i1="01" i2="1" l="ENG">
<s1>Multimedia on mobile devices</s1>
<s3>San Jose CA USA</s3>
<s4>2008</s4>
</fA30>
</pR>
</standard>
</inist>
<affiliations>
<list>
<country>
<li>Singapour</li>
</country>
</list>
<tree>
<country name="Singapour">
<noRegion>
<name sortKey="Zhiying Zhou, Steven" sort="Zhiying Zhou, Steven" uniqKey="Zhiying Zhou S" first="Steven" last="Zhiying Zhou">Steven Zhiying Zhou</name>
</noRegion>
<name sortKey="Syed Omer Gilani" sort="Syed Omer Gilani" uniqKey="Syed Omer Gilani" last="Syed Omer Gilani">SYED OMER GILANI</name>
<name sortKey="Winkler, Stefan" sort="Winkler, Stefan" uniqKey="Winkler S" first="Stefan" last="Winkler">Stefan Winkler</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000231 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Checkpoint/biblio.hfd -nk 000231 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PascalFrancis
   |étape=   Checkpoint
   |type=    RBID
   |clé=     Pascal:08-0426718
   |texte=   Open Source OCR Framework Using Mobile Devices
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024