Text enhancement in digital video
Identifieur interne : 002139 ( Main/Merge ); précédent : 002138; suivant : 002140Text enhancement in digital video
Auteurs : HUIPING LI [États-Unis] ; O. Kia [États-Unis] ; David Doermann [États-Unis]Source :
- SPIE proceedings series [ 1017-2653 ] ; 1999.
Descripteurs français
- Pascal (Inist)
- Wicri :
- topic : Recherche documentaire.
English descriptors
- KwdEn :
Abstract
One difficulty with using text from digital video for indexing and retrieval is that video images are often in low resolution and poor quality, and as a result, the text can not be recognized adequately by most commercial OCR software. Text image enhancement is necessary to achieve reasonable OCR accuracy. Our enhancement consists of two main procedures, resolution enhancement based on Shannon interpolation and text separation from complex image background. Experiments show our enhancement approach improves OCR accuracy considerably.
Links toward previous steps (curation, corpus...)
- to stream PascalFrancis, to step Corpus: 000823
- to stream PascalFrancis, to step Curation: 000B71
- to stream PascalFrancis, to step Checkpoint: 000758
Links to Exploration step
Pascal:99-0297282Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Text enhancement in digital video</title>
<author><name sortKey="Huiping Li" sort="Huiping Li" uniqKey="Huiping Li" last="Huiping Li">HUIPING LI</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Language and Media Processing Laboratory, Institute for Advanced Computer Studies, University of Maryland</s1>
<s2>College Park, MD 20742-3275</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Maryland</region>
<settlement type="city">College Park (Maryland)</settlement>
</placeName>
<orgName type="university">Université du Maryland</orgName>
</affiliation>
</author>
<author><name sortKey="Kia, O" sort="Kia, O" uniqKey="Kia O" first="O." last="Kia">O. Kia</name>
<affiliation wicri:level="2"><inist:fA14 i1="02"><s1>Mathematical and Computational Sciences Division, National Institute of Standards and Technology</s1>
<s2>Gaithersburg, MD 20899</s2>
<s3>USA</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Doermann, D" sort="Doermann, D" uniqKey="Doermann D" first="D." last="Doermann">David Doermann</name>
<affiliation wicri:level="2"><inist:fA14 i1="02"><s1>Mathematical and Computational Sciences Division, National Institute of Standards and Technology</s1>
<s2>Gaithersburg, MD 20899</s2>
<s3>USA</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Maryland</region>
</placeName>
<placeName><settlement type="city">College Park (Maryland)</settlement>
<region type="state">Maryland</region>
</placeName>
<orgName type="university" n="3">Université du Maryland</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">99-0297282</idno>
<date when="1999">1999</date>
<idno type="stanalyst">PASCAL 99-0297282 INIST</idno>
<idno type="RBID">Pascal:99-0297282</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000823</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000B71</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000758</idno>
<idno type="wicri:doubleKey">1017-2653:1999:Huiping Li:text:enhancement:in</idno>
<idno type="wicri:Area/Main/Merge">002139</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Text enhancement in digital video</title>
<author><name sortKey="Huiping Li" sort="Huiping Li" uniqKey="Huiping Li" last="Huiping Li">HUIPING LI</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Language and Media Processing Laboratory, Institute for Advanced Computer Studies, University of Maryland</s1>
<s2>College Park, MD 20742-3275</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Maryland</region>
<settlement type="city">College Park (Maryland)</settlement>
</placeName>
<orgName type="university">Université du Maryland</orgName>
</affiliation>
</author>
<author><name sortKey="Kia, O" sort="Kia, O" uniqKey="Kia O" first="O." last="Kia">O. Kia</name>
<affiliation wicri:level="2"><inist:fA14 i1="02"><s1>Mathematical and Computational Sciences Division, National Institute of Standards and Technology</s1>
<s2>Gaithersburg, MD 20899</s2>
<s3>USA</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Maryland</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Doermann, D" sort="Doermann, D" uniqKey="Doermann D" first="D." last="Doermann">David Doermann</name>
<affiliation wicri:level="2"><inist:fA14 i1="02"><s1>Mathematical and Computational Sciences Division, National Institute of Standards and Technology</s1>
<s2>Gaithersburg, MD 20899</s2>
<s3>USA</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Maryland</region>
</placeName>
<placeName><settlement type="city">College Park (Maryland)</settlement>
<region type="state">Maryland</region>
</placeName>
<orgName type="university" n="3">Université du Maryland</orgName>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">SPIE proceedings series</title>
<idno type="ISSN">1017-2653</idno>
<imprint><date when="1999">1999</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">SPIE proceedings series</title>
<idno type="ISSN">1017-2653</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Document analysis</term>
<term>Document image processing</term>
<term>Document retrieval</term>
<term>Image restoration</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Traitement image document</term>
<term>Restauration image</term>
<term>Reconnaissance forme</term>
<term>Reconnaissance optique caractère</term>
<term>Recherche documentaire</term>
<term>Analyse documentaire</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr"><term>Recherche documentaire</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">One difficulty with using text from digital video for indexing and retrieval is that video images are often in low resolution and poor quality, and as a result, the text can not be recognized adequately by most commercial OCR software. Text image enhancement is necessary to achieve reasonable OCR accuracy. Our enhancement consists of two main procedures, resolution enhancement based on Shannon interpolation and text separation from complex image background. Experiments show our enhancement approach improves OCR accuracy considerably.</div>
</front>
</TEI>
<affiliations><list><country><li>États-Unis</li>
</country>
<region><li>Maryland</li>
</region>
<settlement><li>College Park (Maryland)</li>
</settlement>
<orgName><li>Université du Maryland</li>
</orgName>
</list>
<tree><country name="États-Unis"><region name="Maryland"><name sortKey="Huiping Li" sort="Huiping Li" uniqKey="Huiping Li" last="Huiping Li">HUIPING LI</name>
</region>
<name sortKey="Doermann, D" sort="Doermann, D" uniqKey="Doermann D" first="D." last="Doermann">David Doermann</name>
<name sortKey="Kia, O" sort="Kia, O" uniqKey="Kia O" first="O." last="Kia">O. Kia</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002139 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 002139 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Merge |type= RBID |clé= Pascal:99-0297282 |texte= Text enhancement in digital video }}
This area was generated with Dilib version V0.6.32. |