Context-based filtering of document images
Identifieur interne : 000329 ( Istex/Curation ); précédent : 000328; suivant : 000330Context-based filtering of document images
Auteurs : E. Ageenko [Finlande] ; P. Fr Nti [Finlande]Source :
- Pattern Recognition Letters [ 0167-8655 ] ; 1999.
Abstract
Two statistical context-based filters are introduced for the enhancement of binary document images for compression and recognition. The simple context filter unconditionally changes uncommon pixels in low information contexts, whereas the gain–loss filter (GLF) changes the pixels conditionally depending on whether the gain in compression outweighs the loss of information. The filtering methods alleviate the loss in compression performance caused by digitization noise while preserving the image quality measured as the optical character recognition (OCR) accuracy. The GLF reaches approximately the compression limit estimated by the compression of the noiseless digital original.
Url:
DOI: 10.1016/S0167-8655(00)00011-8
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000334
Links to Exploration step
ISTEX:B3D498D9F5AC28B24926C2F49667CCD0F3650151Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title>Context-based filtering of document images</title>
<author><name sortKey="Ageenko, E" sort="Ageenko, E" uniqKey="Ageenko E" first="E." last="Ageenko">E. Ageenko</name>
<affiliation wicri:level="1"><mods:affiliation>Department of Computer Science, University of Joensuu, Box 111, FIN-80101 Joensuu, Finland</mods:affiliation>
<country xml:lang="fr">Finlande</country>
<wicri:regionArea>Department of Computer Science, University of Joensuu, Box 111, FIN-80101 Joensuu</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><mods:affiliation>E-mail: ageenko@cs.joensuu.fi</mods:affiliation>
<country wicri:rule="url">Finlande</country>
</affiliation>
</author>
<author><name sortKey="Fr Nti, P" sort="Fr Nti, P" uniqKey="Fr Nti P" first="P." last="Fr Nti">P. Fr Nti</name>
<affiliation wicri:level="1"><mods:affiliation>Department of Computer Science, University of Joensuu, Box 111, FIN-80101 Joensuu, Finland</mods:affiliation>
<country xml:lang="fr">Finlande</country>
<wicri:regionArea>Department of Computer Science, University of Joensuu, Box 111, FIN-80101 Joensuu</wicri:regionArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:B3D498D9F5AC28B24926C2F49667CCD0F3650151</idno>
<date when="2000" year="2000">2000</date>
<idno type="doi">10.1016/S0167-8655(00)00011-8</idno>
<idno type="url">https://api.istex.fr/document/B3D498D9F5AC28B24926C2F49667CCD0F3650151/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000334</idno>
<idno type="wicri:Area/Istex/Curation">000329</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a">Context-based filtering of document images</title>
<author><name sortKey="Ageenko, E" sort="Ageenko, E" uniqKey="Ageenko E" first="E." last="Ageenko">E. Ageenko</name>
<affiliation wicri:level="1"><mods:affiliation>Department of Computer Science, University of Joensuu, Box 111, FIN-80101 Joensuu, Finland</mods:affiliation>
<country xml:lang="fr">Finlande</country>
<wicri:regionArea>Department of Computer Science, University of Joensuu, Box 111, FIN-80101 Joensuu</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1"><mods:affiliation>E-mail: ageenko@cs.joensuu.fi</mods:affiliation>
<country wicri:rule="url">Finlande</country>
</affiliation>
</author>
<author><name sortKey="Fr Nti, P" sort="Fr Nti, P" uniqKey="Fr Nti P" first="P." last="Fr Nti">P. Fr Nti</name>
<affiliation wicri:level="1"><mods:affiliation>Department of Computer Science, University of Joensuu, Box 111, FIN-80101 Joensuu, Finland</mods:affiliation>
<country xml:lang="fr">Finlande</country>
<wicri:regionArea>Department of Computer Science, University of Joensuu, Box 111, FIN-80101 Joensuu</wicri:regionArea>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Pattern Recognition Letters</title>
<title level="j" type="abbrev">PATREC</title>
<idno type="ISSN">0167-8655</idno>
<imprint><publisher>ELSEVIER</publisher>
<date type="published" when="1999">1999</date>
<biblScope unit="volume">21</biblScope>
<biblScope unit="issue">6–7</biblScope>
<biblScope unit="page" from="483">483</biblScope>
<biblScope unit="page" to="491">491</biblScope>
</imprint>
<idno type="ISSN">0167-8655</idno>
</series>
<idno type="istex">B3D498D9F5AC28B24926C2F49667CCD0F3650151</idno>
<idno type="DOI">10.1016/S0167-8655(00)00011-8</idno>
<idno type="PII">S0167-8655(00)00011-8</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0167-8655</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Two statistical context-based filters are introduced for the enhancement of binary document images for compression and recognition. The simple context filter unconditionally changes uncommon pixels in low information contexts, whereas the gain–loss filter (GLF) changes the pixels conditionally depending on whether the gain in compression outweighs the loss of information. The filtering methods alleviate the loss in compression performance caused by digitization noise while preserving the image quality measured as the optical character recognition (OCR) accuracy. The GLF reaches approximately the compression limit estimated by the compression of the noiseless digital original.</div>
</front>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000329 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Istex/Curation/biblio.hfd -nk 000329 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Istex |étape= Curation |type= RBID |clé= ISTEX:B3D498D9F5AC28B24926C2F49667CCD0F3650151 |texte= Context-based filtering of document images }}
![]() | This area was generated with Dilib version V0.6.32. | ![]() |