Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Text/Graphics Separation Revisited

Identifieur interne : 003286 ( Crin/Corpus ); précédent : 003285; suivant : 003287

Text/Graphics Separation Revisited

Auteurs : Karl Tombre ; Salvatore Tabbone ; Loïc Pélissier ; Bart Lamiroy ; Philippe Dosch

Source :

RBID : CRIN:tombre02a

English descriptors

Abstract

Text/graphics separation aims at segmenting the document into two layers : a layer assumed to contain text and a layer containing graphical objects. In this paper, we present a consolidation of a method proposed by Fletcher and Kasturi, with a number of improvements to make it more suitable for graphics-rich documents. We discuss the right choice of thresholds for this method, and their stability. We also propose a post-processing step for retrieving text components touching the graphics, through local segmentation of the distance skeleton.

Links to Exploration step

CRIN:tombre02a

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" wicri:score="86">Text/Graphics Separation Revisited</title>
</titleStmt>
<publicationStmt>
<idno type="RBID">CRIN:tombre02a</idno>
<date when="2002" year="2002">2002</date>
<idno type="wicri:Area/Crin/Corpus">003286</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Text/Graphics Separation Revisited</title>
<author>
<name sortKey="Tombre, Karl" sort="Tombre, Karl" uniqKey="Tombre K" first="Karl" last="Tombre">Karl Tombre</name>
</author>
<author>
<name sortKey="Tabbone, Salvatore" sort="Tabbone, Salvatore" uniqKey="Tabbone S" first="Salvatore" last="Tabbone">Salvatore Tabbone</name>
</author>
<author>
<name sortKey="Pelissier, Loic" sort="Pelissier, Loic" uniqKey="Pelissier L" first="Loïc" last="Pélissier">Loïc Pélissier</name>
</author>
<author>
<name sortKey="Lamiroy, Bart" sort="Lamiroy, Bart" uniqKey="Lamiroy B" first="Bart" last="Lamiroy">Bart Lamiroy</name>
</author>
<author>
<name sortKey="Dosch, Philippe" sort="Dosch, Philippe" uniqKey="Dosch P" first="Philippe" last="Dosch">Philippe Dosch</name>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>document analysis</term>
<term>segmentation</term>
<term>text/graphics separation</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en" wicri:score="1686">Text/graphics separation aims at segmenting the document into two layers : a layer assumed to contain text and a layer containing graphical objects. In this paper, we present a consolidation of a method proposed by Fletcher and Kasturi, with a number of improvements to make it more suitable for graphics-rich documents. We discuss the right choice of thresholds for this method, and their stability. We also propose a post-processing step for retrieving text components touching the graphics, through local segmentation of the distance skeleton.</div>
</front>
</TEI>
<BibTex type="inproceedings">
<ref>tombre02a</ref>
<crinnumber>A02-R-107</crinnumber>
<category>3</category>
<equipe>QGAR</equipe>
<author>
<e>Tombre, Karl</e>
<e>Tabbone, Salvatore</e>
<e>Pélissier, Loïc</e>
<e>Lamiroy, Bart</e>
<e>Dosch, Philippe</e>
</author>
<title>Text/Graphics Separation Revisited</title>
<booktitle>{5th International Workshop on Document Analysis - DAS'02, Princeton, NJ, USA}</booktitle>
<year>2002</year>
<editor>D. Lopresti, J. Hu and R. Kashi</editor>
<volume>2423</volume>
<series>Lecture Notes in Computer Science</series>
<pages>200-211</pages>
<month>Aug</month>
<publisher>Springer Verlag</publisher>
<keywords>
<e>document analysis</e>
<e>segmentation</e>
<e>text/graphics separation</e>
</keywords>
<abstract>Text/graphics separation aims at segmenting the document into two layers : a layer assumed to contain text and a layer containing graphical objects. In this paper, we present a consolidation of a method proposed by Fletcher and Kasturi, with a number of improvements to make it more suitable for graphics-rich documents. We discuss the right choice of thresholds for this method, and their stability. We also propose a post-processing step for retrieving text components touching the graphics, through local segmentation of the distance skeleton.</abstract>
</BibTex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Crin/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 003286 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Crin/Corpus/biblio.hfd -nk 003286 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Crin
   |étape=   Corpus
   |type=    RBID
   |clé=     CRIN:tombre02a
   |texte=   Text/Graphics Separation Revisited
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022