Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Page segmentation and identification for intelligent signal processing

Identifieur interne : 002B01 ( Main/Merge ); précédent : 002B00; suivant : 002B02

Page segmentation and identification for intelligent signal processing

Auteurs : Kuo-Chin Fan [République populaire de Chine, Taïwan] ; Liang-Shen Wang [République populaire de Chine] ; Yuan-Kai Wang [République populaire de Chine]

Source :

RBID : ISTEX:F54A35D32E2414C90B68141D4EDE9A2FBA0C02F1

Abstract

Document analysis plays an important role in office automation, especially in intelligent signal processing. In this paper, we propose an intelligent document analysis system to achieve the document segmentation and identification goal. The proposed system consists of two modules: block segmentation and block identification. In our approach, we first segment a document into several non-overlapping blocks by utilizing a novel recursive segmentation technique, then extract the features embedded in each segmented block. Two kinds of features, connectivity histogram and multiresolution features, are extracted. The features are verified to be effective in characterizing document blocks. Last, a two-layer perceptron is adopted in the identification module to determine the identity of the considered block. Experiments with a wide varity of documents verify the feasibility of our approach.

Url:
DOI: 10.1016/0165-1684(95)00061-H

Links toward previous steps (curation, corpus...)


Links to Exploration step

ISTEX:F54A35D32E2414C90B68141D4EDE9A2FBA0C02F1

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title>Page segmentation and identification for intelligent signal processing</title>
<author>
<name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
</author>
<author>
<name sortKey="Wang, Liang Shen" sort="Wang, Liang Shen" uniqKey="Wang L" first="Liang-Shen" last="Wang">Liang-Shen Wang</name>
</author>
<author>
<name sortKey="Wang, Yuan Kai" sort="Wang, Yuan Kai" uniqKey="Wang Y" first="Yuan-Kai" last="Wang">Yuan-Kai Wang</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:F54A35D32E2414C90B68141D4EDE9A2FBA0C02F1</idno>
<date when="1995" year="1995">1995</date>
<idno type="doi">10.1016/0165-1684(95)00061-H</idno>
<idno type="url">https://api.istex.fr/document/F54A35D32E2414C90B68141D4EDE9A2FBA0C02F1/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000910</idno>
<idno type="wicri:Area/Istex/Curation">000900</idno>
<idno type="wicri:Area/Istex/Checkpoint">001D01</idno>
<idno type="wicri:doubleKey">0165-1684:1995:Fan K:page:segmentation:and</idno>
<idno type="wicri:Area/Main/Merge">002B01</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a">Page segmentation and identification for intelligent signal processing</title>
<author>
<name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
<affiliation wicri:level="1">
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Chung-Li, Taiwan 320</wicri:regionArea>
<wicri:noRegion>Taiwan 320</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Taïwan</country>
</affiliation>
</author>
<author>
<name sortKey="Wang, Liang Shen" sort="Wang, Liang Shen" uniqKey="Wang L" first="Liang-Shen" last="Wang">Liang-Shen Wang</name>
<affiliation wicri:level="1">
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Chung-Li, Taiwan 320</wicri:regionArea>
<wicri:noRegion>Taiwan 320</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Wang, Yuan Kai" sort="Wang, Yuan Kai" uniqKey="Wang Y" first="Yuan-Kai" last="Wang">Yuan-Kai Wang</name>
<affiliation wicri:level="1">
<country xml:lang="fr" wicri:curation="lc">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Chung-Li, Taiwan 320</wicri:regionArea>
<wicri:noRegion>Taiwan 320</wicri:noRegion>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Signal Processing</title>
<title level="j" type="abbrev">SIGPRO</title>
<idno type="ISSN">0165-1684</idno>
<imprint>
<publisher>ELSEVIER</publisher>
<date type="published" when="1994">1994</date>
<biblScope unit="volume">45</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="329">329</biblScope>
<biblScope unit="page" to="346">346</biblScope>
</imprint>
<idno type="ISSN">0165-1684</idno>
</series>
<idno type="istex">F54A35D32E2414C90B68141D4EDE9A2FBA0C02F1</idno>
<idno type="DOI">10.1016/0165-1684(95)00061-H</idno>
<idno type="PII">0165-1684(95)00061-H</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0165-1684</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Document analysis plays an important role in office automation, especially in intelligent signal processing. In this paper, we propose an intelligent document analysis system to achieve the document segmentation and identification goal. The proposed system consists of two modules: block segmentation and block identification. In our approach, we first segment a document into several non-overlapping blocks by utilizing a novel recursive segmentation technique, then extract the features embedded in each segmented block. Two kinds of features, connectivity histogram and multiresolution features, are extracted. The features are verified to be effective in characterizing document blocks. Last, a two-layer perceptron is adopted in the identification module to determine the identity of the considered block. Experiments with a wide varity of documents verify the feasibility of our approach.</div>
</front>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002B01 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 002B01 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     ISTEX:F54A35D32E2414C90B68141D4EDE9A2FBA0C02F1
   |texte=   Page segmentation and identification for intelligent signal processing
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024