Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Classification of business documents for real time application

Identifieur interne : 000029 ( Hal/Corpus ); précédent : 000028; suivant : 000030

Classification of business documents for real time application

Auteurs : Djamel Gaceb ; Véronique Eglin ; Frank Le Bourgeois

Source :

RBID : Hal:hal-01300863

Abstract

In this paper, we present a new document classification based on physical layout features and graph b-coloring modeling. In order to reduce the computed time and to increase the performance of our automatic reading system, we propose to pre-classify the business documents by introducing an Automatic Recognition of Documents stage (ARD) as a pre-analysis phase. This phase guides the others involved in the recognition process of the documents contents. Once the document type identified, the reading system will use its corresponding information source to improve the recognition of its logical layout, the selection and parameterization of the OCR, and the final decision of sorting. The graph coloring model is introduced for both layout analysis and document classification. The proposed method is reliable, robust to various constraints and guarantees a real-time answer to the sorting ofbusiness documents.

Url:
DOI: 10.1007/s11554-011-0227-4

Links to Exploration step

Hal:hal-01300863

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Classification of business documents for real time application</title>
<author>
<name sortKey="Gaceb, Djamel" sort="Gaceb, Djamel" uniqKey="Gaceb D" first="Djamel" last="Gaceb">Djamel Gaceb</name>
<affiliation>
<hal:affiliation type="researchteam" xml:id="struct-403930" status="VALID">
<orgName>Extraction de Caractéristiques et Identification</orgName>
<orgName type="acronym">imagine</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-2003" type="direct"></relation>
<relation active="#struct-33804" type="indirect"></relation>
<relation active="#struct-126765" type="indirect"></relation>
<relation active="#struct-194495" type="indirect"></relation>
<relation name="- LYON" active="#struct-301232" type="indirect"></relation>
<relation name="UMR5205" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-2003" type="direct">
<org type="laboratory" xml:id="struct-2003" status="VALID">
<orgName>Laboratoire d'InfoRmatique en Image et Systèmes d'information</orgName>
<orgName type="acronym">LIRIS</orgName>
<desc>
<address>
<addrLine>Bâtiment Blaise Pascal - 20, avenue Albert Einstein - 69621 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://liris.cnrs.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-126765" type="direct"></relation>
<relation active="#struct-194495" type="direct"></relation>
<relation name="- LYON" active="#struct-301232" type="direct"></relation>
<relation name="UMR5205" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-33804" type="indirect">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-126765" type="indirect">
<org type="institution" xml:id="struct-126765" status="VALID">
<orgName>École Centrale de Lyon</orgName>
<orgName type="acronym">ECL</orgName>
<desc>
<address>
<addrLine>36 avenue Guy de Collongue - 69134 Ecully cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ec-lyon.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-194495" type="indirect">
<org type="institution" xml:id="struct-194495" status="VALID">
<orgName>Université Claude Bernard Lyon 1</orgName>
<orgName type="acronym">UCBL</orgName>
<desc>
<address>
<addrLine>43, boulevard du 11 novembre 1918, 69622 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="- LYON" active="#struct-301232" type="indirect">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5205" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Eglin, Veronique" sort="Eglin, Veronique" uniqKey="Eglin V" first="Véronique" last="Eglin">Véronique Eglin</name>
<affiliation>
<hal:affiliation type="researchteam" xml:id="struct-403930" status="VALID">
<orgName>Extraction de Caractéristiques et Identification</orgName>
<orgName type="acronym">imagine</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-2003" type="direct"></relation>
<relation active="#struct-33804" type="indirect"></relation>
<relation active="#struct-126765" type="indirect"></relation>
<relation active="#struct-194495" type="indirect"></relation>
<relation name="- LYON" active="#struct-301232" type="indirect"></relation>
<relation name="UMR5205" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-2003" type="direct">
<org type="laboratory" xml:id="struct-2003" status="VALID">
<orgName>Laboratoire d'InfoRmatique en Image et Systèmes d'information</orgName>
<orgName type="acronym">LIRIS</orgName>
<desc>
<address>
<addrLine>Bâtiment Blaise Pascal - 20, avenue Albert Einstein - 69621 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://liris.cnrs.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-126765" type="direct"></relation>
<relation active="#struct-194495" type="direct"></relation>
<relation name="- LYON" active="#struct-301232" type="direct"></relation>
<relation name="UMR5205" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-33804" type="indirect">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-126765" type="indirect">
<org type="institution" xml:id="struct-126765" status="VALID">
<orgName>École Centrale de Lyon</orgName>
<orgName type="acronym">ECL</orgName>
<desc>
<address>
<addrLine>36 avenue Guy de Collongue - 69134 Ecully cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ec-lyon.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-194495" type="indirect">
<org type="institution" xml:id="struct-194495" status="VALID">
<orgName>Université Claude Bernard Lyon 1</orgName>
<orgName type="acronym">UCBL</orgName>
<desc>
<address>
<addrLine>43, boulevard du 11 novembre 1918, 69622 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="- LYON" active="#struct-301232" type="indirect">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5205" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Le Bourgeois, Frank" sort="Le Bourgeois, Frank" uniqKey="Le Bourgeois F" first="Frank" last="Le Bourgeois">Frank Le Bourgeois</name>
<affiliation>
<hal:affiliation type="researchteam" xml:id="struct-403930" status="VALID">
<orgName>Extraction de Caractéristiques et Identification</orgName>
<orgName type="acronym">imagine</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-2003" type="direct"></relation>
<relation active="#struct-33804" type="indirect"></relation>
<relation active="#struct-126765" type="indirect"></relation>
<relation active="#struct-194495" type="indirect"></relation>
<relation name="- LYON" active="#struct-301232" type="indirect"></relation>
<relation name="UMR5205" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-2003" type="direct">
<org type="laboratory" xml:id="struct-2003" status="VALID">
<orgName>Laboratoire d'InfoRmatique en Image et Systèmes d'information</orgName>
<orgName type="acronym">LIRIS</orgName>
<desc>
<address>
<addrLine>Bâtiment Blaise Pascal - 20, avenue Albert Einstein - 69621 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://liris.cnrs.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-126765" type="direct"></relation>
<relation active="#struct-194495" type="direct"></relation>
<relation name="- LYON" active="#struct-301232" type="direct"></relation>
<relation name="UMR5205" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-33804" type="indirect">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-126765" type="indirect">
<org type="institution" xml:id="struct-126765" status="VALID">
<orgName>École Centrale de Lyon</orgName>
<orgName type="acronym">ECL</orgName>
<desc>
<address>
<addrLine>36 avenue Guy de Collongue - 69134 Ecully cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ec-lyon.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-194495" type="indirect">
<org type="institution" xml:id="struct-194495" status="VALID">
<orgName>Université Claude Bernard Lyon 1</orgName>
<orgName type="acronym">UCBL</orgName>
<desc>
<address>
<addrLine>43, boulevard du 11 novembre 1918, 69622 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="- LYON" active="#struct-301232" type="indirect">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5205" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:hal-01300863</idno>
<idno type="halId">hal-01300863</idno>
<idno type="halUri">https://hal.archives-ouvertes.fr/hal-01300863</idno>
<idno type="url">https://hal.archives-ouvertes.fr/hal-01300863</idno>
<idno type="doi">10.1007/s11554-011-0227-4</idno>
<date when="2014-06">2014-06</date>
<idno type="wicri:Area/Hal/Corpus">000029</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Classification of business documents for real time application</title>
<author>
<name sortKey="Gaceb, Djamel" sort="Gaceb, Djamel" uniqKey="Gaceb D" first="Djamel" last="Gaceb">Djamel Gaceb</name>
<affiliation>
<hal:affiliation type="researchteam" xml:id="struct-403930" status="VALID">
<orgName>Extraction de Caractéristiques et Identification</orgName>
<orgName type="acronym">imagine</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-2003" type="direct"></relation>
<relation active="#struct-33804" type="indirect"></relation>
<relation active="#struct-126765" type="indirect"></relation>
<relation active="#struct-194495" type="indirect"></relation>
<relation name="- LYON" active="#struct-301232" type="indirect"></relation>
<relation name="UMR5205" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-2003" type="direct">
<org type="laboratory" xml:id="struct-2003" status="VALID">
<orgName>Laboratoire d'InfoRmatique en Image et Systèmes d'information</orgName>
<orgName type="acronym">LIRIS</orgName>
<desc>
<address>
<addrLine>Bâtiment Blaise Pascal - 20, avenue Albert Einstein - 69621 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://liris.cnrs.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-126765" type="direct"></relation>
<relation active="#struct-194495" type="direct"></relation>
<relation name="- LYON" active="#struct-301232" type="direct"></relation>
<relation name="UMR5205" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-33804" type="indirect">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-126765" type="indirect">
<org type="institution" xml:id="struct-126765" status="VALID">
<orgName>École Centrale de Lyon</orgName>
<orgName type="acronym">ECL</orgName>
<desc>
<address>
<addrLine>36 avenue Guy de Collongue - 69134 Ecully cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ec-lyon.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-194495" type="indirect">
<org type="institution" xml:id="struct-194495" status="VALID">
<orgName>Université Claude Bernard Lyon 1</orgName>
<orgName type="acronym">UCBL</orgName>
<desc>
<address>
<addrLine>43, boulevard du 11 novembre 1918, 69622 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="- LYON" active="#struct-301232" type="indirect">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5205" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Eglin, Veronique" sort="Eglin, Veronique" uniqKey="Eglin V" first="Véronique" last="Eglin">Véronique Eglin</name>
<affiliation>
<hal:affiliation type="researchteam" xml:id="struct-403930" status="VALID">
<orgName>Extraction de Caractéristiques et Identification</orgName>
<orgName type="acronym">imagine</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-2003" type="direct"></relation>
<relation active="#struct-33804" type="indirect"></relation>
<relation active="#struct-126765" type="indirect"></relation>
<relation active="#struct-194495" type="indirect"></relation>
<relation name="- LYON" active="#struct-301232" type="indirect"></relation>
<relation name="UMR5205" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-2003" type="direct">
<org type="laboratory" xml:id="struct-2003" status="VALID">
<orgName>Laboratoire d'InfoRmatique en Image et Systèmes d'information</orgName>
<orgName type="acronym">LIRIS</orgName>
<desc>
<address>
<addrLine>Bâtiment Blaise Pascal - 20, avenue Albert Einstein - 69621 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://liris.cnrs.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-126765" type="direct"></relation>
<relation active="#struct-194495" type="direct"></relation>
<relation name="- LYON" active="#struct-301232" type="direct"></relation>
<relation name="UMR5205" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-33804" type="indirect">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-126765" type="indirect">
<org type="institution" xml:id="struct-126765" status="VALID">
<orgName>École Centrale de Lyon</orgName>
<orgName type="acronym">ECL</orgName>
<desc>
<address>
<addrLine>36 avenue Guy de Collongue - 69134 Ecully cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ec-lyon.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-194495" type="indirect">
<org type="institution" xml:id="struct-194495" status="VALID">
<orgName>Université Claude Bernard Lyon 1</orgName>
<orgName type="acronym">UCBL</orgName>
<desc>
<address>
<addrLine>43, boulevard du 11 novembre 1918, 69622 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="- LYON" active="#struct-301232" type="indirect">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5205" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Le Bourgeois, Frank" sort="Le Bourgeois, Frank" uniqKey="Le Bourgeois F" first="Frank" last="Le Bourgeois">Frank Le Bourgeois</name>
<affiliation>
<hal:affiliation type="researchteam" xml:id="struct-403930" status="VALID">
<orgName>Extraction de Caractéristiques et Identification</orgName>
<orgName type="acronym">imagine</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-2003" type="direct"></relation>
<relation active="#struct-33804" type="indirect"></relation>
<relation active="#struct-126765" type="indirect"></relation>
<relation active="#struct-194495" type="indirect"></relation>
<relation name="- LYON" active="#struct-301232" type="indirect"></relation>
<relation name="UMR5205" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-2003" type="direct">
<org type="laboratory" xml:id="struct-2003" status="VALID">
<orgName>Laboratoire d'InfoRmatique en Image et Systèmes d'information</orgName>
<orgName type="acronym">LIRIS</orgName>
<desc>
<address>
<addrLine>Bâtiment Blaise Pascal - 20, avenue Albert Einstein - 69621 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://liris.cnrs.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-126765" type="direct"></relation>
<relation active="#struct-194495" type="direct"></relation>
<relation name="- LYON" active="#struct-301232" type="direct"></relation>
<relation name="UMR5205" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-33804" type="indirect">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-126765" type="indirect">
<org type="institution" xml:id="struct-126765" status="VALID">
<orgName>École Centrale de Lyon</orgName>
<orgName type="acronym">ECL</orgName>
<desc>
<address>
<addrLine>36 avenue Guy de Collongue - 69134 Ecully cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ec-lyon.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-194495" type="indirect">
<org type="institution" xml:id="struct-194495" status="VALID">
<orgName>Université Claude Bernard Lyon 1</orgName>
<orgName type="acronym">UCBL</orgName>
<desc>
<address>
<addrLine>43, boulevard du 11 novembre 1918, 69622 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="- LYON" active="#struct-301232" type="indirect">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5205" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
</analytic>
<idno type="DOI">10.1007/s11554-011-0227-4</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">In this paper, we present a new document classification based on physical layout features and graph b-coloring modeling. In order to reduce the computed time and to increase the performance of our automatic reading system, we propose to pre-classify the business documents by introducing an Automatic Recognition of Documents stage (ARD) as a pre-analysis phase. This phase guides the others involved in the recognition process of the documents contents. Once the document type identified, the reading system will use its corresponding information source to improve the recognition of its logical layout, the selection and parameterization of the OCR, and the final decision of sorting. The graph coloring model is introduced for both layout analysis and document classification. The proposed method is reliable, robust to various constraints and guarantees a real-time answer to the sorting ofbusiness documents.</div>
</front>
</TEI>
<hal api="V3">
<titleStmt>
<title xml:lang="en">Classification of business documents for real time application</title>
<author role="aut">
<persName>
<forename type="first">Djamel</forename>
<surname>Gaceb</surname>
</persName>
<email></email>
<idno type="halauthor">150637</idno>
<affiliation ref="#struct-403930"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Véronique</forename>
<surname>Eglin</surname>
</persName>
<email>veronique.eglin@insa-lyon.fr</email>
<idno type="idhal">veronique-eglin</idno>
<idno type="halauthor">135464</idno>
<affiliation ref="#struct-403930"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Frank</forename>
<surname>Le BOURGEOIS</surname>
</persName>
<email></email>
<idno type="idhal">frank-le-bourgeois</idno>
<idno type="halauthor">86952</idno>
<affiliation ref="#struct-403930"></affiliation>
</author>
<editor role="depositor">
<persName>
<forename>Équipe gestionnaire des publications</forename>
<surname>SI LIRIS</surname>
</persName>
<email>webmaster-hal@liris.cnrs.fr</email>
</editor>
</titleStmt>
<editionStmt>
<edition n="v1" type="current">
<date type="whenSubmitted">2016-04-11 14:56:09</date>
<date type="whenWritten">2014-06</date>
<date type="whenModified">2016-04-12 01:07:10</date>
<date type="whenReleased">2016-04-11 14:56:09</date>
<date type="whenProduced">2014-06</date>
</edition>
<respStmt>
<resp>contributor</resp>
<name key="314372">
<persName>
<forename>Équipe gestionnaire des publications</forename>
<surname>SI LIRIS</surname>
</persName>
<email>webmaster-hal@liris.cnrs.fr</email>
</name>
</respStmt>
</editionStmt>
<publicationStmt>
<distributor>CCSD</distributor>
<idno type="halId">hal-01300863</idno>
<idno type="halUri">https://hal.archives-ouvertes.fr/hal-01300863</idno>
<idno type="halBibtex">gaceb:hal-01300863</idno>
<idno type="halRefHtml">Journal of Real-Time Image Processing, , 2014, 9, pp.329-345. <10.1007/s11554-011-0227-4></idno>
<idno type="halRef">Journal of Real-Time Image Processing, , 2014, 9, pp.329-345. <10.1007/s11554-011-0227-4></idno>
</publicationStmt>
<seriesStmt>
<idno type="stamp" n="CNRS">CNRS - Centre national de la recherche scientifique</idno>
<idno type="stamp" n="EC-LYON">Ecole Centrale de Lyon</idno>
<idno type="stamp" n="UNIV-LYON2">Université Lumière Lyon 2</idno>
<idno type="stamp" n="LIRIS">Laboratoire d'InfoRmatique en Image et Systèmes d'information</idno>
</seriesStmt>
<notesStmt>
<note type="audience" n="2">International</note>
<note type="popular" n="0">No</note>
<note type="peer" n="1">Yes</note>
</notesStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Classification of business documents for real time application</title>
<author role="aut">
<persName>
<forename type="first">Djamel</forename>
<surname>Gaceb</surname>
</persName>
<idno type="halAuthorId">150637</idno>
<affiliation ref="#struct-403930"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Véronique</forename>
<surname>Eglin</surname>
</persName>
<email>veronique.eglin@insa-lyon.fr</email>
<idno type="idHal">veronique-eglin</idno>
<idno type="halAuthorId">135464</idno>
<affiliation ref="#struct-403930"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Frank</forename>
<surname>Le BOURGEOIS</surname>
</persName>
<idno type="idHal">frank-le-bourgeois</idno>
<idno type="halAuthorId">86952</idno>
<affiliation ref="#struct-403930"></affiliation>
</author>
</analytic>
<monogr>
<idno type="localRef">5282</idno>
<idno type="halJournalId" status="INCOMING">108591</idno>
<title level="j">Journal of Real-Time Image Processing, </title>
<imprint>
<biblScope unit="volume">9</biblScope>
<biblScope unit="pp">329-345</biblScope>
<date type="datePub">2014-06</date>
</imprint>
</monogr>
<idno type="doi">10.1007/s11554-011-0227-4</idno>
</biblStruct>
</sourceDesc>
<profileDesc>
<langUsage>
<language ident="en">English</language>
</langUsage>
<textClass>
<classCode scheme="halDomain" n="info">Computer Science [cs]</classCode>
<classCode scheme="halTypology" n="ART">Journal articles</classCode>
</textClass>
<abstract xml:lang="en">In this paper, we present a new document classification based on physical layout features and graph b-coloring modeling. In order to reduce the computed time and to increase the performance of our automatic reading system, we propose to pre-classify the business documents by introducing an Automatic Recognition of Documents stage (ARD) as a pre-analysis phase. This phase guides the others involved in the recognition process of the documents contents. Once the document type identified, the reading system will use its corresponding information source to improve the recognition of its logical layout, the selection and parameterization of the OCR, and the final decision of sorting. The graph coloring model is introduced for both layout analysis and document classification. The proposed method is reliable, robust to various constraints and guarantees a real-time answer to the sorting ofbusiness documents.</abstract>
</profileDesc>
</hal>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Hal/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000029 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Hal/Corpus/biblio.hfd -nk 000029 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Hal
   |étape=   Corpus
   |type=    RBID
   |clé=     Hal:hal-01300863
   |texte=   Classification of business documents for real time application
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024