Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Contribution to the Automatic Recognition of Business Documents

Identifieur interne : 000036 ( Hal/Curation ); précédent : 000035; suivant : 000037

Contribution to the Automatic Recognition of Business Documents

Auteurs : Djamel Gaceb [France] ; Frank Lebourgeois [France] ; Véronique Eglin [France] ; Hubert Emptoz [France]

Source :

RBID : Hal:inria-00104169

English descriptors

Abstract

The automatic processing of paper documents and mails is a major challenge for all companies. Current recognition systems use modular architectures in which each stage of the process is independent. To improve the performances, it is necessary to reintroduce a cooperation between the different modules, for example by coupling the segmentation / recognition or zones of interests location / segmentation steps. In this context we propose a mixed approach for text localization and image segmentation which respects real time constraints. In the first part, we are going to present the state of the art in text location and thresholding in the images of postal addresses. In the second part, we will describe our method which simultaneously localize and segment text zones. The Location of text blocks obtained from a multiresolution approach on cumulated gradients computed directly from grey level images. The coupling of the two processes (text zones location and thresholding) allows to reduce simultaneously the computing time by processing only necessary parts of the image and by obtaining a better character segmentation for the OCR (Optical Character Recognition). We will present the results obtained from the implementation of our approach on an industrial line which daily processes several tons of documents from large companies.

Url:

Links toward previous steps (curation, corpus...)


Links to Exploration step

Hal:inria-00104169

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Contribution to the Automatic Recognition of Business Documents</title>
<author>
<name sortKey="Gaceb, Djamel" sort="Gaceb, Djamel" uniqKey="Gaceb D" first="Djamel" last="Gaceb">Djamel Gaceb</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-2003" status="VALID">
<orgName>Laboratoire d'InfoRmatique en Image et Systèmes d'information</orgName>
<orgName type="acronym">LIRIS</orgName>
<desc>
<address>
<addrLine>Bâtiment Blaise Pascal - 20, avenue Albert Einstein - 69621 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://liris.cnrs.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-126765" type="direct"></relation>
<relation active="#struct-194495" type="direct"></relation>
<relation name="- LYON" active="#struct-301232" type="direct"></relation>
<relation name="UMR5205" active="#struct-441569" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-33804" type="direct">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-126765" type="direct">
<org type="institution" xml:id="struct-126765" status="VALID">
<orgName>École Centrale de Lyon</orgName>
<orgName type="acronym">ECL</orgName>
<desc>
<address>
<addrLine>36 avenue Guy de Collongue - 69134 Ecully cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ec-lyon.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-194495" type="direct">
<org type="institution" xml:id="struct-194495" status="VALID">
<orgName>Université Claude Bernard Lyon 1</orgName>
<orgName type="acronym">UCBL</orgName>
<desc>
<address>
<addrLine>43, boulevard du 11 novembre 1918, 69622 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="- LYON" active="#struct-301232" type="direct">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5205" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Lyon</settlement>
<region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
</placeName>
<orgName type="university">Université Claude Bernard Lyon 1</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lyon</orgName>
</affiliation>
</author>
<author>
<name sortKey="Lebourgeois, Frank" sort="Lebourgeois, Frank" uniqKey="Lebourgeois F" first="Frank" last="Lebourgeois">Frank Lebourgeois</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-2003" status="VALID">
<orgName>Laboratoire d'InfoRmatique en Image et Systèmes d'information</orgName>
<orgName type="acronym">LIRIS</orgName>
<desc>
<address>
<addrLine>Bâtiment Blaise Pascal - 20, avenue Albert Einstein - 69621 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://liris.cnrs.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-126765" type="direct"></relation>
<relation active="#struct-194495" type="direct"></relation>
<relation name="- LYON" active="#struct-301232" type="direct"></relation>
<relation name="UMR5205" active="#struct-441569" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-33804" type="direct">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-126765" type="direct">
<org type="institution" xml:id="struct-126765" status="VALID">
<orgName>École Centrale de Lyon</orgName>
<orgName type="acronym">ECL</orgName>
<desc>
<address>
<addrLine>36 avenue Guy de Collongue - 69134 Ecully cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ec-lyon.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-194495" type="direct">
<org type="institution" xml:id="struct-194495" status="VALID">
<orgName>Université Claude Bernard Lyon 1</orgName>
<orgName type="acronym">UCBL</orgName>
<desc>
<address>
<addrLine>43, boulevard du 11 novembre 1918, 69622 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="- LYON" active="#struct-301232" type="direct">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5205" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Lyon</settlement>
<region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
</placeName>
<orgName type="university">Université Claude Bernard Lyon 1</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lyon</orgName>
</affiliation>
</author>
<author>
<name sortKey="Eglin, Veronique" sort="Eglin, Veronique" uniqKey="Eglin V" first="Véronique" last="Eglin">Véronique Eglin</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-2003" status="VALID">
<orgName>Laboratoire d'InfoRmatique en Image et Systèmes d'information</orgName>
<orgName type="acronym">LIRIS</orgName>
<desc>
<address>
<addrLine>Bâtiment Blaise Pascal - 20, avenue Albert Einstein - 69621 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://liris.cnrs.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-126765" type="direct"></relation>
<relation active="#struct-194495" type="direct"></relation>
<relation name="- LYON" active="#struct-301232" type="direct"></relation>
<relation name="UMR5205" active="#struct-441569" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-33804" type="direct">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-126765" type="direct">
<org type="institution" xml:id="struct-126765" status="VALID">
<orgName>École Centrale de Lyon</orgName>
<orgName type="acronym">ECL</orgName>
<desc>
<address>
<addrLine>36 avenue Guy de Collongue - 69134 Ecully cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ec-lyon.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-194495" type="direct">
<org type="institution" xml:id="struct-194495" status="VALID">
<orgName>Université Claude Bernard Lyon 1</orgName>
<orgName type="acronym">UCBL</orgName>
<desc>
<address>
<addrLine>43, boulevard du 11 novembre 1918, 69622 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="- LYON" active="#struct-301232" type="direct">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5205" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Lyon</settlement>
<region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
</placeName>
<orgName type="university">Université Claude Bernard Lyon 1</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lyon</orgName>
</affiliation>
</author>
<author>
<name sortKey="Emptoz, Hubert" sort="Emptoz, Hubert" uniqKey="Emptoz H" first="Hubert" last="Emptoz">Hubert Emptoz</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-2003" status="VALID">
<orgName>Laboratoire d'InfoRmatique en Image et Systèmes d'information</orgName>
<orgName type="acronym">LIRIS</orgName>
<desc>
<address>
<addrLine>Bâtiment Blaise Pascal - 20, avenue Albert Einstein - 69621 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://liris.cnrs.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-126765" type="direct"></relation>
<relation active="#struct-194495" type="direct"></relation>
<relation name="- LYON" active="#struct-301232" type="direct"></relation>
<relation name="UMR5205" active="#struct-441569" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-33804" type="direct">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-126765" type="direct">
<org type="institution" xml:id="struct-126765" status="VALID">
<orgName>École Centrale de Lyon</orgName>
<orgName type="acronym">ECL</orgName>
<desc>
<address>
<addrLine>36 avenue Guy de Collongue - 69134 Ecully cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ec-lyon.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-194495" type="direct">
<org type="institution" xml:id="struct-194495" status="VALID">
<orgName>Université Claude Bernard Lyon 1</orgName>
<orgName type="acronym">UCBL</orgName>
<desc>
<address>
<addrLine>43, boulevard du 11 novembre 1918, 69622 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="- LYON" active="#struct-301232" type="direct">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5205" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Lyon</settlement>
<region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
</placeName>
<orgName type="university">Université Claude Bernard Lyon 1</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lyon</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:inria-00104169</idno>
<idno type="halId">inria-00104169</idno>
<idno type="halUri">https://hal.inria.fr/inria-00104169</idno>
<idno type="url">https://hal.inria.fr/inria-00104169</idno>
<date when="2006-10-23">2006-10-23</date>
<idno type="wicri:Area/Hal/Corpus">000036</idno>
<idno type="wicri:Area/Hal/Curation">000036</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Contribution to the Automatic Recognition of Business Documents</title>
<author>
<name sortKey="Gaceb, Djamel" sort="Gaceb, Djamel" uniqKey="Gaceb D" first="Djamel" last="Gaceb">Djamel Gaceb</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-2003" status="VALID">
<orgName>Laboratoire d'InfoRmatique en Image et Systèmes d'information</orgName>
<orgName type="acronym">LIRIS</orgName>
<desc>
<address>
<addrLine>Bâtiment Blaise Pascal - 20, avenue Albert Einstein - 69621 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://liris.cnrs.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-126765" type="direct"></relation>
<relation active="#struct-194495" type="direct"></relation>
<relation name="- LYON" active="#struct-301232" type="direct"></relation>
<relation name="UMR5205" active="#struct-441569" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-33804" type="direct">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-126765" type="direct">
<org type="institution" xml:id="struct-126765" status="VALID">
<orgName>École Centrale de Lyon</orgName>
<orgName type="acronym">ECL</orgName>
<desc>
<address>
<addrLine>36 avenue Guy de Collongue - 69134 Ecully cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ec-lyon.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-194495" type="direct">
<org type="institution" xml:id="struct-194495" status="VALID">
<orgName>Université Claude Bernard Lyon 1</orgName>
<orgName type="acronym">UCBL</orgName>
<desc>
<address>
<addrLine>43, boulevard du 11 novembre 1918, 69622 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="- LYON" active="#struct-301232" type="direct">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5205" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Lyon</settlement>
<region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
</placeName>
<orgName type="university">Université Claude Bernard Lyon 1</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lyon</orgName>
</affiliation>
</author>
<author>
<name sortKey="Lebourgeois, Frank" sort="Lebourgeois, Frank" uniqKey="Lebourgeois F" first="Frank" last="Lebourgeois">Frank Lebourgeois</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-2003" status="VALID">
<orgName>Laboratoire d'InfoRmatique en Image et Systèmes d'information</orgName>
<orgName type="acronym">LIRIS</orgName>
<desc>
<address>
<addrLine>Bâtiment Blaise Pascal - 20, avenue Albert Einstein - 69621 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://liris.cnrs.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-126765" type="direct"></relation>
<relation active="#struct-194495" type="direct"></relation>
<relation name="- LYON" active="#struct-301232" type="direct"></relation>
<relation name="UMR5205" active="#struct-441569" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-33804" type="direct">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-126765" type="direct">
<org type="institution" xml:id="struct-126765" status="VALID">
<orgName>École Centrale de Lyon</orgName>
<orgName type="acronym">ECL</orgName>
<desc>
<address>
<addrLine>36 avenue Guy de Collongue - 69134 Ecully cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ec-lyon.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-194495" type="direct">
<org type="institution" xml:id="struct-194495" status="VALID">
<orgName>Université Claude Bernard Lyon 1</orgName>
<orgName type="acronym">UCBL</orgName>
<desc>
<address>
<addrLine>43, boulevard du 11 novembre 1918, 69622 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="- LYON" active="#struct-301232" type="direct">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5205" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Lyon</settlement>
<region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
</placeName>
<orgName type="university">Université Claude Bernard Lyon 1</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lyon</orgName>
</affiliation>
</author>
<author>
<name sortKey="Eglin, Veronique" sort="Eglin, Veronique" uniqKey="Eglin V" first="Véronique" last="Eglin">Véronique Eglin</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-2003" status="VALID">
<orgName>Laboratoire d'InfoRmatique en Image et Systèmes d'information</orgName>
<orgName type="acronym">LIRIS</orgName>
<desc>
<address>
<addrLine>Bâtiment Blaise Pascal - 20, avenue Albert Einstein - 69621 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://liris.cnrs.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-126765" type="direct"></relation>
<relation active="#struct-194495" type="direct"></relation>
<relation name="- LYON" active="#struct-301232" type="direct"></relation>
<relation name="UMR5205" active="#struct-441569" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-33804" type="direct">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-126765" type="direct">
<org type="institution" xml:id="struct-126765" status="VALID">
<orgName>École Centrale de Lyon</orgName>
<orgName type="acronym">ECL</orgName>
<desc>
<address>
<addrLine>36 avenue Guy de Collongue - 69134 Ecully cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ec-lyon.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-194495" type="direct">
<org type="institution" xml:id="struct-194495" status="VALID">
<orgName>Université Claude Bernard Lyon 1</orgName>
<orgName type="acronym">UCBL</orgName>
<desc>
<address>
<addrLine>43, boulevard du 11 novembre 1918, 69622 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="- LYON" active="#struct-301232" type="direct">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5205" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Lyon</settlement>
<region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
</placeName>
<orgName type="university">Université Claude Bernard Lyon 1</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lyon</orgName>
</affiliation>
</author>
<author>
<name sortKey="Emptoz, Hubert" sort="Emptoz, Hubert" uniqKey="Emptoz H" first="Hubert" last="Emptoz">Hubert Emptoz</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-2003" status="VALID">
<orgName>Laboratoire d'InfoRmatique en Image et Systèmes d'information</orgName>
<orgName type="acronym">LIRIS</orgName>
<desc>
<address>
<addrLine>Bâtiment Blaise Pascal - 20, avenue Albert Einstein - 69621 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://liris.cnrs.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-33804" type="direct"></relation>
<relation active="#struct-126765" type="direct"></relation>
<relation active="#struct-194495" type="direct"></relation>
<relation name="- LYON" active="#struct-301232" type="direct"></relation>
<relation name="UMR5205" active="#struct-441569" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-33804" type="direct">
<org type="institution" xml:id="struct-33804" status="VALID">
<orgName>Université Lumière - Lyon 2</orgName>
<orgName type="acronym">UL2</orgName>
<desc>
<address>
<addrLine>86, rue Pasteur - 69007 Lyon</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon2.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-126765" type="direct">
<org type="institution" xml:id="struct-126765" status="VALID">
<orgName>École Centrale de Lyon</orgName>
<orgName type="acronym">ECL</orgName>
<desc>
<address>
<addrLine>36 avenue Guy de Collongue - 69134 Ecully cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ec-lyon.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-194495" type="direct">
<org type="institution" xml:id="struct-194495" status="VALID">
<orgName>Université Claude Bernard Lyon 1</orgName>
<orgName type="acronym">UCBL</orgName>
<desc>
<address>
<addrLine>43, boulevard du 11 novembre 1918, 69622 Villeurbanne cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lyon1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="- LYON" active="#struct-301232" type="direct">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR5205" active="#struct-441569" type="direct">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Lyon</settlement>
<region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
</placeName>
<orgName type="university">Université Claude Bernard Lyon 1</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lyon</orgName>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="mix" xml:lang="en">
<term>Text location</term>
<term>business documents processing</term>
<term>business documents processing.</term>
<term>image segmentation</term>
<term>real time processing</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">The automatic processing of paper documents and mails is a major challenge for all companies. Current recognition systems use modular architectures in which each stage of the process is independent. To improve the performances, it is necessary to reintroduce a cooperation between the different modules, for example by coupling the segmentation / recognition or zones of interests location / segmentation steps. In this context we propose a mixed approach for text localization and image segmentation which respects real time constraints. In the first part, we are going to present the state of the art in text location and thresholding in the images of postal addresses. In the second part, we will describe our method which simultaneously localize and segment text zones. The Location of text blocks obtained from a multiresolution approach on cumulated gradients computed directly from grey level images. The coupling of the two processes (text zones location and thresholding) allows to reduce simultaneously the computing time by processing only necessary parts of the image and by obtaining a better character segmentation for the OCR (Optical Character Recognition). We will present the results obtained from the implementation of our approach on an industrial line which daily processes several tons of documents from large companies.</div>
</front>
</TEI>
<hal api="V3">
<titleStmt>
<title xml:lang="en">Contribution to the Automatic Recognition of Business Documents</title>
<author role="aut">
<persName>
<forename type="first">Djamel</forename>
<surname>Gaceb</surname>
</persName>
<email>djamel.gaceb1@insa-lyon.fr</email>
<idno type="halauthor">135462</idno>
<affiliation ref="#struct-2003"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Frank</forename>
<surname>Lebourgeois</surname>
</persName>
<email>flebourg@rfv.insa-lyon.fr</email>
<idno type="halauthor">135463</idno>
<affiliation ref="#struct-2003"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Véronique</forename>
<surname>Eglin</surname>
</persName>
<email>veronique.eglin@insa-lyon.fr</email>
<idno type="idhal">veronique-eglin</idno>
<idno type="halauthor">135464</idno>
<affiliation ref="#struct-2003"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Hubert</forename>
<surname>Emptoz</surname>
</persName>
<email>hubert.emptoz@liris.cnrs.fr</email>
<idno type="halauthor">135465</idno>
<affiliation ref="#struct-2003"></affiliation>
</author>
<editor role="depositor">
<persName>
<forename>Anne</forename>
<surname>Jaigu</surname>
</persName>
<email>Anne.Jaigu@inria.fr</email>
</editor>
<funder>Université de Rennes 1</funder>
</titleStmt>
<editionStmt>
<edition n="v1" type="current">
<date type="whenSubmitted">2006-10-06 08:47:39</date>
<date type="whenModified">2006-10-06 09:22:02</date>
<date type="whenReleased">2006-10-06 09:22:02</date>
<date type="whenProduced">2006-10-23</date>
<date type="whenEndEmbargoed">2006-10-06</date>
<ref type="file" target="https://hal.inria.fr/inria-00104169/document">
<date notBefore="2006-10-06"></date>
</ref>
<ref type="file" n="1" target="https://hal.inria.fr/inria-00104169/file/cr1044184407776.pdf">
<date notBefore="2006-10-06"></date>
</ref>
</edition>
<respStmt>
<resp>contributor</resp>
<name key="102211">
<persName>
<forename>Anne</forename>
<surname>Jaigu</surname>
</persName>
<email>Anne.Jaigu@inria.fr</email>
</name>
</respStmt>
</editionStmt>
<publicationStmt>
<distributor>CCSD</distributor>
<idno type="halId">inria-00104169</idno>
<idno type="halUri">https://hal.inria.fr/inria-00104169</idno>
<idno type="halBibtex">gaceb:inria-00104169</idno>
<idno type="halRefHtml">Guy Lorette. Tenth International Workshop on Frontiers in Handwriting Recognition, Oct 2006, La Baule (France), Suvisoft, 2006</idno>
<idno type="halRef">Guy Lorette. Tenth International Workshop on Frontiers in Handwriting Recognition, Oct 2006, La Baule (France), Suvisoft, 2006</idno>
</publicationStmt>
<seriesStmt>
<idno type="stamp" n="IWFHR10">International Workshop on Frontiers in Handwriting Recognition</idno>
<idno type="stamp" n="CNRS">CNRS - Centre national de la recherche scientifique</idno>
<idno type="stamp" n="UNIV-LYON1">Université Claude Bernard - Lyon I</idno>
<idno type="stamp" n="UNIV-LYON2">Université Lumière Lyon 2</idno>
<idno type="stamp" n="INSA-LYON">Institut National des Sciences Appliquées de Lyon</idno>
<idno type="stamp" n="EC-LYON">Ecole Centrale de Lyon</idno>
<idno type="stamp" n="LIRIS">Laboratoire d'InfoRmatique en Image et Systèmes d'information</idno>
</seriesStmt>
<notesStmt>
<note type="commentary">http://www.suvisoft.com</note>
<note type="audience" n="1">Not set</note>
<note type="invited" n="0">No</note>
<note type="popular" n="0">No</note>
<note type="peer" n="1">Yes</note>
<note type="proceedings" n="1">Yes</note>
</notesStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Contribution to the Automatic Recognition of Business Documents</title>
<author role="aut">
<persName>
<forename type="first">Djamel</forename>
<surname>Gaceb</surname>
</persName>
<email>djamel.gaceb1@insa-lyon.fr</email>
<idno type="halAuthorId">135462</idno>
<affiliation ref="#struct-2003"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Frank</forename>
<surname>Lebourgeois</surname>
</persName>
<email>flebourg@rfv.insa-lyon.fr</email>
<idno type="halAuthorId">135463</idno>
<affiliation ref="#struct-2003"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Véronique</forename>
<surname>Eglin</surname>
</persName>
<email>veronique.eglin@insa-lyon.fr</email>
<idno type="idHal">veronique-eglin</idno>
<idno type="halAuthorId">135464</idno>
<affiliation ref="#struct-2003"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Hubert</forename>
<surname>Emptoz</surname>
</persName>
<email>hubert.emptoz@liris.cnrs.fr</email>
<idno type="halAuthorId">135465</idno>
<affiliation ref="#struct-2003"></affiliation>
</author>
</analytic>
<monogr>
<meeting>
<title>Tenth International Workshop on Frontiers in Handwriting Recognition</title>
<date type="start">2006-10-23</date>
<settlement>La Baule (France)</settlement>
</meeting>
<respStmt>
<resp>conferenceOrganizer</resp>
<name>Université de Rennes 1</name>
</respStmt>
<editor>Guy Lorette</editor>
<imprint>
<publisher>Suvisoft</publisher>
<date type="datePub">2006-10-23</date>
</imprint>
</monogr>
</biblStruct>
</sourceDesc>
<profileDesc>
<langUsage>
<language ident="en">English</language>
</langUsage>
<textClass>
<keywords scheme="author">
<term xml:lang="en">business documents processing.</term>
<term xml:lang="en">Text location</term>
<term xml:lang="en">image segmentation</term>
<term xml:lang="en">real time processing</term>
<term xml:lang="en">business documents processing</term>
</keywords>
<classCode scheme="acm" n="I.5"></classCode>
<classCode scheme="acm" n="I.7"></classCode>
<classCode scheme="halDomain" n="info.info-tt">Computer Science [cs]/Document and Text Processing</classCode>
<classCode scheme="halDomain" n="info.info-cv">Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]</classCode>
<classCode scheme="halTypology" n="COMM">Conference papers</classCode>
</textClass>
<abstract xml:lang="en">The automatic processing of paper documents and mails is a major challenge for all companies. Current recognition systems use modular architectures in which each stage of the process is independent. To improve the performances, it is necessary to reintroduce a cooperation between the different modules, for example by coupling the segmentation / recognition or zones of interests location / segmentation steps. In this context we propose a mixed approach for text localization and image segmentation which respects real time constraints. In the first part, we are going to present the state of the art in text location and thresholding in the images of postal addresses. In the second part, we will describe our method which simultaneously localize and segment text zones. The Location of text blocks obtained from a multiresolution approach on cumulated gradients computed directly from grey level images. The coupling of the two processes (text zones location and thresholding) allows to reduce simultaneously the computing time by processing only necessary parts of the image and by obtaining a better character segmentation for the OCR (Optical Character Recognition). We will present the results obtained from the implementation of our approach on an industrial line which daily processes several tons of documents from large companies.</abstract>
</profileDesc>
</hal>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Hal/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000036 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Hal/Curation/biblio.hfd -nk 000036 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Hal
   |étape=   Curation
   |type=    RBID
   |clé=     Hal:inria-00104169
   |texte=   Contribution to the Automatic Recognition of Business Documents
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024