Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Scan-to-XML for Vector Graphics: an experimental setup for intelligent browsable document generation

Identifieur interne : 005D39 ( Hal/Checkpoint ); précédent : 005D38; suivant : 005D40

Scan-to-XML for Vector Graphics: an experimental setup for intelligent browsable document generation

Auteurs : Bart Lamiroy [France] ; Laurent Najman ; Romain Ehrhard [France] ; Céline Louis [France] ; Franck Quélain [France] ; Nicolas Rouyer [France] ; Nabil Zeghache [France]

Source :

RBID : Hal:inria-00100445

Descripteurs français

Abstract

This paper describes an experimental setup, conducted in collaboration with the ISA research group of the LORIA laboratory, Océ-PLT, and students from the École des Mines de Nancy. The main objective is to experiment an approach to develop a high level document analysis platform by composing existing bricks from a comprehensive library of state-of-the art algorithms. The test-case of this methodology consists in the realization of a fully automated method of generating a browsable, hyper-linked document from a simple scanned image. We concentrated our work on cutaway diagrams. These documents present the advantage of containing simple browsing semantics, in the sense that they consist of a clearly identifiable legend containing index references, plus a drawing containing one or more occurrences of the same indices. The setup described in this paper starts from a raw binary image of a cutaway diagram, and delivers an XML description matching the references of the legend with the indices in the image, and a browser for interpreting the XML generated map. The complete document treatment pipeline is conceived within a combined scripting and compiled library environment.

Url:

Links toward previous steps (curation, corpus...)


Links to Exploration step

Hal:inria-00100445

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Scan-to-XML for Vector Graphics: an experimental setup for intelligent browsable document generation</title>
<author>
<name sortKey="Lamiroy, Bart" sort="Lamiroy, Bart" uniqKey="Lamiroy B" first="Bart" last="Lamiroy">Bart Lamiroy</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-2350" status="OLD">
<idno type="RNSR">199521440F</idno>
<orgName>Models, algorithms and geometry for computer graphics and vision</orgName>
<orgName type="acronym">ISA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/isa</ref>
</desc>
<listRelation>
<relation active="#struct-160" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-300291" type="indirect"></relation>
<relation active="#struct-300292" type="indirect"></relation>
<relation active="#struct-300293" type="indirect"></relation>
<relation active="#struct-2496" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-160" type="direct">
<org type="laboratory" xml:id="struct-160" status="OLD">
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-300291" type="direct"></relation>
<relation active="#struct-300292" type="direct"></relation>
<relation active="#struct-300293" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300291" type="indirect">
<org type="institution" xml:id="struct-300291" status="OLD">
<orgName>Université Henri Poincaré - Nancy 1</orgName>
<orgName type="acronym">UHP</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<addrLine>24-30 rue Lionnois, BP 60120, 54 003 NANCY cedex, France</addrLine>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300292" type="indirect">
<org type="institution" xml:id="struct-300292" status="OLD">
<orgName>Université Nancy 2</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<addrLine>91 avenue de la Libération, BP 454, 54001 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300293" type="indirect">
<org type="institution" xml:id="struct-300293" status="OLD">
<orgName>Institut National Polytechnique de Lorraine</orgName>
<orgName type="acronym">INPL</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-2496" type="direct">
<org type="laboratory" xml:id="struct-2496" status="OLD">
<orgName>INRIA Lorraine</orgName>
<desc>
<address>
<addrLine>615 rue du Jardin Botanique 54600 Villers-lès-Nancy</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/centre-de-recherche-inria/nancy-grand-est</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Université Nancy 2</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lorraine</orgName>
<placeName>
<settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Institut national polytechnique de Lorraine</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lorraine</orgName>
</affiliation>
</author>
<author>
<name sortKey="Najman, Laurent" sort="Najman, Laurent" uniqKey="Najman L" first="Laurent" last="Najman">Laurent Najman</name>
</author>
<author>
<name sortKey="Ehrhard, Romain" sort="Ehrhard, Romain" uniqKey="Ehrhard R" first="Romain" last="Ehrhard">Romain Ehrhard</name>
<affiliation wicri:level="1">
<hal:affiliation type="department" xml:id="struct-14430" status="VALID">
<orgName>École nationale supérieure des Mines de Nancy</orgName>
<orgName type="acronym">Mines Nancy</orgName>
<desc>
<address>
<addrLine>Campus Artem - CS 14 234 - 54042 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines.inpl-nancy.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-302102" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-302102" type="direct">
<org type="institution" xml:id="struct-302102" status="VALID">
<orgName>Institut Mines-Télécom</orgName>
<desc>
<address>
<addrLine>46 rue Barrault -75634 Paris Cedex 13</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines-telecom.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Louis, Celine" sort="Louis, Celine" uniqKey="Louis C" first="Céline" last="Louis">Céline Louis</name>
<affiliation wicri:level="1">
<hal:affiliation type="department" xml:id="struct-14430" status="VALID">
<orgName>École nationale supérieure des Mines de Nancy</orgName>
<orgName type="acronym">Mines Nancy</orgName>
<desc>
<address>
<addrLine>Campus Artem - CS 14 234 - 54042 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines.inpl-nancy.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-302102" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-302102" type="direct">
<org type="institution" xml:id="struct-302102" status="VALID">
<orgName>Institut Mines-Télécom</orgName>
<desc>
<address>
<addrLine>46 rue Barrault -75634 Paris Cedex 13</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines-telecom.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Quelain, Franck" sort="Quelain, Franck" uniqKey="Quelain F" first="Franck" last="Quélain">Franck Quélain</name>
<affiliation wicri:level="1">
<hal:affiliation type="department" xml:id="struct-14430" status="VALID">
<orgName>École nationale supérieure des Mines de Nancy</orgName>
<orgName type="acronym">Mines Nancy</orgName>
<desc>
<address>
<addrLine>Campus Artem - CS 14 234 - 54042 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines.inpl-nancy.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-302102" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-302102" type="direct">
<org type="institution" xml:id="struct-302102" status="VALID">
<orgName>Institut Mines-Télécom</orgName>
<desc>
<address>
<addrLine>46 rue Barrault -75634 Paris Cedex 13</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines-telecom.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Rouyer, Nicolas" sort="Rouyer, Nicolas" uniqKey="Rouyer N" first="Nicolas" last="Rouyer">Nicolas Rouyer</name>
<affiliation wicri:level="1">
<hal:affiliation type="department" xml:id="struct-14430" status="VALID">
<orgName>École nationale supérieure des Mines de Nancy</orgName>
<orgName type="acronym">Mines Nancy</orgName>
<desc>
<address>
<addrLine>Campus Artem - CS 14 234 - 54042 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines.inpl-nancy.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-302102" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-302102" type="direct">
<org type="institution" xml:id="struct-302102" status="VALID">
<orgName>Institut Mines-Télécom</orgName>
<desc>
<address>
<addrLine>46 rue Barrault -75634 Paris Cedex 13</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines-telecom.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Zeghache, Nabil" sort="Zeghache, Nabil" uniqKey="Zeghache N" first="Nabil" last="Zeghache">Nabil Zeghache</name>
<affiliation wicri:level="1">
<hal:affiliation type="department" xml:id="struct-14430" status="VALID">
<orgName>École nationale supérieure des Mines de Nancy</orgName>
<orgName type="acronym">Mines Nancy</orgName>
<desc>
<address>
<addrLine>Campus Artem - CS 14 234 - 54042 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines.inpl-nancy.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-302102" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-302102" type="direct">
<org type="institution" xml:id="struct-302102" status="VALID">
<orgName>Institut Mines-Télécom</orgName>
<desc>
<address>
<addrLine>46 rue Barrault -75634 Paris Cedex 13</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines-telecom.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:inria-00100445</idno>
<idno type="halId">inria-00100445</idno>
<idno type="halUri">https://hal.inria.fr/inria-00100445</idno>
<idno type="url">https://hal.inria.fr/inria-00100445</idno>
<date when="2001">2001</date>
<idno type="wicri:Area/Hal/Corpus">004365</idno>
<idno type="wicri:Area/Hal/Curation">004365</idno>
<idno type="wicri:Area/Hal/Checkpoint">005D39</idno>
<idno type="wicri:explorRef" wicri:stream="Hal" wicri:step="Checkpoint">005D39</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Scan-to-XML for Vector Graphics: an experimental setup for intelligent browsable document generation</title>
<author>
<name sortKey="Lamiroy, Bart" sort="Lamiroy, Bart" uniqKey="Lamiroy B" first="Bart" last="Lamiroy">Bart Lamiroy</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-2350" status="OLD">
<idno type="RNSR">199521440F</idno>
<orgName>Models, algorithms and geometry for computer graphics and vision</orgName>
<orgName type="acronym">ISA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/isa</ref>
</desc>
<listRelation>
<relation active="#struct-160" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-300291" type="indirect"></relation>
<relation active="#struct-300292" type="indirect"></relation>
<relation active="#struct-300293" type="indirect"></relation>
<relation active="#struct-2496" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-160" type="direct">
<org type="laboratory" xml:id="struct-160" status="OLD">
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-300291" type="direct"></relation>
<relation active="#struct-300292" type="direct"></relation>
<relation active="#struct-300293" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300291" type="indirect">
<org type="institution" xml:id="struct-300291" status="OLD">
<orgName>Université Henri Poincaré - Nancy 1</orgName>
<orgName type="acronym">UHP</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<addrLine>24-30 rue Lionnois, BP 60120, 54 003 NANCY cedex, France</addrLine>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300292" type="indirect">
<org type="institution" xml:id="struct-300292" status="OLD">
<orgName>Université Nancy 2</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<addrLine>91 avenue de la Libération, BP 454, 54001 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300293" type="indirect">
<org type="institution" xml:id="struct-300293" status="OLD">
<orgName>Institut National Polytechnique de Lorraine</orgName>
<orgName type="acronym">INPL</orgName>
<date type="end">2011-12-31</date>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-2496" type="direct">
<org type="laboratory" xml:id="struct-2496" status="OLD">
<orgName>INRIA Lorraine</orgName>
<desc>
<address>
<addrLine>615 rue du Jardin Botanique 54600 Villers-lès-Nancy</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/centre-de-recherche-inria/nancy-grand-est</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Université Nancy 2</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lorraine</orgName>
<placeName>
<settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Institut national polytechnique de Lorraine</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Lorraine</orgName>
</affiliation>
</author>
<author>
<name sortKey="Najman, Laurent" sort="Najman, Laurent" uniqKey="Najman L" first="Laurent" last="Najman">Laurent Najman</name>
</author>
<author>
<name sortKey="Ehrhard, Romain" sort="Ehrhard, Romain" uniqKey="Ehrhard R" first="Romain" last="Ehrhard">Romain Ehrhard</name>
<affiliation wicri:level="1">
<hal:affiliation type="department" xml:id="struct-14430" status="VALID">
<orgName>École nationale supérieure des Mines de Nancy</orgName>
<orgName type="acronym">Mines Nancy</orgName>
<desc>
<address>
<addrLine>Campus Artem - CS 14 234 - 54042 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines.inpl-nancy.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-302102" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-302102" type="direct">
<org type="institution" xml:id="struct-302102" status="VALID">
<orgName>Institut Mines-Télécom</orgName>
<desc>
<address>
<addrLine>46 rue Barrault -75634 Paris Cedex 13</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines-telecom.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Louis, Celine" sort="Louis, Celine" uniqKey="Louis C" first="Céline" last="Louis">Céline Louis</name>
<affiliation wicri:level="1">
<hal:affiliation type="department" xml:id="struct-14430" status="VALID">
<orgName>École nationale supérieure des Mines de Nancy</orgName>
<orgName type="acronym">Mines Nancy</orgName>
<desc>
<address>
<addrLine>Campus Artem - CS 14 234 - 54042 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines.inpl-nancy.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-302102" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-302102" type="direct">
<org type="institution" xml:id="struct-302102" status="VALID">
<orgName>Institut Mines-Télécom</orgName>
<desc>
<address>
<addrLine>46 rue Barrault -75634 Paris Cedex 13</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines-telecom.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Quelain, Franck" sort="Quelain, Franck" uniqKey="Quelain F" first="Franck" last="Quélain">Franck Quélain</name>
<affiliation wicri:level="1">
<hal:affiliation type="department" xml:id="struct-14430" status="VALID">
<orgName>École nationale supérieure des Mines de Nancy</orgName>
<orgName type="acronym">Mines Nancy</orgName>
<desc>
<address>
<addrLine>Campus Artem - CS 14 234 - 54042 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines.inpl-nancy.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-302102" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-302102" type="direct">
<org type="institution" xml:id="struct-302102" status="VALID">
<orgName>Institut Mines-Télécom</orgName>
<desc>
<address>
<addrLine>46 rue Barrault -75634 Paris Cedex 13</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines-telecom.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Rouyer, Nicolas" sort="Rouyer, Nicolas" uniqKey="Rouyer N" first="Nicolas" last="Rouyer">Nicolas Rouyer</name>
<affiliation wicri:level="1">
<hal:affiliation type="department" xml:id="struct-14430" status="VALID">
<orgName>École nationale supérieure des Mines de Nancy</orgName>
<orgName type="acronym">Mines Nancy</orgName>
<desc>
<address>
<addrLine>Campus Artem - CS 14 234 - 54042 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines.inpl-nancy.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-302102" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-302102" type="direct">
<org type="institution" xml:id="struct-302102" status="VALID">
<orgName>Institut Mines-Télécom</orgName>
<desc>
<address>
<addrLine>46 rue Barrault -75634 Paris Cedex 13</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines-telecom.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author>
<name sortKey="Zeghache, Nabil" sort="Zeghache, Nabil" uniqKey="Zeghache N" first="Nabil" last="Zeghache">Nabil Zeghache</name>
<affiliation wicri:level="1">
<hal:affiliation type="department" xml:id="struct-14430" status="VALID">
<orgName>École nationale supérieure des Mines de Nancy</orgName>
<orgName type="acronym">Mines Nancy</orgName>
<desc>
<address>
<addrLine>Campus Artem - CS 14 234 - 54042 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines.inpl-nancy.fr/</ref>
</desc>
<listRelation>
<relation active="#struct-302102" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-302102" type="direct">
<org type="institution" xml:id="struct-302102" status="VALID">
<orgName>Institut Mines-Télécom</orgName>
<desc>
<address>
<addrLine>46 rue Barrault -75634 Paris Cedex 13</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.mines-telecom.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="mix" xml:lang="fr">
<term>algèbre de composantes</term>
<term>analyse de documents</term>
<term>automated generation</term>
<term>component algebra</term>
<term>document analysis</term>
<term>génération automatique</term>
<term>hyper-lien</term>
<term>hyperlink</term>
<term>xml</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">This paper describes an experimental setup, conducted in collaboration with the ISA research group of the LORIA laboratory, Océ-PLT, and students from the École des Mines de Nancy. The main objective is to experiment an approach to develop a high level document analysis platform by composing existing bricks from a comprehensive library of state-of-the art algorithms. The test-case of this methodology consists in the realization of a fully automated method of generating a browsable, hyper-linked document from a simple scanned image. We concentrated our work on cutaway diagrams. These documents present the advantage of containing simple browsing semantics, in the sense that they consist of a clearly identifiable legend containing index references, plus a drawing containing one or more occurrences of the same indices. The setup described in this paper starts from a raw binary image of a cutaway diagram, and delivers an XML description matching the references of the legend with the indices in the image, and a browser for interpreting the XML generated map. The complete document treatment pipeline is conceived within a combined scripting and compiled library environment.</div>
</front>
</TEI>
<hal api="V3">
<titleStmt>
<title xml:lang="en">Scan-to-XML for Vector Graphics: an experimental setup for intelligent browsable document generation</title>
<author role="aut">
<persName>
<forename type="first">Bart</forename>
<surname>Lamiroy</surname>
</persName>
<email>Bart.Lamiroy@loria.fr</email>
<idno type="idhal">bart-lamiroy</idno>
<idno type="halauthor">131730</idno>
<idno type="ResearcherId">http://www.researcherid.com/rid/A-7746-2010</idno>
<idno type="IdRef">http://www.idref.fr/111726980</idno>
<idno type="ORCID">http://orcid.org/0000-0003-0871-0149</idno>
<orgName ref="#struct-364480"></orgName>
<affiliation ref="#struct-2350"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Laurent</forename>
<surname>Najman</surname>
</persName>
<email>laurent@najman.org</email>
<idno type="idhal">laurent-najman</idno>
<idno type="halauthor">533748</idno>
<idno type="ORCID">http://orcid.org/0000-0002-6190-0235</idno>
<idno type="arXiv">http://arxiv.org/a/lnajman</idno>
</author>
<author role="aut">
<persName>
<forename type="first">Romain</forename>
<surname>Ehrhard</surname>
</persName>
<email></email>
<idno type="halauthor">131732</idno>
<orgName ref="#struct-14430"></orgName>
<affiliation ref="#struct-14430"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Céline</forename>
<surname>Louis</surname>
</persName>
<email></email>
<idno type="halauthor">131733</idno>
<orgName ref="#struct-14430"></orgName>
<affiliation ref="#struct-14430"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Franck</forename>
<surname>Quélain</surname>
</persName>
<email></email>
<idno type="halauthor">131734</idno>
<orgName ref="#struct-14430"></orgName>
<affiliation ref="#struct-14430"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Nicolas</forename>
<surname>Rouyer</surname>
</persName>
<email></email>
<idno type="halauthor">131735</idno>
<orgName ref="#struct-14430"></orgName>
<affiliation ref="#struct-14430"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Nabil</forename>
<surname>Zeghache</surname>
</persName>
<email></email>
<idno type="halauthor">131736</idno>
<orgName ref="#struct-14430"></orgName>
<affiliation ref="#struct-14430"></affiliation>
</author>
<editor role="depositor">
<persName>
<forename>Publications</forename>
<surname>Loria</surname>
</persName>
<email>publications@loria.fr</email>
</editor>
</titleStmt>
<editionStmt>
<edition n="v1" type="current">
<date type="whenSubmitted">2006-09-26 14:45:26</date>
<date type="whenModified">2016-05-19 01:09:06</date>
<date type="whenReleased">2006-09-28 15:22:47</date>
<date type="whenProduced">2001</date>
</edition>
<respStmt>
<resp>contributor</resp>
<name key="108626">
<persName>
<forename>Publications</forename>
<surname>Loria</surname>
</persName>
<email>publications@loria.fr</email>
</name>
</respStmt>
</editionStmt>
<publicationStmt>
<distributor>CCSD</distributor>
<idno type="halId">inria-00100445</idno>
<idno type="halUri">https://hal.inria.fr/inria-00100445</idno>
<idno type="halBibtex">lamiroy:inria-00100445</idno>
<idno type="halRefHtml">Fourth IAPR International Workshop on Graphics Recognition, 2001, Kingston, Ontario, Canada, 14 p, 2001</idno>
<idno type="halRef">Fourth IAPR International Workshop on Graphics Recognition, 2001, Kingston, Ontario, Canada, 14 p, 2001</idno>
</publicationStmt>
<seriesStmt>
<idno type="stamp" n="INRIA">INRIA - Institut National de Recherche en Informatique et en Automatique</idno>
<idno type="stamp" n="CNRS">CNRS - Centre national de la recherche scientifique</idno>
<idno type="stamp" n="INPL">Institut National Polytechnique de Lorraine</idno>
<idno type="stamp" n="LORIA2">Publications du LORIA</idno>
<idno type="stamp" n="LORIA">LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications</idno>
<idno type="stamp" n="INRIA-NANCY-GRAND-EST">INRIA Nancy - Grand Est</idno>
<idno type="stamp" n="UNIV-LORRAINE">Université de Lorraine</idno>
<idno type="stamp" n="EM-NANCY">Ecole Nationale Supérieure des Mines de Nancy</idno>
<idno type="stamp" n="INSTITUT-TELECOM">Institut Télécom</idno>
<idno type="stamp" n="INRIA-LORRAINE">INRIA Nancy - Grand Est</idno>
<idno type="stamp" n="LABO-LORIA-SET" p="LORIA">LABO-LORIA-SET</idno>
</seriesStmt>
<notesStmt>
<note type="commentary">Colloque avec actes et comité de lecture. internationale.</note>
<note type="audience" n="2">International</note>
<note type="invited" n="0">No</note>
<note type="popular" n="0">No</note>
<note type="peer" n="1">Yes</note>
<note type="proceedings" n="1">Yes</note>
</notesStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Scan-to-XML for Vector Graphics: an experimental setup for intelligent browsable document generation</title>
<author role="aut">
<persName>
<forename type="first">Bart</forename>
<surname>Lamiroy</surname>
</persName>
<email>Bart.Lamiroy@loria.fr</email>
<idno type="idHal">bart-lamiroy</idno>
<idno type="halAuthorId">131730</idno>
<idno type="ResearcherId">http://www.researcherid.com/rid/A-7746-2010</idno>
<idno type="IdRef">http://www.idref.fr/111726980</idno>
<idno type="ORCID">http://orcid.org/0000-0003-0871-0149</idno>
<orgName ref="#struct-364480"></orgName>
<affiliation ref="#struct-2350"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Laurent</forename>
<surname>Najman</surname>
</persName>
<email>laurent@najman.org</email>
<idno type="idHal">laurent-najman</idno>
<idno type="halAuthorId">533748</idno>
<idno type="ORCID">http://orcid.org/0000-0002-6190-0235</idno>
<idno type="arXiv">http://arxiv.org/a/lnajman</idno>
</author>
<author role="aut">
<persName>
<forename type="first">Romain</forename>
<surname>Ehrhard</surname>
</persName>
<idno type="halAuthorId">131732</idno>
<orgName ref="#struct-14430"></orgName>
<affiliation ref="#struct-14430"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Céline</forename>
<surname>Louis</surname>
</persName>
<idno type="halAuthorId">131733</idno>
<orgName ref="#struct-14430"></orgName>
<affiliation ref="#struct-14430"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Franck</forename>
<surname>Quélain</surname>
</persName>
<idno type="halAuthorId">131734</idno>
<orgName ref="#struct-14430"></orgName>
<affiliation ref="#struct-14430"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Nicolas</forename>
<surname>Rouyer</surname>
</persName>
<idno type="halAuthorId">131735</idno>
<orgName ref="#struct-14430"></orgName>
<affiliation ref="#struct-14430"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Nabil</forename>
<surname>Zeghache</surname>
</persName>
<idno type="halAuthorId">131736</idno>
<orgName ref="#struct-14430"></orgName>
<affiliation ref="#struct-14430"></affiliation>
</author>
</analytic>
<monogr>
<idno type="localRef">A01-R-077 || lamiroy01a</idno>
<meeting>
<title>Fourth IAPR International Workshop on Graphics Recognition</title>
<date type="start">2001</date>
<settlement>Kingston, Ontario, Canada</settlement>
</meeting>
<imprint>
<biblScope unit="pp">14 p</biblScope>
<date type="datePub">2001</date>
</imprint>
</monogr>
</biblStruct>
</sourceDesc>
<profileDesc>
<langUsage>
<language ident="en">English</language>
</langUsage>
<textClass>
<keywords scheme="author">
<term xml:lang="fr">component algebra</term>
<term xml:lang="fr">automated generation</term>
<term xml:lang="fr">hyperlink</term>
<term xml:lang="fr">génération automatique</term>
<term xml:lang="fr">hyper-lien</term>
<term xml:lang="fr">algèbre de composantes</term>
<term xml:lang="fr">document analysis</term>
<term xml:lang="fr">xml</term>
<term xml:lang="fr">analyse de documents</term>
</keywords>
<classCode scheme="halDomain" n="info.info-oh">Computer Science [cs]/Other [cs.OH]</classCode>
<classCode scheme="halTypology" n="COMM">Conference papers</classCode>
</textClass>
<abstract xml:lang="en">This paper describes an experimental setup, conducted in collaboration with the ISA research group of the LORIA laboratory, Océ-PLT, and students from the École des Mines de Nancy. The main objective is to experiment an approach to develop a high level document analysis platform by composing existing bricks from a comprehensive library of state-of-the art algorithms. The test-case of this methodology consists in the realization of a fully automated method of generating a browsable, hyper-linked document from a simple scanned image. We concentrated our work on cutaway diagrams. These documents present the advantage of containing simple browsing semantics, in the sense that they consist of a clearly identifiable legend containing index references, plus a drawing containing one or more occurrences of the same indices. The setup described in this paper starts from a raw binary image of a cutaway diagram, and delivers an XML description matching the references of the legend with the indices in the image, and a browser for interpreting the XML generated map. The complete document treatment pipeline is conceived within a combined scripting and compiled library environment.</abstract>
</profileDesc>
</hal>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Hal/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 005D39 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Hal/Checkpoint/biblio.hfd -nk 005D39 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Hal
   |étape=   Checkpoint
   |type=    RBID
   |clé=     Hal:inria-00100445
   |texte=   Scan-to-XML for Vector Graphics: an experimental setup for intelligent browsable document generation
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022