Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Exploring Digital Libraries with Document Image Retrieval

Identifieur interne : 000339 ( Istex/Curation ); précédent : 000338; suivant : 000340

Exploring Digital Libraries with Document Image Retrieval

Auteurs : Simone Marinai [Italie] ; Emanuele Marino [Italie] ; Giovanni Soda [Italie]

Source :

RBID : ISTEX:5F3A004C275E411FB7FED9DA2A1A40971B6E44EB

Abstract

Abstract: In this paper, we describe a system to perform Document Image Retrieval in Digital Libraries. The system allows users to retrieve digitized pages on the basis of layout similarities and to make textual searches on the documents without relying on OCR. The system is discussed in the context of recent applications of document image retrieval in the field of Digital Libraries. We present the different techniques in a single framework in which the emphasis is put on the representation level at which the similarity between the query and the indexed documents is computed. We also report the results of some recent experiments on the use of layout-based document image retrieval.

Url:
DOI: 10.1007/978-3-540-74851-9_31

Links toward previous steps (curation, corpus...)


Links to Exploration step

ISTEX:5F3A004C275E411FB7FED9DA2A1A40971B6E44EB

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Exploring Digital Libraries with Document Image Retrieval</title>
<author>
<name sortKey="Marinai, Simone" sort="Marinai, Simone" uniqKey="Marinai S" first="Simone" last="Marinai">Simone Marinai</name>
<affiliation wicri:level="1">
<mods:affiliation>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze, Italy</mods:affiliation>
<country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1">
<mods:affiliation>E-mail: marinai@dsi.unifi.it</mods:affiliation>
<country wicri:rule="url">Italie</country>
</affiliation>
</author>
<author>
<name sortKey="Marino, Emanuele" sort="Marino, Emanuele" uniqKey="Marino E" first="Emanuele" last="Marino">Emanuele Marino</name>
<affiliation wicri:level="1">
<mods:affiliation>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze, Italy</mods:affiliation>
<country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Soda, Giovanni" sort="Soda, Giovanni" uniqKey="Soda G" first="Giovanni" last="Soda">Giovanni Soda</name>
<affiliation wicri:level="1">
<mods:affiliation>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze, Italy</mods:affiliation>
<country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze</wicri:regionArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:5F3A004C275E411FB7FED9DA2A1A40971B6E44EB</idno>
<date when="2007" year="2007">2007</date>
<idno type="doi">10.1007/978-3-540-74851-9_31</idno>
<idno type="url">https://api.istex.fr/document/5F3A004C275E411FB7FED9DA2A1A40971B6E44EB/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000344</idno>
<idno type="wicri:Area/Istex/Curation">000339</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Exploring Digital Libraries with Document Image Retrieval</title>
<author>
<name sortKey="Marinai, Simone" sort="Marinai, Simone" uniqKey="Marinai S" first="Simone" last="Marinai">Simone Marinai</name>
<affiliation wicri:level="1">
<mods:affiliation>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze, Italy</mods:affiliation>
<country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1">
<mods:affiliation>E-mail: marinai@dsi.unifi.it</mods:affiliation>
<country wicri:rule="url">Italie</country>
</affiliation>
</author>
<author>
<name sortKey="Marino, Emanuele" sort="Marino, Emanuele" uniqKey="Marino E" first="Emanuele" last="Marino">Emanuele Marino</name>
<affiliation wicri:level="1">
<mods:affiliation>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze, Italy</mods:affiliation>
<country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Soda, Giovanni" sort="Soda, Giovanni" uniqKey="Soda G" first="Giovanni" last="Soda">Giovanni Soda</name>
<affiliation wicri:level="1">
<mods:affiliation>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze, Italy</mods:affiliation>
<country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze</wicri:regionArea>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2007</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">5F3A004C275E411FB7FED9DA2A1A40971B6E44EB</idno>
<idno type="DOI">10.1007/978-3-540-74851-9_31</idno>
<idno type="ChapterID">31</idno>
<idno type="ChapterID">Chap31</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: In this paper, we describe a system to perform Document Image Retrieval in Digital Libraries. The system allows users to retrieve digitized pages on the basis of layout similarities and to make textual searches on the documents without relying on OCR. The system is discussed in the context of recent applications of document image retrieval in the field of Digital Libraries. We present the different techniques in a single framework in which the emphasis is put on the representation level at which the similarity between the query and the indexed documents is computed. We also report the results of some recent experiments on the use of layout-based document image retrieval.</div>
</front>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000339 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Curation/biblio.hfd -nk 000339 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Istex
   |étape=   Curation
   |type=    RBID
   |clé=     ISTEX:5F3A004C275E411FB7FED9DA2A1A40971B6E44EB
   |texte=   Exploring Digital Libraries with Document Image Retrieval
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024