Exploring Digital Libraries with Document Image Retrieval
Identifieur interne : 000E55 ( Main/Curation ); précédent : 000E54; suivant : 000E56Exploring Digital Libraries with Document Image Retrieval
Auteurs : Simone Marinai [Italie] ; Emanuele Marino [Italie] ; Giovanni Soda [Italie]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2007.
Abstract
Abstract: In this paper, we describe a system to perform Document Image Retrieval in Digital Libraries. The system allows users to retrieve digitized pages on the basis of layout similarities and to make textual searches on the documents without relying on OCR. The system is discussed in the context of recent applications of document image retrieval in the field of Digital Libraries. We present the different techniques in a single framework in which the emphasis is put on the representation level at which the similarity between the query and the indexed documents is computed. We also report the results of some recent experiments on the use of layout-based document image retrieval.
Url:
DOI: 10.1007/978-3-540-74851-9_31
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000344
- to stream Istex, to step Curation: Pour aller vers cette notice dans l'étape Curation :000339
- to stream Istex, to step Checkpoint: Pour aller vers cette notice dans l'étape Curation :000870
- to stream Main, to step Merge: Pour aller vers cette notice dans l'étape Curation :000E68
Links to Exploration step
ISTEX:5F3A004C275E411FB7FED9DA2A1A40971B6E44EBLe document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Exploring Digital Libraries with Document Image Retrieval</title>
<author><name sortKey="Marinai, Simone" sort="Marinai, Simone" uniqKey="Marinai S" first="Simone" last="Marinai">Simone Marinai</name>
</author>
<author><name sortKey="Marino, Emanuele" sort="Marino, Emanuele" uniqKey="Marino E" first="Emanuele" last="Marino">Emanuele Marino</name>
</author>
<author><name sortKey="Soda, Giovanni" sort="Soda, Giovanni" uniqKey="Soda G" first="Giovanni" last="Soda">Giovanni Soda</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:5F3A004C275E411FB7FED9DA2A1A40971B6E44EB</idno>
<date when="2007" year="2007">2007</date>
<idno type="doi">10.1007/978-3-540-74851-9_31</idno>
<idno type="url">https://api.istex.fr/document/5F3A004C275E411FB7FED9DA2A1A40971B6E44EB/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000344</idno>
<idno type="wicri:Area/Istex/Curation">000339</idno>
<idno type="wicri:Area/Istex/Checkpoint">000870</idno>
<idno type="wicri:doubleKey">0302-9743:2007:Marinai S:exploring:digital:libraries</idno>
<idno type="wicri:Area/Main/Merge">000E68</idno>
<idno type="wicri:Area/Main/Curation">000E55</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Exploring Digital Libraries with Document Image Retrieval</title>
<author><name sortKey="Marinai, Simone" sort="Marinai, Simone" uniqKey="Marinai S" first="Simone" last="Marinai">Simone Marinai</name>
<affiliation wicri:level="1"><country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze</wicri:regionArea>
<wicri:noRegion>3 - 50139 Firenze</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Italie</country>
</affiliation>
</author>
<author><name sortKey="Marino, Emanuele" sort="Marino, Emanuele" uniqKey="Marino E" first="Emanuele" last="Marino">Emanuele Marino</name>
<affiliation wicri:level="1"><country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze</wicri:regionArea>
<wicri:noRegion>3 - 50139 Firenze</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Soda, Giovanni" sort="Soda, Giovanni" uniqKey="Soda G" first="Giovanni" last="Soda">Giovanni Soda</name>
<affiliation wicri:level="1"><country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze</wicri:regionArea>
<wicri:noRegion>3 - 50139 Firenze</wicri:noRegion>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2007</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">5F3A004C275E411FB7FED9DA2A1A40971B6E44EB</idno>
<idno type="DOI">10.1007/978-3-540-74851-9_31</idno>
<idno type="ChapterID">31</idno>
<idno type="ChapterID">Chap31</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: In this paper, we describe a system to perform Document Image Retrieval in Digital Libraries. The system allows users to retrieve digitized pages on the basis of layout similarities and to make textual searches on the documents without relying on OCR. The system is discussed in the context of recent applications of document image retrieval in the field of Digital Libraries. We present the different techniques in a single framework in which the emphasis is put on the representation level at which the similarity between the query and the indexed documents is computed. We also report the results of some recent experiments on the use of layout-based document image retrieval.</div>
</front>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000E55 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Curation/biblio.hfd -nk 000E55 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Curation |type= RBID |clé= ISTEX:5F3A004C275E411FB7FED9DA2A1A40971B6E44EB |texte= Exploring Digital Libraries with Document Image Retrieval }}
This area was generated with Dilib version V0.6.32. |