Exploring Digital Libraries with Document Image Retrieval
Identifieur interne : 000E55 ( Main/Exploration ); précédent : 000E54; suivant : 000E56Exploring Digital Libraries with Document Image Retrieval
Auteurs : Simone Marinai [Italie] ; Emanuele Marino [Italie] ; Giovanni Soda [Italie]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2007.
Abstract
Abstract: In this paper, we describe a system to perform Document Image Retrieval in Digital Libraries. The system allows users to retrieve digitized pages on the basis of layout similarities and to make textual searches on the documents without relying on OCR. The system is discussed in the context of recent applications of document image retrieval in the field of Digital Libraries. We present the different techniques in a single framework in which the emphasis is put on the representation level at which the similarity between the query and the indexed documents is computed. We also report the results of some recent experiments on the use of layout-based document image retrieval.
Url:
DOI: 10.1007/978-3-540-74851-9_31
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 000344
- to stream Istex, to step Curation: 000339
- to stream Istex, to step Checkpoint: 000870
- to stream Main, to step Merge: 000E68
- to stream Main, to step Curation: 000E55
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Exploring Digital Libraries with Document Image Retrieval</title>
<author><name sortKey="Marinai, Simone" sort="Marinai, Simone" uniqKey="Marinai S" first="Simone" last="Marinai">Simone Marinai</name>
</author>
<author><name sortKey="Marino, Emanuele" sort="Marino, Emanuele" uniqKey="Marino E" first="Emanuele" last="Marino">Emanuele Marino</name>
</author>
<author><name sortKey="Soda, Giovanni" sort="Soda, Giovanni" uniqKey="Soda G" first="Giovanni" last="Soda">Giovanni Soda</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:5F3A004C275E411FB7FED9DA2A1A40971B6E44EB</idno>
<date when="2007" year="2007">2007</date>
<idno type="doi">10.1007/978-3-540-74851-9_31</idno>
<idno type="url">https://api.istex.fr/document/5F3A004C275E411FB7FED9DA2A1A40971B6E44EB/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000344</idno>
<idno type="wicri:Area/Istex/Curation">000339</idno>
<idno type="wicri:Area/Istex/Checkpoint">000870</idno>
<idno type="wicri:doubleKey">0302-9743:2007:Marinai S:exploring:digital:libraries</idno>
<idno type="wicri:Area/Main/Merge">000E68</idno>
<idno type="wicri:Area/Main/Curation">000E55</idno>
<idno type="wicri:Area/Main/Exploration">000E55</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Exploring Digital Libraries with Document Image Retrieval</title>
<author><name sortKey="Marinai, Simone" sort="Marinai, Simone" uniqKey="Marinai S" first="Simone" last="Marinai">Simone Marinai</name>
<affiliation wicri:level="1"><country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze</wicri:regionArea>
<wicri:noRegion>3 - 50139 Firenze</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Italie</country>
</affiliation>
</author>
<author><name sortKey="Marino, Emanuele" sort="Marino, Emanuele" uniqKey="Marino E" first="Emanuele" last="Marino">Emanuele Marino</name>
<affiliation wicri:level="1"><country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze</wicri:regionArea>
<wicri:noRegion>3 - 50139 Firenze</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Soda, Giovanni" sort="Soda, Giovanni" uniqKey="Soda G" first="Giovanni" last="Soda">Giovanni Soda</name>
<affiliation wicri:level="1"><country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Sistemi e Informatica - Università di Firenze, Via S.Marta, 3 - 50139 Firenze</wicri:regionArea>
<wicri:noRegion>3 - 50139 Firenze</wicri:noRegion>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2007</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">5F3A004C275E411FB7FED9DA2A1A40971B6E44EB</idno>
<idno type="DOI">10.1007/978-3-540-74851-9_31</idno>
<idno type="ChapterID">31</idno>
<idno type="ChapterID">Chap31</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: In this paper, we describe a system to perform Document Image Retrieval in Digital Libraries. The system allows users to retrieve digitized pages on the basis of layout similarities and to make textual searches on the documents without relying on OCR. The system is discussed in the context of recent applications of document image retrieval in the field of Digital Libraries. We present the different techniques in a single framework in which the emphasis is put on the representation level at which the similarity between the query and the indexed documents is computed. We also report the results of some recent experiments on the use of layout-based document image retrieval.</div>
</front>
</TEI>
<affiliations><list><country><li>Italie</li>
</country>
</list>
<tree><country name="Italie"><noRegion><name sortKey="Marinai, Simone" sort="Marinai, Simone" uniqKey="Marinai S" first="Simone" last="Marinai">Simone Marinai</name>
</noRegion>
<name sortKey="Marinai, Simone" sort="Marinai, Simone" uniqKey="Marinai S" first="Simone" last="Marinai">Simone Marinai</name>
<name sortKey="Marino, Emanuele" sort="Marino, Emanuele" uniqKey="Marino E" first="Emanuele" last="Marino">Emanuele Marino</name>
<name sortKey="Soda, Giovanni" sort="Soda, Giovanni" uniqKey="Soda G" first="Giovanni" last="Soda">Giovanni Soda</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000E55 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000E55 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:5F3A004C275E411FB7FED9DA2A1A40971B6E44EB |texte= Exploring Digital Libraries with Document Image Retrieval }}
![]() | This area was generated with Dilib version V0.6.32. | ![]() |