Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Automatic Feature Extraction and Recognition for Digital Access of Books of the Renaissance

Identifieur interne : 001E84 ( Main/Merge ); précédent : 001E83; suivant : 001E85

Automatic Feature Extraction and Recognition for Digital Access of Books of the Renaissance

Auteurs : F. Muge ; I. Granado ; M. Mengucci ; P. Pina ; V. Ramos ; N. Sirakov ; R. Caldas Pinto ; A. Marcolino ; Mário Ramalho ; P. Vieira ; A. Maia Do Amaral

Source :

RBID : ISTEX:363E5AD5B3AC83B30AA0AD9AC7C0B2CEBFB2C846

Abstract

Abstract: Antique printed books constitute a heritage that should be preserved and used. With novel digitising techniques is now possible to have these books stored in digital format and accessible to a wider public. However it remains the problem of how to use them. DEBORA (Digital accEss to BOoks of the RenAissance) is a European project that aims to develop a system to interact with these books through world-wide networks. The main issue is to build a database accessible through client computers. That will require to built accompanying metadata that should characterise different components of the books as illuminated letters, banners, figures and key words in order to simplify and speed up the remote access. To solve these problems, digital image analysis algorithms regarding filtering, segmentation, separation of text from non-text, lines and word segmentation and word recognition were developed. Some novel ideas are presented and illustrated through examples.

Url:
DOI: 10.1007/3-540-45268-0_1

Links toward previous steps (curation, corpus...)


Links to Exploration step

ISTEX:363E5AD5B3AC83B30AA0AD9AC7C0B2CEBFB2C846

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Automatic Feature Extraction and Recognition for Digital Access of Books of the Renaissance</title>
<author>
<name sortKey="Muge, F" sort="Muge, F" uniqKey="Muge F" first="F." last="Muge">F. Muge</name>
</author>
<author>
<name sortKey="Granado, I" sort="Granado, I" uniqKey="Granado I" first="I." last="Granado">I. Granado</name>
</author>
<author>
<name sortKey="Mengucci, M" sort="Mengucci, M" uniqKey="Mengucci M" first="M." last="Mengucci">M. Mengucci</name>
</author>
<author>
<name sortKey="Pina, P" sort="Pina, P" uniqKey="Pina P" first="P." last="Pina">P. Pina</name>
</author>
<author>
<name sortKey="Ramos, V" sort="Ramos, V" uniqKey="Ramos V" first="V." last="Ramos">V. Ramos</name>
</author>
<author>
<name sortKey="Sirakov, N" sort="Sirakov, N" uniqKey="Sirakov N" first="N." last="Sirakov">N. Sirakov</name>
</author>
<author>
<name sortKey="Caldas Pinto, R" sort="Caldas Pinto, R" uniqKey="Caldas Pinto R" first="R." last="Caldas Pinto">R. Caldas Pinto</name>
</author>
<author>
<name sortKey="Marcolino, A" sort="Marcolino, A" uniqKey="Marcolino A" first="A." last="Marcolino">A. Marcolino</name>
</author>
<author>
<name sortKey="Ramalho, Mario" sort="Ramalho, Mario" uniqKey="Ramalho M" first="Mário" last="Ramalho">Mário Ramalho</name>
</author>
<author>
<name sortKey="Vieira, P" sort="Vieira, P" uniqKey="Vieira P" first="P." last="Vieira">P. Vieira</name>
</author>
<author>
<name sortKey="Maia Do Amaral, A" sort="Maia Do Amaral, A" uniqKey="Maia Do Amaral A" first="A." last="Maia Do Amaral">A. Maia Do Amaral</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:363E5AD5B3AC83B30AA0AD9AC7C0B2CEBFB2C846</idno>
<date when="2000" year="2000">2000</date>
<idno type="doi">10.1007/3-540-45268-0_1</idno>
<idno type="url">https://api.istex.fr/document/363E5AD5B3AC83B30AA0AD9AC7C0B2CEBFB2C846/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000930</idno>
<idno type="wicri:Area/Istex/Curation">000920</idno>
<idno type="wicri:Area/Istex/Checkpoint">001387</idno>
<idno type="wicri:doubleKey">0302-9743:2000:Muge F:automatic:feature:extraction</idno>
<idno type="wicri:Area/Main/Merge">001E84</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Automatic Feature Extraction and Recognition for Digital Access of Books of the Renaissance</title>
<author>
<name sortKey="Muge, F" sort="Muge, F" uniqKey="Muge F" first="F." last="Muge">F. Muge</name>
<affiliation>
<wicri:noCountry code="subField">Lisboa</wicri:noCountry>
</affiliation>
</author>
<author>
<name sortKey="Granado, I" sort="Granado, I" uniqKey="Granado I" first="I." last="Granado">I. Granado</name>
<affiliation>
<wicri:noCountry code="subField">Lisboa</wicri:noCountry>
</affiliation>
</author>
<author>
<name sortKey="Mengucci, M" sort="Mengucci, M" uniqKey="Mengucci M" first="M." last="Mengucci">M. Mengucci</name>
<affiliation>
<wicri:noCountry code="subField">Lisboa</wicri:noCountry>
</affiliation>
</author>
<author>
<name sortKey="Pina, P" sort="Pina, P" uniqKey="Pina P" first="P." last="Pina">P. Pina</name>
<affiliation>
<wicri:noCountry code="subField">Lisboa</wicri:noCountry>
</affiliation>
</author>
<author>
<name sortKey="Ramos, V" sort="Ramos, V" uniqKey="Ramos V" first="V." last="Ramos">V. Ramos</name>
<affiliation>
<wicri:noCountry code="subField">Lisboa</wicri:noCountry>
</affiliation>
</author>
<author>
<name sortKey="Sirakov, N" sort="Sirakov, N" uniqKey="Sirakov N" first="N." last="Sirakov">N. Sirakov</name>
<affiliation>
<wicri:noCountry code="subField">Lisboa</wicri:noCountry>
</affiliation>
</author>
<author>
<name sortKey="Caldas Pinto, R" sort="Caldas Pinto, R" uniqKey="Caldas Pinto R" first="R." last="Caldas Pinto">R. Caldas Pinto</name>
<affiliation>
<wicri:noCountry code="subField">Lisboa</wicri:noCountry>
</affiliation>
</author>
<author>
<name sortKey="Marcolino, A" sort="Marcolino, A" uniqKey="Marcolino A" first="A." last="Marcolino">A. Marcolino</name>
<affiliation>
<wicri:noCountry code="subField">Lisboa</wicri:noCountry>
</affiliation>
</author>
<author>
<name sortKey="Ramalho, Mario" sort="Ramalho, Mario" uniqKey="Ramalho M" first="Mário" last="Ramalho">Mário Ramalho</name>
<affiliation>
<wicri:noCountry code="subField">Lisboa</wicri:noCountry>
</affiliation>
</author>
<author>
<name sortKey="Vieira, P" sort="Vieira, P" uniqKey="Vieira P" first="P." last="Vieira">P. Vieira</name>
<affiliation>
<wicri:noCountry code="subField">Lisboa</wicri:noCountry>
</affiliation>
</author>
<author>
<name sortKey="Maia Do Amaral, A" sort="Maia Do Amaral, A" uniqKey="Maia Do Amaral A" first="A." last="Maia Do Amaral">A. Maia Do Amaral</name>
<affiliation>
<wicri:noCountry code="subField">Coimbra</wicri:noCountry>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2000</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">363E5AD5B3AC83B30AA0AD9AC7C0B2CEBFB2C846</idno>
<idno type="DOI">10.1007/3-540-45268-0_1</idno>
<idno type="ChapterID">1</idno>
<idno type="ChapterID">Chap1</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Antique printed books constitute a heritage that should be preserved and used. With novel digitising techniques is now possible to have these books stored in digital format and accessible to a wider public. However it remains the problem of how to use them. DEBORA (Digital accEss to BOoks of the RenAissance) is a European project that aims to develop a system to interact with these books through world-wide networks. The main issue is to build a database accessible through client computers. That will require to built accompanying metadata that should characterise different components of the books as illuminated letters, banners, figures and key words in order to simplify and speed up the remote access. To solve these problems, digital image analysis algorithms regarding filtering, segmentation, separation of text from non-text, lines and word segmentation and word recognition were developed. Some novel ideas are presented and illustrated through examples.</div>
</front>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001E84 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 001E84 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     ISTEX:363E5AD5B3AC83B30AA0AD9AC7C0B2CEBFB2C846
   |texte=   Automatic Feature Extraction and Recognition for Digital Access of Books of the Renaissance
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024