Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

The processing of digitized works

Identifieur interne : 000308 ( PascalFrancis/Corpus ); précédent : 000307; suivant : 000309

The processing of digitized works

Auteurs : José Borbinha ; Joao Gil ; Gilberto Pedrosa ; Joao Penas

Source :

RBID : Francis:08-0091670

Descripteurs français

English descriptors

Abstract

This paper describes the processing of digitised works at the National Library of Portugal, as done in the scope of the National Digital Library initiate (BND). This comprises the normalization of the names of the images, the creation of technical metadata, image processing, OCR, indexing, and the creation of derived copies for preservation and copies for access in PNG, JPG, GIF, and PDF. The structural descriptions of all the objects an done in METS.

Notice en format standard (ISO 2709)

Pour connaître la documentation sur le format Inist Standard.

pA  
A08 01  1  ENG  @1 The processing of digitized works
A09 01  1  ENG  @1 6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006 : opening information horizons : June 11-15, 2006, Chapel Hill NC
A11 01  1    @1 BORBINHA (José)
A11 02  1    @1 GIL (Joao)
A11 03  1    @1 PEDROSA (Gilberto)
A11 04  1    @1 PENAS (Joao)
A14 01      @1 INESC-ID -R. Alves Redol 9, Apartado 13069 @2 1000-029 Lisboa @3 PRT @Z 1 aut. @Z 2 aut. @Z 3 aut. @Z 4 aut.
A18 01  1    @1 Association for Computing Machinery. Special Interest Group on Information Retrieval @3 USA @9 org-cong.
A18 02  1    @1 Association for Computing Machinery. Special Interest Group on Hypertext, Hypermedia and Web @3 USA @9 org-cong.
A18 03  1    @1 IEEE Computer Society. Technical Committee on Digital Libraries @3 USA @9 org-cong.
A20       @1 103-104
A21       @1 2006
A23 01      @0 ENG
A25 01      @1 ACM Press @2 New York NY
A26 01      @0 1-59593-354-9
A30 01  1  ENG  @1 ACM/IEEE Joint Conference on Digital Libraries @2 6 @3 Chapel Hill NC USA @4 2006
A43 01      @1 INIST @2 Y 38968 @5 354000153512330140
A44       @0 0000 @1 © 2008 INIST-CNRS. All rights reserved.
A45       @0 3 ref.
A47 01  1    @0 08-0091670
A60       @1 C
A61       @0 A
A66 01      @0 USA
C01 01    ENG  @0 This paper describes the processing of digitised works at the National Library of Portugal, as done in the scope of the National Digital Library initiate (BND). This comprises the normalization of the names of the images, the creation of technical metadata, image processing, OCR, indexing, and the creation of derived copies for preservation and copies for access in PNG, JPG, GIF, and PDF. The structural descriptions of all the objects an done in METS.
C02 01  X    @0 790D02A @1 IV
C03 01  X  FRE  @0 Portugal @2 NG @5 01
C03 01  X  ENG  @0 Portugal @2 NG @5 01
C03 01  X  SPA  @0 Portugal @2 NG @5 01
C03 02  X  FRE  @0 Numérisation @5 04
C03 02  X  ENG  @0 Digitizing @5 04
C03 02  X  SPA  @0 Numerización @5 04
C03 03  X  FRE  @0 Bibliothèque nationale @5 05
C03 03  X  ENG  @0 National library @5 05
C03 03  X  SPA  @0 Biblioteca nacional @5 05
C03 04  X  FRE  @0 Accès information @5 06
C03 04  X  ENG  @0 Information access @5 06
C03 04  X  SPA  @0 Acceso información @5 06
C03 05  X  FRE  @0 Bibliothèque électronique @5 07
C03 05  X  ENG  @0 Electronic library @5 07
C03 05  X  SPA  @0 Biblioteca electronica @5 07
C03 06  X  FRE  @0 Reconnaissance optique caractère @5 10
C03 06  X  ENG  @0 Optical character recognition @5 10
C03 06  X  SPA  @0 Reconocimento óptico de caracteres @5 10
C03 07  X  FRE  @0 Préservation @5 11
C03 07  X  ENG  @0 Preservation @5 11
C03 07  X  SPA  @0 Preservación @5 11
C03 08  X  FRE  @0 Description @5 12
C03 08  X  ENG  @0 Description @5 12
C03 08  X  SPA  @0 Descripción @5 12
C07 01  X  FRE  @0 Europe @2 NG
C07 01  X  ENG  @0 Europe @2 NG
C07 01  X  SPA  @0 Europa @2 NG
N21       @1 052

Format Inist (serveur)

NO : FRANCIS 08-0091670 INIST
ET : The processing of digitized works
AU : BORBINHA (José); GIL (Joao); PEDROSA (Gilberto); PENAS (Joao)
AF : INESC-ID -R. Alves Redol 9, Apartado 13069/1000-029 Lisboa/Portugal (1 aut., 2 aut., 3 aut., 4 aut.)
DT : Congrès; Niveau analytique
SO : ACM/IEEE Joint Conference on Digital Libraries/6/2006/Chapel Hill NC USA; Etats-Unis; New York NY: ACM Press; Da. 2006; Pp. 103-104; ISBN 1-59593-354-9
LA : Anglais
EA : This paper describes the processing of digitised works at the National Library of Portugal, as done in the scope of the National Digital Library initiate (BND). This comprises the normalization of the names of the images, the creation of technical metadata, image processing, OCR, indexing, and the creation of derived copies for preservation and copies for access in PNG, JPG, GIF, and PDF. The structural descriptions of all the objects an done in METS.
CC : 790D02A
FD : Portugal; Numérisation; Bibliothèque nationale; Accès information; Bibliothèque électronique; Reconnaissance optique caractère; Préservation; Description
FG : Europe
ED : Portugal; Digitizing; National library; Information access; Electronic library; Optical character recognition; Preservation; Description
EG : Europe
SD : Portugal; Numerización; Biblioteca nacional; Acceso información; Biblioteca electronica; Reconocimento óptico de caracteres; Preservación; Descripción
LO : INIST-Y 38968.354000153512330140
ID : 08-0091670

Links to Exploration step

Francis:08-0091670

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">The processing of digitized works</title>
<author>
<name sortKey="Borbinha, Jose" sort="Borbinha, Jose" uniqKey="Borbinha J" first="José" last="Borbinha">José Borbinha</name>
<affiliation>
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Gil, Joao" sort="Gil, Joao" uniqKey="Gil J" first="Joao" last="Gil">Joao Gil</name>
<affiliation>
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Pedrosa, Gilberto" sort="Pedrosa, Gilberto" uniqKey="Pedrosa G" first="Gilberto" last="Pedrosa">Gilberto Pedrosa</name>
<affiliation>
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Penas, Joao" sort="Penas, Joao" uniqKey="Penas J" first="Joao" last="Penas">Joao Penas</name>
<affiliation>
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">08-0091670</idno>
<date when="2006">2006</date>
<idno type="stanalyst">FRANCIS 08-0091670 INIST</idno>
<idno type="RBID">Francis:08-0091670</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000308</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">The processing of digitized works</title>
<author>
<name sortKey="Borbinha, Jose" sort="Borbinha, Jose" uniqKey="Borbinha J" first="José" last="Borbinha">José Borbinha</name>
<affiliation>
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Gil, Joao" sort="Gil, Joao" uniqKey="Gil J" first="Joao" last="Gil">Joao Gil</name>
<affiliation>
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Pedrosa, Gilberto" sort="Pedrosa, Gilberto" uniqKey="Pedrosa G" first="Gilberto" last="Pedrosa">Gilberto Pedrosa</name>
<affiliation>
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Penas, Joao" sort="Penas, Joao" uniqKey="Penas J" first="Joao" last="Penas">Joao Penas</name>
<affiliation>
<inist:fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Description</term>
<term>Digitizing</term>
<term>Electronic library</term>
<term>Information access</term>
<term>National library</term>
<term>Optical character recognition</term>
<term>Portugal</term>
<term>Preservation</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Portugal</term>
<term>Numérisation</term>
<term>Bibliothèque nationale</term>
<term>Accès information</term>
<term>Bibliothèque électronique</term>
<term>Reconnaissance optique caractère</term>
<term>Préservation</term>
<term>Description</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">This paper describes the processing of digitised works at the National Library of Portugal, as done in the scope of the National Digital Library initiate (BND). This comprises the normalization of the names of the images, the creation of technical metadata, image processing, OCR, indexing, and the creation of derived copies for preservation and copies for access in PNG, JPG, GIF, and PDF. The structural descriptions of all the objects an done in METS.</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA08 i1="01" i2="1" l="ENG">
<s1>The processing of digitized works</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG">
<s1>6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006 : opening information horizons : June 11-15, 2006, Chapel Hill NC</s1>
</fA09>
<fA11 i1="01" i2="1">
<s1>BORBINHA (José)</s1>
</fA11>
<fA11 i1="02" i2="1">
<s1>GIL (Joao)</s1>
</fA11>
<fA11 i1="03" i2="1">
<s1>PEDROSA (Gilberto)</s1>
</fA11>
<fA11 i1="04" i2="1">
<s1>PENAS (Joao)</s1>
</fA11>
<fA14 i1="01">
<s1>INESC-ID -R. Alves Redol 9, Apartado 13069</s1>
<s2>1000-029 Lisboa</s2>
<s3>PRT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</fA14>
<fA18 i1="01" i2="1">
<s1>Association for Computing Machinery. Special Interest Group on Information Retrieval</s1>
<s3>USA</s3>
<s9>org-cong.</s9>
</fA18>
<fA18 i1="02" i2="1">
<s1>Association for Computing Machinery. Special Interest Group on Hypertext, Hypermedia and Web</s1>
<s3>USA</s3>
<s9>org-cong.</s9>
</fA18>
<fA18 i1="03" i2="1">
<s1>IEEE Computer Society. Technical Committee on Digital Libraries</s1>
<s3>USA</s3>
<s9>org-cong.</s9>
</fA18>
<fA20>
<s1>103-104</s1>
</fA20>
<fA21>
<s1>2006</s1>
</fA21>
<fA23 i1="01">
<s0>ENG</s0>
</fA23>
<fA25 i1="01">
<s1>ACM Press</s1>
<s2>New York NY</s2>
</fA25>
<fA26 i1="01">
<s0>1-59593-354-9</s0>
</fA26>
<fA30 i1="01" i2="1" l="ENG">
<s1>ACM/IEEE Joint Conference on Digital Libraries</s1>
<s2>6</s2>
<s3>Chapel Hill NC USA</s3>
<s4>2006</s4>
</fA30>
<fA43 i1="01">
<s1>INIST</s1>
<s2>Y 38968</s2>
<s5>354000153512330140</s5>
</fA43>
<fA44>
<s0>0000</s0>
<s1>© 2008 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45>
<s0>3 ref.</s0>
</fA45>
<fA47 i1="01" i2="1">
<s0>08-0091670</s0>
</fA47>
<fA60>
<s1>C</s1>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA66 i1="01">
<s0>USA</s0>
</fA66>
<fC01 i1="01" l="ENG">
<s0>This paper describes the processing of digitised works at the National Library of Portugal, as done in the scope of the National Digital Library initiate (BND). This comprises the normalization of the names of the images, the creation of technical metadata, image processing, OCR, indexing, and the creation of derived copies for preservation and copies for access in PNG, JPG, GIF, and PDF. The structural descriptions of all the objects an done in METS.</s0>
</fC01>
<fC02 i1="01" i2="X">
<s0>790D02A</s0>
<s1>IV</s1>
</fC02>
<fC03 i1="01" i2="X" l="FRE">
<s0>Portugal</s0>
<s2>NG</s2>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG">
<s0>Portugal</s0>
<s2>NG</s2>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA">
<s0>Portugal</s0>
<s2>NG</s2>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE">
<s0>Numérisation</s0>
<s5>04</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG">
<s0>Digitizing</s0>
<s5>04</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA">
<s0>Numerización</s0>
<s5>04</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE">
<s0>Bibliothèque nationale</s0>
<s5>05</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG">
<s0>National library</s0>
<s5>05</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA">
<s0>Biblioteca nacional</s0>
<s5>05</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE">
<s0>Accès information</s0>
<s5>06</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG">
<s0>Information access</s0>
<s5>06</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA">
<s0>Acceso información</s0>
<s5>06</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE">
<s0>Bibliothèque électronique</s0>
<s5>07</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG">
<s0>Electronic library</s0>
<s5>07</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA">
<s0>Biblioteca electronica</s0>
<s5>07</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE">
<s0>Reconnaissance optique caractère</s0>
<s5>10</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG">
<s0>Optical character recognition</s0>
<s5>10</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA">
<s0>Reconocimento óptico de caracteres</s0>
<s5>10</s5>
</fC03>
<fC03 i1="07" i2="X" l="FRE">
<s0>Préservation</s0>
<s5>11</s5>
</fC03>
<fC03 i1="07" i2="X" l="ENG">
<s0>Preservation</s0>
<s5>11</s5>
</fC03>
<fC03 i1="07" i2="X" l="SPA">
<s0>Preservación</s0>
<s5>11</s5>
</fC03>
<fC03 i1="08" i2="X" l="FRE">
<s0>Description</s0>
<s5>12</s5>
</fC03>
<fC03 i1="08" i2="X" l="ENG">
<s0>Description</s0>
<s5>12</s5>
</fC03>
<fC03 i1="08" i2="X" l="SPA">
<s0>Descripción</s0>
<s5>12</s5>
</fC03>
<fC07 i1="01" i2="X" l="FRE">
<s0>Europe</s0>
<s2>NG</s2>
</fC07>
<fC07 i1="01" i2="X" l="ENG">
<s0>Europe</s0>
<s2>NG</s2>
</fC07>
<fC07 i1="01" i2="X" l="SPA">
<s0>Europa</s0>
<s2>NG</s2>
</fC07>
<fN21>
<s1>052</s1>
</fN21>
</pA>
</standard>
<server>
<NO>FRANCIS 08-0091670 INIST</NO>
<ET>The processing of digitized works</ET>
<AU>BORBINHA (José); GIL (Joao); PEDROSA (Gilberto); PENAS (Joao)</AU>
<AF>INESC-ID -R. Alves Redol 9, Apartado 13069/1000-029 Lisboa/Portugal (1 aut., 2 aut., 3 aut., 4 aut.)</AF>
<DT>Congrès; Niveau analytique</DT>
<SO>ACM/IEEE Joint Conference on Digital Libraries/6/2006/Chapel Hill NC USA; Etats-Unis; New York NY: ACM Press; Da. 2006; Pp. 103-104; ISBN 1-59593-354-9</SO>
<LA>Anglais</LA>
<EA>This paper describes the processing of digitised works at the National Library of Portugal, as done in the scope of the National Digital Library initiate (BND). This comprises the normalization of the names of the images, the creation of technical metadata, image processing, OCR, indexing, and the creation of derived copies for preservation and copies for access in PNG, JPG, GIF, and PDF. The structural descriptions of all the objects an done in METS.</EA>
<CC>790D02A</CC>
<FD>Portugal; Numérisation; Bibliothèque nationale; Accès information; Bibliothèque électronique; Reconnaissance optique caractère; Préservation; Description</FD>
<FG>Europe</FG>
<ED>Portugal; Digitizing; National library; Information access; Electronic library; Optical character recognition; Preservation; Description</ED>
<EG>Europe</EG>
<SD>Portugal; Numerización; Biblioteca nacional; Acceso información; Biblioteca electronica; Reconocimento óptico de caracteres; Preservación; Descripción</SD>
<LO>INIST-Y 38968.354000153512330140</LO>
<ID>08-0091670</ID>
</server>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000308 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000308 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PascalFrancis
   |étape=   Corpus
   |type=    RBID
   |clé=     Francis:08-0091670
   |texte=   The processing of digitized works
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024