Digitalizálás - a Digitalizált Törvényhozási Tudástár projekt tapasztalatai
Identifieur interne : 000020 ( PascalFrancis/Corpus ); précédent : 000019; suivant : 000021Digitalizálás - a Digitalizált Törvényhozási Tudástár projekt tapasztalatai
Auteurs : Boros IldikoSource :
- Tudományos és müszaki tájékoztatás : (Nyomtatott) [ 0041-3917 ] ; 2013.
Descripteurs français
- Pascal (Inist)
English descriptors
Abstract
The Digitised Legislation Knowledge Store (DTT) project of the Library of the Hungarian Parliament was implemented in the period January 4, 2010 to November 30, 2012, as a priority project, in the framework of the Electronic Administration Operational Programme (EKOP), supported by the European Union and co-financed by the European Regional Development Fund. This article describes the workflow of digitisation. After creating the IT background the general technical parameters were defined, and the physical and logical presentation of digital documents started. This was followed by the workflow for mass uploading: METS/XML was prepared based on templates for various groups of materials, and plans for quality control were made using mathematical-statistical methods. Principles for the selection of works for digitisation were defined, with categories according to fields of science, document types, language, time, etc. Hungarian-language materials were selected from the main collecting areas of the Library (law, history and political science). The size, structure and condition of the volumes selected were registered on status sheets. Books were described individually, while journals, official gazettes and decisions were described in groups in a conservation status database. In order to select suitable copies it was necessary to identify the bibliographic, copy and publishing data in catalogues, and to check, improve and prepare for conversion the records of books and journals concerned, to enter their copyright status and collection organisation codes into the database. Metadata were prepared. The digitisation company was chosen within a public procurement process. Because of the value of works selected for digitisation, their uniqueness and in many cases irreplaceable nature, as well as preservation considerations, digitisation took place on site, mostly using a Kirtas KABIS III. robotic scanner. Large volumes and fold-out attachments were digitised by a flatbed scanner. The processing of the images, their cutting and correction was carried out by the Book Scan Editor (BSE) software. Metadata were assigned to each page of each volume by the GLOBE-Index software. This was followed by optical character recognition (OCR): after image processing the data were transferred into the OCR database, and the OCR Engine automatically created the final two-layer PDF file format. The pages prepared were then input into the DigiTool software in this format. The first step of checking was the automatic control of TIFF images. During manual inspection the general quality control of scanning results took place. Within the project two million pages have been digitised, the total number of volumes was 5272. The documents can be accessed in accordance with the copyright legislation in force. A major part of works (40%) is under copyright protection; consequently they can be displayed on the Library's computers only for scientific research or private study. The works that are not subject to copyright protection are available without any limitation to the public on the Internet. The DTT portal is barrier-free; those visually impaired can use it properly as well.
Notice en format standard (ISO 2709)
Pour connaître la documentation sur le format Inist Standard.
pA |
|
---|
Format Inist (serveur)
NO : | PASCAL 14-0131397 INIST |
---|---|
ET : | (Digitisation - experience from the Digitised Legislation Knowledge Store (DTT) project) |
OT : | Digitalizálás - a Digitalizált Törvényhozási Tudástár projekt tapasztalatai |
AU : | ILDIKO (Boros) |
AF : | Az Országgyülési Könyvtár Gyüjteményszervezési osztályának vezetöje/Hongrie (1 aut.) |
DT : | Publication en série; Niveau analytique |
SO : | Tudományos és müszaki tájékoztatás : (Nyomtatott); ISSN 0041-3917; Hongrie; Da. 2013; No. 7; 283-290, 282 [9 p.]; Abs. anglais |
LA : | Hongrois |
EA : | The Digitised Legislation Knowledge Store (DTT) project of the Library of the Hungarian Parliament was implemented in the period January 4, 2010 to November 30, 2012, as a priority project, in the framework of the Electronic Administration Operational Programme (EKOP), supported by the European Union and co-financed by the European Regional Development Fund. This article describes the workflow of digitisation. After creating the IT background the general technical parameters were defined, and the physical and logical presentation of digital documents started. This was followed by the workflow for mass uploading: METS/XML was prepared based on templates for various groups of materials, and plans for quality control were made using mathematical-statistical methods. Principles for the selection of works for digitisation were defined, with categories according to fields of science, document types, language, time, etc. Hungarian-language materials were selected from the main collecting areas of the Library (law, history and political science). The size, structure and condition of the volumes selected were registered on status sheets. Books were described individually, while journals, official gazettes and decisions were described in groups in a conservation status database. In order to select suitable copies it was necessary to identify the bibliographic, copy and publishing data in catalogues, and to check, improve and prepare for conversion the records of books and journals concerned, to enter their copyright status and collection organisation codes into the database. Metadata were prepared. The digitisation company was chosen within a public procurement process. Because of the value of works selected for digitisation, their uniqueness and in many cases irreplaceable nature, as well as preservation considerations, digitisation took place on site, mostly using a Kirtas KABIS III. robotic scanner. Large volumes and fold-out attachments were digitised by a flatbed scanner. The processing of the images, their cutting and correction was carried out by the Book Scan Editor (BSE) software. Metadata were assigned to each page of each volume by the GLOBE-Index software. This was followed by optical character recognition (OCR): after image processing the data were transferred into the OCR database, and the OCR Engine automatically created the final two-layer PDF file format. The pages prepared were then input into the DigiTool software in this format. The first step of checking was the automatic control of TIFF images. During manual inspection the general quality control of scanning results took place. Within the project two million pages have been digitised, the total number of volumes was 5272. The documents can be accessed in accordance with the copyright legislation in force. A major part of works (40%) is under copyright protection; consequently they can be displayed on the Library's computers only for scientific research or private study. The works that are not subject to copyright protection are available without any limitation to the public on the Internet. The DTT portal is barrier-free; those visually impaired can use it properly as well. |
CC : | 001A01C02 |
FD : | Hongrie; Projet; Bibliothèque |
FG : | Europe |
ED : | Hungary; Project; Library |
EG : | Europe |
SD : | Hungría; Proyecto; Biblioteca |
LO : | INIST-5087.354000505844410010 |
ID : | 14-0131397 |
Links to Exploration step
Pascal:14-0131397Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="HUN" level="a">Digitalizálás - a Digitalizált Törvényhozási Tudástár projekt tapasztalatai</title>
<author><name sortKey="Ildiko, Boros" sort="Ildiko, Boros" uniqKey="Ildiko B" first="Boros" last="Ildiko">Boros Ildiko</name>
<affiliation><inist:fA14 i1="01"><s1>Az Országgyülési Könyvtár Gyüjteményszervezési osztályának vezetöje</s1>
<s3>HUN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">14-0131397</idno>
<date when="2013">2013</date>
<idno type="stanalyst">PASCAL 14-0131397 INIST</idno>
<idno type="RBID">Pascal:14-0131397</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000020</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="HUN" level="a">Digitalizálás - a Digitalizált Törvényhozási Tudástár projekt tapasztalatai</title>
<author><name sortKey="Ildiko, Boros" sort="Ildiko, Boros" uniqKey="Ildiko B" first="Boros" last="Ildiko">Boros Ildiko</name>
<affiliation><inist:fA14 i1="01"><s1>Az Országgyülési Könyvtár Gyüjteményszervezési osztályának vezetöje</s1>
<s3>HUN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">Tudományos és müszaki tájékoztatás : (Nyomtatott)</title>
<title level="j" type="abbreviated">Tud. müsz. táj. : (Nyomt.)</title>
<idno type="ISSN">0041-3917</idno>
<imprint><date when="2013">2013</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">Tudományos és müszaki tájékoztatás : (Nyomtatott)</title>
<title level="j" type="abbreviated">Tud. müsz. táj. : (Nyomt.)</title>
<idno type="ISSN">0041-3917</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Hungary</term>
<term>Library</term>
<term>Project</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Hongrie</term>
<term>Projet</term>
<term>Bibliothèque</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">The Digitised Legislation Knowledge Store (DTT) project of the Library of the Hungarian Parliament was implemented in the period January 4, 2010 to November 30, 2012, as a priority project, in the framework of the Electronic Administration Operational Programme (EKOP), supported by the European Union and co-financed by the European Regional Development Fund. This article describes the workflow of digitisation. After creating the IT background the general technical parameters were defined, and the physical and logical presentation of digital documents started. This was followed by the workflow for mass uploading: METS/XML was prepared based on templates for various groups of materials, and plans for quality control were made using mathematical-statistical methods. Principles for the selection of works for digitisation were defined, with categories according to fields of science, document types, language, time, etc. Hungarian-language materials were selected from the main collecting areas of the Library (law, history and political science). The size, structure and condition of the volumes selected were registered on status sheets. Books were described individually, while journals, official gazettes and decisions were described in groups in a conservation status database. In order to select suitable copies it was necessary to identify the bibliographic, copy and publishing data in catalogues, and to check, improve and prepare for conversion the records of books and journals concerned, to enter their copyright status and collection organisation codes into the database. Metadata were prepared. The digitisation company was chosen within a public procurement process. Because of the value of works selected for digitisation, their uniqueness and in many cases irreplaceable nature, as well as preservation considerations, digitisation took place on site, mostly using a Kirtas KABIS III. robotic scanner. Large volumes and fold-out attachments were digitised by a flatbed scanner. The processing of the images, their cutting and correction was carried out by the Book Scan Editor (BSE) software. Metadata were assigned to each page of each volume by the GLOBE-Index software. This was followed by optical character recognition (OCR): after image processing the data were transferred into the OCR database, and the OCR Engine automatically created the final two-layer PDF file format. The pages prepared were then input into the DigiTool software in this format. The first step of checking was the automatic control of TIFF images. During manual inspection the general quality control of scanning results took place. Within the project two million pages have been digitised, the total number of volumes was 5272. The documents can be accessed in accordance with the copyright legislation in force. A major part of works (40%) is under copyright protection; consequently they can be displayed on the Library's computers only for scientific research or private study. The works that are not subject to copyright protection are available without any limitation to the public on the Internet. The DTT portal is barrier-free; those visually impaired can use it properly as well.</div>
</front>
</TEI>
<inist><standard h6="B"><pA><fA01 i1="01" i2="1"><s0>0041-3917</s0>
</fA01>
<fA03 i2="1"><s0>Tud. müsz. táj. : (Nyomt.)</s0>
</fA03>
<fA06><s2>7</s2>
</fA06>
<fA08 i1="01" i2="1" l="HUN"><s1>Digitalizálás - a Digitalizált Törvényhozási Tudástár projekt tapasztalatai</s1>
</fA08>
<fA11 i1="01" i2="1"><s1>ILDIKO (Boros)</s1>
</fA11>
<fA14 i1="01"><s1>Az Országgyülési Könyvtár Gyüjteményszervezési osztályának vezetöje</s1>
<s3>HUN</s3>
<sZ>1 aut.</sZ>
</fA14>
<fA20><s2>283-290, 282 [9 p.]</s2>
</fA20>
<fA21><s1>2013</s1>
</fA21>
<fA23 i1="01"><s0>HUN</s0>
</fA23>
<fA24 i1="01"><s0>eng</s0>
</fA24>
<fA43 i1="01"><s1>INIST</s1>
<s2>5087</s2>
<s5>354000505844410010</s5>
</fA43>
<fA44><s0>0000</s0>
<s1>© 2014 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA47 i1="01" i2="1"><s0>14-0131397</s0>
</fA47>
<fA60><s1>P</s1>
</fA60>
<fA61><s0>A</s0>
</fA61>
<fA64 i1="01" i2="1"><s0>Tudományos és müszaki tájékoztatás : (Nyomtatott)</s0>
</fA64>
<fA66 i1="01"><s0>HUN</s0>
</fA66>
<fA68 i1="01" i2="1" l="ENG"><s1>Digitisation - experience from the Digitised Legislation Knowledge Store (DTT) project</s1>
</fA68>
<fC01 i1="01" l="ENG"><s0>The Digitised Legislation Knowledge Store (DTT) project of the Library of the Hungarian Parliament was implemented in the period January 4, 2010 to November 30, 2012, as a priority project, in the framework of the Electronic Administration Operational Programme (EKOP), supported by the European Union and co-financed by the European Regional Development Fund. This article describes the workflow of digitisation. After creating the IT background the general technical parameters were defined, and the physical and logical presentation of digital documents started. This was followed by the workflow for mass uploading: METS/XML was prepared based on templates for various groups of materials, and plans for quality control were made using mathematical-statistical methods. Principles for the selection of works for digitisation were defined, with categories according to fields of science, document types, language, time, etc. Hungarian-language materials were selected from the main collecting areas of the Library (law, history and political science). The size, structure and condition of the volumes selected were registered on status sheets. Books were described individually, while journals, official gazettes and decisions were described in groups in a conservation status database. In order to select suitable copies it was necessary to identify the bibliographic, copy and publishing data in catalogues, and to check, improve and prepare for conversion the records of books and journals concerned, to enter their copyright status and collection organisation codes into the database. Metadata were prepared. The digitisation company was chosen within a public procurement process. Because of the value of works selected for digitisation, their uniqueness and in many cases irreplaceable nature, as well as preservation considerations, digitisation took place on site, mostly using a Kirtas KABIS III. robotic scanner. Large volumes and fold-out attachments were digitised by a flatbed scanner. The processing of the images, their cutting and correction was carried out by the Book Scan Editor (BSE) software. Metadata were assigned to each page of each volume by the GLOBE-Index software. This was followed by optical character recognition (OCR): after image processing the data were transferred into the OCR database, and the OCR Engine automatically created the final two-layer PDF file format. The pages prepared were then input into the DigiTool software in this format. The first step of checking was the automatic control of TIFF images. During manual inspection the general quality control of scanning results took place. Within the project two million pages have been digitised, the total number of volumes was 5272. The documents can be accessed in accordance with the copyright legislation in force. A major part of works (40%) is under copyright protection; consequently they can be displayed on the Library's computers only for scientific research or private study. The works that are not subject to copyright protection are available without any limitation to the public on the Internet. The DTT portal is barrier-free; those visually impaired can use it properly as well.</s0>
</fC01>
<fC02 i1="01" i2="X"><s0>001A01C02</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE"><s0>Hongrie</s0>
<s2>NG</s2>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG"><s0>Hungary</s0>
<s2>NG</s2>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA"><s0>Hungría</s0>
<s2>NG</s2>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE"><s0>Projet</s0>
<s5>04</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG"><s0>Project</s0>
<s5>04</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA"><s0>Proyecto</s0>
<s5>04</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE"><s0>Bibliothèque</s0>
<s5>05</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG"><s0>Library</s0>
<s5>05</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA"><s0>Biblioteca</s0>
<s5>05</s5>
</fC03>
<fC07 i1="01" i2="X" l="FRE"><s0>Europe</s0>
<s2>NG</s2>
</fC07>
<fC07 i1="01" i2="X" l="ENG"><s0>Europe</s0>
<s2>NG</s2>
</fC07>
<fC07 i1="01" i2="X" l="SPA"><s0>Europa</s0>
<s2>NG</s2>
</fC07>
<fN21><s1>167</s1>
</fN21>
</pA>
</standard>
<server><NO>PASCAL 14-0131397 INIST</NO>
<ET>(Digitisation - experience from the Digitised Legislation Knowledge Store (DTT) project)</ET>
<OT>Digitalizálás - a Digitalizált Törvényhozási Tudástár projekt tapasztalatai</OT>
<AU>ILDIKO (Boros)</AU>
<AF>Az Országgyülési Könyvtár Gyüjteményszervezési osztályának vezetöje/Hongrie (1 aut.)</AF>
<DT>Publication en série; Niveau analytique</DT>
<SO>Tudományos és müszaki tájékoztatás : (Nyomtatott); ISSN 0041-3917; Hongrie; Da. 2013; No. 7; 283-290, 282 [9 p.]; Abs. anglais</SO>
<LA>Hongrois</LA>
<EA>The Digitised Legislation Knowledge Store (DTT) project of the Library of the Hungarian Parliament was implemented in the period January 4, 2010 to November 30, 2012, as a priority project, in the framework of the Electronic Administration Operational Programme (EKOP), supported by the European Union and co-financed by the European Regional Development Fund. This article describes the workflow of digitisation. After creating the IT background the general technical parameters were defined, and the physical and logical presentation of digital documents started. This was followed by the workflow for mass uploading: METS/XML was prepared based on templates for various groups of materials, and plans for quality control were made using mathematical-statistical methods. Principles for the selection of works for digitisation were defined, with categories according to fields of science, document types, language, time, etc. Hungarian-language materials were selected from the main collecting areas of the Library (law, history and political science). The size, structure and condition of the volumes selected were registered on status sheets. Books were described individually, while journals, official gazettes and decisions were described in groups in a conservation status database. In order to select suitable copies it was necessary to identify the bibliographic, copy and publishing data in catalogues, and to check, improve and prepare for conversion the records of books and journals concerned, to enter their copyright status and collection organisation codes into the database. Metadata were prepared. The digitisation company was chosen within a public procurement process. Because of the value of works selected for digitisation, their uniqueness and in many cases irreplaceable nature, as well as preservation considerations, digitisation took place on site, mostly using a Kirtas KABIS III. robotic scanner. Large volumes and fold-out attachments were digitised by a flatbed scanner. The processing of the images, their cutting and correction was carried out by the Book Scan Editor (BSE) software. Metadata were assigned to each page of each volume by the GLOBE-Index software. This was followed by optical character recognition (OCR): after image processing the data were transferred into the OCR database, and the OCR Engine automatically created the final two-layer PDF file format. The pages prepared were then input into the DigiTool software in this format. The first step of checking was the automatic control of TIFF images. During manual inspection the general quality control of scanning results took place. Within the project two million pages have been digitised, the total number of volumes was 5272. The documents can be accessed in accordance with the copyright legislation in force. A major part of works (40%) is under copyright protection; consequently they can be displayed on the Library's computers only for scientific research or private study. The works that are not subject to copyright protection are available without any limitation to the public on the Internet. The DTT portal is barrier-free; those visually impaired can use it properly as well.</EA>
<CC>001A01C02</CC>
<FD>Hongrie; Projet; Bibliothèque</FD>
<FG>Europe</FG>
<ED>Hungary; Project; Library</ED>
<EG>Europe</EG>
<SD>Hungría; Proyecto; Biblioteca</SD>
<LO>INIST-5087.354000505844410010</LO>
<ID>14-0131397</ID>
</server>
</inist>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000020 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000020 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= PascalFrancis |étape= Corpus |type= RBID |clé= Pascal:14-0131397 |texte= Digitalizálás - a Digitalizált Törvényhozási Tudástár projekt tapasztalatai }}
This area was generated with Dilib version V0.6.32. |