Importing Documents and Metadata into Digital Libraries: Requirements Analysis and an Extensible Architecture
Identifieur interne : 000293 ( Main/Merge ); précédent : 000292; suivant : 000294Importing Documents and Metadata into Digital Libraries: Requirements Analysis and an Extensible Architecture
Auteurs : H. Witten [Nouvelle-Zélande] ; David Bainbridge [Nouvelle-Zélande] ; Gordon Paynter [États-Unis] ; Stefan Boddie [Nouvelle-Zélande]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2002.
Abstract
Abstract: Flexible digital library systems need to be able to accept, or “import,” documents and metadata in a variety of forms, and associate metadata with the appropriate documents. This paper analyzes the requirements of the import process for general digital libraries. The requirements include (a) format conversion for source documents, (b) the ability to incorporate existing conversion utilities, (c) provision for metadata to be specified in the document files themselves and/or in separate metadata files, (d) format conversion for metadata files, (e) provision for metadata to be computed from the document content, and (f) flexible ways of associating metadata with documents or sets of documents. We argue that these requirements are so open-ended that they are best met by an extensible architecture that facilitates the addition of new document formats and metadata facilities to existing digital library systems. An implementation of this architecture is briefly described.
Url:
DOI: 10.1007/3-540-45747-X_29
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 000437
- to stream Istex, to step Curation: 000437
- to stream Istex, to step Checkpoint: 000223
Links to Exploration step
ISTEX:A777AD93D793AB6C76C88DCC3E45E69662C2B5CCLe document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Importing Documents and Metadata into Digital Libraries: Requirements Analysis and an Extensible Architecture</title>
<author><name sortKey="Witten, H" sort="Witten, H" uniqKey="Witten H" first="H." last="Witten">H. Witten</name>
</author>
<author><name sortKey="Bainbridge, David" sort="Bainbridge, David" uniqKey="Bainbridge D" first="David" last="Bainbridge">David Bainbridge</name>
</author>
<author><name sortKey="Paynter, Gordon" sort="Paynter, Gordon" uniqKey="Paynter G" first="Gordon" last="Paynter">Gordon Paynter</name>
</author>
<author><name sortKey="Boddie, Stefan" sort="Boddie, Stefan" uniqKey="Boddie S" first="Stefan" last="Boddie">Stefan Boddie</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:A777AD93D793AB6C76C88DCC3E45E69662C2B5CC</idno>
<date when="2002" year="2002">2002</date>
<idno type="doi">10.1007/3-540-45747-X_29</idno>
<idno type="url">https://api.istex.fr/document/A777AD93D793AB6C76C88DCC3E45E69662C2B5CC/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000437</idno>
<idno type="wicri:Area/Istex/Curation">000437</idno>
<idno type="wicri:Area/Istex/Checkpoint">000223</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000223</idno>
<idno type="wicri:doubleKey">0302-9743:2002:Witten H:importing:documents:and</idno>
<idno type="wicri:Area/Main/Merge">000293</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Importing Documents and Metadata into Digital Libraries: Requirements Analysis and an Extensible Architecture</title>
<author><name sortKey="Witten, H" sort="Witten, H" uniqKey="Witten H" first="H." last="Witten">H. Witten</name>
<affiliation wicri:level="1"><country xml:lang="fr">Nouvelle-Zélande</country>
<wicri:regionArea>Computer Science Department, University of Waikato</wicri:regionArea>
<wicri:noRegion>University of Waikato</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Nouvelle-Zélande</country>
</affiliation>
</author>
<author><name sortKey="Bainbridge, David" sort="Bainbridge, David" uniqKey="Bainbridge D" first="David" last="Bainbridge">David Bainbridge</name>
<affiliation wicri:level="1"><country xml:lang="fr">Nouvelle-Zélande</country>
<wicri:regionArea>Computer Science Department, University of Waikato</wicri:regionArea>
<wicri:noRegion>University of Waikato</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Nouvelle-Zélande</country>
</affiliation>
</author>
<author><name sortKey="Paynter, Gordon" sort="Paynter, Gordon" uniqKey="Paynter G" first="Gordon" last="Paynter">Gordon Paynter</name>
<affiliation wicri:level="2"><country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Universtiy of California Science Library, Riverside, California</wicri:regionArea>
<placeName><region type="state">Californie</region>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">États-Unis</country>
</affiliation>
</author>
<author><name sortKey="Boddie, Stefan" sort="Boddie, Stefan" uniqKey="Boddie S" first="Stefan" last="Boddie">Stefan Boddie</name>
<affiliation wicri:level="1"><country xml:lang="fr">Nouvelle-Zélande</country>
<wicri:regionArea>Computer Science Department, University of Waikato</wicri:regionArea>
<wicri:noRegion>University of Waikato</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Nouvelle-Zélande</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2002</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">A777AD93D793AB6C76C88DCC3E45E69662C2B5CC</idno>
<idno type="DOI">10.1007/3-540-45747-X_29</idno>
<idno type="ChapterID">29</idno>
<idno type="ChapterID">Chap29</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: Flexible digital library systems need to be able to accept, or “import,” documents and metadata in a variety of forms, and associate metadata with the appropriate documents. This paper analyzes the requirements of the import process for general digital libraries. The requirements include (a) format conversion for source documents, (b) the ability to incorporate existing conversion utilities, (c) provision for metadata to be specified in the document files themselves and/or in separate metadata files, (d) format conversion for metadata files, (e) provision for metadata to be computed from the document content, and (f) flexible ways of associating metadata with documents or sets of documents. We argue that these requirements are so open-ended that they are best met by an extensible architecture that facilitates the addition of new document formats and metadata facilities to existing digital library systems. An implementation of this architecture is briefly described.</div>
</front>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000293 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 000293 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Ticri |area= TeiVM2 |flux= Main |étape= Merge |type= RBID |clé= ISTEX:A777AD93D793AB6C76C88DCC3E45E69662C2B5CC |texte= Importing Documents and Metadata into Digital Libraries: Requirements Analysis and an Extensible Architecture }}
![]() | This area was generated with Dilib version V0.6.31. | ![]() |