Importing Documents and Metadata into Digital Libraries: Requirements Analysis and an Extensible Architecture
Identifieur interne : 000223 ( Istex/Checkpoint ); précédent : 000222; suivant : 000224Importing Documents and Metadata into Digital Libraries: Requirements Analysis and an Extensible Architecture
Auteurs : H. Witten [Nouvelle-Zélande] ; David Bainbridge [Nouvelle-Zélande] ; Gordon Paynter [États-Unis] ; Stefan Boddie [Nouvelle-Zélande]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2002.
Abstract
Abstract: Flexible digital library systems need to be able to accept, or “import,” documents and metadata in a variety of forms, and associate metadata with the appropriate documents. This paper analyzes the requirements of the import process for general digital libraries. The requirements include (a) format conversion for source documents, (b) the ability to incorporate existing conversion utilities, (c) provision for metadata to be specified in the document files themselves and/or in separate metadata files, (d) format conversion for metadata files, (e) provision for metadata to be computed from the document content, and (f) flexible ways of associating metadata with documents or sets of documents. We argue that these requirements are so open-ended that they are best met by an extensible architecture that facilitates the addition of new document formats and metadata facilities to existing digital library systems. An implementation of this architecture is briefly described.
Url:
DOI: 10.1007/3-540-45747-X_29
Affiliations:
Links toward previous steps (curation, corpus...)
Links to Exploration step
ISTEX:A777AD93D793AB6C76C88DCC3E45E69662C2B5CCLe document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Importing Documents and Metadata into Digital Libraries: Requirements Analysis and an Extensible Architecture</title>
<author><name sortKey="Witten, H" sort="Witten, H" uniqKey="Witten H" first="H." last="Witten">H. Witten</name>
</author>
<author><name sortKey="Bainbridge, David" sort="Bainbridge, David" uniqKey="Bainbridge D" first="David" last="Bainbridge">David Bainbridge</name>
</author>
<author><name sortKey="Paynter, Gordon" sort="Paynter, Gordon" uniqKey="Paynter G" first="Gordon" last="Paynter">Gordon Paynter</name>
</author>
<author><name sortKey="Boddie, Stefan" sort="Boddie, Stefan" uniqKey="Boddie S" first="Stefan" last="Boddie">Stefan Boddie</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:A777AD93D793AB6C76C88DCC3E45E69662C2B5CC</idno>
<date when="2002" year="2002">2002</date>
<idno type="doi">10.1007/3-540-45747-X_29</idno>
<idno type="url">https://api.istex.fr/document/A777AD93D793AB6C76C88DCC3E45E69662C2B5CC/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000437</idno>
<idno type="wicri:Area/Istex/Curation">000437</idno>
<idno type="wicri:Area/Istex/Checkpoint">000223</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000223</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Importing Documents and Metadata into Digital Libraries: Requirements Analysis and an Extensible Architecture</title>
<author><name sortKey="Witten, H" sort="Witten, H" uniqKey="Witten H" first="H." last="Witten">H. Witten</name>
<affiliation wicri:level="1"><country xml:lang="fr">Nouvelle-Zélande</country>
<wicri:regionArea>Computer Science Department, University of Waikato</wicri:regionArea>
<wicri:noRegion>University of Waikato</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Nouvelle-Zélande</country>
</affiliation>
</author>
<author><name sortKey="Bainbridge, David" sort="Bainbridge, David" uniqKey="Bainbridge D" first="David" last="Bainbridge">David Bainbridge</name>
<affiliation wicri:level="1"><country xml:lang="fr">Nouvelle-Zélande</country>
<wicri:regionArea>Computer Science Department, University of Waikato</wicri:regionArea>
<wicri:noRegion>University of Waikato</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Nouvelle-Zélande</country>
</affiliation>
</author>
<author><name sortKey="Paynter, Gordon" sort="Paynter, Gordon" uniqKey="Paynter G" first="Gordon" last="Paynter">Gordon Paynter</name>
<affiliation wicri:level="2"><country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Universtiy of California Science Library, Riverside, California</wicri:regionArea>
<placeName><region type="state">Californie</region>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">États-Unis</country>
</affiliation>
</author>
<author><name sortKey="Boddie, Stefan" sort="Boddie, Stefan" uniqKey="Boddie S" first="Stefan" last="Boddie">Stefan Boddie</name>
<affiliation wicri:level="1"><country xml:lang="fr">Nouvelle-Zélande</country>
<wicri:regionArea>Computer Science Department, University of Waikato</wicri:regionArea>
<wicri:noRegion>University of Waikato</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Nouvelle-Zélande</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2002</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">A777AD93D793AB6C76C88DCC3E45E69662C2B5CC</idno>
<idno type="DOI">10.1007/3-540-45747-X_29</idno>
<idno type="ChapterID">29</idno>
<idno type="ChapterID">Chap29</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: Flexible digital library systems need to be able to accept, or “import,” documents and metadata in a variety of forms, and associate metadata with the appropriate documents. This paper analyzes the requirements of the import process for general digital libraries. The requirements include (a) format conversion for source documents, (b) the ability to incorporate existing conversion utilities, (c) provision for metadata to be specified in the document files themselves and/or in separate metadata files, (d) format conversion for metadata files, (e) provision for metadata to be computed from the document content, and (f) flexible ways of associating metadata with documents or sets of documents. We argue that these requirements are so open-ended that they are best met by an extensible architecture that facilitates the addition of new document formats and metadata facilities to existing digital library systems. An implementation of this architecture is briefly described.</div>
</front>
</TEI>
<affiliations><list><country><li>Nouvelle-Zélande</li>
<li>États-Unis</li>
</country>
<region><li>Californie</li>
</region>
</list>
<tree><country name="Nouvelle-Zélande"><noRegion><name sortKey="Witten, H" sort="Witten, H" uniqKey="Witten H" first="H." last="Witten">H. Witten</name>
</noRegion>
<name sortKey="Bainbridge, David" sort="Bainbridge, David" uniqKey="Bainbridge D" first="David" last="Bainbridge">David Bainbridge</name>
<name sortKey="Bainbridge, David" sort="Bainbridge, David" uniqKey="Bainbridge D" first="David" last="Bainbridge">David Bainbridge</name>
<name sortKey="Boddie, Stefan" sort="Boddie, Stefan" uniqKey="Boddie S" first="Stefan" last="Boddie">Stefan Boddie</name>
<name sortKey="Boddie, Stefan" sort="Boddie, Stefan" uniqKey="Boddie S" first="Stefan" last="Boddie">Stefan Boddie</name>
<name sortKey="Witten, H" sort="Witten, H" uniqKey="Witten H" first="H." last="Witten">H. Witten</name>
</country>
<country name="États-Unis"><region name="Californie"><name sortKey="Paynter, Gordon" sort="Paynter, Gordon" uniqKey="Paynter G" first="Gordon" last="Paynter">Gordon Paynter</name>
</region>
<name sortKey="Paynter, Gordon" sort="Paynter, Gordon" uniqKey="Paynter G" first="Gordon" last="Paynter">Gordon Paynter</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Istex/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000223 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Istex/Checkpoint/biblio.hfd -nk 000223 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Ticri |area= TeiVM2 |flux= Istex |étape= Checkpoint |type= RBID |clé= ISTEX:A777AD93D793AB6C76C88DCC3E45E69662C2B5CC |texte= Importing Documents and Metadata into Digital Libraries: Requirements Analysis and an Extensible Architecture }}
This area was generated with Dilib version V0.6.31. |