Serveur d'exploration sur la TEI

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Encoding models for scholarly literature

Identifieur interne : 000007 ( Hal/Curation ); précédent : 000006; suivant : 000008

Encoding models for scholarly literature

Auteurs : Martin Holmes [Canada] ; Laurent Romary [Allemagne]

Source :

RBID : Hal:hal-00390966

Abstract

We examine the issue of digital formats for document encoding, archiving and publishing, through the specific example of "born-digital" scholarly journal articles. We will begin by looking at the traditional workflow of journal editing and publication, and how these practices have made the transition into the online domain. We will examine the range of different file formats in which electronic articles are currently stored and published. We will argue strongly that, despite the prevalence of binary and proprietary formats such as PDF and MS Word, XML is a far superior encoding choice for journal articles. Next, we look at the range of XML document structures (DTDs, Schemas) which are in common use for encoding journal articles, and consider some of their strengths and weaknesses. We will suggest that, despite the existence of specialized schemas intended specifically for journal articles (such as NLM), and more broadly-used publication-oriented schemas such as DocBook, there are strong arguments in favour of developing a subset or customization of the Text Encoding Initiative (TEI) schema for the purpose of journal-article encoding; TEI is already in use in a number of journal publication projects, and the scale and precision of the TEI tagset makes it particularly appropriate for encoding scholarly articles. We will outline the document structure of a TEI-encoded journal article, and look in detail at suggested markup patterns for specific features of journal articles.

Url:
DOI: 10.4018/978-1-60960-031-0

Links toward previous steps (curation, corpus...)


Links to Exploration step

Hal:hal-00390966

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Encoding models for scholarly literature</title>
<author>
<name sortKey="Holmes, Martin" sort="Holmes, Martin" uniqKey="Holmes M" first="Martin" last="Holmes">Martin Holmes</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-95238" status="VALID">
<orgName>Humanities Computing and Media Centre</orgName>
<orgName type="acronym">HCMC</orgName>
<desc>
<address>
<addrLine>University of Victoria 3800 Finnerty Road Clearihue Building, B043 Victoria, B.C. V8P 5C2</addrLine>
<country key="CA"></country>
</address>
<ref type="url">http://hcmc.uvic.ca/</ref>
</desc>
<listRelation>
<relation active="#struct-15325" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-15325" type="direct">
<org type="institution" xml:id="struct-15325" status="VALID">
<orgName>University of Victoria [Canada]</orgName>
<orgName type="acronym">UVIC</orgName>
<desc>
<address>
<addrLine>University of Victoria3800 Finnerty RoadVictoria BC V8P 5C2Canada</addrLine>
<country key="CA"></country>
</address>
<ref type="url">http://www.uvic.ca</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Canada</country>
</affiliation>
</author>
<author>
<name sortKey="Romary, Laurent" sort="Romary, Laurent" uniqKey="Romary L" first="Laurent" last="Romary">Laurent Romary</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-95237" status="VALID">
<orgName>Institut für Deutsche Sprache und Linguistik</orgName>
<orgName type="acronym">IDSL</orgName>
<desc>
<address>
<addrLine>Dorotheenstraße 24, 10099 Berlin</addrLine>
<country key="DE"></country>
</address>
<ref type="url">http://www.linguistik.hu-berlin.de/</ref>
</desc>
<listRelation>
<relation active="#struct-139189" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-139189" type="direct">
<org type="institution" xml:id="struct-139189" status="VALID">
<orgName>Humboldt Universität zu Berlin [Berlin]</orgName>
<desc>
<address>
<addrLine>Unter den Linden 610099 Berlin</addrLine>
<country key="DE"></country>
</address>
<ref type="url">https://www.hu-berlin.de/en/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Allemagne</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:hal-00390966</idno>
<idno type="halId">hal-00390966</idno>
<idno type="halUri">https://hal.archives-ouvertes.fr/hal-00390966</idno>
<idno type="url">https://hal.archives-ouvertes.fr/hal-00390966</idno>
<idno type="doi">10.4018/978-1-60960-031-0</idno>
<date when="2010">2010</date>
<idno type="wicri:Area/Hal/Corpus">000007</idno>
<idno type="wicri:Area/Hal/Curation">000007</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Encoding models for scholarly literature</title>
<author>
<name sortKey="Holmes, Martin" sort="Holmes, Martin" uniqKey="Holmes M" first="Martin" last="Holmes">Martin Holmes</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-95238" status="VALID">
<orgName>Humanities Computing and Media Centre</orgName>
<orgName type="acronym">HCMC</orgName>
<desc>
<address>
<addrLine>University of Victoria 3800 Finnerty Road Clearihue Building, B043 Victoria, B.C. V8P 5C2</addrLine>
<country key="CA"></country>
</address>
<ref type="url">http://hcmc.uvic.ca/</ref>
</desc>
<listRelation>
<relation active="#struct-15325" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-15325" type="direct">
<org type="institution" xml:id="struct-15325" status="VALID">
<orgName>University of Victoria [Canada]</orgName>
<orgName type="acronym">UVIC</orgName>
<desc>
<address>
<addrLine>University of Victoria3800 Finnerty RoadVictoria BC V8P 5C2Canada</addrLine>
<country key="CA"></country>
</address>
<ref type="url">http://www.uvic.ca</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Canada</country>
</affiliation>
</author>
<author>
<name sortKey="Romary, Laurent" sort="Romary, Laurent" uniqKey="Romary L" first="Laurent" last="Romary">Laurent Romary</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-95237" status="VALID">
<orgName>Institut für Deutsche Sprache und Linguistik</orgName>
<orgName type="acronym">IDSL</orgName>
<desc>
<address>
<addrLine>Dorotheenstraße 24, 10099 Berlin</addrLine>
<country key="DE"></country>
</address>
<ref type="url">http://www.linguistik.hu-berlin.de/</ref>
</desc>
<listRelation>
<relation active="#struct-139189" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-139189" type="direct">
<org type="institution" xml:id="struct-139189" status="VALID">
<orgName>Humboldt Universität zu Berlin [Berlin]</orgName>
<desc>
<address>
<addrLine>Unter den Linden 610099 Berlin</addrLine>
<country key="DE"></country>
</address>
<ref type="url">https://www.hu-berlin.de/en/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Allemagne</country>
</affiliation>
</author>
</analytic>
<idno type="DOI">10.4018/978-1-60960-031-0</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">We examine the issue of digital formats for document encoding, archiving and publishing, through the specific example of "born-digital" scholarly journal articles. We will begin by looking at the traditional workflow of journal editing and publication, and how these practices have made the transition into the online domain. We will examine the range of different file formats in which electronic articles are currently stored and published. We will argue strongly that, despite the prevalence of binary and proprietary formats such as PDF and MS Word, XML is a far superior encoding choice for journal articles. Next, we look at the range of XML document structures (DTDs, Schemas) which are in common use for encoding journal articles, and consider some of their strengths and weaknesses. We will suggest that, despite the existence of specialized schemas intended specifically for journal articles (such as NLM), and more broadly-used publication-oriented schemas such as DocBook, there are strong arguments in favour of developing a subset or customization of the Text Encoding Initiative (TEI) schema for the purpose of journal-article encoding; TEI is already in use in a number of journal publication projects, and the scale and precision of the TEI tagset makes it particularly appropriate for encoding scholarly articles. We will outline the document structure of a TEI-encoded journal article, and look in detail at suggested markup patterns for specific features of journal articles.</div>
</front>
</TEI>
<hal api="V3">
<titleStmt>
<title xml:lang="en">Encoding models for scholarly literature</title>
<author role="aut">
<persName>
<forename type="first">Martin</forename>
<surname>Holmes</surname>
</persName>
<email></email>
<idno type="halauthor">409827</idno>
<affiliation ref="#struct-95238"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Laurent</forename>
<surname>Romary</surname>
</persName>
<email>laurent.romary@inria.fr</email>
<idno type="idhal">laurentromary</idno>
<idno type="halauthor">49567</idno>
<idno type="arXiv">http://arxiv.org/a/Romary_L</idno>
<idno type="IdRef">http://www.idref.fr/060702494</idno>
<idno type="ORCID">http://orcid.org/0000-0002-0756-0508</idno>
<idno type="VIAF">http://viaf.org/viaf/VIAF282014122</idno>
<idno type="ISNI">http://isni.org/isni/0000 0003 8879 5444</idno>
<affiliation ref="#struct-95237"></affiliation>
<affiliation ref="#struct-118511"></affiliation>
</author>
<editor role="depositor">
<persName>
<forename>Laurent</forename>
<surname>Romary</surname>
</persName>
<email>laurent.romary@inria.fr</email>
</editor>
</titleStmt>
<editionStmt>
<edition n="v1">
<date type="whenSubmitted">2009-06-03 11:45:40</date>
</edition>
<edition n="v2" type="current">
<date type="whenSubmitted">2011-01-13 17:49:29</date>
<date type="whenModified">2011-01-14 09:49:25</date>
<date type="whenReleased">2011-01-14 09:49:25</date>
<date type="whenProduced">2010</date>
<date type="whenEndEmbargoed">2011-01-13</date>
<ref type="file" target="https://hal.archives-ouvertes.fr/hal-00390966v2/document">
<date notBefore="2011-01-13"></date>
</ref>
<ref type="file" subtype="publisherAgreement" n="1" target="https://hal.archives-ouvertes.fr/hal-00390966/file/romary_chap_iglezakis_book.pdf">
<date notBefore="2011-01-13"></date>
</ref>
</edition>
<respStmt>
<resp>contributor</resp>
<name key="105529">
<persName>
<forename>Laurent</forename>
<surname>Romary</surname>
</persName>
<email>laurent.romary@inria.fr</email>
</name>
</respStmt>
</editionStmt>
<publicationStmt>
<distributor>CCSD</distributor>
<idno type="halId">hal-00390966</idno>
<idno type="halUri">https://hal.archives-ouvertes.fr/hal-00390966</idno>
<idno type="halBibtex">holmes:hal-00390966</idno>
<idno type="halRefHtml">Ioannis Iglezakis, Tatiana-Eleni Synodinou, Sarantos Kapidakis. Publishing and digital libraries: Legal and organizational issues, IGI Global, pp.88-110, 2010, <10.4018/978-1-60960-031-0></idno>
<idno type="halRef">Ioannis Iglezakis, Tatiana-Eleni Synodinou, Sarantos Kapidakis. Publishing and digital libraries: Legal and organizational issues, IGI Global, pp.88-110, 2010, <10.4018/978-1-60960-031-0></idno>
</publicationStmt>
<seriesStmt>
<idno type="stamp" n="INRIA">INRIA - Institut National de Recherche en Informatique et en Automatique</idno>
<idno type="stamp" n="INRIA-SACLAY">INRIA Saclay - Ile de France</idno>
</seriesStmt>
<notesStmt>
<note type="commentary">Copyright 2010, IGI Global, www.igi-global.com. Posted by permission of the publisher.</note>
<note type="audience" n="1">Not set</note>
<note type="popular" n="0">No</note>
</notesStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Encoding models for scholarly literature</title>
<author role="aut">
<persName>
<forename type="first">Martin</forename>
<surname>Holmes</surname>
</persName>
<idno type="halAuthorId">409827</idno>
<affiliation ref="#struct-95238"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Laurent</forename>
<surname>Romary</surname>
</persName>
<email>laurent.romary@inria.fr</email>
<idno type="idHal">laurentromary</idno>
<idno type="halAuthorId">49567</idno>
<idno type="arXiv">http://arxiv.org/a/Romary_L</idno>
<idno type="IdRef">http://www.idref.fr/060702494</idno>
<idno type="ORCID">http://orcid.org/0000-0002-0756-0508</idno>
<idno type="VIAF">http://viaf.org/viaf/VIAF282014122</idno>
<idno type="ISNI">http://isni.org/isni/0000 0003 8879 5444</idno>
<affiliation ref="#struct-95237"></affiliation>
<affiliation ref="#struct-118511"></affiliation>
</author>
</analytic>
<monogr>
<title level="m">Publishing and digital libraries: Legal and organizational issues</title>
<editor>Ioannis Iglezakis, Tatiana-Eleni Synodinou, Sarantos Kapidakis</editor>
<imprint>
<publisher>IGI Global</publisher>
<biblScope unit="pp">88-110</biblScope>
<date type="datePub">2010</date>
</imprint>
</monogr>
<idno type="doi">10.4018/978-1-60960-031-0</idno>
</biblStruct>
</sourceDesc>
<profileDesc>
<langUsage>
<language ident="en">English</language>
</langUsage>
<textClass>
<classCode scheme="halDomain" n="info.info-cl">Computer Science [cs]/Computation and Language [cs.CL]</classCode>
<classCode scheme="halTypology" n="COUV">Book section</classCode>
</textClass>
<abstract xml:lang="en">We examine the issue of digital formats for document encoding, archiving and publishing, through the specific example of "born-digital" scholarly journal articles. We will begin by looking at the traditional workflow of journal editing and publication, and how these practices have made the transition into the online domain. We will examine the range of different file formats in which electronic articles are currently stored and published. We will argue strongly that, despite the prevalence of binary and proprietary formats such as PDF and MS Word, XML is a far superior encoding choice for journal articles. Next, we look at the range of XML document structures (DTDs, Schemas) which are in common use for encoding journal articles, and consider some of their strengths and weaknesses. We will suggest that, despite the existence of specialized schemas intended specifically for journal articles (such as NLM), and more broadly-used publication-oriented schemas such as DocBook, there are strong arguments in favour of developing a subset or customization of the Text Encoding Initiative (TEI) schema for the purpose of journal-article encoding; TEI is already in use in a number of journal publication projects, and the scale and precision of the TEI tagset makes it particularly appropriate for encoding scholarly articles. We will outline the document structure of a TEI-encoded journal article, and look in detail at suggested markup patterns for specific features of journal articles.</abstract>
</profileDesc>
</hal>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Hal/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000007 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Hal/Curation/biblio.hfd -nk 000007 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    Hal
   |étape=   Curation
   |type=    RBID
   |clé=     Hal:hal-00390966
   |texte=   Encoding models for scholarly literature
}}

Wicri

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024