The inadequacy of embedded markup for cultural heritage texts
Identifieur interne : 000057 ( Main/Curation ); précédent : 000056; suivant : 000058The inadequacy of embedded markup for cultural heritage texts
Auteurs : Desmond Schmidt [Australie]Source :
- Literary and Linguistic Computing [ 0268-1145 ] ; 2010-09.
Abstract
Embedded generalized markup, as applied by digital humanists to the recording and studying of our textual cultural heritage, suffers from a number of serious technical drawbacks. As a result of its evolution from early printer control languages, generalized markup can only express a documents logical structure via a repertoire of permissible printed format structures. In addition to the well-researched overlap problem, the embedding of markup codes into texts that never had them when written leads to a number of further difficulties: the inclusion of potentially obsolescent technical and subjective information into texts that are supposed to be archivable for the long term, the manual encoding of information that could be better computed automatically, and the obscuring of the text by highly complex technical data. Many of these problems can be alleviated by asserting a separation between the versions of which many cultural heritage texts are composed, and their content. In this way the complex interconnections between versions can be handled automatically, leaving only simple markup for individual versions to be handled by the user.
Url:
DOI: 10.1093/llc/fqq007
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000159
- to stream Istex, to step Curation: Pour aller vers cette notice dans l'étape Curation :000159
- to stream Istex, to step Checkpoint: Pour aller vers cette notice dans l'étape Curation :000028
- to stream Main, to step Merge: Pour aller vers cette notice dans l'étape Curation :000057
Links to Exploration step
ISTEX:F4113A3958C0A461179818E675591966FE17E0D4Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title>The inadequacy of embedded markup for cultural heritage texts</title>
<author wicri:is="90%"><name sortKey="Schmidt, Desmond" sort="Schmidt, Desmond" uniqKey="Schmidt D" first="Desmond" last="Schmidt">Desmond Schmidt</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:F4113A3958C0A461179818E675591966FE17E0D4</idno>
<date when="2010" year="2010">2010</date>
<idno type="doi">10.1093/llc/fqq007</idno>
<idno type="url">https://api.istex.fr/document/F4113A3958C0A461179818E675591966FE17E0D4/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000159</idno>
<idno type="wicri:Area/Istex/Curation">000159</idno>
<idno type="wicri:Area/Istex/Checkpoint">000028</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000028</idno>
<idno type="wicri:doubleKey">0268-1145:2010:Schmidt D:the:inadequacy:of</idno>
<idno type="wicri:Area/Main/Merge">000057</idno>
<idno type="wicri:Area/Main/Curation">000057</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a">The inadequacy of embedded markup for cultural heritage texts</title>
<author wicri:is="90%"><name sortKey="Schmidt, Desmond" sort="Schmidt, Desmond" uniqKey="Schmidt D" first="Desmond" last="Schmidt">Desmond Schmidt</name>
<affiliation wicri:level="1"><country xml:lang="fr">Australie</country>
<wicri:regionArea>Information Security Institute, Queensland University of Technology, Queensland</wicri:regionArea>
<wicri:noRegion>Queensland</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Australie</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Literary and Linguistic Computing</title>
<idno type="ISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint><publisher>Oxford University Press</publisher>
<date type="published" when="2010-09">2010-09</date>
<biblScope unit="volume">25</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="337">337</biblScope>
<biblScope unit="page" to="356">356</biblScope>
</imprint>
<idno type="ISSN">0268-1145</idno>
</series>
<idno type="istex">F4113A3958C0A461179818E675591966FE17E0D4</idno>
<idno type="DOI">10.1093/llc/fqq007</idno>
<idno type="ArticleID">fqq007</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract">Embedded generalized markup, as applied by digital humanists to the recording and studying of our textual cultural heritage, suffers from a number of serious technical drawbacks. As a result of its evolution from early printer control languages, generalized markup can only express a documents logical structure via a repertoire of permissible printed format structures. In addition to the well-researched overlap problem, the embedding of markup codes into texts that never had them when written leads to a number of further difficulties: the inclusion of potentially obsolescent technical and subjective information into texts that are supposed to be archivable for the long term, the manual encoding of information that could be better computed automatically, and the obscuring of the text by highly complex technical data. Many of these problems can be alleviated by asserting a separation between the versions of which many cultural heritage texts are composed, and their content. In this way the complex interconnections between versions can be handled automatically, leaving only simple markup for individual versions to be handled by the user.</div>
</front>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Main/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000057 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Curation/biblio.hfd -nk 000057 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Ticri |area= TeiVM2 |flux= Main |étape= Curation |type= RBID |clé= ISTEX:F4113A3958C0A461179818E675591966FE17E0D4 |texte= The inadequacy of embedded markup for cultural heritage texts }}
This area was generated with Dilib version V0.6.31. |