Serveur d'exploration sur la TEI

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Towards a TEI-based encoding scheme for the annotation of parallel texts

Identifieur interne : 000108 ( Main/Merge ); précédent : 000107; suivant : 000109

Towards a TEI-based encoding scheme for the annotation of parallel texts

Auteurs : Peter Boot [Pays-Bas]

Source :

RBID : Francis:11-0223733

Descripteurs français

English descriptors

Abstract

Translation, adaptation, and other forms of appropriation of literary works can result in bodies of parallel texts. For the purpose of studying appropriation strategies, it is important to be able to annotate digital representations of these parallel text structures. This article uses early modern emblem culture (books of engravings or woodcuts, accompanied by mottos and explanatory texts) to investigate the forms this text parallelism may take. It defines requirements for annotation definition and proposes a TEI (Text Encoding Initiative) extension to implement these requirements. In the proposed encoding scheme, TEI feature structures will be used for storing annotation information. This scheme should be useful for annotating parallel text structures as well as for other annotation tasks. The annotation scheme assumes the annotated texts are available in XML. If this is not the case (there is no electronic version of the text at all or perhaps only a facsimile) the article suggests the definition of a TEI proxy document. A TEI proxy document contains enough of the structural aspects of the texts to serve as a basis for attaching annotations to the text. Outside of the annotation context, proxy documents may serve as a basis for adding functionality to image-based editions.

Links toward previous steps (curation, corpus...)


Links to Exploration step

Francis:11-0223733

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Towards a TEI-based encoding scheme for the annotation of parallel texts</title>
<author>
<name sortKey="Boot, Peter" sort="Boot, Peter" uniqKey="Boot P" first="Peter" last="Boot">Peter Boot</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Huygens Institute, Royal Netherlands Academy of Arts and Sciences</s1>
<s2>The Hague</s2>
<s3>NLD</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Pays-Bas</country>
<wicri:noRegion>The Hague</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">11-0223733</idno>
<date when="2009">2009</date>
<idno type="stanalyst">FRANCIS 11-0223733 INIST</idno>
<idno type="RBID">Francis:11-0223733</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000009</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000036</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000005</idno>
<idno type="wicri:explorRef" wicri:stream="PascalFrancis" wicri:step="Checkpoint">000005</idno>
<idno type="wicri:doubleKey">0268-1145:2009:Boot P:towards:a:tei</idno>
<idno type="wicri:Area/Main/Merge">000108</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Towards a TEI-based encoding scheme for the annotation of parallel texts</title>
<author>
<name sortKey="Boot, Peter" sort="Boot, Peter" uniqKey="Boot P" first="Peter" last="Boot">Peter Boot</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Huygens Institute, Royal Netherlands Academy of Arts and Sciences</s1>
<s2>The Hague</s2>
<s3>NLD</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Pays-Bas</country>
<wicri:noRegion>The Hague</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Literary and linguistic computing</title>
<title level="j" type="abbreviated">Lit. linguist. comput.</title>
<idno type="ISSN">0268-1145</idno>
<imprint>
<date when="2009">2009</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Literary and linguistic computing</title>
<title level="j" type="abbreviated">Lit. linguist. comput.</title>
<idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Computational linguistics</term>
<term>Corpus annotation</term>
<term>Feature structure</term>
<term>Markup language</term>
<term>Parallel corpus</term>
<term>TEI</term>
<term>Text structure</term>
<term>Translation</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>TEI</term>
<term>Annotation de corpus</term>
<term>Corpus parallèle</term>
<term>Traduction</term>
<term>Structure de traits</term>
<term>Structure textuelle</term>
<term>Langage de balisage</term>
<term>Linguistique informatique</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Traduction</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Translation, adaptation, and other forms of appropriation of literary works can result in bodies of parallel texts. For the purpose of studying appropriation strategies, it is important to be able to annotate digital representations of these parallel text structures. This article uses early modern emblem culture (books of engravings or woodcuts, accompanied by mottos and explanatory texts) to investigate the forms this text parallelism may take. It defines requirements for annotation definition and proposes a TEI (Text Encoding Initiative) extension to implement these requirements. In the proposed encoding scheme, TEI feature structures will be used for storing annotation information. This scheme should be useful for annotating parallel text structures as well as for other annotation tasks. The annotation scheme assumes the annotated texts are available in XML. If this is not the case (there is no electronic version of the text at all or perhaps only a facsimile) the article suggests the definition of a TEI proxy document. A TEI proxy document contains enough of the structural aspects of the texts to serve as a basis for attaching annotations to the text. Outside of the annotation context, proxy documents may serve as a basis for adding functionality to image-based editions.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Pays-Bas</li>
</country>
</list>
<tree>
<country name="Pays-Bas">
<noRegion>
<name sortKey="Boot, Peter" sort="Boot, Peter" uniqKey="Boot P" first="Peter" last="Boot">Peter Boot</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000108 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 000108 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     Francis:11-0223733
   |texte=   Towards a TEI-based encoding scheme for the annotation of parallel texts
}}

Wicri

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024