Serveur d'exploration sur la TEI

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Towards a TEI-based encoding scheme for the annotation of parallel texts

Identifieur interne : 000320 ( Istex/Corpus ); précédent : 000319; suivant : 000321

Towards a TEI-based encoding scheme for the annotation of parallel texts

Auteurs : Peter Boot

Source :

RBID : ISTEX:C87E170B5B672608C2723A492673F975F0780E43

Abstract

Translation, adaptation, and other forms of appropriation of literary works can result in bodies of parallel texts. For the purpose of studying appropriation strategies, it is important to be able to annotate digital representations of these parallel text structures. This article uses early modern emblem culture (books of engravings or woodcuts, accompanied by mottos and explanatory texts) to investigate the forms this text parallelism may take. It defines requirements for annotation definition and proposes a TEI (Text Encoding Initiative) extension to implement these requirements. In the proposed encoding scheme, TEI feature structures will be used for storing annotation information. This scheme should be useful for annotating parallel text structures as well as for other annotation tasks. The annotation scheme assumes the annotated texts are available in XML. If this is not the case (there is no electronic version of the text at all or perhaps only a facsimile) the article suggests the definition of a TEI proxy document. A TEI proxy document contains enough of the structural aspects of the texts to serve as a basis for attaching annotations to the text. Outside of the annotation context, proxy documents may serve as a basis for adding functionality to image-based editions.

Url:
DOI: 10.1093/llc/fqp023

Links to Exploration step

ISTEX:C87E170B5B672608C2723A492673F975F0780E43

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title>Towards a TEI-based encoding scheme for the annotation of parallel texts</title>
<author wicri:is="90%">
<name sortKey="Boot, Peter" sort="Boot, Peter" uniqKey="Boot P" first="Peter" last="Boot">Peter Boot</name>
<affiliation>
<mods:affiliation>Huygens Institute, Royal Netherlands Academy of Arts and Sciences, The Hague, The Netherlands</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: pboot@xs4all.nl</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:C87E170B5B672608C2723A492673F975F0780E43</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.1093/llc/fqp023</idno>
<idno type="url">https://api.istex.fr/document/C87E170B5B672608C2723A492673F975F0780E43/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000320</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a">Towards a TEI-based encoding scheme for the annotation of parallel texts</title>
<author wicri:is="90%">
<name sortKey="Boot, Peter" sort="Boot, Peter" uniqKey="Boot P" first="Peter" last="Boot">Peter Boot</name>
<affiliation>
<mods:affiliation>Huygens Institute, Royal Netherlands Academy of Arts and Sciences, The Hague, The Netherlands</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: pboot@xs4all.nl</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Literary and Linguistic Computing</title>
<idno type="ISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint>
<publisher>Oxford University Press</publisher>
<date type="published" when="2009-09">2009-09</date>
<biblScope unit="volume">24</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="347">347</biblScope>
<biblScope unit="page" to="361">361</biblScope>
</imprint>
<idno type="ISSN">0268-1145</idno>
</series>
<idno type="istex">C87E170B5B672608C2723A492673F975F0780E43</idno>
<idno type="DOI">10.1093/llc/fqp023</idno>
<idno type="ArticleID">fqp023</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract">Translation, adaptation, and other forms of appropriation of literary works can result in bodies of parallel texts. For the purpose of studying appropriation strategies, it is important to be able to annotate digital representations of these parallel text structures. This article uses early modern emblem culture (books of engravings or woodcuts, accompanied by mottos and explanatory texts) to investigate the forms this text parallelism may take. It defines requirements for annotation definition and proposes a TEI (Text Encoding Initiative) extension to implement these requirements. In the proposed encoding scheme, TEI feature structures will be used for storing annotation information. This scheme should be useful for annotating parallel text structures as well as for other annotation tasks. The annotation scheme assumes the annotated texts are available in XML. If this is not the case (there is no electronic version of the text at all or perhaps only a facsimile) the article suggests the definition of a TEI proxy document. A TEI proxy document contains enough of the structural aspects of the texts to serve as a basis for attaching annotations to the text. Outside of the annotation context, proxy documents may serve as a basis for adding functionality to image-based editions.</div>
</front>
</TEI>
<istex>
<corpusName>oup</corpusName>
<author>
<json:item>
<name>Peter Boot</name>
<affiliations>
<json:string>Huygens Institute, Royal Netherlands Academy of Arts and Sciences, The Hague, The Netherlands</json:string>
<json:string>E-mail: pboot@xs4all.nl</json:string>
</affiliations>
</json:item>
</author>
<subject>
<json:item>
<lang>
<json:string>eng</json:string>
</lang>
<value>Original Articles</value>
</json:item>
</subject>
<articleId>
<json:string>fqp023</json:string>
</articleId>
<language>
<json:string>eng</json:string>
</language>
<originalGenre>
<json:string>research-article</json:string>
</originalGenre>
<abstract>Translation, adaptation, and other forms of appropriation of literary works can result in bodies of parallel texts. For the purpose of studying appropriation strategies, it is important to be able to annotate digital representations of these parallel text structures. This article uses early modern emblem culture (books of engravings or woodcuts, accompanied by mottos and explanatory texts) to investigate the forms this text parallelism may take. It defines requirements for annotation definition and proposes a TEI (Text Encoding Initiative) extension to implement these requirements. In the proposed encoding scheme, TEI feature structures will be used for storing annotation information. This scheme should be useful for annotating parallel text structures as well as for other annotation tasks. The annotation scheme assumes the annotated texts are available in XML. If this is not the case (there is no electronic version of the text at all or perhaps only a facsimile) the article suggests the definition of a TEI proxy document. A TEI proxy document contains enough of the structural aspects of the texts to serve as a basis for attaching annotations to the text. Outside of the annotation context, proxy documents may serve as a basis for adding functionality to image-based editions.</abstract>
<qualityIndicators>
<score>7.9</score>
<pdfVersion>1.4</pdfVersion>
<pdfPageSize>538.583 x 697.323 pts</pdfPageSize>
<refBibsNative>true</refBibsNative>
<keywordCount>1</keywordCount>
<abstractCharCount>1295</abstractCharCount>
<pdfWordCount>6221</pdfWordCount>
<pdfCharCount>40656</pdfCharCount>
<pdfPageCount>15</pdfPageCount>
<abstractWordCount>200</abstractWordCount>
</qualityIndicators>
<title>Towards a TEI-based encoding scheme for the annotation of parallel texts</title>
<genre>
<json:string>research-article</json:string>
</genre>
<host>
<volume>24</volume>
<publisherId>
<json:string>litlin</json:string>
</publisherId>
<pages>
<last>361</last>
<first>347</first>
</pages>
<issn>
<json:string>0268-1145</json:string>
</issn>
<issue>3</issue>
<genre>
<json:string>journal</json:string>
</genre>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1477-4615</json:string>
</eissn>
<title>Literary and Linguistic Computing</title>
</host>
<categories>
<wos>
<json:string>LINGUISTICS</json:string>
<json:string>LITERATURE</json:string>
</wos>
</categories>
<publicationDate>2009</publicationDate>
<copyrightDate>2009</copyrightDate>
<doi>
<json:string>10.1093/llc/fqp023</json:string>
</doi>
<id>C87E170B5B672608C2723A492673F975F0780E43</id>
<score>0.16755493</score>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/C87E170B5B672608C2723A492673F975F0780E43/fulltext/pdf</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/C87E170B5B672608C2723A492673F975F0780E43/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/C87E170B5B672608C2723A492673F975F0780E43/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a">Towards a TEI-based encoding scheme for the annotation of parallel texts</title>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>Oxford University Press</publisher>
<availability>
<p>OUP</p>
</availability>
<date>2009-05-18</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a">Towards a TEI-based encoding scheme for the annotation of parallel texts</title>
<author>
<persName>
<forename type="first">Peter</forename>
<surname>Boot</surname>
</persName>
<email>pboot@xs4all.nl</email>
<affiliation>Huygens Institute, Royal Netherlands Academy of Arts and Sciences, The Hague, The Netherlands</affiliation>
</author>
</analytic>
<monogr>
<title level="j">Literary and Linguistic Computing</title>
<idno type="pISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint>
<publisher>Oxford University Press</publisher>
<date type="published" when="2009-09"></date>
<biblScope unit="volume">24</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="347">347</biblScope>
<biblScope unit="page" to="361">361</biblScope>
</imprint>
</monogr>
<idno type="istex">C87E170B5B672608C2723A492673F975F0780E43</idno>
<idno type="DOI">10.1093/llc/fqp023</idno>
<idno type="ArticleID">fqp023</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>2009-05-18</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract>
<p>Translation, adaptation, and other forms of appropriation of literary works can result in bodies of parallel texts. For the purpose of studying appropriation strategies, it is important to be able to annotate digital representations of these parallel text structures. This article uses early modern emblem culture (books of engravings or woodcuts, accompanied by mottos and explanatory texts) to investigate the forms this text parallelism may take. It defines requirements for annotation definition and proposes a TEI (Text Encoding Initiative) extension to implement these requirements. In the proposed encoding scheme, TEI feature structures will be used for storing annotation information. This scheme should be useful for annotating parallel text structures as well as for other annotation tasks. The annotation scheme assumes the annotated texts are available in XML. If this is not the case (there is no electronic version of the text at all or perhaps only a facsimile) the article suggests the definition of a TEI proxy document. A TEI proxy document contains enough of the structural aspects of the texts to serve as a basis for attaching annotations to the text. Outside of the annotation context, proxy documents may serve as a basis for adding functionality to image-based editions.</p>
</abstract>
<textClass>
<keywords scheme="keyword">
<list>
<item>
<term>Original Articles</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change when="2009-05-18">Created</change>
<change when="2009-09">Published</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/C87E170B5B672608C2723A492673F975F0780E43/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="corpus oup" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="utf-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" URI="journalpublishing.dtd" name="istex:docType"></istex:docType>
<istex:document>
<article article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">litlin</journal-id>
<journal-id journal-id-type="hwp">litlin</journal-id>
<journal-title>Literary and Linguistic Computing</journal-title>
<issn pub-type="ppub">0268-1145</issn>
<issn pub-type="epub">1477-4615</issn>
<publisher>
<publisher-name>Oxford University Press</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.1093/llc/fqp023</article-id>
<article-id pub-id-type="publisher-id">fqp023</article-id>
<article-categories>
<subj-group>
<subject>Original Articles</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Towards a TEI-based encoding scheme for the annotation of parallel texts</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Boot</surname>
<given-names>Peter</given-names>
</name>
</contrib>
</contrib-group>
<aff>Huygens Institute, Royal Netherlands Academy of Arts and Sciences, The Hague, The Netherlands</aff>
<author-notes>
<corresp>
<bold>Correspondence:</bold>
Peter Boot, Huygens Institute (KNAW), PO Box 90754, 2509 LT, The Hague, The Netherlands.
<bold>E-mail:</bold>
<email>pboot@xs4all.nl</email>
</corresp>
</author-notes>
<pub-date pub-type="ppub">
<month>9</month>
<year>2009</year>
</pub-date>
<pub-date pub-type="epub">
<day>18</day>
<month>5</month>
<year>2009</year>
</pub-date>
<volume>24</volume>
<issue>3</issue>
<fpage>347</fpage>
<lpage>361</lpage>
<permissions>
<copyright-statement>© The Author 2009. Published by Oxford University Press on behalf of ALLC and ACH. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org</copyright-statement>
<copyright-year>2009</copyright-year>
</permissions>
<abstract>
<p>Translation, adaptation, and other forms of appropriation of literary works can result in bodies of parallel texts. For the purpose of studying appropriation strategies, it is important to be able to annotate digital representations of these parallel text structures. This article uses early modern emblem culture (books of engravings or woodcuts, accompanied by mottos and explanatory texts) to investigate the forms this text parallelism may take. It defines requirements for annotation definition and proposes a TEI (Text Encoding Initiative) extension to implement these requirements. In the proposed encoding scheme, TEI feature structures will be used for storing annotation information. This scheme should be useful for annotating parallel text structures as well as for other annotation tasks. The annotation scheme assumes the annotated texts are available in XML. If this is not the case (there is no electronic version of the text at all or perhaps only a facsimile) the article suggests the definition of a TEI proxy document. A TEI proxy document contains enough of the structural aspects of the texts to serve as a basis for attaching annotations to the text. Outside of the annotation context, proxy documents may serve as a basis for adding functionality to image-based editions.</p>
</abstract>
</article-meta>
</front>
</article>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo>
<title>Towards a TEI-based encoding scheme for the annotation of parallel texts</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA">
<title>Towards a TEI-based encoding scheme for the annotation of parallel texts</title>
</titleInfo>
<name type="personal">
<namePart type="given">Peter</namePart>
<namePart type="family">Boot</namePart>
<affiliation>Huygens Institute, Royal Netherlands Academy of Arts and Sciences, The Hague, The Netherlands</affiliation>
<affiliation>E-mail: pboot@xs4all.nl</affiliation>
</name>
<typeOfResource>text</typeOfResource>
<genre type="research-article" displayLabel="research-article"></genre>
<subject>
<topic>Original Articles</topic>
</subject>
<originInfo>
<publisher>Oxford University Press</publisher>
<dateIssued encoding="w3cdtf">2009-09</dateIssued>
<dateCreated encoding="w3cdtf">2009-05-18</dateCreated>
<copyrightDate encoding="w3cdtf">2009</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract>Translation, adaptation, and other forms of appropriation of literary works can result in bodies of parallel texts. For the purpose of studying appropriation strategies, it is important to be able to annotate digital representations of these parallel text structures. This article uses early modern emblem culture (books of engravings or woodcuts, accompanied by mottos and explanatory texts) to investigate the forms this text parallelism may take. It defines requirements for annotation definition and proposes a TEI (Text Encoding Initiative) extension to implement these requirements. In the proposed encoding scheme, TEI feature structures will be used for storing annotation information. This scheme should be useful for annotating parallel text structures as well as for other annotation tasks. The annotation scheme assumes the annotated texts are available in XML. If this is not the case (there is no electronic version of the text at all or perhaps only a facsimile) the article suggests the definition of a TEI proxy document. A TEI proxy document contains enough of the structural aspects of the texts to serve as a basis for attaching annotations to the text. Outside of the annotation context, proxy documents may serve as a basis for adding functionality to image-based editions.</abstract>
<relatedItem type="host">
<titleInfo>
<title>Literary and Linguistic Computing</title>
</titleInfo>
<genre type="journal">journal</genre>
<identifier type="ISSN">0268-1145</identifier>
<identifier type="eISSN">1477-4615</identifier>
<identifier type="PublisherID">litlin</identifier>
<identifier type="PublisherID-hwp">litlin</identifier>
<part>
<date>2009</date>
<detail type="volume">
<caption>vol.</caption>
<number>24</number>
</detail>
<detail type="issue">
<caption>no.</caption>
<number>3</number>
</detail>
<extent unit="pages">
<start>347</start>
<end>361</end>
</extent>
</part>
</relatedItem>
<identifier type="istex">C87E170B5B672608C2723A492673F975F0780E43</identifier>
<identifier type="DOI">10.1093/llc/fqp023</identifier>
<identifier type="ArticleID">fqp023</identifier>
<accessCondition type="use and reproduction" contentType="copyright">The Author 2009. Published by Oxford University Press on behalf of ALLC and ACH. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org</accessCondition>
<recordInfo>
<recordContentSource>OUP</recordContentSource>
</recordInfo>
</mods>
</metadata>
<covers>
<json:item>
<original>true</original>
<mimetype>image/tiff</mimetype>
<extension>tiff</extension>
<uri>https://api.istex.fr/document/C87E170B5B672608C2723A492673F975F0780E43/covers/tiff</uri>
</json:item>
</covers>
<annexes>
<json:item>
<original>true</original>
<mimetype>image/jpeg</mimetype>
<extension>jpeg</extension>
<uri>https://api.istex.fr/document/C87E170B5B672608C2723A492673F975F0780E43/annexes/jpeg</uri>
</json:item>
<json:item>
<original>true</original>
<mimetype>image/gif</mimetype>
<extension>gif</extension>
<uri>https://api.istex.fr/document/C87E170B5B672608C2723A492673F975F0780E43/annexes/gif</uri>
</json:item>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/C87E170B5B672608C2723A492673F975F0780E43/annexes/pdf</uri>
</json:item>
</annexes>
<enrichments>
<istex:catWosTEI uri="https://api.istex.fr/document/C87E170B5B672608C2723A492673F975F0780E43/enrichments/catWos">
<teiHeader>
<profileDesc>
<textClass>
<classCode scheme="WOS">LINGUISTICS</classCode>
<classCode scheme="WOS">LITERATURE</classCode>
</textClass>
</profileDesc>
</teiHeader>
</istex:catWosTEI>
</enrichments>
<serie></serie>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000320 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000320 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:C87E170B5B672608C2723A492673F975F0780E43
   |texte=   Towards a TEI-based encoding scheme for the annotation of parallel texts
}}

Wicri

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024