Serveur d'exploration sur la TEI

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

TEI encoding and syntactic tagging of an old French text

Identifieur interne : 000059 ( PascalFrancis/Checkpoint ); précédent : 000058; suivant : 000060

TEI encoding and syntactic tagging of an old French text

Auteurs : D. Estival [Australie] ; N. Nicholas [Australie]

Source :

RBID : Francis:524-99-12135

Descripteurs français

English descriptors

Abstract

This paper report on some of the concrete outcomes of a larger research project on the study of syntactic change. In this part of the project, we are collecting and encoding historical texts and tagging them for syntactic analysis. We have so far produced a TEI-conformant version of an Old French text, "La Vie de Saint Louis" written by Jehan de Joinville around 1305, and we are in the process of adding syntactic tags to this text. Those syntactic tags are derived from the Penn-Helsinki coding scheme, which had been devised for the syntactic encoding of Middle English texts, and have been translated into TEI. Thus this paper addresses two issues: the development of a TEI encoding for the text, and the adaptation of the Penn-Helsinki syntactic coding scheme. While the first part of this work raises issues of a textual nature independently of the language of the text, and proposes concrete immediate solutions, the second part points to a more general extension of the PH tagset to other types of texts and to other languages


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

Francis:524-99-12135

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">TEI encoding and syntactic tagging of an old French text</title>
<author>
<name sortKey="Estival, D" sort="Estival, D" uniqKey="Estival D" first="D." last="Estival">D. Estival</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Linguistics & Applied Linguistics, University of Melbourne</s1>
<s2>Parkville, Victoria 3052</s2>
<s3>AUS</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Australie</country>
<wicri:noRegion>Parkville, Victoria 3052</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Nicholas, N" sort="Nicholas, N" uniqKey="Nicholas N" first="N." last="Nicholas">N. Nicholas</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Linguistics & Applied Linguistics, University of Melbourne</s1>
<s2>Parkville, Victoria 3052</s2>
<s3>AUS</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Australie</country>
<wicri:noRegion>Parkville, Victoria 3052</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">524-99-12135</idno>
<date when="1999">1999</date>
<idno type="stanalyst">FRANCIS 524-99-12135 INIST</idno>
<idno type="RBID">Francis:524-99-12135</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000076</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000053</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000059</idno>
<idno type="wicri:explorRef" wicri:stream="PascalFrancis" wicri:step="Checkpoint">000059</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">TEI encoding and syntactic tagging of an old French text</title>
<author>
<name sortKey="Estival, D" sort="Estival, D" uniqKey="Estival D" first="D." last="Estival">D. Estival</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Linguistics & Applied Linguistics, University of Melbourne</s1>
<s2>Parkville, Victoria 3052</s2>
<s3>AUS</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Australie</country>
<wicri:noRegion>Parkville, Victoria 3052</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Nicholas, N" sort="Nicholas, N" uniqKey="Nicholas N" first="N." last="Nicholas">N. Nicholas</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Department of Linguistics & Applied Linguistics, University of Melbourne</s1>
<s2>Parkville, Victoria 3052</s2>
<s3>AUS</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Australie</country>
<wicri:noRegion>Parkville, Victoria 3052</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Computers and the humanities</title>
<title level="j" type="abbreviated">Comput. humanit.</title>
<idno type="ISSN">0010-4817</idno>
<imprint>
<date when="1999">1999</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Computers and the humanities</title>
<title level="j" type="abbreviated">Comput. humanit.</title>
<idno type="ISSN">0010-4817</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Computational linguistics</term>
<term>Corpus annotation</term>
<term>Electronic text</term>
<term>Markup language</term>
<term>TEI</term>
<term>Tagging</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Linguistique informatique</term>
<term>Annotation de corpus</term>
<term>Changement syntaxique</term>
<term>Texte électronique</term>
<term>Etiquetage automatique</term>
<term>Français (ancien-)</term>
<term>Encodage</term>
<term>Joinville (J. de)</term>
<term>SGML</term>
<term>TEI</term>
<term>Langage de balisage</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">This paper report on some of the concrete outcomes of a larger research project on the study of syntactic change. In this part of the project, we are collecting and encoding historical texts and tagging them for syntactic analysis. We have so far produced a TEI-conformant version of an Old French text, "La Vie de Saint Louis" written by Jehan de Joinville around 1305, and we are in the process of adding syntactic tags to this text. Those syntactic tags are derived from the Penn-Helsinki coding scheme, which had been devised for the syntactic encoding of Middle English texts, and have been translated into TEI. Thus this paper addresses two issues: the development of a TEI encoding for the text, and the adaptation of the Penn-Helsinki syntactic coding scheme. While the first part of this work raises issues of a textual nature independently of the language of the text, and proposes concrete immediate solutions, the second part points to a more general extension of the PH tagset to other types of texts and to other languages</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA01 i1="01" i2="1">
<s0>0010-4817</s0>
</fA01>
<fA02 i1="01">
<s0>COHUAD</s0>
</fA02>
<fA03 i2="1">
<s0>Comput. humanit.</s0>
</fA03>
<fA05>
<s2>33</s2>
</fA05>
<fA06>
<s2>1-2</s2>
</fA06>
<fA08 i1="01" i2="1" l="ENG">
<s1>TEI encoding and syntactic tagging of an old French text</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG">
<s1>Selected papers from TEI 10: Celebrating the tenth anniversary of the Text Encoding Initiative</s1>
</fA09>
<fA11 i1="01" i2="1">
<s1>ESTIVAL (D.)</s1>
</fA11>
<fA11 i1="02" i2="1">
<s1>NICHOLAS (N.)</s1>
</fA11>
<fA12 i1="01" i2="1">
<s1>MYLONAS (Elli)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="02" i2="1">
<s1>RENEAR (Allen)</s1>
<s9>ed.</s9>
</fA12>
<fA14 i1="01">
<s1>Department of Linguistics & Applied Linguistics, University of Melbourne</s1>
<s2>Parkville, Victoria 3052</s2>
<s3>AUS</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</fA14>
<fA15 i1="01">
<s1>Scholarly Technology Group, Brown University</s1>
<s2>Providence, RI</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</fA15>
<fA20>
<s1>155-174</s1>
</fA20>
<fA21>
<s1>1999</s1>
</fA21>
<fA23 i1="01">
<s0>ENG</s0>
</fA23>
<fA43 i1="01">
<s1>INIST</s1>
<s2>14902</s2>
<s5>354000084333370110</s5>
</fA43>
<fA44>
<s0>0000</s0>
<s1>© 1999 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45>
<s0>24 ref.</s0>
</fA45>
<fA47 i1="01" i2="1">
<s0>524-99-12135</s0>
</fA47>
<fA60>
<s1>P</s1>
<s2>C</s2>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA64 i1="01" i2="1">
<s0>Computers and the humanities</s0>
</fA64>
<fA66 i1="01">
<s0>NLD</s0>
</fA66>
<fA68 i1="01" i2="1" l="FRE">
<s1>L'encodage TEI et l'étiquetage syntaxique d'un texte en ancien français</s1>
</fA68>
<fA69 i1="01" i2="1" l="FRE">
<s1>Sélection d'articles célébrant le 10
<sup>e</sup>
anniversaire de la TEI</s1>
</fA69>
<fA99>
<s0>18 notes</s0>
</fA99>
<fC01 i1="01" l="ENG">
<s0>This paper report on some of the concrete outcomes of a larger research project on the study of syntactic change. In this part of the project, we are collecting and encoding historical texts and tagging them for syntactic analysis. We have so far produced a TEI-conformant version of an Old French text, "La Vie de Saint Louis" written by Jehan de Joinville around 1305, and we are in the process of adding syntactic tags to this text. Those syntactic tags are derived from the Penn-Helsinki coding scheme, which had been devised for the syntactic encoding of Middle English texts, and have been translated into TEI. Thus this paper addresses two issues: the development of a TEI encoding for the text, and the adaptation of the Penn-Helsinki syntactic coding scheme. While the first part of this work raises issues of a textual nature independently of the language of the text, and proposes concrete immediate solutions, the second part points to a more general extension of the PH tagset to other types of texts and to other languages</s0>
</fC01>
<fC02 i1="01" i2="L">
<s0>52478</s0>
<s1>XV</s1>
</fC02>
<fC02 i1="02" i2="L">
<s0>524</s0>
</fC02>
<fC03 i1="01" i2="L" l="FRE">
<s0>Linguistique informatique</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="L" l="ENG">
<s0>Computational linguistics</s0>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="L" l="FRE">
<s0>Annotation de corpus</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="L" l="ENG">
<s0>Corpus annotation</s0>
<s5>02</s5>
</fC03>
<fC03 i1="03" i2="L" l="FRE">
<s0>Changement syntaxique</s0>
<s5>03</s5>
</fC03>
<fC03 i1="04" i2="L" l="FRE">
<s0>Texte électronique</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="L" l="ENG">
<s0>Electronic text</s0>
<s5>04</s5>
</fC03>
<fC03 i1="05" i2="L" l="FRE">
<s0>Etiquetage automatique</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="L" l="ENG">
<s0>Tagging</s0>
<s5>05</s5>
</fC03>
<fC03 i1="06" i2="L" l="FRE">
<s0>Français (ancien-)</s0>
<s2>NL</s2>
<s5>08</s5>
</fC03>
<fC03 i1="07" i2="L" l="FRE">
<s0>Encodage</s0>
<s4>INC</s4>
<s5>31</s5>
</fC03>
<fC03 i1="08" i2="L" l="FRE">
<s0>Joinville (J. de)</s0>
<s4>INC</s4>
<s5>32</s5>
</fC03>
<fC03 i1="09" i2="L" l="FRE">
<s0>SGML</s0>
<s4>INC</s4>
<s5>33</s5>
</fC03>
<fC03 i1="10" i2="L" l="FRE">
<s0>TEI</s0>
<s4>CD</s4>
<s5>96</s5>
</fC03>
<fC03 i1="10" i2="L" l="ENG">
<s0>TEI</s0>
<s4>CD</s4>
<s5>96</s5>
</fC03>
<fC03 i1="11" i2="L" l="FRE">
<s0>Langage de balisage</s0>
<s4>CD</s4>
<s5>97</s5>
</fC03>
<fC03 i1="11" i2="L" l="ENG">
<s0>Markup language</s0>
<s4>CD</s4>
<s5>97</s5>
</fC03>
<fN21>
<s1>193</s1>
</fN21>
</pA>
<pR>
<fA30 i1="01" i2="1" l="ENG">
<s1>Text Encoding Initiative 10th Anniversary Conference</s1>
<s3>Providence, RI USA</s3>
<s4>1997-11</s4>
</fA30>
</pR>
</standard>
</inist>
<affiliations>
<list>
<country>
<li>Australie</li>
</country>
</list>
<tree>
<country name="Australie">
<noRegion>
<name sortKey="Estival, D" sort="Estival, D" uniqKey="Estival D" first="D." last="Estival">D. Estival</name>
</noRegion>
<name sortKey="Nicholas, N" sort="Nicholas, N" uniqKey="Nicholas N" first="N." last="Nicholas">N. Nicholas</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/PascalFrancis/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000059 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Checkpoint/biblio.hfd -nk 000059 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    PascalFrancis
   |étape=   Checkpoint
   |type=    RBID
   |clé=     Francis:524-99-12135
   |texte=   TEI encoding and syntactic tagging of an old French text
}}

Wicri

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024