Serveur d'exploration sur la TEI

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Practical considerations in the use of TEI headers in a large corpus

Identifieur interne : 000101 ( PascalFrancis/Corpus ); précédent : 000100; suivant : 000102

Practical considerations in the use of TEI headers in a large corpus

Auteurs : D. Dunlop

Source :

RBID : Pascal:96-0009176

Descripteurs français

English descriptors

Abstract

Many aspects of the guidelines of the Text Encoding Initiative (TEI) are applicable to corpora and text collections, and to the texts that these contain. As the first large corpus developed using mark-up conforming to the guidelines, the British National Corpus (BNC) is a test-bed for many TEI-developed mechanisms. This particularly true in the case of the TEI header, which has three intended applications-to describe a corpus, to describe an individual text, and as a free-standing bibliographic record- all of them used by the BNC. This paper describes the application of the TEI header to the BNC. It is intended that this information should, through a descriptions of experience on a practical project, serve as a guide for those wishing to use TEI headers in the documentation and management of other corpora and collections of texts

Notice en format standard (ISO 2709)

Pour connaître la documentation sur le format Inist Standard.

pA  
A01 01  1    @0 0010-4817
A02 01      @0 COHUAD
A03   1    @0 Comput. humanit.
A05       @2 29
A06       @2 1
A08 01  1  ENG  @1 Practical considerations in the use of TEI headers in a large corpus
A09 01  1  ENG  @1 The text encoding initiative : background and contexts
A11 01  1    @1 DUNLOP (D.)
A12 01  1    @1 IDE (Nancy) @9 ed.
A12 02  1    @1 VERONIS (Jean) @9 ed.
A14 01      @1 Oxford univ., computing serv. @2 Oxford OX2 6NN @3 GBR
A15 01      @1 Vassar coll., dep. computer sci. @2 Poughkeepsie NY 12601 @3 USA @Z 1 aut.
A15 02      @1 CNRS Univ. Provence, lab. parole langage @2 13621 Aix-en-Provence @3 FRA @Z 2 aut.
A20       @1 85-98
A21       @1 1995
A23 01      @0 ENG
A43 01      @1 INIST @2 14902 @5 354000053678760060
A44       @0 0000
A45       @0 14 ref.
A47 01  1    @0 96-0009176
A60       @1 P
A61       @0 A
A64 01  1    @0 Computers and the humanities
A66 01      @0 USA
C01 01    ENG  @0 Many aspects of the guidelines of the Text Encoding Initiative (TEI) are applicable to corpora and text collections, and to the texts that these contain. As the first large corpus developed using mark-up conforming to the guidelines, the British National Corpus (BNC) is a test-bed for many TEI-developed mechanisms. This particularly true in the case of the TEI header, which has three intended applications-to describe a corpus, to describe an individual text, and as a free-standing bibliographic record- all of them used by the BNC. This paper describes the application of the TEI header to the BNC. It is intended that this information should, through a descriptions of experience on a practical project, serve as a guide for those wishing to use TEI headers in the documentation and management of other corpora and collections of texts
C02 01  X    @0 001A01E02B
C02 02  X    @0 205
C03 01  X  FRE  @0 Texte @5 02
C03 01  X  ENG  @0 Text @5 02
C03 01  X  SPA  @0 Texto @5 02
C03 02  X  FRE  @0 Codage @5 03
C03 02  X  ENG  @0 Coding @5 03
C03 02  X  SPA  @0 Codificación @5 03
C03 03  X  FRE  @0 Marquage @5 04
C03 03  X  ENG  @0 Tagging @5 04
C03 03  X  GER  @0 Markieren @5 04
C03 03  X  SPA  @0 Marcación @5 04
C03 04  X  FRE  @0 Application @5 05
C03 04  X  ENG  @0 Application @5 05
C03 04  X  GER  @0 Anwendung @5 05
C03 04  X  SPA  @0 Aplicación @5 05
C03 05  X  FRE  @0 Grande dimension @5 06
C03 05  X  ENG  @0 Large dimension @5 06
C03 05  X  SPA  @0 Gran dimensión @5 06
C03 06  X  FRE  @0 Essai @5 07
C03 06  X  ENG  @0 Test @5 07
C03 06  X  SPA  @0 Ensayo @5 07
C03 07  X  FRE  @0 TEI (Text Encoding Initiative) @2 SI @4 INC @5 27
C03 08  X  FRE  @0 SGML (Standard Generalized Markup Language) @4 INC @5 28
C03 09  X  FRE  @0 BNC (British National Corpus) @2 SI @4 INC @5 29
C03 10  X  FRE  @0 Texte électronique @4 CD @5 96
C03 10  X  ENG  @0 Electronic text @4 CD @5 96
C03 11  X  FRE  @0 Corpus @4 CD @5 97
C03 11  X  ENG  @0 Corpus @4 CD @5 97
C03 12  X  FRE  @0 En tête @4 CD @5 98
C03 12  X  ENG  @0 Header @4 CD @5 98
C03 13  X  FRE  @0 Collection test @4 CD @5 99
C03 13  X  ENG  @0 Test collection @4 CD @5 99
C07 01  X  FRE  @0 Edition électronique @5 01
C07 01  X  ENG  @0 Electronic publishing @5 01
C07 01  X  SPA  @0 Edición electrónica @5 01
N21       @1 359

Format Inist (serveur)

NO : PASCAL 96-0009176 INIST
ET : Practical considerations in the use of TEI headers in a large corpus
AU : DUNLOP (D.); IDE (Nancy); VERONIS (Jean)
AF : Oxford univ., computing serv./Oxford OX2 6NN/Royaume-Uni; Vassar coll., dep. computer sci./Poughkeepsie NY 12601/Etats-Unis (1 aut.); CNRS Univ. Provence, lab. parole langage/13621 Aix-en-Provence/France (2 aut.)
DT : Publication en série; Niveau analytique
SO : Computers and the humanities; ISSN 0010-4817; Coden COHUAD; Etats-Unis; Da. 1995; Vol. 29; No. 1; Pp. 85-98; Bibl. 14 ref.
LA : Anglais
EA : Many aspects of the guidelines of the Text Encoding Initiative (TEI) are applicable to corpora and text collections, and to the texts that these contain. As the first large corpus developed using mark-up conforming to the guidelines, the British National Corpus (BNC) is a test-bed for many TEI-developed mechanisms. This particularly true in the case of the TEI header, which has three intended applications-to describe a corpus, to describe an individual text, and as a free-standing bibliographic record- all of them used by the BNC. This paper describes the application of the TEI header to the BNC. It is intended that this information should, through a descriptions of experience on a practical project, serve as a guide for those wishing to use TEI headers in the documentation and management of other corpora and collections of texts
CC : 001A01E02B; 205
FD : Texte; Codage; Marquage; Application; Grande dimension; Essai; TEI (Text Encoding Initiative); SGML (Standard Generalized Markup Language); BNC (British National Corpus); Texte électronique; Corpus; En tête; Collection test
FG : Edition électronique
ED : Text; Coding; Tagging; Application; Large dimension; Test; Electronic text; Corpus; Header; Test collection
EG : Electronic publishing
GD : Markieren; Anwendung
SD : Texto; Codificación; Marcación; Aplicación; Gran dimensión; Ensayo
LO : INIST-14902.354000053678760060
ID : 96-0009176

Links to Exploration step

Pascal:96-0009176

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Practical considerations in the use of TEI headers in a large corpus</title>
<author>
<name sortKey="Dunlop, D" sort="Dunlop, D" uniqKey="Dunlop D" first="D." last="Dunlop">D. Dunlop</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Oxford univ., computing serv.</s1>
<s2>Oxford OX2 6NN</s2>
<s3>GBR</s3>
</inist:fA14>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">96-0009176</idno>
<date when="1995">1995</date>
<idno type="stanalyst">PASCAL 96-0009176 INIST</idno>
<idno type="RBID">Pascal:96-0009176</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000101</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Practical considerations in the use of TEI headers in a large corpus</title>
<author>
<name sortKey="Dunlop, D" sort="Dunlop, D" uniqKey="Dunlop D" first="D." last="Dunlop">D. Dunlop</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Oxford univ., computing serv.</s1>
<s2>Oxford OX2 6NN</s2>
<s3>GBR</s3>
</inist:fA14>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Computers and the humanities</title>
<title level="j" type="abbreviated">Comput. humanit.</title>
<idno type="ISSN">0010-4817</idno>
<imprint>
<date when="1995">1995</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Computers and the humanities</title>
<title level="j" type="abbreviated">Comput. humanit.</title>
<idno type="ISSN">0010-4817</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Application</term>
<term>Coding</term>
<term>Corpus</term>
<term>Electronic text</term>
<term>Header</term>
<term>Large dimension</term>
<term>Tagging</term>
<term>Test</term>
<term>Test collection</term>
<term>Text</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Texte</term>
<term>Codage</term>
<term>Marquage</term>
<term>Application</term>
<term>Grande dimension</term>
<term>Essai</term>
<term>TEI (Text Encoding Initiative)</term>
<term>SGML (Standard Generalized Markup Language)</term>
<term>BNC (British National Corpus)</term>
<term>Texte électronique</term>
<term>Corpus</term>
<term>En tête</term>
<term>Collection test</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Many aspects of the guidelines of the Text Encoding Initiative (TEI) are applicable to corpora and text collections, and to the texts that these contain. As the first large corpus developed using mark-up conforming to the guidelines, the British National Corpus (BNC) is a test-bed for many TEI-developed mechanisms. This particularly true in the case of the TEI header, which has three intended applications-to describe a corpus, to describe an individual text, and as a free-standing bibliographic record- all of them used by the BNC. This paper describes the application of the TEI header to the BNC. It is intended that this information should, through a descriptions of experience on a practical project, serve as a guide for those wishing to use TEI headers in the documentation and management of other corpora and collections of texts</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA01 i1="01" i2="1">
<s0>0010-4817</s0>
</fA01>
<fA02 i1="01">
<s0>COHUAD</s0>
</fA02>
<fA03 i2="1">
<s0>Comput. humanit.</s0>
</fA03>
<fA05>
<s2>29</s2>
</fA05>
<fA06>
<s2>1</s2>
</fA06>
<fA08 i1="01" i2="1" l="ENG">
<s1>Practical considerations in the use of TEI headers in a large corpus</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG">
<s1>The text encoding initiative : background and contexts</s1>
</fA09>
<fA11 i1="01" i2="1">
<s1>DUNLOP (D.)</s1>
</fA11>
<fA12 i1="01" i2="1">
<s1>IDE (Nancy)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="02" i2="1">
<s1>VERONIS (Jean)</s1>
<s9>ed.</s9>
</fA12>
<fA14 i1="01">
<s1>Oxford univ., computing serv.</s1>
<s2>Oxford OX2 6NN</s2>
<s3>GBR</s3>
</fA14>
<fA15 i1="01">
<s1>Vassar coll., dep. computer sci.</s1>
<s2>Poughkeepsie NY 12601</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
</fA15>
<fA15 i1="02">
<s1>CNRS Univ. Provence, lab. parole langage</s1>
<s2>13621 Aix-en-Provence</s2>
<s3>FRA</s3>
<sZ>2 aut.</sZ>
</fA15>
<fA20>
<s1>85-98</s1>
</fA20>
<fA21>
<s1>1995</s1>
</fA21>
<fA23 i1="01">
<s0>ENG</s0>
</fA23>
<fA43 i1="01">
<s1>INIST</s1>
<s2>14902</s2>
<s5>354000053678760060</s5>
</fA43>
<fA44>
<s0>0000</s0>
</fA44>
<fA45>
<s0>14 ref.</s0>
</fA45>
<fA47 i1="01" i2="1">
<s0>96-0009176</s0>
</fA47>
<fA60>
<s1>P</s1>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA64 i1="01" i2="1">
<s0>Computers and the humanities</s0>
</fA64>
<fA66 i1="01">
<s0>USA</s0>
</fA66>
<fC01 i1="01" l="ENG">
<s0>Many aspects of the guidelines of the Text Encoding Initiative (TEI) are applicable to corpora and text collections, and to the texts that these contain. As the first large corpus developed using mark-up conforming to the guidelines, the British National Corpus (BNC) is a test-bed for many TEI-developed mechanisms. This particularly true in the case of the TEI header, which has three intended applications-to describe a corpus, to describe an individual text, and as a free-standing bibliographic record- all of them used by the BNC. This paper describes the application of the TEI header to the BNC. It is intended that this information should, through a descriptions of experience on a practical project, serve as a guide for those wishing to use TEI headers in the documentation and management of other corpora and collections of texts</s0>
</fC01>
<fC02 i1="01" i2="X">
<s0>001A01E02B</s0>
</fC02>
<fC02 i1="02" i2="X">
<s0>205</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE">
<s0>Texte</s0>
<s5>02</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG">
<s0>Text</s0>
<s5>02</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA">
<s0>Texto</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE">
<s0>Codage</s0>
<s5>03</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG">
<s0>Coding</s0>
<s5>03</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA">
<s0>Codificación</s0>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE">
<s0>Marquage</s0>
<s5>04</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG">
<s0>Tagging</s0>
<s5>04</s5>
</fC03>
<fC03 i1="03" i2="X" l="GER">
<s0>Markieren</s0>
<s5>04</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA">
<s0>Marcación</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE">
<s0>Application</s0>
<s5>05</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG">
<s0>Application</s0>
<s5>05</s5>
</fC03>
<fC03 i1="04" i2="X" l="GER">
<s0>Anwendung</s0>
<s5>05</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA">
<s0>Aplicación</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE">
<s0>Grande dimension</s0>
<s5>06</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG">
<s0>Large dimension</s0>
<s5>06</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA">
<s0>Gran dimensión</s0>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE">
<s0>Essai</s0>
<s5>07</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG">
<s0>Test</s0>
<s5>07</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA">
<s0>Ensayo</s0>
<s5>07</s5>
</fC03>
<fC03 i1="07" i2="X" l="FRE">
<s0>TEI (Text Encoding Initiative)</s0>
<s2>SI</s2>
<s4>INC</s4>
<s5>27</s5>
</fC03>
<fC03 i1="08" i2="X" l="FRE">
<s0>SGML (Standard Generalized Markup Language)</s0>
<s4>INC</s4>
<s5>28</s5>
</fC03>
<fC03 i1="09" i2="X" l="FRE">
<s0>BNC (British National Corpus)</s0>
<s2>SI</s2>
<s4>INC</s4>
<s5>29</s5>
</fC03>
<fC03 i1="10" i2="X" l="FRE">
<s0>Texte électronique</s0>
<s4>CD</s4>
<s5>96</s5>
</fC03>
<fC03 i1="10" i2="X" l="ENG">
<s0>Electronic text</s0>
<s4>CD</s4>
<s5>96</s5>
</fC03>
<fC03 i1="11" i2="X" l="FRE">
<s0>Corpus</s0>
<s4>CD</s4>
<s5>97</s5>
</fC03>
<fC03 i1="11" i2="X" l="ENG">
<s0>Corpus</s0>
<s4>CD</s4>
<s5>97</s5>
</fC03>
<fC03 i1="12" i2="X" l="FRE">
<s0>En tête</s0>
<s4>CD</s4>
<s5>98</s5>
</fC03>
<fC03 i1="12" i2="X" l="ENG">
<s0>Header</s0>
<s4>CD</s4>
<s5>98</s5>
</fC03>
<fC03 i1="13" i2="X" l="FRE">
<s0>Collection test</s0>
<s4>CD</s4>
<s5>99</s5>
</fC03>
<fC03 i1="13" i2="X" l="ENG">
<s0>Test collection</s0>
<s4>CD</s4>
<s5>99</s5>
</fC03>
<fC07 i1="01" i2="X" l="FRE">
<s0>Edition électronique</s0>
<s5>01</s5>
</fC07>
<fC07 i1="01" i2="X" l="ENG">
<s0>Electronic publishing</s0>
<s5>01</s5>
</fC07>
<fC07 i1="01" i2="X" l="SPA">
<s0>Edición electrónica</s0>
<s5>01</s5>
</fC07>
<fN21>
<s1>359</s1>
</fN21>
</pA>
</standard>
<server>
<NO>PASCAL 96-0009176 INIST</NO>
<ET>Practical considerations in the use of TEI headers in a large corpus</ET>
<AU>DUNLOP (D.); IDE (Nancy); VERONIS (Jean)</AU>
<AF>Oxford univ., computing serv./Oxford OX2 6NN/Royaume-Uni; Vassar coll., dep. computer sci./Poughkeepsie NY 12601/Etats-Unis (1 aut.); CNRS Univ. Provence, lab. parole langage/13621 Aix-en-Provence/France (2 aut.)</AF>
<DT>Publication en série; Niveau analytique</DT>
<SO>Computers and the humanities; ISSN 0010-4817; Coden COHUAD; Etats-Unis; Da. 1995; Vol. 29; No. 1; Pp. 85-98; Bibl. 14 ref.</SO>
<LA>Anglais</LA>
<EA>Many aspects of the guidelines of the Text Encoding Initiative (TEI) are applicable to corpora and text collections, and to the texts that these contain. As the first large corpus developed using mark-up conforming to the guidelines, the British National Corpus (BNC) is a test-bed for many TEI-developed mechanisms. This particularly true in the case of the TEI header, which has three intended applications-to describe a corpus, to describe an individual text, and as a free-standing bibliographic record- all of them used by the BNC. This paper describes the application of the TEI header to the BNC. It is intended that this information should, through a descriptions of experience on a practical project, serve as a guide for those wishing to use TEI headers in the documentation and management of other corpora and collections of texts</EA>
<CC>001A01E02B; 205</CC>
<FD>Texte; Codage; Marquage; Application; Grande dimension; Essai; TEI (Text Encoding Initiative); SGML (Standard Generalized Markup Language); BNC (British National Corpus); Texte électronique; Corpus; En tête; Collection test</FD>
<FG>Edition électronique</FG>
<ED>Text; Coding; Tagging; Application; Large dimension; Test; Electronic text; Corpus; Header; Test collection</ED>
<EG>Electronic publishing</EG>
<GD>Markieren; Anwendung</GD>
<SD>Texto; Codificación; Marcación; Aplicación; Gran dimensión; Ensayo</SD>
<LO>INIST-14902.354000053678760060</LO>
<ID>96-0009176</ID>
</server>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/PascalFrancis/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000101 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000101 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    PascalFrancis
   |étape=   Corpus
   |type=    RBID
   |clé=     Pascal:96-0009176
   |texte=   Practical considerations in the use of TEI headers in a large corpus
}}

Wicri

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024