TeiVM2, PascalFrancis, Curation, bibRecord, 000035

TEI documents in the grid

Identifieur interne : 000035 ( PascalFrancis/Curation ); précédent : 000034; suivant : 000036

TEI documents in the grid

Auteurs : Andrea Zielinski [Allemagne] ; Wolfgang Pempe [Allemagne] ; Peter Gietz [Allemagne] ; Martin Haase [Allemagne] ; Stefan Funk [Allemagne] ; Christian Simon [Allemagne]

Source :

Literary and linguistic computing [ 0268-1145 ] ; 2009.

RBID : Francis:11-0223726

Descripteurs français

Pascal (Inist)
- TEI, Projet de recherche, Linguistique informatique, Numérisation, Annotation de corpus, Dictionnaire, Traitement automatique des langues naturelles, Bibliothèque électronique, Langage XML.
Wicri :
- topic : Projet de recherche, Numérisation, Dictionnaire.

English descriptors

KwdEn :
- Computational linguistics, Corpus annotation, Dictionary, Digitization, Natural language processing, Research project, TEI.

Abstract

This article describes the life cycle of a TEI Document within TextGrid, an eHumanities platform for scholarly text processing, in which structured search is based on the TEl framework and metadata with restricted values. A workbench is provided that offers tools for handling TEI documents, TextGridLab, making it easier to annotate, process, search, and persistently store new digitized texts. The digitization and annotation of the Campe dictionary¹ serves as a first test bed. The overall framework of TextGrid is very generic and can handle different types of text (literary editions, linguistic corpora, lexica) as well as heterogeneous data formats (plain text, XML/TEI, images). In fact, the TextGrid repository, TextGridRep, is designed as a digital virtual library over federated archives, where humanities projects are invited to participate. Sharing of data is enabled by means of a grid-based architecture. Specifically the middleware includes most of the treatment of authorization, search, and file management. TextGrid is entirely based on open source software including Eclipse² and Globus Toolkit.

A01	`01`	`1`		`@0 0268-1145`
A03		`1`		`@0 Lit. linguist. comput.`
A05				`@2 24`
A06				`@2 3`
A08	`01`	`1`	`ENG`	`@1 TEI documents in the grid`
A09	`01`	`1`	`ENG`	`@1 Selected papers from Text Encoding Initiative`
A11	`01`	`1`		`@1 ZIELINSKI (Andrea)`
A11	`02`	`1`		`@1 PEMPE (Wolfgang)`
A11	`03`	`1`		`@1 GIETZ (Peter)`
A11	`04`	`1`		`@1 HAASE (Martin)`
A11	`05`	`1`		`@1 FUNK (Stefan)`
A11	`06`	`1`		`@1 SIMON (Christian)`
A12	`01`	`1`		`@1 RAHTZ (Sebastian) @9 ed.`
A12	`02`	`1`		`@1 SCHREIBMAN (Susan) @9 ed.`
A14	`01`			`@1 Institut für Deutsche Sprache Mannheim @3 DEU @Z 1 aut. @Z 6 aut.`
A14	`02`			`@1 Saphor GmbH @3 DEU @Z 2 aut.`
A14	`03`			`@1 DAASI International GmbH @3 DEU @Z 3 aut. @Z 4 aut. @Z 5 aut.`
A15	`01`			`@1 Oxford University @3 GBR @Z 1 aut.`
A15	`02`			`@1 Digital Humanities Observatory, Royal Irish Academy @3 IRL @Z 2 aut.`
A20				`@1 267-279`
A21				`@1 2009`
A23	`01`			`@0 ENG`
A43	`01`			`@1 INIST @2 23967 @5 354000171896560020`
A44				`@0 0000 @1 © 2011 INIST-CNRS. All rights reserved.`
A45				`@0 3/4 p.`
A47	`01`	`1`		`@0 11-0223726`
A60				`@1 P @2 C`
A61				`@0 A`
A64	`01`	`1`		`@0 Literary and linguistic computing`
A66	`01`			`@0 GBR`
A68	`01`	`1`	`FRE`	`@1 Documents de la TEI dans la grille`
A69	`01`	`1`	`FRE`	`@1 Sélection d'articles de la TEI`
A99				`@0 27 notes`
C01	`01`		`ENG`	@0 This article describes the life cycle of a TEI Document within TextGrid, an eHumanities platform for scholarly text processing, in which structured search is based on the TEl framework and metadata with restricted values. A workbench is provided that offers tools for handling TEI documents, TextGridLab, making it easier to annotate, process, search, and persistently store new digitized texts. The digitization and annotation of the Campe dictionary¹ serves as a first test bed. The overall framework of TextGrid is very generic and can handle different types of text (literary editions, linguistic corpora, lexica) as well as heterogeneous data formats (plain text, XML/TEI, images). In fact, the TextGrid repository, TextGridRep, is designed as a digital virtual library over federated archives, where humanities projects are invited to participate. Sharing of data is enabled by means of a grid-based architecture. Specifically the middleware includes most of the treatment of authorization, search, and file management. TextGrid is entirely based on open source software including Eclipse² and Globus Toolkit.
C02	`01`	`L`		`@0 52478 @1 XV`
C02	`02`	`L`		`@0 524`
C03	`01`	`L`	`FRE`	`@0 TEI @2 NI @5 01`
C03	`01`	`L`	`ENG`	`@0 TEI @2 NI @5 01`
C03	`02`	`L`	`FRE`	`@0 Projet de recherche @2 NI @5 02`
C03	`02`	`L`	`ENG`	`@0 Research project @2 NI @5 02`
C03	`03`	`L`	`FRE`	`@0 Linguistique informatique @2 NI @5 03`
C03	`03`	`L`	`ENG`	`@0 Computational linguistics @2 NI @5 03`
C03	`04`	`L`	`FRE`	`@0 Numérisation @2 NI @5 04`
C03	`04`	`L`	`ENG`	`@0 Digitization @2 NI @5 04`
C03	`05`	`L`	`FRE`	`@0 Annotation de corpus @2 NI @5 05`
C03	`05`	`L`	`ENG`	`@0 Corpus annotation @2 NI @5 05`
C03	`06`	`L`	`FRE`	`@0 Dictionnaire @2 NI @5 06`
C03	`06`	`L`	`ENG`	`@0 Dictionary @2 NI @5 06`
C03	`07`	`L`	`FRE`	`@0 Traitement automatique des langues naturelles @2 NI @5 07`
C03	`07`	`L`	`ENG`	`@0 Natural language processing @2 NI @5 07`
C03	`08`	`L`	`FRE`	`@0 Bibliothèque électronique @4 INC @5 31`
C03	`09`	`L`	`FRE`	`@0 Langage XML @4 INC @5 32`
N21				`@1 150`

A30	`01`	`1`	`ENG`	`@1 TEI Consortium @2 6 @3 Maryland USA @4 2007-11`

Links toward previous steps (curation, corpus...)

to stream PascalFrancis, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000010

Links to Exploration step

Francis:11-0223726

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">TEI documents in the grid</title>
<author><name sortKey="Zielinski, Andrea" sort="Zielinski, Andrea" uniqKey="Zielinski A" first="Andrea" last="Zielinski">Andrea Zielinski</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Institut für Deutsche Sprache Mannheim</s1>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
<author><name sortKey="Pempe, Wolfgang" sort="Pempe, Wolfgang" uniqKey="Pempe W" first="Wolfgang" last="Pempe">Wolfgang Pempe</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>Saphor GmbH</s1>
<s3>DEU</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
<author><name sortKey="Gietz, Peter" sort="Gietz, Peter" uniqKey="Gietz P" first="Peter" last="Gietz">Peter Gietz</name>
<affiliation wicri:level="1"><inist:fA14 i1="03"><s1>DAASI International GmbH</s1>
<s3>DEU</s3>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
<author><name sortKey="Haase, Martin" sort="Haase, Martin" uniqKey="Haase M" first="Martin" last="Haase">Martin Haase</name>
<affiliation wicri:level="1"><inist:fA14 i1="03"><s1>DAASI International GmbH</s1>
<s3>DEU</s3>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
<author><name sortKey="Funk, Stefan" sort="Funk, Stefan" uniqKey="Funk S" first="Stefan" last="Funk">Stefan Funk</name>
<affiliation wicri:level="1"><inist:fA14 i1="03"><s1>DAASI International GmbH</s1>
<s3>DEU</s3>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
<author><name sortKey="Simon, Christian" sort="Simon, Christian" uniqKey="Simon C" first="Christian" last="Simon">Christian Simon</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Institut für Deutsche Sprache Mannheim</s1>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">11-0223726</idno>
<date when="2009">2009</date>
<idno type="stanalyst">FRANCIS 11-0223726 INIST</idno>
<idno type="RBID">Francis:11-0223726</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000010</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000035</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">TEI documents in the grid</title>
<author><name sortKey="Zielinski, Andrea" sort="Zielinski, Andrea" uniqKey="Zielinski A" first="Andrea" last="Zielinski">Andrea Zielinski</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Institut für Deutsche Sprache Mannheim</s1>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
<author><name sortKey="Pempe, Wolfgang" sort="Pempe, Wolfgang" uniqKey="Pempe W" first="Wolfgang" last="Pempe">Wolfgang Pempe</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>Saphor GmbH</s1>
<s3>DEU</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
<author><name sortKey="Gietz, Peter" sort="Gietz, Peter" uniqKey="Gietz P" first="Peter" last="Gietz">Peter Gietz</name>
<affiliation wicri:level="1"><inist:fA14 i1="03"><s1>DAASI International GmbH</s1>
<s3>DEU</s3>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
<author><name sortKey="Haase, Martin" sort="Haase, Martin" uniqKey="Haase M" first="Martin" last="Haase">Martin Haase</name>
<affiliation wicri:level="1"><inist:fA14 i1="03"><s1>DAASI International GmbH</s1>
<s3>DEU</s3>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
<author><name sortKey="Funk, Stefan" sort="Funk, Stefan" uniqKey="Funk S" first="Stefan" last="Funk">Stefan Funk</name>
<affiliation wicri:level="1"><inist:fA14 i1="03"><s1>DAASI International GmbH</s1>
<s3>DEU</s3>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
<author><name sortKey="Simon, Christian" sort="Simon, Christian" uniqKey="Simon C" first="Christian" last="Simon">Christian Simon</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Institut für Deutsche Sprache Mannheim</s1>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">Literary and linguistic computing</title>
<title level="j" type="abbreviated">Lit. linguist. comput.</title>
<idno type="ISSN">0268-1145</idno>
<imprint><date when="2009">2009</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">Literary and linguistic computing</title>
<title level="j" type="abbreviated">Lit. linguist. comput.</title>
<idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Computational linguistics</term>
<term>Corpus annotation</term>
<term>Dictionary</term>
<term>Digitization</term>
<term>Natural language processing</term>
<term>Research project</term>
<term>TEI</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>TEI</term>
<term>Projet de recherche</term>
<term>Linguistique informatique</term>
<term>Numérisation</term>
<term>Annotation de corpus</term>
<term>Dictionnaire</term>
<term>Traitement automatique des langues naturelles</term>
<term>Bibliothèque électronique</term>
<term>Langage XML</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr"><term>Projet de recherche</term>
<term>Numérisation</term>
<term>Dictionnaire</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">This article describes the life cycle of a TEI Document within TextGrid, an eHumanities platform for scholarly text processing, in which structured search is based on the TEl framework and metadata with restricted values. A workbench is provided that offers tools for handling TEI documents, TextGridLab, making it easier to annotate, process, search, and persistently store new digitized texts. The digitization and annotation of the Campe dictionary<sup>1</sup>
 serves as a first test bed. The overall framework of TextGrid is very generic and can handle different types of text (literary editions, linguistic corpora, lexica) as well as heterogeneous data formats (plain text, XML/TEI, images). In fact, the TextGrid repository, TextGridRep, is designed as a digital virtual library over federated archives, where humanities projects are invited to participate. Sharing of data is enabled by means of a grid-based architecture. Specifically the middleware includes most of the treatment of authorization, search, and file management. TextGrid is entirely based on open source software including Eclipse<sup>2</sup>
 and Globus Toolkit.</div>
</front>
</TEI>
<inist><standard h6="B"><pA><fA01 i1="01" i2="1"><s0>0268-1145</s0>
</fA01>
<fA03 i2="1"><s0>Lit. linguist. comput.</s0>
</fA03>
<fA05><s2>24</s2>
</fA05>
<fA06><s2>3</s2>
</fA06>
<fA08 i1="01" i2="1" l="ENG"><s1>TEI documents in the grid</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG"><s1>Selected papers from Text Encoding Initiative</s1>
</fA09>
<fA11 i1="01" i2="1"><s1>ZIELINSKI (Andrea)</s1>
</fA11>
<fA11 i1="02" i2="1"><s1>PEMPE (Wolfgang)</s1>
</fA11>
<fA11 i1="03" i2="1"><s1>GIETZ (Peter)</s1>
</fA11>
<fA11 i1="04" i2="1"><s1>HAASE (Martin)</s1>
</fA11>
<fA11 i1="05" i2="1"><s1>FUNK (Stefan)</s1>
</fA11>
<fA11 i1="06" i2="1"><s1>SIMON (Christian)</s1>
</fA11>
<fA12 i1="01" i2="1"><s1>RAHTZ (Sebastian)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="02" i2="1"><s1>SCHREIBMAN (Susan)</s1>
<s9>ed.</s9>
</fA12>
<fA14 i1="01"><s1>Institut für Deutsche Sprache Mannheim</s1>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>6 aut.</sZ>
</fA14>
<fA14 i1="02"><s1>Saphor GmbH</s1>
<s3>DEU</s3>
<sZ>2 aut.</sZ>
</fA14>
<fA14 i1="03"><s1>DAASI International GmbH</s1>
<s3>DEU</s3>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</fA14>
<fA15 i1="01"><s1>Oxford University</s1>
<s3>GBR</s3>
<sZ>1 aut.</sZ>
</fA15>
<fA15 i1="02"><s1>Digital Humanities Observatory, Royal Irish Academy</s1>
<s3>IRL</s3>
<sZ>2 aut.</sZ>
</fA15>
<fA20><s1>267-279</s1>
</fA20>
<fA21><s1>2009</s1>
</fA21>
<fA23 i1="01"><s0>ENG</s0>
</fA23>
<fA43 i1="01"><s1>INIST</s1>
<s2>23967</s2>
<s5>354000171896560020</s5>
</fA43>
<fA44><s0>0000</s0>
<s1>© 2011 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45><s0>3/4 p.</s0>
</fA45>
<fA47 i1="01" i2="1"><s0>11-0223726</s0>
</fA47>
<fA60><s1>P</s1>
<s2>C</s2>
</fA60>
<fA61><s0>A</s0>
</fA61>
<fA64 i1="01" i2="1"><s0>Literary and linguistic computing</s0>
</fA64>
<fA66 i1="01"><s0>GBR</s0>
</fA66>
<fA68 i1="01" i2="1" l="FRE"><s1>Documents de la TEI dans la grille</s1>
</fA68>
<fA69 i1="01" i2="1" l="FRE"><s1>Sélection d'articles de la TEI</s1>
</fA69>
<fA99><s0>27 notes</s0>
</fA99>
<fC01 i1="01" l="ENG"><s0>This article describes the life cycle of a TEI Document within TextGrid, an eHumanities platform for scholarly text processing, in which structured search is based on the TEl framework and metadata with restricted values. A workbench is provided that offers tools for handling TEI documents, TextGridLab, making it easier to annotate, process, search, and persistently store new digitized texts. The digitization and annotation of the Campe dictionary<sup>1</sup>
 serves as a first test bed. The overall framework of TextGrid is very generic and can handle different types of text (literary editions, linguistic corpora, lexica) as well as heterogeneous data formats (plain text, XML/TEI, images). In fact, the TextGrid repository, TextGridRep, is designed as a digital virtual library over federated archives, where humanities projects are invited to participate. Sharing of data is enabled by means of a grid-based architecture. Specifically the middleware includes most of the treatment of authorization, search, and file management. TextGrid is entirely based on open source software including Eclipse<sup>2</sup>
 and Globus Toolkit.</s0>
</fC01>
<fC02 i1="01" i2="L"><s0>52478</s0>
<s1>XV</s1>
</fC02>
<fC02 i1="02" i2="L"><s0>524</s0>
</fC02>
<fC03 i1="01" i2="L" l="FRE"><s0>TEI</s0>
<s2>NI</s2>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="L" l="ENG"><s0>TEI</s0>
<s2>NI</s2>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="L" l="FRE"><s0>Projet de recherche</s0>
<s2>NI</s2>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="L" l="ENG"><s0>Research project</s0>
<s2>NI</s2>
<s5>02</s5>
</fC03>
<fC03 i1="03" i2="L" l="FRE"><s0>Linguistique informatique</s0>
<s2>NI</s2>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="L" l="ENG"><s0>Computational linguistics</s0>
<s2>NI</s2>
<s5>03</s5>
</fC03>
<fC03 i1="04" i2="L" l="FRE"><s0>Numérisation</s0>
<s2>NI</s2>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="L" l="ENG"><s0>Digitization</s0>
<s2>NI</s2>
<s5>04</s5>
</fC03>
<fC03 i1="05" i2="L" l="FRE"><s0>Annotation de corpus</s0>
<s2>NI</s2>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="L" l="ENG"><s0>Corpus annotation</s0>
<s2>NI</s2>
<s5>05</s5>
</fC03>
<fC03 i1="06" i2="L" l="FRE"><s0>Dictionnaire</s0>
<s2>NI</s2>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="L" l="ENG"><s0>Dictionary</s0>
<s2>NI</s2>
<s5>06</s5>
</fC03>
<fC03 i1="07" i2="L" l="FRE"><s0>Traitement automatique des langues naturelles</s0>
<s2>NI</s2>
<s5>07</s5>
</fC03>
<fC03 i1="07" i2="L" l="ENG"><s0>Natural language processing</s0>
<s2>NI</s2>
<s5>07</s5>
</fC03>
<fC03 i1="08" i2="L" l="FRE"><s0>Bibliothèque électronique</s0>
<s4>INC</s4>
<s5>31</s5>
</fC03>
<fC03 i1="09" i2="L" l="FRE"><s0>Langage XML</s0>
<s4>INC</s4>
<s5>32</s5>
</fC03>
<fN21><s1>150</s1>
</fN21>
</pA>
<pR><fA30 i1="01" i2="1" l="ENG"><s1>TEI Consortium</s1>
<s2>6</s2>
<s3>Maryland USA</s3>
<s4>2007-11</s4>
</fA30>
</pR>
</standard>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/PascalFrancis/Curation

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000035 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Curation/biblio.hfd -nk 000035 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    PascalFrancis
   |étape=   Curation
   |type=    RBID
   |clé=     Francis:11-0223726
   |texte=   TEI documents in the grid
}}

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024

	Serveur d'exploration sur la TEI
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur la TEI

TEI documents in the grid

TEI documents in the grid

Source :

Descripteurs français

English descriptors

Abstract

Links toward previous steps (curation, corpus...)

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri