Serveur d'exploration sur la TEI

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Data-processing modeling of dynamic structures of textual segments for the analysis of corpus

Identifieur interne : 000005 ( Hal/Curation ); précédent : 000004; suivant : 000006

Data-processing modeling of dynamic structures of textual segments for the analysis of corpus

Auteurs : François Daoust [France]

Source :

RBID : Hal:tel-00870410

Descripteurs français

English descriptors

Abstract

The objective of the thesis is to propose a data-processing model to represent, build and exploit textualstructures. The suggested model relies on a «type/token» form of text representation extended bysystems of lexical and contextual annotations. This model's establishment was carried out in the SATOsoftware -- of which the functionalities and the internal organization are presented. Reference to anumber of works give an account of the development and use of the software in various contexts.The formal assumption of the textual and discursive structures find an ally in the beaconing XMLlanguage and the proposals of the Text Encoding Initiative (TEI). Formally, the structures built on thetextual segments correspond to graphs. In a development driven textual analysis context, these graphsare multiple and partially deployed. Their resolution, within the fastening of the nodes to textualsegments or that of other graphs, is a dynamic process which can be sustained by various dataprocessingmechanisms. Examples drawn from textual linguistics are used to illustrate the principles ofstructural annotation. Prospective considerations for the data-processing establishment of amanagement system of the structural annotation are also exposed.

Url:

Links toward previous steps (curation, corpus...)


Links to Exploration step

Hal:tel-00870410

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Data-processing modeling of dynamic structures of textual segments for the analysis of corpus</title>
<title xml:lang="fr">Modélisation informatique de structures dynamiques de segments textuels pour l'analyse de corpus</title>
<author>
<name sortKey="Daoust, Francois" sort="Daoust, Francois" uniqKey="Daoust F" first="François" last="Daoust">François Daoust</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-202931" status="VALID">
<idno type="IdRef">168612100</idno>
<idno type="RNSR">201220083G</idno>
<orgName>Edition, Littératures, Langages, Informatique, Arts, Didactique, Discours - UFC</orgName>
<orgName type="acronym">ELLIADD</orgName>
<desc>
<address>
<addrLine>30 rue Mégevand, 25030 Besançon cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-fcomte.fr/pages/fr/menu1/recherche/la-recherche-a-l-ufc/ea-4661---elliadd-18229-17558.html</ref>
</desc>
<listRelation>
<relation name="EA4661" active="#struct-458810" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA4661" active="#struct-458810" type="direct">
<org type="institution" xml:id="struct-458810" status="VALID">
<idno type="IdRef">026403188</idno>
<idno type="ISNI">0000 0001 2188 3779 </idno>
<orgName>Université de Franche-Comté</orgName>
<orgName type="acronym">UFC</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-fcomte.fr</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city" wicri:auto="siege">Besançon</settlement>
<region type="region" nuts="2">Franche-Comté</region>
</placeName>
<orgName type="university">Université de Franche-Comté</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Bourgogne Franche-Comté</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:tel-00870410</idno>
<idno type="halId">tel-00870410</idno>
<idno type="halUri">https://tel.archives-ouvertes.fr/tel-00870410</idno>
<idno type="url">https://tel.archives-ouvertes.fr/tel-00870410</idno>
<date when="2011-01-10">2011-01-10</date>
<idno type="wicri:Area/Hal/Corpus">000005</idno>
<idno type="wicri:Area/Hal/Curation">000005</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Data-processing modeling of dynamic structures of textual segments for the analysis of corpus</title>
<title xml:lang="fr">Modélisation informatique de structures dynamiques de segments textuels pour l'analyse de corpus</title>
<author>
<name sortKey="Daoust, Francois" sort="Daoust, Francois" uniqKey="Daoust F" first="François" last="Daoust">François Daoust</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-202931" status="VALID">
<idno type="IdRef">168612100</idno>
<idno type="RNSR">201220083G</idno>
<orgName>Edition, Littératures, Langages, Informatique, Arts, Didactique, Discours - UFC</orgName>
<orgName type="acronym">ELLIADD</orgName>
<desc>
<address>
<addrLine>30 rue Mégevand, 25030 Besançon cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-fcomte.fr/pages/fr/menu1/recherche/la-recherche-a-l-ufc/ea-4661---elliadd-18229-17558.html</ref>
</desc>
<listRelation>
<relation name="EA4661" active="#struct-458810" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA4661" active="#struct-458810" type="direct">
<org type="institution" xml:id="struct-458810" status="VALID">
<idno type="IdRef">026403188</idno>
<idno type="ISNI">0000 0001 2188 3779 </idno>
<orgName>Université de Franche-Comté</orgName>
<orgName type="acronym">UFC</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-fcomte.fr</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city" wicri:auto="siege">Besançon</settlement>
<region type="region" nuts="2">Franche-Comté</region>
</placeName>
<orgName type="university">Université de Franche-Comté</orgName>
<orgName type="institution" wicri:auto="newGroup">Université de Bourgogne Franche-Comté</orgName>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="mix" xml:lang="en">
<term>Computer Aided Text Analysis</term>
<term>Discourse analysis</term>
<term>SATO model</term>
<term>Structural annotation</term>
<term>Textometry</term>
</keywords>
<keywords scheme="mix" xml:lang="fr">
<term>Analyse de discours</term>
<term>Analyse de texte assistée par ordinateur</term>
<term>Annotation structurelle</term>
<term>Modèle SATO</term>
<term>TEI</term>
<term>Textométrie</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">The objective of the thesis is to propose a data-processing model to represent, build and exploit textualstructures. The suggested model relies on a «type/token» form of text representation extended bysystems of lexical and contextual annotations. This model's establishment was carried out in the SATOsoftware -- of which the functionalities and the internal organization are presented. Reference to anumber of works give an account of the development and use of the software in various contexts.The formal assumption of the textual and discursive structures find an ally in the beaconing XMLlanguage and the proposals of the Text Encoding Initiative (TEI). Formally, the structures built on thetextual segments correspond to graphs. In a development driven textual analysis context, these graphsare multiple and partially deployed. Their resolution, within the fastening of the nodes to textualsegments or that of other graphs, is a dynamic process which can be sustained by various dataprocessingmechanisms. Examples drawn from textual linguistics are used to illustrate the principles ofstructural annotation. Prospective considerations for the data-processing establishment of amanagement system of the structural annotation are also exposed.</div>
</front>
</TEI>
<hal api="V3">
<titleStmt>
<title xml:lang="en">Data-processing modeling of dynamic structures of textual segments for the analysis of corpus</title>
<title xml:lang="fr">Modélisation informatique de structures dynamiques de segments textuels pour l'analyse de corpus</title>
<author role="aut">
<persName>
<forename type="first">François</forename>
<forename type="middle">Daoust</forename>
<surname>Daoust</surname>
</persName>
<email></email>
<idno type="halauthor">1142857</idno>
<idno type="IdRef">http://www.idref.fr/156103869</idno>
<affiliation ref="#struct-202931"></affiliation>
</author>
<editor role="depositor">
<persName>
<forename>ABES</forename>
<surname>STAR</surname>
</persName>
<email>thelec@abes.fr</email>
</editor>
</titleStmt>
<editionStmt>
<edition n="v1" type="current">
<date type="whenSubmitted">2013-10-07 11:33:15</date>
<date type="whenModified">2015-05-21 18:13:38</date>
<date type="whenReleased">2013-10-08 15:23:46</date>
<date type="whenProduced">2011-01-10</date>
<date type="whenEndEmbargoed">2013-10-07</date>
<ref type="file" target="https://tel.archives-ouvertes.fr/tel-00870410/document">
<date notBefore="2013-10-07"></date>
</ref>
<ref type="file" subtype="author" n="1" target="https://tel.archives-ouvertes.fr/tel-00870410/file/these_A_DAOUST_Francois_2011.pdf">
<date notBefore="2013-10-07"></date>
</ref>
</edition>
<respStmt>
<resp>contributor</resp>
<name key="131274">
<persName>
<forename>ABES</forename>
<surname>STAR</surname>
</persName>
<email>thelec@abes.fr</email>
</name>
</respStmt>
</editionStmt>
<publicationStmt>
<distributor>CCSD</distributor>
<idno type="halId">tel-00870410</idno>
<idno type="halUri">https://tel.archives-ouvertes.fr/tel-00870410</idno>
<idno type="halBibtex">daoust:tel-00870410</idno>
<idno type="halRefHtml">Linguistique. Université de Franche-Comté, 2011. Français. <NNT : 2011BESA1013></idno>
<idno type="halRef">Linguistique. Université de Franche-Comté, 2011. Français. <NNT : 2011BESA1013></idno>
</publicationStmt>
<seriesStmt>
<idno type="stamp" n="STAR">STAR - Dépôt national des thèses électroniques</idno>
<idno type="stamp" n="UNIV-FCOMTE">Université de Franche-Comté</idno>
<idno type="stamp" n="SHS">Sciences de l'Homme et de la Société</idno>
<idno type="stamp" n="AO-LINGUISTIQUE">Archives ouvertes de la Linguistique</idno>
</seriesStmt>
<notesStmt></notesStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Data-processing modeling of dynamic structures of textual segments for the analysis of corpus</title>
<title xml:lang="fr">Modélisation informatique de structures dynamiques de segments textuels pour l'analyse de corpus</title>
<author role="aut">
<persName>
<forename type="first">François</forename>
<forename type="middle">Daoust</forename>
<surname>Daoust</surname>
</persName>
<idno type="halAuthorId">1142857</idno>
<idno type="IdRef">http://www.idref.fr/156103869</idno>
<affiliation ref="#struct-202931"></affiliation>
</author>
</analytic>
<monogr>
<idno type="nnt">2011BESA1013</idno>
<imprint>
<date type="dateDefended">2011-01-10</date>
</imprint>
<authority type="institution">Université de Franche-Comté</authority>
<authority type="school">Ecole doctorale Langages, Espaces, Temps, Sociétés (Besançon)</authority>
<authority type="supervisor">Jean-Marie Viprey</authority>
<authority type="supervisor">Yves Marcoux</authority>
<authority type="jury">Jules Duchastel [Président]</authority>
<authority type="jury">Lou Burnard [Rapporteur]</authority>
<authority type="jury">André Salem [Rapporteur]</authority>
</monogr>
</biblStruct>
</sourceDesc>
<profileDesc>
<langUsage>
<language ident="fr">French</language>
</langUsage>
<textClass>
<keywords scheme="author">
<term xml:lang="en">Textometry</term>
<term xml:lang="en">Structural annotation</term>
<term xml:lang="en">Discourse analysis</term>
<term xml:lang="en">SATO model</term>
<term xml:lang="en">Computer Aided Text Analysis</term>
<term xml:lang="fr">Textométrie</term>
<term xml:lang="fr">TEI</term>
<term xml:lang="fr">Annotation structurelle</term>
<term xml:lang="fr">Modèle SATO</term>
<term xml:lang="fr">Analyse de discours</term>
<term xml:lang="fr">Analyse de texte assistée par ordinateur</term>
</keywords>
<classCode scheme="halDomain" n="shs.langue">Humanities and Social Sciences/Linguistics</classCode>
<classCode scheme="halTypology" n="THESE">Theses</classCode>
</textClass>
<abstract xml:lang="en">The objective of the thesis is to propose a data-processing model to represent, build and exploit textualstructures. The suggested model relies on a «type/token» form of text representation extended bysystems of lexical and contextual annotations. This model's establishment was carried out in the SATOsoftware -- of which the functionalities and the internal organization are presented. Reference to anumber of works give an account of the development and use of the software in various contexts.The formal assumption of the textual and discursive structures find an ally in the beaconing XMLlanguage and the proposals of the Text Encoding Initiative (TEI). Formally, the structures built on thetextual segments correspond to graphs. In a development driven textual analysis context, these graphsare multiple and partially deployed. Their resolution, within the fastening of the nodes to textualsegments or that of other graphs, is a dynamic process which can be sustained by various dataprocessingmechanisms. Examples drawn from textual linguistics are used to illustrate the principles ofstructural annotation. Prospective considerations for the data-processing establishment of amanagement system of the structural annotation are also exposed.</abstract>
<abstract xml:lang="fr">L'objectif de la thèse est de proposer un modèle informatique pour représenter, construire et exploiterdes structures textuelles. Le modèle proposé s'appuie sur une représentation du texte sous la forme d'unplan lexique/occurrences augmenté de systèmes d'annotations lexicales et contextuelles, modèle dontune implantation a été réalisée dans le logiciel SATO dont on présente les fonctionnalités etl'organisation interne. La présentation d'un certain nombre de travaux rendent compte dudéveloppement et de l'utilisation du logiciel dans divers contextes.La prise en charge formelle des structures textuelles et discursives trouve un allié dans le langage debalisage XML et dans les propositions de la Text Encoding Initiative (TEI). Formellement, lesstructures construites sur les segments textuels correspondent à des graphes. Dans le contexte d'uneanalyse textuelle en élaboration, ces graphes sont multiples et partiellement déployés. La résolution deces graphes, au sens du rattachement des noeuds à des segments textuels ou à des noeuds d'autresgraphes, est un processus dynamique qui peut être soutenu par divers mécanismes informatiques. Desexemples tirés de la linguistique textuelle servent à illustrer les principes de l'annotation structurelle.Des considérations prospectives sur une implantation informatique d'un système de gestion del'annotation structurelle sont aussi exposées.</abstract>
</profileDesc>
</hal>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Hal/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000005 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Hal/Curation/biblio.hfd -nk 000005 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    Hal
   |étape=   Curation
   |type=    RBID
   |clé=     Hal:tel-00870410
   |texte=   Data-processing modeling of dynamic structures of textual segments for the analysis of corpus
}}

Wicri

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024