Serveur d'exploration sur la TEI

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Testing Structural Properties in Textual Data: Beyond Document Grammars

Identifieur interne : 000325 ( Istex/Corpus ); précédent : 000324; suivant : 000326

Testing Structural Properties in Textual Data: Beyond Document Grammars

Auteurs : Felix Sasaki ; Jens Po Nninghaus

Source :

RBID : ISTEX:8EF65EB4296EEB019ADBA8EB4E0CE952A1CC6C95

Abstract

Schema languages concentrate on grammatical constraints on document structures, i.e. hierarchical relations between elements in a tree‐like structure. In this paper, we complement this concept with a methodology for defining and applying structural constraints from the perspective of a single element. These constraints can be used in addition to the existing constraints of a document grammar. There is no need to change the document grammar. Using a hierarchy of descriptions of such constraints allows for a classification of elements. These are important features for tasks such as visualizing, modelling, querying, and checking consistency in textual data. A document containing descriptions of such constraints we call a ‘context specification document’ (CSD). We describe the basic ideas of a CSD, its formal properties, the path language we are currently using, and related approaches. Then we show how to create and use a CSD. We give two example applications for a CSD. Modelling co‐referential relations between textual units with a CSD can help to maintain consistency in textual data and to explore the linguistic properties of co‐reference. In the area of textual, non‐hierarchical annotation, several annotations can be held in one document and interrelated by the CSD. In the future we want to explore the relation and interaction between the underlying path language of the CSD and document grammars.

Url:
DOI: 10.1093/llc/18.1.89

Links to Exploration step

ISTEX:8EF65EB4296EEB019ADBA8EB4E0CE952A1CC6C95

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Testing Structural Properties in Textual Data: Beyond Document Grammars</title>
<author>
<name sortKey="Sasaki, Felix" sort="Sasaki, Felix" uniqKey="Sasaki F" first="Felix" last="Sasaki">Felix Sasaki</name>
<affiliation>
<mods:affiliation>University of Bielefeld, Germany</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation></mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Po Nninghaus, Jens" sort="Po Nninghaus, Jens" uniqKey="Po Nninghaus J" first="Jens" last="Po Nninghaus">Jens Po Nninghaus</name>
<affiliation>
<mods:affiliation>University of Bielefeld, Germany</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation></mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:8EF65EB4296EEB019ADBA8EB4E0CE952A1CC6C95</idno>
<date when="2003" year="2003">2003</date>
<idno type="doi">10.1093/llc/18.1.89</idno>
<idno type="url">https://api.istex.fr/document/8EF65EB4296EEB019ADBA8EB4E0CE952A1CC6C95/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000325</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Testing Structural Properties in Textual Data: Beyond Document Grammars</title>
<author>
<name sortKey="Sasaki, Felix" sort="Sasaki, Felix" uniqKey="Sasaki F" first="Felix" last="Sasaki">Felix Sasaki</name>
<affiliation>
<mods:affiliation>University of Bielefeld, Germany</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation></mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Po Nninghaus, Jens" sort="Po Nninghaus, Jens" uniqKey="Po Nninghaus J" first="Jens" last="Po Nninghaus">Jens Po Nninghaus</name>
<affiliation>
<mods:affiliation>University of Bielefeld, Germany</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation></mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Literary and Linguistic Computing</title>
<title level="j" type="abbrev">Lit Linguist Computing</title>
<idno type="ISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint>
<publisher>Oxford University Press</publisher>
<date type="published" when="2003-04">2003-04</date>
<biblScope unit="volume">18</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="89">89</biblScope>
<biblScope unit="page" to="100">100</biblScope>
</imprint>
<idno type="ISSN">0268-1145</idno>
</series>
<idno type="istex">8EF65EB4296EEB019ADBA8EB4E0CE952A1CC6C95</idno>
<idno type="DOI">10.1093/llc/18.1.89</idno>
<idno type="local">180089</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Schema languages concentrate on grammatical constraints on document structures, i.e. hierarchical relations between elements in a tree‐like structure. In this paper, we complement this concept with a methodology for defining and applying structural constraints from the perspective of a single element. These constraints can be used in addition to the existing constraints of a document grammar. There is no need to change the document grammar. Using a hierarchy of descriptions of such constraints allows for a classification of elements. These are important features for tasks such as visualizing, modelling, querying, and checking consistency in textual data. A document containing descriptions of such constraints we call a ‘context specification document’ (CSD). We describe the basic ideas of a CSD, its formal properties, the path language we are currently using, and related approaches. Then we show how to create and use a CSD. We give two example applications for a CSD. Modelling co‐referential relations between textual units with a CSD can help to maintain consistency in textual data and to explore the linguistic properties of co‐reference. In the area of textual, non‐hierarchical annotation, several annotations can be held in one document and interrelated by the CSD. In the future we want to explore the relation and interaction between the underlying path language of the CSD and document grammars.</div>
</front>
</TEI>
<istex>
<corpusName>oup</corpusName>
<author>
<json:item>
<name>Felix Sasaki</name>
<affiliations>
<json:string>University of Bielefeld, Germany</json:string>
<json:null></json:null>
</affiliations>
</json:item>
<json:item>
<name>Jens Pönninghaus</name>
<affiliations>
<json:string>University of Bielefeld, Germany</json:string>
<json:null></json:null>
</affiliations>
</json:item>
</author>
<language>
<json:string>eng</json:string>
</language>
<originalGenre>
<json:string>research-article</json:string>
</originalGenre>
<abstract>Schema languages concentrate on grammatical constraints on document structures, i.e. hierarchical relations between elements in a tree‐like structure. In this paper, we complement this concept with a methodology for defining and applying structural constraints from the perspective of a single element. These constraints can be used in addition to the existing constraints of a document grammar. There is no need to change the document grammar. Using a hierarchy of descriptions of such constraints allows for a classification of elements. These are important features for tasks such as visualizing, modelling, querying, and checking consistency in textual data. A document containing descriptions of such constraints we call a ‘context specification document’ (CSD). We describe the basic ideas of a CSD, its formal properties, the path language we are currently using, and related approaches. Then we show how to create and use a CSD. We give two example applications for a CSD. Modelling co‐referential relations between textual units with a CSD can help to maintain consistency in textual data and to explore the linguistic properties of co‐reference. In the area of textual, non‐hierarchical annotation, several annotations can be held in one document and interrelated by the CSD. In the future we want to explore the relation and interaction between the underlying path language of the CSD and document grammars.</abstract>
<qualityIndicators>
<score>6.364</score>
<pdfVersion>1.3</pdfVersion>
<pdfPageSize>538.307 x 697.433 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>1418</abstractCharCount>
<pdfWordCount>3772</pdfWordCount>
<pdfCharCount>22461</pdfCharCount>
<pdfPageCount>12</pdfPageCount>
<abstractWordCount>216</abstractWordCount>
</qualityIndicators>
<title>Testing Structural Properties in Textual Data: Beyond Document Grammars</title>
<genre>
<json:string>research-article</json:string>
</genre>
<host>
<volume>18</volume>
<publisherId>
<json:string>litlin</json:string>
</publisherId>
<pages>
<last>100</last>
<first>89</first>
</pages>
<issn>
<json:string>0268-1145</json:string>
</issn>
<issue>1</issue>
<genre>
<json:string>journal</json:string>
</genre>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1477-4615</json:string>
</eissn>
<title>Literary and Linguistic Computing</title>
</host>
<categories>
<wos>
<json:string>LINGUISTICS</json:string>
<json:string>LITERATURE</json:string>
</wos>
</categories>
<publicationDate>2003</publicationDate>
<copyrightDate>2003</copyrightDate>
<doi>
<json:string>10.1093/llc/18.1.89</json:string>
</doi>
<id>8EF65EB4296EEB019ADBA8EB4E0CE952A1CC6C95</id>
<score>0.16590782</score>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/8EF65EB4296EEB019ADBA8EB4E0CE952A1CC6C95/fulltext/pdf</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/8EF65EB4296EEB019ADBA8EB4E0CE952A1CC6C95/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/8EF65EB4296EEB019ADBA8EB4E0CE952A1CC6C95/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">Testing Structural Properties in Textual Data: Beyond Document Grammars</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
<respStmt>
<resp>Références bibliographiques récupérées via GROBID</resp>
<name resp="ISTEX-API">ISTEX-API (INIST-CNRS)</name>
</respStmt>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>Oxford University Press</publisher>
<availability>
<p>OUP</p>
</availability>
<date>2003</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a" type="main" xml:lang="en">Testing Structural Properties in Textual Data: Beyond Document Grammars</title>
<author>
<persName>
<forename type="first">Felix</forename>
<surname>Sasaki</surname>
</persName>
<affiliation>University of Bielefeld, Germany</affiliation>
<affiliation></affiliation>
</author>
<author>
<persName>
<forename type="first">Jens</forename>
<surname>Pönninghaus</surname>
</persName>
<affiliation>University of Bielefeld, Germany</affiliation>
<affiliation></affiliation>
</author>
</analytic>
<monogr>
<title level="j">Literary and Linguistic Computing</title>
<title level="j" type="abbrev">Lit Linguist Computing</title>
<idno type="pISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint>
<publisher>Oxford University Press</publisher>
<date type="published" when="2003-04"></date>
<biblScope unit="volume">18</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="89">89</biblScope>
<biblScope unit="page" to="100">100</biblScope>
</imprint>
</monogr>
<idno type="istex">8EF65EB4296EEB019ADBA8EB4E0CE952A1CC6C95</idno>
<idno type="DOI">10.1093/llc/18.1.89</idno>
<idno type="local">180089</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>2003</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>Schema languages concentrate on grammatical constraints on document structures, i.e. hierarchical relations between elements in a tree‐like structure. In this paper, we complement this concept with a methodology for defining and applying structural constraints from the perspective of a single element. These constraints can be used in addition to the existing constraints of a document grammar. There is no need to change the document grammar. Using a hierarchy of descriptions of such constraints allows for a classification of elements. These are important features for tasks such as visualizing, modelling, querying, and checking consistency in textual data. A document containing descriptions of such constraints we call a ‘context specification document’ (CSD). We describe the basic ideas of a CSD, its formal properties, the path language we are currently using, and related approaches. Then we show how to create and use a CSD. We give two example applications for a CSD. Modelling co‐referential relations between textual units with a CSD can help to maintain consistency in textual data and to explore the linguistic properties of co‐reference. In the area of textual, non‐hierarchical annotation, several annotations can be held in one document and interrelated by the CSD. In the future we want to explore the relation and interaction between the underlying path language of the CSD and document grammars.</p>
</abstract>
</profileDesc>
<revisionDesc>
<change when="2003-04">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-15">References added</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-21">References added</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-07-27">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/8EF65EB4296EEB019ADBA8EB4E0CE952A1CC6C95/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="corpus oup" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="US-ASCII"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" URI="journalpublishing.dtd" name="istex:docType"></istex:docType>
<istex:document>
<article xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">litlin</journal-id>
<journal-id journal-id-type="hwp">litlin</journal-id>
<journal-title>Literary and Linguistic Computing</journal-title>
<abbrev-journal-title abbrev-type="publisher">Lit Linguist Computing</abbrev-journal-title>
<issn pub-type="ppub">0268-1145</issn>
<issn pub-type="epub">1477-4615</issn>
<publisher>
<publisher-name>Oxford University Press</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="other">180089</article-id>
<article-id pub-id-type="doi">10.1093/llc/18.1.89</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Testing Structural Properties in Textual Data: Beyond Document Grammars</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Sasaki</surname>
<given-names>Felix</given-names>
</name>
<xref rid="AFF1">1</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Pönninghaus</surname>
<given-names>Jens</given-names>
</name>
<xref rid="AFF1">1</xref>
</contrib>
<aff>
<target target-type="aff" id="AFF1"></target>
<label>1</label>
University of Bielefeld, Germany</aff>
</contrib-group>
<pub-date pub-type="ppub">
<month>04</month>
<year>2003</year>
</pub-date>
<volume>18</volume>
<issue>1</issue>
<fpage>89</fpage>
<lpage>100</lpage>
<permissions>
<copyright-statement>Copyright Association for Literary & Linguistic Computing 2003</copyright-statement>
<copyright-year>2003</copyright-year>
</permissions>
<abstract xml:lang="en">
<p>Schema languages concentrate on grammatical constraints on document structures, i.e. hierarchical relations between elements in a tree‐like structure. In this paper, we complement this concept with a methodology for defining and applying structural constraints from the perspective of a single element. These constraints can be used in addition to the existing constraints of a document grammar. There is no need to change the document grammar. Using a hierarchy of descriptions of such constraints allows for a classification of elements. These are important features for tasks such as visualizing, modelling, querying, and checking consistency in textual data. A document containing descriptions of such constraints we call a ‘context specification document’ (CSD). We describe the basic ideas of a CSD, its formal properties, the path language we are currently using, and related approaches. Then we show how to create and use a CSD. We give two example applications for a CSD. Modelling co‐referential relations between textual units with a CSD can help to maintain consistency in textual data and to explore the linguistic properties of co‐reference. In the area of textual, non‐hierarchical annotation, several annotations can be held in one document and interrelated by the CSD. In the future we want to explore the relation and interaction between the underlying path language of the CSD and document grammars.</p>
</abstract>
<custom-meta-wrap>
<custom-meta>
<meta-name>hwp-legacy-fpage</meta-name>
<meta-value>89</meta-value>
</custom-meta>
<custom-meta>
<meta-name>hwp-legacy-dochead</meta-name>
<meta-value>Article</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
</article>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>Testing Structural Properties in Textual Data: Beyond Document Grammars</title>
</titleInfo>
<titleInfo type="alternative" lang="en" contentType="CDATA">
<title>Testing Structural Properties in Textual Data: Beyond Document Grammars</title>
</titleInfo>
<name type="personal">
<namePart type="given">Felix</namePart>
<namePart type="family">Sasaki</namePart>
<affiliation>University of Bielefeld, Germany</affiliation>
<affiliation></affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jens</namePart>
<namePart type="family">Pönninghaus</namePart>
<affiliation>University of Bielefeld, Germany</affiliation>
<affiliation></affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="research-article" displayLabel="research-article"></genre>
<originInfo>
<publisher>Oxford University Press</publisher>
<dateIssued encoding="w3cdtf">2003-04</dateIssued>
<copyrightDate encoding="w3cdtf">2003</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">Schema languages concentrate on grammatical constraints on document structures, i.e. hierarchical relations between elements in a tree‐like structure. In this paper, we complement this concept with a methodology for defining and applying structural constraints from the perspective of a single element. These constraints can be used in addition to the existing constraints of a document grammar. There is no need to change the document grammar. Using a hierarchy of descriptions of such constraints allows for a classification of elements. These are important features for tasks such as visualizing, modelling, querying, and checking consistency in textual data. A document containing descriptions of such constraints we call a ‘context specification document’ (CSD). We describe the basic ideas of a CSD, its formal properties, the path language we are currently using, and related approaches. Then we show how to create and use a CSD. We give two example applications for a CSD. Modelling co‐referential relations between textual units with a CSD can help to maintain consistency in textual data and to explore the linguistic properties of co‐reference. In the area of textual, non‐hierarchical annotation, several annotations can be held in one document and interrelated by the CSD. In the future we want to explore the relation and interaction between the underlying path language of the CSD and document grammars.</abstract>
<relatedItem type="host">
<titleInfo>
<title>Literary and Linguistic Computing</title>
</titleInfo>
<titleInfo type="abbreviated">
<title>Lit Linguist Computing</title>
</titleInfo>
<genre type="journal">journal</genre>
<identifier type="ISSN">0268-1145</identifier>
<identifier type="eISSN">1477-4615</identifier>
<identifier type="PublisherID">litlin</identifier>
<identifier type="PublisherID-hwp">litlin</identifier>
<part>
<date>2003</date>
<detail type="volume">
<caption>vol.</caption>
<number>18</number>
</detail>
<detail type="issue">
<caption>no.</caption>
<number>1</number>
</detail>
<extent unit="pages">
<start>89</start>
<end>100</end>
</extent>
</part>
</relatedItem>
<identifier type="istex">8EF65EB4296EEB019ADBA8EB4E0CE952A1CC6C95</identifier>
<identifier type="DOI">10.1093/llc/18.1.89</identifier>
<identifier type="local">180089</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Copyright Association for Literary & Linguistic Computing 2003</accessCondition>
<recordInfo>
<recordContentSource>OUP</recordContentSource>
</recordInfo>
</mods>
</metadata>
<enrichments>
<istex:catWosTEI uri="https://api.istex.fr/document/8EF65EB4296EEB019ADBA8EB4E0CE952A1CC6C95/enrichments/catWos">
<teiHeader>
<profileDesc>
<textClass>
<classCode scheme="WOS">LINGUISTICS</classCode>
<classCode scheme="WOS">LITERATURE</classCode>
</textClass>
</profileDesc>
</teiHeader>
</istex:catWosTEI>
<json:item>
<type>refBibs</type>
<uri>https://api.istex.fr/document/8EF65EB4296EEB019ADBA8EB4E0CE952A1CC6C95/enrichments/refBibs</uri>
</json:item>
</enrichments>
<serie></serie>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000325 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000325 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:8EF65EB4296EEB019ADBA8EB4E0CE952A1CC6C95
   |texte=   Testing Structural Properties in Textual Data: Beyond Document Grammars
}}

Wicri

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024