Serveur d'exploration sur la TEI

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Sheffield Corpus of Chinese for Diachronic Linguistic Study1

Identifieur interne : 000394 ( Istex/Corpus ); précédent : 000393; suivant : 000395

Sheffield Corpus of Chinese for Diachronic Linguistic Study1

Auteurs : Xiaoling Hu ; Nigel Williamson ; Jamie Mclaughlin

Source :

RBID : ISTEX:256A35EE1B8079C5A29796E783FF3DD586ACD254

Abstract

The paper presents the outcome of the pilot phase of a major project which aims to build a digital resource for the study of historical Chinese texts with a view to facilitating linguistic analysis of the language, particularly from a diachronic point of view. The approach to general problems for a diachronic corpus is discussed. Details of the tag set and the tagging system devised are given. The development of a sophisticated automatic mark-up scheme for Chinese texts from widely different time periods and genres is indicated.

Url:
DOI: 10.1093/llc/fqi034

Links to Exploration step

ISTEX:256A35EE1B8079C5A29796E783FF3DD586ACD254

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Sheffield Corpus of Chinese for Diachronic Linguistic Study1</title>
<author>
<name sortKey="Hu, Xiaoling" sort="Hu, Xiaoling" uniqKey="Hu X" first="Xiaoling" last="Hu">Xiaoling Hu</name>
<affiliation>
<mods:affiliation>University of Sheffield, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Williamson, Nigel" sort="Williamson, Nigel" uniqKey="Williamson N" first="Nigel" last="Williamson">Nigel Williamson</name>
<affiliation>
<mods:affiliation>University of Sheffield, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Mclaughlin, Jamie" sort="Mclaughlin, Jamie" uniqKey="Mclaughlin J" first="Jamie" last="Mclaughlin">Jamie Mclaughlin</name>
<affiliation>
<mods:affiliation>University of Sheffield, UK</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:256A35EE1B8079C5A29796E783FF3DD586ACD254</idno>
<date when="2005" year="2005">2005</date>
<idno type="doi">10.1093/llc/fqi034</idno>
<idno type="url">https://api.istex.fr/document/256A35EE1B8079C5A29796E783FF3DD586ACD254/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000394</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Sheffield Corpus of Chinese for Diachronic Linguistic Study1</title>
<author>
<name sortKey="Hu, Xiaoling" sort="Hu, Xiaoling" uniqKey="Hu X" first="Xiaoling" last="Hu">Xiaoling Hu</name>
<affiliation>
<mods:affiliation>University of Sheffield, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Williamson, Nigel" sort="Williamson, Nigel" uniqKey="Williamson N" first="Nigel" last="Williamson">Nigel Williamson</name>
<affiliation>
<mods:affiliation>University of Sheffield, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Mclaughlin, Jamie" sort="Mclaughlin, Jamie" uniqKey="Mclaughlin J" first="Jamie" last="Mclaughlin">Jamie Mclaughlin</name>
<affiliation>
<mods:affiliation>University of Sheffield, UK</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Literary and Linguistic Computing</title>
<title level="j" type="abbrev">Lit Linguist Computing</title>
<idno type="ISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint>
<publisher>Oxford University Press</publisher>
<date type="published" when="2005-09">2005-09</date>
<biblScope unit="volume">20</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="281">281</biblScope>
<biblScope unit="page" to="293">293</biblScope>
</imprint>
<idno type="ISSN">0268-1145</idno>
</series>
<idno type="istex">256A35EE1B8079C5A29796E783FF3DD586ACD254</idno>
<idno type="DOI">10.1093/llc/fqi034</idno>
<idno type="local">fqi034</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">The paper presents the outcome of the pilot phase of a major project which aims to build a digital resource for the study of historical Chinese texts with a view to facilitating linguistic analysis of the language, particularly from a diachronic point of view. The approach to general problems for a diachronic corpus is discussed. Details of the tag set and the tagging system devised are given. The development of a sophisticated automatic mark-up scheme for Chinese texts from widely different time periods and genres is indicated.</div>
</front>
</TEI>
<istex>
<corpusName>oup</corpusName>
<author>
<json:item>
<name>Xiaoling Hu</name>
<affiliations>
<json:string>University of Sheffield, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Nigel Williamson</name>
<affiliations>
<json:string>University of Sheffield, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Jamie McLaughlin</name>
<affiliations>
<json:string>University of Sheffield, UK</json:string>
</affiliations>
</json:item>
</author>
<language>
<json:string>eng</json:string>
</language>
<originalGenre>
<json:string>research-article</json:string>
</originalGenre>
<abstract>The paper presents the outcome of the pilot phase of a major project which aims to build a digital resource for the study of historical Chinese texts with a view to facilitating linguistic analysis of the language, particularly from a diachronic point of view. The approach to general problems for a diachronic corpus is discussed. Details of the tag set and the tagging system devised are given. The development of a sophisticated automatic mark-up scheme for Chinese texts from widely different time periods and genres is indicated.</abstract>
<qualityIndicators>
<score>6.383</score>
<pdfVersion>1.4</pdfVersion>
<pdfPageSize>539 x 694 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>534</abstractCharCount>
<pdfWordCount>4839</pdfWordCount>
<pdfCharCount>31136</pdfCharCount>
<pdfPageCount>13</pdfPageCount>
<abstractWordCount>87</abstractWordCount>
</qualityIndicators>
<title>Sheffield Corpus of Chinese for Diachronic Linguistic Study1</title>
<genre>
<json:string>research-article</json:string>
</genre>
<host>
<volume>20</volume>
<publisherId>
<json:string>litlin</json:string>
</publisherId>
<pages>
<last>293</last>
<first>281</first>
</pages>
<issn>
<json:string>0268-1145</json:string>
</issn>
<issue>3</issue>
<genre>
<json:string>journal</json:string>
</genre>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1477-4615</json:string>
</eissn>
<title>Literary and Linguistic Computing</title>
</host>
<categories>
<wos>
<json:string>LINGUISTICS</json:string>
<json:string>LITERATURE</json:string>
</wos>
</categories>
<publicationDate>2005</publicationDate>
<copyrightDate>2005</copyrightDate>
<doi>
<json:string>10.1093/llc/fqi034</json:string>
</doi>
<id>256A35EE1B8079C5A29796E783FF3DD586ACD254</id>
<score>0.14212005</score>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/256A35EE1B8079C5A29796E783FF3DD586ACD254/fulltext/pdf</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/256A35EE1B8079C5A29796E783FF3DD586ACD254/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/256A35EE1B8079C5A29796E783FF3DD586ACD254/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">Sheffield Corpus of Chinese for Diachronic Linguistic Study1</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
<respStmt>
<resp>Références bibliographiques récupérées via GROBID</resp>
<name resp="ISTEX-API">ISTEX-API (INIST-CNRS)</name>
</respStmt>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>Oxford University Press</publisher>
<availability>
<p>OUP</p>
</availability>
<date>2005</date>
</publicationStmt>
<notesStmt>
<note>Dr Xiaoling Hu, School of East Asian Studies, University of Sheffield, Floor 5, The Arts Tower, Western Bank, Sheffield S10 2TN. E-mail: x.l.hu@sheffield.ac.uk</note>
</notesStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a" type="main" xml:lang="en">Sheffield Corpus of Chinese for Diachronic Linguistic Study1</title>
<author>
<persName>
<forename type="first">Xiaoling</forename>
<surname>Hu</surname>
</persName>
<affiliation>University of Sheffield, UK</affiliation>
</author>
<author>
<persName>
<forename type="first">Nigel</forename>
<surname>Williamson</surname>
</persName>
<affiliation>University of Sheffield, UK</affiliation>
</author>
<author>
<persName>
<forename type="first">Jamie</forename>
<surname>McLaughlin</surname>
</persName>
<affiliation>University of Sheffield, UK</affiliation>
</author>
</analytic>
<monogr>
<title level="j">Literary and Linguistic Computing</title>
<title level="j" type="abbrev">Lit Linguist Computing</title>
<idno type="pISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint>
<publisher>Oxford University Press</publisher>
<date type="published" when="2005-09"></date>
<biblScope unit="volume">20</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="281">281</biblScope>
<biblScope unit="page" to="293">293</biblScope>
</imprint>
</monogr>
<idno type="istex">256A35EE1B8079C5A29796E783FF3DD586ACD254</idno>
<idno type="DOI">10.1093/llc/fqi034</idno>
<idno type="local">fqi034</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>2005</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>The paper presents the outcome of the pilot phase of a major project which aims to build a digital resource for the study of historical Chinese texts with a view to facilitating linguistic analysis of the language, particularly from a diachronic point of view. The approach to general problems for a diachronic corpus is discussed. Details of the tag set and the tagging system devised are given. The development of a sophisticated automatic mark-up scheme for Chinese texts from widely different time periods and genres is indicated.</p>
</abstract>
</profileDesc>
<revisionDesc>
<change when="2005-09">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-14">References added</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-21">References added</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-07-27">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/256A35EE1B8079C5A29796E783FF3DD586ACD254/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="corpus oup" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="US-ASCII"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" URI="journalpublishing.dtd" name="istex:docType"></istex:docType>
<istex:document>
<article xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">litlin</journal-id>
<journal-id journal-id-type="hwp">litlin</journal-id>
<journal-title>Literary and Linguistic Computing</journal-title>
<abbrev-journal-title abbrev-type="publisher">Lit Linguist Computing</abbrev-journal-title>
<issn pub-type="ppub">0268-1145</issn>
<issn pub-type="epub">1477-4615</issn>
<publisher>
<publisher-name>Oxford University Press</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="other">fqi034</article-id>
<article-id pub-id-type="doi">10.1093/llc/fqi034</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Articles</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Sheffield Corpus of Chinese for Diachronic Linguistic Study
<sup>1</sup>
<xref rid="FNT1"></xref>
</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Hu</surname>
<given-names>Xiaoling</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Williamson</surname>
<given-names>Nigel</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>McLaughlin</surname>
<given-names>Jamie</given-names>
</name>
</contrib>
<aff>University of Sheffield, UK</aff>
</contrib-group>
<author-notes>
<corresp id="COR1">Dr Xiaoling Hu, School of East Asian Studies, University of Sheffield, Floor 5, The Arts Tower, Western Bank, Sheffield S10 2TN.
<bold>E-mail:</bold>
<ext-link xlink:href="x.l.hu@sheffield.ac.uk" ext-link-type="email">x.l.hu@sheffield.ac.uk</ext-link>
</corresp>
</author-notes>
<pub-date pub-type="ppub">
<month>September</month>
<year>2005</year>
</pub-date>
<volume>20</volume>
<issue>3</issue>
<fpage>281</fpage>
<lpage>293</lpage>
<permissions>
<copyright-statement>© The Author. Published by Oxford University Press on behalf of ALLC and ACH. All rights reserved. For Permissions, please email: journals.permissions@oupjournals.org</copyright-statement>
<copyright-year>2005</copyright-year>
</permissions>
<abstract xml:lang="en">
<p>The paper presents the outcome of the pilot phase of a major project which aims to build a digital resource for the study of historical Chinese texts with a view to facilitating linguistic analysis of the language, particularly from a diachronic point of view. The approach to general problems for a diachronic corpus is discussed. Details of the tag set and the tagging system devised are given. The development of a sophisticated automatic mark-up scheme for Chinese texts from widely different time periods and genres is indicated.</p>
</abstract>
<custom-meta-wrap>
<custom-meta>
<meta-name>hwp-legacy-fpage</meta-name>
<meta-value>281</meta-value>
</custom-meta>
<custom-meta>
<meta-name>cover-date</meta-name>
<meta-value>September 2005</meta-value>
</custom-meta>
<custom-meta>
<meta-name>hwp-legacy-dochead</meta-name>
<meta-value>Articles</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
</article>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>Sheffield Corpus of Chinese for Diachronic Linguistic Study1</title>
</titleInfo>
<titleInfo type="alternative" lang="en" contentType="CDATA">
<title>Sheffield Corpus of Chinese for Diachronic Linguistic Study1</title>
</titleInfo>
<name type="personal">
<namePart type="given">Xiaoling</namePart>
<namePart type="family">Hu</namePart>
<affiliation>University of Sheffield, UK</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Nigel</namePart>
<namePart type="family">Williamson</namePart>
<affiliation>University of Sheffield, UK</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jamie</namePart>
<namePart type="family">McLaughlin</namePart>
<affiliation>University of Sheffield, UK</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="research-article" displayLabel="research-article"></genre>
<originInfo>
<publisher>Oxford University Press</publisher>
<dateIssued encoding="w3cdtf">2005-09</dateIssued>
<copyrightDate encoding="w3cdtf">2005</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">The paper presents the outcome of the pilot phase of a major project which aims to build a digital resource for the study of historical Chinese texts with a view to facilitating linguistic analysis of the language, particularly from a diachronic point of view. The approach to general problems for a diachronic corpus is discussed. Details of the tag set and the tagging system devised are given. The development of a sophisticated automatic mark-up scheme for Chinese texts from widely different time periods and genres is indicated.</abstract>
<note type="author-notes">Dr Xiaoling Hu, School of East Asian Studies, University of Sheffield, Floor 5, The Arts Tower, Western Bank, Sheffield S10 2TN. E-mail: x.l.hu@sheffield.ac.uk</note>
<relatedItem type="host">
<titleInfo>
<title>Literary and Linguistic Computing</title>
</titleInfo>
<titleInfo type="abbreviated">
<title>Lit Linguist Computing</title>
</titleInfo>
<genre type="journal">journal</genre>
<identifier type="ISSN">0268-1145</identifier>
<identifier type="eISSN">1477-4615</identifier>
<identifier type="PublisherID">litlin</identifier>
<identifier type="PublisherID-hwp">litlin</identifier>
<part>
<date>2005</date>
<detail type="volume">
<caption>vol.</caption>
<number>20</number>
</detail>
<detail type="issue">
<caption>no.</caption>
<number>3</number>
</detail>
<extent unit="pages">
<start>281</start>
<end>293</end>
</extent>
</part>
</relatedItem>
<identifier type="istex">256A35EE1B8079C5A29796E783FF3DD586ACD254</identifier>
<identifier type="DOI">10.1093/llc/fqi034</identifier>
<identifier type="local">fqi034</identifier>
<accessCondition type="use and reproduction" contentType="copyright">© The Author. Published by Oxford University Press on behalf of ALLC and ACH. All rights reserved. For Permissions, please email: journals.permissions@oupjournals.org</accessCondition>
<recordInfo>
<recordContentSource>OUP</recordContentSource>
</recordInfo>
</mods>
</metadata>
<enrichments>
<istex:catWosTEI uri="https://api.istex.fr/document/256A35EE1B8079C5A29796E783FF3DD586ACD254/enrichments/catWos">
<teiHeader>
<profileDesc>
<textClass>
<classCode scheme="WOS">LINGUISTICS</classCode>
<classCode scheme="WOS">LITERATURE</classCode>
</textClass>
</profileDesc>
</teiHeader>
</istex:catWosTEI>
<json:item>
<type>refBibs</type>
<uri>https://api.istex.fr/document/256A35EE1B8079C5A29796E783FF3DD586ACD254/enrichments/refBibs</uri>
</json:item>
</enrichments>
<serie></serie>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000394 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000394 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:256A35EE1B8079C5A29796E783FF3DD586ACD254
   |texte=   Sheffield Corpus of Chinese for Diachronic Linguistic Study1
}}

Wicri

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024