Serveur d'exploration sur la TEI

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

A new agenda for corpus linguistics - working with all of the world's languages

Identifieur interne : 000456 ( Istex/Corpus ); précédent : 000455; suivant : 000457

A new agenda for corpus linguistics - working with all of the world's languages

Auteurs : T. Mcenery ; N. Ostler

Source :

RBID : ISTEX:47F3834E6E43DD4865BBDE99A7C450A55654D9D0

Abstract

In this paper we argue that corpus linguistics needs to expand to cover a wider set of languages. While the reasons that some languages have not been provided with corpus data to the date are clear, the intellectual and moral imperative to extend the range of corpus linguistics is strong. However, there are technical problems to be faced in such an extension of corpus linguistics. These problems are reviewed here and possible solutions to them explored. Following on from this, we consider what possible benefits the provision of appropriate corpus data may bring to languages currently untouched by the development of corpus linguistics.

Url:
DOI: 10.1093/llc/15.4.403

Links to Exploration step

ISTEX:47F3834E6E43DD4865BBDE99A7C450A55654D9D0

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">A new agenda for corpus linguistics - working with all of the world's languages</title>
<author>
<name sortKey="Mcenery, T" sort="Mcenery, T" uniqKey="Mcenery T" first="T" last="Mcenery">T. Mcenery</name>
<affiliation>
<mods:affiliation>Foundation for Endangered Languages, Batheaston Villa, 172 Bailbrook Lane, Bath BA1 7AA, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Ostler, N" sort="Ostler, N" uniqKey="Ostler N" first="N" last="Ostler">N. Ostler</name>
<affiliation>
<mods:affiliation>Corresponding author E-mail: nostler@chibcha.demon.co.uk</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>Foundation for Endangered Languages, Batheaston Villa, 172 Bailbrook Lane, Bath BA1 7AA, UK</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:47F3834E6E43DD4865BBDE99A7C450A55654D9D0</idno>
<date when="2000" year="2000">2000</date>
<idno type="doi">10.1093/llc/15.4.403</idno>
<idno type="url">https://api.istex.fr/document/47F3834E6E43DD4865BBDE99A7C450A55654D9D0/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000456</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">A new agenda for corpus linguistics - working with all of the world's languages</title>
<author>
<name sortKey="Mcenery, T" sort="Mcenery, T" uniqKey="Mcenery T" first="T" last="Mcenery">T. Mcenery</name>
<affiliation>
<mods:affiliation>Foundation for Endangered Languages, Batheaston Villa, 172 Bailbrook Lane, Bath BA1 7AA, UK</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Ostler, N" sort="Ostler, N" uniqKey="Ostler N" first="N" last="Ostler">N. Ostler</name>
<affiliation>
<mods:affiliation>Corresponding author E-mail: nostler@chibcha.demon.co.uk</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>Foundation for Endangered Languages, Batheaston Villa, 172 Bailbrook Lane, Bath BA1 7AA, UK</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Literary and Linguistic Computing</title>
<title level="j" type="abbrev">Lit Linguist Computing</title>
<idno type="ISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint>
<publisher>Oxford University Press</publisher>
<date type="published" when="2000-12">2000-12</date>
<biblScope unit="volume">15</biblScope>
<biblScope unit="issue">4</biblScope>
<biblScope unit="page" from="403">403</biblScope>
<biblScope unit="page" to="420">420</biblScope>
</imprint>
<idno type="ISSN">0268-1145</idno>
</series>
<idno type="istex">47F3834E6E43DD4865BBDE99A7C450A55654D9D0</idno>
<idno type="DOI">10.1093/llc/15.4.403</idno>
<idno type="local">2</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">In this paper we argue that corpus linguistics needs to expand to cover a wider set of languages. While the reasons that some languages have not been provided with corpus data to the date are clear, the intellectual and moral imperative to extend the range of corpus linguistics is strong. However, there are technical problems to be faced in such an extension of corpus linguistics. These problems are reviewed here and possible solutions to them explored. Following on from this, we consider what possible benefits the provision of appropriate corpus data may bring to languages currently untouched by the development of corpus linguistics.</div>
</front>
</TEI>
<istex>
<corpusName>oup</corpusName>
<author>
<json:item>
<name>T McEnery</name>
<affiliations>
<json:string>Foundation for Endangered Languages, Batheaston Villa, 172 Bailbrook Lane, Bath BA1 7AA, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>N Ostler</name>
<affiliations>
<json:string>Corresponding author E-mail: nostler@chibcha.demon.co.uk</json:string>
<json:string>Foundation for Endangered Languages, Batheaston Villa, 172 Bailbrook Lane, Bath BA1 7AA, UK</json:string>
</affiliations>
</json:item>
</author>
<language>
<json:string>eng</json:string>
</language>
<originalGenre>
<json:string>research-article</json:string>
</originalGenre>
<abstract>In this paper we argue that corpus linguistics needs to expand to cover a wider set of languages. While the reasons that some languages have not been provided with corpus data to the date are clear, the intellectual and moral imperative to extend the range of corpus linguistics is strong. However, there are technical problems to be faced in such an extension of corpus linguistics. These problems are reviewed here and possible solutions to them explored. Following on from this, we consider what possible benefits the provision of appropriate corpus data may bring to languages currently untouched by the development of corpus linguistics.</abstract>
<qualityIndicators>
<score>6.236</score>
<pdfVersion>1.3</pdfVersion>
<pdfPageSize>519 x 702 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>642</abstractCharCount>
<pdfWordCount>7292</pdfWordCount>
<pdfCharCount>44924</pdfCharCount>
<pdfPageCount>17</pdfPageCount>
<abstractWordCount>103</abstractWordCount>
</qualityIndicators>
<title>A new agenda for corpus linguistics - working with all of the world's languages</title>
<genre>
<json:string>research-article</json:string>
</genre>
<host>
<volume>15</volume>
<publisherId>
<json:string>litlin</json:string>
</publisherId>
<pages>
<last>420</last>
<first>403</first>
</pages>
<issn>
<json:string>0268-1145</json:string>
</issn>
<issue>4</issue>
<genre>
<json:string>journal</json:string>
</genre>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1477-4615</json:string>
</eissn>
<title>Literary and Linguistic Computing</title>
</host>
<categories>
<wos>
<json:string>LINGUISTICS</json:string>
<json:string>LITERATURE</json:string>
</wos>
</categories>
<publicationDate>2000</publicationDate>
<copyrightDate>2000</copyrightDate>
<doi>
<json:string>10.1093/llc/15.4.403</json:string>
</doi>
<id>47F3834E6E43DD4865BBDE99A7C450A55654D9D0</id>
<score>0.11850558</score>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/47F3834E6E43DD4865BBDE99A7C450A55654D9D0/fulltext/pdf</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/47F3834E6E43DD4865BBDE99A7C450A55654D9D0/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/47F3834E6E43DD4865BBDE99A7C450A55654D9D0/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">A new agenda for corpus linguistics - working with all of the world's languages</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
<respStmt>
<resp>Références bibliographiques récupérées via GROBID</resp>
<name resp="ISTEX-API">ISTEX-API (INIST-CNRS)</name>
</respStmt>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>Oxford University Press</publisher>
<availability>
<p>OUP</p>
</availability>
<date>2000</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a" type="main" xml:lang="en">A new agenda for corpus linguistics - working with all of the world's languages</title>
<author>
<persName>
<forename type="first">T</forename>
<surname>McEnery</surname>
</persName>
<affiliation>Foundation for Endangered Languages, Batheaston Villa, 172 Bailbrook Lane, Bath BA1 7AA, UK</affiliation>
</author>
<author>
<persName>
<forename type="first">N</forename>
<surname>Ostler</surname>
</persName>
<email>nostler@chibcha.demon.co.uk</email>
<affiliation>Foundation for Endangered Languages, Batheaston Villa, 172 Bailbrook Lane, Bath BA1 7AA, UK</affiliation>
</author>
</analytic>
<monogr>
<title level="j">Literary and Linguistic Computing</title>
<title level="j" type="abbrev">Lit Linguist Computing</title>
<idno type="pISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint>
<publisher>Oxford University Press</publisher>
<date type="published" when="2000-12"></date>
<biblScope unit="volume">15</biblScope>
<biblScope unit="issue">4</biblScope>
<biblScope unit="page" from="403">403</biblScope>
<biblScope unit="page" to="420">420</biblScope>
</imprint>
</monogr>
<idno type="istex">47F3834E6E43DD4865BBDE99A7C450A55654D9D0</idno>
<idno type="DOI">10.1093/llc/15.4.403</idno>
<idno type="local">2</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>2000</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>In this paper we argue that corpus linguistics needs to expand to cover a wider set of languages. While the reasons that some languages have not been provided with corpus data to the date are clear, the intellectual and moral imperative to extend the range of corpus linguistics is strong. However, there are technical problems to be faced in such an extension of corpus linguistics. These problems are reviewed here and possible solutions to them explored. Following on from this, we consider what possible benefits the provision of appropriate corpus data may bring to languages currently untouched by the development of corpus linguistics.</p>
</abstract>
</profileDesc>
<revisionDesc>
<change when="2000-12">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-15">References added</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-21">References added</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-07-27">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/47F3834E6E43DD4865BBDE99A7C450A55654D9D0/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="corpus oup" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="US-ASCII"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" URI="journalpublishing.dtd" name="istex:docType"></istex:docType>
<istex:document>
<article xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">litlin</journal-id>
<journal-id journal-id-type="hwp">litlin</journal-id>
<journal-title>Literary and Linguistic Computing</journal-title>
<abbrev-journal-title abbrev-type="publisher">Lit Linguist Computing</abbrev-journal-title>
<issn pub-type="ppub">0268-1145</issn>
<issn pub-type="epub">1477-4615</issn>
<publisher>
<publisher-name>Oxford University Press</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="other">2</article-id>
<article-id pub-id-type="doi">10.1093/llc/15.4.403</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>A new agenda for corpus linguistics - working with all of the world's languages</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>McEnery</surname>
<given-names>T</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Ostler</surname>
<given-names>N</given-names>
</name>
<xref rid="Z">Z</xref>
</contrib>
<aff> Foundation for Endangered Languages, Batheaston Villa, 172 Bailbrook Lane, Bath BA1 7AA, UK
<target target-type="aff" id="Z"></target>
<label>Z</label>
Corresponding author E-mail: nostler@chibcha.demon.co.uk </aff>
</contrib-group>
<pub-date pub-type="ppub">
<month>12</month>
<year>2000</year>
</pub-date>
<volume>15</volume>
<issue>4</issue>
<fpage>403</fpage>
<lpage>420</lpage>
<permissions>
<copyright-statement>Copyright 2000</copyright-statement>
<copyright-year>2000</copyright-year>
</permissions>
<abstract xml:lang="en">
<p>In this paper we argue that corpus linguistics needs to expand to cover a wider set of languages. While the reasons that some languages have not been provided with corpus data to the date are clear, the intellectual and moral imperative to extend the range of corpus linguistics is strong. However, there are technical problems to be faced in such an extension of corpus linguistics. These problems are reviewed here and possible solutions to them explored. Following on from this, we consider what possible benefits the provision of appropriate corpus data may bring to languages currently untouched by the development of corpus linguistics.</p>
</abstract>
<custom-meta-wrap>
<custom-meta>
<meta-name>hwp-legacy-fpage</meta-name>
<meta-value>403</meta-value>
</custom-meta>
<custom-meta>
<meta-name>hwp-legacy-dochead</meta-name>
<meta-value>Article</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
</article>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>A new agenda for corpus linguistics - working with all of the world's languages</title>
</titleInfo>
<titleInfo type="alternative" lang="en" contentType="CDATA">
<title>A new agenda for corpus linguistics - working with all of the world's languages</title>
</titleInfo>
<name type="personal">
<namePart type="given">T</namePart>
<namePart type="family">McEnery</namePart>
<affiliation>Foundation for Endangered Languages, Batheaston Villa, 172 Bailbrook Lane, Bath BA1 7AA, UK</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">N</namePart>
<namePart type="family">Ostler</namePart>
<affiliation>Corresponding author E-mail: nostler@chibcha.demon.co.uk</affiliation>
<affiliation>Foundation for Endangered Languages, Batheaston Villa, 172 Bailbrook Lane, Bath BA1 7AA, UK</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="research-article" displayLabel="research-article"></genre>
<originInfo>
<publisher>Oxford University Press</publisher>
<dateIssued encoding="w3cdtf">2000-12</dateIssued>
<copyrightDate encoding="w3cdtf">2000</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">In this paper we argue that corpus linguistics needs to expand to cover a wider set of languages. While the reasons that some languages have not been provided with corpus data to the date are clear, the intellectual and moral imperative to extend the range of corpus linguistics is strong. However, there are technical problems to be faced in such an extension of corpus linguistics. These problems are reviewed here and possible solutions to them explored. Following on from this, we consider what possible benefits the provision of appropriate corpus data may bring to languages currently untouched by the development of corpus linguistics.</abstract>
<relatedItem type="host">
<titleInfo>
<title>Literary and Linguistic Computing</title>
</titleInfo>
<titleInfo type="abbreviated">
<title>Lit Linguist Computing</title>
</titleInfo>
<genre type="journal">journal</genre>
<identifier type="ISSN">0268-1145</identifier>
<identifier type="eISSN">1477-4615</identifier>
<identifier type="PublisherID">litlin</identifier>
<identifier type="PublisherID-hwp">litlin</identifier>
<part>
<date>2000</date>
<detail type="volume">
<caption>vol.</caption>
<number>15</number>
</detail>
<detail type="issue">
<caption>no.</caption>
<number>4</number>
</detail>
<extent unit="pages">
<start>403</start>
<end>420</end>
</extent>
</part>
</relatedItem>
<identifier type="istex">47F3834E6E43DD4865BBDE99A7C450A55654D9D0</identifier>
<identifier type="DOI">10.1093/llc/15.4.403</identifier>
<identifier type="local">2</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Copyright 2000</accessCondition>
<recordInfo>
<recordContentSource>OUP</recordContentSource>
</recordInfo>
</mods>
</metadata>
<enrichments>
<istex:catWosTEI uri="https://api.istex.fr/document/47F3834E6E43DD4865BBDE99A7C450A55654D9D0/enrichments/catWos">
<teiHeader>
<profileDesc>
<textClass>
<classCode scheme="WOS">LINGUISTICS</classCode>
<classCode scheme="WOS">LITERATURE</classCode>
</textClass>
</profileDesc>
</teiHeader>
</istex:catWosTEI>
<json:item>
<type>refBibs</type>
<uri>https://api.istex.fr/document/47F3834E6E43DD4865BBDE99A7C450A55654D9D0/enrichments/refBibs</uri>
</json:item>
</enrichments>
<serie></serie>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000456 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000456 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:47F3834E6E43DD4865BBDE99A7C450A55654D9D0
   |texte=   A new agenda for corpus linguistics - working with all of the world's languages
}}

Wicri

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024