Sustainability of annotated resources in linguistics: A web-platform for exploring, querying, and distributing linguistic corpora and other resources
Identifieur interne : 000321 ( Istex/Corpus ); précédent : 000320; suivant : 000322Sustainability of annotated resources in linguistics: A web-platform for exploring, querying, and distributing linguistic corpora and other resources
Auteurs : Georg Rehm ; Oliver Schonefeld ; Andreas Witt ; Erhard Hinrichs ; Marga ReisSource :
- Literary and Linguistic Computing [ 0268-1145 ] ; 2009-06.
Abstract
We report on finished work in a project that is concerned with providing methods, tools, best practice guidelines, and solutions for sustainable linguistic resources. The article discusses several general aspects of sustainability and introduces an approach to normalizing corpus data and metadata records. Moreover, the architecture of the sustainability platform implemented by the authors is described.
Url:
DOI: 10.1093/llc/fqp003
Links to Exploration step
ISTEX:67F0DBA7CE468A79CB71C752F27B085C02394B54Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title>Sustainability of annotated resources in linguistics: A web-platform for exploring, querying, and distributing linguistic corpora and other resources</title>
<author wicri:is="90%"><name sortKey="Rehm, Georg" sort="Rehm, Georg" uniqKey="Rehm G" first="Georg" last="Rehm">Georg Rehm</name>
<affiliation><mods:affiliation>vionto GmbH, Berlin, Germany</mods:affiliation>
</affiliation>
<affiliation><mods:affiliation>E-mail: georg.rehm@vionto.com</mods:affiliation>
</affiliation>
</author>
<author wicri:is="90%"><name sortKey="Schonefeld, Oliver" sort="Schonefeld, Oliver" uniqKey="Schonefeld O" first="Oliver" last="Schonefeld">Oliver Schonefeld</name>
<affiliation><mods:affiliation>German National Library of Medicine (ZB MED), Cologne, Germany</mods:affiliation>
</affiliation>
</author>
<author wicri:is="90%"><name sortKey="Witt, Andreas" sort="Witt, Andreas" uniqKey="Witt A" first="Andreas" last="Witt">Andreas Witt</name>
<affiliation><mods:affiliation>Institute for the German Language (IDS), Mannheim, Germany</mods:affiliation>
</affiliation>
</author>
<author wicri:is="90%"><name sortKey="Hinrichs, Erhard" sort="Hinrichs, Erhard" uniqKey="Hinrichs E" first="Erhard" last="Hinrichs">Erhard Hinrichs</name>
<affiliation><mods:affiliation>General and Computational Linguistics, Tübingen University, Tübingen, Germany</mods:affiliation>
</affiliation>
</author>
<author wicri:is="90%"><name sortKey="Reis, Marga" sort="Reis, Marga" uniqKey="Reis M" first="Marga" last="Reis">Marga Reis</name>
<affiliation><mods:affiliation>Deutsches Seminar, Tübingen University, Tübingen, Germany</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:67F0DBA7CE468A79CB71C752F27B085C02394B54</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.1093/llc/fqp003</idno>
<idno type="url">https://api.istex.fr/document/67F0DBA7CE468A79CB71C752F27B085C02394B54/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000321</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a">Sustainability of annotated resources in linguistics: A web-platform for exploring, querying, and distributing linguistic corpora and other resources</title>
<author wicri:is="90%"><name sortKey="Rehm, Georg" sort="Rehm, Georg" uniqKey="Rehm G" first="Georg" last="Rehm">Georg Rehm</name>
<affiliation><mods:affiliation>vionto GmbH, Berlin, Germany</mods:affiliation>
</affiliation>
<affiliation><mods:affiliation>E-mail: georg.rehm@vionto.com</mods:affiliation>
</affiliation>
</author>
<author wicri:is="90%"><name sortKey="Schonefeld, Oliver" sort="Schonefeld, Oliver" uniqKey="Schonefeld O" first="Oliver" last="Schonefeld">Oliver Schonefeld</name>
<affiliation><mods:affiliation>German National Library of Medicine (ZB MED), Cologne, Germany</mods:affiliation>
</affiliation>
</author>
<author wicri:is="90%"><name sortKey="Witt, Andreas" sort="Witt, Andreas" uniqKey="Witt A" first="Andreas" last="Witt">Andreas Witt</name>
<affiliation><mods:affiliation>Institute for the German Language (IDS), Mannheim, Germany</mods:affiliation>
</affiliation>
</author>
<author wicri:is="90%"><name sortKey="Hinrichs, Erhard" sort="Hinrichs, Erhard" uniqKey="Hinrichs E" first="Erhard" last="Hinrichs">Erhard Hinrichs</name>
<affiliation><mods:affiliation>General and Computational Linguistics, Tübingen University, Tübingen, Germany</mods:affiliation>
</affiliation>
</author>
<author wicri:is="90%"><name sortKey="Reis, Marga" sort="Reis, Marga" uniqKey="Reis M" first="Marga" last="Reis">Marga Reis</name>
<affiliation><mods:affiliation>Deutsches Seminar, Tübingen University, Tübingen, Germany</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Literary and Linguistic Computing</title>
<idno type="ISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint><publisher>Oxford University Press</publisher>
<date type="published" when="2009-06">2009-06</date>
<biblScope unit="volume">24</biblScope>
<biblScope unit="issue">2</biblScope>
<biblScope unit="page" from="193">193</biblScope>
<biblScope unit="page" to="210">210</biblScope>
</imprint>
<idno type="ISSN">0268-1145</idno>
</series>
<idno type="istex">67F0DBA7CE468A79CB71C752F27B085C02394B54</idno>
<idno type="DOI">10.1093/llc/fqp003</idno>
<idno type="ArticleID">fqp003</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract">We report on finished work in a project that is concerned with providing methods, tools, best practice guidelines, and solutions for sustainable linguistic resources. The article discusses several general aspects of sustainability and introduces an approach to normalizing corpus data and metadata records. Moreover, the architecture of the sustainability platform implemented by the authors is described.</div>
</front>
</TEI>
<istex><corpusName>oup</corpusName>
<author><json:item><name>Georg Rehm</name>
<affiliations><json:string>vionto GmbH, Berlin, Germany</json:string>
<json:string>E-mail: georg.rehm@vionto.com</json:string>
</affiliations>
</json:item>
<json:item><name>Oliver Schonefeld</name>
<affiliations><json:string>German National Library of Medicine (ZB MED), Cologne, Germany</json:string>
</affiliations>
</json:item>
<json:item><name>Andreas Witt</name>
<affiliations><json:string>Institute for the German Language (IDS), Mannheim, Germany</json:string>
</affiliations>
</json:item>
<json:item><name>Erhard Hinrichs</name>
<affiliations><json:string>General and Computational Linguistics, Tübingen University, Tübingen, Germany</json:string>
</affiliations>
</json:item>
<json:item><name>Marga Reis</name>
<affiliations><json:string>Deutsches Seminar, Tübingen University, Tübingen, Germany</json:string>
</affiliations>
</json:item>
</author>
<subject><json:item><lang><json:string>eng</json:string>
</lang>
<value>Original Articles</value>
</json:item>
</subject>
<articleId><json:string>fqp003</json:string>
</articleId>
<language><json:string>eng</json:string>
</language>
<originalGenre><json:string>research-article</json:string>
</originalGenre>
<abstract>We report on finished work in a project that is concerned with providing methods, tools, best practice guidelines, and solutions for sustainable linguistic resources. The article discusses several general aspects of sustainability and introduces an approach to normalizing corpus data and metadata records. Moreover, the architecture of the sustainability platform implemented by the authors is described.</abstract>
<qualityIndicators><score>6.172</score>
<pdfVersion>1.4</pdfVersion>
<pdfPageSize>538.583 x 697.323 pts</pdfPageSize>
<refBibsNative>true</refBibsNative>
<keywordCount>1</keywordCount>
<abstractCharCount>405</abstractCharCount>
<pdfWordCount>6640</pdfWordCount>
<pdfCharCount>43117</pdfCharCount>
<pdfPageCount>18</pdfPageCount>
<abstractWordCount>56</abstractWordCount>
</qualityIndicators>
<title>Sustainability of annotated resources in linguistics: A web-platform for exploring, querying, and distributing linguistic corpora and other resources</title>
<genre><json:string>research-article</json:string>
</genre>
<host><volume>24</volume>
<publisherId><json:string>litlin</json:string>
</publisherId>
<pages><last>210</last>
<first>193</first>
</pages>
<issn><json:string>0268-1145</json:string>
</issn>
<issue>2</issue>
<genre><json:string>journal</json:string>
</genre>
<language><json:string>unknown</json:string>
</language>
<eissn><json:string>1477-4615</json:string>
</eissn>
<title>Literary and Linguistic Computing</title>
</host>
<categories><wos><json:string>LINGUISTICS</json:string>
<json:string>LITERATURE</json:string>
</wos>
</categories>
<publicationDate>2009</publicationDate>
<copyrightDate>2009</copyrightDate>
<doi><json:string>10.1093/llc/fqp003</json:string>
</doi>
<id>67F0DBA7CE468A79CB71C752F27B085C02394B54</id>
<score>0.16755493</score>
<fulltext><json:item><original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/67F0DBA7CE468A79CB71C752F27B085C02394B54/fulltext/pdf</uri>
</json:item>
<json:item><original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/67F0DBA7CE468A79CB71C752F27B085C02394B54/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/67F0DBA7CE468A79CB71C752F27B085C02394B54/fulltext/tei"><teiHeader><fileDesc><titleStmt><title level="a">Sustainability of annotated resources in linguistics: A web-platform for exploring, querying, and distributing linguistic corpora and other resources</title>
</titleStmt>
<publicationStmt><authority>ISTEX</authority>
<publisher>Oxford University Press</publisher>
<availability><p>OUP</p>
</availability>
<date>2009-03-19</date>
</publicationStmt>
<sourceDesc><biblStruct type="inbook"><analytic><title level="a">Sustainability of annotated resources in linguistics: A web-platform for exploring, querying, and distributing linguistic corpora and other resources</title>
<author><persName><forename type="first">Georg</forename>
<surname>Rehm</surname>
</persName>
<email>georg.rehm@vionto.com</email>
<affiliation>vionto GmbH, Berlin, Germany</affiliation>
</author>
<author><persName><forename type="first">Oliver</forename>
<surname>Schonefeld</surname>
</persName>
<affiliation>German National Library of Medicine (ZB MED), Cologne, Germany</affiliation>
</author>
<author><persName><forename type="first">Andreas</forename>
<surname>Witt</surname>
</persName>
<affiliation>Institute for the German Language (IDS), Mannheim, Germany</affiliation>
</author>
<author><persName><forename type="first">Erhard</forename>
<surname>Hinrichs</surname>
</persName>
<affiliation>General and Computational Linguistics, Tübingen University, Tübingen, Germany</affiliation>
</author>
<author><persName><forename type="first">Marga</forename>
<surname>Reis</surname>
</persName>
<affiliation>Deutsches Seminar, Tübingen University, Tübingen, Germany</affiliation>
</author>
</analytic>
<monogr><title level="j">Literary and Linguistic Computing</title>
<idno type="pISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint><publisher>Oxford University Press</publisher>
<date type="published" when="2009-06"></date>
<biblScope unit="volume">24</biblScope>
<biblScope unit="issue">2</biblScope>
<biblScope unit="page" from="193">193</biblScope>
<biblScope unit="page" to="210">210</biblScope>
</imprint>
</monogr>
<idno type="istex">67F0DBA7CE468A79CB71C752F27B085C02394B54</idno>
<idno type="DOI">10.1093/llc/fqp003</idno>
<idno type="ArticleID">fqp003</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><creation><date>2009-03-19</date>
</creation>
<langUsage><language ident="en">en</language>
</langUsage>
<abstract><p>We report on finished work in a project that is concerned with providing methods, tools, best practice guidelines, and solutions for sustainable linguistic resources. The article discusses several general aspects of sustainability and introduces an approach to normalizing corpus data and metadata records. Moreover, the architecture of the sustainability platform implemented by the authors is described.</p>
</abstract>
<textClass><keywords scheme="keyword"><list><item><term>Original Articles</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc><change when="2009-03-19">Created</change>
<change when="2009-06">Published</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item><original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/67F0DBA7CE468A79CB71C752F27B085C02394B54/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata><istex:metadataXml wicri:clean="corpus oup" wicri:toSee="no header"><istex:xmlDeclaration>version="1.0" encoding="utf-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" URI="journalpublishing.dtd" name="istex:docType"></istex:docType>
<istex:document><article article-type="research-article"><front><journal-meta><journal-id journal-id-type="publisher-id">litlin</journal-id>
<journal-id journal-id-type="hwp">litlin</journal-id>
<journal-title>Literary and Linguistic Computing</journal-title>
<issn pub-type="ppub">0268-1145</issn>
<issn pub-type="epub">1477-4615</issn>
<publisher><publisher-name>Oxford University Press</publisher-name>
</publisher>
</journal-meta>
<article-meta><article-id pub-id-type="doi">10.1093/llc/fqp003</article-id>
<article-id pub-id-type="publisher-id">fqp003</article-id>
<article-categories><subj-group><subject>Original Articles</subject>
</subj-group>
</article-categories>
<title-group><article-title>Sustainability of annotated resources in linguistics: A web-platform for exploring, querying, and distributing linguistic corpora and other resources</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" corresp="yes"><name><surname>Rehm</surname>
<given-names>Georg</given-names>
</name>
</contrib>
<aff>vionto GmbH, Berlin, Germany</aff>
</contrib-group>
<contrib-group><contrib contrib-type="author"><name><surname>Schonefeld</surname>
<given-names>Oliver</given-names>
</name>
</contrib>
<aff>German National Library of Medicine (ZB MED), Cologne, Germany</aff>
</contrib-group>
<contrib-group><contrib contrib-type="author"><name><surname>Witt</surname>
<given-names>Andreas</given-names>
</name>
</contrib>
<aff>Institute for the German Language (IDS), Mannheim, Germany</aff>
</contrib-group>
<contrib-group><contrib contrib-type="author"><name><surname>Hinrichs</surname>
<given-names>Erhard</given-names>
</name>
</contrib>
<aff>General and Computational Linguistics, Tübingen University, Tübingen, Germany</aff>
</contrib-group>
<contrib-group><contrib contrib-type="author"><name><surname>Reis</surname>
<given-names>Marga</given-names>
</name>
</contrib>
<aff>Deutsches Seminar, Tübingen University, Tübingen, Germany</aff>
</contrib-group>
<author-notes><corresp><bold>Correspondence:</bold>
Georg Rehm, vionto GmbH, Karl-Marx-Allee 90a, D-10243 Berlin, Germany <bold>E-mail:</bold>
<email>georg.rehm@vionto.com</email>
</corresp>
</author-notes>
<pub-date pub-type="ppub"><month>6</month>
<year>2009</year>
</pub-date>
<pub-date pub-type="epub"><day>19</day>
<month>3</month>
<year>2009</year>
</pub-date>
<volume>24</volume>
<issue>2</issue>
<issue-title>Special Issue 'Selected papers from Digital Humanities 2008, University of Oulu, Finland, June 25–29'</issue-title>
<fpage>193</fpage>
<lpage>210</lpage>
<permissions><copyright-statement>© The Author 2009. Published by Oxford University Press onbehalf of ALLC and ACH. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org</copyright-statement>
<copyright-year>2009</copyright-year>
</permissions>
<abstract><p>We report on finished work in a project that is concerned with providing methods, tools, best practice guidelines, and solutions for sustainable linguistic resources. The article discusses several general aspects of sustainability and introduces an approach to normalizing corpus data and metadata records. Moreover, the architecture of the sustainability platform implemented by the authors is described.</p>
</abstract>
</article-meta>
</front>
</article>
</istex:document>
</istex:metadataXml>
<mods version="3.6"><titleInfo><title>Sustainability of annotated resources in linguistics: A web-platform for exploring, querying, and distributing linguistic corpora and other resources</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA"><title>Sustainability of annotated resources in linguistics: A web-platform for exploring, querying, and distributing linguistic corpora and other resources</title>
</titleInfo>
<name type="personal"><namePart type="given">Georg</namePart>
<namePart type="family">Rehm</namePart>
<affiliation>vionto GmbH, Berlin, Germany</affiliation>
<affiliation>E-mail: georg.rehm@vionto.com</affiliation>
</name>
<name type="personal"><namePart type="given">Oliver</namePart>
<namePart type="family">Schonefeld</namePart>
<affiliation>German National Library of Medicine (ZB MED), Cologne, Germany</affiliation>
</name>
<name type="personal"><namePart type="given">Andreas</namePart>
<namePart type="family">Witt</namePart>
<affiliation>Institute for the German Language (IDS), Mannheim, Germany</affiliation>
</name>
<name type="personal"><namePart type="given">Erhard</namePart>
<namePart type="family">Hinrichs</namePart>
<affiliation>General and Computational Linguistics, Tübingen University, Tübingen, Germany</affiliation>
</name>
<name type="personal"><namePart type="given">Marga</namePart>
<namePart type="family">Reis</namePart>
<affiliation>Deutsches Seminar, Tübingen University, Tübingen, Germany</affiliation>
</name>
<typeOfResource>text</typeOfResource>
<genre type="research-article" displayLabel="research-article"></genre>
<subject><topic>Original Articles</topic>
</subject>
<originInfo><publisher>Oxford University Press</publisher>
<dateIssued encoding="w3cdtf">2009-06</dateIssued>
<dateCreated encoding="w3cdtf">2009-03-19</dateCreated>
<copyrightDate encoding="w3cdtf">2009</copyrightDate>
</originInfo>
<language><languageTerm type="code" authority="iso639-2b">eng</languageTerm>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
</language>
<physicalDescription><internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract>We report on finished work in a project that is concerned with providing methods, tools, best practice guidelines, and solutions for sustainable linguistic resources. The article discusses several general aspects of sustainability and introduces an approach to normalizing corpus data and metadata records. Moreover, the architecture of the sustainability platform implemented by the authors is described.</abstract>
<relatedItem type="host"><titleInfo><title>Literary and Linguistic Computing</title>
</titleInfo>
<genre type="journal">journal</genre>
<identifier type="ISSN">0268-1145</identifier>
<identifier type="eISSN">1477-4615</identifier>
<identifier type="PublisherID">litlin</identifier>
<identifier type="PublisherID-hwp">litlin</identifier>
<part><date>2009</date>
<detail type="title"><title>Special Issue 'Selected papers from Digital Humanities 2008, University of Oulu, Finland, June 2529'</title>
</detail>
<detail type="volume"><caption>vol.</caption>
<number>24</number>
</detail>
<detail type="issue"><caption>no.</caption>
<number>2</number>
</detail>
<extent unit="pages"><start>193</start>
<end>210</end>
</extent>
</part>
</relatedItem>
<identifier type="istex">67F0DBA7CE468A79CB71C752F27B085C02394B54</identifier>
<identifier type="DOI">10.1093/llc/fqp003</identifier>
<identifier type="ArticleID">fqp003</identifier>
<accessCondition type="use and reproduction" contentType="copyright">© The Author 2009. Published by Oxford University Press onbehalf of ALLC and ACH. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org</accessCondition>
<recordInfo><recordContentSource>OUP</recordContentSource>
</recordInfo>
</mods>
</metadata>
<covers><json:item><original>true</original>
<mimetype>image/tiff</mimetype>
<extension>tiff</extension>
<uri>https://api.istex.fr/document/67F0DBA7CE468A79CB71C752F27B085C02394B54/covers/tiff</uri>
</json:item>
</covers>
<annexes><json:item><original>true</original>
<mimetype>image/jpeg</mimetype>
<extension>jpeg</extension>
<uri>https://api.istex.fr/document/67F0DBA7CE468A79CB71C752F27B085C02394B54/annexes/jpeg</uri>
</json:item>
<json:item><original>true</original>
<mimetype>image/gif</mimetype>
<extension>gif</extension>
<uri>https://api.istex.fr/document/67F0DBA7CE468A79CB71C752F27B085C02394B54/annexes/gif</uri>
</json:item>
<json:item><original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/67F0DBA7CE468A79CB71C752F27B085C02394B54/annexes/pdf</uri>
</json:item>
</annexes>
<enrichments><istex:catWosTEI uri="https://api.istex.fr/document/67F0DBA7CE468A79CB71C752F27B085C02394B54/enrichments/catWos"><teiHeader><profileDesc><textClass><classCode scheme="WOS">LINGUISTICS</classCode>
<classCode scheme="WOS">LITERATURE</classCode>
</textClass>
</profileDesc>
</teiHeader>
</istex:catWosTEI>
</enrichments>
<serie></serie>
</istex>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000321 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000321 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Ticri |area= TeiVM2 |flux= Istex |étape= Corpus |type= RBID |clé= ISTEX:67F0DBA7CE468A79CB71C752F27B085C02394B54 |texte= Sustainability of annotated resources in linguistics: A web-platform for exploring, querying, and distributing linguistic corpora and other resources }}
![]() | This area was generated with Dilib version V0.6.31. | ![]() |