Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Adapting a Robust Multi-genre NE System for Automatic Content Extraction

Identifieur interne : 000860 ( Istex/Corpus ); précédent : 000859; suivant : 000861

Adapting a Robust Multi-genre NE System for Automatic Content Extraction

Auteurs : Diana Maynard ; Hamish Cunningham ; Kalina Bontcheva ; Marin Dimitrov

Source :

RBID : ISTEX:50643758F6A5345504D4B37A8BBA39C828D900BF

Abstract

Abstract: Many current information extraction systems tend to be designed with particular applications and domains in mind. With the increasing need for robust language engineering tools which can handle a variety of language processing demands, we have used the GATE architecture to design MUSE - a system for named entity recognition and related tasks. In this paper, we address the issue of how this general-purpose system can be adapted for particular applications with minimal time and effort, and how the set of resources used can be adapted dynamically and automatically. We focus specifically on the challenges of the ACE (Automatic Content Extraction) entity detection and tracking task, and preliminary results show promising figures.

Url:
DOI: 10.1007/3-540-46148-5_27

Links to Exploration step

ISTEX:50643758F6A5345504D4B37A8BBA39C828D900BF

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct:series">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Adapting a Robust Multi-genre NE System for Automatic Content Extraction</title>
<author>
<name sortKey="Maynard, Diana" sort="Maynard, Diana" uniqKey="Maynard D" first="Diana" last="Maynard">Diana Maynard</name>
<affiliation>
<mods:affiliation>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield, UK</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: diana@dcs.shef.ac.uk</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Cunningham, Hamish" sort="Cunningham, Hamish" uniqKey="Cunningham H" first="Hamish" last="Cunningham">Hamish Cunningham</name>
<affiliation>
<mods:affiliation>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield, UK</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: hamish@dcs.shef.ac.uk</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Bontcheva, Kalina" sort="Bontcheva, Kalina" uniqKey="Bontcheva K" first="Kalina" last="Bontcheva">Kalina Bontcheva</name>
<affiliation>
<mods:affiliation>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield, UK</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: kalina@dcs.shef.ac.uk</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Dimitrov, Marin" sort="Dimitrov, Marin" uniqKey="Dimitrov M" first="Marin" last="Dimitrov">Marin Dimitrov</name>
<affiliation>
<mods:affiliation>Sirma AI Ltd, Ontotext Lab, 38AHristo Botev Blvd, 1000, Sofia, Bulgaria</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: marin@sirma.bg</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:50643758F6A5345504D4B37A8BBA39C828D900BF</idno>
<date when="2002" year="2002">2002</date>
<idno type="doi">10.1007/3-540-46148-5_27</idno>
<idno type="url">https://api.istex.fr/document/50643758F6A5345504D4B37A8BBA39C828D900BF/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000860</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Adapting a Robust Multi-genre NE System for Automatic Content Extraction</title>
<author>
<name sortKey="Maynard, Diana" sort="Maynard, Diana" uniqKey="Maynard D" first="Diana" last="Maynard">Diana Maynard</name>
<affiliation>
<mods:affiliation>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield, UK</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: diana@dcs.shef.ac.uk</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Cunningham, Hamish" sort="Cunningham, Hamish" uniqKey="Cunningham H" first="Hamish" last="Cunningham">Hamish Cunningham</name>
<affiliation>
<mods:affiliation>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield, UK</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: hamish@dcs.shef.ac.uk</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Bontcheva, Kalina" sort="Bontcheva, Kalina" uniqKey="Bontcheva K" first="Kalina" last="Bontcheva">Kalina Bontcheva</name>
<affiliation>
<mods:affiliation>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield, UK</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: kalina@dcs.shef.ac.uk</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Dimitrov, Marin" sort="Dimitrov, Marin" uniqKey="Dimitrov M" first="Marin" last="Dimitrov">Marin Dimitrov</name>
<affiliation>
<mods:affiliation>Sirma AI Ltd, Ontotext Lab, 38AHristo Botev Blvd, 1000, Sofia, Bulgaria</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: marin@sirma.bg</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2002</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">50643758F6A5345504D4B37A8BBA39C828D900BF</idno>
<idno type="DOI">10.1007/3-540-46148-5_27</idno>
<idno type="ChapterID">27</idno>
<idno type="ChapterID">Chap27</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Many current information extraction systems tend to be designed with particular applications and domains in mind. With the increasing need for robust language engineering tools which can handle a variety of language processing demands, we have used the GATE architecture to design MUSE - a system for named entity recognition and related tasks. In this paper, we address the issue of how this general-purpose system can be adapted for particular applications with minimal time and effort, and how the set of resources used can be adapted dynamically and automatically. We focus specifically on the challenges of the ACE (Automatic Content Extraction) entity detection and tracking task, and preliminary results show promising figures.</div>
</front>
</TEI>
<istex>
<corpusName>springer</corpusName>
<author>
<json:item>
<name>Diana Maynard</name>
<affiliations>
<json:string>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield, UK</json:string>
<json:string>E-mail: diana@dcs.shef.ac.uk</json:string>
</affiliations>
</json:item>
<json:item>
<name>Hamish Cunningham</name>
<affiliations>
<json:string>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield, UK</json:string>
<json:string>E-mail: hamish@dcs.shef.ac.uk</json:string>
</affiliations>
</json:item>
<json:item>
<name>Kalina Bontcheva</name>
<affiliations>
<json:string>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield, UK</json:string>
<json:string>E-mail: kalina@dcs.shef.ac.uk</json:string>
</affiliations>
</json:item>
<json:item>
<name>Marin Dimitrov</name>
<affiliations>
<json:string>Sirma AI Ltd, Ontotext Lab, 38AHristo Botev Blvd, 1000, Sofia, Bulgaria</json:string>
<json:string>E-mail: marin@sirma.bg</json:string>
</affiliations>
</json:item>
</author>
<language>
<json:string>eng</json:string>
</language>
<abstract>Abstract: Many current information extraction systems tend to be designed with particular applications and domains in mind. With the increasing need for robust language engineering tools which can handle a variety of language processing demands, we have used the GATE architecture to design MUSE - a system for named entity recognition and related tasks. In this paper, we address the issue of how this general-purpose system can be adapted for particular applications with minimal time and effort, and how the set of resources used can be adapted dynamically and automatically. We focus specifically on the challenges of the ACE (Automatic Content Extraction) entity detection and tracking task, and preliminary results show promising figures.</abstract>
<qualityIndicators>
<score>5.21</score>
<pdfVersion>1.3</pdfVersion>
<pdfPageSize>648 x 864 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>744</abstractCharCount>
<pdfWordCount>3854</pdfWordCount>
<pdfCharCount>22928</pdfCharCount>
<pdfPageCount>10</pdfPageCount>
<abstractWordCount>113</abstractWordCount>
</qualityIndicators>
<title>Adapting a Robust Multi-genre NE System for Automatic Content Extraction</title>
<genre.original>
<json:string>OriginalPaper</json:string>
</genre.original>
<chapterId>
<json:string>27</json:string>
<json:string>Chap27</json:string>
</chapterId>
<genre>
<json:string>conference [eBooks]</json:string>
</genre>
<serie>
<editor>
<json:item>
<name>G. Goos</name>
</json:item>
<json:item>
<name>J. Hartmanis</name>
</json:item>
<json:item>
<name>J. van Leeuwen</name>
</json:item>
</editor>
<issn>
<json:string>0302-9743</json:string>
</issn>
<language>
<json:string>unknown</json:string>
</language>
<title>Lecture Notes in Computer Science</title>
<copyrightDate>2002</copyrightDate>
</serie>
<host>
<editor>
<json:item>
<name>Donia Scott</name>
<affiliations>
<json:string>ITRI, University of Brighton, Lewes Road, BN2 4GJ, Brighton, UK</json:string>
<json:string>E-mail: donia.scott@itri.bton.ac.uk</json:string>
</affiliations>
</json:item>
</editor>
<subject>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Artificial Intelligence (incl. Robotics)</value>
</json:item>
</subject>
<isbn>
<json:string>978-3-540-44127-4</json:string>
</isbn>
<language>
<json:string>unknown</json:string>
</language>
<title>Artificial Intelligence: Methodology, Systems, and Applications</title>
<genre.original>
<json:string>Proceedings</json:string>
</genre.original>
<bookId>
<json:string>3-540-46148-5</json:string>
</bookId>
<volume>2443</volume>
<pages>
<last>273</last>
<first>264</first>
</pages>
<issn>
<json:string>0302-9743</json:string>
</issn>
<genre>
<json:string>Book Series</json:string>
</genre>
<eisbn>
<json:string>978-3-540-46148-7</json:string>
</eisbn>
<copyrightDate>2002</copyrightDate>
<doi>
<json:string>10.1007/3-540-46148-5</json:string>
</doi>
</host>
<publicationDate>2002</publicationDate>
<copyrightDate>2002</copyrightDate>
<doi>
<json:string>10.1007/3-540-46148-5_27</json:string>
</doi>
<id>50643758F6A5345504D4B37A8BBA39C828D900BF</id>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/50643758F6A5345504D4B37A8BBA39C828D900BF/fulltext/pdf</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/50643758F6A5345504D4B37A8BBA39C828D900BF/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/50643758F6A5345504D4B37A8BBA39C828D900BF/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">Adapting a Robust Multi-genre NE System for Automatic Content Extraction</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<availability>
<p>SPRINGER</p>
</availability>
<date>2002</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a" type="main" xml:lang="en">Adapting a Robust Multi-genre NE System for Automatic Content Extraction</title>
<author>
<persName>
<forename type="first">Diana</forename>
<surname>Maynard</surname>
</persName>
<email>diana@dcs.shef.ac.uk</email>
<affiliation>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield, UK</affiliation>
</author>
<author>
<persName>
<forename type="first">Hamish</forename>
<surname>Cunningham</surname>
</persName>
<email>hamish@dcs.shef.ac.uk</email>
<affiliation>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield, UK</affiliation>
</author>
<author>
<persName>
<forename type="first">Kalina</forename>
<surname>Bontcheva</surname>
</persName>
<email>kalina@dcs.shef.ac.uk</email>
<affiliation>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield, UK</affiliation>
</author>
<author>
<persName>
<forename type="first">Marin</forename>
<surname>Dimitrov</surname>
</persName>
<email>marin@sirma.bg</email>
<affiliation>Sirma AI Ltd, Ontotext Lab, 38AHristo Botev Blvd, 1000, Sofia, Bulgaria</affiliation>
</author>
</analytic>
<monogr>
<title level="m">Artificial Intelligence: Methodology, Systems, and Applications</title>
<title level="m" type="sub">10th International Conference, AIMSA 2002 Varna, Bulgaria, September 4–6, 2002 Proceedings</title>
<idno type="pISBN">978-3-540-44127-4</idno>
<idno type="eISBN">978-3-540-46148-7</idno>
<idno type="pISSN">0302-9743</idno>
<idno type="DOI">10.1007/3-540-46148-5</idno>
<idno type="BookID">3-540-46148-5</idno>
<idno type="BookTitleID">72465</idno>
<idno type="BookSequenceNumber">2443</idno>
<idno type="BookVolumeNumber">2443</idno>
<idno type="BookChapterCount">28</idno>
<editor>
<persName>
<forename type="first">Donia</forename>
<surname>Scott</surname>
</persName>
<email>donia.scott@itri.bton.ac.uk</email>
<affiliation>ITRI, University of Brighton, Lewes Road, BN2 4GJ, Brighton, UK</affiliation>
</editor>
<imprint>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<date type="published" when="2002"></date>
<biblScope unit="volume">2443</biblScope>
<biblScope unit="page" from="264">264</biblScope>
<biblScope unit="page" to="273">273</biblScope>
</imprint>
</monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<editor>
<persName>
<forename type="first">G.</forename>
<surname>Goos</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">J.</forename>
<surname>Hartmanis</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">J.</forename>
<surname>van Leeuwen</surname>
</persName>
</editor>
<biblScope>
<date>2002</date>
</biblScope>
<idno type="pISSN">0302-9743</idno>
<idno type="seriesId">558</idno>
</series>
<series>
<title level="s">Lecture Notes in Artificial Intelligence</title>
<title level="s" type="sub">Subseries of Lecture Notes in Computer Science</title>
<editor>
<persName>
<forename type="first">G.</forename>
<surname>Goos</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">J.</forename>
<surname>Hartmanis</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">J.</forename>
<surname>van Leeuwen</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">Donia</forename>
<surname>Scott</surname>
</persName>
<email>donia.scott@itri.bton.ac.uk</email>
<affiliation>ITRI, University of Brighton, Lewes Road, BN2 4GJ, Brighton, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Jaime</forename>
<forename type="first">G.</forename>
<surname>Carbonell</surname>
</persName>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Jörg</forename>
<surname>Siekmann</surname>
</persName>
<affiliation>University of Saarland, Saarbrücken, Germany</affiliation>
</editor>
<biblScope type="seriesId">1244</biblScope>
</series>
<idno type="istex">50643758F6A5345504D4B37A8BBA39C828D900BF</idno>
<idno type="DOI">10.1007/3-540-46148-5_27</idno>
<idno type="ChapterID">27</idno>
<idno type="ChapterID">Chap27</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>2002</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>Abstract: Many current information extraction systems tend to be designed with particular applications and domains in mind. With the increasing need for robust language engineering tools which can handle a variety of language processing demands, we have used the GATE architecture to design MUSE - a system for named entity recognition and related tasks. In this paper, we address the issue of how this general-purpose system can be adapted for particular applications with minimal time and effort, and how the set of resources used can be adapted dynamically and automatically. We focus specifically on the challenges of the ACE (Automatic Content Extraction) entity detection and tracking task, and preliminary results show promising figures.</p>
</abstract>
<textClass>
<keywords scheme="Book Subject Collection">
<list>
<label>SUCO11645</label>
<item>
<term>Computer Science</term>
</item>
</list>
</keywords>
</textClass>
<textClass>
<keywords scheme="Book Subject Group">
<list>
<label>I</label>
<label>I21017</label>
<item>
<term>Computer Science</term>
</item>
<item>
<term>Artificial Intelligence (incl. Robotics)</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change when="2002">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-19">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/50643758F6A5345504D4B37A8BBA39C828D900BF/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="Springer, Publisher found" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="UTF-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//Springer-Verlag//DTD A++ V2.4//EN" URI="http://devel.springer.de/A++/V2.4/DTD/A++V2.4.dtd" name="istex:docType"></istex:docType>
<istex:document>
<Publisher>
<PublisherInfo>
<PublisherName>Springer Berlin Heidelberg</PublisherName>
<PublisherLocation>Berlin, Heidelberg</PublisherLocation>
</PublisherInfo>
<Series>
<SeriesInfo SeriesType="Series" TocLevels="0">
<SeriesID>558</SeriesID>
<SeriesPrintISSN>0302-9743</SeriesPrintISSN>
<SeriesTitle Language="En">Lecture Notes in Computer Science</SeriesTitle>
</SeriesInfo>
<SeriesHeader>
<EditorGroup>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>G.</GivenName>
<FamilyName>Goos</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>J.</GivenName>
<FamilyName>Hartmanis</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>J.</GivenName>
<Particle>van</Particle>
<FamilyName>Leeuwen</FamilyName>
</EditorName>
</Editor>
</EditorGroup>
</SeriesHeader>
<Book Language="En">
<BookInfo BookProductType="Proceedings" Language="En" MediaType="eBook" NumberingStyle="Unnumbered" TocLevels="0">
<BookID>3-540-46148-5</BookID>
<BookTitle>Artificial Intelligence: Methodology, Systems, and Applications</BookTitle>
<BookSubTitle>10th International Conference, AIMSA 2002 Varna, Bulgaria, September 4–6, 2002 Proceedings</BookSubTitle>
<BookVolumeNumber>2443</BookVolumeNumber>
<BookSequenceNumber>2443</BookSequenceNumber>
<BookDOI>10.1007/3-540-46148-5</BookDOI>
<BookTitleID>72465</BookTitleID>
<BookPrintISBN>978-3-540-44127-4</BookPrintISBN>
<BookElectronicISBN>978-3-540-46148-7</BookElectronicISBN>
<BookChapterCount>28</BookChapterCount>
<BookHistory>
<OnlineDate>
<Year>2002</Year>
<Month>8</Month>
<Day>21</Day>
</OnlineDate>
</BookHistory>
<BookCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2002</CopyrightYear>
</BookCopyright>
<BookSubjectGroup>
<BookSubject Code="I" Type="Primary">Computer Science</BookSubject>
<BookSubject Code="I21017" Priority="1" Type="Secondary">Artificial Intelligence (incl. Robotics)</BookSubject>
<SubjectCollection Code="SUCO11645">Computer Science</SubjectCollection>
</BookSubjectGroup>
<BookContext>
<SeriesID>558</SeriesID>
</BookContext>
</BookInfo>
<BookHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff1">
<EditorName DisplayOrder="Western">
<GivenName>Donia</GivenName>
<FamilyName>Scott</FamilyName>
</EditorName>
<Contact>
<Email>donia.scott@itri.bton.ac.uk</Email>
</Contact>
</Editor>
<Affiliation ID="Aff1">
<OrgDivision>ITRI</OrgDivision>
<OrgName>University of Brighton</OrgName>
<OrgAddress>
<Street>Lewes Road</Street>
<Postcode>BN2 4GJ</Postcode>
<City>Brighton</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</BookHeader>
<Chapter ID="Chap27" Language="En">
<ChapterInfo ChapterType="OriginalPaper" ContainsESM="No" Language="En" NumberingStyle="Unnumbered" TocLevels="0">
<ChapterID>27</ChapterID>
<ChapterDOI>10.1007/3-540-46148-5_27</ChapterDOI>
<ChapterSequenceNumber>27</ChapterSequenceNumber>
<ChapterTitle Language="En">Adapting a Robust Multi-genre NE System for Automatic Content Extraction</ChapterTitle>
<ChapterFirstPage>264</ChapterFirstPage>
<ChapterLastPage>273</ChapterLastPage>
<ChapterCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2002</CopyrightYear>
</ChapterCopyright>
<ChapterHistory>
<RegistrationDate>
<Year>2002</Year>
<Month>8</Month>
<Day>20</Day>
</RegistrationDate>
<OnlineDate>
<Year>2002</Year>
<Month>8</Month>
<Day>21</Day>
</OnlineDate>
</ChapterHistory>
<ChapterGrants Type="Regular">
<MetadataGrant Grant="OpenAccess"></MetadataGrant>
<AbstractGrant Grant="OpenAccess"></AbstractGrant>
<BodyPDFGrant Grant="Restricted"></BodyPDFGrant>
<BodyHTMLGrant Grant="Restricted"></BodyHTMLGrant>
<BibliographyGrant Grant="Restricted"></BibliographyGrant>
<ESMGrant Grant="Restricted"></ESMGrant>
</ChapterGrants>
<ChapterContext>
<SeriesID>558</SeriesID>
<BookID>3-540-46148-5</BookID>
<BookTitle>Artificial Intelligence: Methodology, Systems, and Applications</BookTitle>
</ChapterContext>
</ChapterInfo>
<ChapterHeader>
<AuthorGroup>
<Author AffiliationIDS="Aff4">
<AuthorName DisplayOrder="Western">
<GivenName>Diana</GivenName>
<FamilyName>Maynard</FamilyName>
</AuthorName>
<Contact>
<Email>diana@dcs.shef.ac.uk</Email>
<URL>http://nlp.shef.ac.uk</URL>
</Contact>
</Author>
<Author AffiliationIDS="Aff4">
<AuthorName DisplayOrder="Western">
<GivenName>Hamish</GivenName>
<FamilyName>Cunningham</FamilyName>
</AuthorName>
<Contact>
<Email>hamish@dcs.shef.ac.uk</Email>
</Contact>
</Author>
<Author AffiliationIDS="Aff4">
<AuthorName DisplayOrder="Western">
<GivenName>Kalina</GivenName>
<FamilyName>Bontcheva</FamilyName>
</AuthorName>
<Contact>
<Email>kalina@dcs.shef.ac.uk</Email>
</Contact>
</Author>
<Author AffiliationIDS="Aff5">
<AuthorName DisplayOrder="Western">
<GivenName>Marin</GivenName>
<FamilyName>Dimitrov</FamilyName>
</AuthorName>
<Contact>
<Email>marin@sirma.bg</Email>
</Contact>
</Author>
<Affiliation ID="Aff4">
<OrgDivision>Dept of Computer Science</OrgDivision>
<OrgName>University of Sheffield</OrgName>
<OrgAddress>
<Street>211 Portobello St</Street>
<Postcode>S1 4DP</Postcode>
<City>Sheffield</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff5">
<OrgName>Sirma AI Ltd, Ontotext Lab</OrgName>
<OrgAddress>
<Street>38AHristo Botev Blvd</Street>
<Postcode>1000</Postcode>
<City>Sofia</City>
<Country>Bulgaria</Country>
</OrgAddress>
</Affiliation>
</AuthorGroup>
<Abstract ID="Abs1" Language="En">
<Heading>Abstract</Heading>
<Para>Many current information extraction systems tend to be designed with particular applications and domains in mind. With the increasing need for robust language engineering tools which can handle a variety of language processing demands, we have used the GATE architecture to design MUSE - a system for named entity recognition and related tasks. In this paper, we address the issue of how this general-purpose system can be adapted for particular applications with minimal time and effort, and how the set of resources used can be adapted dynamically and automatically. We focus specifically on the challenges of the ACE (Automatic Content Extraction) entity detection and tracking task, and preliminary results show promising figures.</Para>
</Abstract>
<KeywordGroup Language="En">
<Heading>Keywords</Heading>
<Keyword>information extraction</Keyword>
<Keyword>named entity recognition</Keyword>
<Keyword>robust NLP</Keyword>
</KeywordGroup>
</ChapterHeader>
<NoBody></NoBody>
</Chapter>
</Book>
<SubSeries>
<SubSeriesInfo>
<SubSeriesID>1244</SubSeriesID>
<SubSeriesTitle Language="En">Lecture Notes in Artificial Intelligence</SubSeriesTitle>
<SubSeriesSubTitle Language="En">Subseries of Lecture Notes in Computer Science</SubSeriesSubTitle>
</SubSeriesInfo>
<SubSeriesHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff2">
<EditorName DisplayOrder="Western">
<GivenName>Jaime</GivenName>
<GivenName>G.</GivenName>
<FamilyName>Carbonell</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff3">
<EditorName DisplayOrder="Western">
<GivenName>Jörg</GivenName>
<FamilyName>Siekmann</FamilyName>
</EditorName>
</Editor>
<Affiliation ID="Aff2">
<OrgName>Carnegie Mellon University</OrgName>
<OrgAddress>
<City>Pittsburgh</City>
<State>PA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff3">
<OrgName>University of Saarland</OrgName>
<OrgAddress>
<City>Saarbrücken</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</SubSeriesHeader>
</SubSeries>
</Series>
</Publisher>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>Adapting a Robust Multi-genre NE System for Automatic Content Extraction</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA" lang="en">
<title>Adapting a Robust Multi-genre NE System for Automatic Content Extraction</title>
</titleInfo>
<name type="personal">
<namePart type="given">Diana</namePart>
<namePart type="family">Maynard</namePart>
<affiliation>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield, UK</affiliation>
<affiliation>E-mail: diana@dcs.shef.ac.uk</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Hamish</namePart>
<namePart type="family">Cunningham</namePart>
<affiliation>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield, UK</affiliation>
<affiliation>E-mail: hamish@dcs.shef.ac.uk</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Kalina</namePart>
<namePart type="family">Bontcheva</namePart>
<affiliation>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield, UK</affiliation>
<affiliation>E-mail: kalina@dcs.shef.ac.uk</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Marin</namePart>
<namePart type="family">Dimitrov</namePart>
<affiliation>Sirma AI Ltd, Ontotext Lab, 38AHristo Botev Blvd, 1000, Sofia, Bulgaria</affiliation>
<affiliation>E-mail: marin@sirma.bg</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="conference [eBooks]" displayLabel="OriginalPaper"></genre>
<originInfo>
<publisher>Springer Berlin Heidelberg</publisher>
<place>
<placeTerm type="text">Berlin, Heidelberg</placeTerm>
</place>
<dateIssued encoding="w3cdtf">2002</dateIssued>
<copyrightDate encoding="w3cdtf">2002</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">Abstract: Many current information extraction systems tend to be designed with particular applications and domains in mind. With the increasing need for robust language engineering tools which can handle a variety of language processing demands, we have used the GATE architecture to design MUSE - a system for named entity recognition and related tasks. In this paper, we address the issue of how this general-purpose system can be adapted for particular applications with minimal time and effort, and how the set of resources used can be adapted dynamically and automatically. We focus specifically on the challenges of the ACE (Automatic Content Extraction) entity detection and tracking task, and preliminary results show promising figures.</abstract>
<relatedItem type="host">
<titleInfo>
<title>Artificial Intelligence: Methodology, Systems, and Applications</title>
<subTitle>10th International Conference, AIMSA 2002 Varna, Bulgaria, September 4–6, 2002 Proceedings</subTitle>
</titleInfo>
<name type="personal">
<namePart type="given">Donia</namePart>
<namePart type="family">Scott</namePart>
<affiliation>ITRI, University of Brighton, Lewes Road, BN2 4GJ, Brighton, UK</affiliation>
<affiliation>E-mail: donia.scott@itri.bton.ac.uk</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="Book Series" displayLabel="Proceedings"></genre>
<originInfo>
<copyrightDate encoding="w3cdtf">2002</copyrightDate>
<issuance>monographic</issuance>
</originInfo>
<subject>
<genre>Book Subject Collection</genre>
<topic authority="SpringerSubjectCodes" authorityURI="SUCO11645">Computer Science</topic>
</subject>
<subject>
<genre>Book Subject Group</genre>
<topic authority="SpringerSubjectCodes" authorityURI="I">Computer Science</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I21017">Artificial Intelligence (incl. Robotics)</topic>
</subject>
<identifier type="DOI">10.1007/3-540-46148-5</identifier>
<identifier type="ISBN">978-3-540-44127-4</identifier>
<identifier type="eISBN">978-3-540-46148-7</identifier>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="BookTitleID">72465</identifier>
<identifier type="BookID">3-540-46148-5</identifier>
<identifier type="BookChapterCount">28</identifier>
<identifier type="BookVolumeNumber">2443</identifier>
<identifier type="BookSequenceNumber">2443</identifier>
<part>
<date>2002</date>
<detail type="volume">
<number>2443</number>
<caption>vol.</caption>
</detail>
<extent unit="pages">
<start>264</start>
<end>273</end>
</extent>
</part>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2002</recordOrigin>
</recordInfo>
</relatedItem>
<relatedItem type="series">
<titleInfo>
<title>Lecture Notes in Computer Science</title>
</titleInfo>
<name type="personal">
<namePart type="given">G.</namePart>
<namePart type="family">Goos</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">J.</namePart>
<namePart type="family">Hartmanis</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">J.</namePart>
<namePart type="family">van Leeuwen</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<originInfo>
<copyrightDate encoding="w3cdtf">2002</copyrightDate>
<issuance>serial</issuance>
</originInfo>
<relatedItem type="constituent">
<titleInfo>
<title>Lecture Notes in Artificial Intelligence</title>
<subTitle>Subseries of Lecture Notes in Computer Science</subTitle>
</titleInfo>
<name type="personal">
<namePart type="given">G.</namePart>
<namePart type="family">Goos</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">J.</namePart>
<namePart type="family">Hartmanis</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">J.</namePart>
<namePart type="family">van Leeuwen</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Donia</namePart>
<namePart type="family">Scott</namePart>
<affiliation>ITRI, University of Brighton, Lewes Road, BN2 4GJ, Brighton, UK</affiliation>
<affiliation>E-mail: donia.scott@itri.bton.ac.uk</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jaime</namePart>
<namePart type="given">G.</namePart>
<namePart type="family">Carbonell</namePart>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jörg</namePart>
<namePart type="family">Siekmann</namePart>
<affiliation>University of Saarland, Saarbrücken, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="Sub-Series"></genre>
<identifier type="SubSeriesID">1244</identifier>
</relatedItem>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="SeriesID">558</identifier>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2002</recordOrigin>
</recordInfo>
</relatedItem>
<identifier type="istex">50643758F6A5345504D4B37A8BBA39C828D900BF</identifier>
<identifier type="DOI">10.1007/3-540-46148-5_27</identifier>
<identifier type="ChapterID">27</identifier>
<identifier type="ChapterID">Chap27</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Springer-Verlag Berlin Heidelberg, 2002</accessCondition>
<recordInfo>
<recordContentSource>SPRINGER</recordContentSource>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2002</recordOrigin>
</recordInfo>
</mods>
</metadata>
<enrichments>
<istex:refBibTEI uri="https://api.istex.fr/document/50643758F6A5345504D4B37A8BBA39C828D900BF/enrichments/refBib">
<teiHeader></teiHeader>
<text>
<front></front>
<body></body>
<back>
<listBibl>
<biblStruct xml:id="b0">
<analytic>
<title level="a" type="main">An Introduction to Information Extraction</title>
<author>
<persName>
<forename type="first">D</forename>
<surname>Appelt</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Artificial Intelligence Communications</title>
<imprint>
<biblScope unit="volume">12</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="161" to="172"></biblScope>
<date type="published" when="1999"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b1">
<monogr>
<title level="m" type="main">Using human language technology for automatic annotation and indexing of digital library content</title>
<author>
<persName>
<forename type="first">K</forename>
<surname>Bontcheva</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Maynard</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">H</forename>
<surname>Saggion</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">H</forename>
<surname>Cunningham</surname>
</persName>
</author>
<imprint>
<date type="published" when="2002"></date>
</imprint>
</monogr>
<note>In. submitted to European Conference on Digital Libraries</note>
</biblStruct>
<biblStruct xml:id="b2">
<analytic>
<title></title>
<author>
<persName>
<forename type="first">J</forename>
<surname>Cowie</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">L</forename>
<surname>Guthrie</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">W</forename>
<surname>Jin</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">W</forename>
<surname>Odgen</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<surname>Pustejowsky</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">R</forename>
<surname>Wanf</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">T</forename>
<surname>Wakao</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Waterman</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">Y</forename>
<surname>Wilks</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">The Diderot System. In Proceedings of Tipster Text Program</title>
<meeting>
<address>
<addrLine>California</addrLine>
</address>
</meeting>
<imprint>
<publisher>Morgan Kaufmann</publisher>
<date type="published" when="1993"></date>
</imprint>
</monogr>
<note>Phase. I</note>
</biblStruct>
<biblStruct xml:id="b3">
<analytic>
<title level="a" type="main">Information Extraction</title>
<author>
<persName>
<forename type="first">J</forename>
<surname>Cowie</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">W</forename>
<surname>Lehnert</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Communications of the ACM</title>
<imprint>
<biblScope unit="volume">39</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="80" to="91"></biblScope>
<date type="published" when="1996"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b4">
<monogr>
<title level="m" type="main">Information Extraction: a User Guide (revised version)</title>
<author>
<persName>
<forename type="first">H</forename>
<surname>Cunningham</surname>
</persName>
</author>
<imprint>
<date type="published" when="1999-03"></date>
<biblScope unit="page" from="99" to="07"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b5">
<analytic>
<title level="a" type="main">GATE, a General Architecture for Text Engineering</title>
<author>
<persName>
<forename type="first">H</forename>
<surname>Cunningham</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Computers and the Humanities</title>
<imprint>
<biblScope unit="volume">36</biblScope>
<biblScope unit="page" from="223" to="254"></biblScope>
<date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b6">
<analytic>
<title level="a" type="main">GATE: A framework and graphical development environment for robust NLP tools and applications</title>
<author>
<persName>
<forename type="first">H</forename>
<surname>Cunningham</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Maynard</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">K</forename>
<surname>Bontcheva</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">V</forename>
<surname>Tablan</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics</title>
<meeting>the 40th Anniversary Meeting of the Association for Computational Linguistics</meeting>
<imprint>
<date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b7">
<analytic>
<title></title>
<author>
<persName>
<forename type="first">H</forename>
<surname>Cunningham</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Maynard</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">K</forename>
<surname>Bontcheva</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">V</forename>
<surname>Tablan</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">C</forename>
<surname>Ursu</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">The GATE User Guide</title>
<imprint>
<date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b8">
<analytic>
<title level="a" type="main">A Light-weight Approach to Coreference Resolution for Named Entities in Text</title>
<author>
<persName>
<forename type="first">M</forename>
<surname>Dimitrov</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Bulgaria</title>
<imprint>
<publisher>MSc Thesis, University of Sofia</publisher>
<date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b9">
<analytic>
<title level="a" type="main">Nist's 1998 topic detection and tracking evaluation (tdt2)</title>
<author>
<persName>
<forename type="first">Jonathan</forename>
<forename type="middle">G</forename>
<surname>Fiscus</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">George</forename>
<surname>Doddington</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">John</forename>
<forename type="middle">S</forename>
<surname>Garofolo</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">Alvin</forename>
<surname>Martin</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proc. of the DARPA Broadcast News Workshop</title>
<meeting>. of the DARPA Broadcast News Workshop
<address>
<addrLine>Virginia, US</addrLine>
</address>
</meeting>
<imprint>
<date type="published" when="1998"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b10">
<monogr>
<title level="m" type="main">Information retrieval, data structures and algorithms</title>
<author>
<persName>
<forename type="first">W</forename>
<forename type="middle">B</forename>
<surname>Frakes</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">R</forename>
<surname>Baeza-Yates</surname>
</persName>
</author>
<imprint>
<date type="published" when="1992"></date>
<publisher>Prentice Hall</publisher>
<pubPlace>New York, Englewood Cliffs, N.J.</pubPlace>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b11">
<monogr>
<title level="m" type="main">Named Entity Recognition in Romanian</title>
<author>
<persName>
<forename type="first">O</forename>
<surname>Hamza</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">V</forename>
<surname>Tablan</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Maynard</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">C</forename>
<surname>Ursu</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">H</forename>
<surname>Cunningham</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">Y</forename>
<surname>Wilks</surname>
</persName>
</author>
<imprint>
<date type="published" when="2002"></date>
</imprint>
</monogr>
<note>Forthcoming</note>
</biblStruct>
<biblStruct xml:id="b12">
<analytic>
<title></title>
<author>
<persName>
<forename type="first">C</forename>
<forename type="middle">D</forename>
<surname>Manning</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">H</forename>
<surname>Schütze</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Foundations of Statistical Natural Language Processing</title>
<imprint>
<publisher>MIT press</publisher>
<publisher>MIT press</publisher>
<date type="published" when="1999"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b13">
<analytic>
<title level="a" type="main">Named Entity Recognition from Diverse Text Types</title>
<author>
<persName>
<forename type="first">D</forename>
<surname>Maynard</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">V</forename>
<surname>Tablan</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">C</forename>
<surname>Ursu</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">H</forename>
<surname>Cunningham</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">Y</forename>
<surname>Wilks</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Recent Advances in Natural Language Processing 2001 Conference</title>
<meeting>
<address>
<addrLine>Tzigov Chark, Bulgaria</addrLine>
</address>
</meeting>
<imprint>
<date type="published" when="2001"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b14">
<analytic>
<title level="a" type="main">Using a text engineering framework to build an extendable and portable IE-based summarisation system</title>
<author>
<persName>
<forename type="first">Diana</forename>
<surname>Maynard</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">Kalina</forename>
<surname>Bontcheva</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">Horacio</forename>
<surname>Saggion</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">Hamish</forename>
<surname>Cunningham</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">Oana</forename>
<surname>Hamza</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the ACL Workshop on Text Summarisation</title>
<meeting>the ACL Workshop on Text Summarisation</meeting>
<imprint>
<date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b15">
<analytic>
<title level="a" type="main">Cost-benefit methodology for office systems</title>
<author>
<persName>
<forename type="first">Peter</forename>
<surname>Sassone</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">ACM Transactions on Office Information Systems</title>
<imprint>
<biblScope unit="volume">5</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="273" to="289"></biblScope>
<date type="published" when="1987"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b16">
<analytic>
<title level="a" type="main">Learning to extract text-based information from the world wide web</title>
<author>
<persName>
<forename type="first">S</forename>
<surname>Soderland</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of Third International Conference on Knowledge Discovery and Data Mining (KDD-97)</title>
<meeting>Third International Conference on Knowledge Discovery and Data Mining (KDD-97)</meeting>
<imprint>
<date type="published" when="1997"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b17">
<analytic>
<title></title>
</analytic>
<monogr>
<title level="m">Proceedings of the Sixth Message Understanding ConferenceMUC-6)</title>
<editor>Beth Sundheim</editor>
<meeting>the Sixth Message Understanding ConferenceMUC-6)
<address>
<addrLine>Columbia, MD</addrLine>
</address>
</meeting>
<imprint>
<publisher>ARPA Morgan Kaufmann</publisher>
<date type="published" when="1995"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b18">
<analytic>
<title level="a" type="main">An evaluation of statistical approaches to text categorization</title>
<author>
<persName>
<forename type="first">Yiming</forename>
<surname>Yang</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Journal of Information Retrieval</title>
<imprint>
<biblScope unit="volume">1</biblScope>
<biblScope unit="page" from="67" to="88"></biblScope>
<date type="published" when="1998"></date>
</imprint>
</monogr>
</biblStruct>
</listBibl>
</back>
</text>
</istex:refBibTEI>
</enrichments>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000860 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000860 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:50643758F6A5345504D4B37A8BBA39C828D900BF
   |texte=   Adapting a Robust Multi-genre NE System for Automatic Content Extraction
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024