Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

A Paragraph Boundary Detection System

Identifieur interne : 003181 ( Istex/Corpus ); précédent : 003180; suivant : 003182

A Paragraph Boundary Detection System

Auteurs : Dmitriy Genzel

Source :

RBID : ISTEX:E721D32157443575D3AA6F2BFAA2655A85232278

Abstract

Abstract: We propose and motivate a novel task: paragraph segmentation. We discuss and compare this task with text segmentation and discourse parsing. We present a system that performs the task with high accuracy. A variety of features is proposed and examined in detail. The best models turn out to include lexical, coherence, and structural features.

Url:
DOI: 10.1007/978-3-540-30586-6_92

Links to Exploration step

ISTEX:E721D32157443575D3AA6F2BFAA2655A85232278

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">A Paragraph Boundary Detection System</title>
<author>
<name sortKey="Genzel, Dmitriy" sort="Genzel, Dmitriy" uniqKey="Genzel D" first="Dmitriy" last="Genzel">Dmitriy Genzel</name>
<affiliation>
<mods:affiliation>Department of Computer Science, Brown University, Box 1910, 02912, Providence, RI, USA</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: dg@cs.brown.edu</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:E721D32157443575D3AA6F2BFAA2655A85232278</idno>
<date when="2005" year="2005">2005</date>
<idno type="doi">10.1007/978-3-540-30586-6_92</idno>
<idno type="url">https://api.istex.fr/document/E721D32157443575D3AA6F2BFAA2655A85232278/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">003181</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">A Paragraph Boundary Detection System</title>
<author>
<name sortKey="Genzel, Dmitriy" sort="Genzel, Dmitriy" uniqKey="Genzel D" first="Dmitriy" last="Genzel">Dmitriy Genzel</name>
<affiliation>
<mods:affiliation>Department of Computer Science, Brown University, Box 1910, 02912, Providence, RI, USA</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: dg@cs.brown.edu</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2005</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">E721D32157443575D3AA6F2BFAA2655A85232278</idno>
<idno type="DOI">10.1007/978-3-540-30586-6_92</idno>
<idno type="ChapterID">92</idno>
<idno type="ChapterID">Chap92</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: We propose and motivate a novel task: paragraph segmentation. We discuss and compare this task with text segmentation and discourse parsing. We present a system that performs the task with high accuracy. A variety of features is proposed and examined in detail. The best models turn out to include lexical, coherence, and structural features.</div>
</front>
</TEI>
<istex>
<corpusName>springer</corpusName>
<author>
<json:item>
<name>Dmitriy Genzel</name>
<affiliations>
<json:string>Department of Computer Science, Brown University, Box 1910, 02912, Providence, RI, USA</json:string>
<json:string>E-mail: dg@cs.brown.edu</json:string>
</affiliations>
</json:item>
</author>
<language>
<json:string>eng</json:string>
</language>
<abstract>Abstract: We propose and motivate a novel task: paragraph segmentation. We discuss and compare this task with text segmentation and discourse parsing. We present a system that performs the task with high accuracy. A variety of features is proposed and examined in detail. The best models turn out to include lexical, coherence, and structural features.</abstract>
<qualityIndicators>
<score>4.152</score>
<pdfVersion>1.3</pdfVersion>
<pdfPageSize>430 x 660 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>352</abstractCharCount>
<pdfWordCount>3492</pdfWordCount>
<pdfCharCount>20887</pdfCharCount>
<pdfPageCount>11</pdfPageCount>
<abstractWordCount>55</abstractWordCount>
</qualityIndicators>
<title>A Paragraph Boundary Detection System</title>
<chapterId>
<json:string>92</json:string>
<json:string>Chap92</json:string>
</chapterId>
<serie>
<editor>
<json:item>
<name>David Hutchison</name>
<affiliations>
<json:string>Lancaster University, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Takeo Kanade</name>
<affiliations>
<json:string>Carnegie Mellon University, Pittsburgh, PA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Josef Kittler</name>
<affiliations>
<json:string>University of Surrey, Guildford, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Jon M. Kleinberg</name>
<affiliations>
<json:string>Cornell University, Ithaca, NY, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Friedemann Mattern</name>
<affiliations>
<json:string>ETH Zurich, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>John C. Mitchell</name>
<affiliations>
<json:string>Stanford University, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moni Naor</name>
<affiliations>
<json:string>Weizmann Institute of Science, Rehovot, Israel</json:string>
</affiliations>
</json:item>
<json:item>
<name>Oscar Nierstrasz</name>
<affiliations>
<json:string>University of Bern, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>C. Pandu Rangan</name>
<affiliations>
<json:string>Indian Institute of Technology, Madras, India</json:string>
</affiliations>
</json:item>
<json:item>
<name>Bernhard Steffen</name>
<affiliations>
<json:string>University of Dortmund, Germany</json:string>
</affiliations>
</json:item>
<json:item>
<name>Madhu Sudan</name>
<affiliations>
<json:string>Massachusetts Institute of Technology, MA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Demetri Terzopoulos</name>
<affiliations>
<json:string>New York University, NY, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Dough Tygar</name>
<affiliations>
<json:string>University of California, Berkeley, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moshe Y. Vardi</name>
<affiliations>
<json:string>Rice University, Houston, TX, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Gerhard Weikum</name>
<affiliations>
<json:string>Max-Planck Institute of Computer Science, Saarbruecken, Germany</json:string>
</affiliations>
</json:item>
</editor>
<issn>
<json:string>0302-9743</json:string>
</issn>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>Lecture Notes in Computer Science</title>
<copyrightDate>2005</copyrightDate>
</serie>
<host>
<editor>
<json:item>
<name>Alexander Gelbukh</name>
<affiliations>
<json:string>National Polytechnic Institute, Center for Computing Research, 07738, Mexico City, México</json:string>
<json:string>E-mail: gelbukh@gelbukh.com</json:string>
</affiliations>
</json:item>
</editor>
<subject>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Information Storage and Retrieval</value>
</json:item>
<json:item>
<value>Artificial Intelligence (incl. Robotics)</value>
</json:item>
<json:item>
<value>Language Translation and Linguistics</value>
</json:item>
<json:item>
<value>Mathematical Logic and Formal Languages</value>
</json:item>
</subject>
<isbn>
<json:string>978-3-540-24523-0</json:string>
</isbn>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>Computational Linguistics and Intelligent Text Processing</title>
<genre.original>
<json:string>Proceedings</json:string>
</genre.original>
<bookId>
<json:string>978-3-540-30586-6</json:string>
</bookId>
<volume>3406</volume>
<pages>
<last>826</last>
<first>816</first>
</pages>
<issn>
<json:string>0302-9743</json:string>
</issn>
<genre>
<json:string>Book Series</json:string>
</genre>
<eisbn>
<json:string>978-3-540-30586-6</json:string>
</eisbn>
<copyrightDate>2005</copyrightDate>
<doi>
<json:string>10.1007/b105772</json:string>
</doi>
</host>
<publicationDate>2005</publicationDate>
<copyrightDate>2005</copyrightDate>
<doi>
<json:string>10.1007/978-3-540-30586-6_92</json:string>
</doi>
<id>E721D32157443575D3AA6F2BFAA2655A85232278</id>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/E721D32157443575D3AA6F2BFAA2655A85232278/fulltext/pdf</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/E721D32157443575D3AA6F2BFAA2655A85232278/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/E721D32157443575D3AA6F2BFAA2655A85232278/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">A Paragraph Boundary Detection System</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<availability>
<p>SPRINGER</p>
</availability>
<date>2005</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a" type="main" xml:lang="en">A Paragraph Boundary Detection System</title>
<author>
<persName>
<forename type="first">Dmitriy</forename>
<surname>Genzel</surname>
</persName>
<email>dg@cs.brown.edu</email>
<affiliation>Department of Computer Science, Brown University, Box 1910, 02912, Providence, RI, USA</affiliation>
</author>
</analytic>
<monogr>
<title level="m">Computational Linguistics and Intelligent Text Processing</title>
<title level="m" type="sub">6th International Conference, CICLing 2005, Mexico City, Mexico, February 13-19, 2005. Proceedings</title>
<idno type="pISBN">978-3-540-24523-0</idno>
<idno type="eISBN">978-3-540-30586-6</idno>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="DOI">10.1007/b105772</idno>
<idno type="BookID">978-3-540-30586-6</idno>
<idno type="BookTitleID">116551</idno>
<idno type="BookSequenceNumber">3406</idno>
<idno type="BookVolumeNumber">3406</idno>
<idno type="BookChapterCount">92</idno>
<editor>
<persName>
<forename type="first">Alexander</forename>
<surname>Gelbukh</surname>
</persName>
<email>gelbukh@gelbukh.com</email>
<affiliation>National Polytechnic Institute, Center for Computing Research, 07738, Mexico City, México</affiliation>
</editor>
<imprint>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<date type="published" when="2005"></date>
<biblScope unit="volume">3406</biblScope>
<biblScope unit="page" from="816">816</biblScope>
<biblScope unit="page" to="826">826</biblScope>
</imprint>
</monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<editor>
<persName>
<forename type="first">David</forename>
<surname>Hutchison</surname>
</persName>
<affiliation>Lancaster University, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Takeo</forename>
<surname>Kanade</surname>
</persName>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Josef</forename>
<surname>Kittler</surname>
</persName>
<affiliation>University of Surrey, Guildford, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Jon</forename>
<forename type="first">M.</forename>
<surname>Kleinberg</surname>
</persName>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Friedemann</forename>
<surname>Mattern</surname>
</persName>
<affiliation>ETH Zurich, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">John</forename>
<forename type="first">C.</forename>
<surname>Mitchell</surname>
</persName>
<affiliation>Stanford University, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moni</forename>
<surname>Naor</surname>
</persName>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Oscar</forename>
<surname>Nierstrasz</surname>
</persName>
<affiliation>University of Bern, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">C.</forename>
<surname>Pandu Rangan</surname>
</persName>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Bernhard</forename>
<surname>Steffen</surname>
</persName>
<affiliation>University of Dortmund, Germany</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Madhu</forename>
<surname>Sudan</surname>
</persName>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Demetri</forename>
<surname>Terzopoulos</surname>
</persName>
<affiliation>New York University, NY, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Dough</forename>
<surname>Tygar</surname>
</persName>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moshe</forename>
<forename type="first">Y.</forename>
<surname>Vardi</surname>
</persName>
<affiliation>Rice University, Houston, TX, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Gerhard</forename>
<surname>Weikum</surname>
</persName>
<affiliation>Max-Planck Institute of Computer Science, Saarbruecken, Germany</affiliation>
</editor>
<biblScope>
<date>2005</date>
</biblScope>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="seriesId">558</idno>
</series>
<idno type="istex">E721D32157443575D3AA6F2BFAA2655A85232278</idno>
<idno type="DOI">10.1007/978-3-540-30586-6_92</idno>
<idno type="ChapterID">92</idno>
<idno type="ChapterID">Chap92</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>2005</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>Abstract: We propose and motivate a novel task: paragraph segmentation. We discuss and compare this task with text segmentation and discourse parsing. We present a system that performs the task with high accuracy. A variety of features is proposed and examined in detail. The best models turn out to include lexical, coherence, and structural features.</p>
</abstract>
<textClass>
<keywords scheme="Book Subject Collection">
<list>
<label>SUCO11645</label>
<item>
<term>Computer Science</term>
</item>
</list>
</keywords>
</textClass>
<textClass>
<keywords scheme="Book Subject Group">
<list>
<label>I</label>
<label>I18032</label>
<label>I21017</label>
<label>I21041</label>
<label>I16048</label>
<item>
<term>Computer Science</term>
</item>
<item>
<term>Information Storage and Retrieval</term>
</item>
<item>
<term>Artificial Intelligence (incl. Robotics)</term>
</item>
<item>
<term>Language Translation and Linguistics</term>
</item>
<item>
<term>Mathematical Logic and Formal Languages</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change when="2005">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-20">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/E721D32157443575D3AA6F2BFAA2655A85232278/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="Springer, Publisher found" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="UTF-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//Springer-Verlag//DTD A++ V2.4//EN" URI="http://devel.springer.de/A++/V2.4/DTD/A++V2.4.dtd" name="istex:docType"></istex:docType>
<istex:document>
<Publisher>
<PublisherInfo>
<PublisherName>Springer Berlin Heidelberg</PublisherName>
<PublisherLocation>Berlin, Heidelberg</PublisherLocation>
</PublisherInfo>
<Series>
<SeriesInfo SeriesType="Series" TocLevels="0">
<SeriesID>558</SeriesID>
<SeriesPrintISSN>0302-9743</SeriesPrintISSN>
<SeriesElectronicISSN>1611-3349</SeriesElectronicISSN>
<SeriesTitle Language="En">Lecture Notes in Computer Science</SeriesTitle>
</SeriesInfo>
<SeriesHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff1">
<EditorName DisplayOrder="Western">
<GivenName>David</GivenName>
<FamilyName>Hutchison</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff2">
<EditorName DisplayOrder="Western">
<GivenName>Takeo</GivenName>
<FamilyName>Kanade</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff3">
<EditorName DisplayOrder="Western">
<GivenName>Josef</GivenName>
<FamilyName>Kittler</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff4">
<EditorName DisplayOrder="Western">
<GivenName>Jon</GivenName>
<GivenName>M.</GivenName>
<FamilyName>Kleinberg</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff5">
<EditorName DisplayOrder="Western">
<GivenName>Friedemann</GivenName>
<FamilyName>Mattern</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff6">
<EditorName DisplayOrder="Western">
<GivenName>John</GivenName>
<GivenName>C.</GivenName>
<FamilyName>Mitchell</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff7">
<EditorName DisplayOrder="Western">
<GivenName>Moni</GivenName>
<FamilyName>Naor</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff8">
<EditorName DisplayOrder="Western">
<GivenName>Oscar</GivenName>
<FamilyName>Nierstrasz</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff9">
<EditorName DisplayOrder="Western">
<GivenName>C.</GivenName>
<FamilyName>Pandu Rangan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff10">
<EditorName DisplayOrder="Western">
<GivenName>Bernhard</GivenName>
<FamilyName>Steffen</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff11">
<EditorName DisplayOrder="Western">
<GivenName>Madhu</GivenName>
<FamilyName>Sudan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff12">
<EditorName DisplayOrder="Western">
<GivenName>Demetri</GivenName>
<FamilyName>Terzopoulos</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff13">
<EditorName DisplayOrder="Western">
<GivenName>Dough</GivenName>
<FamilyName>Tygar</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff14">
<EditorName DisplayOrder="Western">
<GivenName>Moshe</GivenName>
<GivenName>Y.</GivenName>
<FamilyName>Vardi</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff15">
<EditorName DisplayOrder="Western">
<GivenName>Gerhard</GivenName>
<FamilyName>Weikum</FamilyName>
</EditorName>
</Editor>
<Affiliation ID="Aff1">
<OrgName>Lancaster University</OrgName>
<OrgAddress>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff2">
<OrgName>Carnegie Mellon University</OrgName>
<OrgAddress>
<City>Pittsburgh</City>
<State>PA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff3">
<OrgName>University of Surrey</OrgName>
<OrgAddress>
<City>Guildford</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff4">
<OrgName>Cornell University</OrgName>
<OrgAddress>
<City>Ithaca</City>
<State>NY</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff5">
<OrgName>ETH Zurich</OrgName>
<OrgAddress>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff6">
<OrgName>Stanford University</OrgName>
<OrgAddress>
<City>CA</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff7">
<OrgName>Weizmann Institute of Science</OrgName>
<OrgAddress>
<City>Rehovot</City>
<Country>Israel</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff8">
<OrgName>University of Bern</OrgName>
<OrgAddress>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff9">
<OrgName>Indian Institute of Technology</OrgName>
<OrgAddress>
<City>Madras</City>
<Country>India</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff10">
<OrgName>University of Dortmund</OrgName>
<OrgAddress>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff11">
<OrgName>Massachusetts Institute of Technology</OrgName>
<OrgAddress>
<City>MA</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff12">
<OrgName>New York University</OrgName>
<OrgAddress>
<City>NY</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff13">
<OrgName>University of California</OrgName>
<OrgAddress>
<City>Berkeley</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff14">
<OrgName>Rice University</OrgName>
<OrgAddress>
<City>Houston</City>
<State>TX</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff15">
<OrgName>Max-Planck Institute of Computer Science</OrgName>
<OrgAddress>
<City>Saarbruecken</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</SeriesHeader>
<Book Language="En">
<BookInfo BookProductType="Proceedings" ContainsESM="No" Language="En" MediaType="eBook" NumberingDepth="2" NumberingStyle="ContentOnly" OutputMedium="All" TocLevels="0">
<BookID>978-3-540-30586-6</BookID>
<BookTitle>Computational Linguistics and Intelligent Text Processing</BookTitle>
<BookSubTitle>6th International Conference, CICLing 2005, Mexico City, Mexico, February 13-19, 2005. Proceedings</BookSubTitle>
<BookVolumeNumber>3406</BookVolumeNumber>
<BookSequenceNumber>3406</BookSequenceNumber>
<BookDOI>10.1007/b105772</BookDOI>
<BookTitleID>116551</BookTitleID>
<BookPrintISBN>978-3-540-24523-0</BookPrintISBN>
<BookElectronicISBN>978-3-540-30586-6</BookElectronicISBN>
<BookChapterCount>92</BookChapterCount>
<BookCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2005</CopyrightYear>
</BookCopyright>
<BookSubjectGroup>
<BookSubject Code="I" Type="Primary">Computer Science</BookSubject>
<BookSubject Code="I18032" Priority="1" Type="Secondary">Information Storage and Retrieval</BookSubject>
<BookSubject Code="I21017" Priority="2" Type="Secondary">Artificial Intelligence (incl. Robotics)</BookSubject>
<BookSubject Code="I21041" Priority="3" Type="Secondary">Language Translation and Linguistics</BookSubject>
<BookSubject Code="I16048" Priority="4" Type="Secondary">Mathematical Logic and Formal Languages</BookSubject>
<SubjectCollection Code="SUCO11645">Computer Science</SubjectCollection>
</BookSubjectGroup>
<BookContext>
<SeriesID>558</SeriesID>
</BookContext>
</BookInfo>
<BookHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff16">
<EditorName DisplayOrder="Western">
<GivenName>Alexander</GivenName>
<FamilyName>Gelbukh</FamilyName>
</EditorName>
<Contact>
<Email>gelbukh@gelbukh.com</Email>
</Contact>
</Editor>
<Affiliation ID="Aff16">
<OrgDivision>National Polytechnic Institute</OrgDivision>
<OrgName>Center for Computing Research</OrgName>
<OrgAddress>
<Postcode>07738</Postcode>
<City>Mexico City</City>
<Country>México</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</BookHeader>
<Part ID="Part2">
<PartInfo TocLevels="0">
<PartID>2</PartID>
<PartSequenceNumber>2</PartSequenceNumber>
<PartTitle>Intelligent Text Processing Applications</PartTitle>
<PartChapterCount>40</PartChapterCount>
<PartContext>
<SeriesID>558</SeriesID>
<BookTitle>Computational Linguistics and Intelligent Text Processing</BookTitle>
</PartContext>
</PartInfo>
<SubPart ID="SubPart19">
<SubPartInfo>
<SubPartID>19</SubPartID>
<SubPartSequenceNumber>19</SubPartSequenceNumber>
<SubPartTitle>Spelling and Style Checking</SubPartTitle>
<SubPartChapterCount>3</SubPartChapterCount>
</SubPartInfo>
<Chapter ID="Chap92" Language="En">
<ChapterInfo ChapterType="OriginalPaper" ContainsESM="No" NumberingDepth="2" NumberingStyle="ContentOnly" TocLevels="0">
<ChapterID>92</ChapterID>
<ChapterDOI>10.1007/978-3-540-30586-6_92</ChapterDOI>
<ChapterSequenceNumber>92</ChapterSequenceNumber>
<ChapterTitle Language="En">A Paragraph Boundary Detection System</ChapterTitle>
<ChapterFirstPage>816</ChapterFirstPage>
<ChapterLastPage>826</ChapterLastPage>
<ChapterCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2005</CopyrightYear>
</ChapterCopyright>
<ChapterGrants Type="Regular">
<MetadataGrant Grant="OpenAccess"></MetadataGrant>
<AbstractGrant Grant="OpenAccess"></AbstractGrant>
<BodyPDFGrant Grant="Restricted"></BodyPDFGrant>
<BodyHTMLGrant Grant="Restricted"></BodyHTMLGrant>
<BibliographyGrant Grant="Restricted"></BibliographyGrant>
<ESMGrant Grant="Restricted"></ESMGrant>
</ChapterGrants>
<ChapterContext>
<SeriesID>558</SeriesID>
<PartID>2</PartID>
<BookID>978-3-540-30586-6</BookID>
<BookTitle>Computational Linguistics and Intelligent Text Processing</BookTitle>
</ChapterContext>
</ChapterInfo>
<ChapterHeader>
<AuthorGroup>
<Author AffiliationIDS="Aff17">
<AuthorName DisplayOrder="Western">
<GivenName>Dmitriy</GivenName>
<FamilyName>Genzel</FamilyName>
</AuthorName>
<Contact>
<Email>dg@cs.brown.edu</Email>
</Contact>
</Author>
<Affiliation ID="Aff17">
<OrgDivision>Department of Computer Science</OrgDivision>
<OrgName>Brown University</OrgName>
<OrgAddress>
<Postbox>Box 1910</Postbox>
<City>Providence</City>
<State>RI</State>
<Postcode>02912</Postcode>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
</AuthorGroup>
<Abstract ID="Abs1" Language="En">
<Heading>Abstract</Heading>
<Para>We propose and motivate a novel task: paragraph segmentation. We discuss and compare this task with text segmentation and discourse parsing. We present a system that performs the task with high accuracy. A variety of features is proposed and examined in detail. The best models turn out to include lexical, coherence, and structural features.</Para>
</Abstract>
</ChapterHeader>
<NoBody></NoBody>
</Chapter>
</SubPart>
</Part>
</Book>
</Series>
</Publisher>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>A Paragraph Boundary Detection System</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA" lang="en">
<title>A Paragraph Boundary Detection System</title>
</titleInfo>
<name type="personal">
<namePart type="given">Dmitriy</namePart>
<namePart type="family">Genzel</namePart>
<affiliation>Department of Computer Science, Brown University, Box 1910, 02912, Providence, RI, USA</affiliation>
<affiliation>E-mail: dg@cs.brown.edu</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<originInfo>
<publisher>Springer Berlin Heidelberg</publisher>
<place>
<placeTerm type="text">Berlin, Heidelberg</placeTerm>
</place>
<dateIssued encoding="w3cdtf">2005</dateIssued>
<copyrightDate encoding="w3cdtf">2005</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">Abstract: We propose and motivate a novel task: paragraph segmentation. We discuss and compare this task with text segmentation and discourse parsing. We present a system that performs the task with high accuracy. A variety of features is proposed and examined in detail. The best models turn out to include lexical, coherence, and structural features.</abstract>
<relatedItem type="host">
<titleInfo>
<title>Computational Linguistics and Intelligent Text Processing</title>
<subTitle>6th International Conference, CICLing 2005, Mexico City, Mexico, February 13-19, 2005. Proceedings</subTitle>
</titleInfo>
<name type="personal">
<namePart type="given">Alexander</namePart>
<namePart type="family">Gelbukh</namePart>
<affiliation>National Polytechnic Institute, Center for Computing Research, 07738, Mexico City, México</affiliation>
<affiliation>E-mail: gelbukh@gelbukh.com</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="Book Series" displayLabel="Proceedings"></genre>
<originInfo>
<copyrightDate encoding="w3cdtf">2005</copyrightDate>
<issuance>monographic</issuance>
</originInfo>
<subject>
<genre>Book Subject Collection</genre>
<topic authority="SpringerSubjectCodes" authorityURI="SUCO11645">Computer Science</topic>
</subject>
<subject>
<genre>Book Subject Group</genre>
<topic authority="SpringerSubjectCodes" authorityURI="I">Computer Science</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18032">Information Storage and Retrieval</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I21017">Artificial Intelligence (incl. Robotics)</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I21041">Language Translation and Linguistics</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I16048">Mathematical Logic and Formal Languages</topic>
</subject>
<identifier type="DOI">10.1007/b105772</identifier>
<identifier type="ISBN">978-3-540-24523-0</identifier>
<identifier type="eISBN">978-3-540-30586-6</identifier>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="BookTitleID">116551</identifier>
<identifier type="BookID">978-3-540-30586-6</identifier>
<identifier type="BookChapterCount">92</identifier>
<identifier type="BookVolumeNumber">3406</identifier>
<identifier type="BookSequenceNumber">3406</identifier>
<identifier type="PartChapterCount">40</identifier>
<part>
<date>2005</date>
<detail type="part">
<title>Intelligent Text Processing Applications</title>
</detail>
<detail type="volume">
<number>3406</number>
<caption>vol.</caption>
</detail>
<extent unit="pages">
<start>816</start>
<end>826</end>
</extent>
</part>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2005</recordOrigin>
</recordInfo>
</relatedItem>
<relatedItem type="series">
<titleInfo>
<title>Lecture Notes in Computer Science</title>
</titleInfo>
<name type="personal">
<namePart type="given">David</namePart>
<namePart type="family">Hutchison</namePart>
<affiliation>Lancaster University, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Takeo</namePart>
<namePart type="family">Kanade</namePart>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Josef</namePart>
<namePart type="family">Kittler</namePart>
<affiliation>University of Surrey, Guildford, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jon</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Kleinberg</namePart>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Friedemann</namePart>
<namePart type="family">Mattern</namePart>
<affiliation>ETH Zurich, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">John</namePart>
<namePart type="given">C.</namePart>
<namePart type="family">Mitchell</namePart>
<affiliation>Stanford University, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moni</namePart>
<namePart type="family">Naor</namePart>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Oscar</namePart>
<namePart type="family">Nierstrasz</namePart>
<affiliation>University of Bern, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">C.</namePart>
<namePart type="family">Pandu Rangan</namePart>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Bernhard</namePart>
<namePart type="family">Steffen</namePart>
<affiliation>University of Dortmund, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Madhu</namePart>
<namePart type="family">Sudan</namePart>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Demetri</namePart>
<namePart type="family">Terzopoulos</namePart>
<affiliation>New York University, NY, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Dough</namePart>
<namePart type="family">Tygar</namePart>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moshe</namePart>
<namePart type="given">Y.</namePart>
<namePart type="family">Vardi</namePart>
<affiliation>Rice University, Houston, TX, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<affiliation>Max-Planck Institute of Computer Science, Saarbruecken, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<originInfo>
<copyrightDate encoding="w3cdtf">2005</copyrightDate>
<issuance>serial</issuance>
</originInfo>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="SeriesID">558</identifier>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2005</recordOrigin>
</recordInfo>
</relatedItem>
<identifier type="istex">E721D32157443575D3AA6F2BFAA2655A85232278</identifier>
<identifier type="DOI">10.1007/978-3-540-30586-6_92</identifier>
<identifier type="ChapterID">92</identifier>
<identifier type="ChapterID">Chap92</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Springer-Verlag Berlin Heidelberg, 2005</accessCondition>
<recordInfo>
<recordContentSource>SPRINGER</recordContentSource>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2005</recordOrigin>
</recordInfo>
</mods>
</metadata>
<enrichments>
<istex:refBibTEI uri="https://api.istex.fr/document/E721D32157443575D3AA6F2BFAA2655A85232278/enrichments/refBib">
<teiHeader></teiHeader>
<text>
<front></front>
<body></body>
<back>
<listBibl>
<biblStruct xml:id="b0">
<analytic>
<title level="a" type="main">Microsoft natural language understanding system and grammar checker</title>
<author>
<persName>
<forename type="first">S</forename>
<surname>Richardson</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of Fifth Conference on Applied Natural Language Processing</title>
<meeting>Fifth Conference on Applied Natural Language Processing</meeting>
<imprint>
<date type="published" when="1997"></date>
<biblScope unit="page">97</biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b1">
<analytic>
<title level="a" type="main">Detecting shifts in news stories for paragraph extraction</title>
<author>
<persName>
<forename type="first">F</forename>
<surname>Fukumoto</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">Y</forename>
<surname>Suzuki</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of 19th International Conference on Computational Linguistics (COLING-02)</title>
<meeting>19th International Conference on Computational Linguistics (COLING-02)</meeting>
<imprint>
<date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b2">
<analytic>
<title level="a" type="main">Variation of entropy and parse trees of sentences as a function of the sentence number</title>
<author>
<persName>
<forename type="first">D</forename>
<surname>Genzel</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">E</forename>
<surname>Charniak</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of EMNLP–03</title>
<meeting>EMNLP–03
<address>
<addrLine>Sapporo, Japan</addrLine>
</address>
</meeting>
<imprint>
<date type="published" when="2003"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b3">
<analytic>
<title level="a" type="main">Text segmentation into paragraphs based on local text cohesion</title>
<author>
<persName>
<forename type="first">I</forename>
<forename type="middle">A</forename>
<surname>Bolshakov</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">A</forename>
<forename type="middle">F</forename>
<surname>Gelbukh</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Lecture Notes in Artificial Intelligence #2166</title>
<imprint>
<publisher>Springer-Verlag</publisher>
<date type="published" when="2001"></date>
<biblScope unit="page" from="158" to="166"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b4">
<analytic>
<title level="a" type="main">TextTiling: Segmenting text into multi-paragraph subtopic passages</title>
</analytic>
<monogr>
<title level="j">Computational Linguistics</title>
<imprint>
<biblScope unit="volume">23</biblScope>
<date type="published" when="1997"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b5">
<analytic>
<title level="a" type="main">Text segmentation using exponential models</title>
<author>
<persName>
<forename type="first">D</forename>
<surname>Beeferman</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">A</forename>
<surname>Berger</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<surname>Lafferty</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of EMNLP–97</title>
<meeting>EMNLP–97</meeting>
<imprint>
<date type="published" when="1997"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b6">
<analytic>
<title level="a" type="main">Sentence level discourse parsing using syntactic and lexical information</title>
<author>
<persName>
<forename type="first">R</forename>
<surname>Soricut</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Marcu</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of HLT/NAACL–03</title>
<meeting>HLT/NAACL–03</meeting>
<imprint>
<date type="published" when="2003"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b7">
<analytic>
<title level="a" type="main">Can text structure be incompatible with rhetorical structure?</title>
<author>
<persName>
<forename type="first">N</forename>
<surname>Bouayad-Agha</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">R</forename>
<surname>Power</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Scott</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the International Natural Language Generation Conference</title>
<meeting>the International Natural Language Generation Conference</meeting>
<imprint>
<date type="published" when="2000"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b8">
<analytic>
<title level="a" type="main">Ranking algorithms for named-entity extraction: Boosting and the voted perceptron</title>
<author>
<persName>
<forename type="first">M</forename>
<surname>Collins</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of ACL–02</title>
<meeting>ACL–02</meeting>
<imprint>
<date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b9">
<analytic>
<title level="a" type="main">The perceptron: A probabilistic model for information storage and organization in the brain</title>
<author>
<persName>
<forename type="first">F</forename>
<surname>Rosenblatt</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Psychological Review</title>
<imprint>
<biblScope unit="volume">65</biblScope>
<biblScope unit="page" from="386" to="408"></biblScope>
<date type="published" when="1958"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b10">
<analytic>
<title level="a" type="main">Centering: a framework for modelling the local coherence of discourse</title>
<author>
<persName>
<forename type="first">B</forename>
<surname>Grosz</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">A</forename>
<surname>Joshi</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Weinstein</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Computational Linguistics</title>
<imprint>
<biblScope unit="volume">21</biblScope>
<biblScope unit="page" from="203" to="226"></biblScope>
<date type="published" when="1995"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b11">
<analytic>
<title level="a" type="main">Assigning function tags to parsed text</title>
<author>
<persName>
<forename type="first">D</forename>
<surname>Blaheta</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">E</forename>
<surname>Charniak</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of NAACL–00</title>
<meeting>NAACL–00</meeting>
<imprint>
<date type="published" when="2000"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b12">
<analytic>
<title level="a" type="main">Building a large annotated corpus of English: the Penn treebank</title>
<author>
<persName>
<forename type="first">M</forename>
<forename type="middle">P</forename>
<surname>Marcus</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">B</forename>
<surname>Santorini</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<forename type="middle">A</forename>
<surname>Marcinkiewicz</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Computational Linguistics</title>
<imprint>
<biblScope unit="volume">19</biblScope>
<biblScope unit="page" from="313" to="330"></biblScope>
<date type="published" when="1993"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b13">
<monogr>
<title level="m" type="main">War and Peace Available online, in 4 languages</title>
<author>
<persName>
<forename type="first">L</forename>
<surname>Tolstoy</surname>
</persName>
</author>
<imprint>
<pubPlace>Russian, English, Spanish, Italian</pubPlace>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b14">
<analytic>
<title level="a" type="main">A maximum-entropy-inspired parser</title>
<author>
<persName>
<forename type="first">E</forename>
<surname>Charniak</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of ACL–01</title>
<meeting>ACL–01
<address>
<addrLine>Toulouse</addrLine>
</address>
</meeting>
<imprint>
<date type="published" when="2001"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b15">
<analytic>
<title level="a" type="main">A simple pattern-matching algorithm for recovering empty nodes and their antecedents</title>
<author>
<persName>
<forename type="first">M</forename>
<surname>Johnson</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of ACL–02</title>
<meeting>ACL–02</meeting>
<imprint>
<date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b16">
<monogr>
<title level="m" type="main">Empirical methods for artificial intelligence</title>
<author>
<persName>
<forename type="first">P</forename>
<surname>Cohen</surname>
</persName>
</author>
<imprint>
<date type="published" when="1995"></date>
<publisher>MIT Press</publisher>
<pubPlace>Cambridge, MA</pubPlace>
</imprint>
</monogr>
</biblStruct>
</listBibl>
</back>
</text>
</istex:refBibTEI>
</enrichments>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 003181 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 003181 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:E721D32157443575D3AA6F2BFAA2655A85232278
   |texte=   A Paragraph Boundary Detection System
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024