A Paragraph Boundary Detection System
Identifieur interne : 003181 ( Istex/Corpus ); précédent : 003180; suivant : 003182A Paragraph Boundary Detection System
Auteurs : Dmitriy GenzelSource :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2005.
Abstract
Abstract: We propose and motivate a novel task: paragraph segmentation. We discuss and compare this task with text segmentation and discourse parsing. We present a system that performs the task with high accuracy. A variety of features is proposed and examined in detail. The best models turn out to include lexical, coherence, and structural features.
Url:
DOI: 10.1007/978-3-540-30586-6_92
Links to Exploration step
ISTEX:E721D32157443575D3AA6F2BFAA2655A85232278Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">A Paragraph Boundary Detection System</title>
<author><name sortKey="Genzel, Dmitriy" sort="Genzel, Dmitriy" uniqKey="Genzel D" first="Dmitriy" last="Genzel">Dmitriy Genzel</name>
<affiliation><mods:affiliation>Department of Computer Science, Brown University, Box 1910, 02912, Providence, RI, USA</mods:affiliation>
</affiliation>
<affiliation><mods:affiliation>E-mail: dg@cs.brown.edu</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:E721D32157443575D3AA6F2BFAA2655A85232278</idno>
<date when="2005" year="2005">2005</date>
<idno type="doi">10.1007/978-3-540-30586-6_92</idno>
<idno type="url">https://api.istex.fr/document/E721D32157443575D3AA6F2BFAA2655A85232278/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">003181</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">A Paragraph Boundary Detection System</title>
<author><name sortKey="Genzel, Dmitriy" sort="Genzel, Dmitriy" uniqKey="Genzel D" first="Dmitriy" last="Genzel">Dmitriy Genzel</name>
<affiliation><mods:affiliation>Department of Computer Science, Brown University, Box 1910, 02912, Providence, RI, USA</mods:affiliation>
</affiliation>
<affiliation><mods:affiliation>E-mail: dg@cs.brown.edu</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2005</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">E721D32157443575D3AA6F2BFAA2655A85232278</idno>
<idno type="DOI">10.1007/978-3-540-30586-6_92</idno>
<idno type="ChapterID">92</idno>
<idno type="ChapterID">Chap92</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: We propose and motivate a novel task: paragraph segmentation. We discuss and compare this task with text segmentation and discourse parsing. We present a system that performs the task with high accuracy. A variety of features is proposed and examined in detail. The best models turn out to include lexical, coherence, and structural features.</div>
</front>
</TEI>
<istex><corpusName>springer</corpusName>
<author><json:item><name>Dmitriy Genzel</name>
<affiliations><json:string>Department of Computer Science, Brown University, Box 1910, 02912, Providence, RI, USA</json:string>
<json:string>E-mail: dg@cs.brown.edu</json:string>
</affiliations>
</json:item>
</author>
<language><json:string>eng</json:string>
</language>
<abstract>Abstract: We propose and motivate a novel task: paragraph segmentation. We discuss and compare this task with text segmentation and discourse parsing. We present a system that performs the task with high accuracy. A variety of features is proposed and examined in detail. The best models turn out to include lexical, coherence, and structural features.</abstract>
<qualityIndicators><score>4.152</score>
<pdfVersion>1.3</pdfVersion>
<pdfPageSize>430 x 660 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>352</abstractCharCount>
<pdfWordCount>3492</pdfWordCount>
<pdfCharCount>20887</pdfCharCount>
<pdfPageCount>11</pdfPageCount>
<abstractWordCount>55</abstractWordCount>
</qualityIndicators>
<title>A Paragraph Boundary Detection System</title>
<chapterId><json:string>92</json:string>
<json:string>Chap92</json:string>
</chapterId>
<serie><editor><json:item><name>David Hutchison</name>
<affiliations><json:string>Lancaster University, UK</json:string>
</affiliations>
</json:item>
<json:item><name>Takeo Kanade</name>
<affiliations><json:string>Carnegie Mellon University, Pittsburgh, PA, USA</json:string>
</affiliations>
</json:item>
<json:item><name>Josef Kittler</name>
<affiliations><json:string>University of Surrey, Guildford, UK</json:string>
</affiliations>
</json:item>
<json:item><name>Jon M. Kleinberg</name>
<affiliations><json:string>Cornell University, Ithaca, NY, USA</json:string>
</affiliations>
</json:item>
<json:item><name>Friedemann Mattern</name>
<affiliations><json:string>ETH Zurich, Switzerland</json:string>
</affiliations>
</json:item>
<json:item><name>John C. Mitchell</name>
<affiliations><json:string>Stanford University, CA, USA</json:string>
</affiliations>
</json:item>
<json:item><name>Moni Naor</name>
<affiliations><json:string>Weizmann Institute of Science, Rehovot, Israel</json:string>
</affiliations>
</json:item>
<json:item><name>Oscar Nierstrasz</name>
<affiliations><json:string>University of Bern, Switzerland</json:string>
</affiliations>
</json:item>
<json:item><name>C. Pandu Rangan</name>
<affiliations><json:string>Indian Institute of Technology, Madras, India</json:string>
</affiliations>
</json:item>
<json:item><name>Bernhard Steffen</name>
<affiliations><json:string>University of Dortmund, Germany</json:string>
</affiliations>
</json:item>
<json:item><name>Madhu Sudan</name>
<affiliations><json:string>Massachusetts Institute of Technology, MA, USA</json:string>
</affiliations>
</json:item>
<json:item><name>Demetri Terzopoulos</name>
<affiliations><json:string>New York University, NY, USA</json:string>
</affiliations>
</json:item>
<json:item><name>Dough Tygar</name>
<affiliations><json:string>University of California, Berkeley, CA, USA</json:string>
</affiliations>
</json:item>
<json:item><name>Moshe Y. Vardi</name>
<affiliations><json:string>Rice University, Houston, TX, USA</json:string>
</affiliations>
</json:item>
<json:item><name>Gerhard Weikum</name>
<affiliations><json:string>Max-Planck Institute of Computer Science, Saarbruecken, Germany</json:string>
</affiliations>
</json:item>
</editor>
<issn><json:string>0302-9743</json:string>
</issn>
<language><json:string>unknown</json:string>
</language>
<eissn><json:string>1611-3349</json:string>
</eissn>
<title>Lecture Notes in Computer Science</title>
<copyrightDate>2005</copyrightDate>
</serie>
<host><editor><json:item><name>Alexander Gelbukh</name>
<affiliations><json:string>National Polytechnic Institute, Center for Computing Research, 07738, Mexico City, México</json:string>
<json:string>E-mail: gelbukh@gelbukh.com</json:string>
</affiliations>
</json:item>
</editor>
<subject><json:item><value>Computer Science</value>
</json:item>
<json:item><value>Computer Science</value>
</json:item>
<json:item><value>Information Storage and Retrieval</value>
</json:item>
<json:item><value>Artificial Intelligence (incl. Robotics)</value>
</json:item>
<json:item><value>Language Translation and Linguistics</value>
</json:item>
<json:item><value>Mathematical Logic and Formal Languages</value>
</json:item>
</subject>
<isbn><json:string>978-3-540-24523-0</json:string>
</isbn>
<language><json:string>unknown</json:string>
</language>
<eissn><json:string>1611-3349</json:string>
</eissn>
<title>Computational Linguistics and Intelligent Text Processing</title>
<genre.original><json:string>Proceedings</json:string>
</genre.original>
<bookId><json:string>978-3-540-30586-6</json:string>
</bookId>
<volume>3406</volume>
<pages><last>826</last>
<first>816</first>
</pages>
<issn><json:string>0302-9743</json:string>
</issn>
<genre><json:string>Book Series</json:string>
</genre>
<eisbn><json:string>978-3-540-30586-6</json:string>
</eisbn>
<copyrightDate>2005</copyrightDate>
<doi><json:string>10.1007/b105772</json:string>
</doi>
</host>
<publicationDate>2005</publicationDate>
<copyrightDate>2005</copyrightDate>
<doi><json:string>10.1007/978-3-540-30586-6_92</json:string>
</doi>
<id>E721D32157443575D3AA6F2BFAA2655A85232278</id>
<fulltext><json:item><original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/E721D32157443575D3AA6F2BFAA2655A85232278/fulltext/pdf</uri>
</json:item>
<json:item><original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/E721D32157443575D3AA6F2BFAA2655A85232278/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/E721D32157443575D3AA6F2BFAA2655A85232278/fulltext/tei"><teiHeader><fileDesc><titleStmt><title level="a" type="main" xml:lang="en">A Paragraph Boundary Detection System</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
</titleStmt>
<publicationStmt><authority>ISTEX</authority>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<availability><p>SPRINGER</p>
</availability>
<date>2005</date>
</publicationStmt>
<sourceDesc><biblStruct type="inbook"><analytic><title level="a" type="main" xml:lang="en">A Paragraph Boundary Detection System</title>
<author><persName><forename type="first">Dmitriy</forename>
<surname>Genzel</surname>
</persName>
<email>dg@cs.brown.edu</email>
<affiliation>Department of Computer Science, Brown University, Box 1910, 02912, Providence, RI, USA</affiliation>
</author>
</analytic>
<monogr><title level="m">Computational Linguistics and Intelligent Text Processing</title>
<title level="m" type="sub">6th International Conference, CICLing 2005, Mexico City, Mexico, February 13-19, 2005. Proceedings</title>
<idno type="pISBN">978-3-540-24523-0</idno>
<idno type="eISBN">978-3-540-30586-6</idno>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="DOI">10.1007/b105772</idno>
<idno type="BookID">978-3-540-30586-6</idno>
<idno type="BookTitleID">116551</idno>
<idno type="BookSequenceNumber">3406</idno>
<idno type="BookVolumeNumber">3406</idno>
<idno type="BookChapterCount">92</idno>
<editor><persName><forename type="first">Alexander</forename>
<surname>Gelbukh</surname>
</persName>
<email>gelbukh@gelbukh.com</email>
<affiliation>National Polytechnic Institute, Center for Computing Research, 07738, Mexico City, México</affiliation>
</editor>
<imprint><publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<date type="published" when="2005"></date>
<biblScope unit="volume">3406</biblScope>
<biblScope unit="page" from="816">816</biblScope>
<biblScope unit="page" to="826">826</biblScope>
</imprint>
</monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<editor><persName><forename type="first">David</forename>
<surname>Hutchison</surname>
</persName>
<affiliation>Lancaster University, UK</affiliation>
</editor>
<editor><persName><forename type="first">Takeo</forename>
<surname>Kanade</surname>
</persName>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
</editor>
<editor><persName><forename type="first">Josef</forename>
<surname>Kittler</surname>
</persName>
<affiliation>University of Surrey, Guildford, UK</affiliation>
</editor>
<editor><persName><forename type="first">Jon</forename>
<forename type="first">M.</forename>
<surname>Kleinberg</surname>
</persName>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
</editor>
<editor><persName><forename type="first">Friedemann</forename>
<surname>Mattern</surname>
</persName>
<affiliation>ETH Zurich, Switzerland</affiliation>
</editor>
<editor><persName><forename type="first">John</forename>
<forename type="first">C.</forename>
<surname>Mitchell</surname>
</persName>
<affiliation>Stanford University, CA, USA</affiliation>
</editor>
<editor><persName><forename type="first">Moni</forename>
<surname>Naor</surname>
</persName>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
</editor>
<editor><persName><forename type="first">Oscar</forename>
<surname>Nierstrasz</surname>
</persName>
<affiliation>University of Bern, Switzerland</affiliation>
</editor>
<editor><persName><forename type="first">C.</forename>
<surname>Pandu Rangan</surname>
</persName>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
</editor>
<editor><persName><forename type="first">Bernhard</forename>
<surname>Steffen</surname>
</persName>
<affiliation>University of Dortmund, Germany</affiliation>
</editor>
<editor><persName><forename type="first">Madhu</forename>
<surname>Sudan</surname>
</persName>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
</editor>
<editor><persName><forename type="first">Demetri</forename>
<surname>Terzopoulos</surname>
</persName>
<affiliation>New York University, NY, USA</affiliation>
</editor>
<editor><persName><forename type="first">Dough</forename>
<surname>Tygar</surname>
</persName>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
</editor>
<editor><persName><forename type="first">Moshe</forename>
<forename type="first">Y.</forename>
<surname>Vardi</surname>
</persName>
<affiliation>Rice University, Houston, TX, USA</affiliation>
</editor>
<editor><persName><forename type="first">Gerhard</forename>
<surname>Weikum</surname>
</persName>
<affiliation>Max-Planck Institute of Computer Science, Saarbruecken, Germany</affiliation>
</editor>
<biblScope><date>2005</date>
</biblScope>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="seriesId">558</idno>
</series>
<idno type="istex">E721D32157443575D3AA6F2BFAA2655A85232278</idno>
<idno type="DOI">10.1007/978-3-540-30586-6_92</idno>
<idno type="ChapterID">92</idno>
<idno type="ChapterID">Chap92</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><creation><date>2005</date>
</creation>
<langUsage><language ident="en">en</language>
</langUsage>
<abstract xml:lang="en"><p>Abstract: We propose and motivate a novel task: paragraph segmentation. We discuss and compare this task with text segmentation and discourse parsing. We present a system that performs the task with high accuracy. A variety of features is proposed and examined in detail. The best models turn out to include lexical, coherence, and structural features.</p>
</abstract>
<textClass><keywords scheme="Book Subject Collection"><list><label>SUCO11645</label>
<item><term>Computer Science</term>
</item>
</list>
</keywords>
</textClass>
<textClass><keywords scheme="Book Subject Group"><list><label>I</label>
<label>I18032</label>
<label>I21017</label>
<label>I21041</label>
<label>I16048</label>
<item><term>Computer Science</term>
</item>
<item><term>Information Storage and Retrieval</term>
</item>
<item><term>Artificial Intelligence (incl. Robotics)</term>
</item>
<item><term>Language Translation and Linguistics</term>
</item>
<item><term>Mathematical Logic and Formal Languages</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc><change when="2005">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-20">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item><original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/E721D32157443575D3AA6F2BFAA2655A85232278/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata><istex:metadataXml wicri:clean="Springer, Publisher found" wicri:toSee="no header"><istex:xmlDeclaration>version="1.0" encoding="UTF-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//Springer-Verlag//DTD A++ V2.4//EN" URI="http://devel.springer.de/A++/V2.4/DTD/A++V2.4.dtd" name="istex:docType"></istex:docType>
<istex:document><Publisher><PublisherInfo><PublisherName>Springer Berlin Heidelberg</PublisherName>
<PublisherLocation>Berlin, Heidelberg</PublisherLocation>
</PublisherInfo>
<Series><SeriesInfo SeriesType="Series" TocLevels="0"><SeriesID>558</SeriesID>
<SeriesPrintISSN>0302-9743</SeriesPrintISSN>
<SeriesElectronicISSN>1611-3349</SeriesElectronicISSN>
<SeriesTitle Language="En">Lecture Notes in Computer Science</SeriesTitle>
</SeriesInfo>
<SeriesHeader><EditorGroup><Editor AffiliationIDS="Aff1"><EditorName DisplayOrder="Western"><GivenName>David</GivenName>
<FamilyName>Hutchison</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff2"><EditorName DisplayOrder="Western"><GivenName>Takeo</GivenName>
<FamilyName>Kanade</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff3"><EditorName DisplayOrder="Western"><GivenName>Josef</GivenName>
<FamilyName>Kittler</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff4"><EditorName DisplayOrder="Western"><GivenName>Jon</GivenName>
<GivenName>M.</GivenName>
<FamilyName>Kleinberg</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff5"><EditorName DisplayOrder="Western"><GivenName>Friedemann</GivenName>
<FamilyName>Mattern</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff6"><EditorName DisplayOrder="Western"><GivenName>John</GivenName>
<GivenName>C.</GivenName>
<FamilyName>Mitchell</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff7"><EditorName DisplayOrder="Western"><GivenName>Moni</GivenName>
<FamilyName>Naor</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff8"><EditorName DisplayOrder="Western"><GivenName>Oscar</GivenName>
<FamilyName>Nierstrasz</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff9"><EditorName DisplayOrder="Western"><GivenName>C.</GivenName>
<FamilyName>Pandu Rangan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff10"><EditorName DisplayOrder="Western"><GivenName>Bernhard</GivenName>
<FamilyName>Steffen</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff11"><EditorName DisplayOrder="Western"><GivenName>Madhu</GivenName>
<FamilyName>Sudan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff12"><EditorName DisplayOrder="Western"><GivenName>Demetri</GivenName>
<FamilyName>Terzopoulos</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff13"><EditorName DisplayOrder="Western"><GivenName>Dough</GivenName>
<FamilyName>Tygar</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff14"><EditorName DisplayOrder="Western"><GivenName>Moshe</GivenName>
<GivenName>Y.</GivenName>
<FamilyName>Vardi</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff15"><EditorName DisplayOrder="Western"><GivenName>Gerhard</GivenName>
<FamilyName>Weikum</FamilyName>
</EditorName>
</Editor>
<Affiliation ID="Aff1"><OrgName>Lancaster University</OrgName>
<OrgAddress><Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff2"><OrgName>Carnegie Mellon University</OrgName>
<OrgAddress><City>Pittsburgh</City>
<State>PA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff3"><OrgName>University of Surrey</OrgName>
<OrgAddress><City>Guildford</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff4"><OrgName>Cornell University</OrgName>
<OrgAddress><City>Ithaca</City>
<State>NY</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff5"><OrgName>ETH Zurich</OrgName>
<OrgAddress><Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff6"><OrgName>Stanford University</OrgName>
<OrgAddress><City>CA</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff7"><OrgName>Weizmann Institute of Science</OrgName>
<OrgAddress><City>Rehovot</City>
<Country>Israel</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff8"><OrgName>University of Bern</OrgName>
<OrgAddress><Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff9"><OrgName>Indian Institute of Technology</OrgName>
<OrgAddress><City>Madras</City>
<Country>India</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff10"><OrgName>University of Dortmund</OrgName>
<OrgAddress><Country>Germany</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff11"><OrgName>Massachusetts Institute of Technology</OrgName>
<OrgAddress><City>MA</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff12"><OrgName>New York University</OrgName>
<OrgAddress><City>NY</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff13"><OrgName>University of California</OrgName>
<OrgAddress><City>Berkeley</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff14"><OrgName>Rice University</OrgName>
<OrgAddress><City>Houston</City>
<State>TX</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff15"><OrgName>Max-Planck Institute of Computer Science</OrgName>
<OrgAddress><City>Saarbruecken</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</SeriesHeader>
<Book Language="En"><BookInfo BookProductType="Proceedings" ContainsESM="No" Language="En" MediaType="eBook" NumberingDepth="2" NumberingStyle="ContentOnly" OutputMedium="All" TocLevels="0"><BookID>978-3-540-30586-6</BookID>
<BookTitle>Computational Linguistics and Intelligent Text Processing</BookTitle>
<BookSubTitle>6th International Conference, CICLing 2005, Mexico City, Mexico, February 13-19, 2005. Proceedings</BookSubTitle>
<BookVolumeNumber>3406</BookVolumeNumber>
<BookSequenceNumber>3406</BookSequenceNumber>
<BookDOI>10.1007/b105772</BookDOI>
<BookTitleID>116551</BookTitleID>
<BookPrintISBN>978-3-540-24523-0</BookPrintISBN>
<BookElectronicISBN>978-3-540-30586-6</BookElectronicISBN>
<BookChapterCount>92</BookChapterCount>
<BookCopyright><CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2005</CopyrightYear>
</BookCopyright>
<BookSubjectGroup><BookSubject Code="I" Type="Primary">Computer Science</BookSubject>
<BookSubject Code="I18032" Priority="1" Type="Secondary">Information Storage and Retrieval</BookSubject>
<BookSubject Code="I21017" Priority="2" Type="Secondary">Artificial Intelligence (incl. Robotics)</BookSubject>
<BookSubject Code="I21041" Priority="3" Type="Secondary">Language Translation and Linguistics</BookSubject>
<BookSubject Code="I16048" Priority="4" Type="Secondary">Mathematical Logic and Formal Languages</BookSubject>
<SubjectCollection Code="SUCO11645">Computer Science</SubjectCollection>
</BookSubjectGroup>
<BookContext><SeriesID>558</SeriesID>
</BookContext>
</BookInfo>
<BookHeader><EditorGroup><Editor AffiliationIDS="Aff16"><EditorName DisplayOrder="Western"><GivenName>Alexander</GivenName>
<FamilyName>Gelbukh</FamilyName>
</EditorName>
<Contact><Email>gelbukh@gelbukh.com</Email>
</Contact>
</Editor>
<Affiliation ID="Aff16"><OrgDivision>National Polytechnic Institute</OrgDivision>
<OrgName>Center for Computing Research</OrgName>
<OrgAddress><Postcode>07738</Postcode>
<City>Mexico City</City>
<Country>México</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</BookHeader>
<Part ID="Part2"><PartInfo TocLevels="0"><PartID>2</PartID>
<PartSequenceNumber>2</PartSequenceNumber>
<PartTitle>Intelligent Text Processing Applications</PartTitle>
<PartChapterCount>40</PartChapterCount>
<PartContext><SeriesID>558</SeriesID>
<BookTitle>Computational Linguistics and Intelligent Text Processing</BookTitle>
</PartContext>
</PartInfo>
<SubPart ID="SubPart19"><SubPartInfo><SubPartID>19</SubPartID>
<SubPartSequenceNumber>19</SubPartSequenceNumber>
<SubPartTitle>Spelling and Style Checking</SubPartTitle>
<SubPartChapterCount>3</SubPartChapterCount>
</SubPartInfo>
<Chapter ID="Chap92" Language="En"><ChapterInfo ChapterType="OriginalPaper" ContainsESM="No" NumberingDepth="2" NumberingStyle="ContentOnly" TocLevels="0"><ChapterID>92</ChapterID>
<ChapterDOI>10.1007/978-3-540-30586-6_92</ChapterDOI>
<ChapterSequenceNumber>92</ChapterSequenceNumber>
<ChapterTitle Language="En">A Paragraph Boundary Detection System</ChapterTitle>
<ChapterFirstPage>816</ChapterFirstPage>
<ChapterLastPage>826</ChapterLastPage>
<ChapterCopyright><CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2005</CopyrightYear>
</ChapterCopyright>
<ChapterGrants Type="Regular"><MetadataGrant Grant="OpenAccess"></MetadataGrant>
<AbstractGrant Grant="OpenAccess"></AbstractGrant>
<BodyPDFGrant Grant="Restricted"></BodyPDFGrant>
<BodyHTMLGrant Grant="Restricted"></BodyHTMLGrant>
<BibliographyGrant Grant="Restricted"></BibliographyGrant>
<ESMGrant Grant="Restricted"></ESMGrant>
</ChapterGrants>
<ChapterContext><SeriesID>558</SeriesID>
<PartID>2</PartID>
<BookID>978-3-540-30586-6</BookID>
<BookTitle>Computational Linguistics and Intelligent Text Processing</BookTitle>
</ChapterContext>
</ChapterInfo>
<ChapterHeader><AuthorGroup><Author AffiliationIDS="Aff17"><AuthorName DisplayOrder="Western"><GivenName>Dmitriy</GivenName>
<FamilyName>Genzel</FamilyName>
</AuthorName>
<Contact><Email>dg@cs.brown.edu</Email>
</Contact>
</Author>
<Affiliation ID="Aff17"><OrgDivision>Department of Computer Science</OrgDivision>
<OrgName>Brown University</OrgName>
<OrgAddress><Postbox>Box 1910</Postbox>
<City>Providence</City>
<State>RI</State>
<Postcode>02912</Postcode>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
</AuthorGroup>
<Abstract ID="Abs1" Language="En"><Heading>Abstract</Heading>
<Para>We propose and motivate a novel task: paragraph segmentation. We discuss and compare this task with text segmentation and discourse parsing. We present a system that performs the task with high accuracy. A variety of features is proposed and examined in detail. The best models turn out to include lexical, coherence, and structural features.</Para>
</Abstract>
</ChapterHeader>
<NoBody></NoBody>
</Chapter>
</SubPart>
</Part>
</Book>
</Series>
</Publisher>
</istex:document>
</istex:metadataXml>
<mods version="3.6"><titleInfo lang="en"><title>A Paragraph Boundary Detection System</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA" lang="en"><title>A Paragraph Boundary Detection System</title>
</titleInfo>
<name type="personal"><namePart type="given">Dmitriy</namePart>
<namePart type="family">Genzel</namePart>
<affiliation>Department of Computer Science, Brown University, Box 1910, 02912, Providence, RI, USA</affiliation>
<affiliation>E-mail: dg@cs.brown.edu</affiliation>
<role><roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<originInfo><publisher>Springer Berlin Heidelberg</publisher>
<place><placeTerm type="text">Berlin, Heidelberg</placeTerm>
</place>
<dateIssued encoding="w3cdtf">2005</dateIssued>
<copyrightDate encoding="w3cdtf">2005</copyrightDate>
</originInfo>
<language><languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription><internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">Abstract: We propose and motivate a novel task: paragraph segmentation. We discuss and compare this task with text segmentation and discourse parsing. We present a system that performs the task with high accuracy. A variety of features is proposed and examined in detail. The best models turn out to include lexical, coherence, and structural features.</abstract>
<relatedItem type="host"><titleInfo><title>Computational Linguistics and Intelligent Text Processing</title>
<subTitle>6th International Conference, CICLing 2005, Mexico City, Mexico, February 13-19, 2005. Proceedings</subTitle>
</titleInfo>
<name type="personal"><namePart type="given">Alexander</namePart>
<namePart type="family">Gelbukh</namePart>
<affiliation>National Polytechnic Institute, Center for Computing Research, 07738, Mexico City, México</affiliation>
<affiliation>E-mail: gelbukh@gelbukh.com</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="Book Series" displayLabel="Proceedings"></genre>
<originInfo><copyrightDate encoding="w3cdtf">2005</copyrightDate>
<issuance>monographic</issuance>
</originInfo>
<subject><genre>Book Subject Collection</genre>
<topic authority="SpringerSubjectCodes" authorityURI="SUCO11645">Computer Science</topic>
</subject>
<subject><genre>Book Subject Group</genre>
<topic authority="SpringerSubjectCodes" authorityURI="I">Computer Science</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18032">Information Storage and Retrieval</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I21017">Artificial Intelligence (incl. Robotics)</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I21041">Language Translation and Linguistics</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I16048">Mathematical Logic and Formal Languages</topic>
</subject>
<identifier type="DOI">10.1007/b105772</identifier>
<identifier type="ISBN">978-3-540-24523-0</identifier>
<identifier type="eISBN">978-3-540-30586-6</identifier>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="BookTitleID">116551</identifier>
<identifier type="BookID">978-3-540-30586-6</identifier>
<identifier type="BookChapterCount">92</identifier>
<identifier type="BookVolumeNumber">3406</identifier>
<identifier type="BookSequenceNumber">3406</identifier>
<identifier type="PartChapterCount">40</identifier>
<part><date>2005</date>
<detail type="part"><title>Intelligent Text Processing Applications</title>
</detail>
<detail type="volume"><number>3406</number>
<caption>vol.</caption>
</detail>
<extent unit="pages"><start>816</start>
<end>826</end>
</extent>
</part>
<recordInfo><recordOrigin>Springer-Verlag Berlin Heidelberg, 2005</recordOrigin>
</recordInfo>
</relatedItem>
<relatedItem type="series"><titleInfo><title>Lecture Notes in Computer Science</title>
</titleInfo>
<name type="personal"><namePart type="given">David</namePart>
<namePart type="family">Hutchison</namePart>
<affiliation>Lancaster University, UK</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Takeo</namePart>
<namePart type="family">Kanade</namePart>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Josef</namePart>
<namePart type="family">Kittler</namePart>
<affiliation>University of Surrey, Guildford, UK</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Jon</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Kleinberg</namePart>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Friedemann</namePart>
<namePart type="family">Mattern</namePart>
<affiliation>ETH Zurich, Switzerland</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">John</namePart>
<namePart type="given">C.</namePart>
<namePart type="family">Mitchell</namePart>
<affiliation>Stanford University, CA, USA</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Moni</namePart>
<namePart type="family">Naor</namePart>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Oscar</namePart>
<namePart type="family">Nierstrasz</namePart>
<affiliation>University of Bern, Switzerland</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">C.</namePart>
<namePart type="family">Pandu Rangan</namePart>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Bernhard</namePart>
<namePart type="family">Steffen</namePart>
<affiliation>University of Dortmund, Germany</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Madhu</namePart>
<namePart type="family">Sudan</namePart>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Demetri</namePart>
<namePart type="family">Terzopoulos</namePart>
<affiliation>New York University, NY, USA</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Dough</namePart>
<namePart type="family">Tygar</namePart>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Moshe</namePart>
<namePart type="given">Y.</namePart>
<namePart type="family">Vardi</namePart>
<affiliation>Rice University, Houston, TX, USA</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<affiliation>Max-Planck Institute of Computer Science, Saarbruecken, Germany</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<originInfo><copyrightDate encoding="w3cdtf">2005</copyrightDate>
<issuance>serial</issuance>
</originInfo>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="SeriesID">558</identifier>
<recordInfo><recordOrigin>Springer-Verlag Berlin Heidelberg, 2005</recordOrigin>
</recordInfo>
</relatedItem>
<identifier type="istex">E721D32157443575D3AA6F2BFAA2655A85232278</identifier>
<identifier type="DOI">10.1007/978-3-540-30586-6_92</identifier>
<identifier type="ChapterID">92</identifier>
<identifier type="ChapterID">Chap92</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Springer-Verlag Berlin Heidelberg, 2005</accessCondition>
<recordInfo><recordContentSource>SPRINGER</recordContentSource>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2005</recordOrigin>
</recordInfo>
</mods>
</metadata>
<enrichments><istex:refBibTEI uri="https://api.istex.fr/document/E721D32157443575D3AA6F2BFAA2655A85232278/enrichments/refBib"><teiHeader></teiHeader>
<text><front></front>
<body></body>
<back><listBibl><biblStruct xml:id="b0"><analytic><title level="a" type="main">Microsoft natural language understanding system and grammar checker</title>
<author><persName><forename type="first">S</forename>
<surname>Richardson</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proceedings of Fifth Conference on Applied Natural Language Processing</title>
<meeting>Fifth Conference on Applied Natural Language Processing</meeting>
<imprint><date type="published" when="1997"></date>
<biblScope unit="page">97</biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b1"><analytic><title level="a" type="main">Detecting shifts in news stories for paragraph extraction</title>
<author><persName><forename type="first">F</forename>
<surname>Fukumoto</surname>
</persName>
</author>
<author><persName><forename type="first">Y</forename>
<surname>Suzuki</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proceedings of 19th International Conference on Computational Linguistics (COLING-02)</title>
<meeting>19th International Conference on Computational Linguistics (COLING-02)</meeting>
<imprint><date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b2"><analytic><title level="a" type="main">Variation of entropy and parse trees of sentences as a function of the sentence number</title>
<author><persName><forename type="first">D</forename>
<surname>Genzel</surname>
</persName>
</author>
<author><persName><forename type="first">E</forename>
<surname>Charniak</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proceedings of EMNLP–03</title>
<meeting>EMNLP–03<address><addrLine>Sapporo, Japan</addrLine>
</address>
</meeting>
<imprint><date type="published" when="2003"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b3"><analytic><title level="a" type="main">Text segmentation into paragraphs based on local text cohesion</title>
<author><persName><forename type="first">I</forename>
<forename type="middle">A</forename>
<surname>Bolshakov</surname>
</persName>
</author>
<author><persName><forename type="first">A</forename>
<forename type="middle">F</forename>
<surname>Gelbukh</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Lecture Notes in Artificial Intelligence #2166</title>
<imprint><publisher>Springer-Verlag</publisher>
<date type="published" when="2001"></date>
<biblScope unit="page" from="158" to="166"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b4"><analytic><title level="a" type="main">TextTiling: Segmenting text into multi-paragraph subtopic passages</title>
</analytic>
<monogr><title level="j">Computational Linguistics</title>
<imprint><biblScope unit="volume">23</biblScope>
<date type="published" when="1997"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b5"><analytic><title level="a" type="main">Text segmentation using exponential models</title>
<author><persName><forename type="first">D</forename>
<surname>Beeferman</surname>
</persName>
</author>
<author><persName><forename type="first">A</forename>
<surname>Berger</surname>
</persName>
</author>
<author><persName><forename type="first">J</forename>
<surname>Lafferty</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proceedings of EMNLP–97</title>
<meeting>EMNLP–97</meeting>
<imprint><date type="published" when="1997"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b6"><analytic><title level="a" type="main">Sentence level discourse parsing using syntactic and lexical information</title>
<author><persName><forename type="first">R</forename>
<surname>Soricut</surname>
</persName>
</author>
<author><persName><forename type="first">D</forename>
<surname>Marcu</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proceedings of HLT/NAACL–03</title>
<meeting>HLT/NAACL–03</meeting>
<imprint><date type="published" when="2003"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b7"><analytic><title level="a" type="main">Can text structure be incompatible with rhetorical structure?</title>
<author><persName><forename type="first">N</forename>
<surname>Bouayad-Agha</surname>
</persName>
</author>
<author><persName><forename type="first">R</forename>
<surname>Power</surname>
</persName>
</author>
<author><persName><forename type="first">D</forename>
<surname>Scott</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proceedings of the International Natural Language Generation Conference</title>
<meeting>the International Natural Language Generation Conference</meeting>
<imprint><date type="published" when="2000"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b8"><analytic><title level="a" type="main">Ranking algorithms for named-entity extraction: Boosting and the voted perceptron</title>
<author><persName><forename type="first">M</forename>
<surname>Collins</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proceedings of ACL–02</title>
<meeting>ACL–02</meeting>
<imprint><date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b9"><analytic><title level="a" type="main">The perceptron: A probabilistic model for information storage and organization in the brain</title>
<author><persName><forename type="first">F</forename>
<surname>Rosenblatt</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">Psychological Review</title>
<imprint><biblScope unit="volume">65</biblScope>
<biblScope unit="page" from="386" to="408"></biblScope>
<date type="published" when="1958"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b10"><analytic><title level="a" type="main">Centering: a framework for modelling the local coherence of discourse</title>
<author><persName><forename type="first">B</forename>
<surname>Grosz</surname>
</persName>
</author>
<author><persName><forename type="first">A</forename>
<surname>Joshi</surname>
</persName>
</author>
<author><persName><forename type="first">S</forename>
<surname>Weinstein</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">Computational Linguistics</title>
<imprint><biblScope unit="volume">21</biblScope>
<biblScope unit="page" from="203" to="226"></biblScope>
<date type="published" when="1995"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b11"><analytic><title level="a" type="main">Assigning function tags to parsed text</title>
<author><persName><forename type="first">D</forename>
<surname>Blaheta</surname>
</persName>
</author>
<author><persName><forename type="first">E</forename>
<surname>Charniak</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proceedings of NAACL–00</title>
<meeting>NAACL–00</meeting>
<imprint><date type="published" when="2000"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b12"><analytic><title level="a" type="main">Building a large annotated corpus of English: the Penn treebank</title>
<author><persName><forename type="first">M</forename>
<forename type="middle">P</forename>
<surname>Marcus</surname>
</persName>
</author>
<author><persName><forename type="first">B</forename>
<surname>Santorini</surname>
</persName>
</author>
<author><persName><forename type="first">M</forename>
<forename type="middle">A</forename>
<surname>Marcinkiewicz</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">Computational Linguistics</title>
<imprint><biblScope unit="volume">19</biblScope>
<biblScope unit="page" from="313" to="330"></biblScope>
<date type="published" when="1993"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b13"><monogr><title level="m" type="main">War and Peace Available online, in 4 languages</title>
<author><persName><forename type="first">L</forename>
<surname>Tolstoy</surname>
</persName>
</author>
<imprint><pubPlace>Russian, English, Spanish, Italian</pubPlace>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b14"><analytic><title level="a" type="main">A maximum-entropy-inspired parser</title>
<author><persName><forename type="first">E</forename>
<surname>Charniak</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proceedings of ACL–01</title>
<meeting>ACL–01<address><addrLine>Toulouse</addrLine>
</address>
</meeting>
<imprint><date type="published" when="2001"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b15"><analytic><title level="a" type="main">A simple pattern-matching algorithm for recovering empty nodes and their antecedents</title>
<author><persName><forename type="first">M</forename>
<surname>Johnson</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proceedings of ACL–02</title>
<meeting>ACL–02</meeting>
<imprint><date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b16"><monogr><title level="m" type="main">Empirical methods for artificial intelligence</title>
<author><persName><forename type="first">P</forename>
<surname>Cohen</surname>
</persName>
</author>
<imprint><date type="published" when="1995"></date>
<publisher>MIT Press</publisher>
<pubPlace>Cambridge, MA</pubPlace>
</imprint>
</monogr>
</biblStruct>
</listBibl>
</back>
</text>
</istex:refBibTEI>
</enrichments>
</istex>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 003181 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 003181 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Istex |étape= Corpus |type= RBID |clé= ISTEX:E721D32157443575D3AA6F2BFAA2655A85232278 |texte= A Paragraph Boundary Detection System }}
This area was generated with Dilib version V0.6.32. |