Correcting Different Types of Errors in Texts
Identifieur interne : 000049 ( Istex/Checkpoint ); précédent : 000048; suivant : 000050Correcting Different Types of Errors in Texts
Auteurs : Aminul Islam [Canada] ; Diana Inkpen [Canada]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2011.
Abstract
Abstract: This paper proposes an unsupervised approach that automatically detects and corrects a text containing multiple errors of both syntactic and semantic nature. The number of errors that can be corrected is equal to the number of correct words in the text. Error types include, but are not limited to: spelling errors, real-word spelling errors, typographical errors, unwanted words, missing words, prepositional errors, punctuation errors, and many of the grammatical errors (e.g., errors in agreement and verb formation).
Url:
DOI: 10.1007/978-3-642-21043-3_23
Affiliations:
Links toward previous steps (curation, corpus...)
Links to Exploration step
ISTEX:A5985BE5EDF2A8F996A278699D22B8E28B3D8736Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct:series"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Correcting Different Types of Errors in Texts</title>
<author><name sortKey="Islam, Aminul" sort="Islam, Aminul" uniqKey="Islam A" first="Aminul" last="Islam">Aminul Islam</name>
</author>
<author><name sortKey="Inkpen, Diana" sort="Inkpen, Diana" uniqKey="Inkpen D" first="Diana" last="Inkpen">Diana Inkpen</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:A5985BE5EDF2A8F996A278699D22B8E28B3D8736</idno>
<date when="2011" year="2011">2011</date>
<idno type="doi">10.1007/978-3-642-21043-3_23</idno>
<idno type="url">https://api.istex.fr/document/A5985BE5EDF2A8F996A278699D22B8E28B3D8736/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001671</idno>
<idno type="wicri:Area/Istex/Curation">001577</idno>
<idno type="wicri:Area/Istex/Checkpoint">000049</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Correcting Different Types of Errors in Texts</title>
<author><name sortKey="Islam, Aminul" sort="Islam, Aminul" uniqKey="Islam A" first="Aminul" last="Islam">Aminul Islam</name>
<affiliation wicri:level="1"><country xml:lang="fr">Canada</country>
<wicri:regionArea>University of Ottawa, Ottawa</wicri:regionArea>
<wicri:noRegion>Ottawa</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Canada</country>
</affiliation>
</author>
<author><name sortKey="Inkpen, Diana" sort="Inkpen, Diana" uniqKey="Inkpen D" first="Diana" last="Inkpen">Diana Inkpen</name>
<affiliation wicri:level="1"><country xml:lang="fr">Canada</country>
<wicri:regionArea>University of Ottawa, Ottawa</wicri:regionArea>
<wicri:noRegion>Ottawa</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Canada</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2011</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">A5985BE5EDF2A8F996A278699D22B8E28B3D8736</idno>
<idno type="DOI">10.1007/978-3-642-21043-3_23</idno>
<idno type="ChapterID">23</idno>
<idno type="ChapterID">Chap23</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: This paper proposes an unsupervised approach that automatically detects and corrects a text containing multiple errors of both syntactic and semantic nature. The number of errors that can be corrected is equal to the number of correct words in the text. Error types include, but are not limited to: spelling errors, real-word spelling errors, typographical errors, unwanted words, missing words, prepositional errors, punctuation errors, and many of the grammatical errors (e.g., errors in agreement and verb formation).</div>
</front>
</TEI>
<affiliations><list><country><li>Canada</li>
</country>
</list>
<tree><country name="Canada"><noRegion><name sortKey="Islam, Aminul" sort="Islam, Aminul" uniqKey="Islam A" first="Aminul" last="Islam">Aminul Islam</name>
</noRegion>
<name sortKey="Inkpen, Diana" sort="Inkpen, Diana" uniqKey="Inkpen D" first="Diana" last="Inkpen">Diana Inkpen</name>
<name sortKey="Inkpen, Diana" sort="Inkpen, Diana" uniqKey="Inkpen D" first="Diana" last="Inkpen">Diana Inkpen</name>
<name sortKey="Islam, Aminul" sort="Islam, Aminul" uniqKey="Islam A" first="Aminul" last="Islam">Aminul Islam</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000049 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Istex/Checkpoint/biblio.hfd -nk 000049 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Istex |étape= Checkpoint |type= RBID |clé= ISTEX:A5985BE5EDF2A8F996A278699D22B8E28B3D8736 |texte= Correcting Different Types of Errors in Texts }}
This area was generated with Dilib version V0.6.32. |