Symbolic Learning Techniques in Paper Document Processing
Identifieur interne : 001F70 ( Main/Merge ); précédent : 001F69; suivant : 001F71Symbolic Learning Techniques in Paper Document Processing
Auteurs : Oronzo Altamura [Italie] ; Floriana Esposito [Italie] ; A. Lisi [Italie] ; Donato Malerba [Italie]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 1999.
Abstract
Abstract: WISDOM++ is an intelligent document processing system that transforms a paper document into HTML/XML format. The main design requirement is adaptivity, which is realized through the application of machine learning methods. This paper illustrates the application of symbolic learning algorithms to the first three steps of document processing, namely document analysis, document classification and document understanding. Machine learning issues related to the application are: Efficient incremental induction of decision trees from numeric data, handling of both numeric and symbolic data in first-order rule learning, learning mutually dependent concepts. Experimental results obtained on a set of real-world documents are illustrated and commented.
Url:
DOI: 10.1007/3-540-48097-8_13
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 000640
- to stream Istex, to step Curation: 000632
- to stream Istex, to step Checkpoint: 001412
Links to Exploration step
ISTEX:77924B0D6E2EFA43ECD671FF043BF689719DBD73Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct:series"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Symbolic Learning Techniques in Paper Document Processing</title>
<author><name sortKey="Altamura, Oronzo" sort="Altamura, Oronzo" uniqKey="Altamura O" first="Oronzo" last="Altamura">Oronzo Altamura</name>
</author>
<author><name sortKey="Esposito, Floriana" sort="Esposito, Floriana" uniqKey="Esposito F" first="Floriana" last="Esposito">Floriana Esposito</name>
</author>
<author><name sortKey="Lisi, A" sort="Lisi, A" uniqKey="Lisi A" first="A." last="Lisi">A. Lisi</name>
</author>
<author><name sortKey="Malerba, Donato" sort="Malerba, Donato" uniqKey="Malerba D" first="Donato" last="Malerba">Donato Malerba</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:77924B0D6E2EFA43ECD671FF043BF689719DBD73</idno>
<date when="1999" year="1999">1999</date>
<idno type="doi">10.1007/3-540-48097-8_13</idno>
<idno type="url">https://api.istex.fr/document/77924B0D6E2EFA43ECD671FF043BF689719DBD73/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000640</idno>
<idno type="wicri:Area/Istex/Curation">000632</idno>
<idno type="wicri:Area/Istex/Checkpoint">001412</idno>
<idno type="wicri:doubleKey">0302-9743:1999:Altamura O:symbolic:learning:techniques</idno>
<idno type="wicri:Area/Main/Merge">001F70</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Symbolic Learning Techniques in Paper Document Processing</title>
<author><name sortKey="Altamura, Oronzo" sort="Altamura, Oronzo" uniqKey="Altamura O" first="Oronzo" last="Altamura">Oronzo Altamura</name>
<affiliation wicri:level="1"><country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Informatica, Università degli Studi di Bari, via Orabona 4, 70126, Bari</wicri:regionArea>
<wicri:noRegion>Bari</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Italie</country>
</affiliation>
</author>
<author><name sortKey="Esposito, Floriana" sort="Esposito, Floriana" uniqKey="Esposito F" first="Floriana" last="Esposito">Floriana Esposito</name>
<affiliation wicri:level="1"><country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Informatica, Università degli Studi di Bari, via Orabona 4, 70126, Bari</wicri:regionArea>
<wicri:noRegion>Bari</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Italie</country>
</affiliation>
</author>
<author><name sortKey="Lisi, A" sort="Lisi, A" uniqKey="Lisi A" first="A." last="Lisi">A. Lisi</name>
<affiliation wicri:level="1"><country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Informatica, Università degli Studi di Bari, via Orabona 4, 70126, Bari</wicri:regionArea>
<wicri:noRegion>Bari</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Italie</country>
</affiliation>
</author>
<author><name sortKey="Malerba, Donato" sort="Malerba, Donato" uniqKey="Malerba D" first="Donato" last="Malerba">Donato Malerba</name>
<affiliation wicri:level="1"><country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Informatica, Università degli Studi di Bari, via Orabona 4, 70126, Bari</wicri:regionArea>
<wicri:noRegion>Bari</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Italie</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<title level="s" type="sub">Lecture Notes in Artificial Intelligence</title>
<imprint><date>1999</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">77924B0D6E2EFA43ECD671FF043BF689719DBD73</idno>
<idno type="DOI">10.1007/3-540-48097-8_13</idno>
<idno type="ChapterID">13</idno>
<idno type="ChapterID">Chap13</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: WISDOM++ is an intelligent document processing system that transforms a paper document into HTML/XML format. The main design requirement is adaptivity, which is realized through the application of machine learning methods. This paper illustrates the application of symbolic learning algorithms to the first three steps of document processing, namely document analysis, document classification and document understanding. Machine learning issues related to the application are: Efficient incremental induction of decision trees from numeric data, handling of both numeric and symbolic data in first-order rule learning, learning mutually dependent concepts. Experimental results obtained on a set of real-world documents are illustrated and commented.</div>
</front>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001F70 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 001F70 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Merge |type= RBID |clé= ISTEX:77924B0D6E2EFA43ECD671FF043BF689719DBD73 |texte= Symbolic Learning Techniques in Paper Document Processing }}
This area was generated with Dilib version V0.6.32. |