Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Symbolic Learning Techniques in Paper Document Processing

Identifieur interne : 001F70 ( Main/Merge ); précédent : 001F69; suivant : 001F71

Symbolic Learning Techniques in Paper Document Processing

Auteurs : Oronzo Altamura [Italie] ; Floriana Esposito [Italie] ; A. Lisi [Italie] ; Donato Malerba [Italie]

Source :

RBID : ISTEX:77924B0D6E2EFA43ECD671FF043BF689719DBD73

Abstract

Abstract: WISDOM++ is an intelligent document processing system that transforms a paper document into HTML/XML format. The main design requirement is adaptivity, which is realized through the application of machine learning methods. This paper illustrates the application of symbolic learning algorithms to the first three steps of document processing, namely document analysis, document classification and document understanding. Machine learning issues related to the application are: Efficient incremental induction of decision trees from numeric data, handling of both numeric and symbolic data in first-order rule learning, learning mutually dependent concepts. Experimental results obtained on a set of real-world documents are illustrated and commented.

Url:
DOI: 10.1007/3-540-48097-8_13

Links toward previous steps (curation, corpus...)


Links to Exploration step

ISTEX:77924B0D6E2EFA43ECD671FF043BF689719DBD73

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct:series">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Symbolic Learning Techniques in Paper Document Processing</title>
<author>
<name sortKey="Altamura, Oronzo" sort="Altamura, Oronzo" uniqKey="Altamura O" first="Oronzo" last="Altamura">Oronzo Altamura</name>
</author>
<author>
<name sortKey="Esposito, Floriana" sort="Esposito, Floriana" uniqKey="Esposito F" first="Floriana" last="Esposito">Floriana Esposito</name>
</author>
<author>
<name sortKey="Lisi, A" sort="Lisi, A" uniqKey="Lisi A" first="A." last="Lisi">A. Lisi</name>
</author>
<author>
<name sortKey="Malerba, Donato" sort="Malerba, Donato" uniqKey="Malerba D" first="Donato" last="Malerba">Donato Malerba</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:77924B0D6E2EFA43ECD671FF043BF689719DBD73</idno>
<date when="1999" year="1999">1999</date>
<idno type="doi">10.1007/3-540-48097-8_13</idno>
<idno type="url">https://api.istex.fr/document/77924B0D6E2EFA43ECD671FF043BF689719DBD73/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000640</idno>
<idno type="wicri:Area/Istex/Curation">000632</idno>
<idno type="wicri:Area/Istex/Checkpoint">001412</idno>
<idno type="wicri:doubleKey">0302-9743:1999:Altamura O:symbolic:learning:techniques</idno>
<idno type="wicri:Area/Main/Merge">001F70</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Symbolic Learning Techniques in Paper Document Processing</title>
<author>
<name sortKey="Altamura, Oronzo" sort="Altamura, Oronzo" uniqKey="Altamura O" first="Oronzo" last="Altamura">Oronzo Altamura</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Informatica, Università degli Studi di Bari, via Orabona 4, 70126, Bari</wicri:regionArea>
<wicri:noRegion>Bari</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Italie</country>
</affiliation>
</author>
<author>
<name sortKey="Esposito, Floriana" sort="Esposito, Floriana" uniqKey="Esposito F" first="Floriana" last="Esposito">Floriana Esposito</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Informatica, Università degli Studi di Bari, via Orabona 4, 70126, Bari</wicri:regionArea>
<wicri:noRegion>Bari</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Italie</country>
</affiliation>
</author>
<author>
<name sortKey="Lisi, A" sort="Lisi, A" uniqKey="Lisi A" first="A." last="Lisi">A. Lisi</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Informatica, Università degli Studi di Bari, via Orabona 4, 70126, Bari</wicri:regionArea>
<wicri:noRegion>Bari</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Italie</country>
</affiliation>
</author>
<author>
<name sortKey="Malerba, Donato" sort="Malerba, Donato" uniqKey="Malerba D" first="Donato" last="Malerba">Donato Malerba</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Informatica, Università degli Studi di Bari, via Orabona 4, 70126, Bari</wicri:regionArea>
<wicri:noRegion>Bari</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Italie</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<title level="s" type="sub">Lecture Notes in Artificial Intelligence</title>
<imprint>
<date>1999</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">77924B0D6E2EFA43ECD671FF043BF689719DBD73</idno>
<idno type="DOI">10.1007/3-540-48097-8_13</idno>
<idno type="ChapterID">13</idno>
<idno type="ChapterID">Chap13</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: WISDOM++ is an intelligent document processing system that transforms a paper document into HTML/XML format. The main design requirement is adaptivity, which is realized through the application of machine learning methods. This paper illustrates the application of symbolic learning algorithms to the first three steps of document processing, namely document analysis, document classification and document understanding. Machine learning issues related to the application are: Efficient incremental induction of decision trees from numeric data, handling of both numeric and symbolic data in first-order rule learning, learning mutually dependent concepts. Experimental results obtained on a set of real-world documents are illustrated and commented.</div>
</front>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001F70 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 001F70 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     ISTEX:77924B0D6E2EFA43ECD671FF043BF689719DBD73
   |texte=   Symbolic Learning Techniques in Paper Document Processing
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024