Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Automatic table detection in document images

Identifieur interne : 000451 ( PascalFrancis/Corpus ); précédent : 000450; suivant : 000452

Automatic table detection in document images

Auteurs : Basilios Gatos ; Dimitrios Danatsas ; Ioannis Pratikakis ; Stavros J. Perantonis

Source :

RBID : Pascal:05-0390700

Descripteurs français

English descriptors

Abstract

In this paper, we propose a novel technique for automatic table detection in document images. Lines and tables are among the most frequent graphic, non-textual entities in documents and their detection is directly related to the OCR performance as well as to the document layout description. We propose a workflow for table detection that comprises three distinct steps: (i) image preprocessing; (ii) horizontal and vertical line detection and (iii) table detection. The efficiency of the proposed method is demonstrated by using a performance evaluation scheme which considers a great variety of documents such as forms, newspapers/magazines, scientific journals, tickets/bank cheques, certificates and handwritten documents.

Notice en format standard (ISO 2709)

Pour connaître la documentation sur le format Inist Standard.

pA  
A01 01  1    @0 0302-9743
A05       @2 3686
A08 01  1  ENG  @1 Automatic table detection in document images
A09 01  1  ENG  @1 Pattern recognition and data mining : Bath, 22-25 august 2005
A11 01  1    @1 GATOS (Basilios)
A11 02  1    @1 DANATSAS (Dimitrios)
A11 03  1    @1 PRATIKAKIS (Ioannis)
A11 04  1    @1 PERANTONIS (Stavros J.)
A12 01  1    @1 SINGH (Sameer) @9 ed.
A12 02  1    @1 SINGH (Maneesha) @9 ed.
A12 03  1    @1 APTE (Chid) @9 ed.
A12 04  1    @1 PERNER (Petra) @9 ed.
A14 01      @1 Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos" @2 15310 Athens @3 GRC @Z 1 aut. @Z 2 aut. @Z 3 aut. @Z 4 aut.
A20       @2 Part I, 609-618
A21       @1 2005
A23 01      @0 ENG
A26 01      @0 3-540-28757-4
A43 01      @1 INIST @2 16343 @5 354000124412760670
A44       @0 0000 @1 © 2005 INIST-CNRS. All rights reserved.
A45       @0 11 ref.
A47 01  1    @0 05-0390700
A60       @1 P @2 C
A61       @0 A
A64 01  1    @0 Lecture notes in computer science
A66 01      @0 DEU
C01 01    ENG  @0 In this paper, we propose a novel technique for automatic table detection in document images. Lines and tables are among the most frequent graphic, non-textual entities in documents and their detection is directly related to the OCR performance as well as to the document layout description. We propose a workflow for table detection that comprises three distinct steps: (i) image preprocessing; (ii) horizontal and vertical line detection and (iii) table detection. The efficiency of the proposed method is demonstrated by using a performance evaluation scheme which considers a great variety of documents such as forms, newspapers/magazines, scientific journals, tickets/bank cheques, certificates and handwritten documents.
C02 01  X    @0 001D02B07D
C02 02  X    @0 001D02B07B
C03 01  X  FRE  @0 Fouille donnée @5 01
C03 01  X  ENG  @0 Data mining @5 01
C03 01  X  SPA  @0 Busca dato @5 01
C03 02  X  FRE  @0 Reconnaissance forme @5 02
C03 02  X  ENG  @0 Pattern recognition @5 02
C03 02  X  SPA  @0 Reconocimiento patrón @5 02
C03 03  X  FRE  @0 Mesure automatique @5 06
C03 03  X  ENG  @0 Automatic measurement @5 06
C03 03  X  SPA  @0 Medición automática @5 06
C03 04  X  FRE  @0 Détecteur image @5 07
C03 04  X  ENG  @0 Image sensor @5 07
C03 04  X  SPA  @0 Detector imagen @5 07
C03 05  X  FRE  @0 Représentation graphique @5 08
C03 05  X  ENG  @0 Graphics @5 08
C03 05  X  SPA  @0 Grafo (curva) @5 08
C03 06  X  FRE  @0 Texte @5 09
C03 06  X  ENG  @0 Text @5 09
C03 06  X  SPA  @0 Texto @5 09
C03 07  X  FRE  @0 Reconnaissance caractère @5 10
C03 07  X  ENG  @0 Character recognition @5 10
C03 07  X  SPA  @0 Reconocimiento carácter @5 10
C03 08  X  FRE  @0 Reconnaissance optique caractère @5 11
C03 08  X  ENG  @0 Optical character recognition @5 11
C03 08  X  SPA  @0 Reconocimento óptico de caracteres @5 11
C03 09  X  FRE  @0 Collecticiel @5 12
C03 09  X  ENG  @0 Groupware @5 12
C03 09  X  SPA  @0 Groupware @5 12
C03 10  X  FRE  @0 Workflow @5 13
C03 10  X  ENG  @0 Workflow @5 13
C03 10  X  SPA  @0 Workflow @5 13
C03 11  X  FRE  @0 Détection contour @5 14
C03 11  X  ENG  @0 Edge detection @5 14
C03 11  X  SPA  @0 Detección contorno @5 14
C03 12  X  FRE  @0 Caractère manuscrit @5 15
C03 12  X  ENG  @0 Manuscript character @5 15
C03 12  X  SPA  @0 Carácter manuscrito @5 15
C03 13  X  FRE  @0 Présentation document @5 18
C03 13  X  ENG  @0 Document layout @5 18
C03 13  X  SPA  @0 Presentación documento @5 18
C03 14  X  FRE  @0 Evaluation performance @5 19
C03 14  X  ENG  @0 Performance evaluation @5 19
C03 14  X  SPA  @0 Evaluación prestación @5 19
C03 15  X  FRE  @0 Chèque bancaire @5 20
C03 15  X  ENG  @0 Bank check @5 20
C03 15  X  SPA  @0 Cheque bancario @5 20
N21       @1 276
N44 01      @1 OTO
N82       @1 OTO
pR  
A30 01  1  ENG  @1 ICAPR : international conference on advances in pattern recognition @3 Bath GBR @4 2005-08-22

Format Inist (serveur)

NO : PASCAL 05-0390700 INIST
ET : Automatic table detection in document images
AU : GATOS (Basilios); DANATSAS (Dimitrios); PRATIKAKIS (Ioannis); PERANTONIS (Stavros J.); SINGH (Sameer); SINGH (Maneesha); APTE (Chid); PERNER (Petra)
AF : Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos"/15310 Athens/Grèce (1 aut., 2 aut., 3 aut., 4 aut.)
DT : Publication en série; Congrès; Niveau analytique
SO : Lecture notes in computer science; ISSN 0302-9743; Allemagne; Da. 2005; Vol. 3686; Part I, 609-618; Bibl. 11 ref.
LA : Anglais
EA : In this paper, we propose a novel technique for automatic table detection in document images. Lines and tables are among the most frequent graphic, non-textual entities in documents and their detection is directly related to the OCR performance as well as to the document layout description. We propose a workflow for table detection that comprises three distinct steps: (i) image preprocessing; (ii) horizontal and vertical line detection and (iii) table detection. The efficiency of the proposed method is demonstrated by using a performance evaluation scheme which considers a great variety of documents such as forms, newspapers/magazines, scientific journals, tickets/bank cheques, certificates and handwritten documents.
CC : 001D02B07D; 001D02B07B
FD : Fouille donnée; Reconnaissance forme; Mesure automatique; Détecteur image; Représentation graphique; Texte; Reconnaissance caractère; Reconnaissance optique caractère; Collecticiel; Workflow; Détection contour; Caractère manuscrit; Présentation document; Evaluation performance; Chèque bancaire
ED : Data mining; Pattern recognition; Automatic measurement; Image sensor; Graphics; Text; Character recognition; Optical character recognition; Groupware; Workflow; Edge detection; Manuscript character; Document layout; Performance evaluation; Bank check
SD : Busca dato; Reconocimiento patrón; Medición automática; Detector imagen; Grafo (curva); Texto; Reconocimiento carácter; Reconocimento óptico de caracteres; Groupware; Workflow; Detección contorno; Carácter manuscrito; Presentación documento; Evaluación prestación; Cheque bancario
LO : INIST-16343.354000124412760670
ID : 05-0390700

Links to Exploration step

Pascal:05-0390700

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Automatic table detection in document images</title>
<author>
<name sortKey="Gatos, Basilios" sort="Gatos, Basilios" uniqKey="Gatos B" first="Basilios" last="Gatos">Basilios Gatos</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos"</s1>
<s2>15310 Athens</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Danatsas, Dimitrios" sort="Danatsas, Dimitrios" uniqKey="Danatsas D" first="Dimitrios" last="Danatsas">Dimitrios Danatsas</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos"</s1>
<s2>15310 Athens</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Pratikakis, Ioannis" sort="Pratikakis, Ioannis" uniqKey="Pratikakis I" first="Ioannis" last="Pratikakis">Ioannis Pratikakis</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos"</s1>
<s2>15310 Athens</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Perantonis, Stavros J" sort="Perantonis, Stavros J" uniqKey="Perantonis S" first="Stavros J." last="Perantonis">Stavros J. Perantonis</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos"</s1>
<s2>15310 Athens</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">05-0390700</idno>
<date when="2005">2005</date>
<idno type="stanalyst">PASCAL 05-0390700 INIST</idno>
<idno type="RBID">Pascal:05-0390700</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000451</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Automatic table detection in document images</title>
<author>
<name sortKey="Gatos, Basilios" sort="Gatos, Basilios" uniqKey="Gatos B" first="Basilios" last="Gatos">Basilios Gatos</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos"</s1>
<s2>15310 Athens</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Danatsas, Dimitrios" sort="Danatsas, Dimitrios" uniqKey="Danatsas D" first="Dimitrios" last="Danatsas">Dimitrios Danatsas</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos"</s1>
<s2>15310 Athens</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Pratikakis, Ioannis" sort="Pratikakis, Ioannis" uniqKey="Pratikakis I" first="Ioannis" last="Pratikakis">Ioannis Pratikakis</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos"</s1>
<s2>15310 Athens</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Perantonis, Stavros J" sort="Perantonis, Stavros J" uniqKey="Perantonis S" first="Stavros J." last="Perantonis">Stavros J. Perantonis</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos"</s1>
<s2>15310 Athens</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Lecture notes in computer science</title>
<idno type="ISSN">0302-9743</idno>
<imprint>
<date when="2005">2005</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Lecture notes in computer science</title>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Automatic measurement</term>
<term>Bank check</term>
<term>Character recognition</term>
<term>Data mining</term>
<term>Document layout</term>
<term>Edge detection</term>
<term>Graphics</term>
<term>Groupware</term>
<term>Image sensor</term>
<term>Manuscript character</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Performance evaluation</term>
<term>Text</term>
<term>Workflow</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Fouille donnée</term>
<term>Reconnaissance forme</term>
<term>Mesure automatique</term>
<term>Détecteur image</term>
<term>Représentation graphique</term>
<term>Texte</term>
<term>Reconnaissance caractère</term>
<term>Reconnaissance optique caractère</term>
<term>Collecticiel</term>
<term>Workflow</term>
<term>Détection contour</term>
<term>Caractère manuscrit</term>
<term>Présentation document</term>
<term>Evaluation performance</term>
<term>Chèque bancaire</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">In this paper, we propose a novel technique for automatic table detection in document images. Lines and tables are among the most frequent graphic, non-textual entities in documents and their detection is directly related to the OCR performance as well as to the document layout description. We propose a workflow for table detection that comprises three distinct steps: (i) image preprocessing; (ii) horizontal and vertical line detection and (iii) table detection. The efficiency of the proposed method is demonstrated by using a performance evaluation scheme which considers a great variety of documents such as forms, newspapers/magazines, scientific journals, tickets/bank cheques, certificates and handwritten documents.</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA01 i1="01" i2="1">
<s0>0302-9743</s0>
</fA01>
<fA05>
<s2>3686</s2>
</fA05>
<fA08 i1="01" i2="1" l="ENG">
<s1>Automatic table detection in document images</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG">
<s1>Pattern recognition and data mining : Bath, 22-25 august 2005</s1>
</fA09>
<fA11 i1="01" i2="1">
<s1>GATOS (Basilios)</s1>
</fA11>
<fA11 i1="02" i2="1">
<s1>DANATSAS (Dimitrios)</s1>
</fA11>
<fA11 i1="03" i2="1">
<s1>PRATIKAKIS (Ioannis)</s1>
</fA11>
<fA11 i1="04" i2="1">
<s1>PERANTONIS (Stavros J.)</s1>
</fA11>
<fA12 i1="01" i2="1">
<s1>SINGH (Sameer)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="02" i2="1">
<s1>SINGH (Maneesha)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="03" i2="1">
<s1>APTE (Chid)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="04" i2="1">
<s1>PERNER (Petra)</s1>
<s9>ed.</s9>
</fA12>
<fA14 i1="01">
<s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos"</s1>
<s2>15310 Athens</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</fA14>
<fA20>
<s2>Part I, 609-618</s2>
</fA20>
<fA21>
<s1>2005</s1>
</fA21>
<fA23 i1="01">
<s0>ENG</s0>
</fA23>
<fA26 i1="01">
<s0>3-540-28757-4</s0>
</fA26>
<fA43 i1="01">
<s1>INIST</s1>
<s2>16343</s2>
<s5>354000124412760670</s5>
</fA43>
<fA44>
<s0>0000</s0>
<s1>© 2005 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45>
<s0>11 ref.</s0>
</fA45>
<fA47 i1="01" i2="1">
<s0>05-0390700</s0>
</fA47>
<fA60>
<s1>P</s1>
<s2>C</s2>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA64 i1="01" i2="1">
<s0>Lecture notes in computer science</s0>
</fA64>
<fA66 i1="01">
<s0>DEU</s0>
</fA66>
<fC01 i1="01" l="ENG">
<s0>In this paper, we propose a novel technique for automatic table detection in document images. Lines and tables are among the most frequent graphic, non-textual entities in documents and their detection is directly related to the OCR performance as well as to the document layout description. We propose a workflow for table detection that comprises three distinct steps: (i) image preprocessing; (ii) horizontal and vertical line detection and (iii) table detection. The efficiency of the proposed method is demonstrated by using a performance evaluation scheme which considers a great variety of documents such as forms, newspapers/magazines, scientific journals, tickets/bank cheques, certificates and handwritten documents.</s0>
</fC01>
<fC02 i1="01" i2="X">
<s0>001D02B07D</s0>
</fC02>
<fC02 i1="02" i2="X">
<s0>001D02B07B</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE">
<s0>Fouille donnée</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG">
<s0>Data mining</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA">
<s0>Busca dato</s0>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE">
<s0>Reconnaissance forme</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG">
<s0>Pattern recognition</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA">
<s0>Reconocimiento patrón</s0>
<s5>02</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE">
<s0>Mesure automatique</s0>
<s5>06</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG">
<s0>Automatic measurement</s0>
<s5>06</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA">
<s0>Medición automática</s0>
<s5>06</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE">
<s0>Détecteur image</s0>
<s5>07</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG">
<s0>Image sensor</s0>
<s5>07</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA">
<s0>Detector imagen</s0>
<s5>07</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE">
<s0>Représentation graphique</s0>
<s5>08</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG">
<s0>Graphics</s0>
<s5>08</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA">
<s0>Grafo (curva)</s0>
<s5>08</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE">
<s0>Texte</s0>
<s5>09</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG">
<s0>Text</s0>
<s5>09</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA">
<s0>Texto</s0>
<s5>09</s5>
</fC03>
<fC03 i1="07" i2="X" l="FRE">
<s0>Reconnaissance caractère</s0>
<s5>10</s5>
</fC03>
<fC03 i1="07" i2="X" l="ENG">
<s0>Character recognition</s0>
<s5>10</s5>
</fC03>
<fC03 i1="07" i2="X" l="SPA">
<s0>Reconocimiento carácter</s0>
<s5>10</s5>
</fC03>
<fC03 i1="08" i2="X" l="FRE">
<s0>Reconnaissance optique caractère</s0>
<s5>11</s5>
</fC03>
<fC03 i1="08" i2="X" l="ENG">
<s0>Optical character recognition</s0>
<s5>11</s5>
</fC03>
<fC03 i1="08" i2="X" l="SPA">
<s0>Reconocimento óptico de caracteres</s0>
<s5>11</s5>
</fC03>
<fC03 i1="09" i2="X" l="FRE">
<s0>Collecticiel</s0>
<s5>12</s5>
</fC03>
<fC03 i1="09" i2="X" l="ENG">
<s0>Groupware</s0>
<s5>12</s5>
</fC03>
<fC03 i1="09" i2="X" l="SPA">
<s0>Groupware</s0>
<s5>12</s5>
</fC03>
<fC03 i1="10" i2="X" l="FRE">
<s0>Workflow</s0>
<s5>13</s5>
</fC03>
<fC03 i1="10" i2="X" l="ENG">
<s0>Workflow</s0>
<s5>13</s5>
</fC03>
<fC03 i1="10" i2="X" l="SPA">
<s0>Workflow</s0>
<s5>13</s5>
</fC03>
<fC03 i1="11" i2="X" l="FRE">
<s0>Détection contour</s0>
<s5>14</s5>
</fC03>
<fC03 i1="11" i2="X" l="ENG">
<s0>Edge detection</s0>
<s5>14</s5>
</fC03>
<fC03 i1="11" i2="X" l="SPA">
<s0>Detección contorno</s0>
<s5>14</s5>
</fC03>
<fC03 i1="12" i2="X" l="FRE">
<s0>Caractère manuscrit</s0>
<s5>15</s5>
</fC03>
<fC03 i1="12" i2="X" l="ENG">
<s0>Manuscript character</s0>
<s5>15</s5>
</fC03>
<fC03 i1="12" i2="X" l="SPA">
<s0>Carácter manuscrito</s0>
<s5>15</s5>
</fC03>
<fC03 i1="13" i2="X" l="FRE">
<s0>Présentation document</s0>
<s5>18</s5>
</fC03>
<fC03 i1="13" i2="X" l="ENG">
<s0>Document layout</s0>
<s5>18</s5>
</fC03>
<fC03 i1="13" i2="X" l="SPA">
<s0>Presentación documento</s0>
<s5>18</s5>
</fC03>
<fC03 i1="14" i2="X" l="FRE">
<s0>Evaluation performance</s0>
<s5>19</s5>
</fC03>
<fC03 i1="14" i2="X" l="ENG">
<s0>Performance evaluation</s0>
<s5>19</s5>
</fC03>
<fC03 i1="14" i2="X" l="SPA">
<s0>Evaluación prestación</s0>
<s5>19</s5>
</fC03>
<fC03 i1="15" i2="X" l="FRE">
<s0>Chèque bancaire</s0>
<s5>20</s5>
</fC03>
<fC03 i1="15" i2="X" l="ENG">
<s0>Bank check</s0>
<s5>20</s5>
</fC03>
<fC03 i1="15" i2="X" l="SPA">
<s0>Cheque bancario</s0>
<s5>20</s5>
</fC03>
<fN21>
<s1>276</s1>
</fN21>
<fN44 i1="01">
<s1>OTO</s1>
</fN44>
<fN82>
<s1>OTO</s1>
</fN82>
</pA>
<pR>
<fA30 i1="01" i2="1" l="ENG">
<s1>ICAPR : international conference on advances in pattern recognition</s1>
<s3>Bath GBR</s3>
<s4>2005-08-22</s4>
</fA30>
</pR>
</standard>
<server>
<NO>PASCAL 05-0390700 INIST</NO>
<ET>Automatic table detection in document images</ET>
<AU>GATOS (Basilios); DANATSAS (Dimitrios); PRATIKAKIS (Ioannis); PERANTONIS (Stavros J.); SINGH (Sameer); SINGH (Maneesha); APTE (Chid); PERNER (Petra)</AU>
<AF>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos"/15310 Athens/Grèce (1 aut., 2 aut., 3 aut., 4 aut.)</AF>
<DT>Publication en série; Congrès; Niveau analytique</DT>
<SO>Lecture notes in computer science; ISSN 0302-9743; Allemagne; Da. 2005; Vol. 3686; Part I, 609-618; Bibl. 11 ref.</SO>
<LA>Anglais</LA>
<EA>In this paper, we propose a novel technique for automatic table detection in document images. Lines and tables are among the most frequent graphic, non-textual entities in documents and their detection is directly related to the OCR performance as well as to the document layout description. We propose a workflow for table detection that comprises three distinct steps: (i) image preprocessing; (ii) horizontal and vertical line detection and (iii) table detection. The efficiency of the proposed method is demonstrated by using a performance evaluation scheme which considers a great variety of documents such as forms, newspapers/magazines, scientific journals, tickets/bank cheques, certificates and handwritten documents.</EA>
<CC>001D02B07D; 001D02B07B</CC>
<FD>Fouille donnée; Reconnaissance forme; Mesure automatique; Détecteur image; Représentation graphique; Texte; Reconnaissance caractère; Reconnaissance optique caractère; Collecticiel; Workflow; Détection contour; Caractère manuscrit; Présentation document; Evaluation performance; Chèque bancaire</FD>
<ED>Data mining; Pattern recognition; Automatic measurement; Image sensor; Graphics; Text; Character recognition; Optical character recognition; Groupware; Workflow; Edge detection; Manuscript character; Document layout; Performance evaluation; Bank check</ED>
<SD>Busca dato; Reconocimiento patrón; Medición automática; Detector imagen; Grafo (curva); Texto; Reconocimiento carácter; Reconocimento óptico de caracteres; Groupware; Workflow; Detección contorno; Carácter manuscrito; Presentación documento; Evaluación prestación; Cheque bancario</SD>
<LO>INIST-16343.354000124412760670</LO>
<ID>05-0390700</ID>
</server>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000451 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000451 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PascalFrancis
   |étape=   Corpus
   |type=    RBID
   |clé=     Pascal:05-0390700
   |texte=   Automatic table detection in document images
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024