Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Document Image Database Indexing with Pictorial Dictionary

Identifieur interne : 000172 ( PascalFrancis/Corpus ); précédent : 000171; suivant : 000173

Document Image Database Indexing with Pictorial Dictionary

Auteurs : Mohammad Akbari ; Reza Azmi

Source :

RBID : Pascal:10-0393832

Descripteurs français

English descriptors

Abstract

In this paper we introduce a new approach for information retrieval from Persian document image database without using Optical Character Recognition (OCR).At first an attribute called subword upper contour label is defined then, a pictorial dictionary is constructed based on this attribute for the subwords. By this approach we address two issues in document image retrieval: keyword spotting and retrieval according to the document similarities. The proposed methods have been evaluated on a Persian document image database. The results have proved the ability of this approach in document image information retrieval.

Notice en format standard (ISO 2709)

Pour connaître la documentation sur le format Inist Standard.

pA  
A01 01  1    @0 0277-786X
A02 01      @0 PSISDG
A03   1    @0 Proc. SPIE Int. Soc. Opt. Eng.
A05       @2 7546
A06       @3 p. 2
A08 01  1  ENG  @1 Document Image Database Indexing with Pictorial Dictionary
A09 01  1  ENG  @1 Second International Conference on Digital Image Processing : 26-28 February 2010, Singapore
A11 01  1    @1 AKBARI (Mohammad)
A11 02  1    @1 AZMI (Reza)
A12 01  1    @1 KAMARUZAMAN JUSOFF @9 ed.
A12 02  1    @1 XIE (Yi) @9 ed.
A14 01      @1 Engineering Department,I.A.U., Shahr-e-Qods Branch @2 Tehran @3 IRN @Z 1 aut.
A14 02      @1 Engineering Department, Alzahra University @2 Tehran @3 IRN @Z 2 aut.
A18 01  1    @1 SPIE @3 USA @9 org-cong.
A18 02  1    @1 International Association of Computer Science and Information Technology @3 USA @9 org-cong.
A20       @2 75462R.1-75462R.6
A21       @1 2010
A23 01      @0 ENG
A25 01      @1 SPIE @2 Bellingham, Wash.
A26 01      @0 978-0-8194-7942-6
A26 02      @0 0-8194-7942-X
A43 01      @1 INIST @2 21760 @5 354000174686700980
A44       @0 0000 @1 © 2010 INIST-CNRS. All rights reserved.
A45       @0 20 ref.
A47 01  1    @0 10-0393832
A60       @1 P @2 C
A61       @0 A
A64 01  1    @0 Proceedings of SPIE, the International Society for Optical Engineering
A66 01      @0 USA
C01 01    ENG  @0 In this paper we introduce a new approach for information retrieval from Persian document image database without using Optical Character Recognition (OCR).At first an attribute called subword upper contour label is defined then, a pictorial dictionary is constructed based on this attribute for the subwords. By this approach we address two issues in document image retrieval: keyword spotting and retrieval according to the document similarities. The proposed methods have been evaluated on a Persian document image database. The results have proved the ability of this approach in document image information retrieval.
C02 01  3    @0 001B00A30C
C02 02  3    @0 001B00G05P
C02 03  3    @0 001B40B30V
C03 01  3  FRE  @0 Recherche information @5 61
C03 01  3  ENG  @0 Information retrieval @5 61
C03 02  3  FRE  @0 Traitement image @5 62
C03 02  3  ENG  @0 Image processing @5 62
C03 03  X  FRE  @0 Image numérique @5 63
C03 03  X  ENG  @0 Digital image @5 63
C03 03  X  SPA  @0 Imagen numérica @5 63
C03 04  3  FRE  @0 Traitement image document @5 64
C03 04  3  ENG  @0 Document image processing @5 64
C03 05  X  FRE  @0 Banque image @5 65
C03 05  X  ENG  @0 Image databank @5 65
C03 05  X  SPA  @0 Banco imagen @5 65
C03 06  3  FRE  @0 Indexation @5 66
C03 06  3  ENG  @0 Indexing @5 66
C03 07  3  FRE  @0 Dictionnaire @5 67
C03 07  3  ENG  @0 Dictionaries @5 67
C03 08  3  FRE  @0 Reconnaissance optique caractère @5 68
C03 08  3  ENG  @0 Optical character recognition @5 68
C03 09  X  FRE  @0 Recherche documentaire @5 69
C03 09  X  ENG  @0 Document retrieval @5 69
C03 09  X  SPA  @0 Búsqueda documental @5 69
C03 10  3  FRE  @0 0130C @4 INC @5 83
C03 11  3  FRE  @0 0705P @4 INC @5 84
C03 12  3  FRE  @0 4230V @4 INC @5 91
N21       @1 256
N44 01      @1 OTO
N82       @1 OTO
pR  
A30 01  1  ENG  @1 International Conference on Digital Image Processing @2 02 @3 Singapore SGP @4 2010

Format Inist (serveur)

NO : PASCAL 10-0393832 INIST
ET : Document Image Database Indexing with Pictorial Dictionary
AU : AKBARI (Mohammad); AZMI (Reza); KAMARUZAMAN JUSOFF; XIE (Yi)
AF : Engineering Department,I.A.U., Shahr-e-Qods Branch/Tehran/Iran (1 aut.); Engineering Department, Alzahra University/Tehran/Iran (2 aut.)
DT : Publication en série; Congrès; Niveau analytique
SO : Proceedings of SPIE, the International Society for Optical Engineering; ISSN 0277-786X; Coden PSISDG; Etats-Unis; Da. 2010; Vol. 7546; No. p. 2; 75462R.1-75462R.6; Bibl. 20 ref.
LA : Anglais
EA : In this paper we introduce a new approach for information retrieval from Persian document image database without using Optical Character Recognition (OCR).At first an attribute called subword upper contour label is defined then, a pictorial dictionary is constructed based on this attribute for the subwords. By this approach we address two issues in document image retrieval: keyword spotting and retrieval according to the document similarities. The proposed methods have been evaluated on a Persian document image database. The results have proved the ability of this approach in document image information retrieval.
CC : 001B00A30C; 001B00G05P; 001B40B30V
FD : Recherche information; Traitement image; Image numérique; Traitement image document; Banque image; Indexation; Dictionnaire; Reconnaissance optique caractère; Recherche documentaire; 0130C; 0705P; 4230V
ED : Information retrieval; Image processing; Digital image; Document image processing; Image databank; Indexing; Dictionaries; Optical character recognition; Document retrieval
SD : Imagen numérica; Banco imagen; Búsqueda documental
LO : INIST-21760.354000174686700980
ID : 10-0393832

Links to Exploration step

Pascal:10-0393832

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Document Image Database Indexing with Pictorial Dictionary</title>
<author>
<name sortKey="Akbari, Mohammad" sort="Akbari, Mohammad" uniqKey="Akbari M" first="Mohammad" last="Akbari">Mohammad Akbari</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Engineering Department,I.A.U., Shahr-e-Qods Branch</s1>
<s2>Tehran</s2>
<s3>IRN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Azmi, Reza" sort="Azmi, Reza" uniqKey="Azmi R" first="Reza" last="Azmi">Reza Azmi</name>
<affiliation>
<inist:fA14 i1="02">
<s1>Engineering Department, Alzahra University</s1>
<s2>Tehran</s2>
<s3>IRN</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">10-0393832</idno>
<date when="2010">2010</date>
<idno type="stanalyst">PASCAL 10-0393832 INIST</idno>
<idno type="RBID">Pascal:10-0393832</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000172</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Document Image Database Indexing with Pictorial Dictionary</title>
<author>
<name sortKey="Akbari, Mohammad" sort="Akbari, Mohammad" uniqKey="Akbari M" first="Mohammad" last="Akbari">Mohammad Akbari</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Engineering Department,I.A.U., Shahr-e-Qods Branch</s1>
<s2>Tehran</s2>
<s3>IRN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author>
<name sortKey="Azmi, Reza" sort="Azmi, Reza" uniqKey="Azmi R" first="Reza" last="Azmi">Reza Azmi</name>
<affiliation>
<inist:fA14 i1="02">
<s1>Engineering Department, Alzahra University</s1>
<s2>Tehran</s2>
<s3>IRN</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Proceedings of SPIE, the International Society for Optical Engineering</title>
<title level="j" type="abbreviated">Proc. SPIE Int. Soc. Opt. Eng.</title>
<idno type="ISSN">0277-786X</idno>
<imprint>
<date when="2010">2010</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Proceedings of SPIE, the International Society for Optical Engineering</title>
<title level="j" type="abbreviated">Proc. SPIE Int. Soc. Opt. Eng.</title>
<idno type="ISSN">0277-786X</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Dictionaries</term>
<term>Digital image</term>
<term>Document image processing</term>
<term>Document retrieval</term>
<term>Image databank</term>
<term>Image processing</term>
<term>Indexing</term>
<term>Information retrieval</term>
<term>Optical character recognition</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Recherche information</term>
<term>Traitement image</term>
<term>Image numérique</term>
<term>Traitement image document</term>
<term>Banque image</term>
<term>Indexation</term>
<term>Dictionnaire</term>
<term>Reconnaissance optique caractère</term>
<term>Recherche documentaire</term>
<term>0130C</term>
<term>0705P</term>
<term>4230V</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">In this paper we introduce a new approach for information retrieval from Persian document image database without using Optical Character Recognition (OCR).At first an attribute called subword upper contour label is defined then, a pictorial dictionary is constructed based on this attribute for the subwords. By this approach we address two issues in document image retrieval: keyword spotting and retrieval according to the document similarities. The proposed methods have been evaluated on a Persian document image database. The results have proved the ability of this approach in document image information retrieval.</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA01 i1="01" i2="1">
<s0>0277-786X</s0>
</fA01>
<fA02 i1="01">
<s0>PSISDG</s0>
</fA02>
<fA03 i2="1">
<s0>Proc. SPIE Int. Soc. Opt. Eng.</s0>
</fA03>
<fA05>
<s2>7546</s2>
</fA05>
<fA06>
<s3>p. 2</s3>
</fA06>
<fA08 i1="01" i2="1" l="ENG">
<s1>Document Image Database Indexing with Pictorial Dictionary</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG">
<s1>Second International Conference on Digital Image Processing : 26-28 February 2010, Singapore</s1>
</fA09>
<fA11 i1="01" i2="1">
<s1>AKBARI (Mohammad)</s1>
</fA11>
<fA11 i1="02" i2="1">
<s1>AZMI (Reza)</s1>
</fA11>
<fA12 i1="01" i2="1">
<s1>KAMARUZAMAN JUSOFF</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="02" i2="1">
<s1>XIE (Yi)</s1>
<s9>ed.</s9>
</fA12>
<fA14 i1="01">
<s1>Engineering Department,I.A.U., Shahr-e-Qods Branch</s1>
<s2>Tehran</s2>
<s3>IRN</s3>
<sZ>1 aut.</sZ>
</fA14>
<fA14 i1="02">
<s1>Engineering Department, Alzahra University</s1>
<s2>Tehran</s2>
<s3>IRN</s3>
<sZ>2 aut.</sZ>
</fA14>
<fA18 i1="01" i2="1">
<s1>SPIE</s1>
<s3>USA</s3>
<s9>org-cong.</s9>
</fA18>
<fA18 i1="02" i2="1">
<s1>International Association of Computer Science and Information Technology</s1>
<s3>USA</s3>
<s9>org-cong.</s9>
</fA18>
<fA20>
<s2>75462R.1-75462R.6</s2>
</fA20>
<fA21>
<s1>2010</s1>
</fA21>
<fA23 i1="01">
<s0>ENG</s0>
</fA23>
<fA25 i1="01">
<s1>SPIE</s1>
<s2>Bellingham, Wash.</s2>
</fA25>
<fA26 i1="01">
<s0>978-0-8194-7942-6</s0>
</fA26>
<fA26 i1="02">
<s0>0-8194-7942-X</s0>
</fA26>
<fA43 i1="01">
<s1>INIST</s1>
<s2>21760</s2>
<s5>354000174686700980</s5>
</fA43>
<fA44>
<s0>0000</s0>
<s1>© 2010 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45>
<s0>20 ref.</s0>
</fA45>
<fA47 i1="01" i2="1">
<s0>10-0393832</s0>
</fA47>
<fA60>
<s1>P</s1>
<s2>C</s2>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA64 i1="01" i2="1">
<s0>Proceedings of SPIE, the International Society for Optical Engineering</s0>
</fA64>
<fA66 i1="01">
<s0>USA</s0>
</fA66>
<fC01 i1="01" l="ENG">
<s0>In this paper we introduce a new approach for information retrieval from Persian document image database without using Optical Character Recognition (OCR).At first an attribute called subword upper contour label is defined then, a pictorial dictionary is constructed based on this attribute for the subwords. By this approach we address two issues in document image retrieval: keyword spotting and retrieval according to the document similarities. The proposed methods have been evaluated on a Persian document image database. The results have proved the ability of this approach in document image information retrieval.</s0>
</fC01>
<fC02 i1="01" i2="3">
<s0>001B00A30C</s0>
</fC02>
<fC02 i1="02" i2="3">
<s0>001B00G05P</s0>
</fC02>
<fC02 i1="03" i2="3">
<s0>001B40B30V</s0>
</fC02>
<fC03 i1="01" i2="3" l="FRE">
<s0>Recherche information</s0>
<s5>61</s5>
</fC03>
<fC03 i1="01" i2="3" l="ENG">
<s0>Information retrieval</s0>
<s5>61</s5>
</fC03>
<fC03 i1="02" i2="3" l="FRE">
<s0>Traitement image</s0>
<s5>62</s5>
</fC03>
<fC03 i1="02" i2="3" l="ENG">
<s0>Image processing</s0>
<s5>62</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE">
<s0>Image numérique</s0>
<s5>63</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG">
<s0>Digital image</s0>
<s5>63</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA">
<s0>Imagen numérica</s0>
<s5>63</s5>
</fC03>
<fC03 i1="04" i2="3" l="FRE">
<s0>Traitement image document</s0>
<s5>64</s5>
</fC03>
<fC03 i1="04" i2="3" l="ENG">
<s0>Document image processing</s0>
<s5>64</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE">
<s0>Banque image</s0>
<s5>65</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG">
<s0>Image databank</s0>
<s5>65</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA">
<s0>Banco imagen</s0>
<s5>65</s5>
</fC03>
<fC03 i1="06" i2="3" l="FRE">
<s0>Indexation</s0>
<s5>66</s5>
</fC03>
<fC03 i1="06" i2="3" l="ENG">
<s0>Indexing</s0>
<s5>66</s5>
</fC03>
<fC03 i1="07" i2="3" l="FRE">
<s0>Dictionnaire</s0>
<s5>67</s5>
</fC03>
<fC03 i1="07" i2="3" l="ENG">
<s0>Dictionaries</s0>
<s5>67</s5>
</fC03>
<fC03 i1="08" i2="3" l="FRE">
<s0>Reconnaissance optique caractère</s0>
<s5>68</s5>
</fC03>
<fC03 i1="08" i2="3" l="ENG">
<s0>Optical character recognition</s0>
<s5>68</s5>
</fC03>
<fC03 i1="09" i2="X" l="FRE">
<s0>Recherche documentaire</s0>
<s5>69</s5>
</fC03>
<fC03 i1="09" i2="X" l="ENG">
<s0>Document retrieval</s0>
<s5>69</s5>
</fC03>
<fC03 i1="09" i2="X" l="SPA">
<s0>Búsqueda documental</s0>
<s5>69</s5>
</fC03>
<fC03 i1="10" i2="3" l="FRE">
<s0>0130C</s0>
<s4>INC</s4>
<s5>83</s5>
</fC03>
<fC03 i1="11" i2="3" l="FRE">
<s0>0705P</s0>
<s4>INC</s4>
<s5>84</s5>
</fC03>
<fC03 i1="12" i2="3" l="FRE">
<s0>4230V</s0>
<s4>INC</s4>
<s5>91</s5>
</fC03>
<fN21>
<s1>256</s1>
</fN21>
<fN44 i1="01">
<s1>OTO</s1>
</fN44>
<fN82>
<s1>OTO</s1>
</fN82>
</pA>
<pR>
<fA30 i1="01" i2="1" l="ENG">
<s1>International Conference on Digital Image Processing</s1>
<s2>02</s2>
<s3>Singapore SGP</s3>
<s4>2010</s4>
</fA30>
</pR>
</standard>
<server>
<NO>PASCAL 10-0393832 INIST</NO>
<ET>Document Image Database Indexing with Pictorial Dictionary</ET>
<AU>AKBARI (Mohammad); AZMI (Reza); KAMARUZAMAN JUSOFF; XIE (Yi)</AU>
<AF>Engineering Department,I.A.U., Shahr-e-Qods Branch/Tehran/Iran (1 aut.); Engineering Department, Alzahra University/Tehran/Iran (2 aut.)</AF>
<DT>Publication en série; Congrès; Niveau analytique</DT>
<SO>Proceedings of SPIE, the International Society for Optical Engineering; ISSN 0277-786X; Coden PSISDG; Etats-Unis; Da. 2010; Vol. 7546; No. p. 2; 75462R.1-75462R.6; Bibl. 20 ref.</SO>
<LA>Anglais</LA>
<EA>In this paper we introduce a new approach for information retrieval from Persian document image database without using Optical Character Recognition (OCR).At first an attribute called subword upper contour label is defined then, a pictorial dictionary is constructed based on this attribute for the subwords. By this approach we address two issues in document image retrieval: keyword spotting and retrieval according to the document similarities. The proposed methods have been evaluated on a Persian document image database. The results have proved the ability of this approach in document image information retrieval.</EA>
<CC>001B00A30C; 001B00G05P; 001B40B30V</CC>
<FD>Recherche information; Traitement image; Image numérique; Traitement image document; Banque image; Indexation; Dictionnaire; Reconnaissance optique caractère; Recherche documentaire; 0130C; 0705P; 4230V</FD>
<ED>Information retrieval; Image processing; Digital image; Document image processing; Image databank; Indexing; Dictionaries; Optical character recognition; Document retrieval</ED>
<SD>Imagen numérica; Banco imagen; Búsqueda documental</SD>
<LO>INIST-21760.354000174686700980</LO>
<ID>10-0393832</ID>
</server>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000172 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000172 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PascalFrancis
   |étape=   Corpus
   |type=    RBID
   |clé=     Pascal:10-0393832
   |texte=   Document Image Database Indexing with Pictorial Dictionary
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024