Document Image Database Indexing with Pictorial Dictionary
Identifieur interne :
000172 ( PascalFrancis/Corpus );
précédent :
000171;
suivant :
000173
Document Image Database Indexing with Pictorial Dictionary
Auteurs : Mohammad Akbari ;
Reza AzmiSource :
-
Proceedings of SPIE, the International Society for Optical Engineering [ 0277-786X ] ; 2010.
RBID : Pascal:10-0393832
Descripteurs français
- Pascal (Inist)
- Recherche information,
Traitement image,
Image numérique,
Traitement image document,
Banque image,
Indexation,
Dictionnaire,
Reconnaissance optique caractère,
Recherche documentaire,
0130C,
0705P,
4230V.
English descriptors
Abstract
In this paper we introduce a new approach for information retrieval from Persian document image database without using Optical Character Recognition (OCR).At first an attribute called subword upper contour label is defined then, a pictorial dictionary is constructed based on this attribute for the subwords. By this approach we address two issues in document image retrieval: keyword spotting and retrieval according to the document similarities. The proposed methods have been evaluated on a Persian document image database. The results have proved the ability of this approach in document image information retrieval.
Notice en format standard (ISO 2709)
Pour connaître la documentation sur le format Inist Standard.
pA |
A01 | 01 | 1 | | @0 0277-786X |
---|
A02 | 01 | | | @0 PSISDG |
---|
A03 | | 1 | | @0 Proc. SPIE Int. Soc. Opt. Eng. |
---|
A05 | | | | @2 7546 |
---|
A06 | | | | @3 p. 2 |
---|
A08 | 01 | 1 | ENG | @1 Document Image Database Indexing with Pictorial Dictionary |
---|
A09 | 01 | 1 | ENG | @1 Second International Conference on Digital Image Processing : 26-28 February 2010, Singapore |
---|
A11 | 01 | 1 | | @1 AKBARI (Mohammad) |
---|
A11 | 02 | 1 | | @1 AZMI (Reza) |
---|
A12 | 01 | 1 | | @1 KAMARUZAMAN JUSOFF @9 ed. |
---|
A12 | 02 | 1 | | @1 XIE (Yi) @9 ed. |
---|
A14 | 01 | | | @1 Engineering Department,I.A.U., Shahr-e-Qods Branch @2 Tehran @3 IRN @Z 1 aut. |
---|
A14 | 02 | | | @1 Engineering Department, Alzahra University @2 Tehran @3 IRN @Z 2 aut. |
---|
A18 | 01 | 1 | | @1 SPIE @3 USA @9 org-cong. |
---|
A18 | 02 | 1 | | @1 International Association of Computer Science and Information Technology @3 USA @9 org-cong. |
---|
A20 | | | | @2 75462R.1-75462R.6 |
---|
A21 | | | | @1 2010 |
---|
A23 | 01 | | | @0 ENG |
---|
A25 | 01 | | | @1 SPIE @2 Bellingham, Wash. |
---|
A26 | 01 | | | @0 978-0-8194-7942-6 |
---|
A26 | 02 | | | @0 0-8194-7942-X |
---|
A43 | 01 | | | @1 INIST @2 21760 @5 354000174686700980 |
---|
A44 | | | | @0 0000 @1 © 2010 INIST-CNRS. All rights reserved. |
---|
A45 | | | | @0 20 ref. |
---|
A47 | 01 | 1 | | @0 10-0393832 |
---|
A60 | | | | @1 P @2 C |
---|
A61 | | | | @0 A |
---|
A64 | 01 | 1 | | @0 Proceedings of SPIE, the International Society for Optical Engineering |
---|
A66 | 01 | | | @0 USA |
---|
C01 | 01 | | ENG | @0 In this paper we introduce a new approach for information retrieval from Persian document image database without using Optical Character Recognition (OCR).At first an attribute called subword upper contour label is defined then, a pictorial dictionary is constructed based on this attribute for the subwords. By this approach we address two issues in document image retrieval: keyword spotting and retrieval according to the document similarities. The proposed methods have been evaluated on a Persian document image database. The results have proved the ability of this approach in document image information retrieval. |
---|
C02 | 01 | 3 | | @0 001B00A30C |
---|
C02 | 02 | 3 | | @0 001B00G05P |
---|
C02 | 03 | 3 | | @0 001B40B30V |
---|
C03 | 01 | 3 | FRE | @0 Recherche information @5 61 |
---|
C03 | 01 | 3 | ENG | @0 Information retrieval @5 61 |
---|
C03 | 02 | 3 | FRE | @0 Traitement image @5 62 |
---|
C03 | 02 | 3 | ENG | @0 Image processing @5 62 |
---|
C03 | 03 | X | FRE | @0 Image numérique @5 63 |
---|
C03 | 03 | X | ENG | @0 Digital image @5 63 |
---|
C03 | 03 | X | SPA | @0 Imagen numérica @5 63 |
---|
C03 | 04 | 3 | FRE | @0 Traitement image document @5 64 |
---|
C03 | 04 | 3 | ENG | @0 Document image processing @5 64 |
---|
C03 | 05 | X | FRE | @0 Banque image @5 65 |
---|
C03 | 05 | X | ENG | @0 Image databank @5 65 |
---|
C03 | 05 | X | SPA | @0 Banco imagen @5 65 |
---|
C03 | 06 | 3 | FRE | @0 Indexation @5 66 |
---|
C03 | 06 | 3 | ENG | @0 Indexing @5 66 |
---|
C03 | 07 | 3 | FRE | @0 Dictionnaire @5 67 |
---|
C03 | 07 | 3 | ENG | @0 Dictionaries @5 67 |
---|
C03 | 08 | 3 | FRE | @0 Reconnaissance optique caractère @5 68 |
---|
C03 | 08 | 3 | ENG | @0 Optical character recognition @5 68 |
---|
C03 | 09 | X | FRE | @0 Recherche documentaire @5 69 |
---|
C03 | 09 | X | ENG | @0 Document retrieval @5 69 |
---|
C03 | 09 | X | SPA | @0 Búsqueda documental @5 69 |
---|
C03 | 10 | 3 | FRE | @0 0130C @4 INC @5 83 |
---|
C03 | 11 | 3 | FRE | @0 0705P @4 INC @5 84 |
---|
C03 | 12 | 3 | FRE | @0 4230V @4 INC @5 91 |
---|
N21 | | | | @1 256 |
---|
N44 | 01 | | | @1 OTO |
---|
N82 | | | | @1 OTO |
---|
|
pR |
A30 | 01 | 1 | ENG | @1 International Conference on Digital Image Processing @2 02 @3 Singapore SGP @4 2010 |
---|
|
Format Inist (serveur)
NO : | PASCAL 10-0393832 INIST |
ET : | Document Image Database Indexing with Pictorial Dictionary |
AU : | AKBARI (Mohammad); AZMI (Reza); KAMARUZAMAN JUSOFF; XIE (Yi) |
AF : | Engineering Department,I.A.U., Shahr-e-Qods Branch/Tehran/Iran (1 aut.); Engineering Department, Alzahra University/Tehran/Iran (2 aut.) |
DT : | Publication en série; Congrès; Niveau analytique |
SO : | Proceedings of SPIE, the International Society for Optical Engineering; ISSN 0277-786X; Coden PSISDG; Etats-Unis; Da. 2010; Vol. 7546; No. p. 2; 75462R.1-75462R.6; Bibl. 20 ref. |
LA : | Anglais |
EA : | In this paper we introduce a new approach for information retrieval from Persian document image database without using Optical Character Recognition (OCR).At first an attribute called subword upper contour label is defined then, a pictorial dictionary is constructed based on this attribute for the subwords. By this approach we address two issues in document image retrieval: keyword spotting and retrieval according to the document similarities. The proposed methods have been evaluated on a Persian document image database. The results have proved the ability of this approach in document image information retrieval. |
CC : | 001B00A30C; 001B00G05P; 001B40B30V |
FD : | Recherche information; Traitement image; Image numérique; Traitement image document; Banque image; Indexation; Dictionnaire; Reconnaissance optique caractère; Recherche documentaire; 0130C; 0705P; 4230V |
ED : | Information retrieval; Image processing; Digital image; Document image processing; Image databank; Indexing; Dictionaries; Optical character recognition; Document retrieval |
SD : | Imagen numérica; Banco imagen; Búsqueda documental |
LO : | INIST-21760.354000174686700980 |
ID : | 10-0393832 |
Links to Exploration step
Pascal:10-0393832
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Document Image Database Indexing with Pictorial Dictionary</title>
<author><name sortKey="Akbari, Mohammad" sort="Akbari, Mohammad" uniqKey="Akbari M" first="Mohammad" last="Akbari">Mohammad Akbari</name>
<affiliation><inist:fA14 i1="01"><s1>Engineering Department,I.A.U., Shahr-e-Qods Branch</s1>
<s2>Tehran</s2>
<s3>IRN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Azmi, Reza" sort="Azmi, Reza" uniqKey="Azmi R" first="Reza" last="Azmi">Reza Azmi</name>
<affiliation><inist:fA14 i1="02"><s1>Engineering Department, Alzahra University</s1>
<s2>Tehran</s2>
<s3>IRN</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">10-0393832</idno>
<date when="2010">2010</date>
<idno type="stanalyst">PASCAL 10-0393832 INIST</idno>
<idno type="RBID">Pascal:10-0393832</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000172</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Document Image Database Indexing with Pictorial Dictionary</title>
<author><name sortKey="Akbari, Mohammad" sort="Akbari, Mohammad" uniqKey="Akbari M" first="Mohammad" last="Akbari">Mohammad Akbari</name>
<affiliation><inist:fA14 i1="01"><s1>Engineering Department,I.A.U., Shahr-e-Qods Branch</s1>
<s2>Tehran</s2>
<s3>IRN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Azmi, Reza" sort="Azmi, Reza" uniqKey="Azmi R" first="Reza" last="Azmi">Reza Azmi</name>
<affiliation><inist:fA14 i1="02"><s1>Engineering Department, Alzahra University</s1>
<s2>Tehran</s2>
<s3>IRN</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">Proceedings of SPIE, the International Society for Optical Engineering</title>
<title level="j" type="abbreviated">Proc. SPIE Int. Soc. Opt. Eng.</title>
<idno type="ISSN">0277-786X</idno>
<imprint><date when="2010">2010</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">Proceedings of SPIE, the International Society for Optical Engineering</title>
<title level="j" type="abbreviated">Proc. SPIE Int. Soc. Opt. Eng.</title>
<idno type="ISSN">0277-786X</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Dictionaries</term>
<term>Digital image</term>
<term>Document image processing</term>
<term>Document retrieval</term>
<term>Image databank</term>
<term>Image processing</term>
<term>Indexing</term>
<term>Information retrieval</term>
<term>Optical character recognition</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Recherche information</term>
<term>Traitement image</term>
<term>Image numérique</term>
<term>Traitement image document</term>
<term>Banque image</term>
<term>Indexation</term>
<term>Dictionnaire</term>
<term>Reconnaissance optique caractère</term>
<term>Recherche documentaire</term>
<term>0130C</term>
<term>0705P</term>
<term>4230V</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">In this paper we introduce a new approach for information retrieval from Persian document image database without using Optical Character Recognition (OCR).At first an attribute called subword upper contour label is defined then, a pictorial dictionary is constructed based on this attribute for the subwords. By this approach we address two issues in document image retrieval: keyword spotting and retrieval according to the document similarities. The proposed methods have been evaluated on a Persian document image database. The results have proved the ability of this approach in document image information retrieval.</div>
</front>
</TEI>
<inist><standard h6="B"><pA><fA01 i1="01" i2="1"><s0>0277-786X</s0>
</fA01>
<fA02 i1="01"><s0>PSISDG</s0>
</fA02>
<fA03 i2="1"><s0>Proc. SPIE Int. Soc. Opt. Eng.</s0>
</fA03>
<fA05><s2>7546</s2>
</fA05>
<fA06><s3>p. 2</s3>
</fA06>
<fA08 i1="01" i2="1" l="ENG"><s1>Document Image Database Indexing with Pictorial Dictionary</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG"><s1>Second International Conference on Digital Image Processing : 26-28 February 2010, Singapore</s1>
</fA09>
<fA11 i1="01" i2="1"><s1>AKBARI (Mohammad)</s1>
</fA11>
<fA11 i1="02" i2="1"><s1>AZMI (Reza)</s1>
</fA11>
<fA12 i1="01" i2="1"><s1>KAMARUZAMAN JUSOFF</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="02" i2="1"><s1>XIE (Yi)</s1>
<s9>ed.</s9>
</fA12>
<fA14 i1="01"><s1>Engineering Department,I.A.U., Shahr-e-Qods Branch</s1>
<s2>Tehran</s2>
<s3>IRN</s3>
<sZ>1 aut.</sZ>
</fA14>
<fA14 i1="02"><s1>Engineering Department, Alzahra University</s1>
<s2>Tehran</s2>
<s3>IRN</s3>
<sZ>2 aut.</sZ>
</fA14>
<fA18 i1="01" i2="1"><s1>SPIE</s1>
<s3>USA</s3>
<s9>org-cong.</s9>
</fA18>
<fA18 i1="02" i2="1"><s1>International Association of Computer Science and Information Technology</s1>
<s3>USA</s3>
<s9>org-cong.</s9>
</fA18>
<fA20><s2>75462R.1-75462R.6</s2>
</fA20>
<fA21><s1>2010</s1>
</fA21>
<fA23 i1="01"><s0>ENG</s0>
</fA23>
<fA25 i1="01"><s1>SPIE</s1>
<s2>Bellingham, Wash.</s2>
</fA25>
<fA26 i1="01"><s0>978-0-8194-7942-6</s0>
</fA26>
<fA26 i1="02"><s0>0-8194-7942-X</s0>
</fA26>
<fA43 i1="01"><s1>INIST</s1>
<s2>21760</s2>
<s5>354000174686700980</s5>
</fA43>
<fA44><s0>0000</s0>
<s1>© 2010 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45><s0>20 ref.</s0>
</fA45>
<fA47 i1="01" i2="1"><s0>10-0393832</s0>
</fA47>
<fA60><s1>P</s1>
<s2>C</s2>
</fA60>
<fA64 i1="01" i2="1"><s0>Proceedings of SPIE, the International Society for Optical Engineering</s0>
</fA64>
<fA66 i1="01"><s0>USA</s0>
</fA66>
<fC01 i1="01" l="ENG"><s0>In this paper we introduce a new approach for information retrieval from Persian document image database without using Optical Character Recognition (OCR).At first an attribute called subword upper contour label is defined then, a pictorial dictionary is constructed based on this attribute for the subwords. By this approach we address two issues in document image retrieval: keyword spotting and retrieval according to the document similarities. The proposed methods have been evaluated on a Persian document image database. The results have proved the ability of this approach in document image information retrieval.</s0>
</fC01>
<fC02 i1="01" i2="3"><s0>001B00A30C</s0>
</fC02>
<fC02 i1="02" i2="3"><s0>001B00G05P</s0>
</fC02>
<fC02 i1="03" i2="3"><s0>001B40B30V</s0>
</fC02>
<fC03 i1="01" i2="3" l="FRE"><s0>Recherche information</s0>
<s5>61</s5>
</fC03>
<fC03 i1="01" i2="3" l="ENG"><s0>Information retrieval</s0>
<s5>61</s5>
</fC03>
<fC03 i1="02" i2="3" l="FRE"><s0>Traitement image</s0>
<s5>62</s5>
</fC03>
<fC03 i1="02" i2="3" l="ENG"><s0>Image processing</s0>
<s5>62</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE"><s0>Image numérique</s0>
<s5>63</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG"><s0>Digital image</s0>
<s5>63</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA"><s0>Imagen numérica</s0>
<s5>63</s5>
</fC03>
<fC03 i1="04" i2="3" l="FRE"><s0>Traitement image document</s0>
<s5>64</s5>
</fC03>
<fC03 i1="04" i2="3" l="ENG"><s0>Document image processing</s0>
<s5>64</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE"><s0>Banque image</s0>
<s5>65</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG"><s0>Image databank</s0>
<s5>65</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA"><s0>Banco imagen</s0>
<s5>65</s5>
</fC03>
<fC03 i1="06" i2="3" l="FRE"><s0>Indexation</s0>
<s5>66</s5>
</fC03>
<fC03 i1="06" i2="3" l="ENG"><s0>Indexing</s0>
<s5>66</s5>
</fC03>
<fC03 i1="07" i2="3" l="FRE"><s0>Dictionnaire</s0>
<s5>67</s5>
</fC03>
<fC03 i1="07" i2="3" l="ENG"><s0>Dictionaries</s0>
<s5>67</s5>
</fC03>
<fC03 i1="08" i2="3" l="FRE"><s0>Reconnaissance optique caractère</s0>
<s5>68</s5>
</fC03>
<fC03 i1="08" i2="3" l="ENG"><s0>Optical character recognition</s0>
<s5>68</s5>
</fC03>
<fC03 i1="09" i2="X" l="FRE"><s0>Recherche documentaire</s0>
<s5>69</s5>
</fC03>
<fC03 i1="09" i2="X" l="ENG"><s0>Document retrieval</s0>
<s5>69</s5>
</fC03>
<fC03 i1="09" i2="X" l="SPA"><s0>Búsqueda documental</s0>
<s5>69</s5>
</fC03>
<fC03 i1="10" i2="3" l="FRE"><s0>0130C</s0>
<s4>INC</s4>
<s5>83</s5>
</fC03>
<fC03 i1="11" i2="3" l="FRE"><s0>0705P</s0>
<s4>INC</s4>
<s5>84</s5>
</fC03>
<fC03 i1="12" i2="3" l="FRE"><s0>4230V</s0>
<s4>INC</s4>
<s5>91</s5>
</fC03>
<fN21><s1>256</s1>
</fN21>
<fN44 i1="01"><s1>OTO</s1>
</fN44>
<fN82><s1>OTO</s1>
</fN82>
</pA>
<pR><fA30 i1="01" i2="1" l="ENG"><s1>International Conference on Digital Image Processing</s1>
<s2>02</s2>
<s3>Singapore SGP</s3>
<s4>2010</s4>
</fA30>
</pR>
</standard>
<server><NO>PASCAL 10-0393832 INIST</NO>
<ET>Document Image Database Indexing with Pictorial Dictionary</ET>
<AU>AKBARI (Mohammad); AZMI (Reza); KAMARUZAMAN JUSOFF; XIE (Yi)</AU>
<AF>Engineering Department,I.A.U., Shahr-e-Qods Branch/Tehran/Iran (1 aut.); Engineering Department, Alzahra University/Tehran/Iran (2 aut.)</AF>
<DT>Publication en série; Congrès; Niveau analytique</DT>
<SO>Proceedings of SPIE, the International Society for Optical Engineering; ISSN 0277-786X; Coden PSISDG; Etats-Unis; Da. 2010; Vol. 7546; No. p. 2; 75462R.1-75462R.6; Bibl. 20 ref.</SO>
<LA>Anglais</LA>
<EA>In this paper we introduce a new approach for information retrieval from Persian document image database without using Optical Character Recognition (OCR).At first an attribute called subword upper contour label is defined then, a pictorial dictionary is constructed based on this attribute for the subwords. By this approach we address two issues in document image retrieval: keyword spotting and retrieval according to the document similarities. The proposed methods have been evaluated on a Persian document image database. The results have proved the ability of this approach in document image information retrieval.</EA>
<CC>001B00A30C; 001B00G05P; 001B40B30V</CC>
<FD>Recherche information; Traitement image; Image numérique; Traitement image document; Banque image; Indexation; Dictionnaire; Reconnaissance optique caractère; Recherche documentaire; 0130C; 0705P; 4230V</FD>
<ED>Information retrieval; Image processing; Digital image; Document image processing; Image databank; Indexing; Dictionaries; Optical character recognition; Document retrieval</ED>
<SD>Imagen numérica; Banco imagen; Búsqueda documental</SD>
<LO>INIST-21760.354000174686700980</LO>
<ID>10-0393832</ID>
</server>
</inist>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000172 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000172 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien
|wiki= Ticri/CIDE
|area= OcrV1
|flux= PascalFrancis
|étape= Corpus
|type= RBID
|clé= Pascal:10-0393832
|texte= Document Image Database Indexing with Pictorial Dictionary
}}
| This area was generated with Dilib version V0.6.32. Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024 | |