OcrV1, Istex, Corpus, bibRecord, 000C96

Learning Visual Shape Lexicon for Document Image Content Recognition

Identifieur interne : 000C96 ( Istex/Corpus ); précédent : 000C95; suivant : 000C97

Learning Visual Shape Lexicon for Document Image Content Recognition

Auteurs : Guangyu Zhu ; Xiaodong Yu ; Yi Li ; David Doermann

Source :

Lecture Notes in Computer Science [ 0302-9743 ] ; 2008.

RBID : ISTEX:000EA72B875137D2E35868AFB5C5FCB5D7A54937

Abstract

Abstract: Developing effective content recognition methods for diverse imagery continues to challenge computer vision researchers. We present a new approach for document image content categorization using a lexicon of shape features. Each lexical word corresponds to a scale and rotation invariant shape feature that is generic enough to be detected repeatably and segmentation free. We learn a concise, structurally indexed shape lexicon from training by clustering and partitioning feature types through graph cuts. We demonstrate our approach on two challenging document image content recognition problems: 1) The classification of 4,500 Web images crawled from Google Image Search into three content categories — pure image, image with text, and document image, and 2) Language identification of 8 languages (Arabic, Chinese, English, Hindi, Japanese, Korean, Russian, and Thai) on a 1,512 complex document image database composed of mixed machine printed text and handwriting. Our approach is capable to handle high intra-class variability and shows results that exceed other state-of-the-art approaches, allowing it to be used as a content recognizer in image indexing and retrieval systems.

Url:

https://api.istex.fr/document/000EA72B875137D2E35868AFB5C5FCB5D7A54937/fulltext/pdf

DOI: 10.1007/978-3-540-88688-4_55

Links to Exploration step

ISTEX:000EA72B875137D2E35868AFB5C5FCB5D7A54937

Le document en format XML

<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Learning Visual Shape Lexicon for Document Image Content Recognition</title>
<author><name sortKey="Zhu, Guangyu" sort="Zhu, Guangyu" uniqKey="Zhu G" first="Guangyu" last="Zhu">Guangyu Zhu</name>
<affiliation><mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
<author><name sortKey="Yu, Xiaodong" sort="Yu, Xiaodong" uniqKey="Yu X" first="Xiaodong" last="Yu">Xiaodong Yu</name>
<affiliation><mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
<author><name sortKey="Li, Yi" sort="Li, Yi" uniqKey="Li Y" first="Yi" last="Li">Yi Li</name>
<affiliation><mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
<author><name sortKey="Doermann, David" sort="Doermann, David" uniqKey="Doermann D" first="David" last="Doermann">David Doermann</name>
<affiliation><mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:000EA72B875137D2E35868AFB5C5FCB5D7A54937</idno>
<date when="2008" year="2008">2008</date>
<idno type="doi">10.1007/978-3-540-88688-4_55</idno>
<idno type="url">https://api.istex.fr/document/000EA72B875137D2E35868AFB5C5FCB5D7A54937/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000C96</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Learning Visual Shape Lexicon for Document Image Content Recognition</title>
<author><name sortKey="Zhu, Guangyu" sort="Zhu, Guangyu" uniqKey="Zhu G" first="Guangyu" last="Zhu">Guangyu Zhu</name>
<affiliation><mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
<author><name sortKey="Yu, Xiaodong" sort="Yu, Xiaodong" uniqKey="Yu X" first="Xiaodong" last="Yu">Xiaodong Yu</name>
<affiliation><mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
<author><name sortKey="Li, Yi" sort="Li, Yi" uniqKey="Li Y" first="Yi" last="Li">Yi Li</name>
<affiliation><mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
<author><name sortKey="Doermann, David" sort="Doermann, David" uniqKey="Doermann D" first="David" last="Doermann">David Doermann</name>
<affiliation><mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2008</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">000EA72B875137D2E35868AFB5C5FCB5D7A54937</idno>
<idno type="DOI">10.1007/978-3-540-88688-4_55</idno>
<idno type="ChapterID">55</idno>
<idno type="ChapterID">Chap55</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: Developing effective content recognition methods for diverse imagery continues to challenge computer vision researchers. We present a new approach for document image content categorization using a lexicon of shape features. Each lexical word corresponds to a scale and rotation invariant shape feature that is generic enough to be detected repeatably and segmentation free. We learn a concise, structurally indexed shape lexicon from training by clustering and partitioning feature types through graph cuts. We demonstrate our approach on two challenging document image content recognition problems: 1) The classification of 4,500 Web images crawled from Google Image Search into three content categories — pure image, image with text, and document image, and 2) Language identification of 8 languages (Arabic, Chinese, English, Hindi, Japanese, Korean, Russian, and Thai) on a 1,512 complex document image database composed of mixed machine printed text and handwriting. Our approach is capable to handle high intra-class variability and shows results that exceed other state-of-the-art approaches, allowing it to be used as a content recognizer in image indexing and retrieval systems.</div>
</front>
</TEI>
<istex><corpusName>springer</corpusName>
<author><json:item><name>Guangyu Zhu</name>
<affiliations><json:string>University of Maryland, MD 20742, College Park, USA</json:string>
</affiliations>
</json:item>
<json:item><name>Xiaodong Yu</name>
<affiliations><json:string>University of Maryland, MD 20742, College Park, USA</json:string>
</affiliations>
</json:item>
<json:item><name>Yi Li</name>
<affiliations><json:string>University of Maryland, MD 20742, College Park, USA</json:string>
</affiliations>
</json:item>
<json:item><name>David Doermann</name>
<affiliations><json:string>University of Maryland, MD 20742, College Park, USA</json:string>
</affiliations>
</json:item>
</author>
<language><json:string>eng</json:string>
</language>
<abstract>Abstract: Developing effective content recognition methods for diverse imagery continues to challenge computer vision researchers. We present a new approach for document image content categorization using a lexicon of shape features. Each lexical word corresponds to a scale and rotation invariant shape feature that is generic enough to be detected repeatably and segmentation free. We learn a concise, structurally indexed shape lexicon from training by clustering and partitioning feature types through graph cuts. We demonstrate our approach on two challenging document image content recognition problems: 1) The classification of 4,500 Web images crawled from Google Image Search into three content categories — pure image, image with text, and document image, and 2) Language identification of 8 languages (Arabic, Chinese, English, Hindi, Japanese, Korean, Russian, and Thai) on a 1,512 complex document image database composed of mixed machine printed text and handwriting. Our approach is capable to handle high intra-class variability and shows results that exceed other state-of-the-art approaches, allowing it to be used as a content recognizer in image indexing and retrieval systems.</abstract>
<qualityIndicators><score>6.99</score>
<pdfVersion>1.3</pdfVersion>
<pdfPageSize>430 x 660 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>1197</abstractCharCount>
<pdfWordCount>4914</pdfWordCount>
<pdfCharCount>30335</pdfCharCount>
<pdfPageCount>14</pdfPageCount>
<abstractWordCount>173</abstractWordCount>
</qualityIndicators>
<title>Learning Visual Shape Lexicon for Document Image Content Recognition</title>
<genre.original><json:string>OriginalPaper</json:string>
</genre.original>
<chapterId><json:string>55</json:string>
<json:string>Chap55</json:string>
</chapterId>
<genre><json:string>conference [eBooks]</json:string>
</genre>
<serie><editor><json:item><name>David Hutchison</name>
</json:item>
<json:item><name>Takeo Kanade</name>
</json:item>
<json:item><name>Josef Kittler</name>
</json:item>
<json:item><name>Jon M. Kleinberg</name>
</json:item>
<json:item><name>Friedemann Mattern</name>
</json:item>
<json:item><name>John C. Mitchell</name>
</json:item>
<json:item><name>Moni Naor</name>
</json:item>
<json:item><name>Oscar Nierstrasz</name>
</json:item>
<json:item><name>C. Pandu Rangan</name>
</json:item>
<json:item><name>Bernhard Steffen</name>
</json:item>
<json:item><name>Madhu Sudan</name>
</json:item>
<json:item><name>Demetri Terzopoulos</name>
</json:item>
<json:item><name>Doug Tygar</name>
</json:item>
<json:item><name>Moshe Y. Vardi</name>
</json:item>
<json:item><name>Gerhard Weikum</name>
</json:item>
</editor>
<issn><json:string>0302-9743</json:string>
</issn>
<language><json:string>unknown</json:string>
</language>
<eissn><json:string>1611-3349</json:string>
</eissn>
<title>Lecture Notes in Computer Science</title>
<copyrightDate>2008</copyrightDate>
</serie>
<host><editor><json:item><name>David Forsyth</name>
<affiliations><json:string>Computer Science Department, University of Illinois at Urbana Champaign, 3310 Siebel Hall, IL 61801, Urbana, USA</json:string>
<json:string>E-mail: daf@cs.uiuc.edu</json:string>
</affiliations>
</json:item>
<json:item><name>Philip Torr</name>
<affiliations><json:string>Department of Computing, Oxford Brookes University, OX33 1HX, Wheatley, Oxford, UK</json:string>
<json:string>E-mail: philiptorr@brookes.ac.uk</json:string>
</affiliations>
</json:item>
<json:item><name>Andrew Zisserman</name>
<affiliations><json:string>Department of Engineering Science, University of Oxford, Parks Road, OX1 3PJ, Oxford, UK</json:string>
<json:string>E-mail: az@robots.ox.ac.uk</json:string>
</affiliations>
</json:item>
</editor>
<subject><json:item><value>Computer Science</value>
</json:item>
<json:item><value>Computer Science</value>
</json:item>
<json:item><value>Image Processing and Computer Vision</value>
</json:item>
<json:item><value>Computer Imaging, Vision, Pattern Recognition and Graphics</value>
</json:item>
<json:item><value>Computer Graphics</value>
</json:item>
<json:item><value>Pattern Recognition</value>
</json:item>
<json:item><value>Data Mining and Knowledge Discovery</value>
</json:item>
<json:item><value>Computer Appl. in Arts and Humanities</value>
</json:item>
</subject>
<isbn><json:string>978-3-540-88685-3</json:string>
</isbn>
<language><json:string>unknown</json:string>
</language>
<eissn><json:string>1611-3349</json:string>
</eissn>
<title>Computer Vision – ECCV 2008</title>
<genre.original><json:string>Proceedings</json:string>
</genre.original>
<bookId><json:string>978-3-540-88688-4</json:string>
</bookId>
<volume>5303</volume>
<pages><last>758</last>
<first>745</first>
</pages>
<issn><json:string>0302-9743</json:string>
</issn>
<genre><json:string>Book Series</json:string>
</genre>
<eisbn><json:string>978-3-540-88688-4</json:string>
</eisbn>
<copyrightDate>2008</copyrightDate>
<doi><json:string>10.1007/978-3-540-88688-4</json:string>
</doi>
</host>
<publicationDate>2008</publicationDate>
<copyrightDate>2008</copyrightDate>
<doi><json:string>10.1007/978-3-540-88688-4_55</json:string>
</doi>
<id>000EA72B875137D2E35868AFB5C5FCB5D7A54937</id>
<fulltext><json:item><original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/000EA72B875137D2E35868AFB5C5FCB5D7A54937/fulltext/pdf</uri>
</json:item>
<json:item><original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/000EA72B875137D2E35868AFB5C5FCB5D7A54937/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/000EA72B875137D2E35868AFB5C5FCB5D7A54937/fulltext/tei"><teiHeader><fileDesc><titleStmt><title level="a" type="main" xml:lang="en">Learning Visual Shape Lexicon for Document Image Content Recognition</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
</titleStmt>
<publicationStmt><authority>ISTEX</authority>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<availability><p>SPRINGER</p>
</availability>
<date>2008</date>
</publicationStmt>
<sourceDesc><biblStruct type="inbook"><analytic><title level="a" type="main" xml:lang="en">Learning Visual Shape Lexicon for Document Image Content Recognition</title>
<author><persName><forename type="first">Guangyu</forename>
<surname>Zhu</surname>
</persName>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
</author>
<author><persName><forename type="first">Xiaodong</forename>
<surname>Yu</surname>
</persName>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
</author>
<author><persName><forename type="first">Yi</forename>
<surname>Li</surname>
</persName>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
</author>
<author><persName><forename type="first">David</forename>
<surname>Doermann</surname>
</persName>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
</author>
</analytic>
<monogr><title level="m">Computer Vision – ECCV 2008</title>
<title level="m" type="sub">10th European Conference on Computer Vision, Marseille, France, October 12-18, 2008, Proceedings, Part II</title>
<idno type="pISBN">978-3-540-88685-3</idno>
<idno type="eISBN">978-3-540-88688-4</idno>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="DOI">10.1007/978-3-540-88688-4</idno>
<idno type="BookID">978-3-540-88688-4</idno>
<idno type="BookTitleID">183879</idno>
<idno type="BookSequenceNumber">5303</idno>
<idno type="BookVolumeNumber">5303</idno>
<idno type="BookChapterCount">61</idno>
<editor><persName><forename type="first">David</forename>
<surname>Forsyth</surname>
</persName>
<email>daf@cs.uiuc.edu</email>
<affiliation>Computer Science Department, University of Illinois at Urbana Champaign, 3310 Siebel Hall, IL 61801, Urbana, USA</affiliation>
</editor>
<editor><persName><forename type="first">Philip</forename>
<surname>Torr</surname>
</persName>
<email>philiptorr@brookes.ac.uk</email>
<affiliation>Department of Computing, Oxford Brookes University, OX33 1HX, Wheatley, Oxford, UK</affiliation>
</editor>
<editor><persName><forename type="first">Andrew</forename>
<surname>Zisserman</surname>
</persName>
<email>az@robots.ox.ac.uk</email>
<affiliation>Department of Engineering Science, University of Oxford, Parks Road, OX1 3PJ, Oxford, UK</affiliation>
</editor>
<imprint><publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<date type="published" when="2008"></date>
<biblScope unit="volume">5303</biblScope>
<biblScope unit="page" from="745">745</biblScope>
<biblScope unit="page" to="758">758</biblScope>
</imprint>
</monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<editor><persName><forename type="first">David</forename>
<surname>Hutchison</surname>
</persName>
</editor>
<editor><persName><forename type="first">Takeo</forename>
<surname>Kanade</surname>
</persName>
</editor>
<editor><persName><forename type="first">Josef</forename>
<surname>Kittler</surname>
</persName>
</editor>
<editor><persName><forename type="first">Jon</forename>
<forename type="first">M.</forename>
<surname>Kleinberg</surname>
</persName>
</editor>
<editor><persName><forename type="first">Friedemann</forename>
<surname>Mattern</surname>
</persName>
</editor>
<editor><persName><forename type="first">John</forename>
<forename type="first">C.</forename>
<surname>Mitchell</surname>
</persName>
</editor>
<editor><persName><forename type="first">Moni</forename>
<surname>Naor</surname>
</persName>
</editor>
<editor><persName><forename type="first">Oscar</forename>
<surname>Nierstrasz</surname>
</persName>
</editor>
<editor><persName><forename type="first">C.</forename>
<surname>Pandu Rangan</surname>
</persName>
</editor>
<editor><persName><forename type="first">Bernhard</forename>
<surname>Steffen</surname>
</persName>
</editor>
<editor><persName><forename type="first">Madhu</forename>
<surname>Sudan</surname>
</persName>
</editor>
<editor><persName><forename type="first">Demetri</forename>
<surname>Terzopoulos</surname>
</persName>
</editor>
<editor><persName><forename type="first">Doug</forename>
<surname>Tygar</surname>
</persName>
</editor>
<editor><persName><forename type="first">Moshe</forename>
<forename type="first">Y.</forename>
<surname>Vardi</surname>
</persName>
</editor>
<editor><persName><forename type="first">Gerhard</forename>
<surname>Weikum</surname>
</persName>
</editor>
<biblScope><date>2008</date>
</biblScope>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="seriesId">558</idno>
</series>
<idno type="istex">000EA72B875137D2E35868AFB5C5FCB5D7A54937</idno>
<idno type="DOI">10.1007/978-3-540-88688-4_55</idno>
<idno type="ChapterID">55</idno>
<idno type="ChapterID">Chap55</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><creation><date>2008</date>
</creation>
<langUsage><language ident="en">en</language>
</langUsage>
<abstract xml:lang="en"><p>Abstract: Developing effective content recognition methods for diverse imagery continues to challenge computer vision researchers. We present a new approach for document image content categorization using a lexicon of shape features. Each lexical word corresponds to a scale and rotation invariant shape feature that is generic enough to be detected repeatably and segmentation free. We learn a concise, structurally indexed shape lexicon from training by clustering and partitioning feature types through graph cuts. We demonstrate our approach on two challenging document image content recognition problems: 1) The classification of 4,500 Web images crawled from Google Image Search into three content categories — pure image, image with text, and document image, and 2) Language identification of 8 languages (Arabic, Chinese, English, Hindi, Japanese, Korean, Russian, and Thai) on a 1,512 complex document image database composed of mixed machine printed text and handwriting. Our approach is capable to handle high intra-class variability and shows results that exceed other state-of-the-art approaches, allowing it to be used as a content recognizer in image indexing and retrieval systems.</p>
</abstract>
<textClass><keywords scheme="Book Subject Collection"><list><label>SUCO11645</label>
<item><term>Computer Science</term>
</item>
</list>
</keywords>
</textClass>
<textClass><keywords scheme="Book Subject Group"><list><label>I</label>
<label>I22021</label>
<label>I22005</label>
<label>I22013</label>
<label>I2203X</label>
<label>I18030</label>
<label>I23036</label>
<item><term>Computer Science</term>
</item>
<item><term>Image Processing and Computer Vision</term>
</item>
<item><term>Computer Imaging, Vision, Pattern Recognition and Graphics</term>
</item>
<item><term>Computer Graphics</term>
</item>
<item><term>Pattern Recognition</term>
</item>
<item><term>Data Mining and Knowledge Discovery</term>
</item>
<item><term>Computer Appl. in Arts and Humanities</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc><change when="2008">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-20">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item><original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/000EA72B875137D2E35868AFB5C5FCB5D7A54937/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata><istex:metadataXml wicri:clean="Springer, Publisher found" wicri:toSee="no header"><istex:xmlDeclaration>version="1.0" encoding="UTF-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//Springer-Verlag//DTD A++ V2.4//EN" URI="http://devel.springer.de/A++/V2.4/DTD/A++V2.4.dtd" name="istex:docType"></istex:docType>
<istex:document><Publisher><PublisherInfo><PublisherName>Springer Berlin Heidelberg</PublisherName>
<PublisherLocation>Berlin, Heidelberg</PublisherLocation>
</PublisherInfo>
<Series><SeriesInfo SeriesType="Series" TocLevels="0"><SeriesID>558</SeriesID>
<SeriesPrintISSN>0302-9743</SeriesPrintISSN>
<SeriesElectronicISSN>1611-3349</SeriesElectronicISSN>
<SeriesTitle Language="En">Lecture Notes in Computer Science</SeriesTitle>
</SeriesInfo>
<SeriesHeader><EditorGroup><Editor><EditorName DisplayOrder="Western"><GivenName>David</GivenName>
<FamilyName>Hutchison</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Takeo</GivenName>
<FamilyName>Kanade</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Josef</GivenName>
<FamilyName>Kittler</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Jon</GivenName>
<GivenName>M.</GivenName>
<FamilyName>Kleinberg</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Friedemann</GivenName>
<FamilyName>Mattern</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>John</GivenName>
<GivenName>C.</GivenName>
<FamilyName>Mitchell</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Moni</GivenName>
<FamilyName>Naor</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Oscar</GivenName>
<FamilyName>Nierstrasz</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>C.</GivenName>
<FamilyName>Pandu Rangan</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Bernhard</GivenName>
<FamilyName>Steffen</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Madhu</GivenName>
<FamilyName>Sudan</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Demetri</GivenName>
<FamilyName>Terzopoulos</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Doug</GivenName>
<FamilyName>Tygar</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Moshe</GivenName>
<GivenName>Y.</GivenName>
<FamilyName>Vardi</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Gerhard</GivenName>
<FamilyName>Weikum</FamilyName>
</EditorName>
</Editor>
</EditorGroup>
</SeriesHeader>
<Book Language="En"><BookInfo BookProductType="Proceedings" ContainsESM="No" Language="En" MediaType="eBook" NumberingStyle="Unnumbered" OutputMedium="All" TocLevels="0"><BookID>978-3-540-88688-4</BookID>
<BookTitle>Computer Vision – ECCV 2008</BookTitle>
<BookSubTitle>10th European Conference on Computer Vision, Marseille, France, October 12-18, 2008, Proceedings, Part II</BookSubTitle>
<BookVolumeNumber>5303</BookVolumeNumber>
<BookSequenceNumber>5303</BookSequenceNumber>
<BookDOI>10.1007/978-3-540-88688-4</BookDOI>
<BookTitleID>183879</BookTitleID>
<BookPrintISBN>978-3-540-88685-3</BookPrintISBN>
<BookElectronicISBN>978-3-540-88688-4</BookElectronicISBN>
<BookChapterCount>61</BookChapterCount>
<BookCopyright><CopyrightHolderName>Springer Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2008</CopyrightYear>
</BookCopyright>
<BookSubjectGroup><BookSubject Code="I" Type="Primary">Computer Science</BookSubject>
<BookSubject Code="I22021" Priority="1" Type="Secondary">Image Processing and Computer Vision</BookSubject>
<BookSubject Code="I22005" Priority="2" Type="Secondary">Computer Imaging, Vision, Pattern Recognition and Graphics</BookSubject>
<BookSubject Code="I22013" Priority="3" Type="Secondary">Computer Graphics</BookSubject>
<BookSubject Code="I2203X" Priority="4" Type="Secondary">Pattern Recognition</BookSubject>
<BookSubject Code="I18030" Priority="5" Type="Secondary">Data Mining and Knowledge Discovery</BookSubject>
<BookSubject Code="I23036" Priority="6" Type="Secondary">Computer Appl. in Arts and Humanities</BookSubject>
<SubjectCollection Code="SUCO11645">Computer Science</SubjectCollection>
</BookSubjectGroup>
</BookInfo>
<BookHeader><EditorGroup><Editor AffiliationIDS="Aff1"><EditorName DisplayOrder="Western"><GivenName>David</GivenName>
<FamilyName>Forsyth</FamilyName>
</EditorName>
<Contact><Email>daf@cs.uiuc.edu</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff2"><EditorName DisplayOrder="Western"><GivenName>Philip</GivenName>
<FamilyName>Torr</FamilyName>
</EditorName>
<Contact><Email>philiptorr@brookes.ac.uk</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff3"><EditorName DisplayOrder="Western"><GivenName>Andrew</GivenName>
<FamilyName>Zisserman</FamilyName>
</EditorName>
<Contact><Email>az@robots.ox.ac.uk</Email>
</Contact>
</Editor>
<Affiliation ID="Aff1"><OrgDivision>Computer Science Department</OrgDivision>
<OrgName>University of Illinois at Urbana Champaign</OrgName>
<OrgAddress><Street>3310 Siebel Hall</Street>
<Postcode>IL 61801</Postcode>
<City>Urbana</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff2"><OrgDivision>Department of Computing</OrgDivision>
<OrgName>Oxford Brookes University</OrgName>
<OrgAddress><Postcode>OX33 1HX</Postcode>
<City>Wheatley, Oxford</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff3"><OrgDivision>Department of Engineering Science</OrgDivision>
<OrgName>University of Oxford</OrgName>
<OrgAddress><Street>Parks Road</Street>
<Postcode>OX1 3PJ</Postcode>
<City>Oxford</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</BookHeader>
<Part ID="Part3"><PartInfo TocLevels="0"><PartID>3</PartID>
<PartSequenceNumber>3</PartSequenceNumber>
<PartTitle>Poster Session II</PartTitle>
<PartChapterCount>50</PartChapterCount>
<PartContext><SeriesID>558</SeriesID>
<BookTitle>Computer Vision – ECCV 2008</BookTitle>
</PartContext>
</PartInfo>
<Chapter ID="Chap55" Language="En"><ChapterInfo ChapterType="OriginalPaper" ContainsESM="No" NumberingStyle="Unnumbered" TocLevels="0"><ChapterID>55</ChapterID>
<ChapterDOI>10.1007/978-3-540-88688-4_55</ChapterDOI>
<ChapterSequenceNumber>55</ChapterSequenceNumber>
<ChapterTitle Language="En">Learning Visual Shape Lexicon for Document Image Content Recognition</ChapterTitle>
<ChapterFirstPage>745</ChapterFirstPage>
<ChapterLastPage>758</ChapterLastPage>
<ChapterCopyright><CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2008</CopyrightYear>
</ChapterCopyright>
<ChapterGrants Type="Regular"><MetadataGrant Grant="OpenAccess"></MetadataGrant>
<AbstractGrant Grant="OpenAccess"></AbstractGrant>
<BodyPDFGrant Grant="Restricted"></BodyPDFGrant>
<BodyHTMLGrant Grant="Restricted"></BodyHTMLGrant>
<BibliographyGrant Grant="Restricted"></BibliographyGrant>
<ESMGrant Grant="Restricted"></ESMGrant>
</ChapterGrants>
<ChapterContext><SeriesID>558</SeriesID>
<PartID>3</PartID>
<BookID>978-3-540-88688-4</BookID>
<BookTitle>Computer Vision – ECCV 2008</BookTitle>
</ChapterContext>
</ChapterInfo>
<ChapterHeader><AuthorGroup><Author AffiliationIDS="Aff4"><AuthorName DisplayOrder="Western"><GivenName>Guangyu</GivenName>
<FamilyName>Zhu</FamilyName>
</AuthorName>
</Author>
<Author AffiliationIDS="Aff4"><AuthorName DisplayOrder="Western"><GivenName>Xiaodong</GivenName>
<FamilyName>Yu</FamilyName>
</AuthorName>
</Author>
<Author AffiliationIDS="Aff4"><AuthorName DisplayOrder="Western"><GivenName>Yi</GivenName>
<FamilyName>Li</FamilyName>
</AuthorName>
</Author>
<Author AffiliationIDS="Aff4"><AuthorName DisplayOrder="Western"><GivenName>David</GivenName>
<FamilyName>Doermann</FamilyName>
</AuthorName>
</Author>
<Affiliation ID="Aff4"><OrgName>University of Maryland</OrgName>
<OrgAddress><City>College Park</City>
<Postcode>MD 20742</Postcode>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
</AuthorGroup>
<Abstract ID="Abs1" Language="En"><Heading>Abstract</Heading>
<Para>Developing effective content recognition methods for diverse imagery continues to challenge computer vision researchers. We present a new approach for document image content categorization using a lexicon of shape features. Each lexical word corresponds to a scale and rotation invariant shape feature that is generic enough to be detected repeatably and segmentation free. We learn a concise, structurally indexed shape lexicon from training by clustering and partitioning feature types through graph cuts. We demonstrate our approach on two challenging document image content recognition problems: 1) The classification of 4,500 Web images crawled from Google Image Search into three content categories — pure image, image with text, and document image, and 2) Language identification of 8 languages (Arabic, Chinese, English, Hindi, Japanese, Korean, Russian, and Thai) on a 1,512 complex document image database composed of mixed machine printed text and handwriting. Our approach is capable to handle high intra-class variability and shows results that exceed other state-of-the-art approaches, allowing it to be used as a content recognizer in image indexing and retrieval systems.</Para>
</Abstract>
</ChapterHeader>
<NoBody></NoBody>
</Chapter>
</Part>
</Book>
</Series>
</Publisher>
</istex:document>
</istex:metadataXml>
<mods version="3.6"><titleInfo lang="en"><title>Learning Visual Shape Lexicon for Document Image Content Recognition</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA" lang="en"><title>Learning Visual Shape Lexicon for Document Image Content Recognition</title>
</titleInfo>
<name type="personal"><namePart type="given">Guangyu</namePart>
<namePart type="family">Zhu</namePart>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
<role><roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Xiaodong</namePart>
<namePart type="family">Yu</namePart>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
<role><roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Yi</namePart>
<namePart type="family">Li</namePart>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
<role><roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">David</namePart>
<namePart type="family">Doermann</namePart>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
<role><roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="conference [eBooks]" displayLabel="OriginalPaper"></genre>
<originInfo><publisher>Springer Berlin Heidelberg</publisher>
<place><placeTerm type="text">Berlin, Heidelberg</placeTerm>
</place>
<dateIssued encoding="w3cdtf">2008</dateIssued>
<copyrightDate encoding="w3cdtf">2008</copyrightDate>
</originInfo>
<language><languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription><internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">Abstract: Developing effective content recognition methods for diverse imagery continues to challenge computer vision researchers. We present a new approach for document image content categorization using a lexicon of shape features. Each lexical word corresponds to a scale and rotation invariant shape feature that is generic enough to be detected repeatably and segmentation free. We learn a concise, structurally indexed shape lexicon from training by clustering and partitioning feature types through graph cuts. We demonstrate our approach on two challenging document image content recognition problems: 1) The classification of 4,500 Web images crawled from Google Image Search into three content categories — pure image, image with text, and document image, and 2) Language identification of 8 languages (Arabic, Chinese, English, Hindi, Japanese, Korean, Russian, and Thai) on a 1,512 complex document image database composed of mixed machine printed text and handwriting. Our approach is capable to handle high intra-class variability and shows results that exceed other state-of-the-art approaches, allowing it to be used as a content recognizer in image indexing and retrieval systems.</abstract>
<relatedItem type="host"><titleInfo><title>Computer Vision – ECCV 2008</title>
<subTitle>10th European Conference on Computer Vision, Marseille, France, October 12-18, 2008, Proceedings, Part II</subTitle>
</titleInfo>
<name type="personal"><namePart type="given">David</namePart>
<namePart type="family">Forsyth</namePart>
<affiliation>Computer Science Department, University of Illinois at Urbana Champaign, 3310 Siebel Hall, IL 61801, Urbana, USA</affiliation>
<affiliation>E-mail: daf@cs.uiuc.edu</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Philip</namePart>
<namePart type="family">Torr</namePart>
<affiliation>Department of Computing, Oxford Brookes University, OX33 1HX, Wheatley, Oxford, UK</affiliation>
<affiliation>E-mail: philiptorr@brookes.ac.uk</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Andrew</namePart>
<namePart type="family">Zisserman</namePart>
<affiliation>Department of Engineering Science, University of Oxford, Parks Road, OX1 3PJ, Oxford, UK</affiliation>
<affiliation>E-mail: az@robots.ox.ac.uk</affiliation>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="Book Series" displayLabel="Proceedings"></genre>
<originInfo><copyrightDate encoding="w3cdtf">2008</copyrightDate>
<issuance>monographic</issuance>
</originInfo>
<subject><genre>Book Subject Collection</genre>
<topic authority="SpringerSubjectCodes" authorityURI="SUCO11645">Computer Science</topic>
</subject>
<subject><genre>Book Subject Group</genre>
<topic authority="SpringerSubjectCodes" authorityURI="I">Computer Science</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I22021">Image Processing and Computer Vision</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I22005">Computer Imaging, Vision, Pattern Recognition and Graphics</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I22013">Computer Graphics</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I2203X">Pattern Recognition</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18030">Data Mining and Knowledge Discovery</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I23036">Computer Appl. in Arts and Humanities</topic>
</subject>
<identifier type="DOI">10.1007/978-3-540-88688-4</identifier>
<identifier type="ISBN">978-3-540-88685-3</identifier>
<identifier type="eISBN">978-3-540-88688-4</identifier>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="BookTitleID">183879</identifier>
<identifier type="BookID">978-3-540-88688-4</identifier>
<identifier type="BookChapterCount">61</identifier>
<identifier type="BookVolumeNumber">5303</identifier>
<identifier type="BookSequenceNumber">5303</identifier>
<identifier type="PartChapterCount">50</identifier>
<part><date>2008</date>
<detail type="part"><title>Poster Session II</title>
</detail>
<detail type="volume"><number>5303</number>
<caption>vol.</caption>
</detail>
<extent unit="pages"><start>745</start>
<end>758</end>
</extent>
</part>
<recordInfo><recordOrigin>Springer Berlin Heidelberg, 2008</recordOrigin>
</recordInfo>
</relatedItem>
<relatedItem type="series"><titleInfo><title>Lecture Notes in Computer Science</title>
</titleInfo>
<name type="personal"><namePart type="given">David</namePart>
<namePart type="family">Hutchison</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Takeo</namePart>
<namePart type="family">Kanade</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Josef</namePart>
<namePart type="family">Kittler</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Jon</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Kleinberg</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Friedemann</namePart>
<namePart type="family">Mattern</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">John</namePart>
<namePart type="given">C.</namePart>
<namePart type="family">Mitchell</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Moni</namePart>
<namePart type="family">Naor</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Oscar</namePart>
<namePart type="family">Nierstrasz</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">C.</namePart>
<namePart type="family">Pandu Rangan</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Bernhard</namePart>
<namePart type="family">Steffen</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Madhu</namePart>
<namePart type="family">Sudan</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Demetri</namePart>
<namePart type="family">Terzopoulos</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Doug</namePart>
<namePart type="family">Tygar</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Moshe</namePart>
<namePart type="given">Y.</namePart>
<namePart type="family">Vardi</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<originInfo><copyrightDate encoding="w3cdtf">2008</copyrightDate>
<issuance>serial</issuance>
</originInfo>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="SeriesID">558</identifier>
<recordInfo><recordOrigin>Springer Berlin Heidelberg, 2008</recordOrigin>
</recordInfo>
</relatedItem>
<identifier type="istex">000EA72B875137D2E35868AFB5C5FCB5D7A54937</identifier>
<identifier type="DOI">10.1007/978-3-540-88688-4_55</identifier>
<identifier type="ChapterID">55</identifier>
<identifier type="ChapterID">Chap55</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Springer Berlin Heidelberg, 2008</accessCondition>
<recordInfo><recordContentSource>SPRINGER</recordContentSource>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2008</recordOrigin>
</recordInfo>
</mods>
</metadata>
<enrichments><istex:refBibTEI uri="https://api.istex.fr/document/000EA72B875137D2E35868AFB5C5FCB5D7A54937/enrichments/refBib"><teiHeader></teiHeader>
<text><front></front>
<body></body>
<back><listBibl><biblStruct xml:id="b0"><analytic><title level="a" type="main">A computational model for visual selection</title>
<author><persName><forename type="first">Y</forename>
<surname>Amit</surname>
</persName>
</author>
<author><persName><forename type="first">D</forename>
<surname>Geman</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">Neural Computation</title>
<imprint><biblScope unit="volume">11</biblScope>
<biblScope unit="page" from="1691" to="1715"></biblScope>
<date type="published" when="1999"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b1"><analytic><title level="a" type="main">Shape matching and object recognition using shape contexts</title>
<author><persName><forename type="first">S</forename>
<surname>Belongie</surname>
</persName>
</author>
<author><persName><forename type="first">J</forename>
<surname>Malik</surname>
</persName>
</author>
<author><persName><forename type="first">J</forename>
<surname>Puzicha</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint><biblScope unit="volume">24</biblScope>
<biblScope unit="issue">4</biblScope>
<biblScope unit="page" from="509" to="522"></biblScope>
<date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b2"><analytic><title level="a" type="main">Efficient indexing for articulation invariant shape matching and retrieval</title>
<author><persName><forename type="first">S</forename>
<surname>Biswas</surname>
</persName>
</author>
<author><persName><forename type="first">G</forename>
<surname>Aggarwal</surname>
</persName>
</author>
<author><persName><forename type="first">R</forename>
<surname>Chellappa</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proc. CVPR</title>
<meeting>. CVPR</meeting>
<imprint><date type="published" when="2007"></date>
<biblScope unit="page" from="1" to="8"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b3"><analytic><title level="a" type="main">Texture for script identification</title>
<author><persName><forename type="first">A</forename>
<surname>Busch</surname>
</persName>
</author>
<author><persName><forename type="first">W</forename>
<surname>Boles</surname>
</persName>
</author>
<author><persName><forename type="first">S</forename>
<surname>Sridharan</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint><biblScope unit="volume">27</biblScope>
<biblScope unit="issue">11</biblScope>
<biblScope unit="page" from="1720" to="1732"></biblScope>
<date type="published" when="2005"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b4"><analytic><title level="a" type="main">A computational approach to edge detection</title>
<author><persName><forename type="first">J</forename>
<surname>Canny</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint><biblScope unit="volume">8</biblScope>
<biblScope unit="issue">6</biblScope>
<biblScope unit="page" from="679" to="697"></biblScope>
<date type="published" when="1986"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b5"><analytic><title level="a" type="main">Histograms of oriented gradients for human detection</title>
<author><persName><forename type="first">N</forename>
<surname>Dalal</surname>
</persName>
</author>
<author><persName><forename type="first">B</forename>
<surname>Triggs</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proc. CVPR</title>
<meeting>. CVPR</meeting>
<imprint><date type="published" when="2005"></date>
<biblScope unit="page" from="886" to="893"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b6"><analytic><title level="a" type="main">Classification of oriental and European scripts by using characteristic features</title>
<author><persName><forename type="first">J</forename>
<surname>Ding</surname>
</persName>
</author>
<author><persName><forename type="first">L</forename>
<surname>Lam</surname>
</persName>
</author>
<author><persName><forename type="first">C</forename>
<surname>Suen</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proc. ICDAR</title>
<meeting>. ICDAR</meeting>
<imprint><date type="published" when="1997"></date>
<biblScope unit="page" from="1023" to="1027"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b7"><analytic><title level="a" type="main">Groups of adjacent contour segments for object detection</title>
<author><persName><forename type="first">V</forename>
<surname>Ferrari</surname>
</persName>
</author>
<author><persName><forename type="first">L</forename>
<surname>Fevrier</surname>
</persName>
</author>
<author><persName><forename type="first">F</forename>
<surname>Jurie</surname>
</persName>
</author>
<author><persName><forename type="first">C</forename>
<surname>Schmid</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint><biblScope unit="volume">30</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="1" to="16"></biblScope>
<date type="published" when="2008"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b8"><analytic><title level="a" type="main">Towards scalable representations of object categories: Learning a hierarchy of parts</title>
<author><persName><forename type="first">S</forename>
<surname>Fidler</surname>
</persName>
</author>
<author><persName><forename type="first">A</forename>
<surname>Leonardis</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proc. CVPR</title>
<meeting>. CVPR</meeting>
<imprint><date type="published" when="2007"></date>
<biblScope unit="page" from="1" to="8"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b9"><analytic><title level="a" type="main">Flexible syntactic matching of curves and its application to automatic hierarchical classification of silhouettes</title>
<author><persName><forename type="first">Y</forename>
<surname>Gdalyahu</surname>
</persName>
</author>
<author><persName><forename type="first">D</forename>
<surname>Weinshall</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint><biblScope unit="volume">21</biblScope>
<biblScope unit="issue">12</biblScope>
<biblScope unit="page" from="1312" to="1328"></biblScope>
<date type="published" when="1999"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b10"><analytic><title level="a" type="main">Automatic script identification from document images using cluster-based templates</title>
<author><persName><forename type="first">J</forename>
<surname>Hochberg</surname>
</persName>
</author>
<author><persName><forename type="first">P</forename>
<surname>Kelly</surname>
</persName>
</author>
<author><persName><forename type="first">T</forename>
<surname>Thomas</surname>
</persName>
</author>
<author><persName><forename type="first">L</forename>
<surname>Kerns</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint><biblScope unit="volume">19</biblScope>
<biblScope unit="issue">2</biblScope>
<biblScope unit="page" from="176" to="181"></biblScope>
<date type="published" when="1997"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b11"><analytic><title level="a" type="main">Robust and efficient detection of salient convex groups</title>
<author><persName><forename type="first">D</forename>
<surname>Jacobs</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint><biblScope unit="volume">18</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="23" to="37"></biblScope>
<date type="published" when="1996"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b12"><analytic><title level="a" type="main">Shape descriptors for non-rigid shapes with a single closed contour</title>
<author><persName><forename type="first">L</forename>
<surname>Latecki</surname>
</persName>
</author>
<author><persName><forename type="first">R</forename>
<surname>Lakamper</surname>
</persName>
</author>
<author><persName><forename type="first">U</forename>
<surname>Eckhardt</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proc. CVPR</title>
<meeting>. CVPR</meeting>
<imprint><date type="published" when="2000"></date>
<biblScope unit="page" from="424" to="429"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b13"><analytic><title level="a" type="main">Language Identification in Complex, Unoriented, and Degraded Document Images</title>
<author><persName><forename type="first">D</forename>
<surname>Lee</surname>
</persName>
</author>
<author><persName><forename type="first">C</forename>
<surname>Nohl</surname>
</persName>
</author>
<author><persName><forename type="first">H</forename>
<surname>Baird</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">Document Analysis Systems II</title>
<imprint><date type="published" when="1998"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b14"><analytic><title level="a" type="main">Script-independent text line segmentation in freestyle handwritten documents</title>
<author><persName><forename type="first">Y</forename>
<surname>Li</surname>
</persName>
</author>
<author><persName><forename type="first">Y</forename>
<surname>Zheng</surname>
</persName>
</author>
<author><persName><forename type="first">D</forename>
<surname>Doermann</surname>
</persName>
</author>
<author><persName><forename type="first">S</forename>
<surname>Jaeger</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint><biblScope unit="volume">30</biblScope>
<biblScope unit="issue">8</biblScope>
<biblScope unit="page" from="1313" to="1329"></biblScope>
<date type="published" when="2008"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b15"><analytic><title level="a" type="main">Shape classification using the inner-distance</title>
<author><persName><forename type="first">H</forename>
<surname>Ling</surname>
</persName>
</author>
<author><persName><forename type="first">D</forename>
<surname>Jacobs</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint><biblScope unit="volume">29</biblScope>
<biblScope unit="issue">2</biblScope>
<biblScope unit="page" from="286" to="299"></biblScope>
<date type="published" when="2007"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b16"><analytic><title level="a" type="main">Three-dimensional object recognition from single two-dimensional images</title>
<author><persName><forename type="first">D</forename>
<surname>Lowe</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">Artificial Intelligence</title>
<imprint><biblScope unit="volume">31</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="355" to="395"></biblScope>
<date type="published" when="1987"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b17"><analytic><title level="a" type="main">Script and language identification in noisy and degraded document images</title>
<author><persName><forename type="first">S</forename>
<surname>Lu</surname>
</persName>
</author>
<author><persName><forename type="first">C</forename>
<surname>Tan</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint><biblScope unit="volume">30</biblScope>
<biblScope unit="issue">2</biblScope>
<biblScope unit="page" from="14" to="24"></biblScope>
<date type="published" when="2008"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b18"><analytic><title level="a" type="main">The IAM-database: An English sentence database for offline handwriting recognition</title>
<author><persName><forename type="first">U</forename>
<surname>Marti</surname>
</persName>
</author>
<author><persName><forename type="first">H</forename>
<surname>Bunke</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">Int. J. Document Analysis and Recognition</title>
<imprint><biblScope unit="volume">5</biblScope>
<biblScope unit="page" from="39" to="46"></biblScope>
<date type="published" when="2006"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b19"><analytic><title level="a" type="main">Multiresolution gray-scale and rotation invariant texture classification with local binary patterns</title>
<author><persName><forename type="first">T</forename>
<surname>Ojala</surname>
</persName>
</author>
<author><persName><forename type="first">M</forename>
<surname>Pietikainen</surname>
</persName>
</author>
<author><persName><forename type="first">T</forename>
<surname>Maenpaa</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint><biblScope unit="volume">24</biblScope>
<biblScope unit="issue">7</biblScope>
<biblScope unit="page" from="971" to="987"></biblScope>
<date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b20"><analytic><title level="a" type="main">Modeling the shape of the scene: A holistic representation of the spatial envelope</title>
<author><persName><forename type="first">A</forename>
<surname>Oliva</surname>
</persName>
</author>
<author><persName><forename type="first">A</forename>
<surname>Torralba</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">Int. J. Computer Vision</title>
<imprint><biblScope unit="volume">42</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="145" to="175"></biblScope>
<date type="published" when="2001"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b21"><analytic><title level="a" type="main">On-line and off-line handwriting recognition: A comprehensive survey</title>
<author><persName><forename type="first">R</forename>
<surname>Plamondon</surname>
</persName>
</author>
<author><persName><forename type="first">S</forename>
<surname>Srihari</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint><biblScope unit="volume">22</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="63" to="84"></biblScope>
<date type="published" when="2000"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b22"><monogr><title level="m" type="main">Optical Character Recognition: An Illustrated Guide to the Frontier</title>
<author><persName><forename type="first">S</forename>
<surname>Rice</surname>
</persName>
</author>
<author><persName><forename type="first">G</forename>
<surname>Nagy</surname>
</persName>
</author>
<author><persName><forename type="first">T</forename>
<surname>Nartker</surname>
</persName>
</author>
<imprint><date type="published" when="1999"></date>
<publisher>Kluwer Academic Publishers</publisher>
<pubPlace>Dordrecht</pubPlace>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b23"><analytic><title level="a" type="main">Planar object recognition using projective shape representation</title>
<author><persName><forename type="first">C</forename>
<surname>Rothwell</surname>
</persName>
</author>
<author><persName><forename type="first">A</forename>
<surname>Zisserman</surname>
</persName>
</author>
<author><persName><forename type="first">D</forename>
<surname>Forsyth</surname>
</persName>
</author>
<author><persName><forename type="first">J</forename>
<surname>Mundy</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">Int. J. Computer Vision</title>
<imprint><biblScope unit="volume">16</biblScope>
<biblScope unit="issue">5</biblScope>
<biblScope unit="page" from="57" to="99"></biblScope>
<date type="published" when="1995"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b24"><analytic><title level="a" type="main">Symmetry-based indexing of image database</title>
<author><persName><forename type="first">D</forename>
<surname>Sharvit</surname>
</persName>
</author>
<author><persName><forename type="first">J</forename>
<surname>Chan</surname>
</persName>
</author>
<author><persName><forename type="first">H</forename>
<surname>Tek</surname>
</persName>
</author>
<author><persName><forename type="first">B</forename>
<surname>Kimia</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">J. Visual Commun. and Image Representation</title>
<imprint><biblScope unit="volume">9</biblScope>
<biblScope unit="issue">4</biblScope>
<biblScope unit="page" from="366" to="380"></biblScope>
<date type="published" when="1998"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b25"><analytic><title level="a" type="main">Normalized cuts and image segmentation</title>
<author><persName><forename type="first">J</forename>
<surname>Shi</surname>
</persName>
</author>
<author><persName><forename type="first">J</forename>
<surname>Malik</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint><biblScope unit="volume">22</biblScope>
<biblScope unit="issue">8</biblScope>
<biblScope unit="page" from="888" to="905"></biblScope>
<date type="published" when="2000"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b26"><analytic><title level="a" type="main">Determination of script and language content of document images</title>
<author><persName><forename type="first">A</forename>
<surname>Spitz</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint><biblScope unit="volume">19</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="235" to="245"></biblScope>
<date type="published" when="1997"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b27"><analytic><title level="a" type="main">Categorizing document images into script and language classes</title>
<author><persName><forename type="first">C</forename>
<surname>Suen</surname>
</persName>
</author>
<author><persName><forename type="first">S</forename>
<surname>Bergler</surname>
</persName>
</author>
<author><persName><forename type="first">N</forename>
<surname>Nobile</surname>
</persName>
</author>
<author><persName><forename type="first">B</forename>
<surname>Waked</surname>
</persName>
</author>
<author><persName><forename type="first">C</forename>
<surname>Nadal</surname>
</persName>
</author>
<author><persName><forename type="first">A</forename>
<surname>Bloch</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proc. ICDAR</title>
<meeting>. ICDAR</meeting>
<imprint><date type="published" when="1998"></date>
<biblScope unit="page" from="297" to="306"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b28"><analytic><title level="a" type="main">Rotation invariant texture features and their use in automatic script identification</title>
<author><persName><forename type="first">T</forename>
<surname>Tan</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint><biblScope unit="volume">20</biblScope>
<biblScope unit="issue">7</biblScope>
<biblScope unit="page" from="751" to="756"></biblScope>
<date type="published" when="1998"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b29"><analytic><title level="a" type="main">Google Book Search: Document understanding on a massive scale</title>
<author><persName><forename type="first">L</forename>
<surname>Vincent</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proc. ICDAR</title>
<meeting>. ICDAR</meeting>
<imprint><date type="published" when="2007"></date>
<biblScope unit="page" from="819" to="823"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b30"><analytic><title level="a" type="main">Multiclass spectral clustering</title>
<author><persName><forename type="first">S</forename>
<surname>Yu</surname>
</persName>
</author>
<author><persName><forename type="first">J</forename>
<surname>Shi</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proc. ICCV</title>
<meeting>. ICCV</meeting>
<imprint><date type="published" when="2003"></date>
<biblScope unit="page" from="11" to="17"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b31"><analytic><title level="a" type="main">Extracting relevant named entities for automated expense reimbursement</title>
<author><persName><forename type="first">G</forename>
<surname>Zhu</surname>
</persName>
</author>
<author><persName><forename type="first">T</forename>
<forename type="middle">J</forename>
<surname>Bethea</surname>
</persName>
</author>
<author><persName><forename type="first">V</forename>
<surname>Krishna</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proc. ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining</title>
<meeting>. ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining</meeting>
<imprint><date type="published" when="2007"></date>
<biblScope unit="page" from="1004" to="1012"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b32"><analytic><title level="a" type="main">Unconstrained language identification using a shape codebook</title>
<author><persName><forename type="first">G</forename>
<surname>Zhu</surname>
</persName>
</author>
<author><persName><forename type="first">X</forename>
<surname>Yu</surname>
</persName>
</author>
<author><persName><forename type="first">Y</forename>
<surname>Li</surname>
</persName>
</author>
<author><persName><forename type="first">D</forename>
<surname>Doermann</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proc. ICFHR</title>
<meeting>. ICFHR</meeting>
<imprint><date type="published" when="2008"></date>
<biblScope unit="page" from="13" to="18"></biblScope>
</imprint>
</monogr>
</biblStruct>
</listBibl>
</back>
</text>
</istex:refBibTEI>
</enrichments>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Corpus

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000C96 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000C96 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:000EA72B875137D2E35868AFB5C5FCB5D7A54937
   |texte=   Learning Visual Shape Lexicon for Document Image Content Recognition
}}

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024

	Serveur d'exploration sur l'OCR
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur l'OCR

Learning Visual Shape Lexicon for Document Image Content Recognition

Learning Visual Shape Lexicon for Document Image Content Recognition

Source :

Abstract

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri