Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Learning Visual Shape Lexicon for Document Image Content Recognition

Identifieur interne : 000C96 ( Istex/Corpus ); précédent : 000C95; suivant : 000C97

Learning Visual Shape Lexicon for Document Image Content Recognition

Auteurs : Guangyu Zhu ; Xiaodong Yu ; Yi Li ; David Doermann

Source :

RBID : ISTEX:000EA72B875137D2E35868AFB5C5FCB5D7A54937

Abstract

Abstract: Developing effective content recognition methods for diverse imagery continues to challenge computer vision researchers. We present a new approach for document image content categorization using a lexicon of shape features. Each lexical word corresponds to a scale and rotation invariant shape feature that is generic enough to be detected repeatably and segmentation free. We learn a concise, structurally indexed shape lexicon from training by clustering and partitioning feature types through graph cuts. We demonstrate our approach on two challenging document image content recognition problems: 1) The classification of 4,500 Web images crawled from Google Image Search into three content categories — pure image, image with text, and document image, and 2) Language identification of 8 languages (Arabic, Chinese, English, Hindi, Japanese, Korean, Russian, and Thai) on a 1,512 complex document image database composed of mixed machine printed text and handwriting. Our approach is capable to handle high intra-class variability and shows results that exceed other state-of-the-art approaches, allowing it to be used as a content recognizer in image indexing and retrieval systems.

Url:
DOI: 10.1007/978-3-540-88688-4_55

Links to Exploration step

ISTEX:000EA72B875137D2E35868AFB5C5FCB5D7A54937

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Learning Visual Shape Lexicon for Document Image Content Recognition</title>
<author>
<name sortKey="Zhu, Guangyu" sort="Zhu, Guangyu" uniqKey="Zhu G" first="Guangyu" last="Zhu">Guangyu Zhu</name>
<affiliation>
<mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Yu, Xiaodong" sort="Yu, Xiaodong" uniqKey="Yu X" first="Xiaodong" last="Yu">Xiaodong Yu</name>
<affiliation>
<mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Li, Yi" sort="Li, Yi" uniqKey="Li Y" first="Yi" last="Li">Yi Li</name>
<affiliation>
<mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Doermann, David" sort="Doermann, David" uniqKey="Doermann D" first="David" last="Doermann">David Doermann</name>
<affiliation>
<mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:000EA72B875137D2E35868AFB5C5FCB5D7A54937</idno>
<date when="2008" year="2008">2008</date>
<idno type="doi">10.1007/978-3-540-88688-4_55</idno>
<idno type="url">https://api.istex.fr/document/000EA72B875137D2E35868AFB5C5FCB5D7A54937/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000C96</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Learning Visual Shape Lexicon for Document Image Content Recognition</title>
<author>
<name sortKey="Zhu, Guangyu" sort="Zhu, Guangyu" uniqKey="Zhu G" first="Guangyu" last="Zhu">Guangyu Zhu</name>
<affiliation>
<mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Yu, Xiaodong" sort="Yu, Xiaodong" uniqKey="Yu X" first="Xiaodong" last="Yu">Xiaodong Yu</name>
<affiliation>
<mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Li, Yi" sort="Li, Yi" uniqKey="Li Y" first="Yi" last="Li">Yi Li</name>
<affiliation>
<mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Doermann, David" sort="Doermann, David" uniqKey="Doermann D" first="David" last="Doermann">David Doermann</name>
<affiliation>
<mods:affiliation>University of Maryland, MD 20742, College Park, USA</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2008</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">000EA72B875137D2E35868AFB5C5FCB5D7A54937</idno>
<idno type="DOI">10.1007/978-3-540-88688-4_55</idno>
<idno type="ChapterID">55</idno>
<idno type="ChapterID">Chap55</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Developing effective content recognition methods for diverse imagery continues to challenge computer vision researchers. We present a new approach for document image content categorization using a lexicon of shape features. Each lexical word corresponds to a scale and rotation invariant shape feature that is generic enough to be detected repeatably and segmentation free. We learn a concise, structurally indexed shape lexicon from training by clustering and partitioning feature types through graph cuts. We demonstrate our approach on two challenging document image content recognition problems: 1) The classification of 4,500 Web images crawled from Google Image Search into three content categories — pure image, image with text, and document image, and 2) Language identification of 8 languages (Arabic, Chinese, English, Hindi, Japanese, Korean, Russian, and Thai) on a 1,512 complex document image database composed of mixed machine printed text and handwriting. Our approach is capable to handle high intra-class variability and shows results that exceed other state-of-the-art approaches, allowing it to be used as a content recognizer in image indexing and retrieval systems.</div>
</front>
</TEI>
<istex>
<corpusName>springer</corpusName>
<author>
<json:item>
<name>Guangyu Zhu</name>
<affiliations>
<json:string>University of Maryland, MD 20742, College Park, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Xiaodong Yu</name>
<affiliations>
<json:string>University of Maryland, MD 20742, College Park, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Yi Li</name>
<affiliations>
<json:string>University of Maryland, MD 20742, College Park, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>David Doermann</name>
<affiliations>
<json:string>University of Maryland, MD 20742, College Park, USA</json:string>
</affiliations>
</json:item>
</author>
<language>
<json:string>eng</json:string>
</language>
<abstract>Abstract: Developing effective content recognition methods for diverse imagery continues to challenge computer vision researchers. We present a new approach for document image content categorization using a lexicon of shape features. Each lexical word corresponds to a scale and rotation invariant shape feature that is generic enough to be detected repeatably and segmentation free. We learn a concise, structurally indexed shape lexicon from training by clustering and partitioning feature types through graph cuts. We demonstrate our approach on two challenging document image content recognition problems: 1) The classification of 4,500 Web images crawled from Google Image Search into three content categories — pure image, image with text, and document image, and 2) Language identification of 8 languages (Arabic, Chinese, English, Hindi, Japanese, Korean, Russian, and Thai) on a 1,512 complex document image database composed of mixed machine printed text and handwriting. Our approach is capable to handle high intra-class variability and shows results that exceed other state-of-the-art approaches, allowing it to be used as a content recognizer in image indexing and retrieval systems.</abstract>
<qualityIndicators>
<score>6.99</score>
<pdfVersion>1.3</pdfVersion>
<pdfPageSize>430 x 660 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>1197</abstractCharCount>
<pdfWordCount>4914</pdfWordCount>
<pdfCharCount>30335</pdfCharCount>
<pdfPageCount>14</pdfPageCount>
<abstractWordCount>173</abstractWordCount>
</qualityIndicators>
<title>Learning Visual Shape Lexicon for Document Image Content Recognition</title>
<genre.original>
<json:string>OriginalPaper</json:string>
</genre.original>
<chapterId>
<json:string>55</json:string>
<json:string>Chap55</json:string>
</chapterId>
<genre>
<json:string>conference [eBooks]</json:string>
</genre>
<serie>
<editor>
<json:item>
<name>David Hutchison</name>
</json:item>
<json:item>
<name>Takeo Kanade</name>
</json:item>
<json:item>
<name>Josef Kittler</name>
</json:item>
<json:item>
<name>Jon M. Kleinberg</name>
</json:item>
<json:item>
<name>Friedemann Mattern</name>
</json:item>
<json:item>
<name>John C. Mitchell</name>
</json:item>
<json:item>
<name>Moni Naor</name>
</json:item>
<json:item>
<name>Oscar Nierstrasz</name>
</json:item>
<json:item>
<name>C. Pandu Rangan</name>
</json:item>
<json:item>
<name>Bernhard Steffen</name>
</json:item>
<json:item>
<name>Madhu Sudan</name>
</json:item>
<json:item>
<name>Demetri Terzopoulos</name>
</json:item>
<json:item>
<name>Doug Tygar</name>
</json:item>
<json:item>
<name>Moshe Y. Vardi</name>
</json:item>
<json:item>
<name>Gerhard Weikum</name>
</json:item>
</editor>
<issn>
<json:string>0302-9743</json:string>
</issn>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>Lecture Notes in Computer Science</title>
<copyrightDate>2008</copyrightDate>
</serie>
<host>
<editor>
<json:item>
<name>David Forsyth</name>
<affiliations>
<json:string>Computer Science Department, University of Illinois at Urbana Champaign, 3310 Siebel Hall, IL 61801, Urbana, USA</json:string>
<json:string>E-mail: daf@cs.uiuc.edu</json:string>
</affiliations>
</json:item>
<json:item>
<name>Philip Torr</name>
<affiliations>
<json:string>Department of Computing, Oxford Brookes University, OX33 1HX, Wheatley, Oxford, UK</json:string>
<json:string>E-mail: philiptorr@brookes.ac.uk</json:string>
</affiliations>
</json:item>
<json:item>
<name>Andrew Zisserman</name>
<affiliations>
<json:string>Department of Engineering Science, University of Oxford, Parks Road, OX1 3PJ, Oxford, UK</json:string>
<json:string>E-mail: az@robots.ox.ac.uk</json:string>
</affiliations>
</json:item>
</editor>
<subject>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Image Processing and Computer Vision</value>
</json:item>
<json:item>
<value>Computer Imaging, Vision, Pattern Recognition and Graphics</value>
</json:item>
<json:item>
<value>Computer Graphics</value>
</json:item>
<json:item>
<value>Pattern Recognition</value>
</json:item>
<json:item>
<value>Data Mining and Knowledge Discovery</value>
</json:item>
<json:item>
<value>Computer Appl. in Arts and Humanities</value>
</json:item>
</subject>
<isbn>
<json:string>978-3-540-88685-3</json:string>
</isbn>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>Computer Vision – ECCV 2008</title>
<genre.original>
<json:string>Proceedings</json:string>
</genre.original>
<bookId>
<json:string>978-3-540-88688-4</json:string>
</bookId>
<volume>5303</volume>
<pages>
<last>758</last>
<first>745</first>
</pages>
<issn>
<json:string>0302-9743</json:string>
</issn>
<genre>
<json:string>Book Series</json:string>
</genre>
<eisbn>
<json:string>978-3-540-88688-4</json:string>
</eisbn>
<copyrightDate>2008</copyrightDate>
<doi>
<json:string>10.1007/978-3-540-88688-4</json:string>
</doi>
</host>
<publicationDate>2008</publicationDate>
<copyrightDate>2008</copyrightDate>
<doi>
<json:string>10.1007/978-3-540-88688-4_55</json:string>
</doi>
<id>000EA72B875137D2E35868AFB5C5FCB5D7A54937</id>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/000EA72B875137D2E35868AFB5C5FCB5D7A54937/fulltext/pdf</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/000EA72B875137D2E35868AFB5C5FCB5D7A54937/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/000EA72B875137D2E35868AFB5C5FCB5D7A54937/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">Learning Visual Shape Lexicon for Document Image Content Recognition</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<availability>
<p>SPRINGER</p>
</availability>
<date>2008</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a" type="main" xml:lang="en">Learning Visual Shape Lexicon for Document Image Content Recognition</title>
<author>
<persName>
<forename type="first">Guangyu</forename>
<surname>Zhu</surname>
</persName>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
</author>
<author>
<persName>
<forename type="first">Xiaodong</forename>
<surname>Yu</surname>
</persName>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
</author>
<author>
<persName>
<forename type="first">Yi</forename>
<surname>Li</surname>
</persName>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
</author>
<author>
<persName>
<forename type="first">David</forename>
<surname>Doermann</surname>
</persName>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
</author>
</analytic>
<monogr>
<title level="m">Computer Vision – ECCV 2008</title>
<title level="m" type="sub">10th European Conference on Computer Vision, Marseille, France, October 12-18, 2008, Proceedings, Part II</title>
<idno type="pISBN">978-3-540-88685-3</idno>
<idno type="eISBN">978-3-540-88688-4</idno>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="DOI">10.1007/978-3-540-88688-4</idno>
<idno type="BookID">978-3-540-88688-4</idno>
<idno type="BookTitleID">183879</idno>
<idno type="BookSequenceNumber">5303</idno>
<idno type="BookVolumeNumber">5303</idno>
<idno type="BookChapterCount">61</idno>
<editor>
<persName>
<forename type="first">David</forename>
<surname>Forsyth</surname>
</persName>
<email>daf@cs.uiuc.edu</email>
<affiliation>Computer Science Department, University of Illinois at Urbana Champaign, 3310 Siebel Hall, IL 61801, Urbana, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Philip</forename>
<surname>Torr</surname>
</persName>
<email>philiptorr@brookes.ac.uk</email>
<affiliation>Department of Computing, Oxford Brookes University, OX33 1HX, Wheatley, Oxford, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Andrew</forename>
<surname>Zisserman</surname>
</persName>
<email>az@robots.ox.ac.uk</email>
<affiliation>Department of Engineering Science, University of Oxford, Parks Road, OX1 3PJ, Oxford, UK</affiliation>
</editor>
<imprint>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<date type="published" when="2008"></date>
<biblScope unit="volume">5303</biblScope>
<biblScope unit="page" from="745">745</biblScope>
<biblScope unit="page" to="758">758</biblScope>
</imprint>
</monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<editor>
<persName>
<forename type="first">David</forename>
<surname>Hutchison</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">Takeo</forename>
<surname>Kanade</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">Josef</forename>
<surname>Kittler</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">Jon</forename>
<forename type="first">M.</forename>
<surname>Kleinberg</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">Friedemann</forename>
<surname>Mattern</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">John</forename>
<forename type="first">C.</forename>
<surname>Mitchell</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">Moni</forename>
<surname>Naor</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">Oscar</forename>
<surname>Nierstrasz</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">C.</forename>
<surname>Pandu Rangan</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">Bernhard</forename>
<surname>Steffen</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">Madhu</forename>
<surname>Sudan</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">Demetri</forename>
<surname>Terzopoulos</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">Doug</forename>
<surname>Tygar</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">Moshe</forename>
<forename type="first">Y.</forename>
<surname>Vardi</surname>
</persName>
</editor>
<editor>
<persName>
<forename type="first">Gerhard</forename>
<surname>Weikum</surname>
</persName>
</editor>
<biblScope>
<date>2008</date>
</biblScope>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="seriesId">558</idno>
</series>
<idno type="istex">000EA72B875137D2E35868AFB5C5FCB5D7A54937</idno>
<idno type="DOI">10.1007/978-3-540-88688-4_55</idno>
<idno type="ChapterID">55</idno>
<idno type="ChapterID">Chap55</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>2008</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>Abstract: Developing effective content recognition methods for diverse imagery continues to challenge computer vision researchers. We present a new approach for document image content categorization using a lexicon of shape features. Each lexical word corresponds to a scale and rotation invariant shape feature that is generic enough to be detected repeatably and segmentation free. We learn a concise, structurally indexed shape lexicon from training by clustering and partitioning feature types through graph cuts. We demonstrate our approach on two challenging document image content recognition problems: 1) The classification of 4,500 Web images crawled from Google Image Search into three content categories — pure image, image with text, and document image, and 2) Language identification of 8 languages (Arabic, Chinese, English, Hindi, Japanese, Korean, Russian, and Thai) on a 1,512 complex document image database composed of mixed machine printed text and handwriting. Our approach is capable to handle high intra-class variability and shows results that exceed other state-of-the-art approaches, allowing it to be used as a content recognizer in image indexing and retrieval systems.</p>
</abstract>
<textClass>
<keywords scheme="Book Subject Collection">
<list>
<label>SUCO11645</label>
<item>
<term>Computer Science</term>
</item>
</list>
</keywords>
</textClass>
<textClass>
<keywords scheme="Book Subject Group">
<list>
<label>I</label>
<label>I22021</label>
<label>I22005</label>
<label>I22013</label>
<label>I2203X</label>
<label>I18030</label>
<label>I23036</label>
<item>
<term>Computer Science</term>
</item>
<item>
<term>Image Processing and Computer Vision</term>
</item>
<item>
<term>Computer Imaging, Vision, Pattern Recognition and Graphics</term>
</item>
<item>
<term>Computer Graphics</term>
</item>
<item>
<term>Pattern Recognition</term>
</item>
<item>
<term>Data Mining and Knowledge Discovery</term>
</item>
<item>
<term>Computer Appl. in Arts and Humanities</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change when="2008">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-20">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/000EA72B875137D2E35868AFB5C5FCB5D7A54937/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="Springer, Publisher found" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="UTF-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//Springer-Verlag//DTD A++ V2.4//EN" URI="http://devel.springer.de/A++/V2.4/DTD/A++V2.4.dtd" name="istex:docType"></istex:docType>
<istex:document>
<Publisher>
<PublisherInfo>
<PublisherName>Springer Berlin Heidelberg</PublisherName>
<PublisherLocation>Berlin, Heidelberg</PublisherLocation>
</PublisherInfo>
<Series>
<SeriesInfo SeriesType="Series" TocLevels="0">
<SeriesID>558</SeriesID>
<SeriesPrintISSN>0302-9743</SeriesPrintISSN>
<SeriesElectronicISSN>1611-3349</SeriesElectronicISSN>
<SeriesTitle Language="En">Lecture Notes in Computer Science</SeriesTitle>
</SeriesInfo>
<SeriesHeader>
<EditorGroup>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>David</GivenName>
<FamilyName>Hutchison</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Takeo</GivenName>
<FamilyName>Kanade</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Josef</GivenName>
<FamilyName>Kittler</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Jon</GivenName>
<GivenName>M.</GivenName>
<FamilyName>Kleinberg</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Friedemann</GivenName>
<FamilyName>Mattern</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>John</GivenName>
<GivenName>C.</GivenName>
<FamilyName>Mitchell</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Moni</GivenName>
<FamilyName>Naor</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Oscar</GivenName>
<FamilyName>Nierstrasz</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>C.</GivenName>
<FamilyName>Pandu Rangan</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Bernhard</GivenName>
<FamilyName>Steffen</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Madhu</GivenName>
<FamilyName>Sudan</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Demetri</GivenName>
<FamilyName>Terzopoulos</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Doug</GivenName>
<FamilyName>Tygar</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Moshe</GivenName>
<GivenName>Y.</GivenName>
<FamilyName>Vardi</FamilyName>
</EditorName>
</Editor>
<Editor>
<EditorName DisplayOrder="Western">
<GivenName>Gerhard</GivenName>
<FamilyName>Weikum</FamilyName>
</EditorName>
</Editor>
</EditorGroup>
</SeriesHeader>
<Book Language="En">
<BookInfo BookProductType="Proceedings" ContainsESM="No" Language="En" MediaType="eBook" NumberingStyle="Unnumbered" OutputMedium="All" TocLevels="0">
<BookID>978-3-540-88688-4</BookID>
<BookTitle>Computer Vision – ECCV 2008</BookTitle>
<BookSubTitle>10th European Conference on Computer Vision, Marseille, France, October 12-18, 2008, Proceedings, Part II</BookSubTitle>
<BookVolumeNumber>5303</BookVolumeNumber>
<BookSequenceNumber>5303</BookSequenceNumber>
<BookDOI>10.1007/978-3-540-88688-4</BookDOI>
<BookTitleID>183879</BookTitleID>
<BookPrintISBN>978-3-540-88685-3</BookPrintISBN>
<BookElectronicISBN>978-3-540-88688-4</BookElectronicISBN>
<BookChapterCount>61</BookChapterCount>
<BookCopyright>
<CopyrightHolderName>Springer Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2008</CopyrightYear>
</BookCopyright>
<BookSubjectGroup>
<BookSubject Code="I" Type="Primary">Computer Science</BookSubject>
<BookSubject Code="I22021" Priority="1" Type="Secondary">Image Processing and Computer Vision</BookSubject>
<BookSubject Code="I22005" Priority="2" Type="Secondary">Computer Imaging, Vision, Pattern Recognition and Graphics</BookSubject>
<BookSubject Code="I22013" Priority="3" Type="Secondary">Computer Graphics</BookSubject>
<BookSubject Code="I2203X" Priority="4" Type="Secondary">Pattern Recognition</BookSubject>
<BookSubject Code="I18030" Priority="5" Type="Secondary">Data Mining and Knowledge Discovery</BookSubject>
<BookSubject Code="I23036" Priority="6" Type="Secondary">Computer Appl. in Arts and Humanities</BookSubject>
<SubjectCollection Code="SUCO11645">Computer Science</SubjectCollection>
</BookSubjectGroup>
</BookInfo>
<BookHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff1">
<EditorName DisplayOrder="Western">
<GivenName>David</GivenName>
<FamilyName>Forsyth</FamilyName>
</EditorName>
<Contact>
<Email>daf@cs.uiuc.edu</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff2">
<EditorName DisplayOrder="Western">
<GivenName>Philip</GivenName>
<FamilyName>Torr</FamilyName>
</EditorName>
<Contact>
<Email>philiptorr@brookes.ac.uk</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff3">
<EditorName DisplayOrder="Western">
<GivenName>Andrew</GivenName>
<FamilyName>Zisserman</FamilyName>
</EditorName>
<Contact>
<Email>az@robots.ox.ac.uk</Email>
</Contact>
</Editor>
<Affiliation ID="Aff1">
<OrgDivision>Computer Science Department</OrgDivision>
<OrgName>University of Illinois at Urbana Champaign</OrgName>
<OrgAddress>
<Street>3310 Siebel Hall</Street>
<Postcode>IL 61801</Postcode>
<City>Urbana</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff2">
<OrgDivision>Department of Computing</OrgDivision>
<OrgName>Oxford Brookes University</OrgName>
<OrgAddress>
<Postcode>OX33 1HX</Postcode>
<City>Wheatley, Oxford</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff3">
<OrgDivision>Department of Engineering Science</OrgDivision>
<OrgName>University of Oxford</OrgName>
<OrgAddress>
<Street>Parks Road</Street>
<Postcode>OX1 3PJ</Postcode>
<City>Oxford</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</BookHeader>
<Part ID="Part3">
<PartInfo TocLevels="0">
<PartID>3</PartID>
<PartSequenceNumber>3</PartSequenceNumber>
<PartTitle>Poster Session II</PartTitle>
<PartChapterCount>50</PartChapterCount>
<PartContext>
<SeriesID>558</SeriesID>
<BookTitle>Computer Vision – ECCV 2008</BookTitle>
</PartContext>
</PartInfo>
<Chapter ID="Chap55" Language="En">
<ChapterInfo ChapterType="OriginalPaper" ContainsESM="No" NumberingStyle="Unnumbered" TocLevels="0">
<ChapterID>55</ChapterID>
<ChapterDOI>10.1007/978-3-540-88688-4_55</ChapterDOI>
<ChapterSequenceNumber>55</ChapterSequenceNumber>
<ChapterTitle Language="En">Learning Visual Shape Lexicon for Document Image Content Recognition</ChapterTitle>
<ChapterFirstPage>745</ChapterFirstPage>
<ChapterLastPage>758</ChapterLastPage>
<ChapterCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2008</CopyrightYear>
</ChapterCopyright>
<ChapterGrants Type="Regular">
<MetadataGrant Grant="OpenAccess"></MetadataGrant>
<AbstractGrant Grant="OpenAccess"></AbstractGrant>
<BodyPDFGrant Grant="Restricted"></BodyPDFGrant>
<BodyHTMLGrant Grant="Restricted"></BodyHTMLGrant>
<BibliographyGrant Grant="Restricted"></BibliographyGrant>
<ESMGrant Grant="Restricted"></ESMGrant>
</ChapterGrants>
<ChapterContext>
<SeriesID>558</SeriesID>
<PartID>3</PartID>
<BookID>978-3-540-88688-4</BookID>
<BookTitle>Computer Vision – ECCV 2008</BookTitle>
</ChapterContext>
</ChapterInfo>
<ChapterHeader>
<AuthorGroup>
<Author AffiliationIDS="Aff4">
<AuthorName DisplayOrder="Western">
<GivenName>Guangyu</GivenName>
<FamilyName>Zhu</FamilyName>
</AuthorName>
</Author>
<Author AffiliationIDS="Aff4">
<AuthorName DisplayOrder="Western">
<GivenName>Xiaodong</GivenName>
<FamilyName>Yu</FamilyName>
</AuthorName>
</Author>
<Author AffiliationIDS="Aff4">
<AuthorName DisplayOrder="Western">
<GivenName>Yi</GivenName>
<FamilyName>Li</FamilyName>
</AuthorName>
</Author>
<Author AffiliationIDS="Aff4">
<AuthorName DisplayOrder="Western">
<GivenName>David</GivenName>
<FamilyName>Doermann</FamilyName>
</AuthorName>
</Author>
<Affiliation ID="Aff4">
<OrgName>University of Maryland</OrgName>
<OrgAddress>
<City>College Park</City>
<Postcode>MD 20742</Postcode>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
</AuthorGroup>
<Abstract ID="Abs1" Language="En">
<Heading>Abstract</Heading>
<Para>Developing effective content recognition methods for diverse imagery continues to challenge computer vision researchers. We present a new approach for document image content categorization using a lexicon of shape features. Each lexical word corresponds to a scale and rotation invariant shape feature that is generic enough to be detected repeatably and segmentation free. We learn a concise, structurally indexed shape lexicon from training by clustering and partitioning feature types through graph cuts. We demonstrate our approach on two challenging document image content recognition problems: 1) The classification of 4,500 Web images crawled from Google Image Search into three content categories — pure image, image with text, and document image, and 2) Language identification of 8 languages (Arabic, Chinese, English, Hindi, Japanese, Korean, Russian, and Thai) on a 1,512 complex document image database composed of mixed machine printed text and handwriting. Our approach is capable to handle high intra-class variability and shows results that exceed other state-of-the-art approaches, allowing it to be used as a content recognizer in image indexing and retrieval systems.</Para>
</Abstract>
</ChapterHeader>
<NoBody></NoBody>
</Chapter>
</Part>
</Book>
</Series>
</Publisher>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>Learning Visual Shape Lexicon for Document Image Content Recognition</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA" lang="en">
<title>Learning Visual Shape Lexicon for Document Image Content Recognition</title>
</titleInfo>
<name type="personal">
<namePart type="given">Guangyu</namePart>
<namePart type="family">Zhu</namePart>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Xiaodong</namePart>
<namePart type="family">Yu</namePart>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Yi</namePart>
<namePart type="family">Li</namePart>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">David</namePart>
<namePart type="family">Doermann</namePart>
<affiliation>University of Maryland, MD 20742, College Park, USA</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="conference [eBooks]" displayLabel="OriginalPaper"></genre>
<originInfo>
<publisher>Springer Berlin Heidelberg</publisher>
<place>
<placeTerm type="text">Berlin, Heidelberg</placeTerm>
</place>
<dateIssued encoding="w3cdtf">2008</dateIssued>
<copyrightDate encoding="w3cdtf">2008</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">Abstract: Developing effective content recognition methods for diverse imagery continues to challenge computer vision researchers. We present a new approach for document image content categorization using a lexicon of shape features. Each lexical word corresponds to a scale and rotation invariant shape feature that is generic enough to be detected repeatably and segmentation free. We learn a concise, structurally indexed shape lexicon from training by clustering and partitioning feature types through graph cuts. We demonstrate our approach on two challenging document image content recognition problems: 1) The classification of 4,500 Web images crawled from Google Image Search into three content categories — pure image, image with text, and document image, and 2) Language identification of 8 languages (Arabic, Chinese, English, Hindi, Japanese, Korean, Russian, and Thai) on a 1,512 complex document image database composed of mixed machine printed text and handwriting. Our approach is capable to handle high intra-class variability and shows results that exceed other state-of-the-art approaches, allowing it to be used as a content recognizer in image indexing and retrieval systems.</abstract>
<relatedItem type="host">
<titleInfo>
<title>Computer Vision – ECCV 2008</title>
<subTitle>10th European Conference on Computer Vision, Marseille, France, October 12-18, 2008, Proceedings, Part II</subTitle>
</titleInfo>
<name type="personal">
<namePart type="given">David</namePart>
<namePart type="family">Forsyth</namePart>
<affiliation>Computer Science Department, University of Illinois at Urbana Champaign, 3310 Siebel Hall, IL 61801, Urbana, USA</affiliation>
<affiliation>E-mail: daf@cs.uiuc.edu</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Philip</namePart>
<namePart type="family">Torr</namePart>
<affiliation>Department of Computing, Oxford Brookes University, OX33 1HX, Wheatley, Oxford, UK</affiliation>
<affiliation>E-mail: philiptorr@brookes.ac.uk</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Andrew</namePart>
<namePart type="family">Zisserman</namePart>
<affiliation>Department of Engineering Science, University of Oxford, Parks Road, OX1 3PJ, Oxford, UK</affiliation>
<affiliation>E-mail: az@robots.ox.ac.uk</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="Book Series" displayLabel="Proceedings"></genre>
<originInfo>
<copyrightDate encoding="w3cdtf">2008</copyrightDate>
<issuance>monographic</issuance>
</originInfo>
<subject>
<genre>Book Subject Collection</genre>
<topic authority="SpringerSubjectCodes" authorityURI="SUCO11645">Computer Science</topic>
</subject>
<subject>
<genre>Book Subject Group</genre>
<topic authority="SpringerSubjectCodes" authorityURI="I">Computer Science</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I22021">Image Processing and Computer Vision</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I22005">Computer Imaging, Vision, Pattern Recognition and Graphics</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I22013">Computer Graphics</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I2203X">Pattern Recognition</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18030">Data Mining and Knowledge Discovery</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I23036">Computer Appl. in Arts and Humanities</topic>
</subject>
<identifier type="DOI">10.1007/978-3-540-88688-4</identifier>
<identifier type="ISBN">978-3-540-88685-3</identifier>
<identifier type="eISBN">978-3-540-88688-4</identifier>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="BookTitleID">183879</identifier>
<identifier type="BookID">978-3-540-88688-4</identifier>
<identifier type="BookChapterCount">61</identifier>
<identifier type="BookVolumeNumber">5303</identifier>
<identifier type="BookSequenceNumber">5303</identifier>
<identifier type="PartChapterCount">50</identifier>
<part>
<date>2008</date>
<detail type="part">
<title>Poster Session II</title>
</detail>
<detail type="volume">
<number>5303</number>
<caption>vol.</caption>
</detail>
<extent unit="pages">
<start>745</start>
<end>758</end>
</extent>
</part>
<recordInfo>
<recordOrigin>Springer Berlin Heidelberg, 2008</recordOrigin>
</recordInfo>
</relatedItem>
<relatedItem type="series">
<titleInfo>
<title>Lecture Notes in Computer Science</title>
</titleInfo>
<name type="personal">
<namePart type="given">David</namePart>
<namePart type="family">Hutchison</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Takeo</namePart>
<namePart type="family">Kanade</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Josef</namePart>
<namePart type="family">Kittler</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jon</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Kleinberg</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Friedemann</namePart>
<namePart type="family">Mattern</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">John</namePart>
<namePart type="given">C.</namePart>
<namePart type="family">Mitchell</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moni</namePart>
<namePart type="family">Naor</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Oscar</namePart>
<namePart type="family">Nierstrasz</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">C.</namePart>
<namePart type="family">Pandu Rangan</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Bernhard</namePart>
<namePart type="family">Steffen</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Madhu</namePart>
<namePart type="family">Sudan</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Demetri</namePart>
<namePart type="family">Terzopoulos</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Doug</namePart>
<namePart type="family">Tygar</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moshe</namePart>
<namePart type="given">Y.</namePart>
<namePart type="family">Vardi</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<originInfo>
<copyrightDate encoding="w3cdtf">2008</copyrightDate>
<issuance>serial</issuance>
</originInfo>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="SeriesID">558</identifier>
<recordInfo>
<recordOrigin>Springer Berlin Heidelberg, 2008</recordOrigin>
</recordInfo>
</relatedItem>
<identifier type="istex">000EA72B875137D2E35868AFB5C5FCB5D7A54937</identifier>
<identifier type="DOI">10.1007/978-3-540-88688-4_55</identifier>
<identifier type="ChapterID">55</identifier>
<identifier type="ChapterID">Chap55</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Springer Berlin Heidelberg, 2008</accessCondition>
<recordInfo>
<recordContentSource>SPRINGER</recordContentSource>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2008</recordOrigin>
</recordInfo>
</mods>
</metadata>
<enrichments>
<istex:refBibTEI uri="https://api.istex.fr/document/000EA72B875137D2E35868AFB5C5FCB5D7A54937/enrichments/refBib">
<teiHeader></teiHeader>
<text>
<front></front>
<body></body>
<back>
<listBibl>
<biblStruct xml:id="b0">
<analytic>
<title level="a" type="main">A computational model for visual selection</title>
<author>
<persName>
<forename type="first">Y</forename>
<surname>Amit</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Geman</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Neural Computation</title>
<imprint>
<biblScope unit="volume">11</biblScope>
<biblScope unit="page" from="1691" to="1715"></biblScope>
<date type="published" when="1999"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b1">
<analytic>
<title level="a" type="main">Shape matching and object recognition using shape contexts</title>
<author>
<persName>
<forename type="first">S</forename>
<surname>Belongie</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<surname>Malik</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<surname>Puzicha</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint>
<biblScope unit="volume">24</biblScope>
<biblScope unit="issue">4</biblScope>
<biblScope unit="page" from="509" to="522"></biblScope>
<date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b2">
<analytic>
<title level="a" type="main">Efficient indexing for articulation invariant shape matching and retrieval</title>
<author>
<persName>
<forename type="first">S</forename>
<surname>Biswas</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">G</forename>
<surname>Aggarwal</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">R</forename>
<surname>Chellappa</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proc. CVPR</title>
<meeting>. CVPR</meeting>
<imprint>
<date type="published" when="2007"></date>
<biblScope unit="page" from="1" to="8"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b3">
<analytic>
<title level="a" type="main">Texture for script identification</title>
<author>
<persName>
<forename type="first">A</forename>
<surname>Busch</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">W</forename>
<surname>Boles</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Sridharan</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint>
<biblScope unit="volume">27</biblScope>
<biblScope unit="issue">11</biblScope>
<biblScope unit="page" from="1720" to="1732"></biblScope>
<date type="published" when="2005"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b4">
<analytic>
<title level="a" type="main">A computational approach to edge detection</title>
<author>
<persName>
<forename type="first">J</forename>
<surname>Canny</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint>
<biblScope unit="volume">8</biblScope>
<biblScope unit="issue">6</biblScope>
<biblScope unit="page" from="679" to="697"></biblScope>
<date type="published" when="1986"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b5">
<analytic>
<title level="a" type="main">Histograms of oriented gradients for human detection</title>
<author>
<persName>
<forename type="first">N</forename>
<surname>Dalal</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">B</forename>
<surname>Triggs</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proc. CVPR</title>
<meeting>. CVPR</meeting>
<imprint>
<date type="published" when="2005"></date>
<biblScope unit="page" from="886" to="893"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b6">
<analytic>
<title level="a" type="main">Classification of oriental and European scripts by using characteristic features</title>
<author>
<persName>
<forename type="first">J</forename>
<surname>Ding</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">L</forename>
<surname>Lam</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">C</forename>
<surname>Suen</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proc. ICDAR</title>
<meeting>. ICDAR</meeting>
<imprint>
<date type="published" when="1997"></date>
<biblScope unit="page" from="1023" to="1027"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b7">
<analytic>
<title level="a" type="main">Groups of adjacent contour segments for object detection</title>
<author>
<persName>
<forename type="first">V</forename>
<surname>Ferrari</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">L</forename>
<surname>Fevrier</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">F</forename>
<surname>Jurie</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">C</forename>
<surname>Schmid</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint>
<biblScope unit="volume">30</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="1" to="16"></biblScope>
<date type="published" when="2008"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b8">
<analytic>
<title level="a" type="main">Towards scalable representations of object categories: Learning a hierarchy of parts</title>
<author>
<persName>
<forename type="first">S</forename>
<surname>Fidler</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">A</forename>
<surname>Leonardis</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proc. CVPR</title>
<meeting>. CVPR</meeting>
<imprint>
<date type="published" when="2007"></date>
<biblScope unit="page" from="1" to="8"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b9">
<analytic>
<title level="a" type="main">Flexible syntactic matching of curves and its application to automatic hierarchical classification of silhouettes</title>
<author>
<persName>
<forename type="first">Y</forename>
<surname>Gdalyahu</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Weinshall</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint>
<biblScope unit="volume">21</biblScope>
<biblScope unit="issue">12</biblScope>
<biblScope unit="page" from="1312" to="1328"></biblScope>
<date type="published" when="1999"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b10">
<analytic>
<title level="a" type="main">Automatic script identification from document images using cluster-based templates</title>
<author>
<persName>
<forename type="first">J</forename>
<surname>Hochberg</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">P</forename>
<surname>Kelly</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">T</forename>
<surname>Thomas</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">L</forename>
<surname>Kerns</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint>
<biblScope unit="volume">19</biblScope>
<biblScope unit="issue">2</biblScope>
<biblScope unit="page" from="176" to="181"></biblScope>
<date type="published" when="1997"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b11">
<analytic>
<title level="a" type="main">Robust and efficient detection of salient convex groups</title>
<author>
<persName>
<forename type="first">D</forename>
<surname>Jacobs</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint>
<biblScope unit="volume">18</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="23" to="37"></biblScope>
<date type="published" when="1996"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b12">
<analytic>
<title level="a" type="main">Shape descriptors for non-rigid shapes with a single closed contour</title>
<author>
<persName>
<forename type="first">L</forename>
<surname>Latecki</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">R</forename>
<surname>Lakamper</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">U</forename>
<surname>Eckhardt</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proc. CVPR</title>
<meeting>. CVPR</meeting>
<imprint>
<date type="published" when="2000"></date>
<biblScope unit="page" from="424" to="429"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b13">
<analytic>
<title level="a" type="main">Language Identification in Complex, Unoriented, and Degraded Document Images</title>
<author>
<persName>
<forename type="first">D</forename>
<surname>Lee</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">C</forename>
<surname>Nohl</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">H</forename>
<surname>Baird</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Document Analysis Systems II</title>
<imprint>
<date type="published" when="1998"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b14">
<analytic>
<title level="a" type="main">Script-independent text line segmentation in freestyle handwritten documents</title>
<author>
<persName>
<forename type="first">Y</forename>
<surname>Li</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">Y</forename>
<surname>Zheng</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Doermann</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Jaeger</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint>
<biblScope unit="volume">30</biblScope>
<biblScope unit="issue">8</biblScope>
<biblScope unit="page" from="1313" to="1329"></biblScope>
<date type="published" when="2008"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b15">
<analytic>
<title level="a" type="main">Shape classification using the inner-distance</title>
<author>
<persName>
<forename type="first">H</forename>
<surname>Ling</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Jacobs</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint>
<biblScope unit="volume">29</biblScope>
<biblScope unit="issue">2</biblScope>
<biblScope unit="page" from="286" to="299"></biblScope>
<date type="published" when="2007"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b16">
<analytic>
<title level="a" type="main">Three-dimensional object recognition from single two-dimensional images</title>
<author>
<persName>
<forename type="first">D</forename>
<surname>Lowe</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Artificial Intelligence</title>
<imprint>
<biblScope unit="volume">31</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="355" to="395"></biblScope>
<date type="published" when="1987"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b17">
<analytic>
<title level="a" type="main">Script and language identification in noisy and degraded document images</title>
<author>
<persName>
<forename type="first">S</forename>
<surname>Lu</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">C</forename>
<surname>Tan</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint>
<biblScope unit="volume">30</biblScope>
<biblScope unit="issue">2</biblScope>
<biblScope unit="page" from="14" to="24"></biblScope>
<date type="published" when="2008"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b18">
<analytic>
<title level="a" type="main">The IAM-database: An English sentence database for offline handwriting recognition</title>
<author>
<persName>
<forename type="first">U</forename>
<surname>Marti</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">H</forename>
<surname>Bunke</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Int. J. Document Analysis and Recognition</title>
<imprint>
<biblScope unit="volume">5</biblScope>
<biblScope unit="page" from="39" to="46"></biblScope>
<date type="published" when="2006"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b19">
<analytic>
<title level="a" type="main">Multiresolution gray-scale and rotation invariant texture classification with local binary patterns</title>
<author>
<persName>
<forename type="first">T</forename>
<surname>Ojala</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Pietikainen</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">T</forename>
<surname>Maenpaa</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint>
<biblScope unit="volume">24</biblScope>
<biblScope unit="issue">7</biblScope>
<biblScope unit="page" from="971" to="987"></biblScope>
<date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b20">
<analytic>
<title level="a" type="main">Modeling the shape of the scene: A holistic representation of the spatial envelope</title>
<author>
<persName>
<forename type="first">A</forename>
<surname>Oliva</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">A</forename>
<surname>Torralba</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Int. J. Computer Vision</title>
<imprint>
<biblScope unit="volume">42</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="145" to="175"></biblScope>
<date type="published" when="2001"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b21">
<analytic>
<title level="a" type="main">On-line and off-line handwriting recognition: A comprehensive survey</title>
<author>
<persName>
<forename type="first">R</forename>
<surname>Plamondon</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Srihari</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint>
<biblScope unit="volume">22</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="63" to="84"></biblScope>
<date type="published" when="2000"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b22">
<monogr>
<title level="m" type="main">Optical Character Recognition: An Illustrated Guide to the Frontier</title>
<author>
<persName>
<forename type="first">S</forename>
<surname>Rice</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">G</forename>
<surname>Nagy</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">T</forename>
<surname>Nartker</surname>
</persName>
</author>
<imprint>
<date type="published" when="1999"></date>
<publisher>Kluwer Academic Publishers</publisher>
<pubPlace>Dordrecht</pubPlace>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b23">
<analytic>
<title level="a" type="main">Planar object recognition using projective shape representation</title>
<author>
<persName>
<forename type="first">C</forename>
<surname>Rothwell</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">A</forename>
<surname>Zisserman</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Forsyth</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<surname>Mundy</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Int. J. Computer Vision</title>
<imprint>
<biblScope unit="volume">16</biblScope>
<biblScope unit="issue">5</biblScope>
<biblScope unit="page" from="57" to="99"></biblScope>
<date type="published" when="1995"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b24">
<analytic>
<title level="a" type="main">Symmetry-based indexing of image database</title>
<author>
<persName>
<forename type="first">D</forename>
<surname>Sharvit</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<surname>Chan</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">H</forename>
<surname>Tek</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">B</forename>
<surname>Kimia</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">J. Visual Commun. and Image Representation</title>
<imprint>
<biblScope unit="volume">9</biblScope>
<biblScope unit="issue">4</biblScope>
<biblScope unit="page" from="366" to="380"></biblScope>
<date type="published" when="1998"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b25">
<analytic>
<title level="a" type="main">Normalized cuts and image segmentation</title>
<author>
<persName>
<forename type="first">J</forename>
<surname>Shi</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<surname>Malik</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint>
<biblScope unit="volume">22</biblScope>
<biblScope unit="issue">8</biblScope>
<biblScope unit="page" from="888" to="905"></biblScope>
<date type="published" when="2000"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b26">
<analytic>
<title level="a" type="main">Determination of script and language content of document images</title>
<author>
<persName>
<forename type="first">A</forename>
<surname>Spitz</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint>
<biblScope unit="volume">19</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="235" to="245"></biblScope>
<date type="published" when="1997"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b27">
<analytic>
<title level="a" type="main">Categorizing document images into script and language classes</title>
<author>
<persName>
<forename type="first">C</forename>
<surname>Suen</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Bergler</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">N</forename>
<surname>Nobile</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">B</forename>
<surname>Waked</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">C</forename>
<surname>Nadal</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">A</forename>
<surname>Bloch</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proc. ICDAR</title>
<meeting>. ICDAR</meeting>
<imprint>
<date type="published" when="1998"></date>
<biblScope unit="page" from="297" to="306"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b28">
<analytic>
<title level="a" type="main">Rotation invariant texture features and their use in automatic script identification</title>
<author>
<persName>
<forename type="first">T</forename>
<surname>Tan</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Pattern Anal. Mach. Intell</title>
<imprint>
<biblScope unit="volume">20</biblScope>
<biblScope unit="issue">7</biblScope>
<biblScope unit="page" from="751" to="756"></biblScope>
<date type="published" when="1998"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b29">
<analytic>
<title level="a" type="main">Google Book Search: Document understanding on a massive scale</title>
<author>
<persName>
<forename type="first">L</forename>
<surname>Vincent</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proc. ICDAR</title>
<meeting>. ICDAR</meeting>
<imprint>
<date type="published" when="2007"></date>
<biblScope unit="page" from="819" to="823"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b30">
<analytic>
<title level="a" type="main">Multiclass spectral clustering</title>
<author>
<persName>
<forename type="first">S</forename>
<surname>Yu</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<surname>Shi</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proc. ICCV</title>
<meeting>. ICCV</meeting>
<imprint>
<date type="published" when="2003"></date>
<biblScope unit="page" from="11" to="17"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b31">
<analytic>
<title level="a" type="main">Extracting relevant named entities for automated expense reimbursement</title>
<author>
<persName>
<forename type="first">G</forename>
<surname>Zhu</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">T</forename>
<forename type="middle">J</forename>
<surname>Bethea</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">V</forename>
<surname>Krishna</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proc. ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining</title>
<meeting>. ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining</meeting>
<imprint>
<date type="published" when="2007"></date>
<biblScope unit="page" from="1004" to="1012"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b32">
<analytic>
<title level="a" type="main">Unconstrained language identification using a shape codebook</title>
<author>
<persName>
<forename type="first">G</forename>
<surname>Zhu</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">X</forename>
<surname>Yu</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">Y</forename>
<surname>Li</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Doermann</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proc. ICFHR</title>
<meeting>. ICFHR</meeting>
<imprint>
<date type="published" when="2008"></date>
<biblScope unit="page" from="13" to="18"></biblScope>
</imprint>
</monogr>
</biblStruct>
</listBibl>
</back>
</text>
</istex:refBibTEI>
</enrichments>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000C96 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000C96 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:000EA72B875137D2E35868AFB5C5FCB5D7A54937
   |texte=   Learning Visual Shape Lexicon for Document Image Content Recognition
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024