Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

An Automatic Video Text Detection, Localization and Extraction Approach

Identifieur interne : 000229 ( Istex/Corpus ); précédent : 000228; suivant : 000230

An Automatic Video Text Detection, Localization and Extraction Approach

Auteurs : Chengjun Zhu ; Yuanxin Ouyang ; Lei Gao ; Zhenyong Chen ; Zhang Xiong

Source :

RBID : ISTEX:D7BE73F548C8DA59A7383DA67FF544008E430728

Abstract

Abstract: Text in video is a very compact and accurate clue for video indexing and summarization. This paper presents an algorithm regarding word group as a special symbol to detect, localize and extract video text using support vector machine (SVM) automatically. First, four sobel operators are applied to get the EM(edge map) of the video frame and the EM is segmented into N×2N size blocks. Then character features and characters group structure features are extracted to construct a 19-dimension feature vector. We use a pre-trained SVM to partition each block into two classes: text and non-text blocks. Secondly a dilatation-shrink process is employed to adjust the text position. Finally text regions are enhanced by multiple frame information. After binarization of enhanced text region, the text region with clean background is recognized by OCR software. Experimental results show that the proposed method can detect, localize, and extract video texts with high accuracy.

Url:
DOI: 10.1007/978-3-642-01350-8_1

Links to Exploration step

ISTEX:D7BE73F548C8DA59A7383DA67FF544008E430728

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">An Automatic Video Text Detection, Localization and Extraction Approach</title>
<author>
<name sortKey="Zhu, Chengjun" sort="Zhu, Chengjun" uniqKey="Zhu C" first="Chengjun" last="Zhu">Chengjun Zhu</name>
<affiliation>
<mods:affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: zhucj@cse.buaa.edu.cn</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Ouyang, Yuanxin" sort="Ouyang, Yuanxin" uniqKey="Ouyang Y" first="Yuanxin" last="Ouyang">Yuanxin Ouyang</name>
<affiliation>
<mods:affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: oyyx@buaa.edu.cn</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Gao, Lei" sort="Gao, Lei" uniqKey="Gao L" first="Lei" last="Gao">Lei Gao</name>
<affiliation>
<mods:affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: gaol@cse.buaa.edu.cn</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Chen, Zhenyong" sort="Chen, Zhenyong" uniqKey="Chen Z" first="Zhenyong" last="Chen">Zhenyong Chen</name>
<affiliation>
<mods:affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: chzhyong@buaa.edu.cn</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Xiong, Zhang" sort="Xiong, Zhang" uniqKey="Xiong Z" first="Zhang" last="Xiong">Zhang Xiong</name>
<affiliation>
<mods:affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: xiongz@buaa.edu.cn</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:D7BE73F548C8DA59A7383DA67FF544008E430728</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.1007/978-3-642-01350-8_1</idno>
<idno type="url">https://api.istex.fr/document/D7BE73F548C8DA59A7383DA67FF544008E430728/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000229</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">An Automatic Video Text Detection, Localization and Extraction Approach</title>
<author>
<name sortKey="Zhu, Chengjun" sort="Zhu, Chengjun" uniqKey="Zhu C" first="Chengjun" last="Zhu">Chengjun Zhu</name>
<affiliation>
<mods:affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: zhucj@cse.buaa.edu.cn</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Ouyang, Yuanxin" sort="Ouyang, Yuanxin" uniqKey="Ouyang Y" first="Yuanxin" last="Ouyang">Yuanxin Ouyang</name>
<affiliation>
<mods:affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: oyyx@buaa.edu.cn</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Gao, Lei" sort="Gao, Lei" uniqKey="Gao L" first="Lei" last="Gao">Lei Gao</name>
<affiliation>
<mods:affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: gaol@cse.buaa.edu.cn</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Chen, Zhenyong" sort="Chen, Zhenyong" uniqKey="Chen Z" first="Zhenyong" last="Chen">Zhenyong Chen</name>
<affiliation>
<mods:affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: chzhyong@buaa.edu.cn</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Xiong, Zhang" sort="Xiong, Zhang" uniqKey="Xiong Z" first="Zhang" last="Xiong">Zhang Xiong</name>
<affiliation>
<mods:affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: xiongz@buaa.edu.cn</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2009</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">D7BE73F548C8DA59A7383DA67FF544008E430728</idno>
<idno type="DOI">10.1007/978-3-642-01350-8_1</idno>
<idno type="ChapterID">1</idno>
<idno type="ChapterID">Chap1</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Text in video is a very compact and accurate clue for video indexing and summarization. This paper presents an algorithm regarding word group as a special symbol to detect, localize and extract video text using support vector machine (SVM) automatically. First, four sobel operators are applied to get the EM(edge map) of the video frame and the EM is segmented into N×2N size blocks. Then character features and characters group structure features are extracted to construct a 19-dimension feature vector. We use a pre-trained SVM to partition each block into two classes: text and non-text blocks. Secondly a dilatation-shrink process is employed to adjust the text position. Finally text regions are enhanced by multiple frame information. After binarization of enhanced text region, the text region with clean background is recognized by OCR software. Experimental results show that the proposed method can detect, localize, and extract video texts with high accuracy.</div>
</front>
</TEI>
<istex>
<corpusName>springer</corpusName>
<author>
<json:item>
<name>Chengjun Zhu</name>
<affiliations>
<json:string>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</json:string>
<json:string>E-mail: zhucj@cse.buaa.edu.cn</json:string>
</affiliations>
</json:item>
<json:item>
<name>Yuanxin Ouyang</name>
<affiliations>
<json:string>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</json:string>
<json:string>E-mail: oyyx@buaa.edu.cn</json:string>
</affiliations>
</json:item>
<json:item>
<name>Lei Gao</name>
<affiliations>
<json:string>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</json:string>
<json:string>E-mail: gaol@cse.buaa.edu.cn</json:string>
</affiliations>
</json:item>
<json:item>
<name>Zhenyong Chen</name>
<affiliations>
<json:string>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</json:string>
<json:string>E-mail: chzhyong@buaa.edu.cn</json:string>
</affiliations>
</json:item>
<json:item>
<name>Zhang Xiong</name>
<affiliations>
<json:string>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</json:string>
<json:string>E-mail: xiongz@buaa.edu.cn</json:string>
</affiliations>
</json:item>
</author>
<language>
<json:string>eng</json:string>
</language>
<abstract>Abstract: Text in video is a very compact and accurate clue for video indexing and summarization. This paper presents an algorithm regarding word group as a special symbol to detect, localize and extract video text using support vector machine (SVM) automatically. First, four sobel operators are applied to get the EM(edge map) of the video frame and the EM is segmented into N×2N size blocks. Then character features and characters group structure features are extracted to construct a 19-dimension feature vector. We use a pre-trained SVM to partition each block into two classes: text and non-text blocks. Secondly a dilatation-shrink process is employed to adjust the text position. Finally text regions are enhanced by multiple frame information. After binarization of enhanced text region, the text region with clean background is recognized by OCR software. Experimental results show that the proposed method can detect, localize, and extract video texts with high accuracy.</abstract>
<qualityIndicators>
<score>6.138</score>
<pdfVersion>1.6</pdfVersion>
<pdfPageSize>430 x 660 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>982</abstractCharCount>
<pdfWordCount>2826</pdfWordCount>
<pdfCharCount>16070</pdfCharCount>
<pdfPageCount>9</pdfPageCount>
<abstractWordCount>151</abstractWordCount>
</qualityIndicators>
<title>An Automatic Video Text Detection, Localization and Extraction Approach</title>
<genre.original>
<json:string>OriginalPaper</json:string>
</genre.original>
<chapterId>
<json:string>1</json:string>
<json:string>Chap1</json:string>
</chapterId>
<genre>
<json:string>conference [eBooks]</json:string>
</genre>
<serie>
<volume>I</volume>
<editor>
<json:item>
<name>David Hutchison</name>
<affiliations>
<json:string>Lancaster University, Lancaster, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Takeo Kanade</name>
<affiliations>
<json:string>Carnegie Mellon University, Pittsburgh, PA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Josef Kittler</name>
<affiliations>
<json:string>University of Surrey, Guildford, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Jon M. Kleinberg</name>
<affiliations>
<json:string>Cornell University, Ithaca, NY, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Friedemann Mattern</name>
<affiliations>
<json:string>ETH Zurich, Zurich, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>John C. Mitchell</name>
<affiliations>
<json:string>Stanford University, Stanford, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moni Naor</name>
<affiliations>
<json:string>Weizmann Institute of Science, Rehovot, Israel</json:string>
</affiliations>
</json:item>
<json:item>
<name>Oscar Nierstrasz</name>
<affiliations>
<json:string>University of Bern, Bern, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>C. Pandu Rangan</name>
<affiliations>
<json:string>Indian Institute of Technology, Madras, India</json:string>
</affiliations>
</json:item>
<json:item>
<name>Bernhard Steffen</name>
<affiliations>
<json:string>University of Dortmund, Dortmund, Germany</json:string>
</affiliations>
</json:item>
<json:item>
<name>Madhu Sudan</name>
<affiliations>
<json:string>Massachusetts Institute of Technology, MA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Demetri Terzopoulos</name>
<affiliations>
<json:string>University of California, Los Angeles, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Doug Tygar</name>
<affiliations>
<json:string>University of California, Berkeley, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moshe Y. Vardi</name>
<affiliations>
<json:string>Rice University, Houston, TX, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Gerhard Weikum</name>
<affiliations>
<json:string>Max-Planck Institute of Computer Science, Saarbrücken, Germany</json:string>
</affiliations>
</json:item>
</editor>
<issn>
<json:string>0302-9743</json:string>
</issn>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>Lecture Notes in Computer Science</title>
<copyrightDate>2009</copyrightDate>
</serie>
<host>
<editor>
<json:item>
<name>Ernesto Damiani</name>
<affiliations>
<json:string>Dipartemento Tecnologie dell’Informazione, Universitá degli Studi di Milano, Via Bramante 65, 26013, Crema, Italy</json:string>
<json:string>E-mail: damiani@dti.unimi.it</json:string>
</affiliations>
</json:item>
<json:item>
<name>Kokou Yetongnon</name>
<affiliations>
<json:string>LE2I-CNRS, Université de Bourgogne, Aile de l’Ingénieur, 21078, Dijon Cedex, France</json:string>
<json:string>E-mail: kokou.yetongnon@u-bourgogne.fr</json:string>
</affiliations>
</json:item>
<json:item>
<name>Richard Chbeir</name>
<affiliations>
<json:string>LE2I-CNRS, Université de Bourgogne, Aile de l’Ingénieur, 21078, Dijon Cedex, France</json:string>
<json:string>E-mail: richard.chbeir@u-bourgogne.fr</json:string>
</affiliations>
</json:item>
<json:item>
<name>Albert Dipanda</name>
<affiliations>
<json:string>LE2I-CNRS, Université de Bourgogne, Aile de l’Ingénieur, 21078, Dijon Cedex, France</json:string>
<json:string>E-mail: adipanda@u-bourgogne.fr</json:string>
</affiliations>
</json:item>
</editor>
<subject>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Information Systems Applications (incl.Internet)</value>
</json:item>
<json:item>
<value>Data Mining and Knowledge Discovery</value>
</json:item>
<json:item>
<value>Database Management</value>
</json:item>
<json:item>
<value>Computer Communication Networks</value>
</json:item>
<json:item>
<value>Multimedia Information Systems</value>
</json:item>
<json:item>
<value>Artificial Intelligence (incl. Robotics)</value>
</json:item>
</subject>
<isbn>
<json:string>978-3-642-01349-2</json:string>
</isbn>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>Advanced Internet Based Systems and Applications</title>
<genre.original>
<json:string>Proceedings</json:string>
</genre.original>
<bookId>
<json:string>978-3-642-01350-8</json:string>
</bookId>
<volume>4879</volume>
<pages>
<last>9</last>
<first>1</first>
</pages>
<issn>
<json:string>0302-9743</json:string>
</issn>
<genre>
<json:string>Book Series</json:string>
</genre>
<eisbn>
<json:string>978-3-642-01350-8</json:string>
</eisbn>
<copyrightDate>2009</copyrightDate>
<doi>
<json:string>10.1007/978-3-642-01350-8</json:string>
</doi>
</host>
<publicationDate>2009</publicationDate>
<copyrightDate>2009</copyrightDate>
<doi>
<json:string>10.1007/978-3-642-01350-8_1</json:string>
</doi>
<id>D7BE73F548C8DA59A7383DA67FF544008E430728</id>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/D7BE73F548C8DA59A7383DA67FF544008E430728/fulltext/pdf</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/D7BE73F548C8DA59A7383DA67FF544008E430728/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/D7BE73F548C8DA59A7383DA67FF544008E430728/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">An Automatic Video Text Detection, Localization and Extraction Approach</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<availability>
<p>SPRINGER</p>
</availability>
<date>2009</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a" type="main" xml:lang="en">An Automatic Video Text Detection, Localization and Extraction Approach</title>
<author>
<persName>
<forename type="first">Chengjun</forename>
<surname>Zhu</surname>
</persName>
<email>zhucj@cse.buaa.edu.cn</email>
<affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</affiliation>
</author>
<author>
<persName>
<forename type="first">Yuanxin</forename>
<surname>Ouyang</surname>
</persName>
<email>oyyx@buaa.edu.cn</email>
<affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</affiliation>
</author>
<author>
<persName>
<forename type="first">Lei</forename>
<surname>Gao</surname>
</persName>
<email>gaol@cse.buaa.edu.cn</email>
<affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</affiliation>
</author>
<author>
<persName>
<forename type="first">Zhenyong</forename>
<surname>Chen</surname>
</persName>
<email>chzhyong@buaa.edu.cn</email>
<affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</affiliation>
</author>
<author>
<persName>
<forename type="first">Zhang</forename>
<surname>Xiong</surname>
</persName>
<email>xiongz@buaa.edu.cn</email>
<affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</affiliation>
</author>
</analytic>
<monogr>
<title level="m">Advanced Internet Based Systems and Applications</title>
<title level="m" type="sub">Second International Conference on Signal-Image Technology and Internet-Based Systems, SITIS 2006, Hammamet, Tunisia, December 17-21, 2006, Revised Selected Papers</title>
<idno type="pISBN">978-3-642-01349-2</idno>
<idno type="eISBN">978-3-642-01350-8</idno>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="DOI">10.1007/978-3-642-01350-8</idno>
<idno type="BookID">978-3-642-01350-8</idno>
<idno type="BookTitleID">187561</idno>
<idno type="BookSequenceNumber">4879</idno>
<idno type="BookVolumeNumber">4879</idno>
<idno type="BookChapterCount">33</idno>
<editor>
<persName>
<forename type="first">Ernesto</forename>
<surname>Damiani</surname>
</persName>
<email>damiani@dti.unimi.it</email>
<affiliation>Dipartemento Tecnologie dell’Informazione, Universitá degli Studi di Milano, Via Bramante 65, 26013, Crema, Italy</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Kokou</forename>
<surname>Yetongnon</surname>
</persName>
<email>kokou.yetongnon@u-bourgogne.fr</email>
<affiliation>LE2I-CNRS, Université de Bourgogne, Aile de l’Ingénieur, 21078, Dijon Cedex, France</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Richard</forename>
<surname>Chbeir</surname>
</persName>
<email>richard.chbeir@u-bourgogne.fr</email>
<affiliation>LE2I-CNRS, Université de Bourgogne, Aile de l’Ingénieur, 21078, Dijon Cedex, France</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Albert</forename>
<surname>Dipanda</surname>
</persName>
<email>adipanda@u-bourgogne.fr</email>
<affiliation>LE2I-CNRS, Université de Bourgogne, Aile de l’Ingénieur, 21078, Dijon Cedex, France</affiliation>
</editor>
<imprint>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<date type="published" when="2009"></date>
<biblScope unit="volume">4879</biblScope>
<biblScope unit="page" from="1">1</biblScope>
<biblScope unit="page" to="9">9</biblScope>
</imprint>
</monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<editor>
<persName>
<forename type="first">David</forename>
<surname>Hutchison</surname>
</persName>
<affiliation>Lancaster University, Lancaster, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Takeo</forename>
<surname>Kanade</surname>
</persName>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Josef</forename>
<surname>Kittler</surname>
</persName>
<affiliation>University of Surrey, Guildford, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Jon</forename>
<forename type="first">M.</forename>
<surname>Kleinberg</surname>
</persName>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Friedemann</forename>
<surname>Mattern</surname>
</persName>
<affiliation>ETH Zurich, Zurich, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">John</forename>
<forename type="first">C.</forename>
<surname>Mitchell</surname>
</persName>
<affiliation>Stanford University, Stanford, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moni</forename>
<surname>Naor</surname>
</persName>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Oscar</forename>
<surname>Nierstrasz</surname>
</persName>
<affiliation>University of Bern, Bern, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">C.</forename>
<surname>Pandu Rangan</surname>
</persName>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Bernhard</forename>
<surname>Steffen</surname>
</persName>
<affiliation>University of Dortmund, Dortmund, Germany</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Madhu</forename>
<surname>Sudan</surname>
</persName>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Demetri</forename>
<surname>Terzopoulos</surname>
</persName>
<affiliation>University of California, Los Angeles, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Doug</forename>
<surname>Tygar</surname>
</persName>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moshe</forename>
<forename type="first">Y.</forename>
<surname>Vardi</surname>
</persName>
<affiliation>Rice University, Houston, TX, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Gerhard</forename>
<surname>Weikum</surname>
</persName>
<affiliation>Max-Planck Institute of Computer Science, Saarbrücken, Germany</affiliation>
</editor>
<biblScope>
<date>2009</date>
</biblScope>
<biblScope unit="volume">I</biblScope>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="seriesId">558</idno>
</series>
<idno type="istex">D7BE73F548C8DA59A7383DA67FF544008E430728</idno>
<idno type="DOI">10.1007/978-3-642-01350-8_1</idno>
<idno type="ChapterID">1</idno>
<idno type="ChapterID">Chap1</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>2009</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>Abstract: Text in video is a very compact and accurate clue for video indexing and summarization. This paper presents an algorithm regarding word group as a special symbol to detect, localize and extract video text using support vector machine (SVM) automatically. First, four sobel operators are applied to get the EM(edge map) of the video frame and the EM is segmented into N×2N size blocks. Then character features and characters group structure features are extracted to construct a 19-dimension feature vector. We use a pre-trained SVM to partition each block into two classes: text and non-text blocks. Secondly a dilatation-shrink process is employed to adjust the text position. Finally text regions are enhanced by multiple frame information. After binarization of enhanced text region, the text region with clean background is recognized by OCR software. Experimental results show that the proposed method can detect, localize, and extract video texts with high accuracy.</p>
</abstract>
<textClass>
<keywords scheme="Book Subject Collection">
<list>
<label>SUCO11645</label>
<item>
<term>Computer Science</term>
</item>
</list>
</keywords>
</textClass>
<textClass>
<keywords scheme="Book Subject Group">
<list>
<label>I</label>
<label>I18040</label>
<label>I18030</label>
<label>I18024</label>
<label>I13022</label>
<label>I18059</label>
<label>I21017</label>
<item>
<term>Computer Science</term>
</item>
<item>
<term>Information Systems Applications (incl.Internet)</term>
</item>
<item>
<term>Data Mining and Knowledge Discovery</term>
</item>
<item>
<term>Database Management</term>
</item>
<item>
<term>Computer Communication Networks</term>
</item>
<item>
<term>Multimedia Information Systems</term>
</item>
<item>
<term>Artificial Intelligence (incl. Robotics)</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change when="2009">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-20">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/D7BE73F548C8DA59A7383DA67FF544008E430728/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="Springer, Publisher found" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="UTF-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//Springer-Verlag//DTD A++ V2.4//EN" URI="http://devel.springer.de/A++/V2.4/DTD/A++V2.4.dtd" name="istex:docType"></istex:docType>
<istex:document>
<Publisher>
<PublisherInfo>
<PublisherName>Springer Berlin Heidelberg</PublisherName>
<PublisherLocation>Berlin, Heidelberg</PublisherLocation>
</PublisherInfo>
<Series>
<SeriesInfo SeriesType="Series" TocLevels="0">
<SeriesID>558</SeriesID>
<SeriesPrintISSN>0302-9743</SeriesPrintISSN>
<SeriesElectronicISSN>1611-3349</SeriesElectronicISSN>
<SeriesTitle Language="En">Lecture Notes in Computer Science</SeriesTitle>
</SeriesInfo>
<SeriesHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff1">
<EditorName DisplayOrder="Western">
<GivenName>David</GivenName>
<FamilyName>Hutchison</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff2">
<EditorName DisplayOrder="Western">
<GivenName>Takeo</GivenName>
<FamilyName>Kanade</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff3">
<EditorName DisplayOrder="Western">
<GivenName>Josef</GivenName>
<FamilyName>Kittler</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff4">
<EditorName DisplayOrder="Western">
<GivenName>Jon</GivenName>
<GivenName>M.</GivenName>
<FamilyName>Kleinberg</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff5">
<EditorName DisplayOrder="Western">
<GivenName>Friedemann</GivenName>
<FamilyName>Mattern</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff6">
<EditorName DisplayOrder="Western">
<GivenName>John</GivenName>
<GivenName>C.</GivenName>
<FamilyName>Mitchell</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff7">
<EditorName DisplayOrder="Western">
<GivenName>Moni</GivenName>
<FamilyName>Naor</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff8">
<EditorName DisplayOrder="Western">
<GivenName>Oscar</GivenName>
<FamilyName>Nierstrasz</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff9">
<EditorName DisplayOrder="Western">
<GivenName>C.</GivenName>
<FamilyName>Pandu Rangan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff10">
<EditorName DisplayOrder="Western">
<GivenName>Bernhard</GivenName>
<FamilyName>Steffen</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff11">
<EditorName DisplayOrder="Western">
<GivenName>Madhu</GivenName>
<FamilyName>Sudan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff12">
<EditorName DisplayOrder="Western">
<GivenName>Demetri</GivenName>
<FamilyName>Terzopoulos</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff13">
<EditorName DisplayOrder="Western">
<GivenName>Doug</GivenName>
<FamilyName>Tygar</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff14">
<EditorName DisplayOrder="Western">
<GivenName>Moshe</GivenName>
<GivenName>Y.</GivenName>
<FamilyName>Vardi</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff15">
<EditorName DisplayOrder="Western">
<GivenName>Gerhard</GivenName>
<FamilyName>Weikum</FamilyName>
</EditorName>
</Editor>
<Affiliation ID="Aff1">
<OrgName>Lancaster University</OrgName>
<OrgAddress>
<City>Lancaster</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff2">
<OrgName>Carnegie Mellon University</OrgName>
<OrgAddress>
<City>Pittsburgh</City>
<State>PA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff3">
<OrgName>University of Surrey</OrgName>
<OrgAddress>
<City>Guildford</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff4">
<OrgName>Cornell University</OrgName>
<OrgAddress>
<City>Ithaca</City>
<State>NY</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff5">
<OrgName>ETH Zurich</OrgName>
<OrgAddress>
<City>Zurich</City>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff6">
<OrgName>Stanford University</OrgName>
<OrgAddress>
<City>Stanford</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff7">
<OrgName>Weizmann Institute of Science</OrgName>
<OrgAddress>
<City>Rehovot</City>
<Country>Israel</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff8">
<OrgName>University of Bern</OrgName>
<OrgAddress>
<City>Bern</City>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff9">
<OrgName>Indian Institute of Technology</OrgName>
<OrgAddress>
<City>Madras</City>
<Country>India</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff10">
<OrgName>University of Dortmund</OrgName>
<OrgAddress>
<City>Dortmund</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff11">
<OrgName>Massachusetts Institute of Technology</OrgName>
<OrgAddress>
<State>MA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff12">
<OrgName>University of California</OrgName>
<OrgAddress>
<City>Los Angeles</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff13">
<OrgName>University of California</OrgName>
<OrgAddress>
<City>Berkeley</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff14">
<OrgName>Rice University</OrgName>
<OrgAddress>
<City>Houston</City>
<State>TX</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff15">
<OrgName>Max-Planck Institute of Computer Science</OrgName>
<OrgAddress>
<City>Saarbrücken</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</SeriesHeader>
<Book Language="En">
<BookInfo BookProductType="Proceedings" ContainsESM="No" Language="En" MediaType="eBook" NumberingStyle="Unnumbered" OutputMedium="All" TocLevels="0">
<BookID>978-3-642-01350-8</BookID>
<BookTitle>Advanced Internet Based Systems and Applications</BookTitle>
<BookSubTitle>Second International Conference on Signal-Image Technology and Internet-Based Systems, SITIS 2006, Hammamet, Tunisia, December 17-21, 2006, Revised Selected Papers</BookSubTitle>
<BookVolumeNumber>4879</BookVolumeNumber>
<BookSequenceNumber>4879</BookSequenceNumber>
<BookDOI>10.1007/978-3-642-01350-8</BookDOI>
<BookTitleID>187561</BookTitleID>
<BookPrintISBN>978-3-642-01349-2</BookPrintISBN>
<BookElectronicISBN>978-3-642-01350-8</BookElectronicISBN>
<BookChapterCount>33</BookChapterCount>
<BookCopyright>
<CopyrightHolderName>Springer Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2009</CopyrightYear>
</BookCopyright>
<BookSubjectGroup>
<BookSubject Code="I" Type="Primary">Computer Science</BookSubject>
<BookSubject Code="I18040" Priority="1" Type="Secondary">Information Systems Applications (incl.Internet)</BookSubject>
<BookSubject Code="I18030" Priority="2" Type="Secondary">Data Mining and Knowledge Discovery</BookSubject>
<BookSubject Code="I18024" Priority="3" Type="Secondary">Database Management</BookSubject>
<BookSubject Code="I13022" Priority="4" Type="Secondary">Computer Communication Networks</BookSubject>
<BookSubject Code="I18059" Priority="5" Type="Secondary">Multimedia Information Systems</BookSubject>
<BookSubject Code="I21017" Priority="6" Type="Secondary">Artificial Intelligence (incl. Robotics)</BookSubject>
<SubjectCollection Code="SUCO11645">Computer Science</SubjectCollection>
</BookSubjectGroup>
<BookContext>
<SeriesID>558</SeriesID>
</BookContext>
</BookInfo>
<BookHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff16">
<EditorName DisplayOrder="Western">
<GivenName>Ernesto</GivenName>
<FamilyName>Damiani</FamilyName>
</EditorName>
<Contact>
<Email>damiani@dti.unimi.it</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff17">
<EditorName DisplayOrder="Western">
<GivenName>Kokou</GivenName>
<FamilyName>Yetongnon</FamilyName>
</EditorName>
<Contact>
<Email>kokou.yetongnon@u-bourgogne.fr</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff17">
<EditorName DisplayOrder="Western">
<GivenName>Richard</GivenName>
<FamilyName>Chbeir</FamilyName>
</EditorName>
<Contact>
<Email>richard.chbeir@u-bourgogne.fr</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff17">
<EditorName DisplayOrder="Western">
<GivenName>Albert</GivenName>
<FamilyName>Dipanda</FamilyName>
</EditorName>
<Contact>
<Email>adipanda@u-bourgogne.fr</Email>
</Contact>
</Editor>
<Affiliation ID="Aff16">
<OrgDivision>Dipartemento Tecnologie dell’Informazione</OrgDivision>
<OrgName>Universitá degli Studi di Milano</OrgName>
<OrgAddress>
<Street>Via Bramante 65</Street>
<Postcode>26013</Postcode>
<City>Crema</City>
<Country>Italy</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff17">
<OrgDivision>LE2I-CNRS</OrgDivision>
<OrgName>Université de Bourgogne</OrgName>
<OrgAddress>
<Street>Aile de l’Ingénieur</Street>
<Postcode>21078</Postcode>
<City>Dijon Cedex</City>
<Country>France</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</BookHeader>
<Part ID="Part1">
<PartInfo TocLevels="0">
<PartID>1</PartID>
<PartNumber>I</PartNumber>
<PartSequenceNumber>1</PartSequenceNumber>
<PartTitle>Query Languages and Information Retrieval</PartTitle>
<PartChapterCount>7</PartChapterCount>
<PartContext>
<SeriesID>558</SeriesID>
<BookTitle>Advanced Internet Based Systems and Applications</BookTitle>
</PartContext>
</PartInfo>
<Chapter ID="Chap1" Language="En">
<ChapterInfo ChapterType="OriginalPaper" ContainsESM="No" NumberingStyle="Unnumbered" TocLevels="0">
<ChapterID>1</ChapterID>
<ChapterDOI>10.1007/978-3-642-01350-8_1</ChapterDOI>
<ChapterSequenceNumber>1</ChapterSequenceNumber>
<ChapterTitle Language="En">An Automatic Video Text Detection, Localization and Extraction Approach</ChapterTitle>
<ChapterFirstPage>1</ChapterFirstPage>
<ChapterLastPage>9</ChapterLastPage>
<ChapterCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2009</CopyrightYear>
</ChapterCopyright>
<ChapterGrants Type="Regular">
<MetadataGrant Grant="OpenAccess"></MetadataGrant>
<AbstractGrant Grant="OpenAccess"></AbstractGrant>
<BodyPDFGrant Grant="Restricted"></BodyPDFGrant>
<BodyHTMLGrant Grant="Restricted"></BodyHTMLGrant>
<BibliographyGrant Grant="Restricted"></BibliographyGrant>
<ESMGrant Grant="Restricted"></ESMGrant>
</ChapterGrants>
<ChapterContext>
<SeriesID>558</SeriesID>
<PartID>1</PartID>
<BookID>978-3-642-01350-8</BookID>
<BookTitle>Advanced Internet Based Systems and Applications</BookTitle>
</ChapterContext>
</ChapterInfo>
<ChapterHeader>
<AuthorGroup>
<Author AffiliationIDS="Aff18">
<AuthorName DisplayOrder="Western">
<GivenName>Chengjun</GivenName>
<FamilyName>Zhu</FamilyName>
</AuthorName>
<Contact>
<Email>zhucj@cse.buaa.edu.cn</Email>
</Contact>
</Author>
<Author AffiliationIDS="Aff18">
<AuthorName DisplayOrder="Western">
<GivenName>Yuanxin</GivenName>
<FamilyName>Ouyang</FamilyName>
</AuthorName>
<Contact>
<Email>oyyx@buaa.edu.cn</Email>
</Contact>
</Author>
<Author AffiliationIDS="Aff18">
<AuthorName DisplayOrder="Western">
<GivenName>Lei</GivenName>
<FamilyName>Gao</FamilyName>
</AuthorName>
<Contact>
<Email>gaol@cse.buaa.edu.cn</Email>
</Contact>
</Author>
<Author AffiliationIDS="Aff18">
<AuthorName DisplayOrder="Western">
<GivenName>Zhenyong</GivenName>
<FamilyName>Chen</FamilyName>
</AuthorName>
<Contact>
<Email>chzhyong@buaa.edu.cn</Email>
</Contact>
</Author>
<Author AffiliationIDS="Aff18">
<AuthorName DisplayOrder="Western">
<GivenName>Zhang</GivenName>
<FamilyName>Xiong</FamilyName>
</AuthorName>
<Contact>
<Email>xiongz@buaa.edu.cn</Email>
</Contact>
</Author>
<Affiliation ID="Aff18">
<OrgDivision>School of Computer Science and Technology</OrgDivision>
<OrgName>Beihang University</OrgName>
<OrgAddress>
<Street>No. 37 Xue Yuan Road, Haidian District</Street>
<City>Beijing</City>
<Country>P.R.China</Country>
</OrgAddress>
</Affiliation>
</AuthorGroup>
<Abstract ID="Abs1" Language="En">
<Heading>Abstract</Heading>
<Para>Text in video is a very compact and accurate clue for video indexing and summarization. This paper presents an algorithm regarding word group as a special symbol to detect, localize and extract video text using support vector machine (SVM) automatically. First, four sobel operators are applied to get the EM(edge map) of the video frame and the EM is segmented into N×2N size blocks. Then character features and characters group structure features are extracted to construct a 19-dimension feature vector. We use a pre-trained SVM to partition each block into two classes: text and non-text blocks. Secondly a dilatation-shrink process is employed to adjust the text position. Finally text regions are enhanced by multiple frame information. After binarization of enhanced text region, the text region with clean background is recognized by OCR software. Experimental results show that the proposed method can detect, localize, and extract video texts with high accuracy.</Para>
</Abstract>
<KeywordGroup Language="En">
<Heading>Keywords</Heading>
<Keyword>video text detection</Keyword>
<Keyword>support vector machine(SVM)</Keyword>
<Keyword>multilingual texts</Keyword>
<Keyword>video OCR</Keyword>
</KeywordGroup>
</ChapterHeader>
<NoBody></NoBody>
</Chapter>
</Part>
</Book>
</Series>
</Publisher>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>An Automatic Video Text Detection, Localization and Extraction Approach</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA" lang="en">
<title>An Automatic Video Text Detection, Localization and Extraction Approach</title>
</titleInfo>
<name type="personal">
<namePart type="given">Chengjun</namePart>
<namePart type="family">Zhu</namePart>
<affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</affiliation>
<affiliation>E-mail: zhucj@cse.buaa.edu.cn</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Yuanxin</namePart>
<namePart type="family">Ouyang</namePart>
<affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</affiliation>
<affiliation>E-mail: oyyx@buaa.edu.cn</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Lei</namePart>
<namePart type="family">Gao</namePart>
<affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</affiliation>
<affiliation>E-mail: gaol@cse.buaa.edu.cn</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Zhenyong</namePart>
<namePart type="family">Chen</namePart>
<affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</affiliation>
<affiliation>E-mail: chzhyong@buaa.edu.cn</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Zhang</namePart>
<namePart type="family">Xiong</namePart>
<affiliation>School of Computer Science and Technology, Beihang University, No. 37 Xue Yuan Road, Haidian District, Beijing, P.R.China</affiliation>
<affiliation>E-mail: xiongz@buaa.edu.cn</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="conference [eBooks]" displayLabel="OriginalPaper"></genre>
<originInfo>
<publisher>Springer Berlin Heidelberg</publisher>
<place>
<placeTerm type="text">Berlin, Heidelberg</placeTerm>
</place>
<dateIssued encoding="w3cdtf">2009</dateIssued>
<copyrightDate encoding="w3cdtf">2009</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">Abstract: Text in video is a very compact and accurate clue for video indexing and summarization. This paper presents an algorithm regarding word group as a special symbol to detect, localize and extract video text using support vector machine (SVM) automatically. First, four sobel operators are applied to get the EM(edge map) of the video frame and the EM is segmented into N×2N size blocks. Then character features and characters group structure features are extracted to construct a 19-dimension feature vector. We use a pre-trained SVM to partition each block into two classes: text and non-text blocks. Secondly a dilatation-shrink process is employed to adjust the text position. Finally text regions are enhanced by multiple frame information. After binarization of enhanced text region, the text region with clean background is recognized by OCR software. Experimental results show that the proposed method can detect, localize, and extract video texts with high accuracy.</abstract>
<relatedItem type="host">
<titleInfo>
<title>Advanced Internet Based Systems and Applications</title>
<subTitle>Second International Conference on Signal-Image Technology and Internet-Based Systems, SITIS 2006, Hammamet, Tunisia, December 17-21, 2006, Revised Selected Papers</subTitle>
</titleInfo>
<name type="personal">
<namePart type="given">Ernesto</namePart>
<namePart type="family">Damiani</namePart>
<affiliation>Dipartemento Tecnologie dell’Informazione, Universitá degli Studi di Milano, Via Bramante 65, 26013, Crema, Italy</affiliation>
<affiliation>E-mail: damiani@dti.unimi.it</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Kokou</namePart>
<namePart type="family">Yetongnon</namePart>
<affiliation>LE2I-CNRS, Université de Bourgogne, Aile de l’Ingénieur, 21078, Dijon Cedex, France</affiliation>
<affiliation>E-mail: kokou.yetongnon@u-bourgogne.fr</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Richard</namePart>
<namePart type="family">Chbeir</namePart>
<affiliation>LE2I-CNRS, Université de Bourgogne, Aile de l’Ingénieur, 21078, Dijon Cedex, France</affiliation>
<affiliation>E-mail: richard.chbeir@u-bourgogne.fr</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Albert</namePart>
<namePart type="family">Dipanda</namePart>
<affiliation>LE2I-CNRS, Université de Bourgogne, Aile de l’Ingénieur, 21078, Dijon Cedex, France</affiliation>
<affiliation>E-mail: adipanda@u-bourgogne.fr</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="Book Series" displayLabel="Proceedings"></genre>
<originInfo>
<copyrightDate encoding="w3cdtf">2009</copyrightDate>
<issuance>monographic</issuance>
</originInfo>
<subject>
<genre>Book Subject Collection</genre>
<topic authority="SpringerSubjectCodes" authorityURI="SUCO11645">Computer Science</topic>
</subject>
<subject>
<genre>Book Subject Group</genre>
<topic authority="SpringerSubjectCodes" authorityURI="I">Computer Science</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18040">Information Systems Applications (incl.Internet)</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18030">Data Mining and Knowledge Discovery</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18024">Database Management</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I13022">Computer Communication Networks</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18059">Multimedia Information Systems</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I21017">Artificial Intelligence (incl. Robotics)</topic>
</subject>
<identifier type="DOI">10.1007/978-3-642-01350-8</identifier>
<identifier type="ISBN">978-3-642-01349-2</identifier>
<identifier type="eISBN">978-3-642-01350-8</identifier>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="BookTitleID">187561</identifier>
<identifier type="BookID">978-3-642-01350-8</identifier>
<identifier type="BookChapterCount">33</identifier>
<identifier type="BookVolumeNumber">4879</identifier>
<identifier type="BookSequenceNumber">4879</identifier>
<identifier type="PartChapterCount">7</identifier>
<part>
<date>2009</date>
<detail type="part">
<title>I: Query Languages and Information Retrieval</title>
</detail>
<detail type="volume">
<number>4879</number>
<caption>vol.</caption>
</detail>
<extent unit="pages">
<start>1</start>
<end>9</end>
</extent>
</part>
<recordInfo>
<recordOrigin>Springer Berlin Heidelberg, 2009</recordOrigin>
</recordInfo>
</relatedItem>
<relatedItem type="series">
<titleInfo>
<title>Lecture Notes in Computer Science</title>
</titleInfo>
<name type="personal">
<namePart type="given">David</namePart>
<namePart type="family">Hutchison</namePart>
<affiliation>Lancaster University, Lancaster, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Takeo</namePart>
<namePart type="family">Kanade</namePart>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Josef</namePart>
<namePart type="family">Kittler</namePart>
<affiliation>University of Surrey, Guildford, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jon</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Kleinberg</namePart>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Friedemann</namePart>
<namePart type="family">Mattern</namePart>
<affiliation>ETH Zurich, Zurich, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">John</namePart>
<namePart type="given">C.</namePart>
<namePart type="family">Mitchell</namePart>
<affiliation>Stanford University, Stanford, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moni</namePart>
<namePart type="family">Naor</namePart>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Oscar</namePart>
<namePart type="family">Nierstrasz</namePart>
<affiliation>University of Bern, Bern, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">C.</namePart>
<namePart type="family">Pandu Rangan</namePart>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Bernhard</namePart>
<namePart type="family">Steffen</namePart>
<affiliation>University of Dortmund, Dortmund, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Madhu</namePart>
<namePart type="family">Sudan</namePart>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Demetri</namePart>
<namePart type="family">Terzopoulos</namePart>
<affiliation>University of California, Los Angeles, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Doug</namePart>
<namePart type="family">Tygar</namePart>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moshe</namePart>
<namePart type="given">Y.</namePart>
<namePart type="family">Vardi</namePart>
<affiliation>Rice University, Houston, TX, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<affiliation>Max-Planck Institute of Computer Science, Saarbrücken, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<originInfo>
<copyrightDate encoding="w3cdtf">2009</copyrightDate>
<issuance>serial</issuance>
</originInfo>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="SeriesID">558</identifier>
<part>
<detail type="volume">
<number>I</number>
<caption>vol.</caption>
</detail>
</part>
<recordInfo>
<recordOrigin>Springer Berlin Heidelberg, 2009</recordOrigin>
</recordInfo>
</relatedItem>
<identifier type="istex">D7BE73F548C8DA59A7383DA67FF544008E430728</identifier>
<identifier type="DOI">10.1007/978-3-642-01350-8_1</identifier>
<identifier type="ChapterID">1</identifier>
<identifier type="ChapterID">Chap1</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Springer Berlin Heidelberg, 2009</accessCondition>
<recordInfo>
<recordContentSource>SPRINGER</recordContentSource>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2009</recordOrigin>
</recordInfo>
</mods>
</metadata>
<enrichments>
<istex:refBibTEI uri="https://api.istex.fr/document/D7BE73F548C8DA59A7383DA67FF544008E430728/enrichments/refBib">
<teiHeader></teiHeader>
<text>
<front></front>
<body></body>
<back>
<listBibl>
<biblStruct xml:id="b0">
<analytic>
<title level="a" type="main">Techniques and systems for image and video retrieval</title>
<author>
<persName>
<forename type="first">Y</forename>
<forename type="middle">A</forename>
<surname>Aslandogan</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">C</forename>
<forename type="middle">T</forename>
<surname>Yu</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Knowledge Data Eng</title>
<imprint>
<biblScope unit="volume">11</biblScope>
<biblScope unit="page" from="56" to="63"></biblScope>
<date type="published" when="1999"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b1">
<analytic>
<title level="a" type="main">Jiqiang Song; Min Cai: A comprehensive method for multilingual video text detection, localization, and extraction</title>
<author>
<persName>
<forename type="first">M</forename>
<forename type="middle">R</forename>
<surname>Lyu</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">IEEE Transactions on Circuits and Systems for Video Technology</title>
<imprint>
<date type="published" when="2005"></date>
<biblScope unit="page" from="243" to="255"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b2">
<analytic>
<title level="a" type="main">A Spatial-Temporal Approach for Video Caption Detection and Recognition</title>
<author>
<persName>
<forename type="first">X</forename>
<surname>Tang</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">X</forename>
<surname>Gao</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<surname>Liu</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans On Neural Networks</title>
<imprint>
<biblScope unit="page" from="961" to="971"></biblScope>
<date type="published" when="2002"></date>
</imprint>
</monogr>
<note>special. issue on Intelligent Multimedia Processing</note>
</biblStruct>
<biblStruct xml:id="b3">
<monogr>
<title level="m" type="main">Content-based video analysis, retrieval and browsing</title>
<author>
<persName>
<forename type="first">H</forename>
<forename type="middle">J</forename>
<surname>Zhang</surname>
</persName>
</author>
<imprint>
<date type="published" when="2001"></date>
<publisher>Microsoft Research Asia</publisher>
<pubPlace>Beijing</pubPlace>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b4">
<monogr>
<title level="m" type="main">Text Identification in Complex Back-ground Using SVM</title>
<author>
<persName>
<forename type="first">D</forename>
<surname>Chen</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">H</forename>
<surname>Bourlard</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J.-P</forename>
<surname>Thiran</surname>
</persName>
</author>
<imprint>
<date type="published" when="2001"></date>
<biblScope unit="page" from="621" to="626"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b5">
<analytic>
<title></title>
<author>
<persName>
<forename type="first">V</forename>
<surname>Vapnik</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">The Nature of Statistical Learning Theory</title>
<imprint>
<date type="published" when="1996"></date>
<biblScope unit="page" from="581" to="585"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b6">
<analytic>
<title level="a" type="main">Video OCR: Indexing digital news libraries by recognition of superimposed captions</title>
<author>
<persName>
<forename type="first">T</forename>
<surname>Sato</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">T</forename>
<surname>Kanade</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">E</forename>
<forename type="middle">K</forename>
<surname>Kughes</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<forename type="middle">A</forename>
<surname>Smith</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Satoh</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">ACM Multimedia Syst (Special Is-sue on Video Libraries)</title>
<imprint>
<biblScope unit="volume">7</biblScope>
<biblScope unit="issue">5</biblScope>
<biblScope unit="page" from="385" to="395"></biblScope>
<date type="published" when="1999"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b7">
<analytic>
<title level="a" type="main">Text extraction, enhancement and OCR in digital video</title>
<author>
<persName>
<forename type="first">H</forename>
<forename type="middle">P</forename>
<surname>Li</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Doemann</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">O</forename>
<surname>Kia</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proc. 3rd IAPR Workshop</title>
<meeting>. 3rd IAPR Workshop
<address>
<addrLine>Nagoya, Japan</addrLine>
</address>
</meeting>
<imprint>
<date type="published" when="1998"></date>
<biblScope unit="page" from="363" to="377"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b8">
<analytic>
<title level="a" type="main">A Threshold Selection Method from Grey-Level Histograms</title>
<author>
<persName>
<forename type="first">N</forename>
<surname>Otsu</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Trans. Systems , Man, and Cybernetics</title>
<imprint>
<biblScope unit="volume">9</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="377" to="393"></biblScope>
<date type="published" when="1979"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b9">
<analytic>
<title level="a" type="main">A robust statistic method for classifying color polar-ity of video text</title>
<author>
<persName>
<forename type="first">J</forename>
<surname>Song</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Cai</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<forename type="middle">R</forename>
<surname>Lyu</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing</title>
<meeting>. IEEE International Conference on Acoustics, Speech, and Signal essing</meeting>
<imprint>
<date type="published" when="2003-04"></date>
<biblScope unit="page" from="581" to="584"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b10">
<monogr>
<title level="m" type="main">LIBSVM: a library for support vector machines</title>
<author>
<persName>
<forename type="first">C.-C</forename>
<surname>Chang</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">C.-J</forename>
<surname>Lin</surname>
</persName>
</author>
<imprint>
<date type="published" when="2001"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b11">
<analytic>
<title level="a" type="main">Automatic Performance Evaluation for Video Text Detection, icdar</title>
<author>
<persName>
<forename type="first">X</forename>
<forename type="middle">S</forename>
<surname>Hua</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">L</forename>
<surname>Wenyin</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">H.-J</forename>
<surname>Zhang</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Sixth International Conference on Document Analysis and Recognition</title>
<imprint>
<date type="published" when="2001"></date>
<biblScope unit="page">0545</biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b12">
<analytic>
<title level="a" type="main">Localization site prediction for membrane proteins by integrating rule and SVM classification</title>
<author>
<persName>
<forename type="first">S</forename>
<surname>Zhou</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">K</forename>
<surname>Wang</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">IEEE Transactions on Knowledge and Data Engineering</title>
<imprint>
<biblScope unit="volume">17</biblScope>
<biblScope unit="issue">12</biblScope>
<biblScope unit="page" from="1694" to="1705"></biblScope>
<date type="published" when="2005"></date>
</imprint>
</monogr>
</biblStruct>
</listBibl>
</back>
</text>
</istex:refBibTEI>
</enrichments>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000229 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000229 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:D7BE73F548C8DA59A7383DA67FF544008E430728
   |texte=   An Automatic Video Text Detection, Localization and Extraction Approach
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024