OcrV1, France, Analysis, bibRecord, 000227

Extraction and recognition of artificial text in multimedia documents

Identifieur interne : 000227 ( France/Analysis ); précédent : 000226; suivant : 000228

Extraction and recognition of artificial text in multimedia documents

Auteurs : C. Wolf [France] ; J.-M. Jolion [France]

Source :

Pattern analysis and applications [ 1433-7541 ] ; 2004.

RBID : Pascal:04-0205881

Descripteurs français

Pascal (Inist)
- Reconnaissance caractère, Multimédia, Signal vidéo, Traitement image, Similitude, Indexation, Texte, Séquence image, Utilisation information, Recherche par contenu, Sémantique, Homme, Choix, Extraction forme.
Wicri :
- topic : Multimédia, Homme.

English descriptors

KwdEn :
- Character recognition, Choice, Content-based retrieval, Human, Image processing, Image sequence, Indexing, Information use, Multimedia, Pattern extraction, Semantics, Similarity, Text, Video signal.

Abstract

The systems currently available for content-based image and video retrieval work without semantic knowledge, i.e. they use image processing methods to extract low level features of the data. The similarity obtained by these approaches does not always correspond to the similarity a human user would expect. A way to include more semantic knowledge into the indexing process is to use the text included in the images and video sequences. It is rich in information but easy to use, e.g. by key word based queries. In this paper we present an algorithm to localise artificial text in images and videos using a measure of accumulated gradients and morphological processing. The quality of the localised text is improved by robust multiple frame integration. A new technique for the binarisation of the text boxes based on a criterion maximizing local contrast is proposed. Finally, detection and OCR results for a commercial OCR are presented, justifying the choice of the binarisation technique.

Affiliations:

Links toward previous steps (curation, corpus...)

to stream PascalFrancis, to step Corpus: 000552
to stream PascalFrancis, to step Curation: 000238
to stream PascalFrancis, to step Checkpoint: 000487
to stream Main, to step Merge: 001698
to stream Main, to step Curation: 001635
to stream Main, to step Exploration: 001635
to stream France, to step Extraction: 000227

Links to Exploration step

Pascal:04-0205881

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Extraction and recognition of artificial text in multimedia documents</title>
<author><name sortKey="Wolf, C" sort="Wolf, C" uniqKey="Wolf C" first="C." last="Wolf">C. Wolf</name>
<affiliation wicri:level="3"><inist:fA14 i1="01"><s1>Lyon Research Center for Images and Intelligent Information Systems, INSA de Lyon, Bat., Verne, 20, Av. Albert Einstein</s1>
<s2>69621 Villeurbanne</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName><region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
<settlement type="city">Villeurbanne</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Jolion, J M" sort="Jolion, J M" uniqKey="Jolion J" first="J.-M." last="Jolion">J.-M. Jolion</name>
<affiliation wicri:level="3"><inist:fA14 i1="01"><s1>Lyon Research Center for Images and Intelligent Information Systems, INSA de Lyon, Bat., Verne, 20, Av. Albert Einstein</s1>
<s2>69621 Villeurbanne</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName><region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
<settlement type="city">Villeurbanne</settlement>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">04-0205881</idno>
<date when="2004">2004</date>
<idno type="stanalyst">PASCAL 04-0205881 INIST</idno>
<idno type="RBID">Pascal:04-0205881</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000552</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000238</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000487</idno>
<idno type="wicri:doubleKey">1433-7541:2004:Wolf C:extraction:and:recognition</idno>
<idno type="wicri:Area/Main/Merge">001698</idno>
<idno type="wicri:Area/Main/Curation">001635</idno>
<idno type="wicri:Area/Main/Exploration">001635</idno>
<idno type="wicri:Area/France/Extraction">000227</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Extraction and recognition of artificial text in multimedia documents</title>
<author><name sortKey="Wolf, C" sort="Wolf, C" uniqKey="Wolf C" first="C." last="Wolf">C. Wolf</name>
<affiliation wicri:level="3"><inist:fA14 i1="01"><s1>Lyon Research Center for Images and Intelligent Information Systems, INSA de Lyon, Bat., Verne, 20, Av. Albert Einstein</s1>
<s2>69621 Villeurbanne</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName><region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
<settlement type="city">Villeurbanne</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Jolion, J M" sort="Jolion, J M" uniqKey="Jolion J" first="J.-M." last="Jolion">J.-M. Jolion</name>
<affiliation wicri:level="3"><inist:fA14 i1="01"><s1>Lyon Research Center for Images and Intelligent Information Systems, INSA de Lyon, Bat., Verne, 20, Av. Albert Einstein</s1>
<s2>69621 Villeurbanne</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName><region type="region" nuts="2">Auvergne-Rhône-Alpes</region>
<region type="old region" nuts="2">Rhône-Alpes</region>
<settlement type="city">Villeurbanne</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">Pattern analysis and applications</title>
<idno type="ISSN">1433-7541</idno>
<imprint><date when="2004">2004</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">Pattern analysis and applications</title>
<idno type="ISSN">1433-7541</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Character recognition</term>
<term>Choice</term>
<term>Content-based retrieval</term>
<term>Human</term>
<term>Image processing</term>
<term>Image sequence</term>
<term>Indexing</term>
<term>Information use</term>
<term>Multimedia</term>
<term>Pattern extraction</term>
<term>Semantics</term>
<term>Similarity</term>
<term>Text</term>
<term>Video signal</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Reconnaissance caractère</term>
<term>Multimédia</term>
<term>Signal vidéo</term>
<term>Traitement image</term>
<term>Similitude</term>
<term>Indexation</term>
<term>Texte</term>
<term>Séquence image</term>
<term>Utilisation information</term>
<term>Recherche par contenu</term>
<term>Sémantique</term>
<term>Homme</term>
<term>Choix</term>
<term>Extraction forme</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr"><term>Multimédia</term>
<term>Homme</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">The systems currently available for content-based image and video retrieval work without semantic knowledge, i.e. they use image processing methods to extract low level features of the data. The similarity obtained by these approaches does not always correspond to the similarity a human user would expect. A way to include more semantic knowledge into the indexing process is to use the text included in the images and video sequences. It is rich in information but easy to use, e.g. by key word based queries. In this paper we present an algorithm to localise artificial text in images and videos using a measure of accumulated gradients and morphological processing. The quality of the localised text is improved by robust multiple frame integration. A new technique for the binarisation of the text boxes based on a criterion maximizing local contrast is proposed. Finally, detection and OCR results for a commercial OCR are presented, justifying the choice of the binarisation technique.</div>
</front>
</TEI>
<affiliations><list><country><li>France</li>
</country>
<region><li>Auvergne-Rhône-Alpes</li>
<li>Rhône-Alpes</li>
</region>
<settlement><li>Villeurbanne</li>
</settlement>
</list>
<tree><country name="France"><region name="Auvergne-Rhône-Alpes"><name sortKey="Wolf, C" sort="Wolf, C" uniqKey="Wolf C" first="C." last="Wolf">C. Wolf</name>
</region>
<name sortKey="Jolion, J M" sort="Jolion, J M" uniqKey="Jolion J" first="J.-M." last="Jolion">J.-M. Jolion</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/France/Analysis

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000227 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/France/Analysis/biblio.hfd -nk 000227 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    France
   |étape=   Analysis
   |type=    RBID
   |clé=     Pascal:04-0205881
   |texte=   Extraction and recognition of artificial text in multimedia documents
}}

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024

	Serveur d'exploration sur l'OCR
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur l'OCR

Extraction and recognition of artificial text in multimedia documents

Extraction and recognition of artificial text in multimedia documents

Source :

Descripteurs français

English descriptors

Abstract

Links toward previous steps (curation, corpus...)

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri