Text recognition in multimedia documents: a study of two neural-based OCRs using and avoiding character segmentation
Identifieur interne : 000009 ( PascalFrancis/Corpus ); précédent : 000008; suivant : 000010Text recognition in multimedia documents: a study of two neural-based OCRs using and avoiding character segmentation
Auteurs : Khaoula Elagouni ; Christophe Garcia ; Franck Mamalet ; Pascale SebillotSource :
- International journal on document analysis and recognition : (Print) [ 1433-2833 ] ; 2014.
Descripteurs français
- Pascal (Inist)
- Reconnaissance caractère, Texte, Reconnaissance forme, Analyse documentaire, Multimédia, Reconnaissance optique caractère, Accès contenu, Vision ordinateur, Linguistique mathématique, Signal vidéo, Analyse scène, Présentation document, Sémantique, Balayage, Sous titrage, Convolution, Réseau neuronal, Segmentation image, Scène naturelle.
English descriptors
- KwdEn :
- Caption, Character recognition, Computational linguistics, Computer vision, Content access, Convolution, Document analysis, Document layout, Image segmentation, Multimedia, Natural scenes, Neural network, Optical character recognition, Pattern recognition, Scanning, Scene analysis, Semantics, Text, Video signal.
Abstract
Text embedded in multimedia documents represents an important semantic information that helps to automatically access the content. This paper proposes two neural-based optical character recognition (OCR) systems that handle the text recognition problem in different ways. The first approach segments a text image into individual characters before recognizing them, while the second one avoids the segmentation step by integrating a multi-scale scanning scheme that allows to jointly localize and recognize characters at each position and scale. Some linguistic knowledge is also incorporated into the proposed schemes to remove errors due to recognition confusions. Both OCR systems are applied to caption texts embedded in videos and in natural scene images and provide outstanding results showing that the proposed approaches outperform the state-of-the-art methods.
Notice en format standard (ISO 2709)
Pour connaître la documentation sur le format Inist Standard.
pA |
|
---|
Format Inist (serveur)
NO : | PASCAL 14-0199549 INIST |
---|---|
ET : | Text recognition in multimedia documents: a study of two neural-based OCRs using and avoiding character segmentation |
AU : | ELAGOUNI (Khaoula); GARCIA (Christophe); MAMALET (Franck); SEBILLOT (Pascale) |
AF : | Orange Labs R&D/35512 Cesson-Sévigné/France (1 aut., 3 aut.); LIRIS/INSA de Lyon/69621 Villeurbanne/France (2 aut.); IRISA/INSA de Rennes/35042 Rennes/France (4 aut.) |
DT : | Publication en série; Niveau analytique |
SO : | International journal on document analysis and recognition : (Print); ISSN 1433-2833; Allemagne; Da. 2014; Vol. 17; No. 1; Pp. 19-31; Bibl. 50 ref. |
LA : | Anglais |
EA : | Text embedded in multimedia documents represents an important semantic information that helps to automatically access the content. This paper proposes two neural-based optical character recognition (OCR) systems that handle the text recognition problem in different ways. The first approach segments a text image into individual characters before recognizing them, while the second one avoids the segmentation step by integrating a multi-scale scanning scheme that allows to jointly localize and recognize characters at each position and scale. Some linguistic knowledge is also incorporated into the proposed schemes to remove errors due to recognition confusions. Both OCR systems are applied to caption texts embedded in videos and in natural scene images and provide outstanding results showing that the proposed approaches outperform the state-of-the-art methods. |
CC : | 001D02C04; 001D02C03; 001D02C06 |
FD : | Reconnaissance caractère; Texte; Reconnaissance forme; Analyse documentaire; Multimédia; Reconnaissance optique caractère; Accès contenu; Vision ordinateur; Linguistique mathématique; Signal vidéo; Analyse scène; Présentation document; Sémantique; Balayage; Sous titrage; Convolution; Réseau neuronal; Segmentation image; Scène naturelle |
ED : | Character recognition; Text; Pattern recognition; Document analysis; Multimedia; Optical character recognition; Content access; Computer vision; Computational linguistics; Video signal; Scene analysis; Document layout; Semantics; Scanning; Caption; Convolution; Neural network; Image segmentation; Natural scenes |
SD : | Reconocimiento carácter; Texto; Reconocimiento patrón; Análisis documental; Multimedia; Reconocimento óptico de caracteres; Acceso contenido; Visión ordenador; Linguística matemática; Señal video; Análisis escena; Presentación documento; Semántica; Exploración; Subtítulo; Convolución; Red neuronal; Segmentación de imágenes; Escena natural |
LO : | INIST-26790.354000501888130020 |
ID : | 14-0199549 |
Links to Exploration step
Pascal:14-0199549Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Text recognition in multimedia documents: a study of two neural-based OCRs using and avoiding character segmentation</title>
<author><name sortKey="Elagouni, Khaoula" sort="Elagouni, Khaoula" uniqKey="Elagouni K" first="Khaoula" last="Elagouni">Khaoula Elagouni</name>
<affiliation><inist:fA14 i1="01"><s1>Orange Labs R&D</s1>
<s2>35512 Cesson-Sévigné</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Garcia, Christophe" sort="Garcia, Christophe" uniqKey="Garcia C" first="Christophe" last="Garcia">Christophe Garcia</name>
<affiliation><inist:fA14 i1="02"><s1>LIRIS/INSA de Lyon</s1>
<s2>69621 Villeurbanne</s2>
<s3>FRA</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Mamalet, Franck" sort="Mamalet, Franck" uniqKey="Mamalet F" first="Franck" last="Mamalet">Franck Mamalet</name>
<affiliation><inist:fA14 i1="01"><s1>Orange Labs R&D</s1>
<s2>35512 Cesson-Sévigné</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Sebillot, Pascale" sort="Sebillot, Pascale" uniqKey="Sebillot P" first="Pascale" last="Sebillot">Pascale Sebillot</name>
<affiliation><inist:fA14 i1="03"><s1>IRISA/INSA de Rennes</s1>
<s2>35042 Rennes</s2>
<s3>FRA</s3>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">14-0199549</idno>
<date when="2014">2014</date>
<idno type="stanalyst">PASCAL 14-0199549 INIST</idno>
<idno type="RBID">Pascal:14-0199549</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000009</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Text recognition in multimedia documents: a study of two neural-based OCRs using and avoiding character segmentation</title>
<author><name sortKey="Elagouni, Khaoula" sort="Elagouni, Khaoula" uniqKey="Elagouni K" first="Khaoula" last="Elagouni">Khaoula Elagouni</name>
<affiliation><inist:fA14 i1="01"><s1>Orange Labs R&D</s1>
<s2>35512 Cesson-Sévigné</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Garcia, Christophe" sort="Garcia, Christophe" uniqKey="Garcia C" first="Christophe" last="Garcia">Christophe Garcia</name>
<affiliation><inist:fA14 i1="02"><s1>LIRIS/INSA de Lyon</s1>
<s2>69621 Villeurbanne</s2>
<s3>FRA</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Mamalet, Franck" sort="Mamalet, Franck" uniqKey="Mamalet F" first="Franck" last="Mamalet">Franck Mamalet</name>
<affiliation><inist:fA14 i1="01"><s1>Orange Labs R&D</s1>
<s2>35512 Cesson-Sévigné</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
<author><name sortKey="Sebillot, Pascale" sort="Sebillot, Pascale" uniqKey="Sebillot P" first="Pascale" last="Sebillot">Pascale Sebillot</name>
<affiliation><inist:fA14 i1="03"><s1>IRISA/INSA de Rennes</s1>
<s2>35042 Rennes</s2>
<s3>FRA</s3>
<sZ>4 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">International journal on document analysis and recognition : (Print)</title>
<title level="j" type="abbreviated">Int. j. doc. anal. recognit. : (Print)</title>
<idno type="ISSN">1433-2833</idno>
<imprint><date when="2014">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">International journal on document analysis and recognition : (Print)</title>
<title level="j" type="abbreviated">Int. j. doc. anal. recognit. : (Print)</title>
<idno type="ISSN">1433-2833</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Caption</term>
<term>Character recognition</term>
<term>Computational linguistics</term>
<term>Computer vision</term>
<term>Content access</term>
<term>Convolution</term>
<term>Document analysis</term>
<term>Document layout</term>
<term>Image segmentation</term>
<term>Multimedia</term>
<term>Natural scenes</term>
<term>Neural network</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Scanning</term>
<term>Scene analysis</term>
<term>Semantics</term>
<term>Text</term>
<term>Video signal</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Reconnaissance caractère</term>
<term>Texte</term>
<term>Reconnaissance forme</term>
<term>Analyse documentaire</term>
<term>Multimédia</term>
<term>Reconnaissance optique caractère</term>
<term>Accès contenu</term>
<term>Vision ordinateur</term>
<term>Linguistique mathématique</term>
<term>Signal vidéo</term>
<term>Analyse scène</term>
<term>Présentation document</term>
<term>Sémantique</term>
<term>Balayage</term>
<term>Sous titrage</term>
<term>Convolution</term>
<term>Réseau neuronal</term>
<term>Segmentation image</term>
<term>Scène naturelle</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Text embedded in multimedia documents represents an important semantic information that helps to automatically access the content. This paper proposes two neural-based optical character recognition (OCR) systems that handle the text recognition problem in different ways. The first approach segments a text image into individual characters before recognizing them, while the second one avoids the segmentation step by integrating a multi-scale scanning scheme that allows to jointly localize and recognize characters at each position and scale. Some linguistic knowledge is also incorporated into the proposed schemes to remove errors due to recognition confusions. Both OCR systems are applied to caption texts embedded in videos and in natural scene images and provide outstanding results showing that the proposed approaches outperform the state-of-the-art methods.</div>
</front>
</TEI>
<inist><standard h6="B"><pA><fA01 i1="01" i2="1"><s0>1433-2833</s0>
</fA01>
<fA03 i2="1"><s0>Int. j. doc. anal. recognit. : (Print)</s0>
</fA03>
<fA05><s2>17</s2>
</fA05>
<fA06><s2>1</s2>
</fA06>
<fA08 i1="01" i2="1" l="ENG"><s1>Text recognition in multimedia documents: a study of two neural-based OCRs using and avoiding character segmentation</s1>
</fA08>
<fA11 i1="01" i2="1"><s1>ELAGOUNI (Khaoula)</s1>
</fA11>
<fA11 i1="02" i2="1"><s1>GARCIA (Christophe)</s1>
</fA11>
<fA11 i1="03" i2="1"><s1>MAMALET (Franck)</s1>
</fA11>
<fA11 i1="04" i2="1"><s1>SEBILLOT (Pascale)</s1>
</fA11>
<fA14 i1="01"><s1>Orange Labs R&D</s1>
<s2>35512 Cesson-Sévigné</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>3 aut.</sZ>
</fA14>
<fA14 i1="02"><s1>LIRIS/INSA de Lyon</s1>
<s2>69621 Villeurbanne</s2>
<s3>FRA</s3>
<sZ>2 aut.</sZ>
</fA14>
<fA14 i1="03"><s1>IRISA/INSA de Rennes</s1>
<s2>35042 Rennes</s2>
<s3>FRA</s3>
<sZ>4 aut.</sZ>
</fA14>
<fA20><s1>19-31</s1>
</fA20>
<fA21><s1>2014</s1>
</fA21>
<fA23 i1="01"><s0>ENG</s0>
</fA23>
<fA43 i1="01"><s1>INIST</s1>
<s2>26790</s2>
<s5>354000501888130020</s5>
</fA43>
<fA44><s0>0000</s0>
<s1>© 2014 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45><s0>50 ref.</s0>
</fA45>
<fA47 i1="01" i2="1"><s0>14-0199549</s0>
</fA47>
<fA60><s1>P</s1>
</fA60>
<fA61><s0>A</s0>
</fA61>
<fA64 i1="01" i2="1"><s0>International journal on document analysis and recognition : (Print)</s0>
</fA64>
<fA66 i1="01"><s0>DEU</s0>
</fA66>
<fC01 i1="01" l="ENG"><s0>Text embedded in multimedia documents represents an important semantic information that helps to automatically access the content. This paper proposes two neural-based optical character recognition (OCR) systems that handle the text recognition problem in different ways. The first approach segments a text image into individual characters before recognizing them, while the second one avoids the segmentation step by integrating a multi-scale scanning scheme that allows to jointly localize and recognize characters at each position and scale. Some linguistic knowledge is also incorporated into the proposed schemes to remove errors due to recognition confusions. Both OCR systems are applied to caption texts embedded in videos and in natural scene images and provide outstanding results showing that the proposed approaches outperform the state-of-the-art methods.</s0>
</fC01>
<fC02 i1="01" i2="X"><s0>001D02C04</s0>
</fC02>
<fC02 i1="02" i2="X"><s0>001D02C03</s0>
</fC02>
<fC02 i1="03" i2="X"><s0>001D02C06</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE"><s0>Reconnaissance caractère</s0>
<s5>06</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG"><s0>Character recognition</s0>
<s5>06</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA"><s0>Reconocimiento carácter</s0>
<s5>06</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE"><s0>Texte</s0>
<s5>07</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG"><s0>Text</s0>
<s5>07</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA"><s0>Texto</s0>
<s5>07</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE"><s0>Reconnaissance forme</s0>
<s5>08</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG"><s0>Pattern recognition</s0>
<s5>08</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA"><s0>Reconocimiento patrón</s0>
<s5>08</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE"><s0>Analyse documentaire</s0>
<s5>09</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG"><s0>Document analysis</s0>
<s5>09</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA"><s0>Análisis documental</s0>
<s5>09</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE"><s0>Multimédia</s0>
<s5>10</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG"><s0>Multimedia</s0>
<s5>10</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA"><s0>Multimedia</s0>
<s5>10</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE"><s0>Reconnaissance optique caractère</s0>
<s5>11</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG"><s0>Optical character recognition</s0>
<s5>11</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA"><s0>Reconocimento óptico de caracteres</s0>
<s5>11</s5>
</fC03>
<fC03 i1="07" i2="X" l="FRE"><s0>Accès contenu</s0>
<s5>12</s5>
</fC03>
<fC03 i1="07" i2="X" l="ENG"><s0>Content access</s0>
<s5>12</s5>
</fC03>
<fC03 i1="07" i2="X" l="SPA"><s0>Acceso contenido</s0>
<s5>12</s5>
</fC03>
<fC03 i1="08" i2="X" l="FRE"><s0>Vision ordinateur</s0>
<s5>13</s5>
</fC03>
<fC03 i1="08" i2="X" l="ENG"><s0>Computer vision</s0>
<s5>13</s5>
</fC03>
<fC03 i1="08" i2="X" l="SPA"><s0>Visión ordenador</s0>
<s5>13</s5>
</fC03>
<fC03 i1="09" i2="X" l="FRE"><s0>Linguistique mathématique</s0>
<s5>14</s5>
</fC03>
<fC03 i1="09" i2="X" l="ENG"><s0>Computational linguistics</s0>
<s5>14</s5>
</fC03>
<fC03 i1="09" i2="X" l="SPA"><s0>Linguística matemática</s0>
<s5>14</s5>
</fC03>
<fC03 i1="10" i2="X" l="FRE"><s0>Signal vidéo</s0>
<s5>15</s5>
</fC03>
<fC03 i1="10" i2="X" l="ENG"><s0>Video signal</s0>
<s5>15</s5>
</fC03>
<fC03 i1="10" i2="X" l="SPA"><s0>Señal video</s0>
<s5>15</s5>
</fC03>
<fC03 i1="11" i2="X" l="FRE"><s0>Analyse scène</s0>
<s5>16</s5>
</fC03>
<fC03 i1="11" i2="X" l="ENG"><s0>Scene analysis</s0>
<s5>16</s5>
</fC03>
<fC03 i1="11" i2="X" l="SPA"><s0>Análisis escena</s0>
<s5>16</s5>
</fC03>
<fC03 i1="12" i2="X" l="FRE"><s0>Présentation document</s0>
<s5>18</s5>
</fC03>
<fC03 i1="12" i2="X" l="ENG"><s0>Document layout</s0>
<s5>18</s5>
</fC03>
<fC03 i1="12" i2="X" l="SPA"><s0>Presentación documento</s0>
<s5>18</s5>
</fC03>
<fC03 i1="13" i2="X" l="FRE"><s0>Sémantique</s0>
<s5>19</s5>
</fC03>
<fC03 i1="13" i2="X" l="ENG"><s0>Semantics</s0>
<s5>19</s5>
</fC03>
<fC03 i1="13" i2="X" l="SPA"><s0>Semántica</s0>
<s5>19</s5>
</fC03>
<fC03 i1="14" i2="X" l="FRE"><s0>Balayage</s0>
<s5>20</s5>
</fC03>
<fC03 i1="14" i2="X" l="ENG"><s0>Scanning</s0>
<s5>20</s5>
</fC03>
<fC03 i1="14" i2="X" l="SPA"><s0>Exploración</s0>
<s5>20</s5>
</fC03>
<fC03 i1="15" i2="X" l="FRE"><s0>Sous titrage</s0>
<s5>21</s5>
</fC03>
<fC03 i1="15" i2="X" l="ENG"><s0>Caption</s0>
<s5>21</s5>
</fC03>
<fC03 i1="15" i2="X" l="SPA"><s0>Subtítulo</s0>
<s5>21</s5>
</fC03>
<fC03 i1="16" i2="X" l="FRE"><s0>Convolution</s0>
<s5>23</s5>
</fC03>
<fC03 i1="16" i2="X" l="ENG"><s0>Convolution</s0>
<s5>23</s5>
</fC03>
<fC03 i1="16" i2="X" l="SPA"><s0>Convolución</s0>
<s5>23</s5>
</fC03>
<fC03 i1="17" i2="X" l="FRE"><s0>Réseau neuronal</s0>
<s5>24</s5>
</fC03>
<fC03 i1="17" i2="X" l="ENG"><s0>Neural network</s0>
<s5>24</s5>
</fC03>
<fC03 i1="17" i2="X" l="SPA"><s0>Red neuronal</s0>
<s5>24</s5>
</fC03>
<fC03 i1="18" i2="X" l="FRE"><s0>Segmentation image</s0>
<s4>CD</s4>
<s5>96</s5>
</fC03>
<fC03 i1="18" i2="X" l="ENG"><s0>Image segmentation</s0>
<s4>CD</s4>
<s5>96</s5>
</fC03>
<fC03 i1="18" i2="X" l="SPA"><s0>Segmentación de imágenes</s0>
<s4>CD</s4>
<s5>96</s5>
</fC03>
<fC03 i1="19" i2="X" l="FRE"><s0>Scène naturelle</s0>
<s4>CD</s4>
<s5>97</s5>
</fC03>
<fC03 i1="19" i2="X" l="ENG"><s0>Natural scenes</s0>
<s4>CD</s4>
<s5>97</s5>
</fC03>
<fC03 i1="19" i2="X" l="SPA"><s0>Escena natural</s0>
<s4>CD</s4>
<s5>97</s5>
</fC03>
<fN21><s1>251</s1>
</fN21>
<fN44 i1="01"><s1>OTO</s1>
</fN44>
<fN82><s1>OTO</s1>
</fN82>
</pA>
</standard>
<server><NO>PASCAL 14-0199549 INIST</NO>
<ET>Text recognition in multimedia documents: a study of two neural-based OCRs using and avoiding character segmentation</ET>
<AU>ELAGOUNI (Khaoula); GARCIA (Christophe); MAMALET (Franck); SEBILLOT (Pascale)</AU>
<AF>Orange Labs R&D/35512 Cesson-Sévigné/France (1 aut., 3 aut.); LIRIS/INSA de Lyon/69621 Villeurbanne/France (2 aut.); IRISA/INSA de Rennes/35042 Rennes/France (4 aut.)</AF>
<DT>Publication en série; Niveau analytique</DT>
<SO>International journal on document analysis and recognition : (Print); ISSN 1433-2833; Allemagne; Da. 2014; Vol. 17; No. 1; Pp. 19-31; Bibl. 50 ref.</SO>
<LA>Anglais</LA>
<EA>Text embedded in multimedia documents represents an important semantic information that helps to automatically access the content. This paper proposes two neural-based optical character recognition (OCR) systems that handle the text recognition problem in different ways. The first approach segments a text image into individual characters before recognizing them, while the second one avoids the segmentation step by integrating a multi-scale scanning scheme that allows to jointly localize and recognize characters at each position and scale. Some linguistic knowledge is also incorporated into the proposed schemes to remove errors due to recognition confusions. Both OCR systems are applied to caption texts embedded in videos and in natural scene images and provide outstanding results showing that the proposed approaches outperform the state-of-the-art methods.</EA>
<CC>001D02C04; 001D02C03; 001D02C06</CC>
<FD>Reconnaissance caractère; Texte; Reconnaissance forme; Analyse documentaire; Multimédia; Reconnaissance optique caractère; Accès contenu; Vision ordinateur; Linguistique mathématique; Signal vidéo; Analyse scène; Présentation document; Sémantique; Balayage; Sous titrage; Convolution; Réseau neuronal; Segmentation image; Scène naturelle</FD>
<ED>Character recognition; Text; Pattern recognition; Document analysis; Multimedia; Optical character recognition; Content access; Computer vision; Computational linguistics; Video signal; Scene analysis; Document layout; Semantics; Scanning; Caption; Convolution; Neural network; Image segmentation; Natural scenes</ED>
<SD>Reconocimiento carácter; Texto; Reconocimiento patrón; Análisis documental; Multimedia; Reconocimento óptico de caracteres; Acceso contenido; Visión ordenador; Linguística matemática; Señal video; Análisis escena; Presentación documento; Semántica; Exploración; Subtítulo; Convolución; Red neuronal; Segmentación de imágenes; Escena natural</SD>
<LO>INIST-26790.354000501888130020</LO>
<ID>14-0199549</ID>
</server>
</inist>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000009 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000009 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= PascalFrancis |étape= Corpus |type= RBID |clé= Pascal:14-0199549 |texte= Text recognition in multimedia documents: a study of two neural-based OCRs using and avoiding character segmentation }}
This area was generated with Dilib version V0.6.32. |