A Model-Based Iterative Method for Caption Extraction in Compressed MPEG Video
Identifieur interne : 002B19 ( Istex/Corpus ); précédent : 002B18; suivant : 002B20A Model-Based Iterative Method for Caption Extraction in Compressed MPEG Video
Auteurs : Daniel Márquez ; Jesús Besc SSource :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2007.
Abstract
Abstract: We here describe a method for caption extraction that totally works in the MPEG compressed domain. As opposed to other compressed domain methods; it does not need to refine their results in the pixel domain. It consists of two phases: first, a selection of candidate frames with captions, based on a rigorous statistical design of an AC coefficients mask; second, an extraction of caption boxes from the pre-selected set of candidate frames. Caption extraction relies on a model-based approach to obtaining the caption mask, robust enough to avoid the use of any subsequent refinement.
Url:
DOI: 10.1007/978-3-540-77051-0_9
Links to Exploration step
ISTEX:AD6B91C5A61EC6049F897D79E17386EA9631E95FLe document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">A Model-Based Iterative Method for Caption Extraction in Compressed MPEG Video</title>
<author><name sortKey="Marquez, Daniel" sort="Marquez, Daniel" uniqKey="Marquez D" first="Daniel" last="Márquez">Daniel Márquez</name>
<affiliation><mods:affiliation>Grupo de Tratamiento de Imágenes, Escuela Politécnica Superior, Universidad Autónoma de Madrid, E-28049 Madrid, Spain</mods:affiliation>
</affiliation>
<affiliation><mods:affiliation>E-mail: daniel.marquez@uam.es</mods:affiliation>
</affiliation>
</author>
<author><name sortKey="Besc S, Jesus" sort="Besc S, Jesus" uniqKey="Besc S J" first="Jesús" last="Besc S">Jesús Besc S</name>
<affiliation><mods:affiliation>Grupo de Tratamiento de Imágenes, Escuela Politécnica Superior, Universidad Autónoma de Madrid, E-28049 Madrid, Spain</mods:affiliation>
</affiliation>
<affiliation><mods:affiliation>E-mail: j.bescos@uam.es</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:AD6B91C5A61EC6049F897D79E17386EA9631E95F</idno>
<date when="2007" year="2007">2007</date>
<idno type="doi">10.1007/978-3-540-77051-0_9</idno>
<idno type="url">https://api.istex.fr/document/AD6B91C5A61EC6049F897D79E17386EA9631E95F/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">002B19</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">A Model-Based Iterative Method for Caption Extraction in Compressed MPEG Video</title>
<author><name sortKey="Marquez, Daniel" sort="Marquez, Daniel" uniqKey="Marquez D" first="Daniel" last="Márquez">Daniel Márquez</name>
<affiliation><mods:affiliation>Grupo de Tratamiento de Imágenes, Escuela Politécnica Superior, Universidad Autónoma de Madrid, E-28049 Madrid, Spain</mods:affiliation>
</affiliation>
<affiliation><mods:affiliation>E-mail: daniel.marquez@uam.es</mods:affiliation>
</affiliation>
</author>
<author><name sortKey="Besc S, Jesus" sort="Besc S, Jesus" uniqKey="Besc S J" first="Jesús" last="Besc S">Jesús Besc S</name>
<affiliation><mods:affiliation>Grupo de Tratamiento de Imágenes, Escuela Politécnica Superior, Universidad Autónoma de Madrid, E-28049 Madrid, Spain</mods:affiliation>
</affiliation>
<affiliation><mods:affiliation>E-mail: j.bescos@uam.es</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2007</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">AD6B91C5A61EC6049F897D79E17386EA9631E95F</idno>
<idno type="DOI">10.1007/978-3-540-77051-0_9</idno>
<idno type="ChapterID">9</idno>
<idno type="ChapterID">Chap9</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: We here describe a method for caption extraction that totally works in the MPEG compressed domain. As opposed to other compressed domain methods; it does not need to refine their results in the pixel domain. It consists of two phases: first, a selection of candidate frames with captions, based on a rigorous statistical design of an AC coefficients mask; second, an extraction of caption boxes from the pre-selected set of candidate frames. Caption extraction relies on a model-based approach to obtaining the caption mask, robust enough to avoid the use of any subsequent refinement.</div>
</front>
</TEI>
<istex><corpusName>springer</corpusName>
<author><json:item><name>Daniel Márquez</name>
<affiliations><json:string>Grupo de Tratamiento de Imágenes, Escuela Politécnica Superior, Universidad Autónoma de Madrid, E-28049 Madrid, Spain</json:string>
<json:string>E-mail: daniel.marquez@uam.es</json:string>
</affiliations>
</json:item>
<json:item><name>Jesús Bescós</name>
<affiliations><json:string>Grupo de Tratamiento de Imágenes, Escuela Politécnica Superior, Universidad Autónoma de Madrid, E-28049 Madrid, Spain</json:string>
<json:string>E-mail: j.bescos@uam.es</json:string>
</affiliations>
</json:item>
</author>
<language><json:string>eng</json:string>
</language>
<abstract>Abstract: We here describe a method for caption extraction that totally works in the MPEG compressed domain. As opposed to other compressed domain methods; it does not need to refine their results in the pixel domain. It consists of two phases: first, a selection of candidate frames with captions, based on a rigorous statistical design of an AC coefficients mask; second, an extraction of caption boxes from the pre-selected set of candidate frames. Caption extraction relies on a model-based approach to obtaining the caption mask, robust enough to avoid the use of any subsequent refinement.</abstract>
<qualityIndicators><score>2.445</score>
<pdfVersion>1.3</pdfVersion>
<pdfPageSize>430 x 660 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>595</abstractCharCount>
<pdfWordCount>1305</pdfWordCount>
<pdfCharCount>7430</pdfCharCount>
<pdfPageCount>4</pdfPageCount>
<abstractWordCount>95</abstractWordCount>
</qualityIndicators>
<title>A Model-Based Iterative Method for Caption Extraction in Compressed MPEG Video</title>
<genre.original><json:string>OriginalPaper</json:string>
</genre.original>
<chapterId><json:string>9</json:string>
<json:string>Chap9</json:string>
</chapterId>
<genre><json:string>conference [eBooks]</json:string>
</genre>
<serie><editor><json:item><name>David Hutchison</name>
</json:item>
<json:item><name>Takeo Kanade</name>
</json:item>
<json:item><name>Josef Kittler</name>
</json:item>
<json:item><name>Jon M. Kleinberg</name>
</json:item>
<json:item><name>Friedemann Mattern</name>
</json:item>
<json:item><name>John C. Mitchell</name>
</json:item>
<json:item><name>Moni Naor</name>
</json:item>
<json:item><name>Oscar Nierstrasz</name>
</json:item>
<json:item><name>C. Pandu Rangan</name>
</json:item>
<json:item><name>Bernhard Steffen</name>
</json:item>
<json:item><name>Madhu Sudan</name>
</json:item>
<json:item><name>Demetri Terzopoulos</name>
</json:item>
<json:item><name>Doug Tygar</name>
</json:item>
<json:item><name>Moshe Y. Vardi</name>
</json:item>
<json:item><name>Gerhard Weikum</name>
</json:item>
</editor>
<issn><json:string>0302-9743</json:string>
</issn>
<language><json:string>unknown</json:string>
</language>
<eissn><json:string>1611-3349</json:string>
</eissn>
<title>Lecture Notes in Computer Science</title>
<copyrightDate>2007</copyrightDate>
</serie>
<host><editor><json:item><name>Bianca Falcidieno</name>
</json:item>
<json:item><name>Michela Spagnuolo</name>
</json:item>
<json:item><name>Yannis Avrithis</name>
</json:item>
<json:item><name>Ioannis Kompatsiaris</name>
</json:item>
<json:item><name>Paul Buitelaar</name>
</json:item>
</editor>
<subject><json:item><value>Computer Science</value>
</json:item>
<json:item><value>Computer Science</value>
</json:item>
<json:item><value>Multimedia Information Systems</value>
</json:item>
<json:item><value>Computer Communication Networks</value>
</json:item>
<json:item><value>Information Systems Applications (incl.Internet)</value>
</json:item>
<json:item><value>Data Mining and Knowledge Discovery</value>
</json:item>
<json:item><value>Document Preparation and Text Processing</value>
</json:item>
<json:item><value>Image Processing and Computer Vision</value>
</json:item>
</subject>
<isbn><json:string>978-3-540-77033-6</json:string>
</isbn>
<language><json:string>unknown</json:string>
</language>
<eissn><json:string>1611-3349</json:string>
</eissn>
<title>Semantic Multimedia</title>
<genre.original><json:string>Proceedings</json:string>
</genre.original>
<bookId><json:string>978-3-540-77051-0</json:string>
</bookId>
<volume>4816</volume>
<pages><last>94</last>
<first>91</first>
</pages>
<issn><json:string>0302-9743</json:string>
</issn>
<genre><json:string>Book Series</json:string>
</genre>
<eisbn><json:string>978-3-540-77051-0</json:string>
</eisbn>
<copyrightDate>2007</copyrightDate>
<doi><json:string>10.1007/978-3-540-77051-0</json:string>
</doi>
</host>
<publicationDate>2007</publicationDate>
<copyrightDate>2007</copyrightDate>
<doi><json:string>10.1007/978-3-540-77051-0_9</json:string>
</doi>
<id>AD6B91C5A61EC6049F897D79E17386EA9631E95F</id>
<fulltext><json:item><original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/AD6B91C5A61EC6049F897D79E17386EA9631E95F/fulltext/pdf</uri>
</json:item>
<json:item><original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/AD6B91C5A61EC6049F897D79E17386EA9631E95F/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/AD6B91C5A61EC6049F897D79E17386EA9631E95F/fulltext/tei"><teiHeader><fileDesc><titleStmt><title level="a" type="main" xml:lang="en">A Model-Based Iterative Method for Caption Extraction in Compressed MPEG Video</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
</titleStmt>
<publicationStmt><authority>ISTEX</authority>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<availability><p>SPRINGER</p>
</availability>
<date>2007</date>
</publicationStmt>
<sourceDesc><biblStruct type="inbook"><analytic><title level="a" type="main" xml:lang="en">A Model-Based Iterative Method for Caption Extraction in Compressed MPEG Video</title>
<author><persName><forename type="first">Daniel</forename>
<surname>Márquez</surname>
</persName>
<email>daniel.marquez@uam.es</email>
<affiliation>Grupo de Tratamiento de Imágenes, Escuela Politécnica Superior, Universidad Autónoma de Madrid, E-28049 Madrid, Spain</affiliation>
</author>
<author><persName><forename type="first">Jesús</forename>
<surname>Bescós</surname>
</persName>
<email>j.bescos@uam.es</email>
<affiliation>Grupo de Tratamiento de Imágenes, Escuela Politécnica Superior, Universidad Autónoma de Madrid, E-28049 Madrid, Spain</affiliation>
</author>
</analytic>
<monogr><title level="m">Semantic Multimedia</title>
<title level="m" type="sub">Second International Conference on Semantic and Digital Media Technologies, SAMT 2007, Genoa, Italy, December 5-7, 2007. Proceedings</title>
<idno type="pISBN">978-3-540-77033-6</idno>
<idno type="eISBN">978-3-540-77051-0</idno>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="DOI">10.1007/978-3-540-77051-0</idno>
<idno type="BookID">978-3-540-77051-0</idno>
<idno type="BookTitleID">158365</idno>
<idno type="BookSequenceNumber">4816</idno>
<idno type="BookVolumeNumber">4816</idno>
<idno type="BookChapterCount">39</idno>
<editor><persName><forename type="first">Bianca</forename>
<surname>Falcidieno</surname>
</persName>
</editor>
<editor><persName><forename type="first">Michela</forename>
<surname>Spagnuolo</surname>
</persName>
</editor>
<editor><persName><forename type="first">Yannis</forename>
<surname>Avrithis</surname>
</persName>
</editor>
<editor><persName><forename type="first">Ioannis</forename>
<surname>Kompatsiaris</surname>
</persName>
</editor>
<editor><persName><forename type="first">Paul</forename>
<surname>Buitelaar</surname>
</persName>
</editor>
<imprint><publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<date type="published" when="2007"></date>
<biblScope unit="volume">4816</biblScope>
<biblScope unit="page" from="91">91</biblScope>
<biblScope unit="page" to="94">94</biblScope>
</imprint>
</monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<editor><persName><forename type="first">David</forename>
<surname>Hutchison</surname>
</persName>
</editor>
<editor><persName><forename type="first">Takeo</forename>
<surname>Kanade</surname>
</persName>
</editor>
<editor><persName><forename type="first">Josef</forename>
<surname>Kittler</surname>
</persName>
</editor>
<editor><persName><forename type="first">Jon</forename>
<forename type="first">M.</forename>
<surname>Kleinberg</surname>
</persName>
</editor>
<editor><persName><forename type="first">Friedemann</forename>
<surname>Mattern</surname>
</persName>
</editor>
<editor><persName><forename type="first">John</forename>
<forename type="first">C.</forename>
<surname>Mitchell</surname>
</persName>
</editor>
<editor><persName><forename type="first">Moni</forename>
<surname>Naor</surname>
</persName>
</editor>
<editor><persName><forename type="first">Oscar</forename>
<surname>Nierstrasz</surname>
</persName>
</editor>
<editor><persName><forename type="first">C.</forename>
<surname>Pandu Rangan</surname>
</persName>
</editor>
<editor><persName><forename type="first">Bernhard</forename>
<surname>Steffen</surname>
</persName>
</editor>
<editor><persName><forename type="first">Madhu</forename>
<surname>Sudan</surname>
</persName>
</editor>
<editor><persName><forename type="first">Demetri</forename>
<surname>Terzopoulos</surname>
</persName>
</editor>
<editor><persName><forename type="first">Doug</forename>
<surname>Tygar</surname>
</persName>
</editor>
<editor><persName><forename type="first">Moshe</forename>
<forename type="first">Y.</forename>
<surname>Vardi</surname>
</persName>
</editor>
<editor><persName><forename type="first">Gerhard</forename>
<surname>Weikum</surname>
</persName>
</editor>
<biblScope><date>2007</date>
</biblScope>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="seriesId">558</idno>
</series>
<idno type="istex">AD6B91C5A61EC6049F897D79E17386EA9631E95F</idno>
<idno type="DOI">10.1007/978-3-540-77051-0_9</idno>
<idno type="ChapterID">9</idno>
<idno type="ChapterID">Chap9</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><creation><date>2007</date>
</creation>
<langUsage><language ident="en">en</language>
</langUsage>
<abstract xml:lang="en"><p>Abstract: We here describe a method for caption extraction that totally works in the MPEG compressed domain. As opposed to other compressed domain methods; it does not need to refine their results in the pixel domain. It consists of two phases: first, a selection of candidate frames with captions, based on a rigorous statistical design of an AC coefficients mask; second, an extraction of caption boxes from the pre-selected set of candidate frames. Caption extraction relies on a model-based approach to obtaining the caption mask, robust enough to avoid the use of any subsequent refinement.</p>
</abstract>
<textClass><keywords scheme="Book Subject Collection"><list><label>SUCO11645</label>
<item><term>Computer Science</term>
</item>
</list>
</keywords>
</textClass>
<textClass><keywords scheme="Book Subject Group"><list><label>I</label>
<label>I18059</label>
<label>I13022</label>
<label>I18040</label>
<label>I18030</label>
<label>I21033</label>
<label>I22021</label>
<item><term>Computer Science</term>
</item>
<item><term>Multimedia Information Systems</term>
</item>
<item><term>Computer Communication Networks</term>
</item>
<item><term>Information Systems Applications (incl.Internet)</term>
</item>
<item><term>Data Mining and Knowledge Discovery</term>
</item>
<item><term>Document Preparation and Text Processing</term>
</item>
<item><term>Image Processing and Computer Vision</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc><change when="2007">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-20">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item><original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/AD6B91C5A61EC6049F897D79E17386EA9631E95F/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata><istex:metadataXml wicri:clean="Springer, Publisher found" wicri:toSee="no header"><istex:xmlDeclaration>version="1.0" encoding="UTF-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//Springer-Verlag//DTD A++ V2.4//EN" URI="http://devel.springer.de/A++/V2.4/DTD/A++V2.4.dtd" name="istex:docType"></istex:docType>
<istex:document><Publisher><PublisherInfo><PublisherName>Springer Berlin Heidelberg</PublisherName>
<PublisherLocation>Berlin, Heidelberg</PublisherLocation>
</PublisherInfo>
<Series><SeriesInfo TocLevels="0" SeriesType="Series"><SeriesID>558</SeriesID>
<SeriesPrintISSN>0302-9743</SeriesPrintISSN>
<SeriesElectronicISSN>1611-3349</SeriesElectronicISSN>
<SeriesTitle Language="En">Lecture Notes in Computer Science</SeriesTitle>
</SeriesInfo>
<SeriesHeader><EditorGroup><Editor><EditorName DisplayOrder="Western"><GivenName>David</GivenName>
<FamilyName>Hutchison</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Takeo</GivenName>
<FamilyName>Kanade</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Josef</GivenName>
<FamilyName>Kittler</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Jon</GivenName>
<GivenName>M.</GivenName>
<FamilyName>Kleinberg</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Friedemann</GivenName>
<FamilyName>Mattern</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>John</GivenName>
<GivenName>C.</GivenName>
<FamilyName>Mitchell</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Moni</GivenName>
<FamilyName>Naor</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Oscar</GivenName>
<FamilyName>Nierstrasz</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>C.</GivenName>
<FamilyName>Pandu Rangan</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Bernhard</GivenName>
<FamilyName>Steffen</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Madhu</GivenName>
<FamilyName>Sudan</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Demetri</GivenName>
<FamilyName>Terzopoulos</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Doug</GivenName>
<FamilyName>Tygar</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Moshe</GivenName>
<GivenName>Y.</GivenName>
<FamilyName>Vardi</FamilyName>
</EditorName>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Gerhard</GivenName>
<FamilyName>Weikum</FamilyName>
</EditorName>
</Editor>
</EditorGroup>
</SeriesHeader>
<Book Language="En"><BookInfo Language="En" NumberingStyle="Unnumbered" OutputMedium="All" TocLevels="0" ContainsESM="No" BookProductType="Proceedings" MediaType="eBook"><BookID>978-3-540-77051-0</BookID>
<BookTitle>Semantic Multimedia</BookTitle>
<BookSubTitle>Second International Conference on Semantic and Digital Media Technologies, SAMT 2007, Genoa, Italy, December 5-7, 2007. Proceedings</BookSubTitle>
<BookVolumeNumber>4816</BookVolumeNumber>
<BookSequenceNumber>4816</BookSequenceNumber>
<BookDOI>10.1007/978-3-540-77051-0</BookDOI>
<BookTitleID>158365</BookTitleID>
<BookPrintISBN>978-3-540-77033-6</BookPrintISBN>
<BookElectronicISBN>978-3-540-77051-0</BookElectronicISBN>
<BookChapterCount>39</BookChapterCount>
<BookCopyright><CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2007</CopyrightYear>
</BookCopyright>
<BookSubjectGroup><BookSubject Type="Primary" Code="I">Computer Science</BookSubject>
<BookSubject Type="Secondary" Priority="1" Code="I18059">Multimedia Information Systems</BookSubject>
<BookSubject Type="Secondary" Priority="2" Code="I13022">Computer Communication Networks</BookSubject>
<BookSubject Type="Secondary" Priority="3" Code="I18040">Information Systems Applications (incl.Internet)</BookSubject>
<BookSubject Type="Secondary" Priority="4" Code="I18030">Data Mining and Knowledge Discovery</BookSubject>
<BookSubject Type="Secondary" Priority="5" Code="I21033">Document Preparation and Text Processing</BookSubject>
<BookSubject Type="Secondary" Priority="6" Code="I22021">Image Processing and Computer Vision</BookSubject>
<SubjectCollection Code="SUCO11645">Computer Science</SubjectCollection>
</BookSubjectGroup>
</BookInfo>
<BookHeader><EditorGroup><Editor><EditorName DisplayOrder="Western"><GivenName>Bianca</GivenName>
<FamilyName>Falcidieno</FamilyName>
</EditorName>
<Contact><Email>bianca.falcidieno@ge.imati.cnr.it</Email>
</Contact>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Michela</GivenName>
<FamilyName>Spagnuolo</FamilyName>
</EditorName>
<Contact><Email>michela.spagnuolo@ge.imati.cnr.it</Email>
</Contact>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Yannis</GivenName>
<FamilyName>Avrithis</FamilyName>
</EditorName>
<Contact><Email>iavr@image.ntua.gr</Email>
</Contact>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Ioannis</GivenName>
<FamilyName>Kompatsiaris</FamilyName>
</EditorName>
<Contact><Email>ikom@iti.gr</Email>
</Contact>
</Editor>
<Editor><EditorName DisplayOrder="Western"><GivenName>Paul</GivenName>
<FamilyName>Buitelaar</FamilyName>
</EditorName>
<Contact><Email>paulb@dfki.de</Email>
</Contact>
</Editor>
</EditorGroup>
</BookHeader>
<Part ID="Part3"><PartInfo TocLevels="0"><PartID>3</PartID>
<PartSequenceNumber>3</PartSequenceNumber>
<PartTitle>Domain-Restricted Generation of Semantic Metadata from Multimodal Sources</PartTitle>
<PartChapterCount>7</PartChapterCount>
<PartContext><SeriesID>558</SeriesID>
<BookTitle>Semantic Multimedia</BookTitle>
</PartContext>
</PartInfo>
<Chapter ID="Chap9" Language="En"><ChapterInfo ChapterType="OriginalPaper" NumberingStyle="Unnumbered" TocLevels="0" ContainsESM="No"><ChapterID>9</ChapterID>
<ChapterDOI>10.1007/978-3-540-77051-0_9</ChapterDOI>
<ChapterSequenceNumber>9</ChapterSequenceNumber>
<ChapterTitle Language="En">A Model-Based Iterative Method for Caption Extraction in Compressed MPEG Video</ChapterTitle>
<ChapterFirstPage>91</ChapterFirstPage>
<ChapterLastPage>94</ChapterLastPage>
<ChapterCopyright><CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2007</CopyrightYear>
</ChapterCopyright>
<ChapterGrants Type="Regular"><MetadataGrant Grant="OpenAccess"></MetadataGrant>
<AbstractGrant Grant="OpenAccess"></AbstractGrant>
<BodyPDFGrant Grant="Restricted"></BodyPDFGrant>
<BodyHTMLGrant Grant="Restricted"></BodyHTMLGrant>
<BibliographyGrant Grant="Restricted"></BibliographyGrant>
<ESMGrant Grant="Restricted"></ESMGrant>
</ChapterGrants>
<ChapterContext><SeriesID>558</SeriesID>
<PartID>3</PartID>
<BookID>978-3-540-77051-0</BookID>
<BookTitle>Semantic Multimedia</BookTitle>
</ChapterContext>
</ChapterInfo>
<ChapterHeader><AuthorGroup><Author AffiliationIDS="Aff1"><AuthorName DisplayOrder="Western"><GivenName>Daniel</GivenName>
<FamilyName>Márquez</FamilyName>
</AuthorName>
<Contact><Email>daniel.marquez@uam.es</Email>
</Contact>
</Author>
<Author AffiliationIDS="Aff1"><AuthorName DisplayOrder="Western"><GivenName>Jesús</GivenName>
<FamilyName>Bescós</FamilyName>
</AuthorName>
<Contact><Email>j.bescos@uam.es</Email>
</Contact>
</Author>
<Affiliation ID="Aff1"><OrgName>Grupo de Tratamiento de Imágenes, Escuela Politécnica Superior, Universidad Autónoma de Madrid, E-28049 Madrid</OrgName>
<OrgAddress><Country>Spain</Country>
</OrgAddress>
</Affiliation>
</AuthorGroup>
<Abstract ID="Abs1" Language="En"><Heading>Abstract</Heading>
<Para>We here describe a method for caption extraction that totally works in the MPEG compressed domain. As opposed to other compressed domain methods; it does not need to refine their results in the pixel domain. It consists of two phases: first, a selection of candidate frames with captions, based on a rigorous statistical design of an AC coefficients mask; second, an extraction of caption boxes from the pre-selected set of candidate frames. Caption extraction relies on a model-based approach to obtaining the caption mask, robust enough to avoid the use of any subsequent refinement.</Para>
</Abstract>
<ArticleNote Type="Misc"><SimplePara>Work partially supported by the European Commission under its 6<Superscript>th</Superscript>
Framework Programme (FP6-027685 - MESH Project) and by Spanish Institutions under projects TIN2004-07860-C02-01 and S-0505-TIC-0223.</SimplePara>
</ArticleNote>
</ChapterHeader>
<NoBody></NoBody>
</Chapter>
</Part>
</Book>
</Series>
</Publisher>
</istex:document>
</istex:metadataXml>
<mods version="3.6"><titleInfo lang="en"><title>A Model-Based Iterative Method for Caption Extraction in Compressed MPEG Video</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA" lang="en"><title>A Model-Based Iterative Method for Caption Extraction in Compressed MPEG Video</title>
</titleInfo>
<name type="personal"><namePart type="given">Daniel</namePart>
<namePart type="family">Márquez</namePart>
<affiliation>Grupo de Tratamiento de Imágenes, Escuela Politécnica Superior, Universidad Autónoma de Madrid, E-28049 Madrid, Spain</affiliation>
<affiliation>E-mail: daniel.marquez@uam.es</affiliation>
<role><roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Jesús</namePart>
<namePart type="family">Bescós</namePart>
<affiliation>Grupo de Tratamiento de Imágenes, Escuela Politécnica Superior, Universidad Autónoma de Madrid, E-28049 Madrid, Spain</affiliation>
<affiliation>E-mail: j.bescos@uam.es</affiliation>
<role><roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="conference [eBooks]" displayLabel="OriginalPaper"></genre>
<originInfo><publisher>Springer Berlin Heidelberg</publisher>
<place><placeTerm type="text">Berlin, Heidelberg</placeTerm>
</place>
<dateIssued encoding="w3cdtf">2007</dateIssued>
<copyrightDate encoding="w3cdtf">2007</copyrightDate>
</originInfo>
<language><languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription><internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">Abstract: We here describe a method for caption extraction that totally works in the MPEG compressed domain. As opposed to other compressed domain methods; it does not need to refine their results in the pixel domain. It consists of two phases: first, a selection of candidate frames with captions, based on a rigorous statistical design of an AC coefficients mask; second, an extraction of caption boxes from the pre-selected set of candidate frames. Caption extraction relies on a model-based approach to obtaining the caption mask, robust enough to avoid the use of any subsequent refinement.</abstract>
<relatedItem type="host"><titleInfo><title>Semantic Multimedia</title>
<subTitle>Second International Conference on Semantic and Digital Media Technologies, SAMT 2007, Genoa, Italy, December 5-7, 2007. Proceedings</subTitle>
</titleInfo>
<name type="personal"><namePart type="given">Bianca</namePart>
<namePart type="family">Falcidieno</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Michela</namePart>
<namePart type="family">Spagnuolo</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Yannis</namePart>
<namePart type="family">Avrithis</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Ioannis</namePart>
<namePart type="family">Kompatsiaris</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Paul</namePart>
<namePart type="family">Buitelaar</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="Book Series" displayLabel="Proceedings"></genre>
<originInfo><copyrightDate encoding="w3cdtf">2007</copyrightDate>
<issuance>monographic</issuance>
</originInfo>
<subject><genre>Book Subject Collection</genre>
<topic authority="SpringerSubjectCodes" authorityURI="SUCO11645">Computer Science</topic>
</subject>
<subject><genre>Book Subject Group</genre>
<topic authority="SpringerSubjectCodes" authorityURI="I">Computer Science</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18059">Multimedia Information Systems</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I13022">Computer Communication Networks</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18040">Information Systems Applications (incl.Internet)</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18030">Data Mining and Knowledge Discovery</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I21033">Document Preparation and Text Processing</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I22021">Image Processing and Computer Vision</topic>
</subject>
<identifier type="DOI">10.1007/978-3-540-77051-0</identifier>
<identifier type="ISBN">978-3-540-77033-6</identifier>
<identifier type="eISBN">978-3-540-77051-0</identifier>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="BookTitleID">158365</identifier>
<identifier type="BookID">978-3-540-77051-0</identifier>
<identifier type="BookChapterCount">39</identifier>
<identifier type="BookVolumeNumber">4816</identifier>
<identifier type="BookSequenceNumber">4816</identifier>
<identifier type="PartChapterCount">7</identifier>
<part><date>2007</date>
<detail type="part"><title>Domain-Restricted Generation of Semantic Metadata from Multimodal Sources</title>
</detail>
<detail type="volume"><number>4816</number>
<caption>vol.</caption>
</detail>
<extent unit="pages"><start>91</start>
<end>94</end>
</extent>
</part>
<recordInfo><recordOrigin>Springer-Verlag Berlin Heidelberg, 2007</recordOrigin>
</recordInfo>
</relatedItem>
<relatedItem type="series"><titleInfo><title>Lecture Notes in Computer Science</title>
</titleInfo>
<name type="personal"><namePart type="given">David</namePart>
<namePart type="family">Hutchison</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Takeo</namePart>
<namePart type="family">Kanade</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Josef</namePart>
<namePart type="family">Kittler</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Jon</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Kleinberg</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Friedemann</namePart>
<namePart type="family">Mattern</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">John</namePart>
<namePart type="given">C.</namePart>
<namePart type="family">Mitchell</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Moni</namePart>
<namePart type="family">Naor</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Oscar</namePart>
<namePart type="family">Nierstrasz</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">C.</namePart>
<namePart type="family">Pandu Rangan</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Bernhard</namePart>
<namePart type="family">Steffen</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Madhu</namePart>
<namePart type="family">Sudan</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Demetri</namePart>
<namePart type="family">Terzopoulos</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Doug</namePart>
<namePart type="family">Tygar</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Moshe</namePart>
<namePart type="given">Y.</namePart>
<namePart type="family">Vardi</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal"><namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<role><roleTerm type="text">editor</roleTerm>
</role>
</name>
<originInfo><copyrightDate encoding="w3cdtf">2007</copyrightDate>
<issuance>serial</issuance>
</originInfo>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="SeriesID">558</identifier>
<recordInfo><recordOrigin>Springer-Verlag Berlin Heidelberg, 2007</recordOrigin>
</recordInfo>
</relatedItem>
<identifier type="istex">AD6B91C5A61EC6049F897D79E17386EA9631E95F</identifier>
<identifier type="DOI">10.1007/978-3-540-77051-0_9</identifier>
<identifier type="ChapterID">9</identifier>
<identifier type="ChapterID">Chap9</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Springer-Verlag Berlin Heidelberg, 2007</accessCondition>
<recordInfo><recordContentSource>SPRINGER</recordContentSource>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2007</recordOrigin>
</recordInfo>
</mods>
</metadata>
<enrichments><istex:refBibTEI uri="https://api.istex.fr/document/AD6B91C5A61EC6049F897D79E17386EA9631E95F/enrichments/refBib"><teiHeader></teiHeader>
<text><front></front>
<body></body>
<back><listBibl><biblStruct xml:id="b0"><analytic><title level="a" type="main">Text Extraction in MPEG Compressed Video for Content-based Indexing</title>
<author><persName><forename type="first">Y</forename>
<forename type="middle">K</forename>
<surname>Lim</surname>
</persName>
</author>
<author><persName><forename type="first">S</forename>
<forename type="middle">H</forename>
<surname>Choi</surname>
</persName>
</author>
<author><persName><forename type="first">S</forename>
<forename type="middle">W</forename>
<surname>Lee</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proc. ICPR</title>
<meeting>. ICPR</meeting>
<imprint><date type="published" when="2000"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b1"><analytic><title level="a" type="main">Robust Detection of Stylized Text-Events on Digital Video</title>
<author><persName><forename type="first">D</forename>
<surname>Crandall</surname>
</persName>
</author>
<author><persName><forename type="first">R</forename>
<surname>Kasturi</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proc. 6th Int. Conf. on Document Analysis and Recognition</title>
<meeting>. 6th Int. Conf. on Document Analysis and Recognition</meeting>
<imprint><date type="published" when="2001"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b2"><analytic><title level="a" type="main">Automatic Caption Localization in Compressed Video</title>
<author><persName><forename type="first">Y</forename>
<surname>Zhong</surname>
</persName>
</author>
<author><persName><forename type="first">H</forename>
<surname>Zhang</surname>
</persName>
</author>
<author><persName><forename type="first">A</forename>
<forename type="middle">K</forename>
<surname>Jain</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">IEEE Transactions on PAMI</title>
<imprint><date type="published" when="2000"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b3"><analytic><title level="a" type="main">Automatic Closed Caption Detection and Filtering in MPEG Vídeos for Vídeo Structuring</title>
<author><persName><forename type="first">D</forename>
<forename type="middle">Y</forename>
<surname>Chen</surname>
</persName>
</author>
<author><persName><forename type="first">M</forename>
<forename type="middle">H</forename>
<surname>Hsiao</surname>
</persName>
</author>
<author><persName><forename type="first">L</forename>
<surname>Suh-Yin</surname>
</persName>
</author>
</analytic>
<monogr><title level="j">Journal of Information Science and Engineering</title>
<imprint><biblScope unit="volume">22</biblScope>
<biblScope unit="issue">5</biblScope>
<date type="published" when="2006"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b4"><analytic><title level="a" type="main">Detection of Text Caption in Compressed Domain Vídeo</title>
<author><persName><forename type="first">Y</forename>
<surname>Zhang</surname>
</persName>
</author>
<author><persName><forename type="first">T</forename>
<surname>Chua</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proc. ACM Workshop on Multimedia</title>
<meeting>. ACM Workshop on Multimedia</meeting>
<imprint><date type="published" when="2000"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b5"><analytic><title level="a" type="main">Fast Text Caption Localization on Vídeo Using Visual Rythm</title>
<author><persName><forename type="first">S</forename>
<surname>Chun</surname>
</persName>
</author>
<author><persName><forename type="first">H</forename>
<surname>Kim</surname>
</persName>
</author>
<author><persName><forename type="first">J</forename>
<forename type="middle">R</forename>
<surname>Kim</surname>
</persName>
</author>
<author><persName><forename type="first">S</forename>
<surname>Oh</surname>
</persName>
</author>
<author><persName><forename type="first">S</forename>
<surname>Sull</surname>
</persName>
</author>
</analytic>
<monogr><title level="m">Proc. 5th Intl. Conf. on Recent Advances in Visual Information Systems</title>
<meeting>. 5th Intl. Conf. on Recent Advances in Visual Information Systems</meeting>
<imprint><date type="published" when="2002"></date>
</imprint>
</monogr>
</biblStruct>
</listBibl>
</back>
</text>
</istex:refBibTEI>
</enrichments>
</istex>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002B19 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 002B19 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Istex |étape= Corpus |type= RBID |clé= ISTEX:AD6B91C5A61EC6049F897D79E17386EA9631E95F |texte= A Model-Based Iterative Method for Caption Extraction in Compressed MPEG Video }}
This area was generated with Dilib version V0.6.32. |