Indexing, Browsing, and Searching of Digital Video and Digital Audio Information
Identifieur interne : 001D47 ( Main/Exploration ); précédent : 001D46; suivant : 001D48Indexing, Browsing, and Searching of Digital Video and Digital Audio Information
Auteurs : F. Smeaton [Irlande (pays)]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2001.
Abstract
Abstract: In this chapter we examine various techniques for providing content access to information stored in a continuous medium, namely digital audio and digital video. Our coverage of audio is centered around post-processing the output of automatic recognition of speech or phones and we describe the various approaches than have been taken in this area. In order to give reasonable coverage of the possibilities and limitations of content-based access to digital video information we sketch out at a high level, the approaches taken in various video compression algorithms, principally the MPEG family. We then address approaches to shot and scene boundary detection, choosing representative frames for browsing and for search, and various browsing interfaces that have been developed. We finish with an overview of the likely developments in this area in the future.
Url:
DOI: 10.1007/3-540-45368-7_5
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 001042
- to stream Istex, to step Curation: 000F90
- to stream Istex, to step Checkpoint: 001350
- to stream Main, to step Merge: 001E47
- to stream Main, to step Curation: 001D47
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Indexing, Browsing, and Searching of Digital Video and Digital Audio Information</title>
<author><name sortKey="Smeaton, F" sort="Smeaton, F" uniqKey="Smeaton F" first="F." last="Smeaton">F. Smeaton</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:04B6568D2583FB7E6F6A6F585DEEE61A58234AD5</idno>
<date when="2000" year="2000">2000</date>
<idno type="doi">10.1007/3-540-45368-7_5</idno>
<idno type="url">https://api.istex.fr/document/04B6568D2583FB7E6F6A6F585DEEE61A58234AD5/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001042</idno>
<idno type="wicri:Area/Istex/Curation">000F90</idno>
<idno type="wicri:Area/Istex/Checkpoint">001350</idno>
<idno type="wicri:doubleKey">0302-9743:2000:Smeaton F:indexing:browsing:and</idno>
<idno type="wicri:Area/Main/Merge">001E47</idno>
<idno type="wicri:Area/Main/Curation">001D47</idno>
<idno type="wicri:Area/Main/Exploration">001D47</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Indexing, Browsing, and Searching of Digital Video and Digital Audio Information</title>
<author><name sortKey="Smeaton, F" sort="Smeaton, F" uniqKey="Smeaton F" first="F." last="Smeaton">F. Smeaton</name>
<affiliation wicri:level="1"><country xml:lang="fr">Irlande (pays)</country>
<wicri:regionArea>School of Computer Applications, Dublin City University, Dublin 9, Glasnevin</wicri:regionArea>
<wicri:noRegion>Glasnevin</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Irlande (pays)</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2001</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">04B6568D2583FB7E6F6A6F585DEEE61A58234AD5</idno>
<idno type="DOI">10.1007/3-540-45368-7_5</idno>
<idno type="ChapterID">5</idno>
<idno type="ChapterID">Chap5</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: In this chapter we examine various techniques for providing content access to information stored in a continuous medium, namely digital audio and digital video. Our coverage of audio is centered around post-processing the output of automatic recognition of speech or phones and we describe the various approaches than have been taken in this area. In order to give reasonable coverage of the possibilities and limitations of content-based access to digital video information we sketch out at a high level, the approaches taken in various video compression algorithms, principally the MPEG family. We then address approaches to shot and scene boundary detection, choosing representative frames for browsing and for search, and various browsing interfaces that have been developed. We finish with an overview of the likely developments in this area in the future.</div>
</front>
</TEI>
<affiliations><list><country><li>Irlande (pays)</li>
</country>
</list>
<tree><country name="Irlande (pays)"><noRegion><name sortKey="Smeaton, F" sort="Smeaton, F" uniqKey="Smeaton F" first="F." last="Smeaton">F. Smeaton</name>
</noRegion>
<name sortKey="Smeaton, F" sort="Smeaton, F" uniqKey="Smeaton F" first="F." last="Smeaton">F. Smeaton</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001D47 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001D47 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:04B6568D2583FB7E6F6A6F585DEEE61A58234AD5 |texte= Indexing, Browsing, and Searching of Digital Video and Digital Audio Information }}
This area was generated with Dilib version V0.6.32. |