Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Using String Comparison in Context for Improved Relevance Feedback in Different Text Media

Identifieur interne : 000256 ( Istex/Corpus ); précédent : 000255; suivant : 000257

Using String Comparison in Context for Improved Relevance Feedback in Different Text Media

Auteurs : M. Lam-Adesina ; F. Jones

Source :

RBID : ISTEX:66E6AA650E9D11341DE243DD7773075E3392078B

Abstract

Abstract: Query expansion is a long standing relevance feedback technique for improving the effectiveness of information retrieval systems. Previous investigations have shown it to be generally effective for electronic text, to give proportionally better improvement for automatic transcriptions of spoken documents, and to be at best of questionable utility for optical character recognized scanned text documents. We introduce two corpus-based methods based on using a string-edit distance measure in context to automatically detect and correct transcription errors. One method operates at query-time and requires no modification of the document index file, and the other at index-time and operates using the standard query-time expansion process. Experimental investigations show these methods to produce improvements in relevance feedback for all three media types, but most significantly mean that relevance feedback can now successfully be applied to scanned text documents.

Url:
DOI: 10.1007/11880561_19

Links to Exploration step

ISTEX:66E6AA650E9D11341DE243DD7773075E3392078B

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Using String Comparison in Context for Improved Relevance Feedback in Different Text Media</title>
<author>
<name sortKey="Lam Adesina, M" sort="Lam Adesina, M" uniqKey="Lam Adesina M" first="M." last="Lam-Adesina">M. Lam-Adesina</name>
<affiliation>
<mods:affiliation>Centre for Digital Video Processing & School of Computing, Dublin City University, Dublin 9, Ireland</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: adenike@computing.dcu.ie</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Jones, F" sort="Jones, F" uniqKey="Jones F" first="F." last="Jones">F. Jones</name>
<affiliation>
<mods:affiliation>Centre for Digital Video Processing & School of Computing, Dublin City University, Dublin 9, Ireland</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: gjones@computing.dcu.ie</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:66E6AA650E9D11341DE243DD7773075E3392078B</idno>
<date when="2006" year="2006">2006</date>
<idno type="doi">10.1007/11880561_19</idno>
<idno type="url">https://api.istex.fr/document/66E6AA650E9D11341DE243DD7773075E3392078B/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000256</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Using String Comparison in Context for Improved Relevance Feedback in Different Text Media</title>
<author>
<name sortKey="Lam Adesina, M" sort="Lam Adesina, M" uniqKey="Lam Adesina M" first="M." last="Lam-Adesina">M. Lam-Adesina</name>
<affiliation>
<mods:affiliation>Centre for Digital Video Processing & School of Computing, Dublin City University, Dublin 9, Ireland</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: adenike@computing.dcu.ie</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Jones, F" sort="Jones, F" uniqKey="Jones F" first="F." last="Jones">F. Jones</name>
<affiliation>
<mods:affiliation>Centre for Digital Video Processing & School of Computing, Dublin City University, Dublin 9, Ireland</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: gjones@computing.dcu.ie</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2006</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">66E6AA650E9D11341DE243DD7773075E3392078B</idno>
<idno type="DOI">10.1007/11880561_19</idno>
<idno type="ChapterID">19</idno>
<idno type="ChapterID">Chap19</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Query expansion is a long standing relevance feedback technique for improving the effectiveness of information retrieval systems. Previous investigations have shown it to be generally effective for electronic text, to give proportionally better improvement for automatic transcriptions of spoken documents, and to be at best of questionable utility for optical character recognized scanned text documents. We introduce two corpus-based methods based on using a string-edit distance measure in context to automatically detect and correct transcription errors. One method operates at query-time and requires no modification of the document index file, and the other at index-time and operates using the standard query-time expansion process. Experimental investigations show these methods to produce improvements in relevance feedback for all three media types, but most significantly mean that relevance feedback can now successfully be applied to scanned text documents.</div>
</front>
</TEI>
<istex>
<corpusName>springer</corpusName>
<author>
<json:item>
<name>Adenike M. Lam-Adesina</name>
<affiliations>
<json:string>Centre for Digital Video Processing & School of Computing, Dublin City University, Dublin 9, Ireland</json:string>
<json:string>E-mail: adenike@computing.dcu.ie</json:string>
</affiliations>
</json:item>
<json:item>
<name>Gareth J. F. Jones</name>
<affiliations>
<json:string>Centre for Digital Video Processing & School of Computing, Dublin City University, Dublin 9, Ireland</json:string>
<json:string>E-mail: gjones@computing.dcu.ie</json:string>
</affiliations>
</json:item>
</author>
<language>
<json:string>eng</json:string>
</language>
<abstract>Abstract: Query expansion is a long standing relevance feedback technique for improving the effectiveness of information retrieval systems. Previous investigations have shown it to be generally effective for electronic text, to give proportionally better improvement for automatic transcriptions of spoken documents, and to be at best of questionable utility for optical character recognized scanned text documents. We introduce two corpus-based methods based on using a string-edit distance measure in context to automatically detect and correct transcription errors. One method operates at query-time and requires no modification of the document index file, and the other at index-time and operates using the standard query-time expansion process. Experimental investigations show these methods to produce improvements in relevance feedback for all three media types, but most significantly mean that relevance feedback can now successfully be applied to scanned text documents.</abstract>
<qualityIndicators>
<score>6.632</score>
<pdfVersion>1.3</pdfVersion>
<pdfPageSize>430 x 660 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>980</abstractCharCount>
<pdfWordCount>5860</pdfWordCount>
<pdfCharCount>35301</pdfCharCount>
<pdfPageCount>13</pdfPageCount>
<abstractWordCount>136</abstractWordCount>
</qualityIndicators>
<title>Using String Comparison in Context for Improved Relevance Feedback in Different Text Media</title>
<genre.original>
<json:string>OriginalPaper</json:string>
</genre.original>
<chapterId>
<json:string>19</json:string>
<json:string>Chap19</json:string>
</chapterId>
<genre>
<json:string>conference [eBooks]</json:string>
</genre>
<serie>
<editor>
<json:item>
<name>David Hutchison</name>
<affiliations>
<json:string>Lancaster University, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Takeo Kanade</name>
<affiliations>
<json:string>Carnegie Mellon University, Pittsburgh, PA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Josef Kittler</name>
<affiliations>
<json:string>University of Surrey, Guildford, UK</json:string>
</affiliations>
</json:item>
<json:item>
<name>Jon M. Kleinberg</name>
<affiliations>
<json:string>Cornell University, Ithaca, NY, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Friedemann Mattern</name>
<affiliations>
<json:string>ETH Zurich, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>John C. Mitchell</name>
<affiliations>
<json:string>Stanford University, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moni Naor</name>
<affiliations>
<json:string>Weizmann Institute of Science, Rehovot, Israel</json:string>
</affiliations>
</json:item>
<json:item>
<name>Oscar Nierstrasz</name>
<affiliations>
<json:string>University of Bern, Switzerland</json:string>
</affiliations>
</json:item>
<json:item>
<name>C. Pandu Rangan</name>
<affiliations>
<json:string>Indian Institute of Technology, Madras, India</json:string>
</affiliations>
</json:item>
<json:item>
<name>Bernhard Steffen</name>
<affiliations>
<json:string>University of Dortmund, Germany</json:string>
</affiliations>
</json:item>
<json:item>
<name>Madhu Sudan</name>
<affiliations>
<json:string>Massachusetts Institute of Technology, MA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Demetri Terzopoulos</name>
<affiliations>
<json:string>University of California, Los Angeles, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Dough Tygar</name>
<affiliations>
<json:string>University of California, Berkeley, CA, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Moshe Y. Vardi</name>
<affiliations>
<json:string>Rice University, Houston, TX, USA</json:string>
</affiliations>
</json:item>
<json:item>
<name>Gerhard Weikum</name>
<affiliations>
<json:string>Max-Planck Institute of Computer Science, Saarbruecken, Germany</json:string>
</affiliations>
</json:item>
</editor>
<issn>
<json:string>0302-9743</json:string>
</issn>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>Lecture Notes in Computer Science</title>
<copyrightDate>2006</copyrightDate>
</serie>
<host>
<editor>
<json:item>
<name>Fabio Crestani</name>
<affiliations>
<json:string>Department of Computer and Information Science, University of Strathclyde, Scotland</json:string>
<json:string>E-mail: f.crestani@cis.strath.ac.uk</json:string>
</affiliations>
</json:item>
<json:item>
<name>Paolo Ferragina</name>
<affiliations>
<json:string>Dipartimento di Informatica, University of Pisa, Largo B. Pontecorvo 3, 56127, Pisa, Italy</json:string>
<json:string>E-mail: ferragina@di.unipi.it</json:string>
</affiliations>
</json:item>
<json:item>
<name>Mark Sanderson</name>
<affiliations>
<json:string>Department of Information Studies, University of Sheffield, Sheffield, UK</json:string>
<json:string>E-mail: m.sanderson@sheffield.ac.uk</json:string>
</affiliations>
</json:item>
</editor>
<subject>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Computer Science</value>
</json:item>
<json:item>
<value>Information Storage and Retrieval</value>
</json:item>
<json:item>
<value>Artificial Intelligence (incl. Robotics)</value>
</json:item>
<json:item>
<value>Database Management</value>
</json:item>
<json:item>
<value>Data Structures</value>
</json:item>
<json:item>
<value>Coding and Information Theory</value>
</json:item>
<json:item>
<value>Algorithm Analysis and Problem Complexity</value>
</json:item>
</subject>
<isbn>
<json:string>978-3-540-45774-9</json:string>
</isbn>
<language>
<json:string>unknown</json:string>
</language>
<eissn>
<json:string>1611-3349</json:string>
</eissn>
<title>String Processing and Information Retrieval</title>
<genre.original>
<json:string>Proceedings</json:string>
</genre.original>
<bookId>
<json:string>978-3-540-45775-6</json:string>
</bookId>
<volume>4209</volume>
<pages>
<last>241</last>
<first>229</first>
</pages>
<issn>
<json:string>0302-9743</json:string>
</issn>
<genre>
<json:string>Book Series</json:string>
</genre>
<eisbn>
<json:string>978-3-540-45775-6</json:string>
</eisbn>
<copyrightDate>2006</copyrightDate>
<doi>
<json:string>10.1007/11880561</json:string>
</doi>
</host>
<publicationDate>2006</publicationDate>
<copyrightDate>2006</copyrightDate>
<doi>
<json:string>10.1007/11880561_19</json:string>
</doi>
<id>66E6AA650E9D11341DE243DD7773075E3392078B</id>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/66E6AA650E9D11341DE243DD7773075E3392078B/fulltext/pdf</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/66E6AA650E9D11341DE243DD7773075E3392078B/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/66E6AA650E9D11341DE243DD7773075E3392078B/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">Using String Comparison in Context for Improved Relevance Feedback in Different Text Media</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<availability>
<p>SPRINGER</p>
</availability>
<date>2006</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a" type="main" xml:lang="en">Using String Comparison in Context for Improved Relevance Feedback in Different Text Media</title>
<author>
<persName>
<forename type="first">Adenike</forename>
<surname>Lam-Adesina</surname>
</persName>
<email>adenike@computing.dcu.ie</email>
<affiliation>Centre for Digital Video Processing & School of Computing, Dublin City University, Dublin 9, Ireland</affiliation>
</author>
<author>
<persName>
<forename type="first">Gareth</forename>
<surname>Jones</surname>
</persName>
<email>gjones@computing.dcu.ie</email>
<affiliation>Centre for Digital Video Processing & School of Computing, Dublin City University, Dublin 9, Ireland</affiliation>
</author>
</analytic>
<monogr>
<title level="m">String Processing and Information Retrieval</title>
<title level="m" type="sub">13th International Conference, SPIRE 2006, Glasgow, UK, October 11-13, 2006. Proceedings</title>
<idno type="pISBN">978-3-540-45774-9</idno>
<idno type="eISBN">978-3-540-45775-6</idno>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="DOI">10.1007/11880561</idno>
<idno type="BookID">978-3-540-45775-6</idno>
<idno type="BookTitleID">141649</idno>
<idno type="BookSequenceNumber">4209</idno>
<idno type="BookVolumeNumber">4209</idno>
<idno type="BookChapterCount">31</idno>
<editor>
<persName>
<forename type="first">Fabio</forename>
<surname>Crestani</surname>
</persName>
<email>f.crestani@cis.strath.ac.uk</email>
<affiliation>Department of Computer and Information Science, University of Strathclyde, Scotland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Paolo</forename>
<surname>Ferragina</surname>
</persName>
<email>ferragina@di.unipi.it</email>
<affiliation>Dipartimento di Informatica, University of Pisa, Largo B. Pontecorvo 3, 56127, Pisa, Italy</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Mark</forename>
<surname>Sanderson</surname>
</persName>
<email>m.sanderson@sheffield.ac.uk</email>
<affiliation>Department of Information Studies, University of Sheffield, Sheffield, UK</affiliation>
</editor>
<imprint>
<publisher>Springer Berlin Heidelberg</publisher>
<pubPlace>Berlin, Heidelberg</pubPlace>
<date type="published" when="2006"></date>
<biblScope unit="volume">4209</biblScope>
<biblScope unit="page" from="229">229</biblScope>
<biblScope unit="page" to="241">241</biblScope>
</imprint>
</monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<editor>
<persName>
<forename type="first">David</forename>
<surname>Hutchison</surname>
</persName>
<affiliation>Lancaster University, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Takeo</forename>
<surname>Kanade</surname>
</persName>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Josef</forename>
<surname>Kittler</surname>
</persName>
<affiliation>University of Surrey, Guildford, UK</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Jon</forename>
<forename type="first">M.</forename>
<surname>Kleinberg</surname>
</persName>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Friedemann</forename>
<surname>Mattern</surname>
</persName>
<affiliation>ETH Zurich, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">John</forename>
<forename type="first">C.</forename>
<surname>Mitchell</surname>
</persName>
<affiliation>Stanford University, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moni</forename>
<surname>Naor</surname>
</persName>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Oscar</forename>
<surname>Nierstrasz</surname>
</persName>
<affiliation>University of Bern, Switzerland</affiliation>
</editor>
<editor>
<persName>
<forename type="first">C.</forename>
<surname>Pandu Rangan</surname>
</persName>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Bernhard</forename>
<surname>Steffen</surname>
</persName>
<affiliation>University of Dortmund, Germany</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Madhu</forename>
<surname>Sudan</surname>
</persName>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Demetri</forename>
<surname>Terzopoulos</surname>
</persName>
<affiliation>University of California, Los Angeles, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Dough</forename>
<surname>Tygar</surname>
</persName>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Moshe</forename>
<forename type="first">Y.</forename>
<surname>Vardi</surname>
</persName>
<affiliation>Rice University, Houston, TX, USA</affiliation>
</editor>
<editor>
<persName>
<forename type="first">Gerhard</forename>
<surname>Weikum</surname>
</persName>
<affiliation>Max-Planck Institute of Computer Science, Saarbruecken, Germany</affiliation>
</editor>
<biblScope>
<date>2006</date>
</biblScope>
<idno type="pISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="seriesId">558</idno>
</series>
<idno type="istex">66E6AA650E9D11341DE243DD7773075E3392078B</idno>
<idno type="DOI">10.1007/11880561_19</idno>
<idno type="ChapterID">19</idno>
<idno type="ChapterID">Chap19</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>2006</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>Abstract: Query expansion is a long standing relevance feedback technique for improving the effectiveness of information retrieval systems. Previous investigations have shown it to be generally effective for electronic text, to give proportionally better improvement for automatic transcriptions of spoken documents, and to be at best of questionable utility for optical character recognized scanned text documents. We introduce two corpus-based methods based on using a string-edit distance measure in context to automatically detect and correct transcription errors. One method operates at query-time and requires no modification of the document index file, and the other at index-time and operates using the standard query-time expansion process. Experimental investigations show these methods to produce improvements in relevance feedback for all three media types, but most significantly mean that relevance feedback can now successfully be applied to scanned text documents.</p>
</abstract>
<textClass>
<keywords scheme="Book Subject Collection">
<list>
<label>SUCO11645</label>
<item>
<term>Computer Science</term>
</item>
</list>
</keywords>
</textClass>
<textClass>
<keywords scheme="Book Subject Group">
<list>
<label>I</label>
<label>I18032</label>
<label>I21017</label>
<label>I18024</label>
<label>I15017</label>
<label>I15041</label>
<label>I16021</label>
<item>
<term>Computer Science</term>
</item>
<item>
<term>Information Storage and Retrieval</term>
</item>
<item>
<term>Artificial Intelligence (incl. Robotics)</term>
</item>
<item>
<term>Database Management</term>
</item>
<item>
<term>Data Structures</term>
</item>
<item>
<term>Coding and Information Theory</term>
</item>
<item>
<term>Algorithm Analysis and Problem Complexity</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change when="2006">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-19">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/66E6AA650E9D11341DE243DD7773075E3392078B/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="Springer, Publisher found" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="UTF-8"</istex:xmlDeclaration>
<istex:docType PUBLIC="-//Springer-Verlag//DTD A++ V2.4//EN" URI="http://devel.springer.de/A++/V2.4/DTD/A++V2.4.dtd" name="istex:docType"></istex:docType>
<istex:document>
<Publisher>
<PublisherInfo>
<PublisherName>Springer Berlin Heidelberg</PublisherName>
<PublisherLocation>Berlin, Heidelberg</PublisherLocation>
</PublisherInfo>
<Series>
<SeriesInfo SeriesType="Series" TocLevels="0">
<SeriesID>558</SeriesID>
<SeriesPrintISSN>0302-9743</SeriesPrintISSN>
<SeriesElectronicISSN>1611-3349</SeriesElectronicISSN>
<SeriesTitle Language="En">Lecture Notes in Computer Science</SeriesTitle>
</SeriesInfo>
<SeriesHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff1">
<EditorName DisplayOrder="Western">
<GivenName>David</GivenName>
<FamilyName>Hutchison</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff2">
<EditorName DisplayOrder="Western">
<GivenName>Takeo</GivenName>
<FamilyName>Kanade</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff3">
<EditorName DisplayOrder="Western">
<GivenName>Josef</GivenName>
<FamilyName>Kittler</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff4">
<EditorName DisplayOrder="Western">
<GivenName>Jon</GivenName>
<GivenName>M.</GivenName>
<FamilyName>Kleinberg</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff5">
<EditorName DisplayOrder="Western">
<GivenName>Friedemann</GivenName>
<FamilyName>Mattern</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff6">
<EditorName DisplayOrder="Western">
<GivenName>John</GivenName>
<GivenName>C.</GivenName>
<FamilyName>Mitchell</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff7">
<EditorName DisplayOrder="Western">
<GivenName>Moni</GivenName>
<FamilyName>Naor</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff8">
<EditorName DisplayOrder="Western">
<GivenName>Oscar</GivenName>
<FamilyName>Nierstrasz</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff9">
<EditorName DisplayOrder="Western">
<GivenName>C.</GivenName>
<FamilyName>Pandu Rangan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff10">
<EditorName DisplayOrder="Western">
<GivenName>Bernhard</GivenName>
<FamilyName>Steffen</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff11">
<EditorName DisplayOrder="Western">
<GivenName>Madhu</GivenName>
<FamilyName>Sudan</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff12">
<EditorName DisplayOrder="Western">
<GivenName>Demetri</GivenName>
<FamilyName>Terzopoulos</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff13">
<EditorName DisplayOrder="Western">
<GivenName>Dough</GivenName>
<FamilyName>Tygar</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff14">
<EditorName DisplayOrder="Western">
<GivenName>Moshe</GivenName>
<GivenName>Y.</GivenName>
<FamilyName>Vardi</FamilyName>
</EditorName>
</Editor>
<Editor AffiliationIDS="Aff15">
<EditorName DisplayOrder="Western">
<GivenName>Gerhard</GivenName>
<FamilyName>Weikum</FamilyName>
</EditorName>
</Editor>
<Affiliation ID="Aff1">
<OrgName>Lancaster University</OrgName>
<OrgAddress>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff2">
<OrgName>Carnegie Mellon University</OrgName>
<OrgAddress>
<City>Pittsburgh</City>
<State>PA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff3">
<OrgName>University of Surrey</OrgName>
<OrgAddress>
<City>Guildford</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff4">
<OrgName>Cornell University</OrgName>
<OrgAddress>
<City>Ithaca</City>
<State>NY</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff5">
<OrgName>ETH Zurich</OrgName>
<OrgAddress>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff6">
<OrgName>Stanford University</OrgName>
<OrgAddress>
<City>CA</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff7">
<OrgName>Weizmann Institute of Science</OrgName>
<OrgAddress>
<City>Rehovot</City>
<Country>Israel</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff8">
<OrgName>University of Bern</OrgName>
<OrgAddress>
<Country>Switzerland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff9">
<OrgName>Indian Institute of Technology</OrgName>
<OrgAddress>
<City>Madras</City>
<Country>India</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff10">
<OrgName>University of Dortmund</OrgName>
<OrgAddress>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff11">
<OrgName>Massachusetts Institute of Technology</OrgName>
<OrgAddress>
<City>MA</City>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff12">
<OrgName>University of California</OrgName>
<OrgAddress>
<City>Los Angeles</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff13">
<OrgName>University of California</OrgName>
<OrgAddress>
<City>Berkeley</City>
<State>CA</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff14">
<OrgName>Rice University</OrgName>
<OrgAddress>
<City>Houston</City>
<State>TX</State>
<Country>USA</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff15">
<OrgName>Max-Planck Institute of Computer Science</OrgName>
<OrgAddress>
<City>Saarbruecken</City>
<Country>Germany</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</SeriesHeader>
<Book Language="En">
<BookInfo BookProductType="Proceedings" ContainsESM="No" Language="En" MediaType="eBook" NumberingDepth="2" NumberingStyle="ContentOnly" OutputMedium="All" TocLevels="0">
<BookID>978-3-540-45775-6</BookID>
<BookTitle>String Processing and Information Retrieval</BookTitle>
<BookSubTitle>13th International Conference, SPIRE 2006, Glasgow, UK, October 11-13, 2006. Proceedings</BookSubTitle>
<BookVolumeNumber>4209</BookVolumeNumber>
<BookSequenceNumber>4209</BookSequenceNumber>
<BookDOI>10.1007/11880561</BookDOI>
<BookTitleID>141649</BookTitleID>
<BookPrintISBN>978-3-540-45774-9</BookPrintISBN>
<BookElectronicISBN>978-3-540-45775-6</BookElectronicISBN>
<BookChapterCount>31</BookChapterCount>
<BookCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2006</CopyrightYear>
</BookCopyright>
<BookSubjectGroup>
<BookSubject Code="I" Type="Primary">Computer Science</BookSubject>
<BookSubject Code="I18032" Priority="1" Type="Secondary">Information Storage and Retrieval</BookSubject>
<BookSubject Code="I21017" Priority="2" Type="Secondary">Artificial Intelligence (incl. Robotics)</BookSubject>
<BookSubject Code="I18024" Priority="3" Type="Secondary">Database Management</BookSubject>
<BookSubject Code="I15017" Priority="4" Type="Secondary">Data Structures</BookSubject>
<BookSubject Code="I15041" Priority="5" Type="Secondary">Coding and Information Theory</BookSubject>
<BookSubject Code="I16021" Priority="6" Type="Secondary">Algorithm Analysis and Problem Complexity</BookSubject>
<SubjectCollection Code="SUCO11645">Computer Science</SubjectCollection>
</BookSubjectGroup>
<BookContext>
<SeriesID>558</SeriesID>
</BookContext>
</BookInfo>
<BookHeader>
<EditorGroup>
<Editor AffiliationIDS="Aff16">
<EditorName DisplayOrder="Western">
<GivenName>Fabio</GivenName>
<FamilyName>Crestani</FamilyName>
</EditorName>
<Contact>
<Email>f.crestani@cis.strath.ac.uk</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff17">
<EditorName DisplayOrder="Western">
<GivenName>Paolo</GivenName>
<FamilyName>Ferragina</FamilyName>
</EditorName>
<Contact>
<Email>ferragina@di.unipi.it</Email>
</Contact>
</Editor>
<Editor AffiliationIDS="Aff18">
<EditorName DisplayOrder="Western">
<GivenName>Mark</GivenName>
<FamilyName>Sanderson</FamilyName>
</EditorName>
<Contact>
<Email>m.sanderson@sheffield.ac.uk</Email>
</Contact>
</Editor>
<Affiliation ID="Aff16">
<OrgDivision>Department of Computer and Information Science</OrgDivision>
<OrgName>University of Strathclyde</OrgName>
<OrgAddress>
<Country>Scotland</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff17">
<OrgDivision>Dipartimento di Informatica</OrgDivision>
<OrgName>University of Pisa</OrgName>
<OrgAddress>
<Street>Largo B. Pontecorvo 3</Street>
<Postcode>56127</Postcode>
<City>Pisa</City>
<Country>Italy</Country>
</OrgAddress>
</Affiliation>
<Affiliation ID="Aff18">
<OrgDivision>Department of Information Studies</OrgDivision>
<OrgName>University of Sheffield</OrgName>
<OrgAddress>
<City>Sheffield</City>
<Country>UK</Country>
</OrgAddress>
</Affiliation>
</EditorGroup>
</BookHeader>
<Part ID="Part7">
<PartInfo TocLevels="0">
<PartID>7</PartID>
<PartSequenceNumber>7</PartSequenceNumber>
<PartTitle>Information Retrieval Applications</PartTitle>
<PartChapterCount>4</PartChapterCount>
<PartContext>
<SeriesID>558</SeriesID>
<BookTitle>String Processing and Information Retrieval</BookTitle>
</PartContext>
</PartInfo>
<Chapter ID="Chap19" Language="En">
<ChapterInfo ChapterType="OriginalPaper" ContainsESM="No" NumberingDepth="2" NumberingStyle="ContentOnly" TocLevels="0">
<ChapterID>19</ChapterID>
<ChapterDOI>10.1007/11880561_19</ChapterDOI>
<ChapterSequenceNumber>19</ChapterSequenceNumber>
<ChapterTitle Language="En">Using String Comparison in Context for Improved Relevance Feedback in Different Text Media</ChapterTitle>
<ChapterFirstPage>229</ChapterFirstPage>
<ChapterLastPage>241</ChapterLastPage>
<ChapterCopyright>
<CopyrightHolderName>Springer-Verlag Berlin Heidelberg</CopyrightHolderName>
<CopyrightYear>2006</CopyrightYear>
</ChapterCopyright>
<ChapterGrants Type="Regular">
<MetadataGrant Grant="OpenAccess"></MetadataGrant>
<AbstractGrant Grant="OpenAccess"></AbstractGrant>
<BodyPDFGrant Grant="Restricted"></BodyPDFGrant>
<BodyHTMLGrant Grant="Restricted"></BodyHTMLGrant>
<BibliographyGrant Grant="Restricted"></BibliographyGrant>
<ESMGrant Grant="Restricted"></ESMGrant>
</ChapterGrants>
<ChapterContext>
<SeriesID>558</SeriesID>
<PartID>7</PartID>
<BookID>978-3-540-45775-6</BookID>
<BookTitle>String Processing and Information Retrieval</BookTitle>
</ChapterContext>
</ChapterInfo>
<ChapterHeader>
<AuthorGroup>
<Author AffiliationIDS="Aff19">
<AuthorName DisplayOrder="Western">
<GivenName>Adenike</GivenName>
<GivenName>M.</GivenName>
<FamilyName>Lam-Adesina</FamilyName>
</AuthorName>
<Contact>
<Email>adenike@computing.dcu.ie</Email>
</Contact>
</Author>
<Author AffiliationIDS="Aff19">
<AuthorName DisplayOrder="Western">
<GivenName>Gareth</GivenName>
<GivenName>J.</GivenName>
<GivenName>F.</GivenName>
<FamilyName>Jones</FamilyName>
</AuthorName>
<Contact>
<Email>gjones@computing.dcu.ie</Email>
</Contact>
</Author>
<Affiliation ID="Aff19">
<OrgDivision>Centre for Digital Video Processing & School of Computing</OrgDivision>
<OrgName>Dublin City University</OrgName>
<OrgAddress>
<City>Dublin 9</City>
<Country>Ireland</Country>
</OrgAddress>
</Affiliation>
</AuthorGroup>
<Abstract ID="Abs1" Language="En">
<Heading>Abstract</Heading>
<Para>Query expansion is a long standing relevance feedback technique for improving the effectiveness of information retrieval systems. Previous investigations have shown it to be generally effective for electronic text, to give proportionally better improvement for automatic transcriptions of spoken documents, and to be at best of questionable utility for optical character recognized scanned text documents. We introduce two corpus-based methods based on using a string-edit distance measure in context to automatically detect and correct transcription errors. One method operates at query-time and requires no modification of the document index file, and the other at index-time and operates using the standard query-time expansion process. Experimental investigations show these methods to produce improvements in relevance feedback for all three media types, but most significantly mean that relevance feedback can now successfully be applied to scanned text documents.</Para>
</Abstract>
</ChapterHeader>
<NoBody></NoBody>
</Chapter>
</Part>
</Book>
</Series>
</Publisher>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>Using String Comparison in Context for Improved Relevance Feedback in Different Text Media</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA" lang="en">
<title>Using String Comparison in Context for Improved Relevance Feedback in Different Text Media</title>
</titleInfo>
<name type="personal">
<namePart type="given">Adenike</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Lam-Adesina</namePart>
<affiliation>Centre for Digital Video Processing & School of Computing, Dublin City University, Dublin 9, Ireland</affiliation>
<affiliation>E-mail: adenike@computing.dcu.ie</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Gareth</namePart>
<namePart type="given">J.</namePart>
<namePart type="given">F.</namePart>
<namePart type="family">Jones</namePart>
<affiliation>Centre for Digital Video Processing & School of Computing, Dublin City University, Dublin 9, Ireland</affiliation>
<affiliation>E-mail: gjones@computing.dcu.ie</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="conference [eBooks]" displayLabel="OriginalPaper"></genre>
<originInfo>
<publisher>Springer Berlin Heidelberg</publisher>
<place>
<placeTerm type="text">Berlin, Heidelberg</placeTerm>
</place>
<dateIssued encoding="w3cdtf">2006</dateIssued>
<copyrightDate encoding="w3cdtf">2006</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">Abstract: Query expansion is a long standing relevance feedback technique for improving the effectiveness of information retrieval systems. Previous investigations have shown it to be generally effective for electronic text, to give proportionally better improvement for automatic transcriptions of spoken documents, and to be at best of questionable utility for optical character recognized scanned text documents. We introduce two corpus-based methods based on using a string-edit distance measure in context to automatically detect and correct transcription errors. One method operates at query-time and requires no modification of the document index file, and the other at index-time and operates using the standard query-time expansion process. Experimental investigations show these methods to produce improvements in relevance feedback for all three media types, but most significantly mean that relevance feedback can now successfully be applied to scanned text documents.</abstract>
<relatedItem type="host">
<titleInfo>
<title>String Processing and Information Retrieval</title>
<subTitle>13th International Conference, SPIRE 2006, Glasgow, UK, October 11-13, 2006. Proceedings</subTitle>
</titleInfo>
<name type="personal">
<namePart type="given">Fabio</namePart>
<namePart type="family">Crestani</namePart>
<affiliation>Department of Computer and Information Science, University of Strathclyde, Scotland</affiliation>
<affiliation>E-mail: f.crestani@cis.strath.ac.uk</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Paolo</namePart>
<namePart type="family">Ferragina</namePart>
<affiliation>Dipartimento di Informatica, University of Pisa, Largo B. Pontecorvo 3, 56127, Pisa, Italy</affiliation>
<affiliation>E-mail: ferragina@di.unipi.it</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Mark</namePart>
<namePart type="family">Sanderson</namePart>
<affiliation>Department of Information Studies, University of Sheffield, Sheffield, UK</affiliation>
<affiliation>E-mail: m.sanderson@sheffield.ac.uk</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<genre type="Book Series" displayLabel="Proceedings"></genre>
<originInfo>
<copyrightDate encoding="w3cdtf">2006</copyrightDate>
<issuance>monographic</issuance>
</originInfo>
<subject>
<genre>Book Subject Collection</genre>
<topic authority="SpringerSubjectCodes" authorityURI="SUCO11645">Computer Science</topic>
</subject>
<subject>
<genre>Book Subject Group</genre>
<topic authority="SpringerSubjectCodes" authorityURI="I">Computer Science</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18032">Information Storage and Retrieval</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I21017">Artificial Intelligence (incl. Robotics)</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I18024">Database Management</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I15017">Data Structures</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I15041">Coding and Information Theory</topic>
<topic authority="SpringerSubjectCodes" authorityURI="I16021">Algorithm Analysis and Problem Complexity</topic>
</subject>
<identifier type="DOI">10.1007/11880561</identifier>
<identifier type="ISBN">978-3-540-45774-9</identifier>
<identifier type="eISBN">978-3-540-45775-6</identifier>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="BookTitleID">141649</identifier>
<identifier type="BookID">978-3-540-45775-6</identifier>
<identifier type="BookChapterCount">31</identifier>
<identifier type="BookVolumeNumber">4209</identifier>
<identifier type="BookSequenceNumber">4209</identifier>
<identifier type="PartChapterCount">4</identifier>
<part>
<date>2006</date>
<detail type="part">
<title>Information Retrieval Applications</title>
</detail>
<detail type="volume">
<number>4209</number>
<caption>vol.</caption>
</detail>
<extent unit="pages">
<start>229</start>
<end>241</end>
</extent>
</part>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2006</recordOrigin>
</recordInfo>
</relatedItem>
<relatedItem type="series">
<titleInfo>
<title>Lecture Notes in Computer Science</title>
</titleInfo>
<name type="personal">
<namePart type="given">David</namePart>
<namePart type="family">Hutchison</namePart>
<affiliation>Lancaster University, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Takeo</namePart>
<namePart type="family">Kanade</namePart>
<affiliation>Carnegie Mellon University, Pittsburgh, PA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Josef</namePart>
<namePart type="family">Kittler</namePart>
<affiliation>University of Surrey, Guildford, UK</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Jon</namePart>
<namePart type="given">M.</namePart>
<namePart type="family">Kleinberg</namePart>
<affiliation>Cornell University, Ithaca, NY, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Friedemann</namePart>
<namePart type="family">Mattern</namePart>
<affiliation>ETH Zurich, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">John</namePart>
<namePart type="given">C.</namePart>
<namePart type="family">Mitchell</namePart>
<affiliation>Stanford University, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moni</namePart>
<namePart type="family">Naor</namePart>
<affiliation>Weizmann Institute of Science, Rehovot, Israel</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Oscar</namePart>
<namePart type="family">Nierstrasz</namePart>
<affiliation>University of Bern, Switzerland</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">C.</namePart>
<namePart type="family">Pandu Rangan</namePart>
<affiliation>Indian Institute of Technology, Madras, India</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Bernhard</namePart>
<namePart type="family">Steffen</namePart>
<affiliation>University of Dortmund, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Madhu</namePart>
<namePart type="family">Sudan</namePart>
<affiliation>Massachusetts Institute of Technology, MA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Demetri</namePart>
<namePart type="family">Terzopoulos</namePart>
<affiliation>University of California, Los Angeles, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Dough</namePart>
<namePart type="family">Tygar</namePart>
<affiliation>University of California, Berkeley, CA, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Moshe</namePart>
<namePart type="given">Y.</namePart>
<namePart type="family">Vardi</namePart>
<affiliation>Rice University, Houston, TX, USA</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Gerhard</namePart>
<namePart type="family">Weikum</namePart>
<affiliation>Max-Planck Institute of Computer Science, Saarbruecken, Germany</affiliation>
<role>
<roleTerm type="text">editor</roleTerm>
</role>
</name>
<originInfo>
<copyrightDate encoding="w3cdtf">2006</copyrightDate>
<issuance>serial</issuance>
</originInfo>
<identifier type="ISSN">0302-9743</identifier>
<identifier type="eISSN">1611-3349</identifier>
<identifier type="SeriesID">558</identifier>
<recordInfo>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2006</recordOrigin>
</recordInfo>
</relatedItem>
<identifier type="istex">66E6AA650E9D11341DE243DD7773075E3392078B</identifier>
<identifier type="DOI">10.1007/11880561_19</identifier>
<identifier type="ChapterID">19</identifier>
<identifier type="ChapterID">Chap19</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Springer-Verlag Berlin Heidelberg, 2006</accessCondition>
<recordInfo>
<recordContentSource>SPRINGER</recordContentSource>
<recordOrigin>Springer-Verlag Berlin Heidelberg, 2006</recordOrigin>
</recordInfo>
</mods>
</metadata>
<enrichments>
<istex:refBibTEI uri="https://api.istex.fr/document/66E6AA650E9D11341DE243DD7773075E3392078B/enrichments/refBib">
<teiHeader></teiHeader>
<text>
<front></front>
<body></body>
<back>
<listBibl>
<biblStruct xml:id="b0">
<analytic>
<title level="a" type="main">Examining and Improving the Effectiveness of Relevance Feedback for Retrieval of Scanned Text Documents</title>
<author>
<persName>
<forename type="first">A</forename>
<forename type="middle">M</forename>
<surname>Lam-Adesina</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">G</forename>
<forename type="middle">J F</forename>
<surname>Jones</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Information Processing and Management</title>
<imprint>
<biblScope unit="volume">43</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="633" to="649"></biblScope>
<date type="published" when="2006"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b1">
<analytic>
<title level="a" type="main">The TREC Spoken Document Retrieval Track: A Success Story</title>
<author>
<persName>
<forename type="first">J</forename>
<forename type="middle">S</forename>
<surname>Garafolo</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">C</forename>
<forename type="middle">G P</forename>
<surname>Auzanne</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">E</forename>
<forename type="middle">M</forename>
<surname>Voorhees</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the RIAO 2000 Conference: Content- Based Multimedia Information Access</title>
<meeting>the RIAO 2000 Conference: Content- Based Multimedia Information Access
<address>
<addrLine>Paris</addrLine>
</address>
</meeting>
<imprint>
<date type="published" when="2000"></date>
<biblScope unit="page" from="1" to="20"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b2">
<analytic>
<title level="a" type="main">Spoken Document Retrieval for TREC-8 at Cambridge University</title>
<author>
<persName>
<forename type="first">S</forename>
<forename type="middle">E</forename>
<surname>Johnson</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">P</forename>
<surname>Jourlin</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">K</forename>
<surname>Sparck</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">P</forename>
<forename type="middle">C</forename>
<surname>Jones</surname>
</persName>
</author>
<author>
<persName>
<surname>Woodland</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the Eighth Text REtrieval Conference (TREC-9)</title>
<meeting>the Eighth Text REtrieval Conference (TREC-9)
<address>
<addrLine>Gaithersburg, MD</addrLine>
</address>
</meeting>
<imprint>
<publisher>NIST</publisher>
<date type="published" when="2000"></date>
<biblScope unit="page" from="157" to="168"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b3">
<analytic>
<title level="a" type="main">Overview of the CLEF-2005 Cross-Language Speech Retrieval Track</title>
<author>
<persName>
<forename type="first">R</forename>
<forename type="middle">W</forename>
<surname>White</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<forename type="middle">W</forename>
<surname>Oard</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">G</forename>
<forename type="middle">J F</forename>
<surname>Jones</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Soergel</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">X</forename>
<surname>Huang</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the CLEF 2005 Workshop</title>
<meeting>the CLEF 2005 Workshop
<address>
<addrLine>Vienna</addrLine>
</address>
</meeting>
<imprint>
<date type="published" when="2005"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b4">
<analytic>
<title level="a" type="main">The TREC-5 Confusion Track: Comparing Retrieval Methods for Scanned Text</title>
<author>
<persName>
<forename type="first">P</forename>
<forename type="middle">B</forename>
<surname>Kantor</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">E</forename>
<forename type="middle">M</forename>
<surname>Voorhees</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Information Retrieval</title>
<imprint>
<biblScope unit="volume">2</biblScope>
<biblScope unit="page" from="165" to="176"></biblScope>
<date type="published" when="2000"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b5">
<analytic>
<title level="a" type="main">Evaluation of Model-Based Retrieval Effectiveness with OCR Text</title>
<author>
<persName>
<forename type="first">K</forename>
<surname>Taghva</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<surname>Borsack</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">A</forename>
<surname>Condit</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">ACM Transactions on Information Systems</title>
<imprint>
<biblScope unit="volume">14</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="64" to="93"></biblScope>
<date type="published" when="1996"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b6">
<analytic>
<title level="a" type="main">An Investigation of Mixed-Media Information Retrieval</title>
<author>
<persName>
<forename type="first">G</forename>
<forename type="middle">J F</forename>
<surname>Jones</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">A</forename>
<forename type="middle">M</forename>
<surname>Lam-Adesina</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the 6th European Conference on Research and Development for Digital Libraries</title>
<meeting>the 6th European Conference on Research and Development for Digital Libraries
<address>
<addrLine>Rome</addrLine>
</address>
</meeting>
<imprint>
<publisher>Springer</publisher>
<date type="published" when="2002"></date>
<biblScope unit="page" from="463" to="478"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b7">
<analytic>
<title level="a" type="main">Okapi at TREC-3</title>
<author>
<persName>
<forename type="first">S</forename>
<forename type="middle">E</forename>
<surname>Robertson</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Walker</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Jones</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<forename type="middle">M</forename>
<surname>Hancock-Beaulieu</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Gatford</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the Third Text REtrieval Conference (TREC-3)</title>
<meeting>the Third Text REtrieval Conference (TREC-3)</meeting>
<imprint>
<date type="published" when="1995"></date>
<biblScope unit="page" from="109" to="126"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b8">
<analytic>
<title level="a" type="main">Applying Summarization Techniques for Term Selection in Relevance Feedback</title>
<author>
<persName>
<forename type="first">A</forename>
<forename type="middle">M</forename>
<surname>Lam-Adesina</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">G</forename>
<forename type="middle">J F</forename>
<surname>Jones</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval</title>
<meeting>the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
<address>
<addrLine>New Orleans</addrLine>
</address>
</meeting>
<imprint>
<publisher>ACM</publisher>
<date type="published" when="2001"></date>
<biblScope unit="page" from="1" to="9"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b9">
<analytic>
<title level="a" type="main">Automatic Language Model Adaptation for Spoken Document Retrieval</title>
<author>
<persName>
<forename type="first">C</forename>
<surname>Auzanne</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<forename type="middle">S</forename>
<surname>Garafolo</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">J</forename>
<forename type="middle">G</forename>
<surname>Fiscus</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">W</forename>
<forename type="middle">M</forename>
<surname>Fisher</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the RIAO 2000 Conference: Content-Based Multimedia Information Access</title>
<meeting>the RIAO 2000 Conference: Content-Based Multimedia Information Access
<address>
<addrLine>Paris</addrLine>
</address>
</meeting>
<imprint>
<date type="published" when="2000"></date>
<biblScope unit="page" from="1" to="20"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b10">
<monogr>
<title level="m" type="main">Information Retrieval from Mixed-Media Collections: Report on Design and Indexing of a Scanned Document Collection</title>
<author>
<persName>
<forename type="first">G</forename>
<forename type="middle">J F</forename>
<surname>Jones</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">M</forename>
<surname>Han</surname>
</persName>
</author>
<imprint>
<date type="published" when="2001-01"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b11">
<analytic>
<title level="a" type="main">Information Retrieval can Cope with Many Errors</title>
<author>
<persName>
<forename type="first">E</forename>
<surname>Mittendorf</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">P</forename>
<surname>Schauble</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="j">Information Retrieval</title>
<imprint>
<biblScope unit="volume">3</biblScope>
<biblScope unit="page" from="189" to="216"></biblScope>
<date type="published" when="2000"></date>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b12">
<analytic>
<title level="a" type="main">Phonetic String Mathing: Lessons from Information Retrieval</title>
<author>
<persName>
<forename type="first">J</forename>
<surname>Zobel</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">P</forename>
<surname>Dart</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval</title>
<meeting>the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
<address>
<addrLine>Zurich</addrLine>
</address>
</meeting>
<imprint>
<publisher>ACM</publisher>
<date type="published" when="1996"></date>
<biblScope unit="page" from="30" to="38"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b13">
<analytic>
<title level="a" type="main">Document Expansion for Speech Retrieval</title>
<author>
<persName>
<forename type="first">A</forename>
<surname>Singhal</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">F</forename>
<forename type="middle">C N</forename>
<surname>Pereira</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval</title>
<meeting>the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
<address>
<addrLine>Berkeley</addrLine>
</address>
</meeting>
<imprint>
<publisher>ACM</publisher>
<date type="published" when="1999"></date>
<biblScope unit="page" from="34" to="41"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b14">
<analytic>
<title level="a" type="main">A Statistical Approach to Automatic OCR Error Correction in Context</title>
<author>
<persName>
<forename type="first">X</forename>
<surname>Tong</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">D</forename>
<surname>Evans</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the Fourth Workshop on Very Large Corpora</title>
<meeting>the Fourth Workshop on Very Large Corpora
<address>
<addrLine>Copenhagen</addrLine>
</address>
</meeting>
<imprint>
<date type="published" when="1996"></date>
<biblScope unit="page" from="88" to="100"></biblScope>
</imprint>
</monogr>
</biblStruct>
<biblStruct xml:id="b15">
<analytic>
<title level="a" type="main">Improved String Matching Under Noisy Channel Conditions</title>
<author>
<persName>
<forename type="first">K</forename>
<surname>Collins-Thompson</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">C</forename>
<surname>Schweizer</surname>
</persName>
</author>
<author>
<persName>
<forename type="first">S</forename>
<surname>Dumais</surname>
</persName>
</author>
</analytic>
<monogr>
<title level="m">Proceedings of the Tenth International Conference on Information and Knowledge Management Atlanta</title>
<meeting>the Tenth International Conference on Information and Knowledge Management Atlanta</meeting>
<imprint>
<publisher>ACM</publisher>
<date type="published" when="2001"></date>
<biblScope unit="page" from="357" to="364"></biblScope>
</imprint>
</monogr>
</biblStruct>
</listBibl>
</back>
</text>
</istex:refBibTEI>
</enrichments>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000256 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 000256 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:66E6AA650E9D11341DE243DD7773075E3392078B
   |texte=   Using String Comparison in Context for Improved Relevance Feedback in Different Text Media
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024