Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

AN APPLICATION OF LANGUAGE PROCESSING FOR A SEARCH INTERFACE

Identifieur interne : 001F80 ( Istex/Corpus ); précédent : 001F79; suivant : 001F81

AN APPLICATION OF LANGUAGE PROCESSING FOR A SEARCH INTERFACE

Auteurs : Brian Vickery ; Alina Vickery

Source :

RBID : ISTEX:E43E2B499565C1802EBD1FB28168193D6C214145

Abstract

The paper describes techniques developed by Tome Associates to process natural language queries into search statements suitable for transmission to online text database systems. The problems discussed include word identification, the handling of unknown words, the contents and structure of system dictionaries, the use of semantic categories and classification, disambiguation of multimeaning words, stemming and truncation, noun compounds and indications of relationship between search terms.

Url:
DOI: 10.1108/eb026897

Links to Exploration step

ISTEX:E43E2B499565C1802EBD1FB28168193D6C214145

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">AN APPLICATION OF LANGUAGE PROCESSING FOR A SEARCH INTERFACE</title>
<author>
<name sortKey="Vickery, Brian" sort="Vickery, Brian" uniqKey="Vickery B" first="Brian" last="Vickery">Brian Vickery</name>
<affiliation>
<mods:affiliation>Ealing, London</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Vickery, Alina" sort="Vickery, Alina" uniqKey="Vickery A" first="Alina" last="Vickery">Alina Vickery</name>
<affiliation>
<mods:affiliation>Ealing, London</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:E43E2B499565C1802EBD1FB28168193D6C214145</idno>
<date when="1992" year="1992">1992</date>
<idno type="doi">10.1108/eb026897</idno>
<idno type="url">https://api.istex.fr/document/E43E2B499565C1802EBD1FB28168193D6C214145/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001F80</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">AN APPLICATION OF LANGUAGE PROCESSING FOR A SEARCH INTERFACE</title>
<author>
<name sortKey="Vickery, Brian" sort="Vickery, Brian" uniqKey="Vickery B" first="Brian" last="Vickery">Brian Vickery</name>
<affiliation>
<mods:affiliation>Ealing, London</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Vickery, Alina" sort="Vickery, Alina" uniqKey="Vickery A" first="Alina" last="Vickery">Alina Vickery</name>
<affiliation>
<mods:affiliation>Ealing, London</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Journal of Documentation</title>
<idno type="ISSN">0022-0418</idno>
<imprint>
<publisher>MCB UP Ltd</publisher>
<date type="published" when="1992-03-01">1992-03-01</date>
<biblScope unit="volume">48</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="255">255</biblScope>
<biblScope unit="page" to="275">275</biblScope>
</imprint>
<idno type="ISSN">0022-0418</idno>
</series>
<idno type="istex">E43E2B499565C1802EBD1FB28168193D6C214145</idno>
<idno type="DOI">10.1108/eb026897</idno>
<idno type="filenameID">2780480301</idno>
<idno type="original-pdf">2780480301.pdf</idno>
<idno type="href">eb026897.pdf</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0022-0418</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">The paper describes techniques developed by Tome Associates to process natural language queries into search statements suitable for transmission to online text database systems. The problems discussed include word identification, the handling of unknown words, the contents and structure of system dictionaries, the use of semantic categories and classification, disambiguation of multimeaning words, stemming and truncation, noun compounds and indications of relationship between search terms.</div>
</front>
</TEI>
<istex>
<corpusName>emerald</corpusName>
<author>
<json:item>
<name>BRIAN VICKERY</name>
<affiliations>
<json:string>Ealing, London</json:string>
</affiliations>
</json:item>
<json:item>
<name>ALINA VICKERY</name>
<affiliations>
<json:string>Ealing, London</json:string>
</affiliations>
</json:item>
</author>
<language>
<json:string>eng</json:string>
</language>
<abstract>The paper describes techniques developed by Tome Associates to process natural language queries into search statements suitable for transmission to online text database systems. The problems discussed include word identification, the handling of unknown words, the contents and structure of system dictionaries, the use of semantic categories and classification, disambiguation of multimeaning words, stemming and truncation, noun compounds and indications of relationship between search terms.</abstract>
<qualityIndicators>
<score>6.28</score>
<pdfVersion>1.4</pdfVersion>
<pdfPageSize>498.96 x 706.8 pts</pdfPageSize>
<refBibsNative>false</refBibsNative>
<keywordCount>0</keywordCount>
<abstractCharCount>494</abstractCharCount>
<pdfWordCount>6703</pdfWordCount>
<pdfCharCount>41352</pdfCharCount>
<pdfPageCount>21</pdfPageCount>
<abstractWordCount>65</abstractWordCount>
</qualityIndicators>
<title>AN APPLICATION OF LANGUAGE PROCESSING FOR A SEARCH INTERFACE</title>
<genre.original>
<json:string>review-article</json:string>
</genre.original>
<genre>
<json:string>review-article</json:string>
</genre>
<host>
<volume>48</volume>
<publisherId>
<json:string>jd</json:string>
</publisherId>
<pages>
<last>275</last>
<first>255</first>
</pages>
<issn>
<json:string>0022-0418</json:string>
</issn>
<issue>3</issue>
<subject>
<json:item>
<value>Information & knowledge management</value>
</json:item>
<json:item>
<value>Information & communications technology</value>
</json:item>
<json:item>
<value>Information management & governance</value>
</json:item>
<json:item>
<value>Internet</value>
</json:item>
<json:item>
<value>Information management</value>
</json:item>
<json:item>
<value>Library & information science</value>
</json:item>
<json:item>
<value>Classification & cataloguing</value>
</json:item>
<json:item>
<value>Collection building & management</value>
</json:item>
<json:item>
<value>Information behaviour & retrieval</value>
</json:item>
<json:item>
<value>Records management & preservation</value>
</json:item>
<json:item>
<value>Scholarly communications/publishing</value>
</json:item>
<json:item>
<value>Document management</value>
</json:item>
</subject>
<genre>
<json:string>Journal</json:string>
</genre>
<language>
<json:string>unknown</json:string>
</language>
<title>Journal of Documentation</title>
<doi>
<json:string>10.1108/jd</json:string>
</doi>
</host>
<publicationDate>1992</publicationDate>
<copyrightDate>1992</copyrightDate>
<doi>
<json:string>10.1108/eb026897</json:string>
</doi>
<id>E43E2B499565C1802EBD1FB28168193D6C214145</id>
<fulltext>
<json:item>
<original>true</original>
<mimetype>application/pdf</mimetype>
<extension>pdf</extension>
<uri>https://api.istex.fr/document/E43E2B499565C1802EBD1FB28168193D6C214145/fulltext/pdf</uri>
</json:item>
<json:item>
<original>false</original>
<mimetype>application/zip</mimetype>
<extension>zip</extension>
<uri>https://api.istex.fr/document/E43E2B499565C1802EBD1FB28168193D6C214145/fulltext/zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/document/E43E2B499565C1802EBD1FB28168193D6C214145/fulltext/tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">AN APPLICATION OF LANGUAGE PROCESSING FOR A SEARCH INTERFACE</title>
<respStmt xml:id="ISTEX-API" resp="Références bibliographiques récupérées via GROBID" name="ISTEX-API (INIST-CNRS)"></respStmt>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>MCB UP Ltd</publisher>
<availability>
<p>EMERALD</p>
</availability>
<date>1992</date>
</publicationStmt>
<sourceDesc>
<biblStruct type="inbook">
<analytic>
<title level="a" type="main" xml:lang="en">AN APPLICATION OF LANGUAGE PROCESSING FOR A SEARCH INTERFACE</title>
<author>
<persName>
<forename type="first">BRIAN</forename>
<surname>VICKERY</surname>
</persName>
<affiliation>Ealing, London</affiliation>
</author>
<author>
<persName>
<forename type="first">ALINA</forename>
<surname>VICKERY</surname>
</persName>
<affiliation>Ealing, London</affiliation>
</author>
</analytic>
<monogr>
<title level="j">Journal of Documentation</title>
<idno type="pISSN">0022-0418</idno>
<idno type="DOI">10.1108/jd</idno>
<imprint>
<publisher>MCB UP Ltd</publisher>
<date type="published" when="1992-03-01"></date>
<biblScope unit="volume">48</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="255">255</biblScope>
<biblScope unit="page" to="275">275</biblScope>
</imprint>
</monogr>
<idno type="istex">E43E2B499565C1802EBD1FB28168193D6C214145</idno>
<idno type="DOI">10.1108/eb026897</idno>
<idno type="filenameID">2780480301</idno>
<idno type="original-pdf">2780480301.pdf</idno>
<idno type="href">eb026897.pdf</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<creation>
<date>1992</date>
</creation>
<langUsage>
<language ident="en">en</language>
</langUsage>
<abstract xml:lang="en">
<p>The paper describes techniques developed by Tome Associates to process natural language queries into search statements suitable for transmission to online text database systems. The problems discussed include word identification, the handling of unknown words, the contents and structure of system dictionaries, the use of semantic categories and classification, disambiguation of multimeaning words, stemming and truncation, noun compounds and indications of relationship between search terms.</p>
</abstract>
<textClass>
<keywords scheme="Emerald Subject Group">
<list>
<label>cat-IKM</label>
<label>cat-ICT</label>
<label>cat-IMG</label>
<label>cat-INT</label>
<label>cat-IMAN</label>
<item>
<term>Information & knowledge management</term>
</item>
<item>
<term>Information & communications technology</term>
</item>
<item>
<term>Information management & governance</term>
</item>
<item>
<term>Internet</term>
</item>
<item>
<term>Information management</term>
</item>
</list>
</keywords>
</textClass>
<textClass>
<keywords scheme="Emerald Subject Group">
<list>
<label>cat-LISC</label>
<label>cat-CCAT</label>
<label>cat-CBM</label>
<label>cat-IBRT</label>
<label>cat-RMP</label>
<label>cat-SCPG</label>
<label>cat-DOCM</label>
<item>
<term>Library & information science</term>
</item>
<item>
<term>Classification & cataloguing</term>
</item>
<item>
<term>Collection building & management</term>
</item>
<item>
<term>Information behaviour & retrieval</term>
</item>
<item>
<term>Records management & preservation</term>
</item>
<item>
<term>Scholarly communications/publishing</term>
</item>
<item>
<term>Document management</term>
</item>
</list>
</keywords>
</textClass>
</profileDesc>
<revisionDesc>
<change when="1992-03-01">Published</change>
<change xml:id="refBibs-istex" who="#ISTEX-API" when="2016-3-13">References added</change>
</revisionDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<original>false</original>
<mimetype>text/plain</mimetype>
<extension>txt</extension>
<uri>https://api.istex.fr/document/E43E2B499565C1802EBD1FB28168193D6C214145/fulltext/txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="corpus emerald not found" wicri:toSee="no header">
<istex:xmlDeclaration>version="1.0" encoding="UTF-8"</istex:xmlDeclaration>
<istex:document><!-- Auto generated NISO JATS XML created by Atypon out of MCB DTD source files. Do Not Edit! -->
<article dtd-version="1.0" xml:lang="en" article-type="review-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">jd</journal-id>
<journal-id journal-id-type="doi">10.1108/jd</journal-id>
<journal-title-group>
<journal-title>Journal of Documentation</journal-title>
</journal-title-group>
<issn pub-type="ppub">0022-0418</issn>
<publisher>
<publisher-name>MCB UP Ltd</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.1108/eb026897</article-id>
<article-id pub-id-type="original-pdf">2780480301.pdf</article-id>
<article-id pub-id-type="filename">2780480301</article-id>
<article-categories>
<subj-group subj-group-type="type-of-publication">
<compound-subject>
<compound-subject-part content-type="code">review-article</compound-subject-part>
<compound-subject-part content-type="label">General review</compound-subject-part>
</compound-subject>
</subj-group>
<subj-group subj-group-type="subject">
<compound-subject>
<compound-subject-part content-type="code">cat-IKM</compound-subject-part>
<compound-subject-part content-type="label">Information & knowledge management</compound-subject-part>
</compound-subject>
<subj-group>
<compound-subject>
<compound-subject-part content-type="code">cat-ICT</compound-subject-part>
<compound-subject-part content-type="label">Information & communications technology</compound-subject-part>
</compound-subject>
<subj-group>
<compound-subject>
<compound-subject-part content-type="code">cat-INT</compound-subject-part>
<compound-subject-part content-type="label">Internet</compound-subject-part>
</compound-subject>
</subj-group>
</subj-group>
<subj-group>
<compound-subject>
<compound-subject-part content-type="code">cat-IMG</compound-subject-part>
<compound-subject-part content-type="label">Information management & governance</compound-subject-part>
</compound-subject>
<subj-group>
<compound-subject>
<compound-subject-part content-type="code">cat-IMAN</compound-subject-part>
<compound-subject-part content-type="label">Information management</compound-subject-part>
</compound-subject>
</subj-group>
</subj-group>
</subj-group>
<subj-group subj-group-type="subject">
<compound-subject>
<compound-subject-part content-type="code">cat-LISC</compound-subject-part>
<compound-subject-part content-type="label">Library & information science</compound-subject-part>
</compound-subject>
<subj-group>
<compound-subject>
<compound-subject-part content-type="code">cat-CCAT</compound-subject-part>
<compound-subject-part content-type="label">Classification & cataloguing</compound-subject-part>
</compound-subject>
</subj-group>
<subj-group>
<compound-subject>
<compound-subject-part content-type="code">cat-CBM</compound-subject-part>
<compound-subject-part content-type="label">Collection building & management</compound-subject-part>
</compound-subject>
<subj-group>
<compound-subject>
<compound-subject-part content-type="code">cat-SCPG</compound-subject-part>
<compound-subject-part content-type="label">Scholarly communications/publishing</compound-subject-part>
</compound-subject>
</subj-group>
</subj-group>
<subj-group>
<compound-subject>
<compound-subject-part content-type="code">cat-IBRT</compound-subject-part>
<compound-subject-part content-type="label">Information behaviour & retrieval</compound-subject-part>
</compound-subject>
</subj-group>
<subj-group>
<compound-subject>
<compound-subject-part content-type="code">cat-RMP</compound-subject-part>
<compound-subject-part content-type="label">Records management & preservation</compound-subject-part>
</compound-subject>
<subj-group>
<compound-subject>
<compound-subject-part content-type="code">cat-DOCM</compound-subject-part>
<compound-subject-part content-type="label">Document management</compound-subject-part>
</compound-subject>
</subj-group>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>AN APPLICATION OF LANGUAGE PROCESSING FOR A SEARCH INTERFACE</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<string-name>
<given-names>BRIAN</given-names>
<surname>VICKERY</surname>
</string-name>
<aff>Ealing, London</aff>
</contrib>
<x></x>
<contrib contrib-type="author">
<string-name>
<given-names>ALINA</given-names>
<surname>VICKERY</surname>
</string-name>
<aff>Ealing, London</aff>
</contrib>
</contrib-group>
<pub-date pub-type="ppub">
<day>1</day>
<month>3</month>
<year>1992</year>
</pub-date>
<volume>48</volume>
<issue>3</issue>
<fpage>255</fpage>
<lpage>275</lpage>
<permissions>
<copyright-statement>© MCB UP Limited</copyright-statement>
<copyright-year>1992</copyright-year>
<license license-type="publisher">
<license-p></license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="eb026897.pdf"></self-uri>
<abstract>
<p>The paper describes techniques developed by Tome Associates to process natural language queries into search statements suitable for transmission to online text database systems. The problems discussed include word identification, the handling of unknown words, the contents and structure of system dictionaries, the use of semantic categories and classification, disambiguation of multi‐meaning words, stemming and truncation, noun compounds and indications of relationship between search terms.</p>
</abstract>
<custom-meta-group>
<custom-meta>
<meta-name>peer-reviewed</meta-name>
<meta-value>no</meta-value>
</custom-meta>
<custom-meta>
<meta-name>academic-content</meta-name>
<meta-value>yes</meta-value>
</custom-meta>
<custom-meta>
<meta-name>rightslink</meta-name>
<meta-value>included</meta-value>
</custom-meta>
</custom-meta-group>
</article-meta>
</front>
</article>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>AN APPLICATION OF LANGUAGE PROCESSING FOR A SEARCH INTERFACE</title>
</titleInfo>
<titleInfo type="alternative" lang="en" contentType="CDATA">
<title>AN APPLICATION OF LANGUAGE PROCESSING FOR A SEARCH INTERFACE</title>
</titleInfo>
<name type="personal">
<namePart type="given">BRIAN</namePart>
<namePart type="family">VICKERY</namePart>
<affiliation>Ealing, London</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">ALINA</namePart>
<namePart type="family">VICKERY</namePart>
<affiliation>Ealing, London</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="review-article" displayLabel="review-article"></genre>
<originInfo>
<publisher>MCB UP Ltd</publisher>
<dateIssued encoding="w3cdtf">1992-03-01</dateIssued>
<copyrightDate encoding="w3cdtf">1992</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
</language>
<physicalDescription>
<internetMediaType>text/html</internetMediaType>
</physicalDescription>
<abstract lang="en">The paper describes techniques developed by Tome Associates to process natural language queries into search statements suitable for transmission to online text database systems. The problems discussed include word identification, the handling of unknown words, the contents and structure of system dictionaries, the use of semantic categories and classification, disambiguation of multimeaning words, stemming and truncation, noun compounds and indications of relationship between search terms.</abstract>
<relatedItem type="host">
<titleInfo>
<title>Journal of Documentation</title>
</titleInfo>
<genre type="Journal">journal</genre>
<subject>
<genre>Emerald Subject Group</genre>
<topic authority="SubjectCodesPrimary" authorityURI="cat-IKM">Information & knowledge management</topic>
<topic authority="SubjectCodesSecondary" authorityURI="cat-ICT">Information & communications technology</topic>
<topic authority="SubjectCodesSecondary" authorityURI="cat-IMG">Information management & governance</topic>
<topic authority="SubjectCodesSecondary" authorityURI="cat-INT">Internet</topic>
<topic authority="SubjectCodesSecondary" authorityURI="cat-IMAN">Information management</topic>
</subject>
<subject>
<genre>Emerald Subject Group</genre>
<topic authority="SubjectCodesPrimary" authorityURI="cat-LISC">Library & information science</topic>
<topic authority="SubjectCodesSecondary" authorityURI="cat-CCAT">Classification & cataloguing</topic>
<topic authority="SubjectCodesSecondary" authorityURI="cat-CBM">Collection building & management</topic>
<topic authority="SubjectCodesSecondary" authorityURI="cat-IBRT">Information behaviour & retrieval</topic>
<topic authority="SubjectCodesSecondary" authorityURI="cat-RMP">Records management & preservation</topic>
<topic authority="SubjectCodesSecondary" authorityURI="cat-SCPG">Scholarly communications/publishing</topic>
<topic authority="SubjectCodesSecondary" authorityURI="cat-DOCM">Document management</topic>
</subject>
<identifier type="ISSN">0022-0418</identifier>
<identifier type="PublisherID">jd</identifier>
<identifier type="DOI">10.1108/jd</identifier>
<part>
<date>1992</date>
<detail type="volume">
<caption>vol.</caption>
<number>48</number>
</detail>
<detail type="issue">
<caption>no.</caption>
<number>3</number>
</detail>
<extent unit="pages">
<start>255</start>
<end>275</end>
</extent>
</part>
</relatedItem>
<identifier type="istex">E43E2B499565C1802EBD1FB28168193D6C214145</identifier>
<identifier type="DOI">10.1108/eb026897</identifier>
<identifier type="filenameID">2780480301</identifier>
<identifier type="original-pdf">2780480301.pdf</identifier>
<identifier type="href">eb026897.pdf</identifier>
<accessCondition type="use and reproduction" contentType="copyright">© MCB UP Limited</accessCondition>
<recordInfo>
<recordContentSource>EMERALD</recordContentSource>
</recordInfo>
</mods>
</metadata>
<serie></serie>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001F80 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 001F80 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:E43E2B499565C1802EBD1FB28168193D6C214145
   |texte=   AN APPLICATION OF LANGUAGE PROCESSING FOR A SEARCH INTERFACE
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024