Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Automatic thesaurus generation for an electronic community system

Identifieur interne : 002C93 ( Main/Merge ); précédent : 002C92; suivant : 002C94

Automatic thesaurus generation for an electronic community system

Auteurs : Hsinchun Chen [États-Unis] ; Tak Yim [États-Unis] ; David Fye [États-Unis] ; Bruce Schatz [États-Unis]

Source :

RBID : ISTEX:D27D40BBA08B13185741AC8C1973CA447E9DFA90

Abstract

This research reports an algorithmic approach to the automatic generation of thesauri for electronic community systems. The techniques used included term filtering, automatic indexing, and cluster analysis. The testbed for our research was the Worm Community System, which contains a comprehensive library of specialized community data and literature, currently in use by molecular biologists who study the nematode worm C. elegans. The resulting worm thesaurus included 2709 researchers' names, 798 gene names, 20 experimental methods, and 4302 subject descriptors. On average, each term had about 90 weighted neighboring terms indicating relevant concepts. The thesaurus was developed as an online search aide. We tested the worm thesaurus in an experiment with six worm researchers of varying degrees of expertise and background. The experiment showed that the thesaurus was an excellent “memory‐jogging” device and that it supported learning and serendipitous browsing. Despite some occurrences of obvious noise, the system was useful in suggesting relevant concepts for the researchers' queries and it helped improve concept recall. With a simple browsing interface, an automatic thesaurus can become a useful tool for online search and can assist researchers in exploring and traversing a dynamic and complex electronic community system. © 1995 John Wiley & Sons, Inc.

Url:
DOI: 10.1002/(SICI)1097-4571(199504)46:3<175::AID-ASI3>3.0.CO;2-U

Links toward previous steps (curation, corpus...)


Links to Exploration step

ISTEX:D27D40BBA08B13185741AC8C1973CA447E9DFA90

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Automatic thesaurus generation for an electronic community system</title>
<author>
<name sortKey="Chen, Hsinchun" sort="Chen, Hsinchun" uniqKey="Chen H" first="Hsinchun" last="Chen">Hsinchun Chen</name>
</author>
<author>
<name sortKey="Yim, Tak" sort="Yim, Tak" uniqKey="Yim T" first="Tak" last="Yim">Tak Yim</name>
</author>
<author>
<name sortKey="Fye, David" sort="Fye, David" uniqKey="Fye D" first="David" last="Fye">David Fye</name>
</author>
<author>
<name sortKey="Schatz, Bruce" sort="Schatz, Bruce" uniqKey="Schatz B" first="Bruce" last="Schatz">Bruce Schatz</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:D27D40BBA08B13185741AC8C1973CA447E9DFA90</idno>
<date when="1995" year="1995">1995</date>
<idno type="doi">10.1002/(SICI)1097-4571(199504)46:3<175::AID-ASI3>3.0.CO;2-U</idno>
<idno type="url">https://api.istex.fr/document/D27D40BBA08B13185741AC8C1973CA447E9DFA90/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">002259</idno>
<idno type="wicri:Area/Istex/Curation">002106</idno>
<idno type="wicri:Area/Istex/Checkpoint">001E93</idno>
<idno type="wicri:doubleKey">0002-8231:1995:Chen H:automatic:thesaurus:generation</idno>
<idno type="wicri:Area/Main/Merge">002C93</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Automatic thesaurus generation for an electronic community system</title>
<author>
<name sortKey="Chen, Hsinchun" sort="Chen, Hsinchun" uniqKey="Chen H" first="Hsinchun" last="Chen">Hsinchun Chen</name>
<affiliation wicri:level="1">
<country wicri:rule="url">États-Unis</country>
<wicri:regionArea>University of Arizona, Management Information Systems Department, Karl Eller Graduate School of Management, McClelland Hall 430Z, Tucson</wicri:regionArea>
<wicri:noRegion>Tucson</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Yim, Tak" sort="Yim, Tak" uniqKey="Yim T" first="Tak" last="Yim">Tak Yim</name>
<affiliation wicri:level="1">
<country wicri:rule="url">États-Unis</country>
<wicri:regionArea>University of Arizona, Management Information Systems Department, Karl Eller Graduate School of Management, McClelland Hall 430Z, Tucson</wicri:regionArea>
<wicri:noRegion>Tucson</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Fye, David" sort="Fye, David" uniqKey="Fye D" first="David" last="Fye">David Fye</name>
<affiliation wicri:level="1">
<country wicri:rule="url">États-Unis</country>
<wicri:regionArea>University of Arizona, Management Information Systems Department, Karl Eller Graduate School of Management, McClelland Hall 430Z, Tucson</wicri:regionArea>
<wicri:noRegion>Tucson</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Schatz, Bruce" sort="Schatz, Bruce" uniqKey="Schatz B" first="Bruce" last="Schatz">Bruce Schatz</name>
<affiliation wicri:level="2">
<country xml:lang="fr">États-Unis</country>
<placeName>
<region type="state">Illinois</region>
</placeName>
<wicri:cityArea>University of Illinois, Graduate School of Library and Information Science, National Center for Supercomputing Applications, Beckman Institute, 405 N. Mathews Avenue, Urbana</wicri:cityArea>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">États-Unis</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Journal of the American Society for Information Science</title>
<title level="j" type="abbrev">J. Am. Soc. Inf. Sci.</title>
<idno type="ISSN">0002-8231</idno>
<idno type="eISSN">1097-4571</idno>
<imprint>
<publisher>Wiley Subscription Services, Inc., A Wiley Company</publisher>
<pubPlace>Washington, D.C.</pubPlace>
<date type="published" when="1995-04">1995-04</date>
<biblScope unit="volume">46</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="175">175</biblScope>
<biblScope unit="page" to="193">193</biblScope>
</imprint>
<idno type="ISSN">0002-8231</idno>
</series>
<idno type="istex">D27D40BBA08B13185741AC8C1973CA447E9DFA90</idno>
<idno type="DOI">10.1002/(SICI)1097-4571(199504)46:3<175::AID-ASI3>3.0.CO;2-U</idno>
<idno type="ArticleID">ASI3</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0002-8231</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">This research reports an algorithmic approach to the automatic generation of thesauri for electronic community systems. The techniques used included term filtering, automatic indexing, and cluster analysis. The testbed for our research was the Worm Community System, which contains a comprehensive library of specialized community data and literature, currently in use by molecular biologists who study the nematode worm C. elegans. The resulting worm thesaurus included 2709 researchers' names, 798 gene names, 20 experimental methods, and 4302 subject descriptors. On average, each term had about 90 weighted neighboring terms indicating relevant concepts. The thesaurus was developed as an online search aide. We tested the worm thesaurus in an experiment with six worm researchers of varying degrees of expertise and background. The experiment showed that the thesaurus was an excellent “memory‐jogging” device and that it supported learning and serendipitous browsing. Despite some occurrences of obvious noise, the system was useful in suggesting relevant concepts for the researchers' queries and it helped improve concept recall. With a simple browsing interface, an automatic thesaurus can become a useful tool for online search and can assist researchers in exploring and traversing a dynamic and complex electronic community system. © 1995 John Wiley & Sons, Inc.</div>
</front>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002C93 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 002C93 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     ISTEX:D27D40BBA08B13185741AC8C1973CA447E9DFA90
   |texte=   Automatic thesaurus generation for an electronic community system
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024