Serveur d'exploration sur SGML

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Comparing noun phrasing techniques for use with medical digital library tools

Identifieur interne : 004096 ( Istex/Corpus ); précédent : 004095; suivant : 004097

Comparing noun phrasing techniques for use with medical digital library tools

Auteurs : Kristin M. Tolle ; Hsinchun Chen

Source :

RBID : ISTEX:F89A78E6F114B4601DE09E6221D5DF8DC95B240F

English descriptors

Abstract

In an effort to assist medical researchers and professionals in accessing information necessary for their work, the A1 Lab at the University of Arizona is investigating the use of a natural language processing (NLP) technique called noun phrasing. The goal of this research is to determine whether noun phrasing could be a viable technique to include in medical information retrieval applications. Four noun phrase generation tools were evaluated as to their ability to isolate noun phrases from medical journal abstracts. Tests were conducted using the National Cancer Institute's CANCERLIT database. The NLP tools evaluated were Massachusetts Institute of Technology's (MIT's) Chopper, The University of Arizona's Automatic Indexer, Lingsoft's NPtool, and The University of Arizona's AZ Noun Phraser. In addition, the National Library of Medicine's SPECIALIST Lexicon was incorporated into two versions of the AZ Noun Phraser to be evaluated against the other tools as well as a nonaugmented version of the AZ Noun Phraser. Using the metrics relative subject recall and precision, our results show that, with the exception of Chopper, the phrasing tools were fairly comparable in recall and precision. It was also shown that augmenting the AZ Noun Phraser by including the SPECIALIST Lexicon from the National Library of Medicine resulted in improved recall and precision.

Url:
DOI: 10.1002/(SICI)1097-4571(2000)51:4<352::AID-ASI5>3.0.CO;2-8

Links to Exploration step

ISTEX:F89A78E6F114B4601DE09E6221D5DF8DC95B240F

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Comparing noun phrasing techniques for use with medical digital library tools</title>
<author>
<name sortKey="Tolle, Kristin M" sort="Tolle, Kristin M" uniqKey="Tolle K" first="Kristin M." last="Tolle">Kristin M. Tolle</name>
<affiliation>
<mods:affiliation>E-mail: ktolle@bpa.arizona.edu</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>Management Information Systems Department, University of Arizona, Tucson, AZ 85721</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: ktolle@bpa.arizona.edu</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Chen, Hsinchun" sort="Chen, Hsinchun" uniqKey="Chen H" first="Hsinchun" last="Chen">Hsinchun Chen</name>
<affiliation>
<mods:affiliation>Management Information Systems Department, University of Arizona, Tucson, AZ 85721</mods:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:F89A78E6F114B4601DE09E6221D5DF8DC95B240F</idno>
<date when="2000" year="2000">2000</date>
<idno type="doi">10.1002/(SICI)1097-4571(2000)51:4<352::AID-ASI5>3.0.CO;2-8</idno>
<idno type="url">https://api.istex.fr/ark:/67375/WNG-RRGVX20X-M/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">004096</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">004096</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Comparing noun phrasing techniques for use with medical digital library tools</title>
<author>
<name sortKey="Tolle, Kristin M" sort="Tolle, Kristin M" uniqKey="Tolle K" first="Kristin M." last="Tolle">Kristin M. Tolle</name>
<affiliation>
<mods:affiliation>E-mail: ktolle@bpa.arizona.edu</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>Management Information Systems Department, University of Arizona, Tucson, AZ 85721</mods:affiliation>
</affiliation>
<affiliation>
<mods:affiliation>E-mail: ktolle@bpa.arizona.edu</mods:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Chen, Hsinchun" sort="Chen, Hsinchun" uniqKey="Chen H" first="Hsinchun" last="Chen">Hsinchun Chen</name>
<affiliation>
<mods:affiliation>Management Information Systems Department, University of Arizona, Tucson, AZ 85721</mods:affiliation>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j" type="main">Journal of the American Society for Information Science</title>
<title level="j" type="sub">Digital Libraries: Part 2</title>
<title level="j" type="alt">JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE</title>
<idno type="ISSN">0002-8231</idno>
<idno type="eISSN">1097-4571</idno>
<imprint>
<biblScope unit="vol">51</biblScope>
<biblScope unit="issue">4</biblScope>
<biblScope unit="page" from="352">352</biblScope>
<biblScope unit="page" to="370">370</biblScope>
<biblScope unit="page-count">19</biblScope>
<publisher>John Wiley & Sons, Inc.</publisher>
<pubPlace>New York</pubPlace>
<date type="published" when="2000">2000</date>
</imprint>
<idno type="ISSN">0002-8231</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0002-8231</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="Teeft" xml:lang="en">
<term>American society</term>
<term>Anick vaithyanathan</term>
<term>Annual symposium</term>
<term>Automatic indexing</term>
<term>Aznp</term>
<term>Boguraev</term>
<term>Boguraev pustejovski</term>
<term>Brill</term>
<term>Brill tagger</term>
<term>Brown corpus</term>
<term>Cambridge university press</term>
<term>Cancer research results</term>
<term>Cancerlit</term>
<term>Cancerlit collection</term>
<term>Chen</term>
<term>Chopper</term>
<term>Computational linguistics</term>
<term>Computer applications</term>
<term>Concept extraction</term>
<term>Concept space</term>
<term>Concept spaces</term>
<term>Current version</term>
<term>Data collections</term>
<term>Database</term>
<term>Detmer shortliffe</term>
<term>Different domains</term>
<term>Different phrases</term>
<term>Different techniques</term>
<term>Different versions</term>
<term>Digital libraries</term>
<term>Document collection</term>
<term>Document keywords</term>
<term>Document retrieval</term>
<term>Fewer phrases</term>
<term>Free text</term>
<term>Government reports</term>
<term>Great deal</term>
<term>High overlap</term>
<term>Human indexers</term>
<term>Human language technology</term>
<term>Hunting dogs</term>
<term>Indexer</term>
<term>Indexing</term>
<term>Information processing</term>
<term>Information producers</term>
<term>Information providers</term>
<term>Information retrieval</term>
<term>Information retrieval systems</term>
<term>Information science</term>
<term>Information seekers</term>
<term>Information sources</term>
<term>Interface</term>
<term>Interim noun phrases</term>
<term>Interim phrase generation tools</term>
<term>Interim phrases</term>
<term>Internet</term>
<term>Karlsson karttunen</term>
<term>Keyword</term>
<term>Keywords</term>
<term>Lexical</term>
<term>Lexical analysis</term>
<term>Lexical entries</term>
<term>Lexical knowledge</term>
<term>Lexical properties</term>
<term>Lexicon</term>
<term>Longest phrase</term>
<term>Longest phrase generation</term>
<term>Machine understanding group</term>
<term>Medical abstracts</term>
<term>Medical care</term>
<term>Medical domain</term>
<term>Medical goal</term>
<term>Medical information</term>
<term>Medical information retrieval</term>
<term>Medical journal abstracts</term>
<term>Medical professionals</term>
<term>Medical researchers</term>
<term>Medical terminology</term>
<term>Mesh terms</term>
<term>More phrases</term>
<term>National cancer institute</term>
<term>National library</term>
<term>Natural language</term>
<term>Natural language processing</term>
<term>Noun</term>
<term>Noun phrase</term>
<term>Noun phraser</term>
<term>Noun phraser noun phraser</term>
<term>Noun phraser versions</term>
<term>Noun phrasers</term>
<term>Noun phrases</term>
<term>Noun phrasing</term>
<term>Nptool</term>
<term>Nptool auto index</term>
<term>Other phrase generation techniques</term>
<term>Other techniques</term>
<term>Other tools</term>
<term>Overlap analysis</term>
<term>Patient record</term>
<term>Phrase</term>
<term>Phrase generation tools</term>
<term>Phrase output</term>
<term>Phraser</term>
<term>Phrasing</term>
<term>Phrasing tools</term>
<term>Precision comparison</term>
<term>Pustejovski</term>
<term>Query</term>
<term>Query expansion</term>
<term>Query formation</term>
<term>Relevant concepts</term>
<term>Relevant phrases</term>
<term>Research questions</term>
<term>Results show</term>
<term>Retrieval</term>
<term>Salton</term>
<term>Same document</term>
<term>Sample output</term>
<term>Schatz</term>
<term>Science alliance</term>
<term>Search engines</term>
<term>Semantic indexing</term>
<term>Semantic network</term>
<term>Semantic types</term>
<term>Sigir conference</term>
<term>Specialist lexicon</term>
<term>Statistical methods</term>
<term>Stochastic methods</term>
<term>Tagger</term>
<term>Test collection</term>
<term>Text corpora</term>
<term>Text documents</term>
<term>Textual data</term>
<term>Textual unit</term>
<term>Thesaurus</term>
<term>Total number</term>
<term>Training corpus</term>
<term>Umls</term>
<term>Umls knowledge sources</term>
<term>Unbiased estimates</term>
<term>Variance</term>
<term>Vector space representation</term>
<term>Vocabulary problem</term>
<term>Wall street journal</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">In an effort to assist medical researchers and professionals in accessing information necessary for their work, the A1 Lab at the University of Arizona is investigating the use of a natural language processing (NLP) technique called noun phrasing. The goal of this research is to determine whether noun phrasing could be a viable technique to include in medical information retrieval applications. Four noun phrase generation tools were evaluated as to their ability to isolate noun phrases from medical journal abstracts. Tests were conducted using the National Cancer Institute's CANCERLIT database. The NLP tools evaluated were Massachusetts Institute of Technology's (MIT's) Chopper, The University of Arizona's Automatic Indexer, Lingsoft's NPtool, and The University of Arizona's AZ Noun Phraser. In addition, the National Library of Medicine's SPECIALIST Lexicon was incorporated into two versions of the AZ Noun Phraser to be evaluated against the other tools as well as a nonaugmented version of the AZ Noun Phraser. Using the metrics relative subject recall and precision, our results show that, with the exception of Chopper, the phrasing tools were fairly comparable in recall and precision. It was also shown that augmenting the AZ Noun Phraser by including the SPECIALIST Lexicon from the National Library of Medicine resulted in improved recall and precision.</div>
</front>
</TEI>
<istex>
<corpusName>wiley</corpusName>
<keywords>
<teeft>
<json:string>phraser</json:string>
<json:string>noun</json:string>
<json:string>noun phraser</json:string>
<json:string>nptool</json:string>
<json:string>american society</json:string>
<json:string>noun phrases</json:string>
<json:string>specialist lexicon</json:string>
<json:string>retrieval</json:string>
<json:string>automatic indexing</json:string>
<json:string>tagger</json:string>
<json:string>cancerlit</json:string>
<json:string>chopper</json:string>
<json:string>schatz</json:string>
<json:string>information retrieval</json:string>
<json:string>database</json:string>
<json:string>keyword</json:string>
<json:string>internet</json:string>
<json:string>umls</json:string>
<json:string>national library</json:string>
<json:string>aznp</json:string>
<json:string>indexer</json:string>
<json:string>thesaurus</json:string>
<json:string>brill</json:string>
<json:string>keywords</json:string>
<json:string>information science</json:string>
<json:string>medical information</json:string>
<json:string>lexicon</json:string>
<json:string>noun phrasing</json:string>
<json:string>boguraev</json:string>
<json:string>pustejovski</json:string>
<json:string>relevant phrases</json:string>
<json:string>query</json:string>
<json:string>concept space</json:string>
<json:string>natural language processing</json:string>
<json:string>boguraev pustejovski</json:string>
<json:string>variance</json:string>
<json:string>indexing</json:string>
<json:string>noun phraser versions</json:string>
<json:string>text corpora</json:string>
<json:string>different versions</json:string>
<json:string>concept spaces</json:string>
<json:string>precision comparison</json:string>
<json:string>human indexers</json:string>
<json:string>medical information retrieval</json:string>
<json:string>phrasing tools</json:string>
<json:string>chen</json:string>
<json:string>lexical</json:string>
<json:string>salton</json:string>
<json:string>results show</json:string>
<json:string>brill tagger</json:string>
<json:string>text documents</json:string>
<json:string>medical researchers</json:string>
<json:string>cambridge university press</json:string>
<json:string>medical abstracts</json:string>
<json:string>other techniques</json:string>
<json:string>interim phrases</json:string>
<json:string>research questions</json:string>
<json:string>textual data</json:string>
<json:string>sigir conference</json:string>
<json:string>information sources</json:string>
<json:string>digital libraries</json:string>
<json:string>mesh terms</json:string>
<json:string>noun phrase</json:string>
<json:string>other tools</json:string>
<json:string>great deal</json:string>
<json:string>brown corpus</json:string>
<json:string>information producers</json:string>
<json:string>information seekers</json:string>
<json:string>total number</json:string>
<json:string>national cancer institute</json:string>
<json:string>longest phrase</json:string>
<json:string>more phrases</json:string>
<json:string>different techniques</json:string>
<json:string>lexical analysis</json:string>
<json:string>phrasing</json:string>
<json:string>phrase</json:string>
<json:string>computational linguistics</json:string>
<json:string>lexical entries</json:string>
<json:string>semantic indexing</json:string>
<json:string>lexical properties</json:string>
<json:string>document keywords</json:string>
<json:string>karlsson karttunen</json:string>
<json:string>training corpus</json:string>
<json:string>vocabulary problem</json:string>
<json:string>stochastic methods</json:string>
<json:string>statistical methods</json:string>
<json:string>wall street journal</json:string>
<json:string>natural language</json:string>
<json:string>medical domain</json:string>
<json:string>anick vaithyanathan</json:string>
<json:string>document collection</json:string>
<json:string>relevant concepts</json:string>
<json:string>patient record</json:string>
<json:string>free text</json:string>
<json:string>vector space representation</json:string>
<json:string>umls knowledge sources</json:string>
<json:string>semantic network</json:string>
<json:string>information retrieval systems</json:string>
<json:string>current version</json:string>
<json:string>semantic types</json:string>
<json:string>different phrases</json:string>
<json:string>machine understanding group</json:string>
<json:string>noun phraser noun phraser</json:string>
<json:string>interim noun phrases</json:string>
<json:string>test collection</json:string>
<json:string>same document</json:string>
<json:string>other phrase generation techniques</json:string>
<json:string>document retrieval</json:string>
<json:string>lexical knowledge</json:string>
<json:string>query formation</json:string>
<json:string>sample output</json:string>
<json:string>government reports</json:string>
<json:string>medical terminology</json:string>
<json:string>phrase output</json:string>
<json:string>detmer shortliffe</json:string>
<json:string>different domains</json:string>
<json:string>cancerlit collection</json:string>
<json:string>information providers</json:string>
<json:string>concept extraction</json:string>
<json:string>hunting dogs</json:string>
<json:string>phrase generation tools</json:string>
<json:string>medical goal</json:string>
<json:string>medical professionals</json:string>
<json:string>search engines</json:string>
<json:string>human language technology</json:string>
<json:string>fewer phrases</json:string>
<json:string>longest phrase generation</json:string>
<json:string>interim phrase generation tools</json:string>
<json:string>data collections</json:string>
<json:string>medical journal abstracts</json:string>
<json:string>unbiased estimates</json:string>
<json:string>overlap analysis</json:string>
<json:string>high overlap</json:string>
<json:string>noun phrasers</json:string>
<json:string>nptool auto index</json:string>
<json:string>science alliance</json:string>
<json:string>query expansion</json:string>
<json:string>textual unit</json:string>
<json:string>annual symposium</json:string>
<json:string>computer applications</json:string>
<json:string>medical care</json:string>
<json:string>information processing</json:string>
<json:string>cancer research results</json:string>
<json:string>interface</json:string>
</teeft>
</keywords>
<author>
<json:item>
<name>Kristin M. Tolle</name>
<affiliations>
<json:string>E-mail: ktolle@bpa.arizona.edu</json:string>
<json:string>Management Information Systems Department, University of Arizona, Tucson, AZ 85721</json:string>
<json:string>E-mail: ktolle@bpa.arizona.edu</json:string>
</affiliations>
</json:item>
<json:item>
<name>Hsinchun Chen</name>
<affiliations>
<json:string>Management Information Systems Department, University of Arizona, Tucson, AZ 85721</json:string>
</affiliations>
</json:item>
</author>
<articleId>
<json:string>ASI5</json:string>
</articleId>
<arkIstex>ark:/67375/WNG-RRGVX20X-M</arkIstex>
<language>
<json:string>eng</json:string>
</language>
<originalGenre>
<json:string>article</json:string>
</originalGenre>
<abstract>In an effort to assist medical researchers and professionals in accessing information necessary for their work, the A1 Lab at the University of Arizona is investigating the use of a natural language processing (NLP) technique called noun phrasing. The goal of this research is to determine whether noun phrasing could be a viable technique to include in medical information retrieval applications. Four noun phrase generation tools were evaluated as to their ability to isolate noun phrases from medical journal abstracts. Tests were conducted using the National Cancer Institute's CANCERLIT database. The NLP tools evaluated were Massachusetts Institute of Technology's (MIT's) Chopper, The University of Arizona's Automatic Indexer, Lingsoft's NPtool, and The University of Arizona's AZ Noun Phraser. In addition, the National Library of Medicine's SPECIALIST Lexicon was incorporated into two versions of the AZ Noun Phraser to be evaluated against the other tools as well as a nonaugmented version of the AZ Noun Phraser. Using the metrics relative subject recall and precision, our results show that, with the exception of Chopper, the phrasing tools were fairly comparable in recall and precision. It was also shown that augmenting the AZ Noun Phraser by including the SPECIALIST Lexicon from the National Library of Medicine resulted in improved recall and precision.</abstract>
<qualityIndicators>
<score>9.496</score>
<pdfWordCount>9239</pdfWordCount>
<pdfCharCount>59342</pdfCharCount>
<pdfVersion>1.2</pdfVersion>
<pdfPageCount>19</pdfPageCount>
<pdfPageSize>612 x 792 pts (letter)</pdfPageSize>
<refBibsNative>true</refBibsNative>
<abstractWordCount>208</abstractWordCount>
<abstractCharCount>1374</abstractCharCount>
<keywordCount>0</keywordCount>
</qualityIndicators>
<title>Comparing noun phrasing techniques for use with medical digital library tools</title>
<genre>
<json:string>article</json:string>
</genre>
<host>
<title>Journal of the American Society for Information Science</title>
<language>
<json:string>unknown</json:string>
</language>
<doi>
<json:string>10.1002/(ISSN)1097-4571</json:string>
</doi>
<issn>
<json:string>0002-8231</json:string>
</issn>
<eissn>
<json:string>1097-4571</json:string>
</eissn>
<publisherId>
<json:string>ASI</json:string>
</publisherId>
<volume>51</volume>
<issue>4</issue>
<pages>
<first>352</first>
<last>370</last>
<total>19</total>
</pages>
<genre>
<json:string>journal</json:string>
</genre>
<author>
<json:item>
<name>Hsinchun Chen</name>
<affiliations>
<json:string>McClelland Professor of MIS, Artificial Intelligence Lab, Management Information Systems Department, The University of Arizona, Tucson, AZ 85721</json:string>
</affiliations>
</json:item>
</author>
<subject>
<json:item>
<value>nouns</value>
</json:item>
<json:item>
<value>phrases</value>
</json:item>
<json:item>
<value>biomedical information</value>
</json:item>
<json:item>
<value>recall</value>
</json:item>
<json:item>
<value>natural language processing</value>
</json:item>
<json:item>
<value>precision</value>
</json:item>
<json:item>
<value>medical libraries</value>
</json:item>
<json:item>
<value>digital libraries</value>
</json:item>
<json:item>
<value>Research Article</value>
</json:item>
</subject>
</host>
<namedEntities>
<unitex>
<date></date>
<geogName></geogName>
<orgName></orgName>
<orgName_funder></orgName_funder>
<orgName_provider></orgName_provider>
<persName></persName>
<placeName></placeName>
<ref_url></ref_url>
<ref_bibl></ref_bibl>
<bibl></bibl>
</unitex>
</namedEntities>
<ark>
<json:string>ark:/67375/WNG-RRGVX20X-M</json:string>
</ark>
<categories>
<wos></wos>
<scienceMetrix>
<json:string>1 - economic & social sciences</json:string>
<json:string>2 - social sciences</json:string>
<json:string>3 - information & library sciences</json:string>
</scienceMetrix>
<scopus>
<json:string>1 - Physical Sciences</json:string>
<json:string>2 - Engineering</json:string>
<json:string>3 - General Engineering</json:string>
</scopus>
<inist>
<json:string>1 - sciences appliquees, technologies et medecines</json:string>
<json:string>2 - sciences biologiques et medicales</json:string>
<json:string>3 - sciences medicales</json:string>
<json:string>4 - nephrologie. maladies des voies urinaires</json:string>
</inist>
</categories>
<publicationDate>2000</publicationDate>
<copyrightDate>2000</copyrightDate>
<doi>
<json:string>10.1002/(SICI)1097-4571(2000)51:4>352::AID-ASI5>3.0.CO;2-8</json:string>
</doi>
<id>F89A78E6F114B4601DE09E6221D5DF8DC95B240F</id>
<score>1</score>
<fulltext>
<json:item>
<extension>pdf</extension>
<original>true</original>
<mimetype>application/pdf</mimetype>
<uri>https://api.istex.fr/ark:/67375/WNG-RRGVX20X-M/fulltext.pdf</uri>
</json:item>
<json:item>
<extension>zip</extension>
<original>false</original>
<mimetype>application/zip</mimetype>
<uri>https://api.istex.fr/ark:/67375/WNG-RRGVX20X-M/bundle.zip</uri>
</json:item>
<istex:fulltextTEI uri="https://api.istex.fr/ark:/67375/WNG-RRGVX20X-M/fulltext.tei">
<teiHeader>
<fileDesc>
<titleStmt>
<title level="a" type="main" xml:lang="en">Comparing noun phrasing techniques for use with medical digital library tools</title>
</titleStmt>
<publicationStmt>
<authority>ISTEX</authority>
<publisher>John Wiley & Sons, Inc.</publisher>
<pubPlace>New York</pubPlace>
<availability>
<licence>Copyright © 2000 John Wiley & Sons, Inc.</licence>
</availability>
<date type="published" when="2000"></date>
</publicationStmt>
<notesStmt>
<note type="content-type" subtype="article" source="article" scheme="https://content-type.data.istex.fr/ark:/67375/XTP-6N5SZHKN-D">article</note>
<note type="publication-type" subtype="journal" scheme="https://publication-type.data.istex.fr/ark:/67375/JMC-0GLKJH51-B">journal</note>
</notesStmt>
<sourceDesc>
<biblStruct type="article">
<analytic>
<title level="a" type="main" xml:lang="en">Comparing noun phrasing techniques for use with medical digital library tools</title>
<author xml:id="author-0000">
<persName>
<forename type="first">Kristin M.</forename>
<surname>Tolle</surname>
</persName>
<email>ktolle@bpa.arizona.edu</email>
<affiliation>
<orgName type="department">Management Information Systems Department</orgName>
<orgName type="institution">University of Arizona</orgName>
<address>
<addrLine>Tucson</addrLine>
<addrLine>AZ 85721</addrLine>
<country key="US"></country>
</address>
</affiliation>
</author>
<author xml:id="author-0001">
<persName>
<forename type="first">Hsinchun</forename>
<surname>Chen</surname>
</persName>
<affiliation>
<orgName type="department">Management Information Systems Department</orgName>
<orgName type="institution">University of Arizona</orgName>
<address>
<addrLine>Tucson</addrLine>
<addrLine>AZ 85721</addrLine>
<country key="US"></country>
</address>
</affiliation>
</author>
<idno type="istex">F89A78E6F114B4601DE09E6221D5DF8DC95B240F</idno>
<idno type="ark">ark:/67375/WNG-RRGVX20X-M</idno>
<idno type="DOI">10.1002/(SICI)1097-4571(2000)51:4<352::AID-ASI5>3.0.CO;2-8</idno>
<idno type="unit">ASI5</idno>
<idno type="toTypesetVersion">file:ASI.ASI5.pdf</idno>
</analytic>
<monogr>
<editor xml:id="editor-0000">
<persName>
<forename type="first">Hsinchun</forename>
<surname>Chen</surname>
</persName>
</editor>
<title level="j" type="main">Journal of the American Society for Information Science</title>
<title level="j" type="sub">Digital Libraries: Part 2</title>
<title level="j" type="alt">JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE</title>
<idno type="pISSN">0002-8231</idno>
<idno type="eISSN">1097-4571</idno>
<idno type="book-DOI">10.1002/(ISSN)1097-4571</idno>
<idno type="book-part-DOI">10.1002/(SICI)1097-4571(2000)51:4<>1.0.CO;2-B</idno>
<idno type="product">ASI</idno>
<imprint>
<biblScope unit="vol">51</biblScope>
<biblScope unit="issue">4</biblScope>
<biblScope unit="page" from="352">352</biblScope>
<biblScope unit="page" to="370">370</biblScope>
<biblScope unit="page-count">19</biblScope>
<publisher>John Wiley & Sons, Inc.</publisher>
<pubPlace>New York</pubPlace>
<date type="published" when="2000"></date>
</imprint>
</monogr>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<abstract xml:lang="en" style="main">
<head>Abstract</head>
<p>In an effort to assist medical researchers and professionals in accessing information necessary for their work, the A1 Lab at the University of Arizona is investigating the use of a natural language processing (NLP) technique called noun phrasing. The goal of this research is to determine whether noun phrasing could be a viable technique to include in medical information retrieval applications. Four noun phrase generation tools were evaluated as to their ability to isolate noun phrases from medical journal abstracts. Tests were conducted using the National Cancer Institute's CANCERLIT database. The NLP tools evaluated were Massachusetts Institute of Technology's (MIT's) Chopper, The University of Arizona's Automatic Indexer, Lingsoft's NPtool, and The University of Arizona's AZ Noun Phraser. In addition, the National Library of Medicine's SPECIALIST Lexicon was incorporated into two versions of the AZ Noun Phraser to be evaluated against the other tools as well as a nonaugmented version of the AZ Noun Phraser. Using the metrics relative subject recall and precision, our results show that, with the exception of Chopper, the phrasing tools were fairly comparable in recall and precision. It was also shown that augmenting the AZ Noun Phraser by including the SPECIALIST Lexicon from the National Library of Medicine resulted in improved recall and precision.</p>
</abstract>
<textClass>
<keywords rend="articleCategory">
<term>Research Article</term>
</keywords>
<keywords rend="tocHeading1">
<term>Research Article</term>
</keywords>
</textClass>
<textClass>
<keywords ana="subject">
<term ref="psi.asis.org/digital/nouns">nouns</term>
<term ref="psi.asis.org/digital/phrases">phrases</term>
<term ref="psi.asis.org/digital/biomedical+information">biomedical information</term>
<term ref="psi.asis.org/digital/recall">recall</term>
<term ref="psi.asis.org/digital/natural+language+processing">natural language processing</term>
<term ref="psi.asis.org/digital/precision">precision</term>
<term ref="psi.asis.org/digital/medical+libraries">medical libraries</term>
<term ref="psi.asis.org/digital/digital+libraries">digital libraries</term>
</keywords>
</textClass>
<langUsage>
<language ident="en"></language>
</langUsage>
</profileDesc>
</teiHeader>
</istex:fulltextTEI>
<json:item>
<extension>txt</extension>
<original>false</original>
<mimetype>text/plain</mimetype>
<uri>https://api.istex.fr/ark:/67375/WNG-RRGVX20X-M/fulltext.txt</uri>
</json:item>
</fulltext>
<metadata>
<istex:metadataXml wicri:clean="Wiley, elements deleted: body">
<istex:xmlDeclaration>version="1.0" encoding="UTF-8" standalone="yes"</istex:xmlDeclaration>
<istex:document>
<component version="2.0" type="serialArticle" xml:lang="en">
<header>
<publicationMeta level="product">
<publisherInfo>
<publisherName>John Wiley & Sons, Inc.</publisherName>
<publisherLoc>New York</publisherLoc>
</publisherInfo>
<doi registered="yes">10.1002/(ISSN)1097-4571</doi>
<issn type="print">0002-8231</issn>
<issn type="electronic">1097-4571</issn>
<idGroup>
<id type="product" value="ASI"></id>
</idGroup>
<titleGroup>
<title type="main" xml:lang="en" sort="JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE">Journal of the American Society for Information Science</title>
<title type="short">J. Am. Soc. Inf. Sci.</title>
</titleGroup>
</publicationMeta>
<publicationMeta level="part" position="40">
<doi origin="wiley" registered="yes">10.1002/(SICI)1097-4571(2000)51:4<>1.0.CO;2-B</doi>
<titleGroup>
<title type="specialIssueTitle">Digital Libraries: Part 2</title>
</titleGroup>
<numberingGroup>
<numbering type="journalVolume" number="51">51</numbering>
<numbering type="journalIssue">4</numbering>
</numberingGroup>
<creators>
<creator xml:id="sped1" creatorRole="sponsoringEditor" affiliationRef="#sp1">
<personName>
<givenNames>Hsinchun</givenNames>
<familyName>Chen</familyName>
</personName>
</creator>
</creators>
<affiliationGroup>
<affiliation xml:id="sp1" countryCode="US" type="organization">
<unparsedAffiliation>McClelland Professor of MIS, Artificial Intelligence Lab, Management Information Systems Department, The University of Arizona, Tucson, AZ 85721</unparsedAffiliation>
</affiliation>
</affiliationGroup>
<coverDate startDate="2000">2000</coverDate>
</publicationMeta>
<publicationMeta level="unit" type="article" position="5" status="forIssue">
<doi origin="wiley" registered="yes">10.1002/(SICI)1097-4571(2000)51:4<352::AID-ASI5>3.0.CO;2-8</doi>
<idGroup>
<id type="unit" value="ASI5"></id>
</idGroup>
<countGroup>
<count type="pageTotal" number="19"></count>
</countGroup>
<titleGroup>
<title type="articleCategory">Research Article</title>
<title type="tocHeading1">Research Article</title>
</titleGroup>
<copyright ownership="publisher">Copyright © 2000 John Wiley & Sons, Inc.</copyright>
<eventGroup>
<event type="firstOnline" date="2000-02-11"></event>
<event type="publishedOnlineFinalForm" date="2000-02-11"></event>
<event type="xmlConverted" agent="Converter:JWSART34_TO_WML3G version:2.4.7 mode:FullText source:FullText result:FullText mathml2tex" date="2011-02-24"></event>
<event type="xmlConverted" agent="Converter:WILEY_ML3G_TO_WILEY_ML3GV2 version:3.8.8" date="2014-01-06"></event>
<event type="xmlConverted" agent="Converter:WML3G_To_WML3G version:4.1.7 mode:FullText,remove_FC" date="2014-10-30"></event>
</eventGroup>
<numberingGroup>
<numbering type="pageFirst">352</numbering>
<numbering type="pageLast">370</numbering>
</numberingGroup>
<subjectInfo>
<subject href="psi.asis.org/digital/nouns">nouns</subject>
<subject href="psi.asis.org/digital/phrases">phrases</subject>
<subject href="psi.asis.org/digital/biomedical+information">biomedical information</subject>
<subject href="psi.asis.org/digital/recall">recall</subject>
<subject href="psi.asis.org/digital/natural+language+processing">natural language processing</subject>
<subject href="psi.asis.org/digital/precision">precision</subject>
<subject href="psi.asis.org/digital/medical+libraries">medical libraries</subject>
<subject href="psi.asis.org/digital/digital+libraries">digital libraries</subject>
</subjectInfo>
<linkGroup>
<link type="toTypesetVersion" href="file:ASI.ASI5.pdf"></link>
</linkGroup>
</publicationMeta>
<contentMeta>
<countGroup>
<count type="figureTotal" number="13"></count>
<count type="tableTotal" number="5"></count>
<count type="referenceTotal" number="49"></count>
<count type="wordTotal" number="10560"></count>
</countGroup>
<titleGroup>
<title type="main" xml:lang="en">Comparing noun phrasing techniques for use with medical digital library tools</title>
</titleGroup>
<creators>
<creator xml:id="au1" creatorRole="author" affiliationRef="#af1">
<personName>
<givenNames>Kristin M.</givenNames>
<familyName>Tolle</familyName>
</personName>
<contactDetails>
<email>ktolle@bpa.arizona.edu</email>
</contactDetails>
</creator>
<creator xml:id="au2" creatorRole="author" affiliationRef="#af1">
<personName>
<givenNames>Hsinchun</givenNames>
<familyName>Chen</familyName>
</personName>
</creator>
</creators>
<affiliationGroup>
<affiliation xml:id="af1" countryCode="US" type="organization">
<unparsedAffiliation>Management Information Systems Department, University of Arizona, Tucson, AZ 85721</unparsedAffiliation>
</affiliation>
</affiliationGroup>
<fundingInfo>
<fundingAgency>NSF/ARPA/NASA Digital Library Initiative</fundingAgency>
<fundingNumber>IRI‐9411318</fundingNumber>
</fundingInfo>
<fundingInfo>
<fundingAgency>NSF CISE</fundingAgency>
<fundingNumber>IRI‐9525790</fundingNumber>
</fundingInfo>
<fundingInfo>
<fundingAgency>National Computational Science Alliance (NCSA)</fundingAgency>
<fundingNumber>IRI970000N</fundingNumber>
<fundingNumber>IRI970002N</fundingNumber>
</fundingInfo>
<fundingInfo>
<fundingAgency>National Library of Medicine (NLM)</fundingAgency>
</fundingInfo>
<fundingInfo>
<fundingAgency>National Cancer Institute</fundingAgency>
</fundingInfo>
<fundingInfo>
<fundingAgency>National Institutes of Health</fundingAgency>
</fundingInfo>
<abstractGroup>
<abstract type="main" xml:lang="en">
<title type="main">Abstract</title>
<p>In an effort to assist medical researchers and professionals in accessing information necessary for their work, the A1 Lab at the University of Arizona is investigating the use of a natural language processing (NLP) technique called noun phrasing. The goal of this research is to determine whether noun phrasing could be a viable technique to include in medical information retrieval applications. Four noun phrase generation tools were evaluated as to their ability to isolate noun phrases from medical journal abstracts. Tests were conducted using the National Cancer Institute's CANCERLIT database. The NLP tools evaluated were Massachusetts Institute of Technology's (MIT's) Chopper, The University of Arizona's Automatic Indexer, Lingsoft's NPtool, and The University of Arizona's AZ Noun Phraser. In addition, the National Library of Medicine's SPECIALIST Lexicon was incorporated into two versions of the AZ Noun Phraser to be evaluated against the other tools as well as a nonaugmented version of the AZ Noun Phraser. Using the metrics relative subject recall and precision, our results show that, with the exception of Chopper, the phrasing tools were fairly comparable in recall and precision. It was also shown that augmenting the AZ Noun Phraser by including the SPECIALIST Lexicon from the National Library of Medicine resulted in improved recall and precision.</p>
</abstract>
</abstractGroup>
</contentMeta>
</header>
</component>
</istex:document>
</istex:metadataXml>
<mods version="3.6">
<titleInfo lang="en">
<title>Comparing noun phrasing techniques for use with medical digital library tools</title>
</titleInfo>
<titleInfo type="alternative" contentType="CDATA" lang="en">
<title>Comparing noun phrasing techniques for use with medical digital library tools</title>
</titleInfo>
<name type="personal">
<namePart type="given">Kristin M.</namePart>
<namePart type="family">Tolle</namePart>
<affiliation>E-mail: ktolle@bpa.arizona.edu</affiliation>
<affiliation>Management Information Systems Department, University of Arizona, Tucson, AZ 85721</affiliation>
<affiliation>E-mail: ktolle@bpa.arizona.edu</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<name type="personal">
<namePart type="given">Hsinchun</namePart>
<namePart type="family">Chen</namePart>
<affiliation>Management Information Systems Department, University of Arizona, Tucson, AZ 85721</affiliation>
<role>
<roleTerm type="text">author</roleTerm>
</role>
</name>
<typeOfResource>text</typeOfResource>
<genre type="article" displayLabel="article" authority="ISTEX" authorityURI="https://content-type.data.istex.fr" valueURI="https://content-type.data.istex.fr/ark:/67375/XTP-6N5SZHKN-D">article</genre>
<originInfo>
<publisher>John Wiley & Sons, Inc.</publisher>
<place>
<placeTerm type="text">New York</placeTerm>
</place>
<dateIssued encoding="w3cdtf">2000</dateIssued>
<copyrightDate encoding="w3cdtf">2000</copyrightDate>
</originInfo>
<language>
<languageTerm type="code" authority="rfc3066">en</languageTerm>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription>
<extent unit="figures">13</extent>
<extent unit="tables">5</extent>
<extent unit="references">49</extent>
<extent unit="words">10560</extent>
</physicalDescription>
<abstract lang="en">In an effort to assist medical researchers and professionals in accessing information necessary for their work, the A1 Lab at the University of Arizona is investigating the use of a natural language processing (NLP) technique called noun phrasing. The goal of this research is to determine whether noun phrasing could be a viable technique to include in medical information retrieval applications. Four noun phrase generation tools were evaluated as to their ability to isolate noun phrases from medical journal abstracts. Tests were conducted using the National Cancer Institute's CANCERLIT database. The NLP tools evaluated were Massachusetts Institute of Technology's (MIT's) Chopper, The University of Arizona's Automatic Indexer, Lingsoft's NPtool, and The University of Arizona's AZ Noun Phraser. In addition, the National Library of Medicine's SPECIALIST Lexicon was incorporated into two versions of the AZ Noun Phraser to be evaluated against the other tools as well as a nonaugmented version of the AZ Noun Phraser. Using the metrics relative subject recall and precision, our results show that, with the exception of Chopper, the phrasing tools were fairly comparable in recall and precision. It was also shown that augmenting the AZ Noun Phraser by including the SPECIALIST Lexicon from the National Library of Medicine resulted in improved recall and precision.</abstract>
<note type="funding">NSF/ARPA/NASA Digital Library Initiative - No. IRI‐9411318; </note>
<note type="funding">NSF CISE - No. IRI‐9525790; </note>
<note type="funding">National Computational Science Alliance (NCSA) - No. IRI970000N; No. IRI970002N; </note>
<note type="funding">National Library of Medicine (NLM)</note>
<note type="funding">National Cancer Institute</note>
<note type="funding">National Institutes of Health</note>
<relatedItem type="host">
<titleInfo>
<title>Journal of the American Society for Information Science</title>
</titleInfo>
<titleInfo type="abbreviated">
<title>J. Am. Soc. Inf. Sci.</title>
</titleInfo>
<name type="personal">
<namePart type="given">Hsinchun</namePart>
<namePart type="family">Chen</namePart>
<affiliation>McClelland Professor of MIS, Artificial Intelligence Lab, Management Information Systems Department, The University of Arizona, Tucson, AZ 85721</affiliation>
</name>
<genre type="journal" authority="ISTEX" authorityURI="https://publication-type.data.istex.fr" valueURI="https://publication-type.data.istex.fr/ark:/67375/JMC-0GLKJH51-B">journal</genre>
<subject>
<genre>index-terms</genre>
<topic authorityURI="psi.asis.org/digital/nouns">nouns</topic>
<topic authorityURI="psi.asis.org/digital/phrases">phrases</topic>
<topic authorityURI="psi.asis.org/digital/biomedical+information">biomedical information</topic>
<topic authorityURI="psi.asis.org/digital/recall">recall</topic>
<topic authorityURI="psi.asis.org/digital/natural+language+processing">natural language processing</topic>
<topic authorityURI="psi.asis.org/digital/precision">precision</topic>
<topic authorityURI="psi.asis.org/digital/medical+libraries">medical libraries</topic>
<topic authorityURI="psi.asis.org/digital/digital+libraries">digital libraries</topic>
</subject>
<subject>
<genre>article-category</genre>
<topic>Research Article</topic>
</subject>
<identifier type="ISSN">0002-8231</identifier>
<identifier type="eISSN">1097-4571</identifier>
<identifier type="DOI">10.1002/(ISSN)1097-4571</identifier>
<identifier type="PublisherID">ASI</identifier>
<part>
<date>2000</date>
<detail type="title">
<title>Digital Libraries: Part 2</title>
</detail>
<detail type="volume">
<caption>vol.</caption>
<number>51</number>
</detail>
<detail type="issue">
<caption>no.</caption>
<number>4</number>
</detail>
<extent unit="pages">
<start>352</start>
<end>370</end>
<total>19</total>
</extent>
</part>
</relatedItem>
<identifier type="istex">F89A78E6F114B4601DE09E6221D5DF8DC95B240F</identifier>
<identifier type="ark">ark:/67375/WNG-RRGVX20X-M</identifier>
<identifier type="DOI">10.1002/(SICI)1097-4571(2000)51:4<352::AID-ASI5>3.0.CO;2-8</identifier>
<identifier type="ArticleID">ASI5</identifier>
<accessCondition type="use and reproduction" contentType="copyright">Copyright © 2000 John Wiley & Sons, Inc.</accessCondition>
<recordInfo>
<recordContentSource authority="ISTEX" authorityURI="https://loaded-corpus.data.istex.fr" valueURI="https://loaded-corpus.data.istex.fr/ark:/67375/XBH-L0C46X92-X">wiley</recordContentSource>
<recordOrigin>John Wiley & Sons, Inc.</recordOrigin>
</recordInfo>
</mods>
<json:item>
<extension>json</extension>
<original>false</original>
<mimetype>application/json</mimetype>
<uri>https://api.istex.fr/ark:/67375/WNG-RRGVX20X-M/record.json</uri>
</json:item>
</metadata>
<serie></serie>
</istex>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Informatique/explor/SgmlV1/Data/Istex/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 004096 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Corpus/biblio.hfd -nk 004096 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Informatique
   |area=    SgmlV1
   |flux=    Istex
   |étape=   Corpus
   |type=    RBID
   |clé=     ISTEX:F89A78E6F114B4601DE09E6221D5DF8DC95B240F
   |texte=   Comparing noun phrasing techniques for use with medical digital library tools
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jul 1 14:26:08 2019. Site generation: Wed Apr 28 21:40:44 2021