Serveur d'exploration sur SGML

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Semi-automated literature mining to identify putative biomarkers of disease from multiple biofluids

Identifieur interne : 000018 ( Pmc/Checkpoint ); précédent : 000017; suivant : 000019

Semi-automated literature mining to identify putative biomarkers of disease from multiple biofluids

Auteurs : Rick Jordan [États-Unis] ; Shyam Visweswaran [États-Unis] ; Vanathi Gopalakrishnan [États-Unis]

Source :

RBID : PMC:4215335

Abstract

Background

Computational methods for mining of biomedical literature can be useful in augmenting manual searches of the literature using keywords for disease-specific biomarker discovery from biofluids. In this work, we develop and apply a semi-automated literature mining method to mine abstracts obtained from PubMed to discover putative biomarkers of breast and lung cancers in specific biofluids.

Methodology

A positive set of abstracts was defined by the terms ‘breast cancer’ and ‘lung cancer’ in conjunction with 14 separate ‘biofluids’ (bile, blood, breastmilk, cerebrospinal fluid, mucus, plasma, saliva, semen, serum, synovial fluid, stool, sweat, tears, and urine), while a negative set of abstracts was defined by the terms ‘(biofluid) NOT breast cancer’ or ‘(biofluid) NOT lung cancer.’ More than 5.3 million total abstracts were obtained from PubMed and examined for biomarker-disease-biofluid associations (34,296 positive and 2,653,396 negative for breast cancer; 28,355 positive and 2,595,034 negative for lung cancer). Biological entities such as genes and proteins were tagged using ABNER, and processed using Python scripts to produce a list of putative biomarkers. Z-scores were calculated, ranked, and used to determine significance of putative biomarkers found. Manual verification of relevant abstracts was performed to assess our method’s performance.

Results

Biofluid-specific markers were identified from the literature, assigned relevance scores based on frequency of occurrence, and validated using known biomarker lists and/or databases for lung and breast cancer [NCBI’s On-line Mendelian Inheritance in Man (OMIM), Cancer Gene annotation server for cancer genomics (CAGE), NCBI’s Genes & Disease, NCI’s Early Detection Research Network (EDRN), and others]. The specificity of each marker for a given biofluid was calculated, and the performance of our semi-automated literature mining method assessed for breast and lung cancer.

Conclusions

We developed a semi-automated process for determining a list of putative biomarkers for breast and lung cancer. New knowledge is presented in the form of biomarker lists; ranked, newly discovered biomarker-disease-biofluid relationships; and biomarker specificity across biofluids.


Url:
DOI: 10.1186/2043-9113-4-13
PubMed: 25379168
PubMed Central: 4215335


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

PMC:4215335

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Semi-automated literature mining to identify putative biomarkers of disease from multiple biofluids</title>
<author>
<name sortKey="Jordan, Rick" sort="Jordan, Rick" uniqKey="Jordan R" first="Rick" last="Jordan">Rick Jordan</name>
<affiliation wicri:level="4">
<nlm:aff id="I1">Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA</wicri:regionArea>
<placeName>
<region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université de Pittsburgh</orgName>
</affiliation>
</author>
<author>
<name sortKey="Visweswaran, Shyam" sort="Visweswaran, Shyam" uniqKey="Visweswaran S" first="Shyam" last="Visweswaran">Shyam Visweswaran</name>
<affiliation wicri:level="4">
<nlm:aff id="I1">Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA</wicri:regionArea>
<placeName>
<region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université de Pittsburgh</orgName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff id="I2">Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA</wicri:regionArea>
<placeName>
<region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université de Pittsburgh</orgName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff id="I3">Department of Computational & Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Computational & Systems Biology, University of Pittsburgh, Pittsburgh, PA</wicri:regionArea>
<placeName>
<region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université de Pittsburgh</orgName>
</affiliation>
</author>
<author>
<name sortKey="Gopalakrishnan, Vanathi" sort="Gopalakrishnan, Vanathi" uniqKey="Gopalakrishnan V" first="Vanathi" last="Gopalakrishnan">Vanathi Gopalakrishnan</name>
<affiliation wicri:level="4">
<nlm:aff id="I1">Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA</wicri:regionArea>
<placeName>
<region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université de Pittsburgh</orgName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff id="I2">Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA</wicri:regionArea>
<placeName>
<region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université de Pittsburgh</orgName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff id="I3">Department of Computational & Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Computational & Systems Biology, University of Pittsburgh, Pittsburgh, PA</wicri:regionArea>
<placeName>
<region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université de Pittsburgh</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">25379168</idno>
<idno type="pmc">4215335</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4215335</idno>
<idno type="RBID">PMC:4215335</idno>
<idno type="doi">10.1186/2043-9113-4-13</idno>
<date when="2014">2014</date>
<idno type="wicri:Area/Pmc/Corpus">000158</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">000158</idno>
<idno type="wicri:Area/Pmc/Curation">000158</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Curation">000158</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000018</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Checkpoint">000018</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Semi-automated literature mining to identify putative biomarkers of disease from multiple biofluids</title>
<author>
<name sortKey="Jordan, Rick" sort="Jordan, Rick" uniqKey="Jordan R" first="Rick" last="Jordan">Rick Jordan</name>
<affiliation wicri:level="4">
<nlm:aff id="I1">Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA</wicri:regionArea>
<placeName>
<region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université de Pittsburgh</orgName>
</affiliation>
</author>
<author>
<name sortKey="Visweswaran, Shyam" sort="Visweswaran, Shyam" uniqKey="Visweswaran S" first="Shyam" last="Visweswaran">Shyam Visweswaran</name>
<affiliation wicri:level="4">
<nlm:aff id="I1">Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA</wicri:regionArea>
<placeName>
<region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université de Pittsburgh</orgName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff id="I2">Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA</wicri:regionArea>
<placeName>
<region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université de Pittsburgh</orgName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff id="I3">Department of Computational & Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Computational & Systems Biology, University of Pittsburgh, Pittsburgh, PA</wicri:regionArea>
<placeName>
<region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université de Pittsburgh</orgName>
</affiliation>
</author>
<author>
<name sortKey="Gopalakrishnan, Vanathi" sort="Gopalakrishnan, Vanathi" uniqKey="Gopalakrishnan V" first="Vanathi" last="Gopalakrishnan">Vanathi Gopalakrishnan</name>
<affiliation wicri:level="4">
<nlm:aff id="I1">Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA</wicri:regionArea>
<placeName>
<region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université de Pittsburgh</orgName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff id="I2">Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA</wicri:regionArea>
<placeName>
<region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université de Pittsburgh</orgName>
</affiliation>
<affiliation wicri:level="4">
<nlm:aff id="I3">Department of Computational & Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Computational & Systems Biology, University of Pittsburgh, Pittsburgh, PA</wicri:regionArea>
<placeName>
<region type="state">Pennsylvanie</region>
<settlement type="city">Pittsburgh</settlement>
</placeName>
<orgName type="university">Université de Pittsburgh</orgName>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Journal of Clinical Bioinformatics</title>
<idno type="eISSN">2043-9113</idno>
<imprint>
<date when="2014">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<sec>
<title>Background</title>
<p>Computational methods for mining of biomedical literature can be useful in augmenting manual searches of the literature using keywords for disease-specific biomarker discovery from biofluids. In this work, we develop and apply a semi-automated literature mining method to mine abstracts obtained from PubMed to discover putative biomarkers of breast and lung cancers in specific biofluids.</p>
</sec>
<sec>
<title>Methodology</title>
<p>A positive set of abstracts was defined by the terms ‘breast cancer’ and ‘lung cancer’ in conjunction with 14 separate ‘biofluids’ (bile, blood, breastmilk, cerebrospinal fluid, mucus, plasma, saliva, semen, serum, synovial fluid, stool, sweat, tears, and urine), while a negative set of abstracts was defined by the terms ‘(biofluid) NOT breast cancer’ or ‘(biofluid) NOT lung cancer.’ More than 5.3 million total abstracts were obtained from PubMed and examined for biomarker-disease-biofluid associations (34,296 positive and 2,653,396 negative for breast cancer; 28,355 positive and 2,595,034 negative for lung cancer). Biological entities such as genes and proteins were tagged using ABNER, and processed using Python scripts to produce a list of putative biomarkers. Z-scores were calculated, ranked, and used to determine significance of putative biomarkers found. Manual verification of relevant abstracts was performed to assess our method’s performance.</p>
</sec>
<sec>
<title>Results</title>
<p>Biofluid-specific markers were identified from the literature, assigned relevance scores based on frequency of occurrence, and validated using known biomarker lists and/or databases for lung and breast cancer [NCBI’s On-line Mendelian Inheritance in Man (OMIM), Cancer Gene annotation server for cancer genomics (CAGE), NCBI’s Genes & Disease, NCI’s Early Detection Research Network (EDRN), and others]. The specificity of each marker for a given biofluid was calculated, and the performance of our semi-automated literature mining method assessed for breast and lung cancer.</p>
</sec>
<sec>
<title>Conclusions</title>
<p>We developed a semi-automated process for determining a list of putative biomarkers for breast and lung cancer. New knowledge is presented in the form of biomarker lists; ranked, newly discovered biomarker-disease-biofluid relationships; and biomarker specificity across biofluids.</p>
</sec>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Hirschman, L" uniqKey="Hirschman L">L Hirschman</name>
</author>
<author>
<name sortKey="Park, Jc" uniqKey="Park J">JC Park</name>
</author>
<author>
<name sortKey="Tsujii, J" uniqKey="Tsujii J">J Tsujii</name>
</author>
<author>
<name sortKey="Wong, L" uniqKey="Wong L">L Wong</name>
</author>
<author>
<name sortKey="Wu, Ch" uniqKey="Wu C">CH Wu</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Adamic, La" uniqKey="Adamic L">LA Adamic</name>
</author>
<author>
<name sortKey="Wilkinson, D" uniqKey="Wilkinson D">D Wilkinson</name>
</author>
<author>
<name sortKey="Huberman, Ba" uniqKey="Huberman B">BA Huberman</name>
</author>
<author>
<name sortKey="Adar, E" uniqKey="Adar E">E Adar</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wren, Jd" uniqKey="Wren J">JD Wren</name>
</author>
<author>
<name sortKey="Bekeredjian, R" uniqKey="Bekeredjian R">R Bekeredjian</name>
</author>
<author>
<name sortKey="Stewart, Ja" uniqKey="Stewart J">JA Stewart</name>
</author>
<author>
<name sortKey="Shohet, Rv" uniqKey="Shohet R">RV Shohet</name>
</author>
<author>
<name sortKey="Garner, Hr" uniqKey="Garner H">HR Garner</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Xuan, W" uniqKey="Xuan W">W Xuan</name>
</author>
<author>
<name sortKey="Wang, P" uniqKey="Wang P">P Wang</name>
</author>
<author>
<name sortKey="Watson, Sj" uniqKey="Watson S">SJ Watson</name>
</author>
<author>
<name sortKey="Meng, F" uniqKey="Meng F">F Meng</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hristovski, D" uniqKey="Hristovski D">D Hristovski</name>
</author>
<author>
<name sortKey="Peterlin, B" uniqKey="Peterlin B">B Peterlin</name>
</author>
<author>
<name sortKey="Mitchell, Ja" uniqKey="Mitchell J">JA Mitchell</name>
</author>
<author>
<name sortKey="Humphrey, Sm" uniqKey="Humphrey S">SM Humphrey</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Novichkova, S" uniqKey="Novichkova S">S Novichkova</name>
</author>
<author>
<name sortKey="Egorov, S" uniqKey="Egorov S">S Egorov</name>
</author>
<author>
<name sortKey="Daraseila, N" uniqKey="Daraseila N">N Daraseila</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Srinivasan, P" uniqKey="Srinivasan P">P Srinivasan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Leonard, Je" uniqKey="Leonard J">JE Leonard</name>
</author>
<author>
<name sortKey="Colombe, Jb" uniqKey="Colombe J">JB Colombe</name>
</author>
<author>
<name sortKey="Levy, Jl" uniqKey="Levy J">JL Levy</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Jensen, Lj" uniqKey="Jensen L">LJ Jensen</name>
</author>
<author>
<name sortKey="Saric, J" uniqKey="Saric J">J Saric</name>
</author>
<author>
<name sortKey="Bork, P" uniqKey="Bork P">P Bork</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Krallinger, M" uniqKey="Krallinger M">M Krallinger</name>
</author>
<author>
<name sortKey="Valencia, A" uniqKey="Valencia A">A Valencia</name>
</author>
<author>
<name sortKey="Hirschman, L" uniqKey="Hirschman L">L Hirschman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Cohen, Am" uniqKey="Cohen A">AM Cohen</name>
</author>
<author>
<name sortKey="Hersh, Wr" uniqKey="Hersh W">WR Hersh</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Swanson, Dr" uniqKey="Swanson D">DR Swanson</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Zhu, S" uniqKey="Zhu S">S Zhu</name>
</author>
<author>
<name sortKey="Okuno, Y" uniqKey="Okuno Y">Y Okuno</name>
</author>
<author>
<name sortKey="Tsujimoto, G" uniqKey="Tsujimoto G">G Tsujimoto</name>
</author>
<author>
<name sortKey="Mamitsuka, H" uniqKey="Mamitsuka H">H Mamitsuka</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Frijters, R" uniqKey="Frijters R">R Frijters</name>
</author>
<author>
<name sortKey="Van Vugt, M" uniqKey="Van Vugt M">M Van Vugt</name>
</author>
<author>
<name sortKey="Smeets, R" uniqKey="Smeets R">R Smeets</name>
</author>
<author>
<name sortKey="Van Schaik, R" uniqKey="Van Schaik R">R Van Schaik</name>
</author>
<author>
<name sortKey="De Vlieg, J" uniqKey="De Vlieg J">J De Vlieg</name>
</author>
<author>
<name sortKey="Alkema, W" uniqKey="Alkema W">W Alkema</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, H" uniqKey="Li H">H Li</name>
</author>
<author>
<name sortKey="Liu, C" uniqKey="Liu C">C Liu</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Al Mubaid, H" uniqKey="Al Mubaid H">H Al-Mubaid</name>
</author>
<author>
<name sortKey="Singh, Rk" uniqKey="Singh R">RK Singh</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Andrade, Ma" uniqKey="Andrade M">MA Andrade</name>
</author>
<author>
<name sortKey="Valencia, A" uniqKey="Valencia A">A Valencia</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Younesi, E" uniqKey="Younesi E">E Younesi</name>
</author>
<author>
<name sortKey="Toldo, L" uniqKey="Toldo L">L Toldo</name>
</author>
<author>
<name sortKey="Muller, B" uniqKey="Muller B">B Muller</name>
</author>
<author>
<name sortKey="Friedrich, Cm" uniqKey="Friedrich C">CM Friedrich</name>
</author>
<author>
<name sortKey="Novac, N" uniqKey="Novac N">N Novac</name>
</author>
<author>
<name sortKey="Scheer, A" uniqKey="Scheer A">A Scheer</name>
</author>
<author>
<name sortKey="Hofmann Apitius, M" uniqKey="Hofmann Apitius M">M Hofmann-Apitius</name>
</author>
<author>
<name sortKey="Fluck, J" uniqKey="Fluck J">J Fluck</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Deyati, A" uniqKey="Deyati A">A Deyati</name>
</author>
<author>
<name sortKey="Younesi, E" uniqKey="Younesi E">E Younesi</name>
</author>
<author>
<name sortKey="Hofmann Apitius, M" uniqKey="Hofmann Apitius M">M Hofmann-Apitius</name>
</author>
<author>
<name sortKey="Novac, N" uniqKey="Novac N">N Novac</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Veenstra, T" uniqKey="Veenstra T">T Veenstra</name>
</author>
<author>
<name sortKey="Conrads, T" uniqKey="Conrads T">T Conrads</name>
</author>
<author>
<name sortKey="Hood, B" uniqKey="Hood B">B Hood</name>
</author>
<author>
<name sortKey="Avellino, A" uniqKey="Avellino A">A Avellino</name>
</author>
<author>
<name sortKey="Ellenbogen, R" uniqKey="Ellenbogen R">R Ellenbogen</name>
</author>
<author>
<name sortKey="Morrison, R" uniqKey="Morrison R">R Morrison</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Zhou, M" uniqKey="Zhou M">M Zhou</name>
</author>
<author>
<name sortKey="Conrads, T" uniqKey="Conrads T">T Conrads</name>
</author>
<author>
<name sortKey="Veenstra, T" uniqKey="Veenstra T">T Veenstra</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lee, Y" uniqKey="Lee Y">Y Lee</name>
</author>
<author>
<name sortKey="Wong, D" uniqKey="Wong D">D Wong</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gao, K" uniqKey="Gao K">K Gao</name>
</author>
<author>
<name sortKey="Zhou, H" uniqKey="Zhou H">H Zhou</name>
</author>
<author>
<name sortKey="Zhang, L" uniqKey="Zhang L">L Zhang</name>
</author>
<author>
<name sortKey="Lee, J" uniqKey="Lee J">J Lee</name>
</author>
<author>
<name sortKey="Zhou, Q" uniqKey="Zhou Q">Q Zhou</name>
</author>
<author>
<name sortKey="Hu, S" uniqKey="Hu S">S Hu</name>
</author>
<author>
<name sortKey="Wolinsky, L" uniqKey="Wolinsky L">L Wolinsky</name>
</author>
<author>
<name sortKey="Farrell, J" uniqKey="Farrell J">J Farrell</name>
</author>
<author>
<name sortKey="Eibl, G" uniqKey="Eibl G">G Eibl</name>
</author>
<author>
<name sortKey="Wong, D" uniqKey="Wong D">D Wong</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Xu, X" uniqKey="Xu X">X Xu</name>
</author>
<author>
<name sortKey="Veenstra, T" uniqKey="Veenstra T">T Veenstra</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Delaleu, N" uniqKey="Delaleu N">N Delaleu</name>
</author>
<author>
<name sortKey="Immervoll, H" uniqKey="Immervoll H">H Immervoll</name>
</author>
<author>
<name sortKey="Cornelius, J" uniqKey="Cornelius J">J Cornelius</name>
</author>
<author>
<name sortKey="Jonsson, R" uniqKey="Jonsson R">R Jonsson</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Alterovitz, G" uniqKey="Alterovitz G">G Alterovitz</name>
</author>
<author>
<name sortKey="Xiang, M" uniqKey="Xiang M">M Xiang</name>
</author>
<author>
<name sortKey="Liu, J" uniqKey="Liu J">J Liu</name>
</author>
<author>
<name sortKey="Chang, A" uniqKey="Chang A">A Chang</name>
</author>
<author>
<name sortKey="Ramoni, Mf" uniqKey="Ramoni M">MF Ramoni</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Camon, E" uniqKey="Camon E">E Camon</name>
</author>
<author>
<name sortKey="Magrane, M" uniqKey="Magrane M">M Magrane</name>
</author>
<author>
<name sortKey="Barrell, D" uniqKey="Barrell D">D Barrell</name>
</author>
<author>
<name sortKey="Lee, V" uniqKey="Lee V">V Lee</name>
</author>
<author>
<name sortKey="Dimmer, E" uniqKey="Dimmer E">E Dimmer</name>
</author>
<author>
<name sortKey="Maslen, J" uniqKey="Maslen J">J Maslen</name>
</author>
<author>
<name sortKey="Binns, D" uniqKey="Binns D">D Binns</name>
</author>
<author>
<name sortKey="Harte, N" uniqKey="Harte N">N Harte</name>
</author>
<author>
<name sortKey="Lopez, R" uniqKey="Lopez R">R Lopez</name>
</author>
<author>
<name sortKey="Apweiler, R" uniqKey="Apweiler R">R Apweiler</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ashburner, M" uniqKey="Ashburner M">M Ashburner</name>
</author>
<author>
<name sortKey="Ball, Ca" uniqKey="Ball C">CA Ball</name>
</author>
<author>
<name sortKey="Blake, Ja" uniqKey="Blake J">JA Blake</name>
</author>
<author>
<name sortKey="Botstein, D" uniqKey="Botstein D">D Botstein</name>
</author>
<author>
<name sortKey="Butler, H" uniqKey="Butler H">H Butler</name>
</author>
<author>
<name sortKey="Cherry, Jm" uniqKey="Cherry J">JM Cherry</name>
</author>
<author>
<name sortKey="Davis, Ap" uniqKey="Davis A">AP Davis</name>
</author>
<author>
<name sortKey="Dolinski, K" uniqKey="Dolinski K">K Dolinski</name>
</author>
<author>
<name sortKey="Dwight, Ss" uniqKey="Dwight S">SS Dwight</name>
</author>
<author>
<name sortKey="Eppig, Jt" uniqKey="Eppig J">JT Eppig</name>
</author>
<author>
<name sortKey="Harris, Ma" uniqKey="Harris M">MA Harris</name>
</author>
<author>
<name sortKey="Hill, Dp" uniqKey="Hill D">DP Hill</name>
</author>
<author>
<name sortKey="Issel Tarver, L" uniqKey="Issel Tarver L">L Issel-Tarver</name>
</author>
<author>
<name sortKey="Kasarskis, A" uniqKey="Kasarskis A">A Kasarskis</name>
</author>
<author>
<name sortKey="Lewis, S" uniqKey="Lewis S">S Lewis</name>
</author>
<author>
<name sortKey="Matese, Jc" uniqKey="Matese J">JC Matese</name>
</author>
<author>
<name sortKey="Richardson, Je" uniqKey="Richardson J">JE Richardson</name>
</author>
<author>
<name sortKey="Ringwald, M" uniqKey="Ringwald M">M Ringwald</name>
</author>
<author>
<name sortKey="Rubin, Gm" uniqKey="Rubin G">GM Rubin</name>
</author>
<author>
<name sortKey="Sherlock, G" uniqKey="Sherlock G">G Sherlock</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wheeler, Dl" uniqKey="Wheeler D">DL Wheeler</name>
</author>
<author>
<name sortKey="Barrett, T" uniqKey="Barrett T">T Barrett</name>
</author>
<author>
<name sortKey="Benson, Da" uniqKey="Benson D">DA Benson</name>
</author>
<author>
<name sortKey="Bryant, Sh" uniqKey="Bryant S">SH Bryant</name>
</author>
<author>
<name sortKey="Canese, K" uniqKey="Canese K">K Canese</name>
</author>
<author>
<name sortKey="Chetvernin, V" uniqKey="Chetvernin V">V Chetvernin</name>
</author>
<author>
<name sortKey="Church, Dm" uniqKey="Church D">DM Church</name>
</author>
<author>
<name sortKey="Dicuccio, M" uniqKey="Dicuccio M">M DiCuccio</name>
</author>
<author>
<name sortKey="Edgar, R" uniqKey="Edgar R">R Edgar</name>
</author>
<author>
<name sortKey="Federhen, S" uniqKey="Federhen S">S Federhen</name>
</author>
<author>
<name sortKey="Geer, Ly" uniqKey="Geer L">LY Geer</name>
</author>
<author>
<name sortKey="Kapustin, Y" uniqKey="Kapustin Y">Y Kapustin</name>
</author>
<author>
<name sortKey="Khovayko, O" uniqKey="Khovayko O">O Khovayko</name>
</author>
<author>
<name sortKey="Landsman, D" uniqKey="Landsman D">D Landsman</name>
</author>
<author>
<name sortKey="Lipman, Dj" uniqKey="Lipman D">DJ Lipman</name>
</author>
<author>
<name sortKey="Madden, Tl" uniqKey="Madden T">TL Madden</name>
</author>
<author>
<name sortKey="Maglott, Dr" uniqKey="Maglott D">DR Maglott</name>
</author>
<author>
<name sortKey="Ostell, J" uniqKey="Ostell J">J Ostell</name>
</author>
<author>
<name sortKey="Miller, V" uniqKey="Miller V">V Miller</name>
</author>
<author>
<name sortKey="Pruitt, Kd" uniqKey="Pruitt K">KD Pruitt</name>
</author>
<author>
<name sortKey="Schuler, Gd" uniqKey="Schuler G">GD Schuler</name>
</author>
<author>
<name sortKey="Sequeira, E" uniqKey="Sequeira E">E Sequeira</name>
</author>
<author>
<name sortKey="Sherry, St" uniqKey="Sherry S">ST Sherry</name>
</author>
<author>
<name sortKey="Sirotkin, K" uniqKey="Sirotkin K">K Sirotkin</name>
</author>
<author>
<name sortKey="Souvorov, A" uniqKey="Souvorov A">A Souvorov</name>
</author>
<author>
<name sortKey="Starchecko, G" uniqKey="Starchecko G">G Starchecko</name>
</author>
<author>
<name sortKey="Tatusov, Rl" uniqKey="Tatusov R">RL Tatusov</name>
</author>
<author>
<name sortKey="Tatusova, Ta" uniqKey="Tatusova T">TA Tatusova</name>
</author>
<author>
<name sortKey="Wagner, L" uniqKey="Wagner L">L Wagner</name>
</author>
<author>
<name sortKey="Yaschenko, E" uniqKey="Yaschenko E">E Yaschenko</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hewett, M" uniqKey="Hewett M">M Hewett</name>
</author>
<author>
<name sortKey="Oliver, De" uniqKey="Oliver D">DE Oliver</name>
</author>
<author>
<name sortKey="Rubin, Dl" uniqKey="Rubin D">DL Rubin</name>
</author>
<author>
<name sortKey="Easton, Kl" uniqKey="Easton K">KL Easton</name>
</author>
<author>
<name sortKey="Stuart, Jm" uniqKey="Stuart J">JM Stuart</name>
</author>
<author>
<name sortKey="Altman, Rb" uniqKey="Altman R">RB Altman</name>
</author>
<author>
<name sortKey="Klein, Te" uniqKey="Klein T">TE Klein</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Settles, B" uniqKey="Settles B">B Settles</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Park, Yk" uniqKey="Park Y">YK Park</name>
</author>
<author>
<name sortKey="Kang, Tw" uniqKey="Kang T">TW Kang</name>
</author>
<author>
<name sortKey="Baek, Sj" uniqKey="Baek S">SJ Baek</name>
</author>
<author>
<name sortKey="Kim, Ki" uniqKey="Kim K">KI Kim</name>
</author>
<author>
<name sortKey="Kim, Sy" uniqKey="Kim S">SY Kim</name>
</author>
<author>
<name sortKey="Lee, D" uniqKey="Lee D">D Lee</name>
</author>
<author>
<name sortKey="Kim, Ys" uniqKey="Kim Y">YS Kim</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wagner, Pd" uniqKey="Wagner P">PD Wagner</name>
</author>
<author>
<name sortKey="Srivastava, S" uniqKey="Srivastava S">S Srivastava</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bigbee, Wl" uniqKey="Bigbee W">WL Bigbee</name>
</author>
<author>
<name sortKey="Gopalakrishnan, V" uniqKey="Gopalakrishnan V">V Gopalakrishnan</name>
</author>
<author>
<name sortKey="Weissfeld, Jl" uniqKey="Weissfeld J">JL Weissfeld</name>
</author>
<author>
<name sortKey="Wilson, Do" uniqKey="Wilson D">DO Wilson</name>
</author>
<author>
<name sortKey="Dacic, S" uniqKey="Dacic S">S Dacic</name>
</author>
<author>
<name sortKey="Lokshin, Ae" uniqKey="Lokshin A">AE Lokshin</name>
</author>
<author>
<name sortKey="Siegfried, Jm" uniqKey="Siegfried J">JM Siegfried</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article" xml:lang="en">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">J Clin Bioinforma</journal-id>
<journal-id journal-id-type="iso-abbrev">J Clin Bioinforma</journal-id>
<journal-title-group>
<journal-title>Journal of Clinical Bioinformatics</journal-title>
</journal-title-group>
<issn pub-type="epub">2043-9113</issn>
<publisher>
<publisher-name>BioMed Central</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">25379168</article-id>
<article-id pub-id-type="pmc">4215335</article-id>
<article-id pub-id-type="publisher-id">2043-9113-4-13</article-id>
<article-id pub-id-type="doi">10.1186/2043-9113-4-13</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Research</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Semi-automated literature mining to identify putative biomarkers of disease from multiple biofluids</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes" id="A1">
<name>
<surname>Jordan</surname>
<given-names>Rick</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<email>rmj12@pitt.edu</email>
</contrib>
<contrib contrib-type="author" id="A2">
<name>
<surname>Visweswaran</surname>
<given-names>Shyam</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<xref ref-type="aff" rid="I2">2</xref>
<xref ref-type="aff" rid="I3">3</xref>
<email>shv3@pitt.edu</email>
</contrib>
<contrib contrib-type="author" id="A3">
<name>
<surname>Gopalakrishnan</surname>
<given-names>Vanathi</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<xref ref-type="aff" rid="I2">2</xref>
<xref ref-type="aff" rid="I3">3</xref>
<email>vanathi@pitt.edu</email>
</contrib>
</contrib-group>
<aff id="I1">
<label>1</label>
Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA</aff>
<aff id="I2">
<label>2</label>
Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA, USA</aff>
<aff id="I3">
<label>3</label>
Department of Computational & Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA</aff>
<pub-date pub-type="collection">
<year>2014</year>
</pub-date>
<pub-date pub-type="epub">
<day>23</day>
<month>10</month>
<year>2014</year>
</pub-date>
<volume>4</volume>
<fpage>13</fpage>
<lpage>13</lpage>
<history>
<date date-type="received">
<day>26</day>
<month>6</month>
<year>2014</year>
</date>
<date date-type="accepted">
<day>2</day>
<month>10</month>
<year>2014</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright © 2014 Jordan et al.; licensee BioMed Central Ltd.</copyright-statement>
<copyright-year>2014</copyright-year>
<copyright-holder>Jordan et al.; licensee BioMed Central Ltd.</copyright-holder>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0">
<license-p>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0">http://creativecommons.org/licenses/by/4.0</ext-link>
), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/publicdomain/zero/1.0/">http://creativecommons.org/publicdomain/zero/1.0/</ext-link>
) applies to the data made available in this article, unless otherwise stated.</license-p>
</license>
</permissions>
<self-uri xlink:href="http://www.jclinbioinformatics.com/content/4/1/13"></self-uri>
<abstract>
<sec>
<title>Background</title>
<p>Computational methods for mining of biomedical literature can be useful in augmenting manual searches of the literature using keywords for disease-specific biomarker discovery from biofluids. In this work, we develop and apply a semi-automated literature mining method to mine abstracts obtained from PubMed to discover putative biomarkers of breast and lung cancers in specific biofluids.</p>
</sec>
<sec>
<title>Methodology</title>
<p>A positive set of abstracts was defined by the terms ‘breast cancer’ and ‘lung cancer’ in conjunction with 14 separate ‘biofluids’ (bile, blood, breastmilk, cerebrospinal fluid, mucus, plasma, saliva, semen, serum, synovial fluid, stool, sweat, tears, and urine), while a negative set of abstracts was defined by the terms ‘(biofluid) NOT breast cancer’ or ‘(biofluid) NOT lung cancer.’ More than 5.3 million total abstracts were obtained from PubMed and examined for biomarker-disease-biofluid associations (34,296 positive and 2,653,396 negative for breast cancer; 28,355 positive and 2,595,034 negative for lung cancer). Biological entities such as genes and proteins were tagged using ABNER, and processed using Python scripts to produce a list of putative biomarkers. Z-scores were calculated, ranked, and used to determine significance of putative biomarkers found. Manual verification of relevant abstracts was performed to assess our method’s performance.</p>
</sec>
<sec>
<title>Results</title>
<p>Biofluid-specific markers were identified from the literature, assigned relevance scores based on frequency of occurrence, and validated using known biomarker lists and/or databases for lung and breast cancer [NCBI’s On-line Mendelian Inheritance in Man (OMIM), Cancer Gene annotation server for cancer genomics (CAGE), NCBI’s Genes & Disease, NCI’s Early Detection Research Network (EDRN), and others]. The specificity of each marker for a given biofluid was calculated, and the performance of our semi-automated literature mining method assessed for breast and lung cancer.</p>
</sec>
<sec>
<title>Conclusions</title>
<p>We developed a semi-automated process for determining a list of putative biomarkers for breast and lung cancer. New knowledge is presented in the form of biomarker lists; ranked, newly discovered biomarker-disease-biofluid relationships; and biomarker specificity across biofluids.</p>
</sec>
</abstract>
<kwd-group>
<kwd>Literature mining</kwd>
<kwd>Text mining</kwd>
<kwd>Lung cancer</kwd>
<kwd>Breast cancer</kwd>
<kwd>Biomarker</kwd>
<kwd>Biofluid</kwd>
</kwd-group>
</article-meta>
</front>
</pmc>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Pennsylvanie</li>
</region>
<settlement>
<li>Pittsburgh</li>
</settlement>
<orgName>
<li>Université de Pittsburgh</li>
</orgName>
</list>
<tree>
<country name="États-Unis">
<region name="Pennsylvanie">
<name sortKey="Jordan, Rick" sort="Jordan, Rick" uniqKey="Jordan R" first="Rick" last="Jordan">Rick Jordan</name>
</region>
<name sortKey="Gopalakrishnan, Vanathi" sort="Gopalakrishnan, Vanathi" uniqKey="Gopalakrishnan V" first="Vanathi" last="Gopalakrishnan">Vanathi Gopalakrishnan</name>
<name sortKey="Gopalakrishnan, Vanathi" sort="Gopalakrishnan, Vanathi" uniqKey="Gopalakrishnan V" first="Vanathi" last="Gopalakrishnan">Vanathi Gopalakrishnan</name>
<name sortKey="Gopalakrishnan, Vanathi" sort="Gopalakrishnan, Vanathi" uniqKey="Gopalakrishnan V" first="Vanathi" last="Gopalakrishnan">Vanathi Gopalakrishnan</name>
<name sortKey="Visweswaran, Shyam" sort="Visweswaran, Shyam" uniqKey="Visweswaran S" first="Shyam" last="Visweswaran">Shyam Visweswaran</name>
<name sortKey="Visweswaran, Shyam" sort="Visweswaran, Shyam" uniqKey="Visweswaran S" first="Shyam" last="Visweswaran">Shyam Visweswaran</name>
<name sortKey="Visweswaran, Shyam" sort="Visweswaran, Shyam" uniqKey="Visweswaran S" first="Shyam" last="Visweswaran">Shyam Visweswaran</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Informatique/explor/SgmlV1/Data/Pmc/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000018 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Checkpoint/biblio.hfd -nk 000018 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Informatique
   |area=    SgmlV1
   |flux=    Pmc
   |étape=   Checkpoint
   |type=    RBID
   |clé=     PMC:4215335
   |texte=   Semi-automated literature mining to identify putative biomarkers of disease from multiple biofluids
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Checkpoint/RBID.i   -Sk "pubmed:25379168" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Checkpoint/biblio.hfd   \
       | NlmPubMed2Wicri -a SgmlV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jul 1 14:26:08 2019. Site generation: Wed Apr 28 21:40:44 2021