Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Development of the Lymphoma Enterprise Architecture Database: A caBIG(tm) Silver level compliant System

Identifieur interne : 000512 ( Pmc/Corpus ); précédent : 000511; suivant : 000513

Development of the Lymphoma Enterprise Architecture Database: A caBIG(tm) Silver level compliant System

Auteurs : Taoying Huang ; Pareen J. Shenoy ; Rajni Sinha ; Michael Graiser ; Kevin W. Bumpers ; Christopher R. Flowers

Source :

RBID : PMC:2675136

Abstract

Lymphomas are the fifth most common cancer in United States with numerous histological subtypes. Integrating existing clinical information on lymphoma patients provides a platform for understanding biological variability in presentation and treatment response and aids development of novel therapies. We developed a cancer Biomedical Informatics Grid™ (caBIG™) Silver level compliant lymphoma database, called the Lymphoma Enterprise Architecture Data-system™ (LEAD™), which integrates the pathology, pharmacy, laboratory, cancer registry, clinical trials, and clinical data from institutional databases. We utilized the Cancer Common Ontological Representation Environment Software Development Kit (caCORE SDK) provided by National Cancer Institute’s Center for Bioinformatics to establish the LEAD™ platform for data management. The caCORE SDK generated system utilizes an n-tier architecture with open Application Programming Interfaces, controlled vocabularies, and registered metadata to achieve semantic integration across multiple cancer databases. We demonstrated that the data elements and structures within LEAD™ could be used to manage clinical research data from phase 1 clinical trials, cohort studies, and registry data from the Surveillance Epidemiology and End Results database. This work provides a clear example of how semantic technologies from caBIG™ can be applied to support a wide range of clinical and research tasks, and integrate data from disparate systems into a single architecture. This illustrates the central importance of caBIG™ to the management of clinical and biological data.


Url:
PubMed: 19492074
PubMed Central: 2675136

Links to Exploration step

PMC:2675136

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Development of the Lymphoma Enterprise Architecture Database: A caBIG(tm) Silver level compliant System</title>
<author>
<name sortKey="Huang, Taoying" sort="Huang, Taoying" uniqKey="Huang T" first="Taoying" last="Huang">Taoying Huang</name>
</author>
<author>
<name sortKey="Shenoy, Pareen J" sort="Shenoy, Pareen J" uniqKey="Shenoy P" first="Pareen J." last="Shenoy">Pareen J. Shenoy</name>
</author>
<author>
<name sortKey="Sinha, Rajni" sort="Sinha, Rajni" uniqKey="Sinha R" first="Rajni" last="Sinha">Rajni Sinha</name>
</author>
<author>
<name sortKey="Graiser, Michael" sort="Graiser, Michael" uniqKey="Graiser M" first="Michael" last="Graiser">Michael Graiser</name>
</author>
<author>
<name sortKey="Bumpers, Kevin W" sort="Bumpers, Kevin W" uniqKey="Bumpers K" first="Kevin W." last="Bumpers">Kevin W. Bumpers</name>
</author>
<author>
<name sortKey="Flowers, Christopher R" sort="Flowers, Christopher R" uniqKey="Flowers C" first="Christopher R." last="Flowers">Christopher R. Flowers</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">19492074</idno>
<idno type="pmc">2675136</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675136</idno>
<idno type="RBID">PMC:2675136</idno>
<date when="2009">2009</date>
<idno type="wicri:Area/Pmc/Corpus">000512</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Development of the Lymphoma Enterprise Architecture Database: A caBIG(tm) Silver level compliant System</title>
<author>
<name sortKey="Huang, Taoying" sort="Huang, Taoying" uniqKey="Huang T" first="Taoying" last="Huang">Taoying Huang</name>
</author>
<author>
<name sortKey="Shenoy, Pareen J" sort="Shenoy, Pareen J" uniqKey="Shenoy P" first="Pareen J." last="Shenoy">Pareen J. Shenoy</name>
</author>
<author>
<name sortKey="Sinha, Rajni" sort="Sinha, Rajni" uniqKey="Sinha R" first="Rajni" last="Sinha">Rajni Sinha</name>
</author>
<author>
<name sortKey="Graiser, Michael" sort="Graiser, Michael" uniqKey="Graiser M" first="Michael" last="Graiser">Michael Graiser</name>
</author>
<author>
<name sortKey="Bumpers, Kevin W" sort="Bumpers, Kevin W" uniqKey="Bumpers K" first="Kevin W." last="Bumpers">Kevin W. Bumpers</name>
</author>
<author>
<name sortKey="Flowers, Christopher R" sort="Flowers, Christopher R" uniqKey="Flowers C" first="Christopher R." last="Flowers">Christopher R. Flowers</name>
</author>
</analytic>
<series>
<title level="j">Cancer Informatics</title>
<idno type="eISSN">1176-9351</idno>
<imprint>
<date when="2009">2009</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>Lymphomas are the fifth most common cancer in United States with numerous histological subtypes. Integrating existing clinical information on lymphoma patients provides a platform for understanding biological variability in presentation and treatment response and aids development of novel therapies. We developed a cancer Biomedical Informatics Grid™ (caBIG™) Silver level compliant lymphoma database, called the Lymphoma Enterprise Architecture Data-system™ (LEAD™), which integrates the pathology, pharmacy, laboratory, cancer registry, clinical trials, and clinical data from institutional databases. We utilized the Cancer Common Ontological Representation Environment Software Development Kit (caCORE SDK) provided by National Cancer Institute’s Center for Bioinformatics to establish the LEAD™ platform for data management. The caCORE SDK generated system utilizes an n-tier architecture with open Application Programming Interfaces, controlled vocabularies, and registered metadata to achieve semantic integration across multiple cancer databases. We demonstrated that the data elements and structures within LEAD™ could be used to manage clinical research data from phase 1 clinical trials, cohort studies, and registry data from the Surveillance Epidemiology and End Results database. This work provides a clear example of how semantic technologies from caBIG™ can be applied to support a wide range of clinical and research tasks, and integrate data from disparate systems into a single architecture. This illustrates the central importance of caBIG™ to the management of clinical and biological data.</p>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Zhong, Y" uniqKey="Zhong Y">Y Zhong</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Jemal, A" uniqKey="Jemal A">A Jemal</name>
</author>
<author>
<name sortKey="Siegel, R" uniqKey="Siegel R">R Siegel</name>
</author>
<author>
<name sortKey="Ward, Em" uniqKey="Ward E">EM Ward</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ottensmeier, C" uniqKey="Ottensmeier C">C Ottensmeier</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Graiser, M" uniqKey="Graiser M">M Graiser</name>
</author>
<author>
<name sortKey="Moore, Sg" uniqKey="Moore S">SG Moore</name>
</author>
<author>
<name sortKey="Victor, R" uniqKey="Victor R">R Victor</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Buetow, Kh" uniqKey="Buetow K">KH Buetow</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Covitz, P" uniqKey="Covitz P">P Covitz</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Deering, Mj" uniqKey="Deering M">MJ Deering</name>
</author>
<author>
<name sortKey="Derr, L" uniqKey="Derr L">L Derr</name>
</author>
<author>
<name sortKey="Hadfield, J" uniqKey="Hadfield J">J Hadfield</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Jaffe, Es" uniqKey="Jaffe E">ES Jaffe</name>
</author>
<author>
<name sortKey="Harris, Nl" uniqKey="Harris N">NL Harris</name>
</author>
<author>
<name sortKey="Stein, H" uniqKey="Stein H">H Stein</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Harris, Nl" uniqKey="Harris N">NL Harris</name>
</author>
<author>
<name sortKey="Jaffe, Es" uniqKey="Jaffe E">ES Jaffe</name>
</author>
<author>
<name sortKey="Stein, H" uniqKey="Stein H">H Stein</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Covitz, Pa" uniqKey="Covitz P">PA Covitz</name>
</author>
<author>
<name sortKey="Hartel, F" uniqKey="Hartel F">F Hartel</name>
</author>
<author>
<name sortKey="Schaefer, C" uniqKey="Schaefer C">C Schaefer</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Tobias, J" uniqKey="Tobias J">J Tobias</name>
</author>
<author>
<name sortKey="Chilukuri, R" uniqKey="Chilukuri R">R Chilukuri</name>
</author>
<author>
<name sortKey="Komatsoulis, G" uniqKey="Komatsoulis G">G Komatsoulis</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wang, H" uniqKey="Wang H">H Wang</name>
</author>
<author>
<name sortKey="Lee, J" uniqKey="Lee J">J Lee</name>
</author>
<author>
<name sortKey="Buck, K" uniqKey="Buck K">K Buck</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wang, H" uniqKey="Wang H">H Wang</name>
</author>
<author>
<name sortKey="Lee, J" uniqKey="Lee J">J Lee</name>
</author>
<author>
<name sortKey="Buck, K" uniqKey="Buck K">K Buck</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Patel, S" uniqKey="Patel S">S Patel</name>
</author>
<author>
<name sortKey="Dumitru, D" uniqKey="Dumitru D">D Dumitru</name>
</author>
<author>
<name sortKey="Griffin, C" uniqKey="Griffin C">C Griffin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Phillips, J" uniqKey="Phillips J">J Phillips</name>
</author>
<author>
<name sortKey="Chilukuri, R" uniqKey="Chilukuri R">R Chilukuri</name>
</author>
<author>
<name sortKey="Fragoso, G" uniqKey="Fragoso G">G Fragoso</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Berners Lee, T" uniqKey="Berners Lee T">T Berners-Lee</name>
</author>
<author>
<name sortKey="Hendler, J" uniqKey="Hendler J">J Hendler</name>
</author>
<author>
<name sortKey="Lassila, O" uniqKey="Lassila O">O Lassila</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wang, X" uniqKey="Wang X">X Wang</name>
</author>
<author>
<name sortKey="Gorlitsky, R" uniqKey="Gorlitsky R">R Gorlitsky</name>
</author>
<author>
<name sortKey="Almeida, Js" uniqKey="Almeida J">JS Almeida</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Stevens, R" uniqKey="Stevens R">R Stevens</name>
</author>
<author>
<name sortKey="Bodenreider, O" uniqKey="Bodenreider O">O Bodenreider</name>
</author>
<author>
<name sortKey="Lussier, Ya" uniqKey="Lussier Y">YA Lussier</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Fritz, A" uniqKey="Fritz A">A Fritz</name>
</author>
<author>
<name sortKey="Percy, C" uniqKey="Percy C">C Percy</name>
</author>
<author>
<name sortKey="Jack, A" uniqKey="Jack A">A Jack</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Jaffe, Es" uniqKey="Jaffe E">ES Jaffe</name>
</author>
<author>
<name sortKey="Harris, Nl" uniqKey="Harris N">NL Harris</name>
</author>
<author>
<name sortKey="Stein, H" uniqKey="Stein H">H Stein</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Buck, K" uniqKey="Buck K">K Buck</name>
</author>
<author>
<name sortKey="Lee, J" uniqKey="Lee J">J Lee</name>
</author>
<author>
<name sortKey="Wang, H" uniqKey="Wang H">H Wang</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Morton, Lm" uniqKey="Morton L">LM Morton</name>
</author>
<author>
<name sortKey="Wang, Ss" uniqKey="Wang S">SS Wang</name>
</author>
<author>
<name sortKey="Devesa, Ss" uniqKey="Devesa S">SS Devesa</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Abouyabis, An" uniqKey="Abouyabis A">AN AbouYabis</name>
</author>
<author>
<name sortKey="Shenoy, Pj" uniqKey="Shenoy P">PJ Shenoy</name>
</author>
<author>
<name sortKey="Lechowicz, Mj" uniqKey="Lechowicz M">MJ Lechowicz</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Baer, D" uniqKey="Baer D">D Baer</name>
</author>
<author>
<name sortKey="Groenewoud, P" uniqKey="Groenewoud P">P Groenewoud</name>
</author>
<author>
<name sortKey="Kapetanios, E" uniqKey="Kapetanios E">E Kapetanios</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Shironoshita, Ep" uniqKey="Shironoshita E">EP Shironoshita</name>
</author>
<author>
<name sortKey="Jean Ary, Yr" uniqKey="Jean Ary Y">YR Jean–Mary</name>
</author>
<author>
<name sortKey="Bradley, Rm" uniqKey="Bradley R">RM Bradley</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Karp, Pd" uniqKey="Karp P">PD Karp</name>
</author>
<author>
<name sortKey="Chaudhri, Vk" uniqKey="Chaudhri V">VK Chaudhri</name>
</author>
<author>
<name sortKey="Thomere, J" uniqKey="Thomere J">J Thomere</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Cancer Inform</journal-id>
<journal-id journal-id-type="iso-abbrev">Cancer Inform</journal-id>
<journal-id journal-id-type="publisher-id">101258149</journal-id>
<journal-title-group>
<journal-title>Cancer Informatics</journal-title>
</journal-title-group>
<issn pub-type="epub">1176-9351</issn>
<publisher>
<publisher-name>Libertas Academica</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">19492074</article-id>
<article-id pub-id-type="pmc">2675136</article-id>
<article-id pub-id-type="publisher-id">cin-08-45</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Original Research</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Development of the Lymphoma Enterprise Architecture Database: A caBIG(tm) Silver level compliant System</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Huang</surname>
<given-names>Taoying</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Shenoy</surname>
<given-names>Pareen J.</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Sinha</surname>
<given-names>Rajni</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Graiser</surname>
<given-names>Michael</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Bumpers</surname>
<given-names>Kevin W.</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Flowers</surname>
<given-names>Christopher R.</given-names>
</name>
<xref ref-type="corresp" rid="c1-cin-08-45"></xref>
</contrib>
<aff id="af1-cin-08-45">Winship Cancer Institute, School of Medicine, Emory University, Atlanta, GA, U.S.A</aff>
</contrib-group>
<author-notes>
<corresp id="c1-cin-08-45">Correspondence: Christopher R. Flowers, Emory University, Winship Cancer Institute, 1365 Clifton Road, N.E. Building C, Suite 3006, Atlanta, GA, U.S.A. Tel: 404-778-5554; Fax: 404-778-5520; Email:
<email>christopher.flowers@emoryhealthcare.org</email>
</corresp>
</author-notes>
<pub-date pub-type="epub">
<day>3</day>
<month>4</month>
<year>2009</year>
</pub-date>
<pub-date pub-type="collection">
<year>2009</year>
</pub-date>
<volume>8</volume>
<fpage>45</fpage>
<lpage>64</lpage>
<permissions>
<copyright-statement>© 2009 The authors.</copyright-statement>
<copyright-year>2009</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0">
<license-p>
<pmc-comment>CREATIVE COMMONS</pmc-comment>
This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/3.0/">http://creativecommons.org/licenses/by/3.0/</ext-link>
).</license-p>
</license>
</permissions>
<abstract>
<p>Lymphomas are the fifth most common cancer in United States with numerous histological subtypes. Integrating existing clinical information on lymphoma patients provides a platform for understanding biological variability in presentation and treatment response and aids development of novel therapies. We developed a cancer Biomedical Informatics Grid™ (caBIG™) Silver level compliant lymphoma database, called the Lymphoma Enterprise Architecture Data-system™ (LEAD™), which integrates the pathology, pharmacy, laboratory, cancer registry, clinical trials, and clinical data from institutional databases. We utilized the Cancer Common Ontological Representation Environment Software Development Kit (caCORE SDK) provided by National Cancer Institute’s Center for Bioinformatics to establish the LEAD™ platform for data management. The caCORE SDK generated system utilizes an n-tier architecture with open Application Programming Interfaces, controlled vocabularies, and registered metadata to achieve semantic integration across multiple cancer databases. We demonstrated that the data elements and structures within LEAD™ could be used to manage clinical research data from phase 1 clinical trials, cohort studies, and registry data from the Surveillance Epidemiology and End Results database. This work provides a clear example of how semantic technologies from caBIG™ can be applied to support a wide range of clinical and research tasks, and integrate data from disparate systems into a single architecture. This illustrates the central importance of caBIG™ to the management of clinical and biological data.</p>
</abstract>
<kwd-group>
<kwd>biomedical informatics grid</kwd>
<kwd>non-Hodgkin’s lymphoma</kwd>
<kwd>large linked database</kwd>
<kwd>semantic integration</kwd>
<kwd>caCORE SDK</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec>
<title>Background</title>
<p>Lymphomas are a heterogeneous group of cancers that are characterized by abnormal growth of tissue in the lymphatic system. These disorders originate from B-lymphocytes, T-lymphocytes, and natural killer (NK) cells. Between 1950 and 1999, the incidence of Non-Hodgkin’s Lymphoma (NHL) increased by 90% in the United States,
<xref rid="b1-cin-08-45" ref-type="bibr">1</xref>
resulting in one of the largest documented increases in cancer. This rapid increase may be a result of improved diagnostic techniques and access to medical care, the rise in HIV-related NHLs, or other causes. Currently, NHL represents approximately 4% of all cancer diagnoses, being the fifth most common cancer among men and women.
<xref rid="b2-cin-08-45" ref-type="bibr">2</xref>
In 2008, 66,120 new cases of NHL and 8,220 new cases of Hodgkin lymphoma (HL) are expected to be diagnosed in the United States.
<xref rid="b2-cin-08-45" ref-type="bibr">2</xref>
</p>
<p>The World Health Organization (WHO) classification system divides lymphomas according to the cell of origin (B, T/NK) and incorporates morphology, immunophenotype, genetic, and clinical features to define subtypes. Approximately 85% of all NHLs are of B-cell origin and the remaining 15% of T-cell origin.
<xref rid="b3-cin-08-45" ref-type="bibr">3</xref>
The WHO classification schema for NHL was devised to help aid in prognosis and treatment decision making. The most frequent clinical entities recognized by the WHO classification are diffuse large B-cell lymphoma (DLBCL, 31%) and follicular lymphoma (FL, 22%). The WHO classification divides HL into 2 main types: classical and lymphocyte-predominant (
<xref ref-type="table" rid="t1-cin-08-45">Table 1</xref>
). Classical HL accounts for 95% of the cases and is further divided into 4 subtypes: nodular sclerosis, mixed cellularity, lymphocyte-depleted, and lymphocyte-rich. Current treatment modalities for lymphoma include conventional chemotherapy, immunotherapy, radioimmunotherapy, and stem cell transplant. Prognostic factors such as age, performance status, and number of relapses can influence how a patient will respond to certain treatments.</p>
<p>In order to expedite the development of innovative clinical and therapeutic strategies for lymphoma, our oncology informatics group has been developing means to integrate existing clinical information into database systems that support cancer research.
<xref rid="b4-cin-08-45" ref-type="bibr">4</xref>
To this end, we designed a platform for integrated clinical and biomedical informatics research using patient-level data linking the institution’s existing clinical trials, cancer registry, clinical, administrative, and pharmacy systems with biological databases. However, semantic integration of data from disparate systems remains challenging even when similar concepts are represented in different data systems.</p>
</sec>
<sec>
<title>The Cancer Biomedical Informatics Grid</title>
<p>The cancer Biomedical Informatics Grid (caBIG™)
<xref rid="b5-cin-08-45" ref-type="bibr">5</xref>
,
<xref rid="b6-cin-08-45" ref-type="bibr">6</xref>
is a voluntary network or grid linking individuals and institutions to promote the sharing of data and tools. The caBIG™ development and research covers clinical trials management systems, tissue banks, pathology tools, integrated cancer research, system architecture, vocabularies, common data elements, data sharing, and intellectual capital. The infrastructure and tools established by caBIG™ are likely also to have broad utility outside the cancer community. Currently, more than 900 individuals from over 50 National Cancer Institute (NCI)-designated cancer centers and a multitude of other organizations are working collaboratively on over 70 projects as part of the caBIG™ initiative.
<xref rid="b5-cin-08-45" ref-type="bibr">5</xref>
,
<xref rid="b7-cin-08-45" ref-type="bibr">7</xref>
</p>
<p>The systems developed within the caBIG™ community can be organized into four levels of maturity based on different degrees of interoperability defined by the caBIG™ Compatibility Guidelines: Legacy, Bronze, Silver, and Gold.
<xref rid="b7-cin-08-45" ref-type="bibr">7</xref>
,
<xref rid="b8-cin-08-45" ref-type="bibr">8</xref>
Legacy compliance implies no interoperability with an external system or resource. In order to achieve Bronze compatibility, the resource should provide at least programmatic access to data through a public, documented application programming interfaces (API). Silver compatibility requires more conditions, which must provide well-documented API that is based upon an object-oriented abstraction of the underlying data. Gold compatibility includes a service–oriented data and analytical service grid with standardized service advertising and discovery features, and grid–level security strategy.
<xref ref-type="table" rid="t2-cin-08-45">Table 2</xref>
details the pertinent categories that must be addressed to obtain caBIG™ Silver compliance: 1) programming and messaging interfaces; 2) vocabularies, terminologies, and ontologies; 3) data elements; and 4) information models (shown in
<xref ref-type="table" rid="t2-cin-08-45">Table 2</xref>
).
<xref rid="b7-cin-08-45" ref-type="bibr">7</xref>
</p>
<p>The caBIG™ is creating a common, extensible informatics platform that can integrate diverse data types and support interoperable analytic tools in areas including clinical trials management, tissue banks and pathology, imaging, and integrative cancer research.
<xref ref-type="table" rid="t3-cin-08-45">Table 3</xref>
displays the various tools, infrastructure, and data resources in caBIG™ and their roles.</p>
<p>The NCI’s Center for Biomedical Informatics and Information Technology (NCIBIIT) has developed a set of software packages to support application development for cancer research, the caCORE Software Development Kit (SDK).
<xref rid="b9-cin-08-45" ref-type="bibr">9</xref>
This SDK provides a platform for data management and semantic integration.</p>
</sec>
<sec>
<title>The Cancer Common Ontological Representation Environment SDK</title>
<p>To establish a common representation within this SDK, the cancer Common Ontological Representation Environment (caCORE) was established to provide a framework for developing syntactically and semantically interoperable biomedical information services. It has several key components: the Enterprise Vocabulary Services (EVS), the cancer Data Standards Repository (caDSR), the Cancer Bioinformatics Infrastructure Objects, and the Common Security Module. A brief description for these components is included in
<xref ref-type="table" rid="t4-cin-08-45">Table 4</xref>
. Complete documentation and updated information on caCORE and its components can be found on the NCI Center for Bioinformatics (NCICB) web site.</p>
<p>A caCORE SDK generated system has two characteristics: 1) a Model Driven Architecture that provides a conceptual framework and standards for expressing the model, relationships between models, and transformations between models using the Meta-Object Facility, Unified Modeling Language (UML), XML Metadata Interchange (XMI), and Common Warehouse Meta-model specifications, and 2) an n-tier architecture with open API. When a caCORE SDK generated system is combined with controlled vocabularies and registered metadata, the resulting system is semantically integrated with all exposed API elements having runtime accessible metadata that defines the meaning of the data elements using a controlled terminology.</p>
<p>The NCICB has developed the EVS to supply controlled vocabularies, and the caDSR to provide a dynamic metadata registry
<xref rid="b10-cin-08-45" ref-type="bibr">10</xref>
specifically for cancer informatics applications. Systems developed using the caCORE methodology use the same approach for defining, registering, and adopting data and representation of standards. Clients of those systems can therefore extract information from multiple data sources using similar API calls, and can rely upon the semantic equivalence of the data retrieved.</p>
</sec>
<sec>
<title>Semantic Structure of caBIG™</title>
<p>One of the problems confronting the biomedical data management community is the vast number of ways that similar or identical concepts are described. Such inconsistency in data descriptors (metadata) makes it challenging to aggregate and manage even modest-sized data sets and share data across current information resources. Consider for instance, the number of different ways that each of the 51 types of lymphoma shown in
<xref ref-type="table" rid="t1-cin-08-45">Table 1</xref>
can be coded if one or more is represented in various administrative, clinical, pathological, radiology, or clinical trials databases. Examples of different coding schemes include International Classification of Diseases, Tenth Revision (ICD-10) diagnosis codes,
<xref rid="b11-cin-08-45" ref-type="bibr">11</xref>
International Classification of Diseases for Oncology (ICD-O) topography and histology codes,
<xref rid="b12-cin-08-45" ref-type="bibr">12</xref>
and institutional representations within clinical trials systems that may be based on the WHO classification system
<xref rid="b13-cin-08-45" ref-type="bibr">13</xref>
or the older Working Formulation, Kiel, or Revised European-American Lymphoma classification systems.
<xref rid="b14-cin-08-45" ref-type="bibr">14</xref>
</p>
<p>In order to address these problems, the NCI created the EVS, which forms the semantic underpinnings of caCORE. Semantic interoperability lies in the UML model, the use of publicly accessible terminologies/vocabularies/ontologies (EVS-NCI Thesaurus) and the use of publicly accessible metadata repository (caDSR). The EVS organizes distinct but overlapping terminologies and thus provides a rich controlled vocabulary for data coding and retrieval including the NCI Thesaurus and the NCI Metathesaurus. The controlled terminology component of caBIG™ is maintained in the NCI Thesaurus. The NCI Metathesaurus is based on National Library Medicine’s UML System Metathesaurus supplemented with additional cancer-centric vocabulary. It maps many biomedical vocabularies useful to the cancer community and contains both public domain and proprietary vocabularies.
<xref rid="b15-cin-08-45" ref-type="bibr">15</xref>
</p>
<p>The caBIG™ organizes semantic metadata in three layers of abstraction, as illustrated in
<xref ref-type="fig" rid="f1-cin-08-45">Figure 1</xref>
. At the top level, semantic concepts are organized through the NCI Thesaurus, and accessed through the EVS. These concepts are related to each other through the use of Common Data Elements (CDEs) which are stored and accessed through the caDSR. The bottom layer is the Domain Model layer where each UML class is linked to a concept within the NCI Thesaurus, each relationship between UML classes is linked to an association, and each relationship between a UML class and an attribute value is linked to a CDE.</p>
<p>Using the caBIG™ semantic modeling methodology, Tobias et al developed a model by which the College of American Pathologists cancer protocols could be used as the basis for an electronic data standard in pathology.
<xref rid="b16-cin-08-45" ref-type="bibr">16</xref>
Wang et al. developed a Lung Cancer Clinical Database Application System using caCORE SDK.
<xref rid="b17-cin-08-45" ref-type="bibr">17</xref>
,
<xref rid="b18-cin-08-45" ref-type="bibr">18</xref>
There are approximately 69 Silver compliant systems registered with the caBIG™.
<xref rid="b19-cin-08-45" ref-type="bibr">19</xref>
The models investigate cancer registries, clinical trials, gene expression, genomics, and behavioral research data management.</p>
<p>The data system described herein is the first system that aids lymphoma research and is registered with caBIG™ (
<ext-link ext-link-type="uri" xlink:href="http://umlmodelbrowser.nci.nih.gov/umlmodelbrowser/">http://umlmodelbrowser.nci.nih.gov/umlmodelbrowser/</ext-link>
). As of writing of this paper, we are not aware of any other lymphoma databases developed using caCORE SDK. However, there are lymphoma databases developed but not registered with caBIG™. For example, the Lymphoma NCI Specialized Programs of Research Excellence initiated at the University of Iowa/Mayo Clinic is a highly successful lymphoma translational research program.
<xref rid="b20-cin-08-45" ref-type="bibr">20</xref>
The Lymphoma Foundation of America is an independent, nonprofit charitable organization that conducts lymphoma research especially on dietary factors, environmental factors, treatment, and genetics.
<xref rid="b21-cin-08-45" ref-type="bibr">21</xref>
Other national lymphoma databases include the Surveillance, Epidemiology, and End Result
<xref rid="b22-cin-08-45" ref-type="bibr">22</xref>
database and National Cancer Database.
<xref rid="b23-cin-08-45" ref-type="bibr">23</xref>
</p>
<p>Using the semantic metadata framework (
<xref ref-type="fig" rid="f1-cin-08-45">Fig. 1</xref>
), we have developed a caBIG™ silver compliant database, the Lymphoma Enterprise Architecture Data-system (LEAD™), that establishes domain specific ontologies and meta–data for lymphoma translational research and lymphoma clinical research. With LEAD™ deployed, we have established reusable data structures for institutional case-control studies, national SEER cohort studies, and lymphoid malignancy clinical trials. This work provides a clear example of how semantic technologies from caBIG™ can be applied to support a wide range of clinical and research tasks, and illustrates the central importance of caBIG™ to the management of clinical and biological data.</p>
</sec>
<sec sec-type="methods">
<title>Methods</title>
<sec>
<title>LEAD™ Development</title>
<p>As a member of caBIG™ community, we followed the caCORE SDK guidance and developed LEAD™ in accordance with the Silver compatibility guidelines. Steps involved in caCORE SDK workflow and the development of LEAD™ are shown in
<xref ref-type="fig" rid="f2-cin-08-45">Figure 2</xref>
.
<xref rid="b24-cin-08-45" ref-type="bibr">24</xref>
The major steps in the workflow include: using case development; information modeling; semantic annotation; metadata registration; code generation; and system deployment.
<xref rid="b25-cin-08-45" ref-type="bibr">25</xref>
LEAD™ development involved creating class diagrams and data models within Enterprise Architecture (EA). The structure and relationships between classes in the LEAD™ model are shown in
<xref ref-type="fig" rid="f3-cin-08-45">Figure 3</xref>
. The actual software code, such as API for data access, data services, is generated from the model.</p>
<p>In the model, classes represent discrete scientific entities. For instance, in LEAD™, a patient’s demographic information is denoted by class Registration, and his/her histological diagnosis is modeled by class Histology. Disparate methods for representing diagnosis (e.g. ICD-9 codes, ICD-O codes, pathology-free text reports) are semantically integrated by mapping to this class. Attributes of each class represent specific characteristics of the entities and become Data Elements in the software system. For example, one of the attributes of Histology is defined as ‘immune_phenotype’. In addition to classes and attributes, the model also specifies the associations between classes including cardinality and direction. For instance, each patient has only one registration record, but he can have multiple adverse events. Thus, the association between class Onstudy and class AdverseEvent is one–to–many relationship. It is important and required that the UML model be annotated with descriptions. This facilitates the subsequent semantic integration. In LEAD™, UML entities are matched to vocabulary concepts; and annotated by an expert in the subject area (CF).</p>
<p>After the UML model was created, the NCI EVS staff were involved the annotation of entities. Once the annotated UML model was approved by the model owner, it was loaded into the caDSR by the NCI EVS staff. The caCORE SDK automatically generates code for web services, an API for data access and a basic class browser. The class browser allows the developer to check the attributes in a class, and to search based on the criteria the user enters. The result set is displayed for all those records that meet the search criteria. Any other classes that have associations with this class are searched and displayed as well. After the data system is deployed biomedical researchers can query the system through well-documented API, web browsers, or web services.</p>
<p>In this manuscript we describe the use of semantic tools to integrate data from heterogeneous systems. LEAD™ compiles data from multiple data different sources with unique data elements including: the ONCORE
<sup>®</sup>
<xref rid="b26-cin-08-45" ref-type="bibr">26</xref>
data from clinical trials, clinical data from Emory University’s legacy administrative (Health Quest: hospital; IDX: clinic), cancer registry (IMPAC Medical Systems), electronic medical records (Cerner PowerChart), laboratory, pharmacy, clinical trials databases
<xref rid="b4-cin-08-45" ref-type="bibr">4</xref>
and data from populated SEER registry. All these data sources have unique representations of data elements that are integrated using caBIG™ semantic tools (
<xref ref-type="fig" rid="f3-cin-08-45">Fig. 3</xref>
). For example, the clinical trials data has information on patient identifiers, clinical and laboratory data, and data on the treatment and response. The Emory University clinical data on the other hand is comprised of linked data from the clinical, administrative, pharmacy and biological databases that contains clinical, laboratory, and treatment response data coded using different schemas and terminology. The SEER data has detailed sociodemographic information in addition to the details on the disease histology, treatments, and survival represented in yet another manner. Through LEAD™ we integrated the data from these data sources into a single comprehensive database with a unified semantic meaning for concepts shared across these databases.</p>
<p>The relationship between entities and classes in the model and data fields from lymphoma cohort studies, SEER registry data, and clinical trials case report forms are shown in
<xref ref-type="fig" rid="f4-cin-08-45">Figure 4</xref>
. The EA model was used to generate an XMI file and Data Definition Language Script. Classes and attributes within the model were iteratively annotated by authors TH, CF, and NCI’s EVS personnel and the final annotated XMI file was uploaded to the caDSR. Once the LEAD™ metadata passed compatibility review, the final APIs were created using the SDK code generator. Development was implemented on and supported by Dell™ PowerEdge™ 1800 web server. Once stabilized, the LEAD™ application was migrated to a Dell™ PowerEdge™ 6800 production server running an Oracle 10 g relational database.</p>
<p>The Semantic Web is a vision for the next generation of the Web.
<xref rid="b27-cin-08-45" ref-type="bibr">27</xref>
The first generation Web is characterized by static and handwritten HTML pages; the second generation is characterized by dynamics and interactive HTML pages. However, these two generations share a similar property; the information on Web is represented only in natural language for human processing. The goal of the next generation, namely Semantic Web, is to make information on the Web available for computational processing.</p>
<p>True semantic integration requires a common and shared vocabulary. The ontology serves this purpose and Semantic Web technology provides language for this goal. The World Wide Web consortium approved two key Semantic Web technologies: the Resource Description Framework (RDF) and the Web Ontology Language (OWL). The RDF provides a common data model so that when RDF is built on top of XML, systems can achieve the level of interoperability required by highly dynamic and integrated bioinformatics applications.
<xref rid="b28-cin-08-45" ref-type="bibr">28</xref>
Both RDF and OWL are Semantic Web standards that provide a framework for sharing the data on the Web. OWL builds on RDF and RDF Schema, and adds more vocabulary for describing properties and classes. Thus, OWL has strict and precise semantics that are not found in RDF Schema. As Semantic Web resources become mature and available, there is an increasing tendency in bioinformatics applications to use Semantic Web technologies.
<xref rid="b29-cin-08-45" ref-type="bibr">29</xref>
This higher level of semantic integration may be possible in future versions of LEAD™.</p>
<p>To use LEAD™, the researcher interacts with a web browser through the Internet to input and query the relevant information (
<xref ref-type="fig" rid="f5-cin-08-45">Fig. 5</xref>
). The web browser sends the user’s request to the web server, the web server parses the requests and transfers the requests to the application server, the application server accesses the backend database using object-relational mapping, generates the required content dynamically, and sends the response back to the web browser through the web server. In order for the outside cancer research community to access the data stored in the lymphoma clinical database application system, the system provides programmatic APIs generated using caCORE SDK code generator.</p>
</sec>
<sec sec-type="methods">
<title>Populating the database with lymphoma data</title>
<p>To populate LEAD™, we utilized data sources from three ongoing research studies: 1) a cohort studies of NHL patients previously treated at Emory University, 2) SEER registry data on patients with lymphoid malignancies, and 3) phase 1 lymphoma clinical trials data.</p>
<sec sec-type="methods">
<title>Emory University Clinical Data</title>
<p>Emory University clinical source data for LEAD™ was derived from a large linked database that represents a fully integrated platform combining clinical and administrative legacy databases.
<xref rid="b4-cin-08-45" ref-type="bibr">4</xref>
This platform interfaces Emory Healthcare’s existing legacy administrative (Health Quest: hospital; IDX: clinic), cancer registry (IMPAC Medical Systems), electronic medical records (Cerner PowerChart), laboratory, pharmacy, clinical trials databases creating a stand-alone structured query language-(SQL) based data warehouse.</p>
<p>We utilized a series of search strategies to identify a joint population of interest containing potential patients with selected NHL subtypes; FL, DLBCL, and mantle cell lymphoma (MCL). We searched the linked database using cancer registry ICD-O histology codes to identify patients: FL(9690, 9695, 9691, 9698), DLBCL (9680, 9684), and MCL (9693).
<xref rid="b30-cin-08-45" ref-type="bibr">30</xref>
The query also included the ICD-O behavior code 3 (malignant neoplasms, primary). This query is labeled Q1 in
<xref ref-type="table" rid="t5A-cin-08-45">Table 5A</xref>
. The next series of queries involved text searches of the electronic medical records using simple free text with the histological diagnosis or more complex free-text including synonyms from the UML System Metathesaurus Concept Search.
<xref rid="b4-cin-08-45" ref-type="bibr">4</xref>
Each text string search was conducted twice, once limited to anatomical pathology (AP) reports, and once accessing all medical records.
<xref ref-type="table" rid="t5A-cin-08-45">Table 5A</xref>
describes the phrases used to identify patients with FL, DLBCL, and MCL. A list of 2471 patients was obtained from the text search of AP reports and 3170 patients were identified from the text search of electronic medical records. Each query defines a different schema for representing patients with a lymphoma diagnosis within the clinical data repository.</p>
<p>Next, a group of trained abstractors (PS, KB, and TH) ascertained each patient’s histological diagnosis by reviewing pathology reports and medical charts. In all cases, WHO classification schema for NHL was utilized as the gold standard for diagnosis.
<xref rid="b31-cin-08-45" ref-type="bibr">31</xref>
A hematological oncologist (CF) resolved all cases where there was uncertainty as to whether the WHO criteria were met. The pathology-verified diagnosis status for each individual was used to calculate the sensitivity and specificity of each query strategy as follows:
<table-wrap id="d35e346" position="anchor">
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" rowspan="1" colspan="1"></th>
<th align="center" rowspan="1" colspan="1">Lymphoma Subtype Positive</th>
<th align="center" rowspan="1" colspan="1">Lymphoma Subtype Negative</th>
<th align="center" rowspan="1" colspan="1"></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="1" colspan="1">Query</td>
<td align="center" rowspan="1" colspan="1">a</td>
<td align="center" rowspan="1" colspan="1">b</td>
<td align="center" rowspan="1" colspan="1">Sensitivity</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Positive</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1">= a/a + c</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Query</td>
<td align="center" rowspan="1" colspan="1">c</td>
<td align="center" rowspan="1" colspan="1">d</td>
<td align="center" rowspan="1" colspan="1">Specificity</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Negative</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1">= d/b + d</td>
</tr>
</tbody>
</table>
</table-wrap>
</p>
<p>Individuals identified by AP and medical reports to have a definitive diagnosis of FL, DLBCL, or MCLwere integrated into LEAD™ using the semantic architecture to map individuals identified by the means described in each query into a unified definition of histological subtype. Data elements were mapped into LEAD™ representations using a Perl script and loaded into the LEAD™ Oracle 10 g database.</p>
</sec>
<sec sec-type="methods">
<title>SEER data</title>
<p>The SEER registry is an authoritative source of information on cancer incidence and survival in the United States. The SEER limited-use data include SEER incidence and population data. Eighty-two fields from the SEER dataset have been incorporated into LEAD™. Data were obtained in a tab-delimited format and mapped to LEAD™ data elements.</p>
<p>Next, we built a 2-tier web interface by using Java 2 Enterprise Edition (J2EE) technology to query the data in the relational database. This web interface simplifies reading and searching SEER data. The user interacts with a web interface that uses JSP and a set of rich, custom tag libraries to provide the view. The Java Server Page communicates with the database management system through object-relational mapping. This provides a powerful, flexible platform for allowing users to query SEER lymphoma data. Moreover, the data are stored in a semantic framework such that data analyses examining the SEER lymphoma populations can be readily compared to equivalent populations of Emory patients since they are mapped to the same LEAD™ concept. The generated LEAD™ application system is a distributed, web-based application which provides two interfaces: one is a web-based user interface for inputting and querying data; the second provides programmatic APIs for outside institutions to query the data stored in the backend database using object/relational mapping technology (
<xref ref-type="fig" rid="f5-cin-08-45">Fig. 5</xref>
).</p>
</sec>
<sec sec-type="methods">
<title>Lymphoma clinical trials data</title>
<p>Web forms for the clinical trial database were generated using a web application template developed in J2EE.
<xref rid="b32-cin-08-45" ref-type="bibr">32</xref>
This web application is meta-data driven, which means all the requirements that are used to generate a web form are stored in a relational database (e.g. column names, data types, data length, constraints, etc), allowing all of the web forms to be generated dynamically. If a change needs to be made on the form (e.g. converting a text field to a selection list) no programming effort is required. Through modifying the metadata in the database by SQL, the change is reflected immediately when the form is reloaded. The web application template was used to generate data entry forms for database table.</p>
</sec>
</sec>
</sec>
<sec sec-type="results">
<title>Results</title>
<p>We performed data integration across disparate data sources: 1) to illustrate integration capabilities of the LEAD™ infrastructure, and 2) to establish datasets that facilitate use by physicians and researchers with a common interface.</p>
<sec sec-type="methods">
<title>Integration of emory university clinical data for cohort studies</title>
<p>Query strategies varied in terms of their sensitivity and specificity across lymphoma subtypes, but strategies using ICD-O codes had the most favorable characteristics (
<xref ref-type="table" rid="t5B-cin-08-45">Table 5B</xref>
). A total of 930 patients (324 FL, 519 DLBCL, and 87 MCL) were verified by histological diagnosis. Queries based on cancer registry histology codes (Q1) had high specificity for FL and MCL but not for DLBCL. Simple free-text queries of pathology reports (Q2) or all medical records documents (Q4) had a high sensitivity for FL, DLBCL, and MCL. Simple free-text searches of all medical records identified 92% of potential FL cases, 64% of DLBCL and 89% of potential MCL cases but had varying sensitivity and specificity: FL(97%, 13%); DLBCL (55%, 50%); and MCL (44%, 6%) (
<xref ref-type="table" rid="t5B-cin-08-45">Table 5B</xref>
). Queries using additional phrases from the UML System synonym list (Q3, Q5) had higher specificity for FL and DLBCL and higher sensitivity for MCL.</p>
<p>No query strategy had ideal characteristics for all lymphoma histological subtypes, thus use of a combination of strategies, representations, and terminologies is required to best identify a robust patient population for clinical research. To render data gathered using heterogeneous terminologies useful, we integrated the resulting data set into LEAD™ by mapping heterogeneous representations for lymphoma histology into the unified meaning ‘Histology Type’ shown in
<xref ref-type="fig" rid="f4-cin-08-45">Figure 4</xref>
. The semantically integrated database allows for data management and data analysis where histological diagnosis has a unique clinical research meaning in LEAD™ regardless of how it was coded in the source data set.</p>
</sec>
<sec sec-type="methods|results">
<title>Integration of surveillance epidemiology and end results
<xref rid="b22-cin-08-45" ref-type="bibr">22</xref>
data</title>
<p>SEER registry data for the years 1973–2005 were extracted from the tab-delimited limited-use data set. The entire database contains more than 3.5 million tumors. Lymphoma patients were identified from this data set using ICD-O, Third Edition codes as described previously.
<xref rid="b33-cin-08-45" ref-type="bibr">33</xref>
Cases of lymphoma were identified during this time frame and classified into histological diagnosis categories using the WHO system shown in
<xref ref-type="table" rid="t1-cin-08-45">Table 1</xref>
. One or more ICD-O codes maps to each histology type.
<xref rid="b33-cin-08-45" ref-type="bibr">33</xref>
</p>
<p>Eighty–two fields from the SEER dataset were incorporated into LEAD™. Tab-delimited data were translated into clinical concepts using the SEER Data Dictionary (
<ext-link ext-link-type="uri" xlink:href="http://seer.cancer.gov/manuals/CD2_popdic.html">http://seer.cancer.gov/manuals/CD2_popdic.html</ext-link>
) and mapped to LEAD™ data elements.
<xref ref-type="fig" rid="f4-cin-08-45">Figure 4</xref>
shows the relationships between SEER and LEAD™ data elements. The common representation of entities and classes between SEER data and other sources (
<xref ref-type="fig" rid="f4-cin-08-45">Fig. 4</xref>
) and the relations between entities and classes across data sources permit data sharing and analysis of data elements with a common meaning. The architecture described permits comparative analyses of institutional and national datasets that address common clinical problems. We have used the representation schema and semantic integration from LEAD™ to investigate the incidence and outcomes for peripheral T-cell lymphoma.
<xref rid="b34-cin-08-45" ref-type="bibr">34</xref>
In this instance, LEAD™ provided a common user interface for researchers to examine data regardless of its original source or representation.</p>
</sec>
<sec sec-type="methods">
<title>Integration of Phase 1 clinical trials data</title>
<p>FL is the second most frequent lymphoma subtype worldwide with a rapidly increasing incidence in the Western world. The majority of patients with FL present with advanced disease. For these patients, there is no standard treatment and the clinical course is characterized by a pattern of multiple relapses and remissions and a median survival of 6.2 years.
<xref rid="b33-cin-08-45" ref-type="bibr">33</xref>
We have developed an early phase clinical trial investigating a novel combination chemotherapy regimen (bortezomib, rituximab, cyclophosphamide, adriamycin, vincristine, and prednisone) designed to improve outcomes for patients with FL. The primary objectives of this study are:
<list list-type="bullet">
<list-item>
<p>To identify the maximal tolerated doses of bortezomib and vincristine when used in this combination</p>
</list-item>
<list-item>
<p>To estimate the complete response rate associated with this regimen</p>
</list-item>
</list>
</p>
<p>
<xref ref-type="table" rid="t6-cin-08-45">Table 6</xref>
shows the schedule for data collections for this lymphoma clinical trial.
<xref ref-type="fig" rid="f6-cin-08-45">Figure 6</xref>
shows an example graphical user interface for entering clinical trial data into LEAD™. Data collected during the course of the trial are mapped to LEAD™ classes and entities as shown in
<xref ref-type="fig" rid="f4-cin-08-45">Figure 4</xref>
. Again, since LEAD™ relies on the caBIG™ architecture for semantic integration, patients with a histological diagnosis of FL within the clinical trial can be matched to patients who share the same histology in SEER or the Emory lymphoma cohort study. Moreover, since the latter dataset contains patients who overlap with those treated on the clinical trial, data elements that are common to the two studies may be stored once with a common representation and those unique to each study are stored as well with the relationships between the data elements made explicit as shown in
<xref ref-type="fig" rid="f4-cin-08-45">Figure 4</xref>
.</p>
</sec>
<sec>
<title>Semantic queries using LEAD™</title>
<p>Viewing caBIG™ semantic metadata as formal standard ontology, querying can be defined as the search of the members of classes from multiple data sources.
<xref rid="b35-cin-08-45" ref-type="bibr">35</xref>
The most important types of queries can be categorized as those that employ joins and merges. If the data services are built upon semantically rich metadata, individual members of different classes ought to be related to each other. Consider for example an individual member of class emory: OnStudy, and an individual of class emory: AdverseEvent, whenever a patient is registered in the clinical trial, this patient has a member of class emory: OnStudy. A query such as the one that seeks to retrieve information about adverse events and event details for a patient, then returns the join on the object property OnStudy has AdverseEvent in class emory: OnStudy and the object property AdverseEvent has AdverseEvent Detail in class emory: AdverseEvent. The strength of caBIG™ semantics in facilitating data integration is obvious, as the join conditions are directly specified by the class properties from the data sources.</p>
<p>The LEAD™ model has been registered in caDSR and can be searched by NCI UML model browser (
<ext-link ext-link-type="uri" xlink:href="http://umlmodelbrowser.nci.nih.gov/umlmodelbrowser/">http://umlmodelbrowser.nci.nih.gov/umlmodelbrowser/</ext-link>
). This model can be deployed by another institution to collect and store its lymphoma clinical data. Data stored in this manner will facilitate data integration and could thus promote the conduct of inter-institutional clinical trials and epidemiology studies. Moreover, the model browser permits queries of the model structure and CDEs. An example is querying the adverse events CDEs contained in LEAD™. The execution of this query would return every individual member of AdverseEvent based on the same conditions from these two data sources. It is worth noting that it is necessary to determine one or more data type properties of AdverseEvent, i.e. CDEs which have the class AdverseEvent as its subject that is shared by both subclasses. The researcher who adopts the current LEAD™ UML model can add more attributes or modify the existing attributes in the classes. Readers who are interested in query formulation techniques on semantic data can further reference papers by Shironoshita et al. and Baer et al.
<xref rid="b35-cin-08-45" ref-type="bibr">35</xref>
,
<xref rid="b36-cin-08-45" ref-type="bibr">36</xref>
</p>
</sec>
<sec>
<title>Investigational review board approval and controlling access</title>
<p>We obtained Investigational Review Board (IRB) approval for storing and maintaining patient-related data for cancer patients, and an IRB approved process has been developed for incorporating new patient data into our lymphoma database system. This database will use the same procedures for access control that is used for other Emory confidential databases. The research database is protected by the standard Emory Firewall. Dr. Flowers reviews any application for passwords and userIDs from researchers. The Informatics Project Manager (TH) assists investigators in applying for the appropriate level of access, in accordance with the researcher’s individual protocol.</p>
</sec>
</sec>
<sec sec-type="discussion">
<title>Discussion</title>
<p>In cancer research, we need to link clinical outcomes and tissue based research data to support the discovery of correlations between molecular studies and prognostic and treatment response profiles. Some of the challenges we face in cancer research include, but not limited to, data sharing, data complexity, and organizational complexity. We also need to use shared languages, such as XML, RDF, or XOL
<xref rid="b37-cin-08-45" ref-type="bibr">37</xref>
for reporting. Clinical trials are run over many sites, and data is held by multiple stakeholders and researchers. Thus we need to adopt semantic web technologies, including RDF and OWL to share and integrate data models, which enable us to report results and phenomena with shared languages. Moreover, we should and make it possible to interpret models with more complete and semantically integrated data. The emergence of grid technologies facilitates the collaboration of different centers and allows clinical trials to be scattered in different sites. However, to be meaningful grid technologies must integrate data from multiple stakeholders semantically. Successful semantic integration involves use of controlled vocabularies and structured, standard-based metadata to unambiguously describe diverse data sets.</p>
<sec>
<title>Significance</title>
<p>This paper successfully demonstrates the application of caCORE SDK in lymphoma research and this methodology serves as a model for adoption into other cancer research domains, such as lung cancer,
<xref rid="b32-cin-08-45" ref-type="bibr">32</xref>
prostate cancer, and breast cancer. The tool sets from the NCI facilitate rapid data modeling, data sharing and exchange, and comprehensive question answering. Hence the tools can expedite drug discovery, cancer detection, and improve disease diagnosis and treatment.</p>
</sec>
<sec>
<title>Limitations and future work</title>
<p>While there has been substantial progress within caBIG™, and in particular in the implementation of caGrid, there is still much work in progress. Although the caCORE SDK provides a rich set of tools to rapidly develop the silver-level compatible system, there is no robust search tool that allows biomedical researchers easily to search the underlying data. Programming effort is still required to extract data from the databases. DBsurfer
<xref rid="b38-cin-08-45" ref-type="bibr">38</xref>
is one of such tools that may allow complex queries of semantically integrated databases, but open source tools are needed in this area.</p>
<p>Another issue addressed by the LEAD™ system is the integration of data across numerous legacy systems: Health Quest, IDX, IMPAC Medical cancer registry, Cerner PowerChart electronic medical records, OnCore clinical trials data, and other databases. Other valuable data are contained in different systems, such as data in spreadsheets on personal computers, patient data in electronic systems such as The Emory Electronic Medical Record, and various other cancer database systems that are outside caGrid framework. In the future, caCORE SDK must provide tools for integrating data from legacy systems coded in various formats and performing data quality assurance evaluations. Our future work will concentrate on the provision of grid-enabled access to our lymphoma cancer data and web services within caGrid, through the public APIs, XML, SOAP to allow other data holders to exchange data without intimate knowledge of each other’s system implementations.</p>
</sec>
</sec>
<sec>
<title>Conclusion</title>
<p>Lymphomas are a heterogeneous group of cancers that require the development of focused research and clinical approaches for specific histological subtypes. Integrating existing clinical, genomic and proteomic information provides a platform for examining biological variability in the pathogenesis of lymphomas and their responses to treatment response and will promote the development of innovative treatment strategies. To expedite the development of novel therapeutics for lymphoma, our oncology informatics group developed a caBIG™ silver-level compliant information system that incorporates clinical and biological data into a semantically integrated structure, LEAD™, that supports information sharing between cancer research scientists and clinicians. The LEAD™ system links cancer registry, pathology, clinical, administrative, pharmacy, and clinical trials data with biological data elements at the patient–level. We demonstrated that the same data elements and structures in LEAD™ could be used for institutional cohort studies linking data across numerous legacy systems,
<xref rid="b4-cin-08-45" ref-type="bibr">4</xref>
early phase lymphoma clinical trials data collection, and national SEER registry data on lymphoid malignancies. This caCORE SDK generated system built upon an n–tier architecture with open APIs, utilizes controlled vocabularies and registered metadata to achieve semantic integration. For LEAD™ and other caBIG™ compliant systems the NCI’s EVS supplies the controlled vocabularies, and the caDSR provides dynamic metadata registry. Data from LEAD™ can be combined and reused by systems, programmers, and investigators in the broader cancer community. We have demonstrated that reusable data structures can be applied to a number of research settings and provided examples from institutional retrospective epidemiological studies, national cancer registry data queries, and clinical trials. LEAD™ was populated with data from three ongoing research studies 1) cohort studies of NHL patients previously treated at Emory University, 2) SEER registry data for lymphoma, and 3) case report forms data from phase 1 clinical trials. Source data for each of these research scenarios originated in a variety of formats and, once stored within LEAD™, can be shared and reused based on the standards of the caBIG™ architecture. We have shown how semantic technologies from caBIG™ were applied to support a wide range of clinical and research tasks, and our work serves to illustrate the central importance of caBIG™ to the management of clinical and biological data in cancer research. The infrastructure and tools created by caBIG™ can also benefit researchers outside the cancer community.</p>
</sec>
</body>
<back>
<ack>
<p>This work was supported by grant funding from Dr. Flowers’ Georgia Cancer Coalition Distinguished Scholar Award, The Leukemia and Lymphoma Society’s Translational Research grant, and Amos Medical Faculty Development Award from the American Society of Hematology/Robert Woods Johnson Foundation.</p>
</ack>
<fn-group>
<fn>
<p>
<bold>Disclosure</bold>
</p>
<p>The authors report no conflicts of interest.</p>
</fn>
<fn>
<p>
<bold>Copyright in this article, its metadata, and any supplementary data is held by its author or authors. It is published under the Creative Commons Attribution By licence. For further information go to:
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/3.0/">http://creativecommons.org/licenses/by/3.0/</ext-link>
.</bold>
</p>
</fn>
</fn-group>
<ref-list>
<title>References</title>
<ref id="b1-cin-08-45">
<label>1.</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhong</surname>
<given-names>Y</given-names>
</name>
</person-group>
<article-title>Non–Hodgkin Lymphoma: What Primary Care Professionals Need to Know</article-title>
<source>The Journal for Nurse Practitioners</source>
<year>2006</year>
<volume>2</volume>
<fpage>309</fpage>
<lpage>338</lpage>
</element-citation>
</ref>
<ref id="b2-cin-08-45">
<label>2.</label>
<element-citation publication-type="webpage">
<person-group person-group-type="author">
<name>
<surname>Jemal</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Siegel</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Ward</surname>
<given-names>EM</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Cancer Facts and Figures</article-title>
<comment>Accessed June 10, 2008.
<ext-link ext-link-type="uri" xlink:href="http://www.cancer.org/downloads/STT/2008CAFFfinalsecured.pdf">http://www.cancer.org/downloads/STT/2008CAFFfinalsecured.pdf</ext-link>
</comment>
<year>2008</year>
</element-citation>
</ref>
<ref id="b3-cin-08-45">
<label>3.</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ottensmeier</surname>
<given-names>C</given-names>
</name>
</person-group>
<article-title>The classification of lymphomas and leukemias</article-title>
<source>Chem Biol Interact</source>
<year>2001</year>
<fpage>135</fpage>
<lpage>136</lpage>
<fpage>653</fpage>
<lpage>664</lpage>
</element-citation>
</ref>
<ref id="b4-cin-08-45">
<label>4.</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Graiser</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Moore</surname>
<given-names>SG</given-names>
</name>
<name>
<surname>Victor</surname>
<given-names>R</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Development of Query Strategies to Identify a Histologic Lymphoma Subtype in a Large Linked Database System</article-title>
<source>Cancer Inform</source>
<year>2007</year>
<volume>2</volume>
<fpage>149</fpage>
<lpage>158</lpage>
<pub-id pub-id-type="pmid">19455241</pub-id>
</element-citation>
</ref>
<ref id="b5-cin-08-45">
<label>5.</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<collab>National Cancer Institute</collab>
</person-group>
<article-title>The Cancer Biomedical Informatics Grid (caBIG)™</article-title>
<comment>URL:
<ext-link ext-link-type="uri" xlink:href="https://cabig.nci.nih.gov/">https://cabig.nci.nih.gov/</ext-link>
. Accessed 18 March 2008.</comment>
</element-citation>
</ref>
<ref id="b6-cin-08-45">
<label>6.</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Buetow</surname>
<given-names>KH</given-names>
</name>
</person-group>
<article-title>Cyberinfrastructure: Empowering a “Third Way” in Biomedical Research</article-title>
<source>Science</source>
<year>2005</year>
<volume>308</volume>
<fpage>821</fpage>
<lpage>4</lpage>
<pub-id pub-id-type="pmid">15879210</pub-id>
</element-citation>
</ref>
<ref id="b7-cin-08-45">
<label>7.</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Covitz</surname>
<given-names>P</given-names>
</name>
</person-group>
<article-title>caBIG™ Compatibility Guidelines: Revision 2</article-title>
<comment>Accessed 18 March 2008. URL:
<ext-link ext-link-type="uri" xlink:href="https://cabig.nci.nih.gov/guidelines_documentation/caBIGCompatGuideRev2_final.pdf">https://cabig.nci.nih.gov/guidelines_documentation/caBIGCompatGuideRev2_final.pdf</ext-link>
</comment>
<year>2005</year>
</element-citation>
</ref>
<ref id="b8-cin-08-45">
<label>8.</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Deering</surname>
<given-names>MJ</given-names>
</name>
<name>
<surname>Derr</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Hadfield</surname>
<given-names>J</given-names>
</name>
<etal></etal>
</person-group>
<article-title>caBIG™ Primer: An Introduction to caBIG™</article-title>
<comment>Accessed 18 March 2008. URL:
<ext-link ext-link-type="uri" xlink:href="https://cabig.nci.nihgov/overview/cabig-primer">https://cabig.nci.nihgov/overview/cabig-primer</ext-link>
</comment>
<year>2006</year>
</element-citation>
</ref>
<ref id="b9-cin-08-45">
<label>9.</label>
<element-citation publication-type="webpage">
<person-group person-group-type="author">
<collab>National Cancer Institute Center for Bioinformatics</collab>
</person-group>
<article-title>caCORE Software Development Kit (SDK)</article-title>
<comment>URL:
<ext-link ext-link-type="uri" xlink:href="http://ncicb.nci.nih.gov/NCICB/infrastructure/cacoresdk">http://ncicb.nci.nih.gov/NCICB/infrastructure/cacoresdk</ext-link>
. Accessed 18 March 2008</comment>
</element-citation>
</ref>
<ref id="b10-cin-08-45">
<label>10.</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<collab>National Cancer Institute</collab>
</person-group>
<article-title>caCORE Software Development Kit 3.2: Programmer’s Guide</article-title>
<comment>Accessed April 01, 2008.
<ext-link ext-link-type="ftp" xlink:href="ftp://ftp1nci.nih.gov/pub/cacore/SDK/v3.2/caCORE_SDK_32_Programmers_Guide.pdf">ftp://ftp1nci.nih.gov/pub/cacore/SDK/v3.2/caCORE_SDK_32_Programmers_Guide.pdf</ext-link>
</comment>
<year>2007</year>
</element-citation>
</ref>
<ref id="b11-cin-08-45">
<label>11.</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<collab>World Health Organization</collab>
</person-group>
<source>International Statistical Classification of Diseases and Related Health Problems</source>
<edition>10th Revision</edition>
<publisher-loc>Geneva, Switzerland</publisher-loc>
<publisher-name>World Health Organization</publisher-name>
<year>2005</year>
</element-citation>
</ref>
<ref id="b12-cin-08-45">
<label>12.</label>
<element-citation publication-type="book">
<person-group person-group-type="editor">
<collab>World Health Organization</collab>
</person-group>
<source>ICD-O International Classification of Diseases for Oncology</source>
<publisher-loc>Geneva, Switzerland</publisher-loc>
<publisher-name>World Health Organization</publisher-name>
<year>1976</year>
</element-citation>
</ref>
<ref id="b13-cin-08-45">
<label>13.</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Jaffe</surname>
<given-names>ES</given-names>
</name>
<name>
<surname>Harris</surname>
<given-names>NL</given-names>
</name>
<name>
<surname>Stein</surname>
<given-names>H</given-names>
</name>
<etal></etal>
</person-group>
<source>World Health Organization Classification of Tumours. Pathology and Genetics of Tumours of Haematopoietic and Lymphoid Tissues</source>
<publisher-loc>Lyon</publisher-loc>
<publisher-name>IARC Press</publisher-name>
<year>2001</year>
</element-citation>
</ref>
<ref id="b14-cin-08-45">
<label>14.</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Harris</surname>
<given-names>NL</given-names>
</name>
<name>
<surname>Jaffe</surname>
<given-names>ES</given-names>
</name>
<name>
<surname>Stein</surname>
<given-names>H</given-names>
</name>
<etal></etal>
</person-group>
<article-title>A revised European–American classification of lymphoid neoplasms: a proposal from the International Lymphoma Study Group</article-title>
<source>Blood</source>
<year>1994</year>
<volume>84</volume>
<fpage>1361</fpage>
<lpage>92</lpage>
<pub-id pub-id-type="pmid">8068936</pub-id>
</element-citation>
</ref>
<ref id="b15-cin-08-45">
<label>15.</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Covitz</surname>
<given-names>PA</given-names>
</name>
<name>
<surname>Hartel</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Schaefer</surname>
<given-names>C</given-names>
</name>
<etal></etal>
</person-group>
<article-title>caCORE: a common infrastructure for cancer informatics</article-title>
<source>Bioinformatics</source>
<year>2003</year>
<volume>19</volume>
<fpage>2404</fpage>
<lpage>12</lpage>
<pub-id pub-id-type="pmid">14668224</pub-id>
</element-citation>
</ref>
<ref id="b16-cin-08-45">
<label>16.</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tobias</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Chilukuri</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Komatsoulis</surname>
<given-names>G</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The CAP cancer protocols—a case study of caCORE based data standards implementation to integrate with the Cancer Biomedical Informatics Grid</article-title>
<source>BMC Med Inform Decis Mak</source>
<year>2006</year>
<volume>6</volume>
<fpage>25</fpage>
<pub-id pub-id-type="pmid">16787533</pub-id>
</element-citation>
</ref>
<ref id="b17-cin-08-45">
<label>17.</label>
<element-citation publication-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Buck</surname>
<given-names>K</given-names>
</name>
<etal></etal>
</person-group>
<conf-name>In caBIG Annual Meeting</conf-name>
<conf-loc>Washington DC</conf-loc>
<year>2007b</year>
</element-citation>
</ref>
<ref id="b18-cin-08-45">
<label>18.</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Buck</surname>
<given-names>K</given-names>
</name>
<etal></etal>
</person-group>
<source>In Bioinformatics Research and Development (BIRD)</source>
<publisher-loc>Berlin, Germany</publisher-loc>
<publisher-name>Bioinformatics Research and Development</publisher-name>
<year>2007a</year>
</element-citation>
</ref>
<ref id="b19-cin-08-45">
<label>19.</label>
<element-citation publication-type="webpage">
<person-group person-group-type="author">
<collab>National Cancer Institute-Center for Bioinformatics</collab>
</person-group>
<article-title>Cancer Data Standards Registry and Repository (caDSR)—UML Model Browser</article-title>
<comment>URL:
<ext-link ext-link-type="uri" xlink:href="http://umlmodelbrowser.nci.nih.gov/umlmodelbrowser/">http://umlmodelbrowser.nci.nih.gov/umlmodelbrowser/</ext-link>
. Accessed 16 December 2008</comment>
</element-citation>
</ref>
<ref id="b20-cin-08-45">
<label>20.</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<collab>Holden Comprehensive Cancer Center</collab>
</person-group>
<source>NCI Specialized Program of Research Excellence (Lymphoma SPORE)</source>
<publisher-name>University of Iowa</publisher-name>
<comment>22 August 2008. URL:
<ext-link ext-link-type="uri" xlink:href="http://www.uihealthcare.com/depts/cancer-center/research/ncispore.html">http://www.uihealthcare.com/depts/cancer-center/research/ncispore.html</ext-link>
. Accessed 15 December 2008</comment>
</element-citation>
</ref>
<ref id="b21-cin-08-45">
<label>21.</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<collab>Lymphoma Foundation of America</collab>
</person-group>
<source>Research</source>
<publisher-name>Lymphoma Foundation of America</publisher-name>
<comment>URL:
<ext-link ext-link-type="uri" xlink:href="http://www.lymphomahelp.org/research.php">http://www.lymphomahelp.org/research.php</ext-link>
. Accessed 15 December, 2008</comment>
</element-citation>
</ref>
<ref id="b22-cin-08-45">
<label>22.</label>
<mixed-citation publication-type="webpage">
<comment>Surveillance Epidemiology and End Results (SEER) Program (2008, based on the November 2007 submission) SEER*Stat Database: Incidence—SEER 13 Regs + AK Public–Use, Nov 2007 Sub 1992–2005 —Linked to County Attributes—Total U.S., 1969–2005 Counties. National Cancer Institute, DCCPS, Surveillance Research Program, Cancer Statistics Branch</comment>
</mixed-citation>
</ref>
<ref id="b23-cin-08-45">
<label>23.</label>
<element-citation publication-type="webpage">
<person-group person-group-type="author">
<collab>Commission on Cancer, American Cancer Society</collab>
</person-group>
<article-title>National Cancer Data Base (NCDB)</article-title>
<source>American College of Surgeons</source>
<comment>
<ext-link ext-link-type="uri" xlink:href="http://www.facs.org/cancer/ncdb/index.html">http://www.facs.org/cancer/ncdb/index.html</ext-link>
. Accessed 15 December, 2008.</comment>
</element-citation>
</ref>
<ref id="b24-cin-08-45">
<label>24.</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Patel</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Dumitru</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Griffin</surname>
<given-names>C</given-names>
</name>
<etal></etal>
</person-group>
<article-title>caCORE SDK 4.0 Developer’s Guide</article-title>
<comment>Accessed 13 August 2008. URL:
<ext-link ext-link-type="uri" xlink:href="https://gforge.nci.nih.gov/docman/view.php/148/8650/caCORE%20SDK%204.0%20Developer's%20Guide_101007.pdf">https://gforge.nci.nih.gov/docman/view.php/148/8650/caCORE%20SDK%204.0%20Developer's%20Guide_101007.pdf</ext-link>
</comment>
<year>2007</year>
</element-citation>
</ref>
<ref id="b25-cin-08-45">
<label>25.</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Phillips</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Chilukuri</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Fragoso</surname>
<given-names>G</given-names>
</name>
<etal></etal>
</person-group>
<article-title>The caCORE Software Development Kit: Streamlining construction of interoperable biomedical information services</article-title>
<source>BMC Med Inform Decis Mak</source>
<year>2006</year>
<volume>6</volume>
<fpage>2</fpage>
<pub-id pub-id-type="pmid">16398930</pub-id>
</element-citation>
</ref>
<ref id="b26-cin-08-45">
<label>26.</label>
<element-citation publication-type="webpage">
<person-group person-group-type="author">
<collab>PercipEnz Technologies Inc</collab>
</person-group>
<article-title>In Preparation of caBIG™ Silver Compatibility Certification, Oncore® is Modularized</article-title>
<source>PercipEnz Technologies, Inc</source>
<comment>URL:
<ext-link ext-link-type="uri" xlink:href="http://www.percipenz.com/news.html#oncore">http://www.percipenz.com/news.html#oncore</ext-link>
. Accessed 17 Sept 2008</comment>
</element-citation>
</ref>
<ref id="b27-cin-08-45">
<label>27.</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Berners-Lee</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Hendler</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Lassila</surname>
<given-names>O</given-names>
</name>
</person-group>
<article-title>The Semantic Web</article-title>
<source>Sci Am</source>
<year>2001</year>
<volume>284</volume>
<fpage>34</fpage>
<pub-id pub-id-type="pmid">11396337</pub-id>
</element-citation>
</ref>
<ref id="b28-cin-08-45">
<label>28.</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Gorlitsky</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Almeida</surname>
<given-names>JS</given-names>
</name>
</person-group>
<article-title>From XML to RDF: how semantic web technologies will change the design of ‘omic’ standards</article-title>
<source>Nat Biotechnol</source>
<year>2005</year>
<volume>23</volume>
<fpage>1099</fpage>
<lpage>103</lpage>
<pub-id pub-id-type="pmid">16151403</pub-id>
</element-citation>
</ref>
<ref id="b29-cin-08-45">
<label>29.</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stevens</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Bodenreider</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Lussier</surname>
<given-names>YA</given-names>
</name>
</person-group>
<article-title>In Pacific Symposium on Biocomputing</article-title>
<source>World Scientific Publishing Co Pte Ltd</source>
<year>2006</year>
</element-citation>
</ref>
<ref id="b30-cin-08-45">
<label>30.</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Fritz</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Percy</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Jack</surname>
<given-names>A</given-names>
</name>
<etal></etal>
</person-group>
<source>ICD-O International Classification of Diseases for Oncology—Third Edition</source>
<publisher-loc>Geneva, Switzerland</publisher-loc>
<publisher-name>World Health Organization</publisher-name>
<year>2000</year>
</element-citation>
</ref>
<ref id="b31-cin-08-45">
<label>31.</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Jaffe</surname>
<given-names>ES</given-names>
</name>
<name>
<surname>Harris</surname>
<given-names>NL</given-names>
</name>
<name>
<surname>Stein</surname>
<given-names>H</given-names>
</name>
<etal></etal>
</person-group>
<source>Pathology and Genetics of Tumours of Hematopoietic and Lymphoid Tissues</source>
<publisher-loc>Lyon</publisher-loc>
<publisher-name>Iarc Press</publisher-name>
<year>2001</year>
</element-citation>
</ref>
<ref id="b32-cin-08-45">
<label>32.</label>
<element-citation publication-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Buck</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>H</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Lung Cancer Clinical Database Application System</article-title>
<conf-name>The caBIG™ Annual Meeting</conf-name>
<conf-loc>Washington, D.C</conf-loc>
<year>2007</year>
</element-citation>
</ref>
<ref id="b33-cin-08-45">
<label>33.</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Morton</surname>
<given-names>LM</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>SS</given-names>
</name>
<name>
<surname>Devesa</surname>
<given-names>SS</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Lymphoma incidence patterns by WHO subtype in the United States, 1992–2001</article-title>
<source>Blood</source>
<year>2006</year>
<volume>107</volume>
<fpage>265</fpage>
<lpage>276</lpage>
<pub-id pub-id-type="pmid">16150940</pub-id>
</element-citation>
</ref>
<ref id="b34-cin-08-45">
<label>34.</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>AbouYabis</surname>
<given-names>AN</given-names>
</name>
<name>
<surname>Shenoy</surname>
<given-names>PJ</given-names>
</name>
<name>
<surname>Lechowicz</surname>
<given-names>MJ</given-names>
</name>
<etal></etal>
</person-group>
<article-title>Incidence and outcomes of the peripheral T-cell lymphoma subtypes in the United States</article-title>
<source>Leuk Lymphoma</source>
<year>2008</year>
<volume>49</volume>
<fpage>2099</fpage>
<lpage>107</lpage>
<pub-id pub-id-type="pmid">19021052</pub-id>
</element-citation>
</ref>
<ref id="b35-cin-08-45">
<label>35.</label>
<element-citation publication-type="other">
<person-group person-group-type="author">
<name>
<surname>Baer</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Groenewoud</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Kapetanios</surname>
<given-names>E</given-names>
</name>
<etal></etal>
</person-group>
<article-title>A Semantics Based Interactive Query Formulation Technique. Second International Workshop on User Interfaces to Data Intensive Systems (UIDIS’01)</article-title>
<year>2001</year>
</element-citation>
</ref>
<ref id="b36-cin-08-45">
<label>36.</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shironoshita</surname>
<given-names>EP</given-names>
</name>
<name>
<surname>Jean–Mary</surname>
<given-names>YR</given-names>
</name>
<name>
<surname>Bradley</surname>
<given-names>RM</given-names>
</name>
<etal></etal>
</person-group>
<article-title>semCDI: a query formulation for semantic data integration in caBIG</article-title>
<source>J Am Med Inform Assoc</source>
<year>2008</year>
<volume>15</volume>
<fpage>559</fpage>
<lpage>68</lpage>
<pub-id pub-id-type="pmid">18436897</pub-id>
</element-citation>
</ref>
<ref id="b37-cin-08-45">
<label>37.</label>
<element-citation publication-type="other">
<person-group person-group-type="author">
<name>
<surname>Karp</surname>
<given-names>PD</given-names>
</name>
<name>
<surname>Chaudhri</surname>
<given-names>VK</given-names>
</name>
<name>
<surname>Thomere</surname>
<given-names>J</given-names>
</name>
</person-group>
<article-title>XOL: An XML–Based Ontology Exchange Language</article-title>
<comment>Accessed 15 December 2008. URL:
<ext-link ext-link-type="uri" xlink:href="www.ai.sri.com/pubs/files/676.pdf">www.ai.sri.com/pubs/files/676.pdf</ext-link>
</comment>
<year>2000</year>
</element-citation>
</ref>
<ref id="b38-cin-08-45">
<label>38.</label>
<element-citation publication-type="webpage">
<person-group person-group-type="author">
<collab>IDX Datasystems Ltd. DBsurfer/WEBserver</collab>
</person-group>
<source>IDX Datasystems Ltd.</source>
<comment>URL:
<ext-link ext-link-type="uri" xlink:href="http://www.dbsurfer.com/">http://www.dbsurfer.com/</ext-link>
. Accessed 15 December 2008.</comment>
</element-citation>
</ref>
</ref-list>
</back>
<floats-group>
<fig id="f1-cin-08-45" position="float">
<label>Figure 1.</label>
<caption>
<p>
<bold>Layers of semantic interoperability in caBIG</bold>
™. Semantic interoperability lies in UML model, use of publicly accessible Terminologies/vocabularies/ontologies (EVS–NCI Thesaurus) and use of publicly accessible metadata repository (caDSR).</p>
</caption>
<graphic xlink:href="CIN-08-45-g001"></graphic>
</fig>
<fig id="f2-cin-08-45" position="float">
<label>Figure 2.</label>
<caption>
<p>
<bold>The caCORE workflow.</bold>
This figure describes the steps involved in creating a silver level compliant system. A UML object model is the input into the workflow. The model is exported from the format native to the tool it was developed into the standard XMI representation. The XMI file is then annotated with terminology services. Once the annotated XMI is reviewed and approved, it is used as input to generate code and public APIs, and it is deposited into production caDSR.</p>
</caption>
<graphic xlink:href="CIN-08-45-g002"></graphic>
</fig>
<fig id="f3-cin-08-45" position="float">
<label>Figure 3.</label>
<caption>
<p>
<bold>Logic Model for lymphoma clinical database developed using Enterprise Architecture.</bold>
This figure demonstrates the relationships between the key data elements in LEAD™. Components within each key element are not represented in this figure due to practical constraints of resolution and size.</p>
</caption>
<graphic xlink:href="CIN-08-45-g003"></graphic>
</fig>
<fig id="f4-cin-08-45" position="float">
<label>Figure 4.</label>
<caption>
<p>
<bold>Relations between entities and classes from all data sources.</bold>
This figure depicts the color–coded sources of various key data elements of LEAD™. The codes for colors are shown in the legend.</p>
<p>
<bold>Abbreviations:</bold>
ELCD, Emory Lymphoma Clinical Data; SEER, Surveillance, Epidemiology, and End Result; CT, Emory University Lymphoma Clinical Trials data.</p>
</caption>
<graphic xlink:href="CIN-08-45-g004"></graphic>
</fig>
<fig id="f5-cin-08-45" position="float">
<label>Figure 5.</label>
<caption>
<p>
<bold>Architecture for LEAD™.</bold>
This figure describes the architecture of the Lymphoma Enterprise Architecture Data–system. It contains the presentation tier, business tier and data source tier. The web server passes the requests from web browsers and transfers them to the application server which then accesses the backend database and generates the required content dynamically and sends the response back to the web browser through the web server. Outside community accesses the data through the provided programmatic API.</p>
</caption>
<graphic xlink:href="CIN-08-45-g005"></graphic>
</fig>
<fig id="f6-cin-08-45" position="float">
<label>Figure 6.</label>
<caption>
<p>
<bold>Graphical user interface for entering adverse event data into LEAD™.</bold>
This figure is a sample screen shot of the graphical user interface for entering clinical trial data into LEAD™.</p>
</caption>
<graphic xlink:href="CIN-08-45-g006"></graphic>
</fig>
<table-wrap id="t1-cin-08-45" position="float">
<label>Table 1.</label>
<caption>
<p>World Health Organization classification for lymphomas.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" colspan="2" rowspan="1">Non Hodgkin Lymphomas (NHL)
<hr></hr>
</th>
</tr>
<tr>
<th align="left" rowspan="1" colspan="1">B-cell Neoplasm, NOS</th>
<th align="left" rowspan="1" colspan="1">T-cell and NK-cell Neoplasm, NOS</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="1" colspan="1">
<bold>Precursor B-cell Neoplasms</bold>
</td>
<td align="left" rowspan="1" colspan="1">
<bold>Precursor T-cell Neoplasm</bold>
</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Precursor B lymphoblastic leukemia (9835/3)</td>
<td align="left" rowspan="1" colspan="1">– Precursor T lymphoblastic leukemia (9837/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Precursor B lymphoblastic lymphoma (9728/3)</td>
<td align="left" rowspan="1" colspan="1">– Precursor T lymphoblastic lymphoma (9729/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1"></td>
<td align="left" rowspan="1" colspan="1">– Blastic NK cell lymphoma (9727/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">
<bold>Peripheral (mature) B-cell Neoplasms</bold>
</td>
<td align="left" rowspan="1" colspan="1">
<bold>Peripheral (mature) T-cell Neoplasms</bold>
</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Chronic lymphocytic leukemia (9823/3)</td>
<td align="left" rowspan="1" colspan="1">– T-cell prolymphocytic leukemia (9834/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Small lymphocytic lymphoma (9670/3)</td>
<td align="left" rowspan="1" colspan="1">– T-cell large granular lymphocytic leukemia (9831/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– B-cell prolymphocytic leukemia (9833/3)</td>
<td align="left" rowspan="1" colspan="1">– Aggressive NK cell leukemia (9948/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Lymphoplasmacytic lymphoma (9671/3)</td>
<td align="left" rowspan="1" colspan="1">– Adult T-cell leukemia/lymphoma (9827/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Splenic marginal zone lymphoma (9689/3)</td>
<td align="left" rowspan="1" colspan="1">– Extranodal NK/T cell lymphoma, nasal type (9719/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Hairy cell leukemia (9940/3)</td>
<td align="left" rowspan="1" colspan="1">– Enteropathy type T-cell lymphoma (9717/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Plasmacytoma/Multiple myeloma (9732/3)</td>
<td align="left" rowspan="1" colspan="1">– Hepatosplenic T-cell lymphoma (9716/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Solitary plasmacytoma of bone (9731/3)</td>
<td align="left" rowspan="1" colspan="1">– Subcutaneous panniculitis-like T-cell lymphoma (9708/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Extraosseous plasmacytoma (9734/3)</td>
<td align="left" rowspan="1" colspan="1">– Mycosis fungoides (9700/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Extranodal marginal zone B-cell lymphoma of mucosa-associated lymphoid tissue (MALT-lymphoma) (9699/3)</td>
<td align="left" rowspan="1" colspan="1">– Sezary Syndrome (9701/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Nodal marginal zone B-cell lymphoma (9699/3)</td>
<td align="left" rowspan="1" colspan="1">– Primary cutaneous anaplastic large cell lymphoma (9718/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Follicular lymphoma, NOS (9690/3)</td>
<td align="left" rowspan="1" colspan="1">– Peripheral T-cell lymphoma, unspecified (9702/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Follicular lymphoma Grade 1 (9690/3)</td>
<td align="left" rowspan="1" colspan="1">– Angioimmunoblastic T-cell lymphoma (9705/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Follicular lymphoma Grade 2 (9690/3)</td>
<td align="left" rowspan="1" colspan="1">– Anaplastic large cell lymphoma (9714/3)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Follicular lymphoma Grade 3 (9690/3)</td>
<td align="left" rowspan="1" colspan="1">– Lymphomatoid papulosis (9718/1)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Mantle cell lymphoma (9673/3)</td>
<td align="left" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Diffuse large B-cell lymphoma (9680/3)</td>
<td align="left" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Mediastinal (thymic) large cell lymphoma (9679/3)</td>
<td align="left" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Intravascular large B-cell lymphoma (9680/3)</td>
<td align="left" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Primary effusion lymphoma (9678/3)</td>
<td align="left" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Burkitt lymphoma (9687/3)</td>
<td align="left" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Burkitt lymphoma leukemia (9826/3)</td>
<td align="left" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– B-cell proliferations of uncertain malignant potential</td>
<td align="left" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Lymphomatoid granulomatosis (9766/1)</td>
<td align="left" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">– Post-transplant lymphoproliferative disorder, pleomorphic (9970/1)</td>
<td align="left" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="center" colspan="2" rowspan="1">
<hr></hr>
<bold>Hodgkin Lymphoma, NOS</bold>
<hr></hr>
</td>
</tr>
<tr>
<td align="left" colspan="2" rowspan="1">– Nodular lymphocyte predominant Hodgkin lymphoma (9659/3)</td>
</tr>
<tr>
<td align="left" colspan="2" rowspan="1">– Classical Hodgkin lymphoma (9650/3)</td>
</tr>
<tr>
<td align="left" colspan="2" rowspan="1">– Nodular sclerosis classical Hodgkin lymphoma (9663/3)</td>
</tr>
<tr>
<td align="left" colspan="2" rowspan="1">– Lymphocyte-rich classical Hodgkin lymphoma (9651/3)</td>
</tr>
<tr>
<td align="left" colspan="2" rowspan="1">– Mixed cellularity classical Hodgkin lymphoma (9652/3)</td>
</tr>
<tr>
<td align="left" colspan="2" rowspan="1">Lymphocyte-depleted classical Hodgkin lymphoma (9653/3)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn1-cin-08-45">
<p>
<bold>Source:</bold>
Jaffe et al.
<xref rid="b31-cin-08-45" ref-type="bibr">31</xref>
</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="t2-cin-08-45" position="float">
<label>Table 2.</label>
<caption>
<p>Silver level compatibility guidelines.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" rowspan="1" colspan="1">Sections</th>
<th align="left" rowspan="1" colspan="1">Compatibility Requirements</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">Programming and Messaging Interfaces</td>
<td align="left" valign="top" rowspan="1" colspan="1">
<list list-type="bullet">
<list-item>
<p>Well–described API approved by the caBIG™ ARCHWS that provide access to data in the form of data objects that are instances of classes represented by a domain model.</p>
</list-item>
<list-item>
<p>Electronic data formats reviewed and approved by the caBIG ARCHWS that are supported for both input to and output from the system.</p>
</list-item>
<list-item>
<p>Messaging protocols approved by the caBIG™ ARCHWS that are supported wherever messaging is indicated by the use cases.</p>
</list-item>
</list>
</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">Vocabularies/Terminologies and Ontologies</td>
<td align="left" valign="top" rowspan="1" colspan="1">
<list list-type="bullet">
<list-item>
<p>Terminologies reviewed and validated by the caBIG™ VCDEWS that are used for all appropriate data collection fields and attributes of data objects.</p>
</list-item>
<list-item>
<p>Term definitions must meet VCDEWS workspace guidelines.</p>
</list-item>
</list>
</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">Data Elements</td>
<td align="left" valign="top" rowspan="1" colspan="1">
<list list-type="bullet">
<list-item>
<p>CDEs built from controlled terminologies and according to practices validated by the VCDEWS that are used throughout.</p>
</list-item>
<list-item>
<p>CDEs are registered as ISO/IEC 11179 metadata components in the caBIG™ Context of the caDSR.</p>
</list-item>
</list>
</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">Information Models</td>
<td align="left" valign="top" rowspan="1" colspan="1">
<list list-type="bullet">
<list-item>
<p>Object-oriented domain information models are expressed in UML as class diagrams and as XMI files, and are reviewed and validated by VCDEWS.</p>
</list-item>
</list>
</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn2-cin-08-45">
<p>
<bold>Abbreviations:</bold>
API, application programming interfaces; caBIG™, Cancer Biomedical Informatics Grid™; ARCHWS, Architecture workspace; VCDEWS, Vocabulary/Common Data Elements Workspace; CDE, Common Data Elements; ISO, International Organization for Standardization; IEC, International Electrotechnical Commission; caDSR, cancer Data Standards Repository; UML, Unified Modeling Language; XMI, XML Metadata Interchange.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="t3-cin-08-45" position="float">
<label>Table 3.</label>
<caption>
<p>caBIG™ tools, infrastructure and data resources.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" rowspan="1" colspan="1">Category</th>
<th align="center" rowspan="1" colspan="1">Role</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="6" valign="top" colspan="1">Tools</td>
<td align="left" valign="top" rowspan="1" colspan="1">caIntegrator: a novel translational informatics platform that allows researchers and bioinformaticians to access and analyze clinical and experimental data across multiple clinical trials and studies.</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">caBIO: a domain model and architecture used to model the rapidly-changing genomics and proteomics domain and to integrate data from numerous sources providing a holistic view of the human and mouse genomes.</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">caTissue Core: a web tissue bank repository tool for biospecimen inventory, tracking, and basic annotation</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">caArray: a web and programmatically accessible array data management system</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">caXchange: a lab integration hub for clinical trials</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">C3PR: a web patient registration system for clinical trials</td>
</tr>
<tr>
<td align="left" rowspan="8" valign="top" colspan="1">Infrastructure</td>
<td align="left" valign="top" rowspan="1" colspan="1">caGrid: enables universal mechanisms for providing interoperable programmatic access to data and analytics in caBIG™, creates a self-described infrastructure wherein the structure and semantics of data can be programmatically determined, and provides a powerful means by which services available in caBIG™ can be programmatically discovered and leveraged</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">BRIDG: provides a shared view of the dynamic and static semantics that collectively define a shared domain-of-interest</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">CTODS: provides a single, unified set of APIs that can access clinical data from multiple data sources</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">caBIO: facilitates the communication and integration of information from the various initiatives supported by caBIG™ and NCI</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">caCORE: helps streamline the informatics development and providing a common data management and application development framework</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">caDSR: stores and manages CDEs developed by caBIG™ participants and various NCI-sponsored organizations</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">EVS: produces the NCI Thesaurus, Metathesaurus and provides NCI with licenses for MedDRA, SNOMED, ICD-O-3, and other proprietary vocabularies</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">caCORE SDK: a set of tools that aid in the design and creation of a “caCORE-like” software system.</td>
</tr>
<tr>
<td align="left" rowspan="6" valign="top" colspan="1">Data Resources</td>
<td align="left" valign="top" rowspan="1" colspan="1">caArray: an open-source, web and programmatically accessible array data management system</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">caBIO: a biomedical data system built using a model-driven approach to develop objects, data models middleware, vocabularies, and ontologies for biomedical research.</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">Cancer Gene Data Curation Pilot: creates a database of associations between genes and diseases and genes and drug compounds derived from the biomedical literature.</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">caIntegrator: provides a mechanism for integrating and aggregating biomedical research data and access to a variety of data types</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">caMOD: provides information about animal models for human cancer to the public research community</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">Pathway Interaction Database: a highly structured, curated collection of information about known biomolecular interactions and key cellular processes assembled into signaling pathways</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn3-cin-08-45">
<p>
<bold>Source:</bold>
<ext-link ext-link-type="uri" xlink:href="https://cabig.nci.nih.gov/inventory/">https://cabig.nci.nih.gov/inventory/</ext-link>
.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="t4-cin-08-45" position="float">
<label>Table 4.</label>
<caption>
<p>The key caCORE components.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" rowspan="1" colspan="1">Component</th>
<th align="center" rowspan="1" colspan="1">Description</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="1" colspan="1">EVS</td>
<td align="left" rowspan="1" colspan="1">A description-logic based thesaurus and ontology management system. It is a set of services and resources that address NCI’s needs for controlled vocabulary.</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">caDSR</td>
<td align="left" rowspan="1" colspan="1">A repository that the NCI and its partners use it to create, edit and deploy the Common Data Elements.</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">caBIO</td>
<td align="left" rowspan="1" colspan="1">A model driven information system using the cancer Common Ontological Representation Environment; a synthesis of software, vocabulary, and metadata models for cancer research. Each of the caBIO domain objects represents an entity found in biomedical research such as Gene, Chromosome, Single Nucleotide Polymorphisms.</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">CSM</td>
<td align="left" rowspan="1" colspan="1">A comprehensive and integrated solution to common security objectives. It helps eliminate the need for development teams to create their own security methodology.</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn4-cin-08-45">
<p>
<bold>Abbreviations:</bold>
EVS, Enterprise Vocabulary System; caDSR, cancer Data Standards Repository; caBIO, cancer Bioinformatics Infrastructure Objects; CSM, Common Security Module.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="t5A-cin-08-45" position="float">
<label>Table 5A.</label>
<caption>
<p>Queries used to identify NHL cases within linked legacy databases.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" rowspan="2" valign="top" colspan="1">Query</th>
<th align="left" rowspan="2" valign="top" colspan="1">Source</th>
<th align="center" colspan="3" valign="top" rowspan="1">Criteria
<hr></hr>
</th>
<th align="center" colspan="3" valign="top" rowspan="1">Records Identified (% of all records)
<hr></hr>
</th>
</tr>
<tr>
<th align="center" valign="top" rowspan="1" colspan="1">FL</th>
<th align="center" valign="top" rowspan="1" colspan="1">DLBCL</th>
<th align="center" valign="top" rowspan="1" colspan="1">MCL</th>
<th align="center" valign="top" rowspan="1" colspan="1">FL</th>
<th align="center" valign="top" rowspan="1" colspan="1">DLBCL</th>
<th align="center" valign="top" rowspan="1" colspan="1">MCL</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">Q1</td>
<td align="center" valign="top" rowspan="1" colspan="1">Cancer Registry, ICD-O morphology codes plus behavior code 3</td>
<td align="center" valign="top" rowspan="1" colspan="1">Morphology codes 9690, 9691, 9695, 9698</td>
<td align="center" valign="top" rowspan="1" colspan="1">Morphology codes 9680, 9684</td>
<td align="center" valign="top" rowspan="1" colspan="1">Morphology codes 9693</td>
<td align="left" valign="top" rowspan="1" colspan="1">258 (40%)</td>
<td align="left" valign="top" rowspan="1" colspan="1">337 (38%)</td>
<td align="left" valign="top" rowspan="1" colspan="1">66 (8%)</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">Q2</td>
<td align="center" valign="top" rowspan="1" colspan="1">Text search-pathology reports</td>
<td align="center" valign="top" rowspan="1" colspan="1">‘follicular lymphoma’</td>
<td align="center" valign="top" rowspan="1" colspan="1">‘diffuse large B’ and ‘diffuse large cell’</td>
<td align="center" valign="top" rowspan="1" colspan="1">‘lymphoma lymphocytic’ OR ‘lymphocytic diffuse’ OR ‘lymphocytic lymphoma’</td>
<td align="left" valign="top" rowspan="1" colspan="1">441 (69%)</td>
<td align="left" valign="top" rowspan="1" colspan="1">367 (41%)</td>
<td align="left" valign="top" rowspan="1" colspan="1">710 (82%)</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">Q3</td>
<td align="center" valign="top" rowspan="1" colspan="1">Text search-pathology reports</td>
<td align="center" valign="top" rowspan="1" colspan="1">‘follicular lymphoma’ OR ‘nodular lymphoma’ OR ‘Brill-Symmers’ OR ‘reticulosarcoma— follicular’ OR ‘follicular lymphosarcoma’ OR ‘follicle center lymphoma’ OR ‘follicular non-Hodgkin’</td>
<td align="center" valign="top" rowspan="1" colspan="1">‘lymphoma diffuse histiocytic’ OR ‘lymphoma diffuse large’ OR ‘lymphoma large B’ OR ‘lymphoma large cell’ OR ‘lymphoma diffuse’ OR ‘diffuse non Hodgkin’ OR ‘lymphoma histiocytic’</td>
<td align="center" valign="top" rowspan="1" colspan="1">‘mantle cell lymphoma’ OR ‘lymphoma mantle cell’ OR ‘mantle zone lymphoma’ OR ‘lymphoma mantle zone’</td>
<td align="left" valign="top" rowspan="1" colspan="1">176 (27%)</td>
<td align="left" valign="top" rowspan="1" colspan="1">321 (36%)</td>
<td align="left" valign="top" rowspan="1" colspan="1">456 (52%)</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="1" colspan="1">Q4</td>
<td align="center" valign="top" rowspan="1" colspan="1">Text search-all medical reports</td>
<td align="center" valign="top" rowspan="1" colspan="1">Same as Q2</td>
<td align="center" valign="top" rowspan="1" colspan="1">Same as Q2</td>
<td align="center" valign="top" rowspan="1" colspan="1">Same as Q2</td>
<td align="left" valign="top" rowspan="1" colspan="1">591 (92%)</td>
<td align="left" valign="top" rowspan="1" colspan="1">567 (64%)</td>
<td align="left" valign="top" rowspan="1" colspan="1">775 (89%)</td>
</tr>
<tr>
<td align="left" rowspan="2" valign="top" colspan="1">Q5</td>
<td align="center" valign="top" rowspan="1" colspan="1">Text search-all medical reports</td>
<td align="center" valign="top" rowspan="1" colspan="1">Same as Q3</td>
<td align="center" valign="top" rowspan="1" colspan="1">Same as Q3</td>
<td align="center" valign="top" rowspan="1" colspan="1">Same as Q3</td>
<td align="left" valign="top" rowspan="1" colspan="1">272 (42%)</td>
<td align="left" valign="top" rowspan="1" colspan="1">471 (53%)</td>
<td align="left" valign="top" rowspan="1" colspan="1">494 (57%)</td>
</tr>
<tr>
<td align="center" valign="top" rowspan="1" colspan="1">Total cases reviewed</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="left" valign="top" rowspan="1" colspan="1">643</td>
<td align="left" valign="top" rowspan="1" colspan="1">886</td>
<td align="left" valign="top" rowspan="1" colspan="1">871</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn5-cin-08-45">
<p>
<bold>Abbreviations:</bold>
FL, Follicular lymphoma; DLBCL, Diffuse large B–cell lymphoma; MCL, Mantle cell lymphoma.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="t5B-cin-08-45" position="float">
<label>Table 5B.</label>
<caption>
<p>Sensitivity and specificity for linked database queries.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" rowspan="1" colspan="1"></th>
<th align="center" colspan="2" rowspan="1">Follicular Lymphoma
<hr></hr>
</th>
<th align="center" colspan="2" rowspan="1">Diffuse Large B–cell Lymphoma
<hr></hr>
</th>
<th align="center" colspan="2" rowspan="1">Mantle Cell Lymphoma
<hr></hr>
</th>
</tr>
<tr>
<th align="left" rowspan="1" colspan="1">QUERY</th>
<th align="left" rowspan="1" colspan="1">Sensitivity (%)</th>
<th align="left" rowspan="1" colspan="1">Specificity (%)</th>
<th align="left" rowspan="1" colspan="1">Sensitivity (%)</th>
<th align="left" rowspan="1" colspan="1">Specificity (%)</th>
<th align="left" rowspan="1" colspan="1">Sensitivity (%)</th>
<th align="left" rowspan="1" colspan="1">Specificity (%)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="1" colspan="1">
<bold>Q1</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>53</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>73</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>65</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>60</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>54</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>98</bold>
</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">
<bold>Q2</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>70</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>33</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>38</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>66</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>72</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>17</bold>
</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">
<bold>Q3</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>37</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>82</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>55</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>78</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>85</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>51</bold>
</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">
<bold>Q4</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>97</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>13</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>55</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>50</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>44</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>6</bold>
</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">
<bold>Q5</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>57</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>73</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>78</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>56</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>97</bold>
</td>
<td align="center" rowspan="1" colspan="1">
<bold>48</bold>
</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="t6-cin-08-45" position="float">
<label>Table 6.</label>
<caption>
<p>Phase 1/Phase 2 Lymphoma clinical trial data collection schedule.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" rowspan="1" colspan="1">Procedures</th>
<th align="center" rowspan="1" colspan="1">Screen</th>
<th align="center" rowspan="1" colspan="1">Cycle 1–8 day 1</th>
<th align="center" rowspan="1" colspan="1">Cycle 1–8 day 8</th>
<th align="center" rowspan="1" colspan="1">End of cycle 2,4,6,8</th>
<th align="center" rowspan="1" colspan="1">End of induction</th>
<th align="center" rowspan="1" colspan="1">Every 3 months maintenance</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="1" colspan="1">Physical exam</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">ECG</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">ECOG Performance Status</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Chemotherapy administration</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Bortezomib administration</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Adverse Event assessment</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Tumor assessment</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Chest, Abdomen, Pelvis CT scan</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Pathology slides</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Tumor tissue block</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Bone Marrow Biopsy</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1">X (if in remission)</td>
<td align="center" rowspan="1" colspan="1">X (if in remission)</td>
<td align="center" rowspan="1" colspan="1">X (if in remission)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Minimal Residual Disease Analysis</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1">X (if in remission)</td>
<td align="center" rowspan="1" colspan="1">X (if in remission)</td>
<td align="center" rowspan="1" colspan="1">X (if in remission)</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">
<bold>Labs</bold>
</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Hematology (CBC)</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Clinical Chemistry</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">(basic metabolic)</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Clinical Chemistry (hepatic)</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Uric Acid</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">LDH</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Serum HCG</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">HIV–1</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Hepatitis B</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">Hepatitis C</td>
<td align="center" rowspan="1" colspan="1">X</td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
<td align="center" rowspan="1" colspan="1"></td>
</tr>
</tbody>
</table>
</table-wrap>
</floats-group>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Pmc/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000512 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 000512 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Pmc
   |étape=   Corpus
   |type=    RBID
   |clé=     PMC:2675136
   |texte=   Development of the Lymphoma Enterprise Architecture Database: A caBIG(tm) Silver level compliant System
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/RBID.i   -Sk "pubmed:19492074" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a CyberinfraV1 

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024