CyberinfraV1, Pmc, Corpus, bibRecord, 000336

Data harmonization and federated analysis of population-based studies: the BioSHaRE project

Identifieur interne : 000336 ( Pmc/Corpus ); précédent : 000335; suivant : 000337

Data harmonization and federated analysis of population-based studies: the BioSHaRE project

Auteurs : Dany Doiron ; Paul Burton ; Yannick Marcon ; Amadou Gaye ; Bruce H R. Wolffenbuttel ; Markus Perola ; Ronald P. Stolk ; Luisa Foco ; Cosetta Minelli ; Melanie Waldenberger ; Rolf Holle ; Kirsti Kval Y ; Hans L. Hillege ; Anne-Marie Tassé ; Vincent Ferretti ; Isabel Fortier

Source :

Emerging Themes in Epidemiology [ 1742-7622 ] ; 2013.

RBID : PMC:4175511

Abstract

AbstractsBackground

Individual-level data pooling of large population-based studies across research centres in international research projects faces many hurdles. The BioSHaRE (Biobank Standardisation and Harmonisation for Research Excellence in the European Union) project aims to address these issues by building a collaborative group of investigators and developing tools for data harmonization, database integration and federated data analyses.

Methods

Eight population-based studies in six European countries were recruited to participate in the BioSHaRE project. Through workshops, teleconferences and electronic communications, participating investigators identified a set of 96 variables targeted for harmonization to answer research questions of interest. Using each study’s questionnaires, standard operating procedures, and data dictionaries, harmonization potential was assessed. Whenever harmonization was deemed possible, processing algorithms were developed and implemented in an open-source software infrastructure to transform study-specific data into the target (i.e. harmonized) format. Harmonized datasets located on server in each research centres across Europe were interconnected through a federated database system to perform statistical analysis.

Results

Retrospective harmonization led to the generation of common format variables for 73% of matches considered (96 targeted variables across 8 studies). Authenticated investigators can now perform complex statistical analyses of harmonized datasets stored on distributed servers without actually sharing individual-level data using the DataSHIELD method.

Conclusion

New Internet-based networking technologies and database management systems are providing the means to support collaborative, multi-center research in an efficient and secure manner. The results from this pilot project show that, given a strong collaborative relationship between participating studies, it is possible to seamlessly co-analyse internationally harmonized research databases while allowing each study to retain full control over individual-level data. We encourage additional collaborative research networks in epidemiology, public health, and the social sciences to make use of the open source tools presented herein.

Url:

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4175511

DOI: 10.1186/1742-7622-10-12
PubMed: 24257327
PubMed Central: 4175511

Links to Exploration step

PMC:4175511

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Data harmonization and federated analysis of population-based studies: the BioSHaRE project</title>
<author><name sortKey="Doiron, Dany" sort="Doiron, Dany" uniqKey="Doiron D" first="Dany" last="Doiron">Dany Doiron</name>
<affiliation><nlm:aff id="I1">Research Institute of the McGill University Health Centre, 2155 Guy, office 458, Montreal, Quebec H3H 2R9, Canada</nlm:aff>
</affiliation>
<affiliation><nlm:aff id="I2">Public Population Project in Genomics and Society, Montreal, Canada</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Burton, Paul" sort="Burton, Paul" uniqKey="Burton P" first="Paul" last="Burton">Paul Burton</name>
<affiliation><nlm:aff id="I4">D2K Research Group, School of Social and Community Medicine, University of Bristol, Bristol, UK</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Marcon, Yannick" sort="Marcon, Yannick" uniqKey="Marcon Y" first="Yannick" last="Marcon">Yannick Marcon</name>
<affiliation><nlm:aff id="I1">Research Institute of the McGill University Health Centre, 2155 Guy, office 458, Montreal, Quebec H3H 2R9, Canada</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Gaye, Amadou" sort="Gaye, Amadou" uniqKey="Gaye A" first="Amadou" last="Gaye">Amadou Gaye</name>
<affiliation><nlm:aff id="I4">D2K Research Group, School of Social and Community Medicine, University of Bristol, Bristol, UK</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Wolffenbuttel, Bruce H R" sort="Wolffenbuttel, Bruce H R" uniqKey="Wolffenbuttel B" first="Bruce H R" last="Wolffenbuttel">Bruce H R. Wolffenbuttel</name>
<affiliation><nlm:aff id="I5">Department of Endocrinology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Perola, Markus" sort="Perola, Markus" uniqKey="Perola M" first="Markus" last="Perola">Markus Perola</name>
<affiliation><nlm:aff id="I6">Department of Chronic Disease Prevention, Public Health Genomics Unit, National Institute for Health and Welfare, Helsinki, Finland</nlm:aff>
</affiliation>
<affiliation><nlm:aff id="I7">Institute for Molecular Medicine, University of Helsinki, Helsinki, Finland</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Stolk, Ronald P" sort="Stolk, Ronald P" uniqKey="Stolk R" first="Ronald P" last="Stolk">Ronald P. Stolk</name>
<affiliation><nlm:aff id="I8">Department of Epidemiology, University Medical Center Groningen, Groningen, The Netherlands</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Foco, Luisa" sort="Foco, Luisa" uniqKey="Foco L" first="Luisa" last="Foco">Luisa Foco</name>
<affiliation><nlm:aff id="I9">European Academy of Bolzano/Bozen (EURAC), Center for Biomedicine, Bolzano, Italy</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Minelli, Cosetta" sort="Minelli, Cosetta" uniqKey="Minelli C" first="Cosetta" last="Minelli">Cosetta Minelli</name>
<affiliation><nlm:aff id="I13">Respiratory Epidemiology, Occupational Medicine and Public Health, National Heart and Lung Institute, Imperial College, London, UK</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Waldenberger, Melanie" sort="Waldenberger, Melanie" uniqKey="Waldenberger M" first="Melanie" last="Waldenberger">Melanie Waldenberger</name>
<affiliation><nlm:aff id="I10">Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Holle, Rolf" sort="Holle, Rolf" uniqKey="Holle R" first="Rolf" last="Holle">Rolf Holle</name>
<affiliation><nlm:aff id="I10">Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Kval Y, Kirsti" sort="Kval Y, Kirsti" uniqKey="Kval Y K" first="Kirsti" last="Kval Y">Kirsti Kval Y</name>
<affiliation><nlm:aff id="I11">Department of Public Health and General Practice, HUNT Research Center, Norwegian University of Science and Technology, Trondheim, Norway</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Hillege, Hans L" sort="Hillege, Hans L" uniqKey="Hillege H" first="Hans L" last="Hillege">Hans L. Hillege</name>
<affiliation><nlm:aff id="I12">Department of Cardiology and Epidemiology, University Medical Centre Groningen, Groningen, The Netherlands</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Tasse, Anne Marie" sort="Tasse, Anne Marie" uniqKey="Tasse A" first="Anne-Marie" last="Tassé">Anne-Marie Tassé</name>
<affiliation><nlm:aff id="I2">Public Population Project in Genomics and Society, Montreal, Canada</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Ferretti, Vincent" sort="Ferretti, Vincent" uniqKey="Ferretti V" first="Vincent" last="Ferretti">Vincent Ferretti</name>
<affiliation><nlm:aff id="I3">Ontario Institute for Cancer Research, MaRS Centre, Toronto, Canada</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Fortier, Isabel" sort="Fortier, Isabel" uniqKey="Fortier I" first="Isabel" last="Fortier">Isabel Fortier</name>
<affiliation><nlm:aff id="I1">Research Institute of the McGill University Health Centre, 2155 Guy, office 458, Montreal, Quebec H3H 2R9, Canada</nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">24257327</idno>
<idno type="pmc">4175511</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4175511</idno>
<idno type="RBID">PMC:4175511</idno>
<idno type="doi">10.1186/1742-7622-10-12</idno>
<date when="2013">2013</date>
<idno type="wicri:Area/Pmc/Corpus">000336</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">Data harmonization and federated analysis of population-based studies: the BioSHaRE project</title>
<author><name sortKey="Doiron, Dany" sort="Doiron, Dany" uniqKey="Doiron D" first="Dany" last="Doiron">Dany Doiron</name>
<affiliation><nlm:aff id="I1">Research Institute of the McGill University Health Centre, 2155 Guy, office 458, Montreal, Quebec H3H 2R9, Canada</nlm:aff>
</affiliation>
<affiliation><nlm:aff id="I2">Public Population Project in Genomics and Society, Montreal, Canada</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Burton, Paul" sort="Burton, Paul" uniqKey="Burton P" first="Paul" last="Burton">Paul Burton</name>
<affiliation><nlm:aff id="I4">D2K Research Group, School of Social and Community Medicine, University of Bristol, Bristol, UK</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Marcon, Yannick" sort="Marcon, Yannick" uniqKey="Marcon Y" first="Yannick" last="Marcon">Yannick Marcon</name>
<affiliation><nlm:aff id="I1">Research Institute of the McGill University Health Centre, 2155 Guy, office 458, Montreal, Quebec H3H 2R9, Canada</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Gaye, Amadou" sort="Gaye, Amadou" uniqKey="Gaye A" first="Amadou" last="Gaye">Amadou Gaye</name>
<affiliation><nlm:aff id="I4">D2K Research Group, School of Social and Community Medicine, University of Bristol, Bristol, UK</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Wolffenbuttel, Bruce H R" sort="Wolffenbuttel, Bruce H R" uniqKey="Wolffenbuttel B" first="Bruce H R" last="Wolffenbuttel">Bruce H R. Wolffenbuttel</name>
<affiliation><nlm:aff id="I5">Department of Endocrinology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Perola, Markus" sort="Perola, Markus" uniqKey="Perola M" first="Markus" last="Perola">Markus Perola</name>
<affiliation><nlm:aff id="I6">Department of Chronic Disease Prevention, Public Health Genomics Unit, National Institute for Health and Welfare, Helsinki, Finland</nlm:aff>
</affiliation>
<affiliation><nlm:aff id="I7">Institute for Molecular Medicine, University of Helsinki, Helsinki, Finland</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Stolk, Ronald P" sort="Stolk, Ronald P" uniqKey="Stolk R" first="Ronald P" last="Stolk">Ronald P. Stolk</name>
<affiliation><nlm:aff id="I8">Department of Epidemiology, University Medical Center Groningen, Groningen, The Netherlands</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Foco, Luisa" sort="Foco, Luisa" uniqKey="Foco L" first="Luisa" last="Foco">Luisa Foco</name>
<affiliation><nlm:aff id="I9">European Academy of Bolzano/Bozen (EURAC), Center for Biomedicine, Bolzano, Italy</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Minelli, Cosetta" sort="Minelli, Cosetta" uniqKey="Minelli C" first="Cosetta" last="Minelli">Cosetta Minelli</name>
<affiliation><nlm:aff id="I13">Respiratory Epidemiology, Occupational Medicine and Public Health, National Heart and Lung Institute, Imperial College, London, UK</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Waldenberger, Melanie" sort="Waldenberger, Melanie" uniqKey="Waldenberger M" first="Melanie" last="Waldenberger">Melanie Waldenberger</name>
<affiliation><nlm:aff id="I10">Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Holle, Rolf" sort="Holle, Rolf" uniqKey="Holle R" first="Rolf" last="Holle">Rolf Holle</name>
<affiliation><nlm:aff id="I10">Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Kval Y, Kirsti" sort="Kval Y, Kirsti" uniqKey="Kval Y K" first="Kirsti" last="Kval Y">Kirsti Kval Y</name>
<affiliation><nlm:aff id="I11">Department of Public Health and General Practice, HUNT Research Center, Norwegian University of Science and Technology, Trondheim, Norway</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Hillege, Hans L" sort="Hillege, Hans L" uniqKey="Hillege H" first="Hans L" last="Hillege">Hans L. Hillege</name>
<affiliation><nlm:aff id="I12">Department of Cardiology and Epidemiology, University Medical Centre Groningen, Groningen, The Netherlands</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Tasse, Anne Marie" sort="Tasse, Anne Marie" uniqKey="Tasse A" first="Anne-Marie" last="Tassé">Anne-Marie Tassé</name>
<affiliation><nlm:aff id="I2">Public Population Project in Genomics and Society, Montreal, Canada</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Ferretti, Vincent" sort="Ferretti, Vincent" uniqKey="Ferretti V" first="Vincent" last="Ferretti">Vincent Ferretti</name>
<affiliation><nlm:aff id="I3">Ontario Institute for Cancer Research, MaRS Centre, Toronto, Canada</nlm:aff>
</affiliation>
</author>
<author><name sortKey="Fortier, Isabel" sort="Fortier, Isabel" uniqKey="Fortier I" first="Isabel" last="Fortier">Isabel Fortier</name>
<affiliation><nlm:aff id="I1">Research Institute of the McGill University Health Centre, 2155 Guy, office 458, Montreal, Quebec H3H 2R9, Canada</nlm:aff>
</affiliation>
</author>
</analytic>
<series><title level="j">Emerging Themes in Epidemiology</title>
<idno type="eISSN">1742-7622</idno>
<imprint><date when="2013">2013</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><sec><title>Abstracts</title>
<sec><title>Background</title>
<p>Individual-level data pooling of large population-based studies across research centres in international research projects faces many hurdles. The BioSHaRE (Biobank Standardisation and Harmonisation for Research Excellence in the European Union) project aims to address these issues by building a collaborative group of investigators and developing tools for data harmonization, database integration and federated data analyses.</p>
</sec>
<sec><title>Methods</title>
<p>Eight population-based studies in six European countries were recruited to participate in the BioSHaRE project. Through workshops, teleconferences and electronic communications, participating investigators identified a set of 96 variables targeted for harmonization to answer research questions of interest. Using each study’s questionnaires, standard operating procedures, and data dictionaries, harmonization potential was assessed. Whenever harmonization was deemed possible, processing algorithms were developed and implemented in an open-source software infrastructure to transform study-specific data into the target (i.e. harmonized) format. Harmonized datasets located on server in each research centres across Europe were interconnected through a federated database system to perform statistical analysis.</p>
</sec>
<sec><title>Results</title>
<p>Retrospective harmonization led to the generation of common format variables for 73% of matches considered (96 targeted variables across 8 studies). Authenticated investigators can now perform complex statistical analyses of harmonized datasets stored on distributed servers without actually sharing individual-level data using the DataSHIELD method.</p>
</sec>
<sec><title>Conclusion</title>
<p>New Internet-based networking technologies and database management systems are providing the means to support collaborative, multi-center research in an efficient and secure manner. The results from this pilot project show that, given a strong collaborative relationship between participating studies, it is possible to seamlessly co-analyse internationally harmonized research databases while allowing each study to retain full control over individual-level data. We encourage additional collaborative research networks in epidemiology, public health, and the social sciences to make use of the open source tools presented herein.</p>
</sec>
</sec>
</div>
</front>
<back><div1 type="bibliography"><listBibl><biblStruct><analytic><author><name sortKey="Smith Warner, Sa" uniqKey="Smith Warner S">SA Smith-Warner</name>
</author>
<author><name sortKey="Spiegelman, D" uniqKey="Spiegelman D">D Spiegelman</name>
</author>
<author><name sortKey="Ritz, J" uniqKey="Ritz J">J Ritz</name>
</author>
<author><name sortKey="Albanes, D" uniqKey="Albanes D">D Albanes</name>
</author>
<author><name sortKey="Beeson, Wl" uniqKey="Beeson W">WL Beeson</name>
</author>
<author><name sortKey="Bernstein, L" uniqKey="Bernstein L">L Bernstein</name>
</author>
<author><name sortKey="Berrino, F" uniqKey="Berrino F">F Berrino</name>
</author>
<author><name sortKey="Van Den Brandt, Pa" uniqKey="Van Den Brandt P">PA van den Brandt</name>
</author>
<author><name sortKey="Buring, Je" uniqKey="Buring J">JE Buring</name>
</author>
<author><name sortKey="Cho, E" uniqKey="Cho E">E Cho</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Thompson, A" uniqKey="Thompson A">A Thompson</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Khoury, Mj" uniqKey="Khoury M">MJ Khoury</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Hamilton, Cm" uniqKey="Hamilton C">CM Hamilton</name>
</author>
<author><name sortKey="Strader, Lc" uniqKey="Strader L">LC Strader</name>
</author>
<author><name sortKey="Pratt, Jg" uniqKey="Pratt J">JG Pratt</name>
</author>
<author><name sortKey="Maiese, D" uniqKey="Maiese D">D Maiese</name>
</author>
<author><name sortKey="Hendershot, T" uniqKey="Hendershot T">T Hendershot</name>
</author>
<author><name sortKey="Kwok, Rk" uniqKey="Kwok R">RK Kwok</name>
</author>
<author><name sortKey="Hammond, Ja" uniqKey="Hammond J">JA Hammond</name>
</author>
<author><name sortKey="Huggins, W" uniqKey="Huggins W">W Huggins</name>
</author>
<author><name sortKey="Jackman, D" uniqKey="Jackman D">D Jackman</name>
</author>
<author><name sortKey="Pan, H" uniqKey="Pan H">H Pan</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Noale, M" uniqKey="Noale M">M Noale</name>
</author>
<author><name sortKey="Minicuci, N" uniqKey="Minicuci N">N Minicuci</name>
</author>
<author><name sortKey="Bardage, C" uniqKey="Bardage C">C Bardage</name>
</author>
<author><name sortKey="Gindin, J" uniqKey="Gindin J">J Gindin</name>
</author>
<author><name sortKey="Nikula, S" uniqKey="Nikula S">S Nikula</name>
</author>
<author><name sortKey="Pluijm, S" uniqKey="Pluijm S">S Pluijm</name>
</author>
<author><name sortKey="Rodriguez Laso, A" uniqKey="Rodriguez Laso A">A Rodríguez-Laso</name>
</author>
<author><name sortKey="Maggi, S" uniqKey="Maggi S">S Maggi</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Serra Majem, L" uniqKey="Serra Majem L">L Serra-Majem</name>
</author>
<author><name sortKey="Maclean, D" uniqKey="Maclean D">D MacLean</name>
</author>
<author><name sortKey="Ribas, L" uniqKey="Ribas L">L Ribas</name>
</author>
<author><name sortKey="Brule, D" uniqKey="Brule D">D Brulé</name>
</author>
<author><name sortKey="Sekula, W" uniqKey="Sekula W">W Sekula</name>
</author>
<author><name sortKey="Prattala, R" uniqKey="Prattala R">R Prattala</name>
</author>
<author><name sortKey="Garcia Closas, R" uniqKey="Garcia Closas R">R Garcia-Closas</name>
</author>
<author><name sortKey="Yngve, A" uniqKey="Yngve A">A Yngve</name>
</author>
<author><name sortKey="Lalonde, M" uniqKey="Lalonde M">M Lalonde</name>
</author>
<author><name sortKey="Petrasovits, A" uniqKey="Petrasovits A">A Petrasovits</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Bath, Pa" uniqKey="Bath P">PA Bath</name>
</author>
<author><name sortKey="Deeg, D" uniqKey="Deeg D">D Deeg</name>
</author>
<author><name sortKey="Poppelaars, J" uniqKey="Poppelaars J">J Poppelaars</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Sansone, S A" uniqKey="Sansone S">S-A Sansone</name>
</author>
<author><name sortKey="Rocca Serra, P" uniqKey="Rocca Serra P">P Rocca-Serra</name>
</author>
<author><name sortKey="Field, D" uniqKey="Field D">D Field</name>
</author>
<author><name sortKey="Maguire, E" uniqKey="Maguire E">E Maguire</name>
</author>
<author><name sortKey="Taylor, C" uniqKey="Taylor C">C Taylor</name>
</author>
<author><name sortKey="Hofmann, O" uniqKey="Hofmann O">O Hofmann</name>
</author>
<author><name sortKey="Fang, H" uniqKey="Fang H">H Fang</name>
</author>
<author><name sortKey="Neumann, S" uniqKey="Neumann S">S Neumann</name>
</author>
<author><name sortKey="Tong, W" uniqKey="Tong W">W Tong</name>
</author>
<author><name sortKey="Amaral Zettler, L" uniqKey="Amaral Zettler L">L Amaral-Zettler</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Schad, Pa" uniqKey="Schad P">PA Schad</name>
</author>
<author><name sortKey="Mobley, Lr" uniqKey="Mobley L">LR Mobley</name>
</author>
<author><name sortKey="Hamilton, Cm" uniqKey="Hamilton C">CM Hamilton</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Seminara, D" uniqKey="Seminara D">D Seminara</name>
</author>
<author><name sortKey="Khoury, Mj" uniqKey="Khoury M">MJ Khoury</name>
</author>
<author><name sortKey="O Rien, Tr" uniqKey="O Rien T">TR O’Brien</name>
</author>
<author><name sortKey="Manolio, T" uniqKey="Manolio T">T Manolio</name>
</author>
<author><name sortKey="Gwinn, Ml" uniqKey="Gwinn M">ML Gwinn</name>
</author>
<author><name sortKey="Little, J" uniqKey="Little J">J Little</name>
</author>
<author><name sortKey="Higgins, Jpt" uniqKey="Higgins J">JPT Higgins</name>
</author>
<author><name sortKey="Bernstein, Jl" uniqKey="Bernstein J">JL Bernstein</name>
</author>
<author><name sortKey="Boffetta, P" uniqKey="Boffetta P">P Boffetta</name>
</author>
<author><name sortKey="Bondy, M" uniqKey="Bondy M">M Bondy</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Budin Lj Sne, I" uniqKey="Budin Lj Sne I">I Budin-Ljøsne</name>
</author>
<author><name sortKey="Isaeva, J" uniqKey="Isaeva J">J Isaeva</name>
</author>
<author><name sortKey="Knoppers, Bm" uniqKey="Knoppers B">BM Knoppers</name>
</author>
<author><name sortKey="Tasse, Am" uniqKey="Tasse A">AM Tassé</name>
</author>
<author><name sortKey="Shen, H Y" uniqKey="Shen H">H-y Shen</name>
</author>
<author><name sortKey="Mccarthy, Mi" uniqKey="Mccarthy M">MI McCarthy</name>
</author>
<author><name sortKey="Harris, Jr" uniqKey="Harris J">JR Harris</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Bousquet, J" uniqKey="Bousquet J">J Bousquet</name>
</author>
<author><name sortKey="Anto, J" uniqKey="Anto J">J Anto</name>
</author>
<author><name sortKey="Sunyer, J" uniqKey="Sunyer J">J Sunyer</name>
</author>
<author><name sortKey="Nieuwenhuijsen, M" uniqKey="Nieuwenhuijsen M">M Nieuwenhuijsen</name>
</author>
<author><name sortKey="Vrijheid, M" uniqKey="Vrijheid M">M Vrijheid</name>
</author>
<author><name sortKey="Keil, T" uniqKey="Keil T">T Keil</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Harris, Jr" uniqKey="Harris J">JR Harris</name>
</author>
<author><name sortKey="Burton, P" uniqKey="Burton P">P Burton</name>
</author>
<author><name sortKey="Knoppers, Bm" uniqKey="Knoppers B">BM Knoppers</name>
</author>
<author><name sortKey="Lindpaintner, K" uniqKey="Lindpaintner K">K Lindpaintner</name>
</author>
<author><name sortKey="Bledsoe, M" uniqKey="Bledsoe M">M Bledsoe</name>
</author>
<author><name sortKey="Brookes, Aj" uniqKey="Brookes A">AJ Brookes</name>
</author>
<author><name sortKey="Budin Ljosne, I" uniqKey="Budin Ljosne I">I Budin-Ljosne</name>
</author>
<author><name sortKey="Chisholm, R" uniqKey="Chisholm R">R Chisholm</name>
</author>
<author><name sortKey="Cox, D" uniqKey="Cox D">D Cox</name>
</author>
<author><name sortKey="Deschenes, M" uniqKey="Deschenes M">M Deschenes</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Zika, E" uniqKey="Zika E">E Zika</name>
</author>
<author><name sortKey="Paci, D" uniqKey="Paci D">D Paci</name>
</author>
<author><name sortKey="Schulte In Den B Umen, T" uniqKey="Schulte In Den B Umen T">T Schulte in den Bäumen</name>
</author>
<author><name sortKey="Braun, A" uniqKey="Braun A">A Braun</name>
</author>
<author><name sortKey="Rijkers Defrasne, S" uniqKey="Rijkers Defrasne S">S RijKers-Defrasne</name>
</author>
<author><name sortKey="Deschenes, M" uniqKey="Deschenes M">M Deschênes</name>
</author>
<author><name sortKey="Fortier, I" uniqKey="Fortier I">I Fortier</name>
</author>
<author><name sortKey="Laage Hellman, J" uniqKey="Laage Hellman J">J Laage-Hellman</name>
</author>
<author><name sortKey="Scerri, Ca" uniqKey="Scerri C">CA Scerri</name>
</author>
<author><name sortKey="Ibarreta, D" uniqKey="Ibarreta D">D Ibarreta</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Gottweis, H" uniqKey="Gottweis H">H Gottweis</name>
</author>
<author><name sortKey="Kaye, J" uniqKey="Kaye J">J Kaye</name>
</author>
<author><name sortKey="Bignami, F" uniqKey="Bignami F">F Bignami</name>
</author>
<author><name sortKey="Rial Sebbag, E" uniqKey="Rial Sebbag E">E Rial-Sebbag</name>
</author>
<author><name sortKey="Lattanzi, R" uniqKey="Lattanzi R">R Lattanzi</name>
</author>
<author><name sortKey="Macek, M" uniqKey="Macek M">M Macek</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Bookman, Eb" uniqKey="Bookman E">EB Bookman</name>
</author>
<author><name sortKey="Mcallister, K" uniqKey="Mcallister K">K McAllister</name>
</author>
<author><name sortKey="Gillanders, E" uniqKey="Gillanders E">E Gillanders</name>
</author>
<author><name sortKey="Wanke, K" uniqKey="Wanke K">K Wanke</name>
</author>
<author><name sortKey="Balshaw, D" uniqKey="Balshaw D">D Balshaw</name>
</author>
<author><name sortKey="Rutter, J" uniqKey="Rutter J">J Rutter</name>
</author>
<author><name sortKey="Reedy, J" uniqKey="Reedy J">J Reedy</name>
</author>
<author><name sortKey="Shaughnessy, D" uniqKey="Shaughnessy D">D Shaughnessy</name>
</author>
<author><name sortKey="Agurs Collins, T" uniqKey="Agurs Collins T">T Agurs-Collins</name>
</author>
<author><name sortKey="Paltoo, D" uniqKey="Paltoo D">D Paltoo</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Khoury, Mj" uniqKey="Khoury M">MJ Khoury</name>
</author>
<author><name sortKey="Lam, Tk" uniqKey="Lam T">TK Lam</name>
</author>
<author><name sortKey="Ioannidis, Jpa" uniqKey="Ioannidis J">JPA Ioannidis</name>
</author>
<author><name sortKey="Hartge, P" uniqKey="Hartge P">P Hartge</name>
</author>
<author><name sortKey="Spitz, Mr" uniqKey="Spitz M">MR Spitz</name>
</author>
<author><name sortKey="Buring, Je" uniqKey="Buring J">JE Buring</name>
</author>
<author><name sortKey="Chanock, Sj" uniqKey="Chanock S">SJ Chanock</name>
</author>
<author><name sortKey="Croyle, R" uniqKey="Croyle R">R Croyle</name>
</author>
<author><name sortKey="Goddard, Kab" uniqKey="Goddard K">KAB Goddard</name>
</author>
<author><name sortKey="Ginsburg, Gs" uniqKey="Ginsburg G">GS Ginsburg</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Walport, M" uniqKey="Walport M">M Walport</name>
</author>
<author><name sortKey="Brest, P" uniqKey="Brest P">P Brest</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Pisani, E" uniqKey="Pisani E">E Pisani</name>
</author>
<author><name sortKey="Abouzahr, C" uniqKey="Abouzahr C">C AbouZahr</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Bennett, Sn" uniqKey="Bennett S">SN Bennett</name>
</author>
<author><name sortKey="Caporaso, N" uniqKey="Caporaso N">N Caporaso</name>
</author>
<author><name sortKey="Fitzpatrick, Al" uniqKey="Fitzpatrick A">AL Fitzpatrick</name>
</author>
<author><name sortKey="Agrawal, A" uniqKey="Agrawal A">A Agrawal</name>
</author>
<author><name sortKey="Barnes, K" uniqKey="Barnes K">K Barnes</name>
</author>
<author><name sortKey="Boyd, Ha" uniqKey="Boyd H">HA Boyd</name>
</author>
<author><name sortKey="Cornelis, Mc" uniqKey="Cornelis M">MC Cornelis</name>
</author>
<author><name sortKey="Hansel, Nn" uniqKey="Hansel N">NN Hansel</name>
</author>
<author><name sortKey="Heiss, G" uniqKey="Heiss G">G Heiss</name>
</author>
<author><name sortKey="Heit, Ja" uniqKey="Heit J">JA Heit</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Vickers, Aj" uniqKey="Vickers A">AJ Vickers</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Fortier, I" uniqKey="Fortier I">I Fortier</name>
</author>
<author><name sortKey="Burton, Pr" uniqKey="Burton P">PR Burton</name>
</author>
<author><name sortKey="Robson, Pj" uniqKey="Robson P">PJ Robson</name>
</author>
<author><name sortKey="Ferretti, V" uniqKey="Ferretti V">V Ferretti</name>
</author>
<author><name sortKey="Little, J" uniqKey="Little J">J Little</name>
</author>
<author><name sortKey="L Eureux, F" uniqKey="L Eureux F">F L’Heureux</name>
</author>
<author><name sortKey="Deschenes, M" uniqKey="Deschenes M">M Deschenes</name>
</author>
<author><name sortKey="Knoppers, Bm" uniqKey="Knoppers B">BM Knoppers</name>
</author>
<author><name sortKey="Doiron, D" uniqKey="Doiron D">D Doiron</name>
</author>
<author><name sortKey="Keers, Jc" uniqKey="Keers J">JC Keers</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Howe, D" uniqKey="Howe D">D Howe</name>
</author>
<author><name sortKey="Costanzo, M" uniqKey="Costanzo M">M Costanzo</name>
</author>
<author><name sortKey="Fey, P" uniqKey="Fey P">P Fey</name>
</author>
<author><name sortKey="Gojobori, T" uniqKey="Gojobori T">T Gojobori</name>
</author>
<author><name sortKey="Hannick, L" uniqKey="Hannick L">L Hannick</name>
</author>
<author><name sortKey="Hide, W" uniqKey="Hide W">W Hide</name>
</author>
<author><name sortKey="Hill, Dp" uniqKey="Hill D">DP Hill</name>
</author>
<author><name sortKey="Kania, R" uniqKey="Kania R">R Kania</name>
</author>
<author><name sortKey="Schaeffer, M" uniqKey="Schaeffer M">M Schaeffer</name>
</author>
<author><name sortKey="St Pierre, S" uniqKey="St Pierre S">S St Pierre</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Kaye, J" uniqKey="Kaye J">J Kaye</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Knoppers, B" uniqKey="Knoppers B">B Knoppers</name>
</author>
<author><name sortKey="Harris, J" uniqKey="Harris J">J Harris</name>
</author>
<author><name sortKey="Tasse, A" uniqKey="Tasse A">A Tasse</name>
</author>
<author><name sortKey="Budin Ljosne, I" uniqKey="Budin Ljosne I">I Budin-Ljosne</name>
</author>
<author><name sortKey="Kaye, J" uniqKey="Kaye J">J Kaye</name>
</author>
<author><name sortKey="Deschenes, M" uniqKey="Deschenes M">M Deschenes</name>
</author>
<author><name sortKey="Zawati, M" uniqKey="Zawati M">M Zawati</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Karr, Af" uniqKey="Karr A">AF Karr</name>
</author>
<author><name sortKey="Fulp, Wj" uniqKey="Fulp W">WJ Fulp</name>
</author>
<author><name sortKey="Vera, F" uniqKey="Vera F">F Vera</name>
</author>
<author><name sortKey="Young, Ss" uniqKey="Young S">SS Young</name>
</author>
<author><name sortKey="Lin, X" uniqKey="Lin X">X Lin</name>
</author>
<author><name sortKey="Reiter, Jp" uniqKey="Reiter J">JP Reiter</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Wolfson, M" uniqKey="Wolfson M">M Wolfson</name>
</author>
<author><name sortKey="Wallace, Se" uniqKey="Wallace S">SE Wallace</name>
</author>
<author><name sortKey="Masca, N" uniqKey="Masca N">N Masca</name>
</author>
<author><name sortKey="Rowe, G" uniqKey="Rowe G">G Rowe</name>
</author>
<author><name sortKey="Sheehan, Na" uniqKey="Sheehan N">NA Sheehan</name>
</author>
<author><name sortKey="Ferretti, V" uniqKey="Ferretti V">V Ferretti</name>
</author>
<author><name sortKey="Laflamme, P" uniqKey="Laflamme P">P LaFlamme</name>
</author>
<author><name sortKey="Tobin, Md" uniqKey="Tobin M">MD Tobin</name>
</author>
<author><name sortKey="Macleod, J" uniqKey="Macleod J">J Macleod</name>
</author>
<author><name sortKey="Little, J" uniqKey="Little J">J Little</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Muilu, J" uniqKey="Muilu J">J Muilu</name>
</author>
<author><name sortKey="Peltonen, L" uniqKey="Peltonen L">L Peltonen</name>
</author>
<author><name sortKey="Litton, Je" uniqKey="Litton J">JE Litton</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Yuille, M" uniqKey="Yuille M">M Yuille</name>
</author>
<author><name sortKey="Van Ommen, G J" uniqKey="Van Ommen G">G-J van Ommen</name>
</author>
<author><name sortKey="Brechot, C" uniqKey="Brechot C">C Bréchot</name>
</author>
<author><name sortKey="Cambon Thomsen, A" uniqKey="Cambon Thomsen A">A Cambon-Thomsen</name>
</author>
<author><name sortKey="Dagher, G" uniqKey="Dagher G">G Dagher</name>
</author>
<author><name sortKey="Landegren, U" uniqKey="Landegren U">U Landegren</name>
</author>
<author><name sortKey="Litton, J E" uniqKey="Litton J">J-E Litton</name>
</author>
<author><name sortKey="Pasterk, M" uniqKey="Pasterk M">M Pasterk</name>
</author>
<author><name sortKey="Peltonen, L" uniqKey="Peltonen L">L Peltonen</name>
</author>
<author><name sortKey="Taussig, M" uniqKey="Taussig M">M Taussig</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Ford, D" uniqKey="Ford D">D Ford</name>
</author>
<author><name sortKey="Jones, K" uniqKey="Jones K">K Jones</name>
</author>
<author><name sortKey="Verplancke, J P" uniqKey="Verplancke J">J-P Verplancke</name>
</author>
<author><name sortKey="Lyons, R" uniqKey="Lyons R">R Lyons</name>
</author>
<author><name sortKey="John, G" uniqKey="John G">G John</name>
</author>
<author><name sortKey="Brown, G" uniqKey="Brown G">G Brown</name>
</author>
<author><name sortKey="Brooks, C" uniqKey="Brooks C">C Brooks</name>
</author>
<author><name sortKey="Thompson, S" uniqKey="Thompson S">S Thompson</name>
</author>
<author><name sortKey="Bodger, O" uniqKey="Bodger O">O Bodger</name>
</author>
<author><name sortKey="Couch, T" uniqKey="Couch T">T Couch</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="El Emam, K" uniqKey="El Emam K">K El Emam</name>
</author>
<author><name sortKey="Samet, S" uniqKey="Samet S">S Samet</name>
</author>
<author><name sortKey="Arbuckle, L" uniqKey="Arbuckle L">L Arbuckle</name>
</author>
<author><name sortKey="Tamblyn, R" uniqKey="Tamblyn R">R Tamblyn</name>
</author>
<author><name sortKey="Earle, C" uniqKey="Earle C">C Earle</name>
</author>
<author><name sortKey="Kantarcioglu, M" uniqKey="Kantarcioglu M">M Kantarcioglu</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Karelis, Ad" uniqKey="Karelis A">AD Karelis</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Denis, Gv" uniqKey="Denis G">GV Denis</name>
</author>
<author><name sortKey="Obin, Ms" uniqKey="Obin M">MS Obin</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Fortier, I" uniqKey="Fortier I">I Fortier</name>
</author>
<author><name sortKey="Doiron, D" uniqKey="Doiron D">D Doiron</name>
</author>
<author><name sortKey="Little, J" uniqKey="Little J">J Little</name>
</author>
<author><name sortKey="Ferretti, V" uniqKey="Ferretti V">V Ferretti</name>
</author>
<author><name sortKey="L Eureux, F" uniqKey="L Eureux F">F L’Heureux</name>
</author>
<author><name sortKey="Stolk, Rp" uniqKey="Stolk R">RP Stolk</name>
</author>
<author><name sortKey="Knoppers, Bm" uniqKey="Knoppers B">BM Knoppers</name>
</author>
<author><name sortKey="Hudson, Tj" uniqKey="Hudson T">TJ Hudson</name>
</author>
<author><name sortKey="Burton, Pr" uniqKey="Burton P">PR Burton</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Doiron, D" uniqKey="Doiron D">D Doiron</name>
</author>
<author><name sortKey="Raina, P" uniqKey="Raina P">P Raina</name>
</author>
<author><name sortKey="Ferretti, V" uniqKey="Ferretti V">V Ferretti</name>
</author>
<author><name sortKey="L Eureux, F" uniqKey="L Eureux F">F L’Heureux</name>
</author>
<author><name sortKey="Fortier, I" uniqKey="Fortier I">I Fortier</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Knoppers, B" uniqKey="Knoppers B">B Knoppers</name>
</author>
<author><name sortKey="Fortier, I" uniqKey="Fortier I">I Fortier</name>
</author>
<author><name sortKey="Legault, D" uniqKey="Legault D">D Legault</name>
</author>
<author><name sortKey="Burton, P" uniqKey="Burton P">P Burton</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct><analytic><author><name sortKey="Flanagan, D" uniqKey="Flanagan D">D Flanagan</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Jones, E" uniqKey="Jones E">E Jones</name>
</author>
<author><name sortKey="Sheehan, N" uniqKey="Sheehan N">N Sheehan</name>
</author>
<author><name sortKey="Masca, N" uniqKey="Masca N">N Masca</name>
</author>
<author><name sortKey="Wallace, S" uniqKey="Wallace S">S Wallace</name>
</author>
<author><name sortKey="Murtagh, M" uniqKey="Murtagh M">M Murtagh</name>
</author>
<author><name sortKey="Burton, P" uniqKey="Burton P">P Burton</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Sutton, Aj" uniqKey="Sutton A">AJ Sutton</name>
</author>
<author><name sortKey="Kendrick, D" uniqKey="Kendrick D">D Kendrick</name>
</author>
<author><name sortKey="Coupland, Cac" uniqKey="Coupland C">CAC Coupland</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Gomatam, S" uniqKey="Gomatam S">S Gomatam</name>
</author>
<author><name sortKey="Karr, Af" uniqKey="Karr A">AF Karr</name>
</author>
<author><name sortKey="Reiter, Jp" uniqKey="Reiter J">JP Reiter</name>
</author>
<author><name sortKey="Sanil, Ap" uniqKey="Sanil A">AP Sanil</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article" xml:lang="en"><pmc-dir>properties open_access</pmc-dir>
  <front><journal-meta><journal-id journal-id-type="nlm-ta">Emerg Themes Epidemiol</journal-id>
<journal-id journal-id-type="iso-abbrev">Emerg Themes Epidemiol</journal-id>
<journal-title-group><journal-title>Emerging Themes in Epidemiology</journal-title>
</journal-title-group>
<issn pub-type="epub">1742-7622</issn>
<publisher><publisher-name>BioMed Central</publisher-name>
</publisher>
</journal-meta>
<article-meta><article-id pub-id-type="pmid">24257327</article-id>
<article-id pub-id-type="pmc">4175511</article-id>
<article-id pub-id-type="publisher-id">1742-7622-10-12</article-id>
<article-id pub-id-type="doi">10.1186/1742-7622-10-12</article-id>
<article-categories><subj-group subj-group-type="heading"><subject>Analytic Perspective</subject>
</subj-group>
</article-categories>
<title-group><article-title>Data harmonization and federated analysis of population-based studies: the BioSHaRE project</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" corresp="yes" id="A1"><name><surname>Doiron</surname>
<given-names>Dany</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<xref ref-type="aff" rid="I2">2</xref>
<email>ddoiron@maelstrom-research.org</email>
</contrib>
<contrib contrib-type="author" id="A2"><name><surname>Burton</surname>
<given-names>Paul</given-names>
</name>
<xref ref-type="aff" rid="I4">4</xref>
<email>pb51@leicester.ac.uk</email>
</contrib>
<contrib contrib-type="author" id="A3"><name><surname>Marcon</surname>
<given-names>Yannick</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<email>yannick.marcon@obiba.org</email>
</contrib>
<contrib contrib-type="author" id="A4"><name><surname>Gaye</surname>
<given-names>Amadou</given-names>
</name>
<xref ref-type="aff" rid="I4">4</xref>
<email>ag239@le.ac.uk</email>
</contrib>
<contrib contrib-type="author" id="A5"><name><surname>Wolffenbuttel</surname>
<given-names>Bruce H R</given-names>
</name>
<xref ref-type="aff" rid="I5">5</xref>
<email>bwo@umcg.nl</email>
</contrib>
<contrib contrib-type="author" id="A6"><name><surname>Perola</surname>
<given-names>Markus</given-names>
</name>
<xref ref-type="aff" rid="I6">6</xref>
<xref ref-type="aff" rid="I7">7</xref>
<email>markus.perola@thl.fi</email>
</contrib>
<contrib contrib-type="author" id="A7"><name><surname>Stolk</surname>
<given-names>Ronald P</given-names>
</name>
<xref ref-type="aff" rid="I8">8</xref>
<email>r.p.stolk@umcg.nl</email>
</contrib>
<contrib contrib-type="author" id="A8"><name><surname>Foco</surname>
<given-names>Luisa</given-names>
</name>
<xref ref-type="aff" rid="I9">9</xref>
<email>luisa.foco@eurac.edu</email>
</contrib>
<contrib contrib-type="author" id="A9"><name><surname>Minelli</surname>
<given-names>Cosetta</given-names>
</name>
<xref ref-type="aff" rid="I13">13</xref>
<email>cosetta.minelli@eurac.edu</email>
</contrib>
<contrib contrib-type="author" id="A10"><name><surname>Waldenberger</surname>
<given-names>Melanie</given-names>
</name>
<xref ref-type="aff" rid="I10">10</xref>
<email>waldenberger@helmholtz-muenchen.de</email>
</contrib>
<contrib contrib-type="author" id="A11"><name><surname>Holle</surname>
<given-names>Rolf</given-names>
</name>
<xref ref-type="aff" rid="I10">10</xref>
<email>holle@helmholtz-muenchen.de</email>
</contrib>
<contrib contrib-type="author" id="A12"><name><surname>Kvaløy</surname>
<given-names>Kirsti</given-names>
</name>
<xref ref-type="aff" rid="I11">11</xref>
<email>kirsti.kvaloy@ntnu.no</email>
</contrib>
<contrib contrib-type="author" id="A13"><name><surname>Hillege</surname>
<given-names>Hans L</given-names>
</name>
<xref ref-type="aff" rid="I12">12</xref>
<email>h.hillege@umcg.nl</email>
</contrib>
<contrib contrib-type="author" id="A14"><name><surname>Tassé</surname>
<given-names>Anne-Marie</given-names>
</name>
<xref ref-type="aff" rid="I2">2</xref>
<email>anne-marie.tasse@mcgill.ca</email>
</contrib>
<contrib contrib-type="author" equal-contrib="yes" id="A15"><name><surname>Ferretti</surname>
<given-names>Vincent</given-names>
</name>
<xref ref-type="aff" rid="I3">3</xref>
<email>vincent.ferretti@oicr.on.ca</email>
</contrib>
<contrib contrib-type="author" equal-contrib="yes" id="A16"><name><surname>Fortier</surname>
<given-names>Isabel</given-names>
</name>
<xref ref-type="aff" rid="I1">1</xref>
<email>isabel.fortier@mail.mcgill.ca</email>
</contrib>
</contrib-group>
<aff id="I1"><label>1</label>
Research Institute of the McGill University Health Centre, 2155 Guy, office 458, Montreal, Quebec H3H 2R9, Canada</aff>
<aff id="I2"><label>2</label>
Public Population Project in Genomics and Society, Montreal, Canada</aff>
<aff id="I3"><label>3</label>
Ontario Institute for Cancer Research, MaRS Centre, Toronto, Canada</aff>
<aff id="I4"><label>4</label>
D2K Research Group, School of Social and Community Medicine, University of Bristol, Bristol, UK</aff>
<aff id="I5"><label>5</label>
Department of Endocrinology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands</aff>
<aff id="I6"><label>6</label>
Department of Chronic Disease Prevention, Public Health Genomics Unit, National Institute for Health and Welfare, Helsinki, Finland</aff>
<aff id="I7"><label>7</label>
Institute for Molecular Medicine, University of Helsinki, Helsinki, Finland</aff>
<aff id="I8"><label>8</label>
Department of Epidemiology, University Medical Center Groningen, Groningen, The Netherlands</aff>
<aff id="I9"><label>9</label>
European Academy of Bolzano/Bozen (EURAC), Center for Biomedicine, Bolzano, Italy</aff>
<aff id="I10"><label>10</label>
Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany</aff>
<aff id="I11"><label>11</label>
Department of Public Health and General Practice, HUNT Research Center, Norwegian University of Science and Technology, Trondheim, Norway</aff>
<aff id="I12"><label>12</label>
Department of Cardiology and Epidemiology, University Medical Centre Groningen, Groningen, The Netherlands</aff>
<aff id="I13"><label>13</label>
Respiratory Epidemiology, Occupational Medicine and Public Health, National Heart and Lung Institute, Imperial College, London, UK</aff>
<pub-date pub-type="collection"><year>2013</year>
</pub-date>
<pub-date pub-type="epub"><day>21</day>
<month>11</month>
<year>2013</year>
</pub-date>
<volume>10</volume>
<fpage>12</fpage>
<lpage>12</lpage>
<history><date date-type="received"><day>3</day>
<month>7</month>
<year>2013</year>
</date>
<date date-type="accepted"><day>11</day>
<month>11</month>
<year>2013</year>
</date>
</history>
<permissions><copyright-statement>Copyright © 2013 Doiron et al.; licensee BioMed Central Ltd.</copyright-statement>
<copyright-year>2013</copyright-year>
<copyright-holder>Doiron et al.; licensee BioMed Central Ltd.</copyright-holder>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/2.0"><license-p>This is an open access article distributed under the terms of the Creative Commons Attribution License (<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/2.0">http://creativecommons.org/licenses/by/2.0</ext-link>
), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri xlink:href="http://www.ete-online.com/content/10/1/12"></self-uri>
<abstract><sec><title>Abstracts</title>
<sec><title>Background</title>
<p>Individual-level data pooling of large population-based studies across research centres in international research projects faces many hurdles. The BioSHaRE (Biobank Standardisation and Harmonisation for Research Excellence in the European Union) project aims to address these issues by building a collaborative group of investigators and developing tools for data harmonization, database integration and federated data analyses.</p>
</sec>
<sec><title>Methods</title>
<p>Eight population-based studies in six European countries were recruited to participate in the BioSHaRE project. Through workshops, teleconferences and electronic communications, participating investigators identified a set of 96 variables targeted for harmonization to answer research questions of interest. Using each study’s questionnaires, standard operating procedures, and data dictionaries, harmonization potential was assessed. Whenever harmonization was deemed possible, processing algorithms were developed and implemented in an open-source software infrastructure to transform study-specific data into the target (i.e. harmonized) format. Harmonized datasets located on server in each research centres across Europe were interconnected through a federated database system to perform statistical analysis.</p>
</sec>
<sec><title>Results</title>
<p>Retrospective harmonization led to the generation of common format variables for 73% of matches considered (96 targeted variables across 8 studies). Authenticated investigators can now perform complex statistical analyses of harmonized datasets stored on distributed servers without actually sharing individual-level data using the DataSHIELD method.</p>
</sec>
<sec><title>Conclusion</title>
<p>New Internet-based networking technologies and database management systems are providing the means to support collaborative, multi-center research in an efficient and secure manner. The results from this pilot project show that, given a strong collaborative relationship between participating studies, it is possible to seamlessly co-analyse internationally harmonized research databases while allowing each study to retain full control over individual-level data. We encourage additional collaborative research networks in epidemiology, public health, and the social sciences to make use of the open source tools presented herein.</p>
</sec>
</sec>
</abstract>
</article-meta>
</front>
<body><sec sec-type="intro"><title>Introduction</title>
<p>The benefits of harmonizing and pooling research databases are numerous. Integrating harmonized data from different populations allows achieving sample sizes that could not be obtained with individual studies [<xref ref-type="bibr" rid="B1">1</xref>
-<xref ref-type="bibr" rid="B4">4</xref>
], improves the generalizability of results [<xref ref-type="bibr" rid="B3">3</xref>
-<xref ref-type="bibr" rid="B5">5</xref>
], helps ensure the validity of comparative research [<xref ref-type="bibr" rid="B6">6</xref>
,<xref ref-type="bibr" rid="B7">7</xref>
], encourages more efficient secondary usage of existing data [<xref ref-type="bibr" rid="B8">8</xref>
], and provides opportunities for collaborative and multi-centre research [<xref ref-type="bibr" rid="B9">9</xref>
-<xref ref-type="bibr" rid="B12">12</xref>
]. Governments, funders, and researchers alike have been stressing the importance of harmonization and collaborative use of data and samples in the population health and biobanking fields over the past half-decade [<xref ref-type="bibr" rid="B13">13</xref>
-<xref ref-type="bibr" rid="B21">21</xref>
]. However, managing and harmonizing very large amounts of data from different sources is a significant challenge [<xref ref-type="bibr" rid="B20">20</xref>
,<xref ref-type="bibr" rid="B22">22</xref>
-<xref ref-type="bibr" rid="B24">24</xref>
]. Further, ethical, legal, and consent-related restrictions associated with sharing or pooling of individual-level data represent a common dilemma faced by international research projects and networks [<xref ref-type="bibr" rid="B25">25</xref>
,<xref ref-type="bibr" rid="B26">26</xref>
]. Web-based networking technologies and new database management systems are at the forefront of providing solutions to some of these dilemmas [<xref ref-type="bibr" rid="B27">27</xref>
-<xref ref-type="bibr" rid="B32">32</xref>
]. When combined with strong collaboration between partners, such tools allow us to interconnect distributed databases through database federation systems and assure secure and effective analysis of complex datasets across research centres while retaining individual-level data within host institutions of participating studies.</p>
<p>BioSHaRE (Biobank Standardisation and Harmonisation for Research Excellence in the European Union) is a Seventh Framework Programme (FP7) funded project whose aim is developing data harmonization tools and standardized IT systems for existing biobanks and cohorts across Europe, and apply them to conduct pan-European epidemiological research [<xref ref-type="bibr" rid="B33">33</xref>
]. As a core project of BioSHaRE, the Healthy Obese Project (HOP) piloted retrospective data harmonization and database federation tools to effectively assess the compatibility of collected data and to safely federate research databases in order to conduct obesity-related research, with a focus on the characterization of metabolically healthy obese individuals [<xref ref-type="bibr" rid="B34">34</xref>
,<xref ref-type="bibr" rid="B35">35</xref>
]. Since ‘healthy obesity’ is rather rare, researchers need a large numbers of subjects to explore its determinants and consequences. To investigate subgroups, even larger numbers are needed, making the HOP a good case study for harmonization and co-analysing data from several large population-based studies.</p>
<p>The data harmonization and database federation methodology and infrastructure developed and piloted under BioSHaRE’s HOP is founded on the DataSHaPER (DataSchema and Harmonization Platform for Epidemiological Research) harmonization approach [<xref ref-type="bibr" rid="B22">22</xref>
,<xref ref-type="bibr" rid="B37">37</xref>
] and on information technology tools developed by OBiBa (Open Source Software for BioBanks) [<xref ref-type="bibr" rid="B38">38</xref>
]. These have been recently integrated into a platform to support retrospective harmonization and integration of data [<xref ref-type="bibr" rid="B39">39</xref>
] by the Maelstrom Research team [<xref ref-type="bibr" rid="B40">40</xref>
]. The current paper presents the stepwise data harmonization and database federation process employed for the HOP (Table <xref ref-type="table" rid="T1">1</xref>
) and the information technology tools developed to support it [<xref ref-type="bibr" rid="B38">38</xref>
]. Resources described in this paper are currently being used by BioSHaRE to harmonize, integrate and jointly analyse data collected by eight population-based cohorts across Europe. Additional studies are joining the project and making use of these tools on a regular basis. The infrastructure described in this paper is helping to create a collaborative environment for BioSHaRE investigators. It aims to facilitate: (1) transforming data collected by existing studies into a common format through the use of processing algorithms; (2) interconnecting harmonized databases located in different countries and institutions across Europe; and (3) achieving combined statistical analyses of these datasets without pooling or sharing individual-level data.</p>
<table-wrap position="float" id="T1"><label>Table 1</label>
<caption><p>The Healthy Obese Project data harmonization and database federation step-by-step process</p>
</caption>
<table frame="hsides" rules="groups" border="1"><colgroup><col align="left"></col>
<col align="left"></col>
</colgroup>
<thead valign="top"><tr><th align="left"><bold>Step</bold>
</th>
<th align="left"><bold>Description</bold>
</th>
</tr>
</thead>
<tbody valign="top"><tr><td align="left" valign="bottom">Study recruitment and documentation<hr></hr>
</td>
<td align="left" valign="bottom">Studies are recruited to participate in the HOP and their key characteristics (e.g. design, sampling frame) are catalogued on the BioSHaRE website (www.bioshare.eu).<hr></hr>
</td>
</tr>
<tr><td align="left" valign="bottom">Harmonized variable selection and definition<hr></hr>
</td>
<td align="left" valign="bottom">A set of ‘target’ variables required to answer obesity-related research questions is identified at workshops bringing together BioSHaRE investigators.<hr></hr>
</td>
</tr>
<tr><td align="left" valign="bottom">Study variable identification and harmonization potential assessment<hr></hr>
</td>
<td align="left" valign="bottom">By analysing participating studies’ questionnaires, standard operating procedures, and data dictionaries, the potential for each study to generate this set of target variables is determined. Study-specific variables required to generate target variables are identified.<hr></hr>
</td>
</tr>
<tr><td align="left" valign="bottom">Data processing<hr></hr>
</td>
<td align="left" valign="bottom">Secure servers are set-up in each study’s host institution and the subsets of data required to generate target variables are loaded onto each of these servers. Processing algorithms transforming study data into the target (i.e. harmonized) format are developed and implemented for each study whenever harmonization is deemed possible.<hr></hr>
</td>
</tr>
<tr><td align="left" valign="top">Harmonized data federation, dissemination and analysis</td>
<td align="left">A password protected web portal federates the servers found in the different study host institutions across Europe and allows remote retrieval of data summaries, descriptive statistics (frequencies, min, max, mean, standard deviation), and contingency tables. For more complex federated data analyses (e.g. linear regressions), the DataSHIELD method [<xref ref-type="bibr" rid="B28">28</xref>
] is employed in the R software environment [<xref ref-type="bibr" rid="B36">36</xref>
].</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec><title>Study recruitment and documentation</title>
<p>The first step in the data harmonization and database federation process was to recruit studies to participate in the project. To be eligible to participate in the HOP, studies needed to collect comprehensive health outcome, socio-demographic, behavioural, physical and biochemical measures, and allow remote access to aggregated data for statistical analyses. Studies were also required to make study metadata (i.e. questionnaires, data codebooks, standard operating procedures) and ethical and legal documents/policies available to the BioSHaRE coordinating group. A preliminary scan of consents, data access, and IP policies was conducted by the Public Population Project in Genomics and Society (P3G) [<xref ref-type="bibr" rid="B41">41</xref>
] to assess the potential for each study to participate. Study investigators then submitted formal requests to participate in the project to their respective research ethics or data access committees. Next, key characteristics of participating studies were documented using a standardized online description form found on the Mica-powered BioSHaRE website (see <italic>“What is Mica?”</italic>
 below) [<xref ref-type="bibr" rid="B33">33</xref>
]. These characteristics included general study design, number of participants, participant characteristics, methods of recruitment, number and type of biological samples collected, and data and sample access conditions. Cataloguing such information helped in better understanding the level of heterogeneity across study designs as well as potential sample sizes available for analyses. Table <xref ref-type="table" rid="T2">2</xref>
 lists the eight studies participating in the HOP to date.</p>
<table-wrap position="float" id="T2"><label>Table 2</label>
<caption><p>Healthy Obese Project participating studies to date, number of participants, host institutions, and location</p>
</caption>
<table frame="hsides" rules="groups" border="1"><colgroup><col align="left"></col>
<col align="left"></col>
<col align="left"></col>
<col align="left"></col>
<col align="left"></col>
</colgroup>
<thead valign="top"><tr><th align="left"><bold>Study name</bold>
</th>
<th align="left"><bold>Acronym</bold>
</th>
<th align="left"><bold>Number of participants in the HOP</bold>
</th>
<th align="left"><bold>Host institution</bold>
</th>
<th align="left"><bold>Location</bold>
</th>
</tr>
</thead>
<tbody valign="top"><tr><td align="left" valign="bottom">Cooperative Health Research in South Tyrol Study<hr></hr>
</td>
<td align="left" valign="bottom">CHRIS<hr></hr>
</td>
<td align="left" valign="bottom">1116<hr></hr>
</td>
<td align="left" valign="bottom">European Academy of Bolzano<hr></hr>
</td>
<td align="left" valign="bottom">Bolzano, Italy<hr></hr>
</td>
</tr>
<tr><td align="left" valign="bottom">KORA Cooperative Health Research in the Region of Augsburg<hr></hr>
</td>
<td align="left" valign="bottom">KORA<hr></hr>
</td>
<td align="left" valign="bottom">18 000<hr></hr>
</td>
<td align="left" valign="bottom">Helmholtz Center Munich<hr></hr>
</td>
<td align="left" valign="bottom">Augsburg, Germany<hr></hr>
</td>
</tr>
<tr><td align="left" valign="bottom">LifeLines Cohort Study<hr></hr>
</td>
<td align="left" valign="bottom">LifeLines<hr></hr>
</td>
<td align="left" valign="bottom">93 000<hr></hr>
</td>
<td align="left" valign="bottom">University Medical Center Groningen<hr></hr>
</td>
<td align="left" valign="bottom">Groningen, The Netherlands<hr></hr>
</td>
</tr>
<tr><td align="left" valign="bottom">Microisolates in South Tyrol Study<hr></hr>
</td>
<td align="left" valign="bottom">MICROS<hr></hr>
</td>
<td align="left" valign="bottom">1300<hr></hr>
</td>
<td align="left" valign="bottom">European Academy of Bolzano<hr></hr>
</td>
<td align="left" valign="bottom">Bolzano, Italy<hr></hr>
</td>
</tr>
<tr><td align="left" valign="bottom">National Child Development Study<hr></hr>
</td>
<td align="left" valign="bottom">NCDS<hr></hr>
</td>
<td align="left" valign="bottom">18 558<hr></hr>
</td>
<td align="left" valign="bottom">University of Leicester<hr></hr>
</td>
<td align="left" valign="bottom">Leicester, United Kingdom<hr></hr>
</td>
</tr>
<tr><td align="left" valign="bottom">FINRISK 2007 Study<hr></hr>
</td>
<td align="left" valign="bottom">FINRISK 2007<hr></hr>
</td>
<td align="left" valign="bottom">10 000<hr></hr>
</td>
<td align="left" valign="bottom">National Institute for Health and Welfare<hr></hr>
</td>
<td align="left" valign="bottom">Helsinki, Finland<hr></hr>
</td>
</tr>
<tr><td align="left" valign="bottom">Nord-Trøndelag Health Study<hr></hr>
</td>
<td align="left" valign="bottom">HUNT<hr></hr>
</td>
<td align="left" valign="bottom">78 968<hr></hr>
</td>
<td align="left" valign="bottom">Norwegian University of Science and Technology<hr></hr>
</td>
<td align="left" valign="bottom">Trondheim, Norway<hr></hr>
</td>
</tr>
<tr><td align="left">Prevention of REnal and Vascular ENd-stage Disease study</td>
<td align="left">PREVEND</td>
<td align="left">8592</td>
<td align="left">University Medical Centre Groningen</td>
<td align="left">Groningen, The Netherlands</td>
</tr>
</tbody>
</table>
</table-wrap>
<sec><title>What is Mica?</title>
<p>Mica [<xref ref-type="bibr" rid="B38">38</xref>
] is a software application developed to create web portals for individual epidemiological studies or for study consortia. Features supported by Mica include a standardized study catalogue, data dictionary browsers, online data access request forms, and communication tools (e.g. forums, events, news). When used in conjunction with the Opal software, Mica also allows authenticated users to perform distributed queries on the content of study databases hosted on remote servers and retrieve summary statistics and contingency tables.</p>
</sec>
</sec>
<sec><title>Harmonized variable selection and definition</title>
<p>In the second step of the process, HOP investigators convened to select and define a set of ‘target’ variables required to answer specific obesity-related research questions. This set of variables, or DataSchema [<xref ref-type="bibr" rid="B22">22</xref>
], acted as a template for the retrospective harmonization process by defining the common format measures to be derived using data of participating studies. In order to allow multiple studies to participate in a collaborative endeavour while ensuring validity of the scientific output, the development of a DataSchema requires a balance between uniformity (e.g. exact same question wording and data collection procedures) and acceptance of certain level of heterogeneity across studies (e.g. slightly different wording or procedures). Two workshops (March and June 2012) bringing together BioSHaRE investigators from across Europe and Canada were organized to identify and define target variables making up the HOP DataSchema. Each workshop respectively focused on selecting variables to answer the following research questions: (1) What is the prevalence of obese individuals not showing increased metabolic or cardiovascular risk in each study (i.e. the ‘healthy obese’)?; and (2) What are the lifestyle and behavioural risk factors associated with ‘healthy obesity’? Following the workshops, the DataSchema went through iterative rounds of revisions through teleconferences and electronic communication to arrive at a consensus on target variables (e.g. weight), definitions (e.g. measured weight), and format (e.g. weight in Kg). For certain areas of information, international standards and classifications were used to define target variables and thereby facilitate international comparison of key concepts. For example, education-related DataSchema variables were developed using UNESCO’s International Standard Classification of Education [<xref ref-type="bibr" rid="B42">42</xref>
], while the ‘current occupation’ variable was developed using the International Labour Organization’s International Standard Classification of Occupations [<xref ref-type="bibr" rid="B43">43</xref>
]. Once finalized, DataSchema variables were annotated in a designated section of the Mica-powered BioSHaRE website (see <ext-link ext-link-type="uri" xlink:href="https://www.bioshare.eu/content/healthy-obese-project-dataschema">https://www.bioshare.eu/content/healthy-obese-project-dataschema</ext-link>
). To date, 96 variables including anthropometric and biochemical measures, history of obesity-related disease outcomes, socio-demographic status, and lifestyle and risk factors make up HOP DataSchema. New variables, including constructs covering the physical activity domain, will be added to the DataSchema over the course of the project.</p>
</sec>
<sec><title>Study variable identification and harmonization potential assessment</title>
<p>As a third step, using study questionnaires, standard operating procedures, and data dictionaries, harmonization team research assistants identified study-specific data covering DataSchema variables and formally assessed the potential for each study to generate each of these variables (96 variables across 8 studies). This step consisted of comparing the full definition and format of a DataSchema variable to study-specific questions, collection procedures and data formats to determine their compatibility. For example, in order for a given study to generate the ‘weight’ DataSchema variables, this variable needed to be objectively measured by a doctor, nurse or technician rather than self-reported by the participant. Not all studies could generate all of the 96 targeted variables. When assessing the harmonization potential, there were two reasons for which a particular study could <italic>not</italic>
 generate a specific DataSchema variable: either because the study simply did not collect information on the construct measured by a particular targeted variable or because the information the study collected on this construct was deemed incompatible with the DataSchema variable definition (e.g. self-reported weight). Harmonization potential assessment allowed determining which DataSchema variables could be generated by each study and identifying what study-specific data needed to be extracted from central study data repositories to be used in the remainder of the harmonization exercise. The overall harmonization potential assessment showed that 73% of all matches evaluated (96 DataSchema variables for each of the 8 studies) were considered compatible. Some domains of information proved to be more problematic to harmonize than others. For example, the 30 nutritional habit variables showed a harmonization potential of only 37% for all matches evaluated. On the other hand, the nine variables covering disease history and medication use (i.e. stroke, diabetes, high blood pressure, myocardial infarction) were considered compatible with DataSchema formats 97% of the time.</p>
<sec><title>Data processing</title>
<p>The fourth step involved processing study-specific data under the DataSchema variable format. This was done with the help of OBiBa’s Opal software (see “<italic>What is Opal?”</italic>
 below), which was installed on secure servers within the respective host institutions of participating studies (see Table <xref ref-type="table" rid="T1">1</xref>
). Data dictionaries (i.e. codebooks) of each participating study were converted into a standardized format readable by Opal and loaded onto the server. Each study then extracted data required to generate DataSchema variables (identified in the previous step) from their main database and loaded it on their respective Opal servers. To guide data processing, the reference DataSchema structure (i.e. common variable names, labels, and coding for categories) was also loaded onto each study-specific Opal instance. By accessing aggregate data via remote connections to each study server, data processing was then centrally conducted by the harmonization team to transform study-specific data into the common format defined by the DataSchema. For each DataSchema-variable-to-study match, the rationale describing the procedure to generate the DataSchema variable was first established. This ‘processing rationale’ varied in nature and scope depending on the variable to be harmonized. For example, in some instances, simple recoding of study data categories was sufficient to generate a DataSchema variable in the appropriate format. In other situations, such as for the generation of the harmonized Fasting Glucose variable (Figure <xref ref-type="fig" rid="F1">1</xref>
), data processing had to be supported by a more detailed explanation, which was documented in Opal. Once the ‘processing rationale’ was established, study specific processing algorithms were developed, documented and implemented in Opal, putting to use the software’s ability to compute custom JavaScript code [<xref ref-type="bibr" rid="B44">44</xref>
] to derive variables. Once executed on study data, algorithms were validated by comparing the distribution and counts of harmonized datasets to the data originally collected by each study. The data processing step ultimately resulted in the creation of one harmonized dataset per participating study, hosted on each host institution’s firewall-protected server.</p>
<fig id="F1" position="float"><label>Figure 1</label>
<caption><p>Example of data processing to obtain a common format: deriving the harmonized Fasting Glucose DataSchema variable for two studies.</p>
</caption>
<graphic xlink:href="1742-7622-10-12-1"></graphic>
</fig>
</sec>
<sec><title>What is Opal?</title>
<p>Opal [<xref ref-type="bibr" rid="B38">38</xref>
] is an software application used to manage study data and includes a software infrastructure enabling data harmonization and data integration across studies. As such, Opal supports the development and implementation of processing algorithms required to transform study-specific data into a common harmonized format. Moreover, when connected to a Mica-web interface, Opal allows users to seamlessly and securely search distributed datasets across several Opal instances.</p>
</sec>
</sec>
<sec><title>Harmonized data federation, dissemination and analysis</title>
<p>The fifth and last step in the process aimed to co-analyse harmonized datasets while addressing ethical and legal restrictions associated with pooling individual-level data. To achieve this, the Opal and Mica software applications were used in parallel to create a federated infrastructure that allows researchers to jointly analyse harmonized data while retaining individual-level data within their respective host institutions. Hence, once harmonized datasets were generated on local Opal servers in each host institution, these servers were securely connected via encrypted remote connections (using HTTPS).</p>
<p>Two types of analyses are made available through this framework (see Figure <xref ref-type="fig" rid="F2">2</xref>
). Firstly, once logged on to a password protected section of the Mica-based BioSHaRE.eu website, investigators can securely execute queries allowing them to retrieve data summaries, descriptive statistics (frequencies, min, max, mean, standard deviation), or contingency tables of the harmonized databases hosted on each of the geographically-dispersed Opal servers. Multiple investigators can run such distributed queries simultaneously and in real time on the different Opal servers. Secondly, and to support more complex federated data analyses such as multiple linear regressions, logistic regressions, Poisson regressions, or for undertaking a simple analysis such as executing a <italic>t</italic>
-test, the Opal-Mica framework is fully compatible with the DataSHIELD method (see “<italic>What is DataSHIELD</italic>
?” below) [<xref ref-type="bibr" rid="B28">28</xref>
,<xref ref-type="bibr" rid="B45">45</xref>
]. When a joint analysis is to be undertaken using data from several sources, statistical efficiency and flexibility is often best served by working directly with individual-level data rather than by meta-analysing summarised results from each study [<xref ref-type="bibr" rid="B46">46</xref>
]. However, important ethico-legal constraints, intellectual property considerations, and/or the physical size of the data to be analysed, often prevent or delay the sharing of individual-level data [<xref ref-type="bibr" rid="B47">47</xref>
]. Based on parallelized analysis and modern distributed computing, DataSHIELD enables the analysis of harmonized individual-level data without the need to physically pool them [<xref ref-type="bibr" rid="B28">28</xref>
,<xref ref-type="bibr" rid="B45">45</xref>
].</p>
<fig id="F2" position="float"><label>Figure 2</label>
<caption><p>Data harmonization and federated infrastructure for three HOP studies.</p>
</caption>
<graphic xlink:href="1742-7622-10-12-2"></graphic>
</fig>
<sec><title>What is DataSHIELD?</title>
<p>DataSHIELD (www.datashield.org) acts as an interface module between the Opal software application and the R software environment [<xref ref-type="bibr" rid="B36">36</xref>
]. Under DataSHIELD, a central analysis computer (i.e. the computer from which analysis is carried out) coordinates a parallelized simultaneous analysis of the individual-level data on all the data computers (i.e. the secure servers where the individual-level data are stored) by sending blocks of code, in the form of simple analytic commands, to each data computer. These request each server to undertake a particular analysis and to return non-disclosive summary statistics to the analysis computer, that is data which cannot possibly lead to the identification of the individuals to which they relate. For analyses such as the fitting of a generalized linear model, DataSHIELD works iteratively. After each iteration, summary statistics (typically the score vector and information matrix) are returned by each data computer to the analysis computer and the estimates of the model are refined; the process ends when the estimates converge. This enables global updating of the estimated model parameters taking full account of the data from <italic>all</italic>
 studies simultaneously. In this way, it is possible to fit a mathematical model as if the individual-level data from all studies were pooled centrally on the analysis computer while - in reality – the data never leave their studies of origin, and all that <italic>does</italic>
 leave are the non-disclosive summary statistics.</p>
</sec>
<sec><title>IT requirement for DataSHIELD</title>
<p>The DataSHIELD approach places very few demands on the IT equipment required (Figure <xref ref-type="fig" rid="F2">2</xref>
). The analysis computer can be a standard laptop or desktop running any R console [<xref ref-type="bibr" rid="B36">36</xref>
] or a rich client such as RStudio [<xref ref-type="bibr" rid="B48">48</xref>
] with DataSHIELD R packages. The data servers must each be running Opal and R. Using this framework, each Opal instance receives, controls and forwards requests from R running on the analysis computer to R running on the server. The controlled and secured web-based links between the analysis computer and the data computers do not need to carry heavy traffic, and DataSHIELD therefore demands no more than a standard wireless link to a broadband access point. It is also possible to channel communications through study firewall configurations to allow only for analyses from computers at specific IP addresses.</p>
</sec>
</sec>
<sec sec-type="conclusions"><title>Conclusion</title>
<p>New Internet-based networking technologies and database management systems are providing the means to support collaborative, multi-centre research in an efficient and secure manner [<xref ref-type="bibr" rid="B27">27</xref>
-<xref ref-type="bibr" rid="B32">32</xref>
]. Since its inception in 2010, the BioSHaRE project works at harnessing such resources along with international expertise in order to facilitate cross-border collaborations in the biomedical sciences. The Healthy Obese Project has successfully served to pilot a suite of tools which facilitates: (1) transforming existing data collected by different studies into a common format through the use of processing algorithms; (2) interconnecting harmonized databases located across Europe via a federated web-based infrastructure; and (3) achieving joint statistical analyses of harmonized datasets without pooling or sharing individual-level data.</p>
<p>It must be noted that the data harmonization and database federation work conducted within the BioSHaRE project has required a high level of collaboration between different parties. Active involvement of study investigators, research centre staff, and the BioSHaRE coordinating group was pivotal for the software and information technologies to be of use. Though this initiative has proven to require a high level of coordination, the infrastructure that results from it has a number of strengths. First, using the Mica-Opal federated framework, studies retain all control over individual-level data since local Opal instances compute aggregate data before sending results to the central Mica web portal, or to the analysis computer running the DataSHIELD R packages. Since either Mica or the analysis computer act as brokers to securely fetch information from each Opal instance, investigators querying data therefore never connect directly to the servers hosting individual-level data. Secondly, once harmonized datasets are derived on each participating study’s server, they can be used and reused for multiple collaborative research projects. Third, allowing investigators to safely and remotely analyse data (i.e. produce summary statistics, contingency tables, logistic regressions) at their convenience and in real time limits the burden associated with filing multiple data access requests at multiple research centres, thereby saving principal investigators and study managers time and resources. Lastly, Opal-Mica federated infrastructure features such as encrypted remote connections (using HTTPS), user authentication, and control over user access and permissions (e.g. dataset visibility, import/export, data manipulation) effectively ensures that participant data privacy and confidentiality are respected across studies in a collaborative research context.</p>
<p>The HOP pilot project is helping to optimize the tools and methods presented herein and to add new data analysis features to these tools in the aim of constructing a more robust, efficient, scalable and automated framework to support secure analysis of harmonized data in BioSHaRE and other collaborative projects. Through this pilot project, we have shown that seamlessly and securely co-analysing internationally harmonized research databases is possible. We hope that the open source tools presented in this paper will be of interest to additional research networks in epidemiology, public health, and the social sciences in the future. Opal and Mica software as well as the DataSHIELD R packages are freely available to the research community under the GPL3 license at <ext-link ext-link-type="uri" xlink:href="https://www.obiba.org">https://www.obiba.org</ext-link>
.</p>
</sec>
<sec><title>Competing interests</title>
<p>The authors declare that they have no competing interests.</p>
</sec>
<sec><title>Authors’ contributions</title>
<p>DD, VF, PB, YM, AG, IF contributed to the conception, design and drafting of the manuscript. BW, MP, RS, AM contributed to coordination of the Healthy Obese Project and to the drafting of the manuscript. LF, CM, MW, RH, KK, HH contributed to the acquisition and interpretation of the study-specific data and to the drafting of the manuscript. All authors read and approved the final manuscript.</p>
</sec>
</body>
<back><sec><title>Acknowledgement</title>
<p>This work was supported by the European Union Seventh Framework Programme (FP7/2007-2013) [grant number 261433]. We would also like to thank the following staff in research centres across Europe who have made this work possible (in alphabetical order): Anusha Appanvel (Helmholtz Zentrum München – German Research Center for Environmental Health, Augsburg, Germany), Clemens Egger (European Academy of Bolzano/Bozen (EURAC), Bolzano, Italy), Tero Hiekkalinna (National Institute for Health and Welfare, Helsinki, Finland), Sveinung Marvik (Norwegian University of Science and Technology, Trondheim, Norway), Christopher Newby (University of Leicester, Leicester, UK), Marja-Liisa Nuotio (National Institute for Health and Welfare, Helsinki, Finland), Chao Pang (University Medical Center Groningen, Groningen, The Netherlands), Eva Reischl (Helmholtz Zentrum München – German Research Center for Environmental Health, Augsburg, Germany), Morris Swertz (University Medical Center Groningen, Groningen, The Netherlands).</p>
</sec>
<ref-list><ref id="B1"><mixed-citation publication-type="journal"><name><surname>Smith-Warner</surname>
<given-names>SA</given-names>
</name>
<name><surname>Spiegelman</surname>
<given-names>D</given-names>
</name>
<name><surname>Ritz</surname>
<given-names>J</given-names>
</name>
<name><surname>Albanes</surname>
<given-names>D</given-names>
</name>
<name><surname>Beeson</surname>
<given-names>WL</given-names>
</name>
<name><surname>Bernstein</surname>
<given-names>L</given-names>
</name>
<name><surname>Berrino</surname>
<given-names>F</given-names>
</name>
<name><surname>van den Brandt</surname>
<given-names>PA</given-names>
</name>
<name><surname>Buring</surname>
<given-names>JE</given-names>
</name>
<name><surname>Cho</surname>
<given-names>E</given-names>
</name>
<etal></etal>
<article-title>Methods for pooling results of epidemiologic studies: the pooling project of prospective studies of diet and cancer</article-title>
<source>Am J Epidemiol</source>
<year>2006</year>
<volume>163</volume>
<issue>11</issue>
<fpage>1053</fpage>
<lpage>1064</lpage>
<pub-id pub-id-type="doi">10.1093/aje/kwj127</pub-id>
<pub-id pub-id-type="pmid">16624970</pub-id>
</mixed-citation>
</ref>
<ref id="B2"><mixed-citation publication-type="journal"><name><surname>Thompson</surname>
<given-names>A</given-names>
</name>
<article-title>Thinking big: large-scale collaborative research in observational epidemiology</article-title>
<source>Eur J Epidemiol</source>
<year>2009</year>
<volume>24</volume>
<issue>12</issue>
<fpage>727</fpage>
<lpage>731</lpage>
<pub-id pub-id-type="doi">10.1007/s10654-009-9412-1</pub-id>
<pub-id pub-id-type="pmid">19967428</pub-id>
</mixed-citation>
</ref>
<ref id="B3"><mixed-citation publication-type="journal"><name><surname>Khoury</surname>
<given-names>MJ</given-names>
</name>
<article-title>The case for a global human genome epidemiology initiative</article-title>
<source>Nat Genet</source>
<year>2004</year>
<volume>36</volume>
<issue>10</issue>
<fpage>1027</fpage>
<lpage>1028</lpage>
<pub-id pub-id-type="doi">10.1038/ng1004-1027</pub-id>
<pub-id pub-id-type="pmid">15454932</pub-id>
</mixed-citation>
</ref>
<ref id="B4"><mixed-citation publication-type="journal"><name><surname>Hamilton</surname>
<given-names>CM</given-names>
</name>
<name><surname>Strader</surname>
<given-names>LC</given-names>
</name>
<name><surname>Pratt</surname>
<given-names>JG</given-names>
</name>
<name><surname>Maiese</surname>
<given-names>D</given-names>
</name>
<name><surname>Hendershot</surname>
<given-names>T</given-names>
</name>
<name><surname>Kwok</surname>
<given-names>RK</given-names>
</name>
<name><surname>Hammond</surname>
<given-names>JA</given-names>
</name>
<name><surname>Huggins</surname>
<given-names>W</given-names>
</name>
<name><surname>Jackman</surname>
<given-names>D</given-names>
</name>
<name><surname>Pan</surname>
<given-names>H</given-names>
</name>
<etal></etal>
<article-title>The PhenX toolkit: Get the most from your measures</article-title>
<source>Am J Epidemiol</source>
<year>2011</year>
<volume>174</volume>
<issue>3</issue>
<fpage>253</fpage>
<lpage>260</lpage>
<pub-id pub-id-type="doi">10.1093/aje/kwr193</pub-id>
<pub-id pub-id-type="pmid">21749974</pub-id>
</mixed-citation>
</ref>
<ref id="B5"><mixed-citation publication-type="journal"><name><surname>Noale</surname>
<given-names>M</given-names>
</name>
<name><surname>Minicuci</surname>
<given-names>N</given-names>
</name>
<name><surname>Bardage</surname>
<given-names>C</given-names>
</name>
<name><surname>Gindin</surname>
<given-names>J</given-names>
</name>
<name><surname>Nikula</surname>
<given-names>S</given-names>
</name>
<name><surname>Pluijm</surname>
<given-names>S</given-names>
</name>
<name><surname>Rodríguez-Laso</surname>
<given-names>A</given-names>
</name>
<name><surname>Maggi</surname>
<given-names>S</given-names>
</name>
<article-title>Predictors of mortality: an international comparison of socio-demographic and health characteristics from six longitudinal studies on aging: the CLESA project</article-title>
<source>Exp Gerontol</source>
<year>2005</year>
<volume>40</volume>
<issue>1</issue>
<fpage>89</fpage>
<lpage>99</lpage>
<pub-id pub-id-type="pmid">15732192</pub-id>
</mixed-citation>
</ref>
<ref id="B6"><mixed-citation publication-type="journal"><name><surname>Serra-Majem</surname>
<given-names>L</given-names>
</name>
<name><surname>MacLean</surname>
<given-names>D</given-names>
</name>
<name><surname>Ribas</surname>
<given-names>L</given-names>
</name>
<name><surname>Brulé</surname>
<given-names>D</given-names>
</name>
<name><surname>Sekula</surname>
<given-names>W</given-names>
</name>
<name><surname>Prattala</surname>
<given-names>R</given-names>
</name>
<name><surname>Garcia-Closas</surname>
<given-names>R</given-names>
</name>
<name><surname>Yngve</surname>
<given-names>A</given-names>
</name>
<name><surname>Lalonde</surname>
<given-names>M</given-names>
</name>
<name><surname>Petrasovits</surname>
<given-names>A</given-names>
</name>
<article-title>Comparative analysis of nutrition data from national, household, and individual levels: results from a WHO-CINDI collaborative project in Canada, Finland, Poland, and Spain</article-title>
<source>J Epidemiol Community Health</source>
<year>2003</year>
<volume>57</volume>
<issue>1</issue>
<fpage>74</fpage>
<lpage>80</lpage>
<pub-id pub-id-type="doi">10.1136/jech.57.1.74</pub-id>
<pub-id pub-id-type="pmid">12490653</pub-id>
</mixed-citation>
</ref>
<ref id="B7"><mixed-citation publication-type="journal"><name><surname>Bath</surname>
<given-names>PA</given-names>
</name>
<name><surname>Deeg</surname>
<given-names>D</given-names>
</name>
<name><surname>Poppelaars</surname>
<given-names>J</given-names>
</name>
<article-title>The harmonisation of longitudinal data: a case study using data from cohort studies in The Netherlands and the United Kingdom</article-title>
<source>Ageing Soc</source>
<year>2010</year>
<volume>30</volume>
<issue>08</issue>
<fpage>1419</fpage>
<lpage>1437</lpage>
<pub-id pub-id-type="doi">10.1017/S0144686X1000070X</pub-id>
</mixed-citation>
</ref>
<ref id="B8"><mixed-citation publication-type="journal"><name><surname>Sansone</surname>
<given-names>S-A</given-names>
</name>
<name><surname>Rocca-Serra</surname>
<given-names>P</given-names>
</name>
<name><surname>Field</surname>
<given-names>D</given-names>
</name>
<name><surname>Maguire</surname>
<given-names>E</given-names>
</name>
<name><surname>Taylor</surname>
<given-names>C</given-names>
</name>
<name><surname>Hofmann</surname>
<given-names>O</given-names>
</name>
<name><surname>Fang</surname>
<given-names>H</given-names>
</name>
<name><surname>Neumann</surname>
<given-names>S</given-names>
</name>
<name><surname>Tong</surname>
<given-names>W</given-names>
</name>
<name><surname>Amaral-Zettler</surname>
<given-names>L</given-names>
</name>
<article-title>Toward interoperable bioscience data</article-title>
<source>Nat Genet</source>
<year>2012</year>
<volume>44</volume>
<issue>2</issue>
<fpage>121</fpage>
<lpage>126</lpage>
<pub-id pub-id-type="doi">10.1038/ng.1054</pub-id>
<pub-id pub-id-type="pmid">22281772</pub-id>
</mixed-citation>
</ref>
<ref id="B9"><mixed-citation publication-type="journal"><name><surname>Schad</surname>
<given-names>PA</given-names>
</name>
<name><surname>Mobley</surname>
<given-names>LR</given-names>
</name>
<name><surname>Hamilton</surname>
<given-names>CM</given-names>
</name>
<article-title>Building a biomedical cyberinfrastructure for collaborative research</article-title>
<source>Am J Prev Med</source>
<year>2011</year>
<volume>40</volume>
<issue>5</issue>
<fpage>S144</fpage>
<lpage>S150</lpage>
<pub-id pub-id-type="doi">10.1016/j.amepre.2011.01.018</pub-id>
<pub-id pub-id-type="pmid">21521587</pub-id>
</mixed-citation>
</ref>
<ref id="B10"><mixed-citation publication-type="journal"><name><surname>Seminara</surname>
<given-names>D</given-names>
</name>
<name><surname>Khoury</surname>
<given-names>MJ</given-names>
</name>
<name><surname>O’Brien</surname>
<given-names>TR</given-names>
</name>
<name><surname>Manolio</surname>
<given-names>T</given-names>
</name>
<name><surname>Gwinn</surname>
<given-names>ML</given-names>
</name>
<name><surname>Little</surname>
<given-names>J</given-names>
</name>
<name><surname>Higgins</surname>
<given-names>JPT</given-names>
</name>
<name><surname>Bernstein</surname>
<given-names>JL</given-names>
</name>
<name><surname>Boffetta</surname>
<given-names>P</given-names>
</name>
<name><surname>Bondy</surname>
<given-names>M</given-names>
</name>
<etal></etal>
<article-title>The emergence of networks in human genome epidemiology: “challenges and opportunities”</article-title>
<source>Epidemiology</source>
<year>2007</year>
<volume>18</volume>
<issue>1</issue>
<fpage>1</fpage>
<lpage>8</lpage>
<pub-id pub-id-type="doi">10.1097/01.ede.0000249540.17855.b7</pub-id>
<pub-id pub-id-type="pmid">17179752</pub-id>
</mixed-citation>
</ref>
<ref id="B11"><mixed-citation publication-type="other"><name><surname>Budin-Ljøsne</surname>
<given-names>I</given-names>
</name>
<name><surname>Isaeva</surname>
<given-names>J</given-names>
</name>
<name><surname>Knoppers</surname>
<given-names>BM</given-names>
</name>
<name><surname>Tassé</surname>
<given-names>AM</given-names>
</name>
<name><surname>Shen</surname>
<given-names>H-y</given-names>
</name>
<name><surname>McCarthy</surname>
<given-names>MI</given-names>
</name>
<name><surname>Harris</surname>
<given-names>JR</given-names>
</name>
<article-title>Data sharing in large research consortia: experiences and recommendations from ENGAGE</article-title>
<source>Eur J Hum Genet</source>
<comment>Advance online publication 19 June 2013. doi:10.1038/ejhg.2013.131</comment>
</mixed-citation>
</ref>
<ref id="B12"><mixed-citation publication-type="journal"><name><surname>Bousquet</surname>
<given-names>J</given-names>
</name>
<name><surname>Anto</surname>
<given-names>J</given-names>
</name>
<name><surname>Sunyer</surname>
<given-names>J</given-names>
</name>
<name><surname>Nieuwenhuijsen</surname>
<given-names>M</given-names>
</name>
<name><surname>Vrijheid</surname>
<given-names>M</given-names>
</name>
<name><surname>Keil</surname>
<given-names>T</given-names>
</name>
<article-title>Pooling birth cohorts in allergy and asthma: European union-funded initiatives – a MeDALL, CHICOS, ENRIECO, and GA < sup > 2 LEN joint paper</article-title>
<source>Int Arch Allergy Immunol</source>
<year>2013</year>
<volume>161</volume>
<issue>1</issue>
<fpage>1</fpage>
<lpage>10</lpage>
<pub-id pub-id-type="doi">10.1159/000343018</pub-id>
<pub-id pub-id-type="pmid">23258290</pub-id>
</mixed-citation>
</ref>
<ref id="B13"><mixed-citation publication-type="journal"><name><surname>Harris</surname>
<given-names>JR</given-names>
</name>
<name><surname>Burton</surname>
<given-names>P</given-names>
</name>
<name><surname>Knoppers</surname>
<given-names>BM</given-names>
</name>
<name><surname>Lindpaintner</surname>
<given-names>K</given-names>
</name>
<name><surname>Bledsoe</surname>
<given-names>M</given-names>
</name>
<name><surname>Brookes</surname>
<given-names>AJ</given-names>
</name>
<name><surname>Budin-Ljosne</surname>
<given-names>I</given-names>
</name>
<name><surname>Chisholm</surname>
<given-names>R</given-names>
</name>
<name><surname>Cox</surname>
<given-names>D</given-names>
</name>
<name><surname>Deschenes</surname>
<given-names>M</given-names>
</name>
<etal></etal>
<article-title>Toward a roadmap in global biobanking for health</article-title>
<source>Eur J Hum Genet</source>
<year>2012</year>
<volume>20</volume>
<fpage>1105</fpage>
<lpage>1111</lpage>
<pub-id pub-id-type="doi">10.1038/ejhg.2012.96</pub-id>
<pub-id pub-id-type="pmid">22713808</pub-id>
</mixed-citation>
</ref>
<ref id="B14"><mixed-citation publication-type="book"><name><surname>Zika</surname>
<given-names>E</given-names>
</name>
<name><surname>Paci</surname>
<given-names>D</given-names>
</name>
<name><surname>Schulte in den Bäumen</surname>
<given-names>T</given-names>
</name>
<name><surname>Braun</surname>
<given-names>A</given-names>
</name>
<name><surname>RijKers-Defrasne</surname>
<given-names>S</given-names>
</name>
<name><surname>Deschênes</surname>
<given-names>M</given-names>
</name>
<name><surname>Fortier</surname>
<given-names>I</given-names>
</name>
<name><surname>Laage-Hellman</surname>
<given-names>J</given-names>
</name>
<name><surname>Scerri</surname>
<given-names>CA</given-names>
</name>
<name><surname>Ibarreta</surname>
<given-names>D</given-names>
</name>
<source>Biobanks in Europe: prospects for harmonisation and networking</source>
<year>2010</year>
<publisher-name>Luxembourg: European Union</publisher-name>
</mixed-citation>
</ref>
<ref id="B15"><mixed-citation publication-type="book"><name><surname>Gottweis</surname>
<given-names>H</given-names>
</name>
<name><surname>Kaye</surname>
<given-names>J</given-names>
</name>
<name><surname>Bignami</surname>
<given-names>F</given-names>
</name>
<name><surname>Rial-Sebbag</surname>
<given-names>E</given-names>
</name>
<name><surname>Lattanzi</surname>
<given-names>R</given-names>
</name>
<name><surname>Macek</surname>
<given-names>M</given-names>
<suffix>Jr</suffix>
</name>
<source>Biobanks for Europe: a challenge for governance</source>
<year>2012</year>
<publisher-name>European Union: Luxembourg</publisher-name>
</mixed-citation>
</ref>
<ref id="B16"><mixed-citation publication-type="journal"><name><surname>Bookman</surname>
<given-names>EB</given-names>
</name>
<name><surname>McAllister</surname>
<given-names>K</given-names>
</name>
<name><surname>Gillanders</surname>
<given-names>E</given-names>
</name>
<name><surname>Wanke</surname>
<given-names>K</given-names>
</name>
<name><surname>Balshaw</surname>
<given-names>D</given-names>
</name>
<name><surname>Rutter</surname>
<given-names>J</given-names>
</name>
<name><surname>Reedy</surname>
<given-names>J</given-names>
</name>
<name><surname>Shaughnessy</surname>
<given-names>D</given-names>
</name>
<name><surname>Agurs-Collins</surname>
<given-names>T</given-names>
</name>
<name><surname>Paltoo</surname>
<given-names>D</given-names>
</name>
<etal></etal>
<article-title>Gene-environment interplay in common complex diseases: forging an integrative model—recommendations from an NIH workshop</article-title>
<source>Genet Epidemiol</source>
<year>2011</year>
<volume>35</volume>
<issue>4</issue>
<fpage>217</fpage>
<lpage>225</lpage>
</mixed-citation>
</ref>
<ref id="B17"><mixed-citation publication-type="journal"><name><surname>Khoury</surname>
<given-names>MJ</given-names>
</name>
<name><surname>Lam</surname>
<given-names>TK</given-names>
</name>
<name><surname>Ioannidis</surname>
<given-names>JPA</given-names>
</name>
<name><surname>Hartge</surname>
<given-names>P</given-names>
</name>
<name><surname>Spitz</surname>
<given-names>MR</given-names>
</name>
<name><surname>Buring</surname>
<given-names>JE</given-names>
</name>
<name><surname>Chanock</surname>
<given-names>SJ</given-names>
</name>
<name><surname>Croyle</surname>
<given-names>R</given-names>
</name>
<name><surname>Goddard</surname>
<given-names>KAB</given-names>
</name>
<name><surname>Ginsburg</surname>
<given-names>GS</given-names>
</name>
<etal></etal>
<article-title>Transforming epidemiology for 21st century medicine and public health</article-title>
<source>Cancer Epidemiol Biomarkers Prev</source>
<year>2013</year>
<volume>22</volume>
<issue>4</issue>
<fpage>508</fpage>
<lpage>516</lpage>
<pub-id pub-id-type="doi">10.1158/1055-9965.EPI-13-0146</pub-id>
<pub-id pub-id-type="pmid">23462917</pub-id>
</mixed-citation>
</ref>
<ref id="B18"><mixed-citation publication-type="journal"><name><surname>Walport</surname>
<given-names>M</given-names>
</name>
<name><surname>Brest</surname>
<given-names>P</given-names>
</name>
<article-title>Sharing research data to improve public health</article-title>
<source>Lancet</source>
<year>2011</year>
<volume>377</volume>
<issue>9765</issue>
<fpage>537</fpage>
<lpage>539</lpage>
<pub-id pub-id-type="doi">10.1016/S0140-6736(10)62234-9</pub-id>
<pub-id pub-id-type="pmid">21216456</pub-id>
</mixed-citation>
</ref>
<ref id="B19"><mixed-citation publication-type="journal"><name><surname>Pisani</surname>
<given-names>E</given-names>
</name>
<name><surname>AbouZahr</surname>
<given-names>C</given-names>
</name>
<article-title>Sharing health data: good intentions are not enough</article-title>
<source>Bull World Health Organ</source>
<year>2010</year>
<volume>88</volume>
<fpage>462</fpage>
<lpage>466</lpage>
<pub-id pub-id-type="doi">10.2471/BLT.09.074393</pub-id>
<pub-id pub-id-type="pmid">20539861</pub-id>
</mixed-citation>
</ref>
<ref id="B20"><mixed-citation publication-type="journal"><name><surname>Bennett</surname>
<given-names>SN</given-names>
</name>
<name><surname>Caporaso</surname>
<given-names>N</given-names>
</name>
<name><surname>Fitzpatrick</surname>
<given-names>AL</given-names>
</name>
<name><surname>Agrawal</surname>
<given-names>A</given-names>
</name>
<name><surname>Barnes</surname>
<given-names>K</given-names>
</name>
<name><surname>Boyd</surname>
<given-names>HA</given-names>
</name>
<name><surname>Cornelis</surname>
<given-names>MC</given-names>
</name>
<name><surname>Hansel</surname>
<given-names>NN</given-names>
</name>
<name><surname>Heiss</surname>
<given-names>G</given-names>
</name>
<name><surname>Heit</surname>
<given-names>JA</given-names>
</name>
<etal></etal>
<article-title>Phenotype harmonization and cross-study collaboration in GWAS consortia: the GENEVA experience</article-title>
<source>Genet Epidemiol</source>
<year>2011</year>
<volume>35</volume>
<issue>3</issue>
<fpage>159</fpage>
<lpage>173</lpage>
<pub-id pub-id-type="doi">10.1002/gepi.20564</pub-id>
<pub-id pub-id-type="pmid">21284036</pub-id>
</mixed-citation>
</ref>
<ref id="B21"><mixed-citation publication-type="journal"><name><surname>Vickers</surname>
<given-names>AJ</given-names>
</name>
<article-title>Making raw data more widely available</article-title>
<source>BMJ</source>
<year>2011</year>
<volume>342</volume>
<fpage>d2323</fpage>
<pub-id pub-id-type="doi">10.1136/bmj.d2323</pub-id>
<pub-id pub-id-type="pmid">21543405</pub-id>
</mixed-citation>
</ref>
<ref id="B22"><mixed-citation publication-type="journal"><name><surname>Fortier</surname>
<given-names>I</given-names>
</name>
<name><surname>Burton</surname>
<given-names>PR</given-names>
</name>
<name><surname>Robson</surname>
<given-names>PJ</given-names>
</name>
<name><surname>Ferretti</surname>
<given-names>V</given-names>
</name>
<name><surname>Little</surname>
<given-names>J</given-names>
</name>
<name><surname>L’Heureux</surname>
<given-names>F</given-names>
</name>
<name><surname>Deschenes</surname>
<given-names>M</given-names>
</name>
<name><surname>Knoppers</surname>
<given-names>BM</given-names>
</name>
<name><surname>Doiron</surname>
<given-names>D</given-names>
</name>
<name><surname>Keers</surname>
<given-names>JC</given-names>
</name>
<etal></etal>
<article-title>Quality, quantity and harmony: the DataSHaPER approach to integrating data across bioclinical studies</article-title>
<source>Int J Epidemiol</source>
<year>2010</year>
<volume>39</volume>
<issue>5</issue>
<fpage>1383</fpage>
<lpage>1393</lpage>
<pub-id pub-id-type="doi">10.1093/ije/dyq139</pub-id>
<pub-id pub-id-type="pmid">20813861</pub-id>
</mixed-citation>
</ref>
<ref id="B23"><mixed-citation publication-type="journal"><name><surname>Howe</surname>
<given-names>D</given-names>
</name>
<name><surname>Costanzo</surname>
<given-names>M</given-names>
</name>
<name><surname>Fey</surname>
<given-names>P</given-names>
</name>
<name><surname>Gojobori</surname>
<given-names>T</given-names>
</name>
<name><surname>Hannick</surname>
<given-names>L</given-names>
</name>
<name><surname>Hide</surname>
<given-names>W</given-names>
</name>
<name><surname>Hill</surname>
<given-names>DP</given-names>
</name>
<name><surname>Kania</surname>
<given-names>R</given-names>
</name>
<name><surname>Schaeffer</surname>
<given-names>M</given-names>
</name>
<name><surname>St Pierre</surname>
<given-names>S</given-names>
</name>
<article-title>Big data: the future of biocuration</article-title>
<source>Nature</source>
<year>2008</year>
<volume>455</volume>
<issue>7209</issue>
<fpage>47</fpage>
<lpage>50</lpage>
<pub-id pub-id-type="doi">10.1038/455047a</pub-id>
<pub-id pub-id-type="pmid">18769432</pub-id>
</mixed-citation>
</ref>
<ref id="B24"><mixed-citation publication-type="journal"><collab>Science Staff</collab>
<article-title>Challenges and opportunities</article-title>
<source>Science</source>
<year>2011</year>
<volume>331</volume>
<issue>6018</issue>
<fpage>692</fpage>
<lpage>693</lpage>
<pub-id pub-id-type="pmid">21311002</pub-id>
</mixed-citation>
</ref>
<ref id="B25"><mixed-citation publication-type="journal"><name><surname>Kaye</surname>
<given-names>J</given-names>
</name>
<article-title>From single biobanks to international networks: developing e-governance</article-title>
<source>Hum Genet</source>
<year>2011</year>
<volume>130</volume>
<issue>3</issue>
<fpage>377</fpage>
<lpage>382</lpage>
<pub-id pub-id-type="doi">10.1007/s00439-011-1063-0</pub-id>
<pub-id pub-id-type="pmid">21785980</pub-id>
</mixed-citation>
</ref>
<ref id="B26"><mixed-citation publication-type="journal"><name><surname>Knoppers</surname>
<given-names>B</given-names>
</name>
<name><surname>Harris</surname>
<given-names>J</given-names>
</name>
<name><surname>Tasse</surname>
<given-names>A</given-names>
</name>
<name><surname>Budin-Ljosne</surname>
<given-names>I</given-names>
</name>
<name><surname>Kaye</surname>
<given-names>J</given-names>
</name>
<name><surname>Deschenes</surname>
<given-names>M</given-names>
</name>
<name><surname>Zawati</surname>
<given-names>M</given-names>
</name>
<article-title>Towards a data sharing code of conduct for international genomic research</article-title>
<source>Genome Med</source>
<year>2011</year>
<volume>3</volume>
<issue>7</issue>
<fpage>46</fpage>
<pub-id pub-id-type="doi">10.1186/gm262</pub-id>
<pub-id pub-id-type="pmid">21787442</pub-id>
</mixed-citation>
</ref>
<ref id="B27"><mixed-citation publication-type="journal"><name><surname>Karr</surname>
<given-names>AF</given-names>
</name>
<name><surname>Fulp</surname>
<given-names>WJ</given-names>
</name>
<name><surname>Vera</surname>
<given-names>F</given-names>
</name>
<name><surname>Young</surname>
<given-names>SS</given-names>
</name>
<name><surname>Lin</surname>
<given-names>X</given-names>
</name>
<name><surname>Reiter</surname>
<given-names>JP</given-names>
</name>
<article-title>Secure, privacy-preserving analysis of distributed databases</article-title>
<source>Technometrics</source>
<year>2007</year>
<volume>49</volume>
<issue>3</issue>
<fpage>335</fpage>
<lpage>345</lpage>
<pub-id pub-id-type="doi">10.1198/004017007000000209</pub-id>
</mixed-citation>
</ref>
<ref id="B28"><mixed-citation publication-type="journal"><name><surname>Wolfson</surname>
<given-names>M</given-names>
</name>
<name><surname>Wallace</surname>
<given-names>SE</given-names>
</name>
<name><surname>Masca</surname>
<given-names>N</given-names>
</name>
<name><surname>Rowe</surname>
<given-names>G</given-names>
</name>
<name><surname>Sheehan</surname>
<given-names>NA</given-names>
</name>
<name><surname>Ferretti</surname>
<given-names>V</given-names>
</name>
<name><surname>LaFlamme</surname>
<given-names>P</given-names>
</name>
<name><surname>Tobin</surname>
<given-names>MD</given-names>
</name>
<name><surname>Macleod</surname>
<given-names>J</given-names>
</name>
<name><surname>Little</surname>
<given-names>J</given-names>
</name>
<etal></etal>
<article-title>DataSHIELD: resolving a conflict in contemporary bioscience—performing a pooled analysis of individual-level data without sharing the data</article-title>
<source>Int J Epidemiol</source>
<year>2010</year>
<volume>39</volume>
<issue>5</issue>
<fpage>1372</fpage>
<lpage>1382</lpage>
<pub-id pub-id-type="doi">10.1093/ije/dyq111</pub-id>
<pub-id pub-id-type="pmid">20630989</pub-id>
</mixed-citation>
</ref>
<ref id="B29"><mixed-citation publication-type="journal"><name><surname>Muilu</surname>
<given-names>J</given-names>
</name>
<name><surname>Peltonen</surname>
<given-names>L</given-names>
</name>
<name><surname>Litton</surname>
<given-names>JE</given-names>
</name>
<article-title>The federated database–a basis for biobank-based post-genome studies, integrating phenome and genome data from 600 000 twin pairs in Europe</article-title>
<source>Eur J Hum Genet</source>
<year>2007</year>
<volume>15</volume>
<issue>7</issue>
<fpage>718</fpage>
<lpage>723</lpage>
<pub-id pub-id-type="doi">10.1038/sj.ejhg.5201850</pub-id>
<pub-id pub-id-type="pmid">17487219</pub-id>
</mixed-citation>
</ref>
<ref id="B30"><mixed-citation publication-type="journal"><name><surname>Yuille</surname>
<given-names>M</given-names>
</name>
<name><surname>van Ommen</surname>
<given-names>G-J</given-names>
</name>
<name><surname>Bréchot</surname>
<given-names>C</given-names>
</name>
<name><surname>Cambon-Thomsen</surname>
<given-names>A</given-names>
</name>
<name><surname>Dagher</surname>
<given-names>G</given-names>
</name>
<name><surname>Landegren</surname>
<given-names>U</given-names>
</name>
<name><surname>Litton</surname>
<given-names>J-E</given-names>
</name>
<name><surname>Pasterk</surname>
<given-names>M</given-names>
</name>
<name><surname>Peltonen</surname>
<given-names>L</given-names>
</name>
<name><surname>Taussig</surname>
<given-names>M</given-names>
</name>
<etal></etal>
<article-title>Biobanking for Europe</article-title>
<source>Brief Bioinform</source>
<year>2008</year>
<volume>9</volume>
<issue>1</issue>
<fpage>14</fpage>
<lpage>24</lpage>
<pub-id pub-id-type="pmid">17959611</pub-id>
</mixed-citation>
</ref>
<ref id="B31"><mixed-citation publication-type="journal"><name><surname>Ford</surname>
<given-names>D</given-names>
</name>
<name><surname>Jones</surname>
<given-names>K</given-names>
</name>
<name><surname>Verplancke</surname>
<given-names>J-P</given-names>
</name>
<name><surname>Lyons</surname>
<given-names>R</given-names>
</name>
<name><surname>John</surname>
<given-names>G</given-names>
</name>
<name><surname>Brown</surname>
<given-names>G</given-names>
</name>
<name><surname>Brooks</surname>
<given-names>C</given-names>
</name>
<name><surname>Thompson</surname>
<given-names>S</given-names>
</name>
<name><surname>Bodger</surname>
<given-names>O</given-names>
</name>
<name><surname>Couch</surname>
<given-names>T</given-names>
</name>
<etal></etal>
<article-title>The SAIL Databank: building a national architecture for e-health research and evaluation</article-title>
<source>BMC Health Serv Res</source>
<year>2009</year>
<volume>9</volume>
<issue>1</issue>
<fpage>157</fpage>
<pub-id pub-id-type="doi">10.1186/1472-6963-9-157</pub-id>
<pub-id pub-id-type="pmid">19732426</pub-id>
</mixed-citation>
</ref>
<ref id="B32"><mixed-citation publication-type="journal"><name><surname>El Emam</surname>
<given-names>K</given-names>
</name>
<name><surname>Samet</surname>
<given-names>S</given-names>
</name>
<name><surname>Arbuckle</surname>
<given-names>L</given-names>
</name>
<name><surname>Tamblyn</surname>
<given-names>R</given-names>
</name>
<name><surname>Earle</surname>
<given-names>C</given-names>
</name>
<name><surname>Kantarcioglu</surname>
<given-names>M</given-names>
</name>
<article-title>A secure distributed logistic regression protocol for the detection of rare adverse drug events</article-title>
<source>J Am Med Inform Assoc</source>
<year>2013</year>
<volume>20</volume>
<issue>3</issue>
<fpage>453</fpage>
<lpage>461</lpage>
<pub-id pub-id-type="doi">10.1136/amiajnl-2011-000735</pub-id>
<pub-id pub-id-type="pmid">22871397</pub-id>
</mixed-citation>
</ref>
<ref id="B33"><mixed-citation publication-type="other"><article-title>Biobank standardisation and harmonisation for research excellence in the European union</article-title>
<comment>[<ext-link ext-link-type="uri" xlink:href="https://www.bioshare.eu/">https://www.bioshare.eu/</ext-link>
]</comment>
</mixed-citation>
</ref>
<ref id="B34"><mixed-citation publication-type="journal"><name><surname>Karelis</surname>
<given-names>AD</given-names>
</name>
<article-title>Metabolically healthy but obese individuals</article-title>
<source>Lancet</source>
<year>2013</year>
<volume>372</volume>
<issue>9646</issue>
<fpage>1281</fpage>
<lpage>1283</lpage>
<pub-id pub-id-type="pmid">18929889</pub-id>
</mixed-citation>
</ref>
<ref id="B35"><mixed-citation publication-type="journal"><name><surname>Denis</surname>
<given-names>GV</given-names>
</name>
<name><surname>Obin</surname>
<given-names>MS</given-names>
</name>
<article-title>‘Metabolically healthy obesity’: origins and implications</article-title>
<source>Mol Aspects Med</source>
<year>2013</year>
<volume>34</volume>
<issue>1</issue>
<fpage>59</fpage>
<lpage>70</lpage>
<pub-id pub-id-type="doi">10.1016/j.mam.2012.10.004</pub-id>
<pub-id pub-id-type="pmid">23068072</pub-id>
</mixed-citation>
</ref>
<ref id="B36"><mixed-citation publication-type="book"><collab>R Core Team</collab>
<source>R: A language and environment for statistical computing</source>
<year>2013</year>
<publisher-name>Vienna, Austria: R Foundation for Statistical Computing</publisher-name>
</mixed-citation>
</ref>
<ref id="B37"><mixed-citation publication-type="journal"><name><surname>Fortier</surname>
<given-names>I</given-names>
</name>
<name><surname>Doiron</surname>
<given-names>D</given-names>
</name>
<name><surname>Little</surname>
<given-names>J</given-names>
</name>
<name><surname>Ferretti</surname>
<given-names>V</given-names>
</name>
<name><surname>L’Heureux</surname>
<given-names>F</given-names>
</name>
<name><surname>Stolk</surname>
<given-names>RP</given-names>
</name>
<name><surname>Knoppers</surname>
<given-names>BM</given-names>
</name>
<name><surname>Hudson</surname>
<given-names>TJ</given-names>
</name>
<name><surname>Burton</surname>
<given-names>PR</given-names>
</name>
<article-title>Is rigorous retrospective harmonization possible? Application of the DataSHaPER approach across 53 large studies</article-title>
<source>Int J Epidemiol</source>
<year>2011</year>
<volume>40</volume>
<issue>5</issue>
<fpage>1314</fpage>
<lpage>1328</lpage>
<pub-id pub-id-type="doi">10.1093/ije/dyr106</pub-id>
<pub-id pub-id-type="pmid">21804097</pub-id>
</mixed-citation>
</ref>
<ref id="B38"><mixed-citation publication-type="other"><article-title>Open Source Software for BioBanks</article-title>
<comment>[<ext-link ext-link-type="uri" xlink:href="http://www.obiba.org/">http://www.obiba.org/</ext-link>
]</comment>
</mixed-citation>
</ref>
<ref id="B39"><mixed-citation publication-type="journal"><name><surname>Doiron</surname>
<given-names>D</given-names>
</name>
<name><surname>Raina</surname>
<given-names>P</given-names>
</name>
<name><surname>Ferretti</surname>
<given-names>V</given-names>
</name>
<name><surname>L’Heureux</surname>
<given-names>F</given-names>
</name>
<name><surname>Fortier</surname>
<given-names>I</given-names>
</name>
<article-title>Facilitating collaborative research: implementing a platform supporting data harmonization and pooling</article-title>
<source>Norsk Epidemiologi</source>
<year>2012</year>
<volume>21</volume>
<issue>2</issue>
<fpage>221</fpage>
<lpage>224</lpage>
</mixed-citation>
</ref>
<ref id="B40"><mixed-citation publication-type="other"><article-title>Maelstrom Research</article-title>
<comment>[<ext-link ext-link-type="uri" xlink:href="http://maelstrom-research.org">http://maelstrom-research.org</ext-link>
]</comment>
</mixed-citation>
</ref>
<ref id="B41"><mixed-citation publication-type="journal"><name><surname>Knoppers</surname>
<given-names>B</given-names>
</name>
<name><surname>Fortier</surname>
<given-names>I</given-names>
</name>
<name><surname>Legault</surname>
<given-names>D</given-names>
</name>
<name><surname>Burton</surname>
<given-names>P</given-names>
</name>
<article-title>Population genomics: the public population project in genomics (P3G): a proof of concept?</article-title>
<source>Eur J Hum Genet</source>
<year>2008</year>
<volume>16</volume>
<issue>6</issue>
<fpage>664</fpage>
<lpage>665</lpage>
<pub-id pub-id-type="doi">10.1038/ejhg.2008.55</pub-id>
<pub-id pub-id-type="pmid">18382478</pub-id>
</mixed-citation>
</ref>
<ref id="B42"><mixed-citation publication-type="other"><article-title>ISCED: International Standard Classification of Education</article-title>
<comment>[<ext-link ext-link-type="uri" xlink:href="http://www.uis.unesco.org/Education/Pages/international-standard-classification-of-education.aspx">http://www.uis.unesco.org/Education/Pages/international-standard-classification-of-education.aspx</ext-link>
]</comment>
</mixed-citation>
</ref>
<ref id="B43"><mixed-citation publication-type="other"><article-title>International Standard Classification of Occupations (ISCO)</article-title>
<comment>[<ext-link ext-link-type="uri" xlink:href="http://www.ilo.org/public/english/bureau/stat/isco/">http://www.ilo.org/public/english/bureau/stat/isco/</ext-link>
]</comment>
</mixed-citation>
</ref>
<ref id="B44"><mixed-citation publication-type="book"><name><surname>Flanagan</surname>
<given-names>D</given-names>
</name>
<source>JavaScript: the definitive guide</source>
<year>2011</year>
<publisher-name>Sebastopol, California: O’Reilly Media</publisher-name>
</mixed-citation>
</ref>
<ref id="B45"><mixed-citation publication-type="journal"><name><surname>Jones</surname>
<given-names>E</given-names>
</name>
<name><surname>Sheehan</surname>
<given-names>N</given-names>
</name>
<name><surname>Masca</surname>
<given-names>N</given-names>
</name>
<name><surname>Wallace</surname>
<given-names>S</given-names>
</name>
<name><surname>Murtagh</surname>
<given-names>M</given-names>
</name>
<name><surname>Burton</surname>
<given-names>P</given-names>
</name>
<article-title>DataSHIELD–shared individual-level analysis without sharing the data: a biostatistical perspective</article-title>
<source>Norsk epidemiologi</source>
<year>2012</year>
<volume>21</volume>
<issue>2</issue>
<fpage>231</fpage>
<lpage>239</lpage>
</mixed-citation>
</ref>
<ref id="B46"><mixed-citation publication-type="journal"><name><surname>Sutton</surname>
<given-names>AJ</given-names>
</name>
<name><surname>Kendrick</surname>
<given-names>D</given-names>
</name>
<name><surname>Coupland</surname>
<given-names>CAC</given-names>
</name>
<article-title>Meta-analysis of individual- and aggregate-level data</article-title>
<source>Stat Med</source>
<year>2008</year>
<volume>27</volume>
<issue>5</issue>
<fpage>651</fpage>
<lpage>669</lpage>
<pub-id pub-id-type="doi">10.1002/sim.2916</pub-id>
<pub-id pub-id-type="pmid">17514698</pub-id>
</mixed-citation>
</ref>
<ref id="B47"><mixed-citation publication-type="journal"><name><surname>Gomatam</surname>
<given-names>S</given-names>
</name>
<name><surname>Karr</surname>
<given-names>AF</given-names>
</name>
<name><surname>Reiter</surname>
<given-names>JP</given-names>
</name>
<name><surname>Sanil</surname>
<given-names>AP</given-names>
</name>
<article-title>Data dissemination and disclosure limitation in a world without microdata: A risk-utility framework for remote access analysis servers</article-title>
<source>Stat Sci</source>
<year>2005</year>
<volume>20</volume>
<issue>2</issue>
<fpage>163</fpage>
<lpage>177</lpage>
<pub-id pub-id-type="doi">10.1214/088342305000000043</pub-id>
</mixed-citation>
</ref>
<ref id="B48"><mixed-citation publication-type="book"><collab>RStudio</collab>
<source>RStudio: Integrated development environment for R</source>
<year>2012</year>
<publisher-name>Boston, MA: (Version 0.97.551) [Computer software]</publisher-name>
<comment>[<ext-link ext-link-type="uri" xlink:href="http://www.rstudio.org/">http://www.rstudio.org/</ext-link>
]</comment>
</mixed-citation>
</ref>
</ref-list>
</back>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Pmc/Corpus

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000336 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 000336 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Pmc
   |étape=   Corpus
   |type=    RBID
   |clé=     PMC:4175511
   |texte=   Data harmonization and federated analysis of population-based studies: the BioSHaRE project
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/RBID.i   -Sk "pubmed:24257327" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a CyberinfraV1

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024

	Serveur d'exploration Cyberinfrastructure
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration Cyberinfrastructure

Data harmonization and federated analysis of population-based studies: the BioSHaRE project

Data harmonization and federated analysis of population-based studies: the BioSHaRE project

Source :

Abstract

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri

Pour générer des pages wiki