Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Data publication with the structural biology data grid supports live analysis

Identifieur interne : 000144 ( Pmc/Corpus ); précédent : 000143; suivant : 000145

Data publication with the structural biology data grid supports live analysis

Auteurs : Peter A. Meyer ; Stephanie Socias ; Jason Key ; Elizabeth Ransey ; Emily C. Tjon ; Alejandro Buschiazzo ; Ming Lei ; Chris Botka ; James Withrow ; David Neau ; Kanagalaghatta Rajashankar ; Karen S. Anderson ; Richard H. Baxter ; Stephen C. Blacklow ; Titus J. Boggon ; Alexandre M. J. J. Bonvin ; Dominika Borek ; Tom J. Brett ; Amedeo Caflisch ; Chung-I Chang ; Walter J. Chazin ; Kevin D. Corbett ; Michael S. Cosgrove ; Sean Crosson ; Sirano Dhe-Paganon ; Enrico Di Cera ; Catherine L. Drennan ; Michael J. Eck ; Brandt F. Eichman ; Qing R. Fan ; Adrian R. Ferré-D'Amaré ; J. Christopher Fromme ; K. Christopher Garcia ; Rachelle Gaudet ; Peng Gong ; Stephen C. Harrison ; Ekaterina E. Heldwein ; Zongchao Jia ; Robert J. Keenan ; Andrew C. Kruse ; Marc Kvansakul ; Jason S. Mclellan ; Yorgo Modis ; Yunsun Nam ; Zbyszek Otwinowski ; Emil F. Pai ; Pedro José Barbosa Pereira ; Carlo Petosa ; C. S. Raman ; Tom A. Rapoport ; Antonina Roll-Mecak ; Michael K. Rosen ; Gabby Rudenko ; Joseph Schlessinger ; Thomas U. Schwartz ; Yousif Shamoo ; Holger Sondermann ; Yizhi J. Tao ; Niraj H. Tolia ; Oleg V. Tsodikov ; Kenneth D. Westover ; Hao Wu ; Ian Foster ; James S. Fraser ; Filipe R. N C. Maia ; Tamir Gonen ; Tom Kirchhausen ; Kay Diederichs ; Mercè Crosas ; Piotr Sliz

Source :

RBID : PMC:4786681

Abstract

Access to experimental X-ray diffraction image data is fundamental for validation and reproduction of macromolecular models and indispensable for development of structural biology processing methods. Here, we established a diffraction data publication and dissemination system, Structural Biology Data Grid (SBDG; data.sbgrid.org), to preserve primary experimental data sets that support scientific publications. Data sets are accessible to researchers through a community driven data grid, which facilitates global data access. Our analysis of a pilot collection of crystallographic data sets demonstrates that the information archived by SBDG is sufficient to reprocess data to statistics that meet or exceed the quality of the original published structures. SBDG has extended its services to the entire community and is used to develop support for other types of biomedical data sets. It is anticipated that access to the experimental data sets will enhance the paradigm shift in the community towards a much more dynamic body of continuously improving data analysis.


Url:
DOI: 10.1038/ncomms10882
PubMed: 26947396
PubMed Central: 4786681

Links to Exploration step

PMC:4786681

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Data publication with the structural biology data grid supports live analysis</title>
<author>
<name sortKey="Meyer, Peter A" sort="Meyer, Peter A" uniqKey="Meyer P" first="Peter A." last="Meyer">Peter A. Meyer</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Socias, Stephanie" sort="Socias, Stephanie" uniqKey="Socias S" first="Stephanie" last="Socias">Stephanie Socias</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Key, Jason" sort="Key, Jason" uniqKey="Key J" first="Jason" last="Key">Jason Key</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Ransey, Elizabeth" sort="Ransey, Elizabeth" uniqKey="Ransey E" first="Elizabeth" last="Ransey">Elizabeth Ransey</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Tjon, Emily C" sort="Tjon, Emily C" uniqKey="Tjon E" first="Emily C." last="Tjon">Emily C. Tjon</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Buschiazzo, Alejandro" sort="Buschiazzo, Alejandro" uniqKey="Buschiazzo A" first="Alejandro" last="Buschiazzo">Alejandro Buschiazzo</name>
<affiliation>
<nlm:aff id="a2">
<institution>Laboratory of Molecular & Structural Microbiology, Institut Pasteur de Montevideo</institution>
, Montevideo 11400,
<country>Uruguay</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a3">
<institution>Department of Structural Biology & Chemistry, Institut Pasteur</institution>
, 75015 Paris,
<country>France</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Lei, Ming" sort="Lei, Ming" uniqKey="Lei M" first="Ming" last="Lei">Ming Lei</name>
<affiliation>
<nlm:aff id="a4">
<institution>Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences</institution>
, Shanghai 200031,
<country>China</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Botka, Chris" sort="Botka, Chris" uniqKey="Botka C" first="Chris" last="Botka">Chris Botka</name>
<affiliation>
<nlm:aff id="a5">
<institution>Harvard Medical School</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Withrow, James" sort="Withrow, James" uniqKey="Withrow J" first="James" last="Withrow">James Withrow</name>
<affiliation>
<nlm:aff id="a6">
<institution>NE-CAT and Department of Chemistry and Chemical Biology, Cornell University</institution>
, Building 436E, Argonne National Laboratory, 9700S. Cass Avenue, Argonne, Illinois 60439,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Neau, David" sort="Neau, David" uniqKey="Neau D" first="David" last="Neau">David Neau</name>
<affiliation>
<nlm:aff id="a6">
<institution>NE-CAT and Department of Chemistry and Chemical Biology, Cornell University</institution>
, Building 436E, Argonne National Laboratory, 9700S. Cass Avenue, Argonne, Illinois 60439,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Rajashankar, Kanagalaghatta" sort="Rajashankar, Kanagalaghatta" uniqKey="Rajashankar K" first="Kanagalaghatta" last="Rajashankar">Kanagalaghatta Rajashankar</name>
<affiliation>
<nlm:aff id="a6">
<institution>NE-CAT and Department of Chemistry and Chemical Biology, Cornell University</institution>
, Building 436E, Argonne National Laboratory, 9700S. Cass Avenue, Argonne, Illinois 60439,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Anderson, Karen S" sort="Anderson, Karen S" uniqKey="Anderson K" first="Karen S." last="Anderson">Karen S. Anderson</name>
<affiliation>
<nlm:aff id="a7">
<institution>Departments of Pharmacology and Molecular Biophysics and Biochemistry, Yale University School of Medicine</institution>
, New Haven, Connecticut 06520,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Baxter, Richard H" sort="Baxter, Richard H" uniqKey="Baxter R" first="Richard H." last="Baxter">Richard H. Baxter</name>
<affiliation>
<nlm:aff id="a8">
<institution>Department of Chemistry, Molecular Biophysics and Biochemistry, Yale University</institution>
, New Haven, Connecticut 06520,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Blacklow, Stephen C" sort="Blacklow, Stephen C" uniqKey="Blacklow S" first="Stephen C." last="Blacklow">Stephen C. Blacklow</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Boggon, Titus J" sort="Boggon, Titus J" uniqKey="Boggon T" first="Titus J." last="Boggon">Titus J. Boggon</name>
<affiliation>
<nlm:aff id="a7">
<institution>Departments of Pharmacology and Molecular Biophysics and Biochemistry, Yale University School of Medicine</institution>
, New Haven, Connecticut 06520,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Bonvin, Alexandre M J J" sort="Bonvin, Alexandre M J J" uniqKey="Bonvin A" first="Alexandre M. J. J." last="Bonvin">Alexandre M. J. J. Bonvin</name>
<affiliation>
<nlm:aff id="a9">
<institution>Bijvoet Center, Faculty of Science, Utrecht University</institution>
, 3584 CH Utrecht,
<country>The Netherlands</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Borek, Dominika" sort="Borek, Dominika" uniqKey="Borek D" first="Dominika" last="Borek">Dominika Borek</name>
<affiliation>
<nlm:aff id="a10">
<institution>Departments of Biophysics and Biochemistry, University of Texas Southwestern Medical Center</institution>
, Dallas, Texas 75390,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Brett, Tom J" sort="Brett, Tom J" uniqKey="Brett T" first="Tom J." last="Brett">Tom J. Brett</name>
<affiliation>
<nlm:aff id="a11">
<institution>Department of Internal Medicine, Washington University School of Medicine</institution>
, St Louis, Missouri 63110,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Caflisch, Amedeo" sort="Caflisch, Amedeo" uniqKey="Caflisch A" first="Amedeo" last="Caflisch">Amedeo Caflisch</name>
<affiliation>
<nlm:aff id="a12">
<institution>Department of Biochemistry, University of Zurich</institution>
, CH-8057 Zurich,
<country>Switzerland</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Chang, Chung I" sort="Chang, Chung I" uniqKey="Chang C" first="Chung-I" last="Chang">Chung-I Chang</name>
<affiliation>
<nlm:aff id="a13">
<institution>Institute of Biological Chemistry, Academia Sinica</institution>
, Taipei 11529,
<country>Taiwan</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Chazin, Walter J" sort="Chazin, Walter J" uniqKey="Chazin W" first="Walter J." last="Chazin">Walter J. Chazin</name>
<affiliation>
<nlm:aff id="a14">
<institution>Departments of Biochemistry and Chemistry, Center for Structural Biology, Vanderbilt University</institution>
, Nashville, Tennessee 37232,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Corbett, Kevin D" sort="Corbett, Kevin D" uniqKey="Corbett K" first="Kevin D." last="Corbett">Kevin D. Corbett</name>
<affiliation>
<nlm:aff id="a15">
<institution>Ludwig Institute for Cancer Research, San Diego Branch</institution>
, La Jolla, California 92093,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a16">
<institution>Department of Cellular and Molecular Medicine, University of California, San Diego</institution>
, La Jolla, California 92093,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Cosgrove, Michael S" sort="Cosgrove, Michael S" uniqKey="Cosgrove M" first="Michael S." last="Cosgrove">Michael S. Cosgrove</name>
<affiliation>
<nlm:aff id="a17">
<institution>Department of Biochemistry and Molecular Biology, SUNY Upstate Medical University</institution>
, Syracuse, New York 13210,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Crosson, Sean" sort="Crosson, Sean" uniqKey="Crosson S" first="Sean" last="Crosson">Sean Crosson</name>
<affiliation>
<nlm:aff id="a18">
<institution>Department of Biochemistry and Molecular Biology, University of Chicago</institution>
, Chicago, Illinois 60637,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Dhe Paganon, Sirano" sort="Dhe Paganon, Sirano" uniqKey="Dhe Paganon S" first="Sirano" last="Dhe-Paganon">Sirano Dhe-Paganon</name>
<affiliation>
<nlm:aff id="a19">
<institution>Department of Cancer Biology, Dana-Farber Cancer Institute</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Di Cera, Enrico" sort="Di Cera, Enrico" uniqKey="Di Cera E" first="Enrico" last="Di Cera">Enrico Di Cera</name>
<affiliation>
<nlm:aff id="a20">
<institution>Edward A. Doisy Department of Biochemistry and Molecular Biology, Saint Louis University School of Medicine</institution>
, St Louis, Missouri 63104,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Drennan, Catherine L" sort="Drennan, Catherine L" uniqKey="Drennan C" first="Catherine L." last="Drennan">Catherine L. Drennan</name>
<affiliation>
<nlm:aff id="a21">
<institution>Departments of Chemistry and Biology and the Howard Hughes Medical Institute, Massachusetts Institute of Technology</institution>
, Cambridge, Massachusetts 02139,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Eck, Michael J" sort="Eck, Michael J" uniqKey="Eck M" first="Michael J." last="Eck">Michael J. Eck</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a19">
<institution>Department of Cancer Biology, Dana-Farber Cancer Institute</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Eichman, Brandt F" sort="Eichman, Brandt F" uniqKey="Eichman B" first="Brandt F." last="Eichman">Brandt F. Eichman</name>
<affiliation>
<nlm:aff id="a22">
<institution>Department of Biological Sciences and Center for Structural Biology, Vanderbilt University</institution>
, Nashville, Tennessee 37235,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Fan, Qing R" sort="Fan, Qing R" uniqKey="Fan Q" first="Qing R." last="Fan">Qing R. Fan</name>
<affiliation>
<nlm:aff id="a23">
<institution>Departments of Pharmacology and Pathology and Cell Biology, Columbia University</institution>
, New York, New York 10032,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Ferre D Amare, Adrian R" sort="Ferre D Amare, Adrian R" uniqKey="Ferre D Amare A" first="Adrian R." last="Ferré-D'Amaré">Adrian R. Ferré-D'Amaré</name>
<affiliation>
<nlm:aff id="a24">
<institution>Laboratory of RNA Biophysics, National Heart, Lung and Blood Institute, NIH</institution>
, Bethesda, Maryland 20892,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Christopher Fromme, J" sort="Christopher Fromme, J" uniqKey="Christopher Fromme J" first="J." last="Christopher Fromme">J. Christopher Fromme</name>
<affiliation>
<nlm:aff id="a25">
<institution>Department of Molecular Biology and Genetics, Weill Institute for Cell and Molecular Biology, Cornell University</institution>
, Ithaca, New York 14853,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Garcia, K Christopher" sort="Garcia, K Christopher" uniqKey="Garcia K" first="K. Christopher" last="Garcia">K. Christopher Garcia</name>
<affiliation>
<nlm:aff id="a26">
<institution>Howard Hughes Medical Institute, Stanford University School of Medicine</institution>
, Stanford, California 94305,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a27">
<institution>Department of Molecular and Cellular Physiology, Stanford University School of Medicine</institution>
, Stanford, California 94305,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a28">
<institution>Department of Structural Biology, Stanford University School of Medicine</institution>
, Stanford, California 94305,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Gaudet, Rachelle" sort="Gaudet, Rachelle" uniqKey="Gaudet R" first="Rachelle" last="Gaudet">Rachelle Gaudet</name>
<affiliation>
<nlm:aff id="a29">
<institution>Department of Molecular and Cellular Biology, Harvard University</institution>
, Cambridge, Massachusetts 02138,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Gong, Peng" sort="Gong, Peng" uniqKey="Gong P" first="Peng" last="Gong">Peng Gong</name>
<affiliation>
<nlm:aff id="a30">
<institution>Key Laboratory of Special Pathogens and Biosafety, Wuhan Institute of Virology, Chinese Academy of Sciences</institution>
, Wuhan 430071,
<country>China</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Harrison, Stephen C" sort="Harrison, Stephen C" uniqKey="Harrison S" first="Stephen C." last="Harrison">Stephen C. Harrison</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a31">
<institution>Howard Hughes Medical Institute, Harvard Medical School</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a32">
<institution>Laboratory of Molecular Medicine, Boston Children's Hospital, Harvard Medical School</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Heldwein, Ekaterina E" sort="Heldwein, Ekaterina E" uniqKey="Heldwein E" first="Ekaterina E." last="Heldwein">Ekaterina E. Heldwein</name>
<affiliation>
<nlm:aff id="a33">
<institution>Department of Molecular Biology and Microbiology, Tufts University School of Medicine</institution>
, Boston, Massachusetts 02111,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Jia, Zongchao" sort="Jia, Zongchao" uniqKey="Jia Z" first="Zongchao" last="Jia">Zongchao Jia</name>
<affiliation>
<nlm:aff id="a34">
<institution>Department of Biomedical and Molecular Sciences, Queen's University</institution>
, Kingston, Ontario,
<country>Canada</country>
K7M 3G5</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Keenan, Robert J" sort="Keenan, Robert J" uniqKey="Keenan R" first="Robert J." last="Keenan">Robert J. Keenan</name>
<affiliation>
<nlm:aff id="a18">
<institution>Department of Biochemistry and Molecular Biology, University of Chicago</institution>
, Chicago, Illinois 60637,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Kruse, Andrew C" sort="Kruse, Andrew C" uniqKey="Kruse A" first="Andrew C." last="Kruse">Andrew C. Kruse</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Kvansakul, Marc" sort="Kvansakul, Marc" uniqKey="Kvansakul M" first="Marc" last="Kvansakul">Marc Kvansakul</name>
<affiliation>
<nlm:aff id="a35">
<institution>Department of Biochemistry and Genetics, La Trobe University</institution>
, Melbourne, Victoria 3086,
<country>Australia</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Mclellan, Jason S" sort="Mclellan, Jason S" uniqKey="Mclellan J" first="Jason S." last="Mclellan">Jason S. Mclellan</name>
<affiliation>
<nlm:aff id="a36">
<institution>Department of Biochemistry, Geisel School of Medicine at Dartmouth</institution>
, Hanover, New Hampshire 03755,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Modis, Yorgo" sort="Modis, Yorgo" uniqKey="Modis Y" first="Yorgo" last="Modis">Yorgo Modis</name>
<affiliation>
<nlm:aff id="a37">
<institution>Department of Medicine, University of Cambridge, MRC Laboratory of Molecular Biology</institution>
, Francis Crick Avenue, Cambridge CB2 0QH,
<country>UK</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Nam, Yunsun" sort="Nam, Yunsun" uniqKey="Nam Y" first="Yunsun" last="Nam">Yunsun Nam</name>
<affiliation>
<nlm:aff id="a38">
<institution>University of Texas, Southwestern Medical Center</institution>
, Dallas, Texas 75390,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Otwinowski, Zbyszek" sort="Otwinowski, Zbyszek" uniqKey="Otwinowski Z" first="Zbyszek" last="Otwinowski">Zbyszek Otwinowski</name>
<affiliation>
<nlm:aff id="a10">
<institution>Departments of Biophysics and Biochemistry, University of Texas Southwestern Medical Center</institution>
, Dallas, Texas 75390,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Pai, Emil F" sort="Pai, Emil F" uniqKey="Pai E" first="Emil F." last="Pai">Emil F. Pai</name>
<affiliation>
<nlm:aff id="a39">
<institution>Departments of Biochemistry, Medical Biophysics and Molecular Genetics, University of Toronto</institution>
, Toronto, Ontario,
<country>Canada</country>
M5S 1A8</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a40">
<institution>Campbell Family Institute for Cancer Research, Ontario Cancer Institute/University Health Network</institution>
, Toronto, Ontario,
<country>Canada</country>
M5G 2M9</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Pereira, Pedro Jose Barbosa" sort="Pereira, Pedro Jose Barbosa" uniqKey="Pereira P" first="Pedro José Barbosa" last="Pereira">Pedro José Barbosa Pereira</name>
<affiliation>
<nlm:aff id="a41">
<institution>IBMC—Instituto de Biologia Molecular e Celular and Instituto de Investigação e Inovação em Saúde, Universidade do Porto</institution>
, 4150 Porto,
<country>Portugal</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Petosa, Carlo" sort="Petosa, Carlo" uniqKey="Petosa C" first="Carlo" last="Petosa">Carlo Petosa</name>
<affiliation>
<nlm:aff id="a42">
<institution>Université Grenoble Alpes/CNRS/CEA, Institut de Biologie Structurale</institution>
, 38027 Grenoble,
<country>France</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Raman, C S" sort="Raman, C S" uniqKey="Raman C" first="C. S." last="Raman">C. S. Raman</name>
<affiliation>
<nlm:aff id="a43">
<institution>Department of Pharmaceutical Sciences, University of Maryland</institution>
, Baltimore, Maryland 21201,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Rapoport, Tom A" sort="Rapoport, Tom A" uniqKey="Rapoport T" first="Tom A." last="Rapoport">Tom A. Rapoport</name>
<affiliation>
<nlm:aff id="a44">
<institution>Howard Hughes Medical Institute and Harvard Medical School, Department of Cell Biology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Roll Mecak, Antonina" sort="Roll Mecak, Antonina" uniqKey="Roll Mecak A" first="Antonina" last="Roll-Mecak">Antonina Roll-Mecak</name>
<affiliation>
<nlm:aff id="a45">
<institution>Cell Biology and Biophysics Unit, Porter Neuroscience Research Center, National Institute of Neurological Disorders and Stroke</institution>
, Bethesda, Maryland 20892,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a46">
<institution>National Heart, Lung and Blood Institute</institution>
, Bethesda, Maryland 20892,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Rosen, Michael K" sort="Rosen, Michael K" uniqKey="Rosen M" first="Michael K." last="Rosen">Michael K. Rosen</name>
<affiliation>
<nlm:aff id="a47">
<institution>Department of Biophysics and Howard Hughes Medical Institute, University of Texas Southwestern Medical Center</institution>
, Dallas, Texas 75390,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Rudenko, Gabby" sort="Rudenko, Gabby" uniqKey="Rudenko G" first="Gabby" last="Rudenko">Gabby Rudenko</name>
<affiliation>
<nlm:aff id="a48">
<institution>Department of Pharmacology and Toxicology, Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch</institution>
, Galveston, Texas 77555,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Schlessinger, Joseph" sort="Schlessinger, Joseph" uniqKey="Schlessinger J" first="Joseph" last="Schlessinger">Joseph Schlessinger</name>
<affiliation>
<nlm:aff id="a49">
<institution>Department of Pharmacology, Yale University School of Medicine</institution>
, New Haven, Connecticut 06520,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Schwartz, Thomas U" sort="Schwartz, Thomas U" uniqKey="Schwartz T" first="Thomas U." last="Schwartz">Thomas U. Schwartz</name>
<affiliation>
<nlm:aff id="a50">
<institution>Department of Biology, Massachusetts Institute of Technology</institution>
, Cambridge, Massachusetts 02139,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Shamoo, Yousif" sort="Shamoo, Yousif" uniqKey="Shamoo Y" first="Yousif" last="Shamoo">Yousif Shamoo</name>
<affiliation>
<nlm:aff id="a51">
<institution>Department of BioSciences, Rice University</institution>
, Houston, Texas 77005,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Sondermann, Holger" sort="Sondermann, Holger" uniqKey="Sondermann H" first="Holger" last="Sondermann">Holger Sondermann</name>
<affiliation>
<nlm:aff id="a52">
<institution>Department of Molecular Medicine, College of Veterinary Medicine, Cornell University</institution>
, Ithaca, New York 14853,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Tao, Yizhi J" sort="Tao, Yizhi J" uniqKey="Tao Y" first="Yizhi J." last="Tao">Yizhi J. Tao</name>
<affiliation>
<nlm:aff id="a51">
<institution>Department of BioSciences, Rice University</institution>
, Houston, Texas 77005,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Tolia, Niraj H" sort="Tolia, Niraj H" uniqKey="Tolia N" first="Niraj H." last="Tolia">Niraj H. Tolia</name>
<affiliation>
<nlm:aff id="a53">
<institution>Department of Molecular Microbiology, Washington University School of Medicine</institution>
, St Louis, Missouri 63110,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Tsodikov, Oleg V" sort="Tsodikov, Oleg V" uniqKey="Tsodikov O" first="Oleg V." last="Tsodikov">Oleg V. Tsodikov</name>
<affiliation>
<nlm:aff id="a54">
<institution>Department of Pharmaceutical Sciences, College of Pharmacy, University of Kentucky</institution>
, Lexington, Kentucky 40536,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Westover, Kenneth D" sort="Westover, Kenneth D" uniqKey="Westover K" first="Kenneth D." last="Westover">Kenneth D. Westover</name>
<affiliation>
<nlm:aff id="a55">
<institution>Departments of Biochemistry and Radiation Oncology, University of Texas, Southwestern Medical Center</institution>
, Dallas, Texas 75390,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Wu, Hao" sort="Wu, Hao" uniqKey="Wu H" first="Hao" last="Wu">Hao Wu</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a56">
<institution>Program in Cellular and Molecular Medicine, Boston Children's Hospital</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Foster, Ian" sort="Foster, Ian" uniqKey="Foster I" first="Ian" last="Foster">Ian Foster</name>
<affiliation>
<nlm:aff id="a57">
<institution>Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, Illinois, and Department of Computer Science, University of Chicago</institution>
, Chicago, Illinois 60637,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Fraser, James S" sort="Fraser, James S" uniqKey="Fraser J" first="James S." last="Fraser">James S. Fraser</name>
<affiliation>
<nlm:aff id="a58">
<institution>Department of Bioengineering and Therapeutic Sciences, University of California San Francisco</institution>
, San Francisco, California 94158,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Maia, Filipe R N C" sort="Maia, Filipe R N C" uniqKey="Maia F" first="Filipe R. N C." last="Maia">Filipe R. N C. Maia</name>
<affiliation>
<nlm:aff id="a59">
<institution>Laboratory of Molecular Biophysics, Department of Cell and Molecular Biology, Uppsala University</institution>
, Husargatan 3 (Box 596), SE-751 24 Uppsala,
<country>Sweden</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a60">
<institution>NERSC, Lawrence Berkeley National Laboratory</institution>
, Berkeley, California 94720,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Gonen, Tamir" sort="Gonen, Tamir" uniqKey="Gonen T" first="Tamir" last="Gonen">Tamir Gonen</name>
<affiliation>
<nlm:aff id="a61">
<institution>Janelia Research Campus, Howard Hughes Medical Institute</institution>
, Ashburn, Virginia 20147
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Kirchhausen, Tom" sort="Kirchhausen, Tom" uniqKey="Kirchhausen T" first="Tom" last="Kirchhausen">Tom Kirchhausen</name>
<affiliation>
<nlm:aff id="a62">
<institution>Program in Cellular and Molecular Medicine and Department of Pediatrics, Boston Children's Hospital</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a63">
<institution>Departments of Cell Biology, Harvard Medical School</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Diederichs, Kay" sort="Diederichs, Kay" uniqKey="Diederichs K" first="Kay" last="Diederichs">Kay Diederichs</name>
<affiliation>
<nlm:aff id="a64">
<institution>Department of Biology, University of Konstanz</institution>
, D-78457 Konstanz,
<country>Germany</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Crosas, Merce" sort="Crosas, Merce" uniqKey="Crosas M" first="Mercè" last="Crosas">Mercè Crosas</name>
<affiliation>
<nlm:aff id="a65">
<institution>Institute for Quantitative Social Science, Harvard University</institution>
, Cambridge, Massachusetts, 02138,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Sliz, Piotr" sort="Sliz, Piotr" uniqKey="Sliz P" first="Piotr" last="Sliz">Piotr Sliz</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">26947396</idno>
<idno type="pmc">4786681</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4786681</idno>
<idno type="RBID">PMC:4786681</idno>
<idno type="doi">10.1038/ncomms10882</idno>
<date when="2016">2016</date>
<idno type="wicri:Area/Pmc/Corpus">000144</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Data publication with the structural biology data grid supports live analysis</title>
<author>
<name sortKey="Meyer, Peter A" sort="Meyer, Peter A" uniqKey="Meyer P" first="Peter A." last="Meyer">Peter A. Meyer</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Socias, Stephanie" sort="Socias, Stephanie" uniqKey="Socias S" first="Stephanie" last="Socias">Stephanie Socias</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Key, Jason" sort="Key, Jason" uniqKey="Key J" first="Jason" last="Key">Jason Key</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Ransey, Elizabeth" sort="Ransey, Elizabeth" uniqKey="Ransey E" first="Elizabeth" last="Ransey">Elizabeth Ransey</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Tjon, Emily C" sort="Tjon, Emily C" uniqKey="Tjon E" first="Emily C." last="Tjon">Emily C. Tjon</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Buschiazzo, Alejandro" sort="Buschiazzo, Alejandro" uniqKey="Buschiazzo A" first="Alejandro" last="Buschiazzo">Alejandro Buschiazzo</name>
<affiliation>
<nlm:aff id="a2">
<institution>Laboratory of Molecular & Structural Microbiology, Institut Pasteur de Montevideo</institution>
, Montevideo 11400,
<country>Uruguay</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a3">
<institution>Department of Structural Biology & Chemistry, Institut Pasteur</institution>
, 75015 Paris,
<country>France</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Lei, Ming" sort="Lei, Ming" uniqKey="Lei M" first="Ming" last="Lei">Ming Lei</name>
<affiliation>
<nlm:aff id="a4">
<institution>Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences</institution>
, Shanghai 200031,
<country>China</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Botka, Chris" sort="Botka, Chris" uniqKey="Botka C" first="Chris" last="Botka">Chris Botka</name>
<affiliation>
<nlm:aff id="a5">
<institution>Harvard Medical School</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Withrow, James" sort="Withrow, James" uniqKey="Withrow J" first="James" last="Withrow">James Withrow</name>
<affiliation>
<nlm:aff id="a6">
<institution>NE-CAT and Department of Chemistry and Chemical Biology, Cornell University</institution>
, Building 436E, Argonne National Laboratory, 9700S. Cass Avenue, Argonne, Illinois 60439,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Neau, David" sort="Neau, David" uniqKey="Neau D" first="David" last="Neau">David Neau</name>
<affiliation>
<nlm:aff id="a6">
<institution>NE-CAT and Department of Chemistry and Chemical Biology, Cornell University</institution>
, Building 436E, Argonne National Laboratory, 9700S. Cass Avenue, Argonne, Illinois 60439,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Rajashankar, Kanagalaghatta" sort="Rajashankar, Kanagalaghatta" uniqKey="Rajashankar K" first="Kanagalaghatta" last="Rajashankar">Kanagalaghatta Rajashankar</name>
<affiliation>
<nlm:aff id="a6">
<institution>NE-CAT and Department of Chemistry and Chemical Biology, Cornell University</institution>
, Building 436E, Argonne National Laboratory, 9700S. Cass Avenue, Argonne, Illinois 60439,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Anderson, Karen S" sort="Anderson, Karen S" uniqKey="Anderson K" first="Karen S." last="Anderson">Karen S. Anderson</name>
<affiliation>
<nlm:aff id="a7">
<institution>Departments of Pharmacology and Molecular Biophysics and Biochemistry, Yale University School of Medicine</institution>
, New Haven, Connecticut 06520,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Baxter, Richard H" sort="Baxter, Richard H" uniqKey="Baxter R" first="Richard H." last="Baxter">Richard H. Baxter</name>
<affiliation>
<nlm:aff id="a8">
<institution>Department of Chemistry, Molecular Biophysics and Biochemistry, Yale University</institution>
, New Haven, Connecticut 06520,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Blacklow, Stephen C" sort="Blacklow, Stephen C" uniqKey="Blacklow S" first="Stephen C." last="Blacklow">Stephen C. Blacklow</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Boggon, Titus J" sort="Boggon, Titus J" uniqKey="Boggon T" first="Titus J." last="Boggon">Titus J. Boggon</name>
<affiliation>
<nlm:aff id="a7">
<institution>Departments of Pharmacology and Molecular Biophysics and Biochemistry, Yale University School of Medicine</institution>
, New Haven, Connecticut 06520,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Bonvin, Alexandre M J J" sort="Bonvin, Alexandre M J J" uniqKey="Bonvin A" first="Alexandre M. J. J." last="Bonvin">Alexandre M. J. J. Bonvin</name>
<affiliation>
<nlm:aff id="a9">
<institution>Bijvoet Center, Faculty of Science, Utrecht University</institution>
, 3584 CH Utrecht,
<country>The Netherlands</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Borek, Dominika" sort="Borek, Dominika" uniqKey="Borek D" first="Dominika" last="Borek">Dominika Borek</name>
<affiliation>
<nlm:aff id="a10">
<institution>Departments of Biophysics and Biochemistry, University of Texas Southwestern Medical Center</institution>
, Dallas, Texas 75390,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Brett, Tom J" sort="Brett, Tom J" uniqKey="Brett T" first="Tom J." last="Brett">Tom J. Brett</name>
<affiliation>
<nlm:aff id="a11">
<institution>Department of Internal Medicine, Washington University School of Medicine</institution>
, St Louis, Missouri 63110,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Caflisch, Amedeo" sort="Caflisch, Amedeo" uniqKey="Caflisch A" first="Amedeo" last="Caflisch">Amedeo Caflisch</name>
<affiliation>
<nlm:aff id="a12">
<institution>Department of Biochemistry, University of Zurich</institution>
, CH-8057 Zurich,
<country>Switzerland</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Chang, Chung I" sort="Chang, Chung I" uniqKey="Chang C" first="Chung-I" last="Chang">Chung-I Chang</name>
<affiliation>
<nlm:aff id="a13">
<institution>Institute of Biological Chemistry, Academia Sinica</institution>
, Taipei 11529,
<country>Taiwan</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Chazin, Walter J" sort="Chazin, Walter J" uniqKey="Chazin W" first="Walter J." last="Chazin">Walter J. Chazin</name>
<affiliation>
<nlm:aff id="a14">
<institution>Departments of Biochemistry and Chemistry, Center for Structural Biology, Vanderbilt University</institution>
, Nashville, Tennessee 37232,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Corbett, Kevin D" sort="Corbett, Kevin D" uniqKey="Corbett K" first="Kevin D." last="Corbett">Kevin D. Corbett</name>
<affiliation>
<nlm:aff id="a15">
<institution>Ludwig Institute for Cancer Research, San Diego Branch</institution>
, La Jolla, California 92093,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a16">
<institution>Department of Cellular and Molecular Medicine, University of California, San Diego</institution>
, La Jolla, California 92093,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Cosgrove, Michael S" sort="Cosgrove, Michael S" uniqKey="Cosgrove M" first="Michael S." last="Cosgrove">Michael S. Cosgrove</name>
<affiliation>
<nlm:aff id="a17">
<institution>Department of Biochemistry and Molecular Biology, SUNY Upstate Medical University</institution>
, Syracuse, New York 13210,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Crosson, Sean" sort="Crosson, Sean" uniqKey="Crosson S" first="Sean" last="Crosson">Sean Crosson</name>
<affiliation>
<nlm:aff id="a18">
<institution>Department of Biochemistry and Molecular Biology, University of Chicago</institution>
, Chicago, Illinois 60637,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Dhe Paganon, Sirano" sort="Dhe Paganon, Sirano" uniqKey="Dhe Paganon S" first="Sirano" last="Dhe-Paganon">Sirano Dhe-Paganon</name>
<affiliation>
<nlm:aff id="a19">
<institution>Department of Cancer Biology, Dana-Farber Cancer Institute</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Di Cera, Enrico" sort="Di Cera, Enrico" uniqKey="Di Cera E" first="Enrico" last="Di Cera">Enrico Di Cera</name>
<affiliation>
<nlm:aff id="a20">
<institution>Edward A. Doisy Department of Biochemistry and Molecular Biology, Saint Louis University School of Medicine</institution>
, St Louis, Missouri 63104,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Drennan, Catherine L" sort="Drennan, Catherine L" uniqKey="Drennan C" first="Catherine L." last="Drennan">Catherine L. Drennan</name>
<affiliation>
<nlm:aff id="a21">
<institution>Departments of Chemistry and Biology and the Howard Hughes Medical Institute, Massachusetts Institute of Technology</institution>
, Cambridge, Massachusetts 02139,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Eck, Michael J" sort="Eck, Michael J" uniqKey="Eck M" first="Michael J." last="Eck">Michael J. Eck</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a19">
<institution>Department of Cancer Biology, Dana-Farber Cancer Institute</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Eichman, Brandt F" sort="Eichman, Brandt F" uniqKey="Eichman B" first="Brandt F." last="Eichman">Brandt F. Eichman</name>
<affiliation>
<nlm:aff id="a22">
<institution>Department of Biological Sciences and Center for Structural Biology, Vanderbilt University</institution>
, Nashville, Tennessee 37235,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Fan, Qing R" sort="Fan, Qing R" uniqKey="Fan Q" first="Qing R." last="Fan">Qing R. Fan</name>
<affiliation>
<nlm:aff id="a23">
<institution>Departments of Pharmacology and Pathology and Cell Biology, Columbia University</institution>
, New York, New York 10032,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Ferre D Amare, Adrian R" sort="Ferre D Amare, Adrian R" uniqKey="Ferre D Amare A" first="Adrian R." last="Ferré-D'Amaré">Adrian R. Ferré-D'Amaré</name>
<affiliation>
<nlm:aff id="a24">
<institution>Laboratory of RNA Biophysics, National Heart, Lung and Blood Institute, NIH</institution>
, Bethesda, Maryland 20892,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Christopher Fromme, J" sort="Christopher Fromme, J" uniqKey="Christopher Fromme J" first="J." last="Christopher Fromme">J. Christopher Fromme</name>
<affiliation>
<nlm:aff id="a25">
<institution>Department of Molecular Biology and Genetics, Weill Institute for Cell and Molecular Biology, Cornell University</institution>
, Ithaca, New York 14853,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Garcia, K Christopher" sort="Garcia, K Christopher" uniqKey="Garcia K" first="K. Christopher" last="Garcia">K. Christopher Garcia</name>
<affiliation>
<nlm:aff id="a26">
<institution>Howard Hughes Medical Institute, Stanford University School of Medicine</institution>
, Stanford, California 94305,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a27">
<institution>Department of Molecular and Cellular Physiology, Stanford University School of Medicine</institution>
, Stanford, California 94305,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a28">
<institution>Department of Structural Biology, Stanford University School of Medicine</institution>
, Stanford, California 94305,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Gaudet, Rachelle" sort="Gaudet, Rachelle" uniqKey="Gaudet R" first="Rachelle" last="Gaudet">Rachelle Gaudet</name>
<affiliation>
<nlm:aff id="a29">
<institution>Department of Molecular and Cellular Biology, Harvard University</institution>
, Cambridge, Massachusetts 02138,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Gong, Peng" sort="Gong, Peng" uniqKey="Gong P" first="Peng" last="Gong">Peng Gong</name>
<affiliation>
<nlm:aff id="a30">
<institution>Key Laboratory of Special Pathogens and Biosafety, Wuhan Institute of Virology, Chinese Academy of Sciences</institution>
, Wuhan 430071,
<country>China</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Harrison, Stephen C" sort="Harrison, Stephen C" uniqKey="Harrison S" first="Stephen C." last="Harrison">Stephen C. Harrison</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a31">
<institution>Howard Hughes Medical Institute, Harvard Medical School</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a32">
<institution>Laboratory of Molecular Medicine, Boston Children's Hospital, Harvard Medical School</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Heldwein, Ekaterina E" sort="Heldwein, Ekaterina E" uniqKey="Heldwein E" first="Ekaterina E." last="Heldwein">Ekaterina E. Heldwein</name>
<affiliation>
<nlm:aff id="a33">
<institution>Department of Molecular Biology and Microbiology, Tufts University School of Medicine</institution>
, Boston, Massachusetts 02111,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Jia, Zongchao" sort="Jia, Zongchao" uniqKey="Jia Z" first="Zongchao" last="Jia">Zongchao Jia</name>
<affiliation>
<nlm:aff id="a34">
<institution>Department of Biomedical and Molecular Sciences, Queen's University</institution>
, Kingston, Ontario,
<country>Canada</country>
K7M 3G5</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Keenan, Robert J" sort="Keenan, Robert J" uniqKey="Keenan R" first="Robert J." last="Keenan">Robert J. Keenan</name>
<affiliation>
<nlm:aff id="a18">
<institution>Department of Biochemistry and Molecular Biology, University of Chicago</institution>
, Chicago, Illinois 60637,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Kruse, Andrew C" sort="Kruse, Andrew C" uniqKey="Kruse A" first="Andrew C." last="Kruse">Andrew C. Kruse</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Kvansakul, Marc" sort="Kvansakul, Marc" uniqKey="Kvansakul M" first="Marc" last="Kvansakul">Marc Kvansakul</name>
<affiliation>
<nlm:aff id="a35">
<institution>Department of Biochemistry and Genetics, La Trobe University</institution>
, Melbourne, Victoria 3086,
<country>Australia</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Mclellan, Jason S" sort="Mclellan, Jason S" uniqKey="Mclellan J" first="Jason S." last="Mclellan">Jason S. Mclellan</name>
<affiliation>
<nlm:aff id="a36">
<institution>Department of Biochemistry, Geisel School of Medicine at Dartmouth</institution>
, Hanover, New Hampshire 03755,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Modis, Yorgo" sort="Modis, Yorgo" uniqKey="Modis Y" first="Yorgo" last="Modis">Yorgo Modis</name>
<affiliation>
<nlm:aff id="a37">
<institution>Department of Medicine, University of Cambridge, MRC Laboratory of Molecular Biology</institution>
, Francis Crick Avenue, Cambridge CB2 0QH,
<country>UK</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Nam, Yunsun" sort="Nam, Yunsun" uniqKey="Nam Y" first="Yunsun" last="Nam">Yunsun Nam</name>
<affiliation>
<nlm:aff id="a38">
<institution>University of Texas, Southwestern Medical Center</institution>
, Dallas, Texas 75390,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Otwinowski, Zbyszek" sort="Otwinowski, Zbyszek" uniqKey="Otwinowski Z" first="Zbyszek" last="Otwinowski">Zbyszek Otwinowski</name>
<affiliation>
<nlm:aff id="a10">
<institution>Departments of Biophysics and Biochemistry, University of Texas Southwestern Medical Center</institution>
, Dallas, Texas 75390,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Pai, Emil F" sort="Pai, Emil F" uniqKey="Pai E" first="Emil F." last="Pai">Emil F. Pai</name>
<affiliation>
<nlm:aff id="a39">
<institution>Departments of Biochemistry, Medical Biophysics and Molecular Genetics, University of Toronto</institution>
, Toronto, Ontario,
<country>Canada</country>
M5S 1A8</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a40">
<institution>Campbell Family Institute for Cancer Research, Ontario Cancer Institute/University Health Network</institution>
, Toronto, Ontario,
<country>Canada</country>
M5G 2M9</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Pereira, Pedro Jose Barbosa" sort="Pereira, Pedro Jose Barbosa" uniqKey="Pereira P" first="Pedro José Barbosa" last="Pereira">Pedro José Barbosa Pereira</name>
<affiliation>
<nlm:aff id="a41">
<institution>IBMC—Instituto de Biologia Molecular e Celular and Instituto de Investigação e Inovação em Saúde, Universidade do Porto</institution>
, 4150 Porto,
<country>Portugal</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Petosa, Carlo" sort="Petosa, Carlo" uniqKey="Petosa C" first="Carlo" last="Petosa">Carlo Petosa</name>
<affiliation>
<nlm:aff id="a42">
<institution>Université Grenoble Alpes/CNRS/CEA, Institut de Biologie Structurale</institution>
, 38027 Grenoble,
<country>France</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Raman, C S" sort="Raman, C S" uniqKey="Raman C" first="C. S." last="Raman">C. S. Raman</name>
<affiliation>
<nlm:aff id="a43">
<institution>Department of Pharmaceutical Sciences, University of Maryland</institution>
, Baltimore, Maryland 21201,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Rapoport, Tom A" sort="Rapoport, Tom A" uniqKey="Rapoport T" first="Tom A." last="Rapoport">Tom A. Rapoport</name>
<affiliation>
<nlm:aff id="a44">
<institution>Howard Hughes Medical Institute and Harvard Medical School, Department of Cell Biology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Roll Mecak, Antonina" sort="Roll Mecak, Antonina" uniqKey="Roll Mecak A" first="Antonina" last="Roll-Mecak">Antonina Roll-Mecak</name>
<affiliation>
<nlm:aff id="a45">
<institution>Cell Biology and Biophysics Unit, Porter Neuroscience Research Center, National Institute of Neurological Disorders and Stroke</institution>
, Bethesda, Maryland 20892,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a46">
<institution>National Heart, Lung and Blood Institute</institution>
, Bethesda, Maryland 20892,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Rosen, Michael K" sort="Rosen, Michael K" uniqKey="Rosen M" first="Michael K." last="Rosen">Michael K. Rosen</name>
<affiliation>
<nlm:aff id="a47">
<institution>Department of Biophysics and Howard Hughes Medical Institute, University of Texas Southwestern Medical Center</institution>
, Dallas, Texas 75390,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Rudenko, Gabby" sort="Rudenko, Gabby" uniqKey="Rudenko G" first="Gabby" last="Rudenko">Gabby Rudenko</name>
<affiliation>
<nlm:aff id="a48">
<institution>Department of Pharmacology and Toxicology, Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch</institution>
, Galveston, Texas 77555,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Schlessinger, Joseph" sort="Schlessinger, Joseph" uniqKey="Schlessinger J" first="Joseph" last="Schlessinger">Joseph Schlessinger</name>
<affiliation>
<nlm:aff id="a49">
<institution>Department of Pharmacology, Yale University School of Medicine</institution>
, New Haven, Connecticut 06520,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Schwartz, Thomas U" sort="Schwartz, Thomas U" uniqKey="Schwartz T" first="Thomas U." last="Schwartz">Thomas U. Schwartz</name>
<affiliation>
<nlm:aff id="a50">
<institution>Department of Biology, Massachusetts Institute of Technology</institution>
, Cambridge, Massachusetts 02139,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Shamoo, Yousif" sort="Shamoo, Yousif" uniqKey="Shamoo Y" first="Yousif" last="Shamoo">Yousif Shamoo</name>
<affiliation>
<nlm:aff id="a51">
<institution>Department of BioSciences, Rice University</institution>
, Houston, Texas 77005,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Sondermann, Holger" sort="Sondermann, Holger" uniqKey="Sondermann H" first="Holger" last="Sondermann">Holger Sondermann</name>
<affiliation>
<nlm:aff id="a52">
<institution>Department of Molecular Medicine, College of Veterinary Medicine, Cornell University</institution>
, Ithaca, New York 14853,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Tao, Yizhi J" sort="Tao, Yizhi J" uniqKey="Tao Y" first="Yizhi J." last="Tao">Yizhi J. Tao</name>
<affiliation>
<nlm:aff id="a51">
<institution>Department of BioSciences, Rice University</institution>
, Houston, Texas 77005,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Tolia, Niraj H" sort="Tolia, Niraj H" uniqKey="Tolia N" first="Niraj H." last="Tolia">Niraj H. Tolia</name>
<affiliation>
<nlm:aff id="a53">
<institution>Department of Molecular Microbiology, Washington University School of Medicine</institution>
, St Louis, Missouri 63110,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Tsodikov, Oleg V" sort="Tsodikov, Oleg V" uniqKey="Tsodikov O" first="Oleg V." last="Tsodikov">Oleg V. Tsodikov</name>
<affiliation>
<nlm:aff id="a54">
<institution>Department of Pharmaceutical Sciences, College of Pharmacy, University of Kentucky</institution>
, Lexington, Kentucky 40536,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Westover, Kenneth D" sort="Westover, Kenneth D" uniqKey="Westover K" first="Kenneth D." last="Westover">Kenneth D. Westover</name>
<affiliation>
<nlm:aff id="a55">
<institution>Departments of Biochemistry and Radiation Oncology, University of Texas, Southwestern Medical Center</institution>
, Dallas, Texas 75390,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Wu, Hao" sort="Wu, Hao" uniqKey="Wu H" first="Hao" last="Wu">Hao Wu</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a56">
<institution>Program in Cellular and Molecular Medicine, Boston Children's Hospital</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Foster, Ian" sort="Foster, Ian" uniqKey="Foster I" first="Ian" last="Foster">Ian Foster</name>
<affiliation>
<nlm:aff id="a57">
<institution>Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, Illinois, and Department of Computer Science, University of Chicago</institution>
, Chicago, Illinois 60637,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Fraser, James S" sort="Fraser, James S" uniqKey="Fraser J" first="James S." last="Fraser">James S. Fraser</name>
<affiliation>
<nlm:aff id="a58">
<institution>Department of Bioengineering and Therapeutic Sciences, University of California San Francisco</institution>
, San Francisco, California 94158,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Maia, Filipe R N C" sort="Maia, Filipe R N C" uniqKey="Maia F" first="Filipe R. N C." last="Maia">Filipe R. N C. Maia</name>
<affiliation>
<nlm:aff id="a59">
<institution>Laboratory of Molecular Biophysics, Department of Cell and Molecular Biology, Uppsala University</institution>
, Husargatan 3 (Box 596), SE-751 24 Uppsala,
<country>Sweden</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a60">
<institution>NERSC, Lawrence Berkeley National Laboratory</institution>
, Berkeley, California 94720,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Gonen, Tamir" sort="Gonen, Tamir" uniqKey="Gonen T" first="Tamir" last="Gonen">Tamir Gonen</name>
<affiliation>
<nlm:aff id="a61">
<institution>Janelia Research Campus, Howard Hughes Medical Institute</institution>
, Ashburn, Virginia 20147
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Kirchhausen, Tom" sort="Kirchhausen, Tom" uniqKey="Kirchhausen T" first="Tom" last="Kirchhausen">Tom Kirchhausen</name>
<affiliation>
<nlm:aff id="a62">
<institution>Program in Cellular and Molecular Medicine and Department of Pediatrics, Boston Children's Hospital</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
<affiliation>
<nlm:aff id="a63">
<institution>Departments of Cell Biology, Harvard Medical School</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Diederichs, Kay" sort="Diederichs, Kay" uniqKey="Diederichs K" first="Kay" last="Diederichs">Kay Diederichs</name>
<affiliation>
<nlm:aff id="a64">
<institution>Department of Biology, University of Konstanz</institution>
, D-78457 Konstanz,
<country>Germany</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Crosas, Merce" sort="Crosas, Merce" uniqKey="Crosas M" first="Mercè" last="Crosas">Mercè Crosas</name>
<affiliation>
<nlm:aff id="a65">
<institution>Institute for Quantitative Social Science, Harvard University</institution>
, Cambridge, Massachusetts, 02138,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Sliz, Piotr" sort="Sliz, Piotr" uniqKey="Sliz P" first="Piotr" last="Sliz">Piotr Sliz</name>
<affiliation>
<nlm:aff id="a1">
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</nlm:aff>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Nature Communications</title>
<idno type="eISSN">2041-1723</idno>
<imprint>
<date when="2016">2016</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>Access to experimental X-ray diffraction image data is fundamental for validation and reproduction of macromolecular models and indispensable for development of structural biology processing methods. Here, we established a diffraction data publication and dissemination system, Structural Biology Data Grid (SBDG; data.sbgrid.org), to preserve primary experimental data sets that support scientific publications. Data sets are accessible to researchers through a community driven data grid, which facilitates global data access. Our analysis of a pilot collection of crystallographic data sets demonstrates that the information archived by SBDG is sufficient to reprocess data to statistics that meet or exceed the quality of the original published structures. SBDG has extended its services to the entire community and is used to develop support for other types of biomedical data sets. It is anticipated that access to the experimental data sets will enhance the paradigm shift in the community towards a much more dynamic body of continuously improving data analysis.</p>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Bilderback, D H" uniqKey="Bilderback D">D. H. Bilderback</name>
</author>
<author>
<name sortKey="Elleaume, P" uniqKey="Elleaume P">P. Elleaume</name>
</author>
<author>
<name sortKey="Weckert, E" uniqKey="Weckert E">E. Weckert</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Guss, J M" uniqKey="Guss J">J. M. Guss</name>
</author>
<author>
<name sortKey="Mcmahon, B" uniqKey="Mcmahon B">B. McMahon</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Meyer, G R" uniqKey="Meyer G">G. R. Meyer</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Elsliger, M A" uniqKey="Elsliger M">M.-A. Elsliger</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kroon Batenburg, L M J" uniqKey="Kroon Batenburg L">L. M. J. Kroon-Batenburg</name>
</author>
<author>
<name sortKey="Helliwell, J R" uniqKey="Helliwell J">J. R. Helliwell</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Terwilliger, T C" uniqKey="Terwilliger T">T. C. Terwilliger</name>
</author>
<author>
<name sortKey="Bricogne, G" uniqKey="Bricogne G">G. Bricogne</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wall, M E" uniqKey="Wall M">M. E. Wall</name>
</author>
<author>
<name sortKey="Adams, P D" uniqKey="Adams P">P. D. Adams</name>
</author>
<author>
<name sortKey="Fraser, J S" uniqKey="Fraser J">J. S. Fraser</name>
</author>
<author>
<name sortKey="Sauter, N K" uniqKey="Sauter N">N. K. Sauter</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Joosten, R P" uniqKey="Joosten R">R. P. Joosten</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Karplus, P A" uniqKey="Karplus P">P. A. Karplus</name>
</author>
<author>
<name sortKey="Diederichs, K" uniqKey="Diederichs K">K. Diederichs</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Tanley, S W M" uniqKey="Tanley S">S. W. M. Tanley</name>
</author>
<author>
<name sortKey="Diederichs, K" uniqKey="Diederichs K">K. Diederichs</name>
</author>
<author>
<name sortKey="Kroon Batenburg, L M J" uniqKey="Kroon Batenburg L">L. M. J. Kroon-Batenburg</name>
</author>
<author>
<name sortKey="Schreurs, A M M" uniqKey="Schreurs A">A. M. M. Schreurs</name>
</author>
<author>
<name sortKey="Helliwell, J R" uniqKey="Helliwell J">J. R. Helliwell</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Matthews, B W" uniqKey="Matthews B">B. W. Matthews</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Janssen, B J" uniqKey="Janssen B">B. J. Janssen</name>
</author>
<author>
<name sortKey="Read, R J" uniqKey="Read R">R. J. Read</name>
</author>
<author>
<name sortKey="Brunger, A T" uniqKey="Brunger A">A. T. Brunger</name>
</author>
<author>
<name sortKey="Gros, P" uniqKey="Gros P">P. Gros</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Berman, H" uniqKey="Berman H">H. Berman</name>
</author>
<author>
<name sortKey="Henrick, K" uniqKey="Henrick K">K. Henrick</name>
</author>
<author>
<name sortKey="Nakamura, H" uniqKey="Nakamura H">H. Nakamura</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Berman, H" uniqKey="Berman H">H. Berman</name>
</author>
<author>
<name sortKey="Kleywegt, G" uniqKey="Kleywegt G">G. Kleywegt</name>
</author>
<author>
<name sortKey="Nakamura, H" uniqKey="Nakamura H">H. Nakamura</name>
</author>
<author>
<name sortKey="Markley, J" uniqKey="Markley J">J. Markley</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Morin, A" uniqKey="Morin A">A. Morin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Foster, I" uniqKey="Foster I">I. Foster</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Foster, I" uniqKey="Foster I">I. Foster</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chard, K" uniqKey="Chard K">K. Chard</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Stokes Rees, I" uniqKey="Stokes Rees I">I. Stokes-Rees</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lee, D" uniqKey="Lee D">D. Lee</name>
</author>
<author>
<name sortKey="Raman, C" uniqKey="Raman C">C. Raman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Rudenko, G" uniqKey="Rudenko G">G. Rudenko</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Biasini, M" uniqKey="Biasini M">M. Biasini</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Starr, J" uniqKey="Starr J">J. Starr</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bourne, P E" uniqKey="Bourne P">P. E. Bourne</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Altman, M" uniqKey="Altman M">M. Altman</name>
</author>
<author>
<name sortKey="King, G" uniqKey="King G">G. King</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Altman, M" uniqKey="Altman M">M. Altman</name>
</author>
<author>
<name sortKey="Crosas, M" uniqKey="Crosas M">M. Crosas</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Socias, S" uniqKey="Socias S">S. Socias</name>
</author>
<author>
<name sortKey="Morin, A" uniqKey="Morin A">A. Morin</name>
</author>
<author>
<name sortKey="Timony, M" uniqKey="Timony M">M. Timony</name>
</author>
<author>
<name sortKey="Sliz, P" uniqKey="Sliz P">P. Sliz</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hunter, J C" uniqKey="Hunter J">J. C. Hunter</name>
</author>
<author>
<name sortKey="Westover, K D" uniqKey="Westover K">K. D. Westover</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gilman, M S A" uniqKey="Gilman M">M. S. A. Gilman</name>
</author>
<author>
<name sortKey="Mclellan, J S" uniqKey="Mclellan J">J. S. McLellan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Feldkamp, M D" uniqKey="Feldkamp M">M. D. Feldkamp</name>
</author>
<author>
<name sortKey="Chazin, W J" uniqKey="Chazin W">W. J. Chazin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Tolia, N H" uniqKey="Tolia N">N. H. Tolia</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hunter, J C" uniqKey="Hunter J">J. C. Hunter</name>
</author>
<author>
<name sortKey="Westover, K D" uniqKey="Westover K">K. D. Westover</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Corbett, K D" uniqKey="Corbett K">K. D. Corbett</name>
</author>
<author>
<name sortKey="Harrison, S" uniqKey="Harrison S">S. Harrison</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gajadeera, C S" uniqKey="Gajadeera C">C. S. Gajadeera</name>
</author>
<author>
<name sortKey="Tsodikov, O V" uniqKey="Tsodikov O">O. V. Tsodikov</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Winter, G" uniqKey="Winter G">G. Winter</name>
</author>
<author>
<name sortKey="Lobley, C M C" uniqKey="Lobley C">C. M. C. Lobley</name>
</author>
<author>
<name sortKey="Prince, S M" uniqKey="Prince S">S. M. Prince</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Evans, P R" uniqKey="Evans P">P. R. Evans</name>
</author>
<author>
<name sortKey="Murshudov, G N" uniqKey="Murshudov G">G. N. Murshudov</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Evans, P" uniqKey="Evans P">P. Evans</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kabsch, W" uniqKey="Kabsch W">W. Kabsch</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Winn, M D" uniqKey="Winn M">M. D. Winn</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Leslie, A G W" uniqKey="Leslie A">A. G. W. Leslie</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Waterman, D G" uniqKey="Waterman D">D. G. Waterman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Battye, G T" uniqKey="Battye G">G. T. Battye</name>
</author>
<author>
<name sortKey="Kontogiannis, L" uniqKey="Kontogiannis L">L. Kontogiannis</name>
</author>
<author>
<name sortKey="Johnson, O" uniqKey="Johnson O">O. Johnson</name>
</author>
<author>
<name sortKey="Powell, H R" uniqKey="Powell H">H. R. Powell</name>
</author>
<author>
<name sortKey="Leslie, A G" uniqKey="Leslie A">A. G. Leslie</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Helliwell, J R" uniqKey="Helliwell J">J. R. Helliwell</name>
</author>
<author>
<name sortKey="Mitchell, E P" uniqKey="Mitchell E">E. P. Mitchell</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Welberry, T" uniqKey="Welberry T">T. Welberry</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wall, M E" uniqKey="Wall M">M. E. Wall</name>
</author>
<author>
<name sortKey="Clarage, J B" uniqKey="Clarage J">J. B. Clarage</name>
</author>
<author>
<name sortKey="Phillips, G N" uniqKey="Phillips G">G. N. Phillips</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wall, M E" uniqKey="Wall M">M. E. Wall</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wall, M" uniqKey="Wall M">M. Wall</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Fraser, J S" uniqKey="Fraser J">J. S. Fraser</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Shi, D" uniqKey="Shi D">D. Shi</name>
</author>
<author>
<name sortKey="Nannenga, B L" uniqKey="Nannenga B">B. L. Nannenga</name>
</author>
<author>
<name sortKey="Iadanza, M G" uniqKey="Iadanza M">M. G. Iadanza</name>
</author>
<author>
<name sortKey="Gonen, T" uniqKey="Gonen T">T. Gonen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Nannenga, B L" uniqKey="Nannenga B">B. L. Nannenga</name>
</author>
<author>
<name sortKey="Shi, D" uniqKey="Shi D">D. Shi</name>
</author>
<author>
<name sortKey="Leslie, A G" uniqKey="Leslie A">A. G. Leslie</name>
</author>
<author>
<name sortKey="Gonen, T" uniqKey="Gonen T">T. Gonen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Reyes, F" uniqKey="Reyes F">F. Reyes</name>
</author>
<author>
<name sortKey="Rodriguez, J" uniqKey="Rodriguez J">J. Rodriguez</name>
</author>
<author>
<name sortKey="Gonen, T" uniqKey="Gonen T">T. Gonen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="De La Cruz, J" uniqKey="De La Cruz J">J. de la Cruz</name>
</author>
<author>
<name sortKey="Shi, D" uniqKey="Shi D">D. Shi</name>
</author>
<author>
<name sortKey="Gonen, T" uniqKey="Gonen T">T. Gonen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Shi, D" uniqKey="Shi D">D. Shi</name>
</author>
<author>
<name sortKey="Gonen, T" uniqKey="Gonen T">T. Gonen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Vangone, A" uniqKey="Vangone A">A. Vangone</name>
</author>
<author>
<name sortKey="Bonvin, A M" uniqKey="Bonvin A">A. M. Bonvin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bowers, K" uniqKey="Bowers K">K. Bowers</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Sliz, P" uniqKey="Sliz P">P. Sliz</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chen, B C" uniqKey="Chen B">B.-C. Chen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kural, C" uniqKey="Kural C">C. Kural</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Upadhyayula, S" uniqKey="Upadhyayula S">S. Upadhyayula</name>
</author>
<author>
<name sortKey="Kirchhausen, T" uniqKey="Kirchhausen T">T. Kirchhausen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Crosas, M" uniqKey="Crosas M">M. Crosas</name>
</author>
<author>
<name sortKey="Honaker, J" uniqKey="Honaker J">J. Honaker</name>
</author>
<author>
<name sortKey="King, G" uniqKey="King G">G. King</name>
</author>
<author>
<name sortKey="Sweeney, L" uniqKey="Sweeney L">L. Sweeney</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Crosas, M" uniqKey="Crosas M">M. Crosas</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Crosas, M" uniqKey="Crosas M">M. Crosas</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="King, G" uniqKey="King G">G. King</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Nicholls, R A" uniqKey="Nicholls R">R. A. Nicholls</name>
</author>
<author>
<name sortKey="Fischer, M" uniqKey="Fischer M">M. Fischer</name>
</author>
<author>
<name sortKey="Stuart, M" uniqKey="Stuart M">M. Stuart</name>
</author>
<author>
<name sortKey="Murshudov, G N" uniqKey="Murshudov G">G. N. Murshudov</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chowdary, T K" uniqKey="Chowdary T">T. K. Chowdary</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Nat Commun</journal-id>
<journal-id journal-id-type="iso-abbrev">Nat Commun</journal-id>
<journal-title-group>
<journal-title>Nature Communications</journal-title>
</journal-title-group>
<issn pub-type="epub">2041-1723</issn>
<publisher>
<publisher-name>Nature Publishing Group</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">26947396</article-id>
<article-id pub-id-type="pmc">4786681</article-id>
<article-id pub-id-type="pii">ncomms10882</article-id>
<article-id pub-id-type="doi">10.1038/ncomms10882</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Data publication with the structural biology data grid supports live analysis</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Meyer</surname>
<given-names>Peter A.</given-names>
</name>
<xref ref-type="aff" rid="a1">1</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Socias</surname>
<given-names>Stephanie</given-names>
</name>
<xref ref-type="aff" rid="a1">1</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Key</surname>
<given-names>Jason</given-names>
</name>
<xref ref-type="aff" rid="a1">1</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Ransey</surname>
<given-names>Elizabeth</given-names>
</name>
<xref ref-type="aff" rid="a1">1</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Tjon</surname>
<given-names>Emily C.</given-names>
</name>
<xref ref-type="aff" rid="a1">1</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Buschiazzo</surname>
<given-names>Alejandro</given-names>
</name>
<xref ref-type="aff" rid="a2">2</xref>
<xref ref-type="aff" rid="a3">3</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Lei</surname>
<given-names>Ming</given-names>
</name>
<xref ref-type="aff" rid="a4">4</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Botka</surname>
<given-names>Chris</given-names>
</name>
<xref ref-type="aff" rid="a5">5</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Withrow</surname>
<given-names>James</given-names>
</name>
<xref ref-type="aff" rid="a6">6</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Neau</surname>
<given-names>David</given-names>
</name>
<xref ref-type="aff" rid="a6">6</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Rajashankar</surname>
<given-names>Kanagalaghatta</given-names>
</name>
<xref ref-type="aff" rid="a6">6</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Anderson</surname>
<given-names>Karen S.</given-names>
</name>
<xref ref-type="aff" rid="a7">7</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Baxter</surname>
<given-names>Richard H.</given-names>
</name>
<xref ref-type="aff" rid="a8">8</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Blacklow</surname>
<given-names>Stephen C.</given-names>
</name>
<xref ref-type="aff" rid="a1">1</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Boggon</surname>
<given-names>Titus J.</given-names>
</name>
<xref ref-type="aff" rid="a7">7</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Bonvin</surname>
<given-names>Alexandre M. J. J.</given-names>
</name>
<xref ref-type="aff" rid="a9">9</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Borek</surname>
<given-names>Dominika</given-names>
</name>
<xref ref-type="aff" rid="a10">10</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Brett</surname>
<given-names>Tom J.</given-names>
</name>
<xref ref-type="aff" rid="a11">11</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Caflisch</surname>
<given-names>Amedeo</given-names>
</name>
<xref ref-type="aff" rid="a12">12</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Chang</surname>
<given-names>Chung-I</given-names>
</name>
<xref ref-type="aff" rid="a13">13</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Chazin</surname>
<given-names>Walter J.</given-names>
</name>
<xref ref-type="aff" rid="a14">14</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Corbett</surname>
<given-names>Kevin D.</given-names>
</name>
<xref ref-type="aff" rid="a15">15</xref>
<xref ref-type="aff" rid="a16">16</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Cosgrove</surname>
<given-names>Michael S.</given-names>
</name>
<xref ref-type="aff" rid="a17">17</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Crosson</surname>
<given-names>Sean</given-names>
</name>
<xref ref-type="aff" rid="a18">18</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Dhe-Paganon</surname>
<given-names>Sirano</given-names>
</name>
<xref ref-type="aff" rid="a19">19</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Di Cera</surname>
<given-names>Enrico</given-names>
</name>
<xref ref-type="aff" rid="a20">20</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Drennan</surname>
<given-names>Catherine L.</given-names>
</name>
<xref ref-type="aff" rid="a21">21</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Eck</surname>
<given-names>Michael J.</given-names>
</name>
<xref ref-type="aff" rid="a1">1</xref>
<xref ref-type="aff" rid="a19">19</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Eichman</surname>
<given-names>Brandt F.</given-names>
</name>
<xref ref-type="aff" rid="a22">22</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Fan</surname>
<given-names>Qing R.</given-names>
</name>
<xref ref-type="aff" rid="a23">23</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Ferré-D'Amaré</surname>
<given-names>Adrian R.</given-names>
</name>
<xref ref-type="aff" rid="a24">24</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Christopher Fromme</surname>
<given-names>J.</given-names>
</name>
<xref ref-type="aff" rid="a25">25</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Garcia</surname>
<given-names>K. Christopher</given-names>
</name>
<xref ref-type="aff" rid="a26">26</xref>
<xref ref-type="aff" rid="a27">27</xref>
<xref ref-type="aff" rid="a28">28</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Gaudet</surname>
<given-names>Rachelle</given-names>
</name>
<xref ref-type="aff" rid="a29">29</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Gong</surname>
<given-names>Peng</given-names>
</name>
<xref ref-type="aff" rid="a30">30</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Harrison</surname>
<given-names>Stephen C.</given-names>
</name>
<xref ref-type="aff" rid="a1">1</xref>
<xref ref-type="aff" rid="a31">31</xref>
<xref ref-type="aff" rid="a32">32</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Heldwein</surname>
<given-names>Ekaterina E.</given-names>
</name>
<xref ref-type="aff" rid="a33">33</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Jia</surname>
<given-names>Zongchao</given-names>
</name>
<xref ref-type="aff" rid="a34">34</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Keenan</surname>
<given-names>Robert J.</given-names>
</name>
<xref ref-type="aff" rid="a18">18</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Kruse</surname>
<given-names>Andrew C.</given-names>
</name>
<xref ref-type="aff" rid="a1">1</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Kvansakul</surname>
<given-names>Marc</given-names>
</name>
<xref ref-type="aff" rid="a35">35</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>McLellan</surname>
<given-names>Jason S.</given-names>
</name>
<xref ref-type="aff" rid="a36">36</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Modis</surname>
<given-names>Yorgo</given-names>
</name>
<xref ref-type="aff" rid="a37">37</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Nam</surname>
<given-names>Yunsun</given-names>
</name>
<xref ref-type="aff" rid="a38">38</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Otwinowski</surname>
<given-names>Zbyszek</given-names>
</name>
<xref ref-type="aff" rid="a10">10</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Pai</surname>
<given-names>Emil F.</given-names>
</name>
<xref ref-type="aff" rid="a39">39</xref>
<xref ref-type="aff" rid="a40">40</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Pereira</surname>
<given-names>Pedro José Barbosa</given-names>
</name>
<xref ref-type="aff" rid="a41">41</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Petosa</surname>
<given-names>Carlo</given-names>
</name>
<xref ref-type="aff" rid="a42">42</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Raman</surname>
<given-names>C. S.</given-names>
</name>
<xref ref-type="aff" rid="a43">43</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Rapoport</surname>
<given-names>Tom A.</given-names>
</name>
<xref ref-type="aff" rid="a44">44</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Roll-Mecak</surname>
<given-names>Antonina</given-names>
</name>
<xref ref-type="aff" rid="a45">45</xref>
<xref ref-type="aff" rid="a46">46</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Rosen</surname>
<given-names>Michael K.</given-names>
</name>
<xref ref-type="aff" rid="a47">47</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Rudenko</surname>
<given-names>Gabby</given-names>
</name>
<xref ref-type="aff" rid="a48">48</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Schlessinger</surname>
<given-names>Joseph</given-names>
</name>
<xref ref-type="aff" rid="a49">49</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Schwartz</surname>
<given-names>Thomas U.</given-names>
</name>
<xref ref-type="aff" rid="a50">50</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Shamoo</surname>
<given-names>Yousif</given-names>
</name>
<xref ref-type="aff" rid="a51">51</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Sondermann</surname>
<given-names>Holger</given-names>
</name>
<xref ref-type="aff" rid="a52">52</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Tao</surname>
<given-names>Yizhi J.</given-names>
</name>
<xref ref-type="aff" rid="a51">51</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Tolia</surname>
<given-names>Niraj H.</given-names>
</name>
<xref ref-type="aff" rid="a53">53</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Tsodikov</surname>
<given-names>Oleg V.</given-names>
</name>
<xref ref-type="aff" rid="a54">54</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Westover</surname>
<given-names>Kenneth D.</given-names>
</name>
<xref ref-type="aff" rid="a55">55</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Wu</surname>
<given-names>Hao</given-names>
</name>
<xref ref-type="aff" rid="a1">1</xref>
<xref ref-type="aff" rid="a56">56</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Foster</surname>
<given-names>Ian</given-names>
</name>
<xref ref-type="aff" rid="a57">57</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Fraser</surname>
<given-names>James S.</given-names>
</name>
<xref ref-type="aff" rid="a58">58</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Maia</surname>
<given-names>Filipe R. N C.</given-names>
</name>
<xref ref-type="aff" rid="a59">59</xref>
<xref ref-type="aff" rid="a60">60</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Gonen</surname>
<given-names>Tamir</given-names>
</name>
<xref ref-type="aff" rid="a61">61</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Kirchhausen</surname>
<given-names>Tom</given-names>
</name>
<xref ref-type="aff" rid="a62">62</xref>
<xref ref-type="aff" rid="a63">63</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Diederichs</surname>
<given-names>Kay</given-names>
</name>
<xref ref-type="aff" rid="a64">64</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Crosas</surname>
<given-names>Mercè</given-names>
</name>
<xref ref-type="aff" rid="a65">65</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Sliz</surname>
<given-names>Piotr</given-names>
</name>
<xref ref-type="corresp" rid="c1">a</xref>
<xref ref-type="aff" rid="a1">1</xref>
</contrib>
<aff id="a1">
<label>1</label>
<institution>Department of Biological Chemistry and Molecular Pharmacology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</aff>
<aff id="a2">
<label>2</label>
<institution>Laboratory of Molecular & Structural Microbiology, Institut Pasteur de Montevideo</institution>
, Montevideo 11400,
<country>Uruguay</country>
</aff>
<aff id="a3">
<label>3</label>
<institution>Department of Structural Biology & Chemistry, Institut Pasteur</institution>
, 75015 Paris,
<country>France</country>
</aff>
<aff id="a4">
<label>4</label>
<institution>Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences</institution>
, Shanghai 200031,
<country>China</country>
</aff>
<aff id="a5">
<label>5</label>
<institution>Harvard Medical School</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</aff>
<aff id="a6">
<label>6</label>
<institution>NE-CAT and Department of Chemistry and Chemical Biology, Cornell University</institution>
, Building 436E, Argonne National Laboratory, 9700S. Cass Avenue, Argonne, Illinois 60439,
<country>USA</country>
</aff>
<aff id="a7">
<label>7</label>
<institution>Departments of Pharmacology and Molecular Biophysics and Biochemistry, Yale University School of Medicine</institution>
, New Haven, Connecticut 06520,
<country>USA</country>
</aff>
<aff id="a8">
<label>8</label>
<institution>Department of Chemistry, Molecular Biophysics and Biochemistry, Yale University</institution>
, New Haven, Connecticut 06520,
<country>USA</country>
</aff>
<aff id="a9">
<label>9</label>
<institution>Bijvoet Center, Faculty of Science, Utrecht University</institution>
, 3584 CH Utrecht,
<country>The Netherlands</country>
</aff>
<aff id="a10">
<label>10</label>
<institution>Departments of Biophysics and Biochemistry, University of Texas Southwestern Medical Center</institution>
, Dallas, Texas 75390,
<country>USA</country>
</aff>
<aff id="a11">
<label>11</label>
<institution>Department of Internal Medicine, Washington University School of Medicine</institution>
, St Louis, Missouri 63110,
<country>USA</country>
</aff>
<aff id="a12">
<label>12</label>
<institution>Department of Biochemistry, University of Zurich</institution>
, CH-8057 Zurich,
<country>Switzerland</country>
</aff>
<aff id="a13">
<label>13</label>
<institution>Institute of Biological Chemistry, Academia Sinica</institution>
, Taipei 11529,
<country>Taiwan</country>
</aff>
<aff id="a14">
<label>14</label>
<institution>Departments of Biochemistry and Chemistry, Center for Structural Biology, Vanderbilt University</institution>
, Nashville, Tennessee 37232,
<country>USA</country>
</aff>
<aff id="a15">
<label>15</label>
<institution>Ludwig Institute for Cancer Research, San Diego Branch</institution>
, La Jolla, California 92093,
<country>USA</country>
</aff>
<aff id="a16">
<label>16</label>
<institution>Department of Cellular and Molecular Medicine, University of California, San Diego</institution>
, La Jolla, California 92093,
<country>USA</country>
</aff>
<aff id="a17">
<label>17</label>
<institution>Department of Biochemistry and Molecular Biology, SUNY Upstate Medical University</institution>
, Syracuse, New York 13210,
<country>USA</country>
</aff>
<aff id="a18">
<label>18</label>
<institution>Department of Biochemistry and Molecular Biology, University of Chicago</institution>
, Chicago, Illinois 60637,
<country>USA</country>
</aff>
<aff id="a19">
<label>19</label>
<institution>Department of Cancer Biology, Dana-Farber Cancer Institute</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</aff>
<aff id="a20">
<label>20</label>
<institution>Edward A. Doisy Department of Biochemistry and Molecular Biology, Saint Louis University School of Medicine</institution>
, St Louis, Missouri 63104,
<country>USA</country>
</aff>
<aff id="a21">
<label>21</label>
<institution>Departments of Chemistry and Biology and the Howard Hughes Medical Institute, Massachusetts Institute of Technology</institution>
, Cambridge, Massachusetts 02139,
<country>USA</country>
</aff>
<aff id="a22">
<label>22</label>
<institution>Department of Biological Sciences and Center for Structural Biology, Vanderbilt University</institution>
, Nashville, Tennessee 37235,
<country>USA</country>
</aff>
<aff id="a23">
<label>23</label>
<institution>Departments of Pharmacology and Pathology and Cell Biology, Columbia University</institution>
, New York, New York 10032,
<country>USA</country>
</aff>
<aff id="a24">
<label>24</label>
<institution>Laboratory of RNA Biophysics, National Heart, Lung and Blood Institute, NIH</institution>
, Bethesda, Maryland 20892,
<country>USA</country>
</aff>
<aff id="a25">
<label>25</label>
<institution>Department of Molecular Biology and Genetics, Weill Institute for Cell and Molecular Biology, Cornell University</institution>
, Ithaca, New York 14853,
<country>USA</country>
</aff>
<aff id="a26">
<label>26</label>
<institution>Howard Hughes Medical Institute, Stanford University School of Medicine</institution>
, Stanford, California 94305,
<country>USA</country>
</aff>
<aff id="a27">
<label>27</label>
<institution>Department of Molecular and Cellular Physiology, Stanford University School of Medicine</institution>
, Stanford, California 94305,
<country>USA</country>
</aff>
<aff id="a28">
<label>28</label>
<institution>Department of Structural Biology, Stanford University School of Medicine</institution>
, Stanford, California 94305,
<country>USA</country>
</aff>
<aff id="a29">
<label>29</label>
<institution>Department of Molecular and Cellular Biology, Harvard University</institution>
, Cambridge, Massachusetts 02138,
<country>USA</country>
</aff>
<aff id="a30">
<label>30</label>
<institution>Key Laboratory of Special Pathogens and Biosafety, Wuhan Institute of Virology, Chinese Academy of Sciences</institution>
, Wuhan 430071,
<country>China</country>
</aff>
<aff id="a31">
<label>31</label>
<institution>Howard Hughes Medical Institute, Harvard Medical School</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</aff>
<aff id="a32">
<label>32</label>
<institution>Laboratory of Molecular Medicine, Boston Children's Hospital, Harvard Medical School</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</aff>
<aff id="a33">
<label>33</label>
<institution>Department of Molecular Biology and Microbiology, Tufts University School of Medicine</institution>
, Boston, Massachusetts 02111,
<country>USA</country>
</aff>
<aff id="a34">
<label>34</label>
<institution>Department of Biomedical and Molecular Sciences, Queen's University</institution>
, Kingston, Ontario,
<country>Canada</country>
K7M 3G5</aff>
<aff id="a35">
<label>35</label>
<institution>Department of Biochemistry and Genetics, La Trobe University</institution>
, Melbourne, Victoria 3086,
<country>Australia</country>
</aff>
<aff id="a36">
<label>36</label>
<institution>Department of Biochemistry, Geisel School of Medicine at Dartmouth</institution>
, Hanover, New Hampshire 03755,
<country>USA</country>
</aff>
<aff id="a37">
<label>37</label>
<institution>Department of Medicine, University of Cambridge, MRC Laboratory of Molecular Biology</institution>
, Francis Crick Avenue, Cambridge CB2 0QH,
<country>UK</country>
</aff>
<aff id="a38">
<label>38</label>
<institution>University of Texas, Southwestern Medical Center</institution>
, Dallas, Texas 75390,
<country>USA</country>
</aff>
<aff id="a39">
<label>39</label>
<institution>Departments of Biochemistry, Medical Biophysics and Molecular Genetics, University of Toronto</institution>
, Toronto, Ontario,
<country>Canada</country>
M5S 1A8</aff>
<aff id="a40">
<label>40</label>
<institution>Campbell Family Institute for Cancer Research, Ontario Cancer Institute/University Health Network</institution>
, Toronto, Ontario,
<country>Canada</country>
M5G 2M9</aff>
<aff id="a41">
<label>41</label>
<institution>IBMC—Instituto de Biologia Molecular e Celular and Instituto de Investigação e Inovação em Saúde, Universidade do Porto</institution>
, 4150 Porto,
<country>Portugal</country>
</aff>
<aff id="a42">
<label>42</label>
<institution>Université Grenoble Alpes/CNRS/CEA, Institut de Biologie Structurale</institution>
, 38027 Grenoble,
<country>France</country>
</aff>
<aff id="a43">
<label>43</label>
<institution>Department of Pharmaceutical Sciences, University of Maryland</institution>
, Baltimore, Maryland 21201,
<country>USA</country>
</aff>
<aff id="a44">
<label>44</label>
<institution>Howard Hughes Medical Institute and Harvard Medical School, Department of Cell Biology</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</aff>
<aff id="a45">
<label>45</label>
<institution>Cell Biology and Biophysics Unit, Porter Neuroscience Research Center, National Institute of Neurological Disorders and Stroke</institution>
, Bethesda, Maryland 20892,
<country>USA</country>
</aff>
<aff id="a46">
<label>46</label>
<institution>National Heart, Lung and Blood Institute</institution>
, Bethesda, Maryland 20892,
<country>USA</country>
</aff>
<aff id="a47">
<label>47</label>
<institution>Department of Biophysics and Howard Hughes Medical Institute, University of Texas Southwestern Medical Center</institution>
, Dallas, Texas 75390,
<country>USA</country>
</aff>
<aff id="a48">
<label>48</label>
<institution>Department of Pharmacology and Toxicology, Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch</institution>
, Galveston, Texas 77555,
<country>USA</country>
</aff>
<aff id="a49">
<label>49</label>
<institution>Department of Pharmacology, Yale University School of Medicine</institution>
, New Haven, Connecticut 06520,
<country>USA</country>
</aff>
<aff id="a50">
<label>50</label>
<institution>Department of Biology, Massachusetts Institute of Technology</institution>
, Cambridge, Massachusetts 02139,
<country>USA</country>
</aff>
<aff id="a51">
<label>51</label>
<institution>Department of BioSciences, Rice University</institution>
, Houston, Texas 77005,
<country>USA</country>
</aff>
<aff id="a52">
<label>52</label>
<institution>Department of Molecular Medicine, College of Veterinary Medicine, Cornell University</institution>
, Ithaca, New York 14853,
<country>USA</country>
</aff>
<aff id="a53">
<label>53</label>
<institution>Department of Molecular Microbiology, Washington University School of Medicine</institution>
, St Louis, Missouri 63110,
<country>USA</country>
</aff>
<aff id="a54">
<label>54</label>
<institution>Department of Pharmaceutical Sciences, College of Pharmacy, University of Kentucky</institution>
, Lexington, Kentucky 40536,
<country>USA</country>
</aff>
<aff id="a55">
<label>55</label>
<institution>Departments of Biochemistry and Radiation Oncology, University of Texas, Southwestern Medical Center</institution>
, Dallas, Texas 75390,
<country>USA</country>
</aff>
<aff id="a56">
<label>56</label>
<institution>Program in Cellular and Molecular Medicine, Boston Children's Hospital</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</aff>
<aff id="a57">
<label>57</label>
<institution>Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, Illinois, and Department of Computer Science, University of Chicago</institution>
, Chicago, Illinois 60637,
<country>USA</country>
</aff>
<aff id="a58">
<label>58</label>
<institution>Department of Bioengineering and Therapeutic Sciences, University of California San Francisco</institution>
, San Francisco, California 94158,
<country>USA</country>
</aff>
<aff id="a59">
<label>59</label>
<institution>Laboratory of Molecular Biophysics, Department of Cell and Molecular Biology, Uppsala University</institution>
, Husargatan 3 (Box 596), SE-751 24 Uppsala,
<country>Sweden</country>
</aff>
<aff id="a60">
<label>60</label>
<institution>NERSC, Lawrence Berkeley National Laboratory</institution>
, Berkeley, California 94720,
<country>USA</country>
</aff>
<aff id="a61">
<label>61</label>
<institution>Janelia Research Campus, Howard Hughes Medical Institute</institution>
, Ashburn, Virginia 20147
<country>USA</country>
</aff>
<aff id="a62">
<label>62</label>
<institution>Program in Cellular and Molecular Medicine and Department of Pediatrics, Boston Children's Hospital</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</aff>
<aff id="a63">
<label>63</label>
<institution>Departments of Cell Biology, Harvard Medical School</institution>
, Boston, Massachusetts 02115,
<country>USA</country>
</aff>
<aff id="a64">
<label>64</label>
<institution>Department of Biology, University of Konstanz</institution>
, D-78457 Konstanz,
<country>Germany</country>
</aff>
<aff id="a65">
<label>65</label>
<institution>Institute for Quantitative Social Science, Harvard University</institution>
, Cambridge, Massachusetts, 02138,
<country>USA</country>
</aff>
</contrib-group>
<author-notes>
<corresp id="c1">
<label>a</label>
<email>sliz@hkl.hms.harvard.edu</email>
</corresp>
</author-notes>
<pub-date pub-type="epub">
<day>07</day>
<month>03</month>
<year>2016</year>
</pub-date>
<pub-date pub-type="collection">
<year>2016</year>
</pub-date>
<volume>7</volume>
<elocation-id>10882</elocation-id>
<history>
<date date-type="received">
<day>16</day>
<month>10</month>
<year>2015</year>
</date>
<date date-type="accepted">
<day>28</day>
<month>01</month>
<year>2016</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright © 2016, Nature Publishing Group, a division of Macmillan Publishers Limited. All Rights Reserved.</copyright-statement>
<copyright-year>2016</copyright-year>
<copyright-holder>Nature Publishing Group, a division of Macmillan Publishers Limited. All Rights Reserved.</copyright-holder>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<pmc-comment>author-paid</pmc-comment>
<license-p>This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</ext-link>
</license-p>
</license>
</permissions>
<abstract>
<p>Access to experimental X-ray diffraction image data is fundamental for validation and reproduction of macromolecular models and indispensable for development of structural biology processing methods. Here, we established a diffraction data publication and dissemination system, Structural Biology Data Grid (SBDG; data.sbgrid.org), to preserve primary experimental data sets that support scientific publications. Data sets are accessible to researchers through a community driven data grid, which facilitates global data access. Our analysis of a pilot collection of crystallographic data sets demonstrates that the information archived by SBDG is sufficient to reprocess data to statistics that meet or exceed the quality of the original published structures. SBDG has extended its services to the entire community and is used to develop support for other types of biomedical data sets. It is anticipated that access to the experimental data sets will enhance the paradigm shift in the community towards a much more dynamic body of continuously improving data analysis.</p>
</abstract>
<abstract abstract-type="web-summary">
<p>
<inline-graphic id="i1" xlink:href="ncomms10882-i1.jpg"></inline-graphic>
The validation and analysis of X-ray crystallographic data is essential for reproducibility and the development of crystallographic methods. Here, the authors describe a repository for crystallographic datasets and demonstrate some of the ways it could serve the crystallographic community.</p>
</abstract>
</article-meta>
</front>
<body>
<p>As one of the most powerful tools in structural biology, X-ray crystallography allows determination of the structure (atomic coordinates) of proteins, nucleic acids, small molecule compounds and macromolecular complexes to atomic-level resolution. Crystallographic data continue to be a primary source of mechanistic understanding of macromolecules, the implications of which extend from basic research to translational studies and the rational design of therapeutics. Reflecting the significance of the technique, the number of published macromolecular crystal structures has rapidly grown to >100,000 and numerous investigators within structural biology have been awarded the Nobel Prize, including Drs. Kendrew, Perutz, Watson, Crick, Wilkins, Hodgkin, Klug, Deisenhofer, Michel, Huber, Walker, MacKinnon, Kornberg, Ramakrishnan, Steitz, Yonath and Kobilka.</p>
<p>To support the needs of a growing structural biology community, a global network of synchrotron beamlines
<xref ref-type="bibr" rid="b1">1</xref>
has been established and made available to researchers. These facilities remain the predominant source for crystallographic data collection. While the data collection process has become increasingly streamlined, deployment of a data management infrastructure to archive original diffraction images has been slow and uncertain
<xref ref-type="bibr" rid="b2">2</xref>
. With the exception of a modest number of data storage systems dedicated to the support of individual synchrotron beamlines
<xref ref-type="bibr" rid="b3">3</xref>
, or specific structural genomics projects
<xref ref-type="bibr" rid="b4">4</xref>
, storage of diffraction image data sets is typically the responsibility of primary investigators. Access to these original experimental data sets is therefore dependent on the policies of individual laboratories, which vary in storage organization, institutional resources, and researcher turnover. There is no universal archiving system to store X-ray diffraction data sets, and raw data sets are rarely made publicly available. In the cases where data sets are available, their distribution format can vary significantly. A typical data set of 360 images collected on modern detectors is 5 GB, and structure determination can involve one to tens of data sets, making the logistics of storing diffraction data for many protein structures a daunting task.</p>
<p>The benefits of easy and public access to experimental data are numerous
<xref ref-type="bibr" rid="b5">5</xref>
. Access to primary data would support community efforts to continuously improve existing models and identify new features through complete reprocessing of experimental data
<xref ref-type="bibr" rid="b6">6</xref>
<xref ref-type="bibr" rid="b7">7</xref>
<xref ref-type="bibr" rid="b8">8</xref>
with modern software tools and improved criteria
<xref ref-type="bibr" rid="b9">9</xref>
. Further, original data may provide a basis for validating questionable existing structures while mistakes in structure determination may be identified earlier
<xref ref-type="bibr" rid="b10">10</xref>
<xref ref-type="bibr" rid="b11">11</xref>
<xref ref-type="bibr" rid="b12">12</xref>
. Additionally, access to a diverse volume of raw data can be used to develop improved software to address limitations of existing programs. Finally, access to a collection of varied experimental data will undoubtedly benefit the training and education of practitioners. The Worldwide Protein Data Bank
<xref ref-type="bibr" rid="b13">13</xref>
<xref ref-type="bibr" rid="b14">14</xref>
(wwPDB) has illustrated how these achievements can be realized with the collection of reduced experimental data, in the form of structure factor amplitudes. Complementing this resource by preserving raw experimental data and making it available to a broad community promises a profound scientific impact in structural biology and other biomedical disciplines that face the challenges of preserving large data sets.</p>
<p>While the primary role of the SBGrid Consortium (www.sbgrid.org) has been to curate and support a collection of data processing software applications and to organize community-wide computing support
<xref ref-type="bibr" rid="b15">15</xref>
, SBGrid has also been active in the management of raw, experimental data sets. In 2012, SBGrid prototyped a system based on Globus technology
<xref ref-type="bibr" rid="b16">16</xref>
<xref ref-type="bibr" rid="b17">17</xref>
<xref ref-type="bibr" rid="b18">18</xref>
<xref ref-type="bibr" rid="b19">19</xref>
to move diffraction data between Harvard, The Advanced Photon Source, and the Stanford Synchrotron Radiation Light source
<xref ref-type="bibr" rid="b19">19</xref>
.</p>
<p>To support the outstanding needs of the global structural community, we have established a publication system for experimental diffraction data sets that supports published structural coordinates: the Structural Biology Data Grid (SBDG). The SBDG project was initiated with a collection of X-ray diffraction image data sets as well as a few additional data set types contributed by many SBGrid Consortium laboratories. The collection supports a diverse subset of over 68 peer-reviewed publications and represents a sampling of numerous structure determination approaches. To evaluate the utility of such a data grid, we reprocessed all published diffraction data sets in this initial collection with modern software and compared the derived statistics against those reported in the original publications. We also demonstrate that by integrating the storage resources of multiple research groups and institutions, the data grid is poised to deliver a novel community driven data preservation system to support various types of structural biology and biomedical data sets.</p>
<sec disp-level="1">
<title>Results</title>
<sec disp-level="2">
<title>Structural biology data grid</title>
<p>The SBDG is a centralized data publication service—a repository for discovering, downloading and depositing large structural biology data sets. We developed the SBDG to support the need of the SBGrid community to archive and disseminate X-ray diffraction image data sets, that is, images recorded on X-ray detectors, which support published structures. More than 90% of SBGrid laboratories use X-ray crystallography in their research, and SBGrid investigators have contributed over 11,000 X-ray structures to the PDB. The SBDG complements the PDB, which archives derived data—merged and post-refined data from diffraction images and the resulting refined coordinates of macromolecular structural models. The data grid has been developed in collaboration with the Data Science team at Harvard's Institute for Quantitative Social Science, and it conforms to progressive data science standards (
<xref ref-type="table" rid="t1">Table 1</xref>
). The SBDG limits its collection to data sets that support journal publications, referred to as ‘primary data'. For X-ray diffraction data, this primary data consists of experimental diffraction images supporting a derived structural model and journal publication. Release of this primary data by the SBDG coincides with publication of the resulting manuscript and for the structural biology data sets of related PDB files. As of 1 September 2015, the SBDG stores a diverse collection of 117 data sets, including 111 X-ray diffraction data sets and a handful of other data types including computational decoys and data sets from MicroED, lattice light-sheet microscopy and molecular dynamics (
<xref ref-type="supplementary-material" rid="S1">Supplementary Table 1</xref>
). These published data sets, contributed by 50 laboratories with diffraction data sets collected at 11 synchrotron facilities (
<xref ref-type="fig" rid="f1">Fig. 1</xref>
) and several home sources, originated 94 structures and 68 journal publications. The X-ray diffraction data sets range in size from 126 MB (ref.
<xref ref-type="bibr" rid="b20">20</xref>
) to 20 GB (ref.
<xref ref-type="bibr" rid="b21">21</xref>
) with a mean of 4.9 GB and a total of 573 GB of storage. Extrapolating from this initial collection, which is quite diverse and registers at just over 0.5 TB, our current 100 TB file system could immediately support roughly twenty thousand X-ray diffraction data sets (
<xref ref-type="fig" rid="f2">Fig. 2</xref>
).</p>
<p>The SBDG's collection of data sets can be accessed from the data.sbgrid.org website. On the home page, deposited data sets are organized into laboratory and institutional collections (
<xref ref-type="fig" rid="f3">Fig. 3a</xref>
). Hyperlinked collection pages provide a list of selected data sets along with the data set's corresponding data Digital Object Identifier (DOI), a link to the journal publication, the PDB ID, a link to the PDB entry, and a link to the depositors' laboratory website. The website molecular viewer, PV
<xref ref-type="bibr" rid="b22">22</xref>
, offers visitors an option to view structures in a manipulatable cartoon representation (
<xref ref-type="fig" rid="f3">Fig. 3b</xref>
). With multiple high-quality viewing options and flexible search functionality, users of the SBDG website can easily identify a small subset of relevant data sets.</p>
<p>Persistent data set pages are an important element for any research data repository because they typically provide a landing URL, which resolves from a given DOI
<xref ref-type="bibr" rid="b23">23</xref>
. The SBDG does not advertise unique codes, but instead distinguishes data sets by fully qualified DOIs. From each SBDG collection page or viewer page a user can access those unique Data set Pages (
<xref ref-type="fig" rid="f4">Fig. 4</xref>
), which offer additional information for each data set including download instructions and the fully formatted data set citation for inclusion in manuscripts, following best practices set by the Joint Declaration of Data Citation Principles
<xref ref-type="bibr" rid="b24">24</xref>
. A Data set Page can also be located by searching the SBDG for a PDB code, although often several related data sets are used to determine a single set of macromolecular coordinates. As the Data Grid is developed, the Data set Pages will include additional functionality, with more information on how to reprocess data sets, extended data statistics, and discussion forums allowing users to annotate data sets after publication. Taken together, the uniquely defined Data set Pages provide a comprehensive and persistent location for individual data sets.</p>
</sec>
<sec disp-level="2">
<title>Data set access</title>
<p>All data sets in the SBDG are readily and freely accessible to the community. Access rights were formalized with adoption of the creative commons zero licence (CC0), which supports dedication of research results to the public domain and is used by many open-data projects. This licence allows use and redistribution of data for both commercial and non-commercial purposes without requiring additional agreements. The CC0 licence does not affect patents or trademark rights of contributors, and is similar to the licensing terms that are used for macromolecular models released by the wwPDB.</p>
<p>Although data sets can be downloaded individually, their size can make this cumbersome. Physical access to SBDG data sets is facilitated through a data grid infrastructure that is supported by members of the data access alliance (DAA;
<xref ref-type="fig" rid="f5">Fig. 5a</xref>
). The DAA is a voluntary and open organization of research-data-storage providers and is being developed in collaboration with the Globus Project. The DAA has two aims: (1) to minimize the chance of data loss by replicating SBDG data sets, and (2) to facilitate global data access through its members. Although it is expected that DAA membership and architecture will evolve rapidly, in its current state the DAA framework already provides a global solution for data dissemination. DAA centres in Europe, Asia, North America and South America replicate the entire SBDG collection and provide local access to members of regional communities. There are four DAA centres: Harvard Medical School in the USA, Uppsala University in Sweden, Shanghai Institutes for Biological Sciences in China, and Institut Pasteur de Montevideo in Uruguay. As a secondary service, DAA centres can provide local, direct access to data sets for their institutional research groups. For example, Harvard Medical School hosts the entire collection and provides direct access to all data from its computing center. The DAA infrastructure is further extended by the DAA satellites, which replicate fractions of SBDG data sets in their local storage for direct access by members of individual institutions. This mode of participation provides an attractive option for research institutions to develop local archives of all primary data generated by the local community. For example, the NE-CAT (Northeastern Collaborative Access Team; sector 24-ID) synchrotron beamline at the Advanced Photon Source, in Argonne, IL, replicates all SBDG data sets that originate from NE-CAT beamlines and makes them available to beamline staff and users. Another SBGrid member and DAA Satellite, Yale University, replicates all data sets from Yale laboratories on its institutional storage and makes them accessible to structural biology workstations through the Network File System. We expect that, as research storage infrastructure catches up with the capacities required to archive larger collections of diffraction data sets, some DAA satellites will elect to replicate a larger fraction of SBDG archives and make them available to the general community.</p>
<p>While the DAA offers a variety of data access options that will support growth of the repository, members of the community can also download individual data sets directly from SBGrid servers at Harvard using an rsync protocol. Instructions for downloading individual data sets are provided on the Data set View Pages, and effectively all data sets can be downloaded using the following command: ‘rsync -av rsync://data.sbgrid.org/DOI.', where DOI is the digital object identifier for a particular data set. The rsync utility, which is native to Linux and OS X systems, is particularly suitable for downloading large data files and can be restarted in case of interruption. After download, the data integrity of individual data sets can be verified by following instructions on the Data Grid website. With a well defined and permissive CC0 access licence and multiple channels for accessing data (four DAA sites and the rsync download mechanism) our initial infrastructure is well suited to support expansion of the data collection.</p>
</sec>
<sec disp-level="2">
<title>Data publication cycle</title>
<p>For many SBGrid laboratories, interest in data deposition is driven by a desire to better organize research data and comply with institutional, federal, and project-specific data preservation requirements. During the pilot phase, data deposition privileges were limited to SBGrid member laboratories. With recent funding to further support the project, the Data Grid is now open to the entire structural biology community. Non-SBGrid groups would first need to register with the SBDG to obtain proper deposition credentials.</p>
<p>Wide adoption of data preservation systems is often hindered by the complexities involved in the data deposition process itself. To mitigate this problem, SBDG deposition involves two simple steps: registration and uploading (
<xref ref-type="fig" rid="f5">Fig. 5b</xref>
). To register a data set, the depositor completes a web form with basic information about the sample, data collection facility, related objects (for example, publication, PDB code), and authorship; this information is mapped to the DataCite schema (
<xref ref-type="fig" rid="f6">Fig. 6</xref>
). Many details necessary for data set reprocessing—beam center, distance, wavelength, and so on—are automatically included with most data sets in the form of an image header generated by the data collection software at the time of collection, simplifying the registration process. A principal investigator is authorized to sponsor depositions as a recognized member of the community and must approve each deposit. This system allows maximum flexibility when accepting data for deposition, facilitating the upload of complex data sets that otherwise could be challenging to validate. Following registration, a DOI is reserved for the data set and the user is provided with data transfer instructions. Data deposition is handled by an automated script provided by SBDG and run on the depositor's computer, which uploads the data and checks for data integrity after upload. Upon verification, the primary data are either released in the bi-weekly SBDG release or placed on hold. As with the PDB, release of data placed on hold will coincide with publication.</p>
<p>The two-step publication process is complemented by behind-the-scenes data replication, DOI registrations, and data analysis. All X-ray diffraction images are currently post-processed using data processing pipelines that provide a post-publication data review that will be shared with depositors and the community in the next phase of the SBDG project. We are building additional tools to help increase data deposition rates, including automatic reminders sent to consortium members to encourage them to deposit data for previously published work.</p>
</sec>
<sec disp-level="2">
<title>Data citation</title>
<p>Research data are the legitimate and citable product of research
<xref ref-type="bibr" rid="b24">24</xref>
<xref ref-type="bibr" rid="b25">25</xref>
and, therefore, the SBDG recommends that depositors and data users cite all data deposited with the SBDG in the standard reference section of their manuscripts following well established community standards
<xref ref-type="bibr" rid="b24">24</xref>
<xref ref-type="bibr" rid="b26">26</xref>
<xref ref-type="bibr" rid="b27">27</xref>
. Data citation examples are provided on individual data set pages (
<xref ref-type="fig" rid="f4">Fig. 4</xref>
). The SBDG complements our AppCiter application
<xref ref-type="bibr" rid="b28">28</xref>
, which facilitates citation of research software. Both services are now presented to users in a unified publication support workflow (
<xref ref-type="fig" rid="f7">Fig. 7a</xref>
). In step 1, the user deposits research-related data that are put on hold until publication. A set of DOIs and corresponding data citations are then generated and provided to the end-user. Users can also use AppCiter to generate a list of software citations for all scientific software used in the project. In step 2, all research data and scientific software citations are included in the References section of the manuscript. In step 3 the user, anticipating manuscript publication, contacts relevant databases to request release of the primary and supporting data. This process should, ideally, take place before manuscript publication and be timed to coincide with the publication date, allowing the community to access the data when the manuscript is released. When preparing future publications that refer to completed structures, scientists should reference the relevant publications and macromolecular models, unless they are referring to a specific data set. For specific data sets, authors should explicitly reference experimental data using the corresponding data citation (
<xref ref-type="fig" rid="f7">Fig. 7b</xref>
). Citation metrics for published data sets will be comparable to those obtained for journal publications.</p>
</sec>
<sec disp-level="2">
<title>Data grid content</title>
<p>Ease of data deposition and community-wide interest facilitated growth of the initial collection of X-ray diffraction data sets when it opened to the SBGrid community in May 2015. The data sets deposited during the pilot collection phase represent a wide cross-section of structures and a diverse subset of journal articles and structure determination methods. For example, 68 structures derived from data deposited in the SBDG have been determined by molecular replacement, while 4 have been solved by Multiple-wavelength Anomalous Diffraction, 4 by Single Isomorphous Replacement with Anomalous Scattering and 15 by Single-wavelength Anomalous Diffraction. The highest resolution data set
<xref ref-type="bibr" rid="b29">29</xref>
extended to 1.04 Å, and the lowest resolution data set
<xref ref-type="bibr" rid="b30">30</xref>
to 5.5 Å. The structures ranged in molecular weight from 8.1 (ref.
<xref ref-type="bibr" rid="b31">31</xref>
) to 426 kDa (ref.
<xref ref-type="bibr" rid="b32">32</xref>
). The solvent content of these structures ranged from 32 (ref.
<xref ref-type="bibr" rid="b33">33</xref>
) to 85% (ref.
<xref ref-type="bibr" rid="b34">34</xref>
) and the longest unit cell edge was reported to be 525.29 Å (ref.
<xref ref-type="bibr" rid="b35">35</xref>
).</p>
<p>For a proof of concept, released data sets in the SBGridDB were reprocessed with
<italic>xia2</italic>
(refs
<xref ref-type="bibr" rid="b36">36</xref>
,
<xref ref-type="bibr" rid="b37">37</xref>
,
<xref ref-type="bibr" rid="b38">38</xref>
,
<xref ref-type="bibr" rid="b39">39</xref>
,
<xref ref-type="bibr" rid="b40">40</xref>
,
<xref ref-type="bibr" rid="b41">41</xref>
,
<xref ref-type="bibr" rid="b42">42</xref>
) in a fully automated manner (
<xref ref-type="fig" rid="f8">Fig. 8a</xref>
). In all, 90 of the 110 released data sets with a corresponding PDB ID were successfully reprocessed. In all,86 of those 90 data sets represented high-resolution, native data and for 51 of those
<italic>xia2</italic>
decision making determined a high-resolution limit within 0.1 Å of the published structure (
<xref ref-type="fig" rid="f8">Fig. 8b</xref>
). The point group determined by reprocessing agreed with that of the published structure in 79 cases; for 65 of these the space groups agreed. The lower degree of recovery of space groups, in comparison to point groups, is attributed to ambiguity in screw axis determination at this stage of data processing. To provide insight into the most common failure modes, data sets for which
<italic>xia2</italic>
did not produce a set of integrated intensities were investigated using iMOSFLM
<xref ref-type="bibr" rid="b43">43</xref>
. Twelve of the failure cases could be attributed to absent or inaccurate information in the image headers: while accuracy of the beam center annotation varied within the pilot collection (
<xref ref-type="fig" rid="f8">Fig. 8c</xref>
), 10 data sets had visually incorrect beam center information, two had missing header information. The cause of failure for the eight remaining data sets was not definitively determined from the data sets alone; however, consulting the reprocessing instructions provided by depositors clarified this for five of these data sets. The reprocessing instructions also suggested that many of the data sets for which
<italic>xia2</italic>
was able to produce integrated intensities, but with resolution or symmetry disagreeing with the deposited structure, could be attributed to incorrect header information. One outlying reprocessing case for which a significantly higher resolution was determined than originally reported was also investigated. For this case, one of four reprocessing attempts for the data set reported a resolution higher than that supported by merging statistics. This discrepancy was resolved by a software update.</p>
<p>In addition to estimates of the Bragg intensities, diffraction images can also be analysed for additional features
<xref ref-type="bibr" rid="b44">44</xref>
. A well-known example is the isotropic solvent ring that generally appears ∼3–4 Å resolution
<xref ref-type="bibr" rid="b45">45</xref>
. However, diffraction images also contain anisotropic diffuse scattering signals under and between the Bragg peaks that derive from two-point correlations of electron density fluctuations
<xref ref-type="bibr" rid="b7">7</xref>
. Analysis of this diffuse scattering could therefore provide information about protein, nucleic acid, and lipid structural dynamics and correlated motions, potentially leading to new mechanistic insights
<xref ref-type="bibr" rid="b46">46</xref>
or to validating sampling schemes and energy functions for molecular dynamics simulations
<xref ref-type="bibr" rid="b47">47</xref>
. One data set on the model enzyme Cyclophilin A is currently deposited (
<xref ref-type="table" rid="t2">Table 2</xref>
) to be used as ‘gold-standard' to compare the influence of temperature on data collection
<xref ref-type="bibr" rid="b48">48</xref>
and to assess consistency between X-Ray Free-Electron Laser (XFEL) and synchrotron data
<xref ref-type="bibr" rid="b49">49</xref>
. This data set can now also be analysed for diffuse scattering features, which could distinguish between models of correlated motion suggested by NMR experiments.</p>
</sec>
<sec disp-level="2">
<title>X-ray diffraction reference subset and other collections</title>
<p>To take advantage of data grid diversity, we have selected a small subset of cases that could be used to support software development and teaching of data processing and diverse structure determination techniques (
<xref ref-type="table" rid="t2">Table 2</xref>
). This subset includes high-resolution (1.2 Å), low resolution (4.5 and 7.0 Å), anisotropic and twinned data sets. Additionally data sets that supported a variety of experimental phasing approaches (for example, phasing with selenium, zinc, uranium, barium/potassium) and molecular replacement cases (for example, with a 9 Å electron microscopy (EM) envelope) are included. The subset also incorporates diffraction data for crystals grown in lipidic cubic phase and an example of multi-crystal averaging.</p>
<p>Additionally SBDG is suited to support various other primary data types that are being generated by members of the consortium, and those pilot collections will seed development of community-wide data analysis systems. MicroED is a promising new technique
<xref ref-type="bibr" rid="b50">50</xref>
<xref ref-type="bibr" rid="b51">51</xref>
and inclusions of the early microcrystal data sets might stimulate the community to explore this technique and to fine-tune data processing software. Examples of MicroED data sets that are included in the pilot collection include three MicroED data sets that were used to determine structures of the toxic core of α-synuclein
<xref ref-type="bibr" rid="b52">52</xref>
, catalyse
<xref ref-type="bibr" rid="b53">53</xref>
and lysozyme
<xref ref-type="bibr" rid="b54">54</xref>
. Other types of data sets in our pilot collection include a 55 GB computational decoy data set for 55 complexes with associated HADDOCK scores
<xref ref-type="bibr" rid="b55">55</xref>
, a 2 μs Desmond
<xref ref-type="bibr" rid="b56">56</xref>
MD trajectory
<xref ref-type="bibr" rid="b57">57</xref>
, and a recently collected Lattice Light-Sheet Microscopy
<xref ref-type="bibr" rid="b58">58</xref>
<xref ref-type="bibr" rid="b59">59</xref>
data set with in-vivo imaging of zebrafish embryos
<xref ref-type="bibr" rid="b60">60</xref>
. Here the engagement with domain experts and respective communities will be also required to establish data validation pipelines and effective DAA distribution models.</p>
</sec>
</sec>
<sec disp-level="1">
<title>Discussion</title>
<p>We have developed a flexible data publication system, the SBDG, to support deposition of a variety of large primary data sets. The data repository complements the wwPDB efforts by preserving the raw data that supports PDB-deposited structure models. The pilot phase of the project, which was limited to SBGrid laboratories, demonstrated both feasibility and strong participation, with the deposition and publication of 117 data sets (as of 1 September 2015, collected over 3 months). To support annotated data collection, we have established data processing pipelines that will evolve the post-deposition data-analysis process. For example, the pipeline presented in the results section allows depositors and SBDG curators to quickly identify image-header problems, and parameters that are refined or corrected will be included in the expanded Dataverse schema
<xref ref-type="bibr" rid="b61">61</xref>
<xref ref-type="bibr" rid="b62">62</xref>
<xref ref-type="bibr" rid="b63">63</xref>
<xref ref-type="bibr" rid="b64">64</xref>
. The outliers and failures of the current reprocessing pipeline illustrate areas of potential improvement to metadata accuracy and the pipeline itself. Data depositors and other community members will be able to provide data annotations to assist with the convergence of this process. Access to this growing collection of X-ray diffraction data sets will support the proposed paradigm shift in the community
<xref ref-type="bibr" rid="b6">6</xref>
from the static archive towards a much more dynamic body of continuously improving refined models.</p>
<p>Despite being in the age of ‘big data science', universal storage of large, biomedical data sets is an issue that has not yet been resolved, as infrastructure and support responsibilities have not been well defined. Shifting the burdens of data management from individual research groups and institutions to global infrastructures is an effective and economical strategy to address this issue that has previously been proven successful by the wwPDB and would now be demonstrated by the SBDG. By virtue of the consortium's global presence, SBDG is well positioned to stimulate community-wide participation. SBGrid may facilitate integration of the data grid with regional projects and facility-related efforts to preserve primary diffraction data sets. This data distribution model is similar to those established in other fields. For example, the Data Preservation Alliance (
<ext-link ext-link-type="uri" xlink:href="http://www.data-pass.org">www.data-pass.org</ext-link>
) replicates and indexes quantitative data for the social sciences. Data collected at the Large Hadron Collider are made available under a multi-tier processing and storage framework. As a large international consortium backed by diverse funding mechanisms and DAA storage contributions of its members, SBGrid is uniquely capable of bypassing grant limitations that would otherwise deter such a long-term global infrastructure effort. Given recently secured funding to support data curation and technology integration under the Dataverse research data management, and with gradual community investment, SBDG is poised to scale up to support the entire community.</p>
<p>While access to experimental data is critical to ensuring research reproducibility, metadata quality is also crucial. Data sets that are poorly annotated have limited use to the research community. With a focus on deployment of a sustainable and flexible data management infrastructure, the SBDG takes a unique approach on metadata preservation. The repository employs an accommodating DataCite schema, which preserves basic information about experiments. The depositions are self-moderated by contributing laboratories, with data publication subject to approval of the principal investigators. As our results demonstrated, this approach worked well for the vast majority of data sets deposited in the SBDG, 82% of which were automatically reprocessed with current data processing software and the majority of the remaining data sets could be easily reprocessed manually. This success rate for reprocessing diffraction data sets was achieved without any explicit quality control to ensure that the data sets contained sufficient information for reprocessing—in other words, using image headers as the only source of experimental (geometry and detector) parameters. Two possibilities under consideration for maintaining and improving this success rate are allowing depositors to annotate updated experimental parameters (for example, beam center) and explicit checks for metadata required for reprocessing prior to data publication. To facilitate interoperability with other projects and further stimulate uniform data evaluations, we will work in parallel to develop tools that will support download of archived data sets in community accepted master formats supporting intrinsic metadata, such as OME-TIFF or HDF5. This process will allow annotation of downloadable data sets with additional information from analysis pipelines, and will be guided by feedback from projects that interface with SBDG. Ideally, publication of data sets will encourage the communities to adopt standardized formats and ensure complete population of experimental metadata with adequate accuracy to support reprocessing.</p>
<p>While the SBDG immediately serves the well-defined area of X-ray crystallography, our pilot project has demonstrated that our infrastructure can preserve additional data types, such as decoy data sets for NMR computations or MicroED data sets. SBDG will duplicate XFEL data sets that are currently accessible through the Coherent X-ray Imaging Data Bank (
<ext-link ext-link-type="uri" xlink:href="http://www.cxidb.org/">http://www.cxidb.org/</ext-link>
) and support their distribution by DAA. In addition, SBDG will collaborate with MicroED and XFEL collection curators who will moderate development of community driven efforts to automate data analysis pipelines to parallel automatic processing of X-ray diffraction data sets with packages like DIALS or
<italic>xia2</italic>
. We envision that the tools and technologies that arise from this project will ultimately lead to the development of a fully featured, primary data publication system. Features of such a system would include the capability of supporting a variety of experimental data types and automatic incorporation of pertinent data set information during data collection at local, regional and national facilities. The integration of primary data management with a base set of scientific software enables repositories to progress towards dynamically improving sources of knowledge, as well as providing an integrated computing environment for ongoing research.</p>
<p>In summary, we have presented the SBDG, a new system for the preservation and publication of large experimental data sets. The system is the latest product of SBGrid's mission to maintain a community-wide research-software infrastructure. Through disclosure, adoption, transparency, management of external dependencies, permissible licensing, and technical protection mechanisms, the SBDG is committed to compliance with evolving community standards of data preservation. We expect that the widespread sharing of experimental data will support methods development and will ultimately lead to better quality of structural models that are subject to continuous methods improvement.</p>
</sec>
<sec disp-level="1">
<title>Methods</title>
<sec disp-level="2">
<title>Current implementation</title>
<p>The databank deposition process involves five stages: (1) recording associated metadata, (2) local checksum calculation, (3) data transfer, (4) post-transfer verification and (5) public identifier registration.</p>
<p>A publicly accessible web frontend is used for handling user interactions with the databank. Built using the Python-based Django web framework, this frontend runs on an Ubuntu 14 LTS Server with a PostgreSQL 9.3 database. It collects the necessary metadata during deposition and informs the backend systems about deposition requests. A cryptographic checksum (SHA1 SUM, FIPS 180-4) is calculated before data transfer. This ensures that the data set is unchanged. Data transfer is handled by rsync over ssh. Once data transfer is complete, the databank verifies that the data set has been transferred uncorrupted, or reports a problem with the data set. If necessary, extraneous files (intermediate data files, processing or transfer scripts) are removed, data files are uncompressed and checksums re-computed. In the event of any modifications to the data set, an unmodified copy is stored in an offline file system. Upon data set release, the DOI reserved during data set registration is registered using the recorded metadata, and the data set (including checksum information) is made available for download over anonymous rsync.</p>
</sec>
<sec disp-level="2">
<title>Metadata schema</title>
<p>DOIs are issued through EZID, through the Harvard University Library and the California Digital Library. Metadata are organized following the DataCite schema (
<xref ref-type="fig" rid="f6">Fig. 6</xref>
).</p>
</sec>
<sec disp-level="2">
<title>Reprocessing details</title>
<p>Data sets that had been publicly released by 1 September 2015 were reprocessed by
<italic>xia2</italic>
in a fully automated manner. For each data set, four attempts were made to reprocess using options ‘-2d', ‘-3d', ‘-3dii' and ‘-dials', using MOSFLM, XDS, XDS (indexing with peaks from all images) and DIALS, respectively. AIMLESS
<xref ref-type="bibr" rid="b37">37</xref>
and POINTLESS
<xref ref-type="bibr" rid="b38">38</xref>
were used by
<italic>xia2</italic>
for space group determination. A data set was considered successfully reprocessed if any of these attempts succeeded, and comparisons to the originally published structure were done with the best matching result. Investigation of unsuccessfully reprocessed data sets was performed using iMOSFLM
<xref ref-type="bibr" rid="b41">41</xref>
. This investigation was performed ‘blinded' to the reprocessing instructions provided by depositors, in order to better investigate the limits of relying solely on diffraction images.</p>
</sec>
<sec disp-level="2">
<title>Data alliance</title>
<p>Released data sets are distributed to Data Alliance mirror sites using the same mechanism as individual data set distribution. Data set checksums enable accurate data transfer. Users can select a mirror site by picking an appropriate rsync URL for data download.</p>
</sec>
</sec>
<sec disp-level="1">
<title>Additional information</title>
<p>
<bold>How to cite this article:</bold>
Meyer, P. A.
<italic>et al.</italic>
Data publication with the structural biology data grid supports live analysis.
<italic>Nat. Commun.</italic>
7:10882 doi: 10.1038/ncomms10882 (2016).</p>
</sec>
<sec sec-type="supplementary-material" id="S1">
<title>Supplementary Material</title>
<supplementary-material id="d33e18" content-type="local-data">
<caption>
<title>Supplementary Information</title>
<p>Supplementary Table 1.</p>
</caption>
<media xlink:href="ncomms10882-s1.pdf"></media>
</supplementary-material>
</sec>
</body>
<back>
<ack>
<p>Development of the Structural Biology Data Grid is funded by The Leona M. and Harry B. Helmsley Charitable Trust 2016PG-BRI002 to PS and MC. Development of citation workflows is supported NSF 1448069 (to PS). DAA is being developed as a pilot project of the National Data Service, with additional funds to support storage and technology development, including NIH P41 GM103403 (NE-CAT) and 1S10RR028832 (HMS) and DOE DE-AC02-06CH11357; NIH 1U54EB020406-01, Big Data for Discovery Science Center; and NIST 60NANB15D077 (Globus Project). AB acknowledges Ariel Chaparro for assistance with the DAA setup (Inst Pasteur Montevideo). Collections of pilot data sets were supported by various grants (see Supplementary Table 1).</p>
</ack>
<ref-list>
<ref id="b1">
<mixed-citation publication-type="journal">
<name>
<surname>Bilderback</surname>
<given-names>D. H.</given-names>
</name>
,
<name>
<surname>Elleaume</surname>
<given-names>P.</given-names>
</name>
&
<name>
<surname>Weckert</surname>
<given-names>E.</given-names>
</name>
<article-title>Review of third and next generation synchrotron light sources</article-title>
.
<source>J. Phys. B: At. Mol. Opt. Phys.</source>
<volume>38</volume>
,
<fpage>S773</fpage>
<lpage>S797</lpage>
(
<year>2005</year>
).</mixed-citation>
</ref>
<ref id="b2">
<mixed-citation publication-type="journal">
<name>
<surname>Guss</surname>
<given-names>J. M.</given-names>
</name>
&
<name>
<surname>McMahon</surname>
<given-names>B.</given-names>
</name>
<article-title>How to make deposition of images a reality</article-title>
.
<source>Acta Crystallogr. D Biol. Crystallogr.</source>
<volume>70</volume>
,
<fpage>2520</fpage>
<lpage>2532</lpage>
(
<year>2014</year>
).
<pub-id pub-id-type="pmid">25286838</pub-id>
</mixed-citation>
</ref>
<ref id="b3">
<mixed-citation publication-type="journal">
<name>
<surname>Meyer</surname>
<given-names>G. R.</given-names>
</name>
<italic>et al.</italic>
<article-title>Operation of the Australian Store.Synchrotron for macromolecular crystallography</article-title>
.
<source>Acta Crystallogr. D Biol. Crystallogr.</source>
<volume>70</volume>
,
<fpage>2510</fpage>
<lpage>2519</lpage>
(
<year>2014</year>
).
<pub-id pub-id-type="pmid">25286837</pub-id>
</mixed-citation>
</ref>
<ref id="b4">
<mixed-citation publication-type="journal">
<name>
<surname>Elsliger</surname>
<given-names>M.-A.</given-names>
</name>
<italic>et al.</italic>
<article-title>The JCSG high-throughput structural biology pipeline</article-title>
.
<source>Acta Crystallogr. Sect. F Struct. Biol. Cryst. Commun.</source>
<volume>66</volume>
,
<fpage>1137</fpage>
<lpage>1142</lpage>
(
<year>2010</year>
).</mixed-citation>
</ref>
<ref id="b5">
<mixed-citation publication-type="journal">
<name>
<surname>Kroon-Batenburg</surname>
<given-names>L. M. J.</given-names>
</name>
&
<name>
<surname>Helliwell</surname>
<given-names>J. R.</given-names>
</name>
<article-title>Experiences with making diffraction image data available: what metadata do we need to archive?</article-title>
<source>Acta Crystallogr. D Biol. Crystallogr.</source>
<volume>70</volume>
,
<fpage>2502</fpage>
<lpage>2509</lpage>
(
<year>2014</year>
).
<pub-id pub-id-type="pmid">25286836</pub-id>
</mixed-citation>
</ref>
<ref id="b6">
<mixed-citation publication-type="journal">
<name>
<surname>Terwilliger</surname>
<given-names>T. C.</given-names>
</name>
&
<name>
<surname>Bricogne</surname>
<given-names>G.</given-names>
</name>
<article-title>Continuous mutual improvement of macromolecular structure models in the PDB and of X-ray crystallographic software: the dual role of deposited experimental data</article-title>
.
<source>Acta Crystallogr. D Biol. Crystallogr.</source>
<volume>70</volume>
,
<fpage>2533</fpage>
<lpage>2543</lpage>
(
<year>2014</year>
).
<pub-id pub-id-type="pmid">25286839</pub-id>
</mixed-citation>
</ref>
<ref id="b7">
<mixed-citation publication-type="journal">
<name>
<surname>Wall</surname>
<given-names>M. E.</given-names>
</name>
,
<name>
<surname>Adams</surname>
<given-names>P. D.</given-names>
</name>
,
<name>
<surname>Fraser</surname>
<given-names>J. S.</given-names>
</name>
&
<name>
<surname>Sauter</surname>
<given-names>N. K.</given-names>
</name>
<article-title>Diffuse X-ray scattering to model protein motions</article-title>
.
<source>Structure</source>
<volume>22</volume>
,
<fpage>182</fpage>
<lpage>184</lpage>
(
<year>2014</year>
).
<pub-id pub-id-type="pmid">24507780</pub-id>
</mixed-citation>
</ref>
<ref id="b8">
<mixed-citation publication-type="journal">
<name>
<surname>Joosten</surname>
<given-names>R. P.</given-names>
</name>
<italic>et al.</italic>
<article-title>PDB REDO: automated re-refinement of X-ray structure models in the PDB</article-title>
.
<source>J. Appl. Crystallogr.</source>
<volume>42</volume>
,
<fpage>376</fpage>
<lpage>384</lpage>
(
<year>2009</year>
).
<pub-id pub-id-type="pmid">22477769</pub-id>
</mixed-citation>
</ref>
<ref id="b9">
<mixed-citation publication-type="journal">
<name>
<surname>Karplus</surname>
<given-names>P. A.</given-names>
</name>
&
<name>
<surname>Diederichs</surname>
<given-names>K.</given-names>
</name>
<article-title>Linking crystallographic model and data quality</article-title>
.
<source>Science</source>
<volume>336</volume>
,
<fpage>1030</fpage>
<lpage>1033</lpage>
(
<year>2012</year>
).
<pub-id pub-id-type="pmid">22628654</pub-id>
</mixed-citation>
</ref>
<ref id="b10">
<mixed-citation publication-type="journal">
<name>
<surname>Tanley</surname>
<given-names>S. W. M.</given-names>
</name>
,
<name>
<surname>Diederichs</surname>
<given-names>K.</given-names>
</name>
,
<name>
<surname>Kroon-Batenburg</surname>
<given-names>L. M. J.</given-names>
</name>
,
<name>
<surname>Schreurs</surname>
<given-names>A. M. M.</given-names>
</name>
&
<name>
<surname>Helliwell</surname>
<given-names>J. R.</given-names>
</name>
<article-title>Experiences with archived raw diffraction images data: capturing cisplatin after chemical conversion of carboplatin in high salt conditions for a protein crystal</article-title>
.
<source>J. Synchrotron Radiat.</source>
<volume>20</volume>
,
<fpage>880</fpage>
<lpage>883</lpage>
(
<year>2013</year>
).
<pub-id pub-id-type="pmid">24121332</pub-id>
</mixed-citation>
</ref>
<ref id="b11">
<mixed-citation publication-type="journal">
<name>
<surname>Matthews</surname>
<given-names>B. W.</given-names>
</name>
<article-title>Five retracted structure reports: inverted or incorrect?</article-title>
<source>Protein Sci.</source>
<volume>16</volume>
,
<fpage>1013</fpage>
<lpage>1016</lpage>
(
<year>2007</year>
).
<pub-id pub-id-type="pmid">17473006</pub-id>
</mixed-citation>
</ref>
<ref id="b12">
<mixed-citation publication-type="journal">
<name>
<surname>Janssen</surname>
<given-names>B. J.</given-names>
</name>
,
<name>
<surname>Read</surname>
<given-names>R. J.</given-names>
</name>
,
<name>
<surname>Brunger</surname>
<given-names>A. T.</given-names>
</name>
&
<name>
<surname>Gros</surname>
<given-names>P.</given-names>
</name>
<article-title>Crystallography: crystallographic evidence for deviating C3b structure</article-title>
.
<source>Nature</source>
<volume>448</volume>
,
<fpage>E1</fpage>
<lpage>E2</lpage>
(
<year>2007</year>
).
<pub-id pub-id-type="pmid">17687277</pub-id>
</mixed-citation>
</ref>
<ref id="b13">
<mixed-citation publication-type="journal">
<name>
<surname>Berman</surname>
<given-names>H.</given-names>
</name>
,
<name>
<surname>Henrick</surname>
<given-names>K.</given-names>
</name>
&
<name>
<surname>Nakamura</surname>
<given-names>H.</given-names>
</name>
<article-title>Announcing the worldwide protein data bank</article-title>
.
<source>Nat. Struct. Biol.</source>
<volume>10</volume>
,
<fpage>980</fpage>
<lpage>980</lpage>
(
<year>2003</year>
).
<pub-id pub-id-type="pmid">14634627</pub-id>
</mixed-citation>
</ref>
<ref id="b14">
<mixed-citation publication-type="journal">
<name>
<surname>Berman</surname>
<given-names>H.</given-names>
</name>
,
<name>
<surname>Kleywegt</surname>
<given-names>G.</given-names>
</name>
,
<name>
<surname>Nakamura</surname>
<given-names>H.</given-names>
</name>
&
<name>
<surname>Markley</surname>
<given-names>J.</given-names>
</name>
<article-title>The protein data bank archive as an open data resource</article-title>
.
<source>J. Comput. Aided Mol. Des.</source>
<volume>28</volume>
,
<fpage>1009</fpage>
<lpage>1014</lpage>
(
<year>2014</year>
).
<pub-id pub-id-type="pmid">25062767</pub-id>
</mixed-citation>
</ref>
<ref id="b15">
<mixed-citation publication-type="journal">
<name>
<surname>Morin</surname>
<given-names>A.</given-names>
</name>
<italic>et al.</italic>
<article-title>Collaboration gets the most out of software</article-title>
.
<source>eLife</source>
<volume>2</volume>
,
<fpage>e01456</fpage>
(
<year>2013</year>
).
<pub-id pub-id-type="pmid">24040512</pub-id>
</mixed-citation>
</ref>
<ref id="b16">
<mixed-citation publication-type="journal">
<name>
<surname>Foster</surname>
<given-names>I.</given-names>
</name>
in
<source>Network and Parallel Computing</source>
eds Jin H., Reed D., Jiang W.
<volume>volume 3779</volume>
,
<fpage>2</fpage>
<lpage>13</lpage>
Springer (
<year>2005</year>
).</mixed-citation>
</ref>
<ref id="b17">
<mixed-citation publication-type="journal">
<name>
<surname>Foster</surname>
<given-names>I.</given-names>
</name>
<article-title>Globus Online: Accelerating and democratizing science through cloud-based services</article-title>
.
<source>IEEE Internet Computing</source>
<volume>15</volume>
,
<fpage>70</fpage>
<lpage>73</lpage>
(
<year>2011</year>
).</mixed-citation>
</ref>
<ref id="b18">
<mixed-citation publication-type="other">
<name>
<surname>Chard</surname>
<given-names>K.</given-names>
</name>
<italic>et al.</italic>
in 2015 IEEE 11th International Conference on e-Science (e-Science), 401–410 (Munich, 2015).</mixed-citation>
</ref>
<ref id="b19">
<mixed-citation publication-type="journal">
<name>
<surname>Stokes-Rees</surname>
<given-names>I.</given-names>
</name>
<italic>et al.</italic>
<article-title>Adapting federated cyberinfrastructure for shared data collection facilities in structural biology</article-title>
.
<source>J. Synchrotron Radiat.</source>
<volume>19</volume>
,
<fpage>462</fpage>
<lpage>467</lpage>
(
<year>2012</year>
).
<pub-id pub-id-type="pmid">22514186</pub-id>
</mixed-citation>
</ref>
<ref id="b20">
<mixed-citation publication-type="other">
<name>
<surname>Lee</surname>
<given-names>D.</given-names>
</name>
&
<name>
<surname>Raman</surname>
<given-names>C.</given-names>
</name>
X-Ray Diffraction data for:
<italic>Escherichia coli</italic>
DOS Br complex. PDB Code 1V9Z, Structural Biology Data Grid, volume V1,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/137">http://dx.doi.org/10.15785/SBGRID/137</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b21">
<mixed-citation publication-type="other">
<name>
<surname>Rudenko</surname>
<given-names>G.</given-names>
</name>
X-Ray Diffraction data for: neurexin 1alpha extracellular domain. PDB Code 3QCW, Structural Biology Data Grid, volume V1,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/78">http://dx.doi.org/10.15785/SBGRID/78</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b22">
<mixed-citation publication-type="journal">
<name>
<surname>Biasini</surname>
<given-names>M.</given-names>
</name>
<article-title>PV-WebGL-based protein viewer</article-title>
.
<source>Zenodo</source>
doi:
<pub-id pub-id-type="doi">10.5281/zenodo.12620</pub-id>
(
<year>2014</year>
).</mixed-citation>
</ref>
<ref id="b23">
<mixed-citation publication-type="journal">
<name>
<surname>Starr</surname>
<given-names>J.</given-names>
</name>
<italic>et al.</italic>
<article-title>Achieving human and machine accessibility of cited data in scholarly publications</article-title>
.
<source>PeerJ Comput. Sci.</source>
<volume>1</volume>
,
<fpage>e1</fpage>
(
<year>2015</year>
).</mixed-citation>
</ref>
<ref id="b24">
<mixed-citation publication-type="other"> Martone M. (ed).
<article-title>Data citation synthesis group: Joint declaration of data citation principles</article-title>
. FORCE11
<ext-link ext-link-type="uri" xlink:href="http://https://www.force11.org/datacitation">https://www.force11.org/datacitation</ext-link>
(
<year>2014</year>
).</mixed-citation>
</ref>
<ref id="b25">
<mixed-citation publication-type="journal">
<name>
<surname>Bourne</surname>
<given-names>P. E.</given-names>
</name>
<italic>et al.</italic>
<article-title>Improving the future of research communications and e-Scholarship (Dagstuhl Perspectives Workshop 11331)</article-title>
.
<source>Dagstuhl Manifestos</source>
<volume>1</volume>
,
<fpage>41</fpage>
<lpage>60</lpage>
(
<year>2011</year>
).</mixed-citation>
</ref>
<ref id="b26">
<mixed-citation publication-type="journal">
<name>
<surname>Altman</surname>
<given-names>M.</given-names>
</name>
&
<name>
<surname>King</surname>
<given-names>G.</given-names>
</name>
<article-title>A proposed standard for the scholarly citation of quantitative data</article-title>
.
<source>D-lib Mag.</source>
<volume>13</volume>
, (
<year>2007</year>
).</mixed-citation>
</ref>
<ref id="b27">
<mixed-citation publication-type="journal">
<name>
<surname>Altman</surname>
<given-names>M.</given-names>
</name>
&
<name>
<surname>Crosas</surname>
<given-names>M.</given-names>
</name>
<article-title>The evolution of data citation: from principles to implementation</article-title>
.
<source>IASSIST Q.</source>
<volume>37</volume>
,
<fpage>62</fpage>
(
<year>2013</year>
).</mixed-citation>
</ref>
<ref id="b28">
<mixed-citation publication-type="journal">
<name>
<surname>Socias</surname>
<given-names>S.</given-names>
</name>
,
<name>
<surname>Morin</surname>
<given-names>A.</given-names>
</name>
,
<name>
<surname>Timony</surname>
<given-names>M.</given-names>
</name>
&
<name>
<surname>Sliz</surname>
<given-names>P.</given-names>
</name>
<article-title>AppCiter: a web application for increasing rates and accuracy of scientific software citation</article-title>
.
<source>Structure</source>
<volume>23</volume>
,
<fpage>807</fpage>
<lpage>808</lpage>
(
<year>2015</year>
).
<pub-id pub-id-type="pmid">25955101</pub-id>
</mixed-citation>
</ref>
<ref id="b29">
<mixed-citation publication-type="other">
<name>
<surname>Hunter</surname>
<given-names>J. C.</given-names>
</name>
&
<name>
<surname>Westover</surname>
<given-names>K. D.</given-names>
</name>
X-Ray Diffraction data for: Human GT- Pase KRAS G12R bound to GDP. PDB Code 4QL3, Structural Biology Data Grid, volume V1,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/160">http://dx.doi.org/10.15785/SBGRID/160</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b30">
<mixed-citation publication-type="other">
<name>
<surname>Gilman</surname>
<given-names>M. S. A.</given-names>
</name>
&
<name>
<surname>McLellan</surname>
<given-names>J. S.</given-names>
</name>
X-Ray diffraction data for: Motavizumab and AM14 in complex with prefusion RSV f. PDB code 4ZYP, Structural Biology Data Grid, volume V1,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/155">http://dx.doi.org/10.15785/SBGRID/155</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b31">
<mixed-citation publication-type="other">
<name>
<surname>Feldkamp</surname>
<given-names>M. D.</given-names>
</name>
&
<name>
<surname>Chazin</surname>
<given-names>W. J.</given-names>
</name>
X-Ray diffraction data for: human RPA32C. PDB code 4OU0, Structural Biology Data Grid, volume V1,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/92">http://dx.doi.org/10.15785/SBGRID/92</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b32">
<mixed-citation publication-type="other">
<name>
<surname>Tolia</surname>
<given-names>N. H.</given-names>
</name>
X-Ray diffraction data for: erythrocyte binding antigen 140. PDB code 4GF2, Structural Biology Data Grid, volume V1,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/115">http://dx.doi.org/10.15785/SBGRID/115</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b33">
<mixed-citation publication-type="other">
<name>
<surname>Hunter</surname>
<given-names>J. C.</given-names>
</name>
&
<name>
<surname>Westover</surname>
<given-names>K. D.</given-names>
</name>
X-Ray diffraction data for: human GTPase KRAS G12C bound to GDP. PDB code 4LDJ, Structural Biology Data Grid, volume V1,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/158">http://dx.doi.org/10.15785/SBGRID/158</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b34">
<mixed-citation publication-type="other">
<name>
<surname>Corbett</surname>
<given-names>K. D.</given-names>
</name>
&
<name>
<surname>Harrison</surname>
<given-names>S.</given-names>
</name>
X-Ray diffraction data for:
<italic>S. cerevisiae</italic>
Csm1-Mam1 complex. PDB code 4EMC, Structural Biology Data Grid, volume V1,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/24">http://dx.doi.org/10.15785/SBGRID/24</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b35">
<mixed-citation publication-type="other">
<name>
<surname>Gajadeera</surname>
<given-names>C. S.</given-names>
</name>
&
<name>
<surname>Tsodikov</surname>
<given-names>O. V.</given-names>
</name>
X-Ray diffraction data for: Inorganic pyrophosphatase from staphylococcus aureus in complex with mn2+. PDB code 4RPA, Structural Biology Data Grid, volume V1,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/22">http://dx.doi.org/10.15785/SBGRID/22</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b36">
<mixed-citation publication-type="journal">
<name>
<surname>Winter</surname>
<given-names>G.</given-names>
</name>
,
<name>
<surname>Lobley</surname>
<given-names>C. M. C.</given-names>
</name>
&
<name>
<surname>Prince</surname>
<given-names>S. M.</given-names>
</name>
<article-title>Decision making in xia2</article-title>
.
<source>Acta Crystallogr. D Biol. Crystallogr.</source>
<volume>69</volume>
,
<fpage>1260</fpage>
<lpage>1273</lpage>
(
<year>2013</year>
).
<pub-id pub-id-type="pmid">23793152</pub-id>
</mixed-citation>
</ref>
<ref id="b37">
<mixed-citation publication-type="journal">
<name>
<surname>Evans</surname>
<given-names>P. R.</given-names>
</name>
&
<name>
<surname>Murshudov</surname>
<given-names>G. N.</given-names>
</name>
<article-title>How good are my data and what is the resolution?</article-title>
<source>Acta Crystallogr. D Biol. Crystallogr.</source>
<volume>69</volume>
,
<fpage>1204</fpage>
<lpage>1214</lpage>
(
<year>2013</year>
).
<pub-id pub-id-type="pmid">23793146</pub-id>
</mixed-citation>
</ref>
<ref id="b38">
<mixed-citation publication-type="journal">
<name>
<surname>Evans</surname>
<given-names>P.</given-names>
</name>
<article-title>Scaling and assessment of data quality</article-title>
.
<source>Acta Crystallogr. D Biol. Crystallogr.</source>
<volume>62</volume>
,
<fpage>72</fpage>
<lpage>82</lpage>
(
<year>2006</year>
).
<pub-id pub-id-type="pmid">16369096</pub-id>
</mixed-citation>
</ref>
<ref id="b39">
<mixed-citation publication-type="journal">
<name>
<surname>Kabsch</surname>
<given-names>W.</given-names>
</name>
<article-title>XDS</article-title>
.
<source>Acta. Cryst.</source>
<volume>66</volume>
,
<fpage>125</fpage>
<lpage>132</lpage>
(
<year>2010</year>
).</mixed-citation>
</ref>
<ref id="b40">
<mixed-citation publication-type="journal">
<name>
<surname>Winn</surname>
<given-names>M. D.</given-names>
</name>
<italic>et al.</italic>
<article-title>Overview of the CCP4 suite and current developments</article-title>
.
<source>Acta Crystallogr. D Biol. Crystallogr.</source>
<volume>67</volume>
,
<fpage>235</fpage>
<lpage>242</lpage>
(
<year>2011</year>
).
<pub-id pub-id-type="pmid">21460441</pub-id>
</mixed-citation>
</ref>
<ref id="b41">
<mixed-citation publication-type="journal">
<name>
<surname>Leslie</surname>
<given-names>A. G. W.</given-names>
</name>
<article-title>Integration of macromolecular diffraction data</article-title>
.
<source>Acta Crystallogr. D Biol. Crystallogr.</source>
<volume>55</volume>
,
<fpage>1696</fpage>
<lpage>1702</lpage>
(
<year>1999</year>
).
<pub-id pub-id-type="pmid">10531519</pub-id>
</mixed-citation>
</ref>
<ref id="b42">
<mixed-citation publication-type="journal">
<name>
<surname>Waterman</surname>
<given-names>D. G.</given-names>
</name>
<italic>et al.</italic>
<article-title>The DIALS framework for integration software</article-title>
.
<source>CCP4 Newslett. Protein Crystallogr.</source>
<volume>49</volume>
,
<fpage>13</fpage>
<lpage>15</lpage>
(
<year>2013</year>
).</mixed-citation>
</ref>
<ref id="b43">
<mixed-citation publication-type="journal">
<name>
<surname>Battye</surname>
<given-names>G. T.</given-names>
</name>
,
<name>
<surname>Kontogiannis</surname>
<given-names>L.</given-names>
</name>
,
<name>
<surname>Johnson</surname>
<given-names>O.</given-names>
</name>
,
<name>
<surname>Powell</surname>
<given-names>H. R.</given-names>
</name>
&
<name>
<surname>Leslie</surname>
<given-names>A. G.</given-names>
</name>
<article-title>iMOSFLM: a new graphical interface for diffraction-image processing with MOSFLM</article-title>
.
<source>Acta Crystallogr. D Biol. Crystallogr.</source>
<volume>67</volume>
,
<fpage>271</fpage>
<lpage>281</lpage>
(
<year>2011</year>
).
<pub-id pub-id-type="pmid">21460445</pub-id>
</mixed-citation>
</ref>
<ref id="b44">
<mixed-citation publication-type="journal">
<name>
<surname>Helliwell</surname>
<given-names>J. R.</given-names>
</name>
&
<name>
<surname>Mitchell</surname>
<given-names>E. P.</given-names>
</name>
<article-title>Synchrotron radiation macromolecular crystallography: science and spin-offs</article-title>
.
<source>IUCrJ</source>
<volume>2</volume>
,
<fpage>283</fpage>
<lpage>291</lpage>
(
<year>2015</year>
).</mixed-citation>
</ref>
<ref id="b45">
<mixed-citation publication-type="journal">
<name>
<surname>Welberry</surname>
<given-names>T.</given-names>
</name>
<source>Diffuse X-Ray Scattering and Models of Disorder</source>
OUP Oxford (
<year>2004</year>
).</mixed-citation>
</ref>
<ref id="b46">
<mixed-citation publication-type="journal">
<name>
<surname>Wall</surname>
<given-names>M. E.</given-names>
</name>
,
<name>
<surname>Clarage</surname>
<given-names>J. B.</given-names>
</name>
&
<name>
<surname>Phillips</surname>
<given-names>G. N.</given-names>
<suffix>Jr</suffix>
</name>
<article-title>Motions of calmodulin characterized using both bragg and diffuse X-ray scattering</article-title>
.
<source>Structure</source>
<volume>5</volume>
,
<fpage>1599</fpage>
<lpage>1612</lpage>
(
<year>1997</year>
).
<pub-id pub-id-type="pmid">9438860</pub-id>
</mixed-citation>
</ref>
<ref id="b47">
<mixed-citation publication-type="journal">
<name>
<surname>Wall</surname>
<given-names>M. E.</given-names>
</name>
<italic>et al.</italic>
<article-title>Conformational dynamics of a crystalline protein from microsecond-scale molecular dynamics simulations and diffuse X-ray scattering</article-title>
.
<source>Proc. Natl. Acad. Sci.</source>
<volume>111</volume>
,
<fpage>17887</fpage>
<lpage>17892</lpage>
(
<year>2014</year>
).
<pub-id pub-id-type="pmid">25453071</pub-id>
</mixed-citation>
</ref>
<ref id="b48">
<mixed-citation publication-type="journal">
<name>
<surname>Wall</surname>
<given-names>M.</given-names>
</name>
<article-title>Methods and software for diffuse X-ray scattering from protein crystals. In Micro and Nano Technologies</article-title>
. in
<source>Bioanalysis Methods in Molecular Biology</source>
eds Foote R. S., Lee J. W.
<volume>volume 544</volume>
,
<fpage>269</fpage>
<lpage>279</lpage>
Humana Press (
<year>2009</year>
).</mixed-citation>
</ref>
<ref id="b49">
<mixed-citation publication-type="other">
<name>
<surname>Fraser</surname>
<given-names>J. S.</given-names>
</name>
X-Ray diffraction data for: Cyclophilin a. PDB code 4YUO, Structural Biology Data Grid, volume V1,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/68">http://dx.doi.org/10.15785/SBGRID/68</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b50">
<mixed-citation publication-type="journal">
<name>
<surname>Shi</surname>
<given-names>D.</given-names>
</name>
,
<name>
<surname>Nannenga</surname>
<given-names>B. L.</given-names>
</name>
,
<name>
<surname>Iadanza</surname>
<given-names>M. G.</given-names>
</name>
&
<name>
<surname>Gonen</surname>
<given-names>T.</given-names>
</name>
<article-title>Three-dimensional electron crystallography of protein microcrystals</article-title>
.
<source>Elife</source>
<volume>2</volume>
,
<fpage>e01345</fpage>
(
<year>2013</year>
).
<pub-id pub-id-type="pmid">24252878</pub-id>
</mixed-citation>
</ref>
<ref id="b51">
<mixed-citation publication-type="journal">
<name>
<surname>Nannenga</surname>
<given-names>B. L.</given-names>
</name>
,
<name>
<surname>Shi</surname>
<given-names>D.</given-names>
</name>
,
<name>
<surname>Leslie</surname>
<given-names>A. G.</given-names>
</name>
&
<name>
<surname>Gonen</surname>
<given-names>T.</given-names>
</name>
<article-title>High-resolution structure determination by continuous-rotation data collection in MicroED</article-title>
.
<source>Nat. Methods</source>
<volume>11</volume>
,
<fpage>927</fpage>
<lpage>930</lpage>
(
<year>2014</year>
).
<pub-id pub-id-type="pmid">25086503</pub-id>
</mixed-citation>
</ref>
<ref id="b52">
<mixed-citation publication-type="other">
<name>
<surname>Reyes</surname>
<given-names>F.</given-names>
</name>
,
<name>
<surname>Rodriguez</surname>
<given-names>J.</given-names>
</name>
&
<name>
<surname>Gonen</surname>
<given-names>T.</given-names>
</name>
Micro-Electron diffraction data for: alpha-synuclein. PDB code 4RIL, Structural Biology Data Grid, volume V1,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/193">http://dx.doi.org/10.15785/SBGRID/193</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b53">
<mixed-citation publication-type="other">
<name>
<surname>de la Cruz</surname>
<given-names>J.</given-names>
</name>
,
<name>
<surname>Shi</surname>
<given-names>D.</given-names>
</name>
&
<name>
<surname>Gonen</surname>
<given-names>T.</given-names>
</name>
Micro-Electron diffraction data for: bovine catalase. PDB code 3J7B, Structural Biology Data Grid, volume V1,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/186">http://dx.doi.org/10.15785/SBGRID/186</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b54">
<mixed-citation publication-type="other">
<name>
<surname>Shi</surname>
<given-names>D.</given-names>
</name>
&
<name>
<surname>Gonen</surname>
<given-names>T.</given-names>
</name>
Micro-Electron diffraction data for: Hen egg white lysozyme. PDB code 3J6K, Structural Biology Data Grid, volume V1,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/185">http://dx.doi.org/10.15785/SBGRID/185</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b55">
<mixed-citation publication-type="other">
<name>
<surname>Vangone</surname>
<given-names>A.</given-names>
</name>
&
<name>
<surname>Bonvin</surname>
<given-names>A. M.</given-names>
</name>
HADDOCK docking models, Structural Biology Data Grid, volume V1,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/131">http://dx.doi.org/10.15785/SBGRID/131</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b56">
<mixed-citation publication-type="other">
<name>
<surname>Bowers</surname>
<given-names>K.</given-names>
</name>
<italic>et al.</italic>
in
<italic>Proceedings of the ACM/IEEE SC 2006 Conference</italic>
, 43–43 (Tampa, FL, USA, 2006).</mixed-citation>
</ref>
<ref id="b57">
<mixed-citation publication-type="other">
<name>
<surname>Sliz</surname>
<given-names>P.</given-names>
</name>
Molecular dynamics trajectory of human O-GlcNAc transferase. PDB code 3PE4, Structural Biology Data Grid,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/190">http://dx.doi.org/10.15785/SBGRID/190</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b58">
<mixed-citation publication-type="journal">
<name>
<surname>Chen</surname>
<given-names>B.-C.</given-names>
</name>
<italic>et al.</italic>
<article-title>Lattice light-sheet microscopy: Imaging molecules to embryos at high spatiotemporal resolution</article-title>
.
<source>Science</source>
<volume>346</volume>
,
<fpage>1257998</fpage>
(
<year>2014</year>
).
<pub-id pub-id-type="pmid">25342811</pub-id>
</mixed-citation>
</ref>
<ref id="b59">
<mixed-citation publication-type="journal">
<name>
<surname>Kural</surname>
<given-names>C.</given-names>
</name>
<italic>et al.</italic>
<article-title>Asymmetric formation of coated pits on dorsal and ventral surfaces at the leading edges of motile cells and on protrusions of immobile cells</article-title>
.
<source>Mol. Biol. Cell</source>
<volume>26</volume>
,
<fpage>2044</fpage>
<lpage>2053</lpage>
(
<year>2015</year>
).
<pub-id pub-id-type="pmid">25851602</pub-id>
</mixed-citation>
</ref>
<ref id="b60">
<mixed-citation publication-type="other">
<name>
<surname>Upadhyayula</surname>
<given-names>S.</given-names>
</name>
&
<name>
<surname>Kirchhausen</surname>
<given-names>T.</given-names>
</name>
Lattice Light-Sheet microscopy data for: developing zebrafish embryo, Structural Biology Data Grid, V1,
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.15785/SBGRID/187">http://dx.doi.org/10.15785/SBGRID/187</ext-link>
(2015).</mixed-citation>
</ref>
<ref id="b61">
<mixed-citation publication-type="journal">
<name>
<surname>Crosas</surname>
<given-names>M.</given-names>
</name>
,
<name>
<surname>Honaker</surname>
<given-names>J.</given-names>
</name>
,
<name>
<surname>King</surname>
<given-names>G.</given-names>
</name>
&
<name>
<surname>Sweeney</surname>
<given-names>L.</given-names>
</name>
<article-title>Automating open science for big data</article-title>
.
<source>Ann. Am. Acad. Polit. Soc. Sci.</source>
<volume>659</volume>
,
<fpage>260</fpage>
<lpage>273</lpage>
(
<year>2015</year>
).</mixed-citation>
</ref>
<ref id="b62">
<mixed-citation publication-type="journal">
<name>
<surname>Crosas</surname>
<given-names>M.</given-names>
</name>
<article-title>A data sharing story</article-title>
.
<source>J. eScience Librariansh.</source>
<volume>1</volume>
,
<fpage>7</fpage>
(
<year>2013</year>
).</mixed-citation>
</ref>
<ref id="b63">
<mixed-citation publication-type="journal">
<name>
<surname>Crosas</surname>
<given-names>M.</given-names>
</name>
<article-title>The dataverse network: an open-source application for sharing, discovering and preserving data</article-title>
.
<source>D-lib Mag.</source>
<volume>17</volume>
,
<fpage>2</fpage>
(
<year>2011</year>
).</mixed-citation>
</ref>
<ref id="b64">
<mixed-citation publication-type="journal">
<name>
<surname>King</surname>
<given-names>G.</given-names>
</name>
<article-title>An introduction to the Dataverse Network as an infrastructure for data sharing</article-title>
.
<source>Sociol. Meth. Res.</source>
<volume>36</volume>
,
<fpage>173</fpage>
<lpage>199</lpage>
(
<year>2007</year>
).</mixed-citation>
</ref>
<ref id="b65">
<mixed-citation publication-type="journal">
<name>
<surname>Nicholls</surname>
<given-names>R. A.</given-names>
</name>
,
<name>
<surname>Fischer</surname>
<given-names>M.</given-names>
</name>
,
<name>
<surname>Stuart</surname>
<given-names>M.</given-names>
</name>
&
<name>
<surname>Murshudov</surname>
<given-names>G. N.</given-names>
</name>
<article-title>Conformation-independent structural comparison of macromolecules with ProSMART</article-title>
.
<source>Acta Crystallogr. D Biol. Crystallogr.</source>
<volume>70</volume>
,
<fpage>2487</fpage>
<lpage>2499</lpage>
(
<year>2014</year>
).
<pub-id pub-id-type="pmid">25195761</pub-id>
</mixed-citation>
</ref>
<ref id="b66">
<mixed-citation publication-type="journal">
<name>
<surname>Chowdary</surname>
<given-names>T. K.</given-names>
</name>
<italic>et al.</italic>
<article-title>Crystal structure of the conserved herpesvirus fusion regulator complex gH-gL</article-title>
.
<source>Nat. Struct. Mol. Biol.</source>
<volume>17</volume>
,
<fpage>882</fpage>
<lpage>888</lpage>
(
<year>2010</year>
).
<pub-id pub-id-type="pmid">20601960</pub-id>
</mixed-citation>
</ref>
</ref-list>
<fn-group>
<fn>
<p>
<bold>Author contributions</bold>
All authors contributed to the current study, including intellectual input and editing of the manuscript. P.A.M., S.S. and P.S. developed the data grid system and A.B., M.L., C.B., J.W., D.N., K.R., J.K., F.R.N.C.M., I.F., MC and P.S. implemented the Data Access Alliance infrastructure. P.A.M, K.D. and P.S. analysed the data. K.S.A, R.H.B., S.C.B., T.J.B., D.B., T.J.B., A.C., C.I.C., W.J.C., K.D.C., M.S.C., S.C., S.D.P., E.D.C., C.L.D., M.J.E., B.F.E., Q.R.F., A.R.F., J.S.F., J.C.F., K.C.G., R.G., P.G., S.C.H., E.E.H., Z.J., R.J.K., A.C.K., M.K., J.S.M., Y.M., Y.N., Z.O., E.F.P., P.J.B.P., C.P., C.S.R., T.A.R., A.R., M.K.R., G.R., J.S., T.U.S., Y.S., H.S., Y.J.T., N.H.T., O.V.T., K.D.W., H.W. and P.S. contributed X-ray diffraction data sets. A.M.J.J.B contributed the HADDOCK docking decoys data set. T.G., T.K., and P.S. contributed MicroED, Lattice Light-Sheet Microscopy and Molecular Dynamics data sets, respectively. P.A.M., E.R and P.S. Analysed the data and wrote the paper.</p>
</fn>
</fn-group>
</back>
<floats-group>
<fig id="f1">
<label>Figure 1</label>
<caption>
<title>Data collection statistics for the pilot subset of 112 data sets.</title>
<p>(
<bold>a</bold>
,
<bold>b</bold>
) Data sets were collected from synchrotrons on four continents (in addition to laboratory sources, which are not broken down geographically) and originate from eleven synchrotron facilities: Advanced Light Source, Advanced Photon Source, Australian Synchrotron, Cornell High Energy Synchrotron Source, Canadian Light Source, European Synchrotron Radiation Facility, National Synchrotron Light Source, National Synchrotron Radiation Research Center, Swiss Light Source, Shanghai Synchrotron Radiation Facility, and Stanford Synchrotron Radiation Lightsource. World map image courtesy of the U.S. Geological Survey. (
<bold>c</bold>
) Breakdown of data sets collected at the Advanced Photon Source beamlines. (
<bold>d</bold>
) Data sets cover a range of detector types, including Area Detector Systems Corporation M300, Q210 and Q315, Rayonix MarMosaic, Dectris Pilatus 2M and 6M, R-AXIS HTC, and MAR345.</p>
</caption>
<graphic xlink:href="ncomms10882-f1"></graphic>
</fig>
<fig id="f2">
<label>Figure 2</label>
<caption>
<title>Estimation of storage requirements for different stages of the structural biology pipeline, based on the SBDG pilot collection.</title>
<p>For structure factor amplitudes and PDB models file sizes were obtained from a subset of 96 PDB depositions derived from the pilot data sets. On average, SBDG stores 1.26 data sets per PDB file. Numbers in red indicated the estimated storage requirements to accommodate data sets for 100,000 structures. We estimate that for each primary data set, additional 100 data sets are collected at national facilities. Primary data refers to original experimental diffraction images supporting the derived structural model, as distinguished from all experimental data (screening images, inferior quality data sets, and so on). For crystallographic experiments, reduced data refers to the integrated intensities (or amplitudes, which do not materially affect storage requirements).</p>
</caption>
<graphic xlink:href="ncomms10882-f2"></graphic>
</fig>
<fig id="f3">
<label>Figure 3</label>
<caption>
<title>Organized display of data collections at SBDG.</title>
<p>(
<bold>a</bold>
) Graphical view of Laboratory and Institutional Collections within the SBDG; (
<bold>b</bold>
) PV structure viewer, displaying a published model with links to its two primary deposited data sets.</p>
</caption>
<graphic xlink:href="ncomms10882-f3"></graphic>
</fig>
<fig id="f4">
<label>Figure 4</label>
<caption>
<title>SBDG persistent data set landing page (the target of a DOI resolver for a published data set).</title>
<p>Data set metadata are displayed, as are instructions for downloading and verifying the data set.</p>
</caption>
<graphic xlink:href="ncomms10882-f4"></graphic>
</fig>
<fig id="f5">
<label>Figure 5</label>
<caption>
<title>Experimental data flow and publication.</title>
<p>(
<bold>a</bold>
) Flow of Primary Experimental Data. Data sets collected at synchrotrons are moved to end-users' computers for processing and structure determination. Subsequently refined macromolecular models are deposited at PDB and primary data is uploaded to SBDG. From SBDG, data sets are replicated to DAA centres and eventually copied to DAA Satellites. End-users can access data sets by downloading from DAA centres and by direct access from Satellites. (
<bold>b</bold>
) Flowchart for data publication.</p>
</caption>
<graphic xlink:href="ncomms10882-f5"></graphic>
</fig>
<fig id="f6">
<label>Figure 6</label>
<caption>
<title>DataCite metadata schema used for primary data sets within the SBDG.</title>
<p>Information associated with the DOI record for a primary data set through the EZID system.</p>
</caption>
<graphic xlink:href="ncomms10882-f6"></graphic>
</fig>
<fig id="f7">
<label>Figure 7</label>
<caption>
<title>Data publication guidelines.</title>
<p>(
<bold>a</bold>
) Flowchart illustrating publication guidelines incorporating software and data citations. (
<bold>b</bold>
) Data Citation guidelines, adapted from Dataverse Best Practices Guidelines that were developed based on Force 11 Joint Declaration of Data Citation Principles.</p>
</caption>
<graphic xlink:href="ncomms10882-f7"></graphic>
</fig>
<fig id="f8">
<label>Figure 8</label>
<caption>
<title>Reprocessing of X-ray diffraction data sets.</title>
<p>(
<bold>a</bold>
) Analysis of 110 X-ray diffraction data sets that supported previously published PDB coordinates. Most of the failures (represented in red) were due to inaccurate or incomplete image-header information. In several of these cases, depositors provided annotations correcting this information; (
<bold>b</bold>
) Comparison of resolution determined by automated
<italic>xia2</italic>
reprocessing with published resolution. Includes data sets not used for final refinement of published structures; (
<bold>c</bold>
) Shift in direct beam position from image headers and refined value following successful reprocessing with
<italic>xia2</italic>
.</p>
</caption>
<graphic xlink:href="ncomms10882-f8"></graphic>
</fig>
<table-wrap position="float" id="t1">
<label>Table 1</label>
<caption>
<title>Data science standards.</title>
</caption>
<table frame="hsides" rules="groups" border="1">
<colgroup>
<col align="left"></col>
<col align="left"></col>
</colgroup>
<tbody valign="top">
<tr>
<td align="left" valign="top" charoff="50">Disclosure</td>
<td align="left" valign="top" charoff="50">Software tools developed under this program will be incorporated into open source software and released to the community. Manuscripts and white papers describing various phases of the project will be released on a regular basis.</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">Adoption</td>
<td align="left" valign="top" charoff="50">All biomedical image data will be converted to the master formats, such as OME-TIFF or HDF5. Community tools to create, analyse, and manipulate diffraction images will be extended to include support for these formats. All biomedical data are assigned Digital Object Identifiers through the CDL EZID system, and follow modified DataCite and Dataverse metadata schemas. Associated metadata are registered with the International DOI Foundation, making it virtually permanent and independent of SBGrid and Harvard computing infrastructure. All data sets published through the SBDG will be citable using Force 11 recommendations.</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">Transparency</td>
<td align="left" valign="top" charoff="50">Files within individual data sets will be deposited in their original format (no archives or encryption allowed). Self-documentation: The majority of diffraction data sets are self-documented and include the basic information required for reprocessing in the images themselves. Additional information will be collected during deposition and will include data set representation (the ability to use the data to be processed), reference (relation to PDB files, publications, and other data sets), context (for example, a native data set or a derivative used for phasing), fixity (checksums), and provenance (typically the data collection facility and the project member who deposits the original data set). With conversion to master formats, all secondary information will be appended to the image metadata along with all original headers.</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">External dependencies</td>
<td align="left" valign="top" charoff="50">The ability to reprocess some older data sets and verify master format conversions could depend on access to a specific version of data processing software. As data sets enter our repository, they will be reprocessed with our Data Reprocessing Pipeline (one of several we will develop as part of our Data Mining Pipelines). Data Reprocessing Pipelines will be archived within our system, issued DOIs, and interlinked with the data sets. It is worth noting that, since 2002, SBGrid has been archiving structural biology applications and, therefore, has access to previous software versions that might be required to reprocess older data sets.</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">Licensing</td>
<td align="left" valign="top" charoff="50">Biomedical data sets will be deposited under the Creative Commons Zero licence, supporting future development of data validation services and database replications and migrations.</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">Technical protection mechanism</td>
<td align="left" valign="top" charoff="50">The security of the deposited data will be maintained by the DAA. The DAA will join with the Library of Congress sponsored NDSA and the data architect working on the project will ensure that NDSA recommendations are being followed.</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="t1-fn1">
<p>NDSA, National Digital Stewardship Alliance; SBDG, Structural Biology Data Grid.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="t2">
<label>Table 2</label>
<caption>
<title>Reference subset.</title>
</caption>
<table frame="hsides" rules="groups" border="1">
<colgroup>
<col align="left"></col>
<col align="left"></col>
</colgroup>
<thead valign="bottom">
<tr>
<th align="left" valign="top" charoff="50">
<bold>Data set</bold>
</th>
<th align="left" valign="top" charoff="50">
<bold>Description</bold>
</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left" valign="top" charoff="50">10.15785/SBGRID/5Boggon LaboratoryReference Case 1:MR/Multi-crystal averaging.</td>
<td align="left" valign="top" charoff="50">Data sets from 5 crystals of SNX17 FERM domain in complex with a peptide corresponding to KRIT1's NPxY2 motif. Separate integration of the data sets and scaling together allows a complete 3.0 Å data set for molecular replacement solution (original paper used 4GXB as a search model) and structure refinement.</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50"> </td>
<td> </td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">10.15785/SBGRID/117Baxter LaboratoryReference Case 2:MR/Low resolution, twinned with rotational pseudosymmetry.</td>
<td align="left" valign="top" charoff="50">3.70 Å data set collected on a crystal of thioester-containing protein 1 *S1 allele (TEP1*S1). Initial data processing suggested P4
<sub>3</sub>
2
<sub>1</sub>
2, but one of the two molecules (∼1300 aa. each) in the ASU overlapped with its symmetry-mate. Comparison of alternative scenarios in refinement identified the true space group as
<italic>P</italic>
4
<sub>3</sub>
with twinning and rotational pseudosymmetry. Refinement was completed with TLS, NCS (local) and external restraints derived by
<italic>ProSMART</italic>
<xref ref-type="bibr" rid="b65">65</xref>
using TEP1*R1 (PDB 4D94) as reference.</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50"> </td>
<td> </td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">10.15785/SBGRID/62Modis LaboratoryReference case 3:U SAD/Low resolution.</td>
<td align="left" valign="top" charoff="50">4.5 Å data set of a uranyl acetate derivative used for a challenging structure determination by SAD. Certain images had streaky features and were excluded from data reprocessing. The height and definition of peaks in anomalous difference Patterson maps was improved by omitting certain images near the end of the data collection run.</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50"> </td>
<td> </td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">10.15785/SBGRID/111 Ferré-D'Amaré LaboratoryReference Case 4:Ba/K SAD; 91 nt RNA-chromophore complex.</td>
<td align="left" valign="top" charoff="50">2.5 Å data set collected at ALS BL 5.0.2 using 6.0 keV X-rays from a crystal of 'Spinach' a fluorescent RNA analogue of GFP. Although anomalous signal was very weak, a heavy atom substructure comprised of one barium and six potassium ions resulted in good quality SAD electron density maps.</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50"> </td>
<td> </td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">10.15785/SBGRID/3Sliz LaboratoryReference Case 5:Zn SAD; 4 Zn/ASUprotein/RNA complex.</td>
<td align="left" valign="top" charoff="50">2.9 Å Zn SAD data set was sufficient to determine a crystal structure of Lin28/let-7d protein-microRNA complex. X-ray beam size was adjusted to maximize flux and minimize radiation damage. One swapped-dimer is located in each asymmetric unit. Two native zinc atoms are located in each tandem CCHC zinc knuckles domain.</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50"> </td>
<td> </td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">10.15785/SBGRID/123Heldwein LaboratoryReference Case 6:3.29-Å SeMet SAD9 Se/ASU</td>
<td align="left" valign="top" charoff="50">This 3.29-Å selenomethionine SAD data set, collected at 0.9789 Å wavelength at BNL X25 beamline, was sufficient to determine the phases and to trace the structure of HSV-2 gH/gL complex
<xref ref-type="bibr" rid="b66">66</xref>
. There are 9 Se sites in the ASU. During integration in HKL2000, χ
<sup>2</sup>
appeared very large for some sectors of the data set. These correlated with crystal orientation and likely resulted from a large difference in cell edges (
<italic>a</italic>
=
<italic>b</italic>
=88 Å versus
<italic>c</italic>
=333 Å).</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50"> </td>
<td> </td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">10.15785.SBGRID/179Schwartz LaboratoryReference Case 7:MR-SAD at 7.0 Å</td>
<td align="left" valign="top" charoff="50">Contaminating
<italic>E.coli</italic>
protein 4FCC_A, acting as a crystallization chaperone, was found readily by MR. Using these MR phases seven (Ta
<sub>6</sub>
Br
<sub>12</sub>
)
<sup>2+</sup>
-positions could be found in the 8.8 Å derivative data set 180. The combined MR-SAD phases were sufficient to position two copies of Nup37 (4FHL) and two copies of Nup120 in the asymmetric unit.</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50"> </td>
<td> </td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">10.15785/SBGRID/21810.15785/SBGRID/78Rudenko LaboratoryReference Case 8:MR-SAD at 2.65 Å(44 Se atoms/ASU)</td>
<td align="left" valign="top" charoff="50">3.25 Å data set (#218) from a crystal of the selenomethionyl neurexin 1alpha ectodomain and 2.65 Å higher resolution native data set (#78), both collected at APS using multiple settings. The structure has 2 molecules/ASU with a total of 14 ordered domains and ∼2,000 residues. Molecular replacement successfully placed 8 LNS domains (using a single LNS domain as a search model, i.e. ∼9% of the scattering mass) generating phases which could be used to reveal 37 out of 44 Se atoms/ASU in the 3.25 Å SeMet SAD data set. Refinement was completed using data set #78.</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">10.15785/SBGRID/9Tao LaboratoryReference case 9:3.25 Å data set used for MR with a 9-Å cryo-EM envelope</td>
<td align="left" valign="top" charoff="50">A 3.25-Å resolution data set was collected at APS LS-CAT. The structure was determined by molecular replacement using a 9-Å resolution cryo-EM reconstruction as a phasing model. Solvent flattening and 15-fold noncrystallographic symmetry averaging were applied during phase extension.</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50"> </td>
<td> </td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">10.15785/SBGRID/83Drennan LaboratoryReference Case 10:MR/large unit cell, anisotropic.</td>
<td align="left" valign="top" charoff="50">Diffraction data from different regions of a crystal of Isobutyryl-coenzyme A mutase fused, a 250 kDa dimeric enzyme. This crystal had a large unit cell (
<italic>a</italic>
,
<italic>b</italic>
=319 Å,
<italic>c</italic>
=344 Å) and the data were anisotropic. Separate integration of the 6 wedges with individually adjusted resolution limits and scaling together yields a complete 3.35 Å data set that can be used for molecular replacement.</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50"> </td>
<td> </td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">10.15785/SBGRID/125Kruse Laboratory(data collected in Kobilka Laboratory)Reference Case 11:MR, lipidic cubic phase</td>
<td align="left" valign="top" charoff="50">Diffraction data for lipidic cubic phase crystals of human M
<sub>2</sub>
muscarinic acetylcholine receptor bound to the agonist iperoxo, the allosteric modulator LY2119620, and the conformationally-selective nanobody Nb9-8.</td>
</tr>
<tr>
<td align="left" valign="top" charoff="50"> </td>
<td> </td>
</tr>
<tr>
<td align="left" valign="top" charoff="50">DOI:10.15785/SBGRID/68Fraser LaboratoryReference case 12:X-ray diffuse scattering</td>
<td align="left" valign="top" charoff="50">1.2 Å data set collected at SSRL provides a high-resolution standard data set of the enzyme Cyclophilin to examine the influence of data collection temperature to compare with XFEL data, and to measure X-ray diffuse scattering.</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="t2-fn1">
<p>MR, molecular replacement; SAD, Single-wavelength Anomalous Diffraction.</p>
</fn>
<fn id="t2-fn2">
<p>12 X-ray diffraction data sets from the SBDG pilot collection were identified as particularly suitable for software testing and teaching activities. In addition, data sets from molecular dynamics, lattice light-sheet microscopy and MicroED represent an invaluable subset.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</floats-group>
</pmc>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Pmc/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000144 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 000144 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Pmc
   |étape=   Corpus
   |type=    RBID
   |clé=     PMC:4786681
   |texte=   Data publication with the structural biology data grid supports live analysis
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/RBID.i   -Sk "pubmed:26947396" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a CyberinfraV1 

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024