Serveur d'exploration sur les relations entre la France et l'Australie

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Statistical methods for handling unwanted variation in metabolomics data.

Identifieur interne : 002E52 ( PubMed/Corpus ); précédent : 002E51; suivant : 002E53

Statistical methods for handling unwanted variation in metabolomics data.

Auteurs : Alysha M. De Livera ; Marko Sysi-Aho ; Laurent Jacob ; Johann A. Gagnon-Bartsch ; Sandra Castillo ; Julie A. Simpson ; Terence P. Speed

Source :

RBID : pubmed:25692814

English descriptors

Abstract

Metabolomics experiments are inevitably subject to a component of unwanted variation, due to factors such as batch effects, long runs of samples, and confounding biological variation. Although the removal of this unwanted variation is a vital step in the analysis of metabolomics data, it is considered a gray area in which there is a recognized need to develop a better understanding of the procedures and statistical methods required to achieve statistically relevant optimal biological outcomes. In this paper, we discuss the causes of unwanted variation in metabolomics experiments, review commonly used metabolomics approaches for handling this unwanted variation, and present a statistical approach for the removal of unwanted variation to obtain normalized metabolomics data. The advantages and performance of the approach relative to several widely used metabolomics normalization approaches are illustrated through two metabolomics studies, and recommendations are provided for choosing and assessing the most suitable normalization method for a given metabolomics experiment. Software for the approach is made freely available.

DOI: 10.1021/ac502439y
PubMed: 25692814

Links to Exploration step

pubmed:25692814

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Statistical methods for handling unwanted variation in metabolomics data.</title>
<author>
<name sortKey="De Livera, Alysha M" sort="De Livera, Alysha M" uniqKey="De Livera A" first="Alysha M" last="De Livera">Alysha M. De Livera</name>
<affiliation>
<nlm:affiliation>†Biostatistics Unit, Centre for Epidemiology and Biostatistics, University of Melbourne, Melbourne, VIC 3800, Australia.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Sysi Aho, Marko" sort="Sysi Aho, Marko" uniqKey="Sysi Aho M" first="Marko" last="Sysi-Aho">Marko Sysi-Aho</name>
<affiliation>
<nlm:affiliation>‡Zora Biosciences Oy, FIN-02150 Espoo, Finland.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Jacob, Laurent" sort="Jacob, Laurent" uniqKey="Jacob L" first="Laurent" last="Jacob">Laurent Jacob</name>
<affiliation>
<nlm:affiliation>§Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, INRA, UMR5558, Villeurbanne, France.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Gagnon Bartsch, Johann A" sort="Gagnon Bartsch, Johann A" uniqKey="Gagnon Bartsch J" first="Johann A" last="Gagnon-Bartsch">Johann A. Gagnon-Bartsch</name>
<affiliation>
<nlm:affiliation>∥Department of Statistics, University of California, Berkeley, California United States, 94720.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Castillo, Sandra" sort="Castillo, Sandra" uniqKey="Castillo S" first="Sandra" last="Castillo">Sandra Castillo</name>
<affiliation>
<nlm:affiliation>¶VTT Technical Research Centre of Finland, P. O. Box 1000, FI-02044 VTT Espoo, Finland.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Simpson, Julie A" sort="Simpson, Julie A" uniqKey="Simpson J" first="Julie A" last="Simpson">Julie A. Simpson</name>
<affiliation>
<nlm:affiliation>†Biostatistics Unit, Centre for Epidemiology and Biostatistics, University of Melbourne, Melbourne, VIC 3800, Australia.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Speed, Terence P" sort="Speed, Terence P" uniqKey="Speed T" first="Terence P" last="Speed">Terence P. Speed</name>
<affiliation>
<nlm:affiliation>∥Department of Statistics, University of California, Berkeley, California United States, 94720.</nlm:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PubMed</idno>
<date when="2015">2015</date>
<idno type="RBID">pubmed:25692814</idno>
<idno type="pmid">25692814</idno>
<idno type="doi">10.1021/ac502439y</idno>
<idno type="wicri:Area/PubMed/Corpus">002E52</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">002E52</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Statistical methods for handling unwanted variation in metabolomics data.</title>
<author>
<name sortKey="De Livera, Alysha M" sort="De Livera, Alysha M" uniqKey="De Livera A" first="Alysha M" last="De Livera">Alysha M. De Livera</name>
<affiliation>
<nlm:affiliation>†Biostatistics Unit, Centre for Epidemiology and Biostatistics, University of Melbourne, Melbourne, VIC 3800, Australia.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Sysi Aho, Marko" sort="Sysi Aho, Marko" uniqKey="Sysi Aho M" first="Marko" last="Sysi-Aho">Marko Sysi-Aho</name>
<affiliation>
<nlm:affiliation>‡Zora Biosciences Oy, FIN-02150 Espoo, Finland.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Jacob, Laurent" sort="Jacob, Laurent" uniqKey="Jacob L" first="Laurent" last="Jacob">Laurent Jacob</name>
<affiliation>
<nlm:affiliation>§Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, INRA, UMR5558, Villeurbanne, France.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Gagnon Bartsch, Johann A" sort="Gagnon Bartsch, Johann A" uniqKey="Gagnon Bartsch J" first="Johann A" last="Gagnon-Bartsch">Johann A. Gagnon-Bartsch</name>
<affiliation>
<nlm:affiliation>∥Department of Statistics, University of California, Berkeley, California United States, 94720.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Castillo, Sandra" sort="Castillo, Sandra" uniqKey="Castillo S" first="Sandra" last="Castillo">Sandra Castillo</name>
<affiliation>
<nlm:affiliation>¶VTT Technical Research Centre of Finland, P. O. Box 1000, FI-02044 VTT Espoo, Finland.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Simpson, Julie A" sort="Simpson, Julie A" uniqKey="Simpson J" first="Julie A" last="Simpson">Julie A. Simpson</name>
<affiliation>
<nlm:affiliation>†Biostatistics Unit, Centre for Epidemiology and Biostatistics, University of Melbourne, Melbourne, VIC 3800, Australia.</nlm:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Speed, Terence P" sort="Speed, Terence P" uniqKey="Speed T" first="Terence P" last="Speed">Terence P. Speed</name>
<affiliation>
<nlm:affiliation>∥Department of Statistics, University of California, Berkeley, California United States, 94720.</nlm:affiliation>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Analytical chemistry</title>
<idno type="eISSN">1520-6882</idno>
<imprint>
<date when="2015" type="published">2015</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Humans</term>
<term>Mass Spectrometry (methods)</term>
<term>Metabolomics (methods)</term>
<term>Principal Component Analysis</term>
<term>Software</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en">
<term>Mass Spectrometry</term>
<term>Metabolomics</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Humans</term>
<term>Principal Component Analysis</term>
<term>Software</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Metabolomics experiments are inevitably subject to a component of unwanted variation, due to factors such as batch effects, long runs of samples, and confounding biological variation. Although the removal of this unwanted variation is a vital step in the analysis of metabolomics data, it is considered a gray area in which there is a recognized need to develop a better understanding of the procedures and statistical methods required to achieve statistically relevant optimal biological outcomes. In this paper, we discuss the causes of unwanted variation in metabolomics experiments, review commonly used metabolomics approaches for handling this unwanted variation, and present a statistical approach for the removal of unwanted variation to obtain normalized metabolomics data. The advantages and performance of the approach relative to several widely used metabolomics normalization approaches are illustrated through two metabolomics studies, and recommendations are provided for choosing and assessing the most suitable normalization method for a given metabolomics experiment. Software for the approach is made freely available.</div>
</front>
</TEI>
<pubmed>
<MedlineCitation Status="MEDLINE" Owner="NLM">
<PMID Version="1">25692814</PMID>
<DateCreated>
<Year>2015</Year>
<Month>04</Month>
<Day>07</Day>
</DateCreated>
<DateCompleted>
<Year>2015</Year>
<Month>09</Month>
<Day>18</Day>
</DateCompleted>
<DateRevised>
<Year>2016</Year>
<Month>10</Month>
<Day>25</Day>
</DateRevised>
<Article PubModel="Print-Electronic">
<Journal>
<ISSN IssnType="Electronic">1520-6882</ISSN>
<JournalIssue CitedMedium="Internet">
<Volume>87</Volume>
<Issue>7</Issue>
<PubDate>
<Year>2015</Year>
<Month>Apr</Month>
<Day>07</Day>
</PubDate>
</JournalIssue>
<Title>Analytical chemistry</Title>
<ISOAbbreviation>Anal. Chem.</ISOAbbreviation>
</Journal>
<ArticleTitle>Statistical methods for handling unwanted variation in metabolomics data.</ArticleTitle>
<Pagination>
<MedlinePgn>3606-15</MedlinePgn>
</Pagination>
<ELocationID EIdType="doi" ValidYN="Y">10.1021/ac502439y</ELocationID>
<Abstract>
<AbstractText>Metabolomics experiments are inevitably subject to a component of unwanted variation, due to factors such as batch effects, long runs of samples, and confounding biological variation. Although the removal of this unwanted variation is a vital step in the analysis of metabolomics data, it is considered a gray area in which there is a recognized need to develop a better understanding of the procedures and statistical methods required to achieve statistically relevant optimal biological outcomes. In this paper, we discuss the causes of unwanted variation in metabolomics experiments, review commonly used metabolomics approaches for handling this unwanted variation, and present a statistical approach for the removal of unwanted variation to obtain normalized metabolomics data. The advantages and performance of the approach relative to several widely used metabolomics normalization approaches are illustrated through two metabolomics studies, and recommendations are provided for choosing and assessing the most suitable normalization method for a given metabolomics experiment. Software for the approach is made freely available.</AbstractText>
</Abstract>
<AuthorList CompleteYN="Y">
<Author ValidYN="Y">
<LastName>De Livera</LastName>
<ForeName>Alysha M</ForeName>
<Initials>AM</Initials>
<AffiliationInfo>
<Affiliation>†Biostatistics Unit, Centre for Epidemiology and Biostatistics, University of Melbourne, Melbourne, VIC 3800, Australia.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Sysi-Aho</LastName>
<ForeName>Marko</ForeName>
<Initials>M</Initials>
<AffiliationInfo>
<Affiliation>‡Zora Biosciences Oy, FIN-02150 Espoo, Finland.</Affiliation>
</AffiliationInfo>
<AffiliationInfo>
<Affiliation>¶VTT Technical Research Centre of Finland, P. O. Box 1000, FI-02044 VTT Espoo, Finland.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Jacob</LastName>
<ForeName>Laurent</ForeName>
<Initials>L</Initials>
<AffiliationInfo>
<Affiliation>§Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, INRA, UMR5558, Villeurbanne, France.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Gagnon-Bartsch</LastName>
<ForeName>Johann A</ForeName>
<Initials>JA</Initials>
<AffiliationInfo>
<Affiliation>∥Department of Statistics, University of California, Berkeley, California United States, 94720.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Castillo</LastName>
<ForeName>Sandra</ForeName>
<Initials>S</Initials>
<AffiliationInfo>
<Affiliation>¶VTT Technical Research Centre of Finland, P. O. Box 1000, FI-02044 VTT Espoo, Finland.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Simpson</LastName>
<ForeName>Julie A</ForeName>
<Initials>JA</Initials>
<AffiliationInfo>
<Affiliation>†Biostatistics Unit, Centre for Epidemiology and Biostatistics, University of Melbourne, Melbourne, VIC 3800, Australia.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y">
<LastName>Speed</LastName>
<ForeName>Terence P</ForeName>
<Initials>TP</Initials>
<AffiliationInfo>
<Affiliation>∥Department of Statistics, University of California, Berkeley, California United States, 94720.</Affiliation>
</AffiliationInfo>
<AffiliationInfo>
<Affiliation>⊥Bioinformatics Division, Walter and Eliza Hall Institute, 1 G Royal Parade, Parkville, Victoria 3052, Australia.</Affiliation>
</AffiliationInfo>
<AffiliationInfo>
<Affiliation>⧧Department of Mathematics and Statistics, University of Melbourne, VIC 3800, Melbourne, Australia.</Affiliation>
</AffiliationInfo>
</Author>
</AuthorList>
<Language>eng</Language>
<GrantList CompleteYN="Y">
<Grant>
<GrantID>R01 GM083084</GrantID>
<Acronym>GM</Acronym>
<Agency>NIGMS NIH HHS</Agency>
<Country>United States</Country>
</Grant>
</GrantList>
<PublicationTypeList>
<PublicationType UI="D016428">Journal Article</PublicationType>
</PublicationTypeList>
<ArticleDate DateType="Electronic">
<Year>2015</Year>
<Month>03</Month>
<Day>06</Day>
</ArticleDate>
</Article>
<MedlineJournalInfo>
<Country>United States</Country>
<MedlineTA>Anal Chem</MedlineTA>
<NlmUniqueID>0370536</NlmUniqueID>
<ISSNLinking>0003-2700</ISSNLinking>
</MedlineJournalInfo>
<CitationSubset>IM</CitationSubset>
<CommentsCorrectionsList>
<CommentsCorrections RefType="Cites">
<RefSource>Anal Chem. 2006 Jan 15;78(2):567-74</RefSource>
<PMID Version="1">16408941</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>J Chromatogr B Analyt Technol Biomed Life Sci. 2008 Aug 15;871(2):299-305</RefSource>
<PMID Version="1">18579458</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Sci Data. 2014;1:140012</RefSource>
<PMID Version="1">25977770</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Bioinformatics. 2014 Aug 1;30(15):2155-61</RefSource>
<PMID Version="1">24711654</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>J Pharm Biomed Anal. 2014 Jan;87:1-11</RefSource>
<PMID Version="1">24091079</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>J Lipid Res. 2013 Oct;54(10):2898-908</RefSource>
<PMID Version="1">23868910</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Methods Mol Biol. 2013;1055:291-307</RefSource>
<PMID Version="1">23963918</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Curr Med Chem. 2013;20(2):257-71</RefSource>
<PMID Version="1">23210853</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Anal Chem. 2013 Jan 15;85(2):1037-46</RefSource>
<PMID Version="1">23240878</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Bioinformatics. 2004 Oct 12;20(15):2447-54</RefSource>
<PMID Version="1">15087312</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Anal Biochem. 2004 Aug 15;331(2):283-95</RefSource>
<PMID Version="1">15265734</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Anal Chem. 2003 Sep 15;75(18):4818-26</RefSource>
<PMID Version="1">14674459</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Anal Chem. 2012 Dec 18;84(24):10768-76</RefSource>
<PMID Version="1">23150939</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Biostatistics. 2012 Jul;13(3):539-52</RefSource>
<PMID Version="1">22101192</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Anal Chem. 2012 Mar 20;84(6):2670-7</RefSource>
<PMID Version="1">22264131</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Nat Protoc. 2011 Jul;6(7):1060-83</RefSource>
<PMID Version="1">21720319</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Methods Mol Biol. 2011;708:247-57</RefSource>
<PMID Version="1">21207295</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Proc Natl Acad Sci U S A. 2010 Sep 21;107(38):16465-70</RefSource>
<PMID Version="1">20810919</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>BMC Bioinformatics. 2010;11:395</RefSource>
<PMID Version="1">20650010</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Mol Biosyst. 2010 Jan;6(1):108-20</RefSource>
<PMID Version="1">20024072</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Anal Chem. 2009 Oct 1;81(19):7974-80</RefSource>
<PMID Version="1">19743813</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Anal Chem. 2009 Feb 15;81(4):1357-64</RefSource>
<PMID Version="1">19170513</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Anal Chem. 2006 Apr 1;78(7):2262-7</RefSource>
<PMID Version="1">16579606</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>BMC Bioinformatics. 2007;8:93</RefSource>
<PMID Version="1">17362505</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>PLoS Genet. 2007 Sep;3(9):1724-35</RefSource>
<PMID Version="1">17907809</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Proteomics. 2008 Jan;8(1):21-7</RefSource>
<PMID Version="1">18095358</PMID>
</CommentsCorrections>
<CommentsCorrections RefType="Cites">
<RefSource>Biostatistics. 2016 Jan;17(1):16-28</RefSource>
<PMID Version="1">26286812</PMID>
</CommentsCorrections>
</CommentsCorrectionsList>
<MeshHeadingList>
<MeshHeading>
<DescriptorName UI="D006801" MajorTopicYN="N">Humans</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D013058" MajorTopicYN="N">Mass Spectrometry</DescriptorName>
<QualifierName UI="Q000379" MajorTopicYN="Y">methods</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D055432" MajorTopicYN="N">Metabolomics</DescriptorName>
<QualifierName UI="Q000379" MajorTopicYN="Y">methods</QualifierName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D025341" MajorTopicYN="N">Principal Component Analysis</DescriptorName>
</MeshHeading>
<MeshHeading>
<DescriptorName UI="D012984" MajorTopicYN="Y">Software</DescriptorName>
</MeshHeading>
</MeshHeadingList>
<OtherID Source="NLM">NIHMS715285</OtherID>
<OtherID Source="NLM">PMC4544854</OtherID>
</MedlineCitation>
<PubmedData>
<History>
<PubMedPubDate PubStatus="entrez">
<Year>2015</Year>
<Month>2</Month>
<Day>19</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="pubmed">
<Year>2015</Year>
<Month>2</Month>
<Day>19</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline">
<Year>2015</Year>
<Month>9</Month>
<Day>19</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>ppublish</PublicationStatus>
<ArticleIdList>
<ArticleId IdType="pubmed">25692814</ArticleId>
<ArticleId IdType="doi">10.1021/ac502439y</ArticleId>
<ArticleId IdType="pmc">PMC4544854</ArticleId>
<ArticleId IdType="mid">NIHMS715285</ArticleId>
</ArticleIdList>
</PubmedData>
</pubmed>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Asie/explor/AustralieFrV1/Data/PubMed/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002E52 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PubMed/Corpus/biblio.hfd -nk 002E52 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Asie
   |area=    AustralieFrV1
   |flux=    PubMed
   |étape=   Corpus
   |type=    RBID
   |clé=     pubmed:25692814
   |texte=   Statistical methods for handling unwanted variation in metabolomics data.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/PubMed/Corpus/RBID.i   -Sk "pubmed:25692814" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/PubMed/Corpus/biblio.hfd   \
       | NlmPubMed2Wicri -a AustralieFrV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Tue Dec 5 10:43:12 2017. Site generation: Tue Mar 5 14:07:20 2024