Data processing and classification analysis of proteomic changes: a case study of oil pollution in the mussel, Mytilus edulis
Identifieur interne : 000165 ( Ncbi/Curation ); précédent : 000164; suivant : 000166Data processing and classification analysis of proteomic changes: a case study of oil pollution in the mussel, Mytilus edulis
Auteurs : Tiphaine Monsinjon [Norvège, France] ; Odd Ketil Andersen [Norvège] ; François Leboulenger [France] ; Thomas Knigge [Norvège, France]Source :
- Proteome Science [ 1477-5956 ] ; 2006.
Abstract
Proteomics may help to detect subtle pollution-related changes, such as responses to mixture pollution at low concentrations, where clear signs of toxicity are absent. The challenges associated with the analysis of large-scale multivariate proteomic datasets have been widely discussed in medical research and biomarker discovery. This concept has been introduced to ecotoxicology only recently, so data processing and classification analysis need to be refined before they can be readily applied in biomarker discovery and monitoring studies.
Data sets obtained from a case study of oil pollution in the Blue mussel were investigated for differential protein expression by retentate chromatography-mass spectrometry and decision tree classification. Different tissues and different settings were used to evaluate classifiers towards their discriminatory power. It was found that, due the intrinsic variability of the data sets, reliable classification of unknown samples could only be achieved on a broad statistical basis (n > 60) with the observed expression changes comprising high statistical significance and sufficient amplitude. The application of stringent criteria to guard against overfitting of the models eventually allowed satisfactory classification for only one of the investigated data sets and settings.
Machine learning techniques provide a promising approach to process and extract informative expression signatures from high-dimensional mass-spectrometry data. Even though characterisation of the proteins forming the expression signatures would be ideal, knowledge of the specific proteins is not mandatory for effective class discrimination. This may constitute a new biomarker approach in ecotoxicology, where working with organisms, which do not have sequenced genomes render protein identification by database searching problematic. However, data processing has to be critically evaluated and statistical constraints have to be considered before supervised classification algorithms are employed.
Url:
DOI: 10.1186/1477-5956-4-17
PubMed: 16970821
PubMed Central: 1592071
Links toward previous steps (curation, corpus...)
- to stream Pmc, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000142
- to stream Pmc, to step Curation: Pour aller vers cette notice dans l'étape Curation :000140
- to stream Pmc, to step Checkpoint: Pour aller vers cette notice dans l'étape Curation :000134
- to stream Ncbi, to step Merge: Pour aller vers cette notice dans l'étape Curation :000165
Links to Exploration step
PMC:1592071Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Data processing and classification analysis of proteomic changes: a case study of oil pollution in the mussel, <italic>Mytilus edulis</italic>
</title>
<author><name sortKey="Monsinjon, Tiphaine" sort="Monsinjon, Tiphaine" uniqKey="Monsinjon T" first="Tiphaine" last="Monsinjon">Tiphaine Monsinjon</name>
<affiliation wicri:level="1"><nlm:aff id="I1">IRIS – International Research Institute of Stavanger AS, Randaberg, Norway</nlm:aff>
<country xml:lang="fr">Norvège</country>
<wicri:regionArea>IRIS – International Research Institute of Stavanger AS, Randaberg</wicri:regionArea>
<wicri:noRegion>Randaberg</wicri:noRegion>
</affiliation>
<affiliation wicri:level="4"><nlm:aff id="I2">Laboratoire d'Ecotoxicologie – Milieux Aquatiques, Université du Havre, Le Havre, France</nlm:aff>
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d'Ecotoxicologie – Milieux Aquatiques, Université du Havre, Le Havre</wicri:regionArea>
<placeName><settlement type="city">Le Havre</settlement>
</placeName>
<orgName type="university">Université du Havre</orgName>
<placeName><settlement type="city">Le Havre</settlement>
<region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Andersen, Odd Ketil" sort="Andersen, Odd Ketil" uniqKey="Andersen O" first="Odd Ketil" last="Andersen">Odd Ketil Andersen</name>
<affiliation wicri:level="1"><nlm:aff id="I1">IRIS – International Research Institute of Stavanger AS, Randaberg, Norway</nlm:aff>
<country xml:lang="fr">Norvège</country>
<wicri:regionArea>IRIS – International Research Institute of Stavanger AS, Randaberg</wicri:regionArea>
<wicri:noRegion>Randaberg</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Leboulenger, Francois" sort="Leboulenger, Francois" uniqKey="Leboulenger F" first="François" last="Leboulenger">François Leboulenger</name>
<affiliation wicri:level="4"><nlm:aff id="I2">Laboratoire d'Ecotoxicologie – Milieux Aquatiques, Université du Havre, Le Havre, France</nlm:aff>
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d'Ecotoxicologie – Milieux Aquatiques, Université du Havre, Le Havre</wicri:regionArea>
<placeName><settlement type="city">Le Havre</settlement>
</placeName>
<orgName type="university">Université du Havre</orgName>
<placeName><settlement type="city">Le Havre</settlement>
<region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Knigge, Thomas" sort="Knigge, Thomas" uniqKey="Knigge T" first="Thomas" last="Knigge">Thomas Knigge</name>
<affiliation wicri:level="1"><nlm:aff id="I1">IRIS – International Research Institute of Stavanger AS, Randaberg, Norway</nlm:aff>
<country xml:lang="fr">Norvège</country>
<wicri:regionArea>IRIS – International Research Institute of Stavanger AS, Randaberg</wicri:regionArea>
<wicri:noRegion>Randaberg</wicri:noRegion>
</affiliation>
<affiliation wicri:level="4"><nlm:aff id="I2">Laboratoire d'Ecotoxicologie – Milieux Aquatiques, Université du Havre, Le Havre, France</nlm:aff>
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d'Ecotoxicologie – Milieux Aquatiques, Université du Havre, Le Havre</wicri:regionArea>
<placeName><settlement type="city">Le Havre</settlement>
</placeName>
<orgName type="university">Université du Havre</orgName>
<placeName><settlement type="city">Le Havre</settlement>
<region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">16970821</idno>
<idno type="pmc">1592071</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1592071</idno>
<idno type="RBID">PMC:1592071</idno>
<idno type="doi">10.1186/1477-5956-4-17</idno>
<date when="2006">2006</date>
<idno type="wicri:Area/Pmc/Corpus">000142</idno>
<idno type="wicri:Area/Pmc/Curation">000140</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000134</idno>
<idno type="wicri:Area/Ncbi/Merge">000165</idno>
<idno type="wicri:Area/Ncbi/Curation">000165</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">Data processing and classification analysis of proteomic changes: a case study of oil pollution in the mussel, <italic>Mytilus edulis</italic>
</title>
<author><name sortKey="Monsinjon, Tiphaine" sort="Monsinjon, Tiphaine" uniqKey="Monsinjon T" first="Tiphaine" last="Monsinjon">Tiphaine Monsinjon</name>
<affiliation wicri:level="1"><nlm:aff id="I1">IRIS – International Research Institute of Stavanger AS, Randaberg, Norway</nlm:aff>
<country xml:lang="fr">Norvège</country>
<wicri:regionArea>IRIS – International Research Institute of Stavanger AS, Randaberg</wicri:regionArea>
<wicri:noRegion>Randaberg</wicri:noRegion>
</affiliation>
<affiliation wicri:level="4"><nlm:aff id="I2">Laboratoire d'Ecotoxicologie – Milieux Aquatiques, Université du Havre, Le Havre, France</nlm:aff>
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d'Ecotoxicologie – Milieux Aquatiques, Université du Havre, Le Havre</wicri:regionArea>
<placeName><settlement type="city">Le Havre</settlement>
</placeName>
<orgName type="university">Université du Havre</orgName>
<placeName><settlement type="city">Le Havre</settlement>
<region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Andersen, Odd Ketil" sort="Andersen, Odd Ketil" uniqKey="Andersen O" first="Odd Ketil" last="Andersen">Odd Ketil Andersen</name>
<affiliation wicri:level="1"><nlm:aff id="I1">IRIS – International Research Institute of Stavanger AS, Randaberg, Norway</nlm:aff>
<country xml:lang="fr">Norvège</country>
<wicri:regionArea>IRIS – International Research Institute of Stavanger AS, Randaberg</wicri:regionArea>
<wicri:noRegion>Randaberg</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Leboulenger, Francois" sort="Leboulenger, Francois" uniqKey="Leboulenger F" first="François" last="Leboulenger">François Leboulenger</name>
<affiliation wicri:level="4"><nlm:aff id="I2">Laboratoire d'Ecotoxicologie – Milieux Aquatiques, Université du Havre, Le Havre, France</nlm:aff>
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d'Ecotoxicologie – Milieux Aquatiques, Université du Havre, Le Havre</wicri:regionArea>
<placeName><settlement type="city">Le Havre</settlement>
</placeName>
<orgName type="university">Université du Havre</orgName>
<placeName><settlement type="city">Le Havre</settlement>
<region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Knigge, Thomas" sort="Knigge, Thomas" uniqKey="Knigge T" first="Thomas" last="Knigge">Thomas Knigge</name>
<affiliation wicri:level="1"><nlm:aff id="I1">IRIS – International Research Institute of Stavanger AS, Randaberg, Norway</nlm:aff>
<country xml:lang="fr">Norvège</country>
<wicri:regionArea>IRIS – International Research Institute of Stavanger AS, Randaberg</wicri:regionArea>
<wicri:noRegion>Randaberg</wicri:noRegion>
</affiliation>
<affiliation wicri:level="4"><nlm:aff id="I2">Laboratoire d'Ecotoxicologie – Milieux Aquatiques, Université du Havre, Le Havre, France</nlm:aff>
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire d'Ecotoxicologie – Milieux Aquatiques, Université du Havre, Le Havre</wicri:regionArea>
<placeName><settlement type="city">Le Havre</settlement>
</placeName>
<orgName type="university">Université du Havre</orgName>
<placeName><settlement type="city">Le Havre</settlement>
<region type="region" nuts="2">Région Normandie</region>
<region type="old region" nuts="2">Haute-Normandie</region>
</placeName>
</affiliation>
</author>
</analytic>
<series><title level="j">Proteome Science</title>
<idno type="eISSN">1477-5956</idno>
<imprint><date when="2006">2006</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><sec><title>Background</title>
<p>Proteomics may help to detect subtle pollution-related changes, such as responses to mixture pollution at low concentrations, where clear signs of toxicity are absent. The challenges associated with the analysis of large-scale multivariate proteomic datasets have been widely discussed in medical research and biomarker discovery. This concept has been introduced to ecotoxicology only recently, so data processing and classification analysis need to be refined before they can be readily applied in biomarker discovery and monitoring studies.</p>
</sec>
<sec><title>Results</title>
<p>Data sets obtained from a case study of oil pollution in the Blue mussel were investigated for differential protein expression by retentate chromatography-mass spectrometry and decision tree classification. Different tissues and different settings were used to evaluate classifiers towards their discriminatory power. It was found that, due the intrinsic variability of the data sets, reliable classification of unknown samples could only be achieved on a broad statistical basis (n > 60) with the observed expression changes comprising high statistical significance and sufficient amplitude. The application of stringent criteria to guard against overfitting of the models eventually allowed satisfactory classification for only one of the investigated data sets and settings.</p>
</sec>
<sec><title>Conclusion</title>
<p>Machine learning techniques provide a promising approach to process and extract informative expression signatures from high-dimensional mass-spectrometry data. Even though characterisation of the proteins forming the expression signatures would be ideal, knowledge of the specific proteins is not mandatory for effective class discrimination. This may constitute a new biomarker approach in ecotoxicology, where working with organisms, which do not have sequenced genomes render protein identification by database searching problematic. However, data processing has to be critically evaluated and statistical constraints have to be considered before supervised classification algorithms are employed.</p>
</sec>
</div>
</front>
<back><div1 type="bibliography"><listBibl><biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
</listBibl>
</div1>
</back>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/France/explor/LeHavreV1/Data/Ncbi/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000165 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Ncbi/Curation/biblio.hfd -nk 000165 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/France |area= LeHavreV1 |flux= Ncbi |étape= Curation |type= RBID |clé= PMC:1592071 |texte= Data processing and classification analysis of proteomic changes: a case study of oil pollution in the mussel, Mytilus edulis }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Ncbi/Curation/RBID.i -Sk "pubmed:16970821" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Ncbi/Curation/biblio.hfd \ | NlmPubMed2Wicri -a LeHavreV1
This area was generated with Dilib version V0.6.25. |