Serveur d'exploration Cyberinfrastructure

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Enabling Big Geoscience Data Analytics with a Cloud-Based, MapReduce-Enabled and Service-Oriented Workflow Framework

Identifieur interne : 000110 ( Pmc/Checkpoint ); précédent : 000109; suivant : 000111

Enabling Big Geoscience Data Analytics with a Cloud-Based, MapReduce-Enabled and Service-Oriented Workflow Framework

Auteurs : Zhenlong Li [États-Unis] ; Chaowei Yang [États-Unis] ; Baoxuan Jin [États-Unis, République populaire de Chine] ; Manzhu Yu [États-Unis] ; Kai Liu [États-Unis] ; Min Sun [États-Unis] ; Matthew Zhan [États-Unis]

Source :

RBID : PMC:4351198

Abstract

Geoscience observations and model simulations are generating vast amounts of multi-dimensional data. Effectively analyzing these data are essential for geoscience studies. However, the tasks are challenging for geoscientists because processing the massive amount of data is both computing and data intensive in that data analytics requires complex procedures and multiple tools. To tackle these challenges, a scientific workflow framework is proposed for big geoscience data analytics. In this framework techniques are proposed by leveraging cloud computing, MapReduce, and Service Oriented Architecture (SOA). Specifically, HBase is adopted for storing and managing big geoscience data across distributed computers. MapReduce-based algorithm framework is developed to support parallel processing of geoscience data. And service-oriented workflow architecture is built for supporting on-demand complex data analytics in the cloud environment. A proof-of-concept prototype tests the performance of the framework. Results show that this innovative framework significantly improves the efficiency of big geoscience data analytics by reducing the data processing time as well as simplifying data analytical procedures for geoscientists.


Url:
DOI: 10.1371/journal.pone.0116781
PubMed: 25742012
PubMed Central: 4351198


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

PMC:4351198

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Enabling Big Geoscience Data Analytics with a Cloud-Based, MapReduce-Enabled and Service-Oriented Workflow Framework</title>
<author>
<name sortKey="Li, Zhenlong" sort="Li, Zhenlong" uniqKey="Li Z" first="Zhenlong" last="Li">Zhenlong Li</name>
<affiliation wicri:level="2">
<nlm:aff id="aff001">
<addr-line>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA</wicri:regionArea>
<placeName>
<region type="state">Virginie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Yang, Chaowei" sort="Yang, Chaowei" uniqKey="Yang C" first="Chaowei" last="Yang">Chaowei Yang</name>
<affiliation wicri:level="2">
<nlm:aff id="aff001">
<addr-line>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA</wicri:regionArea>
<placeName>
<region type="state">Virginie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Jin, Baoxuan" sort="Jin, Baoxuan" uniqKey="Jin B" first="Baoxuan" last="Jin">Baoxuan Jin</name>
<affiliation wicri:level="2">
<nlm:aff id="aff001">
<addr-line>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA</wicri:regionArea>
<placeName>
<region type="state">Virginie</region>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="aff002">
<addr-line>Yunnan Provincial Geomatics Center, Yunnan Bureau of Surveying, Mapping, and GeoInformation, Kunming,Yunnan, China</addr-line>
</nlm:aff>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Yunnan Provincial Geomatics Center, Yunnan Bureau of Surveying, Mapping, and GeoInformation, Kunming,Yunnan</wicri:regionArea>
<wicri:noRegion>Yunnan</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Yu, Manzhu" sort="Yu, Manzhu" uniqKey="Yu M" first="Manzhu" last="Yu">Manzhu Yu</name>
<affiliation wicri:level="2">
<nlm:aff id="aff001">
<addr-line>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA</wicri:regionArea>
<placeName>
<region type="state">Virginie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Liu, Kai" sort="Liu, Kai" uniqKey="Liu K" first="Kai" last="Liu">Kai Liu</name>
<affiliation wicri:level="2">
<nlm:aff id="aff001">
<addr-line>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA</wicri:regionArea>
<placeName>
<region type="state">Virginie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Sun, Min" sort="Sun, Min" uniqKey="Sun M" first="Min" last="Sun">Min Sun</name>
<affiliation wicri:level="2">
<nlm:aff id="aff001">
<addr-line>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA</wicri:regionArea>
<placeName>
<region type="state">Virginie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Zhan, Matthew" sort="Zhan, Matthew" uniqKey="Zhan M" first="Matthew" last="Zhan">Matthew Zhan</name>
<affiliation wicri:level="2">
<nlm:aff id="aff001">
<addr-line>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA</wicri:regionArea>
<placeName>
<region type="state">Virginie</region>
</placeName>
</affiliation>
<affiliation wicri:level="2">
<nlm:aff id="aff003">
<addr-line>Department of Computer Science, University of Texas—Austin, Austin, Texas, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Computer Science, University of Texas—Austin, Austin, Texas</wicri:regionArea>
<placeName>
<region type="state">Texas</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">25742012</idno>
<idno type="pmc">4351198</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4351198</idno>
<idno type="RBID">PMC:4351198</idno>
<idno type="doi">10.1371/journal.pone.0116781</idno>
<date when="2015">2015</date>
<idno type="wicri:Area/Pmc/Corpus">000069</idno>
<idno type="wicri:Area/Pmc/Curation">000069</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000110</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Enabling Big Geoscience Data Analytics with a Cloud-Based, MapReduce-Enabled and Service-Oriented Workflow Framework</title>
<author>
<name sortKey="Li, Zhenlong" sort="Li, Zhenlong" uniqKey="Li Z" first="Zhenlong" last="Li">Zhenlong Li</name>
<affiliation wicri:level="2">
<nlm:aff id="aff001">
<addr-line>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA</wicri:regionArea>
<placeName>
<region type="state">Virginie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Yang, Chaowei" sort="Yang, Chaowei" uniqKey="Yang C" first="Chaowei" last="Yang">Chaowei Yang</name>
<affiliation wicri:level="2">
<nlm:aff id="aff001">
<addr-line>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA</wicri:regionArea>
<placeName>
<region type="state">Virginie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Jin, Baoxuan" sort="Jin, Baoxuan" uniqKey="Jin B" first="Baoxuan" last="Jin">Baoxuan Jin</name>
<affiliation wicri:level="2">
<nlm:aff id="aff001">
<addr-line>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA</wicri:regionArea>
<placeName>
<region type="state">Virginie</region>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="aff002">
<addr-line>Yunnan Provincial Geomatics Center, Yunnan Bureau of Surveying, Mapping, and GeoInformation, Kunming,Yunnan, China</addr-line>
</nlm:aff>
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Yunnan Provincial Geomatics Center, Yunnan Bureau of Surveying, Mapping, and GeoInformation, Kunming,Yunnan</wicri:regionArea>
<wicri:noRegion>Yunnan</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Yu, Manzhu" sort="Yu, Manzhu" uniqKey="Yu M" first="Manzhu" last="Yu">Manzhu Yu</name>
<affiliation wicri:level="2">
<nlm:aff id="aff001">
<addr-line>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA</wicri:regionArea>
<placeName>
<region type="state">Virginie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Liu, Kai" sort="Liu, Kai" uniqKey="Liu K" first="Kai" last="Liu">Kai Liu</name>
<affiliation wicri:level="2">
<nlm:aff id="aff001">
<addr-line>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA</wicri:regionArea>
<placeName>
<region type="state">Virginie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Sun, Min" sort="Sun, Min" uniqKey="Sun M" first="Min" last="Sun">Min Sun</name>
<affiliation wicri:level="2">
<nlm:aff id="aff001">
<addr-line>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA</wicri:regionArea>
<placeName>
<region type="state">Virginie</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Zhan, Matthew" sort="Zhan, Matthew" uniqKey="Zhan M" first="Matthew" last="Zhan">Matthew Zhan</name>
<affiliation wicri:level="2">
<nlm:aff id="aff001">
<addr-line>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA</wicri:regionArea>
<placeName>
<region type="state">Virginie</region>
</placeName>
</affiliation>
<affiliation wicri:level="2">
<nlm:aff id="aff003">
<addr-line>Department of Computer Science, University of Texas—Austin, Austin, Texas, United States of America</addr-line>
</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>Department of Computer Science, University of Texas—Austin, Austin, Texas</wicri:regionArea>
<placeName>
<region type="state">Texas</region>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j">PLoS ONE</title>
<idno type="eISSN">1932-6203</idno>
<imprint>
<date when="2015">2015</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>Geoscience observations and model simulations are generating vast amounts of multi-dimensional data. Effectively analyzing these data are essential for geoscience studies. However, the tasks are challenging for geoscientists because processing the massive amount of data is both computing and data intensive in that data analytics requires complex procedures and multiple tools. To tackle these challenges, a scientific workflow framework is proposed for big geoscience data analytics. In this framework techniques are proposed by leveraging cloud computing, MapReduce, and Service Oriented Architecture (SOA). Specifically, HBase is adopted for storing and managing big geoscience data across distributed computers. MapReduce-based algorithm framework is developed to support parallel processing of geoscience data. And service-oriented workflow architecture is built for supporting on-demand complex data analytics in the cloud environment. A proof-of-concept prototype tests the performance of the framework. Results show that this innovative framework significantly improves the efficiency of big geoscience data analytics by reducing the data processing time as well as simplifying data analytical procedures for geoscientists.</p>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Groot, R" uniqKey="Groot R">R Groot</name>
</author>
<author>
<name sortKey="Mclaughlin, Jd" uniqKey="Mclaughlin J">JD McLaughlin</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yang, C" uniqKey="Yang C">C Yang</name>
</author>
<author>
<name sortKey="Li, W" uniqKey="Li W">W Li</name>
</author>
<author>
<name sortKey="Xie, J" uniqKey="Xie J">J Xie</name>
</author>
<author>
<name sortKey="Zhou, B" uniqKey="Zhou B">B Zhou</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yang, C" uniqKey="Yang C">C Yang</name>
</author>
<author>
<name sortKey="Goodchild, M" uniqKey="Goodchild M">M Goodchild</name>
</author>
<author>
<name sortKey="Huang, Q" uniqKey="Huang Q">Q Huang</name>
</author>
<author>
<name sortKey="Nebert, D" uniqKey="Nebert D">D Nebert</name>
</author>
<author>
<name sortKey="Raskin, R" uniqKey="Raskin R">R Raskin</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Edwards, Pn" uniqKey="Edwards P">PN Edwards</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Grassl, H" uniqKey="Grassl H">H Grassl</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hodgson, Ja" uniqKey="Hodgson J">JA Hodgson</name>
</author>
<author>
<name sortKey="Thomas, Cd" uniqKey="Thomas C">CD Thomas</name>
</author>
<author>
<name sortKey="Wintle, Ba" uniqKey="Wintle B">BA Wintle</name>
</author>
<author>
<name sortKey="Moilanen, A" uniqKey="Moilanen A">A Moilanen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Murphy, Jm" uniqKey="Murphy J">JM Murphy</name>
</author>
<author>
<name sortKey="Sexton, Dm" uniqKey="Sexton D">DM Sexton</name>
</author>
<author>
<name sortKey="Barnett, Dn" uniqKey="Barnett D">DN Barnett</name>
</author>
<author>
<name sortKey="Jones, Gs" uniqKey="Jones G">GS Jones</name>
</author>
<author>
<name sortKey="Webb, Mj" uniqKey="Webb M">MJ Webb</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, Z" uniqKey="Li Z">Z Li</name>
</author>
<author>
<name sortKey="Yang, C" uniqKey="Yang C">C Yang</name>
</author>
<author>
<name sortKey="Sun, M" uniqKey="Sun M">M Sun</name>
</author>
<author>
<name sortKey="Li, J" uniqKey="Li J">J Li</name>
</author>
<author>
<name sortKey="Xu, C" uniqKey="Xu C">C Xu</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Cui, D" uniqKey="Cui D">D Cui</name>
</author>
<author>
<name sortKey="Wu, Y" uniqKey="Wu Y">Y Wu</name>
</author>
<author>
<name sortKey="Zhang, Q" uniqKey="Zhang Q">Q Zhang</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Liu, Y" uniqKey="Liu Y">Y Liu</name>
</author>
<author>
<name sortKey="Guo, W" uniqKey="Guo W">W Guo</name>
</author>
<author>
<name sortKey="Jiang, W" uniqKey="Jiang W">W Jiang</name>
</author>
<author>
<name sortKey="Gong, J" uniqKey="Gong J">J Gong</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yang, C" uniqKey="Yang C">C Yang</name>
</author>
<author>
<name sortKey="Wu, H" uniqKey="Wu H">H Wu</name>
</author>
<author>
<name sortKey="Huang, Q" uniqKey="Huang Q">Q Huang</name>
</author>
<author>
<name sortKey="Li, Z" uniqKey="Li Z">Z Li</name>
</author>
<author>
<name sortKey="Li, J" uniqKey="Li J">J Li</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, J" uniqKey="Li J">J Li</name>
</author>
<author>
<name sortKey="Wang, Fz" uniqKey="Wang F">FZ Wang</name>
</author>
<author>
<name sortKey="Meng, L" uniqKey="Meng L">L Meng</name>
</author>
<author>
<name sortKey="Zhang, W" uniqKey="Zhang W">W Zhang</name>
</author>
<author>
<name sortKey="Cai, Y" uniqKey="Cai Y">Y Cai</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Juve, G" uniqKey="Juve G">G Juve</name>
</author>
<author>
<name sortKey="Deelman, E" uniqKey="Deelman E">E Deelman</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wright, Dj" uniqKey="Wright D">DJ Wright</name>
</author>
<author>
<name sortKey="Wang, S" uniqKey="Wang S">S Wang</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Fiore, S" uniqKey="Fiore S">S Fiore</name>
</author>
<author>
<name sortKey="Negro, A" uniqKey="Negro A">A Negro</name>
</author>
<author>
<name sortKey="Aloisio, G" uniqKey="Aloisio G">G Aloisio</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Stonebraker, M" uniqKey="Stonebraker M">M Stonebraker</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Liu, Y" uniqKey="Liu Y">Y Liu</name>
</author>
<author>
<name sortKey="Chen, B" uniqKey="Chen B">B Chen</name>
</author>
<author>
<name sortKey="He, W" uniqKey="He W">W He</name>
</author>
<author>
<name sortKey="Fang, Y" uniqKey="Fang Y">Y Fang</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Khetrapal, A" uniqKey="Khetrapal A">A Khetrapal</name>
</author>
<author>
<name sortKey="Ganesh, V" uniqKey="Ganesh V">V Ganesh</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lakshman, A" uniqKey="Lakshman A">A Lakshman</name>
</author>
<author>
<name sortKey="Malik, P" uniqKey="Malik P">P Malik</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chang, F" uniqKey="Chang F">F Chang</name>
</author>
<author>
<name sortKey="Dean, J" uniqKey="Dean J">J Dean</name>
</author>
<author>
<name sortKey="Ghemawat, S" uniqKey="Ghemawat S">S Ghemawat</name>
</author>
<author>
<name sortKey="Hsieh, Wc" uniqKey="Hsieh W">WC Hsieh</name>
</author>
<author>
<name sortKey="Wallach, Da" uniqKey="Wallach D">DA Wallach</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chen, J" uniqKey="Chen J">J Chen</name>
</author>
<author>
<name sortKey="Zheng, G" uniqKey="Zheng G">G Zheng</name>
</author>
<author>
<name sortKey="Chen, H" uniqKey="Chen H">H Chen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Zhang, H" uniqKey="Zhang H">H Zhang</name>
</author>
<author>
<name sortKey="Liu, M" uniqKey="Liu M">M Liu</name>
</author>
<author>
<name sortKey="Shi, Y" uniqKey="Shi Y">Y Shi</name>
</author>
<author>
<name sortKey="Yuen, Da" uniqKey="Yuen D">DA Yuen</name>
</author>
<author>
<name sortKey="Yan, Z" uniqKey="Yan Z">Z Yan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Geist, A" uniqKey="Geist A">A Geist</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gropp, W" uniqKey="Gropp W">W Gropp</name>
</author>
<author>
<name sortKey="Lusk, E" uniqKey="Lusk E">E Lusk</name>
</author>
<author>
<name sortKey="Skjellum, A" uniqKey="Skjellum A">A Skjellum</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Foster, I" uniqKey="Foster I">I Foster</name>
</author>
<author>
<name sortKey="Kesselman, C" uniqKey="Kesselman C">C Kesselman</name>
</author>
<author>
<name sortKey="Tuecke, S" uniqKey="Tuecke S">S Tuecke</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Dean, J" uniqKey="Dean J">J Dean</name>
</author>
<author>
<name sortKey="Ghemawat, S" uniqKey="Ghemawat S">S Ghemawat</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Rizvandi, Nb" uniqKey="Rizvandi N">NB Rizvandi</name>
</author>
<author>
<name sortKey="Boloori, Aj" uniqKey="Boloori A">AJ Boloori</name>
</author>
<author>
<name sortKey="Kamyabpour, N" uniqKey="Kamyabpour N">N Kamyabpour</name>
</author>
<author>
<name sortKey="Zomaya, Ay" uniqKey="Zomaya A">AY Zomaya</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Zhao, H" uniqKey="Zhao H">H Zhao</name>
</author>
<author>
<name sortKey="Ai, S" uniqKey="Ai S">S Ai</name>
</author>
<author>
<name sortKey="Lv, Z" uniqKey="Lv Z">Z Lv</name>
</author>
<author>
<name sortKey="Li, B" uniqKey="Li B">B Li</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lud Scher, B" uniqKey="Lud Scher B">B Ludäscher</name>
</author>
<author>
<name sortKey="Altintas, I" uniqKey="Altintas I">I Altintas</name>
</author>
<author>
<name sortKey="Berkley, C" uniqKey="Berkley C">C Berkley</name>
</author>
<author>
<name sortKey="Higgins, D" uniqKey="Higgins D">D Higgins</name>
</author>
<author>
<name sortKey="Jaeger, E" uniqKey="Jaeger E">E Jaeger</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wang, S" uniqKey="Wang S">S Wang</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yang, C" uniqKey="Yang C">C Yang</name>
</author>
<author>
<name sortKey="Raskin, R" uniqKey="Raskin R">R Raskin</name>
</author>
<author>
<name sortKey="Goodchild, M" uniqKey="Goodchild M">M Goodchild</name>
</author>
<author>
<name sortKey="Gahegan, M" uniqKey="Gahegan M">M Gahegan</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Gil, Y" uniqKey="Gil Y">Y Gil</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Taylor, Ij" uniqKey="Taylor I">IJ Taylor</name>
</author>
<author>
<name sortKey="Deelman, E" uniqKey="Deelman E">E Deelman</name>
</author>
<author>
<name sortKey="Gannon, D" uniqKey="Gannon D">D Gannon</name>
</author>
<author>
<name sortKey="Shields, M" uniqKey="Shields M">M Shields</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yue, P" uniqKey="Yue P">P Yue</name>
</author>
<author>
<name sortKey="He, L" uniqKey="He L">L He</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Oinn, T" uniqKey="Oinn T">T Oinn</name>
</author>
<author>
<name sortKey="Addis, M" uniqKey="Addis M">M Addis</name>
</author>
<author>
<name sortKey="Ferris, J" uniqKey="Ferris J">J Ferris</name>
</author>
<author>
<name sortKey="Marvin, D" uniqKey="Marvin D">D Marvin</name>
</author>
<author>
<name sortKey="Senger, M" uniqKey="Senger M">M Senger</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Barga, R" uniqKey="Barga R">R Barga</name>
</author>
<author>
<name sortKey="Jackson, J" uniqKey="Jackson J">J Jackson</name>
</author>
<author>
<name sortKey="Araujo, N" uniqKey="Araujo N">N Araujo</name>
</author>
<author>
<name sortKey="Guo, D" uniqKey="Guo D">D Guo</name>
</author>
<author>
<name sortKey="Gautam, N" uniqKey="Gautam N">N Gautam</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mattmann, Ca" uniqKey="Mattmann C">CA Mattmann</name>
</author>
<author>
<name sortKey="Crichton, Dj" uniqKey="Crichton D">DJ Crichton</name>
</author>
<author>
<name sortKey="Hart, Af" uniqKey="Hart A">AF Hart</name>
</author>
<author>
<name sortKey="Goodale, C" uniqKey="Goodale C">C Goodale</name>
</author>
<author>
<name sortKey="Hughes, Js" uniqKey="Hughes J">JS Hughes</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Williams, Dn" uniqKey="Williams D">DN Williams</name>
</author>
<author>
<name sortKey="Drach, R" uniqKey="Drach R">R Drach</name>
</author>
<author>
<name sortKey="Ananthakrishnan, R" uniqKey="Ananthakrishnan R">R Ananthakrishnan</name>
</author>
<author>
<name sortKey="Foster, I" uniqKey="Foster I">I Foster</name>
</author>
<author>
<name sortKey="Fraser, D" uniqKey="Fraser D">D Fraser</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Huang, Q" uniqKey="Huang Q">Q Huang</name>
</author>
<author>
<name sortKey="Yang, C" uniqKey="Yang C">C Yang</name>
</author>
<author>
<name sortKey="Liu, K" uniqKey="Liu K">K Liu</name>
</author>
<author>
<name sortKey="Xia, J" uniqKey="Xia J">J Xia</name>
</author>
<author>
<name sortKey="Xu, C" uniqKey="Xu C">C Xu</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yue, P" uniqKey="Yue P">P Yue</name>
</author>
<author>
<name sortKey="Di, L" uniqKey="Di L">L Di</name>
</author>
<author>
<name sortKey="Yang, W" uniqKey="Yang W">W Yang</name>
</author>
<author>
<name sortKey="Yu, G" uniqKey="Yu G">G Yu</name>
</author>
<author>
<name sortKey="Zhao, P" uniqKey="Zhao P">P Zhao</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, Z" uniqKey="Li Z">Z Li</name>
</author>
<author>
<name sortKey="Yang, C" uniqKey="Yang C">C Yang</name>
</author>
<author>
<name sortKey="Wu, H" uniqKey="Wu H">H Wu</name>
</author>
<author>
<name sortKey="Li, W" uniqKey="Li W">W Li</name>
</author>
<author>
<name sortKey="Miao, L" uniqKey="Miao L">L Miao</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">PLoS One</journal-id>
<journal-id journal-id-type="iso-abbrev">PLoS ONE</journal-id>
<journal-id journal-id-type="publisher-id">plos</journal-id>
<journal-id journal-id-type="pmc">plosone</journal-id>
<journal-title-group>
<journal-title>PLoS ONE</journal-title>
</journal-title-group>
<issn pub-type="epub">1932-6203</issn>
<publisher>
<publisher-name>Public Library of Science</publisher-name>
<publisher-loc>San Francisco, CA USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">25742012</article-id>
<article-id pub-id-type="pmc">4351198</article-id>
<article-id pub-id-type="doi">10.1371/journal.pone.0116781</article-id>
<article-id pub-id-type="publisher-id">PONE-D-14-42409</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Research Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Enabling Big Geoscience Data Analytics with a Cloud-Based, MapReduce-Enabled and Service-Oriented Workflow Framework</article-title>
<alt-title alt-title-type="running-head">Developing a New Framework to Enable Big Geoscience Data Analytics</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Li</surname>
<given-names>Zhenlong</given-names>
</name>
<xref ref-type="aff" rid="aff001">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Yang</surname>
<given-names>Chaowei</given-names>
</name>
<xref ref-type="aff" rid="aff001">
<sup>1</sup>
</xref>
<xref rid="cor001" ref-type="corresp">*</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Jin</surname>
<given-names>Baoxuan</given-names>
</name>
<xref ref-type="aff" rid="aff001">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff002">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Yu</surname>
<given-names>Manzhu</given-names>
</name>
<xref ref-type="aff" rid="aff001">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Liu</surname>
<given-names>Kai</given-names>
</name>
<xref ref-type="aff" rid="aff001">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Sun</surname>
<given-names>Min</given-names>
</name>
<xref ref-type="aff" rid="aff001">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Zhan</surname>
<given-names>Matthew</given-names>
</name>
<xref ref-type="aff" rid="aff001">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff003">
<sup>3</sup>
</xref>
</contrib>
</contrib-group>
<aff id="aff001">
<label>1</label>
<addr-line>NSF Spatiotemporal Innovation Center, George Mason University, Fairfax, VA, United States of America</addr-line>
</aff>
<aff id="aff002">
<label>2</label>
<addr-line>Yunnan Provincial Geomatics Center, Yunnan Bureau of Surveying, Mapping, and GeoInformation, Kunming,Yunnan, China</addr-line>
</aff>
<aff id="aff003">
<label>3</label>
<addr-line>Department of Computer Science, University of Texas—Austin, Austin, Texas, United States of America</addr-line>
</aff>
<contrib-group>
<contrib contrib-type="editor">
<name>
<surname>Gomez-Gesteira</surname>
<given-names>Moncho</given-names>
</name>
<role>Academic Editor</role>
<xref ref-type="aff" rid="edit1"></xref>
</contrib>
</contrib-group>
<aff id="edit1">
<addr-line>University of Vigo, SPAIN</addr-line>
</aff>
<author-notes>
<fn fn-type="conflict" id="coi001">
<p>
<bold>Competing Interests: </bold>
The authors have declared that no competing interests exist.</p>
</fn>
<fn fn-type="con" id="contrib001">
<p>Conceived and designed the experiments: CY ZL BJ. Performed the experiments: ZL KL MY MZ. Analyzed the data: ZL CY BJ KL MY. Contributed reagents/materials/analysis tools: ZL CY BJ MS KL. Wrote the paper: ZL CY MY BJ.</p>
</fn>
<corresp id="cor001">* E-mail:
<email>cyang3@gmu.edu</email>
</corresp>
</author-notes>
<pub-date pub-type="epub">
<day>5</day>
<month>3</month>
<year>2015</year>
</pub-date>
<pub-date pub-type="collection">
<year>2015</year>
</pub-date>
<volume>10</volume>
<issue>3</issue>
<elocation-id>e0116781</elocation-id>
<history>
<date date-type="received">
<day>20</day>
<month>9</month>
<year>2014</year>
</date>
<date date-type="accepted">
<day>14</day>
<month>12</month>
<year>2014</year>
</date>
</history>
<permissions>
<copyright-year>2015</copyright-year>
<copyright-holder>Li et al</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>This is an open access article distributed under the terms of the
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution License</ext-link>
, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:type="simple" xlink:href="pone.0116781.pdf"></self-uri>
<abstract>
<p>Geoscience observations and model simulations are generating vast amounts of multi-dimensional data. Effectively analyzing these data are essential for geoscience studies. However, the tasks are challenging for geoscientists because processing the massive amount of data is both computing and data intensive in that data analytics requires complex procedures and multiple tools. To tackle these challenges, a scientific workflow framework is proposed for big geoscience data analytics. In this framework techniques are proposed by leveraging cloud computing, MapReduce, and Service Oriented Architecture (SOA). Specifically, HBase is adopted for storing and managing big geoscience data across distributed computers. MapReduce-based algorithm framework is developed to support parallel processing of geoscience data. And service-oriented workflow architecture is built for supporting on-demand complex data analytics in the cloud environment. A proof-of-concept prototype tests the performance of the framework. Results show that this innovative framework significantly improves the efficiency of big geoscience data analytics by reducing the data processing time as well as simplifying data analytical procedures for geoscientists.</p>
</abstract>
<funding-group>
<funding-statement>This research is supported by NSF (PLR-1349259, IIP-1338925, CNS-1117300) and NASA (NNG12PP37I). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</funding-statement>
</funding-group>
<counts>
<fig-count count="13"></fig-count>
<table-count count="2"></table-count>
<page-count count="23"></page-count>
</counts>
<custom-meta-group>
<custom-meta id="data-availability">
<meta-name>Data Availability</meta-name>
<meta-value>All relevant data are within the paper.</meta-value>
</custom-meta>
</custom-meta-group>
</article-meta>
<notes>
<title>Data Availability</title>
<p>All relevant data are within the paper.</p>
</notes>
</front>
</pmc>
<affiliations>
<list>
<country>
<li>République populaire de Chine</li>
<li>États-Unis</li>
</country>
<region>
<li>Texas</li>
<li>Virginie</li>
</region>
</list>
<tree>
<country name="États-Unis">
<region name="Virginie">
<name sortKey="Li, Zhenlong" sort="Li, Zhenlong" uniqKey="Li Z" first="Zhenlong" last="Li">Zhenlong Li</name>
</region>
<name sortKey="Jin, Baoxuan" sort="Jin, Baoxuan" uniqKey="Jin B" first="Baoxuan" last="Jin">Baoxuan Jin</name>
<name sortKey="Liu, Kai" sort="Liu, Kai" uniqKey="Liu K" first="Kai" last="Liu">Kai Liu</name>
<name sortKey="Sun, Min" sort="Sun, Min" uniqKey="Sun M" first="Min" last="Sun">Min Sun</name>
<name sortKey="Yang, Chaowei" sort="Yang, Chaowei" uniqKey="Yang C" first="Chaowei" last="Yang">Chaowei Yang</name>
<name sortKey="Yu, Manzhu" sort="Yu, Manzhu" uniqKey="Yu M" first="Manzhu" last="Yu">Manzhu Yu</name>
<name sortKey="Zhan, Matthew" sort="Zhan, Matthew" uniqKey="Zhan M" first="Matthew" last="Zhan">Matthew Zhan</name>
<name sortKey="Zhan, Matthew" sort="Zhan, Matthew" uniqKey="Zhan M" first="Matthew" last="Zhan">Matthew Zhan</name>
</country>
<country name="République populaire de Chine">
<noRegion>
<name sortKey="Jin, Baoxuan" sort="Jin, Baoxuan" uniqKey="Jin B" first="Baoxuan" last="Jin">Baoxuan Jin</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/CyberinfraV1/Data/Pmc/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000110 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Checkpoint/biblio.hfd -nk 000110 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    CyberinfraV1
   |flux=    Pmc
   |étape=   Checkpoint
   |type=    RBID
   |clé=     PMC:4351198
   |texte=   Enabling Big Geoscience Data Analytics with a Cloud-Based, MapReduce-Enabled and Service-Oriented Workflow Framework
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Checkpoint/RBID.i   -Sk "pubmed:25742012" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Checkpoint/biblio.hfd   \
       | NlmPubMed2Wicri -a CyberinfraV1 

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Oct 27 09:30:58 2016. Site generation: Sun Mar 10 23:08:40 2024