Open data and open code for big science of science studies
Identifieur interne : 000278 ( Main/Exploration ); précédent : 000277; suivant : 000279Open data and open code for big science of science studies
Auteurs : Robert P. Light [États-Unis] ; David E. Polley [États-Unis] ; Katy Börner [États-Unis]Source :
- Scientometrics : (Print) [ 0138-9130 ] ; 2014.
Descripteurs français
- Pascal (Inist)
- Wicri :
- topic : Base de données, Brevet, Recherche scientifique.
English descriptors
- KwdEn :
Abstract
Historically, science of science (Sci2) studies have been performed by single investigators or small teams. As the size and complexity of data sets and analyses scales up, a "Big Science" approach (Price, Little science, big science, 1963) is required that exploits the expertise and resources of interdisciplinary teams spanning academic, government, and industry boundaries. Big Sci2 studies utilize "big data", i.e., large, complex, diverse, longitudinal, and/or distributed datasets that might be owned by different stake-holders. They apply a systems science approach to uncover hidden patterns, bursts of activity, correlations, and laws. They make available open data and open code in support of replication of results, iterative refinement of approaches and tools, and education. This paper introduces a database-tool infrastructure that was designed to support big Sci2 studies. The open access Scholarly Database (http://sdb.cns.iu.edu) provides easy access to 26 million paper, patent, grant, and clinical trial records. The open source Sci2 tool (http:// sci2.cns.iu.edu) supports temporal, geospatial, topical, and network studies. The scalability of the infrastructure is examined. Results show that temporal analyses scale linearly with the number of records and file size, while the geospatial algorithm showed quadratic growth. The number of edges rather than nodes determined performance for network based algorithms.
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream PascalFrancis, to step Corpus: 000004
- to stream PascalFrancis, to step Corpus: 000013
- to stream PascalFrancis, to step Curation: 000131
- to stream PascalFrancis, to step Checkpoint: 000005
- to stream Main, to step Merge: 000278
- to stream Main, to step Curation: 000278
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Open data and open code for big science of science studies</title>
<author><name sortKey="Light, Robert P" sort="Light, Robert P" uniqKey="Light R" first="Robert P." last="Light">Robert P. Light</name>
<affiliation wicri:level="2"><inist:fA14 i1="01"><s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Polley, David E" sort="Polley, David E" uniqKey="Polley D" first="David E." last="Polley">David E. Polley</name>
<affiliation wicri:level="2"><inist:fA14 i1="01"><s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Borner, Katy" sort="Borner, Katy" uniqKey="Borner K" first="Katy" last="Börner">Katy Börner</name>
<affiliation wicri:level="2"><inist:fA14 i1="01"><s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">14-0270434</idno>
<date when="2014">2014</date>
<idno type="stanalyst">PASCAL 14-0270434 INIST</idno>
<idno type="RBID">Pascal:14-0270434</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000004</idno>
<idno type="stanalyst">FRANCIS 14-0270434 INIST</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000013</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000131</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000005</idno>
<idno type="wicri:doubleKey">0138-9130:2014:Light R:open:data:and</idno>
<idno type="wicri:Area/Main/Merge">000278</idno>
<idno type="wicri:Area/Main/Curation">000278</idno>
<idno type="wicri:Area/Main/Exploration">000278</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Open data and open code for big science of science studies</title>
<author><name sortKey="Light, Robert P" sort="Light, Robert P" uniqKey="Light R" first="Robert P." last="Light">Robert P. Light</name>
<affiliation wicri:level="2"><inist:fA14 i1="01"><s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Polley, David E" sort="Polley, David E" uniqKey="Polley D" first="David E." last="Polley">David E. Polley</name>
<affiliation wicri:level="2"><inist:fA14 i1="01"><s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Borner, Katy" sort="Borner, Katy" uniqKey="Borner K" first="Katy" last="Börner">Katy Börner</name>
<affiliation wicri:level="2"><inist:fA14 i1="01"><s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName><region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">Scientometrics : (Print)</title>
<title level="j" type="abbreviated">Scientometrics : (Print)</title>
<idno type="ISSN">0138-9130</idno>
<imprint><date when="2014">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">Scientometrics : (Print)</title>
<title level="j" type="abbreviated">Scientometrics : (Print)</title>
<idno type="ISSN">0138-9130</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Algorithm</term>
<term>Data analysis</term>
<term>Database</term>
<term>Growth</term>
<term>Interdisciplinary field</term>
<term>Patents</term>
<term>Scientific research</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Analyse donnée</term>
<term>Interdisciplinaire</term>
<term>Base de données</term>
<term>Brevet</term>
<term>Algorithme</term>
<term>Croissance</term>
<term>Recherche scientifique</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr"><term>Base de données</term>
<term>Brevet</term>
<term>Recherche scientifique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Historically, science of science (Sci2) studies have been performed by single investigators or small teams. As the size and complexity of data sets and analyses scales up, a "Big Science" approach (Price, Little science, big science, 1963) is required that exploits the expertise and resources of interdisciplinary teams spanning academic, government, and industry boundaries. Big Sci2 studies utilize "big data", i.e., large, complex, diverse, longitudinal, and/or distributed datasets that might be owned by different stake-holders. They apply a systems science approach to uncover hidden patterns, bursts of activity, correlations, and laws. They make available open data and open code in support of replication of results, iterative refinement of approaches and tools, and education. This paper introduces a database-tool infrastructure that was designed to support big Sci2 studies. The open access Scholarly Database (http://sdb.cns.iu.edu) provides easy access to 26 million paper, patent, grant, and clinical trial records. The open source Sci2 tool (http:// sci2.cns.iu.edu) supports temporal, geospatial, topical, and network studies. The scalability of the infrastructure is examined. Results show that temporal analyses scale linearly with the number of records and file size, while the geospatial algorithm showed quadratic growth. The number of edges rather than nodes determined performance for network based algorithms.</div>
</front>
</TEI>
<affiliations><list><country><li>États-Unis</li>
</country>
<region><li>Indiana</li>
</region>
</list>
<tree><country name="États-Unis"><region name="Indiana"><name sortKey="Light, Robert P" sort="Light, Robert P" uniqKey="Light R" first="Robert P." last="Light">Robert P. Light</name>
</region>
<name sortKey="Borner, Katy" sort="Borner, Katy" uniqKey="Borner K" first="Katy" last="Börner">Katy Börner</name>
<name sortKey="Polley, David E" sort="Polley, David E" uniqKey="Polley D" first="David E." last="Polley">David E. Polley</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Belgique/explor/OpenAccessBelV2/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000278 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000278 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Belgique |area= OpenAccessBelV2 |flux= Main |étape= Exploration |type= RBID |clé= Pascal:14-0270434 |texte= Open data and open code for big science of science studies }}
This area was generated with Dilib version V0.6.25. |