Serveur d'exploration autour du libre accès en Belgique

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Open data and open code for big science of science studies

Identifieur interne : 000005 ( PascalFrancis/Checkpoint ); précédent : 000004; suivant : 000006

Open data and open code for big science of science studies

Auteurs : Robert P. Light [États-Unis] ; David E. Polley [États-Unis] ; Katy Börner [États-Unis]

Source :

RBID : Pascal:14-0270434

Descripteurs français

English descriptors

Abstract

Historically, science of science (Sci2) studies have been performed by single investigators or small teams. As the size and complexity of data sets and analyses scales up, a "Big Science" approach (Price, Little science, big science, 1963) is required that exploits the expertise and resources of interdisciplinary teams spanning academic, government, and industry boundaries. Big Sci2 studies utilize "big data", i.e., large, complex, diverse, longitudinal, and/or distributed datasets that might be owned by different stake-holders. They apply a systems science approach to uncover hidden patterns, bursts of activity, correlations, and laws. They make available open data and open code in support of replication of results, iterative refinement of approaches and tools, and education. This paper introduces a database-tool infrastructure that was designed to support big Sci2 studies. The open access Scholarly Database (http://sdb.cns.iu.edu) provides easy access to 26 million paper, patent, grant, and clinical trial records. The open source Sci2 tool (http:// sci2.cns.iu.edu) supports temporal, geospatial, topical, and network studies. The scalability of the infrastructure is examined. Results show that temporal analyses scale linearly with the number of records and file size, while the geospatial algorithm showed quadratic growth. The number of edges rather than nodes determined performance for network based algorithms.


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

Pascal:14-0270434

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Open data and open code for big science of science studies</title>
<author>
<name sortKey="Light, Robert P" sort="Light, Robert P" uniqKey="Light R" first="Robert P." last="Light">Robert P. Light</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Polley, David E" sort="Polley, David E" uniqKey="Polley D" first="David E." last="Polley">David E. Polley</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Borner, Katy" sort="Borner, Katy" uniqKey="Borner K" first="Katy" last="Börner">Katy Börner</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">14-0270434</idno>
<date when="2014">2014</date>
<idno type="stanalyst">PASCAL 14-0270434 INIST</idno>
<idno type="RBID">Pascal:14-0270434</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000004</idno>
<idno type="stanalyst">FRANCIS 14-0270434 INIST</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000013</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000131</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000005</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Open data and open code for big science of science studies</title>
<author>
<name sortKey="Light, Robert P" sort="Light, Robert P" uniqKey="Light R" first="Robert P." last="Light">Robert P. Light</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Polley, David E" sort="Polley, David E" uniqKey="Polley D" first="David E." last="Polley">David E. Polley</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Borner, Katy" sort="Borner, Katy" uniqKey="Borner K" first="Katy" last="Börner">Katy Börner</name>
<affiliation wicri:level="2">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
<placeName>
<region type="state">Indiana</region>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Scientometrics : (Print)</title>
<title level="j" type="abbreviated">Scientometrics : (Print)</title>
<idno type="ISSN">0138-9130</idno>
<imprint>
<date when="2014">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Scientometrics : (Print)</title>
<title level="j" type="abbreviated">Scientometrics : (Print)</title>
<idno type="ISSN">0138-9130</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithm</term>
<term>Data analysis</term>
<term>Database</term>
<term>Growth</term>
<term>Interdisciplinary field</term>
<term>Patents</term>
<term>Scientific research</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Analyse donnée</term>
<term>Interdisciplinaire</term>
<term>Base de données</term>
<term>Brevet</term>
<term>Algorithme</term>
<term>Croissance</term>
<term>Recherche scientifique</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Base de données</term>
<term>Brevet</term>
<term>Recherche scientifique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Historically, science of science (Sci2) studies have been performed by single investigators or small teams. As the size and complexity of data sets and analyses scales up, a "Big Science" approach (Price, Little science, big science, 1963) is required that exploits the expertise and resources of interdisciplinary teams spanning academic, government, and industry boundaries. Big Sci2 studies utilize "big data", i.e., large, complex, diverse, longitudinal, and/or distributed datasets that might be owned by different stake-holders. They apply a systems science approach to uncover hidden patterns, bursts of activity, correlations, and laws. They make available open data and open code in support of replication of results, iterative refinement of approaches and tools, and education. This paper introduces a database-tool infrastructure that was designed to support big Sci2 studies. The open access Scholarly Database (http://sdb.cns.iu.edu) provides easy access to 26 million paper, patent, grant, and clinical trial records. The open source Sci2 tool (http:// sci2.cns.iu.edu) supports temporal, geospatial, topical, and network studies. The scalability of the infrastructure is examined. Results show that temporal analyses scale linearly with the number of records and file size, while the geospatial algorithm showed quadratic growth. The number of edges rather than nodes determined performance for network based algorithms.</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA01 i1="01" i2="1">
<s0>0138-9130</s0>
</fA01>
<fA02 i1="01">
<s0>SCNTDX</s0>
</fA02>
<fA03 i2="1">
<s0>Scientometrics : (Print)</s0>
</fA03>
<fA05>
<s2>101</s2>
</fA05>
<fA06>
<s2>2</s2>
</fA06>
<fA08 i1="01" i2="1" l="ENG">
<s1>Open data and open code for big science of science studies</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG">
<s1>Selected Papers of the 14th International Conference of the International Society for Scientometrics and Informetrics (ISSI), July 15-19, 2013, Vienna, Austria</s1>
</fA09>
<fA11 i1="01" i2="1">
<s1>LIGHT (Robert P.)</s1>
</fA11>
<fA11 i1="02" i2="1">
<s1>POLLEY (David E.)</s1>
</fA11>
<fA11 i1="03" i2="1">
<s1>BÖRNER (Katy)</s1>
</fA11>
<fA12 i1="01" i2="1">
<s1>GORRAIZ (Juan)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="02" i2="1">
<s1>GUMPENBERGER (Christian)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="03" i2="1">
<s1>HÖRLESBERGER (Marianne)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="04" i2="1">
<s1>MOED (Henk)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="05" i2="1">
<s1>SCHIEBEL (Edgar)</s1>
<s9>ed.</s9>
</fA12>
<fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</fA14>
<fA15 i1="01">
<s1>Library and Archive Services, Bibliometrics Department, University of Vienna, Boltzmanngasse 5</s1>
<s2>1090 Vienna</s2>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</fA15>
<fA15 i1="02">
<s1>AIT Austrian Institute of Technology GmbH, Tech Gate Vienna, Donau-City-Strasse 1</s1>
<s2>1220 Vienna</s2>
<s3>AUT</s3>
<sZ>3 aut.</sZ>
<sZ>5 aut.</sZ>
</fA15>
<fA15 i1="03">
<s1>Elsevier B.V., Radarweg 29</s1>
<s2>1043 NX Amsterdam</s2>
<s3>NLD</s3>
<sZ>4 aut.</sZ>
</fA15>
<fA18 i1="01" i2="1">
<s1>University of Vienna</s1>
<s3>AUT</s3>
<s9>org-cong.</s9>
</fA18>
<fA18 i1="02" i2="1">
<s1>AIT Austrian Institute of Technology</s1>
<s3>AUT</s3>
<s9>org-cong.</s9>
</fA18>
<fA18 i1="03" i2="1">
<s1>International Society for Scientometrics and Informetrics (ISSI)</s1>
<s2>Leuven</s2>
<s3>BEL</s3>
<s9>org-cong.</s9>
</fA18>
<fA20>
<s1>1535-1551</s1>
</fA20>
<fA21>
<s1>2014</s1>
</fA21>
<fA23 i1="01">
<s0>ENG</s0>
</fA23>
<fA43 i1="01">
<s1>INIST</s1>
<s2>19049</s2>
<s5>354000504566570370</s5>
</fA43>
<fA44>
<s0>0000</s0>
<s1>© 2014 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45>
<s0>3/4 p.</s0>
</fA45>
<fA47 i1="01" i2="1">
<s0>14-0270434</s0>
</fA47>
<fA60>
<s1>P</s1>
<s2>C</s2>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA64 i1="01" i2="1">
<s0>Scientometrics : (Print)</s0>
</fA64>
<fA66 i1="01">
<s0>NLD</s0>
</fA66>
<fC01 i1="01" l="ENG">
<s0>Historically, science of science (Sci2) studies have been performed by single investigators or small teams. As the size and complexity of data sets and analyses scales up, a "Big Science" approach (Price, Little science, big science, 1963) is required that exploits the expertise and resources of interdisciplinary teams spanning academic, government, and industry boundaries. Big Sci2 studies utilize "big data", i.e., large, complex, diverse, longitudinal, and/or distributed datasets that might be owned by different stake-holders. They apply a systems science approach to uncover hidden patterns, bursts of activity, correlations, and laws. They make available open data and open code in support of replication of results, iterative refinement of approaches and tools, and education. This paper introduces a database-tool infrastructure that was designed to support big Sci2 studies. The open access Scholarly Database (http://sdb.cns.iu.edu) provides easy access to 26 million paper, patent, grant, and clinical trial records. The open source Sci2 tool (http:// sci2.cns.iu.edu) supports temporal, geospatial, topical, and network studies. The scalability of the infrastructure is examined. Results show that temporal analyses scale linearly with the number of records and file size, while the geospatial algorithm showed quadratic growth. The number of edges rather than nodes determined performance for network based algorithms.</s0>
</fC01>
<fC02 i1="01" i2="X">
<s0>001A01A02</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE">
<s0>Analyse donnée</s0>
<s5>04</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG">
<s0>Data analysis</s0>
<s5>04</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA">
<s0>Análisis datos</s0>
<s5>04</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE">
<s0>Interdisciplinaire</s0>
<s5>05</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG">
<s0>Interdisciplinary field</s0>
<s5>05</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA">
<s0>Interdisciplinario</s0>
<s5>05</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE">
<s0>Base de données</s0>
<s5>06</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG">
<s0>Database</s0>
<s5>06</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA">
<s0>Base dato</s0>
<s5>06</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE">
<s0>Brevet</s0>
<s5>07</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG">
<s0>Patents</s0>
<s5>07</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA">
<s0>Patente</s0>
<s5>07</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE">
<s0>Algorithme</s0>
<s5>08</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG">
<s0>Algorithm</s0>
<s5>08</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA">
<s0>Algoritmo</s0>
<s5>08</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE">
<s0>Croissance</s0>
<s5>09</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG">
<s0>Growth</s0>
<s5>09</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA">
<s0>Crecimiento</s0>
<s5>09</s5>
</fC03>
<fC03 i1="07" i2="X" l="FRE">
<s0>Recherche scientifique</s0>
<s5>10</s5>
</fC03>
<fC03 i1="07" i2="X" l="ENG">
<s0>Scientific research</s0>
<s5>10</s5>
</fC03>
<fC03 i1="07" i2="X" l="SPA">
<s0>Investigación científica</s0>
<s5>10</s5>
</fC03>
<fN21>
<s1>335</s1>
</fN21>
<fN44 i1="01">
<s1>OTO</s1>
</fN44>
<fN82>
<s1>OTO</s1>
</fN82>
</pA>
<pR>
<fA30 i1="01" i2="1" l="ENG">
<s1>International Conference of the International Society for Scientometrics and Informetrics</s1>
<s2>14</s2>
<s3>Vienna AUT</s3>
<s4>2013-07-15</s4>
</fA30>
</pR>
</standard>
</inist>
<affiliations>
<list>
<country>
<li>États-Unis</li>
</country>
<region>
<li>Indiana</li>
</region>
</list>
<tree>
<country name="États-Unis">
<region name="Indiana">
<name sortKey="Light, Robert P" sort="Light, Robert P" uniqKey="Light R" first="Robert P." last="Light">Robert P. Light</name>
</region>
<name sortKey="Borner, Katy" sort="Borner, Katy" uniqKey="Borner K" first="Katy" last="Börner">Katy Börner</name>
<name sortKey="Polley, David E" sort="Polley, David E" uniqKey="Polley D" first="David E." last="Polley">David E. Polley</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Belgique/explor/OpenAccessBelV2/Data/PascalFrancis/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000005 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Checkpoint/biblio.hfd -nk 000005 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Belgique
   |area=    OpenAccessBelV2
   |flux=    PascalFrancis
   |étape=   Checkpoint
   |type=    RBID
   |clé=     Pascal:14-0270434
   |texte=   Open data and open code for big science of science studies
}}

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Dec 1 00:43:49 2016. Site generation: Wed Mar 6 14:51:30 2024