Serveur d'exploration autour du libre accès en Belgique

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Open data and open code for big science of science studies

Identifieur interne : 000131 ( PascalFrancis/Curation ); précédent : 000130; suivant : 000132

Open data and open code for big science of science studies

Auteurs : Robert P. Light [États-Unis] ; David E. Polley [États-Unis] ; Katy Börner [États-Unis]

Source :

RBID : Pascal:14-0270434

Descripteurs français

English descriptors

Abstract

Historically, science of science (Sci2) studies have been performed by single investigators or small teams. As the size and complexity of data sets and analyses scales up, a "Big Science" approach (Price, Little science, big science, 1963) is required that exploits the expertise and resources of interdisciplinary teams spanning academic, government, and industry boundaries. Big Sci2 studies utilize "big data", i.e., large, complex, diverse, longitudinal, and/or distributed datasets that might be owned by different stake-holders. They apply a systems science approach to uncover hidden patterns, bursts of activity, correlations, and laws. They make available open data and open code in support of replication of results, iterative refinement of approaches and tools, and education. This paper introduces a database-tool infrastructure that was designed to support big Sci2 studies. The open access Scholarly Database (http://sdb.cns.iu.edu) provides easy access to 26 million paper, patent, grant, and clinical trial records. The open source Sci2 tool (http:// sci2.cns.iu.edu) supports temporal, geospatial, topical, and network studies. The scalability of the infrastructure is examined. Results show that temporal analyses scale linearly with the number of records and file size, while the geospatial algorithm showed quadratic growth. The number of edges rather than nodes determined performance for network based algorithms.
pA  
A01 01  1    @0 0138-9130
A02 01      @0 SCNTDX
A03   1    @0 Scientometrics : (Print)
A05       @2 101
A06       @2 2
A08 01  1  ENG  @1 Open data and open code for big science of science studies
A09 01  1  ENG  @1 Selected Papers of the 14th International Conference of the International Society for Scientometrics and Informetrics (ISSI), July 15-19, 2013, Vienna, Austria
A11 01  1    @1 LIGHT (Robert P.)
A11 02  1    @1 POLLEY (David E.)
A11 03  1    @1 BÖRNER (Katy)
A12 01  1    @1 GORRAIZ (Juan) @9 ed.
A12 02  1    @1 GUMPENBERGER (Christian) @9 ed.
A12 03  1    @1 HÖRLESBERGER (Marianne) @9 ed.
A12 04  1    @1 MOED (Henk) @9 ed.
A12 05  1    @1 SCHIEBEL (Edgar) @9 ed.
A14 01      @1 Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University @2 Bloomington, IN @3 USA @Z 1 aut. @Z 2 aut. @Z 3 aut.
A15 01      @1 Library and Archive Services, Bibliometrics Department, University of Vienna, Boltzmanngasse 5 @2 1090 Vienna @3 AUT @Z 1 aut. @Z 2 aut.
A15 02      @1 AIT Austrian Institute of Technology GmbH, Tech Gate Vienna, Donau-City-Strasse 1 @2 1220 Vienna @3 AUT @Z 3 aut. @Z 5 aut.
A15 03      @1 Elsevier B.V., Radarweg 29 @2 1043 NX Amsterdam @3 NLD @Z 4 aut.
A18 01  1    @1 University of Vienna @3 AUT @9 org-cong.
A18 02  1    @1 AIT Austrian Institute of Technology @3 AUT @9 org-cong.
A18 03  1    @1 International Society for Scientometrics and Informetrics (ISSI) @2 Leuven @3 BEL @9 org-cong.
A20       @1 1535-1551
A21       @1 2014
A23 01      @0 ENG
A43 01      @1 INIST @2 19049 @5 354000504566570370
A44       @0 0000 @1 © 2014 INIST-CNRS. All rights reserved.
A45       @0 3/4 p.
A47 01  1    @0 14-0270434
A60       @1 P @2 C
A61       @0 A
A64 01  1    @0 Scientometrics : (Print)
A66 01      @0 NLD
C01 01    ENG  @0 Historically, science of science (Sci2) studies have been performed by single investigators or small teams. As the size and complexity of data sets and analyses scales up, a "Big Science" approach (Price, Little science, big science, 1963) is required that exploits the expertise and resources of interdisciplinary teams spanning academic, government, and industry boundaries. Big Sci2 studies utilize "big data", i.e., large, complex, diverse, longitudinal, and/or distributed datasets that might be owned by different stake-holders. They apply a systems science approach to uncover hidden patterns, bursts of activity, correlations, and laws. They make available open data and open code in support of replication of results, iterative refinement of approaches and tools, and education. This paper introduces a database-tool infrastructure that was designed to support big Sci2 studies. The open access Scholarly Database (http://sdb.cns.iu.edu) provides easy access to 26 million paper, patent, grant, and clinical trial records. The open source Sci2 tool (http:// sci2.cns.iu.edu) supports temporal, geospatial, topical, and network studies. The scalability of the infrastructure is examined. Results show that temporal analyses scale linearly with the number of records and file size, while the geospatial algorithm showed quadratic growth. The number of edges rather than nodes determined performance for network based algorithms.
C02 01  X    @0 001A01A02
C03 01  X  FRE  @0 Analyse donnée @5 04
C03 01  X  ENG  @0 Data analysis @5 04
C03 01  X  SPA  @0 Análisis datos @5 04
C03 02  X  FRE  @0 Interdisciplinaire @5 05
C03 02  X  ENG  @0 Interdisciplinary field @5 05
C03 02  X  SPA  @0 Interdisciplinario @5 05
C03 03  X  FRE  @0 Base de données @5 06
C03 03  X  ENG  @0 Database @5 06
C03 03  X  SPA  @0 Base dato @5 06
C03 04  X  FRE  @0 Brevet @5 07
C03 04  X  ENG  @0 Patents @5 07
C03 04  X  SPA  @0 Patente @5 07
C03 05  X  FRE  @0 Algorithme @5 08
C03 05  X  ENG  @0 Algorithm @5 08
C03 05  X  SPA  @0 Algoritmo @5 08
C03 06  X  FRE  @0 Croissance @5 09
C03 06  X  ENG  @0 Growth @5 09
C03 06  X  SPA  @0 Crecimiento @5 09
C03 07  X  FRE  @0 Recherche scientifique @5 10
C03 07  X  ENG  @0 Scientific research @5 10
C03 07  X  SPA  @0 Investigación científica @5 10
N21       @1 335
N44 01      @1 OTO
N82       @1 OTO
pR  
A30 01  1  ENG  @1 International Conference of the International Society for Scientometrics and Informetrics @2 14 @3 Vienna AUT @4 2013-07-15

Links toward previous steps (curation, corpus...)


Links to Exploration step

Pascal:14-0270434

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Open data and open code for big science of science studies</title>
<author>
<name sortKey="Light, Robert P" sort="Light, Robert P" uniqKey="Light R" first="Robert P." last="Light">Robert P. Light</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
</affiliation>
</author>
<author>
<name sortKey="Polley, David E" sort="Polley, David E" uniqKey="Polley D" first="David E." last="Polley">David E. Polley</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
</affiliation>
</author>
<author>
<name sortKey="Borner, Katy" sort="Borner, Katy" uniqKey="Borner K" first="Katy" last="Börner">Katy Börner</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">14-0270434</idno>
<date when="2014">2014</date>
<idno type="stanalyst">PASCAL 14-0270434 INIST</idno>
<idno type="RBID">Pascal:14-0270434</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000004</idno>
<idno type="stanalyst">FRANCIS 14-0270434 INIST</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000013</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000131</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Open data and open code for big science of science studies</title>
<author>
<name sortKey="Light, Robert P" sort="Light, Robert P" uniqKey="Light R" first="Robert P." last="Light">Robert P. Light</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
</affiliation>
</author>
<author>
<name sortKey="Polley, David E" sort="Polley, David E" uniqKey="Polley D" first="David E." last="Polley">David E. Polley</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
</affiliation>
</author>
<author>
<name sortKey="Borner, Katy" sort="Borner, Katy" uniqKey="Borner K" first="Katy" last="Börner">Katy Börner</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>États-Unis</country>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Scientometrics : (Print)</title>
<title level="j" type="abbreviated">Scientometrics : (Print)</title>
<idno type="ISSN">0138-9130</idno>
<imprint>
<date when="2014">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Scientometrics : (Print)</title>
<title level="j" type="abbreviated">Scientometrics : (Print)</title>
<idno type="ISSN">0138-9130</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithm</term>
<term>Data analysis</term>
<term>Database</term>
<term>Growth</term>
<term>Interdisciplinary field</term>
<term>Patents</term>
<term>Scientific research</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Analyse donnée</term>
<term>Interdisciplinaire</term>
<term>Base de données</term>
<term>Brevet</term>
<term>Algorithme</term>
<term>Croissance</term>
<term>Recherche scientifique</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Base de données</term>
<term>Brevet</term>
<term>Recherche scientifique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Historically, science of science (Sci2) studies have been performed by single investigators or small teams. As the size and complexity of data sets and analyses scales up, a "Big Science" approach (Price, Little science, big science, 1963) is required that exploits the expertise and resources of interdisciplinary teams spanning academic, government, and industry boundaries. Big Sci2 studies utilize "big data", i.e., large, complex, diverse, longitudinal, and/or distributed datasets that might be owned by different stake-holders. They apply a systems science approach to uncover hidden patterns, bursts of activity, correlations, and laws. They make available open data and open code in support of replication of results, iterative refinement of approaches and tools, and education. This paper introduces a database-tool infrastructure that was designed to support big Sci2 studies. The open access Scholarly Database (http://sdb.cns.iu.edu) provides easy access to 26 million paper, patent, grant, and clinical trial records. The open source Sci2 tool (http:// sci2.cns.iu.edu) supports temporal, geospatial, topical, and network studies. The scalability of the infrastructure is examined. Results show that temporal analyses scale linearly with the number of records and file size, while the geospatial algorithm showed quadratic growth. The number of edges rather than nodes determined performance for network based algorithms.</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA01 i1="01" i2="1">
<s0>0138-9130</s0>
</fA01>
<fA02 i1="01">
<s0>SCNTDX</s0>
</fA02>
<fA03 i2="1">
<s0>Scientometrics : (Print)</s0>
</fA03>
<fA05>
<s2>101</s2>
</fA05>
<fA06>
<s2>2</s2>
</fA06>
<fA08 i1="01" i2="1" l="ENG">
<s1>Open data and open code for big science of science studies</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG">
<s1>Selected Papers of the 14th International Conference of the International Society for Scientometrics and Informetrics (ISSI), July 15-19, 2013, Vienna, Austria</s1>
</fA09>
<fA11 i1="01" i2="1">
<s1>LIGHT (Robert P.)</s1>
</fA11>
<fA11 i1="02" i2="1">
<s1>POLLEY (David E.)</s1>
</fA11>
<fA11 i1="03" i2="1">
<s1>BÖRNER (Katy)</s1>
</fA11>
<fA12 i1="01" i2="1">
<s1>GORRAIZ (Juan)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="02" i2="1">
<s1>GUMPENBERGER (Christian)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="03" i2="1">
<s1>HÖRLESBERGER (Marianne)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="04" i2="1">
<s1>MOED (Henk)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="05" i2="1">
<s1>SCHIEBEL (Edgar)</s1>
<s9>ed.</s9>
</fA12>
<fA14 i1="01">
<s1>Cyberinfrastructure for Network Science Center, School of Informatics and Computing, Indiana University</s1>
<s2>Bloomington, IN</s2>
<s3>USA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</fA14>
<fA15 i1="01">
<s1>Library and Archive Services, Bibliometrics Department, University of Vienna, Boltzmanngasse 5</s1>
<s2>1090 Vienna</s2>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</fA15>
<fA15 i1="02">
<s1>AIT Austrian Institute of Technology GmbH, Tech Gate Vienna, Donau-City-Strasse 1</s1>
<s2>1220 Vienna</s2>
<s3>AUT</s3>
<sZ>3 aut.</sZ>
<sZ>5 aut.</sZ>
</fA15>
<fA15 i1="03">
<s1>Elsevier B.V., Radarweg 29</s1>
<s2>1043 NX Amsterdam</s2>
<s3>NLD</s3>
<sZ>4 aut.</sZ>
</fA15>
<fA18 i1="01" i2="1">
<s1>University of Vienna</s1>
<s3>AUT</s3>
<s9>org-cong.</s9>
</fA18>
<fA18 i1="02" i2="1">
<s1>AIT Austrian Institute of Technology</s1>
<s3>AUT</s3>
<s9>org-cong.</s9>
</fA18>
<fA18 i1="03" i2="1">
<s1>International Society for Scientometrics and Informetrics (ISSI)</s1>
<s2>Leuven</s2>
<s3>BEL</s3>
<s9>org-cong.</s9>
</fA18>
<fA20>
<s1>1535-1551</s1>
</fA20>
<fA21>
<s1>2014</s1>
</fA21>
<fA23 i1="01">
<s0>ENG</s0>
</fA23>
<fA43 i1="01">
<s1>INIST</s1>
<s2>19049</s2>
<s5>354000504566570370</s5>
</fA43>
<fA44>
<s0>0000</s0>
<s1>© 2014 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45>
<s0>3/4 p.</s0>
</fA45>
<fA47 i1="01" i2="1">
<s0>14-0270434</s0>
</fA47>
<fA60>
<s1>P</s1>
<s2>C</s2>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA64 i1="01" i2="1">
<s0>Scientometrics : (Print)</s0>
</fA64>
<fA66 i1="01">
<s0>NLD</s0>
</fA66>
<fC01 i1="01" l="ENG">
<s0>Historically, science of science (Sci2) studies have been performed by single investigators or small teams. As the size and complexity of data sets and analyses scales up, a "Big Science" approach (Price, Little science, big science, 1963) is required that exploits the expertise and resources of interdisciplinary teams spanning academic, government, and industry boundaries. Big Sci2 studies utilize "big data", i.e., large, complex, diverse, longitudinal, and/or distributed datasets that might be owned by different stake-holders. They apply a systems science approach to uncover hidden patterns, bursts of activity, correlations, and laws. They make available open data and open code in support of replication of results, iterative refinement of approaches and tools, and education. This paper introduces a database-tool infrastructure that was designed to support big Sci2 studies. The open access Scholarly Database (http://sdb.cns.iu.edu) provides easy access to 26 million paper, patent, grant, and clinical trial records. The open source Sci2 tool (http:// sci2.cns.iu.edu) supports temporal, geospatial, topical, and network studies. The scalability of the infrastructure is examined. Results show that temporal analyses scale linearly with the number of records and file size, while the geospatial algorithm showed quadratic growth. The number of edges rather than nodes determined performance for network based algorithms.</s0>
</fC01>
<fC02 i1="01" i2="X">
<s0>001A01A02</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE">
<s0>Analyse donnée</s0>
<s5>04</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG">
<s0>Data analysis</s0>
<s5>04</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA">
<s0>Análisis datos</s0>
<s5>04</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE">
<s0>Interdisciplinaire</s0>
<s5>05</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG">
<s0>Interdisciplinary field</s0>
<s5>05</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA">
<s0>Interdisciplinario</s0>
<s5>05</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE">
<s0>Base de données</s0>
<s5>06</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG">
<s0>Database</s0>
<s5>06</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA">
<s0>Base dato</s0>
<s5>06</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE">
<s0>Brevet</s0>
<s5>07</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG">
<s0>Patents</s0>
<s5>07</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA">
<s0>Patente</s0>
<s5>07</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE">
<s0>Algorithme</s0>
<s5>08</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG">
<s0>Algorithm</s0>
<s5>08</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA">
<s0>Algoritmo</s0>
<s5>08</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE">
<s0>Croissance</s0>
<s5>09</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG">
<s0>Growth</s0>
<s5>09</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA">
<s0>Crecimiento</s0>
<s5>09</s5>
</fC03>
<fC03 i1="07" i2="X" l="FRE">
<s0>Recherche scientifique</s0>
<s5>10</s5>
</fC03>
<fC03 i1="07" i2="X" l="ENG">
<s0>Scientific research</s0>
<s5>10</s5>
</fC03>
<fC03 i1="07" i2="X" l="SPA">
<s0>Investigación científica</s0>
<s5>10</s5>
</fC03>
<fN21>
<s1>335</s1>
</fN21>
<fN44 i1="01">
<s1>OTO</s1>
</fN44>
<fN82>
<s1>OTO</s1>
</fN82>
</pA>
<pR>
<fA30 i1="01" i2="1" l="ENG">
<s1>International Conference of the International Society for Scientometrics and Informetrics</s1>
<s2>14</s2>
<s3>Vienna AUT</s3>
<s4>2013-07-15</s4>
</fA30>
</pR>
</standard>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Belgique/explor/OpenAccessBelV2/Data/PascalFrancis/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000131 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Curation/biblio.hfd -nk 000131 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Belgique
   |area=    OpenAccessBelV2
   |flux=    PascalFrancis
   |étape=   Curation
   |type=    RBID
   |clé=     Pascal:14-0270434
   |texte=   Open data and open code for big science of science studies
}}

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Dec 1 00:43:49 2016. Site generation: Wed Mar 6 14:51:30 2024