Serveur d'exploration sur Mozart

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Interactions of Cultures and Top People of Wikipedia from Ranking of 24 Language Editions

Identifieur interne : 000792 ( Ncbi/Merge ); précédent : 000791; suivant : 000793

Interactions of Cultures and Top People of Wikipedia from Ranking of 24 Language Editions

Auteurs : Young-Ho Eom [France] ; Pablo Arag N [Espagne] ; David Laniado [Espagne] ; Andreas Kaltenbrunner [Espagne] ; Sebastiano Vigna [Italie] ; Dima L. Shepelyansky [France]

Source :

RBID : PMC:4349893

Abstract

Wikipedia is a huge global repository of human knowledge that can be leveraged to investigate interwinements between cultures. With this aim, we apply methods of Markov chains and Google matrix for the analysis of the hyperlink networks of 24 Wikipedia language editions, and rank all their articles by PageRank, 2DRank and CheiRank algorithms. Using automatic extraction of people names, we obtain the top 100 historical figures, for each edition and for each algorithm. We investigate their spatial, temporal, and gender distributions in dependence of their cultural origins. Our study demonstrates not only the existence of skewness with local figures, mainly recognized only in their own cultures, but also the existence of global historical figures appearing in a large number of editions. By determining the birth time and place of these persons, we perform an analysis of the evolution of such figures through 35 centuries of human history for each language, thus recovering interactions and entanglement of cultures over time. We also obtain the distributions of historical figures over world countries, highlighting geographical aspects of cross-cultural links. Considering historical figures who appear in multiple editions as interactions between cultures, we construct a network of cultures and identify the most influential cultures according to this network.


Url:
DOI: 10.1371/journal.pone.0114825
PubMed: 25738291
PubMed Central: 4349893

Links toward previous steps (curation, corpus...)


Links to Exploration step

PMC:4349893

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Interactions of Cultures and Top People of Wikipedia from Ranking of 24 Language Editions</title>
<author>
<name sortKey="Eom, Young Ho" sort="Eom, Young Ho" uniqKey="Eom Y" first="Young-Ho" last="Eom">Young-Ho Eom</name>
<affiliation wicri:level="3">
<nlm:aff id="aff001">
<addr-line>Laboratoire de Physique Théorique du CNRS, IRSAMC, Université de Toulouse, UPS, F-31062 Toulouse, France</addr-line>
</nlm:aff>
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire de Physique Théorique du CNRS, IRSAMC, Université de Toulouse, UPS, F-31062 Toulouse</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Languedoc-Roussillon-Midi-Pyrénées</region>
<region type="old region" nuts="2">Midi-Pyrénées</region>
<settlement type="city">Toulouse</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Arag N, Pablo" sort="Arag N, Pablo" uniqKey="Arag N P" first="Pablo" last="Arag N">Pablo Arag N</name>
<affiliation wicri:level="3">
<nlm:aff id="aff002">
<addr-line>Barcelona Media Foundation, Barcelona, Spain</addr-line>
</nlm:aff>
<country xml:lang="fr">Espagne</country>
<wicri:regionArea>Barcelona Media Foundation, Barcelona</wicri:regionArea>
<placeName>
<settlement type="city">Barcelone</settlement>
<region nuts="2" type="region">Catalogne</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Laniado, David" sort="Laniado, David" uniqKey="Laniado D" first="David" last="Laniado">David Laniado</name>
<affiliation wicri:level="3">
<nlm:aff id="aff002">
<addr-line>Barcelona Media Foundation, Barcelona, Spain</addr-line>
</nlm:aff>
<country xml:lang="fr">Espagne</country>
<wicri:regionArea>Barcelona Media Foundation, Barcelona</wicri:regionArea>
<placeName>
<settlement type="city">Barcelone</settlement>
<region nuts="2" type="region">Catalogne</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Kaltenbrunner, Andreas" sort="Kaltenbrunner, Andreas" uniqKey="Kaltenbrunner A" first="Andreas" last="Kaltenbrunner">Andreas Kaltenbrunner</name>
<affiliation wicri:level="3">
<nlm:aff id="aff002">
<addr-line>Barcelona Media Foundation, Barcelona, Spain</addr-line>
</nlm:aff>
<country xml:lang="fr">Espagne</country>
<wicri:regionArea>Barcelona Media Foundation, Barcelona</wicri:regionArea>
<placeName>
<settlement type="city">Barcelone</settlement>
<region nuts="2" type="region">Catalogne</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Vigna, Sebastiano" sort="Vigna, Sebastiano" uniqKey="Vigna S" first="Sebastiano" last="Vigna">Sebastiano Vigna</name>
<affiliation wicri:level="1">
<nlm:aff id="aff003">
<addr-line>Dipartimento di Informatica, Università degli Studi di Milano, Milano, Italy</addr-line>
</nlm:aff>
<country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Informatica, Università degli Studi di Milano, Milano</wicri:regionArea>
<wicri:noRegion>Milano</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Shepelyansky, Dima L" sort="Shepelyansky, Dima L" uniqKey="Shepelyansky D" first="Dima L." last="Shepelyansky">Dima L. Shepelyansky</name>
<affiliation wicri:level="3">
<nlm:aff id="aff001">
<addr-line>Laboratoire de Physique Théorique du CNRS, IRSAMC, Université de Toulouse, UPS, F-31062 Toulouse, France</addr-line>
</nlm:aff>
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire de Physique Théorique du CNRS, IRSAMC, Université de Toulouse, UPS, F-31062 Toulouse</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Languedoc-Roussillon-Midi-Pyrénées</region>
<region type="old region" nuts="2">Midi-Pyrénées</region>
<settlement type="city">Toulouse</settlement>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">25738291</idno>
<idno type="pmc">4349893</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4349893</idno>
<idno type="RBID">PMC:4349893</idno>
<idno type="doi">10.1371/journal.pone.0114825</idno>
<date when="2015">2015</date>
<idno type="wicri:Area/Pmc/Corpus">000042</idno>
<idno type="wicri:Area/Pmc/Curation">000042</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000039</idno>
<idno type="wicri:Area/Ncbi/Merge">000792</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Interactions of Cultures and Top People of Wikipedia from Ranking of 24 Language Editions</title>
<author>
<name sortKey="Eom, Young Ho" sort="Eom, Young Ho" uniqKey="Eom Y" first="Young-Ho" last="Eom">Young-Ho Eom</name>
<affiliation wicri:level="3">
<nlm:aff id="aff001">
<addr-line>Laboratoire de Physique Théorique du CNRS, IRSAMC, Université de Toulouse, UPS, F-31062 Toulouse, France</addr-line>
</nlm:aff>
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire de Physique Théorique du CNRS, IRSAMC, Université de Toulouse, UPS, F-31062 Toulouse</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Languedoc-Roussillon-Midi-Pyrénées</region>
<region type="old region" nuts="2">Midi-Pyrénées</region>
<settlement type="city">Toulouse</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Arag N, Pablo" sort="Arag N, Pablo" uniqKey="Arag N P" first="Pablo" last="Arag N">Pablo Arag N</name>
<affiliation wicri:level="3">
<nlm:aff id="aff002">
<addr-line>Barcelona Media Foundation, Barcelona, Spain</addr-line>
</nlm:aff>
<country xml:lang="fr">Espagne</country>
<wicri:regionArea>Barcelona Media Foundation, Barcelona</wicri:regionArea>
<placeName>
<settlement type="city">Barcelone</settlement>
<region nuts="2" type="region">Catalogne</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Laniado, David" sort="Laniado, David" uniqKey="Laniado D" first="David" last="Laniado">David Laniado</name>
<affiliation wicri:level="3">
<nlm:aff id="aff002">
<addr-line>Barcelona Media Foundation, Barcelona, Spain</addr-line>
</nlm:aff>
<country xml:lang="fr">Espagne</country>
<wicri:regionArea>Barcelona Media Foundation, Barcelona</wicri:regionArea>
<placeName>
<settlement type="city">Barcelone</settlement>
<region nuts="2" type="region">Catalogne</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Kaltenbrunner, Andreas" sort="Kaltenbrunner, Andreas" uniqKey="Kaltenbrunner A" first="Andreas" last="Kaltenbrunner">Andreas Kaltenbrunner</name>
<affiliation wicri:level="3">
<nlm:aff id="aff002">
<addr-line>Barcelona Media Foundation, Barcelona, Spain</addr-line>
</nlm:aff>
<country xml:lang="fr">Espagne</country>
<wicri:regionArea>Barcelona Media Foundation, Barcelona</wicri:regionArea>
<placeName>
<settlement type="city">Barcelone</settlement>
<region nuts="2" type="region">Catalogne</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Vigna, Sebastiano" sort="Vigna, Sebastiano" uniqKey="Vigna S" first="Sebastiano" last="Vigna">Sebastiano Vigna</name>
<affiliation wicri:level="1">
<nlm:aff id="aff003">
<addr-line>Dipartimento di Informatica, Università degli Studi di Milano, Milano, Italy</addr-line>
</nlm:aff>
<country xml:lang="fr">Italie</country>
<wicri:regionArea>Dipartimento di Informatica, Università degli Studi di Milano, Milano</wicri:regionArea>
<wicri:noRegion>Milano</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Shepelyansky, Dima L" sort="Shepelyansky, Dima L" uniqKey="Shepelyansky D" first="Dima L." last="Shepelyansky">Dima L. Shepelyansky</name>
<affiliation wicri:level="3">
<nlm:aff id="aff001">
<addr-line>Laboratoire de Physique Théorique du CNRS, IRSAMC, Université de Toulouse, UPS, F-31062 Toulouse, France</addr-line>
</nlm:aff>
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire de Physique Théorique du CNRS, IRSAMC, Université de Toulouse, UPS, F-31062 Toulouse</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Languedoc-Roussillon-Midi-Pyrénées</region>
<region type="old region" nuts="2">Midi-Pyrénées</region>
<settlement type="city">Toulouse</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j">PLoS ONE</title>
<idno type="e-ISSN">1932-6203</idno>
<imprint>
<date when="2015">2015</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>Wikipedia is a huge global repository of human knowledge that can be leveraged to investigate interwinements between cultures. With this aim, we apply methods of Markov chains and Google matrix for the analysis of the hyperlink networks of 24 Wikipedia language editions, and rank all their articles by PageRank, 2DRank and CheiRank algorithms. Using automatic extraction of people names, we obtain the top 100 historical figures, for each edition and for each algorithm. We investigate their spatial, temporal, and gender distributions in dependence of their cultural origins. Our study demonstrates not only the existence of skewness with local figures, mainly recognized only in their own cultures, but also the existence of global historical figures appearing in a large number of editions. By determining the birth time and place of these persons, we perform an analysis of the evolution of such figures through 35 centuries of human history for each language, thus recovering interactions and entanglement of cultures over time. We also obtain the distributions of historical figures over world countries, highlighting geographical aspects of cross-cultural links. Considering historical figures who appear in multiple editions as interactions between cultures, we construct a network of cultures and identify the most influential cultures according to this network.</p>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Rosenzweig, R" uniqKey="Rosenzweig R">R Rosenzweig</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lavsa, Sm" uniqKey="Lavsa S">SM Lavsa</name>
</author>
<author>
<name sortKey="Corman, Sl" uniqKey="Corman S">SL Corman</name>
</author>
<author>
<name sortKey="Culley, Cm" uniqKey="Culley C">CM Culley</name>
</author>
<author>
<name sortKey="Pummer, Tl" uniqKey="Pummer T">TL Pummer</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Giles, J" uniqKey="Giles J">J Giles</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kittur, A" uniqKey="Kittur A">A Kittur</name>
</author>
<author>
<name sortKey="Chi, Eh" uniqKey="Chi E">EH Chi</name>
</author>
<author>
<name sortKey="Suh, B" uniqKey="Suh B">B Suh</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Priedhorsky, R" uniqKey="Priedhorsky R">R Priedhorsky</name>
</author>
<author>
<name sortKey="Chen, J" uniqKey="Chen J">J Chen</name>
</author>
<author>
<name sortKey="Lam, Stk" uniqKey="Lam S">STK Lam</name>
</author>
<author>
<name sortKey="Panciera, K" uniqKey="Panciera K">K Panciera</name>
</author>
<author>
<name sortKey="Terveen, L" uniqKey="Terveen L">L Terveen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yasseri, T" uniqKey="Yasseri T">T Yasseri</name>
</author>
<author>
<name sortKey="Sumi, R" uniqKey="Sumi R">R Sumi</name>
</author>
<author>
<name sortKey="Rung, A" uniqKey="Rung A">A Rung</name>
</author>
<author>
<name sortKey="Kornai, A" uniqKey="Kornai A">A Kornai</name>
</author>
<author>
<name sortKey="Kertesz, J" uniqKey="Kertesz J">J Kertész</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Laniado, D" uniqKey="Laniado D">D Laniado</name>
</author>
<author>
<name sortKey="Tasso, R" uniqKey="Tasso R">R Tasso</name>
</author>
<author>
<name sortKey="Volkovich Y Kaltenbrunner, A" uniqKey="Volkovich Y Kaltenbrunner A">A Volkovich Y. Kaltenbrunner</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Pfeil, U" uniqKey="Pfeil U">U Pfeil</name>
</author>
<author>
<name sortKey="Zaphiris, P" uniqKey="Zaphiris P">P Zaphiris</name>
</author>
<author>
<name sortKey="Ang C, A" uniqKey="Ang C A">A Ang C</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Callahan, Es" uniqKey="Callahan E">ES Callahan</name>
</author>
<author>
<name sortKey="Herriing, Sc" uniqKey="Herriing S">SC Herriing</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hecht, B" uniqKey="Hecht B">B Hecht</name>
</author>
<author>
<name sortKey="Gergle, D" uniqKey="Gergle D">D Gergle</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hecht, B" uniqKey="Hecht B">B Hecht</name>
</author>
<author>
<name sortKey="Gergle, D" uniqKey="Gergle D">D Gergle</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Nemoto, K" uniqKey="Nemoto K">K Nemoto</name>
</author>
<author>
<name sortKey="Gloor, Pa" uniqKey="Gloor P">PA Gloor</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Warncke Wang, M" uniqKey="Warncke Wang M">M Warncke-Wang</name>
</author>
<author>
<name sortKey="Uduwage, A" uniqKey="Uduwage A">A Uduwage</name>
</author>
<author>
<name sortKey="Dong, Z" uniqKey="Dong Z">Z Dong</name>
</author>
<author>
<name sortKey="Riedl, J" uniqKey="Riedl J">J Riedl</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Massa, P" uniqKey="Massa P">P Massa</name>
</author>
<author>
<name sortKey="Scrinzi, F" uniqKey="Scrinzi F">F Scrinzi</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Zhirov, Ao" uniqKey="Zhirov A">AO Zhirov</name>
</author>
<author>
<name sortKey="Zhirov, Ov" uniqKey="Zhirov O">OV Zhirov</name>
</author>
<author>
<name sortKey="Shepelyansky, Dl" uniqKey="Shepelyansky D">DL Shepelyansky</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Eom, Yh" uniqKey="Eom Y">YH Eom</name>
</author>
<author>
<name sortKey="Frahm, Km" uniqKey="Frahm K">KM Frahm</name>
</author>
<author>
<name sortKey="Benc Ur, A" uniqKey="Benc Ur A">A Bencźur</name>
</author>
<author>
<name sortKey="Shepelyansky, Dl" uniqKey="Shepelyansky D">DL Shepelyansky</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Arag N, P" uniqKey="Arag N P">P Aragón</name>
</author>
<author>
<name sortKey="Laniado, D" uniqKey="Laniado D">D Laniado</name>
</author>
<author>
<name sortKey="Kaltenbrunner, A" uniqKey="Kaltenbrunner A">A Kaltenbrunner</name>
</author>
<author>
<name sortKey="Volkovich, Y" uniqKey="Volkovich Y">Y Volkovich</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Eom, Yh" uniqKey="Eom Y">YH Eom</name>
</author>
<author>
<name sortKey="Shepelyansky, Dl" uniqKey="Shepelyansky D">DL Shepelyansky</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Skiena, S" uniqKey="Skiena S">S Skiena</name>
</author>
<author>
<name sortKey="Ward, Cb" uniqKey="Ward C">CB Ward</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Samoilenko, A" uniqKey="Samoilenko A">A Samoilenko</name>
</author>
<author>
<name sortKey="Yasseri, T" uniqKey="Yasseri T">T Yasseri</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hart, Mh" uniqKey="Hart M">MH Hart</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ermann, L" uniqKey="Ermann L">L Ermann</name>
</author>
<author>
<name sortKey="Chepelianskii, Ad" uniqKey="Chepelianskii A">AD Chepelianskii</name>
</author>
<author>
<name sortKey="Shepelyansky, Dl" uniqKey="Shepelyansky D">DL Shepelyansky</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ermann, L" uniqKey="Ermann L">L Ermann</name>
</author>
<author>
<name sortKey="Frahm, Km" uniqKey="Frahm K">KM Frahm</name>
</author>
<author>
<name sortKey="Shepelyansky, Dl" uniqKey="Shepelyansky D">DL Shepelyansky</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Langville, Am" uniqKey="Langville A">AM Langville</name>
</author>
<author>
<name sortKey="Meyer, Cd" uniqKey="Meyer C">CD Meyer</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Brin, S" uniqKey="Brin S">S Brin</name>
</author>
<author>
<name sortKey="Page, L" uniqKey="Page L">L Page</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chen, P" uniqKey="Chen P">P Chen</name>
</author>
<author>
<name sortKey="Xie, H" uniqKey="Xie H">H Xie</name>
</author>
<author>
<name sortKey="Maslov, S" uniqKey="Maslov S">S Maslov</name>
</author>
<author>
<name sortKey="Redner, S" uniqKey="Redner S">S Redner</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kwak, H" uniqKey="Kwak H">H Kwak</name>
</author>
<author>
<name sortKey="Lee, C" uniqKey="Lee C">C Lee</name>
</author>
<author>
<name sortKey="Park, H" uniqKey="Park H">H Park</name>
</author>
<author>
<name sortKey="Moon, S" uniqKey="Moon S">S Moon</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ermann, L" uniqKey="Ermann L">L Ermann</name>
</author>
<author>
<name sortKey="Shepelyansky, Dl" uniqKey="Shepelyansky D">DL Shepelyansky</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kandiah, V" uniqKey="Kandiah V">V Kandiah</name>
</author>
<author>
<name sortKey="Shepelyansky, Dl" uniqKey="Shepelyansky D">DL Shepelyansky</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lu, L" uniqKey="Lu L">L Lü</name>
</author>
<author>
<name sortKey="Zhang, Y C" uniqKey="Zhang Y">Y-C Zhang</name>
</author>
<author>
<name sortKey="Yeung, Ch" uniqKey="Yeung C">CH Yeung</name>
</author>
<author>
<name sortKey="Zhou, T" uniqKey="Zhou T">T Zhou</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Carr, Eh" uniqKey="Carr E">EH Carr</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hale, Sa" uniqKey="Hale S">SA Hale</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">PLoS One</journal-id>
<journal-id journal-id-type="iso-abbrev">PLoS ONE</journal-id>
<journal-id journal-id-type="publisher-id">plos</journal-id>
<journal-id journal-id-type="pmc">plosone</journal-id>
<journal-title-group>
<journal-title>PLoS ONE</journal-title>
</journal-title-group>
<issn pub-type="epub">1932-6203</issn>
<publisher>
<publisher-name>Public Library of Science</publisher-name>
<publisher-loc>San Francisco, CA USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">25738291</article-id>
<article-id pub-id-type="pmc">4349893</article-id>
<article-id pub-id-type="publisher-id">PONE-D-14-23746</article-id>
<article-id pub-id-type="doi">10.1371/journal.pone.0114825</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Research Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Interactions of Cultures and Top People of Wikipedia from Ranking of 24 Language Editions</article-title>
<alt-title alt-title-type="running-head">Interactions of Cultures and Top People of Wikipedia</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Eom</surname>
<given-names>Young-Ho</given-names>
</name>
<xref ref-type="aff" rid="aff001">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Aragón</surname>
<given-names>Pablo</given-names>
</name>
<xref ref-type="aff" rid="aff002">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Laniado</surname>
<given-names>David</given-names>
</name>
<xref ref-type="aff" rid="aff002">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Kaltenbrunner</surname>
<given-names>Andreas</given-names>
</name>
<xref ref-type="aff" rid="aff002">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Vigna</surname>
<given-names>Sebastiano</given-names>
</name>
<xref ref-type="aff" rid="aff003">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Shepelyansky</surname>
<given-names>Dima L.</given-names>
</name>
<xref ref-type="aff" rid="aff001">
<sup>1</sup>
</xref>
<xref ref-type="corresp" rid="cor001">*</xref>
</contrib>
</contrib-group>
<aff id="aff001">
<label>1</label>
<addr-line>Laboratoire de Physique Théorique du CNRS, IRSAMC, Université de Toulouse, UPS, F-31062 Toulouse, France</addr-line>
</aff>
<aff id="aff002">
<label>2</label>
<addr-line>Barcelona Media Foundation, Barcelona, Spain</addr-line>
</aff>
<aff id="aff003">
<label>3</label>
<addr-line>Dipartimento di Informatica, Università degli Studi di Milano, Milano, Italy</addr-line>
</aff>
<contrib-group>
<contrib contrib-type="editor">
<name>
<surname>Gao</surname>
<given-names>Zhong-Ke</given-names>
</name>
<role>Academic Editor</role>
<xref ref-type="aff" rid="edit1"></xref>
</contrib>
</contrib-group>
<aff id="edit1">
<addr-line>Tianjin University, CHINA</addr-line>
</aff>
<author-notes>
<fn fn-type="conflict" id="coi001">
<p>
<bold>Competing Interests: </bold>
The authors have declared that no competing interests exist.</p>
</fn>
<fn fn-type="con" id="contrib001">
<p>Conceived and designed the experiments: YHE DLS. Performed the experiments: YHE PA DL AK SV. Analyzed the data: YHE PA DLS. Contributed reagents/materials/analysis tools: YHE PA DL AK SV. Wrote the paper: YHE DLS.</p>
</fn>
<corresp id="cor001">* E-mail:
<email>dima@irsamc.ups-tlse.fr</email>
</corresp>
</author-notes>
<pub-date pub-type="collection">
<year>2015</year>
</pub-date>
<pub-date pub-type="epub">
<day>4</day>
<month>3</month>
<year>2015</year>
</pub-date>
<volume>10</volume>
<issue>3</issue>
<elocation-id>e0114825</elocation-id>
<history>
<date date-type="received">
<day>30</day>
<month>5</month>
<year>2014</year>
</date>
<date date-type="accepted">
<day>14</day>
<month>11</month>
<year>2014</year>
</date>
</history>
<permissions>
<copyright-year>2015</copyright-year>
<copyright-holder>Eom et al</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>This is an open access article distributed under the terms of the
<ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution License</ext-link>
, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:type="simple" xlink:href="pone.0114825.pdf"></self-uri>
<abstract>
<p>Wikipedia is a huge global repository of human knowledge that can be leveraged to investigate interwinements between cultures. With this aim, we apply methods of Markov chains and Google matrix for the analysis of the hyperlink networks of 24 Wikipedia language editions, and rank all their articles by PageRank, 2DRank and CheiRank algorithms. Using automatic extraction of people names, we obtain the top 100 historical figures, for each edition and for each algorithm. We investigate their spatial, temporal, and gender distributions in dependence of their cultural origins. Our study demonstrates not only the existence of skewness with local figures, mainly recognized only in their own cultures, but also the existence of global historical figures appearing in a large number of editions. By determining the birth time and place of these persons, we perform an analysis of the evolution of such figures through 35 centuries of human history for each language, thus recovering interactions and entanglement of cultures over time. We also obtain the distributions of historical figures over world countries, highlighting geographical aspects of cross-cultural links. Considering historical figures who appear in multiple editions as interactions between cultures, we construct a network of cultures and identify the most influential cultures according to this network.</p>
</abstract>
<funding-group>
<funding-statement>This research is supported in part by the EC FET Open project “New tools and algorithms for directed network analysis” (NADINE number 288956). No additional external or internal funding was received for this study. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</funding-statement>
</funding-group>
<counts>
<fig-count count="10"></fig-count>
<table-count count="6"></table-count>
<page-count count="27"></page-count>
</counts>
<custom-meta-group>
<custom-meta id="data-availability">
<meta-name>Data Availability</meta-name>
<meta-value>All used computational data are publicly available at
<ext-link ext-link-type="uri" xlink:href="http://dumps.wikimedia.org/">http://dumps.wikimedia.org/</ext-link>
. All the raw data necessary to replicate the findings and conclusion of this study are within the paper, supporting information files and this Wikimedia web site.</meta-value>
</custom-meta>
</custom-meta-group>
</article-meta>
<notes>
<title>Data Availability</title>
<p>All used computational data are publicly available at
<ext-link ext-link-type="uri" xlink:href="http://dumps.wikimedia.org/">http://dumps.wikimedia.org/</ext-link>
. All the raw data necessary to replicate the findings and conclusion of this study are within the paper, supporting information files and this Wikimedia web site.</p>
</notes>
</front>
<body>
<sec sec-type="intro" id="sec001">
<title>Introduction</title>
<p>The influence of digital media on collective opinions, social relationships, and information dynamics is growing significantly with the advances of information technology. On the other hand, understanding how collective opinions are reflected in digital media has crucial importance. Among such a medium, Wikipedia, the open, free, and online encyclopedia, has crucial importance since it is not only the largest global knowledge repository but also the biggest collaborative knowledge platform on the Web. Thanks to its huge size, broad coverage and ease of use, Wikipedia is currently one of the most widely used knowledge references. However, since its beginning, there have been constant concerns about the reliability of Wikipedia because of its openness. Although professional scholars may not be affected by a possible skewness or bias of Wikipedia, students and the public can be affected significantly [
<xref rid="pone.0114825.ref001" ref-type="bibr">1</xref>
,
<xref rid="pone.0114825.ref002" ref-type="bibr">2</xref>
]. Extensive studies have examined the reliability of contents [
<xref rid="pone.0114825.ref001" ref-type="bibr">1</xref>
<xref rid="pone.0114825.ref003" ref-type="bibr">3</xref>
], topic coverage [
<xref rid="pone.0114825.ref004" ref-type="bibr">4</xref>
], vandalism [
<xref rid="pone.0114825.ref005" ref-type="bibr">5</xref>
], and conflict [
<xref rid="pone.0114825.ref006" ref-type="bibr">6</xref>
<xref rid="pone.0114825.ref008" ref-type="bibr">8</xref>
] in Wikipedia.</p>
<p>Wikipedia is available in different language editions; 287 language editions are currently active. This indicates that the same topic can be described in hundreds of articles written by different language user groups. Since language is one of the primary elements of culture [
<xref rid="pone.0114825.ref009" ref-type="bibr">9</xref>
], collective cultural biases may be reflected on the contents and organization of each Wikipedia edition. Although Wikipedia adopts a “neutral point of view” policy for the description of contents, aiming to provide unbiased information to the public [
<xref rid="pone.0114825.ref010" ref-type="bibr">10</xref>
], it is natural that each language edition presents reality from a different angle. To investigate differences and relationships among different language editions, we develop mathematical and statistical methods which treat the huge amount of information in Wikipedia, excluding cultural preferences of the investigators.</p>
<p>Cultural bias or differences across Wikipedia editions have been investigated in previous research [
<xref rid="pone.0114825.ref011" ref-type="bibr">11</xref>
<xref rid="pone.0114825.ref017" ref-type="bibr">17</xref>
]. A special emphasis was devoted to persons described in Wikipedia articles [
<xref rid="pone.0114825.ref012" ref-type="bibr">12</xref>
] and their ranking [
<xref rid="pone.0114825.ref018" ref-type="bibr">18</xref>
,
<xref rid="pone.0114825.ref019" ref-type="bibr">19</xref>
]. Indeed, human knowledge, as well as Wikipedia itself, was created by people who are the main actors of its development. Thus it is rather natural to analyze a ranking of people according to the Wikipedia hyper-link network of citations between articles (see network data description below). A cross-cultural study of biographical articles was presented in [
<xref rid="pone.0114825.ref020" ref-type="bibr">20</xref>
], by building a network of interlinked biographies. Another approach was proposed recently in [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
]: the difference in importance of historical figures across Wikipedia language editions is assessed on the basis of the global ranking of Wikipedia articles about persons. This study, motivated by the question “Is an important person in a given culture also important in other cultures?”, showed that there are strong entanglements and local biases of historical figures in Wikipedia. Indeed, the results of the study show that each Wikipedia edition favors persons belonging to the same culture (language), but also that there are cross-Wikipedia top ranked persons, who can be signs of entanglement between cultures. These cross-language historical figures can be used to generate inter-culture networks demonstrating interactions between cultures [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
]. Such an approach provides us novel insights on cross-cultural differences across Wikipedia editions. However, in [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
] only 9 Wikipedia editions, mainly languages spoken in European, have been considered. Thus a broader set of language editions is needed to offer a more complete view on a global scale.</p>
<p>We note that the analysis of persons’ importance via Wikipedia becomes more and more popular. This is well visible from the appearance of new recent studies for the English Wikipedia [
<xref rid="pone.0114825.ref022" ref-type="bibr">22</xref>
] and for multiple languages [
<xref rid="pone.0114825.ref023" ref-type="bibr">23</xref>
]. The analysis of coverage of researchers and academics via Wikipedia is reported in [
<xref rid="pone.0114825.ref024" ref-type="bibr">24</xref>
].</p>
<p>Here we investigate interactions and skewness of cultures with a broader perspective, using global ranking of articles about persons in 24 Wikipedia language editions. According to Wikipedia [
<xref rid="pone.0114825.ref025" ref-type="bibr">25</xref>
] these 24 languages cover 59 percent of world population. Moreover, according to Wikipedia [
<xref rid="pone.0114825.ref026" ref-type="bibr">26</xref>
], our selection of 24 language editions covers the 68 percent of the total number of 30.9 millions of Wikipedia articles in all 287 languages. These 24 editions also cover languages which played an important role in human history including Western, Asian and Arabic cultures.</p>
<p>On the basis of this data set we analyze spatial, temporal, and gender skewness in Wikipedia by analyzing birth place, birth date, and gender of the top ranked historical figures in Wikipedia. We identified overall Western, modern, and male skewness of important historical figures across Wikipedia editions, a tendency towards local preference (i.e. each Wikipedia edition favors historical figures born in countries speaking that edition’s language), and the existence of global historical figures who are highly ranked in most of Wikipedia editions. We also constructed networks of cultures based on cross-cultural historical figures to represent interactions between cultures according to Wikipedia.</p>
<p>To obtain a unified ranking of historical figures for all 24 Wikipedia editions, we introduce an average ranking which gives us the top 100 persons of human history. To assess the alignment of our ranking with previous work by historians, we compare it with the Hart’s list of the top 100 people who, according to him, most influenced human history [
<xref rid="pone.0114825.ref027" ref-type="bibr">27</xref>
]. We note that Hart “ranked these 100 persons in order of importance: that is, according to the total amount of influence that each of them had on human history and on the everyday lives of other human beings”.</p>
</sec>
<sec sec-type="materials|methods" id="sec002">
<title>Methods</title>
<p>In this research, we consider each Wikipedia edition as a network of articles. Each article corresponds to a node of the network and hyperlinks between articles correspond to links of the network. For a given network, we can define an adjacency matrix
<italic>A</italic>
<sub>
<italic>ij</italic>
</sub>
. If there is a link (one or more) from node (article)
<italic>j</italic>
to node (article)
<italic>i</italic>
then
<italic>A</italic>
<sub>
<italic>ij</italic>
</sub>
= 1, otherwise,
<italic>A</italic>
<sub>
<italic>ij</italic>
</sub>
= 0. The out-degree
<italic>k</italic>
<sub>
<italic>out</italic>
</sub>
(
<italic>j</italic>
) is the number of links from node
<italic>j</italic>
to other nodes and the in-degree
<italic>k</italic>
<sub>
<italic>in</italic>
</sub>
(
<italic>j</italic>
) is the number of links to node
<italic>j</italic>
from other nodes. The links between articles are considered only inside a given Wikipedia edition, there are no links counted between editions. Thus each language edition is analyzed independently from others by the Google matrix methods described below. The transcriptions of names from English to the other 23 selected languages are harvested from WikiData (
<ext-link ext-link-type="uri" xlink:href="http://dumps.wikimedia.org/wikidatawiki">http://dumps.wikimedia.org/wikidatawiki</ext-link>
) and not directly from the text of articles.</p>
<p>To rank the articles of a Wikipedia edition, we use two ranking algorithms based on the articles network structure. Detailed descriptions of these algorithms and their use for Wikipedia editions are given in [
<xref rid="pone.0114825.ref018" ref-type="bibr">18</xref>
,
<xref rid="pone.0114825.ref019" ref-type="bibr">19</xref>
,
<xref rid="pone.0114825.ref028" ref-type="bibr">28</xref>
,
<xref rid="pone.0114825.ref029" ref-type="bibr">29</xref>
]. The methods used here are described in [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
]; we keep the same notations.</p>
<sec id="sec003">
<title>Google matrix</title>
<p>First we construct the matrix
<italic>S</italic>
<sub>
<italic>ij</italic>
</sub>
of Markov transitions by normalizing the sum of the elements in each column of
<italic>A</italic>
to unity (
<italic>S</italic>
<sub>
<italic>ij</italic>
</sub>
=
<italic>A</italic>
<sub>
<italic>ij</italic>
</sub>
/∑
<sub>
<italic>i</italic>
</sub>
<italic>A</italic>
<sub>
<italic>ij</italic>
</sub>
, ∑
<sub>
<italic>i</italic>
</sub>
<italic>S</italic>
<sub>
<italic>ij</italic>
</sub>
= 1) and replacing columns with zero elements by elements 1/
<italic>N</italic>
with
<italic>N</italic>
being the matrix size. Then the Google matrix is given by the relation
<italic>G</italic>
<sub>
<italic>ij</italic>
</sub>
=
<italic>αS</italic>
<sub>
<italic>ij</italic>
</sub>
+ (1 −
<italic>α</italic>
)/
<italic>N</italic>
, where
<italic>α</italic>
is the damping factor [
<xref rid="pone.0114825.ref030" ref-type="bibr">30</xref>
]. As in [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
] we use the conventional value
<italic>α</italic>
= 0.85. It is known that the variation of
<italic>α</italic>
in a range 0.5 ≤
<italic>α</italic>
< 0.95 does not significantly affect the probability distribution of ranks discussed below (see e.g. [
<xref rid="pone.0114825.ref018" ref-type="bibr">18</xref>
,
<xref rid="pone.0114825.ref019" ref-type="bibr">19</xref>
,
<xref rid="pone.0114825.ref030" ref-type="bibr">30</xref>
]).</p>
</sec>
<sec id="sec004">
<title>PageRank algorithm</title>
<p>PageRank is a widely used algorithm to rank nodes in a directed network. It was originally introduced for Google web search engine to rank web pages of the World Wide Web based on the idea of academic citations [
<xref rid="pone.0114825.ref031" ref-type="bibr">31</xref>
]. Currently PageRank is used to rank nodes of network systems from scientific papers [
<xref rid="pone.0114825.ref032" ref-type="bibr">32</xref>
] to social network services [
<xref rid="pone.0114825.ref033" ref-type="bibr">33</xref>
], world trade [
<xref rid="pone.0114825.ref034" ref-type="bibr">34</xref>
] and biological systems [
<xref rid="pone.0114825.ref035" ref-type="bibr">35</xref>
]. Here we briefly outline the iteration method of PageRank computation. The PageRank vector
<italic>P</italic>
(
<italic>i</italic>
,
<italic>t</italic>
) of a node
<italic>i</italic>
at iteration
<italic>t</italic>
in a network with
<italic>N</italic>
nodes is given by
<disp-formula id="pone.0114825.e001">
<alternatives>
<graphic xlink:href="pone.0114825.e001.jpg" id="pone.0114825.e001g" mimetype="image" position="anchor" orientation="portrait"></graphic>
<mml:math id="M1">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd columnalign="right">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:munder>
<mml:mo></mml:mo>
<mml:mi>j</mml:mi>
</mml:munder>
<mml:msub>
<mml:mi>G</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>-</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>-</mml:mo>
<mml:mi>α</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:mi>N</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi>α</mml:mi>
<mml:munder>
<mml:mo></mml:mo>
<mml:mi>j</mml:mi>
</mml:munder>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>-</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:msub>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>o</mml:mi>
<mml:mi>u</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mspace width="0.277778em"></mml:mspace>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
</alternatives>
<label>(1)</label>
</disp-formula>
</p>
<p>The stationary state
<italic>P</italic>
(
<italic>i</italic>
) of
<italic>P</italic>
(
<italic>i</italic>
,
<italic>t</italic>
) is the PageRank of node
<italic>i</italic>
. More detailed information about the PageRank algorithm is described in [
<xref rid="pone.0114825.ref030" ref-type="bibr">30</xref>
]. Ordering all nodes by their decreasing probability
<italic>P</italic>
(
<italic>i</italic>
), we obtain the PageRank ranking index
<italic>K</italic>
(
<italic>i</italic>
). In qualitative terms, the PageRank probability of a node is proportional to the number of incoming links weighted according to their own probability. A random network surfer spends on a given node a time given on average by the PageRank probability.</p>
</sec>
<sec id="sec005">
<title>CheiRank algorithm</title>
<p>In a directed network, outgoing links can be as important as ingoing links. In this sense, as a complementary to PageRank, the CheiRank algorithm is defined and used in [
<xref rid="pone.0114825.ref018" ref-type="bibr">18</xref>
,
<xref rid="pone.0114825.ref028" ref-type="bibr">28</xref>
,
<xref rid="pone.0114825.ref036" ref-type="bibr">36</xref>
]. The CheiRank vector
<italic>P</italic>
*(
<italic>i</italic>
,
<italic>t</italic>
) of a node at iteration time
<italic>t</italic>
is given by
<disp-formula id="pone.0114825.e002">
<alternatives>
<graphic xlink:href="pone.0114825.e002.jpg" id="pone.0114825.e002g" mimetype="image" position="anchor" orientation="portrait"></graphic>
<mml:math id="M2">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd columnalign="right">
<mml:mrow>
<mml:msup>
<mml:mi>P</mml:mi>
<mml:mo>*</mml:mo>
</mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>-</mml:mo>
<mml:mi>α</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:mi>N</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi>α</mml:mi>
<mml:munder>
<mml:mo>Σ</mml:mo>
<mml:mi>j</mml:mi>
</mml:munder>
<mml:msub>
<mml:mi>A</mml:mi>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msup>
<mml:mi>P</mml:mi>
<mml:mo>*</mml:mo>
</mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>/</mml:mo>
<mml:msub>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
</alternatives>
<label>(2)</label>
</disp-formula>
</p>
<p>Same as the case of PageRank, we consider the stationary state
<italic>P</italic>
*(
<italic>i</italic>
) of
<italic>P</italic>
*(
<italic>i</italic>
,
<italic>t</italic>
) as the CheiRank probability of node
<italic>i</italic>
with
<italic>α</italic>
= 0.85. High CheiRank nodes in the network have large out-degree. Ordering all nodes by their decreasing probability
<italic>P</italic>
*(
<italic>i</italic>
), we obtain the CheiRank ranking index
<italic>K</italic>
*(
<italic>i</italic>
). The PageRank probability of an article is proportional to the number of incoming links, while the CheiRank probability of an article is proportional to the number of outgoing links. Thus a top PageRank article is important since other articles refer to it, while a top CheiRank article is highly connected because it refers to other articles.</p>
</sec>
<sec id="sec006">
<title>2DRank algorithm</title>
<p>PageRank and CheiRank algorithms focus only on in-degree and out-degree of nodes, respectively. The 2DRank algorithm considers both types of information simultaneously to rank nodes with a balanced point of view in a directed network. Briefly speaking, nodes with both high PageRank and CheiRank get high 2DRank ranking. Consider a node
<italic>i</italic>
which is
<italic>K</italic>
<sub>
<italic>i</italic>
</sub>
-th ranked by PageRank and
<italic>K</italic>
*
<sub>
<italic>i</italic>
</sub>
ranked by CheiRank. Then we can assign a secondary ranking
<inline-formula id="pone.0114825.e003">
<mml:math id="M3">
<mml:mrow>
<mml:msubsup>
<mml:mi>K</mml:mi>
<mml:mi>i</mml:mi>
<mml:mo></mml:mo>
</mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mi>m</mml:mi>
<mml:mi>a</mml:mi>
<mml:mi>x</mml:mi>
<mml:mo stretchy="false">{</mml:mo>
<mml:msub>
<mml:mi>K</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:msup>
<mml:mi>K</mml:mi>
<mml:mo>*</mml:mo>
</mml:msup>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="false">}</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>
to the node. If
<inline-formula id="pone.0114825.e004">
<mml:math id="M4">
<mml:mrow>
<mml:msubsup>
<mml:mi>K</mml:mi>
<mml:mi>i</mml:mi>
<mml:mo></mml:mo>
</mml:msubsup>
<mml:mo><</mml:mo>
<mml:msubsup>
<mml:mi>K</mml:mi>
<mml:mi>j</mml:mi>
<mml:mo></mml:mo>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>
, then node
<italic>j</italic>
has lower 2DRank and vice versa. A detailed illustration and description of this algorithm is given in [
<xref rid="pone.0114825.ref018" ref-type="bibr">18</xref>
].</p>
<p>We note that the studies reported in [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
] show that the overlap between top CheiRank persons of different editions is rather small and due to that the statistical accuracy of this data is not sufficient for determining interactions between different cultures for the CheiRank list. Moreover, CheiRank, based on outgoing links only, selects mainly persons from such activity fields like sports and arts where the historical trace is not so important. Due to these reasons we restrict our study to PageRank and 2DRank. It can be also interesting to use other algorithms of ranking, e.g. LeaderRank [
<xref rid="pone.0114825.ref037" ref-type="bibr">37</xref>
], but here we restrict ourselves to the methods which we already tested, leaving investigation of other raking methods for further studies.</p>
</sec>
</sec>
<sec id="sec007">
<title>Data preparation</title>
<p>We consider 24 different language editions of Wikipedia: English (EN), Dutch (NL), German (DE), French (FR), Spanish (ES), Italian (IT), Portuguese (PT), Greek (EL), Danish (DA), Swedish (SV), Polish (PL), Hungarian (HU), Russian (RU), Hebrew (HE), Turkish (TR), Arabic (AR), Persian (FA), Hindi (HI), Malaysian (MS), Thai (TH), Vietnamese (VI), Chinese (ZH), Korean (KO), and Japanese (JA). The Wikipedia data were collected in middle February 2013. The overview summary of each Wikipedia is represented in
<xref ref-type="table" rid="pone.0114825.t001">Table 1</xref>
.</p>
<table-wrap id="pone.0114825.t001" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.t001</object-id>
<label>Table 1</label>
<caption>
<title>Wikipedia hyperlink networks from the 24 considered language editions.</title>
<p>Here
<italic>N</italic>
<sub>
<italic>a</italic>
</sub>
is the number of articles. Wikipedia data were collected in middle February 2013.</p>
</caption>
<alternatives>
<graphic id="pone.0114825.t001g" xlink:href="pone.0114825.t001"></graphic>
<table frame="box" rules="all" border="0">
<colgroup span="1">
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
</colgroup>
<thead>
<tr>
<th align="left" rowspan="1" colspan="1">
<bold>Edition</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>Language</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>N</italic>
<sub>
<italic>a</italic>
</sub>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>Edition</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>Language</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>N</italic>
<sub>
<italic>a</italic>
</sub>
</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="1" colspan="1">EN</td>
<td align="left" rowspan="1" colspan="1">English</td>
<td align="left" rowspan="1" colspan="1">4212493</td>
<td align="left" rowspan="1" colspan="1">RU</td>
<td align="left" rowspan="1" colspan="1">Russian</td>
<td align="left" rowspan="1" colspan="1">966284</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">NL</td>
<td align="left" rowspan="1" colspan="1">Dutch</td>
<td align="left" rowspan="1" colspan="1">1144615</td>
<td align="left" rowspan="1" colspan="1">HE</td>
<td align="left" rowspan="1" colspan="1">Hebrew</td>
<td align="left" rowspan="1" colspan="1">144959</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">DE</td>
<td align="left" rowspan="1" colspan="1">German</td>
<td align="left" rowspan="1" colspan="1">1532978</td>
<td align="left" rowspan="1" colspan="1">TR</td>
<td align="left" rowspan="1" colspan="1">Turkish</td>
<td align="left" rowspan="1" colspan="1">206311</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">FR</td>
<td align="left" rowspan="1" colspan="1">French</td>
<td align="left" rowspan="1" colspan="1">1352825</td>
<td align="left" rowspan="1" colspan="1">AR</td>
<td align="left" rowspan="1" colspan="1">Arabic</td>
<td align="left" rowspan="1" colspan="1">203328</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">ES</td>
<td align="left" rowspan="1" colspan="1">Spanish</td>
<td align="left" rowspan="1" colspan="1">974025</td>
<td align="left" rowspan="1" colspan="1">FA</td>
<td align="left" rowspan="1" colspan="1">Persian</td>
<td align="left" rowspan="1" colspan="1">295696</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">IT</td>
<td align="left" rowspan="1" colspan="1">Italian</td>
<td align="left" rowspan="1" colspan="1">1017953</td>
<td align="left" rowspan="1" colspan="1">HI</td>
<td align="left" rowspan="1" colspan="1">Hindi</td>
<td align="left" rowspan="1" colspan="1">96869</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">PT</td>
<td align="left" rowspan="1" colspan="1">Portuguese</td>
<td align="left" rowspan="1" colspan="1">758227</td>
<td align="left" rowspan="1" colspan="1">MS</td>
<td align="left" rowspan="1" colspan="1">Malaysian</td>
<td align="left" rowspan="1" colspan="1">180886</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">EL</td>
<td align="left" rowspan="1" colspan="1">Greek</td>
<td align="left" rowspan="1" colspan="1">82563</td>
<td align="left" rowspan="1" colspan="1">TH</td>
<td align="left" rowspan="1" colspan="1">Thai</td>
<td align="left" rowspan="1" colspan="1">78953</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">DA</td>
<td align="left" rowspan="1" colspan="1">Danish</td>
<td align="left" rowspan="1" colspan="1">175228</td>
<td align="left" rowspan="1" colspan="1">VI</td>
<td align="left" rowspan="1" colspan="1">Vietnamese</td>
<td align="left" rowspan="1" colspan="1">594089</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">SV</td>
<td align="left" rowspan="1" colspan="1">Swedish</td>
<td align="left" rowspan="1" colspan="1">780872</td>
<td align="left" rowspan="1" colspan="1">ZH</td>
<td align="left" rowspan="1" colspan="1">Chinese</td>
<td align="left" rowspan="1" colspan="1">663485</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">PL</td>
<td align="left" rowspan="1" colspan="1">Polish</td>
<td align="left" rowspan="1" colspan="1">949153</td>
<td align="left" rowspan="1" colspan="1">KO</td>
<td align="left" rowspan="1" colspan="1">Korean</td>
<td align="left" rowspan="1" colspan="1">231959</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">HU</td>
<td align="left" rowspan="1" colspan="1">Hungarian</td>
<td align="left" rowspan="1" colspan="1">235212</td>
<td align="left" rowspan="1" colspan="1">JA</td>
<td align="left" rowspan="1" colspan="1">Japanese</td>
<td align="left" rowspan="1" colspan="1">852087</td>
</tr>
</tbody>
</table>
</alternatives>
</table-wrap>
<p>We understand that our selection of Wikipedia editions does not represent a complete view of all the 287 languages of Wikipedia editions. However, this selection covers most of the largest language editions and allows us to perform quantitative and statistical analysis of important historical figures. Among the 20 largest editions (counted by their size, taken at the middle of 2014) we have not considered the following editions: Waray-Waray, Cebuano, Ukrainian, Catalan, Bokmal-Riksmal, and Finish.</p>
<p>First we ranked all the articles in a given Wikipedia edition by PageRank and 2DRank algorithms, and selected biographical articles about historical figures. To identify biographical articles, we considered all articles belonging to “Category:living people”, or to “Category:Deaths by year” or “Category:Birth by year” or their subcategories in the English Wikipedia. In this way, we obtained a list of about 1.1 million biographical articles. We identified birth place, birth date, and gender of each selected historical figure based on DBpedia [
<xref rid="pone.0114825.ref038" ref-type="bibr">38</xref>
] or a manual inspection of the corresponding Wikipedia biographical article, when for the considered historical figure no DBpedia data were available. We then started from the list of persons with their biographical article’s title on the English Wikipedia, and found the corresponding titles in other language editions using the inter-language links provided by WikiData. Using the corresponding articles, identified by the inter-languages links in different language editions, we extracted the top 100 persons from the rankings of all Wikipedia articles of each edition. At the end, for each Wikipedia edition and for each ranking algorithm, we have information about the top 100 historical figures with their corresponding name in the English Wikipedia, their birth place and date, and their gender. All 48 lists of the top 100 historical figures in PageRank and 2DRank for the 24 Wikipedia editions and for the two ranking algorithms are represented in [
<xref rid="pone.0114825.ref039" ref-type="bibr">39</xref>
] and Supporting Information (SI). The original network data for each edition are available at [
<xref rid="pone.0114825.ref039" ref-type="bibr">39</xref>
]. The automatic extraction of persons from PageRank and 2DRank listings of articles of each edition is performed by using the above whole list of person names in all 24 editions. This method implies a significantly higher recall compared to the manual selection of persons from the ranking list of articles for each edition used in [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
].</p>
<p>We attribute each of the 100 historical figures to a birth place at the country level (actual country borders), to a birth date in year, to a gender, and to a cultural group. Historical figures are assigned to the countries currently at the locations where they were born. The cultural group of historical figures is assigned by the most spoken language of their birth place at the current country level. For example, if someone was born in “Constantinople” in the ancient Roman era, since the place is now Istanbul, Turkey, we assign her/his birth place as “Turkey” and since Turkish is the most spoken language in Turkey, we assign this person to the Turkish cultural group. If the birth country does not belong to any of the 24 cultures (languages) which we consider, we assign WR (world) as the culture of this person. We would like to point out that although a culture can not be defined only by language, we think that language is a suitable first-approximation of culture. All lists of top 100 historical figures with their birth place, birth date, gender, and cultural group for each Wikipedia edition and for each ranking algorithm are represented in [
<xref rid="pone.0114825.ref039" ref-type="bibr">39</xref>
]. A part of this information is also reported in SI.</p>
<p>To apply PageRank and 2DRank methods, we consider each edition as the network of articles of the given edition connected by hyper-links among the articles (see the details of ranking algorithms in Section
<xref ref-type="sec" rid="sec002">Methods</xref>
). The full list of considered Wikipedia language editions is given in
<xref ref-type="table" rid="pone.0114825.t001">Table 1</xref>
.
<xref ref-type="table" rid="pone.0114825.t002">Table 2</xref>
represents the top 10 historical figures by PageRank and 2DRank in the English Wikipedia. Roughly speaking, top PageRank articles imply highly cited articles in Wikipedia and top 2DRank articles imply articles which are both highly cited and highly citing in Wikipedia. In total, we identified 2400 top historical figures for each ranking algorithm. However, since some historical figures such as
<italic>Jesus</italic>
,
<italic>Aristotle</italic>
, or
<italic>Napoleon</italic>
appear in multiple Wikipedia editions, we have 1045 unique top PageRank historical figures and 1616 unique top 2Drank historical figures.</p>
<table-wrap id="pone.0114825.t002" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.t002</object-id>
<label>Table 2</label>
<caption>
<title>List of top persons by PageRank and 2DRank for the English Wikipedia.</title>
<p>All names are represented by article titles in the English Wikipedia.</p>
</caption>
<alternatives>
<graphic id="pone.0114825.t002g" xlink:href="pone.0114825.t002"></graphic>
<table frame="box" rules="all" border="0">
<colgroup span="1">
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
</colgroup>
<thead>
<tr>
<th align="left" rowspan="1" colspan="1">
<bold>Rank</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>PageRank persons</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>2DRank persons</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="1" colspan="1">1st</td>
<td align="left" rowspan="1" colspan="1">Napoleon</td>
<td align="left" rowspan="1" colspan="1">Frank Sinatra</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">2nd</td>
<td align="left" rowspan="1" colspan="1">Barack Obama</td>
<td align="left" rowspan="1" colspan="1">Michael Jackson</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">3rd</td>
<td align="left" rowspan="1" colspan="1">Carl Linnaeus</td>
<td align="left" rowspan="1" colspan="1">Pope Pius XII</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">4th</td>
<td align="left" rowspan="1" colspan="1">Elizabeth II</td>
<td align="left" rowspan="1" colspan="1">Elton John</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">5th</td>
<td align="left" rowspan="1" colspan="1">George W. Bush</td>
<td align="left" rowspan="1" colspan="1">Elizabeth II</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">6th</td>
<td align="left" rowspan="1" colspan="1">Jesus</td>
<td align="left" rowspan="1" colspan="1">Pope John Paul II</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">7th</td>
<td align="left" rowspan="1" colspan="1">Aristotle</td>
<td align="left" rowspan="1" colspan="1">Beyoncé Knowles</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">8th</td>
<td align="left" rowspan="1" colspan="1">William Shakespeare</td>
<td align="left" rowspan="1" colspan="1">Jorge Luis Borges</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">9th</td>
<td align="left" rowspan="1" colspan="1">Adolf Hitler</td>
<td align="left" rowspan="1" colspan="1">Mariah Carey</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">10th</td>
<td align="left" rowspan="1" colspan="1">Franklin D. Roosevelt</td>
<td align="left" rowspan="1" colspan="1">Vladimir Putin</td>
</tr>
</tbody>
</table>
</alternatives>
</table-wrap>
<p>We should note that the extraction of persons and their information from a Wikipedia edition is not an easy task even for the English edition, and much more complicated for certain other language editions. Therefore, the above automatic method based on 1.1 million English names and their corresponding names seems to us to be the most adequate approach. Of course, it will miss people who do not have a biographical article on the English Wikipedia. Cross-checking investigation is done for Korean and Russian Wikipedia, which are native languages for two authors, by manually selecting top 100 persons from top lists of all articles ordered by PageRank and 2DRank in both Wikipedia editions. We find that our automatic search misses on average only 2 persons from 100 top persons for these two editions (the missed names are given in SI). The errors appear due to transcription changes of names or missing cases in our name-database based on English Wikipedia. For Western languages the number of errors is presumably reduced since transcription remains close to English. Based on the manual inspection for the Korean and the Russian Wikipedia, we expect that the errors of our automatic recovery of the top people from the whole articles ordered by PageRank and 2DRank are on a level of two percent.</p>
<p>We also note that our study is in compliance with Wikipedia’s Terms and Conditions.</p>
</sec>
<sec sec-type="results" id="sec008">
<title>Results</title>
<p>Above we described the methods used for the extraction of the top 100 persons in the ranking list of each edition. Below we present the obtained results describing the spatial, temporal and gender distributions of top ranked historical figures. We also determine the global and local persons and obtain the network of cultures based on the ranking of persons from a given language by other language editions of Wikipedia.</p>
<sec id="sec009">
<title>Spatial distribution</title>
<p>The birth places of historical figures are attributed to the country containing their geographical location of birth according to the present geographical territories of all world countries. The list of countries appeared for the top 100 persons in all editions is given in
<xref ref-type="table" rid="pone.0114825.t003">Table 3</xref>
. We also attribute each country to one of the 24 languages of the considered editions. This attribution is done according to the language spoken by the largest part of population in the given country. Thus e.g. Belgium is attributed to Dutch (NL) since the majority of the population speaks Dutch. If the main language of a country is not among our 24 languages, then this country is attributed to an additional section WR corresponding to the remaining world (e.g. Ukraine, Norway are attributed to WR). If the birth place of a person is not known, then it is also attributed to WR. The choice of attribution of a person to a given country in its current geographic territory, and as a result to a certain language, may have some fluctuations due to historical variations of country borders (e.g. Immanuel Kant was born in the current territory of Russia and hence is attributed to Russian language). However, the number of such cases is small, being on a level of 3.5 percent (see Section “
<xref ref-type="sec" rid="sec013">Network of cultures</xref>
” below). We think that the way in which a link between person, language and country is fixed by the birth place avoids much larger ambiguity of attribution of a person according to the native language which is not so easy to fix in an automatic manner.</p>
<table-wrap id="pone.0114825.t003" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.t003</object-id>
<label>Table 3</label>
<caption>
<title>List of country code (CC), countries as birth places of historical figures, and language code (LC) for each country.</title>
<p>LC is determined by the most spoken language in the given country. Country codes are based on country codes of Internet top-level domains and language codes are based on language edition codes of Wikipedia; WR represents all languages other than the considered 24 languages.</p>
</caption>
<alternatives>
<graphic id="pone.0114825.t003g" xlink:href="pone.0114825.t003"></graphic>
<table frame="box" rules="all" border="0">
<colgroup span="1">
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
</colgroup>
<thead>
<tr>
<th align="left" rowspan="1" colspan="1">
<bold>CC</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>Country</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>LC</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>CC</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>Country</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>LC</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>CC</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>Country</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>LC</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="1" colspan="1">AE</td>
<td align="left" rowspan="1" colspan="1">United Arab Emirates</td>
<td align="left" rowspan="1" colspan="1">AR</td>
<td align="left" rowspan="1" colspan="1">AF</td>
<td align="left" rowspan="1" colspan="1">Afghanistan</td>
<td align="left" rowspan="1" colspan="1">FA</td>
<td align="left" rowspan="1" colspan="1">AL</td>
<td align="left" rowspan="1" colspan="1">Albania</td>
<td align="left" rowspan="1" colspan="1">WR</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">AR</td>
<td align="left" rowspan="1" colspan="1">Argentina</td>
<td align="left" rowspan="1" colspan="1">ES</td>
<td align="left" rowspan="1" colspan="1">AT</td>
<td align="left" rowspan="1" colspan="1">Austria</td>
<td align="left" rowspan="1" colspan="1">DE</td>
<td align="left" rowspan="1" colspan="1">AU</td>
<td align="left" rowspan="1" colspan="1">Australia</td>
<td align="left" rowspan="1" colspan="1">EN</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">AZ</td>
<td align="left" rowspan="1" colspan="1">Azerbaijan</td>
<td align="left" rowspan="1" colspan="1">TR</td>
<td align="left" rowspan="1" colspan="1">BE</td>
<td align="left" rowspan="1" colspan="1">Belgium</td>
<td align="left" rowspan="1" colspan="1">NL</td>
<td align="left" rowspan="1" colspan="1">BG</td>
<td align="left" rowspan="1" colspan="1">Bulgaria</td>
<td align="left" rowspan="1" colspan="1">WR</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">BR</td>
<td align="left" rowspan="1" colspan="1">Brazil</td>
<td align="left" rowspan="1" colspan="1">PT</td>
<td align="left" rowspan="1" colspan="1">BS</td>
<td align="left" rowspan="1" colspan="1">Bahamas</td>
<td align="left" rowspan="1" colspan="1">EN</td>
<td align="left" rowspan="1" colspan="1">BY</td>
<td align="left" rowspan="1" colspan="1">Belarus</td>
<td align="left" rowspan="1" colspan="1">RU</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">CA</td>
<td align="left" rowspan="1" colspan="1">Canada</td>
<td align="left" rowspan="1" colspan="1">EN</td>
<td align="left" rowspan="1" colspan="1">CH</td>
<td align="left" rowspan="1" colspan="1">Switzerland</td>
<td align="left" rowspan="1" colspan="1">DE</td>
<td align="left" rowspan="1" colspan="1">CL</td>
<td align="left" rowspan="1" colspan="1">Chile</td>
<td align="left" rowspan="1" colspan="1">ES</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">CN</td>
<td align="left" rowspan="1" colspan="1">China</td>
<td align="left" rowspan="1" colspan="1">ZH</td>
<td align="left" rowspan="1" colspan="1">CO</td>
<td align="left" rowspan="1" colspan="1">Colombia</td>
<td align="left" rowspan="1" colspan="1">ES</td>
<td align="left" rowspan="1" colspan="1">CU</td>
<td align="left" rowspan="1" colspan="1">Cuba</td>
<td align="left" rowspan="1" colspan="1">ES</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">CY</td>
<td align="left" rowspan="1" colspan="1">Cyprus</td>
<td align="left" rowspan="1" colspan="1">EL</td>
<td align="left" rowspan="1" colspan="1">CZ</td>
<td align="left" rowspan="1" colspan="1">Czech Rep.</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">DE</td>
<td align="left" rowspan="1" colspan="1">Germany</td>
<td align="left" rowspan="1" colspan="1">DE</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">DK</td>
<td align="left" rowspan="1" colspan="1">Denmark</td>
<td align="left" rowspan="1" colspan="1">DA</td>
<td align="left" rowspan="1" colspan="1">DZ</td>
<td align="left" rowspan="1" colspan="1">Algeria</td>
<td align="left" rowspan="1" colspan="1">AR</td>
<td align="left" rowspan="1" colspan="1">EG</td>
<td align="left" rowspan="1" colspan="1">Egypt</td>
<td align="left" rowspan="1" colspan="1">AR</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">ES</td>
<td align="left" rowspan="1" colspan="1">Spain</td>
<td align="left" rowspan="1" colspan="1">ES</td>
<td align="left" rowspan="1" colspan="1">FI</td>
<td align="left" rowspan="1" colspan="1">Finland</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">FR</td>
<td align="left" rowspan="1" colspan="1">France</td>
<td align="left" rowspan="1" colspan="1">FR</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">GE</td>
<td align="left" rowspan="1" colspan="1">Georgia</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">GR</td>
<td align="left" rowspan="1" colspan="1">Greece</td>
<td align="left" rowspan="1" colspan="1">EL</td>
<td align="left" rowspan="1" colspan="1">HK</td>
<td align="left" rowspan="1" colspan="1">Hong Kong</td>
<td align="left" rowspan="1" colspan="1">ZH</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">HR</td>
<td align="left" rowspan="1" colspan="1">Croatia</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">HU</td>
<td align="left" rowspan="1" colspan="1">Hungary</td>
<td align="left" rowspan="1" colspan="1">HU</td>
<td align="left" rowspan="1" colspan="1">ID</td>
<td align="left" rowspan="1" colspan="1">Indonesia</td>
<td align="left" rowspan="1" colspan="1">WR</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">IE</td>
<td align="left" rowspan="1" colspan="1">Ireland</td>
<td align="left" rowspan="1" colspan="1">EN</td>
<td align="left" rowspan="1" colspan="1">IL</td>
<td align="left" rowspan="1" colspan="1">Israel</td>
<td align="left" rowspan="1" colspan="1">HE</td>
<td align="left" rowspan="1" colspan="1">IN</td>
<td align="left" rowspan="1" colspan="1">India</td>
<td align="left" rowspan="1" colspan="1">HI</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">IQ</td>
<td align="left" rowspan="1" colspan="1">Iraq</td>
<td align="left" rowspan="1" colspan="1">AR</td>
<td align="left" rowspan="1" colspan="1">IR</td>
<td align="left" rowspan="1" colspan="1">Iran</td>
<td align="left" rowspan="1" colspan="1">FA</td>
<td align="left" rowspan="1" colspan="1">IS</td>
<td align="left" rowspan="1" colspan="1">Iceland</td>
<td align="left" rowspan="1" colspan="1">WR</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">IT</td>
<td align="left" rowspan="1" colspan="1">Italy</td>
<td align="left" rowspan="1" colspan="1">IT</td>
<td align="left" rowspan="1" colspan="1">JP</td>
<td align="left" rowspan="1" colspan="1">Japan</td>
<td align="left" rowspan="1" colspan="1">JA</td>
<td align="left" rowspan="1" colspan="1">KE</td>
<td align="left" rowspan="1" colspan="1">Kenya</td>
<td align="left" rowspan="1" colspan="1">EN</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">KG</td>
<td align="left" rowspan="1" colspan="1">Kyrgyzstan</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">KH</td>
<td align="left" rowspan="1" colspan="1">Cambodia</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">KO</td>
<td align="left" rowspan="1" colspan="1">S. Korea</td>
<td align="left" rowspan="1" colspan="1">KO</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">KP</td>
<td align="left" rowspan="1" colspan="1">N. Korea</td>
<td align="left" rowspan="1" colspan="1">KO</td>
<td align="left" rowspan="1" colspan="1">KW</td>
<td align="left" rowspan="1" colspan="1">Kuwait</td>
<td align="left" rowspan="1" colspan="1">AR</td>
<td align="left" rowspan="1" colspan="1">KZ</td>
<td align="left" rowspan="1" colspan="1">Kazakhstan</td>
<td align="left" rowspan="1" colspan="1">WR</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">LB</td>
<td align="left" rowspan="1" colspan="1">Lebanon</td>
<td align="left" rowspan="1" colspan="1">AR</td>
<td align="left" rowspan="1" colspan="1">LT</td>
<td align="left" rowspan="1" colspan="1">Lithuania</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">LV</td>
<td align="left" rowspan="1" colspan="1">Latvia</td>
<td align="left" rowspan="1" colspan="1">WR</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">LY</td>
<td align="left" rowspan="1" colspan="1">Libya</td>
<td align="left" rowspan="1" colspan="1">AR</td>
<td align="left" rowspan="1" colspan="1">MK</td>
<td align="left" rowspan="1" colspan="1">Macedonia</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">MM</td>
<td align="left" rowspan="1" colspan="1">Myanmar</td>
<td align="left" rowspan="1" colspan="1">WR</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">MN</td>
<td align="left" rowspan="1" colspan="1">Mongolia</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">MX</td>
<td align="left" rowspan="1" colspan="1">Mexico</td>
<td align="left" rowspan="1" colspan="1">ES</td>
<td align="left" rowspan="1" colspan="1">MY</td>
<td align="left" rowspan="1" colspan="1">Malaysia</td>
<td align="left" rowspan="1" colspan="1">MS</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">NL</td>
<td align="left" rowspan="1" colspan="1">Netherlands</td>
<td align="left" rowspan="1" colspan="1">NL</td>
<td align="left" rowspan="1" colspan="1">NO</td>
<td align="left" rowspan="1" colspan="1">Norway</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">NP</td>
<td align="left" rowspan="1" colspan="1">Nepal</td>
<td align="left" rowspan="1" colspan="1">WR</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">NZ</td>
<td align="left" rowspan="1" colspan="1">New Zealand</td>
<td align="left" rowspan="1" colspan="1">EN</td>
<td align="left" rowspan="1" colspan="1">OM</td>
<td align="left" rowspan="1" colspan="1">Oman</td>
<td align="left" rowspan="1" colspan="1">AR</td>
<td align="left" rowspan="1" colspan="1">PA</td>
<td align="left" rowspan="1" colspan="1">Panama</td>
<td align="left" rowspan="1" colspan="1">ES</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">PE</td>
<td align="left" rowspan="1" colspan="1">Peru</td>
<td align="left" rowspan="1" colspan="1">ES</td>
<td align="left" rowspan="1" colspan="1">PK</td>
<td align="left" rowspan="1" colspan="1">Pakistan</td>
<td align="left" rowspan="1" colspan="1">HI</td>
<td align="left" rowspan="1" colspan="1">PL</td>
<td align="left" rowspan="1" colspan="1">Poland</td>
<td align="left" rowspan="1" colspan="1">PL</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">PS</td>
<td align="left" rowspan="1" colspan="1">State of Palestine</td>
<td align="left" rowspan="1" colspan="1">AR</td>
<td align="left" rowspan="1" colspan="1">PT</td>
<td align="left" rowspan="1" colspan="1">Portugal</td>
<td align="left" rowspan="1" colspan="1">PT</td>
<td align="left" rowspan="1" colspan="1">RO</td>
<td align="left" rowspan="1" colspan="1">Romania</td>
<td align="left" rowspan="1" colspan="1">WR</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">RS</td>
<td align="left" rowspan="1" colspan="1">Serbia</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">RU</td>
<td align="left" rowspan="1" colspan="1">Russia</td>
<td align="left" rowspan="1" colspan="1">RU</td>
<td align="left" rowspan="1" colspan="1">SA</td>
<td align="left" rowspan="1" colspan="1">Saudi Arabia</td>
<td align="left" rowspan="1" colspan="1">AR</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">SD</td>
<td align="left" rowspan="1" colspan="1">Sudan</td>
<td align="left" rowspan="1" colspan="1">AR</td>
<td align="left" rowspan="1" colspan="1">SE</td>
<td align="left" rowspan="1" colspan="1">Sweden</td>
<td align="left" rowspan="1" colspan="1">SV</td>
<td align="left" rowspan="1" colspan="1">SG</td>
<td align="left" rowspan="1" colspan="1">Singapore</td>
<td align="left" rowspan="1" colspan="1">ZH</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">SI</td>
<td align="left" rowspan="1" colspan="1">Slovenia</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">SK</td>
<td align="left" rowspan="1" colspan="1">Slovakia</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">SR</td>
<td align="left" rowspan="1" colspan="1">Suriname</td>
<td align="left" rowspan="1" colspan="1">NL</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">SY</td>
<td align="left" rowspan="1" colspan="1">Syria</td>
<td align="left" rowspan="1" colspan="1">AR</td>
<td align="left" rowspan="1" colspan="1">TH</td>
<td align="left" rowspan="1" colspan="1">Thailand</td>
<td align="left" rowspan="1" colspan="1">TH</td>
<td align="left" rowspan="1" colspan="1">TJ</td>
<td align="left" rowspan="1" colspan="1">Tajikistan</td>
<td align="left" rowspan="1" colspan="1">WR</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">TN</td>
<td align="left" rowspan="1" colspan="1">Tunisia</td>
<td align="left" rowspan="1" colspan="1">AR</td>
<td align="left" rowspan="1" colspan="1">TR</td>
<td align="left" rowspan="1" colspan="1">Turkey</td>
<td align="left" rowspan="1" colspan="1">TR</td>
<td align="left" rowspan="1" colspan="1">TW</td>
<td align="left" rowspan="1" colspan="1">Taiwan</td>
<td align="left" rowspan="1" colspan="1">ZH</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">TZ</td>
<td align="left" rowspan="1" colspan="1">Tanzania</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">UA</td>
<td align="left" rowspan="1" colspan="1">Ukraine</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">UK</td>
<td align="left" rowspan="1" colspan="1">United Kingdom</td>
<td align="left" rowspan="1" colspan="1">EN</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">US</td>
<td align="left" rowspan="1" colspan="1">United States</td>
<td align="left" rowspan="1" colspan="1">EN</td>
<td align="left" rowspan="1" colspan="1">UZ</td>
<td align="left" rowspan="1" colspan="1">Uzbekistan</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">VE</td>
<td align="left" rowspan="1" colspan="1">Venezuela</td>
<td align="left" rowspan="1" colspan="1">ES</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">VN</td>
<td align="left" rowspan="1" colspan="1">Vietnam</td>
<td align="left" rowspan="1" colspan="1">VI</td>
<td align="left" rowspan="1" colspan="1">XX</td>
<td align="left" rowspan="1" colspan="1">Unknown</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">YE</td>
<td align="left" rowspan="1" colspan="1">Yemen</td>
<td align="left" rowspan="1" colspan="1">AR</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">ZA</td>
<td align="left" rowspan="1" colspan="1">South Africa</td>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1"></td>
<td align="left" rowspan="1" colspan="1"></td>
<td align="left" rowspan="1" colspan="1"></td>
<td align="left" rowspan="1" colspan="1"></td>
<td align="left" rowspan="1" colspan="1"></td>
<td align="left" rowspan="1" colspan="1"></td>
</tr>
</tbody>
</table>
</alternatives>
</table-wrap>
<p>The obtained spatial distribution of historical figures of Wikipedia over countries is shown in
<xref ref-type="fig" rid="pone.0114825.g001">Fig. 1</xref>
. This averaged distribution gives the average number of top 100 persons born in a specific country as birth place, with averaging done over our 24 Wikipedia editions. Thus an average over the 24 editions gives for Germany (DE) approximately 9.7 persons in the top 100 of PageRank, being at the first position, followed by USA with approximately 9.5 persons. For 2DRank we have USA at the first position with an average of 9.8 persons and Germany at the second with an average of 8.0 persons.</p>
<fig id="pone.0114825.g001" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.g001</object-id>
<label>Fig 1</label>
<caption>
<title>Birth place distribution of top historical figures averaged over 24 Wikipedia edition for (A) PageRank historical figures (71 countries) and (B) 2DRank historical figures (91 countries).</title>
<p>Two letter country codes are represented in
<xref ref-type="table" rid="pone.0114825.t003">Table 3</xref>
.</p>
</caption>
<graphic xlink:href="pone.0114825.g001"></graphic>
</fig>
<p>Western (Europe and USA) skewed patterns are observed in both top PageRank historical figures (
<xref ref-type="fig" rid="pone.0114825.g001">Fig. 1</xref>
. (A)) and top 2DRank historical figures (
<xref ref-type="fig" rid="pone.0114825.g001">Fig. 1</xref>
. (B)). This Western skewed pattern is remarkable since 11 Wikipedia editions of the 24 considered editions are not European language editions. Germany, USA, Italy, UK and France are the top five birth places of top PageRank historical figures among 71 countries. On the other hand, USA, Germany, UK, Italy and Japan are top five birth places of the top 2DRank historical figures among 91 countries.</p>
<p>In
<xref ref-type="fig" rid="pone.0114825.g002">Fig. 2</xref>
we show the world map of countries, where color indicates the number of persons from a given country among the 24 × 100 top persons for PageRank and 2DRank. Additional figures showing these distributions for different centuries are available at [
<xref rid="pone.0114825.ref039" ref-type="bibr">39</xref>
].</p>
<fig id="pone.0114825.g002" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.g002</object-id>
<label>Fig 2</label>
<caption>
<title>Sum of appearances of historical figures from a given country in the 24 lists of top 100 persons for PageRank (top panel) and 2DRank (bottom panel).</title>
<p>Color changes from zero (white) to maximum (black). Maximal values are 233 appearances for Germany (top) and 236 for USA (bottom). Values are proportional to the averages per country shown in
<xref ref-type="fig" rid="pone.0114825.g001">Fig. 1</xref>
.</p>
</caption>
<graphic xlink:href="pone.0114825.g002"></graphic>
</fig>
<p>We also observed local skewness in the spatial distribution of the top historical figures for the PageRank (2DRank) ranking algorithm as shown in
<xref ref-type="fig" rid="pone.0114825.g003">Fig. 3A</xref>
(in
<xref ref-type="fig" rid="pone.0114825.g003">Fig. 3B</xref>
). For example, 47 percent of the top PageRank historical figures in the English Wikipedia were born in USA (25 percent) and UK (22 percent) and 56 percent of the top historical figures in the Hindi Wikipedia were born in India. A similar strong locality pattern of the top historical figures was observed in our previous research [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
]. However it should be noted that in the previous study we considered the native language of the top historical figure as a criterion of locality, while in the current study we considered ‘birth place’ as criterion of locality.</p>
<fig id="pone.0114825.g003" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.g003</object-id>
<label>Fig 3</label>
<caption>
<title>Birth place distributions over countries of top historical figures from each Wikipedia edition; two letter country codes are represented in
<xref ref-type="table" rid="pone.0114825.t003">Table 3</xref>
.</title>
<p>Panels: (A) distributions of PageRank historical figures over 71 countries for each Wikipedia edition; (B) distributions of 2DRank historical figures over 91 countries for each Wikipedia edition; (C) column normalized birth place distributions of PageRank historical figures of panel (A); (D) column normalized birth place distributions of 2DRank historical figures of panel (B).</p>
</caption>
<graphic xlink:href="pone.0114825.g003"></graphic>
</fig>
<p>Regional skewness, the preferences of Wikipedia editions for historical figures who were born in geographically or culturally related countries, is also observed. For example, 18 (5) of the top 100 PageRank historical figures in the Korean (Japanese) Wikipedia were born in China. Also 9 of the top 100 PageRank historical figures in the Persian Wikipedia were born in Saudi Arabia. The distribution of top persons from each Wikipedia edition over world countries is shown in
<xref ref-type="fig" rid="pone.0114825.g003">Fig. 3A and Fig. 3B</xref>
. The countries on a horizontal axis are grouped by clusters of corresponding language so that the links inside a given culture (or language) become well visible.</p>
<p>To observe patterns in a better way at low numbers of historical figures, we normalized each column of
<xref ref-type="fig" rid="pone.0114825.g003">Fig. 3A and Fig. 3B</xref>
corresponding to a given country. In this way we obtain a rescaled distribution with better visibility for each birth country level as shown in
<xref ref-type="fig" rid="pone.0114825.g003">Fig. 3C and Fig. 3D</xref>
, respectively. We can observe a clear birth pattern of top PageRank historical figures born in Lebanon, Libya, Oman, and Tunisia in the case of the Arabic Wikipedia, and historical figures born in N. Korea appearing not only in the Korean but also in the Japanese Wikipedia.</p>
<p>In the case of the top 2DRank historical figures shown in
<xref ref-type="fig" rid="pone.0114825.g003">Fig. 3B and Fig. 3D</xref>
, we observe overall patterns of locality and regions being similar to the case of PageRank, but the locality is stronger.</p>
<p>In short, we observed that most of the top historical figures in Wikipedia were born in Western countries, but also that each edition shows its own preference to the historical figures born in countries which are closely related to the corresponding language edition.</p>
</sec>
<sec id="sec010">
<title>Temporal distribution</title>
<p>The analysis of the temporal distribution of top historical figures is done based on their birth dates. As shown in
<xref ref-type="fig" rid="pone.0114825.g004">Fig. 4A</xref>
for PageRank, most of historical figures were born after the 17th century on average, which shows similar pattern with world population growth [
<xref rid="pone.0114825.ref040" ref-type="bibr">40</xref>
]. However, there are some distinctive peaks around BC 5th century and BC 1st century for the case of PageRank because of Greek scholars (
<italic>Socrates, Plato</italic>
, and
<italic>Herodotus</italic>
), Roman politicians (
<italic>Julius Caesar, Augustus</italic>
) and Christianity leaders (
<italic>Jesus, Paul the Apostle</italic>
, and
<italic>Mary (mother of Jesus)</italic>
). We also observe that the Arabic and the Persian Wikipedia have more historical figures than Western language Wikipedia editions from AD 6th century to AD 12th century. For the case of 2DRank in
<xref ref-type="fig" rid="pone.0114825.g004">Fig. 4B</xref>
, there is only one small peak around BC 1C, which is also smaller than the peak in the case of PageRank, and all the distribution is dominated by a strong growth on the 20th century.</p>
<fig id="pone.0114825.g004" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.g004</object-id>
<label>Fig 4</label>
<caption>
<title>Birth date distributions of top historical figures.</title>
<p>(A) Birth date distribution of PageRank historical figures averaged over 24 Wikipedia editions (B) Birth date distribution of 2DRank historical figures averaged over 24 Wikipedia editions (C) Birth date distributions of PageRank historical figures for each Wikipedia edition. (D) Birth date distributions of 2DRank historical figures for each Wikipedia edition. (E) Column normalized birth date distributions of PageRank historical figures for each Wikipedia edition. (F) Column normalized birth date distributions of 2DRank historical figures for each Wikipedia edition.</p>
</caption>
<graphic xlink:href="pone.0114825.g004"></graphic>
</fig>
<p>The distributions of the top PageRank historical figures over the 24 Wikipedia editions for each century are shown in
<xref ref-type="fig" rid="pone.0114825.g004">Fig. 4C</xref>
. The same distribution, but normalized to unity over all editions for each century, is shown in
<xref ref-type="fig" rid="pone.0114825.g004">Fig. 4E</xref>
. The Persian (FA) and the Arabic (AR) Wikipedia have more historical figures than other language editions (in particular European language editions) from the 6th to the 12th century due to Islamic leaders and scholars. On the other hand, the Greek Wikipedia has more historical figures in BC 5th century because of Greek philosophers. Also most of western-southern European language editions, including English, Dutch, German, French, Spanish, Italian, Portuguese, and Greek, have more top historical figures because they have
<italic>Augustine the Hippo</italic>
and
<italic>Justinian I</italic>
in common. Similar distributions obtained from 2DRank are shown in
<xref ref-type="fig" rid="pone.0114825.g004">Fig. 4D and Fig. 4F</xref>
respectively.</p>
<p>The data of
<xref ref-type="fig" rid="pone.0114825.g004">Figs. 4E, F</xref>
clearly show well pronounced patterns, corresponding to strong interactions between cultures: from BC 5th century to AD 15th century for JA, KO, ZH, VI; from AD 6th century to AD 12th century for FA, AR; and a common birth pattern in EN, EL, PT, IT, ES, DE, NL (Western European languages) from BC 5th century to AD 6th century. In supporting Figure S1 we show distributions of historical figures over languages according to their birth place. In this case the above patterns become even more pronounced.</p>
<p>At a first glance from
<xref ref-type="fig" rid="pone.0114825.g004">Figs. 4E, F</xref>
we observe for persons born in AD 20th century a significantly more homogeneous distribution over cultures compared to early centuries. However, as noted in [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
], each Wikipedia edition favors historical figures speaking the corresponding language. We investigate how this preference to same-language historical figures changes in time. For this analysis, we define two variables
<italic>M</italic>
<sub>
<italic>L</italic>
,
<italic>C</italic>
</sub>
and
<italic>N</italic>
<sub>
<italic>L</italic>
,
<italic>C</italic>
</sub>
for a given language edition
<italic>L</italic>
and a given century
<italic>C</italic>
. Here
<italic>M</italic>
<sub>
<italic>L</italic>
,
<italic>C</italic>
</sub>
is the number of historical figures born in all countries being attributed to a given language
<italic>L</italic>
, and
<italic>N</italic>
<sub>
<italic>L</italic>
,
<italic>C</italic>
</sub>
is the total number of historical figures for a given century
<italic>C</italic>
and a given language edition
<italic>L</italic>
. For example, among the 21 top PageRank historical figures from the English Wikipedia, who were born in AD 20th century, two historical figures (Pope John Paul II and Pope Benedict XVI) were not born in English speaking countries. Thus in this case
<italic>N</italic>
<sub>
<italic>EN</italic>
, 20</sub>
= 21 and
<italic>M</italic>
<sub>
<italic>EN</italic>
, 20</sub>
= 19.
<xref ref-type="fig" rid="pone.0114825.g005">Fig. 5</xref>
represents the ratio
<italic>r</italic>
<sub>
<italic>L</italic>
,
<italic>C</italic>
</sub>
=
<italic>M</italic>
<sub>
<italic>L</italic>
,
<italic>C</italic>
</sub>
/
<italic>N</italic>
<sub>
<italic>L</italic>
,
<italic>C</italic>
</sub>
for each edition and each century. In ancient times (i.e. before AD 5th century), most historical figures for each Wikipedia edition are not born in the same language region except for the Greek, Italian, Hebrew, and Chinese Wikipedia. However, after AD 5th century, the ratio of same language historical figures is rising. Thus, in AD 20th century, most Wikipedia editions have significant numbers of historical figures born in countries speaking the corresponding language. For PageRank persons and AD 20th century, we find that the English edition has the largest fraction of its own language, followed by Arabic and Persian editions while other editions have significantly large connections with other cultures. For the English edition this is related to a significant number of USA presidents appearing in the top 100 list (see [
<xref rid="pone.0114825.ref018" ref-type="bibr">18</xref>
,
<xref rid="pone.0114825.ref019" ref-type="bibr">19</xref>
]). For 2DRank persons the largest fractions were found for Greek, Arabic, Chinese and Japanese cultures. These data show that even in age of globalization there is a significant dominance of local historical figures for certain cultures.</p>
<fig id="pone.0114825.g005" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.g005</object-id>
<label>Fig 5</label>
<caption>
<title>The locality property of cultures represented by the ratio
<italic>r</italic>
<sub>
<italic>L</italic>
,
<italic>C</italic>
</sub>
=
<italic>M</italic>
<sub>
<italic>L</italic>
,
<italic>C</italic>
</sub>
/
<italic>N</italic>
<sub>
<italic>L</italic>
,
<italic>C</italic>
</sub>
for each edition
<italic>L</italic>
and each century
<italic>C</italic>
.</title>
<p>Here
<italic>M</italic>
<sub>
<italic>L</italic>
,
<italic>C</italic>
</sub>
is the number of historical figures born in countries attributed to a given language edition
<italic>L</italic>
at century
<italic>C</italic>
and
<italic>N</italic>
<sub>
<italic>L</italic>
,
<italic>C</italic>
</sub>
is the total number of historical figures in a given edition at a given century, regardless of language of their birth countries. Black color (-0.2 in the color bars) shows that there is no historical figure at all for a given edition and century; blue (0 in the color bars) shows there there are some historical figures but no same language historical figures. Here (A) panel shows PageRank historical figures, and (B) panel shows 2DRank historical figures.</p>
</caption>
<graphic xlink:href="pone.0114825.g005"></graphic>
</fig>
</sec>
<sec id="sec011">
<title>Gender distribution</title>
<p>From the gender distributions of historical figures, we observe a strong male-skewed pattern across many Wikipedia editions regardless of the ranking algorithm. On average, 5.2(10.1) female historical figures are observed among the 100 top PageRank (2DRank) persons for each Wikipedia edition.
<xref ref-type="fig" rid="pone.0114825.g006">Fig. 6</xref>
shows the number of top female historical figures for each Wikipedia edition. Thai, Hindi, Swedish, and Hebrew have more female historical figures than the average over our 24 editions in the case of PageRank. On the other hand, the Greek and the Korean versions have a lower number of females than the average. In the case of 2DRank, English, Hindi, Thai, and Hungarian Wikipedia have more females than the average while German, Chinese, Korean, and Persian Wikipedia have less females than the average. In short, the top historical figures in Wikipedia are quite male-skewed. This is not surprising since females had little chance to be historical figures for most of human history. We compare the gender skewness to other cases such as the number of female editors in Wikipedia (9 percent) in 2011 [
<xref rid="pone.0114825.ref041" ref-type="bibr">41</xref>
] and the share of women in parliaments, which was 18.7 percent in 2012 by UN Statistics and indicators on women and men [
<xref rid="pone.0114825.ref042" ref-type="bibr">42</xref>
], the male skewness for the PageRank list is stronger in the contents of Wikipedia [
<xref rid="pone.0114825.ref043" ref-type="bibr">43</xref>
]. However, the ratio of females among the top historical figures is growing by time as shown in
<xref ref-type="fig" rid="pone.0114825.g006">Fig. 6 C</xref>
. It is notable that the peak in
<xref ref-type="fig" rid="pone.0114825.g006">Fig. 6C</xref>
at BC 1st is due to “Mary (mother of Jesus)”. In the 20th century 2DRank gives a larger percentage of women compared to PageRank. This is due to the fact that 2DRank has a larger fraction of singers and artists comparing to PageRank (see [
<xref rid="pone.0114825.ref018" ref-type="bibr">18</xref>
,
<xref rid="pone.0114825.ref019" ref-type="bibr">19</xref>
]) and that the fraction of women in these fields of activity is larger.</p>
<fig id="pone.0114825.g006" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.g006</object-id>
<label>Fig 6</label>
<caption>
<title>Number of females of top historical figures from each Wikipedia edition (A) Top PageRank historical figures (B) Top 2DRank historical figures.</title>
<p>(C) The average female ratio of historical figures in given centuries across 24 Wikipedia editions.</p>
</caption>
<graphic xlink:href="pone.0114825.g006"></graphic>
</fig>
</sec>
<sec id="sec012">
<title>Global historical figures</title>
<p>Above we analyzed how top historical figures in Wikipedia are distributed in terms of space, time, and gender. Now we identify how these top historical figures are distributed in each Wikipedia edition and which are global historical figures. According to previous research [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
], there are some global historical figures who are recognized as important historical figures across Wikipedia editions. We identify global historical figures based on the ranking score for a given person determined by her number of appearances and ranking index over our 24 Wikipedia editions.</p>
<p>Following [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
], the ranking score Θ
<sub>
<italic>P</italic>
,
<italic>A</italic>
</sub>
of a historical figure
<italic>P</italic>
is given by
<disp-formula id="pone.0114825.e005">
<alternatives>
<graphic xlink:href="pone.0114825.e005.jpg" id="pone.0114825.e005g" mimetype="image" position="anchor" orientation="portrait"></graphic>
<mml:math id="M5">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd columnalign="right">
<mml:mrow>
<mml:msub>
<mml:mo>Θ</mml:mo>
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>A</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:munder>
<mml:mo></mml:mo>
<mml:mi>E</mml:mi>
</mml:munder>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mn>101</mml:mn>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mi>R</mml:mi>
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>E</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>A</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
</alternatives>
<label>(3)</label>
</disp-formula>
</p>
<p>Here
<italic>R</italic>
<sub>
<italic>P</italic>
,
<italic>E</italic>
,
<italic>A</italic>
</sub>
is the ranking of a historical figure
<italic>P</italic>
in Wikipedia edition
<italic>E</italic>
by ranking algorithm
<italic>A</italic>
. According to this definition, a historical figure who appears more often in the lists of top historical figures for the given 24 Wikipedia editions or has higher ranking in the lists gets a higher ranking score.
<xref ref-type="table" rid="pone.0114825.t004">Table 4</xref>
represents the top 10 global historical figures for PageRank and 2DRank.
<italic>Carl Linnaeus</italic>
is the 1st global historical figure by PageRank followed by
<italic>Jesus, Aristotle. Adolf Hitler</italic>
is the 1st global historical figure by 2DRank followed by
<italic>Michael Jackson, Madonna (entertainer)</italic>
. On the other hand, the lists of the top 10 local historical figures ordered by our ranking score for each language are represented in supporting Tables S1–S25 and [
<xref rid="pone.0114825.ref039" ref-type="bibr">39</xref>
].</p>
<table-wrap id="pone.0114825.t004" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.t004</object-id>
<label>Table 4</label>
<caption>
<title>List of global historical figures by PageRank and 2DRank for all 24 Wikipedia editions.</title>
<p>All names are represented by the corresponding article titles in the English Wikipedia. Here, Θ
<sub>
<italic>A</italic>
</sub>
is the ranking score of algorithm
<italic>A</italic>
(
<xref ref-type="disp-formula" rid="pone.0114825.e005">3</xref>
);
<italic>N</italic>
<sub>
<italic>A</italic>
</sub>
is the number of appearances of a given person in the top 100 rank for all editions.</p>
</caption>
<alternatives>
<graphic id="pone.0114825.t004g" xlink:href="pone.0114825.t004"></graphic>
<table frame="box" rules="all" border="0">
<colgroup span="1">
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
</colgroup>
<thead>
<tr>
<th align="left" rowspan="1" colspan="1">
<bold>Rank</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>PageRank global figures</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>Θ
<sub>
<italic>PR</italic>
</sub>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>N</italic>
<sub>
<italic>A</italic>
</sub>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>2DRank global figures</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>Θ
<sub>2
<italic>D</italic>
</sub>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>N</italic>
<sub>
<italic>A</italic>
</sub>
</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="1" colspan="1">1st</td>
<td align="left" rowspan="1" colspan="1">Carl Linnaeus</td>
<td align="left" rowspan="1" colspan="1">2284</td>
<td align="left" rowspan="1" colspan="1">24</td>
<td align="left" rowspan="1" colspan="1">Adolf Hitler</td>
<td align="left" rowspan="1" colspan="1">1557</td>
<td align="left" rowspan="1" colspan="1">20</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">2nd</td>
<td align="left" rowspan="1" colspan="1">Jesus</td>
<td align="left" rowspan="1" colspan="1">2282</td>
<td align="left" rowspan="1" colspan="1">24</td>
<td align="left" rowspan="1" colspan="1">Michael Jackson</td>
<td align="left" rowspan="1" colspan="1">1315</td>
<td align="left" rowspan="1" colspan="1">17</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">3rd</td>
<td align="left" rowspan="1" colspan="1">Aristotle</td>
<td align="left" rowspan="1" colspan="1">2237</td>
<td align="left" rowspan="1" colspan="1">24</td>
<td align="left" rowspan="1" colspan="1">Madonna (entertainer)</td>
<td align="left" rowspan="1" colspan="1">991</td>
<td align="left" rowspan="1" colspan="1">14</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">4th</td>
<td align="left" rowspan="1" colspan="1">Napoleon</td>
<td align="left" rowspan="1" colspan="1">2208</td>
<td align="left" rowspan="1" colspan="1">24</td>
<td align="left" rowspan="1" colspan="1">Jesus</td>
<td align="left" rowspan="1" colspan="1">943</td>
<td align="left" rowspan="1" colspan="1">14</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">5th</td>
<td align="left" rowspan="1" colspan="1">Adolf Hitler</td>
<td align="left" rowspan="1" colspan="1">2112</td>
<td align="left" rowspan="1" colspan="1">24</td>
<td align="left" rowspan="1" colspan="1">Ludwig van Beethoven</td>
<td align="left" rowspan="1" colspan="1">872</td>
<td align="left" rowspan="1" colspan="1">14</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">6th</td>
<td align="left" rowspan="1" colspan="1">Julius Caesar</td>
<td align="left" rowspan="1" colspan="1">1952</td>
<td align="left" rowspan="1" colspan="1">23</td>
<td align="left" rowspan="1" colspan="1">Wolfgang Amadeus Mozart</td>
<td align="left" rowspan="1" colspan="1">853</td>
<td align="left" rowspan="1" colspan="1">11</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">7th</td>
<td align="left" rowspan="1" colspan="1">Plato</td>
<td align="left" rowspan="1" colspan="1">1949</td>
<td align="left" rowspan="1" colspan="1">24</td>
<td align="left" rowspan="1" colspan="1">Pope Benedict XVI</td>
<td align="left" rowspan="1" colspan="1">840</td>
<td align="left" rowspan="1" colspan="1">12</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">8th</td>
<td align="left" rowspan="1" colspan="1">William Shakespeare</td>
<td align="left" rowspan="1" colspan="1">1861</td>
<td align="left" rowspan="1" colspan="1">24</td>
<td align="left" rowspan="1" colspan="1">Alexander the Great</td>
<td align="left" rowspan="1" colspan="1">789</td>
<td align="left" rowspan="1" colspan="1">11</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">9th</td>
<td align="left" rowspan="1" colspan="1">Albert Eistein</td>
<td align="left" rowspan="1" colspan="1">1847</td>
<td align="left" rowspan="1" colspan="1">24</td>
<td align="left" rowspan="1" colspan="1">Charles Darwin</td>
<td align="left" rowspan="1" colspan="1">773</td>
<td align="left" rowspan="1" colspan="1">12</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">10th</td>
<td align="left" rowspan="1" colspan="1">Elizabeth II</td>
<td align="left" rowspan="1" colspan="1">1789</td>
<td align="left" rowspan="1" colspan="1">24</td>
<td align="left" rowspan="1" colspan="1">Barack Obama</td>
<td align="left" rowspan="1" colspan="1">754</td>
<td align="left" rowspan="1" colspan="1">16</td>
</tr>
</tbody>
</table>
</alternatives>
</table-wrap>
<p>The reason for a somewhat unexpected PageRank leader
<italic>Carl Linnaeus</italic>
is related to the fact that he laid the foundations for the modern biological naming scheme so that plenty of articles about animals, insects and plants point to the Wikipedia article about him, which strongly increases the PageRank probability. This happens for all 24 languages where
<italic>Carl Linnaeus</italic>
always appears on high positions since articles about animals and plants are an important fraction of Wikipedia. Even if in a given language the top persons are often politicians (e.g.
<italic>Napoleon, Barak Obama</italic>
at
<italic>K</italic>
= 1, 2 in EN), these politicians have mainly local importance and are not highly ranked in other languages (e.g. in ZH
<italic>Carl Linnaeus</italic>
is at
<italic>K</italic>
= 1,
<italic>Napoleon</italic>
at
<italic>K</italic>
= 3 and
<italic>Barak Obama</italic>
is at
<italic>K</italic>
= 24). As a result when the global contribution is counted over all 24 languages
<italic>Carl Linnaeus</italic>
appears on the top PageRank position.</p>
<p>Our analysis suggests that there might be three groups of historical figures.
<xref ref-type="fig" rid="pone.0114825.g007">Fig. 7</xref>
shows these three groups of top PageRank historical figures in Wikipedia: (i) global historical figures who appear in most of Wikipedia editions (
<italic>N</italic>
<sub>
<italic>A</italic>
</sub>
≥ 18) and are highly ranked (⟨
<italic>K</italic>
⟩ ≤ 50) for each Wikipedia such as Carl Linnaeus, Plato, Jesus, and Napoleon (Right-Top of the
<xref ref-type="fig" rid="pone.0114825.g007">Fig. 7A</xref>
); (ii) local-highly ranked historical figures who appear in a few Wikipedia editions (
<italic>N</italic>
<sub>
<italic>A</italic>
</sub>
< 18) but are highly ranked (⟨
<italic>K</italic>
⟩ ≤ 50) in the Wikipedia editions in which they appear, such as Tycho Brahe, Sejong the Great, and Sun Yat-sen (Left-Top of the
<xref ref-type="fig" rid="pone.0114825.g007">Fig. 7A</xref>
); (iii) locally-low ranked historical figures who appear in a few Wikipedia editions (
<italic>N</italic>
<sub>
<italic>A</italic>
</sub>
< 18) and who are not highly ranked (⟨
<italic>K</italic>
⟩ > 50). Here
<italic>N</italic>
<sub>
<italic>A</italic>
</sub>
is the number of appearances in different Wikipedia editions for a given person and ⟨
<italic>K</italic>
⟩ is the average ranking of the given persons across Wikipedia editions for each ranking algorithm. In the case of 2DRank historical figures, due to the absence of global historical figures, most of them belong to two types of local historical figures (i.e. local-highly ranked or local-lowly ranked).</p>
<fig id="pone.0114825.g007" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.g007</object-id>
<label>Fig 7</label>
<caption>
<title>The distribution of 1045 top PageRank persons (A) and 1616 top 2DRank persons (B) as a function of number of appearances
<italic>N</italic>
<sub>
<italic>A</italic>
</sub>
of a given person and the rank ⟨
<italic>K</italic>
⟩ of this person averaged over Wikipedia editions where this person appeared.</title>
</caption>
<graphic xlink:href="pone.0114825.g007"></graphic>
</fig>
<p>Following ranking of persons via Θ
<sub>
<italic>P</italic>
,
<italic>A</italic>
</sub>
we determine also the top global female historical figures, presented in
<xref ref-type="table" rid="pone.0114825.t005">Table 5</xref>
for PageRank and 2DRank persons. The full lists of global female figures are available at [
<xref rid="pone.0114825.ref039" ref-type="bibr">39</xref>
] (63 and 165 names for PageRank and 2DRank).</p>
<table-wrap id="pone.0114825.t005" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.t005</object-id>
<label>Table 5</label>
<caption>
<title>List of the top 10 global female historical figures by PageRank and 2DRank for all the 24 Wikipedia editions.</title>
<p>All names are represented by article titles in the English Wikipedia. Here, Θ
<sub>
<italic>A</italic>
</sub>
is the ranking score of the algorithm
<italic>A</italic>
(
<xref ref-type="disp-formula" rid="pone.0114825.e005">Eq.3</xref>
);
<italic>N</italic>
<sub>
<italic>A</italic>
</sub>
is the number of appearances of a given person in the top 100 rank for all editions. Here
<italic>CC</italic>
is the birth country code and
<italic>LC</italic>
is the language code of the given historical figure.</p>
</caption>
<alternatives>
<graphic id="pone.0114825.t005g" xlink:href="pone.0114825.t005"></graphic>
<table frame="box" rules="all" border="0">
<colgroup span="1">
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
</colgroup>
<thead>
<tr>
<th align="left" rowspan="1" colspan="1">
<bold>Rank</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>Θ
<sub>
<italic>PR</italic>
</sub>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>N</italic>
<sub>
<italic>A</italic>
</sub>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>PageRank female figures</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>CC</italic>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>Century</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>LC</italic>
</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="1" colspan="1">1</td>
<td align="left" rowspan="1" colspan="1">1789</td>
<td align="left" rowspan="1" colspan="1">24</td>
<td align="left" rowspan="1" colspan="1">Elizabeth II</td>
<td align="left" rowspan="1" colspan="1">UK</td>
<td align="left" rowspan="1" colspan="1">20</td>
<td align="left" rowspan="1" colspan="1">EN</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">2</td>
<td align="left" rowspan="1" colspan="1">1094</td>
<td align="left" rowspan="1" colspan="1">17</td>
<td align="left" rowspan="1" colspan="1">Mary (mother of Jesus)</td>
<td align="left" rowspan="1" colspan="1">IL</td>
<td align="left" rowspan="1" colspan="1">-1</td>
<td align="left" rowspan="1" colspan="1">HE</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">3</td>
<td align="left" rowspan="1" colspan="1">404</td>
<td align="left" rowspan="1" colspan="1">12</td>
<td align="left" rowspan="1" colspan="1">Queen Victoria</td>
<td align="left" rowspan="1" colspan="1">UK</td>
<td align="left" rowspan="1" colspan="1">19</td>
<td align="left" rowspan="1" colspan="1">EN</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">4</td>
<td align="left" rowspan="1" colspan="1">234</td>
<td align="left" rowspan="1" colspan="1">6</td>
<td align="left" rowspan="1" colspan="1">Elizabeth I of England</td>
<td align="left" rowspan="1" colspan="1">UK</td>
<td align="left" rowspan="1" colspan="1">16</td>
<td align="left" rowspan="1" colspan="1">EN</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">5</td>
<td align="left" rowspan="1" colspan="1">128</td>
<td align="left" rowspan="1" colspan="1">2</td>
<td align="left" rowspan="1" colspan="1">Maria Theresa</td>
<td align="left" rowspan="1" colspan="1">AT</td>
<td align="left" rowspan="1" colspan="1">18</td>
<td align="left" rowspan="1" colspan="1">DE</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">6</td>
<td align="left" rowspan="1" colspan="1">100</td>
<td align="left" rowspan="1" colspan="1">1</td>
<td align="left" rowspan="1" colspan="1">Benazir Bhutto</td>
<td align="left" rowspan="1" colspan="1">PK</td>
<td align="left" rowspan="1" colspan="1">20</td>
<td align="left" rowspan="1" colspan="1">HI</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">7</td>
<td align="left" rowspan="1" colspan="1">94</td>
<td align="left" rowspan="1" colspan="1">1</td>
<td align="left" rowspan="1" colspan="1">Catherine the Great</td>
<td align="left" rowspan="1" colspan="1">PL</td>
<td align="left" rowspan="1" colspan="1">18</td>
<td align="left" rowspan="1" colspan="1">PL</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">8</td>
<td align="left" rowspan="1" colspan="1">91</td>
<td align="left" rowspan="1" colspan="1">1</td>
<td align="left" rowspan="1" colspan="1">Anne Frank</td>
<td align="left" rowspan="1" colspan="1">DE</td>
<td align="left" rowspan="1" colspan="1">20</td>
<td align="left" rowspan="1" colspan="1">DE</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">9</td>
<td align="left" rowspan="1" colspan="1">87</td>
<td align="left" rowspan="1" colspan="1">1</td>
<td align="left" rowspan="1" colspan="1">Indira Gandhi</td>
<td align="left" rowspan="1" colspan="1">IN</td>
<td align="left" rowspan="1" colspan="1">20</td>
<td align="left" rowspan="1" colspan="1">HI</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">10</td>
<td align="left" rowspan="1" colspan="1">86</td>
<td align="left" rowspan="1" colspan="1">1</td>
<td align="left" rowspan="1" colspan="1">Margrethe II of Denmark</td>
<td align="left" rowspan="1" colspan="1">DK</td>
<td align="left" rowspan="1" colspan="1">20</td>
<td align="left" rowspan="1" colspan="1">DA</td>
</tr>
</tbody>
</table>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" rowspan="1" colspan="1">
<bold>Rank</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>Θ
<sub>2
<italic>D</italic>
</sub>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>N</italic>
<sub>
<italic>A</italic>
</sub>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>2DRank female figures</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>CC</italic>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>Century</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>LC</italic>
</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="1" colspan="1">1</td>
<td align="left" rowspan="1" colspan="1">991</td>
<td align="left" rowspan="1" colspan="1">14</td>
<td align="left" rowspan="1" colspan="1">Madonna (entertainer)</td>
<td align="left" rowspan="1" colspan="1">US</td>
<td align="left" rowspan="1" colspan="1">20</td>
<td align="left" rowspan="1" colspan="1">EN</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">2</td>
<td align="left" rowspan="1" colspan="1">664</td>
<td align="left" rowspan="1" colspan="1">9</td>
<td align="left" rowspan="1" colspan="1">Elizabeth II</td>
<td align="left" rowspan="1" colspan="1">UK</td>
<td align="left" rowspan="1" colspan="1">20</td>
<td align="left" rowspan="1" colspan="1">EN</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">3</td>
<td align="left" rowspan="1" colspan="1">580</td>
<td align="left" rowspan="1" colspan="1">8</td>
<td align="left" rowspan="1" colspan="1">Mary (mother of Jesus)</td>
<td align="left" rowspan="1" colspan="1">IL</td>
<td align="left" rowspan="1" colspan="1">-1</td>
<td align="left" rowspan="1" colspan="1">HE</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">4</td>
<td align="left" rowspan="1" colspan="1">550</td>
<td align="left" rowspan="1" colspan="1">9</td>
<td align="left" rowspan="1" colspan="1">Queen Victoria</td>
<td align="left" rowspan="1" colspan="1">UK</td>
<td align="left" rowspan="1" colspan="1">19</td>
<td align="left" rowspan="1" colspan="1">EN</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">5</td>
<td align="left" rowspan="1" colspan="1">225</td>
<td align="left" rowspan="1" colspan="1">5</td>
<td align="left" rowspan="1" colspan="1">Agatha Christie</td>
<td align="left" rowspan="1" colspan="1">UK</td>
<td align="left" rowspan="1" colspan="1">19</td>
<td align="left" rowspan="1" colspan="1">EN</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">6</td>
<td align="left" rowspan="1" colspan="1">211</td>
<td align="left" rowspan="1" colspan="1">4</td>
<td align="left" rowspan="1" colspan="1">Mariah Carey</td>
<td align="left" rowspan="1" colspan="1">US</td>
<td align="left" rowspan="1" colspan="1">20</td>
<td align="left" rowspan="1" colspan="1">EN</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">7</td>
<td align="left" rowspan="1" colspan="1">206</td>
<td align="left" rowspan="1" colspan="1">7</td>
<td align="left" rowspan="1" colspan="1">Britney Spears</td>
<td align="left" rowspan="1" colspan="1">US</td>
<td align="left" rowspan="1" colspan="1">20</td>
<td align="left" rowspan="1" colspan="1">EN</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">8</td>
<td align="left" rowspan="1" colspan="1">200</td>
<td align="left" rowspan="1" colspan="1">3</td>
<td align="left" rowspan="1" colspan="1">Margaret Thatcher</td>
<td align="left" rowspan="1" colspan="1">UK</td>
<td align="left" rowspan="1" colspan="1">20</td>
<td align="left" rowspan="1" colspan="1">EN</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">9</td>
<td align="left" rowspan="1" colspan="1">191</td>
<td align="left" rowspan="1" colspan="1">2</td>
<td align="left" rowspan="1" colspan="1">Martina Navratilova</td>
<td align="left" rowspan="1" colspan="1">CZ</td>
<td align="left" rowspan="1" colspan="1">20</td>
<td align="left" rowspan="1" colspan="1">WR</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">10</td>
<td align="left" rowspan="1" colspan="1">175</td>
<td align="left" rowspan="1" colspan="1">2</td>
<td align="left" rowspan="1" colspan="1">Elizabeth I of England</td>
<td align="left" rowspan="1" colspan="1">UK</td>
<td align="left" rowspan="1" colspan="1">16</td>
<td align="left" rowspan="1" colspan="1">EN</td>
</tr>
</tbody>
</table>
</alternatives>
</table-wrap>
<p>The comparison of our 100 global historical figures with the top 100 from Hart’s list [
<xref rid="pone.0114825.ref027" ref-type="bibr">27</xref>
] gives an overlap of 43 persons for PageRank and 26 persons for 2DRank. We note that for the top 100 from the English Wikipedia we obtain a lower overlap of 37 (PageRank) and 4 (2DRank) persons. Among all editions the highest overlaps with the Hart list are 42 (VI), 37 (EN, ES, PT, TR) and 33 (IT), 32 (DE), 31 (FR) for PageRank; while for 2DRank we find 18 (EL) and 17 (VI). We give the overlap numbers for all editions at [
<xref rid="pone.0114825.ref039" ref-type="bibr">39</xref>
]. This shows that the consideration of 24 editions provides us the global list of the top 100 persons with a more balanced selection of top historical figures. Our overlap of the top 100 global historical figures by PageRank with the top 100 people from Pantheon MIT ranking list [
<xref rid="pone.0114825.ref023" ref-type="bibr">23</xref>
] is 44 percent, while the overlap of this Pantheon list with Hart’s list is 43 percent. We note that the Pantheon method is significantly based on a number of page views while our approach is based on the network structure of the whole Wikipedia network. The top 100 persons from [
<xref rid="pone.0114825.ref022" ref-type="bibr">22</xref>
] are not publicly available but nevertheless we present the overlaps between the top 100 persons from the lists of Hart, Pantheon, Stony-Brook and our global PageRank and 2DRank lists in Figures S2, S3 (we received the Stony-Brook list as a private message from the authors of [
<xref rid="pone.0114825.ref022" ref-type="bibr">22</xref>
]). We have an average overlap between the 4 methods on a level of 40 percent (2DRank is on average lower by a few percent), we find a larger overlap between our PageRank list and the Stony-Brook list since the Stony-Brook method, applied only for the English Wikipedia, is significantly based on PageRank.</p>
<p>We also compared the distributions of our global top 100 persons of PageRank and 2DRank with the distribution of Hart’s top 100 over centuries and over 24 languages with the additional WR category (see Figure S4). We find that these 3 distributions have very similar shapes. Thus the largest number of persons appears in centuries AD 18th, 19th, 20th for the 3 distributions. Among languages, the main peaks for the 3 distributions appear for EN, DE, IT, EL, AR, ZH. The deviations from Hart’s distribution are larger for the 2DRank list. Thus the comparison of distributions over centuries and languages shows that the PageRank list has not only a strong overlap with the Hart list in the number of persons but that they also have very similar statistical distributions of the top 100 persons over centuries and languages.</p>
<p>The overlap of the top 100 global persons found here with the previous study [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
] gives 54 and 47 percent for PageRank and 2DRank lists, respectively. However, we note that the global list in [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
] was obtained from the top 30 persons in each edition while here we use the top 100 persons.</p>
<p>It is interesting to note that for the top 100 PageRank universities from the English Wikipedia edition the overlap with Shanghai top 100 list of universities is on a even higher level of 75 percent [
<xref rid="pone.0114825.ref018" ref-type="bibr">18</xref>
].</p>
<p>Finally, we note that the ranking of historical figures using the whole PageRank (or 2DRank) list of all Wikipedia articles of a given edition provides a more stable approach compared to the network of biographical articles used in [
<xref rid="pone.0114825.ref020" ref-type="bibr">20</xref>
]. Indeed, the number of nodes and links in such a biographical network is significantly smaller compared to the whole network of Wikipedia articles and thus the fluctuations become rather large. For example, from the biographical network of the Russian edition one finds as the top person
<italic>Napoleon III</italic>
(and even not
<italic>Napoleon I</italic>
) [
<xref rid="pone.0114825.ref020" ref-type="bibr">20</xref>
], who has a rather low importance for Russia. In contrast to that the present study gives us the top PageRank historical figure of the Russian edition to be
<italic>Peter the Great</italic>
, that has much more historical grounds. In a similar way for FR the results of [
<xref rid="pone.0114825.ref020" ref-type="bibr">20</xref>
] give at the first position
<italic>Adolf Hitler</italic>
, that is rather strange for the French culture, while we find a natural result
<italic>Napoleon</italic>
.</p>
</sec>
<sec id="sec013">
<title>Network of cultures</title>
<p>We consider the selected top persons from each Wikipedia edition as important historical figures recognized by people who speak the language of that Wikipedia edition. Therefore, if a top person from a language edition
<italic>A</italic>
appears in another edition
<italic>B</italic>
, then we can consider this as a ‘cultural’ influence from culture
<italic>A</italic>
to
<italic>B</italic>
. Here we consider each language as a proxy for a cultural group and assign each historical figure to one of these cultural groups based on the most spoken language of her/his birth place at the country level. For example,
<italic>Adolf Hitler</italic>
was born in modern Austria and since German language is the most spoken language in Austria, he is considered as a German historical figure in our analysis. This method may lead to some misguiding results due to discrepancy between territories of country and cultures, e.g.
<italic>Jesus</italic>
was born in the modern State of Palestine (Bethlehem), which is an Arabic speaking country. Thus
<italic>Jesus</italic>
is from the Arabic culture in our analysis while usually one would say that he belongs to the Hebrew culture. Other similar examples we find are:
<italic>Charlemagne</italic>
(Belgium—Dutch),
<italic>Immanuel Kant</italic>
(Russia—Russian, while usually he is attributed to DE),
<italic>Moses</italic>
(Egypt—Arabic),
<italic>Catherine the Great</italic>
(Poland—Polish, while usually she would be attributed to DE or RU).</p>
<p>In total there are such 36 cases from the global PageRank list of 1045 names (these 36 names are given in SI). However, in our knowledge, the birth place is the best way to assign a given historical figure to a certain cultural background computationally and systematically and with the data we have available. In total we have only about 3.4 percent of cases which can be discussed and where a native speaking language can be a better indicator of belonging to a given culture. For the global 2DRank list of 1616 names we identified 53 similar cases where an attribution to a culture via a native language or a birth place could be discussed (about 3.3 percent). These 53 names are given in SI. About half of such cases are linked with birth places in ancient Russian Empire where people from Belarus, Litvania and Ukraine moved to RU, IL, PL, WR. However, the percentage of such cases is small and the corresponding errors also remain small.</p>
<p>Based on the above assumption and following the approach developed in [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
], we construct two weighted networks of cultures (or language groups) based on the top PageRank historical figures and top 2DRank historical figures respectively. Each culture (i.e. language) is represented as a node of the network, and the weight of a directed link from culture
<italic>A</italic>
to culture
<italic>B</italic>
is given by the number of historical figures belonging to culture
<italic>B</italic>
(e.g. French) appearing in the list of top 100 historical figures for a given culture
<italic>A</italic>
(e.g. English). The persons in a given edition, belonging to the language of the edition, are not taken into account since they do not create links between cultures. In
<xref ref-type="table" rid="pone.0114825.t006">Table 6</xref>
we give the number of such persons for each language. This table also gives the number of persons of a given language among the top 100 persons of the global PageRank and 2DRank listings.</p>
<table-wrap id="pone.0114825.t006" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.t006</object-id>
<label>Table 6</label>
<caption>
<title>Numbers of certain historical figures for top 100 list of each language:
<italic>N</italic>
<sub>1</sub>
is the number of historical figures of a given language among the top 100 PageRank global historical figures;
<italic>N</italic>
<sub>2</sub>
is the number of historical figures of a given language among the top 100 PageRank historical figures for the given language edition;
<italic>N</italic>
<sub>3</sub>
is the number of historical figures of a given language among the top 100 2DRank global historical figures;
<italic>N</italic>
<sub>4</sub>
is the number of historical figures of a given language among the top 100 2DRank historical figures for the given language edition.</title>
</caption>
<alternatives>
<graphic id="pone.0114825.t006g" xlink:href="pone.0114825.t006"></graphic>
<table frame="box" rules="all" border="0">
<colgroup span="1">
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
<col align="left" valign="top" span="1"></col>
</colgroup>
<thead>
<tr>
<th align="left" rowspan="1" colspan="1">
<bold>Language</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>N</italic>
<sub>1</sub>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>N</italic>
<sub>2</sub>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>N</italic>
<sub>3</sub>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>N</italic>
<sub>4</sub>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>Language</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>N</italic>
<sub>1</sub>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>N</italic>
<sub>2</sub>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>N</italic>
<sub>3</sub>
</bold>
</th>
<th align="left" rowspan="1" colspan="1">
<bold>
<italic>N</italic>
<sub>4</sub>
</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="1" colspan="1">EN</td>
<td align="left" rowspan="1" colspan="1">22</td>
<td align="left" rowspan="1" colspan="1">47</td>
<td align="left" rowspan="1" colspan="1">27</td>
<td align="left" rowspan="1" colspan="1">64</td>
<td align="left" rowspan="1" colspan="1">RU</td>
<td align="left" rowspan="1" colspan="1">2</td>
<td align="left" rowspan="1" colspan="1">29</td>
<td align="left" rowspan="1" colspan="1">3</td>
<td align="left" rowspan="1" colspan="1">27</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">NL</td>
<td align="left" rowspan="1" colspan="1">2</td>
<td align="left" rowspan="1" colspan="1">10</td>
<td align="left" rowspan="1" colspan="1">4</td>
<td align="left" rowspan="1" colspan="1">38</td>
<td align="left" rowspan="1" colspan="1">HE</td>
<td align="left" rowspan="1" colspan="1">2</td>
<td align="left" rowspan="1" colspan="1">17</td>
<td align="left" rowspan="1" colspan="1">2</td>
<td align="left" rowspan="1" colspan="1">22</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">DE</td>
<td align="left" rowspan="1" colspan="1">20</td>
<td align="left" rowspan="1" colspan="1">41</td>
<td align="left" rowspan="1" colspan="1">16</td>
<td align="left" rowspan="1" colspan="1">55</td>
<td align="left" rowspan="1" colspan="1">TR</td>
<td align="left" rowspan="1" colspan="1">2</td>
<td align="left" rowspan="1" colspan="1">27</td>
<td align="left" rowspan="1" colspan="1">2</td>
<td align="left" rowspan="1" colspan="1">54</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">FR</td>
<td align="left" rowspan="1" colspan="1">8</td>
<td align="left" rowspan="1" colspan="1">33</td>
<td align="left" rowspan="1" colspan="1">3</td>
<td align="left" rowspan="1" colspan="1">32</td>
<td align="left" rowspan="1" colspan="1">AR</td>
<td align="left" rowspan="1" colspan="1">8</td>
<td align="left" rowspan="1" colspan="1">42</td>
<td align="left" rowspan="1" colspan="1">5</td>
<td align="left" rowspan="1" colspan="1">69</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">ES</td>
<td align="left" rowspan="1" colspan="1">2</td>
<td align="left" rowspan="1" colspan="1">20</td>
<td align="left" rowspan="1" colspan="1">5</td>
<td align="left" rowspan="1" colspan="1">39</td>
<td align="left" rowspan="1" colspan="1">FA</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">46</td>
<td align="left" rowspan="1" colspan="1">1</td>
<td align="left" rowspan="1" colspan="1">64</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">IT</td>
<td align="left" rowspan="1" colspan="1">11</td>
<td align="left" rowspan="1" colspan="1">31</td>
<td align="left" rowspan="1" colspan="1">9</td>
<td align="left" rowspan="1" colspan="1">43</td>
<td align="left" rowspan="1" colspan="1">HI</td>
<td align="left" rowspan="1" colspan="1">1</td>
<td align="left" rowspan="1" colspan="1">65</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">76</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">PT</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">19</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">35</td>
<td align="left" rowspan="1" colspan="1">MS</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">15</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">40</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">EL</td>
<td align="left" rowspan="1" colspan="1">5</td>
<td align="left" rowspan="1" colspan="1">28</td>
<td align="left" rowspan="1" colspan="1">2</td>
<td align="left" rowspan="1" colspan="1">55</td>
<td align="left" rowspan="1" colspan="1">TH</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">46</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">53</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">DA</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">31</td>
<td align="left" rowspan="1" colspan="1">1</td>
<td align="left" rowspan="1" colspan="1">48</td>
<td align="left" rowspan="1" colspan="1">VI</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">7</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">30</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">SV</td>
<td align="left" rowspan="1" colspan="1">1</td>
<td align="left" rowspan="1" colspan="1">26</td>
<td align="left" rowspan="1" colspan="1">1</td>
<td align="left" rowspan="1" colspan="1">39</td>
<td align="left" rowspan="1" colspan="1">ZH</td>
<td align="left" rowspan="1" colspan="1">5</td>
<td align="left" rowspan="1" colspan="1">43</td>
<td align="left" rowspan="1" colspan="1">6</td>
<td align="left" rowspan="1" colspan="1">79</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">PL</td>
<td align="left" rowspan="1" colspan="1">1</td>
<td align="left" rowspan="1" colspan="1">20</td>
<td align="left" rowspan="1" colspan="1">2</td>
<td align="left" rowspan="1" colspan="1">26</td>
<td align="left" rowspan="1" colspan="1">KO</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">34</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">59</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">HU</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">18</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">18</td>
<td align="left" rowspan="1" colspan="1">JA</td>
<td align="left" rowspan="1" colspan="1">0</td>
<td align="left" rowspan="1" colspan="1">41</td>
<td align="left" rowspan="1" colspan="1">4</td>
<td align="left" rowspan="1" colspan="1">80</td>
</tr>
<tr>
<td align="left" rowspan="1" colspan="1">WR</td>
<td align="left" rowspan="1" colspan="1">8</td>
<td align="left" rowspan="1" colspan="1">-</td>
<td align="left" rowspan="1" colspan="1">7</td>
<td align="left" rowspan="1" colspan="1">-</td>
<td align="left" rowspan="1" colspan="1"></td>
<td align="left" rowspan="1" colspan="1"></td>
<td align="left" rowspan="1" colspan="1"></td>
<td align="left" rowspan="1" colspan="1"></td>
<td align="left" rowspan="1" colspan="1"></td>
</tr>
</tbody>
</table>
</alternatives>
</table-wrap>
<p>For example, there are 5 French historical figures among the top 100 PageRank historical figures of the English Wikipedia, so we can assign weight 5 to the link from English to French.
<xref ref-type="fig" rid="pone.0114825.g008">Fig. 8A and Fig. 8B</xref>
represent the constructed networks of cultures defined by appearances of the top PageRank historical figures and top 2DRank historical figures, respectively. In total we have two networks with 25 nodes which include our 24 editions and an additional node WR for all the other world cultures.</p>
<fig id="pone.0114825.g008" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.g008</object-id>
<label>Fig 8</label>
<caption>
<title>Network of cultures obtained from 24 Wikipedia languages and the remaining world (WR) consider (A) top PageRank historical figures and (B) 2DRank historical figures.</title>
<p>The link width and darkness are proportional to a number of foreign historical figures quoted in top 100 of a given culture, the link direction goes from a given culture to cultures of quoted foreign historical figures, links inside cultures are not considered. The size of nodes is proportional to their PageRank.</p>
</caption>
<graphic xlink:href="pone.0114825.g008"></graphic>
</fig>
<p>The Google matrix
<italic>G</italic>
<sub>
<italic>ij</italic>
</sub>
for each network is constructed following the standard rules described in [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
] and in the Methods Section. In a standard way we determine the PageRank index
<italic>K</italic>
and the CheiRank index
<italic>K</italic>
* that order all cultures according to decreasing PageRank and CheiRank probabilities (see
<xref ref-type="sec" rid="sec002">Methods</xref>
and Figure S5). The structure of matrix elements
<italic>G</italic>
<sub>
<italic>KK</italic>
<sup></sup>
</sub>
is shown in
<xref ref-type="fig" rid="pone.0114825.g009">Fig. 9</xref>
.</p>
<fig id="pone.0114825.g009" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.g009</object-id>
<label>Fig 9</label>
<caption>
<title>Google matrix of network of cultures shown in
<xref ref-type="fig" rid="pone.0114825.g008">Fig. 8</xref>
respectively.</title>
<p>The matrix elements
<italic>G</italic>
<sub>
<italic>ij</italic>
</sub>
are shown by color with damping factor
<italic>α</italic>
= 0.85.</p>
</caption>
<graphic xlink:href="pone.0114825.g009"></graphic>
</fig>
<p>To identify which cultures (or language groups) are more influential than others, we calculated PageRank and CheiRank of the constructed networks of cultures by considering link weights. Briefly speaking, a culture has high PageRank (CheiRank) if it has many ingoing (outgoing) links from (to) other cultures (see
<xref ref-type="sec" rid="sec002">Methods</xref>
). The distribution of cultures on a PageRank-CheiRank plane is shown in
<xref ref-type="fig" rid="pone.0114825.g010">Fig. 10</xref>
. In both cases of PageRank and 2DRank historical figures, historical figures of English culture (i.e. born in English language spoken countries) are the most influential (highest PageRank) and German culture is the second one (
<xref ref-type="fig" rid="pone.0114825.g010">Fig. 10A, B</xref>
). Here we consider the historical figures for the whole range of centuries.
<xref ref-type="fig" rid="pone.0114825.g010">Fig. 10</xref>
represents the detailed features of how each culture is located on the plane of PageRank ranking
<italic>K</italic>
and CheiRank ranking
<italic>K</italic>
* based on the top PageRank historical figures (
<xref ref-type="fig" rid="pone.0114825.g010">Fig. 10A</xref>
) and top 2DRank historical figures (
<xref ref-type="fig" rid="pone.0114825.g010">Fig. 10B</xref>
). Here
<italic>K</italic>
indicates the ranking of a given culture ordered by how many of its own top historical figures appear in other Wikipedia editions, and
<italic>K</italic>
* indicates the ranking of a given culture according to how many of the top historical figures in the considered culture are from other cultures. As described above, English is on (
<italic>K</italic>
= 1,
<italic>K</italic>
* = 19) and German is on (
<italic>K</italic>
= 2,
<italic>K</italic>
* = 21) in the case of PageRank historical figures (
<xref ref-type="fig" rid="pone.0114825.g010">Fig. 10A</xref>
). In the case of 2DRank historical figures, English is on (
<italic>K</italic>
= 1,
<italic>K</italic>
* = 14) and German is on (
<italic>K</italic>
= 2,
<italic>K</italic>
* = 9).</p>
<fig id="pone.0114825.g010" orientation="portrait" position="float">
<object-id pub-id-type="doi">10.1371/journal.pone.0114825.g010</object-id>
<label>Fig 10</label>
<caption>
<title>PageRank ranking versus CheiRank ranking plane of cultures with corresponding indexes
<italic>K</italic>
and
<italic>K</italic>
* obtained from the network of cultures based on (A) all PageRank historical figures, (B) all 2DRank historical figures, (C) PageRank historical figure born before AD 19th century, and (D) 2DRank historical figure born before AD 19th century, respectively.</title>
</caption>
<graphic xlink:href="pone.0114825.g010"></graphic>
</fig>
<p>It is important to note that there is a significant difference compared to the previous study [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
]: there, only 9 editions had been considered and the top positions were attributed to the world node WR which captured a significant fraction of the top persons. This indicated that 9 editions are not sufficient to cover the whole world. Now for 24 editions we see that the importance of the world node WR is much lower (it moves from
<italic>K</italic>
= 1 for 9 editions [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
] to
<italic>K</italic>
= 4 and 3 in
<xref ref-type="fig" rid="pone.0114825.g010">Fig. 10A and Fig. 10B</xref>
). Thus our 24 editions cover the majority the world. Still it would be desirable to add a few additional editions (e.g. Ukraine, Baltic Republics, Serbia etc.) to fill certain gaps.</p>
<p>It is interesting to note that the ranking plane of cultures (
<italic>K</italic>
,
<italic>K</italic>
*) changes significantly in time. Indeed, if we take into account only persons born before the 19th century then the ranking is modified with EN going to 4th (
<xref ref-type="fig" rid="pone.0114825.g010">Fig. 10C</xref>
for PageRank figures) and 6th position (
<xref ref-type="fig" rid="pone.0114825.g010">Fig. 10C</xref>
for 2DRank figures) while the top positions are taken by IT, DE, FR and DE, IT, AR, respectively.</p>
<p>At the same time, we may also argue that for cultures it is important not only to be cited but also to be communicative with other cultures. To characterize communicative properties of nodes on the network of cultures shown in
<xref ref-type="fig" rid="pone.0114825.g008">Fig. 8</xref>
we use again the concepts of PageRank, CheiRank and 2DRank for these networks as described in Methods and [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
]. Thus, for the network of cultures of
<xref ref-type="fig" rid="pone.0114825.g008">Fig. 8</xref>
, the 2DRank index of cultures highlights their influence in a more balanced way taking into account their importance (incoming links) and communicative (outgoing links) properties in a balanced manner.</p>
<p>Thus we find for all centuries at the top positions Greek, Turkish and Arabic (for PageRank persons) and French, Russian and Arabic (for 2DRank persons). For historical figures before the 19th century, we find respectively Arabic, Turkish and Greek (for PageRank) and Arabic, Greek and Hebrew (for 2DRank). The high position of Turkish is due to its close links both with Greek culture in ancient times and with Arabic culture in more recent times. We see also that with time the positions of Greek in 2DRank improves due to a global improved ranking of Western cultures closely connected with Greece.</p>
</sec>
</sec>
<sec sec-type="conclusions" id="sec014">
<title>Discussion</title>
<p>By investigating birth place, birth date, and gender of important historical figures determined by the network structure of Wikipedia, we identified spatial, temporal, and gender skewness in Wikipedia. Our analysis shows that the most important historical figures across Wikipedia language editions were born in Western countries after the 17th century, and are male. Also, each Wikipedia edition highlights local figures so that most of its own historical figures are born in the countries which use the language of the edition. The emergence of such pronounced accent to local figures seems to be natural since there are more links and interactions within one culture. This is also visible from to the fact that in many editions the main country for the given language is at the first PageRank position among all articles (e.g. Russia in RU edition) [
<xref rid="pone.0114825.ref021" ref-type="bibr">21</xref>
]. Despite such a locality feature, there are also global historical figures who appear in most of the considered Wikipedia editions with very high rankings. Based on the cross-cultural historical figures, who appear in multiple editions, we can construct a network of cultures which describes interactions and entanglement between cultures.</p>
<p>It is very difficult to describe history in an objective way and due to that it was argued that history is “an unending dialogue between the past and present” [
<xref rid="pone.0114825.ref044" ref-type="bibr">44</xref>
]. In a similar way we can say that history is an unending dialogue between different cultural groups.</p>
<p>We use a computational and data mining approach, based on rank vectors of the Google matrix of Wikipedia, to perform a statistical analysis of interactions and entanglement of cultures. We find that this approach can be used for selecting the most influential historical figures through an analysis of collectively generated links between articles on Wikipedia. Our results are coherent with studies conducted by historians [
<xref rid="pone.0114825.ref027" ref-type="bibr">27</xref>
], with an overlap of 43% of important historical figures. Thus, such a mathematical analysis of local and global historical figures can be a useful step towards the understanding of local and global history and interactions of world cultures. Our approach has some limitations, mainly caused by the data source and by the difficulty of defining culture boundaries across centuries. The ongoing improvement of structured content in Wikipedia through the WikiData project, eventually in conjunction with additional manual annotation, should allow to deal with these limitations. Furthermore, it would be useful to perform comparisons with other approaches to measure the interactions of cultures, such as the analysis of language crossings of multilingual users [
<xref rid="pone.0114825.ref045" ref-type="bibr">45</xref>
].</p>
<p>Influence of digital media on information dissemination and social collective opinions among the public is growing fast. Our research across Wikipedia language editions suggests a rigorous mathematical way, based on Markov chains and Google matrix, for the identification of important historical figures and for the analysis of interactions of cultures at different historical periods and in different world regions. We think that a further extension of this approach to a larger number of Wikipedia editions will provide a more detailed and balanced analysis of interactions of world cultures.</p>
</sec>
<sec sec-type="supplementary-material" id="sec015">
<title>Supporting Information</title>
<supplementary-material content-type="local-data" id="pone.0114825.s001">
<label>S1 File</label>
<caption>
<title>Supporting Information file S1 presents Figures S1–S5 with additional information discussed above in the main part of the paper, lists of top 100 global PageRank and 2DRank names; Tables S1–S25 of top 10 names of given language and remained world from the global PageRank and 2DRank ranking lists of persons ordered by the score Θ
<sub>
<italic>P</italic>
,
<italic>A</italic>
</sub>
of
<xref ref-type="disp-formula" rid="pone.0114825.e005">Eq.(3)</xref>
.</title>
<p>For a reader convenience the lists of all 100 ranked names for all 24 Wikipedia editions and corresponding network link data for each edition are also given at [
<xref rid="pone.0114825.ref039" ref-type="bibr">39</xref>
] in addition to Supporting Information file. All used computational data are publicly available at
<ext-link ext-link-type="uri" xlink:href="http://dumps.wikimedia.org/">http://dumps.wikimedia.org/</ext-link>
. All the raw data necessary to replicate the findings and conclusion of this study are within the paper, supporting information files and this Wikimedia web site.</p>
<p>(PDF)</p>
</caption>
<media xlink:href="pone.0114825.s001.pdf">
<caption>
<p>Click here for additional data file.</p>
</caption>
</media>
</supplementary-material>
</sec>
</body>
<back>
<ack>
<p>This research is supported in part by the EC FET Open project “New tools and algorithms for directed network analysis” (NADINE $No$ 288956)</p>
</ack>
<ref-list>
<title>References</title>
<ref id="pone.0114825.ref001">
<label>1</label>
<mixed-citation publication-type="journal">
<name>
<surname>Rosenzweig</surname>
<given-names>R</given-names>
</name>
(
<year>2006</year>
)
<article-title>Can history be open source?</article-title>
<source>Wikipedia and the future and the past, Journal of American History</source>
<volume>93</volume>
(
<issue>1</issue>
):
<fpage>117</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.2307/4486062">10.2307/4486062</ext-link>
</comment>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref002">
<label>2</label>
<mixed-citation publication-type="journal">
<name>
<surname>Lavsa</surname>
<given-names>SM</given-names>
</name>
,
<name>
<surname>Corman</surname>
<given-names>SL</given-names>
</name>
,
<name>
<surname>Culley</surname>
<given-names>CM</given-names>
</name>
,
<name>
<surname>Pummer</surname>
<given-names>TL</given-names>
</name>
(
<year>2011</year>
)
<article-title>Reliability of Wikipedia as a medication information source for pharmacy students</article-title>
,
<source>Currents in Pharmacy Teaching and Learning</source>
<volume>3</volume>
(
<issue>2</issue>
):
<fpage>154</fpage>
<lpage>158</lpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1016/j.cptl.2011.01.007">10.1016/j.cptl.2011.01.007</ext-link>
</comment>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref003">
<label>3</label>
<mixed-citation publication-type="journal">
<name>
<surname>Giles</surname>
<given-names>J</given-names>
</name>
(
<year>2005</year>
)
<article-title>Internet encylopedia go head to head</article-title>
,
<source>Nature</source>
,
<volume>438</volume>
:
<fpage>900</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1038/438900a">10.1038/438900a</ext-link>
</comment>
<pub-id pub-id-type="pmid">16355180</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref004">
<label>4</label>
<mixed-citation publication-type="book">
<name>
<surname>Kittur</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Chi</surname>
<given-names>EH</given-names>
</name>
,
<name>
<surname>Suh</surname>
<given-names>B</given-names>
</name>
(
<year>2009</year>
)
<chapter-title>What’s in Wikipedia?: mapping topics and conflict using socially annotated category structure</chapter-title>
, In
<source>Proc. of SIGCHI Conference on Human Factors in Computing Systems, CHI’09</source>
,
<publisher-name>ACM</publisher-name>
,
<publisher-loc>New York</publisher-loc>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref005">
<label>5</label>
<mixed-citation publication-type="book">
<name>
<surname>Priedhorsky</surname>
<given-names>R</given-names>
</name>
,
<name>
<surname>Chen</surname>
<given-names>J</given-names>
</name>
,
<name>
<surname>Lam</surname>
<given-names>STK</given-names>
</name>
,
<name>
<surname>Panciera</surname>
<given-names>K</given-names>
</name>
,
<name>
<surname>Terveen</surname>
<given-names>L</given-names>
</name>
<etal>et al</etal>
(
<year>2007</year>
).
<chapter-title>Creating, Destroying, and Restoring Value in Wikipedia</chapter-title>
, In
<source>Proceedings of the Intl. Conf. on Supporting Group Work</source>
,
<volume>295</volume>
,
<publisher-name>ACM</publisher-name>
,
<publisher-loc>New York</publisher-loc>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref006">
<label>6</label>
<mixed-citation publication-type="journal">
<name>
<surname>Yasseri</surname>
<given-names>T</given-names>
</name>
,
<name>
<surname>Sumi</surname>
<given-names>R</given-names>
</name>
,
<name>
<surname>Rung</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Kornai</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Kertész</surname>
<given-names>J</given-names>
</name>
(
<year>2012</year>
)
<article-title>Dynamics of Conflicts in Wikipedia</article-title>
,
<source>PLoS ONE</source>
<volume>7</volume>
(
<issue>6</issue>
):
<fpage>e38869</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1371/journal.pone.0038869">10.1371/journal.pone.0038869</ext-link>
</comment>
<pub-id pub-id-type="pmid">22745683</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref007">
<label>7</label>
<mixed-citation publication-type="other">Yasseri T, Spoerri A, Graham M, Kertész J (2013) The most controversial topics in Wikipedia: a multilingual and geographical analysis arXiv:1305.5566 [physics.soc-ph]</mixed-citation>
</ref>
<ref id="pone.0114825.ref008">
<label>8</label>
<mixed-citation publication-type="journal">
<name>
<surname>Laniado</surname>
<given-names>D</given-names>
</name>
,
<name>
<surname>Tasso</surname>
<given-names>R</given-names>
</name>
,
<name>
<surname>Volkovich Y. Kaltenbrunner</surname>
<given-names>A</given-names>
</name>
(
<year>2011</year>
)
<article-title>When the wikipedians talk: Network and tree structure of Wikipedia discussion pages</article-title>
,
<source>Proc. ICWSM</source>
<volume>2011</volume>
:
<fpage>177</fpage>
<lpage>184</lpage>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref009">
<label>9</label>
<mixed-citation publication-type="other">UNESCO World Report (2009) Investing in cultural diversity and intercultural dialogue, Available:
<ext-link ext-link-type="uri" xlink:href="http://www.unesco.org/new/en/culture/resources/report/the-unesco-world-report-on-cultural-diversity">http://www.unesco.org/new/en/culture/resources/report/the-unesco-world-report-on-cultural-diversity</ext-link>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref010">
<label>10</label>
<mixed-citation publication-type="other">Wikipedia: Neutral point of view. Retrived May 12, 2014 from
<ext-link ext-link-type="uri" xlink:href="http://en.wikipedia.org/wiki/Wikipedia:Neutral_point_of_view">http://en.wikipedia.org/wiki/Wikipedia:Neutral_point_of_view</ext-link>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref011">
<label>11</label>
<mixed-citation publication-type="journal">
<name>
<surname>Pfeil</surname>
<given-names>U</given-names>
</name>
,
<name>
<surname>Zaphiris</surname>
<given-names>P</given-names>
</name>
,
<name>
<surname>Ang C</surname>
<given-names>A</given-names>
</name>
, (
<year>2006</year>
)
<article-title>Cultural Differences in Collaborative Authoring of Wikipedia</article-title>
,
<source>J. Computer-Mediated Comm.</source>
<volume>12</volume>
(
<issue>1</issue>
):
<fpage>88</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1111/j.1083-6101.2006.00316.x">10.1111/j.1083-6101.2006.00316.x</ext-link>
</comment>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref012">
<label>12</label>
<mixed-citation publication-type="journal">
<name>
<surname>Callahan</surname>
<given-names>ES</given-names>
</name>
,
<name>
<surname>Herriing</surname>
<given-names>SC</given-names>
</name>
(
<year>2011</year>
)
<article-title>Cultural bias in Wikipedia content on famous persons</article-title>
,
<source>Journal of the American society for information science and technology</source>
<volume>62</volume>
:
<fpage>1899</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1002/asi.21577">10.1002/asi.21577</ext-link>
</comment>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref013">
<label>13</label>
<mixed-citation publication-type="book">
<name>
<surname>Hecht</surname>
<given-names>B</given-names>
</name>
,
<name>
<surname>Gergle</surname>
<given-names>D</given-names>
</name>
(
<year>2009</year>
)
<chapter-title>Measuring self-focus bias in community-maintained knowledge repositories</chapter-title>
,
<source>Proc. of the Fourth Intl Conf. Communities and technologies</source>
,
<publisher-name>ACM</publisher-name>
,
<publisher-loc>New York</publisher-loc>
<volume>2009</volume>
:
<fpage>11</fpage>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref014">
<label>14</label>
<mixed-citation publication-type="book">
<name>
<surname>Hecht</surname>
<given-names>B</given-names>
</name>
,
<name>
<surname>Gergle</surname>
<given-names>D</given-names>
</name>
(
<year>2010</year>
)
<chapter-title>The Tower of Babel Meets Web 2.0: User-Generated Content and Its Applications in a Multilingual Context</chapter-title>
,
<source>Proc. of SIGCHI Conference on Human Factors in Computing Systems, CHI’10</source>
,
<publisher-name>Atlanta, ACM</publisher-name>
,
<publisher-loc>New York</publisher-loc>
<fpage>291</fpage>
<lpage>300</lpage>
p</mixed-citation>
</ref>
<ref id="pone.0114825.ref015">
<label>15</label>
<mixed-citation publication-type="journal">
<name>
<surname>Nemoto</surname>
<given-names>K</given-names>
</name>
,
<name>
<surname>Gloor</surname>
<given-names>PA</given-names>
</name>
(
<year>2011</year>
)
<article-title>Analyzing cultural differences in collaborative innovation networks by analyzing editing behavior in different-language Wikipedias</article-title>
,
<source>Procedia—Social and Behavioral Sciences</source>
<volume>26</volume>
:
<fpage>180</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1016/j.sbspro.2011.10.574">10.1016/j.sbspro.2011.10.574</ext-link>
</comment>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref016">
<label>16</label>
<mixed-citation publication-type="book">
<name>
<surname>Warncke-Wang</surname>
<given-names>M</given-names>
</name>
,
<name>
<surname>Uduwage</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Dong</surname>
<given-names>Z</given-names>
</name>
,
<name>
<surname>Riedl</surname>
<given-names>J</given-names>
</name>
(
<year>2012</year>
)
<chapter-title>In search of the ur-Wikipedia: universality, similarity, and translation in the Wikipedia inter-language link network</chapter-title>
,
<source>Proceedings of the 8th Intl. Symposium on Wikis and Open Collaboration (WikiSym 2012)</source>
,
<publisher-name>ACM</publisher-name>
,
<publisher-loc>New York</publisher-loc>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref017">
<label>17</label>
<mixed-citation publication-type="book">
<name>
<surname>Massa</surname>
<given-names>P</given-names>
</name>
,
<name>
<surname>Scrinzi</surname>
<given-names>F</given-names>
</name>
(
<year>2012</year>
)
<chapter-title>Manypedia: Comparing language points of view of Wikipedia communities</chapter-title>
,
<source>Proceedings of the 8th Intl. Symposium on Wikis and Open Collaboration (WikiSym 2012)</source>
,
<publisher-name>ACM</publisher-name>
,
<publisher-loc>New York</publisher-loc>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref018">
<label>18</label>
<mixed-citation publication-type="journal">
<name>
<surname>Zhirov</surname>
<given-names>AO</given-names>
</name>
,
<name>
<surname>Zhirov</surname>
<given-names>OV</given-names>
</name>
,
<name>
<surname>Shepelyansky</surname>
<given-names>DL</given-names>
</name>
(
<year>2010</year>
)
<article-title>Two-dimensional ranking of Wikipedia articles</article-title>
,
<source>Eur. Phys. J. B</source>
<volume>77</volume>
:
<fpage>523</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1140/epjb/e2010-10500-7">10.1140/epjb/e2010-10500-7</ext-link>
</comment>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref019">
<label>19</label>
<mixed-citation publication-type="journal">
<name>
<surname>Eom</surname>
<given-names>YH</given-names>
</name>
,
<name>
<surname>Frahm</surname>
<given-names>KM</given-names>
</name>
,
<name>
<surname>Bencźur</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Shepelyansky</surname>
<given-names>DL</given-names>
</name>
(
<year>2013</year>
)
<article-title>Time evolution of Wikipedia network ranking</article-title>
,
<source>Eur. Phys. J. B</source>
,
<volume>86</volume>
:
<fpage>482</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1140/epjb/e2013-40432-5">10.1140/epjb/e2013-40432-5</ext-link>
</comment>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref020">
<label>20</label>
<mixed-citation publication-type="book">
<name>
<surname>Aragón</surname>
<given-names>P</given-names>
</name>
,
<name>
<surname>Laniado</surname>
<given-names>D</given-names>
</name>
,
<name>
<surname>Kaltenbrunner</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Volkovich</surname>
<given-names>Y</given-names>
</name>
(
<year>2012</year>
)
<chapter-title>Biographical social networks on Wikipedia: a cross-cultural study of links that made history</chapter-title>
,
<source>Proc. of the 8th Intl. Symposium on Wikis and Open Collaboration (WikiSym 2012)</source>
,
<publisher-name>ACM</publisher-name>
,
<publisher-loc>New York</publisher-loc>
<comment>No</comment>
<fpage>19</fpage>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref021">
<label>21</label>
<mixed-citation publication-type="journal">
<name>
<surname>Eom</surname>
<given-names>YH</given-names>
</name>
,
<name>
<surname>Shepelyansky</surname>
<given-names>DL</given-names>
</name>
(
<year>2013</year>
)
<article-title>Highlighting Entanglement of Cultures via Ranking of Multilingual Wikipedia Articles</article-title>
,
<source>PLoS ONE</source>
,
<volume>8</volume>
(
<issue>10</issue>
):
<fpage>e74554</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1371/journal.pone.0074554">10.1371/journal.pone.0074554</ext-link>
</comment>
<pub-id pub-id-type="pmid">24098338</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref022">
<label>22</label>
<mixed-citation publication-type="book">
<name>
<surname>Skiena</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Ward</surname>
<given-names>CB</given-names>
</name>
(
<year>2013</year>
)
<source>Who is Bigger?: Where Historical Figures Really Rank</source>
,
<publisher-name>Cambridge University Press</publisher-name>
,
<publisher-loc>Cambridge UK</publisher-loc>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref023">
<label>23</label>
<mixed-citation publication-type="other">MIT Pantheon project. Available:
<ext-link ext-link-type="uri" xlink:href="http://pantheon.media.mit.edu">http://pantheon.media.mit.edu</ext-link>
. Accessed 2014 May 12.</mixed-citation>
</ref>
<ref id="pone.0114825.ref024">
<label>24</label>
<mixed-citation publication-type="journal">
<name>
<surname>Samoilenko</surname>
<given-names>A</given-names>
</name>
,
<name>
<surname>Yasseri</surname>
<given-names>T</given-names>
</name>
(
<year>2014</year>
)
<article-title>The distorted mirror of Wikipedia: a quantitative analysis of Wikipedia coverage of academics</article-title>
,
<source>EPJ Data Sci.</source>
<volume>3</volume>
:
<fpage>1</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1140/epjds20">10.1140/epjds20</ext-link>
</comment>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref025">
<label>25</label>
<mixed-citation publication-type="other">Wikipedia: List of languages by number of native speakers. Retrived May 12, 2014 from
<ext-link ext-link-type="uri" xlink:href="http://en.wikipedia.org/wiki/List_of_languages_by_number_of_native_speakers">http://en.wikipedia.org/wiki/List_of_languages_by_number_of_native_speakers</ext-link>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref026">
<label>26</label>
<mixed-citation publication-type="other">Wikipedia: Wikipedia. Retrived May 12, 2014 from
<ext-link ext-link-type="uri" xlink:href="http://en.wikipedia.org/wiki/Wikipedia">http://en.wikipedia.org/wiki/Wikipedia</ext-link>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref027">
<label>27</label>
<mixed-citation publication-type="book">
<name>
<surname>Hart</surname>
<given-names>MH</given-names>
</name>
(
<year>1992</year>
)
<source>The 100: ranking of the most influential persons in history</source>
,
<publisher-name>Citadel Press</publisher-name>
,
<publisher-loc>N.Y</publisher-loc>
.</mixed-citation>
</ref>
<ref id="pone.0114825.ref028">
<label>28</label>
<mixed-citation publication-type="journal">
<name>
<surname>Ermann</surname>
<given-names>L</given-names>
</name>
,
<name>
<surname>Chepelianskii</surname>
<given-names>AD</given-names>
</name>
,
<name>
<surname>Shepelyansky</surname>
<given-names>DL</given-names>
</name>
(
<year>2012</year>
)
<article-title>Toward two-dimensional search engines</article-title>
,
<source>J. Phys. A: Math. Theor.</source>
<volume>45</volume>
:
<fpage>275101</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1088/1751-8113/45/27/275101">10.1088/1751-8113/45/27/275101</ext-link>
</comment>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref029">
<label>29</label>
<mixed-citation publication-type="journal">
<name>
<surname>Ermann</surname>
<given-names>L</given-names>
</name>
,
<name>
<surname>Frahm</surname>
<given-names>KM</given-names>
</name>
,
<name>
<surname>Shepelyansky</surname>
<given-names>DL</given-names>
</name>
(
<year>2013</year>
)
<article-title>Spectral properties of Google matrix of Wikipedia and other networks</article-title>
,
<source>Eur. Phys. J. D</source>
<volume>86</volume>
:
<fpage>193</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1140/epjb/e2013-31090-8">10.1140/epjb/e2013-31090-8</ext-link>
</comment>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref030">
<label>30</label>
<mixed-citation publication-type="book">
<name>
<surname>Langville</surname>
<given-names>AM</given-names>
</name>
,
<name>
<surname>Meyer</surname>
<given-names>CD</given-names>
</name>
(
<year>2006</year>
)
<source>Google’s PageRank and Beyond: The Science of Search Engine Rankings</source>
,
<publisher-name>Princeton University Press</publisher-name>
,
<publisher-loc>Princeton</publisher-loc>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref031">
<label>31</label>
<mixed-citation publication-type="journal">
<name>
<surname>Brin</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Page</surname>
<given-names>L</given-names>
</name>
(
<year>1998</year>
)
<article-title>The anatomy of a large-scale hypertextual Web search engine</article-title>
,
<source>Computer Networks and ISDN Systems</source>
<volume>30</volume>
:
<fpage>107</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1016/S0169-7552(98)00110-X">10.1016/S0169-7552(98)00110-X</ext-link>
</comment>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref032">
<label>32</label>
<mixed-citation publication-type="journal">
<name>
<surname>Chen</surname>
<given-names>P</given-names>
</name>
,
<name>
<surname>Xie</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Maslov</surname>
<given-names>S</given-names>
</name>
,
<name>
<surname>Redner</surname>
<given-names>S</given-names>
</name>
(
<year>2007</year>
)
<article-title>Finding scientific gems with Googleś PageRank algorithm</article-title>
,
<source>Jour. Informetrics</source>
,
<volume>1</volume>
:
<fpage>8</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1016/j.joi.2006.06.001">10.1016/j.joi.2006.06.001</ext-link>
</comment>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref033">
<label>33</label>
<mixed-citation publication-type="book">
<name>
<surname>Kwak</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Lee</surname>
<given-names>C</given-names>
</name>
,
<name>
<surname>Park</surname>
<given-names>H</given-names>
</name>
,
<name>
<surname>Moon</surname>
<given-names>S</given-names>
</name>
(
<year>2010</year>
)
<chapter-title>What is Twitter, a social network or a news media?</chapter-title>
,
<source>Proc. 19th Int. Conf. WWW2010</source>
,
<publisher-name>ACM</publisher-name>
,
<publisher-loc>New York</publisher-loc>
<fpage>591</fpage>
p</mixed-citation>
</ref>
<ref id="pone.0114825.ref034">
<label>34</label>
<mixed-citation publication-type="journal">
<name>
<surname>Ermann</surname>
<given-names>L</given-names>
</name>
,
<name>
<surname>Shepelyansky</surname>
<given-names>DL</given-names>
</name>
(
<year>2011</year>
)
<article-title>Google matrix of the world trade network</article-title>
,
<source>Acta Physica Polonica A</source>
<volume>120</volume>
(
<issue>6A</issue>
),
<fpage>A158</fpage>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref035">
<label>35</label>
<mixed-citation publication-type="journal">
<name>
<surname>Kandiah</surname>
<given-names>V</given-names>
</name>
,
<name>
<surname>Shepelyansky</surname>
<given-names>DL</given-names>
</name>
(
<year>2013</year>
)
<article-title>Google matrix analysis of DNA sequences</article-title>
,
<source>PLoS ONE</source>
<volume>8</volume>
(
<issue>5</issue>
):
<fpage>e61519</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1371/journal.pone.0061519">10.1371/journal.pone.0061519</ext-link>
</comment>
<pub-id pub-id-type="pmid">23671568</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref036">
<label>36</label>
<mixed-citation publication-type="other">Chepelianskii AD (2010) Towards physical laws for software architecture, arXiv:1003.5455 [cs.SE]</mixed-citation>
</ref>
<ref id="pone.0114825.ref037">
<label>37</label>
<mixed-citation publication-type="journal">
<name>
<surname></surname>
<given-names>L</given-names>
</name>
,
<name>
<surname>Zhang</surname>
<given-names>Y-C</given-names>
</name>
,
<name>
<surname>Yeung</surname>
<given-names>CH</given-names>
</name>
,
<name>
<surname>Zhou</surname>
<given-names>T</given-names>
</name>
(
<year>2011</year>
)
<article-title>Leaders in social networks, the delicious case</article-title>
,
<source>PLoS ONE</source>
<volume>6</volume>
(
<issue>6</issue>
):
<fpage>e21202</fpage>
<comment>doi:
<ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.1371/journal.pone.0021202">10.1371/journal.pone.0021202</ext-link>
</comment>
<pub-id pub-id-type="pmid">21738620</pub-id>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref038">
<label>38</label>
<mixed-citation publication-type="other">
<ext-link ext-link-type="uri" xlink:href="http://dbpedia.org">http://dbpedia.org</ext-link>
. Accessed 2014 May 12</mixed-citation>
</ref>
<ref id="pone.0114825.ref039">
<label>39</label>
<mixed-citation publication-type="other">Top wikipeople. Available:
<ext-link ext-link-type="uri" xlink:href="http://www.quantware.ups-tlse.fr/QWLIB/topwikipeople/">http://www.quantware.ups-tlse.fr/QWLIB/topwikipeople/</ext-link>
. Accessed 2014 May 12.</mixed-citation>
</ref>
<ref id="pone.0114825.ref040">
<label>40</label>
<mixed-citation publication-type="other">United States Census Bureau. Retrieved May 12, 2014 from
<ext-link ext-link-type="uri" xlink:href="http://www.census.gov/population/international/data/worldpop/table_history.php">http://www.census.gov/population/international/data/worldpop/table_history.php</ext-link>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref041">
<label>41</label>
<mixed-citation publication-type="other">Wikipedia: Wikipedians. Retrived May 12, 2014 from
<ext-link ext-link-type="uri" xlink:href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedians">http://en.wikipedia.org/wiki/Wikipedia:Wikipedians</ext-link>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref042">
<label>42</label>
<mixed-citation publication-type="other">Statistics and indicators on women and men by United Nation.
<ext-link ext-link-type="uri" xlink:href="http://unstats.un.org/unsd/Demographic/products/indwm/">http://unstats.un.org/unsd/Demographic/products/indwm/</ext-link>
(accesible May 12, 2014)</mixed-citation>
</ref>
<ref id="pone.0114825.ref043">
<label>43</label>
<mixed-citation publication-type="other">Lam STK, Uduwage A, Dong Z, Sen S (2011) WP:clubhouse?: an exploration of Wikipedia’s gender imbalance, Proc. of the 7th Intl. Symposium on Wikis and Open Collaboration, WikiSym’11, Moutain View 1–10p</mixed-citation>
</ref>
<ref id="pone.0114825.ref044">
<label>44</label>
<mixed-citation publication-type="book">
<name>
<surname>Carr</surname>
<given-names>EH</given-names>
</name>
(
<year>1961</year>
)
<source>What is History?</source>
,
<publisher-name>Vintage Books</publisher-name>
,
<publisher-loc>New York</publisher-loc>
</mixed-citation>
</ref>
<ref id="pone.0114825.ref045">
<label>45</label>
<mixed-citation publication-type="book">
<name>
<surname>Hale</surname>
<given-names>SA</given-names>
</name>
(
<year>2014</year>
)
<chapter-title>Multilinguals and Wikipedia editing</chapter-title>
,
<source>Proc. 6th Annual ACM Web Science Conf.</source>
<publisher-name>ACM</publisher-name>
<publisher-loc>New York</publisher-loc>
<volume>1</volume>
,
<fpage>99</fpage>
</mixed-citation>
</ref>
</ref-list>
</back>
</pmc>
<affiliations>
<list>
<country>
<li>Espagne</li>
<li>France</li>
<li>Italie</li>
</country>
<region>
<li>Catalogne</li>
<li>Languedoc-Roussillon-Midi-Pyrénées</li>
<li>Midi-Pyrénées</li>
</region>
<settlement>
<li>Barcelone</li>
<li>Toulouse</li>
</settlement>
</list>
<tree>
<country name="France">
<region name="Languedoc-Roussillon-Midi-Pyrénées">
<name sortKey="Eom, Young Ho" sort="Eom, Young Ho" uniqKey="Eom Y" first="Young-Ho" last="Eom">Young-Ho Eom</name>
</region>
<name sortKey="Shepelyansky, Dima L" sort="Shepelyansky, Dima L" uniqKey="Shepelyansky D" first="Dima L." last="Shepelyansky">Dima L. Shepelyansky</name>
</country>
<country name="Espagne">
<region name="Catalogne">
<name sortKey="Arag N, Pablo" sort="Arag N, Pablo" uniqKey="Arag N P" first="Pablo" last="Arag N">Pablo Arag N</name>
</region>
<name sortKey="Kaltenbrunner, Andreas" sort="Kaltenbrunner, Andreas" uniqKey="Kaltenbrunner A" first="Andreas" last="Kaltenbrunner">Andreas Kaltenbrunner</name>
<name sortKey="Laniado, David" sort="Laniado, David" uniqKey="Laniado D" first="David" last="Laniado">David Laniado</name>
</country>
<country name="Italie">
<noRegion>
<name sortKey="Vigna, Sebastiano" sort="Vigna, Sebastiano" uniqKey="Vigna S" first="Sebastiano" last="Vigna">Sebastiano Vigna</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Musique/explor/MozartV1/Data/Ncbi/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000792 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Ncbi/Merge/biblio.hfd -nk 000792 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Musique
   |area=    MozartV1
   |flux=    Ncbi
   |étape=   Merge
   |type=    RBID
   |clé=     PMC:4349893
   |texte=   Interactions of Cultures and Top People of Wikipedia from Ranking of 24 Language Editions
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Ncbi/Merge/RBID.i   -Sk "pubmed:25738291" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Ncbi/Merge/biblio.hfd   \
       | NlmPubMed2Wicri -a MozartV1 

Wicri

This area was generated with Dilib version V0.6.20.
Data generation: Sun Apr 10 15:06:14 2016. Site generation: Tue Feb 7 15:40:35 2023