Quantitative analysis of culture using millions of digitized books
Identifieur interne : 000095 ( Pmc/Corpus ); précédent : 000094; suivant : 000096Quantitative analysis of culture using millions of digitized books
Auteurs : Jean-Baptiste Michel ; Yuan Kui Shen ; Aviva P. Aiden ; Adrian Veres ; Matthew K. Gray ; Joseph P. Pickett ; Dale Hoiberg ; Dan Clancy ; Peter Norvig ; Jon Orwant ; Steven Pinker ; Martin A. Nowak ; Erez Lieberman AidenSource :
- Science (New York, N.y.) [ 0036-8075 ] ; 2010.
Abstract
We constructed a corpus of digitized texts containing about 4% of all books ever printed. Analysis of this corpus enables us to investigate cultural trends quantitatively. We survey the vast terrain of ‘culturomics’, focusing on linguistic and cultural phenomena that were reflected in the English language between 1800 and 2000. We show how this approach can provide insights about fields as diverse as lexicography, the evolution of grammar, collective memory, the adoption of technology, the pursuit of fame, censorship, and historical epidemiology. ‘Culturomics’ extends the boundaries of rigorous quantitative inquiry to a wide array of new phenomena spanning the social sciences and the humanities.
Url:
DOI: 10.1126/science.1199644
PubMed: 21163965
PubMed Central: 3279742
Links to Exploration step
PMC:3279742***** Acces problem to record *****\Le document en format XML
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Pmc/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000095 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 000095 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Pmc |étape= Corpus |type= RBID |clé= PMC:3279742 |texte= Quantitative analysis of culture using millions of digitized books }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/RBID.i -Sk "pubmed:21163965" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd \ | NlmPubMed2Wicri -a OcrV1
This area was generated with Dilib version V0.6.32. |