Extracting scientific articles from a large digital archive: BioStor and the Biodiversity Heritage Library
Identifieur interne : 000149 ( Pmc/Corpus ); précédent : 000148; suivant : 000150Extracting scientific articles from a large digital archive: BioStor and the Biodiversity Heritage Library
Auteurs : Roderic Dm PageSource :
- BMC Bioinformatics [ 1471-2105 ] ; 2011.
Abstract
The Biodiversity Heritage Library (BHL) is a large digital archive of legacy biological literature, comprising over 31 million pages scanned from books, monographs, and journals. During the digitisation process basic metadata about the scanned items is recorded, but not article-level metadata. Given that the article is the standard unit of citation, this makes it difficult to locate cited literature in BHL. Adding the ability to easily find articles in BHL would greatly enhance the value of the archive.
A service was developed to locate articles in BHL based on matching article metadata to BHL metadata using approximate string matching, regular expressions, and string alignment. This article locating service is exposed as a standard OpenURL resolver on the BioStor web site
BioStor provides tools for extracting, annotating, and visualising articles from the Biodiversity Heritage Library. BioStor is available from
Url:
DOI: 10.1186/1471-2105-12-187
PubMed: 21605356
PubMed Central: 3129327
Links to Exploration step
PMC:3129327***** Acces problem to record *****\Le document en format XML
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Pmc/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000149 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd -nk 000149 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Pmc |étape= Corpus |type= RBID |clé= PMC:3129327 |texte= Extracting scientific articles from a large digital archive: BioStor and the Biodiversity Heritage Library }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Corpus/RBID.i -Sk "pubmed:21605356" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Corpus/biblio.hfd \ | NlmPubMed2Wicri -a OcrV1
This area was generated with Dilib version V0.6.32. |