Increasing the efficiency of digitization workflows for herbarium specimens
Identifieur interne : 000260 ( Main/Curation ); précédent : 000259; suivant : 000261Increasing the efficiency of digitization workflows for herbarium specimens
Auteurs : Melissa Tulig [États-Unis] ; Nicole Tarnowsky [États-Unis] ; Michael Bevans [États-Unis] ; Barbara M. Thiers [États-Unis]Source :
- ZooKeys [ 1313-2989 ] ; 2012.
Abstract
The New York Botanical Garden Herbarium has been databasing and imaging its estimated 7.3 million plant specimens for the past 17 years. Due to the size of the collection, we have been selectively digitizing fundable subsets of specimens, making successive passes through the herbarium with each new grant. With this strategy, the average rate for databasing complete records has been 10 specimens per hour. With 1.3 million specimens databased, this effort has taken about 130,000 hours of staff time. At this rate, to complete the herbarium and digitize the remaining 6 million specimens, another 600,000 hours would be needed. Given the current biodiversity and economic crises, there is neither the time nor money to complete the collection at this rate.
Through a combination of grants over the last few years, The New York Botanical Garden has been testing new protocols and tactics for increasing the rate of digitization through combinations of data collaboration, field book digitization, partial data entry and imaging, and optical character recognition (OCR) of specimen images. With the launch of the National Science Foundation’s new Advancing Digitization of Biological Collections program, we hope to move forward with larger, more efficient digitization projects, capturing data from larger portions of the herbarium at a fraction of the cost and time.
Url:
DOI: 10.3897/zookeys.209.3125
PubMed: 22859882
PubMed Central: 3406470
Links toward previous steps (curation, corpus...)
- to stream Pmc, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000202
- to stream Pmc, to step Curation: Pour aller vers cette notice dans l'étape Curation :000202
- to stream Pmc, to step Checkpoint: Pour aller vers cette notice dans l'étape Curation :000104
- to stream PubMed, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000027
- to stream PubMed, to step Curation: Pour aller vers cette notice dans l'étape Curation :000027
- to stream PubMed, to step Checkpoint: Pour aller vers cette notice dans l'étape Curation :000027
- to stream Ncbi, to step Merge: Pour aller vers cette notice dans l'étape Curation :000140
- to stream Ncbi, to step Curation: Pour aller vers cette notice dans l'étape Curation :000140
- to stream Ncbi, to step Checkpoint: Pour aller vers cette notice dans l'étape Curation :000140
- to stream Main, to step Merge: Pour aller vers cette notice dans l'étape Curation :000263
Links to Exploration step
PMC:3406470Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Increasing the efficiency of digitization workflows for herbarium specimens</title>
<author><name sortKey="Tulig, Melissa" sort="Tulig, Melissa" uniqKey="Tulig M" first="Melissa" last="Tulig">Melissa Tulig</name>
<affiliation wicri:level="2"><nlm:aff id="A1">William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York</wicri:regionArea>
<placeName><region type="state">État de New York</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Tarnowsky, Nicole" sort="Tarnowsky, Nicole" uniqKey="Tarnowsky N" first="Nicole" last="Tarnowsky">Nicole Tarnowsky</name>
<affiliation wicri:level="2"><nlm:aff id="A1">William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York</wicri:regionArea>
<placeName><region type="state">État de New York</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Bevans, Michael" sort="Bevans, Michael" uniqKey="Bevans M" first="Michael" last="Bevans">Michael Bevans</name>
<affiliation wicri:level="2"><nlm:aff id="A1">William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York</wicri:regionArea>
<placeName><region type="state">État de New York</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Thiers, Barbara M" sort="Thiers, Barbara M" uniqKey="Thiers " first=" Barbara M." last="Thiers"> Barbara M. Thiers</name>
<affiliation wicri:level="2"><nlm:aff id="A1">William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York</wicri:regionArea>
<placeName><region type="state">État de New York</region>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PMC</idno>
<idno type="pmid">22859882</idno>
<idno type="pmc">3406470</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3406470</idno>
<idno type="RBID">PMC:3406470</idno>
<idno type="doi">10.3897/zookeys.209.3125</idno>
<date when="2012">2012</date>
<idno type="wicri:Area/Pmc/Corpus">000202</idno>
<idno type="wicri:Area/Pmc/Curation">000202</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000104</idno>
<idno type="wicri:source">PubMed</idno>
<idno type="wicri:Area/PubMed/Corpus">000027</idno>
<idno type="wicri:Area/PubMed/Curation">000027</idno>
<idno type="wicri:Area/PubMed/Checkpoint">000027</idno>
<idno type="wicri:Area/Ncbi/Merge">000140</idno>
<idno type="wicri:Area/Ncbi/Curation">000140</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000140</idno>
<idno type="wicri:doubleKey">1313-2989:2012:Tulig M:increasing:the:efficiency</idno>
<idno type="wicri:Area/Main/Merge">000263</idno>
<idno type="wicri:Area/Main/Curation">000260</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a" type="main">Increasing the efficiency of digitization workflows for herbarium specimens</title>
<author><name sortKey="Tulig, Melissa" sort="Tulig, Melissa" uniqKey="Tulig M" first="Melissa" last="Tulig">Melissa Tulig</name>
<affiliation wicri:level="2"><nlm:aff id="A1">William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York</wicri:regionArea>
<placeName><region type="state">État de New York</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Tarnowsky, Nicole" sort="Tarnowsky, Nicole" uniqKey="Tarnowsky N" first="Nicole" last="Tarnowsky">Nicole Tarnowsky</name>
<affiliation wicri:level="2"><nlm:aff id="A1">William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York</wicri:regionArea>
<placeName><region type="state">État de New York</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Bevans, Michael" sort="Bevans, Michael" uniqKey="Bevans M" first="Michael" last="Bevans">Michael Bevans</name>
<affiliation wicri:level="2"><nlm:aff id="A1">William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York</wicri:regionArea>
<placeName><region type="state">État de New York</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Thiers, Barbara M" sort="Thiers, Barbara M" uniqKey="Thiers " first=" Barbara M." last="Thiers"> Barbara M. Thiers</name>
<affiliation wicri:level="2"><nlm:aff id="A1">William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York, USA</nlm:aff>
<country xml:lang="fr">États-Unis</country>
<wicri:regionArea>William and Lynda Steere Herbarium, The New York Botanical Garden, Bronx, New York</wicri:regionArea>
<placeName><region type="state">État de New York</region>
</placeName>
</affiliation>
</author>
</analytic>
<series><title level="j">ZooKeys</title>
<idno type="ISSN">1313-2989</idno>
<idno type="eISSN">1313-2970</idno>
<imprint><date when="2012">2012</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><label>Abstract</label>
<p>The New York Botanical Garden Herbarium has been databasing and imaging its estimated 7.3 million plant specimens for the past 17 years. Due to the size of the collection, we have been selectively digitizing fundable subsets of specimens, making successive passes through the herbarium with each new grant. With this strategy, the average rate for databasing complete records has been 10 specimens per hour. With 1.3 million specimens databased, this effort has taken about 130,000 hours of staff time. At this rate, to complete the herbarium and digitize the remaining 6 million specimens, another 600,000 hours would be needed. Given the current biodiversity and economic crises, there is neither the time nor money to complete the collection at this rate.</p>
<p>Through a combination of grants over the last few years, The New York Botanical Garden has been testing new protocols and tactics for increasing the rate of digitization through combinations of data collaboration, field book digitization, partial data entry and imaging, and optical character recognition (OCR) of specimen images. With the launch of the National Science Foundation’s new Advancing Digitization of Biological Collections program, we hope to move forward with larger, more efficient digitization projects, capturing data from larger portions of the herbarium at a fraction of the cost and time.</p>
</div>
</front>
<back><div1 type="bibliography"><listBibl><biblStruct><analytic><author><name sortKey="Baird, R" uniqKey="Baird R">R Baird</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Chapman, Ad" uniqKey="Chapman A">AD Chapman</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Granzow De La Cerda, I" uniqKey="Granzow De La Cerda I">I Granzow-de la Cerda</name>
</author>
<author><name sortKey="Beach, Jh" uniqKey="Beach J">JH Beach</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Pyke, Gh" uniqKey="Pyke G">GH Pyke</name>
</author>
<author><name sortKey="Ehrlich, Pr" uniqKey="Ehrlich P">PR Ehrlich</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Rabeler, Rk" uniqKey="Rabeler R">RK Rabeler</name>
</author>
<author><name sortKey="Macklin, Ja" uniqKey="Macklin J">JA Macklin</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Scoble, Mj" uniqKey="Scoble M">MJ Scoble</name>
</author>
<author><name sortKey="T, Bourgoin" uniqKey="T B">Bourgoin T</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Specify Scatter Gather Reconcile, Sgr" uniqKey="Specify Scatter Gather Reconcile ">(SGR) Specify 6.4: Scatter Gather Reconcile</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Vollmar, A" uniqKey="Vollmar A">A Vollmar</name>
</author>
<author><name sortKey="Macklin, Ja" uniqKey="Macklin J">JA Macklin</name>
</author>
<author><name sortKey="Ford, Ls" uniqKey="Ford L">LS Ford</name>
</author>
</analytic>
</biblStruct>
<biblStruct><analytic><author><name sortKey="Wang, Z" uniqKey="Wang Z">Z Wang</name>
</author>
<author><name sortKey="Dong, H" uniqKey="Dong H">H Dong</name>
</author>
<author><name sortKey="Kelly, M" uniqKey="Kelly M">M Kelly</name>
</author>
<author><name sortKey="Macklin, Ja" uniqKey="Macklin J">JA Macklin</name>
</author>
<author><name sortKey="Morris, Pj" uniqKey="Morris P">PJ Morris</name>
</author>
<author><name sortKey="Morris, Ra" uniqKey="Morris R">RA Morris</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000260 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Curation/biblio.hfd -nk 000260 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Curation |type= RBID |clé= PMC:3406470 |texte= Increasing the efficiency of digitization workflows for herbarium specimens }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Curation/RBID.i -Sk "pubmed:22859882" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Curation/biblio.hfd \ | NlmPubMed2Wicri -a OcrV1
This area was generated with Dilib version V0.6.32. |