Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis
Identifieur interne : 000551 ( PascalFrancis/Curation ); précédent : 000550; suivant : 000552Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis
Auteurs : Eveline Wandl-Vogt [Autriche]Source :
- Literary and linguistic computing [ 0268-1145 ] ; 2008.
Descripteurs français
- Pascal (Inist)
English descriptors
Abstract
The DATABASE OF ELECTRONIC TEXTS of the LEXICON OF BAVARIAN DIALECTS IN AUSTRIA (WBÖ) (TEXTKORPUS zum WBÖ)-The Lexicon of Bavarian Dialects in Austria (Wörterbuch der bairischen Mundarten in Österrreich [WBÖ]) is based on a collection of about 4 million single records. They represent the variety of the regional, social and historical Bavarian dialects. About 10% of all entries are excerpts from texts of various types. The lexicographer has to draw the illustrative quotations from the original texts. The digitized full-texts have been dated and localized, thus each quotation can be placed in time and area enabling the lexicogapher to choose a representative number of examples. For the definitions, the most appropriate and illustrative quotations have to be found, in order to actively support the lexicographers work, the Institute of Lexicography of Austrian Dialects and Names / Institut für Österreichische Dialekt- und Namenlexika (http://www.oeaw.ac.at/dinamlex) started a new project: the so-called DBÖ (Database of the Bavarian Dialects in Austria / Datenbank der bairischen Mundarten in Österreich), financed by the Österreichische Akademie der Wissenschaften / Austrian Academy of Sciences. One main database includes the so-called Hauptkatalog, the main archive; there are additional databases which are necessary to retrieve correct information about the questionnaire, the localization of the word and the date of its recording. One component of the DBÖ is the database of electronic texts of the WBÖ, including some 90 Austrian texts spanning several centuries, representing the Austrian dialects. The original texts are scanned using the Austrian OCR-programme proLector V1.20 (A.1) which allows the training of various fonts. The machine-readable texts are converted in TUSTEP (Tübinger System von Textverarbeitungs-Programmen / Tübingen System of Text Processing Programmes). The TUSTEP-files get an alphanumeric key, which allows one to retrieve each quotation from the database in a chronological order and to sort it according to its localization. Finally the texts are broken down into the requisite sized pieces for quoting in the WBÖ-entries. Using a special programme the quotations can easily be reconnected and replaced in the proper context from which they were drawn. The digital texts are a valuable source for the lexicographer freeing him from the monotony of checking over and over again, thus leaving him more time for his proper work, namely the writing of entries for the WBÖ. Furthermore, the digital texts are important for speeding up the dictionary's publication in accordance with the guidelines of the Straffungskon-: zept 1993 and 1998.
pA |
|
---|
Links toward previous steps (curation, corpus...)
- to stream PascalFrancis, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000248
Links to Exploration step
Francis:09-0208472Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="GER" level="a">Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis</title>
<author><name sortKey="Wandl Vogt, Eveline" sort="Wandl Vogt, Eveline" uniqKey="Wandl Vogt E" first="Eveline" last="Wandl-Vogt">Eveline Wandl-Vogt</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Institut für Österreichische Dialekt-und Namenlexika, Österreichische Akademie der Wissenschaften, Wohllebengasse 12-14/2</s1>
<s2>1040 Wien</s2>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">09-0208472</idno>
<date when="2008">2008</date>
<idno type="stanalyst">FRANCIS 09-0208472 INIST</idno>
<idno type="RBID">Francis:09-0208472</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000248</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000551</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="GER" level="a">Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis</title>
<author><name sortKey="Wandl Vogt, Eveline" sort="Wandl Vogt, Eveline" uniqKey="Wandl Vogt E" first="Eveline" last="Wandl-Vogt">Eveline Wandl-Vogt</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Institut für Österreichische Dialekt-und Namenlexika, Österreichische Akademie der Wissenschaften, Wohllebengasse 12-14/2</s1>
<s2>1040 Wien</s2>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">Literary and linguistic computing</title>
<title level="j" type="abbreviated">Lit. linguist. comput.</title>
<idno type="ISSN">0268-1145</idno>
<imprint><date when="2008">2008</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">Literary and linguistic computing</title>
<title level="j" type="abbreviated">Lit. linguist. comput.</title>
<idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Computational lexicography</term>
<term>Dialect</term>
<term>Textual database</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Lexicographie informatique</term>
<term>Base de données textuelles</term>
<term>Dialecte</term>
<term>Bavarois-Autrichien</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">The DATABASE OF ELECTRONIC TEXTS of the LEXICON OF BAVARIAN DIALECTS IN AUSTRIA (WBÖ) (TEXTKORPUS zum WBÖ)-The Lexicon of Bavarian Dialects in Austria (Wörterbuch der bairischen Mundarten in Österrreich [WBÖ]) is based on a collection of about 4 million single records. They represent the variety of the regional, social and historical Bavarian dialects. About 10% of all entries are excerpts from texts of various types. The lexicographer has to draw the illustrative quotations from the original texts. The digitized full-texts have been dated and localized, thus each quotation can be placed in time and area enabling the lexicogapher to choose a representative number of examples. For the definitions, the most appropriate and illustrative quotations have to be found, in order to actively support the lexicographers work, the Institute of Lexicography of Austrian Dialects and Names / Institut für Österreichische Dialekt- und Namenlexika (http://www.oeaw.ac.at/dinamlex) started a new project: the so-called DBÖ (Database of the Bavarian Dialects in Austria / Datenbank der bairischen Mundarten in Österreich), financed by the Österreichische Akademie der Wissenschaften / Austrian Academy of Sciences. One main database includes the so-called Hauptkatalog, the main archive; there are additional databases which are necessary to retrieve correct information about the questionnaire, the localization of the word and the date of its recording. One component of the DBÖ is the database of electronic texts of the WBÖ, including some 90 Austrian texts spanning several centuries, representing the Austrian dialects. The original texts are scanned using the Austrian OCR-programme proLector V1.20 (A.1) which allows the training of various fonts. The machine-readable texts are converted in TUSTEP (Tübinger System von Textverarbeitungs-Programmen / Tübingen System of Text Processing Programmes). The TUSTEP-files get an alphanumeric key, which allows one to retrieve each quotation from the database in a chronological order and to sort it according to its localization. Finally the texts are broken down into the requisite sized pieces for quoting in the WBÖ-entries. Using a special programme the quotations can easily be reconnected and replaced in the proper context from which they were drawn. The digital texts are a valuable source for the lexicographer freeing him from the monotony of checking over and over again, thus leaving him more time for his proper work, namely the writing of entries for the WBÖ. Furthermore, the digital texts are important for speeding up the dictionary's publication in accordance with the guidelines of the Straffungskon-: zept 1993 and 1998.</div>
</front>
</TEI>
<inist><standard h6="B"><pA><fA01 i1="01" i2="1"><s0>0268-1145</s0>
</fA01>
<fA03 i2="1"><s0>Lit. linguist. comput.</s0>
</fA03>
<fA05><s2>23</s2>
</fA05>
<fA06><s2>2</s2>
</fA06>
<fA08 i1="01" i2="1" l="GER"><s1>Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis</s1>
</fA08>
<fA11 i1="01" i2="1"><s1>WANDL-VOGT (Eveline)</s1>
</fA11>
<fA14 i1="01"><s1>Institut für Österreichische Dialekt-und Namenlexika, Österreichische Akademie der Wissenschaften, Wohllebengasse 12-14/2</s1>
<s2>1040 Wien</s2>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
</fA14>
<fA20><s1>201-217</s1>
</fA20>
<fA21><s1>2008</s1>
</fA21>
<fA23 i1="01"><s0>GER</s0>
</fA23>
<fA24 i1="01"><s0>eng</s0>
</fA24>
<fA43 i1="01"><s1>INIST</s1>
<s2>23967</s2>
<s5>354000197945600050</s5>
</fA43>
<fA44><s0>0000</s0>
<s1>© 2009 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45><s0>1 p.3/4</s0>
</fA45>
<fA47 i1="01" i2="1"><s0>09-0208472</s0>
</fA47>
<fA60><s1>P</s1>
</fA60>
<fA61><s0>A</s0>
</fA61>
<fA64 i1="01" i2="1"><s0>Literary and linguistic computing</s0>
</fA64>
<fA66 i1="01"><s0>GBR</s0>
</fA66>
<fA99><s0>33 notes</s0>
</fA99>
<fC01 i1="01" l="ENG"><s0>The DATABASE OF ELECTRONIC TEXTS of the LEXICON OF BAVARIAN DIALECTS IN AUSTRIA (WBÖ) (TEXTKORPUS zum WBÖ)-The Lexicon of Bavarian Dialects in Austria (Wörterbuch der bairischen Mundarten in Österrreich [WBÖ]) is based on a collection of about 4 million single records. They represent the variety of the regional, social and historical Bavarian dialects. About 10% of all entries are excerpts from texts of various types. The lexicographer has to draw the illustrative quotations from the original texts. The digitized full-texts have been dated and localized, thus each quotation can be placed in time and area enabling the lexicogapher to choose a representative number of examples. For the definitions, the most appropriate and illustrative quotations have to be found, in order to actively support the lexicographers work, the Institute of Lexicography of Austrian Dialects and Names / Institut für Österreichische Dialekt- und Namenlexika (http://www.oeaw.ac.at/dinamlex) started a new project: the so-called DBÖ (Database of the Bavarian Dialects in Austria / Datenbank der bairischen Mundarten in Österreich), financed by the Österreichische Akademie der Wissenschaften / Austrian Academy of Sciences. One main database includes the so-called Hauptkatalog, the main archive; there are additional databases which are necessary to retrieve correct information about the questionnaire, the localization of the word and the date of its recording. One component of the DBÖ is the database of electronic texts of the WBÖ, including some 90 Austrian texts spanning several centuries, representing the Austrian dialects. The original texts are scanned using the Austrian OCR-programme proLector V1.20 (A.1) which allows the training of various fonts. The machine-readable texts are converted in TUSTEP (Tübinger System von Textverarbeitungs-Programmen / Tübingen System of Text Processing Programmes). The TUSTEP-files get an alphanumeric key, which allows one to retrieve each quotation from the database in a chronological order and to sort it according to its localization. Finally the texts are broken down into the requisite sized pieces for quoting in the WBÖ-entries. Using a special programme the quotations can easily be reconnected and replaced in the proper context from which they were drawn. The digital texts are a valuable source for the lexicographer freeing him from the monotony of checking over and over again, thus leaving him more time for his proper work, namely the writing of entries for the WBÖ. Furthermore, the digital texts are important for speeding up the dictionary's publication in accordance with the guidelines of the Straffungskon-: zept 1993 and 1998.</s0>
</fC01>
<fC02 i1="01" i2="L"><s0>52474</s0>
<s1>XV</s1>
</fC02>
<fC02 i1="02" i2="L"><s0>524</s0>
</fC02>
<fC03 i1="01" i2="L" l="FRE"><s0>Lexicographie informatique</s0>
<s2>563</s2>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="L" l="ENG"><s0>Computational lexicography</s0>
<s2>563</s2>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="L" l="FRE"><s0>Base de données textuelles</s0>
<s2>NI</s2>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="L" l="ENG"><s0>Textual database</s0>
<s2>NI</s2>
<s5>02</s5>
</fC03>
<fC03 i1="03" i2="L" l="FRE"><s0>Dialecte</s0>
<s2>NI</s2>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="L" l="ENG"><s0>Dialect</s0>
<s2>NI</s2>
<s5>03</s5>
</fC03>
<fC03 i1="04" i2="L" l="FRE"><s0>Bavarois-Autrichien</s0>
<s2>NL</s2>
<s5>16</s5>
</fC03>
<fN21><s1>152</s1>
</fN21>
</pA>
</standard>
</inist>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000551 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Curation/biblio.hfd -nk 000551 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= PascalFrancis |étape= Curation |type= RBID |clé= Francis:09-0208472 |texte= Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis }}
This area was generated with Dilib version V0.6.32. |