Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis

Identifieur interne : 000551 ( PascalFrancis/Curation ); précédent : 000550; suivant : 000552

Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis

Auteurs : Eveline Wandl-Vogt [Autriche]

Source :

RBID : Francis:09-0208472

Descripteurs français

English descriptors

Abstract

The DATABASE OF ELECTRONIC TEXTS of the LEXICON OF BAVARIAN DIALECTS IN AUSTRIA (WBÖ) (TEXTKORPUS zum WBÖ)-The Lexicon of Bavarian Dialects in Austria (Wörterbuch der bairischen Mundarten in Österrreich [WBÖ]) is based on a collection of about 4 million single records. They represent the variety of the regional, social and historical Bavarian dialects. About 10% of all entries are excerpts from texts of various types. The lexicographer has to draw the illustrative quotations from the original texts. The digitized full-texts have been dated and localized, thus each quotation can be placed in time and area enabling the lexicogapher to choose a representative number of examples. For the definitions, the most appropriate and illustrative quotations have to be found, in order to actively support the lexicographers work, the Institute of Lexicography of Austrian Dialects and Names / Institut für Österreichische Dialekt- und Namenlexika (http://www.oeaw.ac.at/dinamlex) started a new project: the so-called DBÖ (Database of the Bavarian Dialects in Austria / Datenbank der bairischen Mundarten in Österreich), financed by the Österreichische Akademie der Wissenschaften / Austrian Academy of Sciences. One main database includes the so-called Hauptkatalog, the main archive; there are additional databases which are necessary to retrieve correct information about the questionnaire, the localization of the word and the date of its recording. One component of the DBÖ is the database of electronic texts of the WBÖ, including some 90 Austrian texts spanning several centuries, representing the Austrian dialects. The original texts are scanned using the Austrian OCR-programme proLector V1.20 (A.1) which allows the training of various fonts. The machine-readable texts are converted in TUSTEP (Tübinger System von Textverarbeitungs-Programmen / Tübingen System of Text Processing Programmes). The TUSTEP-files get an alphanumeric key, which allows one to retrieve each quotation from the database in a chronological order and to sort it according to its localization. Finally the texts are broken down into the requisite sized pieces for quoting in the WBÖ-entries. Using a special programme the quotations can easily be reconnected and replaced in the proper context from which they were drawn. The digital texts are a valuable source for the lexicographer freeing him from the monotony of checking over and over again, thus leaving him more time for his proper work, namely the writing of entries for the WBÖ. Furthermore, the digital texts are important for speeding up the dictionary's publication in accordance with the guidelines of the Straffungskon-: zept 1993 and 1998.
pA  
A01 01  1    @0 0268-1145
A03   1    @0 Lit. linguist. comput.
A05       @2 23
A06       @2 2
A08 01  1  GER  @1 Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis
A11 01  1    @1 WANDL-VOGT (Eveline)
A14 01      @1 Institut für Österreichische Dialekt-und Namenlexika, Österreichische Akademie der Wissenschaften, Wohllebengasse 12-14/2 @2 1040 Wien @3 AUT @Z 1 aut.
A20       @1 201-217
A21       @1 2008
A23 01      @0 GER
A24 01      @0 eng
A43 01      @1 INIST @2 23967 @5 354000197945600050
A44       @0 0000 @1 © 2009 INIST-CNRS. All rights reserved.
A45       @0 1 p.3/4
A47 01  1    @0 09-0208472
A60       @1 P
A61       @0 A
A64 01  1    @0 Literary and linguistic computing
A66 01      @0 GBR
A99       @0 33 notes
C01 01    ENG  @0 The DATABASE OF ELECTRONIC TEXTS of the LEXICON OF BAVARIAN DIALECTS IN AUSTRIA (WBÖ) (TEXTKORPUS zum WBÖ)-The Lexicon of Bavarian Dialects in Austria (Wörterbuch der bairischen Mundarten in Österrreich [WBÖ]) is based on a collection of about 4 million single records. They represent the variety of the regional, social and historical Bavarian dialects. About 10% of all entries are excerpts from texts of various types. The lexicographer has to draw the illustrative quotations from the original texts. The digitized full-texts have been dated and localized, thus each quotation can be placed in time and area enabling the lexicogapher to choose a representative number of examples. For the definitions, the most appropriate and illustrative quotations have to be found, in order to actively support the lexicographers work, the Institute of Lexicography of Austrian Dialects and Names / Institut für Österreichische Dialekt- und Namenlexika (http://www.oeaw.ac.at/dinamlex) started a new project: the so-called DBÖ (Database of the Bavarian Dialects in Austria / Datenbank der bairischen Mundarten in Österreich), financed by the Österreichische Akademie der Wissenschaften / Austrian Academy of Sciences. One main database includes the so-called Hauptkatalog, the main archive; there are additional databases which are necessary to retrieve correct information about the questionnaire, the localization of the word and the date of its recording. One component of the DBÖ is the database of electronic texts of the WBÖ, including some 90 Austrian texts spanning several centuries, representing the Austrian dialects. The original texts are scanned using the Austrian OCR-programme proLector V1.20 (A.1) which allows the training of various fonts. The machine-readable texts are converted in TUSTEP (Tübinger System von Textverarbeitungs-Programmen / Tübingen System of Text Processing Programmes). The TUSTEP-files get an alphanumeric key, which allows one to retrieve each quotation from the database in a chronological order and to sort it according to its localization. Finally the texts are broken down into the requisite sized pieces for quoting in the WBÖ-entries. Using a special programme the quotations can easily be reconnected and replaced in the proper context from which they were drawn. The digital texts are a valuable source for the lexicographer freeing him from the monotony of checking over and over again, thus leaving him more time for his proper work, namely the writing of entries for the WBÖ. Furthermore, the digital texts are important for speeding up the dictionary's publication in accordance with the guidelines of the Straffungskon-: zept 1993 and 1998.
C02 01  L    @0 52474 @1 XV
C02 02  L    @0 524
C03 01  L  FRE  @0 Lexicographie informatique @2 563 @5 01
C03 01  L  ENG  @0 Computational lexicography @2 563 @5 01
C03 02  L  FRE  @0 Base de données textuelles @2 NI @5 02
C03 02  L  ENG  @0 Textual database @2 NI @5 02
C03 03  L  FRE  @0 Dialecte @2 NI @5 03
C03 03  L  ENG  @0 Dialect @2 NI @5 03
C03 04  L  FRE  @0 Bavarois-Autrichien @2 NL @5 16
N21       @1 152

Links toward previous steps (curation, corpus...)


Links to Exploration step

Francis:09-0208472

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="GER" level="a">Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis</title>
<author>
<name sortKey="Wandl Vogt, Eveline" sort="Wandl Vogt, Eveline" uniqKey="Wandl Vogt E" first="Eveline" last="Wandl-Vogt">Eveline Wandl-Vogt</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Institut für Österreichische Dialekt-und Namenlexika, Österreichische Akademie der Wissenschaften, Wohllebengasse 12-14/2</s1>
<s2>1040 Wien</s2>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">09-0208472</idno>
<date when="2008">2008</date>
<idno type="stanalyst">FRANCIS 09-0208472 INIST</idno>
<idno type="RBID">Francis:09-0208472</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000248</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000551</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="GER" level="a">Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis</title>
<author>
<name sortKey="Wandl Vogt, Eveline" sort="Wandl Vogt, Eveline" uniqKey="Wandl Vogt E" first="Eveline" last="Wandl-Vogt">Eveline Wandl-Vogt</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Institut für Österreichische Dialekt-und Namenlexika, Österreichische Akademie der Wissenschaften, Wohllebengasse 12-14/2</s1>
<s2>1040 Wien</s2>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Literary and linguistic computing</title>
<title level="j" type="abbreviated">Lit. linguist. comput.</title>
<idno type="ISSN">0268-1145</idno>
<imprint>
<date when="2008">2008</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Literary and linguistic computing</title>
<title level="j" type="abbreviated">Lit. linguist. comput.</title>
<idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Computational lexicography</term>
<term>Dialect</term>
<term>Textual database</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Lexicographie informatique</term>
<term>Base de données textuelles</term>
<term>Dialecte</term>
<term>Bavarois-Autrichien</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">The DATABASE OF ELECTRONIC TEXTS of the LEXICON OF BAVARIAN DIALECTS IN AUSTRIA (WBÖ) (TEXTKORPUS zum WBÖ)-The Lexicon of Bavarian Dialects in Austria (Wörterbuch der bairischen Mundarten in Österrreich [WBÖ]) is based on a collection of about 4 million single records. They represent the variety of the regional, social and historical Bavarian dialects. About 10% of all entries are excerpts from texts of various types. The lexicographer has to draw the illustrative quotations from the original texts. The digitized full-texts have been dated and localized, thus each quotation can be placed in time and area enabling the lexicogapher to choose a representative number of examples. For the definitions, the most appropriate and illustrative quotations have to be found, in order to actively support the lexicographers work, the Institute of Lexicography of Austrian Dialects and Names / Institut für Österreichische Dialekt- und Namenlexika (http://www.oeaw.ac.at/dinamlex) started a new project: the so-called DBÖ (Database of the Bavarian Dialects in Austria / Datenbank der bairischen Mundarten in Österreich), financed by the Österreichische Akademie der Wissenschaften / Austrian Academy of Sciences. One main database includes the so-called Hauptkatalog, the main archive; there are additional databases which are necessary to retrieve correct information about the questionnaire, the localization of the word and the date of its recording. One component of the DBÖ is the database of electronic texts of the WBÖ, including some 90 Austrian texts spanning several centuries, representing the Austrian dialects. The original texts are scanned using the Austrian OCR-programme proLector V1.20 (A.1) which allows the training of various fonts. The machine-readable texts are converted in TUSTEP (Tübinger System von Textverarbeitungs-Programmen / Tübingen System of Text Processing Programmes). The TUSTEP-files get an alphanumeric key, which allows one to retrieve each quotation from the database in a chronological order and to sort it according to its localization. Finally the texts are broken down into the requisite sized pieces for quoting in the WBÖ-entries. Using a special programme the quotations can easily be reconnected and replaced in the proper context from which they were drawn. The digital texts are a valuable source for the lexicographer freeing him from the monotony of checking over and over again, thus leaving him more time for his proper work, namely the writing of entries for the WBÖ. Furthermore, the digital texts are important for speeding up the dictionary's publication in accordance with the guidelines of the Straffungskon-: zept 1993 and 1998.</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA01 i1="01" i2="1">
<s0>0268-1145</s0>
</fA01>
<fA03 i2="1">
<s0>Lit. linguist. comput.</s0>
</fA03>
<fA05>
<s2>23</s2>
</fA05>
<fA06>
<s2>2</s2>
</fA06>
<fA08 i1="01" i2="1" l="GER">
<s1>Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis</s1>
</fA08>
<fA11 i1="01" i2="1">
<s1>WANDL-VOGT (Eveline)</s1>
</fA11>
<fA14 i1="01">
<s1>Institut für Österreichische Dialekt-und Namenlexika, Österreichische Akademie der Wissenschaften, Wohllebengasse 12-14/2</s1>
<s2>1040 Wien</s2>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
</fA14>
<fA20>
<s1>201-217</s1>
</fA20>
<fA21>
<s1>2008</s1>
</fA21>
<fA23 i1="01">
<s0>GER</s0>
</fA23>
<fA24 i1="01">
<s0>eng</s0>
</fA24>
<fA43 i1="01">
<s1>INIST</s1>
<s2>23967</s2>
<s5>354000197945600050</s5>
</fA43>
<fA44>
<s0>0000</s0>
<s1>© 2009 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45>
<s0>1 p.3/4</s0>
</fA45>
<fA47 i1="01" i2="1">
<s0>09-0208472</s0>
</fA47>
<fA60>
<s1>P</s1>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA64 i1="01" i2="1">
<s0>Literary and linguistic computing</s0>
</fA64>
<fA66 i1="01">
<s0>GBR</s0>
</fA66>
<fA99>
<s0>33 notes</s0>
</fA99>
<fC01 i1="01" l="ENG">
<s0>The DATABASE OF ELECTRONIC TEXTS of the LEXICON OF BAVARIAN DIALECTS IN AUSTRIA (WBÖ) (TEXTKORPUS zum WBÖ)-The Lexicon of Bavarian Dialects in Austria (Wörterbuch der bairischen Mundarten in Österrreich [WBÖ]) is based on a collection of about 4 million single records. They represent the variety of the regional, social and historical Bavarian dialects. About 10% of all entries are excerpts from texts of various types. The lexicographer has to draw the illustrative quotations from the original texts. The digitized full-texts have been dated and localized, thus each quotation can be placed in time and area enabling the lexicogapher to choose a representative number of examples. For the definitions, the most appropriate and illustrative quotations have to be found, in order to actively support the lexicographers work, the Institute of Lexicography of Austrian Dialects and Names / Institut für Österreichische Dialekt- und Namenlexika (http://www.oeaw.ac.at/dinamlex) started a new project: the so-called DBÖ (Database of the Bavarian Dialects in Austria / Datenbank der bairischen Mundarten in Österreich), financed by the Österreichische Akademie der Wissenschaften / Austrian Academy of Sciences. One main database includes the so-called Hauptkatalog, the main archive; there are additional databases which are necessary to retrieve correct information about the questionnaire, the localization of the word and the date of its recording. One component of the DBÖ is the database of electronic texts of the WBÖ, including some 90 Austrian texts spanning several centuries, representing the Austrian dialects. The original texts are scanned using the Austrian OCR-programme proLector V1.20 (A.1) which allows the training of various fonts. The machine-readable texts are converted in TUSTEP (Tübinger System von Textverarbeitungs-Programmen / Tübingen System of Text Processing Programmes). The TUSTEP-files get an alphanumeric key, which allows one to retrieve each quotation from the database in a chronological order and to sort it according to its localization. Finally the texts are broken down into the requisite sized pieces for quoting in the WBÖ-entries. Using a special programme the quotations can easily be reconnected and replaced in the proper context from which they were drawn. The digital texts are a valuable source for the lexicographer freeing him from the monotony of checking over and over again, thus leaving him more time for his proper work, namely the writing of entries for the WBÖ. Furthermore, the digital texts are important for speeding up the dictionary's publication in accordance with the guidelines of the Straffungskon-: zept 1993 and 1998.</s0>
</fC01>
<fC02 i1="01" i2="L">
<s0>52474</s0>
<s1>XV</s1>
</fC02>
<fC02 i1="02" i2="L">
<s0>524</s0>
</fC02>
<fC03 i1="01" i2="L" l="FRE">
<s0>Lexicographie informatique</s0>
<s2>563</s2>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="L" l="ENG">
<s0>Computational lexicography</s0>
<s2>563</s2>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="L" l="FRE">
<s0>Base de données textuelles</s0>
<s2>NI</s2>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="L" l="ENG">
<s0>Textual database</s0>
<s2>NI</s2>
<s5>02</s5>
</fC03>
<fC03 i1="03" i2="L" l="FRE">
<s0>Dialecte</s0>
<s2>NI</s2>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="L" l="ENG">
<s0>Dialect</s0>
<s2>NI</s2>
<s5>03</s5>
</fC03>
<fC03 i1="04" i2="L" l="FRE">
<s0>Bavarois-Autrichien</s0>
<s2>NL</s2>
<s5>16</s5>
</fC03>
<fN21>
<s1>152</s1>
</fN21>
</pA>
</standard>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PascalFrancis/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000551 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Curation/biblio.hfd -nk 000551 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    PascalFrancis
   |étape=   Curation
   |type=    RBID
   |clé=     Francis:09-0208472
   |texte=   Zitate per Mausklick? Das Textkorpus zum WORTERBUCH DER BAIRISCHEN MUNDARTEN IN OSTERREICH (WBO) als leistungsstarkes Werkzeug für die lexikographische Praxis
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024