Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Development of a written music recognition system using Java and open source technologies

Identifieur interne : 001404 ( Main/Exploration ); précédent : 001403; suivant : 001405

Development of a written music recognition system using Java and open source technologies

Auteurs : Gernot Loibner [Autriche] ; Andreas Schwarzl [Autriche] ; Matthias Kovac [Autriche] ; Dietmar Paulus [Autriche] ; Wolfgang Polzleitner [Autriche]

Source :

RBID : Pascal:06-0297695

Descripteurs français

English descriptors

Abstract

We report on the development of a software system to recognize and interpret printed music. The overall goal is to scan printed music sheets, analyze and recognize the notes, timing, and written text, and derive the all necessary information to use the computers MIDI sound system to play the music. This function is primarily useful for musicians who want to digitize printed music for editing purposes. There exist a number of commercial systems that offer such a functionality. However, on testing these systems, we were astonished on how weak they behave in their pattern recognition parts. Although we submitted very clear and rather flawless scanning input, none of these systems was able to e.g. recognize all notes, staff lines, and systems. They all require a high degree of interaction, post-processing, and editing to get a decent digital version of the hard copy material. In this paper we focus on the pattern recognition area. In a first approach we tested more or less standard methods of adaptive thresholding, blob detection, line detection, and comer detection to find the notes, staff lines, and candidate objects subject to OCR. Many of the objects on this type of material can be learned in a training phase. None of the commercial systems we saw offers the option to train special characters or unusual signatures. A second goal in this project is to use a modern software engineering platform. We were interested in how well Java and open source technologies are suitable for pattern recognition and machine vision. The scanning of music served as a case-study.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Development of a written music recognition system using Java and open source technologies</title>
<author>
<name sortKey="Loibner, Gernot" sort="Loibner, Gernot" uniqKey="Loibner G" first="Gernot" last="Loibner">Gernot Loibner</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Kaindorf College of Computer Information Systems</s1>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
<wicri:noRegion>Kaindorf College of Computer Information Systems</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Schwarzl, Andreas" sort="Schwarzl, Andreas" uniqKey="Schwarzl A" first="Andreas" last="Schwarzl">Andreas Schwarzl</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Kaindorf College of Computer Information Systems</s1>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
<wicri:noRegion>Kaindorf College of Computer Information Systems</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Kovac, Matthias" sort="Kovac, Matthias" uniqKey="Kovac M" first="Matthias" last="Kovac">Matthias Kovac</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Kaindorf College of Computer Information Systems</s1>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
<wicri:noRegion>Kaindorf College of Computer Information Systems</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Paulus, Dietmar" sort="Paulus, Dietmar" uniqKey="Paulus D" first="Dietmar" last="Paulus">Dietmar Paulus</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Kaindorf College of Computer Information Systems</s1>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
<wicri:noRegion>Kaindorf College of Computer Information Systems</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Polzleitner, Wolfgang" sort="Polzleitner, Wolfgang" uniqKey="Polzleitner W" first="Wolfgang" last="Polzleitner">Wolfgang Polzleitner</name>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>Sensotech GmbH</s1>
<s3>AUT</s3>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
<wicri:noRegion>Sensotech GmbH</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">06-0297695</idno>
<date when="2005">2005</date>
<idno type="stanalyst">PASCAL 06-0297695 INIST</idno>
<idno type="RBID">Pascal:06-0297695</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000384</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000402</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000417</idno>
<idno type="wicri:doubleKey">0277-786X:2005:Loibner G:development:of:a</idno>
<idno type="wicri:Area/Main/Merge">001447</idno>
<idno type="wicri:Area/Main/Curation">001404</idno>
<idno type="wicri:Area/Main/Exploration">001404</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Development of a written music recognition system using Java and open source technologies</title>
<author>
<name sortKey="Loibner, Gernot" sort="Loibner, Gernot" uniqKey="Loibner G" first="Gernot" last="Loibner">Gernot Loibner</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Kaindorf College of Computer Information Systems</s1>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
<wicri:noRegion>Kaindorf College of Computer Information Systems</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Schwarzl, Andreas" sort="Schwarzl, Andreas" uniqKey="Schwarzl A" first="Andreas" last="Schwarzl">Andreas Schwarzl</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Kaindorf College of Computer Information Systems</s1>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
<wicri:noRegion>Kaindorf College of Computer Information Systems</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Kovac, Matthias" sort="Kovac, Matthias" uniqKey="Kovac M" first="Matthias" last="Kovac">Matthias Kovac</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Kaindorf College of Computer Information Systems</s1>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
<wicri:noRegion>Kaindorf College of Computer Information Systems</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Paulus, Dietmar" sort="Paulus, Dietmar" uniqKey="Paulus D" first="Dietmar" last="Paulus">Dietmar Paulus</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Kaindorf College of Computer Information Systems</s1>
<s3>AUT</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
<wicri:noRegion>Kaindorf College of Computer Information Systems</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Polzleitner, Wolfgang" sort="Polzleitner, Wolfgang" uniqKey="Polzleitner W" first="Wolfgang" last="Polzleitner">Wolfgang Polzleitner</name>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>Sensotech GmbH</s1>
<s3>AUT</s3>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Autriche</country>
<wicri:noRegion>Sensotech GmbH</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Proceedings of SPIE, the International Society for Optical Engineering</title>
<idno type="ISSN">0277-786X</idno>
<imprint>
<date when="2005">2005</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Proceedings of SPIE, the International Society for Optical Engineering</title>
<idno type="ISSN">0277-786X</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Adaptive detection</term>
<term>Adaptive method</term>
<term>Character recognition</term>
<term>Computer vision</term>
<term>Image processing</term>
<term>Information use</term>
<term>JAVA language</term>
<term>Music</term>
<term>Open source software</term>
<term>Open systems</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Robotics</term>
<term>Software development</term>
<term>Software engineering</term>
<term>Text</term>
<term>Threshold detection</term>
<term>Timed system</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Robotique</term>
<term>Système ouvert</term>
<term>Langage JAVA</term>
<term>Logiciel libre</term>
<term>Développement logiciel</term>
<term>Texte</term>
<term>Utilisation information</term>
<term>Reconnaissance forme</term>
<term>Reconnaissance caractère</term>
<term>Reconnaissance optique caractère</term>
<term>Génie logiciel</term>
<term>Vision ordinateur</term>
<term>Traitement image</term>
<term>Musique</term>
<term>Système temporisé</term>
<term>Détection adaptative</term>
<term>Détection seuil</term>
<term>Méthode adaptative</term>
<term>.</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Robotique</term>
<term>Génie logiciel</term>
<term>Musique</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">We report on the development of a software system to recognize and interpret printed music. The overall goal is to scan printed music sheets, analyze and recognize the notes, timing, and written text, and derive the all necessary information to use the computers MIDI sound system to play the music. This function is primarily useful for musicians who want to digitize printed music for editing purposes. There exist a number of commercial systems that offer such a functionality. However, on testing these systems, we were astonished on how weak they behave in their pattern recognition parts. Although we submitted very clear and rather flawless scanning input, none of these systems was able to e.g. recognize all notes, staff lines, and systems. They all require a high degree of interaction, post-processing, and editing to get a decent digital version of the hard copy material. In this paper we focus on the pattern recognition area. In a first approach we tested more or less standard methods of adaptive thresholding, blob detection, line detection, and comer detection to find the notes, staff lines, and candidate objects subject to OCR. Many of the objects on this type of material can be learned in a training phase. None of the commercial systems we saw offers the option to train special characters or unusual signatures. A second goal in this project is to use a modern software engineering platform. We were interested in how well Java and open source technologies are suitable for pattern recognition and machine vision. The scanning of music served as a case-study.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Autriche</li>
</country>
</list>
<tree>
<country name="Autriche">
<noRegion>
<name sortKey="Loibner, Gernot" sort="Loibner, Gernot" uniqKey="Loibner G" first="Gernot" last="Loibner">Gernot Loibner</name>
</noRegion>
<name sortKey="Kovac, Matthias" sort="Kovac, Matthias" uniqKey="Kovac M" first="Matthias" last="Kovac">Matthias Kovac</name>
<name sortKey="Paulus, Dietmar" sort="Paulus, Dietmar" uniqKey="Paulus D" first="Dietmar" last="Paulus">Dietmar Paulus</name>
<name sortKey="Polzleitner, Wolfgang" sort="Polzleitner, Wolfgang" uniqKey="Polzleitner W" first="Wolfgang" last="Polzleitner">Wolfgang Polzleitner</name>
<name sortKey="Schwarzl, Andreas" sort="Schwarzl, Andreas" uniqKey="Schwarzl A" first="Andreas" last="Schwarzl">Andreas Schwarzl</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001404 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001404 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:06-0297695
   |texte=   Development of a written music recognition system using Java and open source technologies
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024