Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Extraction of Logical Structure from Articles in Mathematics

Identifieur interne : 000204 ( Istex/Curation ); précédent : 000203; suivant : 000205

Extraction of Logical Structure from Articles in Mathematics

Auteurs : Koji Nakagawa [Japon] ; Akihiro Nomura [Japon] ; Masakazu Suzuki [Japon]

Source :

RBID : ISTEX:1E3DF0D2C722EA10F79418F82CCB9CF41C7E8BFB

Abstract

Abstract: We propose a mathematical knowledge browser which helps people to read mathematical documents. By the browser printed mathematical documents can be scanned and recognized by OCR (Optical Character Recognition). Then the meta-information (e.g. title, author) and the logical structure (e.g. section, theorem) of the documents are automatically extracted. The purpose of this paper is to show the extraction method of logical structure specialized for mathematical documents. We implemented this method in INFTY which is an integrated OCR system for mathematical documents. In order to show the effectiveness of the method we made a correct database from an existing mathematical OCR database, and made an experiment.

Url:
DOI: 10.1007/978-3-540-27818-4_20

Links toward previous steps (curation, corpus...)


Links to Exploration step

ISTEX:1E3DF0D2C722EA10F79418F82CCB9CF41C7E8BFB

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Extraction of Logical Structure from Articles in Mathematics</title>
<author>
<name sortKey="Nakagawa, Koji" sort="Nakagawa, Koji" uniqKey="Nakagawa K" first="Koji" last="Nakagawa">Koji Nakagawa</name>
<affiliation wicri:level="1">
<mods:affiliation>Faculty of Mathematics, Kyushu University, Kyushu Univ. 36, 812-8581, Fukuoka, Japan</mods:affiliation>
<country xml:lang="fr">Japon</country>
<wicri:regionArea>Faculty of Mathematics, Kyushu University, Kyushu Univ. 36, 812-8581, Fukuoka</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1">
<mods:affiliation>E-mail: nakagawa@math.kyushu-u.ac.jp</mods:affiliation>
<country wicri:rule="url">Japon</country>
</affiliation>
</author>
<author>
<name sortKey="Nomura, Akihiro" sort="Nomura, Akihiro" uniqKey="Nomura A" first="Akihiro" last="Nomura">Akihiro Nomura</name>
<affiliation wicri:level="4">
<mods:affiliation>Graduate School of Mathematics, Kyushu University</mods:affiliation>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author>
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki</name>
<affiliation wicri:level="1">
<mods:affiliation>Faculty of Mathematics, Kyushu University, Kyushu Univ. 36, 812-8581, Fukuoka, Japan</mods:affiliation>
<country xml:lang="fr">Japon</country>
<wicri:regionArea>Faculty of Mathematics, Kyushu University, Kyushu Univ. 36, 812-8581, Fukuoka</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1">
<mods:affiliation>E-mail: suzuki@math.kyushu-u.ac.jp</mods:affiliation>
<country wicri:rule="url">Japon</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:1E3DF0D2C722EA10F79418F82CCB9CF41C7E8BFB</idno>
<date when="2004" year="2004">2004</date>
<idno type="doi">10.1007/978-3-540-27818-4_20</idno>
<idno type="url">https://api.istex.fr/document/1E3DF0D2C722EA10F79418F82CCB9CF41C7E8BFB/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000207</idno>
<idno type="wicri:Area/Istex/Curation">000204</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Extraction of Logical Structure from Articles in Mathematics</title>
<author>
<name sortKey="Nakagawa, Koji" sort="Nakagawa, Koji" uniqKey="Nakagawa K" first="Koji" last="Nakagawa">Koji Nakagawa</name>
<affiliation wicri:level="1">
<mods:affiliation>Faculty of Mathematics, Kyushu University, Kyushu Univ. 36, 812-8581, Fukuoka, Japan</mods:affiliation>
<country xml:lang="fr">Japon</country>
<wicri:regionArea>Faculty of Mathematics, Kyushu University, Kyushu Univ. 36, 812-8581, Fukuoka</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1">
<mods:affiliation>E-mail: nakagawa@math.kyushu-u.ac.jp</mods:affiliation>
<country wicri:rule="url">Japon</country>
</affiliation>
</author>
<author>
<name sortKey="Nomura, Akihiro" sort="Nomura, Akihiro" uniqKey="Nomura A" first="Akihiro" last="Nomura">Akihiro Nomura</name>
<affiliation wicri:level="4">
<mods:affiliation>Graduate School of Mathematics, Kyushu University</mods:affiliation>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author>
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki</name>
<affiliation wicri:level="1">
<mods:affiliation>Faculty of Mathematics, Kyushu University, Kyushu Univ. 36, 812-8581, Fukuoka, Japan</mods:affiliation>
<country xml:lang="fr">Japon</country>
<wicri:regionArea>Faculty of Mathematics, Kyushu University, Kyushu Univ. 36, 812-8581, Fukuoka</wicri:regionArea>
</affiliation>
<affiliation wicri:level="1">
<mods:affiliation>E-mail: suzuki@math.kyushu-u.ac.jp</mods:affiliation>
<country wicri:rule="url">Japon</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2004</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">1E3DF0D2C722EA10F79418F82CCB9CF41C7E8BFB</idno>
<idno type="DOI">10.1007/978-3-540-27818-4_20</idno>
<idno type="ChapterID">20</idno>
<idno type="ChapterID">Chap20</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: We propose a mathematical knowledge browser which helps people to read mathematical documents. By the browser printed mathematical documents can be scanned and recognized by OCR (Optical Character Recognition). Then the meta-information (e.g. title, author) and the logical structure (e.g. section, theorem) of the documents are automatically extracted. The purpose of this paper is to show the extraction method of logical structure specialized for mathematical documents. We implemented this method in INFTY which is an integrated OCR system for mathematical documents. In order to show the effectiveness of the method we made a correct database from an existing mathematical OCR database, and made an experiment.</div>
</front>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Istex/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000204 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Istex/Curation/biblio.hfd -nk 000204 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Istex
   |étape=   Curation
   |type=    RBID
   |clé=     ISTEX:1E3DF0D2C722EA10F79418F82CCB9CF41C7E8BFB
   |texte=   Extraction of Logical Structure from Articles in Mathematics
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024