InforLorV4, PascalFrancis, Corpus, bibRecord, 000724

A connectionist architecture that adapts its representation to complex tasks

Identifieur interne : 000724 ( PascalFrancis/Corpus ); précédent : 000723; suivant : 000725

A connectionist architecture that adapts its representation to complex tasks

Auteurs : Bruno Scherrer

Source :

IEEE ... International Conference on Neural Networks [ 1098-7576 ] ; 2002.

RBID : Pascal:04-0132603

Descripteurs français

Pascal (Inist)
- Planification, Apprentissage renforcé, Algorithme réparti, Calcul réparti, Connexionnisme, Décision Markov, Architecture réseau.

English descriptors

KwdEn :
- Connectionism, Distributed algorithm, Distributed computing, Markov decision, Network architecture, Planning, Reinforcement learning.

Abstract

This paper presents an original connectionist architecture that is capable of adapting its representation to one or various reinforcement problems. We briefly describe the generic reinforcement learning theory it is based on. We focus on distributed algorithms that enables efficient planning. In this specific framework, we define the notion of task-specialisation and propose a procedure for adapting a task model without increasing its complexity. It consists in a high-level learning of representation in problems with possibly delayed reinforcements. We show that such a single architecture can adapt to multiple tasks. Finally we stress its connectionist nature: most computations can be distributed and done in parallel. We illustrate and evaluate this adaptation paradigm on a navigation continuous-space environment.

Notice en format standard (ISO 2709)

Pour connaître la documentation sur le format Inist Standard.

A01	`01`	`1`		`@0 1098-7576`
A08	`01`	`1`	`ENG`	`@1 A connectionist architecture that adapts its representation to complex tasks`
A09	`01`	`1`	`ENG`	`@1 IJCNN'02 : international joint conference on neural networks : Honolulu HI, 12-17 May 2002`
A11	`01`	`1`		`@1 SCHERRER (Bruno)`
A14	`01`			`@1 CORTEX/MAIA Teams, LORIA, B.P. 239 @2 54506 Vandoeuvre-Les-Nancy @3 FRA @Z 1 aut.`
A18	`01`	`1`		`@1 IEEE. Neural Networks Society @3 USA @9 patr.`
A18	`02`	`1`		`@1 International Neural Network Society @3 USA @9 patr.`
A20				`@1 2929-2934`
A21				`@1 2002`
A23	`01`			`@0 ENG`
A26	`01`			`@0 0-7803-7278-6`
A43	`01`			`@1 INIST @2 Y 37961 @5 354000117750885180`
A44				`@0 0000 @1 © 2004 INIST-CNRS. All rights reserved.`
A45				`@0 13 ref.`
A47	`01`	`1`		`@0 04-0132603`
A60				`@1 P @2 C`
A61				`@0 A`
A64	`01`	`1`		`@0 IEEE ... International Conference on Neural Networks`
A66	`01`			`@0 USA`
C01	`01`		`ENG`	@0 This paper presents an original connectionist architecture that is capable of adapting its representation to one or various reinforcement problems. We briefly describe the generic reinforcement learning theory it is based on. We focus on distributed algorithms that enables efficient planning. In this specific framework, we define the notion of task-specialisation and propose a procedure for adapting a task model without increasing its complexity. It consists in a high-level learning of representation in problems with possibly delayed reinforcements. We show that such a single architecture can adapt to multiple tasks. Finally we stress its connectionist nature: most computations can be distributed and done in parallel. We illustrate and evaluate this adaptation paradigm on a navigation continuous-space environment.
C02	`01`	`X`		`@0 001D02C06`
C03	`01`	`X`	`FRE`	`@0 Planification @5 01`
C03	`01`	`X`	`ENG`	`@0 Planning @5 01`
C03	`01`	`X`	`SPA`	`@0 Planificación @5 01`
C03	`02`	`X`	`FRE`	`@0 Apprentissage renforcé @5 02`
C03	`02`	`X`	`ENG`	`@0 Reinforcement learning @5 02`
C03	`02`	`X`	`SPA`	`@0 Aprendizaje reforzado @5 02`
C03	`03`	`X`	`FRE`	`@0 Algorithme réparti @5 03`
C03	`03`	`X`	`ENG`	`@0 Distributed algorithm @5 03`
C03	`03`	`X`	`SPA`	`@0 Algoritmo repartido @5 03`
C03	`04`	`X`	`FRE`	`@0 Calcul réparti @5 04`
C03	`04`	`X`	`ENG`	`@0 Distributed computing @5 04`
C03	`04`	`X`	`SPA`	`@0 Cálculo repartido @5 04`
C03	`05`	`X`	`FRE`	`@0 Connexionnisme @5 05`
C03	`05`	`X`	`ENG`	`@0 Connectionism @5 05`
C03	`05`	`X`	`SPA`	`@0 Conexionismo @5 05`
C03	`06`	`X`	`FRE`	`@0 Décision Markov @5 06`
C03	`06`	`X`	`ENG`	`@0 Markov decision @5 06`
C03	`06`	`X`	`SPA`	`@0 Decisión Markov @5 06`
C03	`07`	`X`	`FRE`	`@0 Architecture réseau @5 07`
C03	`07`	`X`	`ENG`	`@0 Network architecture @5 07`
C03	`07`	`X`	`SPA`	`@0 Arquitectura red @5 07`
N21				`@1 082`
N82				`@1 PSI`

A30	`01`	`1`	`ENG`	`@1 2002 International joint conference on neural networks @3 Honolulu HI USA @4 2002-05-12`

Format Inist (serveur)

NO :	PASCAL 04-0132603 INIST
ET :	A connectionist architecture that adapts its representation to complex tasks
AU :	SCHERRER (Bruno)
AF :	CORTEX/MAIA Teams, LORIA, B.P. 239 /54506 Vandoeuvre-Les-Nancy/France (1 aut.)
DT :	Publication en série; Congrès; Niveau analytique
SO :	IEEE ... International Conference on Neural Networks; ISSN 1098-7576; Etats-Unis; Da. 2002; Pp. 2929-2934; Bibl. 13 ref.
LA :	Anglais
EA :	This paper presents an original connectionist architecture that is capable of adapting its representation to one or various reinforcement problems. We briefly describe the generic reinforcement learning theory it is based on. We focus on distributed algorithms that enables efficient planning. In this specific framework, we define the notion of task-specialisation and propose a procedure for adapting a task model without increasing its complexity. It consists in a high-level learning of representation in problems with possibly delayed reinforcements. We show that such a single architecture can adapt to multiple tasks. Finally we stress its connectionist nature: most computations can be distributed and done in parallel. We illustrate and evaluate this adaptation paradigm on a navigation continuous-space environment.
CC :	001D02C06
FD :	Planification; Apprentissage renforcé; Algorithme réparti; Calcul réparti; Connexionnisme; Décision Markov; Architecture réseau
ED :	Planning; Reinforcement learning; Distributed algorithm; Distributed computing; Connectionism; Markov decision; Network architecture
SD :	Planificación; Aprendizaje reforzado; Algoritmo repartido; Cálculo repartido; Conexionismo; Decisión Markov; Arquitectura red
LO :	INIST-Y 37961.354000117750885180
ID :	04-0132603

Links to Exploration step

Pascal:04-0132603

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">A connectionist architecture that adapts its representation to complex tasks</title>
<author><name sortKey="Scherrer, Bruno" sort="Scherrer, Bruno" uniqKey="Scherrer B" first="Bruno" last="Scherrer">Bruno Scherrer</name>
<affiliation><inist:fA14 i1="01"><s1>CORTEX/MAIA Teams, LORIA, B.P. 239 </s1>
<s2>54506 Vandoeuvre-Les-Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">04-0132603</idno>
<date when="2002">2002</date>
<idno type="stanalyst">PASCAL 04-0132603 INIST</idno>
<idno type="RBID">Pascal:04-0132603</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000724</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">A connectionist architecture that adapts its representation to complex tasks</title>
<author><name sortKey="Scherrer, Bruno" sort="Scherrer, Bruno" uniqKey="Scherrer B" first="Bruno" last="Scherrer">Bruno Scherrer</name>
<affiliation><inist:fA14 i1="01"><s1>CORTEX/MAIA Teams, LORIA, B.P. 239 </s1>
<s2>54506 Vandoeuvre-Les-Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">IEEE ... International Conference on Neural Networks</title>
<idno type="ISSN">1098-7576</idno>
<imprint><date when="2002">2002</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">IEEE ... International Conference on Neural Networks</title>
<idno type="ISSN">1098-7576</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Connectionism</term>
<term>Distributed algorithm</term>
<term>Distributed computing</term>
<term>Markov decision</term>
<term>Network architecture</term>
<term>Planning</term>
<term>Reinforcement learning</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Planification</term>
<term>Apprentissage renforcé</term>
<term>Algorithme réparti</term>
<term>Calcul réparti</term>
<term>Connexionnisme</term>
<term>Décision Markov</term>
<term>Architecture réseau</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">This paper presents an original connectionist architecture that is capable of adapting its representation to one or various reinforcement problems. We briefly describe the generic reinforcement learning theory it is based on. We focus on distributed algorithms that enables efficient planning. In this specific framework, we define the notion of task-specialisation and propose a procedure for adapting a task model without increasing its complexity. It consists in a high-level learning of representation in problems with possibly delayed reinforcements. We show that such a single architecture can adapt to multiple tasks. Finally we stress its connectionist nature: most computations can be distributed and done in parallel. We illustrate and evaluate this adaptation paradigm on a navigation continuous-space environment.</div>
</front>
</TEI>
<inist><standard h6="B"><pA><fA01 i1="01" i2="1"><s0>1098-7576</s0>
</fA01>
<fA08 i1="01" i2="1" l="ENG"><s1>A connectionist architecture that adapts its representation to complex tasks</s1>
</fA08>
<fA09 i1="01" i2="1" l="ENG"><s1>IJCNN'02 : international joint conference on neural networks : Honolulu HI, 12-17 May 2002</s1>
</fA09>
<fA11 i1="01" i2="1"><s1>SCHERRER (Bruno)</s1>
</fA11>
<fA14 i1="01"><s1>CORTEX/MAIA Teams, LORIA, B.P. 239 </s1>
<s2>54506 Vandoeuvre-Les-Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
</fA14>
<fA18 i1="01" i2="1"><s1>IEEE. Neural Networks Society</s1>
<s3>USA</s3>
<s9>patr.</s9>
</fA18>
<fA18 i1="02" i2="1"><s1>International Neural Network Society</s1>
<s3>USA</s3>
<s9>patr.</s9>
</fA18>
<fA20><s1>2929-2934</s1>
</fA20>
<fA21><s1>2002</s1>
</fA21>
<fA23 i1="01"><s0>ENG</s0>
</fA23>
<fA26 i1="01"><s0>0-7803-7278-6</s0>
</fA26>
<fA43 i1="01"><s1>INIST</s1>
<s2>Y 37961</s2>
<s5>354000117750885180</s5>
</fA43>
<fA44><s0>0000</s0>
<s1>© 2004 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45><s0>13 ref.</s0>
</fA45>
<fA47 i1="01" i2="1"><s0>04-0132603</s0>
</fA47>
<fA60><s1>P</s1>
<s2>C</s2>
</fA60>
<fA61><s0>A</s0>
</fA61>
<fA64 i1="01" i2="1"><s0>IEEE ... International Conference on Neural Networks</s0>
</fA64>
<fA66 i1="01"><s0>USA</s0>
</fA66>
<fC01 i1="01" l="ENG"><s0>This paper presents an original connectionist architecture that is capable of adapting its representation to one or various reinforcement problems. We briefly describe the generic reinforcement learning theory it is based on. We focus on distributed algorithms that enables efficient planning. In this specific framework, we define the notion of task-specialisation and propose a procedure for adapting a task model without increasing its complexity. It consists in a high-level learning of representation in problems with possibly delayed reinforcements. We show that such a single architecture can adapt to multiple tasks. Finally we stress its connectionist nature: most computations can be distributed and done in parallel. We illustrate and evaluate this adaptation paradigm on a navigation continuous-space environment.</s0>
</fC01>
<fC02 i1="01" i2="X"><s0>001D02C06</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE"><s0>Planification</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG"><s0>Planning</s0>
<s5>01</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA"><s0>Planificación</s0>
<s5>01</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE"><s0>Apprentissage renforcé</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG"><s0>Reinforcement learning</s0>
<s5>02</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA"><s0>Aprendizaje reforzado</s0>
<s5>02</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE"><s0>Algorithme réparti</s0>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG"><s0>Distributed algorithm</s0>
<s5>03</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA"><s0>Algoritmo repartido</s0>
<s5>03</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE"><s0>Calcul réparti</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG"><s0>Distributed computing</s0>
<s5>04</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA"><s0>Cálculo repartido</s0>
<s5>04</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE"><s0>Connexionnisme</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG"><s0>Connectionism</s0>
<s5>05</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA"><s0>Conexionismo</s0>
<s5>05</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE"><s0>Décision Markov</s0>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG"><s0>Markov decision</s0>
<s5>06</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA"><s0>Decisión Markov</s0>
<s5>06</s5>
</fC03>
<fC03 i1="07" i2="X" l="FRE"><s0>Architecture réseau</s0>
<s5>07</s5>
</fC03>
<fC03 i1="07" i2="X" l="ENG"><s0>Network architecture</s0>
<s5>07</s5>
</fC03>
<fC03 i1="07" i2="X" l="SPA"><s0>Arquitectura red</s0>
<s5>07</s5>
</fC03>
<fN21><s1>082</s1>
</fN21>
<fN82><s1>PSI</s1>
</fN82>
</pA>
<pR><fA30 i1="01" i2="1" l="ENG"><s1>2002 International joint conference on neural networks</s1>
<s3>Honolulu HI USA</s3>
<s4>2002-05-12</s4>
</fA30>
</pR>
</standard>
<server><NO>PASCAL 04-0132603 INIST</NO>
<ET>A connectionist architecture that adapts its representation to complex tasks</ET>
<AU>SCHERRER (Bruno)</AU>
<AF>CORTEX/MAIA Teams, LORIA, B.P. 239 /54506 Vandoeuvre-Les-Nancy/France (1 aut.)</AF>
<DT>Publication en série; Congrès; Niveau analytique</DT>
<SO>IEEE ... International Conference on Neural Networks; ISSN 1098-7576; Etats-Unis; Da. 2002; Pp. 2929-2934; Bibl. 13 ref.</SO>
<LA>Anglais</LA>
<EA>This paper presents an original connectionist architecture that is capable of adapting its representation to one or various reinforcement problems. We briefly describe the generic reinforcement learning theory it is based on. We focus on distributed algorithms that enables efficient planning. In this specific framework, we define the notion of task-specialisation and propose a procedure for adapting a task model without increasing its complexity. It consists in a high-level learning of representation in problems with possibly delayed reinforcements. We show that such a single architecture can adapt to multiple tasks. Finally we stress its connectionist nature: most computations can be distributed and done in parallel. We illustrate and evaluate this adaptation paradigm on a navigation continuous-space environment.</EA>
<CC>001D02C06</CC>
<FD>Planification; Apprentissage renforcé; Algorithme réparti; Calcul réparti; Connexionnisme; Décision Markov; Architecture réseau</FD>
<ED>Planning; Reinforcement learning; Distributed algorithm; Distributed computing; Connectionism; Markov decision; Network architecture</ED>
<SD>Planificación; Aprendizaje reforzado; Algoritmo repartido; Cálculo repartido; Conexionismo; Decisión Markov; Arquitectura red</SD>
<LO>INIST-Y 37961.354000117750885180</LO>
<ID>04-0132603</ID>
</server>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/PascalFrancis/Corpus

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000724 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000724 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    PascalFrancis
   |étape=   Corpus
   |type=    RBID
   |clé=     Pascal:04-0132603
   |texte=   A connectionist architecture that adapts its representation to complex tasks
}}

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022

	Serveur d'exploration sur la recherche en informatique en Lorraine
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur la recherche en informatique en Lorraine

A connectionist architecture that adapts its representation to complex tasks

A connectionist architecture that adapts its representation to complex tasks

Source :

Descripteurs français

English descriptors

Abstract

Notice en format standard (ISO 2709)

Format Inist (serveur)

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri