InforLorV4, Main, Merge, bibRecord, 008903

Learning to weigh basic behaviors in Scalable Agents

Identifieur interne : 008903 ( Main/Merge ); précédent : 008902; suivant : 008904

Learning to weigh basic behaviors in Scalable Agents

Auteurs : Olivier Buffet ; Alain Dutech ; François Charpillet

Source :

RBID : CRIN:buffet02a

English descriptors

KwdEn :
- adaptation, complex environments, multiagent systems, reinforcement learning, scalability.

Abstract

Agents, especially in the context of Multi-Agents Systems, are confronted to complex tasks. We propose a methodology for the automated design of such agents in the case where the global task can be decomposed into simpler sub-tasks that can be concurrent. This is accomplished by automatically combining basic behaviors using Reinforcement Learning methods. Basic behaviors are either learned or reused from previous tasks as they do not need to be tuned to the specific task being learned. Furthermore, the agents designed by our methodology are highly scalable as, without further refinement of the global behavior, they can automatically combine several instances of the same basic behavior to take into account concurrent occurences of the same subtask.

Links toward previous steps (curation, corpus...)

to stream Crin, to step Corpus: 003265
to stream Crin, to step Curation: 003265
to stream Crin, to step Checkpoint: 001135

Links to Exploration step

CRIN:buffet02a

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" wicri:score="340">Learning to weigh basic behaviors in Scalable Agents</title>
</titleStmt>
<publicationStmt><idno type="RBID">CRIN:buffet02a</idno>
<date when="2002" year="2002">2002</date>
<idno type="wicri:Area/Crin/Corpus">003265</idno>
<idno type="wicri:Area/Crin/Curation">003265</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Curation">003265</idno>
<idno type="wicri:Area/Crin/Checkpoint">001135</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Checkpoint">001135</idno>
<idno type="wicri:Area/Main/Merge">008903</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Learning to weigh basic behaviors in Scalable Agents</title>
<author><name sortKey="Buffet, Olivier" sort="Buffet, Olivier" uniqKey="Buffet O" first="Olivier" last="Buffet">Olivier Buffet</name>
</author>
<author><name sortKey="Dutech, Alain" sort="Dutech, Alain" uniqKey="Dutech A" first="Alain" last="Dutech">Alain Dutech</name>
</author>
<author><name sortKey="Charpillet, Francois" sort="Charpillet, Francois" uniqKey="Charpillet F" first="François" last="Charpillet">François Charpillet</name>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>adaptation</term>
<term>complex environments</term>
<term>multiagent systems</term>
<term>reinforcement learning</term>
<term>scalability</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en" wicri:score="2251">Agents, especially in the context of Multi-Agents Systems, are confronted to complex tasks. We propose a methodology for the automated design of such agents in the case where the global task can be decomposed into simpler sub-tasks that can be concurrent. This is accomplished by automatically combining basic behaviors using Reinforcement Learning methods. Basic behaviors are either learned or reused from previous tasks as they do not need to be tuned to the specific task being learned. Furthermore, the agents designed by our methodology are highly scalable as, without further refinement of the global behavior, they can automatically combine several instances of the same basic behavior to take into account concurrent occurences of the same subtask.</div>
</front>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Merge

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 008903 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 008903 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     CRIN:buffet02a
   |texte=   Learning to weigh basic behaviors in Scalable Agents
}}

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022

	Serveur d'exploration sur la recherche en informatique en Lorraine
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur la recherche en informatique en Lorraine

Learning to weigh basic behaviors in Scalable Agents

Learning to weigh basic behaviors in Scalable Agents

Source :

English descriptors

Abstract

Links toward previous steps (curation, corpus...)

Links to Exploration step

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri