Learning to weigh basic behaviors in Scalable Agents
Identifieur interne : 008903 ( Main/Merge ); précédent : 008902; suivant : 008904Learning to weigh basic behaviors in Scalable Agents
Auteurs : Olivier Buffet ; Alain Dutech ; François CharpilletSource :
English descriptors
Abstract
Agents, especially in the context of Multi-Agents Systems, are confronted to complex tasks. We propose a methodology for the automated design of such agents in the case where the global task can be decomposed into simpler sub-tasks that can be concurrent. This is accomplished by automatically combining basic behaviors using Reinforcement Learning methods. Basic behaviors are either learned or reused from previous tasks as they do not need to be tuned to the specific task being learned. Furthermore, the agents designed by our methodology are highly scalable as, without further refinement of the global behavior, they can automatically combine several instances of the same basic behavior to take into account concurrent occurences of the same subtask.
Links toward previous steps (curation, corpus...)
- to stream Crin, to step Corpus: 003265
- to stream Crin, to step Curation: 003265
- to stream Crin, to step Checkpoint: 001135
Links to Exploration step
CRIN:buffet02aLe document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" wicri:score="340">Learning to weigh basic behaviors in Scalable Agents</title>
</titleStmt>
<publicationStmt><idno type="RBID">CRIN:buffet02a</idno>
<date when="2002" year="2002">2002</date>
<idno type="wicri:Area/Crin/Corpus">003265</idno>
<idno type="wicri:Area/Crin/Curation">003265</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Curation">003265</idno>
<idno type="wicri:Area/Crin/Checkpoint">001135</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Checkpoint">001135</idno>
<idno type="wicri:Area/Main/Merge">008903</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Learning to weigh basic behaviors in Scalable Agents</title>
<author><name sortKey="Buffet, Olivier" sort="Buffet, Olivier" uniqKey="Buffet O" first="Olivier" last="Buffet">Olivier Buffet</name>
</author>
<author><name sortKey="Dutech, Alain" sort="Dutech, Alain" uniqKey="Dutech A" first="Alain" last="Dutech">Alain Dutech</name>
</author>
<author><name sortKey="Charpillet, Francois" sort="Charpillet, Francois" uniqKey="Charpillet F" first="François" last="Charpillet">François Charpillet</name>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>adaptation</term>
<term>complex environments</term>
<term>multiagent systems</term>
<term>reinforcement learning</term>
<term>scalability</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en" wicri:score="2251">Agents, especially in the context of Multi-Agents Systems, are confronted to complex tasks. We propose a methodology for the automated design of such agents in the case where the global task can be decomposed into simpler sub-tasks that can be concurrent. This is accomplished by automatically combining basic behaviors using Reinforcement Learning methods. Basic behaviors are either learned or reused from previous tasks as they do not need to be tuned to the specific task being learned. Furthermore, the agents designed by our methodology are highly scalable as, without further refinement of the global behavior, they can automatically combine several instances of the same basic behavior to take into account concurrent occurences of the same subtask.</div>
</front>
</TEI>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 008903 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 008903 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Lorraine |area= InforLorV4 |flux= Main |étape= Merge |type= RBID |clé= CRIN:buffet02a |texte= Learning to weigh basic behaviors in Scalable Agents }}
![]() | This area was generated with Dilib version V0.6.33. | ![]() |