Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Interac-DEC-MDP : Towards the use of interactions in DEC-MDP

Identifieur interne : 006D30 ( Main/Merge ); précédent : 006D29; suivant : 006D31

Interac-DEC-MDP : Towards the use of interactions in DEC-MDP

Auteurs : Vincent Thomas ; Christine Bourjot ; Vincent Chevrier

Source :

RBID : CRIN:thomas04b

English descriptors

Abstract

This article presents a new formalism Interac-DEC-MDP whose aim is to introduce the concept of interaction in Decentralized Markov Decision Process and which has been inspired by biology. The aim of this formalism, Interac-DEC-MDP, is to describe and represent interactions among agents. The outcome of interactions is decided collectively by two agents and is in charge of the distribution of local rewards. We have modeled a biological experiment within this formalism. A simple learning algorithm applied on this formalism generates a more efficient collective behavior than without interactions.

Links toward previous steps (curation, corpus...)


Links to Exploration step

CRIN:thomas04b

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" wicri:score="140">Interac-DEC-MDP : Towards the use of interactions in DEC-MDP</title>
</titleStmt>
<publicationStmt>
<idno type="RBID">CRIN:thomas04b</idno>
<date when="2004" year="2004">2004</date>
<idno type="wicri:Area/Crin/Corpus">003D97</idno>
<idno type="wicri:Area/Crin/Curation">003D97</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Curation">003D97</idno>
<idno type="wicri:Area/Crin/Checkpoint">000855</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Checkpoint">000855</idno>
<idno type="wicri:Area/Main/Merge">006D30</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Interac-DEC-MDP : Towards the use of interactions in DEC-MDP</title>
<author>
<name sortKey="Thomas, Vincent" sort="Thomas, Vincent" uniqKey="Thomas V" first="Vincent" last="Thomas">Vincent Thomas</name>
</author>
<author>
<name sortKey="Bourjot, Christine" sort="Bourjot, Christine" uniqKey="Bourjot C" first="Christine" last="Bourjot">Christine Bourjot</name>
</author>
<author>
<name sortKey="Chevrier, Vincent" sort="Chevrier, Vincent" uniqKey="Chevrier V" first="Vincent" last="Chevrier">Vincent Chevrier</name>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>interaction</term>
<term>learning</term>
<term>markov decision process</term>
<term>multi-agent system</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en" wicri:score="2672">This article presents a new formalism Interac-DEC-MDP whose aim is to introduce the concept of interaction in Decentralized Markov Decision Process and which has been inspired by biology. The aim of this formalism, Interac-DEC-MDP, is to describe and represent interactions among agents. The outcome of interactions is decided collectively by two agents and is in charge of the distribution of local rewards. We have modeled a biological experiment within this formalism. A simple learning algorithm applied on this formalism generates a more efficient collective behavior than without interactions.</div>
</front>
</TEI>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Merge
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 006D30 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Merge/biblio.hfd -nk 006D30 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Merge
   |type=    RBID
   |clé=     CRIN:thomas04b
   |texte=   Interac-DEC-MDP : Towards the use of interactions in DEC-MDP
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022