Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Algorithmes de co-évolution pour la résolution approchée de PDM multi-agent

Identifieur interne : 000438 ( PascalFrancis/Corpus ); précédent : 000437; suivant : 000439

Algorithmes de co-évolution pour la résolution approchée de PDM multi-agent

Auteurs : Iadine Chades

Source :

RBID : Pascal:06-0329057

Descripteurs français

English descriptors

Abstract

Markov Decision Processes (MDP) provide a formal approach for solving sequential decision problems under uncertainty. Decentralized Markov Decision Processes extend MDP. They are used to model the problem of several agents making decision under uncertainty. However, solving a decentralized markov decision process is NEXP-complete as soon as two agents are involved. In this paper we propose algorithms for solving approximately Multi-agent MDP problems in a centralized or decentralized way. The aim of our methods is to design a system in which agents coordinate to achieve a task in collaboration. Coordination is based on two major properties: subjectivity and empathy. While subjectivity allows an agent to deal with incomplete and local perceptions, thus to design memory less policies, empathy allows an agent to adapt its decisions in regards with the uncertainty of the decisions of the others.

Notice en format standard (ISO 2709)

Pour connaître la documentation sur le format Inist Standard.

pA  
A01 01  1    @0 0992-499X
A03   1    @0 Rev. intell. artif.
A05       @2 20
A06       @2 2-3
A08 01  1  FRE  @1 Algorithmes de co-évolution pour la résolution approchée de PDM multi-agent
A09 01  1  FRE  @1 Décision et planification dans l'incertain
A11 01  1    @1 CHADES (Iadine)
A12 01  1    @1 CHARPILLET (F.) @9 ed.
A12 02  1    @1 GARCIA (F.) @9 ed.
A12 03  1    @1 PERNY (Patrice) @9 ed.
A12 04  1    @1 SIGAUD (Olivier) @9 ed.
A14 01      @1 Equipe Méthodes Mathématiques et Informatique pour la Décision Unité de Biométrie et Intelligence Artificielle INRA, BP27 @2 31326 Castanet-Tolosan @3 FRA @Z 1 aut.
A15 01      @1 LORIA-INRIA @2 Nancy @3 FRA @Z 1 aut.
A15 02      @1 INRA-MIA @2 Toulouse @3 FRA @Z 2 aut.
A15 03      @1 LIP6 @2 Paris @3 FRA @Z 3 aut. @Z 4 aut.
A20       @1 345-382
A21       @1 2006
A23 01      @0 FRE
A24 01      @0 eng
A43 01      @1 INIST @2 21320 @5 354000142556580070
A44       @0 0000 @1 © 2006 INIST-CNRS. All rights reserved.
A45       @0 1 p.1/4
A47 01  1    @0 06-0329057
A60       @1 P @2 C
A61       @0 A
A64 01  1    @0 Revue d'intelligence artificielle
A66 01      @0 FRA
C01 01    ENG  @0 Markov Decision Processes (MDP) provide a formal approach for solving sequential decision problems under uncertainty. Decentralized Markov Decision Processes extend MDP. They are used to model the problem of several agents making decision under uncertainty. However, solving a decentralized markov decision process is NEXP-complete as soon as two agents are involved. In this paper we propose algorithms for solving approximately Multi-agent MDP problems in a centralized or decentralized way. The aim of our methods is to design a system in which agents coordinate to achieve a task in collaboration. Coordination is based on two major properties: subjectivity and empathy. While subjectivity allows an agent to deal with incomplete and local perceptions, thus to design memory less policies, empathy allows an agent to adapt its decisions in regards with the uncertainty of the decisions of the others.
C02 01  X    @0 001D01A08
C02 02  X    @0 001D02C
C03 01  X  FRE  @0 Prise décision @5 06
C03 01  X  ENG  @0 Decision making @5 06
C03 01  X  SPA  @0 Toma decision @5 06
C03 02  X  FRE  @0 Système incertain @5 07
C03 02  X  ENG  @0 Uncertain system @5 07
C03 02  X  SPA  @0 Sistema incierto @5 07
C03 03  X  FRE  @0 Système aide décision @5 08
C03 03  X  ENG  @0 Decision support system @5 08
C03 03  X  SPA  @0 Sistema ayuda decisíon @5 08
C03 04  X  FRE  @0 Système multiagent @5 09
C03 04  X  ENG  @0 Multiagent system @5 09
C03 04  X  SPA  @0 Sistema multiagente @5 09
C03 05  X  FRE  @0 Intelligence artificielle @5 10
C03 05  X  ENG  @0 Artificial intelligence @5 10
C03 05  X  SPA  @0 Inteligencia artificial @5 10
C03 06  X  FRE  @0 Coordination @5 11
C03 06  X  ENG  @0 Coordination @5 11
C03 06  X  SPA  @0 Coordinación @5 11
C03 07  X  FRE  @0 Empathie @5 12
C03 07  X  ENG  @0 Empathy @5 12
C03 07  X  SPA  @0 Empatía @5 12
C03 08  X  FRE  @0 Perception @5 13
C03 08  X  ENG  @0 Perception @5 13
C03 08  X  SPA  @0 Percepción @5 13
C03 09  X  FRE  @0 Planification @5 14
C03 09  X  ENG  @0 Planning @5 14
C03 09  X  SPA  @0 Planificación @5 14
C03 10  X  FRE  @0 Décision Markov @5 18
C03 10  X  ENG  @0 Markov decision @5 18
C03 10  X  SPA  @0 Decisión Markov @5 18
C03 11  X  FRE  @0 Décision séquentielle @5 19
C03 11  X  ENG  @0 Sequential decision @5 19
C03 11  X  SPA  @0 Decisión secuencial @5 19
C03 12  X  FRE  @0 Système décentralisé @5 20
C03 12  X  ENG  @0 Decentralized system @5 20
C03 12  X  SPA  @0 Sistema descentralizado @5 20
C03 13  X  FRE  @0 Processus Markov @5 23
C03 13  X  ENG  @0 Markov process @5 23
C03 13  X  SPA  @0 Proceso Markov @5 23
C03 14  X  FRE  @0 Résolution problème @5 24
C03 14  X  ENG  @0 Problem solving @5 24
C03 14  X  SPA  @0 Resolución problema @5 24
C03 15  X  FRE  @0 Modélisation @5 25
C03 15  X  ENG  @0 Modeling @5 25
C03 15  X  SPA  @0 Modelización @5 25
C03 16  X  FRE  @0 Système coordonnée @5 26
C03 16  X  ENG  @0 Coordinate system @5 26
C03 16  X  SPA  @0 Sistema coordenadas @5 26
C03 17  X  FRE  @0 . @4 INC @5 82
N21       @1 212
N44 01      @1 OTO
N82       @1 OTO
pR  
A30 01  1  FRE  @1 Décision dynamique et planification dans l'incertain. Journée @3 Paris FRA @4 2004-05-07

Format Inist (serveur)

NO : PASCAL 06-0329057 INIST
FT : Algorithmes de co-évolution pour la résolution approchée de PDM multi-agent
AU : CHADES (Iadine); CHARPILLET (F.); GARCIA (F.); PERNY (Patrice); SIGAUD (Olivier)
AF : Equipe Méthodes Mathématiques et Informatique pour la Décision Unité de Biométrie et Intelligence Artificielle INRA, BP27/31326 Castanet-Tolosan/France (1 aut.); LORIA-INRIA/Nancy/France (1 aut.); INRA-MIA/Toulouse/France (2 aut.); LIP6/Paris/France (3 aut., 4 aut.)
DT : Publication en série; Congrès; Niveau analytique
SO : Revue d'intelligence artificielle; ISSN 0992-499X; France; Da. 2006; Vol. 20; No. 2-3; Pp. 345-382; Abs. anglais; Bibl. 1 p.1/4
LA : Français
EA : Markov Decision Processes (MDP) provide a formal approach for solving sequential decision problems under uncertainty. Decentralized Markov Decision Processes extend MDP. They are used to model the problem of several agents making decision under uncertainty. However, solving a decentralized markov decision process is NEXP-complete as soon as two agents are involved. In this paper we propose algorithms for solving approximately Multi-agent MDP problems in a centralized or decentralized way. The aim of our methods is to design a system in which agents coordinate to achieve a task in collaboration. Coordination is based on two major properties: subjectivity and empathy. While subjectivity allows an agent to deal with incomplete and local perceptions, thus to design memory less policies, empathy allows an agent to adapt its decisions in regards with the uncertainty of the decisions of the others.
CC : 001D01A08; 001D02C
FD : Prise décision; Système incertain; Système aide décision; Système multiagent; Intelligence artificielle; Coordination; Empathie; Perception; Planification; Décision Markov; Décision séquentielle; Système décentralisé; Processus Markov; Résolution problème; Modélisation; Système coordonnée; .
ED : Decision making; Uncertain system; Decision support system; Multiagent system; Artificial intelligence; Coordination; Empathy; Perception; Planning; Markov decision; Sequential decision; Decentralized system; Markov process; Problem solving; Modeling; Coordinate system
SD : Toma decision; Sistema incierto; Sistema ayuda decisíon; Sistema multiagente; Inteligencia artificial; Coordinación; Empatía; Percepción; Planificación; Decisión Markov; Decisión secuencial; Sistema descentralizado; Proceso Markov; Resolución problema; Modelización; Sistema coordenadas
LO : INIST-21320.354000142556580070
ID : 06-0329057

Links to Exploration step

Pascal:06-0329057

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="fr" level="a">Algorithmes de co-évolution pour la résolution approchée de PDM multi-agent</title>
<author>
<name sortKey="Chades, Iadine" sort="Chades, Iadine" uniqKey="Chades I" first="Iadine" last="Chades">Iadine Chades</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Equipe Méthodes Mathématiques et Informatique pour la Décision Unité de Biométrie et Intelligence Artificielle INRA, BP27</s1>
<s2>31326 Castanet-Tolosan</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">06-0329057</idno>
<date when="2006">2006</date>
<idno type="stanalyst">PASCAL 06-0329057 INIST</idno>
<idno type="RBID">Pascal:06-0329057</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000438</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="fr" level="a">Algorithmes de co-évolution pour la résolution approchée de PDM multi-agent</title>
<author>
<name sortKey="Chades, Iadine" sort="Chades, Iadine" uniqKey="Chades I" first="Iadine" last="Chades">Iadine Chades</name>
<affiliation>
<inist:fA14 i1="01">
<s1>Equipe Méthodes Mathématiques et Informatique pour la Décision Unité de Biométrie et Intelligence Artificielle INRA, BP27</s1>
<s2>31326 Castanet-Tolosan</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Revue d'intelligence artificielle</title>
<title level="j" type="abbreviated">Rev. intell. artif.</title>
<idno type="ISSN">0992-499X</idno>
<imprint>
<date when="2006">2006</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Revue d'intelligence artificielle</title>
<title level="j" type="abbreviated">Rev. intell. artif.</title>
<idno type="ISSN">0992-499X</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Artificial intelligence</term>
<term>Coordinate system</term>
<term>Coordination</term>
<term>Decentralized system</term>
<term>Decision making</term>
<term>Decision support system</term>
<term>Empathy</term>
<term>Markov decision</term>
<term>Markov process</term>
<term>Modeling</term>
<term>Multiagent system</term>
<term>Perception</term>
<term>Planning</term>
<term>Problem solving</term>
<term>Sequential decision</term>
<term>Uncertain system</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Prise décision</term>
<term>Système incertain</term>
<term>Système aide décision</term>
<term>Système multiagent</term>
<term>Intelligence artificielle</term>
<term>Coordination</term>
<term>Empathie</term>
<term>Perception</term>
<term>Planification</term>
<term>Décision Markov</term>
<term>Décision séquentielle</term>
<term>Système décentralisé</term>
<term>Processus Markov</term>
<term>Résolution problème</term>
<term>Modélisation</term>
<term>Système coordonnée</term>
<term>.</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Markov Decision Processes (MDP) provide a formal approach for solving sequential decision problems under uncertainty. Decentralized Markov Decision Processes extend MDP. They are used to model the problem of several agents making decision under uncertainty. However, solving a decentralized markov decision process is NEXP-complete as soon as two agents are involved. In this paper we propose algorithms for solving approximately Multi-agent MDP problems in a centralized or decentralized way. The aim of our methods is to design a system in which agents coordinate to achieve a task in collaboration. Coordination is based on two major properties: subjectivity and empathy. While subjectivity allows an agent to deal with incomplete and local perceptions, thus to design memory less policies, empathy allows an agent to adapt its decisions in regards with the uncertainty of the decisions of the others.</div>
</front>
</TEI>
<inist>
<standard h6="B">
<pA>
<fA01 i1="01" i2="1">
<s0>0992-499X</s0>
</fA01>
<fA03 i2="1">
<s0>Rev. intell. artif.</s0>
</fA03>
<fA05>
<s2>20</s2>
</fA05>
<fA06>
<s2>2-3</s2>
</fA06>
<fA08 i1="01" i2="1" l="FRE">
<s1>Algorithmes de co-évolution pour la résolution approchée de PDM multi-agent</s1>
</fA08>
<fA09 i1="01" i2="1" l="FRE">
<s1>Décision et planification dans l'incertain</s1>
</fA09>
<fA11 i1="01" i2="1">
<s1>CHADES (Iadine)</s1>
</fA11>
<fA12 i1="01" i2="1">
<s1>CHARPILLET (F.)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="02" i2="1">
<s1>GARCIA (F.)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="03" i2="1">
<s1>PERNY (Patrice)</s1>
<s9>ed.</s9>
</fA12>
<fA12 i1="04" i2="1">
<s1>SIGAUD (Olivier)</s1>
<s9>ed.</s9>
</fA12>
<fA14 i1="01">
<s1>Equipe Méthodes Mathématiques et Informatique pour la Décision Unité de Biométrie et Intelligence Artificielle INRA, BP27</s1>
<s2>31326 Castanet-Tolosan</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
</fA14>
<fA15 i1="01">
<s1>LORIA-INRIA</s1>
<s2>Nancy</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
</fA15>
<fA15 i1="02">
<s1>INRA-MIA</s1>
<s2>Toulouse</s2>
<s3>FRA</s3>
<sZ>2 aut.</sZ>
</fA15>
<fA15 i1="03">
<s1>LIP6</s1>
<s2>Paris</s2>
<s3>FRA</s3>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
</fA15>
<fA20>
<s1>345-382</s1>
</fA20>
<fA21>
<s1>2006</s1>
</fA21>
<fA23 i1="01">
<s0>FRE</s0>
</fA23>
<fA24 i1="01">
<s0>eng</s0>
</fA24>
<fA43 i1="01">
<s1>INIST</s1>
<s2>21320</s2>
<s5>354000142556580070</s5>
</fA43>
<fA44>
<s0>0000</s0>
<s1>© 2006 INIST-CNRS. All rights reserved.</s1>
</fA44>
<fA45>
<s0>1 p.1/4</s0>
</fA45>
<fA47 i1="01" i2="1">
<s0>06-0329057</s0>
</fA47>
<fA60>
<s1>P</s1>
<s2>C</s2>
</fA60>
<fA61>
<s0>A</s0>
</fA61>
<fA64 i1="01" i2="1">
<s0>Revue d'intelligence artificielle</s0>
</fA64>
<fA66 i1="01">
<s0>FRA</s0>
</fA66>
<fC01 i1="01" l="ENG">
<s0>Markov Decision Processes (MDP) provide a formal approach for solving sequential decision problems under uncertainty. Decentralized Markov Decision Processes extend MDP. They are used to model the problem of several agents making decision under uncertainty. However, solving a decentralized markov decision process is NEXP-complete as soon as two agents are involved. In this paper we propose algorithms for solving approximately Multi-agent MDP problems in a centralized or decentralized way. The aim of our methods is to design a system in which agents coordinate to achieve a task in collaboration. Coordination is based on two major properties: subjectivity and empathy. While subjectivity allows an agent to deal with incomplete and local perceptions, thus to design memory less policies, empathy allows an agent to adapt its decisions in regards with the uncertainty of the decisions of the others.</s0>
</fC01>
<fC02 i1="01" i2="X">
<s0>001D01A08</s0>
</fC02>
<fC02 i1="02" i2="X">
<s0>001D02C</s0>
</fC02>
<fC03 i1="01" i2="X" l="FRE">
<s0>Prise décision</s0>
<s5>06</s5>
</fC03>
<fC03 i1="01" i2="X" l="ENG">
<s0>Decision making</s0>
<s5>06</s5>
</fC03>
<fC03 i1="01" i2="X" l="SPA">
<s0>Toma decision</s0>
<s5>06</s5>
</fC03>
<fC03 i1="02" i2="X" l="FRE">
<s0>Système incertain</s0>
<s5>07</s5>
</fC03>
<fC03 i1="02" i2="X" l="ENG">
<s0>Uncertain system</s0>
<s5>07</s5>
</fC03>
<fC03 i1="02" i2="X" l="SPA">
<s0>Sistema incierto</s0>
<s5>07</s5>
</fC03>
<fC03 i1="03" i2="X" l="FRE">
<s0>Système aide décision</s0>
<s5>08</s5>
</fC03>
<fC03 i1="03" i2="X" l="ENG">
<s0>Decision support system</s0>
<s5>08</s5>
</fC03>
<fC03 i1="03" i2="X" l="SPA">
<s0>Sistema ayuda decisíon</s0>
<s5>08</s5>
</fC03>
<fC03 i1="04" i2="X" l="FRE">
<s0>Système multiagent</s0>
<s5>09</s5>
</fC03>
<fC03 i1="04" i2="X" l="ENG">
<s0>Multiagent system</s0>
<s5>09</s5>
</fC03>
<fC03 i1="04" i2="X" l="SPA">
<s0>Sistema multiagente</s0>
<s5>09</s5>
</fC03>
<fC03 i1="05" i2="X" l="FRE">
<s0>Intelligence artificielle</s0>
<s5>10</s5>
</fC03>
<fC03 i1="05" i2="X" l="ENG">
<s0>Artificial intelligence</s0>
<s5>10</s5>
</fC03>
<fC03 i1="05" i2="X" l="SPA">
<s0>Inteligencia artificial</s0>
<s5>10</s5>
</fC03>
<fC03 i1="06" i2="X" l="FRE">
<s0>Coordination</s0>
<s5>11</s5>
</fC03>
<fC03 i1="06" i2="X" l="ENG">
<s0>Coordination</s0>
<s5>11</s5>
</fC03>
<fC03 i1="06" i2="X" l="SPA">
<s0>Coordinación</s0>
<s5>11</s5>
</fC03>
<fC03 i1="07" i2="X" l="FRE">
<s0>Empathie</s0>
<s5>12</s5>
</fC03>
<fC03 i1="07" i2="X" l="ENG">
<s0>Empathy</s0>
<s5>12</s5>
</fC03>
<fC03 i1="07" i2="X" l="SPA">
<s0>Empatía</s0>
<s5>12</s5>
</fC03>
<fC03 i1="08" i2="X" l="FRE">
<s0>Perception</s0>
<s5>13</s5>
</fC03>
<fC03 i1="08" i2="X" l="ENG">
<s0>Perception</s0>
<s5>13</s5>
</fC03>
<fC03 i1="08" i2="X" l="SPA">
<s0>Percepción</s0>
<s5>13</s5>
</fC03>
<fC03 i1="09" i2="X" l="FRE">
<s0>Planification</s0>
<s5>14</s5>
</fC03>
<fC03 i1="09" i2="X" l="ENG">
<s0>Planning</s0>
<s5>14</s5>
</fC03>
<fC03 i1="09" i2="X" l="SPA">
<s0>Planificación</s0>
<s5>14</s5>
</fC03>
<fC03 i1="10" i2="X" l="FRE">
<s0>Décision Markov</s0>
<s5>18</s5>
</fC03>
<fC03 i1="10" i2="X" l="ENG">
<s0>Markov decision</s0>
<s5>18</s5>
</fC03>
<fC03 i1="10" i2="X" l="SPA">
<s0>Decisión Markov</s0>
<s5>18</s5>
</fC03>
<fC03 i1="11" i2="X" l="FRE">
<s0>Décision séquentielle</s0>
<s5>19</s5>
</fC03>
<fC03 i1="11" i2="X" l="ENG">
<s0>Sequential decision</s0>
<s5>19</s5>
</fC03>
<fC03 i1="11" i2="X" l="SPA">
<s0>Decisión secuencial</s0>
<s5>19</s5>
</fC03>
<fC03 i1="12" i2="X" l="FRE">
<s0>Système décentralisé</s0>
<s5>20</s5>
</fC03>
<fC03 i1="12" i2="X" l="ENG">
<s0>Decentralized system</s0>
<s5>20</s5>
</fC03>
<fC03 i1="12" i2="X" l="SPA">
<s0>Sistema descentralizado</s0>
<s5>20</s5>
</fC03>
<fC03 i1="13" i2="X" l="FRE">
<s0>Processus Markov</s0>
<s5>23</s5>
</fC03>
<fC03 i1="13" i2="X" l="ENG">
<s0>Markov process</s0>
<s5>23</s5>
</fC03>
<fC03 i1="13" i2="X" l="SPA">
<s0>Proceso Markov</s0>
<s5>23</s5>
</fC03>
<fC03 i1="14" i2="X" l="FRE">
<s0>Résolution problème</s0>
<s5>24</s5>
</fC03>
<fC03 i1="14" i2="X" l="ENG">
<s0>Problem solving</s0>
<s5>24</s5>
</fC03>
<fC03 i1="14" i2="X" l="SPA">
<s0>Resolución problema</s0>
<s5>24</s5>
</fC03>
<fC03 i1="15" i2="X" l="FRE">
<s0>Modélisation</s0>
<s5>25</s5>
</fC03>
<fC03 i1="15" i2="X" l="ENG">
<s0>Modeling</s0>
<s5>25</s5>
</fC03>
<fC03 i1="15" i2="X" l="SPA">
<s0>Modelización</s0>
<s5>25</s5>
</fC03>
<fC03 i1="16" i2="X" l="FRE">
<s0>Système coordonnée</s0>
<s5>26</s5>
</fC03>
<fC03 i1="16" i2="X" l="ENG">
<s0>Coordinate system</s0>
<s5>26</s5>
</fC03>
<fC03 i1="16" i2="X" l="SPA">
<s0>Sistema coordenadas</s0>
<s5>26</s5>
</fC03>
<fC03 i1="17" i2="X" l="FRE">
<s0>.</s0>
<s4>INC</s4>
<s5>82</s5>
</fC03>
<fN21>
<s1>212</s1>
</fN21>
<fN44 i1="01">
<s1>OTO</s1>
</fN44>
<fN82>
<s1>OTO</s1>
</fN82>
</pA>
<pR>
<fA30 i1="01" i2="1" l="FRE">
<s1>Décision dynamique et planification dans l'incertain. Journée</s1>
<s3>Paris FRA</s3>
<s4>2004-05-07</s4>
</fA30>
</pR>
</standard>
<server>
<NO>PASCAL 06-0329057 INIST</NO>
<FT>Algorithmes de co-évolution pour la résolution approchée de PDM multi-agent</FT>
<AU>CHADES (Iadine); CHARPILLET (F.); GARCIA (F.); PERNY (Patrice); SIGAUD (Olivier)</AU>
<AF>Equipe Méthodes Mathématiques et Informatique pour la Décision Unité de Biométrie et Intelligence Artificielle INRA, BP27/31326 Castanet-Tolosan/France (1 aut.); LORIA-INRIA/Nancy/France (1 aut.); INRA-MIA/Toulouse/France (2 aut.); LIP6/Paris/France (3 aut., 4 aut.)</AF>
<DT>Publication en série; Congrès; Niveau analytique</DT>
<SO>Revue d'intelligence artificielle; ISSN 0992-499X; France; Da. 2006; Vol. 20; No. 2-3; Pp. 345-382; Abs. anglais; Bibl. 1 p.1/4</SO>
<LA>Français</LA>
<EA>Markov Decision Processes (MDP) provide a formal approach for solving sequential decision problems under uncertainty. Decentralized Markov Decision Processes extend MDP. They are used to model the problem of several agents making decision under uncertainty. However, solving a decentralized markov decision process is NEXP-complete as soon as two agents are involved. In this paper we propose algorithms for solving approximately Multi-agent MDP problems in a centralized or decentralized way. The aim of our methods is to design a system in which agents coordinate to achieve a task in collaboration. Coordination is based on two major properties: subjectivity and empathy. While subjectivity allows an agent to deal with incomplete and local perceptions, thus to design memory less policies, empathy allows an agent to adapt its decisions in regards with the uncertainty of the decisions of the others.</EA>
<CC>001D01A08; 001D02C</CC>
<FD>Prise décision; Système incertain; Système aide décision; Système multiagent; Intelligence artificielle; Coordination; Empathie; Perception; Planification; Décision Markov; Décision séquentielle; Système décentralisé; Processus Markov; Résolution problème; Modélisation; Système coordonnée; .</FD>
<ED>Decision making; Uncertain system; Decision support system; Multiagent system; Artificial intelligence; Coordination; Empathy; Perception; Planning; Markov decision; Sequential decision; Decentralized system; Markov process; Problem solving; Modeling; Coordinate system</ED>
<SD>Toma decision; Sistema incierto; Sistema ayuda decisíon; Sistema multiagente; Inteligencia artificial; Coordinación; Empatía; Percepción; Planificación; Decisión Markov; Decisión secuencial; Sistema descentralizado; Proceso Markov; Resolución problema; Modelización; Sistema coordenadas</SD>
<LO>INIST-21320.354000142556580070</LO>
<ID>06-0329057</ID>
</server>
</inist>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/PascalFrancis/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000438 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/PascalFrancis/Corpus/biblio.hfd -nk 000438 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    PascalFrancis
   |étape=   Corpus
   |type=    RBID
   |clé=     Pascal:06-0329057
   |texte=   Algorithmes de co-évolution pour la résolution approchée de PDM multi-agent
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022