Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Exploiting separability in multiagent planning with continuous-state mdps (extended abstract)

Identifieur interne : 002128 ( Hal/Corpus ); précédent : 002127; suivant : 002129

Exploiting separability in multiagent planning with continuous-state mdps (extended abstract)

Auteurs : Jilles Steeve Dibangoye ; Christopher Amato ; Olivier Buffet ; François Charpillet

Source :

RBID : Hal:hal-01188483

Abstract

Decentralized partially observable Markov deci- sion processes (Dec-POMDPs) provide a general model for decision-making under uncertainty in co- operative decentralized settings, but are difficult to solve optimally (NEXP-Complete). As a new way of solving these problems, we recently intro- duced a method for transforming a Dec-POMDP into a continuous-state deterministic MDP with a piecewise-linear and convex value function. This new Dec-POMDP formulation, which we call an occupancy MDP, allows powerful POMDP and continuous-state MDP methods to be used for the first time. However, scalability remains limited when the number of agents or problem variables becomes large. In this paper, we show that, un- der certain separability conditions of the optimal value function, the scalability of this approach can increase considerably. This separability is present when there is locality of interaction be- tween agents, which can be exploited to improve performance. Unlike most previous methods, the novel continuous-state MDP algorithm retains op- timality and convergence guarantees. Results show that the extension using separability can scale to a large number of agents and domain variables while maintaining optimality.

Url:

Links to Exploration step

Hal:hal-01188483

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Exploiting separability in multiagent planning with continuous-state mdps (extended abstract)</title>
<author>
<name sortKey="Dibangoye, Jilles Steeve" sort="Dibangoye, Jilles Steeve" uniqKey="Dibangoye J" first="Jilles Steeve" last="Dibangoye">Jilles Steeve Dibangoye</name>
<affiliation>
<hal:affiliation type="researchteam" xml:id="struct-417638" status="VALID">
<idno type="RNSR">201521334D</idno>
<orgName>Robots coopératifs et adaptés à la présence humaine en environnements dynamiques</orgName>
<orgName type="acronym">CHROMA</orgName>
<desc>
<address>
<addrLine>Centre de Recherche Inria655 Avenue de l'Europe, 38330 Montbonnot-Saint-Martin</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/chroma</ref>
</desc>
<listRelation>
<relation active="#struct-2497" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-2497" type="direct">
<org type="laboratory" xml:id="struct-2497" status="VALID">
<idno type="RNSR">199218244V</idno>
<orgName>Inria Grenoble - Rhône-Alpes</orgName>
<desc>
<address>
<addrLine>Inovallée655 avenue de l'Europe38330 Montbonnot</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/centre/grenoble</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Amato, Christopher" sort="Amato, Christopher" uniqKey="Amato C" first="Christopher" last="Amato">Christopher Amato</name>
<affiliation>
<hal:affiliation type="laboratory" xml:id="struct-406632" status="VALID">
<orgName>Computer Science and Artificial Intelligence Laboratory [Cambridge]</orgName>
<orgName type="acronym">CSAIL</orgName>
<desc>
<address>
<addrLine>The Stata Center, Building 32 32 Vassar Street Cambridge, MA 02139</addrLine>
<country key="US"></country>
</address>
<ref type="url">https://www.csail.mit.edu/</ref>
</desc>
<listRelation>
<relation active="#struct-22441" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-22441" type="direct">
<org type="institution" xml:id="struct-22441" status="VALID">
<orgName>Massachusetts Institute of technology [Cambridge]</orgName>
<orgName type="acronym">MIT</orgName>
<desc>
<address>
<addrLine>Massachusetts Avenue Cambridge, MA 02142</addrLine>
<country key="US"></country>
</address>
<ref type="url">http://mit.edu/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Buffet, Olivier" sort="Buffet, Olivier" uniqKey="Buffet O" first="Olivier" last="Buffet">Olivier Buffet</name>
<affiliation>
<hal:affiliation type="researchteam" xml:id="struct-205124" status="OLD">
<idno type="RNSR">200218290B</idno>
<orgName>Autonomous intelligent machine</orgName>
<orgName type="acronym">MAIA</orgName>
<date type="end">2014-12-31</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/maia</ref>
</desc>
<listRelation>
<relation active="#struct-129671" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-423090" type="direct"></relation>
<relation active="#struct-206040" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-129671" type="direct">
<org type="laboratory" xml:id="struct-129671" status="VALID">
<idno type="RNSR">198618246Y</idno>
<orgName>INRIA Nancy - Grand Est</orgName>
<desc>
<address>
<addrLine>615 rue du Jardin Botanique 54600 Villers-lès-Nancy</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/nancy</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-423090" type="direct">
<org type="department" xml:id="struct-423090" status="VALID">
<orgName>Department of Complex Systems, Artificial Intelligence & Robotics</orgName>
<orgName type="acronym">LORIA - AIS</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr/la-recherche-en/departements/complex-system-and-artificial-intelligence</ref>
</desc>
<listRelation>
<relation active="#struct-206040" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-206040" type="indirect">
<org type="laboratory" xml:id="struct-206040" status="VALID">
<idno type="IdRef">067077927</idno>
<idno type="RNSR">198912571S</idno>
<idno type="IdUnivLorraine">[UL]RSI--</idno>
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-413289" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-413289" type="indirect">
<org type="institution" xml:id="struct-413289" status="VALID">
<idno type="IdRef">157040569</idno>
<idno type="IdUnivLorraine">[UL]100--</idno>
<orgName>Université de Lorraine</orgName>
<orgName type="acronym">UL</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>34 cours Léopold - CS 25233 - 54052 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lorraine.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Charpillet, Francois" sort="Charpillet, Francois" uniqKey="Charpillet F" first="François" last="Charpillet">François Charpillet</name>
<affiliation>
<hal:affiliation type="researchteam" xml:id="struct-415996" status="VALID">
<idno type="RNSR">201521241C</idno>
<orgName>Lifelong Autonomy and interaction skills for Robots in a Sensing ENvironment</orgName>
<orgName type="acronym">LARSEN</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/larsen</ref>
</desc>
<listRelation>
<relation active="#struct-129671" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-423090" type="direct"></relation>
<relation active="#struct-206040" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-129671" type="direct">
<org type="laboratory" xml:id="struct-129671" status="VALID">
<idno type="RNSR">198618246Y</idno>
<orgName>INRIA Nancy - Grand Est</orgName>
<desc>
<address>
<addrLine>615 rue du Jardin Botanique 54600 Villers-lès-Nancy</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/nancy</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-423090" type="direct">
<org type="department" xml:id="struct-423090" status="VALID">
<orgName>Department of Complex Systems, Artificial Intelligence & Robotics</orgName>
<orgName type="acronym">LORIA - AIS</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr/la-recherche-en/departements/complex-system-and-artificial-intelligence</ref>
</desc>
<listRelation>
<relation active="#struct-206040" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-206040" type="indirect">
<org type="laboratory" xml:id="struct-206040" status="VALID">
<idno type="IdRef">067077927</idno>
<idno type="RNSR">198912571S</idno>
<idno type="IdUnivLorraine">[UL]RSI--</idno>
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-413289" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-413289" type="indirect">
<org type="institution" xml:id="struct-413289" status="VALID">
<idno type="IdRef">157040569</idno>
<idno type="IdUnivLorraine">[UL]100--</idno>
<orgName>Université de Lorraine</orgName>
<orgName type="acronym">UL</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>34 cours Léopold - CS 25233 - 54052 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lorraine.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:hal-01188483</idno>
<idno type="halId">hal-01188483</idno>
<idno type="halUri">https://hal.inria.fr/hal-01188483</idno>
<idno type="url">https://hal.inria.fr/hal-01188483</idno>
<date when="2015-07-22">2015-07-22</date>
<idno type="wicri:Area/Hal/Corpus">002128</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Exploiting separability in multiagent planning with continuous-state mdps (extended abstract)</title>
<author>
<name sortKey="Dibangoye, Jilles Steeve" sort="Dibangoye, Jilles Steeve" uniqKey="Dibangoye J" first="Jilles Steeve" last="Dibangoye">Jilles Steeve Dibangoye</name>
<affiliation>
<hal:affiliation type="researchteam" xml:id="struct-417638" status="VALID">
<idno type="RNSR">201521334D</idno>
<orgName>Robots coopératifs et adaptés à la présence humaine en environnements dynamiques</orgName>
<orgName type="acronym">CHROMA</orgName>
<desc>
<address>
<addrLine>Centre de Recherche Inria655 Avenue de l'Europe, 38330 Montbonnot-Saint-Martin</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/chroma</ref>
</desc>
<listRelation>
<relation active="#struct-2497" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-2497" type="direct">
<org type="laboratory" xml:id="struct-2497" status="VALID">
<idno type="RNSR">199218244V</idno>
<orgName>Inria Grenoble - Rhône-Alpes</orgName>
<desc>
<address>
<addrLine>Inovallée655 avenue de l'Europe38330 Montbonnot</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/centre/grenoble</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Amato, Christopher" sort="Amato, Christopher" uniqKey="Amato C" first="Christopher" last="Amato">Christopher Amato</name>
<affiliation>
<hal:affiliation type="laboratory" xml:id="struct-406632" status="VALID">
<orgName>Computer Science and Artificial Intelligence Laboratory [Cambridge]</orgName>
<orgName type="acronym">CSAIL</orgName>
<desc>
<address>
<addrLine>The Stata Center, Building 32 32 Vassar Street Cambridge, MA 02139</addrLine>
<country key="US"></country>
</address>
<ref type="url">https://www.csail.mit.edu/</ref>
</desc>
<listRelation>
<relation active="#struct-22441" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-22441" type="direct">
<org type="institution" xml:id="struct-22441" status="VALID">
<orgName>Massachusetts Institute of technology [Cambridge]</orgName>
<orgName type="acronym">MIT</orgName>
<desc>
<address>
<addrLine>Massachusetts Avenue Cambridge, MA 02142</addrLine>
<country key="US"></country>
</address>
<ref type="url">http://mit.edu/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Buffet, Olivier" sort="Buffet, Olivier" uniqKey="Buffet O" first="Olivier" last="Buffet">Olivier Buffet</name>
<affiliation>
<hal:affiliation type="researchteam" xml:id="struct-205124" status="OLD">
<idno type="RNSR">200218290B</idno>
<orgName>Autonomous intelligent machine</orgName>
<orgName type="acronym">MAIA</orgName>
<date type="end">2014-12-31</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/maia</ref>
</desc>
<listRelation>
<relation active="#struct-129671" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-423090" type="direct"></relation>
<relation active="#struct-206040" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-129671" type="direct">
<org type="laboratory" xml:id="struct-129671" status="VALID">
<idno type="RNSR">198618246Y</idno>
<orgName>INRIA Nancy - Grand Est</orgName>
<desc>
<address>
<addrLine>615 rue du Jardin Botanique 54600 Villers-lès-Nancy</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/nancy</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-423090" type="direct">
<org type="department" xml:id="struct-423090" status="VALID">
<orgName>Department of Complex Systems, Artificial Intelligence & Robotics</orgName>
<orgName type="acronym">LORIA - AIS</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr/la-recherche-en/departements/complex-system-and-artificial-intelligence</ref>
</desc>
<listRelation>
<relation active="#struct-206040" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-206040" type="indirect">
<org type="laboratory" xml:id="struct-206040" status="VALID">
<idno type="IdRef">067077927</idno>
<idno type="RNSR">198912571S</idno>
<idno type="IdUnivLorraine">[UL]RSI--</idno>
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-413289" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-413289" type="indirect">
<org type="institution" xml:id="struct-413289" status="VALID">
<idno type="IdRef">157040569</idno>
<idno type="IdUnivLorraine">[UL]100--</idno>
<orgName>Université de Lorraine</orgName>
<orgName type="acronym">UL</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>34 cours Léopold - CS 25233 - 54052 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lorraine.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
<author>
<name sortKey="Charpillet, Francois" sort="Charpillet, Francois" uniqKey="Charpillet F" first="François" last="Charpillet">François Charpillet</name>
<affiliation>
<hal:affiliation type="researchteam" xml:id="struct-415996" status="VALID">
<idno type="RNSR">201521241C</idno>
<orgName>Lifelong Autonomy and interaction skills for Robots in a Sensing ENvironment</orgName>
<orgName type="acronym">LARSEN</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/larsen</ref>
</desc>
<listRelation>
<relation active="#struct-129671" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-423090" type="direct"></relation>
<relation active="#struct-206040" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-129671" type="direct">
<org type="laboratory" xml:id="struct-129671" status="VALID">
<idno type="RNSR">198618246Y</idno>
<orgName>INRIA Nancy - Grand Est</orgName>
<desc>
<address>
<addrLine>615 rue du Jardin Botanique 54600 Villers-lès-Nancy</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/nancy</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-423090" type="direct">
<org type="department" xml:id="struct-423090" status="VALID">
<orgName>Department of Complex Systems, Artificial Intelligence & Robotics</orgName>
<orgName type="acronym">LORIA - AIS</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr/la-recherche-en/departements/complex-system-and-artificial-intelligence</ref>
</desc>
<listRelation>
<relation active="#struct-206040" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-206040" type="indirect">
<org type="laboratory" xml:id="struct-206040" status="VALID">
<idno type="IdRef">067077927</idno>
<idno type="RNSR">198912571S</idno>
<idno type="IdUnivLorraine">[UL]RSI--</idno>
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-413289" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-413289" type="indirect">
<org type="institution" xml:id="struct-413289" status="VALID">
<idno type="IdRef">157040569</idno>
<idno type="IdUnivLorraine">[UL]100--</idno>
<orgName>Université de Lorraine</orgName>
<orgName type="acronym">UL</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>34 cours Léopold - CS 25233 - 54052 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lorraine.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Decentralized partially observable Markov deci- sion processes (Dec-POMDPs) provide a general model for decision-making under uncertainty in co- operative decentralized settings, but are difficult to solve optimally (NEXP-Complete). As a new way of solving these problems, we recently intro- duced a method for transforming a Dec-POMDP into a continuous-state deterministic MDP with a piecewise-linear and convex value function. This new Dec-POMDP formulation, which we call an occupancy MDP, allows powerful POMDP and continuous-state MDP methods to be used for the first time. However, scalability remains limited when the number of agents or problem variables becomes large. In this paper, we show that, un- der certain separability conditions of the optimal value function, the scalability of this approach can increase considerably. This separability is present when there is locality of interaction be- tween agents, which can be exploited to improve performance. Unlike most previous methods, the novel continuous-state MDP algorithm retains op- timality and convergence guarantees. Results show that the extension using separability can scale to a large number of agents and domain variables while maintaining optimality.</div>
</front>
</TEI>
<hal api="V3">
<titleStmt>
<title xml:lang="en">Exploiting separability in multiagent planning with continuous-state mdps (extended abstract)</title>
<author role="aut">
<persName>
<forename type="first">Jilles Steeve</forename>
<surname>Dibangoye</surname>
</persName>
<email>jilles-steeve.dibangoye@insa-lyon.fr</email>
<ptr type="url" target="http://jilles.dibangoye.net/"></ptr>
<idno type="idhal">jilles-steeve-dibangoye</idno>
<idno type="halauthor">1123508</idno>
<affiliation ref="#struct-417638"></affiliation>
<affiliation ref="#struct-203831"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Christopher</forename>
<surname>Amato</surname>
</persName>
<email>camato@csail.mit.edu</email>
<ptr type="url" target="http://people.csail.mit.edu/camato/"></ptr>
<idno type="halauthor">946035</idno>
<affiliation ref="#struct-406632"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Olivier</forename>
<surname>Buffet</surname>
</persName>
<email>olivier.buffet@loria.fr</email>
<ptr type="url" target="http://www.loria.fr/~buffet/"></ptr>
<idno type="idhal">olivier-buffet</idno>
<idno type="halauthor">660501</idno>
<affiliation ref="#struct-205124"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">François</forename>
<surname>Charpillet</surname>
</persName>
<email>francois.charpillet@inria.fr</email>
<ptr type="url" target="http://www.loria.fr/~charp/"></ptr>
<idno type="idhal">francois-charpillet</idno>
<idno type="halauthor">1201239</idno>
<affiliation ref="#struct-415996"></affiliation>
</author>
<editor role="depositor">
<persName>
<forename>Jilles Steeve</forename>
<surname>Dibangoye</surname>
</persName>
<email>jilles-steeve.dibangoye@insa-lyon.fr</email>
</editor>
</titleStmt>
<editionStmt>
<edition n="v1" type="current">
<date type="whenSubmitted">2015-08-31 00:08:11</date>
<date type="whenModified">2015-11-18 01:12:41</date>
<date type="whenReleased">2015-08-31 00:08:11</date>
<date type="whenProduced">2015-07-22</date>
</edition>
<respStmt>
<resp>contributor</resp>
<name key="178863">
<persName>
<forename>Jilles Steeve</forename>
<surname>Dibangoye</surname>
</persName>
<email>jilles-steeve.dibangoye@insa-lyon.fr</email>
</name>
</respStmt>
</editionStmt>
<publicationStmt>
<distributor>CCSD</distributor>
<idno type="halId">hal-01188483</idno>
<idno type="halUri">https://hal.inria.fr/hal-01188483</idno>
<idno type="halBibtex">dibangoye:hal-01188483</idno>
<idno type="halRefHtml">Proceedings of the twenty-fourth international joint conference on artificial intelligence, IJCAI 2015, Jul 2015, Buenos Aires, Argentina. 2015</idno>
<idno type="halRef">Proceedings of the twenty-fourth international joint conference on artificial intelligence, IJCAI 2015, Jul 2015, Buenos Aires, Argentina. 2015</idno>
</publicationStmt>
<seriesStmt>
<idno type="stamp" n="CNRS">CNRS - Centre national de la recherche scientifique</idno>
<idno type="stamp" n="INRIA-RHA">INRIA Grenoble - Rhône-Alpes</idno>
<idno type="stamp" n="INRIA-LORRAINE">INRIA Nancy - Grand Est</idno>
<idno type="stamp" n="INRIA-NANCY-GRAND-EST">INRIA Nancy - Grand Est</idno>
<idno type="stamp" n="LORIA-CSAI" p="LORIA">Systèmes complexes et intelligence artificielle</idno>
<idno type="stamp" n="LORIA2">Publications du LORIA</idno>
<idno type="stamp" n="INRIA">INRIA - Institut National de Recherche en Informatique et en Automatique</idno>
<idno type="stamp" n="LORIA">LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications</idno>
<idno type="stamp" n="INRIA_TEST">INRIA - Institut National de Recherche en Informatique et en Automatique</idno>
<idno type="stamp" n="UNIV-LORRAINE">Université de Lorraine</idno>
<idno type="stamp" n="INSA-LYON">Institut National des Sciences Appliquées de Lyon</idno>
<idno type="stamp" n="INRIA2">INRIA 2</idno>
</seriesStmt>
<notesStmt>
<note type="audience" n="2">International</note>
<note type="invited" n="0">No</note>
<note type="popular" n="0">No</note>
<note type="peer" n="1">Yes</note>
<note type="proceedings" n="0">No</note>
</notesStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Exploiting separability in multiagent planning with continuous-state mdps (extended abstract)</title>
<author role="aut">
<persName>
<forename type="first">Jilles Steeve</forename>
<surname>Dibangoye</surname>
</persName>
<email>jilles-steeve.dibangoye@insa-lyon.fr</email>
<ptr type="url" target="http://jilles.dibangoye.net/"></ptr>
<idno type="idHal">jilles-steeve-dibangoye</idno>
<idno type="halAuthorId">1123508</idno>
<affiliation ref="#struct-417638"></affiliation>
<affiliation ref="#struct-203831"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Christopher</forename>
<surname>Amato</surname>
</persName>
<email>camato@csail.mit.edu</email>
<ptr type="url" target="http://people.csail.mit.edu/camato/"></ptr>
<idno type="halAuthorId">946035</idno>
<affiliation ref="#struct-406632"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">Olivier</forename>
<surname>Buffet</surname>
</persName>
<email>olivier.buffet@loria.fr</email>
<ptr type="url" target="http://www.loria.fr/~buffet/"></ptr>
<idno type="idHal">olivier-buffet</idno>
<idno type="halAuthorId">660501</idno>
<affiliation ref="#struct-205124"></affiliation>
</author>
<author role="aut">
<persName>
<forename type="first">François</forename>
<surname>Charpillet</surname>
</persName>
<email>francois.charpillet@inria.fr</email>
<ptr type="url" target="http://www.loria.fr/~charp/"></ptr>
<idno type="idHal">francois-charpillet</idno>
<idno type="halAuthorId">1201239</idno>
<affiliation ref="#struct-415996"></affiliation>
</author>
</analytic>
<monogr>
<meeting>
<title>Proceedings of the twenty-fourth international joint conference on artificial intelligence, IJCAI 2015</title>
<date type="start">2015-07-22</date>
<date type="end">2015-07-31</date>
<settlement>Buenos Aires</settlement>
<country key="AR">Argentina</country>
</meeting>
<imprint>
<date type="datePub">2015-07-22</date>
</imprint>
</monogr>
</biblStruct>
</sourceDesc>
<profileDesc>
<langUsage>
<language ident="en">English</language>
</langUsage>
<textClass>
<classCode scheme="halDomain" n="info.info-ai">Computer Science [cs]/Artificial Intelligence [cs.AI]</classCode>
<classCode scheme="halTypology" n="COMM">Conference papers</classCode>
</textClass>
<abstract xml:lang="en">Decentralized partially observable Markov deci- sion processes (Dec-POMDPs) provide a general model for decision-making under uncertainty in co- operative decentralized settings, but are difficult to solve optimally (NEXP-Complete). As a new way of solving these problems, we recently intro- duced a method for transforming a Dec-POMDP into a continuous-state deterministic MDP with a piecewise-linear and convex value function. This new Dec-POMDP formulation, which we call an occupancy MDP, allows powerful POMDP and continuous-state MDP methods to be used for the first time. However, scalability remains limited when the number of agents or problem variables becomes large. In this paper, we show that, un- der certain separability conditions of the optimal value function, the scalability of this approach can increase considerably. This separability is present when there is locality of interaction be- tween agents, which can be exploited to improve performance. Unlike most previous methods, the novel continuous-state MDP algorithm retains op- timality and convergence guarantees. Results show that the extension using separability can scale to a large number of agents and domain variables while maintaining optimality.</abstract>
</profileDesc>
</hal>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Hal/Corpus
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002128 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Hal/Corpus/biblio.hfd -nk 002128 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Hal
   |étape=   Corpus
   |type=    RBID
   |clé=     Hal:hal-01188483
   |texte=   Exploiting separability in multiagent planning with continuous-state mdps (extended abstract)
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022