Serveur d'exploration sur la recherche en informatique en Lorraine - Exploration (Accueil)

Index « Keywords » - entrée « reinforcement learning »
Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.
reinforcement < reinforcement learning < reinforcement learning.  Facettes :

List of bibliographic references indexed by reinforcement learning

Number of relevant bibliographic references: 50.
[0-20] [0 - 20][0 - 50][20-40]
Ident.Authors (with country if any)Title
000722 (2015) Bruno Scherrer [France] ; Mohammad Ghavamzadeh [France] ; Victor Gabillon [France] ; Boris Lesner [France] ; Matthieu Geist [France]Approximate Modified Policy Iteration and its Application to the Game of Tetris
001334 (2013-01-01) Bruno Scherrer [France]Performance Bounds for Lambda Policy Iteration and Application to the Game of Tetris
002C21 (2010-06-23) Sheila Becker [France] ; Humberto Abdelnur [France] ; Radu State [Luxembourg (pays)] ; Thomas Engel [Luxembourg (pays)]An Autonomic Testing Framework for IPv6 Configuration Protocols
003538 (2009-06-21) Vincent Thomas [France] ; Mahuna Akplogan [France]Using "social actions" and RL algorithms to build policies in Dec-POMDP
003732 (2009-01) François Klein [France] ; Christine Bourjot [France] ; Vincent Chevrier [France]Application of reinforcement learning to control a multi-agent system
003976 (2009) François Klein [France] ; Christine Bourjot [France] ; Vincent Chevrier [France]Contribution to the Control of a MAS’s Global Behaviour: Reinforcement Learning Tools
003C18 (2009) Vincent Thomas [France] ; Mahuna Akplogan [France]Using “Social actions” and RL-algorithms to build policies in DEC-POMDP
003C68 (2009) Christophe Thiery [France] ; Bruno Scherrer [France]Improvements on Learning Tetris with Cross Entropy
003C92 (2009) Christophe Thiery [France] ; Bruno Scherrer [France]Building Controllers for Tetris
003E77 (2008-09) François Klein [France] ; Christine Bourjot [France] ; Vincent Chevrier [France]Controlling the Global Behaviour of a Reactive MAS : Reinforcement Learning Tools
005D52 (2005) Bruno ScherrerAsynchronous Neurocomputing for optimal control and reinforcement learning with large state spaces
006693 (2004) Romaric Charton ; Anne Boyer ; François CharpilletApprentissage de stratégies de coordination dans les hSMA
006725 (2004) Alain Dutech ; Olivier Buffet ; François CharpilletDéveloppement autonome des comportements de base d'un agent
006792 (2004) Daniel Szer ; François CharpilletCommunication et apprentissage par renforcement pour une équipe d'agents
006795 (2004) Daniel Szer ; François CharpilletCoordination through Mutual Notification in Cooperative Multiagent Reinforcement Learning
006796 (2004) Daniel Szer ; François CharpilletCoordination through Mutual Notification in Cooperative Multiagent Reinforcement Learning
006861 (2004) Olivier Buffet ; Alain Dutech ; François CharpilletSelf-Growth of Basic Behaviors in an Action Selection Based Agent
006864 (2004) Rémi CoulomHigh-Accuracy Value-Function Approximation with Neural Networks Applied to the Acrobot
006E38 (2004) Romaric Charton [France] ; Anne Boyer [France] ; François Charpillet [France]Apprentissage de stratégies de coordination dans les hSMA
007133 (2003-12-02) Romaric Charton [France]Intelligent agents in multimedia communication environments: Towards the design of adaptive services
007165 (2003-09-10) Olivier Buffet [France]A Twofold Modular Approach of Reinforcement Learning for Adaptive Intelligent Agents

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/KwdEn.i -k "reinforcement learning" 
HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/KwdEn.i  \
                -Sk "reinforcement learning" \
         | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd 

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    indexItem
   |index=    KwdEn.i
   |clé=    reinforcement learning
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022