Recherche - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu

Filtrer vos résultats

179 résultats
keyword_s : Reinforcement learning
Image document

A Survey on Deep Learning for Skeleton‐Based Human Animation

Lucas Mourot , Ludovic Hoyet , François Le Clerc , François Schnitzler , Pierre Hellier
Computer Graphics Forum, 2022, 41 (1), pp.122-157. ⟨10.1111/cgf.14426⟩
Article dans une revue hal-03468599v1

Adaptive Combination of Behaviors in an Agent

Olivier Buffet , Alain Dutech , François Charpillet
European Conference on Artificial Intelligence - ECAI'02, 2002, Lyon, France, pp.48-52
Communication dans un congrès inria-00100766v1

Improving Coordination with Communication in Multiagent Reinforcement Learning

Daniel Szer , François Charpillet
16th IEEE International Conference on Tools with Artificial Intelligence - ICTAI'04, 2004, Boca Raton, USA, 5 p
Communication dans un congrès inria-00100165v1
Image document

High-Accuracy Value-Function Approximation with Neural Networks Applied to the Acrobot

Rémi Coulom
12th European Symposium on Artificial Neural Networks - ESANN'2004, Michel Verleysen, 2004, Bruges, Belgique, pp.7-12
Communication dans un congrès inria-00107776v1

A connectionist architecture that adpats its representation to complex tasks

Bruno Scherrer
International Joint Conference on Neural Networks - IJCNN 2002, 2002, Hilton hawaiian Village, Honolulu, HI, 6 p
Communication dans un congrès inria-00100735v1
Image document

Sur certaines méthodes raisonnées pour l'apprentissage par renforcement profond

Léonard Blier
Apprentissage [cs.LG]. Université Paris-Saclay, 2022. Français. ⟨NNT : 2022UPASG040⟩
Thèse tel-03829500v1
Image document

The Globus Pallidus Pars Interna in Goal-Oriented and Routine Behaviors: Resolving a Long-Standing Paradox

Camille Piron , Daisuke Kase , Meropi Topalidou , Michel Goillandeau , Hugues Orignac , et al.
Movement Disorders, 2016, ⟨10.1002/mds.26542⟩
Article dans une revue hal-01317968v1

Solving POMDPs using selected past events

Alain Dutech
European Conference on Artificial Intelligence, 2000, Berlin, Germany
Communication dans un congrès inria-00099378v1
Image document

DeepRoute: Herding Elephant and Mice Flows with Reinforcement Learning

Mariam Kiran , Bashir Mohammed , Nandini Krishnaswamy
MLN 2019 - 2nd International Conference on Machine Learning for Networking, Dec 2019, Paris, France. pp.296-314, ⟨10.1007/978-3-030-45778-5_20⟩
Communication dans un congrès hal-03266462v1
Image document

A Network-assisted Approach for RAT Selection in Heterogeneous Cellular Networks

Melhem El Helou , Marc Ibrahim , Samer Lahoud , Kinda Khawam , Dany Mezher , et al.
IEEE Journal on Selected Areas in Communications, 2015, 33 (6), pp.1055-1067. ⟨10.1109/JSAC.2015.2416987⟩
Article dans une revue hal-01141520v1
Image document

Automated Placement of In-Network ACL Rules

Wafik Zahwa , Abdelkader Lahmadi , Michael Rusinowitch , Mondher Ayadi
2023 IEEE 9th International Conference on Network Softwarization (NetSoft), Jun 2023, Madrid, Spain. pp.486-491, ⟨10.1109/NetSoft57336.2023.10175436⟩
Communication dans un congrès hal-04236850v1

QBSO-FS: A Reinforcement Learning Based Bee Swarm Optimization Metaheuristic for Feature Selection

Souhila Sadeg , Leila Hamdad , Amine Riad Remache , Mehdi Nedjmeddine Karech , Karima Benatchba , et al.
International Work-Conference on Artificial Neural Networks, 11507, pp.785-796, 2019, Advances in Computational Intelligence. IWANN 2019, 978-3-030-20517-1. ⟨10.1007/978-3-030-20518-8_65⟩
Chapitre d'ouvrage hal-03251457v1
Image document

On the role of Actions and Machine Learning in Artificial Agent Perception.

Hugo Caselles-Dupré
Machine Learning [cs.LG]. Institut Polytechnique de Paris, 2021. English. ⟨NNT : 2021IPPAE006⟩
Thèse tel-03352421v1
Image document

Computational modeling of cognitive control for rule-guided behavior

Snigdha Dagar
Modeling and Simulation. Université de Bordeaux, 2023. English. ⟨NNT : 2023BORD0106⟩
Thèse tel-04301585v1
Image document

Intrinsic Motivation for Autonomous Mental Development

Pierre-Yves Oudeyer , Frédéric Kaplan , Véréna Hafner
IEEE Transactions on Evolutionary Computation, 2007, 11 (2), pp.265-286. ⟨10.1109/TEVC.2006.890271⟩
Article dans une revue hal-00793610v1
Image document

Using Confounded Data in Latent Model-Based Reinforcement Learning

Maxime Gasse , Damien Grasset , Guillaume Gaudron , Pierre-Yves Oudeyer
Transactions on Machine Learning Research Journal, 2023
Article dans une revue hal-04404106v1
Image document

Information per unit of interaction in stochastic sequential decision making

Fabien Pesquerel
Artificial Intelligence [cs.AI]. Université de Lille, 2023. English. ⟨NNT : ⟩
Thèse tel-04501905v1
Image document

SMPyBandits: an Experimental Framework for Single and Multi-Players Multi-Arms Bandits Algorithms in Python

Lilian Besson
2018
Pré-publication, Document de travail hal-01840022v1
Image document

Whittle index based Q-learning for restless bandits with average reward

Konstantin E Avrachenkov , Vivek Borkar
Automatica, 2022, 139, pp.110186. ⟨10.1016/j.automatica.2022.110186⟩
Article dans une revue hal-03582664v1
Image document

Magnetic control of WEST plasmas through deep reinforcement learning

S Kerboua-Benlarbi , R Nouailletas , Blaise Faugeras , E Nardon , P Moreau
2023
Pré-publication, Document de travail hal-04393963v2
Image document

Anderson acceleration for reinforcement learning

Matthieu Geist , Bruno Scherrer
EWRL 2018 - 4th European workshop on Reinforcement Learning, Oct 2018, Lille, France
Communication dans un congrès hal-01928142v1

Using “Social actions” and RL-algorithms to build policies in DEC-POMDP

Vincent Thomas , Mahuna Akplogan
IADIS International Journal on Computer Science and Information Systems, 2009, 4 (3), pp.82-98
Article dans une revue inria-00536851v1
Image document

Une double approche modulaire de l'apprentissage par renforcement pour des agents intelligents adaptatifs

Olivier Buffet
Informatique [cs]. Université Henri Poincaré - Nancy I, 2003. Français. ⟨NNT : ⟩
Thèse tel-00509349v1
Image document

Asking for Knowledge : Training RL Agents to Query External Knowledge Using Language

Iou-Jen Liu , Xingdi Yuan , Marc-Alexandre Côté , Pierre-Yves Oudeyer , Alexander G Schwing
ICML 2022 - 39th International Conference on Machine Learning, Jul 2022, Baltimore, United States
Communication dans un congrès hal-03897379v1
Image document

Application of reinforcement learning to control a multi-agent system

François Klein , Christine Bourjot , Vincent Chevrier
International Conference on Agents and Artificial Intelligence - ICAART 09, Jan 2009, Porto, Portugal
Communication dans un congrès inria-00336173v1
Image document

Towards Scalable Adaptive Learning with Graph Neural Networks and Reinforcement Learning

Jean Vassoyan , Jill-Jênn Vie , Pirmin Lemberger
EDM 2023 - 16th International Conference on Educational Data Mining, Jul 2023, Bangalore, India
Communication dans un congrès hal-04108408v1
Image document

Finite Time Analysis of Linear Two-timescale Stochastic Approximation with Markovian Noise

Maxim Kaledin , Eric Moulines , Alexey Naumov , Vladislav Tadic , Hoi-To Wai
COLT 2020 - 33rd Conference on Learning Theory, Jul 2020, Graz / Virtual, Austria
Communication dans un congrès hal-03033458v1
Image document

Sample-efficient deep reinforcement learning for control, exploration and safety

Yannis Flet-Berliac
Machine Learning [cs.LG]. Université de Lille, 2021. English. ⟨NNT : 2021LILUB009⟩
Thèse tel-03526401v2

A Self-Made Agent Based on Action-Selection

Olivier Buffet , Alain Dutech
Sixth European Workshop on Reinforcement Learning - EWRL-6 2003, 2003, Nancy, France, pp.47-48
Communication dans un congrès inria-00099828v1
Image document

Finite-Sample Analysis of Least-Squares Policy Iteration

Alessandro Lazaric , Mohammad Ghavamzadeh , Rémi Munos
Journal of Machine Learning Research, 2012, 13, pp.3041-3074
Article dans une revue hal-00772060v1