Filtrer vos résultats
- 146
- 33
- 81
- 36
- 35
- 12
- 10
- 3
- 2
- 6
- 164
- 21
- 18
- 7
- 5
- 2
- 1
- 1
- 790
- 659
- 610
- 589
- 535
- 478
- 431
- 391
- 370
- 324
- 322
- 317
- 312
- 307
- 305
- 294
- 290
- 283
- 266
- 259
- 251
- 226
- 220
- 218
- 218
- 218
- 216
- 215
- 213
- 212
- 211
- 205
- 202
- 201
- 200
- 197
- 196
- 195
- 192
- 191
- 191
- 189
- 183
- 182
- 179
- 179
- 178
- 176
- 174
- 171
- 170
- 168
- 164
- 162
- 161
- 161
- 159
- 157
- 157
- 156
- 155
- 154
- 154
- 152
- 152
- 152
- 150
- 150
- 149
- 148
- 148
- 147
- 144
- 142
- 142
- 141
- 141
- 141
- 140
- 139
- 139
- 139
- 138
- 137
- 137
- 137
- 136
- 136
- 136
- 136
- 135
- 135
- 133
- 132
- 131
- 131
- 131
- 131
- 131
- 128
- 1
- 22
- 29
- 9
- 11
- 8
- 8
- 4
- 11
- 5
- 4
- 3
- 5
- 2
- 3
- 8
- 7
- 5
- 1
- 6
- 17
- 6
- 3
- 1
- 156
- 23
- 52
- 42
- 39
- 26
- 24
- 22
- 14
- 13
- 13
- 11
- 11
- 10
- 9
- 9
- 9
- 8
- 7
- 4
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 16
- 13
- 13
- 11
- 10
- 6
- 5
- 5
- 5
- 4
- 4
- 4
- 4
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
|
A Survey on Deep Learning for Skeleton‐Based Human AnimationComputer Graphics Forum, 2022, 41 (1), pp.122-157. ⟨10.1111/cgf.14426⟩
Article dans une revue
hal-03468599v1
|
||
Adaptive Combination of Behaviors in an AgentEuropean Conference on Artificial Intelligence - ECAI'02, 2002, Lyon, France, pp.48-52
Communication dans un congrès
inria-00100766v1
|
|||
Improving Coordination with Communication in Multiagent Reinforcement Learning16th IEEE International Conference on Tools with Artificial Intelligence - ICTAI'04, 2004, Boca Raton, USA, 5 p
Communication dans un congrès
inria-00100165v1
|
|||
|
High-Accuracy Value-Function Approximation with Neural Networks Applied to the Acrobot12th European Symposium on Artificial Neural Networks - ESANN'2004, Michel Verleysen, 2004, Bruges, Belgique, pp.7-12
Communication dans un congrès
inria-00107776v1
|
||
A connectionist architecture that adpats its representation to complex tasksInternational Joint Conference on Neural Networks - IJCNN 2002, 2002, Hilton hawaiian Village, Honolulu, HI, 6 p
Communication dans un congrès
inria-00100735v1
|
|||
|
Sur certaines méthodes raisonnées pour l'apprentissage par renforcement profondApprentissage [cs.LG]. Université Paris-Saclay, 2022. Français. ⟨NNT : 2022UPASG040⟩
Thèse
tel-03829500v1
|
||
|
The Globus Pallidus Pars Interna in Goal-Oriented and Routine Behaviors: Resolving a Long-Standing ParadoxMovement Disorders, 2016, ⟨10.1002/mds.26542⟩
Article dans une revue
hal-01317968v1
|
||
Solving POMDPs using selected past eventsEuropean Conference on Artificial Intelligence, 2000, Berlin, Germany
Communication dans un congrès
inria-00099378v1
|
|||
|
DeepRoute: Herding Elephant and Mice Flows with Reinforcement LearningMLN 2019 - 2nd International Conference on Machine Learning for Networking, Dec 2019, Paris, France. pp.296-314, ⟨10.1007/978-3-030-45778-5_20⟩
Communication dans un congrès
hal-03266462v1
|
||
|
A Network-assisted Approach for RAT Selection in Heterogeneous Cellular NetworksIEEE Journal on Selected Areas in Communications, 2015, 33 (6), pp.1055-1067. ⟨10.1109/JSAC.2015.2416987⟩
Article dans une revue
hal-01141520v1
|
||
|
Automated Placement of In-Network ACL Rules2023 IEEE 9th International Conference on Network Softwarization (NetSoft), Jun 2023, Madrid, Spain. pp.486-491, ⟨10.1109/NetSoft57336.2023.10175436⟩
Communication dans un congrès
hal-04236850v1
|
||
QBSO-FS: A Reinforcement Learning Based Bee Swarm Optimization Metaheuristic for Feature SelectionInternational Work-Conference on Artificial Neural Networks, 11507, pp.785-796, 2019, Advances in Computational Intelligence. IWANN 2019, 978-3-030-20517-1. ⟨10.1007/978-3-030-20518-8_65⟩
Chapitre d'ouvrage
hal-03251457v1
|
|||
|
On the role of Actions and Machine Learning in Artificial Agent Perception.Machine Learning [cs.LG]. Institut Polytechnique de Paris, 2021. English. ⟨NNT : 2021IPPAE006⟩
Thèse
tel-03352421v1
|
||
|
Computational modeling of cognitive control for rule-guided behaviorModeling and Simulation. Université de Bordeaux, 2023. English. ⟨NNT : 2023BORD0106⟩
Thèse
tel-04301585v1
|
||
|
Intrinsic Motivation for Autonomous Mental DevelopmentIEEE Transactions on Evolutionary Computation, 2007, 11 (2), pp.265-286. ⟨10.1109/TEVC.2006.890271⟩
Article dans une revue
hal-00793610v1
|
||
|
Using Confounded Data in Latent Model-Based Reinforcement LearningTransactions on Machine Learning Research Journal, 2023
Article dans une revue
hal-04404106v1
|
||
|
Information per unit of interaction in stochastic sequential decision makingArtificial Intelligence [cs.AI]. Université de Lille, 2023. English. ⟨NNT : ⟩
Thèse
tel-04501905v1
|
||
|
SMPyBandits: an Experimental Framework for Single and Multi-Players Multi-Arms Bandits Algorithms in Python2018
Pré-publication, Document de travail
hal-01840022v1
|
||
|
Whittle index based Q-learning for restless bandits with average rewardAutomatica, 2022, 139, pp.110186. ⟨10.1016/j.automatica.2022.110186⟩
Article dans une revue
hal-03582664v1
|
||
|
Magnetic control of WEST plasmas through deep reinforcement learning2023
Pré-publication, Document de travail
hal-04393963v2
|
||
|
Anderson acceleration for reinforcement learningEWRL 2018 - 4th European workshop on Reinforcement Learning, Oct 2018, Lille, France
Communication dans un congrès
hal-01928142v1
|
||
Using “Social actions” and RL-algorithms to build policies in DEC-POMDPIADIS International Journal on Computer Science and Information Systems, 2009, 4 (3), pp.82-98
Article dans une revue
inria-00536851v1
|
|||
|
Une double approche modulaire de l'apprentissage par renforcement pour des agents intelligents adaptatifsInformatique [cs]. Université Henri Poincaré - Nancy I, 2003. Français. ⟨NNT : ⟩
Thèse
tel-00509349v1
|
||
|
Asking for Knowledge : Training RL Agents to Query External Knowledge Using LanguageICML 2022 - 39th International Conference on Machine Learning, Jul 2022, Baltimore, United States
Communication dans un congrès
hal-03897379v1
|
||
|
Application of reinforcement learning to control a multi-agent systemInternational Conference on Agents and Artificial Intelligence - ICAART 09, Jan 2009, Porto, Portugal
Communication dans un congrès
inria-00336173v1
|
||
|
Towards Scalable Adaptive Learning with Graph Neural Networks and Reinforcement LearningEDM 2023 - 16th International Conference on Educational Data Mining, Jul 2023, Bangalore, India
Communication dans un congrès
hal-04108408v1
|
||
|
Finite Time Analysis of Linear Two-timescale Stochastic Approximation with Markovian NoiseCOLT 2020 - 33rd Conference on Learning Theory, Jul 2020, Graz / Virtual, Austria
Communication dans un congrès
hal-03033458v1
|
||
|
Sample-efficient deep reinforcement learning for control, exploration and safetyMachine Learning [cs.LG]. Université de Lille, 2021. English. ⟨NNT : 2021LILUB009⟩
Thèse
tel-03526401v2
|
||
A Self-Made Agent Based on Action-SelectionSixth European Workshop on Reinforcement Learning - EWRL-6 2003, 2003, Nancy, France, pp.47-48
Communication dans un congrès
inria-00099828v1
|
|||
|
Finite-Sample Analysis of Least-Squares Policy IterationJournal of Machine Learning Research, 2012, 13, pp.3041-3074
Article dans une revue
hal-00772060v1
|