Improving Generalization in Reinforcement Learning Through Forked Agents

Olivier Moulin, Vincent Francois-Lavet, Paul Elbers, Mark Hoogendoorn

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

An eco-system of agents, each having their own policy with limited generalizability, has proven to be a reliable approach to increase generalization across procedurally generated environments. In such an approach, new agents are regularly added to the eco-system when encountering a new environment that is outside of the scope of the eco-system. The speed of adaptation and general effectiveness of the eco-system approach highly depends on the initialization of new agents. In this paper we propose different initialization techniques, inspired from Deep Neural Network initialization and transfer learning, and study their impact.
Original languageEnglish
Title of host publicationAdvances and Trends in Artificial Intelligence. Theory and Applications - 36th International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2023, Proceedings, Part II
EditorsHamido Fujita, Yinglin Wang, Yanghua Xiao, Ali Moonis
PublisherSpringer Science and Business Media Deutschland GmbH
Pages249-260
Number of pages12
Volume13926 LNAI
ISBN (Print)9783031368219
DOIs
Publication statusPublished - 2023
Event36th International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2023 - Shanghai, Switzerland
Duration: 19 Jul 202322 Jul 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13926 LNAI

Conference

Conference36th International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2023
Country/TerritorySwitzerland
CityShanghai
Period19/07/202322/07/2023

Keywords

  • Agents
  • Generalization
  • Reinforcement Learning

Cite this