TY - GEN
T1 - Improving generalization to new environments and removing catastrophic forgetting in Reinforcement Learning by using an eco-system of agents
AU - Moulin, Olivier
AU - Vincent-Lavet, Francois
AU - Elbers, Paul
AU - Hoogendoorn, Mark
PY - 2022
Y1 - 2022
N2 - Adapting a Reinforcement Learning (RL) agent to an unseen environment is a difficult task due to typical over-fitting on the training environment. RL agents are often capable of solving environments very close to the trained environment, but when environments become substantially different, their performance quickly drops. When agents are retrained on new environments, a second issue arises: there is a risk of catastrophic forgetting, where the performance on previously seen environments is seriously hampered. This paper proposes a novel approach that exploits an eco-system of agents to address both concerns. Hereby, the (limited) adaptive power of individual agents is harvested to build a highly adaptive eco-system.
AB - Adapting a Reinforcement Learning (RL) agent to an unseen environment is a difficult task due to typical over-fitting on the training environment. RL agents are often capable of solving environments very close to the trained environment, but when environments become substantially different, their performance quickly drops. When agents are retrained on new environments, a second issue arises: there is a risk of catastrophic forgetting, where the performance on previously seen environments is seriously hampered. This paper proposes a novel approach that exploits an eco-system of agents to address both concerns. Hereby, the (limited) adaptive power of individual agents is harvested to build a highly adaptive eco-system.
UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85158876637&origin=inward
U2 - https://doi.org/10.1109/WI-IAT55865.2022.00032
DO - https://doi.org/10.1109/WI-IAT55865.2022.00032
M3 - Conference contribution
T3 - Proceedings - 2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2022
SP - 166
EP - 173
BT - Proceedings - 2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2022
A2 - Zhao, Jiashu
A2 - Fan, Yixing
A2 - Bagheri, Ebrahim
A2 - Fuhr, Norbert
A2 - Takasu, Atsuhiro
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2022
Y2 - 17 November 2022 through 20 November 2022
ER -