SCIENCE SPIN-OFFS

Reply Private New

Next 10 Prev Next

Send PM Follow Ignore

Followers	28
Posts	7358
Boards Moderated	1
Alias Born	09/13/2010

dockzef

Re: None

Friday, 11/09/2018 1:45:45 PM

Friday, November 09, 2018 1:45:45 PM

Meta-Learning for Multi-objective Reinforcement Learning

Xi Chen, Ali Ghadirzadeh, Mårten Björkman, Patric Jensfelt
(Submitted on 8 Nov 2018)

Multi-objective reinforcement learning (MORL) is the generalization of standard reinforcement learning (RL) approaches to solve sequential decision making problems that consist of several, possibly conflicting, objectives. Generally, in such formulations, there is no single optimal policy which optimizes all the objectives simultaneously, and instead, a number of policies has to be found, each optimizing a preference of the objectives. In this paper, we introduce a novel MORL approach by training a meta-policy, a policy simultaneously trained with multiple tasks sampled from a task distribution, for a number of randomly sampled Markov decision processes (MDPs). In other words, the MORL is framed as a meta-learning problem, with the task distribution given by a distribution over the preferences. We demonstrate that such a formulation results in a better approximation of the Pareto optimal solutions, in terms of both the optimality and the computational efficiency. We evaluated our method on obtaining Pareto optimal policies using a number of continuous control problems with high degrees of freedom.

https://arxiv.org/abs/1811.03376

Keep Last Read

Next 10 Prev Next

Join the InvestorsHub Community

Register for free to join our community of investors and share your ideas. You will also get access to streaming quotes, interactive charts, trades, portfolio, live options flow and more tools.

Volume
Day Range:
Bid Price
Ask Price
Last Trade Time:

Boards:

Quotes:

Boards

News

Market Data

Markets

Discover

Discover

Boards:

Quotes:

Join the InvestorsHub Community