Machine Learning

Deep Reinforcement Learning with Surrogate Agent-Environment Interface

This topic contains 0 replies, has 1 voice, and was last updated by  arXiv 1 week, 3 days ago.


  • arXiv
    5 pts

    Deep Reinforcement Learning with Surrogate Agent-Environment Interface

    In this paper we propose surrogate agent-environment interface (SAEI) in reinforcement learning. We also state that learning based on probability surrogate agent-environment interface gives optimal policy of task agent-environment interface. We introduce surrogate probability action and develope the probability surrogate action deterministic policy gradient (PSADPG) algorithm based on SAEI. This algorithm enables continuous control of discrete action. The experiments show PSADPG achieves the performance of DQN in the long run for selected tasks.

    Deep Reinforcement Learning with Surrogate Agent-Environment Interface
    by Song Wang, Yu Jing
    https://arxiv.org/pdf/1709.03942v1.pdf

You must be logged in to reply to this topic.