排序方式: 共有1条查询结果,搜索用时 11 毫秒
1
1.
杨银贤 《重庆大学学报(英文版)》2005,4(1):50-54
1 Introduction Reinforcement learning is a machine learningmethod for agents to acquire the optimal policyautonomously from the environment of their behaviors.When an action is executed, the agent receives areinforcement signal by interacting with theenvironment. This technology has recently been used inmany fields, such as robot control [1], artificialintelligence [2], especially multi-agent system [3,4].Generally, when the state space of the environment issmall enough and all states can be e… 相似文献
1