上次openAI的人说dota是genetic algorithm + RL。
“At the beginning it is worth noting that OpenAI’s artificial intelligence
learns to play with itself. All the strategies noted by the researchers are
the result of many hours of sessions, during which two independent
instances are fighting each other. One of them is still learning and the
other one is blocked. When the learning bot achieves an advantage, it is
cloned and the researchers continue the process. The genetic algorithms work
underneath all the time, which on the basis of the results achieved
determine which behaviours bring the intended effect, and which are
meaningless and translate into a failure. In the following video, OpenAI
employees present strategies that their artificial intelligence used when
playing with real Dota 2 players”
Deepmind的方案现在有报道了吗?