It is named Alpha Zero. Also if you find good materials please share, I could not :(!
@Joe Joyce
I don't think scalability is a problem, not a big one. Not even changing boards or weird piece properties, as long as they can be approximated with number these algorithms are fine. And honestly to my mind everything could be approximated with numbers. The technical term is that the Stone-Weierstrass Theorem should be applicable and then it works. Things probably could go wrong initially sometimes but it will always be a matter or training time, and it most likely will never be matter of decades. Of course one could imagine something totally unfeasible for any hardware from this universe like a game on a 10 billion by 10 billion board, but I think that is too far fetched :)!
@Kevin Pacey
These algorithms are indeed not perfect as by definition they are heuristic, so they will never try to achieve perfect play, but by an statistically relevant sample they choose a very likely very good solution. To my knowledge there was only one self driving car accident ever on a public road in more than 1 billion kilometers. And it was not a sole software glitch but rater a poor visibility problem leading to not enough information.
@Greg Strong
It is named Alpha Zero. Also if you find good materials please share, I could not :(!
@Joe Joyce
I don't think scalability is a problem, not a big one. Not even changing boards or weird piece properties, as long as they can be approximated with number these algorithms are fine. And honestly to my mind everything could be approximated with numbers. The technical term is that the Stone-Weierstrass Theorem should be applicable and then it works. Things probably could go wrong initially sometimes but it will always be a matter or training time, and it most likely will never be matter of decades. Of course one could imagine something totally unfeasible for any hardware from this universe like a game on a 10 billion by 10 billion board, but I think that is too far fetched :)!
@Kevin Pacey
These algorithms are indeed not perfect as by definition they are heuristic, so they will never try to achieve perfect play, but by an statistically relevant sample they choose a very likely very good solution. To my knowledge there was only one self driving car accident ever on a public road in more than 1 billion kilometers. And it was not a sole software glitch but rater a poor visibility problem leading to not enough information.