You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To estimate values quickly on big boards, it is needed to do a more or less dirty trick:
If you encounter a move on point X,Y for a player, and you do playouts for that move, let other moves on that point also profit from the assigned value by a certain amount.
This does of course increase the noise level for winrate-distributions over the board, but also it gives very fast access to potential exploration-worthy nodes.
The current implementation is only marginal and needs to be reworked, if not simply thrown out and rewritten. This is pretty important, since the speed of value estimations on a 19x19 board is far too slow to play against.
The text was updated successfully, but these errors were encountered:
- sloppy AMAF implementation REFS #2@30m
- todo commentaries (outsorced as issues)
[ADDED]
- option to disable rating overlay when displaying the board
- generic tree structure REFS #8@8H
- additional display values (AMAFPlayouts)
- AMAF/RAVE implementation REFS #2@4h
- adapted GameManager to minimally support playing with the UCTreeThread instead of old uct implementation REFS #8@30m
[REMOVED]
- most playout heuristics (don't work too well with RAVE, bias to random playouts shouldn't be distributed among all nodes!)
[FIXED]
- some bug in UCT that caused the engine to select the global highest uct node instead of the best path (minimax) CLOSES#10@2h
[CHANGED]
- some weights
- no more "random" montecarlo playouts needed, everything done now by the UCT
To estimate values quickly on big boards, it is needed to do a more or less dirty trick:
If you encounter a move on point X,Y for a player, and you do playouts for that move, let other moves on that point also profit from the assigned value by a certain amount.
This does of course increase the noise level for winrate-distributions over the board, but also it gives very fast access to potential exploration-worthy nodes.
The current implementation is only marginal and needs to be reworked, if not simply thrown out and rewritten. This is pretty important, since the speed of value estimations on a 19x19 board is far too slow to play against.
The text was updated successfully, but these errors were encountered: