-
Notifications
You must be signed in to change notification settings - Fork 45.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Relation to the original XLNet implementation? #7843
Comments
Hi, this is a new implementation in TF2 and includes a few updates from the authors directly. |
@saberkun Off topic: Also XLnet sota results should be retried with swish / RAdam / gradient centralization / Ranger https://github.com/lessw2020/Ranger-Deep-Learning-Optimizer I sincerely hope that you, the tensorflow engineers can observe this research direction blind spot and correct it. Then you would be covered with glory for the induced empirical results. |
Hi @saberkun, @zihangdai, @graykode, @bzantium
The original zihangdai/XLNet repository doesn't get any update recently. Should we assume that the XLNet implementation here official/nlp/xlnet will replace the original one?
BTW, there are several PRs there that are not merged, including my latest PR zihangdai/xlnet#247 that enables pre-training XLNet using Cloud TPU and TPU Pod.
The text was updated successfully, but these errors were encountered: