-
Notifications
You must be signed in to change notification settings - Fork 19.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
'''Train a simple deep CNN on the CIFAR10 small images dataset." after a while it gets worse #7384
Comments
Accuracy on the validation set or on the training set? |
these are the result after 200 epoc: |
Are you running on theano, tensorflow or cntk backend? |
tensorflow 1.2
and no I didn't change anything in the code |
I started a run using the Theano backend and I am getting similar results. From the code, it seems that this might be caused by vanishing gradients, because if This being said, I just realised that the default value for the numerical constant Maybe you can try another optimizer to see if this is indeed a problem with rmsprop. However, it seems that all adaptive learning rate optimizers use the same epsilon that might just cause the same problems. Therefore, it might be wiser to just choose PS: I found this similar issue for torch |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 30 days if no further activity occurs, but feel free to re-open a closed issue if needed. |
I run the code it went high as 0.79% accuracy in 100 epoc but then it started to get worse and the loss goes back to what it started!(1.8) how to prevent that?
The text was updated successfully, but these errors were encountered: