Cannot reproduce README.md graphs/results on copy task [Commit: 5c5ce66] #2

jagleeso · 2018-03-15T14:42:37Z

Hello,

I'm just trying to reproduce the results/graphs shown in the README.md for the "copy" task.

I am running this on the latest master branch:
Commit 5c5ce66376e8032c38ef4327ca381fee145f4d0f

How I trained my model:

./train.py --seed 1000 --task copy --checkpoint_interval 500 --checkpoint-path ./notebooks/copy -pbatch_size=15

NOTE: I used a batch_size of 15 instead of the default of 1, since it seems to lead to more stable convergence rates.

I then used the python notebook to generate 3 graphs shown in the README.md.
For convenience when comparing, I've included both the graphs I got, and the graph I expect (taken from the README.md).

Graph 1: Training convergence

I got:

I expect:

Graph 2: Training convergence (per sequence length)

I got:

I expect:

Graph 3: Evaluate

I got:

(The expected graph being that Outputs matches the Targets)

Setup information

My setup is:

OS: Ubuntu 16.04.4 LTS
Python version: 3.6.4
PyTorch is installed using Anaconda, pip freeze reports the version as:
torch==0.3.0.post4
CUDA/cuDNN versions/libraries in use by pytorch at runtime:

/usr/lib/x86_64-linux-gnu/libcuda.so.384.111
/usr/local/cuda-9.0/lib64/libcublas.so.9.0.176
/usr/local/cuda-9.0/lib64/libcudart.so.9.0.176
/usr/local/cuda-9.0/lib64/libcudnn.so.7.0.5
/usr/local/cuda-9.0/lib64/libcurand.so.9.0.176
/usr/local/cuda-9.0/lib64/libcusparse.so.9.0.176
/usr/local/cuda-9.0/lib64/libnvrtc.so.9.0.176
/usr/local/cuda-9.0/lib64/libnvToolsExt.so.1.0.0

Let me know if there is any other information I can provide.

The text was updated successfully, but these errors were encountered:

loudinthecloud · 2018-03-16T19:21:19Z

Thanks for reporting it @jagleeso, the same issue was reported to me earlier this week and the fix was to use earlier commit d8fedf6 for now which seemed to work. I plan to fix it in the coming days.

loudinthecloud self-assigned this Mar 17, 2018

loudinthecloud closed this as completed in 64f1d58 Mar 17, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot reproduce README.md graphs/results on copy task [Commit: 5c5ce66] #2

Cannot reproduce README.md graphs/results on copy task [Commit: 5c5ce66] #2

jagleeso commented Mar 15, 2018

loudinthecloud commented Mar 16, 2018

Cannot reproduce README.md graphs/results on copy task [Commit: 5c5ce66] #2

Cannot reproduce README.md graphs/results on copy task [Commit: 5c5ce66] #2

Comments

jagleeso commented Mar 15, 2018

How I trained my model:

Graph 1: Training convergence

Graph 2: Training convergence (per sequence length)

Graph 3: Evaluate

Setup information

loudinthecloud commented Mar 16, 2018