Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cannot reproduce the result #36

Open
Joyyang158 opened this issue Jul 12, 2024 · 4 comments
Open

Cannot reproduce the result #36

Joyyang158 opened this issue Jul 12, 2024 · 4 comments

Comments

@Joyyang158
Copy link

Hi authors,

I tried to reproduce the result you declared in the paper using zephyr-7b-sft-full

If following the algorithm you design in the paper, only the performance of the first iteration can increase, and then it decreases after later a few iterations. And I set the epoch = 3.

Could you give me some guidance? Thanks!

@angelahzyuan
Copy link
Collaborator

@Joyyang158 To reproduce the results from our paper, you would need:

  1. Use zephyr-7b-sft-full at revision ac6e600eefcce74f5e8bae1035d4f66019e93190.
  2. Use HF generation, as VLLM generation differs from what we used in the paper.
  3. Set the total number of epochs to 6, and stop as needed.
  4. For evaluation, use lm-evaluation-harness at version v0.4.0.

For settings different from our original configuration, you may need to adjust the parameters. We are currently working on tuning the parameters for VLLM generation and will provide updates once we have results.

Thank you.

@Joyyang158
Copy link
Author

Thanks for your reply. I will try epoch = 6. And There are two things I want to check with you

  1. Every time, you use previous two iterations instead of one iteration as shown in the paper?
  2. Every time, the base model is sft model or the iterative model like the model-iter0,1,2,3?

Thanks!

@angelahzyuan
Copy link
Collaborator

@Joyyang158

  1. previous two iterations' data works better than single iteration.
  2. the base model is changed to iter0,1,2

@Joyyang158
Copy link
Author

I see, thanks for your help and patience!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants