-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question about Project Status and Potential Contributions #1
Comments
@Hannibal046 Hi, yes, all of your understandings are correct.
Regarding post-training, I don't have extensive experience in this field yet. However, I'd be very glad if you could contribute in this area. I also plan to add support for post-training features in the future. |
@yzhangcs If you're planning to implement support for online data tokenization with shuffling, I'd like to share an elegant implementation from Meta Lingua for your reference. Their approach:
I'm not sure which specific features you need to implement, but relying solely on "2. online tokenization and reshuffling with a buffer" might not be sufficient for large-scale training. This is because some datasets from Hugging Face are chronologically ordered, and even with a large online buffer, the data would still be biased. I'm happy to help if you need any assistance! |
Thank you! I will be taking a look at it. |
Hi Team,
First, I want to express my appreciation for maintaining this repository and fla. I'm finding both projects very valuable.
I have several questions about the project:
Project Status
Development Direction
Technical Architecture
From my understanding:
fla
for model definitionfla
's HuggingFace compatibility, it should work withlm-eval-harness
for evaluationCould you confirm if this understanding is correct?
Future Plans
Looking forward to your response and potentially contributing to the project.
Best regards
The text was updated successfully, but these errors were encountered: