-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: bundle CUDA DLL into the release #62
base: main
Are you sure you want to change the base?
Conversation
The latest updates on your projects. Learn more about Vercel for Git ↗︎ 1 Ignored Deployment
|
Feeding.slowdown.mp4@louisgv The callback to the UI of the fed tokens seams to slowdown the feeding process significantly. (Should be instant) |
Yes haha - there's a 42ms artificial lag that I introduced to make the UI a bit more smooth: ref: https://github.com/louisgv/local.ai/blob/main/apps/desktop/src/providers/thread.ts#L160-L161 For non-accelerated machine and models, this is needed to have something showing :d....... |
Currently this copies the cuda dlls next to the local.ai executable if the |
@louisgv Whats the plan going forward on this? Can you take over and handle the auto update stuff? |
@LLukas22 yup, I'm on it now! |
Kinda want to wait for the Metal fix to land. My main fear with this PR is that the build seems flaky :d.... (OOM?...) Perhaps we should remove some of the flakyness by building for either cuda or CL only?... |
Per the docs, there's still no metal chips on github runner VM yet: https://docs.github.com/en/actions/using-github-hosted-runners/about-github-hosted-runners#supported-runners-and-hardware-resources So we will still need a metal self-hosted runner I think :d |
The last piece missing from this PR is a pipeline to upload the content of each zip artifact into a release.
|
No description provided.