Skip to content

Releases: predibase/lorax

v0.5: CUDA graph compilation

08 Jan 17:14
57d5470
Compare
Choose a tag to compare

🎉 Enhancements

🐛 Bugfixes

  • Fixed deadlock in sgmv_shrink kernel caused by imbalanced segments by @tgaddair in #156
  • Fixed loading adapter from absolute s3 path by @tgaddair in #161

📝 Docs

  • Update client docs with new endpoint source by @abidwael in #126
  • Update client docs with new endpoint source by @abidwael in #146

🔧 Maintenance

New Contributors

Full Changelog: v0.4.1...v0.5.0

v0.4.1

18 Dec 19:53
9ae65b3
Compare
Choose a tag to compare

🐛 Bugfixes

🔧 Maintenance

  • Optimize SGMV kernel code path to reduce mallocs by @tgaddair in #139
  • fix sync script to account for subfolder bucket paths by @noyoshi in #135

Full Changelog: v0.4.0...v0.4.1

v0.4.0

15 Dec 18:15
ce99dbf
Compare
Choose a tag to compare

🎉 Enhancements

🐛 Bugfixes

📝 Docs

🔧 Maintenance

New Contributors

Full Changelog: v0.3.0...v0.4.0

v0.3.0

07 Dec 18:56
bb950cc
Compare
Choose a tag to compare

What's Changed

Enhancements

Bugfixes

Maintenance

New Contributors

Full Changelog: v0.2.1...v0.3.0

lorax-0.3.0

07 Dec 18:51
bb950cc
Compare
Choose a tag to compare

LoRAX is the open-source framework for serving hundreds of fine-tuned LLMs in production for the price of one.

lorax-0.2.1

30 Nov 19:16
fb3cdb4
Compare
Choose a tag to compare

LoRAX is the open-source framework for serving hundreds of fine-tuned LLMs in production for the price of one.

v0.2.1

29 Nov 20:29
f1b9778
Compare
Choose a tag to compare

What's Changed

Bugfixes

  • add weight property to exllamav2 quanlinear by @flozi00 in #80
  • fix: Assign dtype of lora to base model dtype by @tgaddair in #82

Full Changelog: v0.2.0...v0.2.1

v0.2.0

28 Nov 22:01
cb96f12
Compare
Choose a tag to compare

What's Changed

Enhancements

Bugfixes

  • Updated to custom SGMV kernel to fix issue with certain ranks by @tgaddair in #70
  • fix: Allow using unsupported base models without adapter loading by @tgaddair in #76

Maintenance

  • Add DISABLE_SGMV env var to explicitly fallback to loop by @tgaddair in #69
  • Upgrade the README discord badge and use an invite link that doesn't expire. by @justinxzhao in #73

New Contributors

Full Changelog: v0.1.2...v0.2.0

v0.1.2

26 Nov 21:33
8c8109c
Compare
Choose a tag to compare

What's Changed

  • Fixed adapter segments when batches contain multiple distinct adapters by @tgaddair in #62

Full Changelog: v0.1.1...v0.1.2

v0.1.1

22 Nov 22:47
00675dc
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.1.0...v0.1.1