v0.8: Structured Output via Outlines
🎉 Enhancements
- Added Outlines logits processor for JSON schema validation by @tgaddair in #224
- Enable JSON guided generation via OpenAI-compatible API by @jeffreyftang in #243
- JSON schema for guided generation now optionally respects field order by @jeffreyftang in #264
- Set default adapter source by @magdyksaleh in #223
- Pad LoRA ranks to ensure compatibility with SGMV kernel by @tgaddair in #256
- Add model and adapter response headers by @magdyksaleh in #220
- Add Cors params by @magdyksaleh in #221
- Add expose headers by @magdyksaleh in #230
🐛 Bugfixes
- Properly split out model_id when retrieving adapter weights downloaded from S3 by @jeffreyftang in #246
- Fixed TIES merging to calculate sign before applying weights by @tgaddair in #239
- Update s3.py by @llama-shepard in #234
- Fix concatenate for flash batch by @tgaddair in #254
- Fixed batch merging and filtering to handle Outlines state by @tgaddair in #263
📝 Docs
- Add guide for guided generation by @jeffreyftang in #240
- Added contributing guide by @tgaddair in #226
- Update README to include model merging by @tgaddair in #225
- Updated structured output by @tgaddair in #258
- Minor corrections to development env setup instructions by @jeffreyftang in #228
🔧 Maintenance
- Upgrade docker to use rust 1.75 and ubuntu 22.04 by @tgaddair in #250
- Upgrading rust for dependency changes by @DhruvaBansal00 in #248
- fix paths on runner by @noyoshi in #242
New Contributors
- @jeffreyftang made their first contribution in #228
- @DhruvaBansal00 made their first contribution in #248
Full Changelog: v0.7.0...v0.8.0