Releases: helixml/helix
1.4.14 - improve app UI, add env var to disable logging
What's Changed
For security conscious orgs, it's now possible to globally disable LLM logging. We've also fixed some bugs in the app editor UI.
New env variable (on controlplane, aka api container) to disable LLM calls logs tables in both admin dashboard and app view:
DISABLE_LLM_CALL_LOGGING=true
Admin dashboard:
App view:
PRs:
- chore(deps): bump github.com/go-git/go-git/v5 from 5.12.0 to 5.13.0 by @dependabot in #693
- chore(smoke): increase rag app test timeout by @philwinder in #694
- fix(knowledge): guard against nil crawler options by @philwinder in #695
- test(smoke): smoke test example apps by @philwinder in #696
- refactor(smoke): remove parallel tests and chat timeout by @philwinder in #697
- refactor(smoke): missed a parallel and improve logging by @philwinder in #698
- refactor(smoke): remove unnecessary reloads that might be affecting s… by @philwinder in #699
- refactor(smoke): remove individual test timeouts by @philwinder in #704
- 701 app creation bug by @binocarlos in #708
- Revert "701 app creation bug" by @lukemarsden in #710
- feat(kind): allow user to specify helix env vars by @philwinder in #711
- Revert "Revert "701 app creation bug"" by @lukemarsden in #712
- 701 app creation bug by @binocarlos in #713
- Disable llm calls logging by @nessie993 in #714
Full Changelog: 1.4.13...1.4.14
1.4.13 - fixing regressions
What's Changed
Fixes nats error introduced by mistake in previous release on API restart. Also fixes regression in the helix k8s operator. Sorry for the turbulence, folks!
- Fix operator code and add CI by @milosgajdos in #683
- feat(smoke): add smoke test for api integrations by @philwinder in #684
- fix(smoke): don't check for specific text in responses, too unreliable by @philwinder in #685
- refactor(smoke): check for presence of response text by @philwinder in #686
- feat(installer): make cli installation path configurable and add to smoke test by @philwinder in #687
- test(smoke): add helix test smoke test by @philwinder in #689
- Bump NATS dependency by @milosgajdos in #690
- Fix subscription by @nessie993 in #691
Full Changelog: 1.4.12...1.4.13
1.4.12 - apps drag'n'drop filestore, initial MCP support
What's Changed
- make frontend work better with filestore knowledge by @nessie993 in #668
- Model Context Protocol (MCP) support by @nessie993 in #680
security/refactoring/quality changes:
- Revive vol2 by @milosgajdos in #657
- (security): Bump golang.org/x/net module (CVE-2024-45338) by @milosgajdos in #658
- chore(deps): bump google.golang.org/grpc from 1.64.0 to 1.64.1 by @dependabot in #660
- (security): Bump golang.org/x/net module (CVE-2024-45338) by @milosgajdos in #661
- Bump gRPC module by @milosgajdos in #662
- (refactor): Client context plumbing by @milosgajdos in #663
- Add context plumbing for sessions. by @milosgajdos in #664
- Revive linter fixes by @milosgajdos in #665
- Enable revive vol4 by @milosgajdos in #666
- Rename vars so they follow idiomatic Go by @milosgajdos in #667
- Rename vars vol2 by @milosgajdos in #669
- Add comments to blank imports by @milosgajdos in #670
- Last revive fixes by @milosgajdos in #671
- Remove TOC from local-dev README by @milosgajdos in #673
- less confusing error for app api keys by @nessie993 in #674
- feat(test): initial smoke test by @philwinder in #675
- feat(test): slack notification by @philwinder in #676
- test(smoke): speed up and make tests more resilient by @philwinder in #677
- test(smoke): add installer test by @philwinder in #678
Full Changelog: 1.4.11...1.4.12
1.4.11 - out of the box local models support for gptscript; fix RAG over files on k8s
What's Changed
- Enable errcheck linter by @milosgajdos in #643
- fix: document serverUrl parameter in sample values.yaml by @lukemarsden in #648
- Add linter step into CI by @milosgajdos in #649
- Allow RUNNER_TOKEN to perform chat completions by @milosgajdos in #647
- Fix stack script by @milosgajdos in #650
- Update docker-compose file so gptscript uses RUNNER_TOKEN by @milosgajdos in #651
- fixes pdf parsing by @nessie993 in #652
- Remove redundant http.Flush when writing chunks by @milosgajdos in #653
- Make Go code Go code again: revive linter checks by @milosgajdos in #655
- chore(deps): bump transformers from 4.31.0 to 4.38.0 in /cog by @dependabot in #644
- refactor(scheduler): fix flakey test and refactor to make more testable by @philwinder in #656
Full Changelog: 1.4.10...1.4.11
1.4.10 - rag improvements
What's Changed
- Small error updates by @milosgajdos in #638
- chore(deps): bump golang.org/x/crypto from 0.27.0 to 0.31.0 in /operator by @dependabot in #639
- chore(deps): bump golang.org/x/crypto from 0.27.0 to 0.31.0 by @dependabot in #641
- add limits to the crawler by @nessie993 in #642
- Introduce golangci linter by @milosgajdos in #640
- fix rag search by @nessie993 in #645
- fix frontend code blocks by @nessie993 in #646
New Contributors
- @nessie993 made their first contribution in #642
Full Changelog: 1.4.9...1.4.10
1.4.9 - support function calling in helix runners, add gptscript runner
What's Changed
- Add gptscript_runner service to helix stack by @milosgajdos in #635
- Bump ollama and update LLM inference with Tool calling by @milosgajdos in #637
Full Changelog: 1.4.8...1.4.9
1.4.8 - Runner slimming, add llama3.3:70b
What's Changed
Runner slimming, add llama3.3:70b by @lukemarsden in #634:
- switch default model back to
llama3.1:8b-instruct-q8_0
, it was set tollama3.1:8b-instruct-q4_K_M
by mistake - remove rarely used models to reduce size of -large model, remove image model from -small, prune default startup models
- switch to symlinks instead of copying in the case that the model cache isn't bind-mounted (will dramatically reduce ephemeral storage usage in production)
- add new llama3.3:70b model which is apparently as good as llama3.1:405b
- upgrade to ollama v0.5.1
- add new
OLLAMA_KV_CACHE_TYPE=q8_0
option to quantize kv store, should reduce long context memory usage (we can monitor memory usage in the dashboard and adjust the predicted memory values accordingly)
Full Changelog: 1.4.7...1.4.8
1.4.7 - [Breaking] Replace some Ollama Models to aid cache stability
Ollama regularly updates their "default" models to use new weights. For example llama3.1:7b gets updated roughly once every two weeks. Each time this happens we have to rebuild our base images from scratch.
To help avoid this, we've switched to use the same model, but with a more specific tag.
New sessions will use this new models by default. API users should change their model request to use the newer models.
Here is a list of changed models:
llama3.1:8b-instruct-q8_0
llama3.1:70b
aya:35b
Switching to:
llama3.1:8b-instruct-q4_K_M
llama3.1:70b-instruct-q4_K_M
aya:35b-23-q4_0
What's Changed
- refactor(runner): swap some ollama models to make them more stable and bust model cache to update by @philwinder in #631
Full Changelog: 1.4.6...1.4.7
1.4.6 - fix regression of frontend using chat models with togetherai
What's Changed
- Bump gptscript version by @milosgajdos in #624
- fix(scheduler): add background goroutine to check for dead runners by @philwinder in #623
- refactor(image): fix size of images by @philwinder in #626
- Use fork of gptscript and update prompt to user by @milosgajdos in #625
Full Changelog: 1.4.5...1.4.6
1.4.5 - Bugfix for openshift
What's Changed
- fix(diffusers): add --no-dev because it shouldn't install dev dependencies by @philwinder in #622
Full Changelog: 1.4.4...1.4.5