Adding Snowflake Provider Support + Distribution Template #351

sfc-gh-alherrera · 2024-10-31T23:29:13Z

Adds support for inference with Snowflake's Cortex endpoint. See docs for more background on Cortex:https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-llm-rest-api

Testing

built a new stack from scratch using remote::snowflake for the Inference API
ran python -m llama_stack.apis.inference.client localhost
directly submitted request to the llama stack server

raghotham · 2024-11-05T17:48:47Z

distributions/snowflake/build.yaml

+  providers:
+    inference: remote::snowflake
+    memory: meta-reference
+    safety: meta-reference


Are you able to offer llama guard inference as well instead of relying on meta-reference?

@raghotham This is actually desirable. Safety impl should be meta-reference -- the important question is that there be a Llama-Guard model available for inference.

Unfortunately we don't have a llama-guard model hosted separately in our LLM service (see here for available models right now: https://docs.snowflake.com/user-guide/snowflake-cortex/llm-functions#availability).

We do use llama-guard in Cortex Guard: https://www.snowflake.com/en/blog/snowflake-cortex-ai-cortex-guard-llm-safeguards/ but it isn't exposed / hosted separately, it is offered as an optional arg with our normal completion serving endpoint. Meaning the Inference API response is filtered with llama-guard on the Cortex server-side if this optional arg is on.

Could we make that work somehow?

@ashwinb @raghotham please let me know if you have specific guidance / comments on updating the implementation given the above context, or if we are good to merge for now. In the future i'd like to build out additional components like memory...etc which Snowflake can support as well.

hi @ashwinb @raghotham - just circling back here. what do you need from our side to get this merged? thanks

@sfc-gh-alherrera sorry about not responding sooner. We have been working to stabilize the APIs and automate verification for providers. Can you take a look at the latest code and rebase your PR?

sfc-gh-alherrera added 2 commits October 31, 2024 13:00

feat: initial implementation of snowflake provider + distro

59d8563

feat: adding snowflake provider and template

ecb395c

sfc-gh-alherrera requested review from ashwinb, yanxi0830, hardikjshah, dltn and raghotham as code owners October 31, 2024 23:29

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 31, 2024

chore: misc cleanup

bbf4dbd

raghotham reviewed Nov 5, 2024

View reviewed changes

Merge branch 'main' into snowflake-llama-stack

35cbed4

sfc-gh-alherrera requested review from dineshyv, vladimirivic and sixianyi0721 as code owners January 24, 2025 14:37

sfc-gh-alherrera marked this pull request as draft January 24, 2025 14:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Snowflake Provider Support + Distribution Template #351

Adding Snowflake Provider Support + Distribution Template #351

sfc-gh-alherrera commented Oct 31, 2024

raghotham Nov 5, 2024

ashwinb Nov 5, 2024

sfc-gh-alherrera Nov 5, 2024 •

edited

Loading

sfc-gh-alherrera Nov 8, 2024

sfc-gh-alherrera Jan 23, 2025

raghotham Jan 23, 2025

Adding Snowflake Provider Support + Distribution Template #351

Are you sure you want to change the base?

Adding Snowflake Provider Support + Distribution Template #351

Conversation

sfc-gh-alherrera commented Oct 31, 2024

raghotham Nov 5, 2024

Choose a reason for hiding this comment

ashwinb Nov 5, 2024

Choose a reason for hiding this comment

sfc-gh-alherrera Nov 5, 2024 • edited Loading

Choose a reason for hiding this comment

sfc-gh-alherrera Nov 8, 2024

Choose a reason for hiding this comment

sfc-gh-alherrera Jan 23, 2025

Choose a reason for hiding this comment

raghotham Jan 23, 2025

Choose a reason for hiding this comment

sfc-gh-alherrera Nov 5, 2024 •

edited

Loading