Skip to content

Commit

Permalink
fix: use torch inference container (#1617)
Browse files Browse the repository at this point in the history

---------

Co-authored-by: Holt Skinner <[email protected]>
  • Loading branch information
inardini and holtskinner authored Jan 10, 2025
1 parent c37883e commit 4ca4987
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -1162,7 +1162,7 @@
"model = Model.upload(\n",
" display_name=\"google--gemma2-tgi-multi-lora-model\",\n",
" artifact_uri=str(serve_uri),\n",
" serving_container_image_uri=\"us-docker.pkg.dev/deeplearning-platform-release/gcr.io/huggingface-text-generation-inference-cu124.2-3.ubuntu2204.py311\",\n",
" serving_container_image_uri=\"us-docker.pkg.dev/deeplearning-platform-release/gcr.io/huggingface-pytorch-inference-cu121.2-3.transformers.4-46.ubuntu2204.py311\",\n",
" serving_container_ports=[8080],\n",
" serving_container_environment_variables={\n",
" \"HUGGING_FACE_HUB_TOKEN\": get_token(),\n",
Expand Down

0 comments on commit 4ca4987

Please sign in to comment.