Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Inference ] Integrate chat template in llm-on-ray (#199)
* integrate inference chat template Signed-off-by: minmingzhu <[email protected]> * update Signed-off-by: minmingzhu <[email protected]> * update Signed-off-by: minmingzhu <[email protected]> * update Signed-off-by: minmingzhu <[email protected]> * update Signed-off-by: minmingzhu <[email protected]> * update Signed-off-by: minmingzhu <[email protected]> * Update query_http_requests.py * update Signed-off-by: minmingzhu <[email protected]> * update Signed-off-by: minmingzhu <[email protected]> * update * update * update * update yaml file * update yaml * format yaml * update * Update mpt_deltatuner.yaml * update * Update neural-chat-7b-v3-1.yaml * Update predictor_deployment.py * 1. add jinja file 2. add chat template unit test 3. fix comments Signed-off-by: minmingzhu <[email protected]> * add license header Signed-off-by: minmingzhu <[email protected]> * update Signed-off-by: minmingzhu <[email protected]> * update Signed-off-by: minmingzhu <[email protected]> * update Signed-off-by: minmingzhu <[email protected]> * Update bloom-560m-ci.yaml * debug CI Signed-off-by: minmingzhu <[email protected]> * debug CI Signed-off-by: minmingzhu <[email protected]> * Update VLLM installation script and documentation (#212) * Update VLLM installation script and documentation Signed-off-by: Wu, Xiaochang <[email protected]> * nit Signed-off-by: Wu, Xiaochang <[email protected]> * Update vLLM installation message Signed-off-by: Wu, Xiaochang <[email protected]> * Update installation instructions for vLLM CPU Signed-off-by: Wu, Xiaochang <[email protected]> * Update Dockerfile.vllm Signed-off-by: Wu, Xiaochang <[email protected]> * Update VLLM version to 0.4.1 Signed-off-by: Wu, Xiaochang <[email protected]> * update doc Signed-off-by: Wu, Xiaochang <[email protected]> * nit Signed-off-by: Wu, Xiaochang <[email protected]> * nit Signed-off-by: Wu, Xiaochang <[email protected]> --------- Signed-off-by: Wu, Xiaochang <[email protected]> * [Workflow] Unify Docker operations into bash (#123) * docker2sh test * codepath * codepath * codepath * add * add * add * add * add * add * df * docker.sh * docker bash * docker bash * docker bash * docker bash * inference docker bash * merge main0312 * merge main0312 * merge main0312 * test set-e * fix test * fix * fix * fix * test error * test error * add map * test install error * test install error * test install error * test install error * test * test * fix * fix * fix * only inference * fux * fux * fux * target * target * target * fix proxy * fix proxy * fix proxy * fix proxy * fix proxy * fix proxy * fix proxy * fix fuc * fix fuc * fix fuc * all inference * add finetune * fix * fix * fix * fix * fix finetune * fix finetune * fix review * fix review * fix review * add info output * Update proxy settings and Docker configurations Signed-off-by: Wu, Xiaochang <[email protected]> * fix vllm pr212 * fix * fix * change name --------- Signed-off-by: Wu, Xiaochang <[email protected]> Co-authored-by: Wu, Xiaochang <[email protected]> * fix comments Signed-off-by: minmingzhu <[email protected]> * update code style Signed-off-by: minmingzhu <[email protected]> * Fix openai response for vLLM (#213) * [CI] Add llama2-70b inference workflow (#208) * add llama-2-70b * nit * fix vllm inference ci * Revert "fix vllm inference ci" This reverts commit 36062bd. * Fix StoppingCriteriaSub parameters to be compatible with latest Transformers (#215) * 1. fix CI 2. fix comments Signed-off-by: minmingzhu <[email protected]> * format Signed-off-by: minmingzhu <[email protected]> * modify jinja path Signed-off-by: minmingzhu <[email protected]> * fix comments Signed-off-by: minmingzhu <[email protected]> * update Signed-off-by: minmingzhu <[email protected]> * fix comments Signed-off-by: minmingzhu <[email protected]> * update Signed-off-by: minmingzhu <[email protected]> * update Signed-off-by: minmingzhu <[email protected]> * update jinja Signed-off-by: minmingzhu <[email protected]> * update jinja file Signed-off-by: minmingzhu <[email protected]> --------- Signed-off-by: minmingzhu <[email protected]> Signed-off-by: Wu, Xiaochang <[email protected]> Signed-off-by: minmingzhu <[email protected]> Co-authored-by: Xiaochang Wu <[email protected]> Co-authored-by: yutianchen <[email protected]> Co-authored-by: KepingYan <[email protected]> Co-authored-by: Yizhong Zhang <[email protected]> Co-authored-by: Zhi Lin <[email protected]>
- Loading branch information