tf-serving-sequence-test

Testing how Tensorflow Serving handles variable length sequences as input. In the context of session-based recommender models.

Particularly in the case where we use transformer encoders for the user's item interaction sequence, padding every single request to the maximum sequence length will result in a lot of wasted compute. This dummy model tests the behaviour of tf-serving when the sequence length varies.

Outcome:

You cannot have ragged inputs in a single request.

{
  "instances": [
    [0, 1, 2, 3, 4],
    [0, 1, 2, 3, 4, 5]
  ]
}

You can have different sequence length in different requests, without request batching enabled. If batching is enabled you get this error:

'{
  "error": "Tensors with name \'serving_default_lambda_input:0\' from different tasks have different shapes and padding is turned off. 
  Set pad_variable_length_inputs to true, or ensure that all tensors with the same name have equal dimensions starting with the first dim."
}'

There's no documentation for the parameter ``, but enabling it causes the server to abort, with the error:

2022-12-09 00:39:00.445797: F external/org_tensorflow/tensorflow/core/framework/tensor_util.cc:94] Check failed: offset + from_data.size() <= to_data.size() (1880 vs. 120)
/usr/bin/tf_serving_entrypoint.sh: line 3:     7 Aborted                 tensorflow_model_server --port=8500 --rest_api_port=8501 --model_name=${MODEL_NAME} --model_base_path=${MODEL_BASE_PATH}/${MODEL_NAME} "$@"

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
auto		auto
.gitignore		.gitignore
README.md		README.md
build_model.py		build_model.py
docker-compose.yaml		docker-compose.yaml
predict.py		predict.py
tf_serving_batch_config		tf_serving_batch_config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tf-serving-sequence-test

Outcome:

About

Releases

Packages

Languages

patrickorlando/tf-serving-sequence-test

Folders and files

Latest commit

History

Repository files navigation

tf-serving-sequence-test

Outcome:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages