Cache models in a class attribute to avoid one network request per provider #133

julien-nc · 2024-09-27T14:11:09Z

As OpenAI apparently has rate limit on the models endpoint, page loading and requests to /ocs/v2.php/apps/assistant/api/v1/task-types can be slowed down quite a bit because each provider gets the list of models to populate the enum value in their optional input shapes.

Thankfully they all use the same instance of OpenAiAPIService so the model request response can be simply cached in a class attribute.

Signed-off-by: Julien Veyssier <[email protected]>

kyteinsky · 2024-09-27T14:17:40Z

This cache will be lost for different requests though, no? Different request -> new instance of the service. Would be better to use the local cache with userid in the key, what say you?
Also, with a timeout.

julien-nc · 2024-09-27T14:28:42Z

This cache will be lost for different requests though, no?

Yes

Would be better to use the local cache with userid in the key, what say you?

With a local cache we can get outdated values. Even if the model list should not change very often, we can't be sure.

Also, with a timeout.

Timeout on what? The request to the models endpoint?

cache models in a class attr to avoid one network request per provider

6e8e641

Signed-off-by: Julien Veyssier <[email protected]>

julien-nc added enhancement New feature or request 3. to review labels Sep 27, 2024

julien-nc requested review from marcelklehr and kyteinsky September 27, 2024 14:11

kyteinsky approved these changes Sep 27, 2024

View reviewed changes

julien-nc merged commit a02bcd8 into main Sep 27, 2024
8 checks passed

kyteinsky deleted the enh/noid/factorize-getmodels-per-service-instance branch September 27, 2024 14:40

julien-nc mentioned this pull request Sep 28, 2024

Prepare 3.1.2 #134

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache models in a class attribute to avoid one network request per provider #133

Cache models in a class attribute to avoid one network request per provider #133

julien-nc commented Sep 27, 2024 •

edited

Loading

kyteinsky commented Sep 27, 2024

julien-nc commented Sep 27, 2024

Cache models in a class attribute to avoid one network request per provider #133

Cache models in a class attribute to avoid one network request per provider #133

Conversation

julien-nc commented Sep 27, 2024 • edited Loading

kyteinsky commented Sep 27, 2024

julien-nc commented Sep 27, 2024

julien-nc commented Sep 27, 2024 •

edited

Loading