You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When trying to benchmark bert model with latest ZenDNN release, I'm facing the similar degradation issue with this issue: #5. And I tried the fix in it but the issue still exist.
Do we have any benchmark result for Bert model on CPU, how much throughput improvement should I expect with ZenDNN.
To reproduce
No now.
Urgency
No response
Platform
Linux
OS Version
Ubuntu 22.04
ONNX Runtime Installation
Released Package
ONNX Runtime Version or Commit ID
v1.17
ONNX Runtime API
Python
Architecture
X64
Execution Provider
Default CPU, Other / Unknown
Execution Provider Library Version
No response
Model File
No response
Is this a quantized model?
No
The text was updated successfully, but these errors were encountered:
Hi @ajeet1203singh, yes I tried the recommended settings of that issue, and I also replied in that issue. I'm trying the same command in that issue (the bert varint, the batch size, and the sequence length).
Describe the issue
When trying to benchmark bert model with latest ZenDNN release, I'm facing the similar degradation issue with this issue: #5. And I tried the fix in it but the issue still exist.
Do we have any benchmark result for Bert model on CPU, how much throughput improvement should I expect with ZenDNN.
To reproduce
No now.
Urgency
No response
Platform
Linux
OS Version
Ubuntu 22.04
ONNX Runtime Installation
Released Package
ONNX Runtime Version or Commit ID
v1.17
ONNX Runtime API
Python
Architecture
X64
Execution Provider
Default CPU, Other / Unknown
Execution Provider Library Version
No response
Model File
No response
Is this a quantized model?
No
The text was updated successfully, but these errors were encountered: