Implement some missing element wise Add/Sub/Mul/Div/Neg operations for CPU and CUDA EPs #23090

Zyrin · 2024-12-12T09:50:41Z

Description

[CPU EP] Implement Add/Sub/Mul/Div element wise operations for (u)int8, (u)int16, uint32 and uint64.
[CPU EP] Implement Neg unary operation for int16
[CUDA EP] Implement Add/Sub/Mul/Div element wise operations for (u)int8 and (u)int16

Motivation and Context

This solves #23051

tianleiwu · 2024-12-12T18:23:15Z

This will increase binary size. Is the missing type used in any real model?

Zyrin · 2024-12-12T21:28:00Z

@microsoft-github-policy-service agree company="Cellumation"

Zyrin · 2024-12-12T21:31:31Z

I do not know if any "real" model use these types for these operations.
I tried to use uint8 operations in my model and found that onnxruntime was not supporting them, although the onnx api documentation has support for them. So I just went ahead and implemented all these missing types.
The binary size grows by <0.6% for libonnxruntime.so and <0.3% for libonnxruntime_providers_cuda.so.

onnxruntime/test/providers/cpu/math/element_wise_ops_test.cc

xadupre · 2024-12-17T11:39:34Z

This may increase the binary size. +@scottmckay

tianleiwu · 2024-12-17T19:29:32Z

@Zyrin, please following https://github.com/microsoft/onnxruntime/blob/main/docs/Coding_Conventions_and_Standards.md#linting to format code.

Also need update documents (You can find the updated documents in artifacts of Windows GPU Doc Gen CI Pipeline from Checks).

tianleiwu · 2024-12-17T19:29:52Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline

tianleiwu · 2024-12-17T19:29:54Z

/azp run Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-linux-gpu-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Linux Android Emulator QNN CI Pipeline,Android CI Pipeline

tianleiwu · 2024-12-17T19:29:55Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline,CoreML CI Pipeline,Linux DNNL CI Pipeline,Linux MIGraphX CI Pipeline,Linux ROCm CI Pipeline

azure-pipelines · 2024-12-17T19:30:26Z

Azure Pipelines successfully started running 6 pipeline(s).

azure-pipelines · 2024-12-17T19:30:37Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2024-12-17T19:30:41Z

Azure Pipelines successfully started running 10 pipeline(s).

Zyrin · 2024-12-18T10:05:46Z

I applied the linting fixes. @tianleiwu could you restart the pipelines?

tianleiwu · 2024-12-18T17:47:34Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline

tianleiwu · 2024-12-18T17:47:35Z

/azp run Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Linux Android Emulator QNN CI Pipeline

tianleiwu · 2024-12-18T17:47:37Z

/azp run Android CI Pipeline,iOS CI Pipeline,ONNX Runtime React Native CI Pipeline,CoreML CI Pipeline,Linux DNNL CI Pipeline,Linux MIGraphX CI Pipeline,Linux ROCm CI Pipeline

azure-pipelines · 2024-12-18T17:48:08Z

Azure Pipelines successfully started running 7 pipeline(s).

azure-pipelines · 2024-12-18T17:48:10Z

Azure Pipelines successfully started running 8 pipeline(s).

azure-pipelines · 2024-12-18T17:48:17Z

Azure Pipelines successfully started running 10 pipeline(s).

tianleiwu · 2025-01-06T22:16:32Z

@Zyrin, there are some build pipeline failed. You need update the unit tests to run on cuda and cpu provider only. See some examples in the same test file.

You will also need update operator documents (you can get them from artifacts of Windows GPU Doc Gen CI Pipeline).

Zyrin · 2025-01-08T19:44:41Z

@tianleiwu I assume you want me to only run the tests on the CPU and CUDA EPs like with the following code snipped from element_wise_ops_test.cc:1837:

if (nullptr != DefaultCpuExecutionProvider()) {
  std::vector<std::unique_ptr<IExecutionProvider>> execution_providers;
  execution_providers.push_back(DefaultCpuExecutionProvider());
  test.Run(OpTester::ExpectResult::kExpectSuccess, "", {}, nullptr, &execution_providers);
}
if (nullptr != DefaultCudaExecutionProvider()) {
  std::vector<std::unique_ptr<IExecutionProvider>> execution_providers;
  execution_providers.push_back(DefaultCudaExecutionProvider());
  test.Run(OpTester::ExpectResult::kExpectSuccess, "", {}, nullptr, &execution_providers);
}

Alternatively I could exclude the TensorRT and DNNL EPs, but I do not know if there are EPs that are not tested here, and thus would fail on someone else.

On that account should I only change the failing tests or all the tests I added?

tianleiwu · 2025-01-09T21:47:11Z

@tianleiwu I assume you want me to only run the tests on the CPU and CUDA EPs like with the following code snipped from element_wise_ops_test.cc:1837:

Right. You can follow the code snippet to fix failing tests that is introduced by this.

…(u)int8, (u)int16, uint32 and uint64 as well as Neg unary operation for int16 on CPU EP and implement Add/Sub/Mul/Div element wise operations for (u)int8 and (u)int16 on CUDA EP

Zyrin · 2025-01-09T23:03:12Z

I fixed the tests. Is there a way for me to generate the docs, or is the easiest way to generate them to trigger the Windows GPU Doc Gen CI Pipeline?

github-advanced-security bot found potential problems Dec 15, 2024

View reviewed changes

onnxruntime/test/providers/cpu/math/element_wise_ops_test.cc Fixed Show fixed Hide fixed

Zyrin force-pushed the main branch 2 times, most recently from 7fcfd3b to 92d0502 Compare December 18, 2024 08:58

Zyrin force-pushed the main branch from 92d0502 to a264918 Compare January 9, 2025 22:56

[CPU/CUDA EPs] Implement Add/Sub/Mul/Div element wise operations for …

89e9858

…(u)int8, (u)int16, uint32 and uint64 as well as Neg unary operation for int16 on CPU EP and implement Add/Sub/Mul/Div element wise operations for (u)int8 and (u)int16 on CUDA EP

Zyrin force-pushed the main branch from a264918 to 89e9858 Compare January 9, 2025 22:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement some missing element wise Add/Sub/Mul/Div/Neg operations for CPU and CUDA EPs #23090

Implement some missing element wise Add/Sub/Mul/Div/Neg operations for CPU and CUDA EPs #23090

Zyrin commented Dec 12, 2024

tianleiwu commented Dec 12, 2024

Zyrin commented Dec 12, 2024

Zyrin commented Dec 12, 2024 •

edited

Loading

xadupre commented Dec 17, 2024

tianleiwu commented Dec 17, 2024 •

edited

Loading

tianleiwu commented Dec 17, 2024

tianleiwu commented Dec 17, 2024

tianleiwu commented Dec 17, 2024

azure-pipelines bot commented Dec 17, 2024

azure-pipelines bot commented Dec 17, 2024

azure-pipelines bot commented Dec 17, 2024

Zyrin commented Dec 18, 2024

tianleiwu commented Dec 18, 2024

tianleiwu commented Dec 18, 2024

tianleiwu commented Dec 18, 2024

azure-pipelines bot commented Dec 18, 2024

azure-pipelines bot commented Dec 18, 2024

azure-pipelines bot commented Dec 18, 2024

tianleiwu commented Jan 6, 2025

Zyrin commented Jan 8, 2025

tianleiwu commented Jan 9, 2025

Zyrin commented Jan 9, 2025

Implement some missing element wise Add/Sub/Mul/Div/Neg operations for CPU and CUDA EPs #23090

Are you sure you want to change the base?

Implement some missing element wise Add/Sub/Mul/Div/Neg operations for CPU and CUDA EPs #23090

Conversation

Zyrin commented Dec 12, 2024

Description

Motivation and Context

tianleiwu commented Dec 12, 2024

Zyrin commented Dec 12, 2024

Zyrin commented Dec 12, 2024 • edited Loading

xadupre commented Dec 17, 2024

tianleiwu commented Dec 17, 2024 • edited Loading

tianleiwu commented Dec 17, 2024

tianleiwu commented Dec 17, 2024

tianleiwu commented Dec 17, 2024

azure-pipelines bot commented Dec 17, 2024

azure-pipelines bot commented Dec 17, 2024

azure-pipelines bot commented Dec 17, 2024

Zyrin commented Dec 18, 2024

tianleiwu commented Dec 18, 2024

tianleiwu commented Dec 18, 2024

tianleiwu commented Dec 18, 2024

azure-pipelines bot commented Dec 18, 2024

azure-pipelines bot commented Dec 18, 2024

azure-pipelines bot commented Dec 18, 2024

tianleiwu commented Jan 6, 2025

Zyrin commented Jan 8, 2025

tianleiwu commented Jan 9, 2025

Zyrin commented Jan 9, 2025

Zyrin commented Dec 12, 2024 •

edited

Loading

tianleiwu commented Dec 17, 2024 •

edited

Loading