Profiler result is not consistent with each run #794

RavikumarLav · 2024-12-17T11:21:57Z

Hello,

i am using below code part to capture the runtime for model inference

// Import the TensorFlow model. Note: use CreateNetworkFromBinaryFile for .pb files.
armnnTfLiteParser::ITfLiteParserPtr parser = armnnTfLiteParser::ITfLiteParser::Create();

armnn::INetworkPtr network = parser->CreateNetworkFromBinaryFile("model_latest.tflite");

// Find the binding points for the input and output nodes  
armnnTfLiteParser::BindingPointInfo inputBindingInfo = parser->GetNetworkInputBindingInfo(0, "conv2d_input");
armnnTfLiteParser::BindingPointInfo outputBindingInfo = parser->GetNetworkOutputBindingInfo(0, "Identity");

// Create ArmNN runtime
armnn::IRuntime::CreationOptions options; // default options
armnn::IRuntimePtr runtime = armnn::IRuntime::Create(options);

armnn::Compute device= armnn::Compute::CpuAcc; 
//armnn::Compute device= armnn::Compute::CpuRef;
armnn::IOptimizedNetworkPtr optNet = Optimize(*network, {device}, runtime->GetDeviceSpec()); 
// Load the optimized network onto the runtime device
armnn::NetworkId networkIdentifier;
runtime->LoadNetwork(networkIdentifier, std::move(optNet));

// Create a profiler and register it for the current thread.
std::shared_ptrarmnn::IProfiler profiler = runtime->GetProfiler(networkIdentifier);
profiler->EnableProfiling(true);

// Enable profiling.
profiler->EnableProfiling(true);

// Run Inference
armnn::InputTensors inputTensor = MakeInputTensors(inputBindingInfo, &input[0]);
armnn::OutputTensors outputTensor = MakeOutputTensors(outputBindingInfo, &output[0]);
armnn::Status ret = runtime->EnqueueWorkload(networkIdentifier, inputTensor, outputTensor);

// Print output
profiler->Print(std::cout);

able to see json format of each layer profiler result .

Problem: Running .tflite model on arm a78 core with CpuAcc as the option the runtime is different for each run of same model.

for one of model it is varying from 0.8 to 1.2ms

Need to know how runtime is calculating using system clock or by using arm registers

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Profiler result is not consistent with each run #794

Profiler result is not consistent with each run #794

RavikumarLav commented Dec 17, 2024 •

edited

Loading

Profiler result is not consistent with each run #794

Profiler result is not consistent with each run #794

Comments

RavikumarLav commented Dec 17, 2024 • edited Loading

RavikumarLav commented Dec 17, 2024 •

edited

Loading