Workload tracking improvement - emit event both at the beginning and end of each workflow operation #53

ashvina · 2023-05-31T06:59:05Z

LST-Bench workload comprises of various operations, including phase, task, and statement. Currently, for tracking purpose, the workload executor emits telemetry events at the end of each operation. The event payload indicates operation start and end times in addition to status of the operation execution. However, this approach poses a problem in error situations and also makes it harder to track progress till an operation completes. It becomes unclear whether the operation had even started or not. To address this issue, I propose enhancing our workload execution tracking by emitting an additional event at the beginning of each operation, in addition to the existing event at the end. Note, missing end event would indicate operation failure.

This approach partially borrows from popular event frameworks like XEvent in Sql Server. For e.g., SQL Server emits an xevent when SQL statement execution starts, and can emit an event when the statement completes or fails.

Another advantage of this change is its potential for extension to other components of the benchmark, going beyond just the workload tracking.

As such, each event will carry the following fields:

experiment id: experiment run identifier)
timestamp
event name (e.g. operation started, operation completed, operation failed, etc.)
event id (e.g. operation instance identifier. Start and end events of an operation carry the same id)
operation name (e.g. operation identifier from the workload config file)
event payload [optional]

This change will require changes to analysis scripts that depend on both start and end times.

ashvina · 2023-05-31T15:29:48Z

@jcamachor what do you think about the proposal?

jcamachor · 2023-05-31T15:52:03Z

Thanks for putting together this proposal, @ashvina ! I think it makes sense and heads in the right direction. As the project evolves, if more flexibility is needed, maybe we can create more complex extensions, that if I'm not mistaken is something that the XEvent framework also considers?
When it comes to the analysis scripts, I hope we can get testing for the metrics module shortly (#22), including some integration testing between core and metrics modules as well. That way we can catch any issues more easily moving forward.

CCing @anjagruenheid for awareness.

This commit introduces two changes. First, it changes type of experiment id from UUID to timestamp, mainly to reuse existing id and improve readability of events. This simplifies experiment analysis when the db contains results from many tests. Second, it adds initital test framework to validate event stream generate by benchmark executor. The test does not really execute any sql statements. A mock simulates the execution so that just the executor's behavior can be validated. Addresses microsoft#53

This commit introduces two changes. First, it changes type of experiment id from UUID to timestamp, mainly to reuse existing id and improve readability of events. This simplifies experiment analysis when the db contains results from many tests. Second, it adds initital test framework to validate event stream generate by benchmark executor. The test does not really execute any sql statements. A mock simulates the execution so that just the executor's behavior can be validated. Addresses #53

ashvina self-assigned this May 31, 2023

ashvina changed the title ~~Workload tracking improvement - emit event both at the beginning and end of each workflow building block~~ Workload tracking improvement - emit event both at the beginning and end of each workflow operation May 31, 2023

ashvina mentioned this issue Jun 6, 2023

Use existing timestamp as experiment id #62

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Workload tracking improvement - emit event both at the beginning and end of each workflow operation #53

Workload tracking improvement - emit event both at the beginning and end of each workflow operation #53

ashvina commented May 31, 2023 •

edited

Loading

ashvina commented May 31, 2023

jcamachor commented May 31, 2023

Workload tracking improvement - emit event both at the beginning and end of each workflow operation #53

Workload tracking improvement - emit event both at the beginning and end of each workflow operation #53

Comments

ashvina commented May 31, 2023 • edited Loading

ashvina commented May 31, 2023

jcamachor commented May 31, 2023

ashvina commented May 31, 2023 •

edited

Loading