Skip to content

Question:Why use torch.cuda.Event instead of torch.profiler for performance measurement? #47

@study-never-alone

Description

@study-never-alone

Hi~ I've noticed that using torch.cuda.Event for measuring kernel performance can lead to significant timing variance across multiple runs, affecting stability. Given that torch.profiler offers more stable and robust measurements of pure kernel execution time, was there a specific reason for choosing cuda.Event in the benchmark's design?

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions