-
Notifications
You must be signed in to change notification settings - Fork 103
Description
Coincidentally, I worked on a very similar project one years ago, where I built a version of KernelBench focused on linear algebra kernels (Link) written in raw CUDA. As part of that effort, I also developed a tool called PowerAPI that measures power consumption using NVIDIA NVML. Its design style and purpose are very closely aligned with the design of KernelBench.
If the community is interested, I would be happy to:
Extend PowerAPI to provide Python bindings
Integrate it directly into KernelBench as an optional module for per-kernel power measurement
This could help users benchmark kernels not only on execution time, but also energy cost per operation, enabling more sustainable and realistic comparisons.
Please let me know if a PR in this direction would be welcomed.