Hi there,
I've deployed the library on a server with 20 cores and 40 physical threads in SMT, and interacting with the library with 40 logical threads. Due to the internal usage of OpenMP, this spawns 40 * 40 = 3600 logical threads in the machine, making the execution rather inefficient. Is there a way to constrain the number of threads used by stinger/openmp (e.g. worker pool), or am I better off using a single logical thread to operate with the library?
linux, gcc 9, libgomp