tf::RoundRobinOptimizer class

class to capture the described graph into a native cudaGraph using a greedy round-robin algorithm on a fixed number of streams

Constructors, destructors, conversion operators

RoundRobinOptimizer()
constructs a round-robin optimizer with 4 streams by default
RoundRobinOptimizer(size_t num_streams)
constructs a round-robin optimizer with the given number of streams

Public functions

auto num_streams() const -> size_t
queries the number of streams used by the optimizer
void num_streams(size_t n)
sets the number of streams used by the optimizer