Runtime intercept tables#

Although most tools will want to leverage the callback or buffer tracing services for tracing the HIP, HSA, and ROCTx APIs, rocprofiler-sdk does provide access to the raw API dispatch tables. Each of the aforementioned APIs are designed similar to the following sample.

Dispatch Table Overview#

Forward Declaration of public C API function#

extern "C"
{
// forward declaration of public C API function
int
foo(int) __attribute__((visibility("default")));
}

Internal Implementation of API function#

namespace impl
{
int
foo(int val)
{
    // real implementation
    return (2 * val);
}
}

Dispatch Table Implementation#

namespace impl
{
struct dispatch_table
{
    int (*foo_fn)(int) = nullptr;
};

// invoked once: populates the dispatch_table with function pointers to implementation
dispatch_table*&
construct_dispatch_table()
{
    static dispatch_table* tbl = new dispatch_table{};
    tbl->foo_fn                = impl::foo;

    // in between above and below, rocprofiler-sdk gets passed the pointer
    // to the dispatch table and has the opportunity to wrap the function
    // pointers for interception

    return tbl;
}

// constructs dispatch table and stores it in static variable
dispatch_table*
get_dispatch_table()
{
    static dispatch_table*& tbl = construct_dispatch_table();
    return tbl;
}
}  // namespace impl

Implementaiton of public C API function#

extern "C"
{
// implementation of public C API function
int
foo(int val)
{
    return impl::get_dispatch_table()->foo_fn(val);
}
}

Dispatch Table Chaining#

rocprofiler-sdk is given an opportunity within impl::construct_dispatch_table() to save the original value(s) of the function pointers such as foo_fn and install it’s own function pointers in its place – this results in the public C API function foo calling into the rocprofiler-sdk function pointer, which then in turn, calls the original function pointer to impl::foo (this is called “chaining”). Once rocprofiler-sdk has made any necessary modifications to the dispatch table, tools which indicated they also want access to the raw dispatch table via rocprofiler_at_intercept_table_registration will be passed the pointer to the dispatch table.

Sample#

For a demo of dispatch table chaining, please see the samples/intercept_table example in the rocprofiler-sdk GitHub repository.