Inference Generator

Inference Generator#

caffe2openvx: Convert a pre-trained CAFFE model into a C library for use by applications.

Extract neural network model from deploy.prototxt
- generate C code that instantiates OpenVX kernels from vx_nn module
- generate build scripts that package C code into a library
- the generated C code or library can be easily integrated into an application for running inference
Extract weights and biases from weights.caffemodel into separates folders for use by the C library during initialization
Also generate a GDF for quick prototyping and kernel debugging

The generated C code will have two functions in annmodule.h:

void annGetTensorDimensions(
        vx_size dimInput[4],    // input tensor dimensions
        vx_size dimOutput[4]    // output tensor dimensions
    );

vx_graph annCreateGraph(
        vx_context context,     // OpenVX context
        vx_tensor input,        // input tensor
        vx_tensor output,       // output tensor
        const char * dataFolder // folder with weights and biases
    );
or
vx_graph annCreateGraphWithInputImage(
        vx_context context,     // OpenVX context
        vx_image input,         // input image (RGB or U8)
        vx_tensor output,       // output tensor
        const char * dataFolder // folder with weights and biases
    );
or
vx_graph annCreateGraphWithInputImageWithArgmaxTensor(
        vx_context context,     // OpenVX context
        vx_image input,         // input image (RGB or U8)
        vx_tensor output,       // output tensor
        const char * dataFolder // folder with weights and biases
    );
or
vx_graph annCreateGraphWithInputImageWithArgmaxImage(
        vx_context context,     // OpenVX context
        vx_image input,         // input image (RGB or U8)
        vx_image output,        // output image (U8)
        const char * dataFolder // folder with weights and biases
    );
or
vx_graph annCreateGraphWithInputImageWithArgmaxImageWithLut(
        vx_context context,     // OpenVX context
        vx_image input,         // input image (RGB or U8)
        vx_image output,        // output image (RGB)
        const char * dataFolder // folder with weights and biases
    );

annGetTensorDimensions: allows an application to query dimensions of input and output tensors
annCreateGraph (or another variant above): creates and initializes a graph with trained neural network for inference

Command-line Usage#

  % caffe2openvx
        [options]
        <net.prototxt|net.caffemodel>
        [n c H W [type fixed-point-position [convert-policy round-policy]]]

option	description
–(no-)error-messages	do/don’t enable error messages (default: ON)
–(no-)virtual-buffers	do/don’t use virtual buffers (default: ON)
–(no-)generate-gdf	do/don’t generate RunVX GDF with weight/bias initialization (default: ON)
–(no-)generate-vx-code	do/don’t generate OpenVX C Code with weight/bias initialization (default: ON)
–output-dir	specify output folder for weights/biases, GDF, and OpenVX C Code (default: current)
–input-rgb	convert input from RGB image into tensor using (a*x+b) conversion: rev=(BGR?1:0)
–input-u8	convert input from U8 image into tensor using (a*x+b) conversion
–argmax-tensor u8/u16 k	return argmax output with specified tensor type and top_k
–argmax-image u8/u16	return argmax output with specified image type
–argmax-lut <rgbLut.txt>	argmax color table: one R G B entry per label
–flags	specify custom flags (default: 0)

Example#

Make sure that all executables and libraries are in PATH and LD_LIBRARY_PATH environment variables.

% export PATH=$PATH:/opt/rocm/bin
% export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/opt/rocm/lib

Below log outlines a simple use-case with inference generator.

% caffe2openvx weights.caffemodel 1 3 32 32
% caffe2openvx deploy.prototxt 1 3 32 32
% ls
CMakeLists.txt   annmodule.txt   cmake              weights
annmodule.cpp    anntest.cpp     deploy.prototxt    weights.caffemodel
annmodule.h      bias            net.gdf
% mkdir build
% cd build
% cmake ..
% make
% cd ..
% ls build
CMakeCache.txt  Makefile        cmake_install.cmake
CMakeFiles      anntest         libannmodule.so
% ./build/anntest
OK: annGetTensorDimensions() => [input 32x32x3x32] [output 1x1x10x32]

The anntest.cpp is a simple program to initialize and run neural network using the annmodule library.

Inference Generator

Contents

Inference Generator#

Command-line Usage#

Example#