MIVisionX Live Image Classification

MIVisionX Live Image Classification#

This application runs know CNN image classifiers on live or pre-recorded video streams.

MIVisionX Image Classification Control#

MIVisionX Image Classification#

Usage#

Prerequisites#

Ubuntu 16.04 / 18.04 or CentOS 7.5 / 7.6
ROCm supported hardware
ROCm
Build & Install MIVisionX

Build#

git clone https://github.com/ROCm/MIVisionX.git
cd MIVisionX/apps/mivisionx_openvx_classifier
mkdir build
cd build
cmake ../
make

Run#

Usage: ./classifier	--label <label text> [required]
 					--video <video file> / --capture <0> [required]
 					--googlenet <googlenet weights.bin> [optional]
 					--inception <inceptionV4 weights.bin> [optional]
 					--resnet50 <resnet50 weights.bin> [optional]
 					--resnet101 <resnet101 weights.bin> [optional]
 					--resnet152 <resnet152 weights.bin> [optional]
 					--vgg16 <vgg16 weights.bin> [optional]
 					--vgg19 <vgg19 weights.bin> [optional]

Note* All the models are optional, but one of the supported model weights.bin is required

Supported Models#

Generating weights.bin for different Models#

Download or train your caffemodel for the supported models listed above.

Here is the sample download link that contains all the prototxt:

Use MIVisionX Model Compiler to extract weights.bin from the pre-trained caffe models

Note: MIVisionX installs all the model compiler scripts in /opt/rocm/libexec/mivisionx/model_compiler/python/ folder

Convert the pre-trained caffemodel into AMD NNIR model:

% python /opt/rocm/libexec/mivisionx/model_compiler/python/caffe_to_nnir.py <net.caffeModel> <nnirOutputFolder> --input-dims <n,c,h,w> [--verbose <0|1>]

Sample:

``` 
% python /opt/rocm/libexec/mivisionx/model_compiler/python/caffe_to_nnir.py VGG_ILSVRC_16_layers.caffemodel VGG16_NNIR --input-dims 1,3,224,224
```

Convert an AMD NNIR model into OpenVX C code:

% python /opt/rocm/libexec/mivisionx/model_compiler/python/nnir_to_openvx.py <nnirModelFolder> <nnirModelOutputFolder>

Sample:

``` 
% python /opt/rocm/libexec/mivisionx/model_compiler/python/nnir_to_openvx.py VGG16_NNIR VGG16_OpenVX
```

Note: The weights.bin file will be generated inside the OpenVX folder and you can use that as an input for this project.

–label #

Use labels.txt or simple_labels.txt file in the data folder

–video #

Run classification on pre-recorded video with this option.

–capture <0>#

Run classification on the live camera feed with this option.

Note: –video and –capture options are not supported concurrently

Sample Runs#

Run VGG 16 Classification on Live Video#

Step 1: Install all the Prerequisites