NVIDIA
NVIDIA
TAO Toolkit Computer Vision Sample Workflows
Resource
NVIDIA
NVIDIA
TAO Toolkit Computer Vision Sample Workflows

Sample notebooks for TAO Toolkit Computer Vision workflows.

TAO CV Sample Workflows

Train Adapt Optimize (TAO) Toolkit is a python based AI toolkit for taking purpose-built pre-trained AI models and customizing them with your own data. TAO adapts popular network architectures and backbones to your data, allowing you to train, fine tune, prune and export highly optimized and accurate AI models for edge deployment.

The pre-trained models accelerate the AI training process and reduce costs associated with large scale data collection, labeling, and training models from scratch. Transfer learning with pre-trained models can be used for AI applications in smart cities, retail, healthcare, industrial inspection and more.

Build end-to-end services and solutions for transforming pixels and sensor data to actionable insights using TAO, DeepStream SDK and TensorRT. TAO can train models for common vision AI tasks such as object detection, classification, instance segmentation as well as other complex tasks such as facial landmark, gaze estimation, heart rate estimation and others.

This resource lists out several sample notebooks to walk you through full training workflow using TAO 3.0.

Getting Started

To get started, first choose the model architecture that you want to build, then select the appropriate model card on NGC and then choose one of the supported backbones.

LOGO

Running TAO Toolkit

  1. Setup your python environment using python virtualenv and virtualenvwrapper.

  2. In TAO Toolkit, we have created an abstraction above the container, you will launch all your training jobs from the launcher. No need to manually pull the appropriate container, tao-launcher will handle that. You may install the launcher using pip with the following commands.

pip3 install nvidia-tao
  1. Download the sample jupyter notebooks using the command mentioned in the CLI tab and kick start a jupyter server using the following command
jupyter notebook --ip 0.0.0.0 --port 8888 --allow-root
Purpose-built ModelJupyter notebook
PeopleNetdetectnet_v2/detectnet_v2.ipynb
TrafficCamNetdetectnet_v2/detectnet_v2.ipynb
DashCamNetdetectnet_v2/detectnet_v2.ipynb
FaceDetectIRdetectnet_v2/detectnet_v2.ipynb
VehicleMakeNetclassification/classification.ipynb
VehicleTypeNetclassification/classification.ipynb
PeopleSegNetmask_rcnn/mask_rcnn.ipynb
PeopleSemSegNetunet/unet_isbi.ipynb
Bodypose Estimationbpnet/bpnet.ipynb
License Plate Detectiondetectnet_v2/detectnet_v2.ipynb
License Plate Recognitionlprnet/lprnet.ipynb
Gaze Estimationgazenet/gazenet.ipynb
Facial Landmarkfpenet/fpenet.ipynb
Heart Rate Estimationheartratenet/heartratenet.ipynb
Gesture Recognitiongesturenet/gesturenet.ipynb
Emotion Recognitionemotionnet/emotionnet.ipynb
FaceDetectfacenet/facenet.ipynb
ActionRecognitionNetaction_recognition_net/actionrecognitionnet.ipynb
PoseClassificationNetpose_classification_net/pose_classificationnet.ipynb
Pointpillarspointpillars/pointpillars.ipynb
Open model architectureJupyter notebook
DetectNet_v2detectnet_v2/detectnet_v2.ipynb
FasterRCNNfaster_rcnn/faster_rcnn.ipynb
YOLOV3yolo_v3/yolo_v3.ipynb
YOLOV4yolo_v4/yolo_v4.ipynb
YOLOv4-Tinyyolo_v4_tiny/yolo_v4_tiny.ipynb
SSDssd/ssd.ipynb
DSSDdssd/dssd.ipynb
RetinaNetretinanet/retinanet.ipynb
MaskRCNNmask_rcnn/mask_rcnn.ipynb
UNETunet/unet_isbi.ipynb
Image Classificationclassification/classification.ipynb
EfficientDetefficientdet/efficientdet.ipynb
Publisher
NVIDIA
NVIDIA
Latest Versionv1.4.1
UpdatedApril 4, 2023 UTC
Compressed Size684.72 KB