root@flatbrick:~# nvidia-smi mig -lgip
+-------------------------------------------------------------------------------+
| GPU instance profiles: |
| GPU Name ID Instances Memory P2P SM DEC ENC |
| Free/Total GiB CE JPEG OFA |
|===============================================================================|
| 0 MIG 3g.0gb 0 1/1 0.00 No 18 2 2 |
| 1 2 1 |
+-------------------------------------------------------------------------------+
root@flatbrick:~# nvidia-smi mig -cgi 0
Successfully created GPU instance ID 0 on GPU 0 using profile MIG 3g.0gb (ID 0)
root@flatbrick:~# nvidia-smi mig -lcip
+--------------------------------------------------------------------------------------+
| Compute instance profiles: |
| GPU GPU Name Profile Instances Exclusive Shared |
| Instance ID Free/Total SM DEC ENC OFA |
| ID CE JPEG |
|======================================================================================|
| 0 0 MIG 1c.3g.0gb 0 2/2 6 2 2 1 |
| 1 2 |
+--------------------------------------------------------------------------------------+
| 0 0 MIG 2c.3g.0gb 1 2/1 6 2 2 1 |
| 1 2 |
+--------------------------------------------------------------------------------------+
| 0 0 MIG 3g.0gb 2* 1/1 18 2 2 1 |
| 1 2 |
+--------------------------------------------------------------------------------------+
root@flatbrick:~# nvidia-smi mig -lgi
+---------------------------------------------------------+
| GPU instances: |
| GPU Name Profile Instance Placement |
| ID ID Start:Size |
|=========================================================|
| 0 MIG 3g.0gb 0 0 0:3 |
+---------------------------------------------------------+
root@flatbrick:~# nvidia-smi mig -cci 1 -gi 0
Unable to create a compute instance on GPU 0 GPU instance ID 0 using profile 1: Unknown Error
Failed to create compute instances: Unknown Error
root@flatbrick:~# nvidia-smi mig -cci 0 -gi 0
Unable to create a compute instance on GPU 0 GPU instance ID 0 using profile 0: Unknown Error
Failed to create compute instances: Unknown Error
root@flatbrick:~# nvidia-smi mig -cci 2 -gi 0
Unable to create a compute instance on GPU 0 GPU instance ID 0 using profile 2: Insufficient Resources
Failed to create compute instances: Insufficient Resources
root@flatbrick:~# nvidia-smi mig -cgi 0 -C
Successfully created GPU instance ID 0 on GPU 0 using profile MIG 3g.0gb (ID 0)
Unable to create a compute instance on GPU 0 GPU instance ID 0 using profile default: Insufficient Resources
Failed to create GPU instances: Insufficient Resources
root@flatbrick:~# nvidia-smi mig -lgi
+---------------------------------------------------------+
| GPU instances: |
| GPU Name Profile Instance Placement |
| ID ID Start:Size |
|=========================================================|
| 0 MIG 3g.0gb 0 0 0:3 |
+---------------------------------------------------------+
root@flatbrick:~# nvidia-smi mig -lci
No compute instances found: Not Found
- Going off a recent technical blog post it sounds like NVML/nvidia-smi is in active development and that mig isn’t fully developed yet.
Copy performance (Jetson Thor)
Avg. time: 0.488719 ms / Copy throughput: 95.281815 GB/s
Copy performance (6000 blackwell)
Avg. time: 0.023844 ms / Copy throughput: 1952.964983 GB/s.
Enable max performance profile and clocks
nvpmodel -m 0 && nvpmodel -q && jetson_clocks
Install actual required stuff to do anything
sudo apt update
sudo apt dist-upgrade
sudo apt install nvidia-jetpack
sudo apt install nvidia-cuda-dev
Setup environment
echo "export PATH=/usr/local/cuda/bin:$PATH" >> ~/.bashrc
echo "export LD_LIBRARY_PATH=/usr/local/cuda/lib64:$LD_LIBRARY_PATH" >> ~/.bashrc
Useful links
https://docs.nvidia.com/jetson/agx-thor-devkit/user-guide/0.1.0/setup_cuda.html
Install jetson-stats
sudo pip3 install --break-system-packages jetson-stats
sudo groupadd jtop
sudo vigr # Add user to jtop group
sudo vigr -s # Add user to jtop group
sudo ln -s /usr/local/jetson_stats/jtop.service /etc/systemd/system/jtop.service
sudo systemctl enable jtop.service
sudo systemctl restart jtop.service
sudo systemctl status jtop.service
Fix jtop JETPACK not installed message
44,45d43
< # -------- THOR -------
< "38.2.0": "7.0",
/usr/local/lib/python3.12/dist-packages/jtop/core/jetson_variables.py
OR (Use my fork that has the variables and nvml fixes)
I added nvml support to jtop so it now is able to get some metrics from the dGPU
