Hello,
I have a workstation running PopOS 20.04 with 4 different NVIDIA GPUs, along with onboard VGA graphics from ASPEED 2500. I have been booting linux into the GUI using the onboard VGA and binding all 4 NVIDIA gpus to vfio-pci for passthrough purposes, and that is working fine - I can passthrough whichever card I like to a Windows guest.
Now I would like to use one of my GPUs (Titan Black) for the linux host GUI. When I removed that card from the list of the vfio_pci.ids, the NVIDIA driver would not bind to it - it was left unbound to any driver. As an experiment, I removed all my GPUs from the vfio_pci.ids list. I found that two of my cards (a GT730 and a Quadro P4000) were bound to nvidia, while the other two (a GTX 1060 and the Titan Black) were left unbound.
Has anyone encountered a similar issue? I’m pretty new to linux, so not sure what logs I should post to provide more information, but want to learn and will follow suggestions.
Thanks in advance. Here are some key parameters of my workstation:
OS: Pop!_OS 20.04 LTS, kernel 5.11.0-7614-generic
MB: Supermicro X11SRA-F
CPU: Intel Xeon W-2175
Graphics:
- AST2500
- GTX 1060 6GB
- GT 730
- GTX Titan Black
- Quadro P4000
Nvidia driver: 460.73.01
lspci of unbound GPU (note that there is no entry for “Kernel driver in use”):
65:00.0 VGA compatible controller [0300]: NVIDIA Corporation GK110B [GeForce GTX TITAN Black] [10de:100c] (rev a1) (prog -if 00 [VGA controller]) Subsystem: NVIDIA Corporation GK110B [GeForce GTX TITAN Black] [10de:1066] Flags: fast devsel, IRQ 11, NUMA node 0 Memory at d2000000 (32-bit, non-prefetchable) [disabled] [size=16M] Memory at c8000000 (64-bit, prefetchable) [disabled] [size=128M] Memory at d0000000 (64-bit, prefetchable) [disabled] [size=32M] I/O ports at b000 [disabled] [size=128] Expansion ROM at d3000000 [disabled] [size=512K] Capabilities: [60] Power Management version 3 Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+ Capabilities: [78] Express Endpoint, MSI 00 Capabilities: [100] Virtual Channel Capabilities: [128] Power Budgeting <?> Capabilities: [420] Advanced Error Reporting Capabilities: [600] Vendor Specific Information: ID=0001 Rev=1 Len=024 <?> Capabilities: [900] Secondary PCI Express Kernel modules: nvidiafb, nouveau, nvidia_drm, nvidia