I’ve been working for the last 2 days (actual quarantine perks) to get tensorflow2.0 to work correctly with my gtx1080. I’m on ubuntu 18.04lts with pop_os using cuda9. the specific error message is:
2020-03-25 22:03:29.833389: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libnvinfer.so.6'; dlerror: libcudart.so.10.2: cannot open shared object file: No such file or directory 2020-03-25 22:03:29.833533: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libnvinfer_plugin.so.6'; dlerror: libnvrtc.so.10.2: cannot open shared object file: No such file or directory 2020-03-25 22:03:29.833544: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:30] Cannot dlopen some TensorRT libraries. If you would like to use Nvidia GPU with TensorRT, please make sure the missing libraries mentioned above are installed properly. 2020-03-25 22:03:30.690848: I tensorflow/stream_executor/platform/default/dso_loader.cc:44] Successfully opened dynamic library libcuda.so.1 2020-03-25 22:03:30.694793: E tensorflow/stream_executor/cuda/cuda_driver.cc:351] failed call to cuInit: CUDA_ERROR_NO_DEVICE: no CUDA-capable device is detected 2020-03-25 22:03:30.694819: I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:156] kernel driver does not appear to be running on this host (pop-os): /proc/driver/nvidia/version does not exist 2020-03-25 22:03:30.695105: I tensorflow/core/platform/cpu_feature_guard.cc:142] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA 2020-03-25 22:03:30.719751: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 3193920000 Hz 2020-03-25 22:03:30.720465: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x4bc8360 initialized for platform Host (this does not guarantee that XLA will be used). Devices: 2020-03-25 22:03:30.720507: I tensorflow/compiler/xla/service/service.cc:176] StreamExecutor device (0): Host, Default Version Train for 195 steps, validate for 3 steps Epoch 1/30 2020-03-25 22:03:44.417531: I tensorflow/core/profiler/lib/profiler_session.cc:225] Profiler session started. 2020-03-25 22:03:44.420449: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] Could not load dynamic library 'libcupti.so.10.1'; dlerror: libcupti.so.10.1: cannot open shared object file: No such file or directory 2020-03-25 22:03:44.420477: E tensorflow/core/profiler/internal/gpu/cupti_tracer.cc:1307] function cupti_interface_->Subscribe( &subscriber_, (CUpti_CallbackFunc)ApiCallback, this)failed with error CUPTI could not be loaded or symbol could not be found. 2020-03-25 22:03:44.420495: E tensorflow/core/profiler/internal/gpu/cupti_tracer.cc:1346] function cupti_interface_->ActivityRegisterCallbacks( AllocCuptiActivityBuffer, FreeCuptiActivityBuffer)failed with error CUPTI could not be loaded or symbol could not be found. 1/195 [..............................] - ETA: 31:23 - loss: 5.1677 - acc: 0.10942020-03-25 22:03:45.371496: E tensorflow/core/profiler/internal/gpu/cupti_tracer.cc:1329] function cupti_interface_->EnableCallback( 0 , subscriber_, CUPTI_CB_DOMAIN_DRIVER_API, cbid)failed with error CUPTI could not be loaded or symbol could not be found. 2020-03-25 22:03:45.371563: I tensorflow/core/profiler/internal/gpu/device_tracer.cc:88] GpuTracer has collected 0 callback api events and 0 activity events. 5/195 [..............................] - ETA: 8:33 - loss: 3.9591 - acc: 0.1484
What I’ve tried:
-completely uninstalling cuda and anything nvidia, it didn’t seem to actually uninstall anything.
-install cuda10, it isnt used or recognized by the system
-use system76 tools to install cuda, nothing has changed.
-using nvidia’s deb and runfile installation methods
nvcc: NVIDIA (R) Cuda compiler driver Copyright (c) 2005-2017 NVIDIA Corporation Built on Fri_Nov__3_21:07:56_CDT_2017 Cuda compilation tools, release 9.1, V9.1.85
I’d like to not reinstall ubuntu to get this working but that is my next logical option and I am hoping you (collectively) have a better idea than nuking it and starting over.