首页 > 解决方案 > GCP AI Platform Notebook 驱动太旧?

问题描述

我正在尝试在具有 32 个 vCPU、208 GB RAM 和 2 个 NVIDIA Tesla T4 的 GCP 的 AI 平台笔记本上运行以下Hugging Face Transformers 教程。

但是,当我尝试运行该部件时

model = DistillBERTClass()

model.to(device)

我收到以下断言错误:

AssertionError: The NVIDIA driver on your system is too old (found version 10010).
Please update your GPU driver by downloading and installing a new
version from the URL: http://www.nvidia.com/Download/index.aspx
Alternatively, go to: https://pytorch.org to install
a PyTorch version that has been compiled with your version
of the CUDA driver.

但是,当我运行 !nvidia-smi

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 418.87.01    Driver Version: 418.87.01    CUDA Version: 10.1     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  Tesla T4            Off  | 00000000:00:04.0 Off |                    0 |
| N/A   38C    P0    22W /  70W |     10MiB / 15079MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+
|   1  Tesla T4            Off  | 00000000:00:05.0 Off |                    0 |
| N/A   39C    P8    10W /  70W |     10MiB / 15079MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID   Type   Process name                             Usage      |
|=============================================================================|
|  No running processes found                                                 |

NVIDIA 驱动程序上的版本与我正在使用的最新 PyTorch 版本兼容。有没有其他人遇到过这个错误,有没有办法解决它?

标签: pytorchnvidiabert-language-modelhuggingface-transformersgcp-ai-platform-notebook

解决方案


您可以:

  • 请通过以下 URL 下载并安装新版本来更新您的 GPU 驱动程序:http ://www.nvidia.com/Download/index.aspx

  • 或者,请访问:https : //pytorch.org 安装已使用您的 CUDA 驱动程序版本编译的 PyTorch 版本。


推荐阅读