首页 > 解决方案 > Cannot use GPU in VM on GCP; GPU_ALL_REGIONS exceeded

问题描述

Trying to boot up a VM (notebook) with a GPU (any GPU type) in Google Cloud, I get this error:

"tensorflow-1-15-20210102-230326: Quota 'GPUS_ALL_REGIONS' exceeded. Limit: 0.0 globally. (if this notebook instance needs to use a resource outside of project recsys-296821, in the project that contains the resource, please grant the missed permission to the Cloud AI Platform Notebooks Service Account: service-64420327861@gcp-sa-notebooks.iam.gserviceaccount.com)".

I already tried to increase my total gpu quotas, having already upgraded my account from the Free tier (like in GCP error: Quota 'GPUS_ALL_REGIONS' exceeded. Limit: 0.0 globally and GPU Quota Error even when i have Quota for Nvidia P100 both for region Us-west1 and Europe-west4 from 0 to 1 etc), but the GPU_ALL_REGIONS is grayed out and I can't change it, while still being set to 1, so in theory I should be able to use 1 gpu, but I cannot.

I really don't know how to solve this issue and couldn't find an up to date answer. Thanks in advance.

标签: google-cloud-platform

解决方案


根据此 GCP 官方文档,我了解您可能有足够的配额来使用这些资源,但该地区可能没有可用资源。例如,您可能仍有配额在 us-central1 中使用 GPU 创建 VM 实例,但该区域可能没有可用的 GPU。

最好的解决方案是:

1- 这是一种临时情况,可以根据波动的需求随时更改,请稍后再试您的请求。

2-尝试在该区域的另一个区域或另一个区域中创建资源。

3-尝试更改您请求的 VM 的形状。例如,此时您需要 NVidia P100,如果可以,您可以使用另一个 Nvidia T4

4- 使用Compute Engine 预留来预留区域内的资源,以确保您需要的资源在您需要时可用。


推荐阅读