0

I am executing my programs on a server that is common to many students. Previously my program used to run without any issues. I am not sure what the colleagues installed on the server and my program is reporting the following error.

Skipping registering GPU devices...
2021-10-25 04:09:05.650355: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 AVX512F FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2021-10-25 04:09:05.949381: F tensorflow/core/platform/statusor.cc:33] Attempting to fetch value instead of handling error Internal: failed initializing StreamExecutor for CUDA device ordinal 1: Internal: failed call to cuDevicePrimaryCtxRetain: CUDA_ERROR_OUT_OF_MEMORY: out of memory; total memory reported: 11523260416
Fatal Python error: Aborted

Current thread 0x00007f33a81f9740 (most recent call first):
  File "/home/.../anaconda3/lib/python3.8/site-packages/tensorflow/python/client/session.py", line 711 in __init__
  File "/home/.../anaconda3/lib/python3.8/site-packages/tensorflow/python/client/session.py", line 1601 in __init__
  File "inception_score.py", line 207 in main
  File "/home/.../anaconda3/lib/python3.8/site-packages/absl/app.py", line 251 in _run_main
  File "/home/.../anaconda3/lib/python3.8/site-packages/absl/app.py", line 303 in run
  File "/home/.../anaconda3/lib/python3.8/site-packages/tensorflow/python/platform/app.py", line 40 in run
  File "Hello.py", line 240 in <module>
Aborted (core dumped)

What is the issue here? What can I do to correct it?

hanugm
  • 869
  • 1
  • 11
  • 29
  • Does this answer your question? [CUDA\_ERROR\_OUT\_OF\_MEMORY in tensorflow](https://stackoverflow.com/questions/39465503/cuda-error-out-of-memory-in-tensorflow) – o-90 Oct 25 '21 at 01:07

0 Answers0