-
Notifications
You must be signed in to change notification settings - Fork 2
Is this project abolished? #2
Description
Can I ask you a question about the error?
2023-07-06 13:21:02.317561: I tensorflow/stream_executor/dso_loader.cc:152] successfully opened CUDA library cublas64_100.dll locally
progress epoch 1 step 49 image/sec 0.4 remaining 12536m
f2s_disc_real_loss 0.14498748
f2s_disc_fake_loss 0.44297093
f2s_disc_loss_real_styl 0.111068994
f2s_disc_loss_real_char 0.06794464
f2s_gen_loss_GAN 0.15400665
f2s_gen_loss_L1 nan
Traceback (most recent call last):
File "C:\Users\Greon_Server.conda\envs\skelgan\lib\site-packages\tensorflow\python\client\session.py", line 1334, in _do_call
return fn(*args)
File "C:\Users\Greon_Server.conda\envs\skelgan\lib\site-packages\tensorflow\python\client\session.py", line 1319, in _run_fn
options, feed_dict, fetch_list, target_list, run_metadata)
File "C:\Users\Greon_Server.conda\envs\skelgan\lib\site-packages\tensorflow\python\client\session.py", line 1407, in _call_tf_sessionrun
run_metadata)
tensorflow.python.framework.errors_impl.InvalidArgumentError: Nan in summary histogram for: discriminator/layer_5/conv2d/kernel/values
[[{{node discriminator/layer_5/conv2d/kernel/values}}]]
[[{{node discriminator/layer_2/conv2d/kernel/read}}]]
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "main.py", line 256, in
main()
File "main.py", line 220, in main
results = sess.run(fetches, options=options, run_metadata=run_metadata)
File "C:\Users\Greon_Server.conda\envs\skelgan\lib\site-packages\tensorflow\python\client\session.py", line 929, in run
run_metadata_ptr)
File "C:\Users\Greon_Server.conda\envs\skelgan\lib\site-packages\tensorflow\python\client\session.py", line 1152, in _run
feed_dict_tensor, options, run_metadata)
File "C:\Users\Greon_Server.conda\envs\skelgan\lib\site-packages\tensorflow\python\client\session.py", line 1328, in _do_run
run_metadata)
File "C:\Users\Greon_Server.conda\envs\skelgan\lib\site-packages\tensorflow\python\client\session.py", line 1348, in _do_call
raise type(e)(node_def, op, message)
tensorflow.python.framework.errors_impl.InvalidArgumentError: Nan in summary histogram for: discriminator/layer_5/conv2d/kernel/values
[[node discriminator/layer_5/conv2d/kernel/values (defined at main.py:147) ]]
[[node discriminator/layer_2/conv2d/kernel/read (defined at C:\workspace\SkelGAN-main\ops.py:9) ]]
Caused by op 'discriminator/layer_5/conv2d/kernel/values', defined at:
File "main.py", line 256, in
main()
File "main.py", line 147, in main
tf.summary.histogram(var.op.name + "/values", var)
File "C:\Users\Greon_Server.conda\envs\skelgan\lib\site-packages\tensorflow\python\summary\summary.py", line 177, in histogram
tag=tag, values=values, name=scope)
File "C:\Users\Greon_Server.conda\envs\skelgan\lib\site-packages\tensorflow\python\ops\gen_logging_ops.py", line 312, in histogram_summary
"HistogramSummary", tag=tag, values=values, name=name)
File "C:\Users\Greon_Server.conda\envs\skelgan\lib\site-packages\tensorflow\python\framework\op_def_library.py", line 788, in _apply_op_helper
op_def=op_def)
File "C:\Users\Greon_Server.conda\envs\skelgan\lib\site-packages\tensorflow\python\util\deprecation.py", line 507, in new_func
return func(*args, **kwargs)
File "C:\Users\Greon_Server.conda\envs\skelgan\lib\site-packages\tensorflow\python\framework\ops.py", line 3300, in create_op
op_def=op_def)
File "C:\Users\Greon_Server.conda\envs\skelgan\lib\site-packages\tensorflow\python\framework\ops.py", line 1801, in init
self._traceback = tf_stack.extract_stack()
InvalidArgumentError (see above for traceback): Nan in summary histogram for: discriminator/layer_5/conv2d/kernel/values
[[node discriminator/layer_5/conv2d/kernel/values (defined at main.py:147) ]]
[[node discriminator/layer_2/conv2d/kernel/read (defined at C:\workspace\SkelGAN-main\ops.py:9) ]]
I don't know why this error occurs.
Considering that the quantity of data sets prepared is 22,234 and the number of data in the paper is 70,500, I don't think it's a problem with the quantity of learning data, can I know the test hardware environment?