tensorflow GPU训练loss与val loss值差距过大问题
问题 最近在ubuntu gpu上训练模型,训练十轮,结果如下 epoch,loss,lr,val_loss 200,nan,0.001,nan 200,0.002468767808750272,0.001,44.29948425292969 201,0.007177405059337616,0.001,49.16984176635742 202,0.012423301115632057,0.001,49.303058624…
2025-12-16