【Pytorch】深度学习Pytorch固定随机种子提高代码可复现性

深度学习Pytorch固定随机种子提高代码可复现性

FeverTwice

2213人浏览 · 2023-04-13 17:12:41

FeverTwice · 2023-04-13 17:12:41 发布

文章目录

代码结构
解释
写在最后

Pytorch在训练深度神经网络的过程中，有许多随机的操作，如基于numpy库的数组初始化、卷积核的初始化，以及一些学习超参数的选取，为了实验的可复现性，必须将整个训练过程固定住

固定随机种子的目的：

方便其他人复现我们的代码
方便模型验证
方便验证我们的模型是哪些超参数在起决定性效果
方便对比不同模型架构的差异
通常在模型训练后期，调整随机种子能有意想不到的结果

代码结构

# Fix random seed for reproducibility
def same_seeds(seed):
	  torch.manual_seed(seed)
	  if torch.cuda.is_available():
		    torch.cuda.manual_seed(seed)
		    torch.cuda.manual_seed_all(seed)
	  np.random.seed(seed)
	  random.seed(seed)
	  torch.backends.cudnn.benchmark = False
	  torch.backends.cudnn.deterministic = True
same_seeds(0)

解释

torch.manual_seed(seed)
援引官方文档Pytorch的manual_seed文档

Sets the seed for generating random numbers. Returns a torch.Generator object.

为CPU中设置种子，设置生成随机数的种子。返回一个 torch.Generator 对象。

注意这里考虑的是在CPU上面的计算，if条件里面考虑的才是GPU Cuda的计算，是为了**保证代码在没有GPU Cuda的环境下依旧能为运算单元设定随机种子，下面cuda的种子设置是一种更复杂的情况

torch.cuda.manual_seed(seed)
为特定GPU设置种子，生成随机数
torch.cuda.manual_seed_all(seed)
官方文档torch.cuda.manual_seed_all(seed)`

Sets the seed for generating random numbers on all GPUs. It’s safe to call this function if CUDA is not available; in that case, it is silently ignored.

为所有GPU设置种子，生成随机数，相比上一句代码，着重考虑了代码环境中有多个GPU并行计算单元的情况

如果没有GPU，执行将会被忽略

np.random.seed(seed)
固定Numpy产生的随机数，使得在相同的随机种子所产生随机数是相同的，这句话将会对所有在Numpy库中的随机函数产生作用
random.seed(seed)
上面一句主要是对Numpy库设置随机种子，这句话则是设置整个Python基础环境中的随机种子
torch.backends.cudnn.benchmark = False
benchmark 设置False，是为了保证不使用选择卷积算法的机制，使用固定的卷积算法
torch.backends.cudnn.deterministic = True
为了确定使用相同的算法，保证得到一样的结果