R/optimizers.R
larc_layer_lr.Rd
Computes the local lr before weight decay is applied
larc_layer_lr(p, lr, trust_coeff, wd, eps, clip = TRUE, ...)
p
learning rate
trust_coeff
weight decay
epsilon
clip
additional arguments to pass
None