Computes the local lr before weight decay is applied

larc_layer_lr(p, lr, trust_coeff, wd, eps, clip = TRUE, ...)

Arguments

p

p

lr

learning rate

trust_coeff

trust_coeff

wd

weight decay

eps

epsilon

clip

clip

...

additional arguments to pass

Value

None