Gradient descent convergence for α-strongly convex functions
Let
be an α-strongly convex function
and assume we have that, for all
,
.
If we run GD for
steps (with adaptive step sizes) we have:
Corollary
If
we have
See: Gradient
descent convergence bound
Also compare: Gradient
descent convergence for β-smooth functions