Online gradient descent regret bound
(OGD Regret Bound)
After
steps,
average regret over time is bounded by
,
goes
as
Note: no assumptions on how
relate to each other, allowing even for these to be chosen
adversarially, e.g. with
depending on our choice of
and all previous choices.
See: Regret bound, Online regret bound