{"ID":2855224,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2510.13560","arxiv_id":"2510.13560","title":"Multi-Objective $\\textit{min-max}$ Online Convex Optimization","abstract":"In this paper, we broaden the horizon of online convex optimization (OCO), and consider multi-objective OCO, where there are $K$ distinct loss function sequences, and an algorithm has to choose its action at time $t$, before the $K$ loss functions at time $t$ are revealed. To capture the tradeoff between tracking the $K$ different sequences, we consider the {\\it min-max} regret, where the benchmark (optimal offline algorithm) takes a static action across all time slots that minimizes the maximum of the total loss (summed across time slots) incurred by each of the $K$ sequences. An online algorithm is allowed to change its action across time slots, and its {\\it min-max} regret is defined as the difference between its {\\it min-max} cost and that of the benchmark. The {\\it min-max} regret is a stringent performance measure and an algorithm with small regret needs to `track' all loss functions simultaneously. We first show that with adversarial input, {\\it min-max} regret scales linearly with the time horizon $T$ for any online algorithm. Consequently, we consider a stochastic i.i.d. input model where all loss functions are i.i.d. generated from an unknown joint distribution and propose a simple algorithm that combines the well-known {\\it Hedge} and online gradient descent (OGD) and show via a remarkably simple proof that its expected {\\it min-max} regret is $O(\\sqrt{T \\log (T K)})$. Analogous results are also derived for Martingale difference and Markov input models.","short_abstract":"In this paper, we broaden the horizon of online convex optimization (OCO), and consider multi-objective OCO, where there are $K$ distinct loss function sequences, and an algorithm has to choose its action at time $t$, before the $K$ loss functions at time $t$ are revealed. To capture the tradeoff between tracking the $...","url_abs":"https://arxiv.org/abs/2510.13560","url_pdf":"https://arxiv.org/pdf/2510.13560v3","authors":"[\"Rahul Vaze\",\"Sumiran Mishra\"]","published":"2025-10-15T13:59:44Z","proceeding":"cs.LG","tasks":"[\"cs.LG\"]","methods":"[]","has_code":false}
