Skip to content

Commit 3bb97d9

Browse files
committed
update pub
1 parent b64244a commit 3bb97d9

3 files changed

Lines changed: 9 additions & 10 deletions

File tree

_publications/2024-02-02-decentralized_gradient_free.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,4 +6,4 @@ date: 2024-02-02
66
paperurl: 'https://ojs.aaai.org/index.php/AAAI/article/view/29697'
77
---
88

9-
We consider decentralized gradient-free optimization of minimizing Lipschitz continuous functions that satisfy neither smoothness nor convexity assumption. We propose two novel gradient-free algorithms, the Decentralized Gradient-Free Method (DGFM) and its variant, the Decentralized Gradient-Free Method+ (DGFM+). Based on the techniques of randomized smoothing and gradient tracking, DGFM requires the computation of the zeroth-order oracle of a single sample in each iteration, making it less demanding in terms of computational resources for individual computing nodes. Theoretically, DGFM achieves a complexity of $\mathcal{O}(d^{3/2}\delta^{-1}\epsilon^{-4})$ for obtaining a $(\delta,\epsilon)$-Goldstein stationary point. DGFM+, an advanced version of DGFM, incorporates variance reduction to further improve the convergence behavior. It samples a mini-batch at each iteration and periodically draws a larger batch of data, which improves the complexity to $\mathcal{O}(d^{3/2}\delta^{-1}\epsilon^{-3})$. Moreover, experimental results underscore the empirical advantages of our proposed algorithms when applied to real-world datasets.
9+
We consider decentralized gradient-free optimization of minimizing Lipschitz continuous functions that satisfy neither smoothness nor convexity assumption. We propose two novel gradient-free algorithms, the Decentralized Gradient-Free Method (DGFM) and its variant, the Decentralized Gradient-Free Method+ (DGFM+). Based on the techniques of randomized smoothing and gradient tracking, DGFM requires the computation of the zeroth-order oracle of a single sample in each iteration, making it less demanding in terms of computational resources for individual computing nodes. Theoretically, DGFM achieves a complexity of O(d³/²δ⁻¹ε⁻⁴) for obtaining a (δ,ε)-Goldstein stationary point. DGFM+, an advanced version of DGFM, incorporates variance reduction to further improve the convergence behavior. It samples a mini-batch at each iteration and periodically draws a larger batch of data, which improves the complexity to O(d³/²δ⁻¹ε⁻³). Moreover, experimental results underscore the empirical advantages of our proposed algorithms when applied to real-world datasets.

_publications/2024-10-20-apdpro.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,4 +6,4 @@ date: 2024-10-20
66
paperurl: 'https://openreview.net/forum?id=pG380vLYRU'
77
---
88

9-
In this paper, we introduce faster accelerated primal-dual algorithms for minimizing a convex function subject to strongly convex function constraints. Prior to our work, the best complexity bound was $\mathcal{O}(1/{\epsilon})$, regardless of the strong convexity of the constraint function. It is unclear whether the strong convexity assumption can enable even better convergence results. To address this issue, we have developed novel techniques to progressively estimate the strong convexity of the Lagrangian function. Our approach, for the first time, effectively leverages the constraint strong convexity, obtaining an improved complexity of $\mathcal{O}(1/\sqrt{\epsilon})$. This rate matches the complexity lower bound for strongly-convex-concave saddle point optimization and is therefore order-optimal. We show the superior performance of our methods in sparsity-inducing constrained optimization, notably Google’s personalized PageRank problem. Furthermore, we show that a restarted version of the proposed methods can effectively identify the optimal solution’s sparsity pattern within a finite number of steps, a result that appears to have independent significance.
9+
In this paper, we introduce faster accelerated primal-dual algorithms for minimizing a convex function subject to strongly convex function constraints. Prior to our work, the best complexity bound was O(ε⁻¹), regardless of the strong convexity of the constraint function. It is unclear whether the strong convexity assumption can enable even better convergence results. To address this issue, we have developed novel techniques to progressively estimate the strong convexity of the Lagrangian function. Our approach, for the first time, effectively leverages the constraint strong convexity, obtaining an improved complexity of O(ε⁻¹/²). This rate matches the complexity lower bound for strongly-convex-concave saddle point optimization and is therefore order-optimal. We show the superior performance of our methods in sparsity-inducing constrained optimization, notably Google’s personalized PageRank problem. Furthermore, we show that a restarted version of the proposed methods can effectively identify the optimal solution’s sparsity pattern within a finite number of steps, a result that appears to have independent significance.
Lines changed: 7 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,22 +1,21 @@
11
---
22
title: "Uniformly Optimal and Parameter-free First-order Methods for Convex and Function-constrained Optimization"
33
collection: publications
4-
category: Preprint
4+
category: preprint
55
date: 2024-12-09
66
paperurl: 'https://arxiv.org/pdf/2412.06319'
77
---
88

99
This paper presents new first-order methods for achieving optimal oracle complexities in convex optimization with convex functional constraints. Oracle complexities are measured by the number of function
1010
and gradient evaluations. To achieve this, we enable first-order methods to utilize computational oracles
11-
for solving diagonal quadratic programs in subproblems. For problems where the optimal value $f^*$
12-
is
11+
for solving diagonal quadratic programs in subproblems. For problems where the optimal value f* is
1312
known, such as those in overparameterized models and feasibility problems, we propose an accelerated
14-
first-order method that incorporates a modified Polyak step size and Nesterovs momentum. Notably, our
13+
first-order method that incorporates a modified Polyak step size and Nesterov's momentum. Notably, our
1514
method does not require knowledge of smoothness levels, Hölder continuity parameter of the gradient, or
16-
additional line search, yet achieves the optimal oracle complexity bound of $\mathcal{O}(\epsilon^{-\frac{2}{1+\rho}})$ under Hölder
17-
smoothness conditions. When $f^*$ is unknown, we reformulate the problem as finding the root of the optimal value function and develop inexact fixed-point iteration and secant method to compute $f^*$. These
15+
additional line search, yet achieves the optimal oracle complexity bound of O(ε⁻²/⁽¹⁺ᵖ⁾) under Hölder
16+
smoothness conditions. When f* is unknown, we reformulate the problem as finding the root of the optimal value function and develop inexact fixed-point iteration and secant method to compute f*. These
1817
root-finding subproblems are solved inexactly using first-order methods to a specified relative accuracy.
1918
We employ the accelerated prox-level (APL) method, which is proven to be uniformly optimal for convex optimization with simple constraints. Our analysis demonstrates that APL-based level-set methods
20-
also achieve the optimal oracle complexity of $\mathcal{O}(\epsilon^{-\frac{2}{1+\rho}})$ for convex function-constrained optimization, without requiring knowledge of any problem-specific structures. Through experiments on various
19+
also achieve the optimal oracle complexity of O(ε⁻²/⁽¹⁺ᵖ⁾) for convex function-constrained optimization, without requiring knowledge of any problem-specific structures. Through experiments on various
2120
tasks, we demonstrate the advantages of our methods over existing approaches in function-constrained
22-
optimization.
21+
optimization.

0 commit comments

Comments
 (0)