1 code implementation • 2 May 2024 • Chengqian Gao, William de Vazelhes, Hualin Zhang, Bin Gu, Zhiqiang Xu
Evolution Strategies (ES) have emerged as a competitive alternative for model-free reinforcement learning, showcasing exemplary performance in tasks like Mujoco and Atari.
no code implementations • 2 Feb 2024 • Hilal AlQuabeh, William de Vazelhes, Bin Gu
Recently, an OGD algorithm emerged, employing gradient computation involving prior and most recent examples, a step that effectively reduces algorithmic complexity to $O(T)$, with $T$ being the number of received examples.
1 code implementation • 19 Dec 2023 • William de Vazelhes, Bhaskar Mukhoty, Xiao-Tong Yuan, Bin Gu
However, most of those iterative methods are based on the $\ell_1$ norm which requires restrictive applicability conditions and could fail in many cases.
no code implementations • 2 Feb 2023 • Bhaskar Mukhoty, Velibor Bojkovic, William de Vazelhes, Giulia De Masi, Huan Xiong, Bin Gu
To circumvent the problem surrogate method uses a differentiable approximation of the Heaviside in the backward pass, while the forward pass uses the Heaviside as the spiking function.
no code implementations • 11 Oct 2022 • William de Vazelhes, Hualin Zhang, Huimin Wu, Xiao-Tong Yuan, Bin Gu
To solve this puzzle, in this paper, we focus on the $\ell_0$ constrained black-box stochastic optimization problems, and propose a new stochastic zeroth-order gradient hard-thresholding (SZOHT) algorithm with a general ZO gradient estimator powered by a novel random support sampling.
6 code implementations • 13 Aug 2019 • William de Vazelhes, CJ Carey, Yuan Tang, Nathalie Vauquier, Aurélien Bellet
metric-learn is an open source Python package implementing supervised and weakly-supervised distance metric learning algorithms.