A Variable Sample-Size Stochastic Quasi-Newton Method for Smooth and Nonsmooth Stochastic Convex Optimization

Classical theory for quasi-Newton schemes has focused on smooth, deterministic, unconstrained optimization, whereas recent forays into stochastic convex optimization have largely resided in smooth, unconstrained, and strongly convex regimes. Naturally, there is a compelling need to address nonsmoothness, the lack of strong convexity, and the presence of constraints. Accordingly, this paper presents a quasi-Newton framework that can process merely convex and possibly nonsmooth (but smoothable) stochastic convex problems. We propose a framework that combines iterative smoothing and regularization with a variance-reduced scheme reliant on using an increasing sample size of gradients. We make the following contributions. (i) We develop a regularized and smoothed variable sample-size BFGS update (rsL-BFGS) that generates a sequence of Hessian approximations and can accommodate nonsmooth convex objectives by utilizing iterative regularization and smoothing. (ii) In strongly convex regimes with state-dependent noise, the proposed variable sample-size stochastic quasi-Newton (VS-SQN) scheme admits a nonasymptotic linear rate of convergence, whereas the oracle complexity of computing an [Formula: see text]-solution is [Formula: see text], where [Formula: see text] denotes the condition number and [Formula: see text]. In nonsmooth (but smoothable) regimes, using Moreau smoothing retains the linear convergence rate for the resulting smoothed VS-SQN (or sVS-SQN) scheme. Notably, the nonsmooth regime allows for accommodating convex constraints. To contend with the possible unavailability of Lipschitzian and strong convexity parameters, we also provide sublinear rates for diminishing step-length variants that do not rely on the knowledge of such parameters. (iii) In merely convex but smooth settings, the regularized VS-SQN scheme rVS-SQN displays a rate of [Formula: see text] with an oracle complexity of [Formula: see text]. When the smoothness requirements are weakened, the rate for the regularized and smoothed VS-SQN scheme rsVS-SQN worsens to [Formula: see text]. Such statements allow for a state-dependent noise assumption under a quadratic growth property on the objective. To the best of our knowledge, the rate results are among the first available rates for QN methods in nonsmooth regimes. Preliminary numerical evidence suggests that the schemes compare well with accelerated gradient counterparts on selected problems in stochastic optimization and machine learning with significant benefits in ill-conditioned regimes.

Download Full-text

A Variable Sample-Size Stochastic Quasi-Newton Method for Smooth and Nonsmooth Stochastic Convex Optimization

2018 IEEE Conference on Decision and Control (CDC) ◽

10.1109/cdc.2018.8619209 ◽

2018 ◽

Author(s):

Afrooz Jalilzadeh ◽

Angelia Nedic ◽

Uday V. Shanbhag ◽

Farzad Yousefian

Keyword(s):

Convex Optimization ◽

Sample Size ◽

Newton Method ◽

Variable Sample Size ◽

Quasi Newton

Download Full-text

On the rate of convergence of alternating minimization for non-smooth non-strongly convex optimization in Banach spaces

Optimization Letters ◽

10.1007/s11590-021-01753-w ◽

2021 ◽

Author(s):

Jakub Wiktor Both

Keyword(s):

Convex Optimization ◽

Banach Spaces ◽

Optimization Problems ◽

Linear Convergence ◽

Rates Of Convergence ◽

Strong Convexity ◽

Quadratic Functional ◽

Alternating Minimization ◽

Strongly Convex ◽

Strongly Convex Optimization

AbstractIn this paper, the convergence of the fundamental alternating minimization is established for non-smooth non-strongly convex optimization problems in Banach spaces, and novel rates of convergence are provided. As objective function a composition of a smooth, and a block-separable, non-smooth part is considered, covering a large range of applications. For the former, three different relaxations of strong convexity are considered: (i) quasi-strong convexity; (ii) quadratic functional growth; and (iii) plain convexity. With new and improved rates benefiting from both separate steps of the scheme, linear convergence is proved for (i) and (ii), whereas sublinear convergence is showed for (iii).

Download Full-text

On the transient growth of Nesterov’s accelerated method for strongly convex optimization problems

2020 59th IEEE Conference on Decision and Control (CDC) ◽

10.1109/cdc42340.2020.9304412 ◽

2020 ◽

Author(s):

Samantha Samuelson ◽

Hesameddin Mohammadi ◽

Mihailo R. Jovanovic

Keyword(s):

Convex Optimization ◽

Optimization Problems ◽

Transient Growth ◽

Strongly Convex ◽

Convex Optimization Problems ◽

Strongly Convex Optimization

Download Full-text

Stochastic quasi-Newton methods for non-strongly convex problems: Convergence and rate analysis

2016 IEEE 55th Conference on Decision and Control (CDC) ◽

10.1109/cdc.2016.7798953 ◽

2016 ◽

Cited By ~ 2

Author(s):

Farzad Yousefian ◽

Angelia Nedic ◽

Uday V. Shanbhag

Keyword(s):

Newton Methods ◽

Strongly Convex ◽

Convex Problems ◽

Rate Analysis ◽

Quasi Newton

Download Full-text

A Novel Adaptive Control Chart with Variable Sample Size, Sampling Interval and Control Limits: A three Stage Variable Parameters (TSVP)

International Journal of Business and Management ◽

10.5539/ijbm.v8n2p38 ◽

2012 ◽

Vol 8 (2) ◽

Author(s):

Ali Deheshvar ◽

Hesam Shams ◽

Farshid Jamali ◽

Zohreh Movahedmanesh

Keyword(s):

Adaptive Control ◽

Sample Size ◽

Control Chart ◽

Sampling Interval ◽

Variable Parameters ◽

Variable Sample Size ◽

And Control ◽

Adaptive Control Chart ◽

Control Limits

Download Full-text

Fuzzy U control chart based on fuzzy rules and evaluating its performance using fuzzy OC curve

The TQM Journal ◽

10.1108/tqm-10-2017-0118 ◽

2018 ◽

Vol 30 (3) ◽

pp. 232-247 ◽

Cited By ~ 9

Author(s):

Somayeh Fadaei ◽

Alireza Pooya

Keyword(s):

Fuzzy Control ◽

Sample Size ◽

Control Chart ◽

Control Charts ◽

Operating Characteristic ◽

Fuzzy Rules ◽

Type Ii ◽

Type Ii Error ◽

Content Type ◽

Variable Sample Size

Purpose The purpose of this paper is to apply fuzzy spectrum in order to collect the vague and imprecise data and to employ the fuzzy U control chart in variable sample size using fuzzy rules. This approach is improved and developed by providing some new rules. Design/methodology/approach The fuzzy operating characteristic (FOC) curve is applied to investigate the performance of the fuzzy U control chart. The application of FOC presents fuzzy bounds of operating characteristic (OC) curve whose width depends on the ambiguity parameter in control charts. Findings To illustrate the efficiency of the proposed approach, a practical example is provided. Comparing performances of control charts indicates that OC curve of the crisp chart has been located between the FOC bounds, near the upper bound; as a result, for the crisp control chart, the probability of the type II error is of significant level. Also, a comparison of the crisp OC curve with OCavg curve and FOCα curve approved that the probability of the type II error for the crisp chart is more than the same amount for the fuzzy chart. Finally, the efficiency of the fuzzy chart is more than the crisp chart, and also it timely gives essential alerts by means of linguistic terms. Consequently, it is more capable of detecting process shifts. Originality/value This research develops the fuzzy U control chart with variable sample size whose output is fuzzy. After creating control charts, performance evaluation in the industry is important. The main contribution of this paper is to employs the FOC curve for evaluating the performance of the fuzzy control chart, while in prior studies in this area, the performance of fuzzy control chart has not been evaluated.

Download Full-text