This document provides an overview of subdifferentials and proximal operators for convex analysis. It defines subgradients and subdifferentials, and explains their relation to gradients of convex functions. It covers properties like monotonicity, maximal monotonicity, and strong monotonicity. It also discusses calculus rules for subdifferentials of sums and compositions. Finally, it introduces proximal operators and explains that evaluating a proximal operator is equivalent to computing an element of the subdifferential.
The document discusses convex functions and related concepts. It defines convex functions and provides examples of convex and concave functions on R and Rn, including norms, logarithms, and powers. It describes properties that preserve convexity, such as positive weighted sums and composition with affine functions. The conjugate function and quasiconvex functions are also introduced. Key concepts are illustrated with examples throughout.
This document discusses convex functions and some of their key properties. It defines a convex function as a function where the epigraph is a convex set. Some key properties are: convex functions have sublevel sets that are convex, their epigraph representation links them to convex geometry, and Jensen's inequality extends to them. Examples are given of convex functions on R and Rn like norms, quadratic over linear, log-sum-exp, and log-determinant functions. Convex functions satisfy useful first-order and second-order conditions and their properties help derive important inequalities like Holder's inequality.
This document discusses convex optimization and properties of convex functions and sets. It begins by defining a convex set as a set where the line segment between any two points in the set is also contained in the set. It then discusses properties of convex sets such as intersections of convex sets being convex and projections onto convex sets being unique. The document defines convex functions and shows examples, and discusses properties such as convex functions being continuous. It also discusses characterizations of convexity using first and second order conditions. Finally, it discusses how convex optimization problems can be solved provably to the global optimum using algorithms like gradient descent.
This document provides an overview of convex optimization. It begins by explaining that convex optimization can efficiently find global optima for certain functions called convex functions. It then defines convex sets as sets where linear combinations of points in the set are also in the set. Common examples of convex sets include norm balls and positive semidefinite matrices. Convex functions are defined as functions where linear combinations of points on the graph lie below the line connecting those points. Convex functions have properties like their first and second derivatives satisfying certain inequalities, allowing efficient optimization.
The document discusses complex differentiability and analytic functions. It shows that for a function f(z) to be complex differentiable, the Cauchy-Riemann equations relating the partial derivatives of the real and imaginary parts must be satisfied. It also discusses representing functions as power series and their radii of convergence. Multivalued functions like logarithms and roots are discussed, noting the need for branch cuts to define single-valued branches.
This document discusses differentiable and analytic functions of a complex variable z. It defines the derivative of a complex function f(z) and shows that for f(z) to be differentiable, the Cauchy-Riemann equations relating the partial derivatives of the real and imaginary parts must be satisfied. Examples are provided to illustrate calculating derivatives and determining differentiability. The document also covers power series representations of functions, elementary functions like exponential and logarithmic functions, and the concepts of branch points and cuts for multi-valued complex functions.
The document defines inverse functions and provides examples. An inverse function f-1(x) undoes the original function f(x) so that f-1(f(x)) = x. For a function to have an inverse, it must be one-to-one meaning each output of f(x) corresponds to only one input x. The document gives examples of linear functions that are invertible and the function y=x2 that is not invertible because it is not one-to-one. It also states that if a function f(x) is one-to-one on its domain, then it has an inverse function and the domain of f(x) is equal to the range of the
Image sciences, image processing, image restoration, photo manipulation. Image and videos representation. Digital versus analog imagery. Quantization and sampling. Sources and models of noises in digital CCD imagery: photon, thermal and readout noises. Sources and models of blurs. Convolutions and point spread functions. Overview of other standard models, problems and tasks: salt-and-pepper and impulse noises, half toning, inpainting, super-resolution, compressed sensing, high dynamic range imagery, demosaicing. Short introduction to other types of imagery: SAR, Sonar, ultrasound, CT and MRI. Linear and ill-posed restoration problems.
The document discusses convex functions and related concepts. It defines convex functions and provides examples of convex and concave functions on R and Rn, including norms, logarithms, and powers. It describes properties that preserve convexity, such as positive weighted sums and composition with affine functions. The conjugate function and quasiconvex functions are also introduced. Key concepts are illustrated with examples throughout.
This document discusses convex functions and some of their key properties. It defines a convex function as a function where the epigraph is a convex set. Some key properties are: convex functions have sublevel sets that are convex, their epigraph representation links them to convex geometry, and Jensen's inequality extends to them. Examples are given of convex functions on R and Rn like norms, quadratic over linear, log-sum-exp, and log-determinant functions. Convex functions satisfy useful first-order and second-order conditions and their properties help derive important inequalities like Holder's inequality.
This document discusses convex optimization and properties of convex functions and sets. It begins by defining a convex set as a set where the line segment between any two points in the set is also contained in the set. It then discusses properties of convex sets such as intersections of convex sets being convex and projections onto convex sets being unique. The document defines convex functions and shows examples, and discusses properties such as convex functions being continuous. It also discusses characterizations of convexity using first and second order conditions. Finally, it discusses how convex optimization problems can be solved provably to the global optimum using algorithms like gradient descent.
This document provides an overview of convex optimization. It begins by explaining that convex optimization can efficiently find global optima for certain functions called convex functions. It then defines convex sets as sets where linear combinations of points in the set are also in the set. Common examples of convex sets include norm balls and positive semidefinite matrices. Convex functions are defined as functions where linear combinations of points on the graph lie below the line connecting those points. Convex functions have properties like their first and second derivatives satisfying certain inequalities, allowing efficient optimization.
The document discusses complex differentiability and analytic functions. It shows that for a function f(z) to be complex differentiable, the Cauchy-Riemann equations relating the partial derivatives of the real and imaginary parts must be satisfied. It also discusses representing functions as power series and their radii of convergence. Multivalued functions like logarithms and roots are discussed, noting the need for branch cuts to define single-valued branches.
This document discusses differentiable and analytic functions of a complex variable z. It defines the derivative of a complex function f(z) and shows that for f(z) to be differentiable, the Cauchy-Riemann equations relating the partial derivatives of the real and imaginary parts must be satisfied. Examples are provided to illustrate calculating derivatives and determining differentiability. The document also covers power series representations of functions, elementary functions like exponential and logarithmic functions, and the concepts of branch points and cuts for multi-valued complex functions.
The document defines inverse functions and provides examples. An inverse function f-1(x) undoes the original function f(x) so that f-1(f(x)) = x. For a function to have an inverse, it must be one-to-one meaning each output of f(x) corresponds to only one input x. The document gives examples of linear functions that are invertible and the function y=x2 that is not invertible because it is not one-to-one. It also states that if a function f(x) is one-to-one on its domain, then it has an inverse function and the domain of f(x) is equal to the range of the
Image sciences, image processing, image restoration, photo manipulation. Image and videos representation. Digital versus analog imagery. Quantization and sampling. Sources and models of noises in digital CCD imagery: photon, thermal and readout noises. Sources and models of blurs. Convolutions and point spread functions. Overview of other standard models, problems and tasks: salt-and-pepper and impulse noises, half toning, inpainting, super-resolution, compressed sensing, high dynamic range imagery, demosaicing. Short introduction to other types of imagery: SAR, Sonar, ultrasound, CT and MRI. Linear and ill-posed restoration problems.
The document outlines key calculus concepts including:
- Functions, derivatives, differentiation rules, and the definition of a derivative as an infinitesimal change in a function with respect to a variable.
- Concepts related to derivatives such as local/absolute extrema, critical points, increasing/decreasing functions, concavity, asymptotes, and inflection points.
- How to use the first and second derivative tests to determine local extrema, concavity, and increasing/decreasing behavior.
The document discusses key concepts related to limits, continuity, and differentiation. It defines what it means for a variable x to approach a finite number a or infinity, and provides the formal definitions of one-sided limits and two-sided limits. It also discusses indeterminate forms when limits take on forms like 0/0, infinity/infinity, or infinity - infinity. The document outlines several properties of limits, including limits of even and odd functions. It distinguishes between the limit of a function as x approaches a, denoted limx→af(x), versus the function value at that point, f(a). Finally, it states standard theorems about limits, such as the sum and product of two functions whose limits exist
This document provides an outline for a calculus lecture on basic differentiation rules. It includes objectives to understand the derivative of constants, the constant multiple rule, the sum rule, the difference rule, and derivatives of sine and cosine. Examples are provided to find the derivatives of squaring, cubing, square root, and cube root functions using the definition of the derivative. Graphs and properties of these functions and their derivatives are also discussed.
This document provides an outline for a calculus lecture on basic differentiation rules. It includes objectives to understand key rules like the constant multiple rule, sum rule, and derivatives of sine and cosine. Examples are worked through to find the derivatives of functions like squaring, cubing, square root, and cube root using the definition of the derivative. Graphs and properties of derived functions are also discussed.
The document discusses limits and continuity of functions. It provides examples of computing one-sided limits, limits at points of discontinuity, and limits involving algebraic, trigonometric, exponential and logarithmic functions. The key rules for limits include the properties of limits, the sandwich theorem, and limits of compositions of functions. Continuity of functions is defined as a function having a limit equal to its value at a point. Polynomials, trigonometric functions and exponentials are shown to be continuous everywhere they are defined.
This document provides an overview of duality in convex optimization. It introduces the Lagrange dual problem and function, and describes how the dual problem can provide bounds on the optimal value of the original problem. It discusses conditions under which strong duality holds, meaning the dual optimal value equals the primal optimal value. It also introduces complementary slackness and the Karush-Kuhn-Tucker (KKT) conditions, which characterize optimal primal and dual solutions when strong duality holds. Examples discussed include linear programs and their duals.
This document discusses convexity properties of the gamma function. It begins with an introduction to the gamma function and its history. It then revisits earlier work by Krull on functional equations of the form f(x+1)-f(x)=g(x). Several lemmas are presented that characterize properties of convex functions satisfying certain conditions. These results are then applied to derive classical and new representations and characterizations of the gamma function based on its convexity.
This document discusses functions, limits, and continuity. It begins by defining functions, domains, ranges, and some standard real functions like constant, identity, modulus, and greatest integer functions. It then covers limits of functions including one-sided limits and properties of limits. Examples are provided to illustrate evaluating limits using substitution and factorization methods. The overall objectives are to understand functions, domains, ranges, limits of functions and methods to evaluate limits.
Convex Analysis and Duality (based on "Functional Analysis and Optimization" ...Katsuya Ito
In this presentation, we explain the monograph ”Functional Analysis and Optimization” by Kazufumi Ito
https://kito.wordpress.ncsu.edu/files/2018/04/funa3.pdf
Our goal in this presentation is to
-Understand the basic notions of functional analysis
lower-semicontinuous, subdifferential, conjugate functional
- Understand the formulation of duality problem
primal (P), perturbed (Py), and dual (P∗) problem
-Understand the primal-dual relationships
inf(P)≤sup(P∗), inf(P) = sup(P∗), inf supL≤sup inf L
This document summarizes an investigation into extending the Strong Maximum Principle for integral functionals involving Minkowski gauges. It begins by introducing the integral functional and definitions related to Minkowski gauges. It then discusses prior work establishing the Strong Maximum Principle under certain conditions, and extending the class of comparison functions used. The document aims to further generalize the Strong Maximum Principle by considering inf- and sup-convolutions of functions with the Minkowski gauge, which allows the principle to unify previous properties into a single extremal extension principle. It provides background, definitions, and auxiliary results to support the generalization proposed in Section 3.
This document discusses continuity and the Intermediate Value Theorem (IVT) in mathematics. It defines continuity, examines examples of continuous and discontinuous functions, and describes special types of discontinuities. The IVT states that if a function is continuous on a closed interval and takes on intermediate values, there exists a number in the interval where the function value is equal to the intermediate value. Examples are provided to illustrate the IVT, including proving the existence of the square root of two.
This document discusses continuity and the Intermediate Value Theorem (IVT) in mathematics. It defines continuity, examines examples of continuous and discontinuous functions, and establishes theorems about continuity. The IVT states that if a function is continuous on a closed interval and takes on intermediate values within its range, there exists at least one value in the domain where the function value is intermediate. An example proves the existence of the square root of two using the IVT and bisection method.
This document discusses continuity and the Intermediate Value Theorem (IVT) in mathematics. It defines continuity, examines examples of continuous and discontinuous functions, and establishes theorems about continuity. The IVT states that if a function is continuous on a closed interval and takes on intermediate values within its range, there exists at least one value in the domain where the function value is intermediate. An example proves the existence of the square root of two using the IVT and bisection method.
This document discusses concepts related to calculus including functions, limits, continuity, and derivatives. The objective is for students to be able to evaluate limits and determine derivatives of algebraic functions. It defines functions and function notation. It discusses limits, continuity, and the definition of the derivative. It provides examples of evaluating limits using theorems and the squeeze principle. It also defines types of discontinuities and conditions for continuity.
This document discusses partial derivatives, which are used to describe the rate of change of functions with multiple variables. It defines:
1) Partial derivatives as the rate of change of the dependent variable with respect to one independent variable, while holding other variables constant.
2) Functions of two variables have level curves where the function value is constant. Their graphs are surfaces in 3D space.
3) Higher order partial derivatives describe the rate of change of the first partial derivatives.
4) The chain rule extends differentiation to composite functions, allowing functions of variables that are themselves functions of other variables.
The document discusses convex minimization problems under submodular constraints. It begins with an overview of submodular functions and base polytopes. It then defines the convex minimization problem of minimizing a separable convex function g(x) over the base polytope B(f). Several examples of this problem are given, including lexicographically optimal bases and submodular utility allocation markets. It shows these problems are equivalent and discusses the α-curve representation of their optimal solutions.
Its states Periodic function, Fourier series for disontinous function, Fourier series, Intervals, Odd and even functions, Half range fourier series etc. Mostly used as active learning assignment in Degree 3rd sem students.
This document provides an overview of functions, limits, and continuity. It defines key concepts such as domain and range of functions, and examples of standard real functions. It also covers even and odd functions, and how to calculate limits, including left and right hand limits. Methods for evaluating algebraic limits using substitution, factorization, and rationalization are presented. The objectives are to understand functions, domains, ranges, and how to evaluate limits of functions.
The document discusses functions and their derivatives. It defines functions, different types of functions, and notation used for functions. It then covers the concept of limits, theorems on limits, and limits at infinity. The document defines the slope of a tangent line to a curve and increments. It provides definitions and rules for derivatives, including differentiation from first principles and various differentiation rules. It includes examples of finding derivatives using these rules and taking multiple derivatives.
This tutorial offers a step-by-step guide on how to effectively use Pinterest. It covers the basics such as account creation and navigation, as well as advanced techniques including creating eye-catching pins and optimizing your profile. The tutorial also explores collaboration and networking on the platform. With visual illustrations and clear instructions, this tutorial will equip you with the skills to navigate Pinterest confidently and achieve your goals.
The document outlines key calculus concepts including:
- Functions, derivatives, differentiation rules, and the definition of a derivative as an infinitesimal change in a function with respect to a variable.
- Concepts related to derivatives such as local/absolute extrema, critical points, increasing/decreasing functions, concavity, asymptotes, and inflection points.
- How to use the first and second derivative tests to determine local extrema, concavity, and increasing/decreasing behavior.
The document discusses key concepts related to limits, continuity, and differentiation. It defines what it means for a variable x to approach a finite number a or infinity, and provides the formal definitions of one-sided limits and two-sided limits. It also discusses indeterminate forms when limits take on forms like 0/0, infinity/infinity, or infinity - infinity. The document outlines several properties of limits, including limits of even and odd functions. It distinguishes between the limit of a function as x approaches a, denoted limx→af(x), versus the function value at that point, f(a). Finally, it states standard theorems about limits, such as the sum and product of two functions whose limits exist
This document provides an outline for a calculus lecture on basic differentiation rules. It includes objectives to understand the derivative of constants, the constant multiple rule, the sum rule, the difference rule, and derivatives of sine and cosine. Examples are provided to find the derivatives of squaring, cubing, square root, and cube root functions using the definition of the derivative. Graphs and properties of these functions and their derivatives are also discussed.
This document provides an outline for a calculus lecture on basic differentiation rules. It includes objectives to understand key rules like the constant multiple rule, sum rule, and derivatives of sine and cosine. Examples are worked through to find the derivatives of functions like squaring, cubing, square root, and cube root using the definition of the derivative. Graphs and properties of derived functions are also discussed.
The document discusses limits and continuity of functions. It provides examples of computing one-sided limits, limits at points of discontinuity, and limits involving algebraic, trigonometric, exponential and logarithmic functions. The key rules for limits include the properties of limits, the sandwich theorem, and limits of compositions of functions. Continuity of functions is defined as a function having a limit equal to its value at a point. Polynomials, trigonometric functions and exponentials are shown to be continuous everywhere they are defined.
This document provides an overview of duality in convex optimization. It introduces the Lagrange dual problem and function, and describes how the dual problem can provide bounds on the optimal value of the original problem. It discusses conditions under which strong duality holds, meaning the dual optimal value equals the primal optimal value. It also introduces complementary slackness and the Karush-Kuhn-Tucker (KKT) conditions, which characterize optimal primal and dual solutions when strong duality holds. Examples discussed include linear programs and their duals.
This document discusses convexity properties of the gamma function. It begins with an introduction to the gamma function and its history. It then revisits earlier work by Krull on functional equations of the form f(x+1)-f(x)=g(x). Several lemmas are presented that characterize properties of convex functions satisfying certain conditions. These results are then applied to derive classical and new representations and characterizations of the gamma function based on its convexity.
This document discusses functions, limits, and continuity. It begins by defining functions, domains, ranges, and some standard real functions like constant, identity, modulus, and greatest integer functions. It then covers limits of functions including one-sided limits and properties of limits. Examples are provided to illustrate evaluating limits using substitution and factorization methods. The overall objectives are to understand functions, domains, ranges, limits of functions and methods to evaluate limits.
Convex Analysis and Duality (based on "Functional Analysis and Optimization" ...Katsuya Ito
In this presentation, we explain the monograph ”Functional Analysis and Optimization” by Kazufumi Ito
https://kito.wordpress.ncsu.edu/files/2018/04/funa3.pdf
Our goal in this presentation is to
-Understand the basic notions of functional analysis
lower-semicontinuous, subdifferential, conjugate functional
- Understand the formulation of duality problem
primal (P), perturbed (Py), and dual (P∗) problem
-Understand the primal-dual relationships
inf(P)≤sup(P∗), inf(P) = sup(P∗), inf supL≤sup inf L
This document summarizes an investigation into extending the Strong Maximum Principle for integral functionals involving Minkowski gauges. It begins by introducing the integral functional and definitions related to Minkowski gauges. It then discusses prior work establishing the Strong Maximum Principle under certain conditions, and extending the class of comparison functions used. The document aims to further generalize the Strong Maximum Principle by considering inf- and sup-convolutions of functions with the Minkowski gauge, which allows the principle to unify previous properties into a single extremal extension principle. It provides background, definitions, and auxiliary results to support the generalization proposed in Section 3.
This document discusses continuity and the Intermediate Value Theorem (IVT) in mathematics. It defines continuity, examines examples of continuous and discontinuous functions, and describes special types of discontinuities. The IVT states that if a function is continuous on a closed interval and takes on intermediate values, there exists a number in the interval where the function value is equal to the intermediate value. Examples are provided to illustrate the IVT, including proving the existence of the square root of two.
This document discusses continuity and the Intermediate Value Theorem (IVT) in mathematics. It defines continuity, examines examples of continuous and discontinuous functions, and establishes theorems about continuity. The IVT states that if a function is continuous on a closed interval and takes on intermediate values within its range, there exists at least one value in the domain where the function value is intermediate. An example proves the existence of the square root of two using the IVT and bisection method.
This document discusses continuity and the Intermediate Value Theorem (IVT) in mathematics. It defines continuity, examines examples of continuous and discontinuous functions, and establishes theorems about continuity. The IVT states that if a function is continuous on a closed interval and takes on intermediate values within its range, there exists at least one value in the domain where the function value is intermediate. An example proves the existence of the square root of two using the IVT and bisection method.
This document discusses concepts related to calculus including functions, limits, continuity, and derivatives. The objective is for students to be able to evaluate limits and determine derivatives of algebraic functions. It defines functions and function notation. It discusses limits, continuity, and the definition of the derivative. It provides examples of evaluating limits using theorems and the squeeze principle. It also defines types of discontinuities and conditions for continuity.
This document discusses partial derivatives, which are used to describe the rate of change of functions with multiple variables. It defines:
1) Partial derivatives as the rate of change of the dependent variable with respect to one independent variable, while holding other variables constant.
2) Functions of two variables have level curves where the function value is constant. Their graphs are surfaces in 3D space.
3) Higher order partial derivatives describe the rate of change of the first partial derivatives.
4) The chain rule extends differentiation to composite functions, allowing functions of variables that are themselves functions of other variables.
The document discusses convex minimization problems under submodular constraints. It begins with an overview of submodular functions and base polytopes. It then defines the convex minimization problem of minimizing a separable convex function g(x) over the base polytope B(f). Several examples of this problem are given, including lexicographically optimal bases and submodular utility allocation markets. It shows these problems are equivalent and discusses the α-curve representation of their optimal solutions.
Its states Periodic function, Fourier series for disontinous function, Fourier series, Intervals, Odd and even functions, Half range fourier series etc. Mostly used as active learning assignment in Degree 3rd sem students.
This document provides an overview of functions, limits, and continuity. It defines key concepts such as domain and range of functions, and examples of standard real functions. It also covers even and odd functions, and how to calculate limits, including left and right hand limits. Methods for evaluating algebraic limits using substitution, factorization, and rationalization are presented. The objectives are to understand functions, domains, ranges, and how to evaluate limits of functions.
The document discusses functions and their derivatives. It defines functions, different types of functions, and notation used for functions. It then covers the concept of limits, theorems on limits, and limits at infinity. The document defines the slope of a tangent line to a curve and increments. It provides definitions and rules for derivatives, including differentiation from first principles and various differentiation rules. It includes examples of finding derivatives using these rules and taking multiple derivatives.
This tutorial offers a step-by-step guide on how to effectively use Pinterest. It covers the basics such as account creation and navigation, as well as advanced techniques including creating eye-catching pins and optimizing your profile. The tutorial also explores collaboration and networking on the platform. With visual illustrations and clear instructions, this tutorial will equip you with the skills to navigate Pinterest confidently and achieve your goals.
The cherry: beauty, softness, its heart-shaped plastic has inspired artists since Antiquity. Cherries and strawberries were considered the fruits of paradise and thus represented the souls of men.
Heart Touching Romantic Love Shayari In English with ImagesShort Good Quotes
Explore our beautiful collection of Romantic Love Shayari in English to express your love. These heartfelt shayaris are perfect for sharing with your loved one. Get the best words to show your love and care.
Fashionista Chic Couture Maze & Coloring Adventures is a coloring and activity book filled with many maze games and coloring activities designed to delight and engage young fashion enthusiasts. Each page offers a unique blend of fashion-themed mazes and stylish illustrations to color, inspiring creativity and problem-solving skills in children.
This document announces the winners of the 2024 Youth Poster Contest organized by MATFORCE. It lists the grand prize and age category winners for grades K-6, 7-12, and individual age groups from 5 years old to 18 years old.
2. Learning goals
• Be able to derive subdifferential and proximal operator formulas
• Understand that subdifferentials define affine minorizors
• Existence of subgradient for convex functions
• Understand maximal monotonicity and Minty’s theorem
• Know strong monotonicity and relation to strong convexity
• Know different characterizations of smoothness
• Understand and be able to use Fermat’s rule
• Know subdifferential calculus rules
• Understand that prox evaluates subdifferential
2
4. Gradients of convex functions
• Recall: A differentiable function f : Rn
→ R is convex iff
f(y) ≥ f(x) + ∇f(x)T
(y − x)
for all x, y ∈ Rn
f(y)
f(x) + ∇f(x)T (y − x)
(∇f(x), −1)
(x, f(x))
• Function f has for all x ∈ Rn
an affine minorizer that:
• has slope s defined by ∇f
• coincides with function f at x
• defines normal (∇f(x), −1) to epigraph of f
• What if function is nondifferentiable?
4
5. Subdifferentials and subgradients
• Subgradients s define affine minorizers to the function that:
(s, −1)
(s, −1)
(s, −1)
• coincide with f at x
• define normal vector (s, −1) to epigraph of f
• can be one of many affine minorizers at nondifferentiable points x
• Subdifferential of f : Rn
→ R at x is set of vectors s satisfying
f(y) ≥ f(x) + sT
(y − x) for all y ∈ Rn
, (1)
• Notation:
• subdifferential: ∂f : Rn
→ 2Rn
(power-set notation 2Rn
)
• subdifferential at x: ∂f(x) = {s : (1) holds}
• elements s ∈ ∂f(x) are called subgradients of f at x
5
6. Relation to gradient
• If f differentiable at x and ∂f(x) 6= ∅ then ∂f(x) = {∇f(x)}:
(s, −1)
x1
(∇f(x2), −1)
x2 (∇f(x3), −1)
x3
• i.e., subdifferential (if nonempty) at x consists of only gradient
6
7. Subgradient existence – Nonconvex example
• Function can be differentiable at x but ∂f(x) = ∅
x1
x2
x3
• x1: ∂f(x1) = {0}, ∇f(x1) = 0
• x2: ∂f(x2) = ∅, ∇f(x2) = 0
• x3: ∂f(x3) = ∅, ∇f(x3) = 0
• Gradient is a local concept, subdifferential is a global property
7
8. Subgradient existence – Convex example
• Consider the convex function:
x1
x2
x3
f(x) = |x|
• What are the subdifferentials at points x1, x2, x3?
• Subdifferential at x1 is -1 (affine minorizer with slope -1)
• Subdifferential at x2 is [-1,1] (affine minorizers with slope [-1,1])
• Subdifferential at x3 is 1 (affine minorizer with slope 1)
Fact:
• For finite-valued convex functions, a subgradient exists for every x
8
9. Existence for extended-valued convex functions
• Let f : Rn
→ R ∪ {∞} be convex, then:
1. Subgradients exist for all x in relative interior of domf
2. Subgradients sometimes exist for x on boundary of domf
3. No subgradient exists for x outside domf
• Examples for second case, boundary points of domf:
−
√
1 − x2 + ι[−1,1](x) x2 + ι[−2,2](x)
• No subgradient (affine minorizer) exists for left function at x = 1
9
10. Monotonicity
• Subdifferential operator is monotone:
(sx − sy)T
(x − y) ≥ 0
for all sx ∈ ∂f(x) and sy ∈ ∂f(y)
• Proof: Add two copies of subdifferential definition
f(y) ≥ f(x) + sT
x (y − x)
with x and y swapped
• ∂f : R → 2R
: Minimum slope 0 and maximum slope ∞
∂f
x
10
11. Monotonicity beyond subdifferentials
• Let A : Rn
→ 2Rn
be monotone, i.e.:
(u − v)T
(x − y) ≥ 0
for all u ∈ Ax and v ∈ Ay
• If n = 1, then A = ∂f for some function f : R → R ∪ {∞}
• If n ≥ 2 there exist monotone A that are not subdifferentials
11
12. Maximal monotonicity
• Let the set gph ∂f := {(x, u) : u ∈ ∂f(x)} be the graph of ∂f
• ∂f is maximally monotone if no other function g exists with
gph ∂f ⊂ gph ∂g,
with strict inclusion
• A result (due to Rockafellar):
f is closed convex if and only if ∂f is maximally monotone
12
13. Minty’s theorem
• Let ∂f : Rn
→ 2Rn
and α > 0
• ∂f is maximally monotone if and only if range(αI + ∂f) = Rn
∂f1
x
maximally monotone
∂f2
x
not maximally monotone
∂f1 + αI
x
full range
∂f2 + αI
x
not full range
• Interpretation: No “holes” in gph ∂f
13
14. Strong convexity
• Recall that f is σ-strongly convex if f − σ
2 k · k2
2 is convex
• If f is σ-strongly convex then
f(y) ≥ f(x) + sT
(y − x) + σ
2 kx − yk2
2
holds for all x ∈ dom∂f, s ∈ ∂f(x), and y ∈ Rn
• The function has convex quadratic minorizers instead of affine
f(y)
f(x1) + sT
1 (y − x1) + σ
2
kx1 − yk2
2
x1
(s1, −1)
f(x2) + sT
2,1(y − x2) + σ
2
kx2 − yk2
2
(s2,1, −1)
f(x2) + sT
2,2(y − x2) + σ
2
kx2 − yk2
2
x2
(s2,2, −1)
• Multiple lower bounds at x2 with subgradients s2,1 and s2,2
14
15. Strong monotonicity
• If f σ-strongly convex function, then ∂f is σ-strongly monotone:
(sx − sy)T
(x − y) ≥ σkx − yk2
2
for all sx ∈ ∂f(x) and sy ∈ ∂f(y)
• Proof: Add two copies of strong convexity inequality
f(y) ≥ f(x) + sT
x (y − x) + σ
2 kx − yk2
2
with x and y swapped
• ∂f is σ-strongly monotone if and only if ∂f − σI is monotone
• ∂f : R → 2R
: Minimum slope σ and maximum slope ∞
∂f
x
15
16. Strongly convex functions – An equivalence
The following are equivalent
(i) f is closed and σ-strongly convex
(ii) ∂f is maximally monotone and σ-strongly monotone
Proof:
(i)⇒(ii): we know this from before
(ii)⇒(i): (ii) ⇒ ∂f − σI = ∂(f − σ
2 k · k2
2) maximally monotone
⇒ f − σ
2 k · k2
2 closed convex
⇒ f closed and σ-strongly convex
16
17. Smoothness and convexity
• A differentiable function f : Rn
→ R is convex and smooth if
f(y) ≤ f(x) + ∇f(x)T
(y − x) + β
2 kx − yk2
2
f(y) ≥ f(x) + ∇f(x)T
(y − x)
holds for all x, y ∈ Rn
• f has convex quadratic majorizers and affine minorizers
f(x1) + ∇f(x1)T (y − x1) + β
2
kx1 − yk2
2
x1
(∇f(x2), −1)
f(x2) + ∇f(x2)T (y − x2) + β
2
kx2 − yk2
2
x2
(∇f(x2), −1)
f(y)
• Quadratic upper bound is called descent lemma 17
18. Gradient of smooth convex function
• Gradient of smooth convex function is monotone and Lipschitz
(∇f(x) − ∇f(y))T
(x − y) ≥ 0
k∇f(y) − ∇f(x)k2 ≤ βkx − yk2
• ∇f : R → R: Minimum slope 0 and maximum slope β
∇f
x
• Actually satisfies the stronger 1
β -cocoercivity property:
(∇f(x) − ∇f(y))T
(x − y) ≥ 1
β k∇f(y) − ∇f(x)k2
2
18
19. Smooth convex functions – Equivalences
Let f : Rn
→ R be differentiable. The following are equivalent:
(i) ∇f is 1
β -cocoercive
(ii) ∇f is maximally monotone and β-Lipschitz continuous
(iii) f is closed convex and satisfies descent lemma (is β-smooth)
• Implication (ii)⇒(i) is called the Baillon-Haddad theorem
• Will connect smoothness and strong convexity via conjugates in next lecture
19
20. Fermat’s rule
Let f : Rn
→ R ∪ {∞}, then x minimizes f if and only if
0 ∈ ∂f(x)
• Proof: x minimizes f if and only if
f(y) ≥ f(x) + 0T
(y − x) for all y ∈ Rn
which by definition of subdifferential is equivalent to 0 ∈ ∂f(x)
• Example: several subgradients at solution, including 0
(0, −1)
20
21. Fermat’s rule – Nonconvex example
• Fermat’s rule holds also for nonconvex functions
• Example:
x1
x2
(0, −1)
• ∂f(x1) = 0 and ∇f(x1) = 0 (global minimum)
• ∂f(x2) = ∅ and ∇f(x2) = 0 (local minimum)
• For nonconvex f, we can typically only hope to find local minima
21
23. Subdifferential of sum
If f1, f2 closed convex and relint domf1 ∩ relint domf2 6= ∅:
∂(f1 + f2) = ∂f1 + ∂f2
• One direction always holds: if x ∈ dom∂f1 ∩ dom∂f2:
∂(f1 + f2)(x) ⊇ ∂f1(x) + ∂f2(x)
Proof: let si ∈ ∂fi(x), add subdifferential definitions:
f1(y) + f2(y) ≥ f1(x) + f2(x) + (s1 + s2)T
(y − x)
i.e. s1 + s2 ∈ ∂(f1 + f2)(x)
• If f1 and f2 differentiable, we have (without convexity of f)
∇(f1 + f2) = ∇f1 + ∇f2
23
24. Subdifferential of composition
If f closed convex and relint dom(f ◦ L) 6= ∅:
∂(f ◦ L)(x) = LT
∂f(Lx)
• One direction always holds: If Lx ∈ domf, then
∂(f ◦ L)(x) ⊇ LT
∂f(Lx)
Proof: let s ∈ ∂f(Lx), then by definition of subgradient of f:
(f ◦ L)(y) ≥ (f ◦ L)(x) + sT
(Ly − Lx) = (f ◦ L)(x) + (LT
s)T
(y − x)
i.e., LT
s ∈ ∂(f ◦ L)(x)
• If f differentiable, we have chain rule (without convexity of f)
∇(f ◦ L)(x) = LT
∇f(Lx)
24
25. A sufficient optimality condition
Let f : Rm
→ R, g : Rn
→ R, and L ∈ Rm×n
then:
minimize f(Lx) + g(x) (1)
is solved by every x ∈ Rn
that satisfies
0 ∈ LT
∂f(Lx) + ∂g(x) (2)
• Subdifferential calculus inclusions say:
0 ∈ LT
∂f(Lx) + ∂g(x) ⊆ ∂((f ◦ L)(x) + g(x))
which by Fermat’s rule is equivalent to x solution to (1)
• Note: (1) can have solution but no x exists that satisfies (2)
25
26. A necessary and sufficient optimality condition
Let f : Rm
→ R, g : Rn
→ R, L ∈ Rm×n
with f, g closed convex
and assume relint dom(f ◦ L) ∩ relint domg 6= ∅ then:
minimize f(Lx) + g(x) (1)
is solved by x ∈ Rn
if and only if x satisfies
0 ∈ LT
∂f(Lx) + ∂g(x) (2)
• Subdifferential calculus equality rules say:
0 ∈ LT
∂f(Lx) + ∂g(x) = ∂((f ◦ L)(x) + g(x))
which by Fermat’s rule is equivalent to x solution to (1)
• Algorithms search for x that satisfy 0 ∈ LT
∂f(Lx) + ∂g(x)
26
27. A comment to constraint qualification
• The condition
relint dom(f ◦ L) ∩ relint domg 6= ∅
is called constraint qualification and referred to as CQ
• It is a mild condition that rarely is not satisfied
dom(f ◦ L)
domg
no solution
dom(f ◦ L)
domg
solution
no CQ
dom(f ◦ L)
domg
solution
CQ
27
28. Evaluating subgradients of convex functions
• Obviously need to evaluate subdifferentials to solve
0 ∈ LT
∂f(Lx) + ∂g(x)
• Explicit evaluation:
• If function is differentiable: ∇f (unique)
• If function is nondifferentiable: compute element in ∂f
• Implicit evaluation:
• Proximal operator (specific element of subdifferential)
28
30. Proximal operator
• Proximal operator of (convex) g defined as:
proxγg(z) = argmin
x
(g(x) + 1
2γ kx − zk2
2)
where γ > 0 is a parameter
• Evaluating prox requires solving optimization problem
• Objective is strongly convex ⇒ solution exists and is unique
30
31. Prox evaluates the subdifferential
• Fermat’s rule on prox definition: x = proxγg(z) if and only if
0 ∈ ∂g(x) + γ−1
(x − z) ⇔ γ−1
(z − x) ∈ ∂g(x)
Hence, γ−1
(z − x) is element in ∂g(x)
• A subgradient ∂g(x) where x = proxγg(z) is computed
• Often used in algorithms when g nonsmooth (no gradient exists)
31
32. Prox is generalization of projection
• Recall the indicator function of a set C
ιC(x) :=
(
0 if x ∈ C
∞ otherwise
• Then
ΠC(z) = argmin
x
(kx − zk2 : x ∈ C)
= argmin
x
(1
2 kx − zk2
2 : x ∈ C)
= argmin
x
(1
2 kx − zk2
2 + ιC(x))
= proxιC
(z)
• Projection onto C equals prox of indicator function of C
32
33. Proximal operator – Example 1
Let g(x) = 1
2 xT
Hx + hT
x with H positive semidefinite
• Gradient satisfies ∇g(x) = Hx + h
• Fermat’s rule for x = proxγgz:
0 = ∇g(x) + γ−1
(x − z) ⇔ 0 = Hx + h + γ−1
(x − z)
⇔ (I + γH)x = z − γh
⇔ x = (I + γH)−1
(z − γh)
• So proxγgz = (I + γH)−1
(z − γh)
33
34. Proximal operator – Example 2
• Consider the function g with subdifferential ∂g:
g(x) =
(
−x if x ≤ 0
0 if x ≥ 0
∂g(x) =
−1 if x < 0
[−1, 0] if x = 0
0 if x > 0
• Graphical representations
(−1, −1)
(−1, −1)
(−0.5, −1) (0, −1) (0, −1)
g(x)
x
∂g(x)
x
• Fermat’s rule for x = proxγgz:
0 ∈ ∂g(x) + γ−1
(x − z)
34
35. Proximal operator – Example 2 cont’d
• Let x < 0, then Fermat’s rule reads
0 = −1 + γ−1
(x − z) ⇔ x = z + γ
which is valid (x < 0) if z < −γ
• Let x = 0, then Fermat’s rule reads
0 = [−1, 0] + γ−1
(0 − z)
which is valid (x = 0) if z ∈ [−γ, 0]
• Let x > 0, then Fermat’s rule reads
0 = 0 + γ−1
(x − z) ⇔ x = z
which is valid (x > 0) if z > 0
• The prox satisfies
proxγg(z) =
z + γ if z < −γ
0 if z ∈ [−γ, 0]
z if z > 0
35
36. Computational cost
• Evaluating prox requires solving optimization problem
proxγg(z) = argmin
x
(g(x) + 1
2γ kx − zk2
2)
• Prox typically more expensive to evaluate than gradient
• Example: Quadratic g(x) = 1
2 xT
Hx + hT
x:
proxγg(z) = (I + γH)−1
(z − γh), ∇g(z) = Hz − h
36