{"id":35,"date":"2015-10-15T22:52:58","date_gmt":"2015-10-15T22:52:58","guid":{"rendered":"http:\/\/www.aarondefazio.com\/tangentially\/?p=35"},"modified":"2019-04-29T21:08:02","modified_gmt":"2019-04-29T21:08:02","slug":"the-many-ways-to-analyse-gradient-descent-part-2","status":"publish","type":"post","link":"https:\/\/www.aarondefazio.com\/tangentially\/?p=35","title":{"rendered":"The Many Ways to Analyse Gradient Descent: Part 2"},"content":{"rendered":"<p><!DOCTYPE html><br \/>\n<html><br \/>\n<head> <title><\/title><br \/>\n<meta charset=\"UTF-8\" \/><br \/>\n<meta name=\"generator\" content=\"TeX4ht (http:\/\/www.cse.ohio-state.edu\/~gurari\/TeX4ht\/)\" \/>\n<link rel=\"stylesheet\" type=\"text\/css\" href=\"GradientDescentNonSC.css\" \/>\n<!--l. 28--><\/p>\n<p class=\"noindent\" ><a href=\"http:\/\/www.aarondefazio.com\/tangentially\/?p=32\">The previous post<\/a> detailed a bunch of different ways of proving the convergence rate<br \/>\nof gradient descent:\n<\/p>\n<div class=\"math-display\"><!--l. 30--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                                      <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-rel\">=<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>\u03b1<\/mi><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><mo \nclass=\"MathClass-punc\">,<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 32--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 35--><\/p>\n<p class=\"indent\" >   for strongly convex problems. This post considers the non-strongly convex, but<br \/>\nstill convex case.\n<\/p>\n<h3 class=\"likesectionHead\"><a \n id=\"x1-1000\"><\/a>Rehash of Basic Lemmas<\/h3>\n<p><!--l. 41--><\/p>\n<p class=\"noindent\" >These hold for any <!--l. 41--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>x<\/mi><\/math><br \/>\nand <!--l. 41--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>y<\/mi><\/math>.<br \/>\n<!--l. 41--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>L<\/mi><\/math><br \/>\nthe Lipschitz smoothness constant. These are completely<br \/>\nstandard, see Nesterov&#x2019;s book <span class=\"cite\">[<a \nhref=\"#Xnes-book\">2<\/a>]<\/span> for proofs. We use the notation<br \/>\n<!--l. 43--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/math> for an arbitrary<br \/>\nminimizer of <!--l. 44--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>f<\/mi><\/math>.\n<\/p>\n<table class=\"equation\">\n<tr>\n<td> <a \n id=\"x1-1001r1\"><\/a><br \/>\n<!--l. 47--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" class=\"equation\">\n                <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>y<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>x<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">+<\/mo> <mfenced separators=\"\" \nopen=\"\u27e8\"  close=\"\u27e9\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>x<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><mo \nclass=\"MathClass-punc\">,<\/mo><mi \n>y<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>x<\/mi><\/mrow><\/mfenced> <mo \nclass=\"MathClass-bin\">+<\/mo> <mfrac><mrow \n><mi \n>L<\/mi><\/mrow> \n<mrow \n><mn>2<\/mn><\/mrow><\/mfrac><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><mi \n>x<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>y<\/mi><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/math><\/td>\n<td class=\"eq-no\">(1)<\/td>\n<\/tr>\n<\/table>\n<table class=\"equation\">\n<tr>\n<td> <a \n id=\"x1-1002r2\"><\/a><br \/>\n<!--l. 52--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" class=\"equation\">\n             <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>y<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2265<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>x<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">+<\/mo> <mfenced separators=\"\" \nopen=\"\u27e8\"  close=\"\u27e9\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>x<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><mo \nclass=\"MathClass-punc\">,<\/mo><mi \n>y<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>x<\/mi><\/mrow><\/mfenced> <mo \nclass=\"MathClass-bin\">+<\/mo>  <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><mn>2<\/mn><mi \n>L<\/mi><\/mrow><\/mfrac><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>x<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>y<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/math><\/td>\n<td class=\"eq-no\">(2)<\/td>\n<\/tr>\n<\/table>\n<table class=\"equation\">\n<tr>\n<td> <a \n id=\"x1-1003r3\"><\/a><br \/>\n<!--l. 57--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" class=\"equation\">\n                <mfenced separators=\"\" \nopen=\"\u27e8\"  close=\"\u27e9\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>x<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>y<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><mo \nclass=\"MathClass-punc\">,<\/mo><mi \n>x<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>y<\/mi><\/mrow><\/mfenced> <mo \nclass=\"MathClass-rel\">\u2265<\/mo> <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><mi \n>L<\/mi><\/mrow><\/mfrac><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>x<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>y<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/math><\/td>\n<td class=\"eq-no\">(3)<\/td>\n<\/tr>\n<\/table>\n<p><!--l. 64--><\/p>\n<p class=\"noindent\" >\n<h3 class=\"sectionHead\"><span class=\"titlemark\">1   <\/span> <a \n id=\"x1-20001\"><\/a>Proximal Style Convergence Proof<\/h3>\n<p><!--l. 66--><\/p>\n<p class=\"noindent\" >The following argument gives a proof of convergence that is well suited to<br \/>\nmodi\ufb01cation for proving the convergence of proximal gradient methods. We start by<br \/>\nproving a useful lemma:\n<\/p>\n<div class=\"newtheorem\">\n<!--l. 69--><\/p>\n<p class=\"noindent\" ><span class=\"head\"><br \/>\n<a \n id=\"x1-2001r1\"><\/a><br \/>\n<span \nclass=\"ecbx-1000\">Lemma 1.<\/span>  <\/span><span \nclass=\"ecti-1000\">For any <\/span><!--l. 70--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/math><br \/>\n<span \nclass=\"ecti-1000\">and <\/span><!--l. 70--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>y<\/mi><\/math><span \nclass=\"ecti-1000\">,<\/span><br \/>\n<span \nclass=\"ecti-1000\">when <\/span><!--l. 70--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-rel\">=<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><mi \n>L<\/mi><\/mrow><\/mfrac><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/math><span \nclass=\"ecti-1000\">:<\/span>\n<\/p>\n<div class=\"par-math-display\"><!--l. 72--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                      <mfrac><mrow \n><mn>2<\/mn><\/mrow> \n<mrow \n><mi \n>L<\/mi><\/mrow><\/mfrac> <mfenced separators=\"\" \nopen=\"[\"  close=\"]\" ><mrow><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>y<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced> <mo \nclass=\"MathClass-rel\">\u2265<\/mo><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><mi \n>y<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfenced> <\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo><msup><mrow \n><mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>y<\/mi><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 74--><\/p>\n<p class=\"nopar\" >\n<\/p><\/div>\n<p><!--l. 76--><\/p>\n<p class=\"indent\" >\n<div class=\"proof\">\n<!--l. 77--><\/p>\n<p class=\"indent\" >   <span class=\"head\"><br \/>\n<span \nclass=\"ecti-1000\">Proof.<\/span> <\/span>We start with the Lipschitz upper bound around <!--l. 77--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/math><br \/>\nof <!--l. 77--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/math>:\n<\/p>\n<div class=\"math-display\"><!--l. 78--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">+<\/mo> <mfenced separators=\"\" \nopen=\"\u27e8\"  close=\"\u27e9\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><mo \nclass=\"MathClass-punc\">,<\/mo><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfenced> <mo \nclass=\"MathClass-bin\">+<\/mo> <mfrac><mrow \n><mi \n>L<\/mi><\/mrow> \n<mrow \n><mn>2<\/mn><\/mrow><\/mfrac><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfenced> <\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 80--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 83--><\/p>\n<p class=\"indent\" >   Now we bound <!--l. 83--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/math><br \/>\nusing the negated convexity lower bound of <!--l. 84--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>y<\/mi><\/math><br \/>\naround <!--l. 84--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>x<\/mi><\/math><br \/>\n(i.e. <!--l. 84--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>y<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2265<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">+<\/mo> <mfenced separators=\"\" \nopen=\"\u27e8\"  close=\"\u27e9\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><mo \nclass=\"MathClass-punc\">,<\/mo><mi \n>y<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfenced><\/math>):\n<\/p>\n<div class=\"math-display\"><!--l. 85--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n          <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>y<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">+<\/mo> <mfenced separators=\"\" \nopen=\"\u27e8\"  close=\"\u27e9\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><mo \nclass=\"MathClass-punc\">,<\/mo><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">+<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>y<\/mi><\/mrow><\/mfenced> <mo \nclass=\"MathClass-bin\">+<\/mo> <mfrac><mrow \n><mi \n>L<\/mi><\/mrow> \n<mrow \n><mn>2<\/mn><\/mrow><\/mfrac><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfenced> <\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 87--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 90--><\/p>\n<p class=\"indent\" >   Negating,        rearranging        and        multiplying        through        by<br \/>\n<!--l. 90--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" > <mfrac><mrow \n><mn>2<\/mn><\/mrow>\n<mrow \n><mi \n>L<\/mi><\/mrow><\/mfrac><\/math><br \/>\ngives:\n<\/p>\n<div class=\"math-display\"><!--l. 91--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                <mfrac><mrow \n><mn>2<\/mn><\/mrow> \n<mrow \n><mi \n>L<\/mi><\/mrow><\/mfrac> <mfenced separators=\"\" \nopen=\"[\"  close=\"]\" ><mrow><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>y<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced> <mo \nclass=\"MathClass-rel\">\u2265<\/mo> <mfrac><mrow \n><mn>2<\/mn><\/mrow> \n<mrow \n><mi \n>L<\/mi><\/mrow><\/mfrac> <mfenced separators=\"\" \nopen=\"\u27e8\"  close=\"\u27e9\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><mo \nclass=\"MathClass-punc\">,<\/mo><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>y<\/mi><\/mrow><\/mfenced> <mo \nclass=\"MathClass-bin\">+<\/mo><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfenced> <\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 93--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 96--><\/p>\n<p class=\"indent\" >   Now we replace <!--l. 96--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/math><br \/>\nusing <!--l. 96--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-rel\">=<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><mi \n>L<\/mi><\/mrow><\/mfrac><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/math>:<\/p>\n<p><!--tex4ht:inline--><\/p>\n<p><!--l. 97--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" >\n<mtable \nclass=\"eqnarray-star\" columnalign=\"right center left\" >\n<mtr><mtd \nclass=\"eqnarray-1\"> <mfrac><mrow \n><mn>2<\/mn><\/mrow>\n<mrow \n><mi \n>L<\/mi><\/mrow><\/mfrac> <mfenced separators=\"\" \nopen=\"[\"  close=\"]\" ><mrow><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>y<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><\/mtd><mtd \nclass=\"eqnarray-2\">   <mo \nclass=\"MathClass-rel\">\u2265<\/mo><\/mtd><mtd \nclass=\"eqnarray-3\">   <mn>2<\/mn> <mfenced separators=\"\" \nopen=\"\u27e8\"  close=\"\u27e9\" ><mrow><mi \n>y<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><mspace width=\"0.3em\" class=\"thinspace\"\/><mo \nclass=\"MathClass-punc\">,<\/mo><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfenced> <mo \nclass=\"MathClass-bin\">\u2212<\/mo><msup><mrow \n><mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfenced> <\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n>         <\/mtd><mtd \nclass=\"eqnarray-4\"> <mtext class=\"eqnarray\"><\/mtext><\/mtd>\n<\/mtr><mtr><mtd \nclass=\"eqnarray-1\">                <\/mtd><mtd \nclass=\"eqnarray-2\">   <mo \nclass=\"MathClass-rel\">=<\/mo><\/mtd><mtd \nclass=\"eqnarray-3\">   <mn>2<\/mn> <mfenced separators=\"\" \nopen=\"\u27e8\"  close=\"\u27e9\" ><mrow><mi \n>y<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">+<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><mspace width=\"0.3em\" class=\"thinspace\"\/><mo \nclass=\"MathClass-punc\">,<\/mo><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfenced> <mo \nclass=\"MathClass-bin\">\u2212<\/mo><msup><mrow \n><mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfenced> <\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><\/mtd><mtd \nclass=\"eqnarray-4\"> <mtext class=\"eqnarray\"><\/mtext><\/mtd>\n<\/mtr><mtr><mtd \nclass=\"eqnarray-1\">                <\/mtd><mtd \nclass=\"eqnarray-2\">   <mo \nclass=\"MathClass-rel\">=<\/mo><\/mtd><mtd \nclass=\"eqnarray-3\">   <mn>2<\/mn> <mfenced separators=\"\" \nopen=\"\u27e8\"  close=\"\u27e9\" ><mrow><mi \n>y<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><mo \nclass=\"MathClass-punc\">,<\/mo><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfenced> <mo \nclass=\"MathClass-bin\">+<\/mo> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfenced> <msup><mrow \n><mo \nclass=\"MathClass-punc\">.<\/mo><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n>          <\/mtd><mtd \nclass=\"eqnarray-4\"> <mtext class=\"eqnarray\"><\/mtext><\/mtd>    <\/mtr><\/mtable>\n<\/math><br \/>\n<!--l. 101--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 104--><\/p>\n<p class=\"indent\" >   Now we complete the square using the quadratic<br \/>\n<!--l. 104--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><mi \n>y<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">+<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfenced> <\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n> <mo \nclass=\"MathClass-rel\">=<\/mo><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><mi \n>y<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfenced> <\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n> <mo \nclass=\"MathClass-bin\">+<\/mo> <mn>2<\/mn> <mfenced separators=\"\" \nopen=\"\u27e8\"  close=\"\u27e9\" ><mrow><mi \n>y<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><mo \nclass=\"MathClass-punc\">,<\/mo><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfenced> <mo \nclass=\"MathClass-bin\">+<\/mo><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfenced> <\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><\/math>. So<br \/>\nwe have:<br \/>\n<!--tex4ht:inline--><\/p>\n<p><!--l. 106--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" >\n<mtable \nclass=\"eqnarray-star\" columnalign=\"right center left\" >\n<mtr><mtd \nclass=\"eqnarray-1\"> <mfrac><mrow \n><mn>2<\/mn><\/mrow>\n<mrow \n><mi \n>L<\/mi><\/mrow><\/mfrac> <mfenced separators=\"\" \nopen=\"[\"  close=\"]\" ><mrow><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>y<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><\/mtd><mtd \nclass=\"eqnarray-2\">   <mo \nclass=\"MathClass-rel\">\u2265<\/mo><\/mtd><mtd \nclass=\"eqnarray-3\">   <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><mi \n>y<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">+<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfenced> <mo \nclass=\"MathClass-bin\">\u2212<\/mo><msup><mrow \n><mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><mi \n>y<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfenced> <\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><\/mtd><mtd \nclass=\"eqnarray-4\"> <mtext class=\"eqnarray\"><\/mtext><\/mtd>\n<\/mtr><mtr><mtd \nclass=\"eqnarray-1\">                <\/mtd><mtd \nclass=\"eqnarray-2\">   <mo \nclass=\"MathClass-rel\">=<\/mo><\/mtd><mtd \nclass=\"eqnarray-3\">   <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><mi \n>y<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfenced> <mo \nclass=\"MathClass-bin\">\u2212<\/mo><msup><mrow \n><mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><mi \n>y<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfenced> <\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>        <\/mtd><mtd \nclass=\"eqnarray-4\"> <mtext class=\"eqnarray\"><\/mtext><\/mtd>                 <\/mtr><\/mtable>\n<\/math><br \/>\n<!--l. 109--><\/p>\n<p class=\"nopar\" >\n                                                                 \u25a1\n<\/p>\n<\/p><\/div>\n<p><!--l. 112--><\/p>\n<p class=\"indent\" >   Using this lemma, the proof is quite simple. We apply it with<br \/>\n<!--l. 112--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>y<\/mi> <mo \nclass=\"MathClass-rel\">=<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/math>:\n<\/p>\n<div class=\"par-math-display\"><!--l. 114--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n><msup><mrow \n>\n                    <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo><msup><mrow \n><mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n> <mo \nclass=\"MathClass-rel\">\u2264<\/mo><mo \nclass=\"MathClass-bin\">\u2212<\/mo><mfrac><mrow \n><mn>2<\/mn><\/mrow>\n<mrow \n><mi \n>L<\/mi><\/mrow><\/mfrac> <mfenced separators=\"\" \nopen=\"[\"  close=\"]\" ><mrow><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 116--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 119--><\/p>\n<p class=\"indent\" >   Now we sum this between <!--l. 119--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mn>0<\/mn><\/math><br \/>\nand <!--l. 119--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>k<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mn>1<\/mn><\/math>.<br \/>\nThe left hand side telescopes:\n<\/p>\n<div class=\"par-math-display\"><!--l. 121--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n><msup><mrow \n>\n              <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo><msup><mrow \n><mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mn>0<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n> <mo \nclass=\"MathClass-rel\">\u2264<\/mo><mo \nclass=\"MathClass-bin\">\u2212<\/mo><mfrac><mrow \n><mn>2<\/mn><\/mrow>\n<mrow \n><mi \n>L<\/mi><\/mrow><\/mfrac><munderover accentunder=\"false\" accent=\"false\"><mrow  \n><mo mathsize=\"big\" \n>\u2211<\/mo>\n  <\/mrow><mrow \n><mi \n>r<\/mi><mo \nclass=\"MathClass-rel\">=<\/mo><mn>0<\/mn><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">\u2212<\/mo><mn>1<\/mn><\/mrow><\/munderover \n> <mfenced separators=\"\" \nopen=\"[\"  close=\"]\" ><mrow><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>r<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 123--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 126--><\/p>\n<p class=\"indent\" >   Now we use the fact that gradient <span \nclass=\"ecti-1000\">descent <\/span>is a descent method, which implies that<br \/>\n<!--l. 127--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>r<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/math> for all<br \/>\n<!--l. 127--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>r<\/mi> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mi \n>k<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mn>1<\/mn><\/math>.<br \/>\nSo:\n<\/p>\n<div class=\"math-display\"><!--l. 129--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n><msup><mrow \n>\n                      <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo><msup><mrow \n><mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mn>0<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n> <mo \nclass=\"MathClass-rel\">\u2264<\/mo><mo \nclass=\"MathClass-bin\">\u2212<\/mo><mfrac><mrow \n><mn>2<\/mn><mi \n>k<\/mi><\/mrow>\n<mrow \n><mi \n>L<\/mi><\/mrow><\/mfrac>  <mfenced separators=\"\" \nopen=\"[\"  close=\"]\" ><mrow><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 131--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 134--><\/p>\n<p class=\"indent\" >   Now we just drop the <!--l. 134--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><\/math><br \/>\nterm since it is positive and small. Leaving:\n<\/p>\n<div class=\"math-display\"><!--l. 136--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                                <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mfrac><mrow \n><mi \n>L<\/mi><\/mrow> \n<mrow \n><mn>2<\/mn><mi \n>k<\/mi><\/mrow><\/mfrac><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 138--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 142--><\/p>\n<p class=\"noindent\" >\n<h4 class=\"likesubsectionHead\"><a \n id=\"x1-30001\"><\/a>Comments<\/h4>\n<p><!--l. 144--><\/p>\n<p class=\"noindent\" >As far as I know, this proof is fairly modern <span class=\"cite\">[<a \nhref=\"#Xbeckt-gradient\">1<\/a>]<\/span>. Notice that<br \/>\nunlike the strongly convex case, the quantity we are bounding<br \/>\n<!--l. 146--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/math> does<br \/>\nnot appear on both sides of the bound. Unfortunately, without strong convexity<br \/>\nthere is necessarily a looseness to the bounds, and this takes the form of<br \/>\nbounding function value by distance to solution, with a large wiggle-factor. One<br \/>\nthing that is perhaps a little confusing is the use of distance to solution<br \/>\n<!--l. 150--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>x<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/math>,<br \/>\nwhen it is not unique, as there are potentially multiple minimizers for<\/p>\n<p>non-strongly convex problems. The bound in fact holds for any chosen minimizer<br \/>\n<!--l. 153--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/math>. I<br \/>\nfound this to be a little confusing at \ufb01rst.\n<\/p>\n<p><!--l. 157--><\/p>\n<p class=\"noindent\" >\n<h3 class=\"sectionHead\"><span class=\"titlemark\">2   <\/span> <a \n id=\"x1-40002\"><\/a>Older Style Proof<\/h3>\n<p><!--l. 159--><\/p>\n<p class=\"noindent\" >This proof is from Nesterov <span class=\"cite\">[<a \nhref=\"#Xnes-book\">2<\/a>]<\/span>. I&#x2019;m not sure of the original source for it.\n<\/p>\n<p><!--l. 162--><\/p>\n<p class=\"indent\" >   We start with the function value descent equation, using<br \/>\n<!--l. 162--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>w<\/mi> <mo \nclass=\"MathClass-punc\">:<\/mo><mo \nclass=\"MathClass-rel\">=<\/mo> <mfenced separators=\"\" \nopen=\"[\"  close=\"]\" ><mrow><mi \n>\u03b1<\/mi> <mfenced separators=\"\" \nopen=\"(\"  close=\")\" ><mrow><mn>1<\/mn> <mo \nclass=\"MathClass-bin\">\u2212<\/mo><mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><mn>2<\/mn><\/mrow><\/mfrac><mi \n>\u03b1<\/mi><mi \n>L<\/mi><\/mrow><\/mfenced><\/mrow><\/mfenced> <mo \nclass=\"MathClass-punc\">:<\/mo><\/math>\n<\/p>\n<div class=\"par-math-display\"><!--l. 164--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                                <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>w<\/mi><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 166--><\/p>\n<p class=\"nopar\" > We introduce the simpli\ufb01ed notation <!--l. 167--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-rel\">=<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/math><br \/>\nso that we have <\/p>\n<table class=\"equation\">\n<tr>\n<td> <a \n id=\"x1-4001r4\"><\/a><br \/>\n<!--l. 169--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" class=\"equation\">\n                      <msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>w<\/mi><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/math><\/td>\n<td class=\"eq-no\">(4)<\/td>\n<\/tr>\n<\/table>\n<p><!--l. 172--><\/p>\n<p class=\"indent\" >   Now using the convexity lower bound around<br \/>\n<!--l. 172--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/math> evaluated<br \/>\nat <!--l. 173--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/math>,<br \/>\nnamely:\n<\/p>\n<div class=\"math-display\"><!--l. 174--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                                     <msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mfenced separators=\"\" \nopen=\"\u27e8\"  close=\"\u27e9\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><mo \nclass=\"MathClass-punc\">,<\/mo><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced><mo \nclass=\"MathClass-punc\">,<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 176--><\/p>\n<p class=\"nopar\" > and applying Cauchy-Schwarz (note the spelling! there is no \u201ct\u201d in Schwarz) to<br \/>\nit:<br \/>\n<!--tex4ht:inline--><\/p>\n<p><!--l. 179--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" >\n<mtable \nclass=\"eqnarray-star\" columnalign=\"right center left\" >\n<mtr><mtd \nclass=\"eqnarray-1\"> <msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mtd><mtd \nclass=\"eqnarray-2\">   <mo \nclass=\"MathClass-rel\">\u2264<\/mo><\/mtd><mtd \nclass=\"eqnarray-3\">   <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced> <\/mtd><mtd \nclass=\"eqnarray-4\"> <mtext class=\"eqnarray\"><\/mtext><\/mtd>\n<\/mtr><mtr><mtd \nclass=\"eqnarray-1\">   <\/mtd><mtd \nclass=\"eqnarray-2\">   <mo \nclass=\"MathClass-rel\">\u2264<\/mo><\/mtd><mtd \nclass=\"eqnarray-3\">   <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><mo \nclass=\"MathClass-punc\">.<\/mo><\/mtd><mtd \nclass=\"eqnarray-4\"> <mtext class=\"eqnarray\"><\/mtext><\/mtd>                                          <\/mtr><\/mtable>\n<\/math><br \/>\n<!--l. 182--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 185--><\/p>\n<p class=\"indent\" >   The last line is because gradient descent method descends in<br \/>\niterate distance each step. We now introduce the additional notation<br \/>\n<!--l. 186--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msub><mrow \n><mi \n>r<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-rel\">=<\/mo> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced><\/math>.<br \/>\nUsing this notation and rearranging gives:\n<\/p>\n<div class=\"math-display\"><!--l. 188--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                                      <mo \nclass=\"MathClass-bin\">\u2212<\/mo><mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced> <mo \nclass=\"MathClass-rel\">\u2264<\/mo><mo \nclass=\"MathClass-bin\">\u2212<\/mo><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><mo \nclass=\"MathClass-bin\">\u2215<\/mo><msub><mrow \n><mi \n>r<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msub \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 190--><\/p>\n<p class=\"nopar\" > We plug this into the function descent equation (Eq <a \nhref=\"#x1-4001r4\">4<!--tex4ht:ref: eq:delta-func-descent --><\/a>) above to get:\n<\/p>\n<div class=\"math-display\"><!--l. 193--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                                     <msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mfrac><mrow \n><mi \n>w<\/mi><\/mrow> \n<mrow \n><msubsup><mrow \n><mi \n>r<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msubsup \n><\/mrow><\/mfrac><msubsup><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msubsup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 195--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 198--><\/p>\n<p class=\"indent\" >   We now divide this through by <!--l. 198--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/math>:\n<\/p>\n<div class=\"math-display\"><!--l. 199--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                                     <mn>1<\/mn> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mfrac><mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow> \n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfrac> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mfrac><mrow \n><mi \n>w<\/mi><\/mrow> \n<mrow \n><msubsup><mrow \n><mi \n>r<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msubsup \n><\/mrow><\/mfrac>  <mfrac><mrow \n><msubsup><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msubsup \n><\/mrow> \n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfrac>\n<\/mrow><\/math><\/div>\n<p><!--l. 201--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 204--><\/p>\n<p class=\"indent\" >   Then divide through by <!--l. 204--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/math><br \/>\nalso:\n<\/p>\n<div class=\"par-math-display\"><!--l. 206--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                                     <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfrac> <mo \nclass=\"MathClass-rel\">\u2264<\/mo>  <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfrac> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mfrac><mrow \n><mi \n>w<\/mi><\/mrow> \n<mrow \n><msubsup><mrow \n><mi \n>r<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msubsup \n><\/mrow><\/mfrac>   <mfrac><mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow> \n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfrac><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 208--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 211--><\/p>\n<p class=\"indent\" >   Now we use the fact that gradient descent is a descent method again, which implies<br \/>\nthat <!--l. 212--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" >  <mfrac><mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow>\n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfrac>  <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mn>1<\/mn><mo \nclass=\"MathClass-punc\">,<\/mo><\/math><br \/>\nso:\n<\/p>\n<div class=\"par-math-display\"><!--l. 214--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                                        <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfrac> <mo \nclass=\"MathClass-rel\">\u2264<\/mo>  <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfrac> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mfrac><mrow \n><mi \n>w<\/mi><\/mrow> \n<mrow \n><msubsup><mrow \n><mi \n>r<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msubsup \n><\/mrow><\/mfrac><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 216--><\/p>\n<p class=\"nopar\" >\n<div class=\"math-display\"><!--l. 217--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                                <mo \nclass=\"MathClass-rel\">\u2234<\/mo>  <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfrac> <mo \nclass=\"MathClass-rel\">\u2265<\/mo> <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfrac> <mo \nclass=\"MathClass-bin\">+<\/mo>  <mfrac><mrow \n><mi \n>w<\/mi><\/mrow> \n<mrow \n><msubsup><mrow \n><mi \n>r<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msubsup \n><\/mrow><\/mfrac><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 219--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 222--><\/p>\n<p class=\"indent\" >   We then chain this inequality for each<br \/>\n<!--l. 222--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>k<\/mi><\/math>:\n<\/p>\n<div class=\"math-display\"><!--l. 223--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfrac> <mo \nclass=\"MathClass-rel\">\u2265<\/mo> <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfrac> <mo \nclass=\"MathClass-bin\">+<\/mo>  <mfrac><mrow \n><mi \n>w<\/mi><\/mrow> \n<mrow \n><msubsup><mrow \n><mi \n>r<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msubsup \n><\/mrow><\/mfrac> <mo \nclass=\"MathClass-rel\">\u2265<\/mo>  <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">\u2212<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfrac> <mo \nclass=\"MathClass-bin\">+<\/mo> <mn>2<\/mn> <mfrac><mrow \n><mi \n>w<\/mi><\/mrow>\n<mrow \n><msubsup><mrow \n><mi \n>r<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msubsup \n><\/mrow><\/mfrac> <mo \nclass=\"MathClass-rel\">\u2265<\/mo><mo \nclass=\"MathClass-rel\">\u22ef<\/mo> <mo \nclass=\"MathClass-rel\">\u2265<\/mo> <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msub \n><\/mrow><\/mfrac> <mo \nclass=\"MathClass-bin\">+<\/mo>  <mfrac><mrow \n><mi \n>w<\/mi><\/mrow> \n<mrow \n><msubsup><mrow \n><mi \n>r<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msubsup \n><\/mrow><\/mfrac><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>k<\/mi> <mo \nclass=\"MathClass-bin\">+<\/mo> <mn>1<\/mn><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow>\n<\/mrow><\/math><\/div>\n<p><!--l. 225--><\/p>\n<p class=\"nopar\" >\n<div class=\"math-display\"><!--l. 226--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                                 <mo \nclass=\"MathClass-rel\">\u2234<\/mo>  <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><\/mfrac> <mo \nclass=\"MathClass-rel\">\u2265<\/mo> <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msub \n><\/mrow><\/mfrac> <mo \nclass=\"MathClass-bin\">+<\/mo>  <mfrac><mrow \n><mi \n>w<\/mi><\/mrow> \n<mrow \n><msubsup><mrow \n><mi \n>r<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msubsup \n><\/mrow><\/mfrac><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>k<\/mi> <mo \nclass=\"MathClass-bin\">+<\/mo> <mn>1<\/mn><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 228--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 231--><\/p>\n<p class=\"indent\" >   To get the \ufb01nal convergence rate we invert both sides:\n<\/p>\n<div class=\"math-display\"><!--l. 232--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                    <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2264<\/mo>  <mfrac><mrow \n><mfenced separators=\"\" \nopen=\"[\"  close=\"]\" ><mrow><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mn>0<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><\/mrow> \n<mrow \n><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n> <mo \nclass=\"MathClass-bin\">+<\/mo> <mi \n>w<\/mi> <mfenced separators=\"\" \nopen=\"[\"  close=\"]\" ><mrow><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><mi \n>k<\/mi><\/mrow><\/mfrac><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 234--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 237--><\/p>\n<p class=\"indent\" >   This is quite a complex expression. To simplify even further, we can get rid of the<br \/>\n<!--l. 238--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/math><br \/>\nterms on the right hand side using the Lipschitz upper bound about<br \/>\n<!--l. 239--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/math>:\n<\/p>\n<div class=\"math-display\"><!--l. 240--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                                 <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mfrac><mrow \n><mi \n>L<\/mi><\/mrow> \n<mrow \n><mn>2<\/mn><\/mrow><\/mfrac><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><mi \n>x<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 242--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 245--><\/p>\n<p class=\"indent\" >   Plugging in the step size <!--l. 245--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>\u03b1<\/mi> <mo \nclass=\"MathClass-rel\">=<\/mo> <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><mi \n>L<\/mi><\/mrow><\/mfrac><\/math><br \/>\ngives <!--l. 245--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>w<\/mi> <mo \nclass=\"MathClass-rel\">=<\/mo>  <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><mn>2<\/mn><mi \n>L<\/mi><\/mrow><\/mfrac><\/math>,<br \/>\nyielding the following simpler convergence rate:\n<\/p>\n<div class=\"math-display\"><!--l. 247--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                                <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mfrac><mrow \n><mn>2<\/mn><mi \n>L<\/mi><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><\/mrow> \n      <mrow \n><mi \n>k<\/mi> <mo \nclass=\"MathClass-bin\">+<\/mo> <mn>4<\/mn><\/mrow><\/mfrac>     <mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 249--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 252--><\/p>\n<p class=\"indent\" >   Compared to the rate from the previous proof,<br \/>\n<!--l. 252--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mfrac><mrow \n><mi \n>L<\/mi><\/mrow> \n<mrow \n><mn>2<\/mn><mi \n>k<\/mi><\/mrow><\/mfrac><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msup \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><\/math>, this is slightly<br \/>\nbetter at <!--l. 253--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>k<\/mi> <mo \nclass=\"MathClass-rel\">=<\/mo> <mn>1<\/mn><\/math>,<br \/>\nand worse thereafter.\n<\/p>\n<p><!--l. 256--><\/p>\n<p class=\"noindent\" >\n<h4 class=\"likesubsectionHead\"><a \n id=\"x1-50002\"><\/a>Comments<\/h4>\n<p><!--l. 258--><\/p>\n<p class=\"noindent\" >I don&#x2019;t like this proof. It&#x2019;s feels like a random sequence of steps when<br \/>\nyou \ufb01rst look at it. The way the proof uses inverse quantities like<br \/>\n<!--l. 260--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" > <mfrac><mrow \n><mn>1<\/mn><\/mrow>\n<mrow \n><msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfrac><\/math><br \/>\nis also confusing. The key equation is really the direct bound on<br \/>\n<!--l. 261--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>\u0394<\/mi><\/math>:\n<\/p>\n<div class=\"math-display\"><!--l. 262--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                                     <msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <msub><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mfrac><mrow \n><mi \n>w<\/mi><\/mrow> \n<mrow \n><msubsup><mrow \n><mi \n>r<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msubsup \n><\/mrow><\/mfrac><msubsup><mrow \n><mi \n>\u0394<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msubsup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 264--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 267--><\/p>\n<p class=\"indent\" >   Often this is the kind of equation you encounter when proving the properties of<br \/>\ndual methods for example. Equations of this kind can be encountered when applying<\/p>\n<p>proximal methods to non-di\ufb00erentiable functions also. It is also quite a clear<br \/>\nstatement about what is going on in terms of per-step convergence, a property that is<br \/>\nless clear in the previous proof.\n<\/p>\n<p><!--l. 275--><\/p>\n<p class=\"noindent\" >\n<h3 class=\"sectionHead\"><span class=\"titlemark\">3   <\/span> <a \n id=\"x1-60003\"><\/a>Gradient Concentration<\/h3>\n<p><!--l. 277--><\/p>\n<p class=\"noindent\" >When we don&#x2019;t even have convexity, just Lipschitz smoothness, we can still prove<br \/>\nsomething about convergence of the gradient norm. The Lipschitz upper bound holds<br \/>\nwithout the requirement of convexity:\n<\/p>\n<div class=\"math-display\"><!--l. 280--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                          <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>y<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>x<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">+<\/mo> <mfenced separators=\"\" \nopen=\"\u27e8\"  close=\"\u27e9\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><mi \n>x<\/mi><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><mo \nclass=\"MathClass-punc\">,<\/mo><mi \n>y<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>x<\/mi><\/mrow><\/mfenced> <mo \nclass=\"MathClass-bin\">+<\/mo> <mfrac><mrow \n><mi \n>L<\/mi><\/mrow> \n<mrow \n><mn>2<\/mn><\/mrow><\/mfrac><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><mi \n>x<\/mi> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>y<\/mi><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 282--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 285--><\/p>\n<p class=\"indent\" >   Recall that from minimizing this bound with respect to<br \/>\n<!--l. 285--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>y<\/mi><\/math> we<br \/>\ncan prove the equation:\n<\/p>\n<div class=\"math-display\"><!--l. 287--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                               <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2265<\/mo> <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><mn>2<\/mn><mi \n>L<\/mi><\/mrow><\/mfrac><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 289--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 292--><\/p>\n<p class=\"indent\" >   Examine this equation carefully. We have a bound on each gradient encountered<br \/>\nduring the optimization in terms of the di\ufb00erence in function values between steps.<br \/>\nThe sequence of function values is bounded below, so in fact we have a hard bound<br \/>\non the sum of the encountered gradient norms. E\ufb00ectively, we chain (telescope) the<br \/>\nabove inequality over steps:\n<\/p>\n<div class=\"math-display\"><!--l. 298--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n       <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">\u2212<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">+<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2265<\/mo> <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><mn>2<\/mn><mi \n>L<\/mi><\/mrow><\/mfrac><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n> <mo \nclass=\"MathClass-bin\">+<\/mo>  <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><mn>2<\/mn><mi \n>L<\/mi><\/mrow><\/mfrac><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">\u2212<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 300--><\/p>\n<p class=\"nopar\" >\n<div class=\"math-display\"><!--l. 301--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                                           <mo \nclass=\"MathClass-punc\">.<\/mo><mo \nclass=\"MathClass-punc\">.<\/mo><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 303--><\/p>\n<p class=\"nopar\" >\n<div class=\"math-display\"><!--l. 304--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                       <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2265<\/mo> <mfrac><mrow \n><mn>1<\/mn><\/mrow> \n<mrow \n><mn>2<\/mn><mi \n>L<\/mi><\/mrow><\/mfrac><munderover accentunder=\"false\" accent=\"false\"><mrow  \n><mo mathsize=\"big\" \n>\u2211<\/mo>\n  <\/mrow><mrow \n><mi \n>i<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/munderover \n><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>i<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 306--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 309--><\/p>\n<p class=\"indent\" >   Now since <!--l. 309--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><mo \nclass=\"MathClass-bin\">+<\/mo><mn>1<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-rel\">\u2265<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/math>:\n<\/p>\n<div class=\"math-display\"><!--l. 310--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n>\n                             <munderover accentunder=\"false\" accent=\"false\"><mrow  \n><mo mathsize=\"big\" \n>\u2211<\/mo>\n  <\/mrow><mrow \n><mi \n>i<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/munderover \n><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mi \n>i<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mn>2<\/mn><mi \n>L<\/mi> <mfenced separators=\"\" \nopen=\"(\"  close=\")\" ><mrow><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n>\n<mn>0<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 312--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 315--><\/p>\n<p class=\"indent\" >   Now to make this bound a little more concrete, we can put it in terms of the gradient<br \/>\n<!--l. 316--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msub><mrow \n><mi \n>g<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/math><br \/>\nwith the smallest norm seen during the minimization<br \/>\n(<!--l. 317--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" > <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>g<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfenced> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>g<\/mi><\/mrow><mrow \n><mi \n>i<\/mi><\/mrow><\/msub \n><\/mrow><\/mfenced><\/math> for all<br \/>\n<!--l. 318--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>i<\/mi><\/math>), so<br \/>\nthat <!--l. 318--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><msubsup><mrow \n><mo \nclass=\"MathClass-op\"> \u2211<\/mo>\n  <!--nolimits--><\/mrow><mrow \n><mi \n>i<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msubsup \n><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msup><mrow \n><mi \n>f<\/mi><\/mrow><mrow \n><mi \n>\u2032<\/mi><\/mrow><\/msup \n><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mi \n>i<\/mi><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n> <mo \nclass=\"MathClass-rel\">\u2265<\/mo> <mi \n>k<\/mi><msup><mrow \n> <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>g<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfenced> <\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n><\/math>,<br \/>\nso:\n<\/p>\n<div class=\"math-display\"><!--l. 320--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"block\" ><mrow \n><msup><mrow \n>\n                                   <mfenced separators=\"\" \nopen=\"\u2225\"  close=\"\u2225\" ><mrow><msub><mrow \n><mi \n>g<\/mi><\/mrow><mrow \n><mi \n>k<\/mi><\/mrow><\/msub \n><\/mrow><\/mfenced> <\/mrow><mrow \n><mn>2<\/mn><\/mrow><\/msup \n> <mo \nclass=\"MathClass-rel\">\u2264<\/mo> <mfrac><mrow \n><mn>2<\/mn><mi \n>L<\/mi><\/mrow> \n <mrow \n><mi \n>k<\/mi><\/mrow><\/mfrac>  <mfenced separators=\"\" \nopen=\"(\"  close=\")\" ><mrow><mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msub><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mn>0<\/mn><\/mrow><\/msub \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow> <mo \nclass=\"MathClass-bin\">\u2212<\/mo> <mi \n>f<\/mi><mrow ><mo \nclass=\"MathClass-open\">(<\/mo><mrow><msup><mrow \n><mi \n>x<\/mi><\/mrow><mrow \n><mo \nclass=\"MathClass-bin\">\u2217<\/mo><\/mrow><\/msup \n><\/mrow><mo \nclass=\"MathClass-close\">)<\/mo><\/mrow><\/mrow><\/mfenced><mo \nclass=\"MathClass-punc\">.<\/mo>\n<\/mrow><\/math><\/div>\n<p><!--l. 322--><\/p>\n<p class=\"nopar\" >\n<p><!--l. 326--><\/p>\n<p class=\"noindent\" >\n<h4 class=\"likesubsectionHead\"><a \n id=\"x1-70003\"><\/a>Comments<\/h4>\n<p><!--l. 328--><\/p>\n<p class=\"noindent\" >Notice that the core technique used in this proof is the same as the last 2 proofs.<br \/>\nWe have a single step inequality bounding one of the quantities we care<br \/>\nabout. By summing that inequality over each step of minimization, one side<br \/>\nof the inequality telescopes. We get an equality saying the the sum of the<br \/>\n<!--l. 332--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>k<\/mi><\/math> versions<br \/>\nof that quantity (one from each step) is less then some \ufb01xed constant independent of<br \/>\n<!--l. 334--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>k<\/mi><\/math>, for any<br \/>\n<!--l. 334--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>k<\/mi><\/math>. The convergence rate is<br \/>\nthus of the form <!--l. 334--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mn>1<\/mn><mo \nclass=\"MathClass-bin\">\u2215<\/mo><mi \n>k<\/mi><\/math>, because<br \/>\nthe summation of the <!--l. 335--><math \n xmlns=\"http:\/\/www.w3.org\/1998\/Math\/MathML\"  \ndisplay=\"inline\" ><mi \n>k<\/mi><\/math><br \/>\nquantities \ufb01ts in a \ufb01xed bound.\n<\/p>\n<p><!--l. 337--><\/p>\n<p class=\"indent\" >   Almost any proof for an optimization method that applies in the non-convex case<br \/>\nuses a similar proof technique. There is just not that many assumptions to work<br \/>\nwith, so the options are limited.\n<\/p>\n<p><!--l. 1--><\/p>\n<p class=\"noindent\" >\n<h3 class=\"likesectionHead\"><a \n id=\"x1-80003\"><\/a>References<\/h3>\n<p><!--l. 1--><\/p>\n<p class=\"noindent\" >\n<div class=\"thebibliography\">\n<p class=\"bibitem\" ><span class=\"biblabel\"><br \/>\n [1]<span class=\"bibsp\">\u00a0\u00a0\u00a0<\/span><\/span><a \n id=\"Xbeckt-gradient\"><\/a>Amir  Beck  and  Marc  Teboulle.     Gradient-based  algorithms  with<br \/>\n   applications to signal recovery problems.  <span \nclass=\"ecti-1000\">Convex Optimization in Signal<\/span><br \/>\n   <span \nclass=\"ecti-1000\">Processing and Communications<\/span>, 2009.<\/p>\n<p class=\"bibitem\" ><span class=\"biblabel\"><br \/>\n [2]<span class=\"bibsp\">\u00a0\u00a0\u00a0<\/span><\/span><a \n id=\"Xnes-book\"><\/a>Yu. Nesterov. <span \nclass=\"ecti-1000\">Introductory Lectures On Convex Programming<\/span>. Springer,<br \/>\n   1998.\n<\/p>\n<\/p><\/div>\n","protected":false},"excerpt":{"rendered":"<p>The previous post detailed a bunch of different ways of proving the convergence rate of gradient descent: xk+1 = xk \u2212 \u03b1f\u2032(x k), for strongly convex problems. This post considers the non-strongly convex, but still convex case. Rehash of Basic Lemmas These hold for any x and y. L the Lipschitz smoothness constant. These are &hellip; <a href=\"https:\/\/www.aarondefazio.com\/tangentially\/?p=35\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">The Many Ways to Analyse Gradient Descent: Part 2<\/span> <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"_links":{"self":[{"href":"https:\/\/www.aarondefazio.com\/tangentially\/index.php?rest_route=\/wp\/v2\/posts\/35"}],"collection":[{"href":"https:\/\/www.aarondefazio.com\/tangentially\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.aarondefazio.com\/tangentially\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.aarondefazio.com\/tangentially\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.aarondefazio.com\/tangentially\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=35"}],"version-history":[{"count":2,"href":"https:\/\/www.aarondefazio.com\/tangentially\/index.php?rest_route=\/wp\/v2\/posts\/35\/revisions"}],"predecessor-version":[{"id":37,"href":"https:\/\/www.aarondefazio.com\/tangentially\/index.php?rest_route=\/wp\/v2\/posts\/35\/revisions\/37"}],"wp:attachment":[{"href":"https:\/\/www.aarondefazio.com\/tangentially\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=35"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.aarondefazio.com\/tangentially\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=35"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.aarondefazio.com\/tangentially\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=35"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}