LO 4.2.2.G

Learning Objective: Describe the advantage of Lasso over the Ridge regression.

Review:

   <math xmlns="http://www.w3.org/1998/Math/MathML"><mtext mathvariant="italic">RIDGE&#xA0;Loss&#xA0;Function&#xA0;=&#xA0;</mtext><mi>R</mi><mi>S</mi><mi>S</mi><mo>+</mo><mi>&#x3BB;</mi><mstyle displaystyle="false"><mstyle displaystyle="true"><munderover><mo>&#x2211;</mo><mrow><mi>j</mi><mo>=</mo><mn>1</mn></mrow><mi>p</mi></munderover></mstyle><msubsup><mi>&#x3B2;</mi><mi>j</mi><mn>2</mn></msubsup></mstyle></math>

<math xmlns="http://www.w3.org/1998/Math/MathML"><mtext mathvariant="italic">LASSO&#xA0;Loss&#xA0;Function&#xA0;=&#xA0;</mtext><mi>R</mi><mi>S</mi><mi>S</mi><mo>+</mo><mi>&#x3BB;</mi><mstyle displaystyle="false"><munderover><mo>&#x2211;</mo><mrow><mi>j</mi><mo>=</mo><mn>1</mn></mrow><mi>p</mi></munderover><mfenced open="|" close="|"><msubsup><mi>&#x3B2;</mi><mi>j</mi><mrow/></msubsup></mfenced></mstyle></math>

<math xmlns="http://www.w3.org/1998/Math/MathML"><mi>&#x3BB;</mi><mstyle displaystyle="false"><munderover><mo>&#x2211;</mo><mrow><mi>j</mi><mo>=</mo><mn>1</mn></mrow><mi>p</mi></munderover><mfenced open="|" close="|"><msubsup><mi>&#x3B2;</mi><mi>j</mi><mrow/></msubsup></mfenced></mstyle></math>

will force some of the coefficient estimates to be exactly equal to zero when the tuning parameter λ is sufficiently large.

EXTRA

<math xmlns="http://www.w3.org/1998/Math/MathML"><mtext mathvariant="italic">LASSO&#xA0;Loss&#xA0;Function&#xA0;=&#xA0;</mtext><mi>R</mi><mi>S</mi><mi>S</mi><mo>+</mo><mi>&#x3BB;</mi><mstyle displaystyle="false"><munderover><mo>&#x2211;</mo><mrow><mi>j</mi><mo>=</mo><mn>1</mn></mrow><mi>p</mi></munderover><mfenced open="|" close="|"><msubsup><mi>&#x3B2;</mi><mi>j</mi><mrow/></msubsup></mfenced></mstyle></math>

With reference to the Lasso loss function and the figure shown below:

Source: Assigned reading

Curves: Income (black), Limit (red), Rating ( blue), Student (yellow)

        <math xmlns="http://www.w3.org/1998/Math/MathML"><mtext mathvariant="italic">RIDGE&#xA0;Loss&#xA0;Function&#xA0;=&#xA0;</mtext><mi>R</mi><mi>S</mi><mi>S</mi><mo>+</mo><mi>&#x3BB;</mi><mstyle displaystyle="false"><mstyle displaystyle="true"><munderover><mo>&#x2211;</mo><mrow><mi>j</mi><mo>=</mo><mn>1</mn></mrow><mi>p</mi></munderover></mstyle><msubsup><mi>&#x3B2;</mi><mi>j</mi><mn>2</mn></msubsup></mstyle></math>

        With reference to the Ridge loss function and the figure shown below:

Source: Assigned reading