Partial Least Squares Explained

Partial Least Squares (PLS) is a widely used technique in chemometrics, especially in the case where

  1. there is multi-collinearity in the set of variables;
  2. the number of variables is larger than the number of data points; and
  3. there are multiple response variables.

There are many articles on PLS but the mathematical details of PLS do not always come out clearly in these treatments. I have attempted to describe PLS in clear and precise mathematical terms in this technical note

A Simple Explanation of Partial Least Squares.

In particular, I show that, given design matrix X and response matrix Y, the PLS algorithm seeks transformed variables of the design matrix X that have high variance (like in principal component analysis) and high correlation with the response matrix Y, as stated in Section 3.5 of The Elements of Statistical Learning.

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s