Journal article
Multiblock PLS: Block dependent prediction modeling for Python
Department of Applied Mathematics and Computer Science, Technical University of Denmark1
Statistics and Data Analysis, Department of Applied Mathematics and Computer Science, Technical University of Denmark2
Cognitive Systems, Department of Applied Mathematics and Computer Science, Technical University of Denmark3
Partial Least Squares (PLS) regression is a statistical method for supervised multivariate analysis. It relates two data blocks X and Y to each other with the aim of establishing a prediction model. When deployed in production, this model can be used to predict an outcome y from a newly measured feature vector x.
PLS is popular in chemometrics, process control and other analytic fields, due to its striking advantages, namely the ability to analyze small sample sizes and the ability to handle high-dimensional data with cross-correlated features (where Ordinary Least Squares regression typically fails). In addition, and in contrast to many other machine learning approaches, PLS models can be interpreted using its latent variable structure just like principal components can be interpreted for a PCA analysis.
Language: | English |
---|---|
Year: | 2019 |
ISSN: | 24759066 |
Types: | Journal article |
DOI: | 10.21105/joss.01190 |
ORCIDs: | Baum, Andreas and Vermue, Laurent |