Participants are expected to have knowledge of basic statistics, e.g. hypothesis testing, correlation and linear regression, and experience using R and RStudio.
Each day consists of lectures in the morning and practicals using R in the afternoon.
Day 1: Data pre-treatment, PCA and PCR Discussion of different data pre-treatment methods e.g. centering, autoscaling, pareto scaling and range scaling. Data exploration using Principal Component Analysis (PCA) and regression using the principal components from PCA in Principal Component Regression, PCR.
Day 2: Modern regression techniques and model validation Discussion of regression methods for high dimensional data: Partial Least Squares (PLS, a technique similar to PCR but with improvements) and regularized regression (ridge/lasso). Ways of assessing model accuracy will also be discussed.
Day 3: Clustering and classification; k-means, hierarchical clustering, LDA and PLS-DA Discussion of cluster analysis: choice of similarity measure, agglomerative methods, divisive methods, k-means & hierarchical clustering.
Date & duration:
20, 21 and 22 June 2022
The study load of this course is 1.5 ECTS credits.
Details will be announced later.
|PhD candidates affiliated with VLAG/WUR *||200 €|
|All other PhD candidates||400 €|
|Postdoc / staff from VLAG||400 €|
|Postdoc / non-profit staff not affiliated with VLAG||600 €|
|Industry / Non academics||1200 €|
Costs includes material, tea/coffee and lunches.
* VLAG/EPS/PE&RC/WASS/WIAS/WIMEK PhD candidates with an approved TSP.
For registration click here
No charge until 20 May 2022
- 25% of the course fee paid or due till 13 June 2022
- No refund after 13 June 2022
Substitutions for participants may be made at any time until the start of the course.
For more information please contact Suzanne van der Wielen