Schedule of Module 3 : Advanced topics in Machine Learning
Day 1, Monday, 12 May | |
16:00 - 18:00 | Explainability and transparency of Machine Learning Models. By HAINAUT Donatien |
Day 2, Thursday, 15 May | |
16:00 - 18:00 | Programming By VAN DAM Daniel, VAN ES Raymond |
Day 3, Thursday, 22 May | |
16:00 - 18:00 | Variable selection and model agnostic methods By VAN OIRBEEK Robin |
Day 4, Thursday, 5 June | |
16:00 - 18:00 | Programming By VAN DAM Daniel, VAN ES Raymond |
Day 5, Thursday, 12 June | |
16:00 - 18:00 | (Generative) Artificial Intelligence |
Day 6, Thursday, 19 June | |
16:00 - 18:00 | Programming By VAN DAM Daniel, VAN ES Raymond |
Day 7, Tuesday, 24 June | |
16:00 - 18:00 | AI & Ethics By ANTONIO Katrien |
Day 8, Thursday, 26 June | |
16:00 - 18:00 | Programming |
Day 9, Sunday, 31 August | |
16:00 - 23:59 | Third assignment |
-
The aim of this session is to cover the local and global approaches to interpret output of a ML algorithm. We will focus on:
- Partial dependence plots
- Permutation feature importance
- Friedman’s interactions
- Global surrogate models
- Local Interpretable Model-Agnostic explanations (LIME)
- Shapley’s value (SHAP)
-
- At the surface, variable selection seems like a very straightforward and easy-to-understand concept, however, there is (much) more to it than meets the eye. That’s why we will start this session by clearly defining what variable selection is all about while facing the inherent ambiguity of the concept heads-on!
- Next, the usual suspects will be covered. This entails forward/backward/stepwise selection and penalized regression methods (mainly LASSO) but also how variable selection practically is hardwired into widely used ML models such as (ensemble) tree models and deep learning.
- The final part of the session will be about some more involved methods like Boruta and the Genetic Algorithm. For the latter method, a convenient adaptation of the well-known concordance probability (specifically tailored to the needs of insurance data) will be presented, with a special focus on its use in variable selection.
-
- Risks
- Biases
- Discriminations