Depending on who you ask, this talk is about:
These are generally the same, but differ in the terminology used for the same concepts, certain convictions, and approaches.
Statistical tests are tools made for hypothesis testing.
They yield the p-value, but do not say anything about direction or size.
Examples are:
Statistical models are mathematical models that summarise the relationship between different variables
This relationship is quantified and can be used for a plethora of goals beyond hypothesis testing.
Statistical tests are just simple statistical models:
| Statistical test | Equivalent statistical model |
|---|---|
| One-sample t-test | Intercept-only linear regression |
| Wilcoxon signed-rank test | Ranked univariable linear regression |
| Two-sample t-test | Univariable linear regression |
| One-way ANOVA | Multivariable linear regression |
| Chi-square test | Univariable logistic regression w/ dichotomous independent variable |
Models allow us to:
Descriptive:
Describe some situation, such as the prevalence of a disease, or how two variables relate to each other.
Causal/etiological:
Determine whether some factor A causes some event B
Predictive (diagnostic & prognostic):
Predict the presence/future occurrence of some event or present/future value of some measurement.
These are often conflated!
Ramspek CL, et al. Prediction or causality? A scoping review of their conflation within current observational research. Eur J Epidemiol. 2021
This is problematic because, these different pillars require:
We want our models to accurately represent some situation
To achieve this, we need to fill our model with data: covariates & parameters.
Covariates are variables (e.g. age, sex, eGFR) that we add to the model to represent our situation of interest:
The way we select these covariates is important.
For causal & prediction research
For confounder selection: use DAGs
Directed acyclic graph (DAG):
Two important lessons that DAGs teach us:
Only for prediction research
Have the data select the most predictive variables (from a set of candidate predictors) through:
From worse to better:
Core idea: based on some measure, determine which variables aid prediction and which do not
Overfitting: the prediction model has incorporated patterns that are coincidentally present in the development data, but not in the target population
Result: model performance is worse in reality than observed during development
Some models require additional parameters (also called hyperparameters) that need to be decided:
Some of these are determined based on knowledge (e.g. link function), some of these are determined through ‘tuning’ (risk of overfitting!)
Sometimes, we may also add weights as an additional parameter to models.
Weights result in observations being counted more or less frequently than once.
Most of our models summarise our data to the model output. For instance:
Tip
Giving a weight to a coefficient is also called the model learning the weight for that coefficient (or feature), hence the term machine learning
Not all models are fully parametric:
This is relevant because:
The simplest architecture: \(y = ax + b\) (linear regression)
Say that \(ax + b = X\beta\)
Generalised linear model:
Cox regression:
\(y(t) = 1 - S_0(t)^{e^{X\beta}}, S_0(t) = e^{-H_0(t)}\)
Random forest: 
Artificial neural network: 
Now relating it to the pillars:

Bias-variance trade-off
Bias: how predictions match the truth (i.e. good performance)
Variance: how performance varies between settings
Machine learning vs. Statistics
Machine learning = statistics
| Statistics | Machine learning |
|---|---|
| Predictor | Feature |
| Outcome | Label |
| Estimation | Learning |
| Development data | Training + validation data |
| Validation data | Test data |
| Contingency table | Confusion matrix |
More @ Janse RJ, et al.. When the whole is greater than the sum of its parts: why machine learning and conventional statistics are complementary for predicting future health outcomes. Clin Kidney J. 2025 & Finlayson SG, et al. Machine Learning and Statistics in Clinical Research Articles-Moving Past the False Dichotomy. JAMA Pediatr. 2023 May
Use a programming language!
R: free, built for statistics, easy to program and read, great tools for data visualisation, used in most of medical statistics
Get started: my tutorial 😁
Python: free, built for general purposes, good statistical support, easy to program and read, used in most of computer sciences
Get started: w3schools
Julia: free, built for statistics, good statistical support, fast, relatively uncommon in use
Get started: JuliaLang
Do not use a syntax!
SPSS: €410.65/year, built for statistics, easy point-and-click, inflexible, poor syntax system, unstable (in my experience).
SAS: €?/year, built for statistics, not so flexible, syntax-reliant, many companies are reliant on it
STATA: €150.32/year, built for statistics, not so flexible, syntax-reliant, many companies are reliant on it
Models are only useful if we report on them.
For all epidemiological studies: STROBE.
Also:
Make sure to check the Equator network for more!
Contact me: r.j.janse-5@umcutrecht.nl
More about me: rjjanse.github.io
These slides: rjjanse.github.io/talks/modelling
Image for title slide by Environmental Graphiti:
350 Species at Risk from Climate Change
© 2025 Environmental Graphiti® All rights reserved.