In gbonte/gbcode: Code from the handbook "Statistical foundations of machine learning"

Shiny dashboard "Statistical foundations of machine learning"

Estimation

Common left panel:

Number of samples: number of samples of observed dataset
Number of MC trials: number of Monte Carlo trials used to simulate the data generating process
3D theta: angle defining the azimuthal direction in the 3D visualization
3D phi: angle defining the colatitude in the 3D visualization

point estimation (1D Gaussian)

Goal: visualize the bias/variance of the sample average and sample variance estimators.

Data generating process: normal random variable ${\bf z} \sim {\mathcal N} (\mu,\sigma^2)$. Estimation of mean and variance.

Slider $\mu$: mean parameter
Slider $\sigma^2$: variance parameter
Top right panel: visualization of density probability and sampled dataset
Bottom left panel: visualization of sampling distribution of the estimator of the mean (including the estimator's mean and variance)
Bottom right panel: visualization of sampling distribution of the estimator of the variance (including the estimator's mean and variance)

Suggested manipulations:

verify by simulation the unbiasedness of the sampled mean and variance
verify by simulation the validity of the formula of the variance of sample average by manipulating the number of samples $N$ and the variance $\sigma^2$

point estimation (1D Uniform)

Goal: visualize the bias/variance of the sample average and sample variance estimators. Show that the unbiasedness of those estimators is independent of the distribution.

Data generating process: uniform random variable ${\bf z} \sim {\mathcal U} (a,b)$. Estimation of mean and variance.

Slider: range $[a,b]$ of uniform distribution, visualizing the correspondent mean and variance
Top right panel: visualization of density probability and sampled dataset
Bottom left panel: visualization of sampling distribution of the estimator of the mean (including the estimator's mean and variance)
Bottom right panel: visualization of sampling distribution of the estimator of the variance (including the estimator's mean and variance)

Suggested manipulations:

change the range of the uniform density, check the corresponding variance and the quality of the variance estimator
change the number of samples $N$ and verify the change of variance of the sampling distribution wrt theoretical formula
change the range of the uniform density (and the corresponding variance) and verify the change of variance of the sampling distribution wrt theoretical formula

confidence interval (1D Gaussian)

Goal: visualize the notion of confidence interval

Data generating process: normal random variable ${\bf z} \sim {\mathcal N} (\mu,\sigma^2)$. Estimation of confidence interval of the mean.

Slider $\mu$: mean parameter
Slider $\sigma^2$: variance parameter
Bottom left panel: histogram of lower and upper bound. The title shows the theoretical probability that the confidence interval contains $\mu$, the MC probability that the confidence interval contains $\mu$ and the average width of the confidence interval.
Bottom right panel: it shows the confidence interval for each MC trial (generation of dataset): the intervals not containing $\mu$ are shown in red

Suggested manipulations:

study the impact of $\alpha$ and $N$ on probability of containing $\mu$ and the width of the confidence interval
check the similarity between theoretical probability that the confidence interval contains $\mu$ and the MC probability that the confidence interval contains $\mu$. Study the impact of the number of MC trials on this similarity.

Likelihood (1 par)

Goal: visualize the relation between accuracy of the estimation and log-likelihood

Data generating process: normal random variable ${\bf z} \sim {\mathcal N} (\mu,\sigma^2)$. Maximum likelihood estimation of the mean (known $\sigma^2$). We denote $\hat{\mu}_{ml}$ the m.l. estimator.

Slider $\mu$: mean parameter
Slider $\sigma^2$: variance parameter
Slider $\hat{\mu}$: mean estimation
Bottom left panel: visualization of $N$ samples, generating probability density (green) and estimated probability density (red)
Bottom right panel: log likelihood function. The three dots represent
$L(\mu)$: log likelihood of $\mu$ (green)
$L(\hat{\mu})$: log likelihood of $\hat{\mu}$ (red)
$L(\hat{\mu}{ml})$: log likelihood of max-likelihood estimator $\hat{\mu}{ml}$ (black)

Suggested manipulations:

change the value $\hat{\mu}$ to see the relation between the accuracy and the value of the log-likelihood function: in particular the closer $\hat{\mu}$ to $\mu$, the higher the log-likelihood
by reducing the number of samples ($N < 10$), the dashboard shows that the max-likelihood estimator $\hat{\mu}_{ml}$ does not coincide with $\mu$ (gap between green and black dots). This is due to the increased variance of the m.l. estimator.

Likelihood (2 pars)

Goal: visualize the relation between accuracy of the estimation and log-likelihood

Data generating process: normal random variable ${\bf z} \sim {\mathcal N} (\mu,\sigma^2)$. Maximum likelihood estimation of both mean and variance. We denote $\theta=[\mu,\sigma^2]$ the parameter vector and $\hat{\theta}_{ml}$ the m.l. estimator.

Slider $\mu$: mean parameter
Slider $\sigma^2$: variance parameter
Slider $\hat{\mu}$: mean estimation
Slider $\hat{\sigma}^2$: variance estimation
Bottom left panel: visualization of $N$ samples, generating probability density (green) and estimated probability density (red)
Bottom right panel: 3D visualization of a bivariate log likelihood function $L(\hat{\mu},\hat{\mu})=L(\theta)$. The three dots represent
$L(\theta)$: log likelihood of $\theta$ (green)
$L(\hat{\theta})$: log likelihood of $\hat{\theta}$ (red)
$L(\hat{\theta}{ml})$:log likelihood of max-likelihood estimator $\hat{\theta}{ml}$ (black)

Suggested manipulations:

change the value $\hat{\theta}$ to see the relation between the accuracy and the value of the log-likelihood function: in particular the closer $\hat{\theta}$ to $\theta$, the higher the log-likelihood
by reducing the number of samples ($N < 10$), the dashboard shows that the max-likelihood estimator $\hat{\theta}_{ml}$ does not coincide with $\theta$ (gap between green and black dots).

point estimation (2D Gaussian)

Goal: visualization of the multivariate variance of the sampling distribution of a multivariate estimator

Data generating process: Normal random 2D vector ${\bf z} \sim {\mathcal N} (\mu,\Sigma^2)$ where $\mu=[0,0]^T$ is a [2,1] vector and the covariance $\Sigma^2$ is a [2,2] matrix.

Slider $\lambda_1$: 1st eigenvalue of covariance matrix (variance along first axis)
Slider $\lambda_2$: 2nd eigenvalue of covariance matrix (variance along second axis)
Slider rotation : rotation angle $\alpha$ of covariance matrix

Note that the diagonal matrix corresponds to $\lambda_1=\lambda_2=1$ and $\theta=0$.

Bottom left panel: sampling distribution of the sample mean [2,1] vector $\hat{\mu}$
Bottom right panel: sampling distribution of the sample covariance [2,2] matrix $\hat{\Sigma}$ represented by contour lines

Suggested manipulations:

change the covariance parameters to visualize that the estimation remains unbiased.
change the number of samples $N$ to visualize the impact on variance of the sampling distribution.

Linear Regression

Goal: visualization of the sampling distribution (bias/variance) of least-squares estimators vs. real parameters to illustrate the unbiasedness of least-squares.

Generating data process: conditional distribution given by ${\bf y}=\beta_0+\beta_1 x+{\bf w}$ where ${\bf w} \sim {\mathcal N} (0,\sigma^2)$ and ${\bf x}$ is uniformly distributed.

Nonlinear Regression

Goal: visualization of the sampling distribution (bias/variance) of predicted vs. real conditional expectation.

Generating data process: ${\bf y}=f(x)+{\bf w}$ where ${\bf w} \sim {\mathcal N} (0,\sigma_w^2)$ Estimator: $h(x)=\hat{\beta}0 +\sum{i=1}^m \hat{\beta_i}_i x^i$. Parameters are estimated by least-sqaures.

gbonte/gbcode
Code from the handbook "Statistical foundations of machine learning"

In gbonte/gbcode: Code from the handbook "Statistical foundations of machine learning"

Shiny dashboard "Statistical foundations of machine learning"

Estimation

point estimation (1D Gaussian)

point estimation (1D Uniform)

confidence interval (1D Gaussian)

Likelihood (1 par)

Likelihood (2 pars)

point estimation (2D Gaussian)

Linear Regression

Nonlinear Regression

R Package Documentation

Browse R Packages

We want your feedback!

gbonte/gbcode Code from the handbook "Statistical foundations of machine learning"

In gbonte/gbcode: Code from the handbook "Statistical foundations of machine learning"

Shiny dashboard "Statistical foundations of machine learning"

Estimation

point estimation (1D Gaussian)

point estimation (1D Uniform)

confidence interval (1D Gaussian)

Likelihood (1 par)

Likelihood (2 pars)

point estimation (2D Gaussian)

Linear Regression

Nonlinear Regression

R Package Documentation

Browse R Packages

We want your feedback!

gbonte/gbcode
Code from the handbook "Statistical foundations of machine learning"