In dajmcdon/ubc-stat406-labs: Tutorials and labs for UBC Stat 406 in the 2020-2021 online year

library(knitr)
library(tidyverse)
library(cowplot)
opts_chunk$set(echo=FALSE, fig.align='center', fig.width=4, fig.height=3,
               cache=TRUE, autodep=TRUE, cache.comments=FALSE,
               message=FALSE, warning=FALSE)

GD for logistic regression

Suppose $Y=1$ with probability $p(x)$ and $Y=0$ with probability $1-p(x)$. I want to model $P(Y=1| X=x)$. I'll assume that $p(x)/(1-p(x)) = ax$ for some scalar $a$.

We're going to estimate $a$ given data.

First, we need data.

set.seed(20200405)
n = 100
a = 2
x = runif(n)*10 - 5
logit <- function(x) exp(x)/(1+exp(x))
p = logit(a*x)
y = rbinom(n, 1, p)
df = tibble(x=x, y=y)

ggplot(df, aes(x,y)) + geom_point(color="red") +
  stat_function(fun=function(x) logit(a*x)) + theme_cowplot(14)

The likelihood is given by [ L(y | a, x) = \prod_{i=1}^n p(x_i)^{y_i}(1-p(x_i))^{1-y_i} ]

(Simple) gradient ascent (to maximize $L(a)$) is:

Input $a_0, \gamma>0, j_\max, \epsilon>0, \nabla L(a)$.
For $j=1,2,\ldots, j_\max$, [ a_j = a_{j-1} + \gamma \nabla L(a_{j-1}) ]
Stop if $\epsilon > |a_j - a_{j-1}|$.

Write a function to find $a_{mle}$

Note that on the log scale, $\nabla L(a) = \sum (y_i - p_i) x_i$ where $p_i$ is as above.

amle <- function(x, y, a0, gam=0.5, jmax=100, eps=1e-6){
  err = 1e8
  j = 1
  a = double(jmax)
  a[1] = a0
  while(j < jmax && err>eps){
    j = j+1
    p = logit(a[j-1]*x)
    ell = sum((y-p)*x)
    a[j] = a[j-1] + gam * ell
    err = abs(a[j] - a[j-1])
  }
  a[1:j]
}

Run your function and report the result

amle(x, y, 5)
amle(x, y, .1)
amle(x, y, 5, .1)
amle(x, y, 5, 1)
glm(y~x-1, data=df, family=binomial)$coef #just to check

dajmcdon/ubc-stat406-labs documentation built on Aug. 18, 2020, 1:23 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

dajmcdon/ubc-stat406-labs
Tutorials and labs for UBC Stat 406 in the 2020-2021 online year

In dajmcdon/ubc-stat406-labs: Tutorials and labs for UBC Stat 406 in the 2020-2021 online year

GD for logistic regression

Write a function to find $a_{mle}$

Run your function and report the result

R Package Documentation

Browse R Packages

We want your feedback!

dajmcdon/ubc-stat406-labs Tutorials and labs for UBC Stat 406 in the 2020-2021 online year

In dajmcdon/ubc-stat406-labs: Tutorials and labs for UBC Stat 406 in the 2020-2021 online year

GD for logistic regression

Write a function to find $a_{mle}$

Run your function and report the result

R Package Documentation

Browse R Packages

We want your feedback!

dajmcdon/ubc-stat406-labs
Tutorials and labs for UBC Stat 406 in the 2020-2021 online year