Self-exciting point process models

class: center, middle, inverse, title-slide

.title[
# Self-exciting point process models
]
.subtitle[
## Charlotte M. Jones-Todd
]
.author[
### 
]
.institute[
### <img src="img/uoa-logo.png" style="width:3.0%" /> University of Auckland
]
.date[
### .typed[]
]

---

--- 
# <svg aria-hidden="true" role="img" viewBox="0 0 512 512" style="height:1em;width:1em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:currentColor;overflow:visible;position:relative;"><path d="M470.6 105.4c12.5 12.5 12.5 32.8 0 45.3l-256 256c-12.5 12.5-32.8 12.5-45.3 0l-128-128c-12.5-12.5-12.5-32.8 0-45.3s32.8-12.5 45.3 0L192 338.7 425.4 105.4c12.5-12.5 32.8-12.5 45.3 0z"/></svg> I am not a robot

---

# Patterns in points

.center[
<img src="img/cancer_im.png" width="100%" style="display: block; margin: auto;" />
]

---

# <svg aria-hidden="true" role="img" viewBox="0 0 576 512" style="height:1em;width:1.12em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:currentColor;overflow:visible;position:relative;"><path d="M184.1 38.2c9.9 8.9 10.7 24 1.8 33.9l-72 80c-4.4 4.9-10.6 7.8-17.2 7.9s-12.9-2.4-17.6-7L39 113c-9.4-9.4-9.4-24.6 0-33.9s24.6-9.4 33.9 0l22.1 22.1 55.1-61.2c8.9-9.9 24-10.7 33.9-1.8zm0 160c9.9 8.9 10.7 24 1.8 33.9l-72 80c-4.4 4.9-10.6 7.8-17.2 7.9s-12.9-2.4-17.6-7L39 273c-9.4-9.4-9.4-24.6 0-33.9s24.6-9.4 33.9 0l22.1 22.1 55.1-61.2c8.9-9.9 24-10.7 33.9-1.8zM256 96c0-17.7 14.3-32 32-32H512c17.7 0 32 14.3 32 32s-14.3 32-32 32H288c-17.7 0-32-14.3-32-32zm0 160c0-17.7 14.3-32 32-32H512c17.7 0 32 14.3 32 32s-14.3 32-32 32H288c-17.7 0-32-14.3-32-32zM192 416c0-17.7 14.3-32 32-32H512c17.7 0 32 14.3 32 32s-14.3 32-32 32H224c-17.7 0-32-14.3-32-32zM80 464c-26.5 0-48-21.5-48-48s21.5-48 48-48s48 21.5 48 48s-21.5 48-48 48z"/></svg> `INLA`

Integrated Nested Laplace Approximation (INLA)

--
 
  - Approximate Bayesian inference for Latent Gaussian Models
  
  - `INLA`verse
    - `INLABMA`
    - `inlabru`
    - `INLAMSM`
    - `nmaINLA`
    - `rSPDE` 
    - ...

---

# Spatial LGCP

<br>
<br>
 
`$$\Lambda(\underline{s}) = \text{exp}(\boldsymbol{X \beta} + G(\underline{s}) + \epsilon)$$`

- `$\boldsymbol{X \beta}$`, design matrix
  - `$G(\underline{s})$`, Gaussian Markov Random Field (GMRF)
  - `$\epsilon$`, error

`$G(\underline{s})$` has a Matérn covariance with parameters `$\tau$` and `$\kappa$`, where  
  - `$r=\frac{\sqrt{8}}{\kappa}$` is the range, and 
  - `$\sigma=\frac{1}{\sqrt{4\pi\kappa^2\tau^2}}$` standard deviation

---

# INLA-SPDE

![](img/lgcp-fit.png)

---

Template Model builder (`TMB`) `R` package for fitting latent variable models
  - Calculates first and second order derivatives of the likelihood function by AD (or any objective function written in `C++`)
  - The objective function (and its derivatives) can be called from `R`
  - The user can specify that the Laplace approximation should be applied to any subset of the function arguments
     - Yields marginal likelihood in latent variable model
  - Standard deviations of any parameter, or derived parameter, obtained by the delta method

---

# `TMB` in summary

A TMB project consists of an `R` file and a `C++` file. The R file does pre- and post processing of data in addition to maximizing the log-likelihood contained in *.cpp.

The purpose of the C++ program is to evaluate the objective function. The program is compiled and called from R, where it can be fed to a function minimizer (e.g., `nlminb()`).

---

# `TMB` in summary

![](https://kaskr.github.io/adcomp/TMB_components.png)

---

# `stelfi`

.center[
<img src="img/hawked_fitted.png" width ="90%" />
]

---

# <svg aria-hidden="true" role="img" viewBox="0 0 512 512" style="height:1em;width:1em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:currentColor;overflow:visible;position:relative;"><path d="M256 0C114.6 0 0 114.6 0 256s114.6 256 256 256s256-114.6 256-256S397.4 0 256 0zM256 464c-114.7 0-208-93.31-208-208S141.3 48 256 48s208 93.31 208 208S370.7 464 256 464zM256 336c-18 0-32 14-32 32s13.1 32 32 32c17.1 0 32-14 32-32S273.1 336 256 336zM289.1 128h-51.1C199 128 168 159 168 198c0 13 11 24 24 24s24-11 24-24C216 186 225.1 176 237.1 176h51.1C301.1 176 312 186 312 198c0 8-4 14.1-11 18.1L244 251C236 256 232 264 232 272V288c0 13 11 24 24 24S280 301 280 288V286l45.1-28c21-13 34-36 34-60C360 159 329 128 289.1 128z"/></svg>

.center[
<img src="img/niwa.png" width ="60%" />
]

---

# A self-exciting point process (Hawkes model)

`$$\lambda(t) = \color{red} \mu + \color{blue} \alpha \Sigma_{i:\tau_i<t}\text{exp}(-\color{green}\beta * (t-\tau_i))$$`

- `$\color{red} \mu$`, background rate
 - `$\color{blue} \alpha$`, increase in intensity after an event
 - `$\color{green}\beta$`, exponential decay
 - `$\Sigma_{i:\tau_i<t} \cdots$`, historic dependence
 - `$\frac{\color{blue} \alpha}{\color{green}\beta}$`, branching ratio (average number events triggered by an event)
 - `$\frac{1}{\color{green}\beta}$`, rate of decay of self-excitement
 
.center[
<img src="img/hawkes_example.png" width ="50%" />
]

---

# Log-likelihood (it's nasty)

`$$\sum_{i=1}^k \text{log} \left [\lambda + \alpha \sum_{j=1}^{i-1}\text{exp}(-\beta(t_i - t_j))\right ] -\lambda t_k + \frac{\alpha}{\beta} \sum_{i=1}^k \left [ \text{exp}(-\beta(t_k - t_i)) - 1\right ]$$`
`$$\vdots$$`

`$$\sum_{i=1}^k \text{log} \left [\lambda + \alpha A(i)\right ] -\lambda t_k + \frac{\alpha}{\beta} \sum_{i=1}^k \left [ \text{exp}(-\beta(t_k - t_i)) - 1\right ]$$`
where,

`$$A(i) = \text{exp}(-\beta(t_i - t_{i-1}))(1 + A(i-1)).$$`

---

# Temporal rate of retweets

.center[
<img src="img/niwa_fit.png" width ="80%" />
]

---

# Temporal rate of retweets

`$$\lambda(t) = \color{red} \mu + \color{blue} \alpha \Sigma_{i:\tau_i<t}\text{exp}(-\color{green}\beta * (t-\tau_i))$$`

+  `$n = 4890$` retweets over `$\text{T} = 3143$` mins (~2 days)

+ `$\color{red} {\hat{\mu}} \text{T} = 0.063 \times 3143 \sim 198$` 'baseline' tweets

+ Expected number of retweets triggered by any one tweet `$\frac{\color{blue}{\hat{\alpha}}}{\color{green}{\hat{\beta}}} \sim 0.94$`
 
 + Rate of decay for the self-excitement `$\frac{1}{\color{green}{\hat{\beta}}} = \frac{1}{0.079} \sim 12$` mins

---

.center[
<img src="img/whale_data.png" width ="60%" />
]

---

# A self-exciting cue rate model

`$$\lambda(t) = \color{red} {\mu(t)} + \color{blue} \alpha \Sigma_{i:\tau_i<t}\text{exp}(-\color{green}\beta * (t-\tau_i))$$`

where

`$$\text{log}(\color{red} {\mu(t)}) = \beta_{1} I_{s(t)} + \beta_2 d(t) + \beta_3 r(t)$$`

- `$d(t)$`, depth
 - `$r(t)$`, rate of descent (+ve when descending, -ve when ascending)
 - `$I_{s(t)} = 0$` on 'surface' `$= 1$` when in 'deep water'

---

# Underdispersion

Now, a Hawkes Process is an inhomogeneous Poisson process:

+ at any time `$\tau_i$` the number of events `$N(t) \sim \text{Poisson}( \int_0^{\tau_i} \lambda(t) dt)$`,
 + therefore the difference between the compensators `$\int_0^{\tau_i} \lambda(t) dt$` for consecutive events assumes an exponential distribution with mean 1...

But, this is not the case here so, we assume a Weibull distribution with dispersion parameter `$k$`
 
  + when `$k = 1$` we have an exponential distribution again.

---

# Cue rate

.center[
<img src="img/whale_fit.jpeg" width ="70%" />
]

---

# Dispersion parameter

.center[
<img src="img/whale_dispersion.png" width ="90%" />
]

---

.center[
<img src="img/sasquatch.png" width ="60%" />
]

---

.pull-left[
<br>
<br>
![](https://upload.wikimedia.org/wikipedia/en/thumb/9/99/Patterson%E2%80%93Gimlin_film_frame_352.jpg/230px-Patterson%E2%80%93Gimlin_film_frame_352.jpg)]

.pull-right[<img src = https://upload.wikimedia.org/wikipedia/commons/thumb/1/14/Pikes_peak_highway_big_foot.jpg/450px-Pikes_peak_highway_big_foot.jpg width=60%/>]

[![Bigfoot Field Researchers Organization](https://www.bfro.net/images/bionics_v1_1213.jpg)](https://www.bfro.net/)
---

# A marked LGCP

Setting `$m_i = 1$` if the Bigfoot sighting is classified as clear (Class A) and  `$m_i = 0$` if the sighting is not (class B).

.center[
<img src="img/bf_marked.jpg" width ="80%" />
]

---

# A joint model

The model:

`$$\Lambda(\boldsymbol{s}) = \text{exp}(\beta_0 + \beta_1 x_{\text{elev}}(s) +  G(\boldsymbol{s}) + \epsilon)$$`
`$$\text{logit}(p(\boldsymbol{s}))^{-1} = \beta_{0^m} + \beta_1^m x_{\text{elev}}(s) + G_m(\boldsymbol{s}) +\alpha_{m}\; G(\boldsymbol{s}) + \epsilon_{m}$$`

where `$m(s) \sim \text{Bernoulli}(p(\boldsymbol{s}))$` and the spatial intensity of all sightings is, as previously, `$\Lambda(\boldsymbol{s})$`. A spatial covariate, `$x_{\text{elev}}(s)$` the elevation in kilometers.

---

# A joint model
 
.pull-left[
<br>
<br>
<br>

| Parameter | est.    | se   |
 |-----------|---------|------|
 | `$\beta_0$` | -0.823  | 0.358 | 
 | `$\beta_1$` | 0.114  | 0.251  |
 | `$\beta_0^m$` | 0.227  | 0.284 |
 | `$\beta_1^m$` | -0.372  | 0.178 |
 | `$\alpha_m$` | 0.044  | 0.063 |
]

.pull-right[
<img src="img/bf_joined.png" height ="70%" />
]

---
# Estimated mark distribution

.center[
<img src="img/bf_prob.png" height ="70%" />
]

---

# But, is seeing Bigfoot contagious?

.center[
<img src="img/hawked_fitted.png" width ="90%" />
]

---
# So, is seeing Bigfoot contagious?

`$$\lambda(t) = \color{red} \mu + \color{blue} \alpha \Sigma_{i:\tau_i<t}\text{exp}(-\color{green}\beta * (t-\tau_i))$$` 
<br>
<br>

+  `$n = 972$` sightings over `$\text{T} = 2188$` days

+ `$\color{red} {\hat{\mu}} \text{T} = 0.12 \times 2188 \sim 263$` baseline sightings

+ Expected number of sightings triggered by any one sighting `$\frac{ \color{blue}{\hat{\alpha}}}{ \color{green}{\hat{\beta}}} = \frac{0.06}{0.09} = \frac{2}{3}$`

+ Expected number of descendants per sighting `$\frac{\color{green}{\hat{\beta}}}{\color{green}{\hat{\beta}} - \color{blue}{\hat{\alpha}}} = \frac{0.09}{0.09 - 0.06} = 3$`
 
 + Rate of decay for the self-excitement `$\frac{1}{\color{green}{\hat{\beta}}} = \frac{1}{0.09} \sim 11$` days

---

# A spatiotemporal self-exciting process

`$$\lambda(\boldsymbol{s},t) = \color{red} \mu + \color{blue} \alpha \Sigma_{i:\tau_i<t}\left [ \text{exp}(-\color{green}\beta * (t-\tau_i)) \color{purple}{K_i(\boldsymbol{s} - \boldsymbol{x}_i, t - \tau_i)} \right ] + \color{orange} {G(\boldsymbol{s})} + \epsilon$$`

where `$\boldsymbol{x}_i$` are the spatial locations.

The spatial self-excitement kernel is given by `$\color{purple}{K_i(\boldsymbol{s} - \boldsymbol{x}_i, t - \tau_i)} \sim \text{Normal}(0, \boldsymbol{Q^{-1}})$`, can either be time-independent where `$\boldsymbol{Q^{-1}} = \begin{bmatrix} \sigma_x^2 & \rho \sigma_x \sigma_y \\ \rho \sigma_x \sigma_y & \sigma_y^2 \end{bmatrix}$` or time-dependent where `$\boldsymbol{Q^{-1}} = \begin{bmatrix} \sigma_x^2 & \rho \sigma_x \sigma_y \\ \rho \sigma_x \sigma_y & \sigma_y^2 \end{bmatrix}\times (t_j - t_i)$` for `$t_j > t_i$`.

`$\color{orange} {G(\boldsymbol{s})}$` is an (optional) Gaussian random field.

---
# Reported Bigfoot sightings

.center[
<img src="img/sasquatch.png" width ="60%" />
]

---

# Spatiotemporal self-exciting process

.center[
<img src="img/stelfi.png" width ="70%" />
]

---

# Other work:  self-exciting SCR

.footnote[.tiny[Alec van Helsdingen's work, don't ask me too many questions]]

The idea is to model `$\lambda_{ij}(t)$`, the 'intensity' of animal `$i$` at camera `$j$` over time `$t$`:
 
 `$$\lambda_{ij}(t) = \mu_0 \left[H_{ij} + \text{exp}(-\beta (t - \tau_i(t)))(S_{ij}(t) - H_{ij})\right]$$`
  - `$H_{ij}$` bivariate normal centered on the home range
  - `$S_{ij}(t)$` bivariate normal 'spike' in intensity immediately after detection centered on the camera
 
--

.pull-left[
<img src="img/t00.png" width ="70%" />
]

.pull-right[
<img src="img/gif.gif" width ="70%" />
]

---

# Other work: Multivariate Hawkes

<br>
<br> 
.center[
<img src="img/nessie_hists.png" width ="90%" />
]

.footnote[.tiny[<svg aria-hidden="true" role="img" viewBox="0 0 512 512" style="height:1em;width:1em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:currentColor;overflow:visible;position:relative;"><path d="M256 0C114.6 0 0 114.6 0 256s114.6 256 256 256s256-114.6 256-256S397.4 0 256 0zM256 464c-114.7 0-208-93.31-208-208S141.3 48 256 48s208 93.31 208 208S370.7 464 256 464zM256 336c-18 0-32 14-32 32s13.1 32 32 32c17.1 0 32-14 32-32S273.1 336 256 336zM289.1 128h-51.1C199 128 168 159 168 198c0 13 11 24 24 24s24-11 24-24C216 186 225.1 176 237.1 176h51.1C301.1 176 312 186 312 198c0 8-4 14.1-11 18.1L244 251C236 256 232 264 232 272V288c0 13 11 24 24 24S280 301 280 288V286l45.1-28c21-13 34-36 34-60C360 159 329 128 289.1 128z"/></svg> what data are these?]]

---

# Other work: Multivariate Hawkes

<br>
<br> 
.center[
<img src="img/nessie.png" width ="90%" />
]

# GOF

---

# [<span><i class="fab  fa-google faa-float animated "></i></span> cmjt.github.io](https://cmjt.github.io)

<table class="visual-citation">
  <tr>
    <td>
      <div style="width:600px">
        <div class="top-row" style="font-size: 10pt;font-family: 'Open Sans', 'Arial', 'Helvetica', sans-serif;font-weight: bold;text-align: left;color:#EE9900;margin-bottom: 10px;">
          <span>Statistics in Medicine (2018)</span>
        </div>
        <div class="title-row" style="font-size: 12pt;font-family: 'Bitter', 'Palatino', 'Georgia', 'Times New Roman', serif;font-weight: 600;text-align: left;color:#000000;margin-bottom:15px;">
          <span>Identifying prognostic structural features in tissue sections of colon cancer patients using point pattern analysis</span>
        </div>
        <div class="author-row" style="font-size: 8pt;font-family: 'Bitter‘, 'Georgia', 'Times New Roman', serif;font-weight: normal;text-align: left;color:#000000;">
          <span>Jones-Todd, Charlotte M. et. al.</span>
        </div>
      </div>
    </td>
    <td>
      <img src="qr/Jones-Todd_2018_qr.png" alt="QR code"/>
    </td>
  </tr>
</table>
 
<table class="visual-citation">
  <tr>
    <td>
      <div style="width:600px">
        <div class="top-row" style="font-size: 10pt;font-family: 'Open Sans', 'Arial', 'Helvetica', sans-serif;font-weight: bold;text-align: left;color:#EE9900;margin-bottom: 10px;">
          <span>Journal of the Royal Statistical Society: Series C (Applied Statistics) (2017)</span>
        </div>
        <div class="title-row" style="font-size: 12pt;font-family: 'Bitter', 'Palatino', 'Georgia', 'Times New Roman', serif;font-weight: 600;text-align: left;color:#000000;margin-bottom:15px;">
          <span>A spatiotemporal multispecies model of a semicontinuous response</span>
        </div>
        <div class="author-row" style="font-size: 8pt;font-family: 'Bitter‘, 'Georgia', 'Times New Roman', serif;font-weight: normal;text-align: left;color:#000000;">
          <span>Jones‐Todd, Charlotte M. et. al.</span>
        </div>
      </div>
    </td>
    <td>
      <img src="qr/Jones%E2%80%90Todd_2017_qr.png" alt="QR code"/>
    </td>
  </tr>
</table>

---

.pull-left[
.animate__animated.animate__bounceInDown[

```
 [1] " __________________"                                                         
 [2] "< Diolch am wrando >"                                                        
 [3] " ------------------"                                                         
 [4] "      \\                    / \\  //\\"                                      
 [5] "       \\    |\\___/|      /   \\//  \\\\"                                   
 [6] "            /0  0  \\__  /    //  | \\ \\    "                               
 [7] "           /     /  \\/_/    //   |  \\  \\  "                               
 [8] "           @_^_@'/   \\/_   //    |   \\   \\ "                              
 [9] "           //_^_/     \\/_ //     |    \\    \\"                             
[10] "        ( //) |        \\///      |     \\     \\"                           
[11] "      ( / /) _|_ /   )  //       |      \\     _\\"                          
[12] "    ( // /) '/,_ _ _/  ( ; -.    |    _ _\\.-~        .-~~~^-."              
[13] "  (( / / )) ,-{        _      `-.|.-~-.           .~         `."             
[14] " (( // / ))  '/\\      /                 ~-. _ .-~      .-~^-.  \\"          
[15] " (( /// ))      `.   {            }                   /      \\  \\"         
[16] "  (( / ))     .----~-.\\        \\-'                 .~         \\  `. \\^-."
[17] "             ///.----..>        \\             _ -~             `.  ^-`  ^-_"
[18] "               ///-._ _ _ _ _ _ _}^ - - - - ~                     ~-- ,.-~"  
[19] "                                                                  /.-~"      
```
]
]

.pull-right[
.center[
<img src="https://www.royalsociety.org.nz/assets/Uploads/Marsden-logo-rgb-96dpi.jpg" width ="20%" /> &nbsp; &nbsp;
<img src="https://inro.pdn.ac.lk/assets/images/opportunities/AOARD.png" width ="20%" />
]
.center[[<span><i class="fab  fa-twitter faa-float animated "></i>@cmjonestodd</span>](https://twitter.com/cmjonestodd)]
<br>
.center[[<span><i class="fab  fa-github faa-float animated "></i></span> @cmjt](https://github.com/cmjt)]
<br>
.center[[<span><i class="fab  fa-google faa-float animated "></i></span> cmjt.github.io](https://cmjt.github.io)]
]