Hidden Markov model

Hidden Markov models (HMM) describe the evolution of a sequence of random variables, {St}\{S_t\} (i.e. behavioural states), which are not directly observable, but can be inferred from another sequence of random variables, {Yt}\{Y_t\}, that are observable (i.e. locations). The two main characteristics of HMMs are (1) each observation is assumed to be generated by one of NN distributions, and (2) the hidden state sequence that determines which of the NN distributions is chosen at time tt is modelled as a Markov chain, where the probability of being in each state at time tt depends only on the state value at the previous time step.

State process

The state process {St}\{S_t\} of a NN-state HMM for TT time steps is characterised by its state transition probability matrix Γ(t)=(γi,j(t))\Gamma^{(t)} = (\gamma^{(t)}_{i,j}), where i,j=1,,Ni,j = 1, \dots, N and γi,j(t)=Pr(st+1=j|st=i)\gamma^{(t)}_{i,j} = \text{Pr}(s_{t+1} = j|s_t = i). The probability of transitioning to state sts_t from state st1s_{t-1} is

stCategorical(Γ(t1)),1tT. s_t \sim \text{Categorical}(\Gamma^{(t-1)}), \quad 1 \leq t \leq T.

Process model

The process equation for the true locations of the animal at regular time intervals tt, zt=[zt,lonzt,lat]z_t = \begin{bmatrix} z_{t, \text{lon}} \\ z_{t, \text{lat}} \end{bmatrix}, assumes that the animal’s location at time tt is not only dependent on the previous location, zt1z_{t-1}, but also on the animal’s previous displacement in each coordinate, zt1zt2z_{t-1} - z_{t-2}:

zt=zt1+λn(zt1zt2)+ϵt,ϵtN(0,Ω),1nN, z_t = z_{t-1} + \lambda_n (z_{t-1} - z_{t-2}) + \epsilon_t, \quad \epsilon_t \sim \text{N}(0, \Omega), \quad 1 \leq n \leq N,

where

Ω=[τϵ,lon200τϵ,lat2]. \Omega = \begin{bmatrix} \tau^2_{\epsilon, \text{lon}} & 0 \\ 0 & \tau^2_{\epsilon, \text{lat}} \end{bmatrix}.

The state-depended parameter, λn\lambda_n, can take values between 0 and 1 (i.e., 0λ10 \leq \lambda \leq 1), and controls the degree of correlation between steps. By default, movetrack estimates track-specific λn\lambda_n values, but it is also possible to use the same λn\lambda_n for all tracks by setting i_lambda = FALSE.

Observation model

The observed locations of an animal, yj=[yj,lonyj,lat]y_j = \begin{bmatrix} y_{j, \text{lon}} \\ y_{j, \text{lat}} \end{bmatrix}, often have irregular time intervals jj, with JJ representing the total number of observed locations. Therefore, the true location of the animal is linearly interpolated to the time of the observation, with wjw_j representing the proportion of the regular time interval between t1t-1 and tt when the observation yjy_j was made:

yj=wjzt+(1wj)zt1+θj,θjT(0,σj),1jJ, y_j = w_j z_t + (1 - w_j) z_{t-1} + \theta_j, \quad \theta_j \sim \text{T}(0, \sigma_j), \quad 1 \leq j \leq J,

where T(0,σj)\text{T}(0, \sigma_j) denotes a bivariate Student’s tt-distribution with measurement error σj=[σj,lonσj,lat]\sigma_j = \begin{bmatrix} \sigma_{j, \text{lon}} \\ \sigma_{j, \text{lat}} \end{bmatrix}.

References

Auger-Méthé, M., Newman, K., Cole, D., Empacher, F., Gryba, R., King, A. A., … & Thomas, L. (2021). A guide to state–space modeling of ecological time series. Ecological Monographs, 91(4), e01470. doi: 10.1002/ecm.1470