"""
State Space Representation - Initialization
Author: Chad Fulton
License: Simplified-BSD
"""
import warnings
import numpy as np
from . import tools
class Initialization(object):
r"""
State space initialization
Parameters
----------
k_states : int
exact_diffuse_initialization : bool, optional
Whether or not to use exact diffuse initialization; only has an effect
if some states are initialized as diffuse. Default is True.
approximate_diffuse_variance : float, optional
If using approximate diffuse initialization, the initial variance used.
Default is 1e6.
Notes
-----
As developed in Durbin and Koopman (2012), the state space recursions
must be initialized for the first time period. The general form of this
initialization is:
.. math::
\alpha_1 & = a + A \delta + R_0 \eta_0 \\
\delta & \sim N(0, \kappa I), \kappa \to \infty \\
\eta_0 & \sim N(0, Q_0)
Thus the state vector can be initialized with a known constant part
(elements of :math:`a`), with part modeled as a diffuse initial
distribution (as a part of :math:`\delta`), and with a part modeled as a
known (proper) initial distribution (as a part of :math:`\eta_0`).
There are two important restrictions:
1. An element of the state vector initialized as diffuse cannot be also
modeled with a stationary component. In the `validate` method,
violations of this cause an exception to be raised.
2. If an element of the state vector is initialized with both a known
constant part and with a diffuse initial distribution, the effect of
the diffuse initialization will essentially ignore the given known
constant value. In the `validate` method, violations of this cause a
warning to be given, since it is not technically invalid but may
indicate user error.
The :math:`\eta_0` compoenent is also referred to as the stationary part
because it is often set to the unconditional distribution of a stationary
process.
Initialization is specified for blocks (consecutive only, for now) of the
state vector, with the entire state vector and individual elements as
special cases. Denote the block in question as :math:`\alpha_1^{(i)}`. It
can be initialized in the following ways:
- 'known'
- 'diffuse' or 'exact_diffuse' or 'approximate_diffuse'
- 'stationary'
- 'mixed'
In the first three cases, the block's initialization is specified as an
instance of the `Initialization` class, with the `initialization_type`
attribute set to one of those three string values. In the 'mixed' cases,
the initialization is also an instance of the `Initialization` class, but
it will itself contain sub-blocks. Details of each type follow.
Regardless of the type, for each block, the following must be defined:
the `constant` array, an array `diffuse` with indices corresponding to
diffuse elements, an array `stationary` with indices corresponding to
stationary elements, and `stationary_cov`, a matrix with order equal to the
number of stationary elements in the block.
**Known**
If a block is initialized as known, then a known (possibly degenerate)
distribution is used; in particular, the block of states is understood to
be distributed
:math:`\alpha_1^{(i)} \sim N(a^{(i)}, Q_0^{(i)})`. Here, is is possible to
set :math:`a^{(i)} = 0`, and it is also possible that
:math:`Q_0^{(i)}` is only positive-semidefinite; i.e.
:math:`\alpha_1^{(i)}` may be degenerate. One particular example is
that if the entire block's initial values are known, then
:math:`R_0^{(i)} = 0`, and so `Var(\alpha_1^{(i)}) = 0`.
Here, `constant` must be provided (although it can be zeros), and
`stationary_cov` is optional (by default it is a matrix of zeros).
**Diffuse**
If a block is initialized as diffuse, then set
:math:`\alpha_1^{(i)} \sim N(a^{(i)}, \kappa^{(i)} I)`. If the block is
initialized using the exact diffuse initialization procedure, then it is
understood that :math:`\kappa^{(i)} \to \infty`.
If the block is initialized using the approximate diffuse initialization
procedure, then `\kappa^{(i)}` is set to some large value rather than
driven to infinity.
In the approximate diffuse initialization case, it is possible, although
unlikely, that a known constant value may have some effect on
initialization if :math:`\kappa^{(i)}` is not set large enough.
Here, `constant` may be provided, and `approximate_diffuse_variance` may be
provided.
**Stationary**
If a block is initialized as stationary, then the block of states is
understood to have the distribution
:math:`\alpha_1^{(i)} \sim N(a^{(i)}, Q_0^{(i)})`. :math:`a^{(i)}` is
the unconditional mean of the block, computed as
:math:`(I - T^{(i)})^{-1} c_t`. :math:`Q_0^{(i)}` is the unconditional
variance of the block, computed as the solution to the discrete Lyapunov
equation:
.. math::
T^{(i)} Q_0^{(i)} T^{(i)} + (R Q R')^{(i)} = Q_0^{(i)}
:math:`T^{(i)}` and :math:`(R Q R')^{(i)}` are the submatrices of
the corresponding state space system matrices corresponding to the given
block of states.
Here, no values can be provided.
**Mixed**
In this case, the block can be further broken down into sub-blocks.
Usually, only the top-level `Initialization` instance will be of 'mixed'
type, and in many cases, even the top-level instance will be purely
'known', 'diffuse', or 'stationary'.
For a block of type mixed, suppose that it has `J` sub-blocks,
:math:`\alpha_1^{(i,j)}`. Then
:math:`\alpha_1^{(i)} = a^{(i)} + A^{(i)} \delta + R_0^{(i)} \eta_0^{(i)}`.
Examples
--------
Basic examples have one specification for all of the states:
>>> Initialization(k_states=2, 'known', constant=[0, 1])
>>> Initialization(k_states=2, 'known', stationary_cov=np.eye(2))
>>> Initialization(k_states=2, 'known', constant=[0, 1],
stationary_cov=np.eye(2))
>>> Initialization(k_states=2, 'diffuse')
>>> Initialization(k_states=2, 'approximate_diffuse',
approximate_diffuse_variance=1e6)
>>> Initialization(k_states=2, 'stationary')
More complex examples initialize different blocks of states separately
>>> init = Initialization(k_states=3)
>>> init.set((0, 1), 'known', constant=[0, 1])
>>> init.set((0, 1), 'known', stationary_cov=np.eye(2))
>>> init.set((0, 1), 'known', constant=[0, 1],
stationary_cov=np.eye(2))
>>> init.set((0, 1), 'diffuse')
>>> init.set((0, 1), 'approximate_diffuse',
approximate_diffuse_variance=1e6)
>>> init.set((0, 1), 'stationary')
A still more complex example initializes a block using a previously
created `Initialization` object:
>>> init1 = Initialization(k_states=2, 'known', constant=[0, 1])
>>> init2 = Initialization(k_states=3)
>>> init2.set((1, 2), init1)
"""
def __init__(self, k_states, initialization_type=None,
initialization_classes=None, approximate_diffuse_variance=1e6,
constant=None, stationary_cov=None):
# Parameters
self.k_states = k_states
# Attributes handling blocks of states with different initializations
self._states = tuple(np.arange(k_states))
self._initialization = np.array([None] * k_states)
self.blocks = {}
# Attributes handling initialization of the entire set of states
# `constant` is a vector of constant values (i.e. it is the vector
# a from DK)
self.initialization_type = None
self.constant = np.zeros(self.k_states)
self.stationary_cov = np.zeros((self.k_states, self.k_states))
self.approximate_diffuse_variance = approximate_diffuse_variance
# Cython interface attributes
self.prefix_initialization_map = (
initialization_classes if initialization_classes is not None
else tools.prefix_initialization_map.copy())
self._representations = {}
self._initializations = {}
# If given a global initialization, use it now
if initialization_type is not None:
self.set(None, initialization_type, constant=constant,
stationary_cov=stationary_cov)
def __setitem__(self, index, initialization_type):
self.set(index, initialization_type)
def _initialize_initialization(self, prefix):
dtype = tools.prefix_dtype_map[prefix]
# If the dtype-specific representation matrices do not exist, create
# them
if prefix not in self._representations:
# Copy the statespace representation matrices
self._representations[prefix] = {
'constant': self.constant.astype(dtype),
'stationary_cov': np.asfortranarray(
self.stationary_cov.astype(dtype)),
}
# If they do exist, update them
else:
self._representations[prefix]['constant'][:] = (
self.constant.astype(dtype)[:])
self._representations[prefix]['stationary_cov'][:] = (
self.stationary_cov.astype(dtype)[:])
# Create if necessary
if prefix not in self._initializations:
# Setup the base statespace object
cls = self.prefix_initialization_map[prefix]
self._initializations[prefix] = cls(
self.k_states, self._representations[prefix]['constant'],
self._representations[prefix]['stationary_cov'],
self.approximate_diffuse_variance)
# Otherwise update
else:
self._initializations[prefix].approximate_diffuse_variance = (
self.approximate_diffuse_variance)
return prefix, dtype
def set(self, index, initialization_type, constant=None,
stationary_cov=None, approximate_diffuse_variance=None):
r"""
Set initialization for states, either globally or for a block
Parameters
----------
index : tuple or int or None
Arguments used to create a `slice` of states. Can be a tuple with
`(start, stop)` (note that for `slice`, stop is not inclusive), or
an integer (to select a specific state), or None (to select all the
states).
initialization_type : str
The type of initialization used for the states selected by `index`.
Must be one of 'known', 'diffuse', 'approximate_diffuse', or
'stationary'.
constant : array_like, optional
A vector of constant values, denoted :math:`a`. Most often used
with 'known' initialization, but may also be used with
'approximate_diffuse' (although it will then likely have little
effect).
stationary_cov : array_like, optional
The covariance matrix of the stationary part, denoted :math:`Q_0`.
Only used with 'known' initialization.
approximate_diffuse_variance : float, optional
The approximate diffuse variance, denoted :math:`\kappa`. Only
applicable with 'approximate_diffuse' initialization. Default is
1e6.
"""
# Construct the index, using a slice object as an intermediate step
# to enforce regularity
if not isinstance(index, slice):
if isinstance(index, (int, np.integer)):
index = int(index)
if index < 0 or index >= self.k_states:
raise ValueError('Invalid index.')
index = (index, index + 1)
elif index is None:
index = (index,)
elif not isinstance(index, tuple):
raise ValueError('Invalid index.')
if len(index) > 2:
raise ValueError('Cannot include a slice step in `index`.')
index = slice(*index)
index = self._states[index]
# Compatibility with zero-length slices (can make it easier to set up
# initialization without lots of if statements)
if len(index) == 0:
return
# Make sure that we are not setting a block when global initialization
# was previously set
if self.initialization_type is not None and not index == self._states:
raise ValueError('Cannot set initialization for the block of'
' states %s because initialization was'
' previously performed globally. You must either'
' re-initialize globally or'
' else unset the global initialization before'
' initializing specific blocks of states.'
% str(index))
# Make sure that we are not setting a block that *overlaps* with
# another block (although we are free to *replace* an entire block)
uninitialized = np.equal(self._initialization[index, ], None)
if index not in self.blocks and not np.all(uninitialized):
raise ValueError('Cannot set initialization for the state(s) %s'
' because they are a subset of a previously'
' initialized block. You must either'
' re-initialize the entire block as a whole or'
' else unset the entire block before'
' re-initializing the subset.'
% str(np.array(index)[~uninitialized]))
# If setting for all states, set this object's initialization
# attributes
k_states = len(index)
if k_states == self.k_states:
self.initialization_type = initialization_type
# General validation
if (approximate_diffuse_variance is not None and
not initialization_type == 'approximate_diffuse'):
raise ValueError('`approximate_diffuse_variance` can only be'
' provided when using approximate diffuse'
' initialization.')
if (stationary_cov is not None and
not initialization_type == 'known'):
raise ValueError('`stationary_cov` can only be provided when'
' using known initialization.')
# Specific initialization handling
if initialization_type == 'known':
# Make sure we were given some known initialization
if constant is None and stationary_cov is None:
raise ValueError('Must specify either the constant vector'
' or the stationary covariance matrix'
' (or both) if using known'
Loading ...