A causal 1D SSM is defined as
where
The eigenvalue decomposition (EVD) form. With EVD
Another modal decomposition is to stick to the domain of real numbers strictly. Say that
This sugguests that we always can block diagonalize
There are some rough notes here.
-
It supports sample rate conversion (see resample_up and resample_down settings).
-
It can enforce stability by pulling poles into the unit disc as
$$\lambda \rightarrow \lambda/\sqrt{|\lambda|^2 + 1}$$ By default, we do not enforce stability (could be unstable for long sequences).
-
May need to manually tune the scales of matrices
$B$ and$C$ for optimal performance. Their default scales could be too large for long sequences.
On a simple language problem we compare the training loss perplexities of our Complex SSM vs Real vs Mamba vs Attention