
in line 13, you write

but in mamba ssm architecture , A0'=exp(delta*ParameterA) . so do VIM drop exp ?
but in your code ,i see mamba is used from Mamba source code that uses exp.
So ,which is the true architecture ?
Can anyone help me solve the problem ,thanks very much!!