The 2-Minute Rule for mamba paper
We modified the Mamba's interior equations so to accept inputs from, and combine, two separate information streams. To the ideal of our expertise, This is actually the first try and adapt the equations of SSMs to a eyesight activity like style transfer devoid of demanding any other module like cross-consideration or tailor made normalization layers