Instrumental Model from Scratch (With Demo)
The architecture is the project's real showpiece: a 72-band non‑uniform band-split BiMamba U‑Net that uses Mamba scans for O(T) memory and interleaved attention in the bottleneck to mix cross‑frequency context — a clever tradeoff between temporal efficiency and global attention. The author ships a runnable demo and an explanatory write-up so you can reproduce the approach, but it's clearly hobby-scale (≈1k songs trained, single home PC queue, slow cold starts), so expect experimental results rather than SOTA separation or instant throughput.
