[RFC] Add RWKV7 kernels and models #105

yzhangcs · 2025-01-05T20:17:28Z

No description provided.

sustcsonglin · 2025-01-06T02:23:48Z

The forward pass for the chunkwise implementation has been completed and tested in this commit: 7569595.

sustcsonglin · 2025-01-11T15:08:45Z

Backward pass has been implemented in this commit: e582c28

TODO: Implement RWKV 7 layer and model in FLA format

Triang-jyed-driung · 2025-01-14T15:04:25Z

I think this might help: https://huggingface.co/SmerkyG/RWKV7-Goose-0.4B-Pile-HF/blob/main/modeling_rwkv7.py

Also, the triton kernels come from https://github.com/johanwind/wind_rwkv/tree/main/wind_rwkv/rwkv7

They use a technique called "backstepping" for the states to avoid recomputation.

sustcsonglin · 2025-01-14T20:22:17Z

I think this might help: https://huggingface.co/SmerkyG/RWKV7-Goose-0.4B-Pile-HF/blob/main/modeling_rwkv7.py

Also, the triton kernels come from https://github.com/johanwind/wind_rwkv/tree/main/wind_rwkv/rwkv7

They use a technique called "backstepping" for the states to avoid recomputation.

Hi @Triang-jyed-driung, thanks for your pointers! We're familiar with wind_rwkv's CUDA kernel but were unaware of the Triton kernel. However, I have some concerns regarding the numerical precision of wind's kernel. Has anyone tested its numerical precision and the relative error compared to the full FP32 recurrent kernels? Also has anyone tested the speed between wind's triton kernel and fla's kernel?

Triang-jyed-driung · 2025-01-15T03:43:41Z

Smerky (Dan Goldstein) tested different kernels. The fastest kernel is 2x (end to end!) faster, in termes of overall training speed. But at a risk of losing numerical precision. Please ask him (and Wind) for details.

yzhangcs added the enhancement New feature or request label Jan 5, 2025

yzhangcs added this to the FLA v1.0.0 release milestone Jan 5, 2025

yzhangcs changed the title ~~Add RWKV7 kernels~~ [RFC] Add RWKV7 kernels Jan 6, 2025

sustcsonglin self-assigned this Jan 6, 2025

sustcsonglin changed the title ~~[RFC] Add RWKV7 kernels~~ [RFC] Add RWKV7 kernels and models Jan 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Add RWKV7 kernels and models #105

[RFC] Add RWKV7 kernels and models #105

yzhangcs commented Jan 5, 2025

sustcsonglin commented Jan 6, 2025

sustcsonglin commented Jan 11, 2025

Triang-jyed-driung commented Jan 14, 2025

sustcsonglin commented Jan 14, 2025

Triang-jyed-driung commented Jan 15, 2025

[RFC] Add RWKV7 kernels and models #105

[RFC] Add RWKV7 kernels and models #105

Comments

yzhangcs commented Jan 5, 2025

sustcsonglin commented Jan 6, 2025

sustcsonglin commented Jan 11, 2025

Triang-jyed-driung commented Jan 14, 2025

sustcsonglin commented Jan 14, 2025

Triang-jyed-driung commented Jan 15, 2025