Riscv LMUL by dop-amin · Pull Request #2 · thisisjube/slothy

dop-amin · 2025-08-19T14:23:30Z

Allow allocating register ranges following LMUL mandates
For instructions using specifier, merge parts of the logic with the one for LMUL
Add tests and examples

* Allow allocating register ranges following LMUL mandates * For instructions using <nf> specifier, merge parts of the logic with the one for LMUL * Add tests and examples

…gister extraction - Add early return for simple cases (expansion_factor <= 1) - Extract _extract_base_registers helper function to reduce code duplication - Replace complex nested if-else logic with cleaner helper function calls - Consolidate LMUL and NF instruction writing into unified _write_expanded_instruction - Rename _write_lmul_instruction to _write_expanded_instruction for generality - Simplify _expand_vector_registers_for_nf to auto-infer NF value from instruction The refactoring reduces code complexity while maintaining the same functionality for both LMUL register grouping and NF load/store whole register operations. 🤖 Generated with the help of Claude Code

…nstr

* Allow allocating register ranges following LMUL mandates * For instructions using <nf> specifier, merge parts of the logic with the one for LMUL * Add tests and examples

…gister extraction - Add early return for simple cases (expansion_factor <= 1) - Extract _extract_base_registers helper function to reduce code duplication - Replace complex nested if-else logic with cleaner helper function calls - Consolidate LMUL and NF instruction writing into unified _write_expanded_instruction - Rename _write_lmul_instruction to _write_expanded_instruction for generality - Simplify _expand_vector_registers_for_nf to auto-infer NF value from instruction The refactoring reduces code complexity while maintaining the same functionality for both LMUL register grouping and NF load/store whole register operations. 🤖 Generated with the help of Claude Code

…nstr

…lmul

…gatherei16.vv

…r RISCVVectorIntegerVectorVectorMasked * Needs smae process to be applied for further classes

When parsing a loop, SLOTHY tries to match any of the available loop types and takes the first that matches. This could lead to the unfortunately situation that a Armv7E-M loop type matches for AArch64 code. For example, previously this code would match a Armv7E-M BranchLoop: count .req x2 mov count, slothy-optimizer#16 start: add x5, x5, x4 add x7, x5, x1 ldr x5, [x0, slothy-optimizer#4] add x5, x5, x7 subs count, count, #2 b.ne start This makes absolutely no sense and would ultimately result in a mysterious error message that b.ne is not a known instruction. This commit fixes that by only trying to parse loop types for the current architecture.

Add support for LMUL >=1

4490461

* Allow allocating register ranges following LMUL mandates * For instructions using <nf> specifier, merge parts of the logic with the one for LMUL * Add tests and examples

dop-amin force-pushed the riscv_lmul branch from 1e969a6 to 4490461 Compare August 20, 2025 09:09

thisisjube and others added 3 commits August 20, 2025 14:08

refactor: fix docstring

0b98baf

Refactor: simplify combination generation for RISCV RVV vector expand

7e665e5

dop-amin force-pushed the riscv_lmul branch from 93e2180 to b63492c Compare August 21, 2025 11:32

thisisjube added 2 commits August 27, 2025 13:46

refactor: linting

8ce62fa

feat: add fixed point arithmetic/ vector single-width scaling shift i…

2048431

…nstr

thisisjube force-pushed the riscv_lmul branch from 7591aa5 to 2048431 Compare August 27, 2025 12:17

thisisjube and others added 8 commits August 27, 2025 15:23

feat: adjust loop for kyber_poly_reduce_rvv_vlen128.s

88a5fa9

Add support for LMUL >=1

82c79d3

* Allow allocating register ranges following LMUL mandates * For instructions using <nf> specifier, merge parts of the logic with the one for LMUL * Add tests and examples

refactor: fix docstring

39421fa

Refactor: simplify combination generation for RISCV RVV vector expand

d4f7e7c

refactor: linting

cdd53fb

feat: add fixed point arithmetic/ vector single-width scaling shift i…

c68489e

…nstr

feat: adjust loop for kyber_poly_reduce_rvv_vlen128.s

e7c4777

dop-amin force-pushed the riscv_lmul branch from 88a5fa9 to e7c4777 Compare September 15, 2025 11:39

dop-amin and others added 5 commits September 15, 2025 10:37

fix: RVV whole register load/store follows lmul alloc convention

950fba8

broken: re-opt RVV Dilithium NTT. Problem: vrgather reg overlap

f35bcc8

Merge branch 'riscv_lmul' of github.com:thisisjube/slothy into riscv_…

b6fbdfa

…lmul

feat: add lmul expansion to vector load + store, add vrgather.vv + vr…

2ac323c

…gatherei16.vv

fix: RVV vrgather parsing

56b4e93

dop-amin force-pushed the riscv_lmul branch from d31c87d to 56b4e93 Compare October 1, 2025 15:27

dop-amin added 2 commits October 1, 2025 17:32

Refactor: Simplify lmul expansion logic

9d80cc8

feat: allow only expanding rvv vectors partially. dont expand mask fo…

9577a0f

…r RISCVVectorIntegerVectorVectorMasked * Needs smae process to be applied for further classes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Riscv LMUL#2

Riscv LMUL#2
dop-amin wants to merge 21 commits into
riscvfrom
riscv_lmul

dop-amin commented Aug 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dop-amin commented Aug 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants