SpacemiT X100 (SpacemiT K3) uarch-tool benchmarks

VLEN: 256

Detect all1s tail/mask policy with simple code snippet:
Tail agnostic policy: undisturbed
Mask agnostic policy: undisturbed
Is vl always set to min(AVL,VLMAX): yes
    Note: spec allows ceil(AVL/2)<=vl<=VLMAX for VLMAX<AVL<2*VLMAX
Measures how LMUL scheduling impacts when results are ready:
A) LMUL=8 v0 overlap with LMUL=1 v0:     351.9964199 cycles/iter
B) LMUL=8 v0 overlap with LMUL=1 v3:     352.9966640 cycles/iter
C) LMUL=8 v0 overlap with LMUL=1 v7:     368.9962501 cycles/iter
D) LMUL=8 v0 overlap with LMUL=1 v8:     352.9962234 cycles/iter
E) LMUL=8 v0 overlap with LMUL=1 v0..v8: 362.9964933 cycles/iter
Measures overhead of reinterpreting a mask as a vector:
A) reinterpret:       12.0001029 cycles/iter
B) don't reinterpret: 12.0001058 cycles/iter