Tenstorrent Ascalon-S veclibm benchmarks
| function | scalar libc fp64/cycle | RVV veclibm fp64/cycle | speedup |
| exp | 0.061245 | 0.066873 | 1.09x |
| exp2 | 0.066486 | 0.064968 | 0.98x |
| expm1 | 0.025792 | 0.052567 | 2.04x |
| log | 0.050702 | 0.055500 | 1.09x |
| log10 | 0.018939 | 0.051238 | 2.71x |
| log2 | 0.047842 | 0.053655 | 1.12x |
| log1p | 0.020178 | 0.056652 | 2.81x |
| sqrt | 0.037031 | 0.235193 | 6.35x |
| cbrt | 0.015656 | 0.050799 | 3.24x |
| sin | 0.032231 | 0.050764 | 1.58x |
| cos | 0.037304 | 0.049949 | 1.34x |
| tan | 0.028901 | 0.031673 | 1.10x |
| asin | 0.020033 | 0.042006 | 2.10x |
| acos | 0.019543 | 0.039438 | 2.02x |
| atan | 0.044810 | 0.030001 | 0.67x |
| sinh | 0.015196 | 0.036519 | 2.40x |
| cosh | 0.023139 | 0.044548 | 1.93x |
| tanh | 0.014320 | 0.034638 | 2.42x |
| asinh | 0.011081 | 0.030987 | 2.80x |
| acosh | 0.030543 | 0.029449 | 0.96x |
| atanh | 0.010305 | 0.047894 | 4.65x |
| erf | 0.047365 | 0.025465 | 0.54x |
| erfc | 0.040424 | 0.020057 | 0.50x |
| tgamma | 0.008332 | 0.012241 | 1.47x |
| lgamma | 0.016929 | 0.009627 | 0.57x |