Add code similar to mul_mm_cm2 to force alignment of strides, to avoid a performance regression. Add noncontiguous FA tests in test-backend-ops. Fixes #11268. |
||
|---|---|---|
| .. | ||
| include | ||
| src | ||
| .gitignore | ||
| CMakeLists.txt | ||
Add code similar to mul_mm_cm2 to force alignment of strides, to avoid a performance regression. Add noncontiguous FA tests in test-backend-ops. Fixes #11268. |
||
|---|---|---|
| .. | ||
| include | ||
| src | ||
| .gitignore | ||
| CMakeLists.txt | ||