Can the library CMSIS-DSP do matrix convolutional multiplication? When I looking at the documentation of the library I can see that it only supports regular convolutional vector multiplication.
Question:
Is there any way to do matrix convolutional multiplication in ARM systems for large kernels by using a proper library that take optimized benefits of the hardware?