I have some issues with my Ultra96-V2 board when I run and example of "vector add", the basic example that comes with Vitis.
1) I manage to compile and run the design but the performance of the kernel is very low, less than 10MOps when it is run with integer vectors. Also memory bandwidth is very low (not saturated). I tried to use higher buffers, and loop unroll with bigger factor but the performance stays very bad.
2) Also, according to the log of the compilation the kernel is synthesized for 200MHz. This should be small number, right? I noticed on the forum that there are some problem with synthesizing kernels for higher clock frequencies, but that is due to high performance of that kernel (people talked about DPU kernel). The vector-add kernel is much smaller, so I guess this is not a same issue.
3) Also, I noticed a problem in the host code when program tries to allocate buffer that is bigger than 4MB. The code exits with an allocation error.
If you have any comment on any of these issues I would really appreciate.
Thank you in advance for help.