Hi There,
VTA first conv layer is running on CPU and not get offload into FPGA, in most
case that is a performance bottle neck and need optimization, following are
some idea about the
optimization, please kindly comments.
Regards
Hua
1. training network to make first conv layer support int8
Hi zhanghaohit,
thanks for this proposal, it is a very intersting topic, this proposal seems
like be a very big change, but some parts of
this proposal i am not quite understand and need your help for clarify,
first about the motivation part, this topic mentioned
#1 about "cloud device ma
Hi @acapone13,
Thanks for following up this post and nice to know you are interested in VTA
performance optimization related topic, about the resenet18 pretrained model,
could I know which framework you use to generate
the model? and how much the accuracy lost is after the quantization?
Rega
Hi @acapone13,
to apply first conv2d layer into VTA, there are 2 solution/ steps, first is to
padding first conv2d from 3 channel into VTA hardware match channel for example
16, after that we
would can run first quantized conv2d layer on VTA , for sure
the padding would increase compute OP num