Speed Up Multiple Model Inference on EDGE TPU

Question

Speed Up Multiple Model Inference on EDGE TPU

898 views Asked by dtlam26 At 14 September 2020 at 03:38

I have retrained a RESNET50 model for reidentification on EDGE TPU. However, it seems to be no way to fetch a batch of image to EDGE_TPU.

I have come up with a solution of running multiple same model for images.

However, is there anyway to speed up the model inference for multiple model? The threading now is even slower than single model inference

Original Q&A

There are 2 answers

**dtlam26** · Answer 1 · 2020-10-08T10:48:09+00:00

dtlam26 On 08 October 2020 at 10:48

Because batch inference is not available now, so pipelining is another secondary option. However, after experiencing with my model, we can make a psuedo batch by feeding multiple single input for EDGE_TPU as another option

**Nam Vu** · Answer 2 · 2020-09-14T18:38:05+00:00

Nam Vu On 14 September 2020 at 18:38

Yeah, the edgetpu's architect won't allow processing in batch size. Have you tried model pipelining? https://coral.ai/docs/edgetpu/pipeline/

Unfortunately only available in C++ right now, but we're looking to extends it to python in mid Q4.

TechQA.

Speed Up Multiple Model Inference on EDGE TPU

There are 2 answers

Related Questions in TENSORFLOW-LITE

Related Questions in QUANTIZATION

Related Questions in GOOGLE-CORAL

Related Questions in EDGE-TPU

Popular Questions

Popular Tags

Trending Questions