[Question] Is infinity faster than sentence_transformers for inference? #327
Replies: 2 comments
-
To compare the inference speed of Infinity and SentenceTransformers, follow these steps:
Refer to References/.github/ISSUE_TEMPLATE/new-model-addition.yml
|
Beta Was this translation helpful? Give feedback.
-
Yes, https://michaelfeil.github.io/infinity/main/benchmarking/ For transformers 4.39.2. But it gets much faster if you implement no batching in your app. |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
All reactions