-
Notifications
You must be signed in to change notification settings - Fork 758
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improved OpenSearchKNN setup for doing benchmarks #412
Conversation
Is this ready to be merged? |
Signed-off-by: Navneet Verma <[email protected]>
@erikbern Yes the code is ready for checked in. I resolved the conflicts which were present. |
Thanks @erikbern These checks were not running when I raised the PR. Let me see what is the issue. |
@erikbern I looked into the run, and it is using 5.2GB of the memory. I was thinking we were running with around 16.5GB of memory. Can you please provide me info like what is the right set of memory which will be provided for containers? As per documentation the tests are running on r6i.16xlarge and parallelization factor as 31. Which mean each container is getting ~16GB of Memory. The code in this PR is based on that. |
On thinking over this more, I can see if there is a way I can get docker memory in the docker file and then take a call what should be the heap config. If there is no way I can add comments and switch to 3GB as memory. @erikbern any thoughts |
It's the standard Github Actions runners right now: https://docs.github.com/en/actions/using-github-hosted-runners/about-github-hosted-runners It would be great if there's an easy way to configure it dynamically |
sure I will start lookin into this, but in meanwhile I flip to 3GB so that this PR can be merged. |
@erikbern updated the PR, we run github actions to see if they succeeds. |
It's failing with a weird issue now: |
taking a look. |
Signed-off-by: Navneet Verma <[email protected]>
@erikbern Seeing successful run on local.
|
@erikbern can we merge the changes? |
NIce – it's working now! |
@erikbern when would the benchmarks be trigged with the latest changes? |
I don't run it at any specific interval. Historically it's been 1y+ between runs. But given how fast the field is moving, I think it's probably worth running it again quite soon! Maybe in a few weeks? |
Description
Improved the OpenSearchKNN setup. The older setting of OpenSearch K-NN is not optimized for 1 CPU and 1 machine.
Changes involve:
_id
to store the id of the vector.