Adding search only mode for index loading to save memory #191

jaroslavgratz · 2020-01-18T22:22:45Z

If an index is loaded from disk and used to search only (likely a typical use case) then it is not necessary initialize link_list_locks_ and label_lookup_ data structures. It saves approximately 350MB of memory for 5M dataset. Adding a search_only flag to enable search only mode (default is off).

yurymalkov · 2020-01-23T01:13:51Z

Hi @jaroslavgratz ,

It seems that the code is missing something. Cannot see a part which omits creating the data structures.

jaroslavgratz · 2020-01-23T04:58:22Z

Hi @yurymalkov,

These lines omits creating the data structures when loading data:

fdfb030#diff-171628eaa21dab74ca44c386d5a17f05R684

fdfb030#diff-171628eaa21dab74ca44c386d5a17f05R699

In fact the link_list_locks_ and label_lookup_ data structures still exist but they are not loaded.

yurymalkov · 2020-01-28T00:39:40Z

@jaroslavgratz

Thanks, I see it. Does it give you a good saving?

There might be a simpler way to save the memory, while still preserving insertions. One can lock the elements in buckets. E.g. do link_list_locks_ [i<<8] instead of link_list_locks_ [i]. Not sure it has any measurable performance loss.

Also, the memory can be further saved if VisitedLists are substituted with hash maps. Would you like to see that in this repo?

add search only mode to save memory

fdfb030

jaroslavgratz force-pushed the master branch from b2b292e to fdfb030 Compare January 19, 2020 05:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding search only mode for index loading to save memory #191

Adding search only mode for index loading to save memory #191

jaroslavgratz commented Jan 18, 2020

yurymalkov commented Jan 23, 2020

jaroslavgratz commented Jan 23, 2020

yurymalkov commented Jan 28, 2020

Adding search only mode for index loading to save memory #191

Are you sure you want to change the base?

Adding search only mode for index loading to save memory #191

Conversation

jaroslavgratz commented Jan 18, 2020

yurymalkov commented Jan 23, 2020

jaroslavgratz commented Jan 23, 2020

yurymalkov commented Jan 28, 2020