how do you handle servers removing blobs? does it need to recalculate the whole bloom filter?
Login to reply
Replies (2)
Yes it currently does. I think a separate filter for added/removed blobs can make sense. I currently recalculate on every request but that works only because I have a manageable 1200 blobs which are all in memory.
I have also added a binary fuse endpoint now, whose data is noticeably larger but that is due the much lower error rate:
https://almond.slidestr.net/_fuse
cuckoo filters are similar to bloom but allow you to remove items