Count-min sketch is a sublinear space data structure for summarizing data streams.
Count min sketch helps you to understand how many times certain item has occured in your stream. It's like a database which you can only retrieve a count from, without being able to retrieve a precise value. So you can ask it "how many times have I seen Alex?" but you can never ask it "what items have you seen at all?".
The algorithm itself is quite straightforward:
You can play around with a live-updating implementation of a count min sketch below.
You enter a word, hit
add button, table gets updated. For each entered word, you
will see an estimated occurrence value.
Published on May 29, 2014
If you like my content, you might want to follow me on Twitter to subscribe for the future updates!