Artwork

169: HyperLogLog

Programming Throwdown

224 subscribers

published

iconShare
 
Manage episode 386214107 series 2417399
Content provided by Patrick Wheeler and Jason Gauci, Patrick Wheeler, and Jason Gauci. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Patrick Wheeler and Jason Gauci, Patrick Wheeler, and Jason Gauci or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://staging.podcastplayer.com/legal.

Intro topic: Testing your car battery

News/Links:

Book of the Show

Patreon Plug https://www.patreon.com/programmingthrowdown?ty=h

Tool of the Show

Topic: HyperLogLog

  • Motivation
    • Cardinality Counting
  • LinearCounting
    • Hash + expectation of collision based on how full
    • Bloom Filter
  • LogLog
    • Use first N bits as bucket
    • Use max sequential 0s in each bucket
    • Average
  • HyperLogLog
    • Handle empty buckets
    • Use correction factor like linear counting for low counts (number of empty buckets) and high counts
  • Distributing

    • Transfer bucket counts

★ Support this podcast on Patreon ★
  continue reading

184 episodes