Neural Network Quantization and Compression with Tijmen Blankevoort - TWIML Talk #292
Today we’re joined by Tijmen Blankevoort, a staff engineer at Qualcomm, who leads their compression and quantization research teams. Tijmen is also CTO at ML startup Scyfer, which was founded by Qualcomm colleague Max Welling, who we spoke with back on episode 267. In our conversation with Tijmen we discuss:
• The ins and outs of compression and quantization of ML models, specifically NNs,
• How much models can actually be compressed, and the best way to achieve compression,
• We also look at a few recent papers including “Lottery Hypothesis."
Check out the full show notes at twimlai.com/talk/292.
Create your
podcast in
minutes
It is Free