In this episode I explain how a community detection algorithm known as Markov clustering can be constructed by combining simple concepts like random walks, graphs, similarity matrix. Moreover, I highlight how one can build a similarity graph and then run a community detection algorithm on such graph to find clusters in tabular data.
You can find a simple hands-on code snippet to play with on the Amethix Blog
Enjoy the show!
References
[1] S. Fortunato, “Community detection in graphs”, Physics Reports, volume 486, issues 3-5, pages 75-174, February 2010.
[2] Z. Yang, et al., “A Comparative Analysis of Community Detection Algorithms on Artificial Networks”, Scientific Reports volume 6, Article number: 30750 (2016)
[3] S. Dongen, “A cluster algorithm for graphs”, Technical Report, CWI (Centre for Mathematics and Computer Science) Amsterdam, The Netherlands, 2000.
[4] A. J. Enright, et al., “An efficient algorithm for large-scale detection of protein families”, Nucleic Acids Research, volume 30, issue 7, pages 1575-1584, 2002.
Attacks to machine learning model: inferring ownership of training data (Ep. 99)
Don't be naive with data anonymization (Ep. 98)
Why sharing real data is dangerous (Ep. 97)
Building reproducible machine learning in production (Ep. 96)
Bridging the gap between data science and data engineering: metrics (Ep. 95)
A big welcome to Pryml: faster machine learning applications to production (Ep. 94)
It's cold outside. Let's speak about AI winter (Ep. 93)
The dark side of AI: bias in the machine (Ep. 92)
The dark side of AI: metadata and the death of privacy (Ep. 91)
The dark side of AI: recommend and manipulate (Ep. 90)
The dark side of AI: social media and the optimization of addiction (Ep. 89)
More powerful deep learning with transformers (Ep. 84) (Rebroadcast)
How to improve the stability of training a GAN (Ep. 88)
What if I train a neural network with random data? (with Stanisław Jastrzębski) (Ep. 87)
Deeplearning is easier when it is illustrated (with Jon Krohn) (Ep. 86)
[RB] How to generate very large images with GANs (Ep. 85)
More powerful deep learning with transformers (Ep. 84)
[RB] Replicating GPT-2, the most dangerous NLP model (with Aaron Gokaslan) (Ep. 83)
What is wrong with reinforcement learning? (Ep. 82)
Have you met Shannon? Conversation with Jimmy Soni and Rob Goodman about one of the greatest minds in history (Ep. 81)
Create your
podcast in
minutes
It is Free
Insight Story: Tech Trends Unpacked
Zero-Shot
Fast Forward by Tomorrow Unlocked: Tech past, tech future
The Unbelivable Truth - Series 1 - 26 including specials and pilot
Lex Fridman Podcast