In this episode I explain how a community detection algorithm known as Markov clustering can be constructed by combining simple concepts like random walks, graphs, similarity matrix. Moreover, I highlight how one can build a similarity graph and then run a community detection algorithm on such graph to find clusters in tabular data.
You can find a simple hands-on code snippet to play with on the Amethix Blog
Enjoy the show!
References
[1] S. Fortunato, “Community detection in graphs”, Physics Reports, volume 486, issues 3-5, pages 75-174, February 2010.
[2] Z. Yang, et al., “A Comparative Analysis of Community Detection Algorithms on Artificial Networks”, Scientific Reports volume 6, Article number: 30750 (2016)
[3] S. Dongen, “A cluster algorithm for graphs”, Technical Report, CWI (Centre for Mathematics and Computer Science) Amsterdam, The Netherlands, 2000.
[4] A. J. Enright, et al., “An efficient algorithm for large-scale detection of protein families”, Nucleic Acids Research, volume 30, issue 7, pages 1575-1584, 2002.
Episode 23: Why do ensemble methods work?
Episode 22: Parallelising and distributing Deep Learning
Episode 21: Additional optimisation strategies for deep learning
Episode 20: How to master optimisation in deep learning
Episode 19: How to completely change your data analytics strategy with deep learning
Episode 18: Machines that learn like humans
Episode 17: Protecting privacy and confidentiality in data and communications
Episode 16: 2017 Predictions in Data Science
Episode 15: Statistical analysis of phenomena that smell like chaos
Episode 14: The minimum required by a data scientist
Episode 13: Data Science and Fraud Detection at iZettle
Episode 12: EU Regulations and the rise of Data Hijackers
Episode 11: Representative Subsets For Big Data Learning
Episode 10: History and applications of Deep Learning
Episode 9: Markov Chain Montecarlo with full conditionals
Episode 8: Frequentists and Bayesians
Episode 7: 30 min with data scientist Sebastian Raschka
Episode 6: How to be data scientist
Episode 5: Development and Testing Practices in Data Science
Episode 1: Predictions in Data Science for 2016
Create your
podcast in
minutes
It is Free
Insight Story: Tech Trends Unpacked
Zero-Shot
Fast Forward by Tomorrow Unlocked: Tech past, tech future
The Unbelivable Truth - Series 1 - 26 including specials and pilot
Elliot in the Morning