Christopher, Peter R.
Sarkozy, Gabor N
Data clustering is an immensely powerful tool. The analysis of big data has led to many clustering techniques. Among these techniques is Regularity Clustering, a new technique based on Abel Prize winner Endre Szemerédi's Regularity Lemma. Regularity Clustering has been shown to outperform industry standard clustering techniques in many circumstances. In this report we present new methods of executing Regularity Clustering. Among these methods one, which we call the most recurring construction method, outperforms the standard Regularity Clustering method by a significant margin. We also present empirical evidence indicating when Regularity Clustering performs well.
Worcester Polytechnic Institute
Major Qualifying Project
All authors have granted to WPI a nonexclusive royalty-free license to distribute copies of the work, subject to other agreements. Copyright is held by the author or authors, with all rights reserved, unless otherwise noted.