Research

My overall research goal is to develop fast, effective, theoretically-grounded algorithms to power data science applications. At the moment, my focus is on improving algorithms for clustering, similarity search, and numerical linear algebra.

Highlighted Projects

Improved Spectral Clustering

Spectral clustering has been used for over two decades for clustering graphs and other data. Unfortunately, it has suffered from some scalability issues which may have limited its adoption in practice. It also had limited theoretical guarantees until recently.

In my recent work, I have (i) improved the theoretical analysis of spectral clustering and (ii) provided improved algorithms which allow spectral clustering to scale to large datasets with large numbers of clusters.

[1] Macgregor, Peter, and He Sun. "A tighter analysis of spectral clustering, and beyond." International Conference on Machine Learning. PMLR, 2022.

[2] Macgregor, Peter, and He Sun. "Fast approximation of similarity graphs with kernel density estimation." 36th Advances in Neural Information Processing Systems (NeurIPS 2024).

[3] Macgregor, Peter. "Fast and simple spectral clustering in theory and practice." 36th Advances in Neural Information Processing Systems (NeurIPS 2024).

STAG Library

The Spectral Toolkit of Algorithms for Graphs (STAG) library is a C++ and Python library providing an easy-to-use implementation of several recently developed algorithms for graph and data analysis.

As well as generic methods for constructing and working with graphs, STAG provides implementations of local graph clustering and spectral clustering, locality-sensitive hashing (LSH), and kernel density estimation (KDE).

For more information on how to use STAG and to see some of its functionality, please refer to the technical reports available on the STAG website.

All Publications

Tech. Report

Spectral Toolkit of Algorithms for Graphs: Technical Report (2)

Peter Macgregor and He Sun

arXiv PDF code


Preprint

Polynomial-Time Algorithms for Weaver's Discrepancy Problem in a Dense Regime

Ben Jourdan, Peter Macgregor, and He Sun

arXiv


NeurIPS 2023 Spotlight

Fast Approximation of Similarity Graphs with Kernel Density Estimation

Peter Macgregor and He Sun


NeurIPS 2023

Fast and Simple Spectral Clustering in Theory and Practice

Peter Macgregor


ISAAC 2023

Is the Algorithmic Kadison-Singer Problem Hard?

Ben Jourdan, Peter Macgregor, and He Sun

arXiv conference


Tech. Report

Spectral Toolkit of Algorithms for Graphs: Technical Report (1)

Peter Macgregor and He Sun

arXiv PDF code


ICML 2022

A Tighter Analysis of Spectral Clustering, and Beyond

Peter Macgregor and He Sun


NeurIPS 2021

Finding Bipartite Components in Hypergraphs

Peter Macgregor and He Sun


ICML 2021 Oral

Local Algorithms for Finding Densely Connected Clusters

Peter Macgregor and He Sun


PhD Thesis

On Learning the Structure of Clusters in Graphs

Peter Macgregor, University of Edinburgh, 2022