A Tutorial Introduction to Lattice-based Cryptography and Homomorphic Encryption

A few of us have been working with homomorphic encryption for a number of years now, but we never found a paper / book that covers all the foundational mathematical material in one place. So we decided to write one — well my postdoc Kelvin Yang Li decided to write one and Mike Purcell and I assisted … More A Tutorial Introduction to Lattice-based Cryptography and Homomorphic Encryption

Bayesian Filtering on Structured Environments

A few colleagues and I have just completed a new research paper titled Factored Conditional Filtering: Tracking States and Estimating Parameters in High-Dimensional Spaces. The research took over 3 years and I am really excited about the underlying theory and its possible applications. In particular, the paper shows how we can lift’ Bayesian filtering to … More Bayesian Filtering on Structured Environments

A Note on Large Scale Data Matching and Entity Resolution

Data matching and entity resolution is a common first step in data preparation and there is a thousand academic papers written on the subject in the literature. In practice, for large datasets – anything more than a million records will do as a definition of large here because most data-matching algorithms can’t handle that because … More A Note on Large Scale Data Matching and Entity Resolution

Private Graph Data Release using Differential Privacy

A few colleagues and I have just put on arXiv a new survey paper on Private Graph Data Release, which took us nearly 9 months to write. Here’s the abstract: The application of graph analytics to various domains have yielded tremendous societal and economical benefits in recent years. However, the increasingly widespread adoption of graph … More Private Graph Data Release using Differential Privacy

Unsupervised 3D Object Segmentation

One of my PhD students has just released a paper titled Spatially Invariant Unsupervised 3D Object Segmentation Using Graph Neural Networks. Here’s the abstract: In this paper, we tackle the problem of unsupervised 3D object segmentation from a point cloud without RGB information. In particular, we propose a framework, SPAIR3D, to model a point cloud … More Unsupervised 3D Object Segmentation

Understanding Industry Structure and Competition

While going through some old documents this week, I rediscovered a set of notes I took while reading Michael Porter many years ago. On rereading the notes, I find (again) his ideas to be really simple and compelling, worthy of sharing as a mental model for both practising data scientists and a value investors. A … More Understanding Industry Structure and Competition