Split Count and Share: A Differentially Private Set Intersection Cardinality Algorithm

My colleagues Mike Purcell, Kelvin Yang Li and I have a new paper on differentially private set intersection cardinality algorithm accepted at this year’s Uncertainty in Artificial Intelligence conference. Here is the abstract:We describe a simple two-party protocol in which each party contributes a set as input. The output of the protocol is an estimate … More Split Count and Share: A Differentially Private Set Intersection Cardinality Algorithm

A Map of Machine Learning Principles and Algorithms

Here is my attempt to map out the major classes of algorithms in Machine Learning, organised around the associated induction principles and learning theory. The usual caveats apply around this being biased towards my own experience. At the highest level, we can distinguish between the Passive and Active learning settings. In the passive case, the … More A Map of Machine Learning Principles and Algorithms

A Direct Approximation of AIXI using Logical State Abstractions

Artificial Intelligence as a well-defined mathematical problem was solved a number of years ago through the formulation of the AIXI agent by Prof Marcus Hutter — see https://theconversation.com/to-create-a-super-intelligent-machine-start-with-an-equation-20756 for a quick introduction — but a key fundamental issue with the AIXI theory has always been the incomputability of the general solution. In a continuation of … More A Direct Approximation of AIXI using Logical State Abstractions

A Tutorial Introduction to Lattice-based Cryptography and Homomorphic Encryption

A few of us have been working with homomorphic encryption for a number of years now, but we never found a paper / book that covers all the foundational mathematical material in one place. So we decided to write one — well my postdoc Kelvin Yang Li decided to write one and Mike Purcell and I assisted … More A Tutorial Introduction to Lattice-based Cryptography and Homomorphic Encryption

Bayesian Filtering on Structured Environments

A few colleagues and I have just completed a new research paper titled Factored Conditional Filtering: Tracking States and Estimating Parameters in High-Dimensional Spaces. The research took over 3 years and I am really excited about the underlying theory and its possible applications. In particular, the paper shows how we can lift’ Bayesian filtering to … More Bayesian Filtering on Structured Environments

A Note on Large Scale Data Matching and Entity Resolution

Data matching and entity resolution is a common first step in data preparation and there is a thousand academic papers written on the subject in the literature. In practice, for large datasets – anything more than a million records will do as a definition of large here because most data-matching algorithms can’t handle that because … More A Note on Large Scale Data Matching and Entity Resolution