Split Count and Share: A Differentially Private Set Intersection Cardinality Algorithm

My colleagues Mike Purcell, Kelvin Yang Li and I have a new paper on differentially private set intersection cardinality algorithm accepted at this year’s Uncertainty in Artificial Intelligence conference. Here is the abstract:We describe a simple two-party protocol in which each party contributes a set as input. The output of the protocol is an estimate … More Split Count and Share: A Differentially Private Set Intersection Cardinality Algorithm

A Map of Machine Learning Principles and Algorithms

Here is my attempt to map out the major classes of algorithms in Machine Learning, organised around the associated induction principles and learning theory. The usual caveats apply around this being biased towards my own experience. At the highest level, we can distinguish between the Passive and Active learning settings. In the passive case, the … More A Map of Machine Learning Principles and Algorithms

A Direct Approximation of AIXI using Logical State Abstractions

Artificial Intelligence as a well-defined mathematical problem was solved a number of years ago through the formulation of the AIXI agent by Prof Marcus Hutter — see https://theconversation.com/to-create-a-super-intelligent-machine-start-with-an-equation-20756 for a quick introduction — but a key fundamental issue with the AIXI theory has always been the incomputability of the general solution. In a continuation of … More A Direct Approximation of AIXI using Logical State Abstractions

Bayesian Filtering on Structured Environments

A few colleagues and I have just completed a new research paper titled Factored Conditional Filtering: Tracking States and Estimating Parameters in High-Dimensional Spaces. The research took over 3 years and I am really excited about the underlying theory and its possible applications. In particular, the paper shows how we can lift’ Bayesian filtering to … More Bayesian Filtering on Structured Environments

A Note on Large Scale Data Matching and Entity Resolution

Data matching and entity resolution is a common first step in data preparation and there is a thousand academic papers written on the subject in the literature. In practice, for large datasets – anything more than a million records will do as a definition of large here because most data-matching algorithms can’t handle that because … More A Note on Large Scale Data Matching and Entity Resolution

Private Graph Data Release using Differential Privacy

A few colleagues and I have just put on arXiv a new survey paper on Private Graph Data Release, which took us nearly 9 months to write. Here’s the abstract: The application of graph analytics to various domains have yielded tremendous societal and economical benefits in recent years. However, the increasingly widespread adoption of graph … More Private Graph Data Release using Differential Privacy

Unsupervised 3D Object Segmentation

One of my PhD students has just released a paper titled Spatially Invariant Unsupervised 3D Object Segmentation Using Graph Neural Networks. Here’s the abstract: In this paper, we tackle the problem of unsupervised 3D object segmentation from a point cloud without RGB information. In particular, we propose a framework, SPAIR3D, to model a point cloud … More Unsupervised 3D Object Segmentation

Towards Fair and Privacy-Preserving Federated Deep Learning Models

My former postdoc Lingjuan Lyu has been working with a few research collaborators on a fair and privacy-preserving federated deep-learning framework and a paper describing the framework has just been published at the IEEE Transactions on Parallel and Distributed Systems. Here’s the paper details: Title: Towards Fair and Privacy-Preserving Federated Deep Models Abstract: The current … More Towards Fair and Privacy-Preserving Federated Deep Learning Models