Renaud Bourassa is a Staff Software Engineer on Slack’s Search and Discovery team, where he works on machine learning systems that help users find the most relevant information. Prior to Slack, Renaud worked at Google where he built systems to detect place visits from user location data. Renaud holds a degree in Software Engineering from the University of Waterloo.
Upcoming Abstract Summary
Building Machine Learning Models with Strict Privacy Boundaries
Every day, millions of people rely on Slack to get the information they need to do their jobs. To make their working lives more productive, we built a number of machine learning models to help users make sense of the data flowing through Slack. Although these models vary in structure and objective, they all share one common characteristic: they must deal with strict privacy boundaries inherent to the underlying dataset.
By policy, users can only be exposed to data that was publicly shared in their own Slack team. These restrictions must carry over into the machine learning models we build: not only must the models refrain from outputting data from foreign teams, patterns in foreign teams’ data must not be inferable from the usage of these models.
In this talk, I will discuss how Slack’s dataset differs from many traditional machine learning datasets. I will also present some techniques we developed to leverage our entire dataset to improve the performance of our models without jeopardizing the privacy boundaries we guarantee to our customers.