Shapley algorithm is an interpretation algorithm that is well-recognized by both the industry and academia. However, given its exponential runtime complexity and existing implementations taking a very long time to generate feature contributions for a single instance, it has found limited practical use in the industry. In order to explain model predictions at scale, we implemented the Shapley IME algorithm in Spark. To our knowledge, this is the first spark implementation of the Shapley algorithm that scales to large datasets and can work with most ML model objects.
Session Summary
Shparkley: Scaling Shapley values with Spark
MLconf Online 2020
Cristine Marsh
Affirm
Data Scientist
Learn more »
Isaac Joseph
Affirm
Software Engineer
Learn more »