Skip to content


Data valuation

  • Shapley values

    An introduction using the spotify dataset, showcasing grouped datasets and applied to improving model performance and identifying bogus data.

  • KNN Shapley

    A showcase of a fast model-specific valuation method using the iris dataset.

  • Data utility learning

    Learning a utility function from a few evaluations and using it to estimate the value of the remaining data.

  • Least Core

    An alternative solution concept from game theory, illustrated on a classification problem.

  • Data OOB

    A different and fast strategy for data valuation, using the out-of-bag error of a bagging model.

  • Faster Banzhaf values

    Using Banzhaf values to estimate the value of data points in MNIST, and evaluating convergence speed of MSR.

Influence functions