Examples
Data valuation¶
-
An introduction using the spotify dataset, showcasing grouped datasets and applied to improving model performance and identifying bogus data.
-
A showcase of a fast model-specific valuation method using the iris dataset.
-
Learning a utility function from a few evaluations and using it to estimate the value of the remaining data.
-
An alternative solution concept from game theory, illustrated on a classification problem.
-
A different and fast strategy for data valuation, using the out-of-bag error of a bagging model.
-
Using Banzhaf values to estimate the value of data points in MNIST, and evaluating convergence speed of MSR.
Influence functions¶
-
Detecting corrupted labels with influence functions on the ImageNet dataset.
-
Using the IMDB dataset for sentiment analysis and a fine-tuned BERT model.
-
Detecting corrupted labels using a synthetic dataset.
-
Using the wine dataset