The python library for data valuation¶
pyDVL collects algorithms for data valuation and influence function computation. For the full list see Methods. It supports out-of-core and distributed computation, as well as local or distributed caching of results.
If you're a first time user of pyDVL, we recommend you to go through Getting started.
-
Steps to install and requirements
-
Notebooks with worked-out examples of data valuation and influence functions
-
Basics of data valuation and description of the main algorithms
-
An introduction to the influence function and its computation with pyDVL
-
List of all methods implemented with references.
-
Full documentation of the API