The python library for data valuation¶
pyDVL collects algorithms for data valuation and influence function computation. It runs most of them in parallel either locally or in a cluster and supports distributed caching of results.
If you're a first time user of pyDVL, we recommend you to go through the Installing pyDVL and First Steps guides in the Getting Started section.
-
Installation
Steps to install and requirements
-
Data valuation
Basics of data valuation and description of the main algorithms
-
Influence Function
An introduction to the influence function and its computation with pyDVL
-
API Reference
Full documentation of the API