Skip to content

The python library for data valuation

pyDVL collects algorithms for data valuation and influence function computation. It runs most of them in parallel either locally or in a cluster and supports distributed caching of results.

If you're a first time user of pyDVL, we recommend you to go through the Installing pyDVL and First Steps guides in the Getting Started section.

  • Installation


    Steps to install and requirements

    Installation

  • Data valuation


    Basics of data valuation and description of the main algorithms

    Data Valuation

  • Influence Function


    An introduction to the influence function and its computation with pyDVL

    Influence Values

  • API Reference


    Full documentation of the API

    API Reference