Skip to content

Data Shapley: Equitable Valuation of Data for Machine Learning

License

Notifications You must be signed in to change notification settings

connor11son/DataShapley

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Added conv_model example notebook to show how to estimate TMC Shap values with convolutional neural network

Data Shapley: Equitable Valuation of Data for Machine Learning

Code for implementation of "Data Shapley: Equitable Valuation of Data for Machine Learning".

Please cite the following work if you use this benchmark or the provided tools or implementations:

@inproceedings{ghorbani2019data,
  title={Data Shapley: Equitable Valuation of Data for Machine Learning},
  author={Ghorbani, Amirata and Zou, James},
  booktitle={International Conference on Machine Learning},
  pages={2242--2251},
  year={2019}
}

Prerequisites

  • Python, NumPy, Tensorflow 1.12, Scikit-learn, Matplotlib

Basic Usage

To divide value fairly between individual train data points/sources given the learning algorithm and a meausre of performance for the trained model (test accuracy, etc)

Authors

License

This project is licensed under the MIT License - see the LICENSE.md file for details

About

Data Shapley: Equitable Valuation of Data for Machine Learning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 50.3%
  • Python 49.7%