Resources

Below is a (non-exhaustive) list of resources and fundamental papers we recommend to researchers and practitioners who want to learn more about Trustworthy ML. We categorize our resources as: (i) Introductory, aimed to serve as gentle introductions to high-level concepts and include tutorials, textbooks, and course webpages, and (ii) Advanced, aimed to be deeper dives into specific topics or concepts and include pointers to relevant influential papers.

While we try our best to ensure that this list of resources is up-to-date and comprehensive, it gets hard to keep up with all the great work out there. Did we miss a useful reference or an important paper? Email us at trustworthyml@gmail.com.

Introductory Resources

General




Interpretability and Explainability




Fairness



  • Books:

    1. Solon Barocas, Moritz Hardt, and Arvind Narayanan. “Fairness and machine learning: Limitations and Opportunities.” https://fairmlbook.org/


Adversarial Machine Learning


Differential Privacy





  • Other Forums:

  1. DifferentialPrivacy.org. https://differentialprivacy.org/

Causality




ADVANCED Resources

Interpretability & Explainability

Fairness

Adversarial Machine Learning

Differential Privacy

Causality