Privacy-Preserving Machine Learning (PPML)

An overview of techniques and approaches for training and deploying machine learning models while preserving data privacy.

Overview

Privacy-Preserving Machine Learning (PPML) encompasses techniques that enable machine learning on sensitive data without compromising individual privacy.

Key Techniques

Federated Learning

Train models across decentralized data sources without centralizing the data.

  • Horizontal FL - Same features, different samples
  • Vertical FL - Same samples, different features
  • Federated Transfer Learning - Different features and samples

Differential Privacy

Add calibrated noise during training to provide formal privacy guarantees.

Secure Multi-Party Computation (SMPC)

Multiple parties jointly compute a function over their inputs while keeping inputs private.

Homomorphic Encryption

Perform computations on encrypted data without decrypting it.

Applications

  • Healthcare ML
  • Financial fraud detection
  • Cross-organization collaboration
  • Personal data analytics

Challenges

  • Computational overhead
  • Communication costs
  • Privacy-utility tradeoff
  • Model accuracy

Further Reading

Coming soon.