Owasp Top 5 Machine Learning risks

View the original Working Session content

Why Must We Address Machine Learning Risks

Machine learning and deep learning are a vital part of critical systems such as self-driving cars, advanced authentication, and automated detection of lesions/tumors. However, research shows that such technologies have inherent risks originating from the process of how the models are being learnt or used.

What is Machine Learning?

ML is the ability to program a computer to do something without it being explicitly being told what to do.

First, we need data. The data is split between two sets, as follows:

70% of the data is put into the training set
30% of the data is put into the testing set.

Next, the learning algorithm is created, the system is trained using the training dataset to produce suitable outcomes, and performance is evaluated using the testing dataset.

What are the Main Security Concerns for ML?

Machine learning being used for security
The internal security of machine learning.

What Examples are There of Fooling an ML System?

Dolphin attack to inject covert sounds for fake voice commands
Toolbox to attack models, generate noise, impersonate celebrities.

What Does the Threat Landscape Look Like for ML?

The ML threat space can be divided into two parts.

Development

Training stage (poisoning of training data)
Code security flaws (in learning algorithm).

Production

Testing stage (adversaries and evasion on test data)
Explainability (understanding how the machine learning technique works, explain how conclusions are reached by the ML system)
Backdoor attack on learnt model
Model stealing via test output.

What Can Be Done to Secure ML?

Data protection on training sets, hashing, encryption
Apply human factor to review decisions taken by the machine
Protect the model from theft
Secure scanning for all machine levels
IBM toolbox for robustness against adversarial ML (inputs to machine learning models that an attacker has created to deliberately cause the model to make a mistake).

Key Takeaway

The top 5 machine learning risks are:

Poisoning of the classifier training data
Adversarial ML
“Explainability" of the learning model
Code security flaws
Model stealing.

References

Session page : • https://open-security-summit.org/tracks/owasp-projects/working-sessions/owasp-top-5-machine-learning-risks/

Additional/External References

https://www.owasp.org/index.php/OWASP_Top_5_Machine_Learning_Risks
https://arxiv.org/abs/1712.05526 (Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning (Chen et al. 2017))

Outcomes