Ranking Loss: Maximizing the Success Rate in Deep Learning Side-Channel Analysis

Gabriel Zaid; Lilian Bossuet; François Dassance; Amaury Habrard; Alexandre Venelli

doi:10.46586/tches.v2021.i1.25-55

Authors

Gabriel Zaid Univ Lyon, UJM-Saint-Etienne, CNRS Laboratoire Hubert Curien UMR 5516 F-42023, Saint-Etienne, France; Thales ITSEF, Toulouse, France
Lilian Bossuet Univ Lyon, UJM-Saint-Etienne, CNRS Laboratoire Hubert Curien UMR 5516 F-42023, Saint-Etienne, France
François Dassance Thales ITSEF, Toulouse, France
Amaury Habrard Univ Lyon, UJM-Saint-Etienne, CNRS Laboratoire Hubert Curien UMR 5516 F-42023, Saint-Etienne, France
Alexandre Venelli Thales ITSEF, Toulouse, France

DOI:

https://doi.org/10.46586/tches.v2021.i1.25-55

Keywords:

Side-Channel Attacks, Deep Learning, Learning to Rank, Loss function, Success Rate, Mutual Information

Abstract

The side-channel community recently investigated a new approach, based on deep learning, to significantly improve profiled attacks against embedded systems. Compared to template attacks, deep learning techniques can deal with protected implementations, such as masking or desynchronization, without substantial preprocessing. However, important issues are still open. One challenging problem is to adapt the methods classically used in the machine learning field (e.g. loss function, performance metrics) to the specific side-channel context in order to obtain optimal results. We propose a new loss function derived from the learning to rank approach that helps preventing approximation and estimation errors, induced by the classical cross-entropy loss. We theoretically demonstrate that this new function, called Ranking Loss (RkL), maximizes the success rate by minimizing the ranking error of the secret key in comparison with all other hypotheses. The resulting model converges towards the optimal distinguisher when considering the mutual information between the secret and the leakage. Consequently, the approximation error is prevented. Furthermore, the estimation error, induced by the cross-entropy, is reduced by up to 23%. When the ranking loss is used, the convergence towards the best solution is up to 23% faster than a model using the cross-entropy loss function. We validate our theoretical propositions on public datasets.

Ranking Loss: Maximizing the Success Rate in Deep Learning Side-Channel Analysis

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

License

How to Cite

iacr-logo