DARTS

2019-ICLR-DARTS Differentiable Architecture Search

Motivation

Current NAS method:

Computationally expensive: 2000/3000 GPU days
Discrete search space, leads to a large number of architecture evaluations required.

Our goal is to jointly learn the architecture α and the weights w within all the mixed operations (e.g. weights of the convolution filters).

Improve

discrepancies between the continuous architecture encoding and the derived discrete architecture. (softmax…)
It would also be interesting to investigate performance-aware architecture derivation schemes based on the shared parameters learned during the search process.