Log softmax activation function.
For each batch i and class j we have
i
j
logsoftmax = logits - log(reduce_sum(exp(logits), axis))