Sparsemax activation function [1].

activation_sparsemax(logits, axis = -1L)

Arguments

logits

Input tensor.

axis

Integer, axis along which the sparsemax operation is applied.

Value

Tensor, output of sparsemax transformation. Has the same type and shape as `logits`. Raises: ValueError: In case `dim(logits) == 1`.

Details

For each batch `i` and class `j` we have $$sparsemax[i, j] = max(logits[i, j] - tau(logits[i, :]), 0)$$ [1]: https://arxiv.org/abs/1602.02068

Raises

ValueError: In case `dim(logits) == 1`.