Which of the following functions can be used as an activation function in the output layer if we wish to predict the probabilities of n classes (p1, p2..pk) such that sum of p over all n equals to 1?
All Answers
total answers (1)
total answers (1)
A. Softmax
need an explanation for this answer? contact us directly to get an explanation for this answer