Which of the following functions can be used as an activation function in the output layer if we wish to predict the probabilities of n classes (p1, p2..pk) such that sum of p over all n equals to 1?
belongs to collection: Deep Learning MCQ Quiz (Multiple Choice Questions And Answers)
All Answers
total answers (1)
A. Softmax
need an explanation for this answer? contact us directly to get an explanation for this answer