Cost functions to estimate a posteriori probabilities in multiclass problems

Title	Cost functions to estimate a posteriori probabilities in multiclass problems
Publication Type	Journal Article
Year of Publication	1999
Authors	Cid-Sueiro, J., J. I. Arribas, S. Urban-Munoz, and A. R. Figueiras-Vidal
Journal	IEEE Transactions on Neural Networks
Volume	10
Pagination	645-656
ISSN	10459227
Keywords	Cost functions, Estimation, Functions, Learning algorithms, Multiclass problems, Neural networks, Pattern recognition, Probability, Problem solving, Random processes, Stochastic gradient learning rule
Abstract	The problem of designing cost functions to estimate a posteriori probabilities in multiclass problems is addressed in this paper. We establish necessary and sufficient conditions that these costs must satisfy in one-class one-output networks whose outputs are consistent with probability laws. We focus our attention on a particular subset of the corresponding cost functions; those which verify two usually interesting properties: symmetry and separability (well-known cost functions, such as the quadratic cost or the cross entropy are particular cases in this subset). Finally, we present a universal stochastic gradient learning rule for single-layer networks, in the sense of minimizing a general version of these cost functions for a wide family of nonlinear activation functions.
URL	http://www.scopus.com/inward/record.url?eid=2-s2.0-0032643080&partnerID=40&md5=d528195bd6ec84531e59ddd2ececcd46
DOI	10.1109/72.761724

You are here