Variational relevance vector machine for classification and regression problems with multidimensional feature arraysстатья
Информация о цитировании статьи получена из
Web of Science ,
Scopus
Статья опубликована в журнале из списка Web of Science и/или Scopus
Дата последнего поиска статьи во внешних источниках: 10 ноября 2014 г.
Автор:
Kropotov D.A.
Журнал:
Computational Mathematics and Mathematical Physics
Том:
51
Номер:
8
Год издания:
2011
Издательство:
Pleiades Publishing, Ltd
Местоположение издательства:
Road Town, United Kingdom
Первая страница:
1441
Последняя страница:
1459
DOI:
10.1134/S0965542511080124
Аннотация:
Problems of classification and regression estimation in which objects are represented by multidimensional arrays of features are considered. Many practical statements can be reduced to such problems, for example, the popular approach to the description of images as a set of patches and a set of descriptors in each patch or the description of an object in the form of a set of distances from it to certain support objects selected based on a set of features. For solving problems concerning the objects thus described, a generalization of the relevance vector model is proposed. In this generalization, specific regularization coefficients are defined for each dimension of the multidimensional array of the object description; the resultant regularization coefficient for a given element in the multidimensional array is determined as a combination of the regularization coefficients for all the dimensions. The models with the sum and product used for such combinations are examined. Algorithms based on the variational approach are proposed for learning in these models. These algorithms enable one to find the so-called "sparse" solutions, that is, exclude from the consideration the irrelevant dimensions in the multidimensional array of the object description. Compared with the classical relevance vector model, the proposed approach makes it possible to reduce the number of adjustable parameters because a sum of all the dimensions is considered instead of their product. As a result, the method becomes more robust under overfitting in the case of small samples. This property and the sparseness of the resulting solutions in the proposed models are demonstrated experimentally, in particular, in the case of the known face identification database called Labeled Faces in the Wild. В© 2011 Pleiades Publishing, Ltd.
Добавил в систему:
Кропотов Дмитрий Александрович