# 2008 Research Summary

## Classification of Images with Hierarchical Beta Processes

View Current Project Information

Romain Jean Thibaux, Michael Jordan and Erik Sudderth

The Indian buffet process [1] is a prior for binary vector observations. It is a nonparametric prior, allowing binary vectors of unbounded length. When representing data such as text as binary vectors--for instance each bit may represent the presence of a word--this prior gives Bayesian estimates of the probability of each bit. Used on texts of various categories or topics, a Naive Bayes classifier can be constructed based on these estimates.

A critical component of Naive Bayes is the smoothing and shrinking of parameters, through the sharing of statistical strengh between categories. Hierarchical beta processes [2] are an extension of Indian buffet processes that allow such sharing and outperform simpler smoothing methods. We now extend this work to categories of images, where obtaining a binary representation is less straightforward.

- [1]
- T. Griffiths and Z. Ghahramani, "Infinite Latent Feature Models and the Indian Buffet Process,"
*Advances in Neural Information Processing Systems (NIPS),*Vol. 18, 2005. - [2]
- R. Thibaux and M. I. Jordan, "Hierarchical Beta Processes and the Indian Buffet Process,"
*Eleventh International Conference on Artificial Intelligence and Statistics (AISTATS 2007),*San Juan, Puerto Rico, 2007.