hierarchical Dirichlet process

(1.2 hours to learn)

Summary

The Hierarchical Dirichlet Process (HDPs) is a stochastic process that can be used to define a nonparametric distribution on a mixture of mixtures (or admixture) model. That is, each grouping of data is a draw from a mixture model, and the mixture components are shared among the different groups. Using a hierarchy of Dirichlet processes allows the number of mixture components to be inferred from the data. HDPs are most commonly used in topic modeling, where the top mixture corresponds to the global set of topics shared among the entire corpus (all documents) and the secondary mixture corresponds to the topic mixture for a given document.

Context

This concept has the prerequisites:

Core resources (read/watch one of the following)

-Free-

Hierarchical Dirichlet Processes
Authors: Yee Whye Teh,Michael I Jordan,Matthew J Beal,David M Blei

See also