hierarchical Dirichlet process
(1.2 hours to learn)
Summary
The Hierarchical Dirichlet Process (HDPs) is a stochastic process that can be used to define a nonparametric distribution on a mixture of mixtures (or admixture) model. That is, each grouping of data is a draw from a mixture model, and the mixture components are shared among the different groups. Using a hierarchy of Dirichlet processes allows the number of mixture components to be inferred from the data. HDPs are most commonly used in topic modeling, where the top mixture corresponds to the global set of topics shared among the entire corpus (all documents) and the secondary mixture corresponds to the topic mixture for a given document.
Context
This concept has the prerequisites:
- Chinese restaurant franchise (The CRF is one canonical interpretation of the HDP.)
- Dirichlet process
Core resources (read/watch one of the following)
-Free-
→ Hierarchical Dirichlet Processes
See also
- The HDP topic model can be viewed as a nonparametric generalization of latent Dirichlet allocation