hierarchical Dirichlet process
(1.2 hours to learn)
The Hierarchical Dirichlet Process (HDPs) is a stochastic process that can be used to define a nonparametric distribution on a mixture of mixtures (or admixture) model. That is, each grouping of data is a draw from a mixture model, and the mixture components are shared among the different groups. Using a hierarchy of Dirichlet processes allows the number of mixture components to be inferred from the data. HDPs are most commonly used in topic modeling, where the top mixture corresponds to the global set of topics shared among the entire corpus (all documents) and the secondary mixture corresponds to the topic mixture for a given document.
This concept has the prerequisites:
Core resources (read/watch one of the following)
→ Hierarchical Dirichlet Processes
- The HDP topic model can be viewed as a nonparametric generalization of latent Dirichlet allocation