Comments on Eric Jang: A Beginner's Guide to Variational Methods: Mean-Field Approximation

I am studying variational Bayes on my own, and thi...

2017-12-16T10:49:37.396-08:00

I am studying variational Bayes on my own, and this was very helpful. Thank you for writing it

Do you mind explaining where that negative comes f...

2017-06-04T03:42:26.678-07:00

Do you mind explaining where that negative comes from? I was anticipating a plus...

I believe the last formula for reverse KL should b...

2017-05-12T13:01:16.053-07:00

I believe the last formula for reverse KL should be an expectation over q, not over p. Great post. Thanks for your effort.

Thanks for this, it is a key resource for our read...

2017-05-07T02:53:22.770-07:00

Thanks for this, it is a key resource for our reading group discussion on VAE today https://github.com/p-i-/machinelearning-IRC-freenode/blob/master/ReadingGroup/README.md

Probabilities sum to 1. i.e. Given a probability d...

2017-05-07T02:52:09.541-07:00

Probabilities sum to 1. i.e. Given a probability distribution q over Z, summing q(z) over all possible z in Z must give 1.

Hi, can you explain me the relation of the sum ove...

2017-05-03T16:04:34.950-07:00

Hi, can you explain me the relation of the sum over q(z) equal to 1 in equation (1)?. Thanks, I don't catch it.

I read a few blogs/articles/slides about variation...

2017-04-25T22:09:19.233-07:00

I read a few blogs/articles/slides about variational autoencoders, and I personally think this is the best one. The key ideas are pointed out clearly. The technical terms(e.g., ELBO) are well explained, too. Thanks so much.

2017-02-24T14:42:55.131-08:00

This comment has been removed by a blog administrator.

I didn't know that! Thank you for sharing this...

2016-11-06T22:47:49.612-08:00

I didn't know that! Thank you for sharing this. I hope that interested readers will scroll down and find your comment.

Given the title of your post, it's worth givin...

2016-10-18T13:01:55.459-07:00

Given the title of your post, it's worth giving some motivation behind the name "mean-field approximation".

From a statistical physics point of view, "mean-field" refers to the relaxation of a difficult optimization problem to a simpler one which ignores second-order effects. For example, in the context of graphical models, one can approximate the partition function of a Markov random field via maximization of the Gibbs free energy (i.e., log partition function minus relative entropy) over the set of product measures, which is significantly more tractable than global optimization over the space of all probability measures (see, e.g., M. Mezard and A. Montanari, Sect 4.4.2).

From an algorithmic point of view, "mean-field" refers to the naive mean field algorithm for computing marginals of a Markov random field. Recall that the fixed points of the naive mean field algorithm are optimizers of the mean-field approximation to the Gibbs variational problem. This approach is "mean" in that it is the average/expectation/LLN version of the Gibbs sampler, hence ignoring second-order (stochastic) effects (see, e.g., M. Wainwright and M. Jordan, (2.14) and (2.15)).

2016-08-15T00:07:42.623-07:00

This comment has been removed by a blog administrator.

2016-08-11T05:04:28.048-07:00

This comment has been removed by a blog administrator.

Thanks for the great post, Eric! Do you plan (or h...

2016-08-08T23:58:29.778-07:00

Thanks for the great post, Eric! Do you plan (or have a link to) to write a simple tutorial to illustrate the VB in practice?

Thanks for your sharp eyes! I added the minus in f...

2016-08-08T14:15:46.743-07:00

Thanks for your sharp eyes! I added the minus in front of the KL term.

There should be a minus in equation (3) for E[log ...

2016-08-08T11:34:33.759-07:00

There should be a minus in equation (3) for E[log p(x|z)] i.e. E[ -log p(x|z)] otherwise your definition of KL-divergence isn't consistent.

Ankur.