본문 바로가기

통계/확률론 및 수리통계학

Measure theoretic description of KL divergence

In this writing, I consider the KL divergence with measure theoretic approach.

 

Suppose P and Q are probability measures on a measurable space (Ω,F), and PQ. That is, P is absolutely continuous with respect to Q. (Or, equivalently, Q dominates P)

 

KL divergence is defined as below:

DKL(P||Q):=EP[logdPdQ]=ΩlogdPdQdP

If we have density functions for P and Q with respect to Lebesgue measure:

dPdλ=fdQdλ=g

then the KL divergence can be expressed in the familiar form.

DKL(P||Q)=ΩlogdPdQdP=ΩlogdPdλdQdλdPdλdλ=Ω(logfg)fdλ

Just to be clear, dP=P(dω), dQ=Q(dω), and dλ=λ(dω).

Since the Lebesgue measure is defined over a real interval, for the last expression, Ω=R[some interval].