Metropolis–Hastings algorithm

The metropolis-Hastings algorithm is a typical algorithm that generates the samples for an arbitrary density function (i.e., target density function). It uses the proposal density (i.e., transition kernel) to generate the next sample candidate. The candidate is accepted with the acceptance ratio calculated by the target density function.

Notes

The proposal density function \(Q\) should satisfies the symmetry property, i.e., \(Q(x|y)==Q(y|x)\). A usual choice of the proposal density function \(Q\) is Gaussian/normal distribution or uniform distribution.

[1]:

# Import auxiliary libraries for demonstration

import matplotlib as mpl
import matplotlib.pyplot as plt
import numpy as np
from scipy import stats

plt.rcParams[ "figure.figsize" ] = [ 5, 4 ]

plt.rcParams[ "figure.dpi" ] = 80
plt.rcParams[ "font.family" ] = "Times New Roman"
plt.rcParams[ "font.size" ] = '14'

Metropolis–Hastings sampler

Class MetropolisHastingsSampler implements the Metropolis-Hastings sampler.

Based on the algorithm, three input parameters, namely initialVal, targetPdf, proposalCSampler are required to create a Metropolis-Hastings sampler.

initialVal is the first observed data point that is used to start the sampling procedure.

targetPdf is a target density function that returns the probability for a given value. It should be noted that the targetPdf function can be proportional to the target density function up to a multiplicative constant.

proposalCSampler is a sampler that can return a sample for a given observation. For example, for a given observation \(x_t\), the sampler can be \(x_{t+1} \sim \mathcal{N}(x_t, 1)\) and it can return a sample based on the current observation \(x_t\).

The following examples show the application of the Metropolis-Hastings sampler for exponential-like and lognormal-like target distribution.

Class initialization help

[2]:

from ffpack.rpm import MetropolisHastingsSampler
help( MetropolisHastingsSampler.__init__ )

Help on function __init__ in module ffpack.rpm.metropolisHastings:

__init__(self, initialVal=None, targetPdf=None, proposalCSampler=None, sampleDomain=None, randomSeed=None, **sdKwargs)
    Initialize the Metropolis-Hastings sampler

    Parameters
    ----------
    initialVal: scalar or array_like
        Initial observed data point.
    targetPdf: function
        Target probability density function or target distribution function.
        targetPdf takes one input parameter and return the corresponding
        probability. It will be called as targetPdf( X ) where X is the same
        type as initialVal, and a scalar value of probability should be returned.
    proposalCSampler: function
        Proposal conditional sampler (i.e., transition kernel). proposalCSampler
        is a sampler that will return a sample for the given observed data point.
        A usual choice is to let proposalCSampler be a Gaussian/normal
        distribution centered at the observed data point. It will be called as
        proposalCSampler( X ) where X is the same type as initialVal, and a
        sample with the same type of initialVal should be returned.
    sampleDomain: function, optional
        Sample domain function. sampleDomain is a function to determine if a
        sample is in the sample domain. For example, if the sample doamin is
        [ 0, inf ] and the sample is -2, the sample will be rejected. For the
        sampling on field of real numbers, it should return True regardless of
        the sample value. It called as sampleDomain( cur, nxt, \**sdKwargs )
        where cur, nxt are the same type as initivalVal, and a boolean value
        should be returned.
    randomSeed: integer, optional
        Random seed. If randomSeed is none or is not an integer, the random seed in
        global config will be used.

    Raises
    ------
    ValueError
        If initialVal, targetPdf, or proposalCSampler is None.
        If targetPdf returns negative value.

    Examples
    --------
    >>> from ffpack.rpm import MetropolisHastingsSampler
    >>> initialVal = 1.0
    >>> targetPdf = lambda x : 0 if x < 0 else np.exp( -x )
    >>> proposalCSampler = lambda x : np.random.normal( x, 1 )
    >>> mhSampler = MetropolisHastingsSampler( initialVal, targetPdf,
    ...                                        proposalCsampler )

Example with exponential-like distribution

[3]:

mhsInitialVal = 1.0

# def mhsTargetPdf( x ):
#     return 0 if x < 0 else np.exp( -x )

# We can also use lambda function
mhsTargetPdf = lambda x : 0 if x < 0 else np.exp( -x )

# def mhsProposalCSampler( x ):
#     return np.random.normal( x, 1 )

# We can also use lambda function
mhsProposalCSampler = lambda x : np.random.normal( x, 1 )

[4]:

mhSampler = MetropolisHastingsSampler( initialVal=mhsInitialVal,
                                       targetPdf=mhsTargetPdf,
                                       proposalCSampler=mhsProposalCSampler,
                                       randomSeed=2023 )

[5]:

mhsResults = np.zeros( 10000 )
for i in range( 10000 ):
    mhsResults[ i ] = mhSampler.getSample()

[6]:

fig, ax = plt.subplots()

ax.hist( np.array( mhsResults ), bins=20 )

ax.tick_params(axis='x', direction="in", length=5)
ax.tick_params(axis='y', direction="in", length=5)
ax.set_ylabel( "Count" )
ax.set_xlabel( "Value" )
ax.set_title( "Histogram of Metropolis–Hastings samples" )

plt.tight_layout()
plt.show()

../_images/moduleCookbook_rpmMetropolisHastingsAlgorithm_11_0.png

Example with lognormal-like distribution

[7]:

mhsInitialVal = 1.0

def mhsTargetPdf( x ):
    return 0 if x < 0 else 1 / x * np.exp( -1 * np.power( np.log( x ), 2 ) )

def mhsProposalCSampler( x ):
    return np.random.normal( x, 1 )

[8]:

mhSampler = MetropolisHastingsSampler( initialVal=mhsInitialVal,
                                       targetPdf=mhsTargetPdf,
                                       proposalCSampler=mhsProposalCSampler,
                                       randomSeed=2023 )

[9]:

mhsResults = np.zeros( 10000 )
for i in range( 10000 ):
    mhsResults[ i ] = mhSampler.getSample()

[10]:

fig, ax = plt.subplots()

ax.hist( np.array( mhsResults ), bins=20 )

ax.tick_params(axis='x', direction="in", length=5)
ax.tick_params(axis='y', direction="in", length=5)
ax.set_ylabel( "Count" )
ax.set_xlabel( "Value" )
ax.set_title( "Histogram of Metropolis–Hastings samples" )

plt.tight_layout()
plt.show()

../_images/moduleCookbook_rpmMetropolisHastingsAlgorithm_16_0.png

Au modified Metropolis–Hastings sampler

Class AuModifiedMHSampler implements the Metropolis-Hastings sampler.

The modified Metropolis-Hastings sampling algorithm proposed by Au and Beck samples data point at each dimension.

Reference:

Au, S.K. and Beck, J.L., 2001. Estimation of small failure probabilities in high dimensions by subset simulation. Probabilistic engineering mechanics, 16(4), pp.263-277.

Class initialization help

[11]:

from ffpack.rpm import AuModifiedMHSampler
help( AuModifiedMHSampler.__init__ )

Help on function __init__ in module ffpack.rpm.metropolisHastings:

__init__(self, initialVal=None, targetPdf=None, proposalCSampler=None, sampleDomain=None, randomSeed=None, **sdKwargs)
    Initialize the Au modified Metropolis-Hastings sampler

    Parameters
    ----------
    initialVal: array_like
        Initial observed data point.
    targetPdf: function list
        Target probability density function list. Each element targetPdf[ i ] in
        the list is a callable function referring the independent marginal.
        targetPdf[ i ] takes one input parameter and return the corresponding
        probability. It will be called as targetPdf[ i ]( X[ i ] ) where X is a
        list in which the element is same type as initialVal[ i ], and a scalar
        value of probability should be returned by targetPdf[ i ]( X[ i ] ).
    proposalCSampler: function list
        Proposal conditional sampler list (i.e., transition kernel list). Each
        element proposalCSampler[ i ] in the list is a callable function
        referring a sampler that will return a sample for the given observed
        data point.  A usual choice is to let proposalCSampler[ i ] be a
        Gaussian/normal distribution centered at the observed data point.
        It will be called as proposalCSampler[ i ]( X[ i ] ) where X is a list
        in which each element is the same type as initialVal[ i ], and a
        sample with the same type of initialVal[ i ] should be returned.
    sampleDomain: function, optional
        Sample domain function. sampleDomain is a function to determine if a
        sample is in the sample domain. For example, if the sample doamin is
        [ 0, inf ] and the sample is -2, the sample will be rejected. For the
        sampling on field of real numbers, it should return True regardless of
        the sample value. It called as sampleDomain( cur, nxt, \**sdKwargs )
        where cur, nxt are lists in which each element is the same type as
        initivalVal[ i ], and a boolean value should be returned.
    randomSeed: integer, optional
        Random seed. If randomSeed is none or is not an integer, the random seed in
        global config will be used.

    Raises
    ------
    ValueError
        If initialVal, targetPdf, or proposalCSampler is None.
        If dims of initialVal, targetPdf, and proposalCSampler are not equal.
        If targetPdf returns negative value.

    Examples
    --------
    >>> from ffpack.rpm import AuModifiedMHSampler
    >>> initialValList = [ 1.0, 1.0 ]
    >>> targetPdf = [ lambda x : 0 if x < 0 else np.exp( -x ),
    ...               lambda x : 0 if x < 0 else np.exp( -x ) ]
    >>> proposalCSampler = [ lambda x : np.random.normal( x, 1 ),
    ...                      lambda x : np.random.normal( x, 1 ) ]
    >>> auMMHSampler = AuModifiedMHSampler( initialVal, targetPdf,
    ...                                     proposalCsampler )

Example with exponential-like distribution

[12]:

aumhsInitialVal = [ 1.0, 1.0 ]


def autpdf( x ):
    return 0 if x < 0 else np.exp( -x )

aumhsTargetPdf = [ autpdf, autpdf ]

# We use normal distribution as proposal distribution
def aupcs( x ):
    return np.random.normal( x, 1 )

aumhsProposalCSampler = [ aupcs, aupcs ]

[13]:

aumhSampler = AuModifiedMHSampler( initialVal=aumhsInitialVal,
                                   targetPdf=aumhsTargetPdf,
                                   proposalCSampler=aumhsProposalCSampler,
                                   randomSeed=2023 )

[14]:

aumhsResults = np.zeros( [ 10000, 2 ] )
for i in range( 10000 ):
    aumhsResults[ i ] = aumhSampler.getSample()

[15]:

fig, ax = plt.subplots( figsize=( 5, 5 ) )

ax.plot( aumhsResults[ :, 0], aumhsResults[ :, 1 ], ".", markersize=2 )

ax.tick_params(axis='x', direction="in", length=5)
ax.tick_params(axis='y', direction="in", length=5)
ax.set_ylabel( "Y" )
ax.set_xlabel( "X" )
ax.set_title( "Histogram of Au modified Metropolis–Hastings samples" )
ax.set_xlim( [ -0.5, 6.5 ] )
ax.set_ylim( [ -0.5, 6.5 ] )

plt.tight_layout()
plt.show()

../_images/moduleCookbook_rpmMetropolisHastingsAlgorithm_25_0.png

Example with lognormal-like distribution

[16]:

aumhsInitialVal = [ 1.0, 1.0 ]

def autpdf( x ):
    return 0 if x < 0 else 1 / x * np.exp( -1 * np.power( np.log( x ), 2 ) )

aumhsTargetPdf = [ autpdf, autpdf ]

# We use uniform distribution as proposal distribution
def aupcs( x ):
    return np.random.uniform( x-0.5, x+0.5 )

aumhsProposalCSampler = [ aupcs, aupcs ]

[17]:

aumhSampler = AuModifiedMHSampler( initialVal=aumhsInitialVal,
                                   targetPdf=aumhsTargetPdf,
                                   proposalCSampler=aumhsProposalCSampler,
                                   randomSeed=2023 )

[18]:

aumhsResults = np.zeros( [ 10000, 2 ] )
for i in range( 10000 ):
    aumhsResults[ i ] = aumhSampler.getSample()

[19]:

fig, ax = plt.subplots( figsize=( 5, 5 ) )

ax.plot( aumhsResults[ :, 0], aumhsResults[ :, 1 ], ".", markersize=2 )

ax.tick_params(axis='x', direction="in", length=5)
ax.tick_params(axis='y', direction="in", length=5)
ax.set_ylabel( "Y" )
ax.set_xlabel( "X" )
ax.set_title( "Histogram of Au modified Metropolis–Hastings samples" )
ax.set_xlim( [ -0.5, 6.5 ] )
ax.set_ylim( [ -0.5, 6.5 ] )

plt.tight_layout()
plt.show()

../_images/moduleCookbook_rpmMetropolisHastingsAlgorithm_30_0.png