Hakaru


Hakaru is a simply-typed probabilistic programming language, designed for easy specification of probabilistic models and inference algorithms. This type of language is useful for the development of machine learning algorithms and stochastic modeling. Hakaru enables the design of modular probabilistic inference programs by providing:

This documentation provides information for installing and using Hakaru. Sample programs are included to demonstrate some of Hakaru’s functionality and Hakaru implementation details are included to help guide future Hakaru developments and extensions.

Warning: This code is alpha and experimental.

Contact us at ppaml@indiana.edu if you have any questions or concerns.

Introduction

The Introduction presents probabilistic programming and illustrates how Hakaru can be used to solve and describe these types of problems, how to install Hakaru on your machine, and some sample programs to get you started.

What is Probabilistic Programming?

Probabilistic programming systems allow us to write programs which describe probability distributions, and provide mechanisms to sample and condition the distributions they represent on data. In this page, we give a sense of the sorts of problems Hakaru is great at solving, and how you would describe them in Hakaru.

Installing Hakaru

You can install Hakaru on Linux, OSX, and Windows and extend its functionality using MapleSoft’s Maple.

Generating Samples from your Hakaru Program

You can use the hakaru command to generate samples from your probabilistic model.

Quick Start: A Mixture Model Example

This page will introduce you to Hakaru’s basic functionality by creating a program to sample and condition a mixture model of a coin toss.

Examples

Two examples, a Gaussian Mixture Model and a Latent Dirichlet Allocation (LDA) topic model, highlight the types of problems that Hakaru is uniquely suited to help you solve.

Compiling to Haskell

A Hakaru program can be ported into Haskell which can then be converted into machine code for other applications.

Compiling to C

Depending on the scale, a Hakaru program might be resource-intensive to run. In these situations, you could port your Hakaru program to C using the hkc command to take advantage of other tools such as OpenMP for parallel processing.

Language Guide

The Language Guide presents an overview of Hakaru’s language primitives and core functionality.

Primitive Probability Distributions

Common probability distributions, such as the normal distribution, are already encoded in Hakaru and are considered to be language primitives. This page provides usage instructions for accessing the primitive distributions in your programs.

Let and Bind

Let (=) and Bind (<~) enable the use of variables in Hakaru programs, which is essential for extracting a value from a probability distribution.

Conditionals

Hakaru supports a restricted if expression for selections between two conditions.

Functions

Hakaru supports both named and anonymous function definitions.

Types and Coercions

Hakaru has basic types which can also be combined to make complex ones. To aid in the communication of information between Hakaru functions, coercions are defined to allow conversions between compatible types.

Data Types and Match

Hakaru supports some built-in data types from Haskell. The match function is used for deconstructing them to extract their elements and to reconstructing data back into these data types.

Arrays and Plate

Hakaru has special syntax for arrays, which is considered distinct from the other supported data types. A specialized array, plate, is used for describing measures over arrays.

Loops

Hakaru loops are specialized to compute the summation or product of the elements in an array.

Expect

The expectation feature (expect) computes expectation of a measure with respect to a given function.

Transformations

Hakaru includes some inference algorithms that you can use to transform your probabilistic models into other forms to extract desireable information. Its inference algorithms are implemented predominantly as program transformations.

Note: By default, Hakaru assigns a weight to each generated sample. Typically a weight of one is used, but it is possible for the weights to vary between samples. This might result in differing results from the original and transformed programs when summarizing a program’s output by counting them.

Normalize

The normalization transformation (normalize) reweights a program so that it represents a normal distribution.

Disintegrate

The disintegration transformation (disintegrate) produces a program representing the conditional distribution based on a joint probability distribution. This command is equivalent to model conditioning in probability theory.

Density

The density transformation (density) is used to create a conditional distribution model that is used to estimate the density of the distribution at a particular point.

Hakaru-Maple

The simplify transformation (hk-maple -c Simplify) is used to improve Hakaru programs by simplifying probabilistic models using computer algebra. This transformation requires the use of Maple. Hakaru provides two other transformations also written in Maple.

Metropolis Hastings

The Metropolis Hastings transform (mh) is used to convert a Hakaru program into a Metropolis Hastings transition kernel.

Internals

The internals section of the manual provides some insight into how Hakaru is implemented and offers guidance into how the system can be extended.

Citing Us

When referring to Hakaru, please cite the following academic paper:

P. Narayanan, J. Carette, W. Romano, C. Shan and R. Zinkov, “Probabilistic Inference by Program Transformation in Hakaru (System Description)”, Functional and Logic Programming, pp. 62-79, 2016.

@inproceedings{narayanan2016probabilistic,
    title = {Probabilistic inference by program transformation in Hakaru (system description)},
    author = {Narayanan, Praveen and Carette, Jacques and Romano, Wren and Shan, Chung{-}chieh and Zinkov, Robert},
    booktitle = {International Symposium on Functional and Logic Programming - 13th International Symposium, {FLOPS} 2016, Kochi, Japan, March 4-6, 2016, Proceedings},
    pages = {62--79},
    year = {2016},
    organization = {Springer},
    url = {http://dx.doi.org/10.1007/978-3-319-29604-3_5},
    doi = {10.1007/978-3-319-29604-3_5},
}