In computer programming, a pure function is a function that has the following properties:[1][2]
The following examples of C++ functions are pure:
floor
max
void f() { static std::atomic<unsigned int> x = 0; ++x; }
x
f()
void f() {}
std::atomic
The following C++ functions are impure as they lack the above property 1:
int f() { static int x = 0; ++x; return x; }
int f() { return x; }
sin()
int f(int* x) { return *x; }
int f() { int x = 0; std::cin >> x; return x; }
The following C++ functions are impure as they lack the above property 2:
void f() { static int x = 0; ++x; }
void f() { ++x; }
void f(int* x) { ++*x; }
void f() { std::cout << "Hello, world!" << std::endl; }
The following C++ functions are impure as they lack both the above properties 1 and 2:
I/O is inherently impure: input operations undermine referential transparency, and output operations create side effects. Nevertheless, there is a sense in which a function can perform input or output and still be pure, if the sequence of operations on the relevant I/O devices is modeled explicitly as both an argument and a result, and I/O operations are taken to fail when the input sequence does not describe the operations actually taken since the program began execution.[clarification needed]
The second point ensures that the only sequence usable as an argument must change with each I/O action; the first allows different calls to an I/O-performing function to return different results on account of the sequence arguments having changed.[3][4]
The I/O monad is a programming idiom typically used to perform I/O in pure functional languages.
The outputs of a pure function can be cached in a look-up table. Any result that is returned from a given function is cached, and the next time the function is called with the same input parameters, the cached result is returned instead of computing the function again.
Memoization can be performed by wrapping the function in another function (wrapper function).[5]
By means of memoization, the computational effort involved in the computations of the function itself can be reduced, at the cost of the overhead for managing the cache and an increase of memory requirements.
A C program for cached computation of factorial (assert() aborts with an error message if its argument is false; on a 32-bit machine, values beyond fact(12) cannot be represented.[citation needed])
assert()
fact(12)
static int fact(int n) { return n <= 1 ? 1 : fact(n - 1) * n; } int fact_wrapper(int n) { static int cache[13]; assert(0 <= n && n < 13); if (cache[n] == 0) cache[n] = fact(n); return cache[n]; }
Functions that have just the above property 2 – that is, have no side effects – allow for compiler optimization techniques such as common subexpression elimination and loop optimization similar to arithmetic operators.[6] A C++ example is the length method, returning the size of a string, which depends on the memory contents where the string points to, therefore lacking the above property 1. Nevertheless, in a single-threaded environment, the following C++ code
length
std::string s = "Hello, world!"; int a[10] = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10}; int l = 0; for (int i = 0; i < 10; ++i) { l += s.length() + a[i]; }
can be optimized such that the value of s.length() is computed only once, before the loop.
s.length()
Some programming languages allow for declaring a pure property to a function:
pure
const
constexpr
Since pure functions have identical return values for identical arguments, they are well suited to unit testing.