Noughts And Crosses AI

Noughts and Crosses Minimax Algorithm Design

In this noughts and crosses project, the 'x' player is the maximizer (wins when minimax value = 1) and the 'o' player is the minimizer (wins when minimax value = -1). Minimax value = 0 represents a tie. The following figure gives an example board space and illustrates how the minimax algorithm makes the next decision for the maximiser. The starting board state is the root node of the tree.

There are only three possible options for X's move, as shown by the three child nodes under the root node. By traversing the left or middle path, the leaves of these options are board states that either end in a draw or a maximiser win. However back-propagating through each layer shows that the minimiser will make the ideal move for its objective, and so this will ultimately result in a draw. However, the right path results in a guaranteed maximiser win, so therefore this is the correct move to make. This explains how the minimax algorithm works.

Two additional modifications to the minimax algorithm have been implemented as well. These are Alpha-Beta Pruning and Early Termination.

Alpha-Beta Pruning

Alpha-Beta pruning is not actually a new algorithm, but rather an optimization technique for the minimax algorithm. It reduces the computation time by a huge factor. This allows us to search much faster and even go into deeper levels in the game tree. It cuts off branches in the game tree which need not be searched because there already exists a better move available. It is called Alpha-Beta pruning because it passes 2 extra parameters in the minimax function, namely alpha and beta.

Alpha is the best value that the maximizer currently can guarantee at that level or above. Beta is the best value that the minimizer currently can guarantee at that level or below. Consider the following example diagram of a-b pruning.

Everything is normal, up until we traverse node E when we try to minimise node B. Since D was visited first, B promises a minimax value <= 5. As soon as we saw E could be 6, a value greater than the promised 5, we prune all other E branches off because they no longer matter. Anything less than 6 WON'T get chosen, and anything more than 6 WILL get chosen, but it doesn't matter anyways because node B will try to minimise and pick 5 no matter what. This also happens at node A, as once the tree is traversed to F, minimax = 2 is returned to C, which will never get chosen because A promises minimax >=5. Subsequent branches of C are pruned away.

Early Termination

For further speedups, it is useful to cut-off/terminate early the minimax tree traversal based on a heuristic or evaluation function. However, the cost of early termination is to introduce inaccuracies in the minimax algorithm, and may yield suboptimal moves. A possible evaluation function for tic-tac-toe is as follows:

E(s) = M(s) - O(s)

where s is a board state, M(s) and O(s) are respectively the number of possible winning lines for Max and Min after state s.

For example, see the following starting board state:

As the maximiser, there are four possible winning board states as shown:

Therefore M(s)=4. Meanwhile, the minimiser will have 6 possible winning board states, so O(s)=6. Thus:

E(s) = 4 - 6 = -2

s is more advantageous for the minimiser.

Program Design

The backend program is comprised of a main function and four subfunctions. It takes a boardstate as input from the user and calculates the best move based on the minimax principle. This best move will be printed to the terminal, and all visited nodes and their corresponding minimax values will be written to an output text file. If there exists more than one move with the optimal minimax value, the move that happens earliest in the raster scan order will be chosen. The program can be compiled into a linux executable binary called tictactoe.bin using the makefile. It can be compiled and run as follows:

              $ make tictactoe
              $ ./tactactoe.bin [state] [path]

Where [state] is the raster-scanned boardstate and [path] is the path of the output text file that the program writes to. An example program call would be $ ./tictactoe.bin oxxxo-ox- /home/visited.txt

main:
Checks input arguments for early termination and alpha-beta pruning. Raster scans input and generates the starting board state. Evaluate board state to determine if a winner exists, and then calculates the optimal move for the current player.
bestMove:
Finds the best move for the current player by iterating through all potential moves and recursively calling minimax algorithm for each board state.
minimax:
Calculates minimax value for board state. Will continue iterating through all potential moves and backtracking at the termination of each path. Will return once the optimal move has been found.
checkWinner:
Checks current state of the board and returns an integer depending on which player has won or if there is a draw game.
evaluate:
Function used for early termination. Calculates the number of potential winning lines of the current board state.

For Alpha-Beta pruning:

              $ ./tactactoe.bin [state] [path] prune

For early termination by specifying ply, a positive integer that specifies the amximum number of moves (in total from both players) to look ahead.

              $ ./tactactoe.bin [state] [path] prune [ply]

Noughts and Crosses AI

AI opponent that won't lose

Highlights

Minimax Algorithm Background

Noughts and Crosses Minimax Algorithm Design

Alpha-Beta Pruning

Early Termination

Program Design

Future work