GameplayKit Programming Guide: The Minmax Strategist

The Minmax Strategist

Many games that have been popular since long before the days of video gaming—especially board games such as backgammon, chess, Tic-Tac-Toe, and Go—are essentially games of logic. A player succeeds in playing such games by planning the best possible move at every opportunity, taking into account the possible moves of the opposing player (or players). These kinds of games can be exciting to play whether the opponent is another person or a computer player.

In GameplayKit, a strategist is a simple form of artificial intelligence that you can employ to create a computer opponent for these types of games. The GKMinmaxStrategist class and supporting APIs implement a minmax strategy for playing games that have many (if not all) of the following characteristics:

Sequential. Each player must act in order—that is, the players take turns.
Adversarial. For one player to win, the other player (or players) must lose. The game design assumes that on each turn, a player will make a move that puts that player closer to winning, or the other players closer to losing.
Deterministic. Players can plan their moves confident in the assumption that the move will occur as planned—that is, player actions are not subject to chance.
Perfect information. All information crucial to the outcome of the game is visible to all players at all times. For example, a game of chess is won or lost based on the positions of the pieces on the board, and both players can see the positions of all pieces at all times. If there existed a chess variant where each player held in secret a hand of cards, with each card corresponding to a possible move, that variant would not be a perfect information game because the possible actions for one player would not be known to the other player.

It’s possible to use the minmax strategist with games (or elements in more complex games) that don’t fit all of the above characteristics, provided that you can translate your game design into a consistent description that’s compatible with the GameplayKit API. For example, in a single-player game (lacking an adversary), you can use the strategist to suggest optimal moves. Or, in a game where some elements remain hidden until revealed through gameplay, the strategist can plan moves within the constraints of available information. In general, though, the strategist produces more optimal gameplay (that is, comes closer to winning or tying every game) if your gameplay model fits more of these characteristics.

Designing Your Game for the Minmax Strategist

The minmax strategist works by predicting and rating the possible future states of your gameplay in a decision tree, then choosing a next move by finding the path through the tree with the highest rating. For each turn of its own, the strategist rates possible moves highly if those moves lead to a win; for each turn of the strategist’s opponent, the strategist assigns a low rating to possible moves that will lead to a win for the opponent. (The name “minmax” comes from the strategy of maximizing the rating of one’s own moves and minimizing the rating of an opponent’s moves.)

Consider a game in which each player (red or black) moves by dropping a colored chip into one column of a grid, and a player wins upon placing four chips in a row horizontally, vertically, or diagonally. Figure 5-1 shows a partial decision tree for such a game. The FourInARow sample code project implements this game.

The red player can move by dropping a piece in any of the seven columns. For each column, the strategist considers the state of the board after a move in that column, and rates the move based on the outcome. (The diagram above shows only a few of the possible moves.)

If the red player moves in column 1 (the leftmost column), the black player can move in that column on the next turn and win the game. This is not a good outcome for the red player, so the red player assigns a low rating to this move.
If the red player moves in column 7 (the rightmost column), the black player has no possible move on the next turn that wins the game, so the red player assigns an intermediate rating to this move.
The red player wins with a move in column 4 (the middle column), so that move has the highest possible rating.

From this example, it should be clear that using the minmax strategist requires three key features in your game’s architecture:

A representation of the game “board”; that is, an object that models the current state of all elements crucial to the game’s outcome.
A way to specify the possible moves for a player, and indicate how each move leads from one game state to another.
A means of scoring or rating each possible state of the game model, so that the states resulting from more advantageous move can be distinguished from other states.

A well-architected game tends to already have these features. For example, the FourInARow sample code project uses a Model-View-Controller architecture to separate its gameplay model from the code that drives user interaction and display. Listing 5-1 shows the key features of the AAPLBoard class that implements the gameplay model:

Listing 5-1Interface for a Four-In-A-Row Game Model
        @interface AAPLBoard : NSObject
{
    AAPLChip _cells[AAPLBoardWidth * AAPLBoardHeight];
}
@property AAPLPlayer *currentPlayer;
- (AAPLChip)chipInColumn:(NSInteger)column row:(NSInteger)row;
- (BOOL)canMoveInColumn:(NSInteger)column;
- (void)addChip:(AAPLChip)chip inColumn:(NSInteger)column;
- (BOOL)isFull;
- (NSArray<NSNumber *> *)runCountsForPlayer:(AAPLPlayer *)player;
@end

      

These properties and methods expose all the information that a strategist needs to play the game:

An AAPLBoard object itself represents the state of the game (or a potential future state of a game). It contains a rectangular array of AAPLChip values (an integer enumeration with labeled cases for the red player, the black player, and an empty space). It also references AAPLPlayer objects, which represent the player whose turn it currently is.
The canMoveInColumn and addChip methods provide a way to specify the possible moves at any given time and simulate the outcome of those moves. To speculate about a possible future state of the game, the strategist makes a copy of the “official” AAPLBoard object representing the current state, then call the addChip method on the copy.
The isFull method determines whether the game has ended due to a draw, and the runCountsForPlayer method provides information that helps determine how close either player is to winning the game. If you add the ability to produce additional AAPLBoard objects for speculating about future states of the game, these methods can be the basis for scoring or rating those future states.

Using the Minmax Strategist in Your Game

You use an instance of the GKMinmaxStrategist class to implement the minmax strategist in your game. This class uses the gameplay model you’ve defined to reason about the state of the game, offering suggestions for the “best” next move for a player to take at any time. However, before the GKMinmaxStrategist class can reason about your gameplay model, you must describe that model in standardized terms that GameplayKit can understand. You do this by adopting the GKGameModel, GKGameModelUpdate, and GKGameModelPlayer protocols.

Adopt GameplayKit Protocols in Your Game Model

Three key protocols define the interface GameplayKit uses to reason about the state of your game.

The GKGameModel protocol applies to objects that represent a distinct state of your gameplay model. In the FourInARow example, the AAPLBoard class is a good candidate for adopting this protocol.
The GKGameModelPlayer protocol is an abstraction of the objects that represent players in your game. In the example, the AAPLPlayer class works well for this case.
The GKGameModelUpdate protocol is for objects that describe a transition between two game states. In the example, there is no such object—the addChip method performs the transition. In this case, adopting the GKGameModelUpdate protocol requires creating a new class.

The code listings below show how to extend the model classes in the FourInARow example to adopt the GameplayKit protocols. First, Listing 5-2 adds an AAPLMove class that adopts the GKGameModelUpdate protocol.

Listing 5-2The Move Class Adopts the GKGameModelUpdate Protocol
        @interface AAPLMove : NSObject <GKGameModelUpdate>
@property (nonatomic) NSInteger value;
@property (nonatomic) NSInteger column;
+ (AAPLMove *)moveInColumn:(NSInteger)column;
@end

      

The GKGameModelUpdate protocol requires one property: value. The strategist assigns a score to this property when weighing the desirability of each possible move. For the rest of your implementation of this protocol, create properties or methods that describe a move in terms particular to your game. In the FourInARow game, the only information needed to specify a move is which column to drop a chip into.

Next, Listing 5-3 adds GKGameModelPlayer conformance to the AAPLPlayer class.

Listing 5-3The Player Class Adopts the GKGameModelPlayer Protocol
        @interface AAPLPlayer (AAPLMinmaxStrategy) <GKGameModelPlayer>
@end
 
@implementation AAPLPlayer (AAPLMinmaxStrategy)
- (NSInteger)playerId { return self.chip; }
@end

      

The AAPLPlayer class simply exposes the integer value of its AAPLChip enumeration as its playerId property.

Adding GKGameModel conformance to the AAPLBoard class requires three areas of functionality: keeping track of players, handling and scoring moves, and copying state information. Listing 5-4 shows the first of these areas.

Listing 5-4GKGameModel Implementation: Managing Players
        - (NSArray<AAPLPlayer *> *)players { return [AAPLPlayer allPlayers]; }
- (AAPLPlayer *)activePlayer { return self.currentPlayer; }

      

The players getter always returns the two players in the game. For the FourInARow game, the number and identity of players is constant, so this array is stored in the AAPLPlayer class. The activePlayer getter simply maps to the currentPlayer property already implemented in the AAPLBoard class.

A critical part of a GKGameModel implementation is simulating each player’s prospective moves. Listing 5-5 shows how the AAPLBoard class handles these tasks.

Listing 5-5GKGameModel Implementation: Finding and Handling Moves
        - (NSArray<AAPLMove *> *)gameModelUpdatesForPlayer:(AAPLPlayer *)player {
    NSMutableArray<AAPLMove *> *moves = [NSMutableArray arrayWithCapacity:AAPLBoardWidth];
    for (NSInteger column = 0; column < AAPLBoardWidth; column++) {
        if ([self canMoveInColumn:column]) {
            [moves addObject:[AAPLMove moveInColumn:column]];
        }
    }
    return moves; // will be empty if isFull
}
 
- (void)applyGameModelUpdate:(AAPLMove *)gameModelUpdate {
    [self addChip:self.currentPlayer.chip inColumn:gameModelUpdate.column];
    self.currentPlayer = self.currentPlayer.opponent;
}

      

The gameModelUpdatesForPlayer: method returns the possible moves for the specified player. Each possible move is an instance of your game’s implementation of the GKGameModelUpdate protocol—in this example, the Move class. The FourInARow game implements this method by returning an array of AAPLMove objects corresponding to each non-full column.

The applyGameModelUpdate: method takes a GKGameModelUpdate object and performs whatever game actions that object specifies. In this example, a game model update is a AAPLMove object, so the board applies a move by calling its addChip: method to drop a chip of the current player’s color in the column specified by the AAPLMove object. This method is also responsible for letting GameplayKit know the order of turns in your game, so that when the strategist plans several turns ahead it knows which player is moving on which turn. This example is a two-player game, so the applyGameModelUpdate: method switches the activePlayer property to the current player’s opponent on each move.

For the minmax strategist to determine which of the possible moves is the best choice, your game model must indicate when it represents a more or less desirable state of the game, as shown in Listing 5-6.

Listing 5-6GKGameModel Implementation: Evaluating Board State
        - (BOOL)isWinForPlayer:(AAPLPlayer *)player {
    NSArray<NSNumber *> *runCounts = [self runCountsForPlayer:player];
    // The player wins if there are any runs of 4 (or more, but that shouldn't happen in a regular game).
    NSNumber *longestRun = [runCounts valueForKeyPath:@"@max.self"];
    return longestRun.integerValue >= AAPLCountToWin;
}
 
- (BOOL)isLossForPlayer:(AAPLPlayer *)player {
    return [self isWinForPlayer:player.opponent];
}
 
- (NSInteger)scoreForPlayer:(AAPLPlayer *)player {
    NSArray<NSNumber *> *playerRunCounts = [self runCountsForPlayer:player];
    NSNumber *playerTotal = [playerRunCounts valueForKeyPath:@"@sum.self"];
    NSArray<NSNumber *> *opponentRunCounts = [self runCountsForPlayer:player.opponent];
    NSNumber *opponentTotal = [opponentRunCounts valueForKeyPath:@"@sum.self"];
    // Return the sum of player runs minus the sum of opponent runs.
    return playerTotal.integerValue - opponentTotal.integerValue;
}

      

The first step in enabling a strategist to select optimal moves is identifying when a game model represents a win or a loss. You provide this information in the isWinForPlayer: and isLossForPlayer: methods. In the FourInARow example, the runCountsForPlayer method provides a convenient way to test for wins—as the name of the game suggests, the player wins with a run of four chips in a row.

Because this game is for two players only, a loss for the player is by definition a win for the player’s opponent, and vice versa, so you can implement the isLossForPlayer: method in terms of the isWinForPlayer: method. (In a game with more than two players, this assumption might not be true. For example, a winner might be determined only after all other players have lost.)

Armed with only the knowledge of winning and losing game states, the minmax strategist can play the game. The GKMinmaxStrategist class automatically ignores possible moves after a win or loss, and it weighs its decisions based on how many moves in the future an expected win or loss is. However, at this point the strategist does not play the game very well, because there are a number of possible states of the game with equal numbers of possible future wins or losses.

To improve the strategist’s gameplay, use the scoreForPlayer: method to implement a heuristic—an estimate of how desirable the current state of the board is to the player. Alternately, a score represents an estimate of how likely the player is to win, or how soon, the player can win. The FourInARow example uses a heuristic based on its runCountsForPlayer method, assuming that, for example, a player with two runs of two chips each is more likely to win soon than a player with no runs. Because the player’s success or failure is related to that of the opponent, the scoreForPlayer: method totals the player’s number and length of runs and subtracts from it the opponent’s number and length of runs.

Experiment

This example heuristic is enough to produce moderately strong gameplay, but is not the best possible heuristic for this game. For a stronger computer-controlled opponent, try improving the heuristic:

Weight longer runs more strongly. For example, two runs of three chips in a row should be better than three runs of two chips in a row.
Account for arrangements where a player can win by filling a gap. For example, if the player has two chips in a row, then an open space, then another chip, that arrangement should score equivalently to one with three chips in a row and then a gap.
Discount arrangements where a run is already blocked. For example, if the player has three chips in a row, but the opponent has chips on either side of that row, that run cannot lead to a win.

Finally, a game model class must support copying itself and its internal state, so that the strategist can make multiple copies of a board for examining the results of possible future moves. Listing 5-7 shows how the AAPLBoard class copies itself.

Listing 5-7GKGameModel Implementation: Copying State Information
        - (void)setGameModel:(AAPLBoard *)gameModel {
    memcpy(_cells, gameModel->_cells, sizeof(_cells));
    self.currentPlayer = gameModel.currentPlayer;
}
 
- (id)copyWithZone:(NSZone *)zone {
    AAPLBoard *copy = [[[self class] allocWithZone:zone] init];
    [copy setGameModel:self];
    return copy;
}

      

The setGameModel: method copies the internal state of one GKGameModel object to another. Here, the AAPLBoard class simply copies its _cells array and currentPlayer property. The GKGameModel protocol requires conformance to the NSCopying protocol, so the copyWithZone: method uses the setGameModel: method to turn a new AAPLBoard instance into a copy.

To simulate possible moves, the strategist uses several copies of your GKGameModel class and calls the setGameModel: method many times. For example, looking ahead six turns in the FourInARow game (which has up to seven possible moves at any given time) requires examining hundreds of thousands of board states. (Rather than continually creating new instances of your game model class, GameplayKit makes only a few copies, and reuses those instances with the setGameModel: method. This optimization improves memory efficiency.)

Use a Strategist Object to Plan Moves

After you’ve adopted GKGameModel and related protocols in your gameplay model classes, using the minmax strategist requires only a few simple steps. To see how these steps are implemented in the FourInARow game, download the sample code project.

Keep a GKMinmaxStrategist instance in whatever controller code manages your gameplay model—for example, a view controller or SpriteKit scene.
Use the GKMinmaxStrategist object’s gameModel property to point to the current state of your game—that is, your model object that implements the GKGameModel protocol.

Typically, you can do this only once, when initializing a new game, because the gameModel property will refer to the same GKGameModel object even as that object’s internal state changes. However, if you begin using a different instance of your model class (for example, when resetting the board for a new game), be sure to set the gameModel property again.
Set the maxLookAheadDepth property to the number of consecutive future moves you want to plan for.

In general, deeper lookahead results in a stronger computer-controlled player. However, deeper lookahead also exponentially increases the amount of work the strategist must do to plan a move. Choose a lookahead depth that provides the desired level of difficulty for your game without sacrificing performance.

For example, the FourInARow game requires four chips in a row to win. Therefore, if a computer-controlled player always makes the optimal move, it should theoretically always be able to either win or force a draw when the lookahead depth is seven—four of the strategist’s own turns plus the intervening three opponent turns. (Whether that theory holds true depends on the robustness of your scoreForPlayer: method.) For this game, a lookahead depth of seven results in a search space of nearly a million possible board states.
To plan the computer-controlled player’s next move, use the bestMoveForPlayer: method. This method returns a GKGameModelUpdate object (in the FourInARow example, an AAPLMove object) corresponding to the chosen move. To perform the move, use your gameplay model’s applyGameModelUpdate: method.

Alternatively, use one of the other methods listed in GKMinmaxStrategist Class Reference, which provide options for handling cases with multiple “best” moves or randomizing moves.

Note

Finding the best move can take a significant amount of time, especially when looking ahead several turns. To keep your game’s user interface performance smooth, use a background queue for strategist work. For details, see Concurrency Programming Guide.

State Machines

Pathfinding