Moyoman: a Go playing program
Introduction
There is a long and successful history of Chess playing programs. One of the strongest,
Deep Blue by IBM, is as strong or stronger than the best human players. In other games such as
Checkers or Othello, it is even easier to write a computer program that can outperform the
best human players. Yet for the game of Go, no one has written a program that can play as well as
even a mediocre human player.
The basic algorithm used by all serious Chess programs is a variation of the minimax search. That is,
the program examines all of its possible moves, each of its opponents countermoves, etc. Each possible
board position is then evaluated using a static evaluation function. This is a simple algebraic
algorithm which is the sum of different factors, such as the number and type of pieces on the board
each of which is multiplied by a weighting factor depending on its importance. There are two ways of
improving this procedure; searching more levels ahead, and improving the static evaluation function.
The ability to search more levels deep is the most important factor. As of 2002, Deep Blue
only uses four factors in its static evaluation function, but it is a massively parallel machine with
256 special purpose VLSI Chess processors and is capable of analyzing over 300 million board positions
per second. By searching the problem space enough moves ahead, even a very simple evaluation method
can produce very powerful results.
The total number of possible games in Go is so much larger than the number of possible games in
Chess that the basic algorithm used by Deep Blue and other Chess playing programs is useless for
developing a Go program, at least for the game as a whole. In addition, even if the size of the search
space were not a problem, creating a simple static evaluation function for Go is an even more difficult
problem. Certain clearly defined problems, such as life and death, can make good use of the search
algorithm; however a different approach is needed in order to write a program to play Go. Two
different subproblems need to be addressed: the design and implementation of the software, and the
organization needed to do the work.
Philosophy
The design for the Moyoman Go playing program described here is based on five major assumptions:
- Writing a Go playing program is a very large problem.
- The problem can be broken up into well-defined components
- Analysis can be done incrementally
- We understand what needs to be done much better than how to do it.
- There is a linear ordering of components
Writing a Go playing program is a very large problem.
If you identify all of the types of processing that need to be done in order to have
an amateur dan level program, you will quickly realize that hundreds of person years
of effort will be required. A single person or small group of people will not be able
to write a good Go playing program, because the problem is just too big.
The problem can be broken up into well-defined components
Any competent Go player is aware of the standard way that the game of Go is analyzed.
There are a number of concepts, such as making good shape, specialized moves for the
opening and endgame, life and death, etc. The Moyoman framework treats each of these
problems independently of the others. That is, a module such as LifeAndDeath is
solved independently of a module such as Shape. The only interaction among modules
is that a module can take other modules as inputs. For example, the Life and Death
module may take the Board and Groups modules as inputs. There is an ordered list of
module types, and a module can use the ouput of any modules which preceed it on that list.
Analysis can be done incrementally
If human players are shown a half completed game, it may take them several minutes
of concentrated study to analyze the game and determine the next move, while the players
who have been playing that game can make a reasonable move within several seconds.
It is assumed by the Moyoman framework that many modules can benefit by saving the results
of their analysis, and only doing an incremental analysis for the last move made.
Of course, in some cases such as when a large groups of stones is captured, it may be easier
to analyze the board from scratch. It is expected that this will only happen once or twice
a game, and that for most moves the incremental approach will work better.
We understand what needs to be done much better than how to do it.
As mentioned above, the definitions of the subproblems in analyzing the game of Go are well understood.
What is not at all well understood are the algorithms for solving them. A strong player
may be able to state for a given board position which moves would make good shape and which
would make bad shape, but he would be hard pressed to give an algorithm for generating
this information. Given this fact, the Moyoman framework allows for different implementations
of the same module to be part of the code base. Only one implementation of a given module type
would be executed at one time, but development work can proceed on each implementation
independently. It is easy to substitute one implementation for another since all implementations
of a given module must implement the same interface. This allows for a huge amount of
parallel development and exploration of ideas.
There is a linear ordering of components
The components defined would be given an ordering, where a component could use the
results of any components which preceeded it on the list. Thus, The LifeAndDeath
component could use the results of Board, Groups, Shape, and Tesuji components, but Tesjui
could not use the results of the LifeAndDeath component. This may be the most
problematic of the assumptions since there may appear to be circular dependencies
among the components. This problem will be dealt with as it occurs.
Organization
In the past, Go programs have been written primarily by one or a small group of people. There are certain
myths that have arisen about Go playing programs. It is believed that it is very difficult to
write a good Go program. This is a result of many people trying, and none succeeding.
This does not necessarily mean that the problem is hard, merely that it is a large problem. It may
very well be that solving this problem is more than can be done by one or two people in one lifetime.
Of course, it is certainly a hard problem as well as being a big problem.
Given the success that projects such as the Linux operating system and the Apache web server have
enjoyed as a result of being open source and having the contributions of many people working together,
it would be interesting to try a similar strategy with developing a Go playing program. All of the
source code would be freely available. The most important challenge is
to develop a framework within which different people can work and develop their ideas. The difference
between developing a web server or even an operating system, and writing a Go playing program is that
the first two problems are well understood, even if the implementation is technically complex. No one
knows how to write a good Go playing program yet, and so a solution cannot be defined by a few people,
with outside contributors merely implementing it and fixing bugs. A flexible framework is crucial,
so that different people have a way of experimenting and trying out various approaches. Developers
would write components referred to here as modules. There is a list of well-defined module types,
and there can be more than one implementation of a given module type. This will
allow the program to evolve and use those modules that prove to be effective, while discarding those
that do not work well without having to redesign the program as a whole. While it would be
prohibitively expensive for a commercial project to take this approach, the large number of Go
players who are also software developers would make this approach feasible.
This flexible framework will be implemented with the help of two design rules. First, there
will be an abstract base class which all modules are required to extend. Second, the interface for each
specific module, such as LifeAndDeath, will be standardized. This not only allows other modules
to easily use the LifeAndDeath module, but it allows for multiple versions of the LifeAndDeath
module to be written using different approaches. Developers would have the choice of enhancing an
existing module, or writing a new one from scratch as long as it corresponds to the approved interface,
neither leaving out any functionality, nor adding any new functionality. In this way, modules which
use the LifeAndDeath module do not know or even care which implementation of that module type they are using,
since they all implement the same interface. Note that the crucial assumption here is that the
definitions of the modules can be determined up front and do not require very much experimentation.
It is also useful to consider that there is a great deal of expertise in the Go playing community
that can be put to use, both software development talent, and Go playing talent. The software
developers will be of great use in developing new modules, debugging and optimizing old modules, and
reviewing the fundamental soundness of the software framework, and the modules that are running within
it. The strong Go players will be of great use in analyzing games played by the program and
analyzing the results of individual modules, as well as helping to devise the specifications
for new modules that need to be developed.
Ideally, there would be legions of very strong Go players with advanced computer skills to work
on this program. In practice, very few of the people working on this project would be very strong
at both Go and software development. Thus, it is important that the program be as easy to use as
possible, even when the user is trying to debug the internals of the program. This means that
each module will be required to produce debugging information which can be displayed graphically.
For example, a scoring module would produce output which would indicate which points are considered
black territory, which ones are considered white territory, which ones are dame points, and which
ones are considered to be partially the territory of one player or the other but not definite
territory. With the use of a graphical client program, a strong Go player can then assess the
accuracy of this analysis, and submit a bug report if necessary, without having any programming
knowledge whatsoever.
Control over the development process would work in a distributed manner. A core person or small
group of people would control the development of the framework and integration of bug fixes and
enhancements into the framework. They would also review proposals for new modules, and finalize the
specifications for those modules. A person or group who creates a module according to the
specifications would have control over the enhancements for that module, but they would not
have control over the specifications for the module. This would allow other people to develop their
own versions of any particular module. Thus, people can choose to either cooperate on the development
of a given module, or compete
in writing the best version of the module. In either case, since the specification of the module is
the same, any of the different versions could be used interchangeably. In the case of competing
modules, they can be reviewed by the testing group, and the best one would be part of the
production release. A production release could contain multiple versions of the same module,
with the actual one being used being controlled by a configuration file. For example, the user could
be given the option of having the program make the best move, or the fastest move. A tool would
also be provided to allow the advanced user to have the ability to specify which version of each
module is to be used. Generally speaking, the higher the level of the module, the greater the
opportunity for people to write different versions of the module. For example, a module that
estimates the current score of the game would leave much more room for experimentation than a module
that determines the different groups on the board, and how many liberties they each have.
It would also be important to have a parallel testing and evaluation group, which is responsible
for ensuring that module implementations implement the specifications as intended, and to evaluate
which module of each particular type is best. It would be best to have an independent group which
is not involved in developing any modules to work on the evaluation. Note that some of these
people could be strong Go players without any programming knowledge.
It is possible to allow the interface for modules to evolve over time. That idea is explicitly
rejected. The reason is that if one of the implementers of a module wants to add new functionality,
then all of the other implementations of that same module would have to update their modules, or they
would become obsolete. The much preferred approach would be to have a new module which uses the previous
module, and adds the new functionality. Unlike most software, the requirements for the back end do not
change over time, so it should be posssible for a well implemented module to be used for many years without any changes required. This should make it clear that the most important ongoing
task facing this project is to thoroughly review module specifications before any implementations are done.
Features
The following is a list of the features that the Moyoman server program will provide:
- Play a game of 19x19 Go
- Provide a framework for experimenting with different ideas about how to write a Go program
- Provide human readable feedback about how the program evaluates moves
- Provide options for best play, quick play, or tournament play
- Provide internationalization support
- Allow for easy automation of computer - computer play
- Provide a graphical user interface
Play a game of 19x19 Go
The purpose of this program is to play standard Go. Whenever an attempt is made to make software
more flexible, it also becomes more complicated. The design of this program will be complicated
enough without worrying about playing 9x9 or 13x13 Go. The rule set is not hard-coded, and so
the user will be able to choose among a variety of rule sets for playing the game.
Provide a framework for experimenting with different ideas about how to write a Go program
This is the main idea which distinguishes Moyoman from a program such as GNU Go, and will be
discussed in much greater detail throughout the documentation for Moyoman. The basic idea is
to spread design responsbility among a large group of developers, and make the barrier to
contributing code as low as possible.
Provide human readable feedback about how the program evaluates moves
This is used for two major purposes: first to teach the weaker Go player about how to analyze
a board position, and second to provide debugging information. The developer can then use the
graphical debugging information from a module to determine how to improve the algorithm,
and the advanced Go player who is not a software developer can critique the performance of
individual modules without having to know anything about software. Once the program has reached
a certain level of play, this user will be crucial in allowing the software to become better.
Provide options for best play, quick play, or tournament play
Since there can be multiple implementations of the same module, the criteria for comparing these
modules can be somewhat arbitrary. Two of the obvious options are select the module that produces
the best results regardless of the speed and memory resources required, and for users with
less powerful computers the module that gives the best results/resources ratio. A third option
would be tournament play, which would be used to allow Moyoman to compete in tournaments. This
option would not be important until the program is better than 10-kyu in strength.
Provide internationalization support
There is no common language among Go players. When a user requests human readable feedback
about a module, he has a right to expect that he can understand the information which he
is presented. English will be supported, since it is the only language that the original
developers speak, but Chinese, Korean, and Japanese will be the primary language for many
of the users of the system. It will be important to support major European languages as
well. The Moyoman framework supports internationalization since Unicode is used as the
underlying representation for text. Tools are provided to support internationalization.
Any language for which someone volunteers to act as translator will be supported.
Allow for easy automation of computer - computer play
Developers who are implementing learning algorithms, or testers who want to evaluate different
module implementations would like to have the program play against itself. The framework makes
it easy to automate this process, with two different configurations of the software playing
against itself, changing configurations automatically, or changing handicaps based on the
results of previous games.
Provide a graphical user interface
Although the focus of this project is on the back-end move generation algorithms, it is important
that the end user can view the results of the game in an easy to understand format. Since the
Moyoman program generates debug information in a way that is not supported by any existing client,
it is necessary to provide a client that the end user can run.
Framework
When the philosophy of incrementally computing more information about the board is considered in
conjunction with having many people involved in developing the different software modules, the
necessity for a framework for managing these modules is clear. The software framework will provide
the following features:
- Provide a standard base class that all modules must extend.
- Provide a standard interface for each specific module type.
- Provide a multi-threaded environment in which the modules can run.
- Determine the correct order in which to run the modules based on the dependencies among them.
- Allow games to be saved and restored from disk or the network.
In order to allow the different modules to work together, a base class will be defined which each
module must extend. It will provide for things such as letting the module register with the
framework and notifying the module when it is time to start processing the next move.
When loading a particular module, e.g, Endgame, the framework defines the interface that
the Endgame module must implement. The module is required to implement that interface exactly,
and not have any additional public methods.
In order to allow the different modules to run efficiently, the framework will start multiple
modules each in its own thread. Different modules may take different amounts of time to run, so new
modules can be started when others have completed their work.
Almost all modules will be dependent on the work of other modules. It will be the job of the
framework to determine the order in which the modules need to run based on their mutual dependencies.
This will also determine the extent to which multi-threading can occur at any given time.
The framework should also have the ability to save the current state of the game, and the
ability to read it back later and restart the game from that point. This can be used to implement
the taking back of moves, and also for crash recovery.
The Java Programming Language
It has been decided to implement the Moyoman server program in Java for the following reasons:
- The program can be distributed in a ready to run, operating system independent format.
- It is easy to support system dependent functionality such as sockets, threads, etc without
having to write platform-dependent code.
- It is easy to save objects as a serial stream and read them back in again.
Any inefficiency caused by running Java code will be more than offset by the accessibility
of the program. It is a goal of this project that setting up and running Moyoman should be as
easy as possible for the novice computer user. This means that the program should
not require many peripheral software products to run, such as web servers, databases, etc.
Installation on a single computer should be a point and click process to encourage the use
of the program.
A Word about the Design Philosophy
Often, when designing software using Java, the solution can become extremely complex.
Creating a distributed, fault-tolerant server using J2EE technology can involve having at
least a half dozen different middleware products to take advantage of such technologies
as Enterprise Java Beans,
Java Messaging Service, etc. For almost all users of our software, having a distributed,
fault-tolerant system is not a requirement. The typical user will be running the program on
their PC, and there will be just one human player. Our solution will be to make our software as
easy to install and configure as possible. There will be a single executable jar file which contains
the clients and a server. Installation will involve copying files
to a directory. Running the software will involve double-clicking on a jar file. This rule of
simplicity will not be violated without a compelling reason. For example, if a browser based
interface is to be implemented, then running a web server and servlet engine would be required
as part of installation. This optional installation would be more involved, but would not
make the standard installation any more complicated. This would be an option because it is valuable,
and provides useful functionality that a single executable program does not. We will not use EJB, JMS,
etc, etc unless the resulting system is no more complicated than the solution
already described, or unless they are part of an optional, advanced system in which the additional
functionality provided justifies the extra complexity.
Modules
The nuts and bolts of the software are the modules. These are small self contained units of code
that perform one Go related type of analysis. Whatever analysis has already been done is saved until
the next move is passed in as a parameter to a method, and analysis continues at that point.
Modules would mostly represent Go specific concepts, such as joseki or life and death, but they
could also represent concepts that are not Go specific, such as a geometric analysis of the board
to determine which groups are adjacent to other groups. The important points to note about modules are:
- A standard interface for a module is created and approved by the core development team,with
input from all interested members of the development community.
- The dependencies among modules are not part of the interface of a module, so one implementation
of the module that determines the score might use the results of the life and death module, and
another might not.
- The dependencies among modules are specified dynamically, and the order in which module types
are executed, and which module implementations are executed for a given module type are determined
at run-time by the framework.
- Module types are given an order, so that the subset of module types for which a given module
can use the results is defined. This prevents any circular dependencies from occurring.
Conclusion
By having a software framework that allows for the work of many different people to be integrated
together, the possibility of writing a high quality Go playing program is greatly enhanced. People
can contribute in the area where their talents and interests lie. People interested in low level
problems such as life and death or joseki can work on those areas. People interested in less well
defined problem areas could try to implement versions of shape or fight or kamikaze. Those with high level
interests can try their creativity by attempting to implement the strategy or move generator modules.
Still others could work on the specifications for new modules without actually implementing them.
Those who have an interest in user interface design can develop new front ends for the server.
The non-programmers can contribute by evaluating the output of each module in a given situation,
or evaluate the play as a whole and critique the results. Others can translate the output strings
into different languages. By combining the efforts of many different people,
it may be possible to generate a high level of play within a reasonable time frame.