In graph theory, graph coloring is an arbitrary assignment of labels (colors) to objects within a graph. Such objects can be vertices, edges, faces, or a mixture of those. A graph coloring is distinct from a graph labeling in that in the former, the same label may be used more than once.
Graph coloring is a rich source of theoretical questions and practical applications. It has found applications in areas such as scheduling, register allocation in compilers, frequency assignment in mobile radios, and pattern matching. It has even become popular with the general public in the form of the popular number puzzle Sudoku. Graph coloring is still a very active field of research.
Vertex coloring, the process of assigning a color to each vertex of a graph, is one of the most fundamental types of coloring. Other coloring problems can be transformed into a vertex coloring problems: for example, an edge coloring of a graph is just the vertex coloring of its line graph. Likewise, a face coloring of a planar graph is just the vertex coloring of its (planar) dual.
When used without qualification, the term graph coloring generally means vertex coloring. Also (unless otherwise specified), a coloring is nearly always assumed to be proper, meaning that no two adjacent vertices are assigned the same color.
A coloring using at most k colors is called a (proper) k-coloring and is equivalent to the problem of partitioning the vertex set into k or fewer independent sets.
The least number of colors needed to color the graph is called its chromatic number χ. For example, the chromatic number of a complete graph of n vertices is . A graph that can be assigned a (proper) k-coloring is k-colorable, and it is k-chromatic if its chromatic number is exactly k.
The problem of finding a minimum coloring of a graph is NP-hard. The corresponding decision problem (is there a coloring which uses at most colors?) is NP-complete, and in fact was one of Karp's 21 NP-complete problems. It remains NP-complete even on planar graphs of degree at most 4, as shown by Garey and Johnson in 1974, although on planar graphs it's trivial for k ≠ 3 (due to the four color theorem). There are however some efficient approximation algorithms that employ semidefinite programming.
Some properties of χ(G):
Here Δ(G) is the maximum degree, and ω(G), the clique number.
About infinite graphs, much less is known. The following is one of the few results about infinite graph coloring:
The chromatic polynomial counts the number of ways a graph can be coloured using no more than a given number of colours. For example, using three colours, the graph in the image to the right can be coloured in 12 ways. With only two colours, it cannot be coloured at all. With four colours, it can be coloured in 24 + 4⋅12 = 72 ways: using all four colours, there are 4! = 24 valid colourings (every assignment of four colors to any 4-vertex graph is a proper coloring); and for every choice of three of the four colours, there are 12 valid 3-colourings. So, for the graph in the example, a table of the number of valid colourings would start like this:
Available colors | 1 | 2 | 3 | 4 | … |
Number of colorings | 0 | 0 | 12 | 72 | … |
The chromatic polynomial is a function that counts the number of -colorings of . As the name indicates, for a given the function is indeed a polynomial in . For the example graph, , and indeed .
The chromatic polynomial includes at least as much information about the colorability of as does the chromatic number. Indeed, χ is the smallest positive integer that is not a root of the chromatic polynomial,
It was first used by Birkhoff and Lewis in their attack on the four-color theorem. It remains an unsolved problem to characterize graphs which have the same chromatic polynomial and to determine precisely what polynomials are chromatic.
Triangle | |
Complete graph | ... |
Tree with vertices | |
Cycle | |
Petersen graph |
Whenever contains a loop, it cannot be legally colored at all, so
If not a loop, then the chromatic polynomial satisifes the recurrence relation
where is the graph with the edge removed, and is the graph with the edge's endpoints contracted into a single vertex.
The two expressions give raise to a recursive procedure, called the deletion–contraction algorithm. In the recursive step, the instance is transformed into two instances that are at least one edge smaller, so the running time comes within a polynomial factor of , where is the number of edges. The analysis can be improved to .
Other algorithms are known, but all are exponential in the size of the graph. For some classes of graphs, polynomial-time algorithms are known. These include the empty and complete graphs, forests, chordal graphs, wheels, and ladders. In general, computing the chromatic polynomial is Sharp-P-complete, so it is unlikely that a polynomial time algorithm for all graphs will be found.
Not only can the idea of vertex coloring be extended to edges, but also be added with different conditions to form new structures and problems.
Some improper colorings:
Main Article: register allocation by graph coloring
In Computer Science, compilers---computer programs which translate one formal language to another---can employ vertex coloring to determine a satisfactory solution to the problem of register allocation. Vertex coloring is the most popular approach, and is one of the most effective.
Typically, a compiler will translate a source written in a high-level language (HLL) to a target in assembly language or machine code. There are a few critical differences between the source and target languages. In particular, HLLs allow for a large or unlimited number of variables, while the typical microprocessor only allows for efficient access to a small number of variables, called registers, and inefficient access to main memory. Since only a small percentage of the variables within a computer program are used at one time, there is an opportunity to place multiple variables into the same register---but only if those variables are not used at the same time.
The register allocation problem can be viewed as a vertex coloring problem as follows: every variable in a program is treated as a vertex in an undirected graph. Then, if at any point during program execution two variables each hold some value that will later be read, then those two variables are said to interfere, and an edge is drawn between them. Unfortunately, it is not always possible to determine if two variables interfere (a consequence of the halting problem), and thus a compiler must act conservatively and assume that they do interfere unless they can be shown not to. This graph is called the interference graph.
As was mentioned earlier, determining a -coloring for a graph is NP-Hard. Nonetheless, the compiler must solve such a problem. Thus, a compiler uses a heuristic algorithm to produce a non-optimal solution to the vertex coloring problem, attempting to minimize the number of colors used. If, ultimately, the number of colors used is greater than the number of registers in the target language, then the graph must be transformed to reduce the number of necessary colors. A compiler accomplishes this by spilling one or more variables, storing them in a memory location instead of a register.