Shortest Path

Learning Outcomes

  • Find the shortest path through a graph using Dijkstra’s Algorithm

When you visit a website like Google Maps or use your Smartphone to ask for directions from home to your Aunt’s house in Pasadena, you are usually looking for a shortest path between the two locations. These computer applications use representations of the street maps as graphs, with estimated driving times as edge weights.

While often it is possible to find a shortest path on a small graph by guess-and-check, our goal in this chapter is to develop methods to solve complex problems in a systematic way by following algorithms. An algorithm is a step-by-step procedure for solving a problem. Dijkstra’s (pronounced dike-stra) algorithm will find the shortest path between two vertices.

Efficiency

Efficiency is a hallmark of mathematical practice. When mathematicians seek a proof for something they have conjectured to be true, the most elegant proof is often the most efficient one: an argument that packs the most information in the fewest words, so to speak.

The ideas explored in graph theory are frequently applied to computing algorithms: the language and instructions of software.  Since resources are limited (time, computing power), mathematicians and computer scientists seek the most efficient ways to compute. Graph theory helps them find the shortest path from A to B.

Dijkstra’s Algorithm

1.     Mark the ending vertex with a distance of zero. Designate this vertex as current.

2.     Find all vertices leading to the current vertex. Calculate their distances to the end. Since we already know the distance the current vertex is from the end, this will just require adding the most recent edge. Don’t record this distance if it is longer than a previously recorded distance.

3.     Mark the current vertex as visited. We will never look at this vertex again.

4.     Mark the vertex with the smallest distance as current, and repeat from step 2.

 

EXAMPLE

Suppose you need to travel from Tacoma, WA (vertex T) to Yakima, WA (vertex Y). Looking at a map, it looks like driving through Auburn (A) then Mount Rainier (MR) might be shortest, but it’s not totally clear since that road is probably slower than taking the major highway through North Bend (NB). A graph with travel times in minutes is shown below. An alternate route through Eatonville (E) and Packwood (P) is also shown.

Graph with 7 vertices marked T, A, NB, Y, MR, P, and E. The length between NB and Y is labeled 104, between NB and A is 36, between A and T is 20. Between T and E is labeled 57, and between E and P is 96. Length between P and Y is labeled 76. Between P and MR is labeled 27, and between MR and A is 79.

Step 1: Mark the ending vertex with a distance of zero. The distances will be recorded in [brackets] after the vertex name

Graph with 7 vertices marked T, A, NB, Y, MR, P, and E. The length between NB and Y is labeled 104, between NB and A is 36, between A and T is 20. Between T and E is labeled 57, and between E and P is 96. Length between P and Y is labeled 76. Between P and MR is labeled 27, and between MR and A is 79. Y is additionally labeled with [0].

Step 2: For each vertex leading to Y, we calculate the distance to the end. For example, NB is a distance of 104 from the end, and MR is 96 from the end. Remember that distances in this case refer to the travel time in minutes.

Graph with 7 vertices marked T, A, NB, Y, MR, P, and E. teh length between NB adn Y is labeled 104, between NB and A is 36, between A and T is 20. Between T and E is labeled 57, and between E and P is 96. Length between P and Y is labeled 76. Between P and MR is labeled 27, and between MR and A is 79. Additionally, Y is labeled with [0], NB is labeled with [104], MR is labeled with [96], and P is labeled with [76].

Step 3 & 4: We mark Y as visited, and mark the vertex with the smallest recorded distance as current. At this point, P will be designated current. Back to step 2.

Step 2 (#2): For each vertex leading to P (and not leading to a visited vertex) we find the distance from the end. Since E is 96 minutes from P, and we’ve already calculated P is 76 minutes from Y, we can compute that E is 96+76 = 172 minutes from Y.

Graph with 7 vertices marked T, A, NB, Y, MR, P, and E. The length between NB and Y is labeled 104, between NB and A is 36, between A and T is 20. Between T and E is labeled 57, and between E and P is 96. Length between P and Y is labeled 76. Between P and MR is labeled 27, and between MR and A is 79. Additionally, Y is labeled with [0], NB is labeled with [104], MR is labeled with [96], and P is labeled with [76], and E is labeled with [172].

Step 3 & 4 (#2): We mark P as visited, and designate the vertex with the smallest recorded distance as current: MR. Back to step 2.

Step 2 (#3): For each vertex leading to MR (and not leading to a visited vertex) we find the distance to the end. The only vertex to be considered is A, since we’ve already visited Y and P. Adding MR’s distance 96 to the length from A to MR gives the distance 96+79 = 175 minutes from A to Y.

Graph with 7 vertices marked T, A, NB, Y, MR, P, and E. The length between NB and Y is labeled 104, between NB and A is 36, between A and T is 20. Between T and E is labeled 57, and between E and P is 96. Length between P and Y is labeled 76. Between P and MR is labeled 27, and between MR and A is 79. Additionally, Y is labeled with [0], NB is labeled with [104], MR is labeled with [96], and P is labeled with [76], and E is labeled with [172], and A is labeled with [175]

Step 3 & 4 (#3): We mark MR as visited, and designate the vertex with smallest recorded distance as current: NB. Back to step 2.

Step 2 (#4):  For each vertex leading to NB, we find the distance to the end.  We know the shortest distance from NB to Y is 104 and the distance from A to NB is 36, so the distance from A to Y through NB is 104+36 = 140.  Since this distance is shorter than the previously calculated distance from Y to A through MR, we replace it.

Graph with 7 vertices marked T, A, NB, Y, MR, P, and E. The length between NB and Y is labeled 104, between NB and A is 36, between A and T is 20. Between T and E is labeled 57, and between E and P is 96. Length between P and Y is labeled 76. Between P and MR is labeled 27, and between MR and A is 79. Additionally, Y is labeled with [0], NB is labeled with [104], MR is labeled with [96], and P is labeled with [76], and E is labeled with [172]. A is labeled with [140]

Step 3 & 4 (#4): We mark NB as visited, and designate A as current, since it now has the shortest distance.

Step 2 (#5): T is the only non-visited vertex leading to A, so we calculate the distance from T to Y through A: 20+140 = 160 minutes.

Graph with 7 vertices marked T, A, NB, Y, MR, P, and E. The length between NB and Y is labeled 104, between NB and A is 36, between A and T is 20. Between T and E is labeled 57, and between E and P is 96. Length between P and Y is labeled 76. Between P and MR is labeled 27, and between MR and A is 79. Additionally, Y is labeled with [0], NB is labeled with [104], MR is labeled with [96], and P is labeled with [76], and E is labeled with [172]. T is labeled with [160]

Step 3 & 4 (#5): We mark A as visited, and designate E as current.

 

Step 2 (#6): The only non-visited vertex leading to E is T. Calculating the distance from T to Y through E, we compute 172+57 = 229 minutes. Since this is longer than the existing marked time, we do not replace it.

 

Step 3 (#6): We mark E as visited. Since all vertices have been visited, we are done.

From this, we know that the shortest path from Tacoma to Yakima will take 160 minutes. Tracking which sequence of edges yielded 160 minutes, we see the shortest path is T-A-NB-Y.

Dijkstra’s algorithm is an optimal algorithm, meaning that it always produces the actual shortest path, not just a path that is pretty short, provided one exists. This algorithm is also efficient, meaning that it can be implemented in a reasonable amount of time. Dijkstra’s algorithm takes around V2 calculations, where V is the number of vertices in a graph[1]. A graph with 100 vertices would take around 10,000 calculations. While that would be a lot to do by hand, it is not a lot for computer to handle. It is because of this efficiency that a GPS unit can compute driving directions in only a few seconds.

[1] It can be made to run faster through various optimizations to the implementation.

In contrast, an inefficient algorithm might try to list all possible paths then compute the length of each path. Trying to list all possible paths could easily take 1025 calculations to compute the shortest path with only 25 vertices; that’s a 1 with 25 zeros after it! To put that in perspective, the fastest computer in the world would still spend over 1000 years analyzing all those paths.

 

EXAMPLE

A shipping company needs to route a package from Washington, D.C. to San Diego, CA. To minimize costs, the package will first be sent to their processing center in Baltimore, MD then sent as part of mass shipments between their various processing centers, ending up in their processing center in Bakersfield, CA. From there it will be delivered in a small truck to San Diego.

The travel times, in hours, between their processing centers are shown in the table below. Three hours has been added to each travel time for processing. Find the shortest path from Baltimore to Bakersfield.

Baltimore Denver Dallas Chicago Atlanta Bakersfield
Baltimore * 15 14
Denver * 18 24 19
Dallas * 18 15 25
Chicago 15 18 18 * 14
Atlanta 14 24 15 14 *
Bakersfield 19 25 *

While we could draw a graph, we can also work directly from the table.

Step 1: The ending vertex, Bakersfield, is marked as current.

Step 2: All cities connected to Bakersfield, in this case Denver and Dallas, have their distances calculated; we’ll mark those distances in the column headers.

Step 3 & 4: Mark Bakersfield as visited. Here, we are doing it by shading the corresponding row and column of the table. We mark Denver as current, shown in bold, since it is the vertex with the shortest distance.

Baltimore Denver[19] Dallas[25] Chicago Atlanta Bakersfield[0]
Baltimore * 15 14
Denver * 18 24 19
Dallas * 18 15 25
Chicago 15 18 18 * 14
Atlanta 14 24 15 14 *
Bakersfield 19 25 *

Step 2 (#2): For cities connected to Denver, calculate distance to the end. For example, Chicago is 18 hours from Denver, and Denver is 19 hours from the end, the distance for Chicago to the end is 18+19 = 37 (Chicago to Denver to Bakersfield). Atlanta is 24 hours from Denver, so the distance to the end is 24+19 = 43 (Atlanta to Denver to Bakersfield).

Step 3 & 4 (#2): We mark Denver as visited and mark Dallas as current.

Baltimore Denver[19] Dallas[25] Chicago[37] Atlanta[43] Bakersfield[0]
Baltimore * 15 14
Denver * 18 24 19
Dallas * 18 15 25
Chicago 15 18 18 * 14
Atlanta 14 24 15 14 *
Bakersfield 19 25 *

Step 2 (#3): For cities connected to Dallas, calculate the distance to the end. For Chicago, the distance from Chicago to Dallas is 18 and from Dallas to the end is 25, so the distance from Chicago to the end through Dallas would be 18+25 = 43. Since this is longer than the currently marked distance for Chicago, we do not replace it. For Atlanta, we calculate 15+25 = 40. Since this is shorter than the currently marked distance for Atlanta, we replace the existing distance.

Step 3 & 4 (#3): We mark Dallas as visited, and mark Chicago as current.

Baltimore Denver[19] Dallas[25] Chicago[37] Atlanta[40] Bakersfield[0]
Baltimore * 15 14
Denver * 18 24 19
Dallas * 18 15 25
Chicago 15 18 18 * 14
Atlanta 14 24 15 14 *
Bakersfield 19 25 *

Step 2 (#4): Baltimore and Atlanta are the only non-visited cities connected to Chicago. For Baltimore, we calculate 15+37 = 52 and mark that distance. For Atlanta, we calculate 14+37 = 51. Since this is longer than the existing distance of 40 for Atlanta, we do not replace that distance.

Step 3 & 4 (#4): Mark Chicago as visited and Atlanta as current.

Baltimore[52] Denver[19] Dallas[25] Chicago[37] Atlanta[40] Bakersfield[0]
Baltimore * 15 14
Denver * 18 24 19
Dallas * 18 15 25
Chicago 15 18 18 * 14
Atlanta 14 24 15 14 *
Bakersfield 19 25 *

Step 2 (#5): The distance from Atlanta to Baltimore is 14. Adding that to the distance already calculated for Atlanta gives a total distance of 14+40 = 54 hours from Baltimore to Bakersfield through Atlanta. Since this is larger than the currently calculated distance, we do not replace the distance for Baltimore.

Step 3 & 4 (#5): We mark Atlanta as visited. All cities have been visited and we are done.

The shortest route from Baltimore to Bakersfield will take 52 hours, and will route through Chicago and Denver.

 

Try It

  • Find the shortest path between vertices A and G in the graph below.

Graph with 7 vertices labeled A, B, C, D, E, F, G. Edge between A, B is labeled 1, edge between A, C is 4, edge between B, E is 6, edge between E, G is 7, edge between G, F is 6, edge between f, c is 5, edge between c, a is 4, edge between c, d is 2, edge between b, d is 3 edge between e, d is 2, edge between e, f is 2, edge between f, d is 4.

The following video summarizes the topics covered on this page.