Merge sort

What is a merge sort?

Merge sort is an example of a divide and conquer algorithm that reduces the size of the problem into smaller problems that are more easily solvable by an algorithm
Specifically, the merge sort divides the list in half, creating two sub lists. Each sub list is then divided into two until each sub list contains one item or element
Groups of two sub lists are then sorted and merged together to form new, larger sub lists until all sub lists have been merged into a single sorted list
The merge sort therefore contains two parts:
- Part one: Divide the list into sub lists until each sub list contains only one element
- Part two: Repeatedly merge groups of two and sort them, until all lists have been merged into a single sorted list
The merge sort has been illustrated below using a list of ten items: [15, 5, 2, 7, 4, 9, 10, 1, 8, 3]

performing-the-merge-sort-revision-notes-computer-science Figure 1: Performing the Merge sort

Notice that halving the list sometimes produces odd numbers of elements in sub lists. When dividing, all sub lists must contain a single element before merging can begin.
When merging, only two sub lists can be merged at a time. Any left over sub lists are merged in the next merging iteration
In order to merge two sub lists, two pointers are required, one for each sub list. The pointer keeps track of which elements are to be compared. Once an element has been merged into the new list, the corresponding pointer is incremented. The process continues until a list contains no elements to compare, at which point the remaining elements are appended to the end of the new merged sub list.

Time complexity of a merge sort

To determine the algorithm’s execution time as measured by the number of instructions or steps carried out Big-O notation must be used
The time complexity of a merge sort is dependent on its divide and conquer nature $O (l o g n)$ and the fact that n sub lists must be merged. This means the overall worst case time complexity of the merge sort is $O (n l o g n)$
The best case and average case scenario time complexity of the merge sort is $O (n l o g n)$ as regardless of the number of elements or item, n must still be merged and halved

Space complexity of a merge sort

The merge sort requires twice the amount of space of other sorting algorithms due to holding copies of the left and right halves of the list
Its space complexity is therefore $O (n)$ original space and an additional $O (n)$ space to store the copies of the list

Tracing a merge sort

In the trace table below, several calls to merge sort are made which splits the list. Each call generates a stack frame which is put on the stack. Stack frames are removed when the current stack frame reaches the end of its code
The list is gradually broken down into sub lists, starting with the left side of each sub list. Sub lists are created until each sub list contains only one element which is the base case. When a base case is reached the stack frame is removed and the program continues execution on the line after the call to merge sort
Each sub list is gradually merged, two at a time and passed up the stack to be merged with another sub list in a later merge sort call
Pointers are incremented during the merge such that the next element to merge in each sub list is tracked and appropriately added to the new list. NewListPointer tracks the position of elements in the new list

Trace table for the merge sort

Row	Stack frame	len(list) > 1?	left	right	Split left, right or merge?	newList	mid	Left Pointer	Right Pointer	newListPointer
1	1	TRUE	15, 5, 2, 7, 4	9, 10, 1, 8, 3	LEFT		5
2	2	TRUE	15, 5	2, 7, 4	LEFT		2
3	3	TRUE	15	5	LEFT		0
4	4	FALSE			BASE CASE
5	3	TRUE	15	5	RIGHT		0
6	4	FALSE			BASE CASE
7	3	TRUE	15	5	MERGE	[5, 15]	0	0	0	0
8	3	TRUE	2	7, 4	LEFT		1
9	4	FALSE			BASE CASE
10	3	TRUE	7	4	LEFT		0
11	4	FALSE			BASE CASE
12	3	TRUE	7	4	RIGHT		0
13	4	FALSE			BASE CASE
14	3	TRUE	7	4	MERGE	[4, 7]	0	0	0	0
15	3	TRUE	2	4, 7	MERGE		1	0	0	0
16						[2]		1		1
17						[2, 4]			1	2
18						[2, 4, 7]			2	3
19	3	TRUE	5, 15	2, 4, 7	MERGE		2	0	0	0
20						[2]			1	1
21						[2, 4]			2	2
22						[2, 4, 5]		1	2	3
23						[2, 4, 5, 7]		1	3	4
24						[2, 4, 5, 7, 15]		1	4	5
25	2	TRUE	9, 10	1, 8 3	LEFT		2

Examiner Tips and Tricks

Merge sort is a complex algorithm. You will not be asked to derive its time or space complexity in detail.

You will however be expected to know and justify its time and space complexity

Implementing a merge sort

procedure mergesort(list)
	if len(list) > 1 then
		mid = len(list) div 2
		left = list[:mid]
		right = list[mid:]
		
		mergesort(left)
		mergesort(right)
		
		leftPointer = 0
		rightPointer = 0
		newListPointer = 0
		while leftPointer < len(left) and rightPointer < len(right)
			if left[leftPointer] < right[rightPointer] then
				list[newListPointer] = left[leftPointer]
				leftPointer = leftPointer + 1
			else
				list[newListPointer] = right[rightPointer]
				rightPointer = rightPointer + 1
			endif
		
			newListPointer = newListPointer + 1
		endwhile
		
		while leftPointer < len(left)
			list[newListPointer] = left[leftPointer]
			leftPointer = leftPointer + 1
			newListPointer = newListPointer + 1
		endwhile

		while rightPointer < len(right)
			list[newListPointer] = right[rightPointer]
			rightPointer = rightPointer + 1
			newListPointer = newListPointer + 1
		endwhile
	endif
endprocedure

list = [15, 5, 2, 7, 4, 9, 10, 1, 8, 3]
mergesort(list)
print(list)

It is important to note that merge sort relies on recursion to function
To divide the list into smaller sub lists, the merge sort algorithm must call itself repeatedly, using a base case of one element
Each recursive call is added to the call stack
Once the list is fully divided into individual elements, the stack frames are removed (popped off) and processed in reverse order to merge the sub lists back together in sorted order
It is also important to understand the concept of slicing in Python:
- [:mid] means all elements up to, but not including, the midindex
- [mid:] means all elements from the mid index to the end of the list
This list-splitting technique is known as slicing
Once the list has been split, and no more recursive calls are made, three new variables are defined per stack frame:
- leftPointer (keeps track of the left sub list)
- rightPointer (keeps track of the right sub list)
- newListPointer (keeps track of the next available space in the sorted list)
The new list is merged in two stages:
- Stage one: Merge all elements until one list is empty
  - The first while loop manages this process. The left and right pointers increment over the left and right lists, comparing elements one at a time. If one is smaller than the other then it is placed in the next available space in the new list
- Stage two: Append the non-empty list’s remaining elements to the new list
  - This is managed by the final two while loops. One merges if the left half has remaining elements. The other merges if the right half has remaining elements. Only one will be called, not both

def merge_sort(list):
  if len(list) > 1:
    mid = len(list) // 2
    left = list[:mid]
    right = list[mid:]
    merge_sort(left)
    merge_sort(right)
    left_pointer, right_pointer, new_list_pointer = 0, 0, 0
    new_list = [None] * len(list)
    while left_pointer < len(left) and right_pointer < len(right):
      if left[left_pointer] < right[right_pointer]:
        new_list[new_list_pointer] = left[left_pointer]
        left_pointer = left_pointer + 1
      else:
        new_list[new_list_pointer] = right[right_pointer]
        right_pointer = right_pointer + 1
      new_list_pointer = new_list_pointer + 1
    while left_pointer < len(left):
      new_list[new_list_pointer] = left[left_pointer]
      left_pointer = left_pointer + 1
      new_list_pointer = new_list_pointer + 1
    while right_pointer < len(right):
      new_list[new_list_pointer] = right[right_pointer]
      right_pointer = right_pointer + 1
      new_list_pointer = new_list_pointer + 1
    list[:] = new_list
  return list
 
list = [15, 5, 2, 7, 4, 9, 10, 1, 8, 3]
merge_sort(list)
print(list)  # Output: [1, 2, 3, 4, 5, 7, 8, 9, 10, 15]

CS Notes

Explorer

Merge sort

What is a merge sort?

Time complexity of a merge sort

Space complexity of a merge sort

Tracing a merge sort

Trace table for the merge sort

Implementing a merge sort

Graph View

Table of Contents

Backlinks