Not understanding slice length and capacity
It’s pretty common for Go developers to mix slice length and capacity or not understand them thoroughly. Assimilating these two concepts is essential for efficiently handling core operations such as slice initialization and adding elements with append, copying, or slicing. This misunderstanding can lead to using slices suboptimally or even to memory leaks.
In Go, a slice is backed by an array. That means the slice’s data is stored contiguously in an array data structure. A slice also handles the logic of adding an element if the backing array is full or shrinking the backing array if it’s almost empty.
Internally, a slice holds a pointer to the backing array plus a length and a capacity. The length is the number of elements the slice contains, whereas the capacity is the number of elements in the backing array, counting from the first element in the slice. Let’s go through a few examples to make things clearer. First, let’s initialize a slice with a given length and capacity:
The first argument, representing the length, is mandatory. However, the second argument representing the capacity is optional. Figure 1 shows the result of this code in memory.
In this case,
make creates an array of six elements (the capacity). But because the length was set to 3, Go initializes only the first three elements. Also, because the slice is an
int type, the first three elements are initialized to the zeroed value of an
int: 0. The grayed elements are allocated but not yet used.
If we print this slice, we get the elements within the range of the length,
[0 0 0]. If we set
s to 1, the second element of the slice updates without impacting its length or capacity. Figure 2 illustrates this.
However, accessing an element outside the length range is forbidden, even though it’s already allocated in memory. For example,
s = 0 would lead to the following panic:
How can we use the remaining space of the slice? By using the
append built-in function:
This code appends to the existing
s slice a new element. It uses the first grayed element (which was allocated but not yet used) to store element 2, as figure 3 shows.
The length of the slice is updated from 3 to 4 because the slice now contains four elements. Now, what happens if we add three more elements so that the backing array isn’t large enough?
If we run this code, we see that the slice was able to cope with our request:
Because an array is a fixed-size structure, it can store the new elements until element 4. When we want to insert element 5, the array is already full: Go internally creates another array by doubling the capacity, copying all the elements, and then inserting element 5. Figure 4 shows this process.
The slice now references the new backing array. What will happen to the previous backing array? If it’s no longer referenced, it’s eventually freed by the garbage collector (GC) if allocated on the heap. (We discuss heap memory in mistake #95, “Not understanding stack vs. heap,” and we look at how the GC works in mistake #99, “Not understanding how the GC works.”)
What happens with slicing? Slicing is an operation done on an array or a slice, providing a half-open range; the first index is included, whereas the second is excluded. The following example shows the impact, and figure 5 displays the result in memory:
s1 is created as a three-length, six-capacity slice. When
s2 is created by slicing
s1, both slices reference the same backing array. However,
s2 starts from a different index, 1. Therefore, its length and capacity (a two-length, five-capacity slice) differ from s1. If we update
s2, the change is made to the same array, hence, visible in both slices, as figure 6 shows.
Now, what happens if we append an element to
s2? Does the following code change
s1 as well?
The shared backing array is modified, but only the length of
s2 changes. Figure 7 shows the result of appending an element to
s1 remains a three-length, six-capacity slice. Therefore, if we print
s2, the added element is only visible for
It’s important to understand this behavior so that we don’t make wrong assumptions while using append.
In these examples, the backing array is internal and not available directly to the Go developer. The only exception is when a slice is created from slicing an existing array.
One last thing to note: what if we keep appending elements to
s2 until the backing array is full? What will the state be, memory-wise? Let’s add three more elements so that the backing array will not have enough capacity:
This code leads to creating another backing array. Figure 8 displays the results in memory.
s2 now reference two different arrays. As
s1 is still a three-length, six-capacity slice, it still has some available buffer, so it keeps referencing the initial array. Also, the new backing array was made by copying the initial one from the first index of
s2. That’s why the new array starts with element 1, not 0.
To summarize, the slice length is the number of available elements in the slice, whereas the slice capacity is the number of elements in the backing array. Adding an element to a full slice (length == capacity) leads to creating a new backing array with a new capacity, copying all the elements from the previous array, and updating the slice pointer to the new array.