Skip to content

Unit 17: Call by Value & Call by Reference

So far, in CS1010, we have considered a very "clean" notion of functions -- each function is a black box, it takes in zero or more parameters, and it returns zero or one value. We also said that within this black box, whatever happens inside the function stays inside, and the function has no effect on the variables outside. Such "effect-free" programming leads to code that is easy to reason about and understandable. For instance, suppose we have a code like this:

long x = 1;
foo(x);
// { x == 1 }

We can assert that, after calling foo, the value of x is still 1.

Such functions are called pure functions. There is a class of programming languages, called functional programming languages, that is built entirely based on such pure functions.

As much as possible, we should write functions that are pure as it is less bug-prone, is easier to understand, and easier to prove that the code is correct.

All the functions that we have written before are invoked using a mechanism called call by value. This is because the value of the variable is copied onto the call stack.

Functions with Side Effects

In Unit 15, we have seen a mechanism that breaks this rule. When we pass an array into a function, due to array decay, we are actually passing in the pointer to the first element of the array. So, what gets copied onto the call stack is the pointer, not the array. Now, with this pointer, we can modify the elements in the array directly.o

For example, the code below reset all elements in the array to 0.

void set_to_zeros(long length, long a[length])
{
  for (long i = 0; i < length; i += 1) {
    a[i] = 0;
  }
}

Whatever happens in the function no longer stays in the function. When you see code like this:

long a[10];
a[0] = 1;
foo(a);
// { a[0] == ?? }

We can no longer be certain that the value of a[0] after calling foo remains 1, since foo, or functions that it calls, could have modified a[0]. We have to trace through the code of foo and all the function it calls to understand and reason about that changes to a[0].

The keyword const

Since a function taking in an array parameter can potentially modify the content of the array, as a courtesy, it is useful to add the C keyword const in the declaration of the array parameter:

void foo(long length, const long a[]) {
    :
}
to signal to the readers of the code that this function only reads from the array and does not modify it. This simplifies the reasoning about the behavior of the code.

Call by Reference

The call-by-value mechanism has its limitation. Sometimes, it is useful for a function to return more than one results. You have seen an example before in your Assignment 2, where, for the collatz problem, you are supposed to find both the largest stopping time and the value with the largest stopping time.

C functions, however, can only return at most one value. One way to get around this limitation is to use call-by-reference, the other is to use struct. We will leave the discussion of struct for another day, so let's see how call by reference works in this unit.

Calling by reference works by passing the (value of a) pointer to a variable into a function, instead of the (value of) variable. Here is an example from the collatz problem.

Example: Collatz

void find_max_steps(long n, long *max_n, long *max_num_steps) {
  *max_num_steps = 0;
  *max_n = 1;
  for (long i = 1; i <= n; i += 1) {
    long num_of_steps = count_num_of_steps(i);
    if (num_of_steps >= *max_num_steps) {
      *max_n = i;
      *max_num_steps = num_of_steps;
    }
  }
}

The method find_max takes in two pointers. Inside the function, we use the deference operator * to modify the variable pointed to by the pointers (Lines 2, 3, 7 and 8).

To use this function, we have:

int main()
{
  long n = cs1010_read_long();
  long max_num_steps = 0;
  long max_n = 1;
  find_max_steps(n, &max_n, &max_num_steps);
  cs1010_println_long(max_n);
  cs1010_println_long(max_num_steps);
}

In Line 4 above, we pass in the address of max_n and max_num_steps into find_max_steps. find_max_steps updates both variables for us.

Example: Swapping Two Variables

Another example for call-by-reference is a function that swaps two variables. Here is one that swaps two long variables.

void swap(long *a, long *b) {
  long temp = *a;
  *a = *b;
  *b = temp;
}

To see swap in action, consider:

long a = 10;
long b = -4;
swap(&a, &b);

After calling swap, the value for a becomes -4, b becomes 10.

Documenting Call-by-Reference Parameters

A parameter passed as a pointer could be used in three different ways:

  • The parameter could be a read-only input, and the main purpose of passing in the value is so that the function has access to the value of the pointer.
  • The parameter could be used as a vessel for the function to pass a value to the caller, similar to the parameters max_n and max_num_steps in the function find_max_steps above. In this case, the value contained in the variable pointed to by the pointer does not matter.
  • The parameter could be used as both input and output. The value contained in the parameter is read inside the function, and the value is updated from inside the function. For example, the parameters passed to swap above.

In CS1010, we will be using Doxygen format to document our functions. There are three types of parameters, corresponding to the three situations above: @param[in] is used to document a read-only parameter (note that this applies to all read-only parameters, not just pointers). @param[out] is used to document an output-only parameter, and @param[in,out] is used to document a parameter that is both input and output.

Problem Set

Problem 17.1

Complete the function find_min_max that takes in a length and an array containing long values of size length, and update the parameter min and max with the minimum and the maximum value from this array, respectively. Show how to call this function from main.

void find_min_max(long length, long array[length], long *min, long *max)
{
  :
}

int main()
{
  long list[10] = {1, 2, 3, 4, -4, 5, 6, -8, 3, 1};
    :
}

Problem 17.2

Consider the program below:

void foo(double *ptr, double trouble) {
  ptr = &trouble;
  *ptr = 10.0;
}

int main() {
  double *ptr;
  double x = -3.0;
  double y = 7.0;
  ptr = &y;

  foo(ptr, x);

  cs1010_println_double(x);
  cs1010_println_double(y);
}

What would be printed?

Appendix: Complete Code

#include "cs1010.h"

void set_to_0s(long len, long a[len]) {
  for (long i = 0; i < len; i += 1) {
    a[i] = 0;
  }
}

int main()
{
  long a[5] = {1, 2, 3, 4, 5};
  int len = 5;
  set_to_0s(len, a);
  for (long i = 0; i < len; i += 1) {
    cs1010_println_long(a[i]);
  }
}
include "cs1010.h"

void max(const long list[], long length, long *max_value, long *max_pos)
{
  *max_value = list[0];
  for (long i = 1; i != length; i += 1) {
    if (list[i] > *max_value) {
      *max_value = list[i];
      *max_pos = i;
    }
  }
}

int main()
{
  long list[6] = {15, 8, 4, 16, 42, 23};
  long max_value;
  long max_pos;
  max(list, 6, &max_value, &max_pos);
  cs1010_println_long(max_value);
  cs1010_println_long(max_pos);
}