CSci 555, Functional Programming, Spring 2016
Functional Programming in Scala
Handling Errors without Exceptions

H. Conrad Cunningham

14 April 2016 (minor edit 4 February 2018)

Acknowledgements: This is a set of notes written to accompany my lectures on Chapter 4 of the book Functional Programming in Scala by Paul Chiusano and Runar Bjarnason (Manning, 2015). I constructed the notes around the ideas and Scala examples from that chapter. I encourage all students to study that chapter.

I also patterned some of the discussion of for-comprehensions on Chapter 10 of the document Scala by Example by Martin Odersky, on Chapter 23 of the book Programming in Scala, Second Edition, by Marin Odersky, Lex Spoon, and Bill Venners (Artima Press, 2010), on the relevant parts of the Scala language specification, and on a relevant FAQ in the Scala documentation.

Prerequisite: This discussion assumes the reader is familiar with the programming concepts and Scala features covered in my Notes on Scala for Java Programmers, Recursion Concepts and Terminology, and Functional Data Structures.

Advisory: The HTML version of this document uses MathML in a few locations. For best results, use a browser that supports the display of MathML. A good choice as of April 2016 seems to be a recent version of Firefox from Mozilla.

TODO: Change to use code blocks, restructure to match current style

Handling Errors without Exceptions

Introduction

A benefit of Java or Scala exception handling (i.e., using try-catch blocks) is that it consolidates error handling to well-defined places in the code.

However, code that throws exceptions typically exhibits two problems for functional programming.

It is not referentially transparent. It has side effects. Its meaning is dependent upon the context in which it is executed.
It is not type safe. Exceptions may cause effects that are of a different type than the return value of the function.

The key idea from Chapter 4 is to use ordinary data values to represent failures and exceptions in programs. This preserves referential transparency and type safety while also preserving the benefit of exception-handling mechanisms, that is, the consolidation of error-handling logic.

To do this, we introduce the Option and Either algebraic data types. These are standard types in the Scala library, but for pedagogical purposes Chapter 4 introduces its own definition that is similar as do the standard one.

This set of notes also introduces Scala features that we have not previously discussed extensively: type variance, call-by-name parameter passing, and for-comprehensions.

An `Option` Algebraic Data Type

We define an algebraic data type Option using sealed trait Option with case class Some to wrap a value and case object None to denote an empty value. We specify the operations on the data type using the method-chaining style.

    import scala.{Option => _, Either => _, _} // hide standard
    sealed trait Option[+A] {
      def map[B](f: A => B): Option[B] 
      def getOrElse[B >: A](default: => B): B
      def flatMap[B](f: A => Option[B]): Option[B]
      def orElse[B >: A](ob: => Option[B]): Option[B]
      def filter(f: A => Boolean): Option[A]
    }
    case class Some[+A](get: A) extends Option[A]
    case object None extends Option[Nothing]

The import statement above hides the standard definitions of Option and Either from the Scala standard library.

Before we move on to the definition of these methods, let’s consider two issues raised in the signatures of the getOrElse and orElse methods: one related to the generic parameters and the other to the default parameters.

Type variance issues

As we have discussed previously, the +A annotation in Option[+A] declares that parameter A is covariant. That is, if S is a subtype of T, then Option[S] is a subtype of Option[T]. Also remember that Nothing is a subtype of all other types.

For example, suppose type Beagle is a subtype of type Dog, which, in turn, is a subtype of type Mammal. Then, because of the covariant definition, Option[Beagle] is a subtype of Option[Dog], which is a subtype of Option[Mammal]. This is intuitively what we expect.

However, in getOrElse and orElse, we use type constraint B >: A. This means that B must be equal to or a supertype of A. We also define value parameter of these functions to have type Option[B] instead of Option[A].

Why must we have this constraint?

See the chapter notes for the Functional Programming in Scala book (https://github.com/fpinscala/fpinscala/wiki) for more detail on this complicated issue. We sketch the argument below.

In some sense, the +A annotation declares that, in all contexts, it is safe to cast this type A to some supertype of A. The Scala compiler does not allow us to use this annotation unless we can cast all members of a type safely.

Suppose we declare orElse (incorrectly) as follows:

    trait Option[+A] {
      def orElse(o: Option[A]): Option[A]
      ...
    }

We have a problem because this declaration of orElse only allows us to cast A to a subtype of A.

Why?

As with any function, orElse can only be passed a subtype of its declared argument type. That is, a function of type Dog => R can be passed an object of subtype like Beagle or of type Dog itself, but it cannot be passed a supertype object of Dog such as Mammal.

But orElse has a value parameter type of Option[A]. Because of the covariance of A (declared in the trait), this parameter only allows subtypes of A–not supertypes as required by the covariance.

So, we have a contradiction.

For the incorrect signature of orElse, the Scala compiler generates an error message such as “Covariant type A occurs in contravariant position.” We can get around this error by using a more complicated signature that does not mention A in any of the function arguments, such as:

      def orElse[B >: A](o: Option[B]): Option[B]

Now consider the second new feature appearing in the signatures of getOrElse and orElse–the => B and => Option[B] types for the parameters.

Parameter-passing modes

Scala’s primary parameter-passing mode is call by value as in Java and C. That is, the program evaluates the caller’s argument and the resulting value is bound to the corresponding formal parameter within the called function.

If the argument’s value is of a primitive type, then the value itself is passed. If the value is an object, then the address of (i.e., reference to) the object is passed.

A call-by-value parameter is called strict because the called function always requires that parameter’s value before it can execute. The corresponding argument must be evaluated eagerly before transferring control to the called function.

Scala also has call-by-name parameter passing. Consider the default: => B feature in the declaration

      def getOrElse[B >: A](default: => B): B

The type notation => B means the calling program leaves the argument of type B unevaluated. That is, the calling program wraps the argument expression in an parameterless function and passes the function to the called method. This automatically generated parameterless function is sometimes called a thunk.

Every reference to the corresponding parameter causes the thunk to be evaluated. If the method does not access the corresponding parameter during some execution, then the parameter is never evaluated.

As with all higher-order arguments in Scala, a thunk is passed as a closure. In addition to the function, the closure captures any free variables occurring in the expression–that is, the variables defined in the caller’s environment but not within the expression itself.

Note: The closure actually captures the variable itself, not its value. So if the free variable is either a reference to a var or to a mutable data structure, then changes in the value are seen inside the called function. But in this course, we normally use immutable data structures and val declarations.

A call-by-name parameter is called nonstrict because the called function does not always require that parameter’s value for its execution. The corresponding argument can thus be evaluated lazily, that is, evaluated only if and when its value is needed.

We look at more implications of strict and nonstrict functions in Chapter 5.

Now, let’s define the methods in the Option data type.

Implementing the `Option` methods

The Option data type is similar to a list that is either empty or has one element.

The map method applies a function to its implicit Option argument. If the implicit argument is a Some, the method applies the function to the wrapped value and returns the resulting Some. If it is a None, the method just returns None. We can implement map using pattern matching directly as follows:

      def map[B](f: A => B): Option[B] = this match {
        case None    => None
        case Some(a) => Some(f(a))
      }

Similarly, we can use pattern matching directly to implement the getOrElse function. If the implicit argument is of type Some, this function returns the value it wraps. If the implicit argument is of type None, this function returns the value denoted by the default argument. By passing default by name, the argument is only evaluated when its value is needed.

      def getOrElse[B >: A](default: => B): B = this match {
        case None    => default // evaluate the thunk
        case Some(a) => a
      }

Function flatMap applies its argument function f, which might fail, to its implicit Option argument when this value is not None. We can define flatMap in terms of map and getOrElse as shown below. (Reminder: If we apply method names as operators in an infix manner, they associate to the left.)

      def flatMap[B](f: A => Option[B]): Option[B] =
        map(f) getOrElse None

We can also define flatMap using pattern matching directly.

      def flatMap_1[B](f: A => Option[B]): Option[B] = this match {
        case None    => None
        case Some(a) => f(a)
      }

Function orElse returns the implicit Option argument if is not None; otherwise, it returns the explicit Option argument. We can define orElse in terms of map and getOrElse or by directly using pattern matching.

      def orElse[B >: A](ob: => Option[B]): Option[B] =
        this map (Some(_)) getOrElse ob

      def orElse_1[B>:A](ob: => Option[B]): Option[B] = 
        this match {
          case None => ob
          case _    => this
        }

The filter function converts its implicit argument from Some to None if it does not satisfy the boolean function p. We can define filter by using pattern matching directly or by using flatMap.

      def filter(p: A => Boolean): Option[A] = 
        this match {
          case Some(a) if p(a) => this
          case _               => None
        }

      def filter_1(p: A => Boolean): Option[A] =
        flatMap(a => if (p(a)) Some(a) else None)

Using `Option` for statistical `mean` and `variance`

Consider a function to calculate and return the mean (i.e., average value) of a list of numbers. This function must sum the list of numbers and divide by the number of elements. It might have a signature such as:

      def mean(xs: List[Double]): Double

But what should be returned for empty lists?

We can modify the signature to use Option and define the function as follows:

      def mean(xs: Seq[Double]): Option[Double] =
        if (xs.isEmpty) 
          None
        else 
          Some(xs.sum / xs.length)

The return type now allows the possibility that the mean may be undefined. We thus extend a partial function to a total function in a meaningful way.

Above we also generalize the List type to its supertype Seq from the Scala library. Type Seq denotes a family of sequential data types that includes the List type, array-like collections, etc. This type defines the methods isEmpty, sum, and length.

If the mean of a sequence $s$ is $m$ , then the (statistical) variance of the sequence $s$ is the mean of the sequence formed by the terms $(x-m)^{2}$ for each $x \in s$ (perserving the order).

Using the mean function defined above, we can compute the variance of a sequence of numbers as follows:

      def variance(xs: Seq[Double]): Option[Double] =
        mean(xs) flatMap 
          (m => mean(xs.map(x => math.pow(x - m, 2))))

Using `Option` in the labelled digraph

In the doubly labelled Digraph case study, we defined the following method to return the label on a vertex:

      def getVertexLabel(ov: A): B

In the case study’s DigraphList implementation, we defined this function as follows. The function terminates with an error when the the argument vertex is not present in the digraph.

     def getVertexLabel(ov: A): B = 
       (vs dropWhile (w => w._1 != ov)) match {
         case Nil => sys.error("Vertex " + ov + " not in digraph")
         case ((_,l)::_) => l
       }

We can avoid the error termination in this function by changing the method signature to return an Option[B] instead of B.

     def getVertexLabelOption(ov: A): Option[B] = 
       (vs dropWhile (w => w._1 != ov)) match {
         case Nil        => None
         case ((_,l)::_) => Some(l)
       }

Code that uses this new Digraph method can call the various Option methods (or directly use pattern matching) to process the result appropriately.

For example, if the vertex label is a string, it may be appropriate in some scenarios to just use a null string for the label of a nonexistent vertex. Let g be a Digraph and be v be a possible vertex, then

      (g getVertexLabelOption v) getOrElse ""

would either return the label string for v if it exists or the null string if v does not exist.

Similarly, the code

      (g getVertexLabelOption v) getOrElse 
        (sys.error("undefined vertex " + v))

would still terminate with an error. However, this design enables the user of the Digraph library to decide under what circumstances and at what point in the code to terminate.

Idiom: A common pattern for computing with the Option type is to use map, flatMap, and filter to transform values generated by a function like getVertexLabelOption and then to use getOrElse for error handling at the end.

Lifting

It seems that that deciding to use of Option could cause lots of changes to ripple through our code, much like the introduction of extensive exception-handling would. However, we can avoid that somewhat by using a technique called lifting.

For example, the Option type’s map function enables us to transform values of type Option[A] into values of type Option[B] using a function of type A => B.

Alternatively, we could consider map as transforming a function of type A => B into a function of type Option[A] => Option[B]. That is, we lift an ordinary function into a function on Option.

We can formalize this alternative view with the following function:

      def lift[A,B](f: A => B): Option[A] => Option[B] = _.map(f)

Thus any existing function can be transformed to work within the context of a single Option value. For example, we can lift the square root function from type Double to work with Option[Double] as follows:

     def sqrtO: Option[Double] => Option[Double] = lift(math.sqrt _)

We can now use sqrtO such as sqrtO(Some(4)). This evaluates to Some(2).

Chapter 4 of Functional Programming in Scala gives several examples where Option types can be used effectively in realistic scenarios.

For comprehensions

Another useful function on Option data types is the function map2 that combines two Option values by lifting a binary function. If either of the arguments are None, then the result should also be None. We can define map2 as follows:

      def map2[A,B,C](a: Option[A], b: Option[B])(f: (A, B) => C): Option[C] =
        a flatMap (aa => 
        b map (bb => 
        f(aa, bb)))

This function applies a series of map and flatMap calls.

Because lifting is so common in functional programming, Scala provides a syntactic construct called a for-comprehension to facilitate its use. This construct is really syntactic sugar for a series of applications of map, flatMap, and withFilter. (Method withFilter works like filter except it filters on demand, without creating a new collection as a result.)

Here is the same code expressed as a for-comprehension:

      def map2fc[A,B,C](a: Option[A], b: Option[B])(f: (A, B) => C): 
        Option[C] =
          for { 
            aa <- a
            bb <- b 
          } yield f(aa, bb)

Of course, for-comprehensions are more general than just their use with Option. They can be used for the lists, arrays, iterators, ranges, streams, and other types in the Scala standard library that support the map, flatMap, and withFilter (or filter) operations. m Consider a list persons of person objects with name and age fields. We can collect the names of all persons who are age 21 and above as follows:

    for (p <- persons if p.age >= 21) yield p.name

This is equivalent to the following List expression

    filter (p => p.age >= 21) map (p => p.name)

Translating (desugaring) for-comprehensions

In general, a for-comprehension

    for (enums) yield e

evaluates expression e for each binding generated by the enumerator sequence enums and collects the results. An enumerator sequence begins with a generator and may be followed by additional generators, value definitions, and guards.

A generator p <- e produces a sequence of zero or more bindings from expression e by matching each value against pattern p.
A value definition p = e binds the names in pattern p to the result of evaluating the expression e.
A guard if e restricts the bindings to those that satisfy the boolean expression e.

We can translate (or desugar) for-comprehension (more or less) as follows:

We replace every generator p <- e, where p is a pattern and e is an expression, by
```
    p <- e.withFilter { case p => true ; case _ => false }
```
Here we use withFilter to filter out those items that do not match the pattern p.
While all comprehension have not been eliminated, repeat the following:
1. Translate for-comprehension
```
    for (p <- e) yield e1
```
  to the expression
```
    e.map { case p => e1 }
```
2. Translate for-comprehension
```
    for (p <- e; p1 <- e1; ... ) yield e2 
```
  where ... is a (possibly empty) sequence of generators, definitions, or guards, to the expression
```
    e.flatMap { case p => for (p1 <- e1; ... ) yield e2 }
```
3. Translate a generator p <- e followed by a guard if g to a single generator
```
    p <- e.withFilter((x1,...,xn) => g)
```
  where x,...,xn are the free variables of p.
4. Translate a generator p <- e followed by a value definition p1 = e1 to the following generator of pairs of values, where x and x1 are fresh names:
```
    (p, p1) <- for (x@p <- e) yield
        { val x1@p1 = e1; (x, x1) }
```

The Scala notation x@p means that name x is bound to the value of the expression p.

Note: Above we do not consider the imperative for-loops. These can also be translated as above except that the imperative method forEach is also needed.

As an example, we can translate (desugar) the for-comprehension

    for(x <- e1; y <- e2; z <- e3) yield {...}

into expression:

    e1.flatMap(x => e2.flatMap(y => e3.map(z => {...})))

As a second example, we can also translate the for-comprehension

    for(x <- e; if p) yield {...}

into expression:

    e.withFilter(x => p).map(x => {...})

If no withFilter method is available, we can instead use:

    e.filter(x => p).map(x => {...})

As a third example, we can translate for-comprehension

    for(x <- e1; y = e2) yield {...}

into expression

    e1.map(x => (x, e2)).map((x,y) => {...})

Adding for-comprehensions to data types

The Scala language has no typing rules for the for-comprehensions themselves. The Scala compiler first translates for-comprehensions into calls on the various method and then checks the types. It does not require that methods map, flatMap, and withFilter have particular type signatures. However, a particular setup for some collection type C with elements of type A is the following:

    trait C[A] {
      def map[B](f: A => B): C[B]
      def flatMap[B](f: A => C[B]): C[B]
      def withFilter(p: A => Boolean): C[A]
    }

We can define our own data types to support for-comprehension by providing one or more of the required operations above.

If the data type defines just map, Scala allows for-comprehensions consisting of a single generator.
If the data type defines both flatMap and map, Scala allows for-comprehensions consisting of several generators.
If the data type defines withFilter, Scala allows for-comprehensions with guards. (If withFilter is not defined but filter is, Scala will currently use filter instead. However, this gives a deprecation warning, so this fallback feature may be eliminated in a future release of Scala.)

We added for-comprehensions to our own Option type earlier. We do the same for the Either type in the next section.

Note: A for-comprehension is, in general, convenient syntactic sugar for expressing compositions of monadic operators. If time allows, we will discuss monads later in the semester.

An `Either` Algebraic Data Type

We can use data type Option to encode that a failure or exception has occurred. However, it does not give any information about what went wrong.

We can encode this additional information using the algebraic data type Either.

    import scala.{Option => _, Either => _, _} // hide builtin
    sealed trait Either[+E,+A] {
      def map[B](f: A => B): Either[E,B]
      def flatMap[EE >: E, B](f: A => Either[EE,B]): Either[EE,B]
      def orElse[EE >: E, AA >: A](b: => Either[EE,AA]): Either[EE,AA]
      def map2[EE >: E, B, C](b: Either[EE, B])(f: (A,B) => C): Either[EE, C] 
    }
    case class Left[+E](get: E) extends Either[E,Nothing]
    case class Right[+A](get: A) extends Either[Nothing,A]

By convention, we use the constructor Right to denote success and constructor Left to denote failure.

We can implement map, flatMap, and orElse directly using pattern matching on the Either type.

     def map[B](f: A => B): Either[E, B] = 
       this match {
         case Left(e)  => Left(e)
         case Right(a) => Right(f(a))
       }

     def flatMap[EE >: E, B](f: A => Either[EE, B]): Either[EE, B] =
       this match {
         case Left(e)  => Left(e)
         case Right(a) => f(a)
       }
        
      def orElse[EE >: E, AA >: A](b: => Either[EE, AA]): Either[EE, AA] =
        this match {
          case Left(_)  => b
          case Right(a) => Right(a)
        }

The availability of flatMap and map enable us to use for-comprehension generators with Either. We can thus implement map2 using a for-comprehension, as follows:

      def map2[EE >: E, B, C](b: Either[EE, B])(f: (A, B) => C): Either[EE, C] =
        for { a <- this; b1 <- b } yield f(a,b1)

Let’s again use mean as an example and use a String to describe the failure in Left.

      def mean(xs: IndexedSeq[Double]): Either[String, Double] = 
        if (xs.isEmpty) 
          Left("mean of empty list!")
        else 
          Right(xs.sum / xs.length)

We can use the Left value to encode more information, such as the location of the error in the program. For example, we might catch and return the value of an Exception generated as we do in the safeDiv function below.

      def safeDiv(x: Int, y: Int): Either[Exception, Int] = 
        try Right(x / y)
        catch { case e: Exception => Left(e) }

We can abstract the computational pattern in the safeDiv function as function Try defined below:

      def Try[A](a: => A): Either[Exception, A] =
        try Right(a)  // evaluate thunk
        catch { case e: Exception => Left(e) }

Chapter 4 of Functional Programming in Scala describes other functions for type Either.

Standard Library

Both Option and Either appear in the standard library.

The standard library Option type is similar to the one developed here, but the library version is missing some of the extended functions described in Chapter 4.

The standard library Either type is similar but more complicated, using projections on the left and right. It is also missing some of the extended functions from Chapter 4.

Study the Scala API documentation for more information on these data types.

Summary

The big idea in this chapter is to use ordinary values to represent exceptions and use higher-order functions for handling and propagating errors. As examples, we considered the algebraic data types Option and Either and functions such as map, flatMap, filter, and orElse to process their values.

We will use this general technique of using values to represent effects in the subsequent studies in this course.

We introduced the idea of nonstrict functions. We examine the implications and use of these more in Chapter 5.