Thoughts on REST style APIs

written by João Cabrita on 08 December 2021

I’ve worked with RESTful APIs for much of my career; besides being pretty much unavoidable anyway, they’ve always had a warm place in my heart and I’m very interested in learning more about how to design them.

While, overall, I think they are a great default for starting to build an API (especially a public one), I’ve also started to see some issues that lead me to the realization that the REST style has many limitations when applied to APIs.

The issue essentially boils down to the fact that it’s impossible for heuristic based clients (like the aforementioned applications) to avoid coupling to the specifics of the API, thus making it almost impossible to build truly generic clients, which in turn prevents the architectural style from achieving its goals.

The golden standard

When defending the REST architectural style, I usually reach for the domination of the Internet as we know it as the prime example; the almost complete adoption by browsers of HTTP, HTML and other standards, means that users can use these browsers to consume every website in existance in a way that “just works” (because it subscribes to the Uniform Interface). This is the ideal that REST strives to achieve: a distributed but ubiquitous and performant system.

Why it doesn’t work for APIs

Despite their similarities, APIs and websites are different beasts because their consumers are of significantly different natures: while websites are tipically consumed by highly autonomous neural networks (i.e. humans), APIs are typically consumed by heuristic based applications which lack the ability to understand the semantics of the content they access and make decisions according to it.

In other words, while humans can easily navigate the Internet with little to no training, even in websites they’ve never been to before, it’s almost impossible to create an application that does the same.

For example, when displaying the following in a web page:

<form action="" method="post">
    <h1>Send message</h1>
    <label>Message:
        <input type="text" name="message"></label>
    <input type="submit" >
</form>

The browser generates the below (which intentionally doesn’t work):

Any person can look at the above and figure out that they can type a message and send it.

A computer? Not so much!

You can convert the above into machine-friendlier JSON (I’m drawing inspiration from HAL-FORMS here):

{
  "_links": {
    "self": {
        "href": "https://example.com",
    }
  },
  "_templates": {
      "default": {
          "method": "POST",
          "properties": [
              {
                  "name": "message",
                  "required": true
              }
          ]
      }
  }
}

It is at this point that one of two problems will occur.

Backend applications

Automated (i.e. backend) applications do not generally have an autonomous agent commanding it; instead, the agent navigates the API using preset rules.

This means that, even if the agent understands what a relationship (a hyperlink) is, it can only follow those links that it’s hardcoded to understand.

Consider the following (HAL based) example for an API response:

{
    "name": "John Doe",
    "age": 22,
    "_links": {
        "self": { "href": "https://example.com/john" },
        "friend-list": { "href": "https://example.com/john/friends" },
        "best-frient": { "href": "https://example.com/john/friends/jane" },
    }
}

For a machine, the only way to differentiate between the friend-list relationship and best-friend is if these are hardcoded into the program, so it has very specific expectations of when these appear and how they’re used.

Frontend applications

Another common use case for consuming REST APIs is in Front-End applications, i.e. applications that read data, display it to the user, as well as reading data from the user and sending it to the API.

Often, what these applications will do is translate JSON into user friendly HTML (similar to my initial example above); the fact that these delegate many choices to the (autonomous) user means that they, unlike the backend applications just discussed, don’t need to understand the relationships but can delegate that understanding to the user.

In practice, however, the User Experience of generic interfaces is pretty low; while the application wouldn’t need to understand what the hyperlinks mean, it would have to display them as links undistinguishable from each other:

Additionally, imagine that you’re trying to display a balance (sum of revenues and expenses) for a month; you might want to display positive values in green and negative ones in red:

Month	Balance
November	10€
December	-10€

Your usual API response would look something like (other months redacted):

{
    "balances": [
        ...,
        10,
        -10
    ]
}

To format the values like this, however, the frontend applications needs to have the knowledge that positive values are good (i.e. green) and that negative ones are bad (i.e. red) built-in by the programmer!

Useful still?

So, does the fact that REST has limitations on what can be achieved when applied to APIs mean that we should stop using it?

I don’t believe so.

In fact, I still believe it should be the default for most use cases.

The rising tide lifts all boats

Consider an alternative API design style that uses gRPC (i.e. Protocol Buffers over HTTP): what would it use for the Content-Type header?

Turns out it doesn’t use any specific value (and could even use application/octet-stream): the reasoning is that, by itself, knowing a given message is Protobuf is pretty much useless; you’d need the message specification to interpret it, a specification which is almost sure to be unique for any pair of APIs you could find.

On the other hand, media types that offer hypermedia capabilities (e.g. HAL, Hydra, JSON-LD) have a specific format, that clients may use to navigate all APIs that use those formats.

At this point, you might be inclined to ask but I don’t want to use all APIs in the world, I just need to use this specific API!. The thing is, if just one person builds a high-level API client that understands HAL, you can use it on any API that produces HAL and only have to work on the parts that are specific to that specific API.

As an example, here’s how you’d use RPC calls for sending a message:

const friendList = api.getListOfFriends();
const friendInbox = api.getInboxForUser(friendList[0].id);
api.sendMessageTo(friendInbox.id, {message: "Hi there"}));

By contrast, here’s how it’d look using a hypermedia format:

api.getRel("friendList")
  .getRel("item") // implicitly gets the first item of a plural relationship
  .getRel("inbox")
  .post({message: "Hi there"});

Another example, this time for buying something, RPC style:

const catalogPage = api.getFirstCatalogPage()
  .getNextPage()
  .getNextPage();
const product = catalogPage.products[5];
const order = api.placeOrder({payment_method: "credit_card"});
// waits for user to pay
api.confirmPayment(order.id);

And this time with a hypermedia format:

const order = api.getRel("catalog")
  .getRel("next")
  .getRel("next")
  .getRelAtIndex("item", 5)
  .getRel("checkout")
  .post({payment_method: "credit_card"});
// waits for user to pay
order.getRel("paymentStatus")
  .put({status: "CONFIRMED"});

To me, what’s apparent about these samples is how similar the hypermedia ones are (because most operations are traversing relationships), while the RPC ones are quite different and, additionally, rely on the structure of the data to navigate (e.g. when using friendList[0].id and catalogPage.products[5]).

While this may seem like a minor difference, implementing the client for the RPC style API would involve either hand-writing (with expensive engineering cost) or auto-generating it from a spec (assuming one exists, in which case you still have to set up the toolchain for it); by contrast, the hypermedia API requires only a dependency on a library that uses the correct media type.

In summary, what this means is that media types that allow for HATEOAS provide a built-in mechanism for navigation that you don’t have to build; this may not lower the majority of the engineering cost of developing an API client but it does reduce it a bit: over time, these small improvements can accumulate into a significant difference.

REST isn’t just the Uniform Interface

What I’ve described is just a limitation in one of the aspects of the REST architectural style: the Uniform Interface. However, REST also outlines other constraints in order to provide additional benefits like cacheability, which means that you still get those.

Conclusion

In this post, I’ve outlined some limitations I’ve identified from my work designing REST APIs, as well as what I think that could mean for the REST architectural style as applied to APIs. While APIs may never reap the full benefits (at least compared to “regular” websites), they still get most of them.