Dublin Tech: REST

Showing posts with label REST. Show all posts

Sunday, June 9, 2019

Defining a Resource

Fielding's dissertation describes a Resource as:

"Any information that can be named" ... "a document or image, a temporal service (e.g. “today’s weather in Los Angeles”), a collection of other resources, a non-virtual object (e.g. a person), and
so on. In other words, any concept that might be the target of an author’s hypertext
reference must fit within the definition of a resource. A resource is a conceptual mapping
to a set of entities, not the entity that corresponds to the mapping at any particular point in
time."

Defining a Resource is both a Science and an Art. It requires both Domain knowledge and API Architectural skills. The following points detailed below serve as a checklist which may help you determine the shape of your Resource, what data it should contain and how it should be presented to consumers of your API.

The Resource must contain a Business Description

The business description should be 3 - 4 sentences in simple prose which explain what the Resource is.
A developer with a moderate knowledge of your system should be able to understand the description
Any caveats of the Resource should be made clear

The Resource should be useful on its own

This is similar to the maxim of defining the boundary of a micro-service, where a micro-service should be considered to be useful on its own. Similarly, a Resource should be useful on its own.

For example, instead of:

/street-address/{id}

RESPONSE

{
    "street1": "String",
    "street2": "String"
}

and

/address-extra/{id}

RESPONSE 

{
    "city": "String",
    "country": "String"
}

It should be:

/address/{id}

RESPONSE

{
    "street1": "String",
    "street2": "String",
    "city": "String",
    "country": "String"
}

If a Resource on its own is not useful and always necessitates a subsequent request, it means code will inevitably become more complex as well as there being a performance impact incurred from the second request

Use an Appropriate Noun

Use of a simple noun over a compound noun is preferred. For example, Address is better than AddressInfo or AddressDetail. This is a general rule, there will always be exceptions.

If using multiple Resources to represent different views of the same data, for example: Address and AddressDetail, use the simple noun e.g Address first. Then if the second representation is more detailed use ResourceNameDetail or if it is less detailed use ResourceNameSummary. For example, suppose there is a requirement to introduce an Address type Resource:

Address is introduced first
If a subsequent view of Address is needed that is more detailed, the new Resource should be called AddressDetail
If a subsequent view of Address is needed that is less detailed, the new Resource should be called AddressSummary

If it is only used in a READ does it need to be a Resource?

If a Resource is only ever used in a Read request and never a Write (Create, Partial Update, Full Update, Delete, ...) request it is questionable if it needs to be defined as a Resource with its own URI. It could just be added to the parent payload and if there is a concern that payload then becomes too complex, the parent could just provide a sparse query - where the client can decide per API request what it wants returned.

Resources should conform to the uniform interface

The uniform interface is a very important part of good API design. It is not just about using special verbs for different requests but also ensuring the data shape is consistent.

If creates, reads, updates, deletes etc are done in a consistent way, it means code is more consistent, reusable and more maintainable.

This means:

GET /addresses/{id}

and

GET /addresses

must return the same address data structure to represent an Address.

GET /addresses/{id}
RESPONSE
{
    "id":"546",
    "street1": "String",
    "street2": "String",
    "city": "String",
    "country": "String"
}

and

GET /addresses

RESPONSE
{
    "elements": [
         {
              "id":"546",
              "street1": "String",
              "street2": "String",
              "city": "String",
              "country": "String"
         },
         ...
     ]
}

Similarly, for write payloads, the Data Structure should be the same. So, a partial update to change street1 would be:

PATCH /addresses/{id}
REQUEST

{
    "street1": "Walkview"
}

RESPONSE
{
    "id":"546",
    "street1": "Walkview",
    "street2": "Meadowbrook",
    "city": "Dublin",
    "country": "Ireland"
}

and not something like

PATCH /addresses/{id}
REQUEST

{
    "newStreet1Value": "Walkview"
}

From a Resource perspective, the data structure must be consistent. A different data structure means a different Resource, which should be named differently and have its own path.

Don't expose everything

If your DB model is quite sophisticated, you can be sure not all attributes need to be exposed at an API level. Some fields may only be getting persisted for back office processing and should never presented make it to any UI.

When adding an attribute to a Resource, consider:

to only include fields that you are sure the client is interested in
if you are not sure, leave the attribute out. It is much smaller problem to add an attribute later on, then to remove an attribute that has already been exposed.

API Models shouldn't blindly mirror DB Relational model or OO Models

In database modelling approaches such as normalizing data or collapsing inheritance hierarchies are used. In Object Orientated design, techniques such as polymorphism, inheritance hierarchies etc are used to promote things like code reuse and to reduce coupling.

Resource modelling does not have to follow theses techniques. The consumer of an API doesn't care if the data is all in one table, or normalized over multiple tables. In general, the API returns data in a format that is easy to use and does not require much additional mapping by the client before it can become useful.

Use Hierarchical data to Avoid repetition

One of the advantages of hierarchical data over flat formats such as CSV is that it provides a mechanism to avoid repetition. For example, consider a flat data structure which contains a list of persons and what team they are in. In CSV this is:

team, firstname, lastname
Liverpool, Mo, Salah
Liverpool, Andy, Roberston

In JSON this could be:

{
    "team": "Liverpool",
    "players": [
        {
            "firstName":"Mo",
            "lastName":"Salah"
        },
        {
            "firstName":"Andy",
            "lastName":"Roberston"
        },
         ...
     ]
}

Use Hierarchical Data to Make context clear

Another advantage of hierarchical data is that it helps provide context. To understand a flat data structure you need to know what the query was that generated the data to understand the meaning of it. For example, consider a bunch of rows that contain a date range.

name, fromDate, toDate, holidays
Tony, 2018-01-01, 2018-02-02, true
Tony, 2018-02-03, 2018-03-01, false

You could make assumptions that there is a new row when there is a change in Tony being on holidays. But, what if there is another column?

name, fromDate, toDate, holidays, sick
Tony, 2018-01-01, 2018-02-02, true, false
Tony, 2018-02-03, 2018-03-01, false, true

Are the date ranges corresponding to holidays, sickness or both?

If we get more data back maybe it might be clearer...

name, fromDate, toDate, holidays, sick,
Tony, 2018-01-01, 2018-02-02, true, false
Tony, 2018-02-03, 2018-03-01, false, true
Tony, 2018-03-02, 2018-04-01, false, false

Now it looks like it's sickness that the date range corresponds to and its only a coincidence it lines up with a holiday period. However, when we get more data back this theory also fails.

name, fromDate, toDate, holidays, sick,
Tony, 2018-01-01, 2018-02-02, true, false
Tony, 2018-02-03, 2018-03-01, false, true
Tony, 2018-03-02, 2018-04-01, false, false
Tony, 2018-04-02, 2018-05-01, true, false

It gets even more complicated when just don't have some information. For example:

name, fromDate, toDate, holidays, sick,
Tony, 2018-01-01, 2018-02-02, true, false
Tony, 2018-02-03, 2018-03-01, false, true
Tony, 2018-03-02, 2018-04-01, false, false
Tony, 2018-04-02, 2018-05-01, true, false
Tony, 2018-05-02, 2018-06-01, null, false
Tony, 2018-06-02, 2018-07-01, null, false
Tony, 2018-07-02, 2018-07-08, true, false
Tony, 2018-07-08, 2018-07-09, true, null

The limitation with flat data structures is not only lack of normalisation but that they can only go so far in making the data self-describing.

When it isn't clear what data means, it is inevitable processing the data will be buggy.

We could represent the same person data in hierarchical format as:

{
    "name":"tony",
    "holidays": [
         {
            "fromDate":"2018-01-01",
            "toDate":"2018-02-02"
         },
         {
             "fromDate":"2018-04-02",
             "toDate":"2018-05-01"
         }, 
         {
             "fromDate":"2018-07-02",
             "toDate":"2018-07-09"
         }
     ],
     "sick": [ 
         {
             "fromDate":"2018-02-03",
             "toDate":"2018-03-01"
         }
     ]
}

Now, the data is much more self describing and it is clear when a date range is for a holiday and when it is for a sick period.

Resource Relationships

Resources on their own only describe themselves. A Resource model describes relationships between Resources. This will give an indication of:

dependencies between Resources. What Resources are needed for a particular Resource to exist or what is impacted when a particular Resource changes: updated or deleted.
Data navigation - in a large domain model, it is much easier to understand and follow if navigational and directional sense is provided to consumers of the model. Especially, when to navigation across (Resources loosely connected) can be be differentiated from navigation down (Resources strongly connected)

Hypermedia links aren't only used to achieve HATEOAS. Resources that describe what they are linked to using hypermedia links demonstrate a very powerful mechanism to express the Resource model. Advantages include:

A large domain model is split into more manageable pieces. Typically users are only interested in a particular part of the model. When Resources self describe their own relationships, it means a large complex model is split up into more digestible chunks and users get the information they need quicker.
The Resource model is self-describing and kept in sync with code. Everything is co-located.

Make clear Parent - Child relationships
A Child Resource describes its Parent URL hierarchical name spacing. A Parent Resource has children of one or many types should make this clear by providing links to the children. For example, if a Team Resource has Players child Resources. The Team payload should make this clear.

REQUEST
https://api.server.com/teams/4676
RESPONSE

{
    "id":"34533",
    ...,
    "_links": {
          "self":"https://api.server.com/teams/4676",
          "players":"https://api.server.com/teams/4676/players"
    }
}

Make clear Peer relationships

This is similar to above except it is for Resources that exist in a different hierarchical name space. So for example, suppose the team is in division 1. A link should be included in the team's division attribute.

REQUEST
https://api.server.com/teams/4676

RESPONSE
{
    "id":"34533",
    "division": {
        "name":"Division 1",
        "_links": {
              "self":"https://api.server.com/divisions/1"
        }
     },
     ..., 
    "_links": {
        "self":"https://api.server.com/teams/4676",
        "players":"https://api.server.com/teams/4676/players"
    }
}

Make clear Links to Other Representations

If data is modeled to have multiple Resources which represent different representations of the data, the Resources should also include links to each other.

REQUEST
https://api.server.com/teams/4676

RESPONSE
{
    "id":"34533",
    "division": {
        "name":"Division 1",
        "_links": {
              "self":"https://api.server.com/divisions/1"
        }
     },
     ..., 
    "_links": {
        "self":"https://api.server.com/teams/4676",
        "players":"https://api.server.com/teams/4676/players",
        "teamDetails":"https://api.server.com/teamDetails/4676"
    }
}

Wednesday, December 19, 2018

What's the case for your API?

Disclaimer: In pure REST, API's are opaque and the URL should be whatever what was sent as a link in the response to a previous request. But, I'm not talking pure REST, I'm talking more pragmatic APIs which involve some concepts from REST and as well as general API best practices.

When writing an API, it starts simple. You identify the obvious resources and end up with endpoints such as:

/api.mycompany.com/tweet

Eventually, your API will have to capture more sophisticated concepts and model more complex resources that cannot be expressed in short single nouns. Some real world examples include:

Enabling request validation via a Request Validator resource (AWS API Gateway API)
Performing a customer search via a Customer Search resource (Google Customer Search API)
Running powerful checks against code via a Check Runs resource (Github API)

In English grammar, nouns that are really two nouns joined in some way are called compound nouns and in English grammar, compound nouns follow one of three patterns:

All the one word: haircut, toothpaste
Two words: rain forest, ice cream
Hyphenated: self-esteem, brother-in-law

In the API world there are different options to choose from but it is better for consistency that your APIs just pick one approach and stick to that it. So firstly, what are the options for compound nouns from an API perspective?

Camel Case

Camel case is the practise of writing each word in the phrase with a capital letter. There are two variations:

Initial upper case (also know as Pascal's case) is where the first letter is also a capital, for example: IceCream. Pascal's case is popular in programming languages for naming classes e.g. Java.
Initial lower case is where the initial letter is always lower case, for example: iceCream. This approach is popular in programming languages (again Java is a good example) for naming variables. When people say camel case, they are usually referring to the initial lower case format.

Kebab Case

In Kebab Case, the individual words are separated by hyphens. Ice cream is expressed as ice-cream. This approach is used in the Lisp programming language, in lots of URLs (for example, every blog post in www.blogger.com e.g. http://dublintech.blogspot.com/2018/08/oauth-20-authorisation-code-grant.html). The observant amongst you will note sometimes the word "dash" is sometimes used in technical references instead of "hyphen". So, what's the difference? In English grammar, hyphen is the thing used to join two words to make one whereas the dash is the thing used to usually add some sort stylistic emphasis to the end of a sentence such as: "I might have an interesting point here - you never know".

In programming we don't care whether the term is hyphen and dash. They are used interchangeably and mean the same thing.

The kebab case approach became popular in Web URIs because search engines knew that the hyphen meant separate words and could index the URI properly. This convention used by search engines meant hyphens became a de facto standard for URIs.

Snake Case

In this approach, an underscore is used to separate words. Ice cream becomes ice_cream. This approach is used in Python and Ruby for anything other than a class name or static constant.

Join words

In this approach the words are just joined. There is no -, no _ and no capitalisation of anything. This is not a popular approach with developers because it's difficult to read.

APIs

Should we use camelCase, kebab-case or snake_case in an API? Well unfortunately, Mr. Fielding's dissertation did not go into such detail. So what are people actually doing? And is the approach used consistent across the API's URL and the JSON Body. Let's take a look.

AWS

AWS have different API styles for different services. The API Gateway REST API reference shows that JSON payload uses camel case but the URL uses nothing, it's just:

/restapis/{id}/requestvalidators/{requestvalidatorId}

Google

Surprise, surprise Google also have lots of APIs. The Google

Custom Search API is similar to the AWS API Gateway API. The compound noun in the URL is just the one word and the JSON body is camel case.

The Google Gmail API has camel case in request body and in some URLs, for example the forwarding addresses API.

The Google youtube API sometimes will use kebab case in the URL e.g. yt-analytics but in other cases will use single word e.g. youtubepartner. But the JSON payload is camel case.

Github

The Github API is a good example where we get a reminder that if possible, you should try to avoid this issue by trying to avoid compound nouns as it avoids them by using some creative name spacing.

However, some more rooting around and you'll find a compound noun such as check run expressed using kebab case in the URL and he JSON body using snake case.

Stripe

Stripe use snake case in the URL and in the JSON body. For example the PaymentsIntents API.

 https://api.stripe.com/v1/payment_intents

and JSON body...

{
  "id": "pi_Aabcxyz01aDfoo",
  "object": "payment_intent",
  "allowed_source_types": [
    "card"
  ],
  "amount": 1099,
  "amount_capturable": 1000,

Paypal

Paypal have more compound nouns than the other APIs checked. APIs for resources such as billing agreement the API will use kebab case in the URL but then use snake case in the JSON payloads.

Twitter

Twitter use snake case in the URL e.g. /saved_searches/ and snake case in the JSON payloads.

Facebook

Facebook's Graph API tends to avoid resource naming in URLs and in JSON bodies it is snake case.

By this stage, you should be getting a little but confused. So let's recap via the table below.

API	URL	JSON Body
AWS API Gateway	No separator	camelCase
Facebook Graph API	N/A	snake_case
Github	Snake and Kebab	snake_case
Google custom search	No separator	camelCase
Google Gmail	camelCase	camelCase
LinkedIn	camelCase	camelCase
Pay pal	kebab-case	snake_case
Stripe	snake_case	snake_case
Twitter	snake_case	snake_case

Everyone is different, what should I do?

So there is a lack of consistency across the industry. However there are point worth making:

In general compound nouns are best avoided. In all the APIs checked (except PayPal), they appear in under 5% of the APIs. This means developers don't get upset when their favourite approach is not used.
The only Web API in the selection above that had more than 5% of its APIs using compound nouns was PayPal and they went for kebab-case in URIs.
kebab-case is never used in any JSON body. The syntax is allowed. So what drives this trend? It's more than likely because JavaScript Web UIs are possibly the mos popular client invoking API and the similarly the most popular back end language serving the API is Java and both of those dudes don't allow the - in any of their declarations.

Is there anyone else saying anything?

In the excellent REST API Design Cookbook, industry expert Mark Masse suggests:

Make your APIs lower case, which rules out camel case
Use kebab case when expressing compound terms in your URIs
Avoid using underscores as they may not display well since some browser render hyperlinks with an underline added

Make a decision

Avoid compound nouns if you can. This isn't always possible. Sticking to a ubiquitous language is important and helpful. If you have a complex business application you will have lots of compound nouns.
If you can't avoid compound nounds and more than 5% of the APIs are going to involve compound nouns use kebab case for your URIs. Why? Because if you have a complex business domain it's not only developers you need to think about. Lots of BA's, Product Architects, curious Managers will also be looking at your APIs. Kebab-case is the easiest to read for everyone.
For JSON body, I think it is okay to use camelCase because this is the easiest to map back to JavaScript and Java code. It is also a recommendation from Google to use camelCase in JSON.
If you have to use camelCase in your URI's, consider using the first letter capital approach for the URIs as the URIs are supposed to marking resources not attributes. Resources are more analogous to Java Classes which also use initial letter capital format; whereas the JSON payload attributes are analogous to Java attributes which use initial lower case.

Until the next time, take care of yourselves.

Friday, May 11, 2018

And some more REST tips

In previous blog posts I have covered some ideas and tips for achieving a REST architecture. In this post, I cover a few more ideas and tips.

Caching

Caching is a big part of the original dissertation. See section 5.1.4
Strategies include validation (client checks it has the latest version) and expiration (client assumes it has the latest version until a specified time)
Expiration:

Expires header tells client when resource is going to expire. The value 0 means avoid caching

Cache-Control

Use max-age directive to specify how long response should be considered valid for; s-maxage for shared caches
Can also be used in requests no-cache means re validate response with server

Validation

Etag - unique version of resource. Used in conjunction with If-none-match request header
Last-Modified - tells client when resource last changed

Controller APIs

When something does fit neatly to a CRUD operation, consider a Controller API

Handling Dates

Use ISO-8601 for your dates - better for natural sorting, handles timezone, locale nuetral, support from most programming languages
Accept any timezone as anyone in the world may call your API
Store in UTC, not in your server's timezone. There should be no offset when persisted.
Return in UTC. Allow the client to adjust to its timezone as necessary
Don't use time if you don't need it. If Date only suffices, only persist Date. This means, timezone complexity goes away.

HEAD

HEAD action should return response headers

Headers

Always return what headers are useful. Consider:

Content-Type
Content-Length
Last-Modified
ETag
Location

Hypermedia (advantages)

Less coupling
Consistent format for links => cleaner client code
Developer productivity: API's easier to navigate
Make easier to introduce services in a more granular way
Code easier to debug - messages always have the URL that created them via the self link

Hypermedia (choices)

HAL - reduces Address coupling
SIREN - reduces Address and Actions coupling
Collection+JSON (CJ) - reduces Address, Action and Object coupling

Idempotent

Can be called several times and return the same result
OPTIONS, GET, HEAD, PUT and DELETE are all idempotent

Long Running Requests

Some operations take a long time. In such cases, consider returning a 202 with the location field set to a URL the client can poll to check for operation progress.

Method not allowed

If an API only supports GET, it should return a 405 for any PUT, POST, DELETEs etc

Must Ignore Principle

Clients should ignore data they are not interested in. This makes it much easier for APIs to be backwardly compatible . If an API returns extra data and some clients aren't expecting it they will just ignore it.

Not acceptable

When a resource doesn't support a specific media type, it should return 406 (see Masse, Rule: 406 (“Not Acceptable”) must be used when the requested media type cannot be served

OPTIONS

OPTIONS should return what actions are available on a resource

Partial Update

Handle partial updates with PATCH

Query

The query component of a URI should be used to filter collections

Resource Creation

When a Resource has been successfully created a 201 should be returned
The location header should indicate the URL to get the Resource.

Safe

Actions are considered Safe if they Do not modify resources
OPTIONS, GET and HEAD are safe

Self link

Response bodies should always include a self link - the URL that was used to return the resource.

Singular or Plural?

Use Singular for Singular Document type resource - when there can only be one. For example: /humans/12343343/head
Otherwise plural

REST: Using a Controller endpoint?

In REST architectures, the fundamental concept is a Resource. A Resource represents anything that’s important enough to be referenced as a thing in itself. For example, a Shopping Cart, a Book or a Car. The next fundamental concept is the Uniform Interface for accessing and manipulating the Resources. In HTTP land usually means:

Create is POST
Read is GET
Update is PUT (or PATCH for Partial Update)
Delete is DELETE

There are of course other concepts (statelessness, caching etc) but for this blog post, let's just focus on Resources.

In the real world, many things map nicely to Resources. However, inevitably somethings won't map so nicely to resources. This is usually a minority of operations for example reset password. It's possible to model these as either

a PUT on /password/

or as

a Controller endpoint and a POST to /resetpassword

The latter may be considered to be closer to programmatic REST than pure REST, but there are times when clients and customers will want you to be pragmatic. This article gives suggestions regarding when to consider using the Controller option.

Does the action Map to a CRUD?

Several actions in a real world application will not map nicely to a Create Read Update Delete (CRUD). For example, Paypal's cancel billing agreement API is:

POST /v1/payments/billing-agreements/agreement_id/cancel

The cancel action rarely maps nicely to a CRUD for a resource. It could be interpreted as:

some resource gets be created (A cancel record)
some resource gets updated (some status column could be getting set to cancelled)
or some resource gets deleted (a order request gets deleted).

Why should the client have to care about how cancel is handled? Couldn't it always change? In some case API's have got around the doesn't map nicely to a CRUD problem using HTTP tunneling. For cancelling a billing agreement this would like:

POST /v1/payments/billing-agreements/agreement_id

with body:

{
  "operation":"cancel"
}

This is considered an anti-pattern and should never be used. Instead a Controller end point should be used.

Resource State or Workflow?

In a REST architecture, every request between Client or Server will usually change a Resource State (write operation) or the Application State (a query or read operation). However, in the real world workflows are inevitable. For example, a reset password flow usually consists of:

Asking the user for the userId (usually email)
System checking that email exists on the system
Sending the user an email with a link to reset the password
Ensuring the user only has a set amount of time to click the link
When the user clicks the link they may be asked a bunch of questions
They will be asked to retype their new password to ensure there's no typos

When an client action is part of a complex workflow, Resource state and Application state changes may not be easy to model. They may not happen synchronously and they could change based on how the workflow is modelled or when the workflow needs to add an extra step. In such scenarios, consider using a Controller end point.

REST without PUT

In some situations, arguments can be made for avoiding PUT and instead using POST to a different endpoint which signifies intent. For example, to change address instead of invoking a PUT to /address/, the client would invoke a POST to /changeaddress and avoid PUTs altogether. One example where this approach is useful is when handling asynchronous operations and you are trying to make clear atomic consistent operation. So for example, if changing address takes a long time and you would rather return a 202, with a location field for the client to poll, if you use the /changeaddress you can then leave /address endpoints as those that are only atomically consistent.

So, any PUT or POST to address, means if you were to immediately do a GET you would get the consistent view of the Resource. This approach is also useful if you want to model the Business event rather than the actual resource that is changing. So for example, suppose 6 or 7 things need to take place when a Bank account has been closed for a Business process perspective. All on the back end in the same thread / transaction. Again, here POST to controller endpoint such as /accountclosed makes more sense then /DELETE to /account.

See this article for more info.

Summary

So why there may be subjectivity involved on when to use a controller style endpoint. The above may at least help to you to make a decision. Remember, it should always only be a minority of APIs where you consider this approach. You are outside the conventional Uniform Interface for unique style operations but you want to still make them feel intuitive to clients of the API.

Friday, January 12, 2018

When a REST Resource should get its own Address?

Background

Author's note

In a purist REST approach, all endpoints (except the starting endpoint) are opaque and their various details shouldn't need to be published. Even if this approach is being used, the points in this article are relevant as Server logic will have to determine when something requires a end point or not.

Introduction

In a REST architecture an entity or a resource (for the rest of the article the term entity will be used) may or may not have its own address. For example, suppose we have an inventory application merchants use to sell their products. Immediately it is possible to see a Product entity. It's URL will look something like: /product/{id}

Now, it is possible for the merchant selling the Products to add his / her own comments to the Products. For example, "Sells very well on Fridays" or "Consider changing price if product doesn't start selling". A Product can have 0..* Comments. As stated, the Product has its own address: /product/{id} for example /product/1231233

and a response payload like this

{

    "id":"1231233",

    "type":"Beer",

    "comments": [{

             "id":"1",

             "comment":"Sells very well on Fridays"                 

     }, {

             "id":"2",

             "comment":"Consider changing price if product doesn't start selling"  

     }]

}

As can be seen, the payload returns a collection of Comment Objects. Should the individual comments each have their own address as well or is it okay that they are just embedded into the Product response? To help the answer this question the following should be considered.

Does the Entity have any meaning outside the Containing Entity Context?

If an Entity (for example Comment) has meaning outside their containing Entity (for example Product) then they should have their own address. For example, suppose the Entity was Student and the Student returned a list of Universities he / she had studied. These Universities have their own meaning outside the Student. So obviously the University should have its own address. In the Activity / Comments scenario, the Comments only exist for the activity. No other Entity will ever reference them or need to reference them. Therefore further aspects needs to be considered.

Is it desirable to perform actions on the individual entities?

Should the client be allowed to create, read, update or delete the individual entity? These have to be considered separately.

Writes: Create, Update, Delete

In the Product / Comments scenario, a Comment would never be created outside or without an Product. It is essentially added to an Product. This could be considered as a partial update to the Product. However, an update or delete to an existing Comment could also be considered a partial update to the Product. This creates complexity on how to differentiate between Create / Updates and Deletes of a Comment using a partial update on the Product. If this is required, it would be much simpler to create a contextual address for the Comment (which indicates the hierarchical nature of the Product / Comment) then allow the Client sent POST, PUT, PATCH, DELETES to that.

Example URL: /product/1231233/comment/1

Reads

In some scenarios the parent containing Entity may not return all the information about the child Entities. For example, again consider the Product --> Comment scenario. Suppose the comment was very large. This would mean the payload for the Product was also very large. In such cases, it might be more prudent for the Product to just return a summary of the Comment and if the client wants the full Entity to make an individual request. Similarly, if there's a big performance cost to get an individual Entity (for example a 3rd party API has to be invoked to get all the information about the comment), it can make more sense to just send a URL link to the Entity rather the than the actual entity contents.

N+1 Problem

If individual Reads are required, be careful that the N+1 problem doesn't then get introduced. For example, suppose a Product could have 100 Comments. the Product API will only return a summary of the Comment and a link to each individual comment if the client wants all the information. However, if the client wants every single comment, this means there will now be 100 HTTP requests. If this is a potential scenario, then a secondary endpoint which aggregates all the comments into the Product should be considered. This is similar to the API Gateway pattern.

Surface Area of Endpoints

In any architecture when contracts are published, if there are too many it can become very unwieldy for developers to understand. Most well known APIs (e.g. PayPal, Amazon, Twitter, Google) usually only have about 20 - 30 addresses. This is a good aim to have. If there are 5,000 different addresses it can become way too large and difficult to control etc.

In summary, the decision diagram provides guidance on what you should do.

Thursday, October 5, 2017

Book Review: RESTful Web Clients

RESTful Web Clients is written by guru Mike Amundsen who amongst other things co-authored RESTful Web APIs with REST guru Leonard Richardson and Sam Ruby.

The book's primary focus is on the hypermedia aspect of REST, particularly from the client's perspective. As Roy Fielding detailed in this famous blog post "if the engine of application state (and hence the API) is not being driven by hypertext, then it cannot be RESTful and cannot be a REST API" and let's face it, we have all seen APIs purporting to be REST with no hypermedia whatsoever with lots of coupling between client and server. Some of this is just down to basic ignorance and some of it probably down to misunderstanding the Richardson Maturity Model

Rather than begin with a summary of Fielding's dissertation like most material on REST, this book begins with details of a simple web application that uses JSON RPC APIs. From the simple example Amundsen shows that while the JSON RPC approach functionally works, it results in a lot of coupling between client and server meaning that if the APIs need to change it will be difficult to do that easily as the client(s) with all its hardcoded of contracts will be impacted. And we know software does need to change from time to time right?

Amundsen distills the coupling with the JSON RPC approach into three distinct types which can be considered and assessed individually:

Objects - the JSON objects that appear in API responses. Clients need to be able to understand them to handle a simple a response to a GET request
Addresses - the URLs clients needs to know to invoke requests
Actions - details methods and arguments for all non-trivial operations. Again clients need to know this before invoking requests.

With the coupling clearly demonstrated, the scene is nicely set to move onto one of key advantages of a REST style archictecture: reducing coupling through hypermedia.

To explain this advantage, Amundsen again uses the approach of specific examples. Firstly, by detailing the JSON hypermedia type HAL. Using this approach reduces the Address coupling and examples of how generic response handling can be written on the client to leverage and take advantage of this decoupling are detailed. However HAL doesn't solve everything. Without a custom extension there is still coupling to the JSON Objects and the possible Actions available to the client. A work around to this is given and I would highly recommend anyone considering using HAL to read Chapter 4.

Next up is another JSON hypermedia type known as Siren. Kevin Swiber designed Siren and registered it with IANA in 2012.

Siren splits response entities into:

class - this is an array, the values of which indicate what the current resource represents e.g. Customer, Person
properties - set of name-value pairs
entities - a list of linked and representational sub entities
actions - contains a set of valid operations of the associated entity and how to invoke those operations including a list of fields which match HTML5 input types (hidden, text, number). This is something not in HAL that helps reduce client-server coupling further
links - links to other resources. Each link has a class, href, rel, title, type property

Siren reduces coupling to Addresses and Actions, however it does not reduce coupling to Objects. There is no meta-data specification for the class type meaning the client has to hardcode the structure of the object somewhere. Like HAL it is possible to create a custom extension but this is not part of the Siren specification.

The third hypermedia type detailed is Collection+JSON format (Cj). Interestingly, this format was designed by the author himself. The basic elements of a Cj message are:

Links - Simlar to HAL and Siren links
Items - Similar to HAL and Siren properties and also includes meta data about the properties
Queries - Information on how to construct various reads (HTTP GETs)
Templates - Information on how to construct various writes (HTTP POSTs, PUTs, DELETEs...)
error - information ref errors

The key point here is that since Cj includes the metadata about the items, it decouples the client from the Objects in the JSON responses something both HAL and Siren could only achieve with custom extensions.

So which format? Well two good points to make here:

That can be a practical decision and not just a technical one. You may prefer Cj because out of the box it achieves most decoupling, but your customer may be used to and prefer HAL.
Rather than trying to support every possible format, think about architecting so it possible to support extra formats if you need to. The approach suggested is described in the Amundsen's Representor pattern - which is inspired from the Message Translator Pattern

So in summary, this is another great REST book from O'Reilly. The style of the book in general is pragmatic rather than academic. It really emphasizes and demostrates the importance of hypermedia in REST APIs and is backed up with practical examples. The central argument in the book is that Cj achieves the most decoupling. Even if it was written by the author, the argument is well made and I don't think it would be fair to make accusations of any selection bias since he does detail how you can extend Siren and HAL to achieve the same level of decoupling.

Bottom line - if you want to understand the hypermedia aspects of REST, read this book.

Thursday, May 19, 2016

Immutable pointers - a pattern for modular design to help achieve microservices

Modular design has a lot of benefits, including:

making it easier to predict the impacts from a change
helping developers work in parallel

But, it is much much much harder than people think. For a start, the abstractions have to be at the right level. Too high and they can become meaningless and too low and they cease to be abstractions, as they will end up having way too many dependencies (efferent and afferent)

In a recent green field project, which was in the area of digital commerce, I was intent on achieving good modular design in the back end for the reasons outlined above and because the project had potential to grow into a platform (more than likely microservices) that would be used by multiple teams. To achieve the modular design, after some white boarding the back end was split up into a bunch of conceptual components to achieve shopping functionality. The core components are listed below.

Shopping component - Core shopping functionality e.g. Shopping Cart management
User component - user management, names, addresses etc
Purchases - details about previous transactions
Merchant - Gateway to merchant API to get information about merchant products etc.
Favourites - Similar to an Amazon wishlist

Now, every software project uses the word "component" differently. In this project, a component was strictly defined as something that did something useful and contained:

domain classes
services
a database schema (could be its own database, but the idea was to isolate it from any other components persistence)
its own configuration
its own dedicated tests
its own exception codes domain

The outside world could access a component via a bunch of ReSTful endpoints.

Any component could be individually packaged, deployed etc. Now, the astute out there will

be thinking, "this sounds like microservices", well they almost were. For this project,

some of them were co-located but they were architected so that deploying them out into

individual deployed artefacts (and hence a microservices approach would be easy).

Ok, to reiterate, the goal was achieve a very clean modular design. This meant, that I didn't want

any dependencies from one component's database scheme to another and for this blog post we are only going to focus on how that aspect of the modularity was achieved.

Now, looking at the above components, it doesn't take long to see that isn't going to be so easy. For example:

A shopping cart (in the Shopping component) will have a reference to a User (in the User component)
A shopping cart item (Shopping component) will have a reference to a Product (Merchant component)
A Shopping cart (Shopping component) will have a reference to a shipping Address (User component)

So the challenge of achieving modularity in the persistence tier should now be becoming more clear. References of some sort across components need to be persisted. Immediately, any developer will ask,

"Wait a sec, if we just use foreign keys for these inter schema references we get ACID and referential

integrity for free!" True. But then you are loosing modularity and introducing coupling. Say you want to move your products (in the Merchant Component) away from a relational database and use something like Elastic or Mongo DB instead - to leverage their searching capabilities. That foreign key isn't so useful now is it?

Ok, so first of all in looking for a solution here, I thought about all the references that were across

components to see if there was anything in common with them. One thing that was obvious was that

they were generally all immutable in nature. For example:

When a Cart Item (Shopping component) points to a Product (Merchant component) it points to that product only. It never changes to point to another product.
When a Shopping Cart (Shopping component) points to a User (User component), it is also immutable. My shopping cart is always mine, it never changes to be someone else's.

So I was now starting to think about preferences:

Avoid cross component dependencies if you can (this should be kinda obvious)
If you have to have them, strive for immutable references.

So, next up was to have a name for this type of relationship - which I was calling "Immutable pointer" in design and architecture documents. But, for actual code I needed something more succint. The database schema was already using "id" for primary keys and "{name_other_relationship}_id" for foreign keys. So I decided all cross component relationships were named as the the name of the entity being pointed to and "ref".

So, some concrete examples:

userRef (ShoppingCart pointing to the user)
productRef (CartItem pointing to the product)
shippingAddressRef (ShoppingCart pointing to the ShippingAddress)

This meant anytime anyone saw something like "xyzRef" in code, schema or logfiles they knew it was a cross component reference. In case it wasn't obvious, Ref was an abbreviation for Reference.

Next up was to decide on the format for the actual Refs. I took a bit of inspirational from that thing call the internet, which of course has similar concepts where abstractions: web sites of web pages contain immutable pointers to other abstractions: hyperlinks to web pages in other web sites.

So similar to hyperlinks and URLs, the refs would follow a hierarchical name spacing format.

Some good team input from senior technical member suggested to continue the inspiration from the Web and start the hierarchical names with cp://. CP for Commerce Platform the name of the project. This was analogous to http:// I thought this was good idea as it indicates that our platform generated the reference. Again, this meant they stood out in logfiles etc and could be differentiated from any downstream components also using hierarchical type references but in a different context.

The key point of the ref was that when generated it should of course be unique. To achieve this

a mixture of database primary keys or something unique about the data (e.g. product skus were used)

So some examples:

userRef -> cp://user/{uuid of user}
productRef -> cp://merchant/{uuid of merchant}/product/{sku of product}
cardRef -> cp://user/{uuid of user}/card/{uuid of card}

Always immutable?

As stated, the first preference was to avoid the cross component reference. The second preference

was to use the immutable pointer (ref pattern). However, what about an edge case where the cross component reference could be mutable. Could this happen? Well it could. Easiest to explain by an example.

Every shopping cart doesn't just have a User, it also has a selected shipping address where the contents purchased will be shipped to. In the domain model, the user's address lived in the User component. But unlike the other cross component references the shipping address could change. Consider your Amazon shopping cart. Imagine you are on the checkout screen with your selected card and your selected address, but before you proceed to checkout you go into your user preferences and delete your card and address. This project had to facilitate similar scenarios. So the User deletes the address that a shopping card id pointing to what should happen?

So for inspiration for solution here, we can look to the various NoSQL patterns and in particular

one of the most popular is eventually consistency. What this says is that unlike ACID you don't

always need consistency straight away, all the time. In certain cases it is okay to allow inconsistency

on the basis that the system is able to reconcile itself.

So in this case:

The shopping cart is pointing to a specific shopping address using an addressRef.
The user deletes that address by hitting a ReST endpoint in the user component.
This means the shopping cart will point to an address that doesn't exist. The system is inconsistent.
The next time the user reads the shopping cart, in the request handling the shopping component asks the user component if the address with this address ref sill exists and if it doesn't removes the pointer.
This system is now consistent

So with the architect hat on it is really important we get this all right. Otherwise the goal

of modular design falls apart.

In this case, it is worth reiterating the strategy one more time:

Avoid cross component references. Doesn't matter how great your pattern is, if you have a lot of cross component references, it is more than likely you have got the component abstractions at too low a level.
Favour immutability. In general immutability means less code paths, less edge cases, less complexity in code.
Eventual consistency.

Software Architecture is about trade offs, and finding the right balance. In this case, it was the balance between clean modular design but not going over board with it so that it become impossible to achieve.

For anyone trying to do microservices, I would strongly recommend trying to master how you would do modular design in your architecture first. If you can't do this, when you add in the complexity of the network things get very complicated.

Sunday, June 29, 2014

Book Review: REST API Design Rulebook (Mark Masse)

RESTful style architectures are becoming more and more ubiquitous. There are many reasons for this:

Web Tiers are full of JavaScript yearning to make nice simple AJAX requests
The obvious shortcomings of SOAP style strong contracts
They are a nice alternative to ESB's for integrating heterogeneous architectures

However, like the MVC design pattern or the Agile Software Methodology, while many projects may be claiming to be using the RESTful architectural approach everyone is doing it differently.

In 2010, Martin Fowler - in his excellent blog - discussed the Richardson Maturity Model. This model provided a very good categorisation technique to assess the degree of RESTfullness based on how many core REST principles were being used. Although, that same model gets a reference in Mark Masse's REST API Design Rulebook, Masse's book goes much more into low level detail about various REST best practises.

For example:

Negative caching: adding cache headers not just for positive responses but to 3xx and 4xx responses. This is something that may not be obvious, but could be a good performance / scalability boost depending on the nature of your application and user usage patterns etc.
How to version your URIs, your representational forms and your resource objects
Using a consistent forms to represent link relations

In addition, there is a abundance of other ideas and guidelines, some pretty simple but nonetheless important:

Don't end URI's with a slash
Don't use underscores in URI paths
Put "api" in the domain part of your rest resource path, e.g.

https://api.dropbox.com/1/oauth/request_token

The main reason why I think it is good to have a book like this is because when a development team are trying to use a REST style architecture, disagreements, misunderstanding will inevitably happen. For example, the proverbial: 'Should a URI end in a plural or singular noun?'
It is always good to be able to reference a respected industry resource to prevent rabbit holes appearing and eating into your valuable project time.

Furthermore, there are some really quick and easy things you can do to make a much a better REST API that are discussed in the book. For example:

Adding an ETag HTTP header to that shopping cart resource as items go in and out of it.
Using query fields to generate partial responses and using the ! for an excludes option.

Now for some constructive criticism. Firstly, I don't there will ever be complete consistency in REST approaches. Some of the so called best practises could be argued to be just subjective or nice-to-haves. They are not something that are going to make a big difference to either functional or non-functional aspects of your architecture. Some of the industry leaders not only take different approaches in their REST APIs, but they are also sometimes doing the opposite of what Massé suggests. For example, Massé suggests not to include a file extension type in your REST URLs (see Chapter 2), but (at the time of writing) Twitter do just that (the URI is: https://api.twitter.com/1.1/statuses/mentions_timeline.json)

Furthermore, in a lot of projects you will be writing a REST API purely to serve JSON back to a Web client. This is quite different to a public facing API on the scale of Twitter, Amazon etc. Therefore you need to ask yourself if you really need to put the time investment in so that you are adhering to every single REST best practise instead of just doing what makes sense for your project. Some obviously make sense, some could be argued to be overkill. For example making sure you are sending back the correct HTTP code when you know the client never even checks if it is a 403 or 405.

I do think this book is written more as if you were designing a public facing API. If you find yourself in that situation you should definitely, definitely, definitely be considering everything Massé is saying in the book. But note, the emphasis is on considering which doesn't always mean adhering or adopting.

The book does a very good job in covering best practises that a lot of developers just wouldn't think of (setting response headers such as content-length, setting the location header in responses for newly created resources, how to support HTTP 1.0 when you are trying to discourage caching) and is definitely worth a read but you really have to think about what makes sense for your project. As stated, some of the suggestions are quick wins others you have to assess the cost and the benefit. Following the core basic REST principles (statelessness, a good resource model, uniform interface etc.) is what is really important after that the challenge is figuring out what works best for each specific project and how to make the most of your time. That will vary from project to project and will depend on project scope, scale etc. A good architectural decision should always consider the various options but make the appropriate decision for the appropriate project.

Until the next time, take care of yourselves.

Sunday, November 13, 2011

REST Spring 3.0 - Simple Introduction

This post is going to be a very simple introduction to the RestTemplate API introduced in Spring 3.0.

Dogs know how to REST!

The RestTemplate is similar to other Spring templates such as JmsTemplate and JdbcTemplate in that Spring eliminates a lot of boot strap code and thus makes your code much cleaner. When applications use the RestTemplate they do not need to worry about HTTP connections, that is all encapsulated by the template. They also get a range of APIs from the RestTemplate which correspond to the well know HTTP methods (GET, PUT, POST, DELETE, HEAD, OPTIONS). These APIs are overloaded to cater for things like different ways of passing parameters to the actual REST API. Ok, so let's look a very simple example. Suppose we want to invoke a well know Twitter REST API to check the Twitter timeline for the user "dublintech". This API would takes form: http://twitter.com/statuses/user_timeline.xml?id=dublintech

Fire it into your web browser and you'll see you get back the timeline for dublintech in XML. But you don't want to fire it into your browser, you want to invoke the API in a Java program! Let's do that.

That's it. Isn't that incredibly simple. No HTTP Connection handling to worry about. Import one Spring 3.0 class, use it and that's it! Yes, this post was a very simple introduction. In the real world, if you were using this API you'd obviously be doing something a little more sophisticated for example using XPath to get specific information out of the XML returned. But we don't need to cover that. The point of this post was really to point how easy it is to use some of the REST 3.0 Spring features. In future posts we'll be looking at more advanced features.

For now - keep it simple!

References:
1. http://static.springsource.org/spring/docs/3.0.x/javadoc-api/
2. http://blog.springsource.org/2009/03/27/rest-in-spring-3-resttemplate/
3. https://dev.twitter.com/docs/api Twitter REST API
4. Roy Fielding's original REST Paper: http://www.ics.uci.edu/~fielding/pubs/dissertation/rest_arch_style.htm