Spring for GraphQL: How to solve the N+1 Problem?
GraphQL promises to solve some major shortcomings associated with REST API, namely over-fetching and under-fetching. But, GraphQL is also prone to one problem, commonly known as N + 1 problem. In short, if the GraphQL service is not implemented correctly, it can cause significant latency in the API calls.
Not surprisingly, Spring for GraphQL provides an elegant solution to solve N + 1 problem.
So, what exactly is N + 1 problem in GraphQL?
What is the N + 1 problem?
Firstly, it’s not as dreaded as it sounds 😑. Secondly, it has nothing to do with Big O, used to describe the algorithm’s performance. Thirdly, the solution to the N + 1 problem is surprisingly straightforward.
But, before we talk about the solution let’s understand the problem first.
Let’s try to understand this problem with one example.
Assuming, you have defined the following GraphQL Apis.
And on the service side, it’s implemented as:
In the above code, we have defined two Data Fetcher
s
books()
for field books of the GraphQL object typeQuery
.ratings(..)
for field ratings of the typeBook
.
The important point to note here is — if you don’t specify a Data Fetcher for a field then the GraphQL assigns a default PropertyDataFetcher
which, in turn, looks for the public XXX getXXX()
method in the POJO object defined by the Type
.
Therefore, in the above example, GraphQL resolves the field name
from public String getName()
method of the Book
object.
Also, when the GraphQL service executes a query it calls a Data Fetcher for each and every field
.
So, if a GraphQL client requests the following data,
then the GraphQL runtime engine does the following.
- Parses the request and validates the request against the schema.
- Then it calls the
book
Data Fetcher (handler methodbooks()
) to fetch book information once. - And, then it calls the
ratings
Data Fetcher for each book.
Now, it’s very much possible that Books
and Ratings
are stored in different databases, or books and ratings are different microservices altogether. In any case, this results in 1 + N network calls.
The n+1 problem means that the GraphQL server executes many unnecessary round trips to fetch nested data.
If you check the service log then you will see that rating
Data Fetcher is called sequentially for each and every call to book.
How to solve N + 1 problem?
In Spring for GraphQL, you can solve this problem using @BatchMapping
annotation.
You just need to declare @BatchMapping
annotation on a handler method, that takes List<Book>
and returns a Map
containing Book
and List<Rating>
as:
In
@Bathmapping
, Spring batches the call to therating
Data Fetcher.
You can also leave out the parameters field
and typeName
as the field name defaults to the method name, while the type name defaults to the simple class name of the input List
element type.
If you run the above query in GraphiQL at http://localhost:8080/graphiql?path=/graphql, you will see that the GraphQL engine batches the call to the ratings
field.
Another approach to solve the ‘N + 1’ problem
There is another way to solve N+1 in Spring for GraphQL — using low-level GraphQL Java Data Loader API. A Data Loader
is a utility class that allows batch loading of data that is identified by a set of unique keys. It's a pure Java port of Facebook DataLoader.
This is how it works.
- Define a Data Loader that takes a set of unique keys and returns results.
- Register Data Loader in the
DataLoaderRegistery
. - Pass Data Loader to
@SchemaMapping
handler method.
A DataLoader
defers loading by returning a future so it can be done in a batch. A DataLoader
maintains a per request cache of loaded entities that can further improve performance.
Since BatchLoaderRegistry
is available as a bean, you can inject that anywhere (in our case we have injected it in the Controller
class for convenience).
And then define a handler method that takes Data Loader as an argument.
If you run the above query in GraphiQL at http://localhost:8080/graphiql?path=/graphql, you will see that even though rating
Data Fetcher is called for every book
but GraphQL engine batches call to Data Loader
as a single call.
Code example
The working code example of this article is listed on GitHub. To run the example, clone the repository, and import graphql-spring-batch as a project in your favorite IDE as a Gradle
project.
Summary
A GraphQL service can be susceptible to N + 1 problem if not implemented correctly. The n+1 problem means that the GraphQL server executes many unnecessary round trips to fetch nested data. In Spring for GraphQL, we can solve this problem by defining a handler method with @BatchMapping
annotation.
Originally published at https://techdozo.dev on August 7, 2022.