One of my favourite questions from the November cohort of Production-Ready Serverless [1] is, "How do you handle e2e tests involving multiple services across bounded contexts?" In a microservices environment, testing user journeys that span across multiple bounded contexts requires collaboration and a clear delineation of responsibilities. Depending on how your organisation is structured, different teams are responsible for testing parts or the entirety of the user journey. For example... The microservice teamsIndividual microservice teams are typically aligned with bounded contexts. They may own one or more microservices within their designated bounded context and should be responsible for testing them. Their job is to ensure that their microservices function correctly and adhere to the defined contracts with other microservices. Tools like Pact [2] are great for verifying that a microservice meets consumers' expectations (i.e. downstream services). This is a great way to catch integration problems between different microservices. But these tests alone does not guarantee the user journey is going to work. These service-centric tests can be very detailed and comprehensive. They are great at catching edge cases that might be missed from testing user journeys. However, they are often not broad enough to cover entire user journeys. The QA teamsIn some organizations, a cross-functional team, which may include QA engineers, developers, and product owners, will own the user journeys. This team is responsible for end-to-end testing across bounded contexts. They test the user journey as a whole to ensure that all system components (the frontend and the microservices) integrate seamlessly and meet business requirements. Tools like Cypress [3] or Selenium [4] can drive these end-to-end tests from the frontend. These tests cover entire user journeys and will test both the frontend and backend. However, they do not replace the service-centric tests the microservices teams would run and own. Full-stack teamsIn many organizations, full-stack teams own both the frontend pages and microservices within a domain. In some cases, they would use microfrontends to allow more team autonomy. This is so that teams can more easily make changes on their pages without impacting others. If you want to learn more about microfrontends, check out my conversation [5] with mr. Microfrontend himself, Luca Mezzalira. In situations like this, the full-stack teams can use end-to-end tests to test their parts of the user journey. Again, tools like Cypress and Selenium are a good choice here. But here, we reach the heart of the question, "How do you handle e2e tests involving multiple services across bounded contexts?" (for example, for step 2. above) Do you deploy related services in your own environment? For example, if each team has its own AWS account, it can deploy other teams' microservices into its accounts and test them together. However, this becomes cumbersome as the system grows. To make it work, you need more and more automation. Also, what version of the other teams' code do you deploy? If you deploy the latest code in development, then you risk instability. If you deploy what's in production, then you miss out on agreed-upon changes that have not yet made it into production. Do you run these tests in dedicated integration environments? For example, a QA/test environment designed for QA teams to run user journey tests. And where necessary, for them to run manual tests. However, this delays the testing of integration across bounded contexts. The full-stack teams want to test their part of the user journey and identify integration problems early. The answer shouldn't be "let the QA team worry about it"! From the full-stack team's perspective, we're back to running other teams' services, or... Do you use mocks so that each team can exercise their tests independently? This is my preferred approach. Tools like WireMock Cloud [6] are very helpful here. You can start by uploading an OpenAPI spec or manually configuring the routes. They also support chaos testing - you can add delays or error responses for specific routes. I find WireMock Cloud a great way to test third-party APIs. It can also be used to test integration with other teams' services without hosting them yourself. For example, in your dev environment:
Sometimes, a feature requires coordinated changes across multiple microservices. You can update the mock API to reflect the agreed-upon change in other microservices and start testing your code right away. It reduces the amount of coordination and interdependence between teams. However, this approach also requires extensive automation to keep the mock APIs up to date. It also demands that teams document their APIs rigorously—and that's a good thing! I wish more teams practised contract-first development. SummaryEnd-to-end testing across bounded contexts in a microservices environment requires a careful balance of responsibilities and collaboration. Microservice teams should focus on service-level testing. Tools like Pact help catch integration problems early while you're still making code changes. QA teams often oversee user journeys and use tools like Cypress or Selenium to test integration across the system. Full-stack teams own both the frontend and backend. So they can test their part of the user journey autonomously. They can host other teams' microservices to support end-to-end tests or use tools like WireMock to test their parts of the user journey in isolation. There is more than one way to test user journeys that span multiple bounded contexts. Your choice depends on organizational structure, team responsibilities, and the maturity of your testing practices. Ultimately, every part of the user journey should be tested, whether it's done piecemeal by individual teams or centrally by a QA/cross-functional team. And it's OK to have both! Service teams should test their parts of the user journey independently. This will allow them to be more exhaustive, help them catch edge cases, and identify problems early. The QA team's tests will be broader in scope and can catch integration problems that are difficult to spot at the service level, especially when 3rd party integrations and manual steps are involved. After all, small amounts of overlapping tests will hurt a lot less than insufficient test coverage of important user journeys! Links[1] Production-Ready Serverless workshop [2] Pact [3] Cypress [4] Selenium [5] Microfrontends with Luca Mezzalira [6] WireMock Cloud |
Join 13K readers and level up you AWS game with just 5 mins a week. Every Monday, I share practical tips, tutorials and best practices for building serverless architectures on AWS.
2024 was the year I got back and amongst the community, and it felt great to be back! Blog I published 33 new blog posts. As a whole, my blog garnered 353k views from 255k visitors. About half of them came through Google search. This is down from 2023... but the decline is offset by more people reading my content through my newsletter nowadays. Most read blog posts: Hit the 6MB Lambda payload limit? Here’s what you can do When to use Step Functions vs. doing it all in a Lambda function How to...
The ability to invalidate a user's session with immediate effect is a common enterprise requirement. For example: If a user's credentials are compromised, we need to immediately revoke the user's access and force the user to change credentials. If an employee is terminated or an external contractor's access is revoked, their session should be invalidated immediately to prevent misuse. Many regulations mandate strict access controls and the ability to prevent unauthorized access in real time....
So that's it for this year's re:Invent. Werner delivered his usual insightful keynote and shared some key lessons in dealing with complexity and warning signs to look out for. As Werner said, the number of moving parts is not a good measure of complexity on its own. And that's something so many people get wrong about serverless - that they judge complexity by the no. of components on an architecture diagram. Ok, on with the serverless announcements. Aurora DSQL (pronounced "dee-sequel")...