Building the perfect GitHub CI workflow for your frontend team
August 3, 2021 / 15 min read
Last Updated: August 3, 2021You've probably noticed if you've been following me for a while that I'm a āØ big fan āØ of automation. I wrote about automated CI/CD a year ago and also talked a lot about the concepts surrounding this subject, but never really touched upon the tools I use for my CI jobs and how I use them. One such tool that has really worked for me, especially as a frontend engineer, is GitHub CI.
For over a year now, it's been my service of choice for automated CI/CD pipelines. The workflow
syntax is easy to get started with, and has an extended set of features to help you craft your CI experience the way you and your team may want it.
However, even after a year, there still is a lot that I'm learning about this tool every day. When I got started with it, there was no set rule on how to properly architect your workflows, and there's a lot of tips, tricks I discovered along the way to build what I would qualify as "the perfect GitHub CI workflow" (at least to my eyes š). This article aims to gather those tips and good practices I've been using for personal projects and at work and show you how you can use all of those in a single workflow to power the CI/CD pipeline of your frontend team.
What would constitute a "good" GitHub CI workflow?
I'm going to throw my best "engineer response" at this question: it depends! Your team might have specific needs or objectives that would make some of my tips not as useful to you as they could be. However, for this article, we need some guidelines that I think would be universal when it comes to building efficient GitHub workflows, such as:
- cost-saving: bring the "build minutes" down to the lowest possible value to not have a massive bill at the end of the month.
- efficient: your team's time is precious, the workflow should be as fast as possible, but also fast to fail if something were to go wrong
- well-architected: each step has a purpose, and might depend on other steps. This also means not running "useless steps".
Now that we've established those guidelines, let's take a look at one of the most important tips of this article.
One workflow to rule them all
Let's consider a typical set of tasks a frontend team would run on every PR:
- Lint
- Formatting
- Type checking
- Unit test
- Build
- End-to-end tests, maybe on different browsers
Running those in separate workflows might look like the most straightforward way to architect those tasks. However, if something as simple as the lint
task fails, there's no way you can stop your expensive tasks like build
or your end-to-end tests from running. And that, my friends, is not very efficient.
Workflows run in parallel, and there's no way for them to interact with one another. Thus, you can't cancel a workflow due to another workflow's failed state. You're stuck running all the workflows in every PR.
To address this, I chose to combine all my workflows into one. All the tasks that were independent workflows before became part of the same unique workflow, but this time, as jobs.
Excerpt of a Github CI workflow job.
1# In this example, lint-format is a job among many others in a bigger GitHub workflow.2# This job has 3 steps: Checking out the code, running the lint command, and running the formatting command.34jobs:5lint-format:6runs-on: ubuntu-latest7strategy:8matrix:9node: [12]10steps:11- name: Checkout Commit12uses: actions/checkout@v213- name: Use Node.js ${{ matrix.node }}14uses: actions/setup-node@v115with:16node-version: ${{ matrix.node }}17- name: Run lint18run: |19yarn lint20- name: Run prettier21run: |22yarn format
The cool thing about jobs is that you can run them sequentially or parallel as you please! GitHub provides a handy keyword called needs
that lets you set one or several jobs as dependencies, thus preventing a given job to start unless the dependent jobs have successfully run. This allows us to:
- Fail the workflow fast. If a key job fails, the workflow is marked as failed on your PR as soon as possible
- Avoid running useless expensive tasks on a "doomed to fail" workflow run
Example of jobs running in parallel and sequentially
1# In this workflow excerpt, the type-check and unit-test jobs run in parallel whereas the2# build job "needs" these 2 jobs to be successful to be kicked off.3# Thus, if any of type-check or unit-test were to fail, the build job will not start and the4# whole workflow will be marked as "failed".56jobs:7type-check:8runs-on: ubuntu-latest9strategy:10matrix:11node: [12]12steps:13- name: Checkout Commit14uses: actions/checkout@v215- name: Use Node.js ${{ matrix.node }}16uses: actions/setup-node@v117with:18node-version: ${{ matrix.node }}19- name: Check types20run: |21yarn type-check22unit-test:23runs-on: ubuntu-latest24strategy:25matrix:26node: [12]27steps:28- name: Checkout Commit29uses: actions/checkout@v230- name: Use Node.js ${{ matrix.node }}31uses: actions/setup-node@v132with:33node-version: ${{ matrix.node }}34- name: Run test35run: |36yarn test37build:38runs-on: ubuntu-latest39strategy:40matrix:41node: [12]42needs: [type-check, unit-test]43steps:44- name: Checkout Commit45uses: actions/checkout@v246- name: Use Node.js ${{ matrix.node }}47uses: actions/setup-node@v148with:49node-version: ${{ matrix.node }}50- name: Run build51run: |52yarn build
You may be wondering: what job should be run in parallel and what job needs to be run sequentially? That will depend on the needs of your team.
On my end, I tend to parallelize unit testing, linting, and type-checking for example. These steps are generally fast and inexpensive to run thus I do not feel they need to depend on each other in most cases. However, I'd require a job such as build to only run if those three jobs above are successful, i.e. run it sequentially.
The screenshot below features the GitHub Workflow powering the CI for this blog. Yours will probably end up sharing some similarities:
As you can see, by combining all our workflows into one, and carefully choosing which job to parallelize or run sequentially, we end up having better visibility on how our CI pipeline functions and the dependencies between each of its steps.
"You are terminated"
My last tip, and perhaps my favorite due to its simplicity is terminating duplicate workflow runs.
It happens to me very often: I'm done with a current branch and decide to push my code and open a PR, thus triggering a workflow run. Then a few seconds later noticed I forgot to run that one console.log
or made a typo somewhere and need to push an extra change, thus triggering yet another workflow run.
By default, there's nothing that will stop the first workflow to run, it will continue until it's finished, thus wasting precious billing minutes that could have had a better use.
To prevent such a thing from happening, GitHub recently introduced the notion of workflow concurrency.
With the concurrency
keyword you can create a concurrency group for your workflow (or a job). This will mark any workflow run from that same concurrency group as "pending" if any run is currently in progress. You can also decide to cancel any in-progress workflow of the same concurrency group whenever a new workflow is added to the queue.
Example of GitHub workflow using concurrency groups
1name: CI23on:4pull_request:5branches:6- main78concurrency:9# Here the group is defined by the head_ref of the PR10group: ${{ github.head_ref }}11# Here we specify that we'll cancel any "in progress" workflow of the same group. Thus if we push, ammend a commit and push12# again the previous workflow will be cancelled, thus saving us github action build minutes and avoid any conflicts13cancel-in-progress: true1415jobs:16install-cache:17...
Doing this at the workflow level will ensure that any old or outdated workflows that are in progress will get canceled when we push a new change and triggering a new workflow thus saving your team's precious time and money.
Conclusion
So now that we went through all the tips to build the perfect Github CI workflow to power the CI needs of a frontend team, let's take a look at how they hold up against the guidelines we've established earlier:
Is it cost-saving?
Yes! We made sure to share the output of expensive steps such as build
and to cache repetitive steps that we would have needed to run throughout the workflow like installing our dependencies.
Is it efficient?
More efficient than running every job in a separate workflow for sure! Not only we are parallelizing independent jobs like e2e-tests-firefox
and e2e-tests-chrome
, we're also making sure to cancel any duplicate workflows thanks to the use of concurrency groups.
Is it well architected?
As we saw in the screenshot showcased earlier in this blog post, it's now easy to visualize all the steps and their dependencies. Combining every task into one workflow and architecting those jobs using the needs
keyword made the whole CI pipeline way easier to understand.
Need a full example? Don't you worry, I got you covered š! You can find my Github CI workflow featuring all the tips and examples of this article on the GitHub repository of this blog. It's fairly dense and long, thus why I did not directly integrate it here directly as it might have been distracting.
I hope some of the tips I introduced in this blog post will help you and your team perfect your own GitHub workflows and thus achieve a fast and reliable CI pipeline for your favorite frontend projects! Are there any other tips that you wish I had introduced in this article? Other GitHub CI secrets that empowered your team worth mentioning? As always, do not hesitate to reach out! I'd love learn more about what worked for you and test them out to further improve this article!
Liked this article? Share it with a friend on Twitter or support me to take on more ambitious projects to write about. Have a question, feedback or simply wish to contact me privately? Shoot me a DM and I'll do my best to get back to you.
Have a wonderful day.
ā Maxime
A guide with some of the best Github CI tips I could find to the power best CI pipeline for your frontend team's needs.