All Posts

The Test Automation Snowman

ByAlan Page May 7, 2018May 7, 2018

I was nothing short of blown away over the past few days, when some comments I made on twitter about UI automation caused a lot of folks to raise their eyebrows.

Here’s the tweet in question.

I’m not against discussions on the invalidity of the test automation pyramid.

If you don’t like it, you use whatever model you want as long as it suggests you write AS FEW UI TESTS AS POSSIBLE.

seriously – stop your infatuation with UI tests

— Alan Page (@alanpage) May 3, 2018

Feedback (blowback?) ranged from accusations of harmful blanket statements to lectures on how what I really meant was “checking”, not testing – with a handful of folks who seemed worried that the world I was describing was too scary compared to the world where they lived. Testing evolves at different speeds in different places, so the last point, at least, was expected. But I stand by the sentiment in that tweet.

The test automation pyramid is a model for thinking about distribution of your tests. Mike Cohn first (I think) wrote about it here. In my opinion (and observation), there is an unhealthy obsession in software testing with writing “automation” (where “automation” means UI automation – i.e. using tools like Selenium to manipulate the UI to test the system under test). A few folks recently have complained about the pyramid (“it’s not a pyramid, it’s a triangle!”; “There are more than 3 types of testing”, etc.).

“All Models are wrong, some are useful” — George Box

It’s a good model, with a lot of practical application. Two key takeaways from the pyramid model are:

Write tests at the lowest level where they can find the bug
Minimize the amount of top/UI level tests

I’m passionate about the second point for three main reasons:

UI Tests are flaky. This was true 25 years ago when I first wrote UI automation, and it’s true today. They’re just not as reliable and trustworthy as lower level tests.
Despite the fact that reliable UI Tests are difficult to write, we (industry) seem to think that UI automation is a reasonable entry point to the world of coding for “manual” testers. UI automation is a horrible way to start programming. Shell (or other) scripts to help set up test environments or generate test data would be a much better use of time (and achieve more success) than learning to code by writing UI tests.
UI tests are s l o w. This is fine if you have a handful of tests, but a huge issue if you have hundreds, or thousands of UI tests.
Note – if you have a large amount of testing that can only be done at the UI level, that’s a big red testability issue you should probably address before investing in expensive testing.

Let’s assume for a moment that the problems with UI automation stability have been solved (companies like testim.io have used ML to make some strides in this area, and despite the entry path problem I mentioned above, there is improvement in automation tools and tester skills). If we go with this assumption, then point #1 – and possibly point #2 above are no longer an issue.

Point #3, however, is not solvable. Tests that automate the UI are slow. Way slow. Like a glacier stuck in molasses slow. I once wrote a UI based networking test to create a folder, share it, connect to it, write files to it, delete files, and then unshare the folder. That test took a little less than two minutes. Problem was, that I needed to test that process for every character possible in isolation (due to issues with DBCS code pages on non-Unicode Windows where details would fill pages of no longer relevant information). On Chinese windows, for example, this was (IIRC), somewhere near 8000 characters.

I wrote an API level test that tested the entire code page – including varying lengths of folder and share names that ran in under 5 minutes (and less than a minute for Western code pages). Of course, we still did spot checking (both exploratory, and via some UI automation), but testing at the level closest to where we could find bugs was the most efficient – both in proximity and speed.

Another view of tests I like is the size model from google. Rather than dwell too much on what makes a test a unit or integration test, think of tests in sizes – where tests of a certain duration are classified at different levels. This model works well (and solves the pyramid complaints I’ve seen recently), but it doesn’t have a visualization.

So – without further babbling, I created this alternate view – The Test Automation Snowman.

Use it, or ignore it. But I still beg you to consider writing far fewer UI based tests.

All Posts

Five for Friday – Juneteenth, 2020
ByAlan Page June 19, 2020June 19, 2020

Happy Juneteenth everyone – I hope you all find your own way to celebrate today. Here are some interesting things I found on the internet this week. First, I want to share an article about Juneteenth that I liked. Juneteenth: are we really woke this time? I enjoyed this article on the relationship between culture…

Like this:
Like Loading…

Read More Five for Friday – Juneteenth, 2020
All Posts

Five for Friday – May 10, 2019
ByAlan Page May 10, 2019May 10, 2019

Still loving my new role at Unity. As I look at the links I saved this week, I’m wondering if an analysis of my FfF posts over the last 18 months will show trends in my actual day-to-day work. I’ll leave that random thought for those smarter than me to investigate. If this isn’t the…

Like this:
Like Loading…

Read More Five for Friday – May 10, 2019
All Posts

Five for Friday – October 21, 2022
ByAlan Page October 21, 2022October 20, 2022

We have made it to another Friday. Air quality in the bright green pacific northwest got to pretty unhealthy levels this week, but it’s better (finally) today. I feel like I’ve been licking a dustpan. On the times when i could see my computer screen through the haze, here’s what I found this week. First…

Like this:
Like Loading…

Read More Five for Friday – October 21, 2022
All Posts

Beyond Regression Tests
ByAlan Page March 13, 2011

In a recent talk on test design (link), I discussed the concept of "useful tests". In my definition, useful tests are tests that provide new information. Almost every test is useful…once – typically the first time it’s run where it shows that the underlying functionality is working. From that point on, many tests function primarily…

Like this:
Like Loading…

Read More Beyond Regression Tests
All Posts

Five for Friday – December 6, 2019
ByAlan Page December 6, 2019December 6, 2019

It’s that time again. Kathy Keating has a great article this week on how Engineering Leaders are Failing Themselves Netflix has open-sourced another interesting tool – Open-Sourcing Metaflow. This is a short and good video summary of one-on-one conversations with remote employees Once again, I’m working my way through Advent of Code – Day 5…

Like this:
Like Loading…

Read More Five for Friday – December 6, 2019
All Posts

Five for Friday – August 12, 2022
ByAlan Page August 12, 2022August 12, 2022

Today has been an interesting day in American politics. Not sure what happens next, but I hope it gets better. Here are a few things I found this week that I thought were interesting. This post on PM & EM: Rules of Engagement echoes something I’ve noticed over the past several years – there’s a…

Like this:
Like Loading…

Read More Five for Friday – August 12, 2022

10 Comments

Drew says:

May 7, 2018 at 11:05 am

Ancient Google Test Sizes:
https://testing.googleblog.com/2010/12/test-sizes.html

I can’t speak to the rest of the post insulting someone. Possibly I already have.

Reply
Nikolay Advolodkin says:

May 7, 2018 at 11:27 am

I think your argument makes complete sense. Your automation snowman is a good diagram as well. One thing I feel is missing from the diagram is a quantification of “slow”, “slightly slower” and so on. I believe that for people that have never written a unit or integration test have a different concept of what is slow. Someone that writes only UI tests thinks that a 30 minute UI test is slow. While for someone like myself, that knows that a unit test can run in milliseconds, I consider anything longer than a 30 second UI test to be slow. I used to be the former too 🙂

So I think that by adding quantification to your snowman, you could possibly avoid the miscommunication that can result from different perspectives.

Reply
1. Alan Page says:
  
  May 7, 2018 at 2:12 pm
  
  The only reason I didn’t add numbers because I think it’s fair to have some variance between types of applications or release cadence (for something shipping quarterly, I think it’s ok to have a suite of tests that takes 2 hours – but I’d never tolerate that for something deploying continuously.
  
  But I think it’s worth calling it out now. Thanks.
  
  Reply
Barret Vasilchik says:

May 8, 2018 at 10:37 pm

I think one possible reason for the large amount of ui automation could be that testers are still black box testing for many things so using the browser is all they know.

I myself want to grow and move more towards learning to write tests near the lower levels, since the majority of my experience is writing ui tests

Reply
adi says:

May 9, 2018 at 5:20 am

You wrote UI automation 25 years ago?

Reply
1. Alan Page says:
  
  May 11, 2018 at 8:08 am
  
  Yep. For Windows apps using MS Test (later became Visual Test). The beta version was available in 1993, and I used it frequently.
  
  Reply
Pingback: Good and Bad UI Test Automation explained – Inspired by Richard Bradshaw’s Tweets – Chris Kenst
Pingback: Testing is like a box of rocks – Gregory Testing
Pingback: Java Testing Weekly 20 / 2018
Pingback: Testing is like a box of rocks | greg.dev

Like this:

Similar Posts

Like this:

Like this:

Like this:

Like this:

Like this:

Like this:

10 Comments

Leave a Reply Cancel reply