Build a moat with browser automation

Everybody can use a Next.js boilerplate. Everybody can use an API. Everybody can add an AI chatbot to their product.

Even if you land on a business that works with these methods, you still don't have a moat. You still aren't differentiated from the competition, let alone all the copycats that come after you.

If you really want to make something that sticks, you need to look for features that are too hard to replicate. Luckily, these also tend to be the features that are the most valuable to your users.

The question is, where do you find these features? And how do you build them?

Browser automation is unique

When I think about a moat, my mind always goes to the browser. Why? Because everything happens in the browser, even the things that you can't do with APIs.

For example, you can't automate the process of signing up for a new account on a website. You can't automate the process of filling out a form. You can't automate the process of clicking on a button.

You also can't scrape a website's valued data, interact on social media, track competitors' prices, ensure a website's performance, or generate beautiful images, videos, and interactive PDFs.

You also can't give an LLM direct access to a persistent browser session, to take actions on a user's behalf.

Yet these are all things you can do with a headless browser and a little bit of code.

Browser automation is defensible

Why then aren't lots of people building these features?

Well, browser automation is a bit of a pain.

First, the web is constantly changing, so you need to write your automation code to be resilient, and you need to ensure you're notified when some of your resilience fails.

Second, your target websites probably don't want you to interact with them programmatically. USA law is clear: your code is free to interact with the web just as a human is. But the law doesn't prohibit sites from finding ways to block or ban you. So you also need a bit of skill in avoiding detection and working around anti-bot measures.

Third, it's a pain to host headless browsers. They're very large, very heavy, and they don't scale in the same way as your typical API. For example, your API may support 1,000 requests per second, but with the same CPU/memory profile, your headless browser may only support 10 requests per second. They also don't work so well in serverless environments. So it's better to host the browsers separately.

Browser automation is valuable

All that said, when you get browser automation right, you can charge an arm and a leg for it. Just look up the pricing for proprietary data sets, LinkedIn automation, or enterprise RPA (Robotic Process Automation). Cha-ching!

The best part is that there's low hanging fruit for browser automation in every niche. Talk to anyone, in any industry, about the pointing and clicking they have to do for their job, and you'll find a wealth of opportunities.

Remember, most knowledge workers spend the vast majority of their time occupied by repetitive tasks in the browser... and they hate it!

It's just a bit annoying to build, so most people don't.

Browser automation can be easy

I've worked with browser automation for years, from major corporate gigs to indie startups to personal side projects. I've faced pretty much every challenge there is, and I've learned a lot about how to make it easy.

Let's look back at our list of challenges:

Writing resilient code
Avoiding detection
Hosting headless browsers

Writing resilient code

Even if you're only a "decent" dev, these practices aren't out of reach. After all, the web doesn't change that much, or that often.

It boils down to a few simple practices:

Handle all possible errors. You never want your job to fail when it could have succeeded. Always think through ways your code could fail, and handle them preemptively.
Write idempotent code. You may have to re-run your jobs, and you want to ensure this doesn't create more problems down the line.
Support partial successes. In complex jobs, a lot can go wrong. But you've got your users to think about. Ensure that even if a job fails, you track and report the successful portions.
Select elements with fallbacks. For example, you target a button by its text, its ID, and its location on the page. Then you take the first match.
Run interactions with fallbacks. For example, if your first strategy for submitting a form fails, try an alternative method.
Notify yourself of failures. At the end of a run, aggregate data about what failed and why. Your scripts should always have fallbacks, so it's not okay if due to changes, only a single path still works. Am error tracking service like Sentry is great for this.

Like all things in programming, these practices seem more daunting than they are. Once you've done them a few times, they become second nature.

I've begun writing about them on the BrowserCat blog. Check out my guide to strengthening your selectors for a taste.

Avoiding detection

This task may seem even more daunting, but it's actually much easier than you think. I listed the following recommendations in descending order of effectiveness. Only implement them as you find you actually need them.

Use a headless browser. It's easy to detect simple fetch() requests are bots. But a real headless browser is completely indistinguishable from a normal browser instance. This is why most scraping APIs charge you 5x the credits for the privilege.
Throttle your requests. Normal humans don't make 100 requests per second. Neither should your jobs. They're too easy to detect, so most adversaries (including Amazon, LinkedIn, and X) rely on this metric as their first or only line of defense.
Use a residential proxy rotation. If you're making a lot of requests from a single commercial IP address, you're going to get detected. But by rotating through numerous residential IPs in your target country, you're going to seem that much more like a real user.
Simulate human behavior. Some sophisticated anti-bot measures will look at mouse movement, keyboard patterns, and "speed-running" through on-site actions. Thankfully, with a headless browser, some helper methods, and a random number generator, you can beat most detection. Open-source libraries will help with the few remaining edge-cases.
Solve captchas. If you're still getting detected, you can rely on third-party APIs to solve captchas for you. Some will use real humans, while others will use AI. Either way, you'll get a solution in just a few seconds. Nevertheless, it's better never to get to this step. Most won't need it.

Hosting headless browsers

This is the most annoying part of browser automation.

As mentioned above, browsers are heavy dependencies, and they must be hosted separately from your code. In serverless, they have terrible warm up times, they're costly to run, and they're limited in many cumbersome ways. But when deployed in docker, you've got to manage scaling to zero, or you're going to spend a lot of money for nothing.

Meanwhile, if you want to generate videos, download large files, reuse persistent user sessions, or in any other way take full advantage of the browser, there are many other bits of infrastructure for you to develop.

The biggest problem isn't that it's hard to do, but rather that you have to do it instead of building your product. It's a huge, costly, painful distraction. (Even if you're starting from a baseline like Playwright's docker image.)

This is the main reason why browser automation isn't more popular.

PITCH: That's why I'm building BrowserCat. BrowserCat hosts and scales headless browsers so you don't have to. Point your code at our servers, and pay only for what you use. I aim to make it cheap, fast, and easy to work with headless browsers, so reach out with any questions or feedback you have.

Write your first browser automation

If you're interested in building a moat with browser automation, check out my article on writing your first script. It shows how to use the open-source Playwright automation library to control a headless browser on your local machine.

And when you're ready to deploy, jump to production with BrowserCat.

Ask me anything!

If you made it this far, you're clearly interested in browser automation. I'll be hanging out in the comments below, and I want to help. Ask me anything!

Mike Stop Continues

posted to

Developers

on December 20, 2023

Say something nice to mikestopcontinues…

Post Comment

3

Good luck, mate! I love your website. Did you design it yourself?

You are definitely onto something. However, it's not like a niche without competition. So be prepared for a long journey.

Also, I will get it featured on SaaSHub's newsletter next week. Cheers.

StanBright

·
2 years ago
·
Reply
1. 2
  
  Hey, thanks, I did! And much appreciated with SaaShub, I actually just signed up a few days ago. It's my new go-to for finding apps. :)
  
  mikestopcontinues
  
  ·
  2 years ago
  ·
  Reply
  1. 1
    
    Cheers! I'm so happy every time someone shares that they find SaaSHub useful. Happy New Year!
    
    StanBright
    
    ·
    2 years ago
    ·
    Reply
2

Recently I've been trying to learn more about browser automation, this article is just what I needed!
Thanks for linking your website, I'm definitely going to try it. :)

Rob097

·
2 years ago
·
Reply
1. 1
  
  Thanks, Rob! If you run into any snags, message me!
  
  mikestopcontinues
  
  ·
  2 years ago
  ·
  Reply
2

Your upcoming project, BrowserCat, sounds like an exciting solution to the hurdles of hosting and scaling headless browsers. Simplifying this process for developers could indeed open up a realm of possibilities for innovative products. Best of luck with BrowserCat—I'm eager to see how it transforms the landscape of browser automation.

celvinpieterson

·
2 years ago
·
Reply
1. 1
  
  Thanks! I hope I can be of service. :D
  
  mikestopcontinues
  
  ·
  2 years ago
  ·
  Reply
2

The concept behind BrowserCat is intriguing and addresses a common pain point for developers working with headless browsers. The promise of hosting and scaling headless browsers without the hassle is certainly appealing. The pay-as-you-go model adds a cost-efficient dimension to the service, making it accessible to a wide range of developers.

princetony

·
2 years ago
·
Reply
1. 1
  
  Thanks for the kind words! If you ever need any help, ping me!
  
  mikestopcontinues
  
  ·
  2 years ago
  ·
  Reply
2

I was facing serious issues trying to run a headless browser with a logged account, YouTube, instagram etc, If somehow I can make that works definitely I will get back to the idea of browser automation

Chacalonjr

·
2 years ago
·
Reply
1. 1
  
  For that kind of big websites can detect your automation via browser fingerprint.
  You can try puppeteer and some privacy extensions like puppeteer-extra-plugin-stealth.
  But i don't recommend a create browser automation for that kind of websites.
  
  ✌️
  
  halitsever
  
  ·
  2 years ago
  ·
  Reply
2. 1
  
  When you hit a roadblock with your next attempt, message me. I'll help you work through it.
  
  mikestopcontinues
  
  ·
  2 years ago
  ·
  Reply
2
Hi Mike. If you don't mind would like to give you some free customer analysis report. From this customer analysis, you could understand what you potential customer are doing online and change your marketing strategy accordingly.

Here is the customer analysis : BROWSERCAT CUSTOMER ANALYSIS

From the customer analysis "web scraping tool" ( average google traffic is 10.31 K and reddit have around 10 upvotes and 12 comment ) subject are one of the most frequent things your potential customer are searching and interreacting online. You can adjust you blog and reddit posts accordingly to get more right customer and readers.

Going deep on the subject here are top things that your customer are interested about the subject :
- powerful web scraping tool
- easy-to-use web scraping tool
- best web scraping tool for e-commerce sites
Catering your marketing effort to these topics will get you the right customers.

Hopefully it will help you gain more customer and eventually buyers. You can also do the analysis your self using decentool.com
aquidor223

·
2 years ago
·
Reply
1. 1
  
  The report is much appreciated. Decentool seems pretty cool. Based on your report, I think I'll focus on offering some helpful content in the scraping subreddits, and if those topics perform well, I'll expand them on the blog. Cheers!
  
  mikestopcontinues
  
  ·
  2 years ago
  ·
  Reply
2

Hey Mike! It sounds awesome. I'm also working on web automation and It's amazing what we can do with it.

Germerlo

·
2 years ago
·
Reply
1. 1
  
  I'd love to hear about what you're building. It doesn't seem like repeat.la is the automation project. Drop a link? :D
  
  mikestopcontinues
  
  ·
  2 years ago
  ·
  Reply
2

Goodluck for your future!

malikbabu35

·
2 years ago
·
Reply
1. 1
  
  Thanks!
  
  mikestopcontinues
  
  ·
  2 years ago
  ·
  Reply
2

this is actually super cool! how quickly did you pivot from your author biz to this and what level of conviction did you have?

jama

·
2 years ago
·
Reply
1. 3
  
  It took me about six months to pick my next project. I looked at nearly 200 ideas to varying degrees. By the end of that process I was pretty damn convinced.
  
  BrowserCat is a bet that the browser is going to become infrastructure in the next decade, as autonomous agents become the web's primary users and as websites become more protective of their data. I think this is pretty much guaranteed. So the question is if I can develop a platform that people prefer for programmatically controlled browsers.
  
  But you're making a similar bet with VTuberPro. It looks pretty damn cool! How's it going for you?
  
  mikestopcontinues
  
  ·
  2 years ago
  ·
  Reply
  1. 2
    
    I love that story and the part about looking at 200+ ideas resonates pretty well haha
    
    Thanks! I'm hoping VtuberPro picks up, but it's just the beginning stages right now. Mostly dogfooding my own product and beginning to do some customer discovery. Trying to focus more on finding out who my early adopters would be, beyond myself!
    
    jama
    
    ·
    2 years ago
    ·
    Reply
1

All niches have competition and there truly isn't any moat.
If you could figure out how to do it, so can someone else. Hard does not mean impossible.

That's where patents come in. There's no real way of defending your IP and creating a moat around your product other than litigation, and that's super expensive, and sometimes impossible to enforce.

One shouldn't be looking for moats anyway. That's the wrong mindset. Just solve one problem for one person at a time. Make sure it's (somewhat) scalable. Rinse and repeat.

markjivko

·
2 years ago
·
Reply
1. 1
  
  I don't think of motes as unbeatable. Invading armies were able to cross motes. And I want a world of maximum competition. No artificial forcefields.
  
  To your latter point: The ultimate mote is serving your users best. Do the hard things first because your competition may not have the ability or will to do so. Nevertheless, it's a mote. :D
  
  I don't think we actually disagree!
  
  mikestopcontinues
  
  ·
  2 years ago
  ·
  Reply