56
21 Comments

What I learned analyzing 10,000 Product Hunt Listings to show what products are successful & unsuccessful

Want to know what’s hot and what’s not on ProductHunt? I analyzed user reviews and product descriptions of over 10,000 ProductHunt listings to find out!

ProductHunt can make or break a launch for a business or product. Which is why founders (that's you people!) tend to invest a significant amount of time & money to build up a launch strategy, image assets, video, copy and external marketing campaigns.

Sometimes though, despite your best efforts, you won't achieve great success as some business models just aren’t popular on ProductHunt by design.

Given I have a great interest in understanding why and how things work, I wanted to pinpoint what these factors are.

For me the best way to do this is to perform topic modeling on the product descriptions of 10k+ products on PH - all to find insights and determine which type of products were most popular.

If you would you like to see the ProductHunt Product Clusters in the dashboard, whilst reading along then you can view the Clustering Dashboard here

What makes a successful ProductHunt listing?

** TL;DR**

  • The ProductHunt audience LOVE Web based apps, automation products, email productivity, community focused products and screen recording / screenshotting software.
  • Don’t expect Pomodoro, QR Code, Habit Tracking, VPN, Paywall or Notification products to get high engagement.

The Good

  • Products that achieved the highest number of reviews were based around emails (25 in total) and rèsumes (41 apps).

  • You will face the lowest amount of competition and may prefer to build these type of products below

    • Timezone (9 apps)
    • VPN (10 apps)
  • 1st - As an overall leading type of product, building a webapp is the way to go. This type of product achieved:

    • 1st highest avg upvotes (11.11)
    • 1st highest avg reviews (30.63)
  • 2nd - Automation based products that were focused towards marketing, tech, AI & productivity score quite high.

    • 2nd highest avg upvotes (10.52)
    • 11th highest avg reviews (10)
  • 3rd - Email Productivity products tend to solve an in-demand problem.

    • 3rd highest avg upvotes (10.42)
    • 93rd highest avg reviews (5.74)
    • This was interesting as although the upvotes were high, the average reviews were in the bottom 50% of all products.
  • 4th - Community focused apps relating to community management on social media or within forums scored highly.

    • 5th avg upvotes (9.13)
    • 51st avg reviews (7.02)
  • 5th - Screenshot or screenrecording apps were also in high demand. These types of apps were very popular with both upvotes and reviews.

    • 6th avg upvotes (8.85)
    • 19th avg reviews (9.18)

The Bad

  • Generally you may want to avoid building in these saturated categories as they had the highest most competition amongst

    • Automation (148 apps)
    • Chatbots(118 apps)
    • Design (118 apps)
    • Video (105 apps)
  • Overall, the products that were reviewed the lowest amount were based on web domains, QR Codes & habits / behaviour focused products.

    • My hypothesis here is that both of these apps tend to be used away from your average ProductHunter’s desk, or have long timespans to give an accurate review, which requires a greater motivation than reviewing a Browser extension.

The Ugly

  • Products that achieved the lowest average results and upvotes were as follows.

    • 1st - Paywall & VPN related products
    • 240th lowest / bottom 9% for average reviews (2.1)
    • 208th lowest / bottom 17% for average upvotes (3.5)
  • 2nd - Update and notification based products

    • 239th lowest / bottom 11% for average reviews (2.29)
    • 236th lowest / bottom 7% for average upvotes (2.67)
  • 3rd - Google Suite focused extensions / products

    • 238th lowest / bottom 11% for average reviews (2.41)
    • 234th lowest / bottom 10% for average upvotes (2.91)
  • 4th - QR Code / Barcode orientated products

    • 237th lowest / bottom 12% for average reviews (2.44)
    • 240th lowest / bottom 4% for average upvotes (2.25)
  • 5th - Productivity Apps & Pomodoro Timers

    • My hypothesis here is that these apps are simple and rehashed, with little innovation between them. There’s a low engagement here as new products entering this market aren’t innovative or impressive enough.
    • 236th lowest / bottom 12% for average reviews (2.47)
    • 218th lowest / bottom 15% for average upvotes (3.33)
  • 6th - Habit Tracking & Behaviour based products

    • 239th lowest / bottom 6% for average reviews (2.44)
    • 230th lowest / bottom 15% for average upvotes (2.88)

Technical Write up

Using the all-MiniLM-L6-v2 encoder, 10,000 product descriptions on 4 main product categories, Marketing, Productivity, Tech, and AI, have been converted into 786-dimensional vectors (which are coordinates based on a 3d plane).

A K-Means clustering algorithm was applied to check which the similarity of reviews in groups of 60, 120 and 240 clusters.

These results were then fed into a Relevance AI dashboard.

Due to other features in addition to the product description, such as the number of upvotes and the number of written reviews, we have used aggregations to show the total number of upvotes and reviews per cluster, as well as the keywords extracted from every cluster using a technique called zeroshots.

Solution

We can have two variations of the same dataset, one portraying the description of the products, and the other one showing the logs (static png or gif) of the different products.

Dataset

Standard web scraping tools such as Beautifulsoup did not work on ProductHunt which required a different technique.

By using pyautogui (a python library to control the keyboard and the mouse movements), to automate the page scrolling, we were able to scrape this differently.

Thanks for reading!

If you would you like to see the ProductHunt Product Clusters in the dashboard, feel free to follow along here -
View the Clustering Dashboard here

If you would like to DIY this data experiment, feel free to use the following links:
View Juypiter notebook here
Download the ProductHunt Dataset here

posted to Icon for group Product Hunt
Product Hunt
on March 3, 2022
  1. 8

    Wow that's alot of data crunching!

    I think this shows one of the big issues for products that launch on Product Hunt. Many of these apps are solutions in search of a problem and launch in search of a user base that may or may not exist. There are better places to validate demand for your product than Product Hunt.

  2. 3

    Nice analysis mate! Was looking forward to this review.

  3. 2

    Very well done, thanks for sharing!!

  4. 2

    Informative! We're going to announce a product launch soon on PH and this post provides us with insight into how to make it successful.

    Thank you for the research!

  5. 2

    Adam! this was truly incredible! this data validates my idea on a web app. based on the current remote and growing culture. Thanks for this incredible write-up! i will be using this and the dataset to further my understanding of what works and what doesnt! keep crushing it, bro! amazing article!

  6. 2

    Amazing insight. Thanks for sharing! Definitely our end goal/audience/customers. Can we dig in a bit further to find how they started? Did they targeted a big market from the get go OR they started in a niche and then they organically grew/pivoted based on market needs? Granular data is hard to find but let's keep digging. :)

  7. 2

    Would be interesting to also map the timing of votes. Perhaps not in the past, but for future products. I still believe that the initial dozen of votes (and their timing) matters a lot. Sitting in Europe, one can get a bit of an advantage of pre-warming the upvotes a bit through friends before the US wakes up.

    Another thing that would be interesting is somehow figuring out how "someone's network" effects the popularity of a product. If Pieter Levels launches a Wordle clone (he wouldn't I'd guess), it would still be product of the day, regardless of quality or category of product. So an effective network skews the results quite a bit, one would assume.

    Not sure how and if those kinds of large-network-effect submitters could be filtered out somehow.

  8. 2

    Unfortunately, there were duplicated data. Pyautogui wasn't the right choice of tool for this task.

    df_drop_duplicates

    If you need the full dataset without duplicates let me know.

  9. 1

    I will launch my product on Product Hunt and this article is very useful for me, thanks a lot!

  10. 1

    that's neatly curated data.

    Here are my 2 cents. In the last 14 days,

    • The PH community is most active on Tuesday and Thursday. Posts launched on Tuesday and Thursday got 9000+ votes and 1500+ comments
    • Majority of posts are launched between 8 AM and 9 AM GMT. This makes sense considering Product Hunt is following PST and people will launch products as soon as the day starts in PST
    • Posts launched Productivity got the most number of upvotes, 13340+.

    if these stats sound interesting, do visit statsph.com

  11. 1

    No wonder we failed our ph launch :)))))) #design

  12. 1

    Very informative @AdamJMarsh. It gives an insight about which ideas should be pursued if you have lots of them and don't know which one to select.

  13. 1

    Amazing Job. Thanks for such great reviewing.

  14. 1

    wow, incredible job :). thanks for the informative analysis. May I know how long it took you to make that ?

  15. 1

    Wow, this is great. Thank you! This type of data is exactly what I look for when I am new to a scene. What I would love also love to see is engagement metrics for community involvement.

    Thanks!

  16. 1

    Nice work, I feel like it's somwhat simplistic in some aspects (you could interpret a lot of competition in a space as saturation or sign of market opportunity, matter of perspective I'd say), but I like the fact that you provide both notebook & dataset.

    The download link to GitHub is broken unfortunately and the "Start for Free" Button on your product's pricing page leads to your staging environment, you might want to look into that.

    1. 3

      Thank you for the pick up there Fabio, will look into this today.

      In the meantime, this is the correct and updated download link for the dataset - https://github.com/arditoibryan/datasets/tree/main/220202_producthunt

  17. 1

    That's better reporting than what most newspapers would do these days!
    Great job!

  18. 1

    Thanks for all the research! This is quite helpful.

    Curious where marketing software stacked up.

Trending on Indie Hackers
Meme marketing for startups 🔥 User Avatar 11 comments After 19,314 lines of code, i'm shutting down my project User Avatar 1 comment Need feedback for my product. User Avatar 1 comment 40 open-source gems to replace your SaaS subscriptions 🔥 🚀 User Avatar 1 comment We are live on Product Hunt User Avatar 1 comment Don't be a Jerk. Use this Tip Calculator. User Avatar 1 comment