Want to know what’s hot and what’s not on ProductHunt? I analyzed user reviews and product descriptions of over 10,000 ProductHunt listings to find out!
ProductHunt can make or break a launch for a business or product. Which is why founders (that's you people!) tend to invest a significant amount of time & money to build up a launch strategy, image assets, video, copy and external marketing campaigns.
Sometimes though, despite your best efforts, you won't achieve great success as some business models just aren’t popular on ProductHunt by design.
Given I have a great interest in understanding why and how things work, I wanted to pinpoint what these factors are.
For me the best way to do this is to perform topic modeling on the product descriptions of 10k+ products on PH - all to find insights and determine which type of products were most popular.
If you would you like to see the ProductHunt Product Clusters in the dashboard, whilst reading along then you can view the Clustering Dashboard here
** TL;DR**
Products that achieved the highest number of reviews were based around emails (25 in total) and rèsumes (41 apps).
You will face the lowest amount of competition and may prefer to build these type of products below
1st - As an overall leading type of product, building a webapp is the way to go. This type of product achieved:
2nd - Automation based products that were focused towards marketing, tech, AI & productivity score quite high.
3rd - Email Productivity products tend to solve an in-demand problem.
4th - Community focused apps relating to community management on social media or within forums scored highly.
5th - Screenshot or screenrecording apps were also in high demand. These types of apps were very popular with both upvotes and reviews.
Generally you may want to avoid building in these saturated categories as they had the highest most competition amongst
Overall, the products that were reviewed the lowest amount were based on web domains, QR Codes & habits / behaviour focused products.
Products that achieved the lowest average results and upvotes were as follows.
2nd - Update and notification based products
3rd - Google Suite focused extensions / products
4th - QR Code / Barcode orientated products
5th - Productivity Apps & Pomodoro Timers
6th - Habit Tracking & Behaviour based products
Using the all-MiniLM-L6-v2 encoder, 10,000 product descriptions on 4 main product categories, Marketing, Productivity, Tech, and AI, have been converted into 786-dimensional vectors (which are coordinates based on a 3d plane).
A K-Means clustering algorithm was applied to check which the similarity of reviews in groups of 60, 120 and 240 clusters.
These results were then fed into a Relevance AI dashboard.
Due to other features in addition to the product description, such as the number of upvotes and the number of written reviews, we have used aggregations to show the total number of upvotes and reviews per cluster, as well as the keywords extracted from every cluster using a technique called zeroshots.
We can have two variations of the same dataset, one portraying the description of the products, and the other one showing the logs (static png or gif) of the different products.
Standard web scraping tools such as Beautifulsoup did not work on ProductHunt which required a different technique.
By using pyautogui (a python library to control the keyboard and the mouse movements), to automate the page scrolling, we were able to scrape this differently.
Thanks for reading!
If you would you like to see the ProductHunt Product Clusters in the dashboard, feel free to follow along here -
View the Clustering Dashboard here
If you would like to DIY this data experiment, feel free to use the following links:
View Juypiter notebook here
Download the ProductHunt Dataset here
Wow that's alot of data crunching!
I think this shows one of the big issues for products that launch on Product Hunt. Many of these apps are solutions in search of a problem and launch in search of a user base that may or may not exist. There are better places to validate demand for your product than Product Hunt.
Nice analysis mate! Was looking forward to this review.
Very well done, thanks for sharing!!
Informative! We're going to announce a product launch soon on PH and this post provides us with insight into how to make it successful.
Thank you for the research!
Adam! this was truly incredible! this data validates my idea on a web app. based on the current remote and growing culture. Thanks for this incredible write-up! i will be using this and the dataset to further my understanding of what works and what doesnt! keep crushing it, bro! amazing article!
Amazing insight. Thanks for sharing! Definitely our end goal/audience/customers. Can we dig in a bit further to find how they started? Did they targeted a big market from the get go OR they started in a niche and then they organically grew/pivoted based on market needs? Granular data is hard to find but let's keep digging. :)
Would be interesting to also map the timing of votes. Perhaps not in the past, but for future products. I still believe that the initial dozen of votes (and their timing) matters a lot. Sitting in Europe, one can get a bit of an advantage of pre-warming the upvotes a bit through friends before the US wakes up.
Another thing that would be interesting is somehow figuring out how "someone's network" effects the popularity of a product. If Pieter Levels launches a Wordle clone (he wouldn't I'd guess), it would still be product of the day, regardless of quality or category of product. So an effective network skews the results quite a bit, one would assume.
Not sure how and if those kinds of large-network-effect submitters could be filtered out somehow.
Unfortunately, there were duplicated data. Pyautogui wasn't the right choice of tool for this task.
If you need the full dataset without duplicates let me know.
I will launch my product on Product Hunt and this article is very useful for me, thanks a lot!
that's neatly curated data.
Here are my 2 cents. In the last 14 days,
if these stats sound interesting, do visit statsph.com
No wonder we failed our ph launch :)))))) #design
Very informative @AdamJMarsh. It gives an insight about which ideas should be pursued if you have lots of them and don't know which one to select.
Amazing Job. Thanks for such great reviewing.
Awesome article!
wow, incredible job :). thanks for the informative analysis. May I know how long it took you to make that ?
Wow, this is great. Thank you! This type of data is exactly what I look for when I am new to a scene. What I would love also love to see is engagement metrics for community involvement.
Thanks!
Nice work, I feel like it's somwhat simplistic in some aspects (you could interpret a lot of competition in a space as saturation or sign of market opportunity, matter of perspective I'd say), but I like the fact that you provide both notebook & dataset.
The download link to GitHub is broken unfortunately and the "Start for Free" Button on your product's pricing page leads to your staging environment, you might want to look into that.
Thank you for the pick up there Fabio, will look into this today.
In the meantime, this is the correct and updated download link for the dataset - https://github.com/arditoibryan/datasets/tree/main/220202_producthunt
That's better reporting than what most newspapers would do these days!
Great job!
Thanks for all the research! This is quite helpful.
Curious where marketing software stacked up.
Great analysis!