Non-Personalized Recommender Systems in eCommerce

Author

Vaibhav Sahu

Principal Data Scientist

Published Mar 11 2019

Recommendation techniques have always played a very crucial role in the marketing activities. The recommender systems have been available for a long time, but it is only recently that they have started being identified with the eCommerce businesses. While a physical example of recommendation can be understood when a mannequin displays a designer’s clothing outside the shop, in eCommerce, it becomes a challenge when the products need to be recommended to a wide range of customers online

The objective of any recommendation system is to help the users to find the products or items that they want. In an eCommerce scenario, the system would present the user with a list of items that they are most likely to buy. In such cases, the system must have all the information about the users (like demographics, previously bought items, channels for buying, etc.) and the products (such as product attributes, prices, discounts available, descriptions, etc).

4 Categories of Recommender Systems:

  1. Non-personalized
  2. Content-based
  3. Collaborative filtering
  4. Hybrid systems

In this blog, we will focus on the non-personalized recommender systems.

Non-Personalized Recommendations

While a personalized recommendation system suggests the products to a user based on their previous history, a non-personalized recommender system displays the products that are popular among the people in general during that time frame. By suggesting a product that the customer may potentially buy, the recommendation system also helps a website in creating their first impression of having the products that may be of the customer’s interest. There are various types of non-personalized systems depending on the customer’s usage.

Product Trending

You may have noticed various eCommerce websites showcasing trending products, or “Hot Items” on their home pages, at the side or bottom of their product display pages. The main purpose of displaying these trending products is to keep the visitors engaged and get them converted into customers by constantly updating the products in real-time or near real-time. These products may get updated within the time buckets that may range from 15 minutes to 2 hours or 4 hours. These time buckets are based on the rate of change of the products from a highly-sold in a time window compared to the previous time window.

There are multiple flavors of trending algorithms. Three of those are explained below:

1. Average Change: Average of the difference in total sales within the time windows.
Average [ (total sale window0 – total sale window1), (total sale window1 – total sale window2) + (total sale window3 – total sale window2), ………….]

2. Weighted Average of Change: Weighted average of the recent three-time windows changes in total sales for a given period.
(total sale window0 – total sale window1) + 0.5*(total sale window1 – total sale window2) + 0.25*(total sale window2 – total sale window3)

3. Slope: Slope of the best fitting line representing the data points in degrees.

Fig.1. Comparing all 3 trending algorithms for 3 different time periods

The above figure shows the number of units sold in a time bucket of 4 hours. However, other metrics, such as views, clicks, total sales or number of times the product was added to the cart, can be used for trending. A friendly suggestion, always normalize the product pricing when using sales dollars on deciding metrics, if you don’t want to display only the high-priced items.

Product Ranking or Sequencing

Product sequencing, or ranking, helps in deciding the order that the products must be displayed on your web page. Whether it’s a home page, category page or a result page of any search query, the products that are being showcased must have a logical order. Intuitively, this order should be based on the likelihood of the product to be clicked or bought by the customers.

Aggregate Metrics-Based Ranking

Many features can be used as a proxy for this likelihood. Some of them are listed below:

  1. Number of units sold
  2. Sales in dollars
  3. Number of views
  4. Number of clicks
  5. Number of times products are added to the cart
  6. Click-through rate
  7. Sell-through percentage
  8. Add-to-bag conversion percentage
  9. Number of reviews / ratings, etc.

These metrics are aggregated within a time duration, which is generally taken from 1 day to 2 weeks. General aggregations are a sum or an average. Again, don’t forget to normalize the product prices when using sales dollars as a deciding metric.

Prediction Based Ranking

In this technique, metrics are used to predict the product ranking for the next day with the help of a historical data. The main advantage of this technique is that a trained model can help in ranking the rank according to their changing prices on a daily basis.

Various machine learning algorithms, like Linear Regression, Random Forest Regression, etc., can be used to train models to predict the next day sales using input features, such as historical sales in dollars, views, units sold, available stock, daily price, day of week, month (for modelling seasonality), discount available, rating, number of reviews, number of clicks, etc.

Below are the ranking results from our prediction model using Gradient Boosted Machines (GBM).

Fig.2. Product ranking system using Sales Prediction (categories: Men’s Watches, Women’s Tops)

Explained above is one of the crucial tools of Non-Personalized Recommender systems. Its importance lies in the fact that it helps in creating your brand’s first impression on the new customers while serving the purpose of revenue maximization for the vendor.