Home / Features / Mythbusting The Amazon Algorithm Part II: Amazon Lists, Products, and Sales
SPR AWARDS 2016 OPEN FOR SUBMISSIONS!

Mythbusting The Amazon Algorithm Part II: Amazon Lists, Products, and Sales

There has been a fair outrage from certain authors and bloggers who claim that my last piece about the Amazon Algorithm did not consider the way lists (often referred to by the author community as “poplists” although this is not an Amazon term) work. There has also been some whispering about BookBub’s sales being left out of Amazon results, and a high misunderstanding perpetuated by crowd mentality and myth online.

I called myself a “search expert” in the last post.  Clarification: my entire adult working life has been spent in publishing, marketing, and search for multinational banks, automobiles, telecommunications – and books. I was responsible for multi-million-dollar accounts for multi-billion-dollar companies, who relied on my team to deliver conversions (sell stuff) globally, and I’ve been doing that for twenty years. For the last six of those years I have worked exclusively in editing and book marketing.

So let’s get down to cracking open a few of these nuts.

MYTH#1 – Amazon has many algorithms for the Bestseller Rank and Lists, and they are separate for sales and products

TRUTH  – The A9 algorithm feeds many different variables and sorts these into lists and rank

Semantics are really off in the world of indie.

algorithm

One algorithm feeding many lists that have many different variables, not many “algos.” Sometimes, people, even informed people, refer to these variables as “algorithms” plural. An algorithm is by definition a collective noun used to describe a set of instructions given to quantify data.

 It’s more than one algorithm when it’s running a separate set of steps. So you might say Google and Amazon have different algorithms.

Variables.

What is the Amazon Algorithm?

Slide @tonyverre at Rockfish Digital from SMX 2016

Slide @TonyVerre, Amazon e-tailor expert from SMX 2016

The algorithm, the A9,  spits out results in SERPs (Search Engine Results Pages). These SERPs are actually divided into lists on Amazon, which are Amazon’s way of presenting the data for users in the most enticing way for sales. Products are sorted for optimum sales. Not a separate one for products, and then sales. One algorithm using all sets of data. Because products and sales are intrinsic. Amazon want to use what sells well in its suggesters, and then serve them to users in a palatable list.

Dave Chesson, Kindle ranking expert at SEMRush says, “Like Google, A9 works to find the right pages using specific on-page and off-page data in order to build their own SERPs.”

What has become known as “Poplists” by indie authors on forums is the equivalent to SERPs on Google, the search pages that come up when looking in categories. However, these pages are generated with a different agenda: to not only study your buying behavior from what you browse, click, buy, wish for, and review, but also to forcibly suggest products you are most likely to buy by placing them right in front of your face during searches.

Comparing Google Search Results to Amazon Lists

After Amazon, now with 3 times the search traffic of Google for products, Google is the most well-used platform for search online worldwide. In the years I worked in the media search environment, vertical search came into play on Google.

Google is a different type of algorithm to Amazon in that it has a different objective: to serve you the most relevant results and learn from what you click and read what adverts to serve you to make money for themselves. They are obsequious in that they want to learn from your own decisions and what you look at. It’s an unstructured site, in other words it has no content to begin with, and does not attempt to catalogue information in any finite way.

Google uses lists too, by sorting into Books, Videos, Maps, etc.  This is vertical search: Sorting results into piles for you to find the most relevant items:

google vertical

Or you can turn off personalization in Google to see the overall results not for just you:

personalization

Here’s an article about vertical search from Editor-in-Chief and SMM manager at WebCEO, Nelly Vinnik. She says, “Vertical Search returns specialized results which represent different types of content for a query.  You can search for images in Image Search, for videos in Video Search, for news in News Search, and local shops or restaurants in Maps Search…” Sound a bit familiar if you replace the words for search with Category, Best Seller, Popular…”?

Here’s an Amazon’s category SERP  (the proper name for the word “Poplist” bandied about) when I search for Epic Fantasy. I do buy GOT books, so it’s most relevant to me with “Most Relevant” set on the right, which is another variable, as a sortable list.

Amazon poplist

 

If a user logs into Amazon, their preferences, search history and everything they browsed and bought previously, along with everything people like them in demographic bought is fed into the algorithm. From the search bar input also, which is one part of the algorithm, Amazon then generates lists of products that the user would most likely buy. That is why my lists look different to my husband’s after searching for the same keyword, and also why products may be out of rank order on the page.

Personalization in Amazon

Unlike Google, personalization is set by default. As discussed in Part I, you could turn off some personalization, but you will end up with skewed results that won’t help you shop or rank your book because you are still tied to everyone else’s personalized results thrown in the mix, so you can only turn off 3rd party tracking.

These lists given are used to encourage you to buy products in the order you yourself are most likely to buy them, despite their rank. This is because unlike Google, which is used for research and finding things out, maps, and reading stuff on the web, Amazon is a shop. If you go to Amazon, you are at the very least thinking about making a purchase. This means Amazon has to present results that make you want to buy.

Lists – Vertical SERPs in Amazon

The Amazon algorithm is a set of instructions that sorts the catalogue of products into lists pertinent to that buyer’s personal choices including Rank, popular items, and Hot New Releases. These lists have different parameters and feed directly from the search and product information available at the time of the search. These lists are the exact same thing as vertical search in Google with Books, Maps, Videos etc. in the image above, but they sort data by different criteria.

Lists = Various ways of sorting and presenting products by criteria given for that variable

 serps amazon There are many lists, such as:

  • Best Seller Lists
  • Popularity Lists
  • Recently Tagged Lists
  • Recently Popular Lists
  • New Release Lists
  • Movers and Shakers
  • Hot New Releases
  • 90-day New Releases
  • Categories

The main lists that are caught up in myth-making are:

  1. Categories SERPs (Poplist) – Category-based filtering by sales in category + personalization
    Shown above, filterable in many ways by category, sub-category, relevance etc. and will not be in ranking order necessarily
  2. Popular Items – Sales, ranking, personalization, plus timely Amazon choices (Seasonal etc.)
    Popular Items are shown in a list of items selling well at that moment, and according to your own personalization results so Amazon can make you buy something more quickly. This list feeds off sales and personalization factors as shown in the last post I wrote and is also timely to make room for promotional timely products such as Easter, Xmas, new JK Rowling etc.
  3. Hot New Releases - Sales in recent time against others in your category and Amazon choices
    Hot New Releases shows results by how many books sold just released in the last 30-90 day and coming soons based on both preorder books and trad publisher dates. Also here Amazon adds the products it wants to show you that are going to be released, based or your personal history, because they are a shop, and they want to push products they have deals on – one factor myth-spreaders forget when second-guessing this stuff. We’ve seen authors promoted in a three-day push make this list pretty consistently, but you need to be selling around 30-50 books a day in a lower-volume subcategory to make this work so quickly.
  4. Best Seller Rank – Sales but considers ranking factors also for driving sales
    Rank is a straightforward sales result, but also takes into consideration all the factors I shared in the last post, because newsflash, Amazon wants to sell good products. So you can and will be dropped if your product does not meet quality or content guidelines – see below. Historically how you sold also counts towards keeping ranked, as does review recency and volume of reviews. *This is not “bestseller” as in books, but the best seller in the category, i.e. the seller/item that does best, and is used in all categories across Amazon, not just books.

MYTH#2 – It’s Possible For Authors To Test Amazon As A User To Find Patterns To Help With Ranking

TRUTH – While it’s possible to examine real-time data for immediate promotion, it’s absolutely impossible to use customer data over time to conclude anything that could help authors sell books

False Positive/Negative Results Done by Author Groups

As an author it is impossible – yes, I will say that again for certain critics among you – IMPOSSIBLE to garner the whole picture if you test using customer data. This is because you cannot second-guess the results shown in real time for every user on Amazon. It is akin to guessing everyone’s favorite color by knowing your own, and then assuming that people must like green because they don’t like blue.  This sort of testing in software development is only used when QA managers (Quality Assurance) are trying to break software to prove that further testing is needed, and NOT to draw conclusions.

You can see why an unqualified person running tests may not be doing it right just by looking at the different types of testing qualified software testers use or here for the UK standards documentation. Like any experiment, the premise must be clean and infallible. When developing detection algorithms or tests, a balance must be chosen between risks of false negatives and false positives. Usually there is a threshold of how close a match to a given sample must be achieved before the algorithm reports a match. The higher this threshold, the more false negatives and the fewer false positives.

When data drawn from these sorts of moveable tests is given as proof of a function, false positives appear that in reality do not prove anything concrete. Just to clarify, this was part of my job for many years when project managing software development. These are the issues.

Problems with testing Amazon data from users’ searches and lists

  • Real Time: Because Amazon is a collaborative filtering item to item algorithm, any data is volatile because it relies on customer and item grouping that changes in real time. Amazon builds tables of data that put similar customers and similar items in the same group to identify possible recommendations in advance of the search. Therefore testing over time will prove nothing due to promotions and additions all the time that immediately change the base data.
  • Every user’s Experience is unique: This means any data gathered from testing varies wildly from person to person and item search to item search and becomes absolutely unique to each and every customer in real time. If any test were made and then repeated, data would vary wildly due to hourly updates of ranking, IP, and other factors. For instance, just to work out what Amazon will show in recommendations requires the entire data set, which nobody but Amazon has at their disposal (see diagram). Nobody on the planet has this data to work out anything this way, in a way not even Amazon, because it can only be calculated in real time by their own algorithm. See below for their own ML tool that gives you a chance to find stuff out behind the scenes.
cf

Slide shows calculation made to show recommendations on Amazon in the algorithm, by Roger Chen, CIO at Eternal Sparkle Infocom Limited (HK)

  • Sparsity of data: In reality a customer will buy very few books in relation to Amazon’s catalogue. Bear in mind that 1% of 2 million books is 20,000 books, and the average American reads 2 books a year! Therefore, data cannot be reliable because of its sparsity in relation to the test being carried out – basing data on just a few purchases does not give a reliable predictive set. The algorithm relies on memory-based historical purchases data and grouping customers by demographic to attempt to serve the best matches, but it’s obviously crazy to think any focus group testing would come up with any reliable data if even Amazon cannot.
  • Scalability: As users purchase and join Amazon, the data changes. Data only stays fresh for one hour. After that, any data-based testing is wrong and old, and will change by the time a new test is conducted.

Averages Over Time Do Not Mean Clean Data

If anyone reckons they have tested the algorithm and beat it with focus group tests, they are misguided. Testers on author forums are not making progress because the data is wildly volatile, and the data is huge, inhumanly so. Given the predictive groups formed from less than the bare minimum of data needed to make a prediction is used in tests by authors, it’s impossible to draw an average or trend. Which is why we have to use what Amazon does give us to make sure your book exposure is the best it can be. In their study for the UK Government on quantifying data in instable and real-time algorithms Neil Johnson and Guannan Zhao note, “Computers can trade freely in real-time – but humans cannot.”

In that vein, I thought I would share one of their equations used to figure out averages in online trading algorithms, where just like Amazon, charts and lists are changing in real-time. I hope this demonstrates fully that any claims to have cracked the lists without considering variance and black/grey swan events (see Data Mining and Predictive Analysis by Colleen McCue for more) among all the other factors and their patterns are nothing but Horton Hears A Who results. Unless, of course, you are an expert in predictive data in real-time algorithm calculus, like those who work at Amazon. To think the “University of Life” has prepared indie authors to predict Amazon is laughable!

calculus used for real time algorithms

Example of calculus used for real time algorithms (like Amazon’s) to figure out averages over time (UK Government)

So What Can We Test on Amazon As Authors?

The only way that authors can test data is by looking at real-time sales, keywords, ranking, and marketing factors such as quality, reviews, and content to compete in categories when they are ready to publish, and ready to consider a change in listing of any kind to help sales.

Tools that can be helpful include (non-affiliate links):

  1. Kindle Spy – to figure out competitor keywords in real-time in a category
  2. Kindle’s own browse keywords guide – to make sure all your keywords and categories are the best they can be (bear in mind there are a ton of other non-BISAC categories available by asking for them)
  3. Rank Calculator – to gauge how many books are selling in a category
  4. Keyword tools such as keywordtool.io can give you ideas on what your audience may look for using generic keywords to help with placing your book on Amazon in the correct category for your content
  5. Amazon’s own Machine Learning tool (advanced) so that you can run your own parameters straight from Amazon’s data, “ML algorithms discover patterns in data and construct predictive models using these patterns. Then, you can use the models to make predictions on future data.”

MYTH #3 – Amazon’s Lists have bugs that don’t make sense

TRUTH – No they don’t, they are set like that for a reason

Often the product you just bought will be shown, just in case you want another one. This isn’t a “bug.”

Amazon rightly thinks that if you liked the product enough to search for the same keywords again, maybe you want to buy it again or for a friend. This is way more relevant for everyday items like face creams or panties for example, but they still do it.

Agile development expert Abraham Marín Pérez defines the terminology:

  • The definition of a defect is that the software doesn’t behave as per the specification
  • A change request is that it does what it’s mean to do as per the specification last implemented, but what the customer is asking for is new functionality.

Therefore this would not be a bug or defect. It’s a change request, or a feature requirement (depending on what model of project management Amazon uses) requested by a stakeholder when software or code is adapted at a stage in the future, called an iteration. Because Amazon has never done this change it must be more fruitful for them to leave this list variable alone and that it meets specifications requested. I already have the GOT boxset in the photo above, for example.

MYTH #4 – Amazon pulls books from BookBub campaigns out of ranking

TRUTH – Amazon pulls books that don’t meet their quality guidelines, especially hardcore sex or erotic books masquerading as Romance.

If a romance book is not showing up, it’s likely that the book has been flagged for use of genital or sexual words that are inappropriate to its category. Many authors have tried tricking category ranks by putting their book in a very narrow category that has zero to do with their book. If this happens, Amazon might just drop that book. However, Bookbub book promos are NOT pulled from Amazon ranking. I wrote and asked if they had any clue as to why this myth is out there. Here’s the reply:

bookbub

BookBub markets books to reader lists, the same way we do at SPR. Both companies send book ads to members of the public, who signed up for the newsletters and we do not have contact with the reviewers in any other way, so Amazon does not have an issue with it. Amazon loves books being advertised: it means they sell more books.

Bookbub do monitor books for quality and gatekeep to an extent, within their policy.

However, Amazon has manual procedures to drop books that do not meet guidelines and policy:

Your books and other content (such as book titles, cover art and product descriptions) must adhere to these content guidelines. We reserve the right to make judgments about whether content is appropriate and to choose not to offer it. We may also terminate your participation in the KDP program if you don’t adhere to these content guidelines.

Pornography
We don’t accept pornography or offensive depictions of graphic sexual acts.

Offensive Content
What we deem offensive is probably about what you would expect.

Poor Customer Experience
We don’t accept books that provide a poor customer experience. We reserve the right to determine whether content provides a poor customer experience. See the Guide to Kindle Content Quality for examples of content that’s typically disappointing to customers.

This means all my factors for success listed in my last post are absolutely essential to ranking and listing in Amazon SERPs.  It matters to ranking, folks.

MYTH #5 – We will never know what Amazon does with its algorithm entirely

TRUTH – We will never know what Amazon does with its algorithm entirely

Authors often whine about Amazon’s dishonesty. In ALL of Amazon’s disclaimers, you are reminded that you are a seller using their absolutely free to set up platform to make money and that they are a private shop online with every right to demand you follow the rules they themselves made and asked you to follow when you signed up with them. Like a bratty kid with a wonderful bedroom and a brand new Apple iMac, authors make all sorts of claims about how Amazon are covering up information about how they sell. Well, of course they are! They are a private entity!

But having looked at all of Amazon’s documentation, everything you need to know is at your fingertips. Book success is possible with some effort and energy. Unfortunately it still stands you need a bloody good product and book page to make it. Maybe that’s the pill to swallow for most authors, and judging by some of the loudest offenders on forums, that lesson doesn’t seem to have hit home yet.

MYTH #6 – Reviews Are Not Counted Towards Ranking

TRUTH – Yes they are.

Here’s a screengrab from the A9 internal documentation, shared by CPC Strategy, a professional Amazon marketing company.

A9 Reviews

Phoenix Sullivan. That’s for you.

It is a FACT that you can lose ranking without customer reviews. Firstly, because of social proof as discussed in my last article – more reviews starred higher means more click-throughs to your Book Page, means more conversions, i.e. sales. Secondly, because Amazon will rank you lower than a competitor in a photo finish based on CTRs (click through rates) and review data.

Dave Chesson says, “If the Amazon search engine continues to place a particular product high in the SERPs based on just sales numbers, but customers aren’t leaving reviews or are leaving negative comments, Amazon will respond by lowering that product’s rankings and bring something else up further. While sales are important, Amazon cares about the customer experience as well.”

Not only that, since June 2015 Amazon’s overall star rating uses recency as a factor, so if you don’t spruce up reviews and sales, your book could start dropping in star value. Who wants 2 measly reviews next to their book listing? Not a lot of authors, is the answer. You can look at my article on how to get reviews safely and within Amazon policy here.

Yes, we are in the business of reviews at SPR, because provably time and again reviews, both customer and editorial, are incredibly important to market ANY product. I don’t think anyone with half a brain can argue with that if we’re going to sensibly look at the fundamentals of online product marketing.

In Conclusion

I am very disappointed that indie authors continue to turn to each other for advice instead of professionals, and aggressively naysay us when book professionals and technical advisors give free information that may help in an open and clarifying way.

I hope this post has taken the knowledge level down a notch into simpler terms for understanding, and that seeing Amazon lists and ranking as SERPs will increase author awareness and flag disinformation in the future, however entrenched in myth it may be.

As always, I have linked to verifiable results, sources, and facts of all working professional and scholarly experts cited, and thanks to all for your insights and materials used here.

Part I of this article can be read here.

 

  • Pingback: Mythbusting The Amazon Algorithm – Reviews and Ranking For Authors | Self-Publishing Review()

  • http://amazonproductoptimization.blogspot.com/ Amzranked

    Thanks for your GREAT post.
    i am very much interest about “Amazon Algorithm” . i am searching and trying to learn this types of E commerce sites algorithm.

    I am professional Amazon SEO expert. Work for keyword rank up by keyword search. i am waiting for more post about Amazon Algorithm.

    Keep it up.

    • http://www.selfpublishingreview.com Cate Baum

      I will! It’s time to help authors. I am happy to spread knowledge to search professionals to bolster our industry and create great products for authors so they have successful books instead of ones that don’t sell.

  • Pingback: Writer Wednesday | creative barbwire (or the many lives of a creator)()

  • Charlie Dean

    Fantastic again. Sorry you had to defend your knowledge to morons.

    • http://www.selfpublishingreview.com Cate Baum

      Glad to be of service, Charlie.

  • http://Kindlepreneur.com kindlepreneur

    It still blows my mind when I’m reading a really good article by someone who I don’t know personally and then all-of-a-sudden, I see my name…haha. Virtual High Five Cate. Then later on I saw my KDP Calculator listed – double virtual High Five!

    In all seriousness, amazing post. I’ve been working with the Amazon API for my next software and just getting into the how the data is presented has been a real eye-opener. Like here’s a fun fact: Amazon doesn’t look at the title and subtitle as two separate entities…the way it pulls data it makes them inseparable. Based off of that, it leads me to believe that having your keyword in the subtitle is just as good as having it in the Title – but of course i can’t be sure.

    Also, LOVED how you brought up the “node” aspect of the search engine. I’ve been fighting with people over that for a while. Many state that the words in your description aren’t tracked by A9. They’ll “disprove” it by copying a sentence and pasting it into the search bar. However, the algo doesn’t work in sentences. It works in batch parameters and this is highlighted by looking at the top of the SERP where, in the case of the person’s sentence search, Amazon suggests particular words from the sentence and crosses off the “of” “is” “the” etc…

    Anyways, amazing article, great research and approach and kudos for the guts of writing this.

    • http://www.selfpublishingreview.com Cate Baum

      Thanks, Dave! I’m probably branded for life by the aforesaid “morons” cited above as a “revolutionary” at this point, but never mind. I know I can be trusted! I feel you are one of the professionals in a sea of amateurs. There’s not many around in this industry, so we have to support each other! Maybe one day we can do a super-podcast or something. As Chomsky once pointed out in the New York Review of Books, intellectuals are responsible for the searching of truth and the exposure of lies. I’m fine with that. Hacks are ripping authors off to make money. It’s time to get busy – and noisy about what’s right and what’s not. Books are too important.

  • C.Steven Manley

    This is a great post full of useful info, but I have one question concerning the search autocomplete for keywords. I use this method with a separate browser perpetually set to private mode without being logged in to Amazon. My thinking was that this method would eliminate my personal history influence from the results. I’m not a tech or numbers guy, so I’m wondering if I was way off track. Thanks again for a great post.

    • http://www.selfpublishingreview.com Cate Baum

      I don’t think it would make a difference because you are forgetting the fact that Amazon is still taking into account the real time activity by other customers and those around you in your area. You would still have an IP address etc. It’s not just about your activity, and as soon as you type anything in that box, Amazon starts calculating what your needs are.

  • http://www.bronwenevans.com/ Bronwen Evans

    I’m assuming they can weight certain factors in the algorithm, is that true? For instance, they can give a higher ranking to books in their KU programme for best seller lists. Is that true? It’s certainly looking that way.

    • http://www.selfpublishingreview.com Cate Baum

      If you mean books that sell more and are more recently read, yes, books can get weighed higher.

  • sharon Brownlie

    A fantastic article. Usually when i read about “Amazon Algorithms” my brain dies a death and my eyes gloss over. However, I stayed for the whole article. Clear and concise!

  • Pingback: Wanna know a secret (about Amazon’s algorithm)? – LOVE INDIE ROMANCE()