Why machine learning does not work

Published on — Written by Wonderflow

machine learning does not work - wonderflow

In this video, our CEO Riccardo Osti explained what machine learning is as well as it’s most common applications. Today he will explain why this acclaimed technology, which, don’t get us wrong, has a huge potential for several applications, does not really work well with text analysis.

When we mention machine learning we are also referring to AI, deep learning and more generally to statistical models. Many of today’s hi-tech leaders make use of these type of statistical technologies for the most diverse applications. Some of these companies are: Tesla, IBM Watson, Google, Amazon et cetera

So why are we saying that machine learning doesn’t work if so many of these brands are using it so heavily? Well, it does not work well for text analysis, and we’ll tell you why.

Algorithms

Machine learning algorithms have the goal of identifying statistical patterns, and based on them, continuously improve their decision-making capabilities. The fact that machine learning is based on statistics implies that, in order to work properly, requires datasets sufficiently big to be considered statistically relevant.

In fact, the biggest variable that determines the performance of machine learning algorithms is often the amount and quality of data that is available for training purposes.

It goes without saying that the larger the dataset the better it is for these statistical models…and here comes the challenge for text analysis applications.

If you look at the domains where statistical models and machine learning are more successful, you will realize that these domains have an abundance of data, that can be used to easily identify specific trends. Good examples are self-driving cars, financial predictions or weather forecast.

Difficulties when analyzing text

If we think about texts, we hardly find cases where we can count on a decent amount of data. Especially when we want to extract detailed information from unstructured text, it becomes difficult to have sufficient data to train statistical algorithms. For example, if you would like to know why people complain about a specific problem they have with one of your products, most likely you would only have some reviews available on the internet…and that, clearly, won’t be sufficient.

Let me give you a better overview of why machine learning is not good for text analysis, due to lack of training datasets. If you put together all the books ever published by mankind, their total weight would be around 54 terabytes. I don’t want to say that you could keep all these texts in one hard drive, but that’s almost the case. On the other side, if we put together all the information recorded by the sensors of the self-driving cars currently running around the world, they would weight around 400 terabytes, each second.

What does it mean? It means that, even if it’s hard to believe, it’s easier to drive a driverless car than to make text analysis.

If you want to learn more about Machine Learning, read this other article about Machine Learning definition.

About Wonderflow

Wonderflow empowers businesses with quick and impactful decision-making because it helps automate and deliver in-depth consumer and competitor insights. All within one place, results are simplified for professionals across any high-UGC organization, and department to access, understand, and share easily. Compared to hiring more analysts, Wonderflow’s AI eliminates the need for human-led setup and analysis, resulting in thousands of structured and unstructured reviews analyzed within a matter of weeks and with up to 50% or more accurate data. The system sources relevant private and public consumer feedback from over 200 channels, including emails, forums, call center logs, chat rooms, social media, and e-commerce. What’s most unique is that its AI is the first ever to help recommend personalized business actions and predict the impact of those actions on key outcomes. Wonderflow is leveraged by high-grade customers like Philips, DHL, Beko, Lavazza, Colgate-Palmolive, GSK, Delonghi, and more.

Start making winning decisions based on customer feedback todayGet a free demo

2023 CX trend

Customer experience · Jan 31, 2023

Top 8 CX Trends in 2023 (And No Shock VoC Analytics Still Tops)

In a period of inflation, rising energy costs, staffing shortages, and more, many businesses are at a crossroads in 2023 as they try to juggle economic uncertainties while focusing on delivering an exceptional customer experience (CX). Therefore, it’s never been more crucial for companies to adapt and deeply listen to the customers’ needs and wants. And how you precisely do that can be answered in our eight top CX trends in  2023! Let’s take a…

wonderflow image

Retail · Jan 04, 2023

What Do Retail Reviews Reveal On Customers’ Price Perception?

Wonderflow’s Business Intelligence is dedicated to bringing you valuable consumer insights based on the actual voice of the customer. In this analysis, we dig deeper into the effects of inflation, particularly customers’ price perception across major Italian supermarkets. Moving forward into 2023, as price concerns continue to rise, retail managers may use our insights to become more aware when creating a better customer experience.    A recent article published by Kantar, the world’s leading data,…

wonderflow image

E-commerce · Dec 23, 2022

7 Top Retail Trends Predicted for the 2022 Holiday Season

2022 may have rushed past, but it isn’t quite over yet. With the holiday season fast approaching, retailers are steeling themselves for the craziest months of the year. Knowing what retail trends to expect from the 2022 holiday season can help brands and retailers to prepare and carve a strong position in the market in what’s undoubtedly been a volatile economic year.  Overview of 2021 Holiday & 2022 Cyber Weekend  First, let’s take a look…