Image by silviarita from Pixabay

Supervised and Unsupervised Learning For Kids

Machine Learning In Layman Terms

In this blog, I am trying to answer an interview question that I was asked during a data science interview.

An Interview question

Here is a very famous machine learning interview question:

What is the difference between supervised learning and unsupervised learning?

You can give an answer like this:

In supervised machine learning algorithms, we use labeled data, for example, classification of X-ray images as pneumonia, or not pneumonia.

In unsupervised machine learning, we don’t need to have labeled data, for example, clustering of customers to segments.

A Different Interview Question

What if the interviewer changes the question a little bit? The new question is not very different than the previous one. But the answer is quite different.

What is the difference between supervised learning and unsupervised learning? Explain to me as though I were a five-year-old.

You may need to change this answer. Because a five-year-old kid doesn’t know what labeled data is.
Let's try to answer this question using layman terms.

Separating Fruits

Imagine that Jack and John are given fruits to separate.

Jack was given bananas, strawberries, and apples. He knows what each fruit is. He classifies them using his previous knowledge and he labels each fruit as apple, banana, and strawberry. He is less likely to make an error.

Image by Beverly Buckley from Pixabay

John was given pomegranates, lychees, rambutans, and mangosteens. He didn’t see these fruits before. He separates them into groups, he makes “fruit segmentations” out of the given fruits. Because he doesn’t know the names of the fruits, he can not label them. He just looks at their shapes, sizes, and colors, maybe textures and smells. He makes groups(clusters) out of given fruits. John is more likely to make a mistake because he doesn’t have information about the classes.

Image by LoggaWiggler, Nicole Köhler, Gun Wallborg, Kerstin Riemer from Pixabay

Actually, in the data science world, Jack is doing supervised machine learning, namely classification, and John is doing unsupervised machine learning, namely segmentation.

Conclusion

You can practice answering similar questions in Layman's terms.

Explain [a data science topic] to me as though I were a five-year-old.

You are going to be asked this kind of questions in the data science interviews.

Kemal Yilmaz, thank you for asking and answering this question.

Thanks for reading.

If you want to get into contact, you can email me at seymatas@gmail.com, or you can find me at https://www.linkedin.com/in/seyma-tas/

Data Scientist, Data Educator, Blogger https://www.linkedin.com/in/seyma-tas/

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store