Offcanvas

The Intelligence of Machine Learning - Part 2

The Intelligence of Machine Learning - Part 2
intcore
By:

A couple of months ago, we started a topic about Machine Learning, we specifically talked about Linear Regression Algorithms and how linear regression can predict the price of the house based on a previous housing data.

You can find more about this in here “The Intelligence of Machine Learning - Part 1”.

Moving to the next thing we’re going to talk about, Detecting Spam emails, something that will tell if an email is spam or not and how we do this. In order to do so, we need previous data (training set). Given for example a hundred email that we looked at already, out of these hundred emails we’ve flagged some of them as spam, and the rest are not.  

Thinking about some features that spam emails may be likely to display and analyze these features. One of those features maybe containing the word “Cheap”. 

Let's assume that we have 25 spam emails and 75 non-spam emails. By analyzing those emails and looking for the word “Cheap” in them, we found that 20 spam emails contains that word, and 5 non-spam emails contains this word also. so, if an email contains the word “Cheap”, the probability of it being a spam will be (number of spam emails contains the word “Cheap” divided by the total number of spam emails). 

Applying this on our situation, we will find that the probability of an email containing the word “Cheap” being a spam email is 80%. So, we can associate this feature (contains “Cheap” word) with a probability of 80% and we can use it to flag future messages  spam or not.

Based on the previous example, We can also look at other features to specify a spam email such as Missing titles, spelling mistakes, etc.

This is known as “Naive Bayes Algorithm” or “Naive Bayes Classifier”.

Recommending Applications:-

Gender

Age

Application

Male

15

Pokemon Go

Female

25

HaYaChat

Male

32

HaYaChat

Female

12

Pokemon Go

Male

17

Pokemon Go

Given the above table, It seems that age is more decisive for predicting what will users download, for simple demonstration, we can use this algorithm, 

If user < 20, then recommend Pokemon Go.

If user > 20, then recommend HaYaChat.

We end up here with what is called “Decision Tree” and the decision are given by the question we asked, and these decision trees are built with data, and now when we have a new user, we can put them to the decision tree and recommend them whatever app the tree suggests is to recommend.

To wrap things up...

We this time we have talked about Naive Bayes Algorithm and decision tree algorithm with a couple of examples. next time, We’ll talk about Logistic regression and its usages. Till then, see you.

People liked
Technical Trends in 2019 that can't be missed!
Technical Trends in...

Some trends have made us feel that the world is really getting smarter everyday; that’s why today we decided to talk about some of the technical trends that took us thinking “Wow, this is huge!”.

How to Build a Relationship with Your Followers
How to Build a Relat...

When it comes to taking your business to another level in this day and age, your followers are pretty much invaluable, to say the least. Your followers are so important for your company that they might as well be referred to as the lifeblood of your business. So it’s crucial to build a steady relationship with them to retain them and make them loyal to you. Because loyal customers, one way or the other, bring more business to your firm.

How to Build an Online Business Identity
How to Build an Onli...

With so many businesses getting started online these days, it is only natural for entrepreneurs to wonder how they can start it as well. So if that’s something you need guidance on, then follow the steps below to build an online business identity.

What does your client expect from you?
What does your clien...

Happy clients are the lifeblood of a business. Not only are they profitable themselves, but they help in bringing more business to the company. Their referrals and positive feedback is pretty much invaluable and it’s perhaps the best way to get more leads

Offcanvas Title
Your content here.