10 Important Algorithms in Machine Learning

The word “Big Data” became popular in 2017 and has become the most popular in the high-tech industry. Machine learning lets computers analyze past data and predict future data. It is now popular in many fields. In fact, even engineers who do not specialize in machine learning can now use it. This article introduces some of the most commonly used machine learning algorithms.

1. Random forest

Random forest is a machine learning technique that you can use for classification and regression. It is a technique that makes many decision trees and merges them together. Although random forest technique operates on a large amount of data, they have a high accuracy for prediction/classification.

Random Forest Diagram — Diagram of Random Forest Algorithm

Let’s look at an example: There are training data: [X1, X2, X3, … X10]. As shown above, a random forest can use bagging (short for bootstrap aggregating) to divide the data set into three subsets and randomly select data from the subsets to create three decision trees I can do it. The final output determines the majority (for classification) or the mean (for regression).

2. Decision tree

Decision trees are a method of dividing and classifying groups by conditional branching. It divides the data into groups as similar as possible. I think that it is easier to understand if you look at the image below.

Decision tree for whether you should go outside — Should you go and play outside?

By repeating the conditional branch in this way, we expand the data more and more like a tree and divide it into the smallest solvable units.
They are one of the simplest machine learning algorithms.

3. Logistic regression

Logistic regression is a type of statistical regression model of variables that follow the Bernoulli distribution. If the probability P is between zero and one, (0 <P <1), it can not be satisfied by a normal linear model. If the domain is not within a certain level, the range will exceed the specified interval.

logistic curve — Optimize your laziness with AI

The following is a form of logistic regression model and linear model. We generally use logistic regression in the following situations. Credit scoring Marketing campaign success rate measurement Forecasting specific products Forecasting whether earthquakes will occur on a specific day

4. Naive Bayesian classifier

A naive Bayesian classifier is a probability-based algorithm that uses Bayesian theorem, assuming strong (naive) independence between features.

Bayesian Theorem Neon Lights — The equation for the Bayesian Theorem

This image represents Bayesian theorem, where P (A | B) is the posterior probability, P (B | A) is the likelihood, P (A) is the prior probability of the classification class, and P (B) is the predictor variable. It is a prior probability. We use Naive Bayesian mainly for text classification, etc. Also, other common uses are detecting spam emails, emotion checks on text, and tagging of articles posted on the Web.

5. Support Vector Machine (SVM)

Support Vector Machine is a pattern recognition model using supervised learning. With this method, we can construct two classes of pattern classifiers using linear input elements.

The problems that properly implemented SVMs solve include display advertisements, human splice site recognition, image-based gender detection, and large-scale image classification.

6. k-nearest neighbor

The k-nearest neighbor method is a classification method on the basis of the nearest training example in feature space. One of the common uses of it is in pattern recognition. The k-nearest neighbor method is a simple algorithm among machine learning algorithms. The reason is that the classification of an instance depends on the majority of objects close to it.

k nearest neighbor diagram — A representation of K-nearest neighbor in action

For example, in the case of the figure above, the flow of class determination is as follows.

Plot the known data (training data) as red triangles and blue squares.
Determine the number of K. K = 1 and so on.
If we obtain a green circle as unknown data, we will acquire one from the near point.
Estimate the class to which the one class belongs by the majority. This time, we suppose that the unknown green circle belongs to Class 1.

*Please note that the result changes depending on the number of K. When K = 3, we judge the green circle as Class 2.

7. k-means

The k-means method is a clustering algorithm. Clustering groups data into similar classifications. k-means is one of the simplest methods of clustering. Here we will explain some of the principles of the k-means method.

Choose k samples to be the “nucleus” of the cluster.
Measure the distance between all samples and k “nucleus”.
Divide each sample into the same cluster as the nearest “nucleus”. (We divide all samples into k types at this point)
Find the centroids of k clusters, and use them as new kernels. (Here, the position of the center of gravity is moving.)
If the position of the center of gravity changes, the process returns to step 2. (Repeat until the center of gravity does not change)
The center of gravity does not change and the process ends.

8. AdaBoost

AdaBoost is a machine learning model that attempts to create strong classifiers by combining weak classifiers that are slightly more accurate than random ones.
The flow of making is to first apply weak classifiers, increase the weight of those that have been misclassified, and then prioritize and classify those that have the weight. Then repeat it.

Ada Boost example — AdaBoost algorithm example

It is easy to understand if you refer to the figure above.
In the above figure, we first use a weak classifier at D1 to classify and increase the ‘+’ 1 and ‘-‘2 weights that are misclassified at D2.
Next, the three misclassified are considered with priority and classified. Here, at the same time as the weights are increased, the weights of others that are correctly classified are decreasing.
In addition, D3 increases the ‘-‘3 weight misclassified in D2 while decreasing the weight at the same time as others.

In this manner, we can make a strong classifier based on the weights of the repeated classifications.

9. Neural networks

A neural network is a combination of mathematically modeled neurons in the human nervous system.

This is a simplified model of neuron behavior. Artificial neural networks differ from biological brains in that data transmission methods are pre-defined in terms of layers, connections, and directions, and can not be transmitted differently.

Neural Network Diagram — Representation of a Neural Network

A neural network consists of a series of layers of neurons where all the neurons in one layer connect to the neurons in the next layer. Thus, the figure above is a two-layer neural network with one hidden layer. In detail, it consists of three input neurons, two neurons in the hidden layer, and two output neurons. The calculation is done in the following order: Starting from the left input layer and passing values to the hidden layer from there, the hidden layer sends the values to the output layer for final output.

10. Markov chain

A Markov chain is a set of random variables X1, X2, X3, … where the past and future states are independent, given the current state.

As a specific example of Markov chain, consider the following model (probability is quite appropriate but it helps to understand Markov chain).

Markov Chain Diagram for weather — A simple Markov chain for two types of weather

Such a diagram is called a state transition diagram. It shows the probability of different events happening after each other.

Summary

Engineers with machine learning skills are in high demands from companies. So naturally, they can have a big advantage, if they can learn these skills. A complete mathematical understanding of machine learning algorithms requires a lot of study. I hope this article helped you as a reference to plan ahead, or even as a reference.

Artificial Intelligence