• ML is the core of much futuristic technological advancement in our world, today we can see various examples of machine learning around us such as Tesla’s self-driving car, Sophia AI robot, etc.
  • As the world is seeing digital expansion and enhancement day by day ML industry is seeing a sudden boom. Global business firms are widely adopting ML and data science for automation, increasing their profits and better management, etc.
  • In Machine Learning we feed data and output to the computer and based on that we get the required program.
  • Unsupervised Learning
  • Reinforcement Learning

Supervised Learning

  • In supervised learning, we train our model on a labelled dataset which means we have both raw input data as well as its results.
  • We split our data into a training dataset and test dataset where the training dataset is used to train our network whereas the test dataset acts as new data for predicting results or to see the accuracy of our model.
  • The algorithm learns the input patterns that generate the expected output and now once the algorithm is trained it can be used to predict the correct output of an unseen input.

We can understand this from an example. Suppose we are feeding raw inputs as an image of tomato to the algorithm. We have a supervisor who keeps on correcting the machine or who keeps on training the machine that yes it is a tomato or no it is not a tomato, things like that. So this process keeps on repeating until we get a final trained model, once the model is ready it can easily predict the correct output of a never-seen input.

  • This model performs fast because the training time taken is less as we already have desired results in our dataset.
  • This model predicts accurate results on unseen data or new data without even knowing a prior target.
  • For each input instance an expected value associates, the value can be discreetly presenting a category or it can be real or continuous value.

Steps involved in supervised ML modeling

Supervised ML modeling consists of the following steps:

  • Pre-processing of data: This involves Cleaning the data(remove duplicates, deal with missing values, normalization, data type conversions, etc.) if required and preparing it for training.
  • Splitting the data into train and validation sets: This involves the splitting of data into training and validation sets.
  • Training the model: The goal of training is to answer a question or make a prediction correctly as often as possible.
  • Evaluating the model: Uses some metric or combination of metrics to “measure” objective performance of the model and test the model against previously unseen data which is meant to be somewhat representative of model performance in the real world, but still helps tune the model (as opposed to testing data, which does not).
  • Improve the model: The sixth step is to improve the model by again going to the training state if we are not getting the required accuracy.
  • Deploy the model: The final step is to deploy the model and monitor real-time.
  • Weather forecasting
  • Financial Portfolio prediction
  • Image classification
  • Spam detection
  • Insurance decisioning

Unsupervised Learning

  • In unsupervised learning, the information used to train is neither classified nor labeled in the dataset.
  • Unsupervised learning studies how systems can infer a function to describe a hidden structure from unlabelled data.
  • The main task of unsupervised learning is to find patterns in the data.
  • Once a model learns to develop patterns, it can easily predict patterns for any new dataset in the form of clusters.
  • The system doesn’t figure out the right output, but it explores the data and can draw inferences from datasets to describe hidden structures from unlabeled data.

As we have already discussed that in unsupervised learning our dataset is not labeled, So if we are feeding apple, avocado, and orange as raw input data then our model will distinguish all three but it cannot tell whether a given cluster is of apple or not as it is unlabelled but any new data will automatically fit into the clusters that are formed.

“Clustering” is the process of grouping similar entities together. The goal of this unsupervised machine learning technique is to find similarities in the data point and group similar data points together.

  • Customer segmentation
  • Insurance fraud detection
  • Delivery store optimization

Reinforcement learning

The third type of machine learning technique is reinforcement learning.

  • Here the algorithms learn to react to an environment on their own.
  • It is rapidly growing and moreover producing a variety of learning algorithms.
  • These algorithms are useful in the field of Robotics, Gaming, etc.
  • The agent travels from one state to another.
  • To reach the end state, there might be a different path.
  • The agent gets the reward(appreciation) for success but will not receive any reward or appreciation for failure.
  • If the dog’s response is close to the desired behavior, the trainer will give some reward like food to it.
  • Now whenever the dog is exposed to the same situation, it executes a similar action even more enthusiastically in expectation of getting more reward(food).
  • In this way, the dog learns “what to do” from positive experiences.
  • At the same time, the dog also learns “what not to do” when faced with negative experiences.

In this case, the dog is an agent that is exposed to the environment. An example of a state could be our dog sitting, and we use a specific word for the dog to walk. Our agent reacts by performing an action transition from one “state” to another “state.” For example, Our dog goes from sitting to walking. After the transition, it may get a reward or penalty in return.

Applications of Reinforcement learning

  • Resource Management
  • Robotics
  • Games
  • Self-driving cars


Machine learning uses algorithms to parse data, learn from that data, and make informative decisions based on what it has learned.

  • In Unsupervised Learning, we find an association between input values and group them.
  • In Reinforcement Learning an agent learn through delayed feedback by interacting with the environment.



Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Bhanu Shahi

Bhanu Shahi

Data Analyst at Decimal Tech | Machine Learning | NLP | Time Series | Python, Tableau & SQL Expert | Storyteller | Blogger