Skip to content

Latest commit

 

History

History

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 

ReadMe.md

Classification Models

1. What is Classification Models ?

  • Classification is a technique where we categorize data into a given number of classes. The main goal of a classification problem is to identify the category/class to which a new data will fall under.
  • Classification can be performed on structured or unstructured data.
  • In statistics and machine learning, classification is the problem of identifying to which of a set of categories (sub-populations) a new observation belongs, on the basis of a training set of data containing observations (or instances) whose category membership is well-known.
  • Classification is a simple example of pattern recognition.
  • In machine learning terminology, classification is considered an instance of supervised learning
    • Ex: learning where a training set of correctly identified observations is available
  • The other hand corresponding unsupervised procedure is known as clustering, and involves grouping data into categories based on some measure of inherent similarity or distance.
  • In classification algorithm, individual observations are analyzed into a set of quantifiable properties, known variously as explanatory variables or features.
  • These properties may variously be as follows:
    • Categorical: [A, B, O, AB] for blood types.
    • real-value: continues measurement of preasure in Steam engine.
    • integer-value: No.of occurance of a word in given paragraph.
    • ordinal: small, medium and large.
  • Classes are sometimes refered as targets or labels or categories.
  • A smiple example of classification problem spam detection in email, which is a binary classification since there are only 2 classes as spam and not spam.

2. Why Classification Models ?

  • As we know from the above definition of classification models, The main goal of a classification problem is to identify the category/class to which a new data will fall under.
  • Classification Models are an essential part of Machine Learning and data mining applications.
  • Approximately 70% of real world problems in Data Science are classification problems.

3. Types of Classification Models