News Updates Monday 25th Nov 2024 :
  • Welcome to INPRESSCO, world's leading publishers, We have served more than 10000+ authors
  • Articles are invited in engineering, science, technology, management, industrial engg, biotechnology etc.
  • Paper submission is open. Submit online or at editor.ijcet@inpressco.com
  • Our journals are indexed in NAAS, University of Regensburg Germany, Google Scholar, Cross Ref etc.
  • DOI is given to all articles

Data Balancing Technique for Multi-Class Imbalanced Problems


Author : Deore Mrunalee C and J.R. Mankar

Pages : 459-463
Download PDF
Abstract

The imbalanced dataset contains skewed distribution of data.  Such data distribution generates difficulties for machine learning algorithms.  These algorithms also fail to generate accurate results in case of data imbalance, overlapping of class boundaries and hybrid datasets. Various techniques proposed in a literature to balance a dataset using oversampling or under sampling methods.  The study of these techniques is done independently. A little work has been done with the combined study of these two techniques. The proposed system focuses on the study and implementation of oversampling and under-sampling together to balance a dataset. The technique is generalized for hybrid datasets. Cluster based under sampling approach is used followed by the Mahalanobis Distancebased Over-sampling technique. The data will be tested on multiple hybrid datasets and classification accuracy using C4.5 algorithm will be evaluated. The accuracy results will be compared with the individual oversampling and under sampling approach.

Keywords: Oversampling, under sampling, hybrid dataset, Mahalanobis distance, cluster based under sampling, Imbalance data, Classification

Call for Papers
  1. IJCET- Current Issue
  2. Issues are published in Feb, April, June, Aug, Oct and Dec
  3. DOI is given to all articles
  • Inpressco Google Scholar
  • Inpressco Science Central
  • Inpressco Global impact factor
  • Inpressco aap

International Press corporation is licensed under a Creative Commons Attribution-Non Commercial NoDerivs 3.0 Unported License
©2010-2023 INPRESSCO® All Rights Reserved