Improving Efficiency of GEO-Distributed Data Sets using Pact

News Updates Tuesday 29th Apr 2025 :

Welcome to INPRESSCO, world's leading publishers, We have served more than 10000+ authors
Articles are invited in engineering, science, technology, management, industrial engg, biotechnology etc.
Paper submission is open. Submit online or at editor.ijcet@inpressco.com
Our journals are indexed in NAAS, University of Regensburg Germany, Google Scholar, Cross Ref etc.
DOI is given to all articles

Improving Efficiency of GEO-Distributed Data Sets using Pact

Author : Kirtimalini N. Kakade and T.A.Chavan

Pages : 1284-1287
Download PDF
Abstract

In an Internet era, a report says every day 2.5 quintillion bytes of data is created. This data is obtained from many sources such as sensors to gather climate information, trajectory information, transaction records, web site usage data etc. This data is known as Big data. Hadoop is only scalable that is it can reliably store and process petabytes. Hadoop plays an important role in processing and handling big data It includes MapReduce – offline computing engine, HDFS – Hadoop Distributed file system, HBase – online data access.Map Reduce functions as dividing input files into chunks and processing these in a series of parallelizable steps., mapping and reducing constitute the essential phases for a Map Reduce job. As this freamework provides solution for large data nodes by providing distributed environment. Moving all input data to a single datacenter before processing the data is expensive. Hence we concentrate on geographical distribution of geo-distributed data for sequential execution of map reduce jobs to optimize the execution time. But it is observed from various results that mapping and reducing function is not sufficient for all type of data processing. The fixed execution strategy of map reduce program is not optimal for many task as it does not know about the behavior of the functions. Thus, to overcome these issues, we are enhancing our proposed work with parallelization contracts. These contracts help to capture a reasonable amount of semantics for executing any type of task with reduced time consumption. The parallelization contracts include input and output contract which includes the constraints and functions of data execution The main aim of this paper is to discuss various known Map reduce technology techniques available for geodistributed data sets by using different techniques. Further, the paper also discloses the implementation of these techniques, their advantages, disadvantages, and the results measured. Future trends including use of query optimizing techniques to improve the results of the query as well as reduce the cost for the computation. To achieve this we use the indexing mechanism to the cache system to preserve the query search results.

Keywords: Geodistributed , MaReduce, PACT, big data

Article published in International Journal of Current Engineering and Technology, Vol.4,No.3 (June- 2014)

Call for Papers

IJCET- Current Issue
Issues are published in Feb, April, June, Aug, Oct and Dec
DOI is given to all articles

Facts and figures

IJCET is NAAS Indexed

INPRESSCO is member of Cross Reference DOI:10.14741

Conferences Proceedings

MECHPGCON, MIT College of Engineering, Pune, India

AMET, MIT College of Engineering, Pune, India

International Conference on Advances in Mechanical Sciences

International Symposium on Engineering and Technology

International Conference on Women in Science and Engineering

Recently Published..

Review on Heart Disease Prediction using Machine Learning

Optimizing Heart Disease Prediction Accuracy using Machine Learning Models

Modeling and Parametric Analysis of Erosion Rate in TiAl Material under Water Droplet Erosion (WDE)

Enhancing Financial Security Based on Machine Learning Techniques for Anomaly Detection in Fraud Transactions

Machine Learning for Predicting Natural Disasters: Techniques and Applications in Disaster Risk Management

For Authors
Author Guidelines

Submit Article

Contact Us

Authors Contribution

Indexing

Call For Papers

IJCET-Current Issue

IJTT- Current Issue

IJAIE- Current Issue

IJCSB- Current Issue

For Authors
Author Guidelines

Submit Article

Contact Us

Authors Contribution

Indexing

Our Journals
IJCET

IJTT

IJAIE

IJCSB

IJAB

About Inpressco
INPRESSCO is an international publisher. IJCET h-index: 27, i10 index: 197, Total Citation: 7000, NAAS Indexed We are publishing since 2011 and till now more than 3500+ articles have been published.

International Press corporation is licensed under a Creative Commons Attribution-Non Commercial NoDerivs 3.0 Unported License
©2010-2023 INPRESSCO^® All Rights Reserved

Back To Top