SlideShare a Scribd company logo
1 of 6
Download to read offline
INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & 
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), 
ISSN 0976 - 6375(Online), Volume 5, Issue 7, July (2014), pp. 11-16 © IAEME 
TECHNOLOGY (IJCET) 
ISSN 0976 – 6367(Print) 
ISSN 0976 – 6375(Online) 
Volume 5, Issue 7, July (2014), pp. 11-16 
© IAEME: www.iaeme.com/IJCET.asp 
Journal Impact Factor (2014): 8.5328 (Calculated by GISI) 
www.jifactor.com 
11 
 
IJCET 
© I A E M E 
NEPHELE: EFFICIENT DATA PROCESSING USING HADOOP 
Gandhali Upadhye Astt. Prof. Trupti Dange 
 
P.G. Student, Dept. of Computer Engg., Dept. of Computer Engineering, 
RMD Sinhgad School of Engg.,Warje. RMD Sinhgad School of Engg.,Warje. 
University of Pune, University of Pune, 
PUNE, India. PUNE, India. 
ABSTRACT 
Today, Infrastructure-as-a-Service (IaaS) cloud providers have incorporated parallel data 
processing framework in their clouds for performing Many-task computing (MTC) applications. 
Parallel data processing framework reduces time and cost in processing the substantial amount of 
users’ data. Nephele is a dynamic resource allocating parallel data processing framework, which is 
designed for dynamic and heterogeneous cluster environments. The existing framework does not 
support to monitor resource overload or underutilization, during job execution, efficiently. 
Consequently, the allocated compute resources may be inadequate for big parts of the submitted job 
and unnecessarily increase processing time and cost. Nephele’s architecture offers for efficient 
parallel data processing in clouds. It is the first data processing framework for the dynamic resource 
allocation offered by today’s IaaS clouds for both, task scheduling and execution. Particular tasks of 
a processing job can be assigned to different types of virtual machines which are automatically 
instantiated and terminated during the job execution. 
Keywords: Cloud Computing, Resource allocation, IaaS, Nephele, Job Scheduling. 
I. INTRODUCTION 
Cloud Computing is a concept that involves a number of technologies, including server 
virtualization, multitenant application hosting, and a variety of Internet and systems management 
services. The computing power resides on the Internet and people access this computing power as 
they need it via the Internet. Growing organizations have been processing immense amount of data in 
a cost effective manner using cloud computing mechanism. More of the cloud providers like Google, 
Yahoo, Microsoft and Amazon are available for processing these data. Instead of creating large data
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), 
ISSN 0976 - 6375(Online), Volume 5, Issue 7, July (2014), pp. 11-16 © IAEME 
centers which are expensive, these providers move into architectural paradigm with commodity 
servers, to process these huge data [3]. Cloud computing is the delivery of computing and storage 
capacity as a service to a community of end-recipients. The name comes from the use of a cloud-shaped 
12 
 
symbol as an abstraction for the complex infrastructure it contains in system diagrams. Cloud 
computing entrusts services with a user's data, software and computation over a network. It is 
difficult for him to maintain all the software’s and data in local servers which costs a lot. His 
business is constantly running out of storage space and fixing broken servers. Then he came to know 
about cloud computing, which gives a solution to his problem. Instead of maintaining all the data and 
software’s at local machines, he can now depend on another company which provides all the features 
that he requires and are accessible through the Web. “This is called Cloud Computing”. 
1.1 Infrastructure as a service (IaaS) 
Infrastructure as a service (IaaS) controls user and manage the systems in terms of the 
bandwidth, response time, resource expenses, and network connectivity, but they need not control 
the cloud infrastructure. Some of the key features of IaaS such as cloud bursting, resource gathering 
etc. differ according to the cloud environment. The greatest value of IaaS is mainly through a key 
element known as cloud bursting. The process of off-loading tasks to the cloud during times when 
the most compute resources are needed. However, for business IaaS takes an advantage in its 
capacity. IT companies [1], [2] able to develop its own software and implements that can able to 
handles the ability to re-schedule resources in an IaaS cloud. Elasticity is the first critical facet of 
IaaS. IaaS is easy to spot, because it is typically platform-independent. IaaS consists of a 
combination of internal and external resources. IaaS is low-level resource that runs independent of an 
operating system called a hypervisor and is responsible for taking rent of hardware resources based 
on pay as you go basics. This process is referred to as resource gathering [2]. Resource gathering by 
the hypervisor makes virtualization possible, and virtualization makes multiprocessing computing 
that leads to an infrastructure shared by several users with similar resources in regard to their 
requirements. 
Fig. 1: Cloud Environment Infrastructure Architecture 
II. PROBLEM STATEMENT 
The IaaS cloud providers integrate the processing frame-work to reduce the processing time 
and provide simplicity to the users. Reducing process time leads to reduction in the cost for attracting 
the users to use their cloud services. To improve and optimize the scenario of parallel data
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), 
ISSN 0976 - 6375(Online), Volume 5, Issue 7, July (2014), pp. 11-16 © IAEME 
processing. In order to avail utmost client satisfaction, the host server needs to be upgraded with the 
latest technology to fulfill all requirements. This will also make sure that the user gets its entire 
requirement fulfilled in optimal time. This will improve the overall resource utilization and, 
consequently, reduce the processing time. 
13 
III. RELATED WORK 
 
In this section we mainly discuss with some existing frameworks that were already 
implemented for efficient parallel data processing. 
SCOPE ((Structured Computations Optimized for Parallel Execution) 
For Companies providing cloud-scale services of Easy and Efficient Parallel Processing of 
Massive Data Sets have an increasing need to store and analyze massive data sets such as search logs 
and click streams. For cost and performance reasons, processing is typically done on large clusters of 
with a huge amount needed to deploy large servers (I.e. shared-nothing commodity machines). It is 
imperative to develop a programming model that hides the complexity of the underlying system but 
provides flexibility by allowing users to extend functionality to meet a variety of requirements. 
SWIFT 
Swift, a system that combines a novel scripting language called Swift Script with a powerful 
runtime system based on Falkon, and Globus to allow for the concise specification, and reliable and 
efficient execution, of large loosely coupled computations. Swift adopts and adapts ideas first 
explored in the virtual data system, improving on that system in many regards. But this was not at all 
useful as this script was not in usage for the current developers, so that’s the reason this framework is 
not popular in usage. 
FALKON 
Fast and Light-Weight Task Execution Framework. To enable the rapid execution of many 
tasks on compute clusters, we have developed Falkon. Falkon integrates multi-level scheduling to 
separate resource acquisition (via, e.g., requests to batch schedulers) from task dispatch, and a 
streamlined dispatcher. Falkon’s integration of multi-level scheduling and streamlined dispatchers 
delivers performance not provided by any other system. We describe Falkon architecture and 
implementation, and present performance results for both micro benchmarks and applications. 
HADOOP 
Apache Hadoop software library is a framework that allows for the distributed processing of 
large data sets across clusters of computers using a simple programming model. It is designed to 
scale up from single servers to thousands of machines, each offering local computation and storage 
in static mode. Rather than rely on hardware to deliver high-availability, the library itself is designed 
to detect and handle failures at the application layer, so delivering a highly-available service on top 
of a cluster of computers, each of which may be prone to failures. This was not suitable in parallel 
processing as this was not suitable for dynamic nature; hence we came with a proposed new 
framework called Nephele Framework. 
IV. NEPHELE FRAMEWORK 
Nephele is a massively parallel data flow engine dealing with resource management, work 
scheduling, communication, and fault tolerance. Nephele can run on top of a cluster and govern the 
resources itself, or directly connect to an IaaS cloud service to allocate computing resources on
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), 
ISSN 0976 - 6375(Online), Volume 5, Issue 7, July (2014), pp. 11-16 © IAEME 
demand. Before submitting a Nephele compute job, a user must start an instance inside the cloud 
which runs the so called Job Manager. The Job Manager receives the client’s jobs, is responsible for 
scheduling them and coordinates their execution. It can allocate or deallocate virtual machines 
according to the current job execution phase. The actual execution of tasks is carried out by a set of 
instances. Each instance runs a local component of the Nephele framework 
14 
 
(Task Manager). A Task Manager receives one or more tasks from the Job Manager at a time, 
executes them and informs the Job Manager about their completion or possible errors. Unless a job is 
submitted to the Job Manager, we expect the set of instances (and hence the set of Task Managers) to 
be empty. 
Fig. 2: Structural overview of Nephele running in Infrastructure-as-a-Service (IaaS) in cloud 
V. MODULE DESCRIPTION 
Client Module 
The client which sends the request to the job manager for the execution of task the job 
manager will schedule the process and coordinates the task and wait for the completion response. 
The client is the one who initiates the request to the job manager. 
Job Manager Module 
The job manager will wait for the task from client, coordinates the process and it checks the 
availability of the server, if the server is available for the task to be done, it allocates the resource for 
execution and wait for the completion response. 
Cloud Controller Module 
This acts as interface between the job manager and task manager and provides the control and 
initiation of task managers. It is also responsible for coordinating and manages the execution and 
also dispatches the task. It checks for the availability of task managers and allocate the resource for 
the task to be executed. 
Task Manager Module 
The task manager will wait for task to be executed; it then executes it and send the complete 
response to the job manager in turn to the client. The actual execution of the task it done in the task 
manager.
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), 
ISSN 0976 - 6375(Online), Volume 5, Issue 7, July (2014), pp. 11-16 © IAEME 
15 
VI. RESULTS 
 
This section will present the results of job scheduling for resource allocation and Nephele’s 
architecture and process the task and allocate the resource for the executed task. The results are 
compared with Hadoop. 
Fig.3: Comparison graph for CPU utilization 
Fig.4: Comparison graph for job completion 
Fig.5: Comparison graph for processing time
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), 
ISSN 0976 - 6375(Online), Volume 5, Issue 7, July (2014), pp. 11-16 © IAEME 
16 
VII. CONCLUSION AND FUTURE WORK 
 
In this project, Nephele, the first data processing framework to exploit the dynamic resource 
provisioning offered by today’s IaaS clouds. Nephele’s basic architecture and presented a 
performance comparison to the established data processing framework Hadoop. The performance 
evaluation gives a first impression on how the ability to assign specific virtual machine types to 
specific tasks of a processing job, as well as the possibility to automatically allocate/deallocate 
virtual machines in the course of a job execution, can help to improve the overall resource utilization 
and, consequently, reduce the processing cost. In particular, improving Nephele’s ability to adapt to 
resource overload or underutilization during the job execution automatically. IT technicians are 
leading the challenge, while academia is bit slower to react. Several groups have recently been 
formed, such as the Cloud Security Alliance or the Open Cloud Consortium, with the goal of 
exploring the possibilities offered by cloud computing and to establish a common language among 
different providers. Our current profiling approach builds a valuable basis for this; however, at the 
moment the system still requires a reasonable amount of user annotations. In general, an important 
contribution to the growing field of Cloud computing services and points out exciting new 
opportunities in the field of parallel data processing is represented. 
VIII. REFERENCES 
1. W. Zhao, K.Ramamritham, and J.A.Stankovic, “Preemptive scheduling under time and resource 
constraints”, In: IEEE Transactions on Computers C-36 (8) (1987)949–960. 
2. Alex King Yeung Cheung and Hans-Arno Jacobsen, “Green Resource Allocation Algorithms 
for Publish/Subscribe Systems”, In: the 31th IEEE International Conference on Distributed 
Computing Systems (ICDCS), 2011. 
3. R. Chaiken, B. Jenkins, P.-A. Larson, B. Ramsey, D. Shakib, S. Weaver, and J. Zhou, “SCOPE: 
Easy and Efficient Parallel Processing of Massive DataSets,” Proc. Very Large Database. 
4. The Apache Software Foundation “Welcome to Hadoop!” http://hadoop.apache.org/,2012. 
5. D. Warneke and O. Kao, “Nephele: Efficient Parallel Data Processing in the Cloud,” Proc. 
Second Workshop Many-Task Computing on Grids and Supercomputers (MTAGS ’09), 
pp. 1-10, 2009. 
6. I. Raicu, Y. Zhao, C. Dumitrescu, I. Foster, and M. Wilde. Falkon: a Fast and Light-weight 
tasKexecutiON framework. In SC ’07: Proceedings of the 2007 ACM/IEEE conference on 
Supercomputing, pages 1–12, New York, NY, USA, 2007. ACM. 
7. J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. In 
OSDI’04: Proceedings of the 6th conference on Symposium on Operating Systems Design  
Implementation, pages 10–10, Berkeley, CA, USA, 2004. USENIX Association. 
8. D. Warneke and O. Kao, “Exploiting dynamic resource allocation for efficient parallel data 
processing in the cloud,” IEEE transactions on parallel and distributed systems, vol. 22, no. 6, 
June 2011. 
9. T. White, Hadoop: The Definitive Guide. O’Reilly Media, 2009 
10. Mr.Abhijit Adhikari, Ms.GandhaliUpadhye, “Efficient Dynamic Resource Allocation in Parallel 
Data Processing for Cloud using PACT”, IJARCSSE Vol.3, 11, Nov.2013. 
11. Ms.GandhaliUpadhye, Prof.Trupti Dange, “Cloud Resource Allocation as Non-Preemptive 
Approach, International Conference on Current Trends in Engineering and Technology 
(ICCTET) 2014. 
12. Priya Deshpande, Brijesh Khundhawala and Prasanna Joeg, “Dynamic Data Replication and Job 
Scheduling Based on Popularity and Category”, International Journal of Computer Engineering 
 Technology (IJCET), Volume 4, Issue 5, 2013, pp. 109 - 114, ISSN Print: 0976 – 6367, 
ISSN Online: 0976 – 6375.

More Related Content

What's hot

Application Architecture for Cloud Computing
Application Architecture for Cloud Computing Application Architecture for Cloud Computing
Application Architecture for Cloud Computing
white paper
 
Hadoop-as-a-Service for Lifecycle Management Simplicity
Hadoop-as-a-Service for Lifecycle Management SimplicityHadoop-as-a-Service for Lifecycle Management Simplicity
Hadoop-as-a-Service for Lifecycle Management Simplicity
DataWorks Summit
 

What's hot (19)

Announcing Symantec & Microsoft’s Azure Cloud Disaster Recovery as a Service ...
Announcing Symantec & Microsoft’s Azure Cloud Disaster Recovery as a Service ...Announcing Symantec & Microsoft’s Azure Cloud Disaster Recovery as a Service ...
Announcing Symantec & Microsoft’s Azure Cloud Disaster Recovery as a Service ...
 
Introduction to Cloud Computing
Introduction to Cloud ComputingIntroduction to Cloud Computing
Introduction to Cloud Computing
 
Cloud Computing 101 Workshop Sample
Cloud Computing 101 Workshop SampleCloud Computing 101 Workshop Sample
Cloud Computing 101 Workshop Sample
 
2017 Cloud Computing Primer
2017 Cloud Computing Primer2017 Cloud Computing Primer
2017 Cloud Computing Primer
 
Microsoft Cloud Computing E-Book
Microsoft Cloud Computing E-BookMicrosoft Cloud Computing E-Book
Microsoft Cloud Computing E-Book
 
What the heck is cloud?
What the heck is cloud?What the heck is cloud?
What the heck is cloud?
 
Cloud Computing Nedc Wp 28 May
Cloud Computing Nedc Wp 28 MayCloud Computing Nedc Wp 28 May
Cloud Computing Nedc Wp 28 May
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Hybrid cloud computing explained
Hybrid cloud computing explainedHybrid cloud computing explained
Hybrid cloud computing explained
 
Application Architecture for Cloud Computing
Application Architecture for Cloud Computing Application Architecture for Cloud Computing
Application Architecture for Cloud Computing
 
Microservices Architecture Enables DevOps: Migration to a Cloud-Native Archit...
Microservices Architecture Enables DevOps: Migration to a Cloud-Native Archit...Microservices Architecture Enables DevOps: Migration to a Cloud-Native Archit...
Microservices Architecture Enables DevOps: Migration to a Cloud-Native Archit...
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Cloud Computing And Citrix C3
Cloud Computing And Citrix C3Cloud Computing And Citrix C3
Cloud Computing And Citrix C3
 
Hadoop-as-a-Service for Lifecycle Management Simplicity
Hadoop-as-a-Service for Lifecycle Management SimplicityHadoop-as-a-Service for Lifecycle Management Simplicity
Hadoop-as-a-Service for Lifecycle Management Simplicity
 
Cloud Computing System models for Distributed and cloud computing & Performan...
Cloud Computing System models for Distributed and cloud computing & Performan...Cloud Computing System models for Distributed and cloud computing & Performan...
Cloud Computing System models for Distributed and cloud computing & Performan...
 
Cloud 101: The Basics of Cloud Computing
Cloud 101: The Basics of Cloud ComputingCloud 101: The Basics of Cloud Computing
Cloud 101: The Basics of Cloud Computing
 
IRJET- Implementation of Cloud Energy Saving System using Virtual Machine...
IRJET-  	  Implementation of Cloud Energy Saving System using Virtual Machine...IRJET-  	  Implementation of Cloud Energy Saving System using Virtual Machine...
IRJET- Implementation of Cloud Energy Saving System using Virtual Machine...
 
Extending Grids with Cloud Resource Management for Scientific Computing
Extending Grids with Cloud Resource Management for Scientific ComputingExtending Grids with Cloud Resource Management for Scientific Computing
Extending Grids with Cloud Resource Management for Scientific Computing
 
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Enviro...
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Enviro...Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Enviro...
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Enviro...
 

Similar to 50120140507002

Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
researchinventy
 
Resource provisioning for video on demand in saas
Resource provisioning for video on demand in saasResource provisioning for video on demand in saas
Resource provisioning for video on demand in saas
IAEME Publication
 
International Conference on Advances in Computing, Communicati.docx
International Conference on Advances in Computing, Communicati.docxInternational Conference on Advances in Computing, Communicati.docx
International Conference on Advances in Computing, Communicati.docx
vrickens
 
Performance Enhancement of Cloud Computing using Clustering
Performance Enhancement of Cloud Computing using ClusteringPerformance Enhancement of Cloud Computing using Clustering
Performance Enhancement of Cloud Computing using Clustering
Editor IJMTER
 

Similar to 50120140507002 (20)

E5 05 ijcite august 2014
E5 05 ijcite august 2014E5 05 ijcite august 2014
E5 05 ijcite august 2014
 
PREDICTIVE RESOURCE MANAGEMENT BY REDUCING COLD START IN SERVERLESS CLOUD
PREDICTIVE RESOURCE MANAGEMENT BY REDUCING COLD START IN SERVERLESS CLOUDPREDICTIVE RESOURCE MANAGEMENT BY REDUCING COLD START IN SERVERLESS CLOUD
PREDICTIVE RESOURCE MANAGEMENT BY REDUCING COLD START IN SERVERLESS CLOUD
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
Resource provisioning for video on demand in saas
Resource provisioning for video on demand in saasResource provisioning for video on demand in saas
Resource provisioning for video on demand in saas
 
International Conference on Advances in Computing, Communicati.docx
International Conference on Advances in Computing, Communicati.docxInternational Conference on Advances in Computing, Communicati.docx
International Conference on Advances in Computing, Communicati.docx
 
D017212027
D017212027D017212027
D017212027
 
A Novel Approach for Workload Optimization and Improving Security in Cloud Co...
A Novel Approach for Workload Optimization and Improving Security in Cloud Co...A Novel Approach for Workload Optimization and Improving Security in Cloud Co...
A Novel Approach for Workload Optimization and Improving Security in Cloud Co...
 
Improved Utilization of Infrastructure of Clouds by using Upgraded Functional...
Improved Utilization of Infrastructure of Clouds by using Upgraded Functional...Improved Utilization of Infrastructure of Clouds by using Upgraded Functional...
Improved Utilization of Infrastructure of Clouds by using Upgraded Functional...
 
Implementation of the Open Source Virtualization Technologies in Cloud Computing
Implementation of the Open Source Virtualization Technologies in Cloud ComputingImplementation of the Open Source Virtualization Technologies in Cloud Computing
Implementation of the Open Source Virtualization Technologies in Cloud Computing
 
Implementation of the Open Source Virtualization Technologies in Cloud Computing
Implementation of the Open Source Virtualization Technologies in Cloud ComputingImplementation of the Open Source Virtualization Technologies in Cloud Computing
Implementation of the Open Source Virtualization Technologies in Cloud Computing
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Ijirsm choudhari-priyanka-backup-and-restore-in-smartphone-using-mobile-cloud...
Ijirsm choudhari-priyanka-backup-and-restore-in-smartphone-using-mobile-cloud...Ijirsm choudhari-priyanka-backup-and-restore-in-smartphone-using-mobile-cloud...
Ijirsm choudhari-priyanka-backup-and-restore-in-smartphone-using-mobile-cloud...
 
Enterprise Cloud Analytics
Enterprise Cloud AnalyticsEnterprise Cloud Analytics
Enterprise Cloud Analytics
 
C017341216
C017341216C017341216
C017341216
 
A PROPOSED MODEL FOR IMPROVING PERFORMANCE AND REDUCING COSTS OF IT THROUGH C...
A PROPOSED MODEL FOR IMPROVING PERFORMANCE AND REDUCING COSTS OF IT THROUGH C...A PROPOSED MODEL FOR IMPROVING PERFORMANCE AND REDUCING COSTS OF IT THROUGH C...
A PROPOSED MODEL FOR IMPROVING PERFORMANCE AND REDUCING COSTS OF IT THROUGH C...
 
A Proposed Model for Improving Performance and Reducing Costs of IT Through C...
A Proposed Model for Improving Performance and Reducing Costs of IT Through C...A Proposed Model for Improving Performance and Reducing Costs of IT Through C...
A Proposed Model for Improving Performance and Reducing Costs of IT Through C...
 
Performance Enhancement of Cloud Computing using Clustering
Performance Enhancement of Cloud Computing using ClusteringPerformance Enhancement of Cloud Computing using Clustering
Performance Enhancement of Cloud Computing using Clustering
 
Review and Classification of Cloud Computing Research
Review and Classification of Cloud Computing ResearchReview and Classification of Cloud Computing Research
Review and Classification of Cloud Computing Research
 
JIT Borawan Cloud computing part 2
JIT Borawan Cloud computing part 2JIT Borawan Cloud computing part 2
JIT Borawan Cloud computing part 2
 
Introduction to Cloud Computing
Introduction to Cloud ComputingIntroduction to Cloud Computing
Introduction to Cloud Computing
 

More from IAEME Publication

A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURSA STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
IAEME Publication
 
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURSBROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
IAEME Publication
 
GANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICEGANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICE
IAEME Publication
 
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
IAEME Publication
 
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
IAEME Publication
 
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
IAEME Publication
 
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
IAEME Publication
 
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
IAEME Publication
 
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
IAEME Publication
 
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
IAEME Publication
 

More from IAEME Publication (20)

IAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdfIAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdf
 
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
 
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURSA STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
 
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURSBROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
 
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONSDETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
 
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONSANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
 
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINOVOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
 
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
 
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMYVISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
 
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
 
GANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICEGANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICE
 
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
 
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
 
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
 
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
 
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
 
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
 
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
 
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
 
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENTA MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
 

Recently uploaded

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

50120140507002

  • 1. INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 7, July (2014), pp. 11-16 © IAEME TECHNOLOGY (IJCET) ISSN 0976 – 6367(Print) ISSN 0976 – 6375(Online) Volume 5, Issue 7, July (2014), pp. 11-16 © IAEME: www.iaeme.com/IJCET.asp Journal Impact Factor (2014): 8.5328 (Calculated by GISI) www.jifactor.com 11 IJCET © I A E M E NEPHELE: EFFICIENT DATA PROCESSING USING HADOOP Gandhali Upadhye Astt. Prof. Trupti Dange P.G. Student, Dept. of Computer Engg., Dept. of Computer Engineering, RMD Sinhgad School of Engg.,Warje. RMD Sinhgad School of Engg.,Warje. University of Pune, University of Pune, PUNE, India. PUNE, India. ABSTRACT Today, Infrastructure-as-a-Service (IaaS) cloud providers have incorporated parallel data processing framework in their clouds for performing Many-task computing (MTC) applications. Parallel data processing framework reduces time and cost in processing the substantial amount of users’ data. Nephele is a dynamic resource allocating parallel data processing framework, which is designed for dynamic and heterogeneous cluster environments. The existing framework does not support to monitor resource overload or underutilization, during job execution, efficiently. Consequently, the allocated compute resources may be inadequate for big parts of the submitted job and unnecessarily increase processing time and cost. Nephele’s architecture offers for efficient parallel data processing in clouds. It is the first data processing framework for the dynamic resource allocation offered by today’s IaaS clouds for both, task scheduling and execution. Particular tasks of a processing job can be assigned to different types of virtual machines which are automatically instantiated and terminated during the job execution. Keywords: Cloud Computing, Resource allocation, IaaS, Nephele, Job Scheduling. I. INTRODUCTION Cloud Computing is a concept that involves a number of technologies, including server virtualization, multitenant application hosting, and a variety of Internet and systems management services. The computing power resides on the Internet and people access this computing power as they need it via the Internet. Growing organizations have been processing immense amount of data in a cost effective manner using cloud computing mechanism. More of the cloud providers like Google, Yahoo, Microsoft and Amazon are available for processing these data. Instead of creating large data
  • 2. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 7, July (2014), pp. 11-16 © IAEME centers which are expensive, these providers move into architectural paradigm with commodity servers, to process these huge data [3]. Cloud computing is the delivery of computing and storage capacity as a service to a community of end-recipients. The name comes from the use of a cloud-shaped 12 symbol as an abstraction for the complex infrastructure it contains in system diagrams. Cloud computing entrusts services with a user's data, software and computation over a network. It is difficult for him to maintain all the software’s and data in local servers which costs a lot. His business is constantly running out of storage space and fixing broken servers. Then he came to know about cloud computing, which gives a solution to his problem. Instead of maintaining all the data and software’s at local machines, he can now depend on another company which provides all the features that he requires and are accessible through the Web. “This is called Cloud Computing”. 1.1 Infrastructure as a service (IaaS) Infrastructure as a service (IaaS) controls user and manage the systems in terms of the bandwidth, response time, resource expenses, and network connectivity, but they need not control the cloud infrastructure. Some of the key features of IaaS such as cloud bursting, resource gathering etc. differ according to the cloud environment. The greatest value of IaaS is mainly through a key element known as cloud bursting. The process of off-loading tasks to the cloud during times when the most compute resources are needed. However, for business IaaS takes an advantage in its capacity. IT companies [1], [2] able to develop its own software and implements that can able to handles the ability to re-schedule resources in an IaaS cloud. Elasticity is the first critical facet of IaaS. IaaS is easy to spot, because it is typically platform-independent. IaaS consists of a combination of internal and external resources. IaaS is low-level resource that runs independent of an operating system called a hypervisor and is responsible for taking rent of hardware resources based on pay as you go basics. This process is referred to as resource gathering [2]. Resource gathering by the hypervisor makes virtualization possible, and virtualization makes multiprocessing computing that leads to an infrastructure shared by several users with similar resources in regard to their requirements. Fig. 1: Cloud Environment Infrastructure Architecture II. PROBLEM STATEMENT The IaaS cloud providers integrate the processing frame-work to reduce the processing time and provide simplicity to the users. Reducing process time leads to reduction in the cost for attracting the users to use their cloud services. To improve and optimize the scenario of parallel data
  • 3. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 7, July (2014), pp. 11-16 © IAEME processing. In order to avail utmost client satisfaction, the host server needs to be upgraded with the latest technology to fulfill all requirements. This will also make sure that the user gets its entire requirement fulfilled in optimal time. This will improve the overall resource utilization and, consequently, reduce the processing time. 13 III. RELATED WORK In this section we mainly discuss with some existing frameworks that were already implemented for efficient parallel data processing. SCOPE ((Structured Computations Optimized for Parallel Execution) For Companies providing cloud-scale services of Easy and Efficient Parallel Processing of Massive Data Sets have an increasing need to store and analyze massive data sets such as search logs and click streams. For cost and performance reasons, processing is typically done on large clusters of with a huge amount needed to deploy large servers (I.e. shared-nothing commodity machines). It is imperative to develop a programming model that hides the complexity of the underlying system but provides flexibility by allowing users to extend functionality to meet a variety of requirements. SWIFT Swift, a system that combines a novel scripting language called Swift Script with a powerful runtime system based on Falkon, and Globus to allow for the concise specification, and reliable and efficient execution, of large loosely coupled computations. Swift adopts and adapts ideas first explored in the virtual data system, improving on that system in many regards. But this was not at all useful as this script was not in usage for the current developers, so that’s the reason this framework is not popular in usage. FALKON Fast and Light-Weight Task Execution Framework. To enable the rapid execution of many tasks on compute clusters, we have developed Falkon. Falkon integrates multi-level scheduling to separate resource acquisition (via, e.g., requests to batch schedulers) from task dispatch, and a streamlined dispatcher. Falkon’s integration of multi-level scheduling and streamlined dispatchers delivers performance not provided by any other system. We describe Falkon architecture and implementation, and present performance results for both micro benchmarks and applications. HADOOP Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage in static mode. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures. This was not suitable in parallel processing as this was not suitable for dynamic nature; hence we came with a proposed new framework called Nephele Framework. IV. NEPHELE FRAMEWORK Nephele is a massively parallel data flow engine dealing with resource management, work scheduling, communication, and fault tolerance. Nephele can run on top of a cluster and govern the resources itself, or directly connect to an IaaS cloud service to allocate computing resources on
  • 4. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 7, July (2014), pp. 11-16 © IAEME demand. Before submitting a Nephele compute job, a user must start an instance inside the cloud which runs the so called Job Manager. The Job Manager receives the client’s jobs, is responsible for scheduling them and coordinates their execution. It can allocate or deallocate virtual machines according to the current job execution phase. The actual execution of tasks is carried out by a set of instances. Each instance runs a local component of the Nephele framework 14 (Task Manager). A Task Manager receives one or more tasks from the Job Manager at a time, executes them and informs the Job Manager about their completion or possible errors. Unless a job is submitted to the Job Manager, we expect the set of instances (and hence the set of Task Managers) to be empty. Fig. 2: Structural overview of Nephele running in Infrastructure-as-a-Service (IaaS) in cloud V. MODULE DESCRIPTION Client Module The client which sends the request to the job manager for the execution of task the job manager will schedule the process and coordinates the task and wait for the completion response. The client is the one who initiates the request to the job manager. Job Manager Module The job manager will wait for the task from client, coordinates the process and it checks the availability of the server, if the server is available for the task to be done, it allocates the resource for execution and wait for the completion response. Cloud Controller Module This acts as interface between the job manager and task manager and provides the control and initiation of task managers. It is also responsible for coordinating and manages the execution and also dispatches the task. It checks for the availability of task managers and allocate the resource for the task to be executed. Task Manager Module The task manager will wait for task to be executed; it then executes it and send the complete response to the job manager in turn to the client. The actual execution of the task it done in the task manager.
  • 5. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 7, July (2014), pp. 11-16 © IAEME 15 VI. RESULTS This section will present the results of job scheduling for resource allocation and Nephele’s architecture and process the task and allocate the resource for the executed task. The results are compared with Hadoop. Fig.3: Comparison graph for CPU utilization Fig.4: Comparison graph for job completion Fig.5: Comparison graph for processing time
  • 6. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 7, July (2014), pp. 11-16 © IAEME 16 VII. CONCLUSION AND FUTURE WORK In this project, Nephele, the first data processing framework to exploit the dynamic resource provisioning offered by today’s IaaS clouds. Nephele’s basic architecture and presented a performance comparison to the established data processing framework Hadoop. The performance evaluation gives a first impression on how the ability to assign specific virtual machine types to specific tasks of a processing job, as well as the possibility to automatically allocate/deallocate virtual machines in the course of a job execution, can help to improve the overall resource utilization and, consequently, reduce the processing cost. In particular, improving Nephele’s ability to adapt to resource overload or underutilization during the job execution automatically. IT technicians are leading the challenge, while academia is bit slower to react. Several groups have recently been formed, such as the Cloud Security Alliance or the Open Cloud Consortium, with the goal of exploring the possibilities offered by cloud computing and to establish a common language among different providers. Our current profiling approach builds a valuable basis for this; however, at the moment the system still requires a reasonable amount of user annotations. In general, an important contribution to the growing field of Cloud computing services and points out exciting new opportunities in the field of parallel data processing is represented. VIII. REFERENCES 1. W. Zhao, K.Ramamritham, and J.A.Stankovic, “Preemptive scheduling under time and resource constraints”, In: IEEE Transactions on Computers C-36 (8) (1987)949–960. 2. Alex King Yeung Cheung and Hans-Arno Jacobsen, “Green Resource Allocation Algorithms for Publish/Subscribe Systems”, In: the 31th IEEE International Conference on Distributed Computing Systems (ICDCS), 2011. 3. R. Chaiken, B. Jenkins, P.-A. Larson, B. Ramsey, D. Shakib, S. Weaver, and J. Zhou, “SCOPE: Easy and Efficient Parallel Processing of Massive DataSets,” Proc. Very Large Database. 4. The Apache Software Foundation “Welcome to Hadoop!” http://hadoop.apache.org/,2012. 5. D. Warneke and O. Kao, “Nephele: Efficient Parallel Data Processing in the Cloud,” Proc. Second Workshop Many-Task Computing on Grids and Supercomputers (MTAGS ’09), pp. 1-10, 2009. 6. I. Raicu, Y. Zhao, C. Dumitrescu, I. Foster, and M. Wilde. Falkon: a Fast and Light-weight tasKexecutiON framework. In SC ’07: Proceedings of the 2007 ACM/IEEE conference on Supercomputing, pages 1–12, New York, NY, USA, 2007. ACM. 7. J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. In OSDI’04: Proceedings of the 6th conference on Symposium on Operating Systems Design Implementation, pages 10–10, Berkeley, CA, USA, 2004. USENIX Association. 8. D. Warneke and O. Kao, “Exploiting dynamic resource allocation for efficient parallel data processing in the cloud,” IEEE transactions on parallel and distributed systems, vol. 22, no. 6, June 2011. 9. T. White, Hadoop: The Definitive Guide. O’Reilly Media, 2009 10. Mr.Abhijit Adhikari, Ms.GandhaliUpadhye, “Efficient Dynamic Resource Allocation in Parallel Data Processing for Cloud using PACT”, IJARCSSE Vol.3, 11, Nov.2013. 11. Ms.GandhaliUpadhye, Prof.Trupti Dange, “Cloud Resource Allocation as Non-Preemptive Approach, International Conference on Current Trends in Engineering and Technology (ICCTET) 2014. 12. Priya Deshpande, Brijesh Khundhawala and Prasanna Joeg, “Dynamic Data Replication and Job Scheduling Based on Popularity and Category”, International Journal of Computer Engineering Technology (IJCET), Volume 4, Issue 5, 2013, pp. 109 - 114, ISSN Print: 0976 – 6367, ISSN Online: 0976 – 6375.