SlideShare a Scribd company logo
1 of 5
Download to read offline
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 
__________________________________________________________________________________________ 
Volume: 03 Special Issue: 03 | May-2014 | NCRIET-2014, Available @ http://www.ijret.org 53 
PERFORMANCE ANALYSIS OF SOBEL EDGE FILTER ON HETEROGENEOUS SYSTEM USING OPENCL Aruna Dore1, Sunitha Lasrado2 14th Sem M.Tech, VLSI Design and Embedded System, NMAMIT, Nitte, Karnataka, India 2Assistant Professor, Electronics and Communication, NMAMIT, Nitte, Karnataka, India Abstract The fundamental task required for any image or Video processing applications like video surveillance, medical imaging is Edge detection. Any of the filters available can be used to detect the edges. In this paper Sobel Edge filter is used for comparing the performance analysis on CPUs and GPUs and from this study it is found that the performance analysis on GPUs is much higher as compared to CPUs. Also it is seen that parallel execution time is much less as compared to sequential execution for the heterogeneous systems. Keywords: OpenCL, GPU, Convolution Filter 
----------------------------------------------------------------------***------------------------------------------------------------------------ 1. INTRODUCTION OpenCL is a parallel framework for heterogeneous computations. It is a standard which is royalty free which is platform independent and is used to work on any kind of computational device. It is a programming language that allows the programmer to write one version of the code that can be executed virtually on any device that has OpenCL drivers .OpenCL consists of an API for coordinating parallel computations in heterogeneous computational environment and a cross–platform programming language based on C99[9]. CUDA (Compute Unified Device Architecture) [8] is a popular development tool for scientific GPU computing provided by the NVIDIA manufacturer for its GPU products. OpenCL programming model [9], supports cross-platform, parallel programming of heterogeneous processing systems while CUDA is not designed for heterogeneous systems. CUDA is a parallel computing framework designed only for NVIDIA‟s GPUs [3], and OpenCL is a standard designed for diverse platforms including CUDA-enabled GPUs, some ATI- GPUs, multi-core CPUs from Intel and AMD, and other processors such as the Cell Broadband Engine. Sobel Edge detection using CUDA gives the performance analysis of different sized images. In this paper OpenCL is used to detect edges for images of different sizes and we see that the performance of a GPU is greater than the performance got through CPU for which the explanation will be given in the next section. 
2. OPENCL STANDARD 
2.1 OpenCL Architecture 
Open Computing Language is a framework for writing programs that execute across heterogeneous platform consisting of Central Processing Units (CPUs), Graphics Processing Units (GPUs), Digital Signal Processor (DSPs), Field Programmable Gate arrays and other computing devices. The OpenCL architecture consists of following programming models: 
 Platform Model 
 Execution Model 
 Memory Model 
 Programming Model 
2.1.1 Platform Model OpenCL platform model consists of host and „n‟ number of compute devices. The compute device consists of compute units which in turn consist of processing elements. This can be viewed as in Figure1.These processing elements execute code as Single Instruction Multiple Data (SIMD) or Single Program Multiple Data (SPMD). Fig -1: OpenCL Platform Model
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 
__________________________________________________________________________________________ 
Volume: 03 Special Issue: 03 | May-2014 | NCRIET-2014, Available @ http://www.ijret.org 54 
2.1.2 Execution Model 
The OpenCL application is split into two parts kernel and 
compute device. The part that runs on the host is named host 
program and the part that runs on the compute devices is 
called a kernel. To facilitate parallel execution an index space 
is created when the kernel is submitted to the device for 
computation. An instance of the kernel is called work-item 
.The work-items are organized in an N-Dimensional Range 
(NDRange)[7] where N can be 1, 2, or 3. Work-items can be 
grouped into work-group shown in Figure.2. The host program 
defines the device context which includes program objects, 
memory objects and kernel function. The interaction between 
host and OpenCL device is controlled by the host program 
using command queues. These commands can be memory 
commands which control all memory transfers, kernel 
execution commands which submit the kernel code for 
execution on the devices and synchronization commands 
which control the order of command execution. 
Fig -2: NDRange Index Space 
2.1.3 Memory Model 
The memory model of OpenCL includes Private memory, 
Local memory, Constant memory, and the Global memory. 
Host processor has host memory this memory is on CPU and 
accessible by the CPU. Compute device has global memory as 
well as constant memory. The compute device used in this 
work is GPU. The GPU architecture is discussed in next 
section. 
Fig -3: Memory Model of OpenCL [1] 
Global Memory: In this region all the work-item and work-group 
has both read and write access on both the compute 
device and the host. 
Constant Memory: This is Read only-Global memory 
accessible to all work-items. 
Local Memory: Each work-group has Local memory. 
Memory is shared within the work-group, i.e. work-items 
within the same work-group can access this memory region 
Private Memory: Memory visible to only work-item. 
2.1.4 Programming Model 
Under the OpenCL [9] programming model, computation can 
be done in data parallel, task parallel, or a hybrid of these two 
models. 
3. GRAPHIC PROCESSOR AND CENTRAL 
PROCESSING UNIT 
Modern GPUs are highly optimized and highly parallel 
computational units can outperform the traditional CPU both 
in terms of arithmetic operations and memory bandwidth.GPU 
uses hundreds [5] of simple cores in parallel to enhance 
performance. The performance of the CPU could hardly be 
increased anymore by increasing the clock frequency so they 
developed a CPU into which they installed several cores in 
order to increase performance. The comparison of the number 
of cores on CPU and GPU is shown in Figure 4. 
CPU Multiple cores 
GPU (hundreds of cores) 
Fig-4: Comparison of the number of cores on a CPU and a 
GPU
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 
__________________________________________________________________________________________ 
Volume: 03 Special Issue: 03 | May-2014 | NCRIET-2014, Available @ http://www.ijret.org 55 
3.1 Graphics Processor Architecture 
GPUs comprise of set of parallel multi processors which are 
called as Streaming multi processors(SM). Each SM contains 
an instruction cache, an instruction queue, a warp-scheduler, 
registers, 32 CUDA cores and special-function units (SFUs) as 
shown in Figure.5. Fermi contains up to 448 general purpose 
arithmetic units known as Streaming Processors (SP) .64 
Special Function Units (SFU) for computing special 
transcendental and algebraic functions not provided by the 
SPs. Groups of 32 SPs, 16 LDSTs, 4 SFUs, and 4 TEXs 
compose a Streaming Multiprocessor (SM). Each CUDA 
cores consists of single Arithmetic Logic Unit (ALU) and 
Floating Point Unit (FPU). 
Fig-5: Each Fermi SM includes 32 cores, 16 load/store units, 
four special-function units, a 32K-word register file, 64K of 
configurable RAM, and thread control logic. Each core has 
both floating-point and integer execution units. (Source: 
NVIDIA) 
3. IMAGE CONVOLUTION 
Edge detection is a common image processing technique used 
in feature detection and extraction. Applying edge detection 
on an image can significantly reduce the amount of data 
needed to be processed at a later phase while maintaining the 
important structure of the image. The idea is to remove 
everything from the image except the pixels that are part of an 
edge. A simple edge detection algorithm is to apply the Sobel 
edge detection algorithm. It involves convolving the image 
using a filter. 
Sobel operator consists of a pair of 3×3 convolution kernels as 
shown in Figure 6. One kernel is simply the other rotated by 
90°. 
Fig-6: Masks used by Sobel Operator [4] 
3.1 Mathematical Representation of Convolution 
Convolution is a simple mathematical operation which is used 
by many common image processing operators. The 
convolution theorem specifies that the applying convolution is 
the same as a per-frequency multiplication in the frequency 
domain i.e. if the basis for both the convolution kernel and the 
image were to be changed to one that consists of simple sine 
and cosine functions by applying a discrete Fourier transform 
then we can take each of these components, multiply them and 
get the same result. This means that Fourier transform of the 
convolution kernel can be taken and the dampened frequencies 
can be seen (those having an amplitude<1), strengthen (>1) or 
leave unchanged (=1).A maximum amplitude value of one 
indicates that each of the different frequencies are 
independently attenuated, i.e. the frequency components in an 
image can be filtered out. The convolution of 2 functions f (t) 
and g (t) is denoted by (f * g) (t) .Convolution theorem gives 
the inverse Laplace transform of a product of two transformed 
functions [2]: 
L-1{F(s) G(s)} = (f * g) (t) (1) 
Let f (t) and g (t) be two functions of t. The convolution of f 
(t) and g (t) is also a function of t, denoted by (f * g) (t).And is 
defined by the relation 
However, if f and g are both causal functions then f (t) and g 
(t) are written as f (t) u (t) and g (t) u (t) respectively, so that 
Because of the properties of the step functions (u (t – x) = 0 if 
x>t and u(x) = 0 if x<0). 
Image convolution can be efficiently implemented on 
massively parallel hardware, since the same operator gets 
executed independently for each image pixel. Figure7 shows 
the convolution using a small 3 x 3 kernel.
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 
__________________________________________________________________________________________ 
Volume: 03 Special Issue: 03 | May-2014 | NCRIET-2014, Available @ http://www.ijret.org 56 
A convolution kernel works by iterating over each pixel in the input image. For each source pixel, the filter is centred over the pixel and the values of the filter multiply the pixel values that they overlay. A sum of the products is then taken to produce a new pixel value. Fig- 7: Convolution using a 3 x 3 kernel Steps of Sobel Edge detection 
 Create Image buffer and Objects 
 Write the Input Data 
 Create Sampler Object 
 Compile and Execute the Kernel 
 Read the Result 
4. EXPERIMENTAL RESULTS The main objective is to develop the OpenCL application that runs on any GPUs like AMD, Intel, and NVIDIA etc with less computation time and implementations of OpenCL are available for the operating systems like Mac OS X 10.6 and higher, Microsoft Windows and Linux. 4.1 Experimental Setup For this experimental set up Intel i7 processor and NVIDIA GTX 470 devices used. Detailed Technical specification is given in the table 1. The code is compiled using OpenCL 1.1 NVIDIA CUDA version Table -1: Device Technical Specification 4.2 Results The size of image is varied from 512 pixels to 2048 pixels. When image size is increased the serial program execution time considerably increases. Execution times in serial and parallel versions over CPU and GPU devices are presented in Table 2, 3 respectively and Graphs are plotted. Speed up for sobel filter is given in the Table 4. Table -2: Execution time of serial and parallel over CPU 
Table -3: Execution time of serial and parallel over GPU 
Table -4: Speed up for Sobel Filter with CPU and GPU 
Chart -1: CPU Speed up for Serial Vs Parallel 
Specifications 
GPU 
CPU 
Vendor 
NVIDIA GeForce GTX 470 
INTEL Core i7 
No. Of cores 
448 
8 
Clock(MHz) 
1215 
3400 
Global Memory(MB) 
1280 
2047 
Local Memory(KB) 
48 
32 
Max.Work-Item size 
1024 x 1024 x 64 
1024 x 1024 x 1024 
Max.Work-Group size 
1024 
1024 
Input Image 
T serial (sequential) (s) 
TCPU (parallel) (s) 
Speedup T serial/ TCPU 
512x512 
3.539 
0.102 
34.696 
1024x1024 
5.568 
0.376 
14.808 
2048x2048 
17.434 
2.788 
6.2532 
Input Image 
T serial (sequential) (s) 
TGPU (parallel) (ms) 
Speedup Tserial/TGPU 
512x512 
3.539 
0.0021 
1685.238 
1024x1024 
5.568 
0.011 
506.181 
2048x2048 
17.434 
0.0843 
206.809 
Input Image 
CPU Processing (ms) 
GPU Processing (ms) 
Speedup 
512x512 
102.001 
2.1 
48.5719 
1024x1024 
376.012 
11.01 
34.1518 
2048x2048 
2788.04 
84.023 
33.1818
IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 
__________________________________________________________________________________________ 
Volume: 03 Special Issue: 03 | May-2014 | NCRIET-2014, Available @ http://www.ijret.org 57 
Chart -2: GPU Speed up for Serial Vs Parallel 
Speed up for Sobel with CPU and GPU 
0 
10 
20 
30 
40 
50 
60 
512x512 1024x1024 2592x3240 
Input Image 
Processing Time (ms) 
Speed UP 
Chart -3: Speed up for Sobel with CPU and GPU 
Efficiency = Speedup/#.of cores 
Chart -4: Efficiency of CPU Vs GPU for Sobel filter 
5. CONCLUSIONS 
In this work parallel programming of OpenCL studied. 
OpenCL is tool used for programming on GPU regardless of 
vendors. OpenCL provides an interface for the interaction of 
hosts with accelerator devices like GPUs, DSPs and FPGAs. 
Graphics Processing Units are highly useful in parallel 
computing. In this paper Sobel edge detection is implemented 
on GPU using OpenCL in C and compared with sequential 
execution and efficiency of CPU and GPU is discussed. As the 
input size is increased the OpenCL gives good performance. 
The technologies advances in FPGA devices offering 
hundreds GFLOPs (Giga Flip Flop per Seconds) with 
maximum power efficiency has made way for parallel 
processing community towards them. As a future expansion 
the use of FPGAs using OpenCL as programming language 
will be considered for comparison of speed ups with the 
CPUs, GPUs, and FPGAs 
REFERENCES 
[1]. Aaftab Munshi, Bendict R. Gaster, Timothy G. Mattson, 
“OpenCL Programming Guide,”Published by Pearson 
Inc.,2012 
[2]. Anirud Pande,Rohit Chandna,”Matrix Convolution using 
Parallel Programming,” International Journal of Science and 
Reasearch (IJSR),India Volume 2 Issue 7, July 2013 
[3]. Jia Tse, “Image Processing with CUDA,” Master‟s Thesis, 
University Of Nevada, Las Vegas, August2012 
[4]. J inu Prabhakar.K.P , Anitha Mary.X,” FPGA Based 
Lane Deviation System Using System Generator” in 
International Journal of Advanced Research in Computer 
Science and Software Engineering ,Volume 3, Issue 2, 
February 2013 
[5]. K. Fatahalian and M. Houston, “A closer look at GPUs,” 
Communications of the ACM, vol.51, No. 10, October 2008. 
[6]. Krishnahari Thouti and S.R.Sathe, “A Methodology for 
translating C-Programs to OpenCL,”, vol. 82, International 
Journal of Computer Applications(0975- 8887), 2013, pp. 11- 
16. 
[7]. Matthew Scarpino,”OpenCL in Action,” by Manning 
Publications Co., 2012 
[8]. Nvidia, OpenCL Programming Guide Version 2.3.pdf 
[9]. OpenCL, “The open standard for parallel programming of 
heterogeneous systems,” [Online]. Available: 
http://www.khronos.org/opencl. 
[10]. Tadeusz Puzniakowski, “Performance of OpenCL” The 
University of Gdansk, 2012

More Related Content

What's hot

MEDIAN BASED PARALLEL STEERING KERNEL REGRESSION FOR IMAGE RECONSTRUCTION
MEDIAN BASED PARALLEL STEERING KERNEL REGRESSION FOR IMAGE RECONSTRUCTIONMEDIAN BASED PARALLEL STEERING KERNEL REGRESSION FOR IMAGE RECONSTRUCTION
MEDIAN BASED PARALLEL STEERING KERNEL REGRESSION FOR IMAGE RECONSTRUCTIONcsandit
 
Median based parallel steering kernel regression for image reconstruction
Median based parallel steering kernel regression for image reconstructionMedian based parallel steering kernel regression for image reconstruction
Median based parallel steering kernel regression for image reconstructioncsandit
 
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONSA SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONScseij
 
Digital Implementation of Artificial Neural Network for Function Approximatio...
Digital Implementation of Artificial Neural Network for Function Approximatio...Digital Implementation of Artificial Neural Network for Function Approximatio...
Digital Implementation of Artificial Neural Network for Function Approximatio...IOSR Journals
 
Memory allocation for real time operating system
Memory allocation for real time operating systemMemory allocation for real time operating system
Memory allocation for real time operating systemAsma'a Lafi
 
Real Time Face Detection on GPU Using OPENCL
Real Time Face Detection on GPU Using OPENCLReal Time Face Detection on GPU Using OPENCL
Real Time Face Detection on GPU Using OPENCLcsandit
 
Using Many-Core Processors to Improve the Performance of Space Computing Plat...
Using Many-Core Processors to Improve the Performance of Space Computing Plat...Using Many-Core Processors to Improve the Performance of Space Computing Plat...
Using Many-Core Processors to Improve the Performance of Space Computing Plat...Fisnik Kraja
 
MULTI-CORE PROCESSORS: CONCEPTS AND IMPLEMENTATIONS
MULTI-CORE PROCESSORS: CONCEPTS AND IMPLEMENTATIONSMULTI-CORE PROCESSORS: CONCEPTS AND IMPLEMENTATIONS
MULTI-CORE PROCESSORS: CONCEPTS AND IMPLEMENTATIONSijcsit
 
Design and development of DrawBot using image processing
Design and development of DrawBot using image processing Design and development of DrawBot using image processing
Design and development of DrawBot using image processing IJECEIAES
 
Comparative study to realize an automatic speaker recognition system
Comparative study to realize an automatic speaker recognition system Comparative study to realize an automatic speaker recognition system
Comparative study to realize an automatic speaker recognition system IJECEIAES
 
Real-time traffic sign detection and recognition using Raspberry Pi
Real-time traffic sign detection and recognition using Raspberry Pi Real-time traffic sign detection and recognition using Raspberry Pi
Real-time traffic sign detection and recognition using Raspberry Pi IJECEIAES
 
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...IJSRD
 
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKSOPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKSZac Darcy
 

What's hot (17)

MEDIAN BASED PARALLEL STEERING KERNEL REGRESSION FOR IMAGE RECONSTRUCTION
MEDIAN BASED PARALLEL STEERING KERNEL REGRESSION FOR IMAGE RECONSTRUCTIONMEDIAN BASED PARALLEL STEERING KERNEL REGRESSION FOR IMAGE RECONSTRUCTION
MEDIAN BASED PARALLEL STEERING KERNEL REGRESSION FOR IMAGE RECONSTRUCTION
 
Median based parallel steering kernel regression for image reconstruction
Median based parallel steering kernel regression for image reconstructionMedian based parallel steering kernel regression for image reconstruction
Median based parallel steering kernel regression for image reconstruction
 
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONSA SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
 
Digital Implementation of Artificial Neural Network for Function Approximatio...
Digital Implementation of Artificial Neural Network for Function Approximatio...Digital Implementation of Artificial Neural Network for Function Approximatio...
Digital Implementation of Artificial Neural Network for Function Approximatio...
 
Memory allocation for real time operating system
Memory allocation for real time operating systemMemory allocation for real time operating system
Memory allocation for real time operating system
 
Real Time Face Detection on GPU Using OPENCL
Real Time Face Detection on GPU Using OPENCLReal Time Face Detection on GPU Using OPENCL
Real Time Face Detection on GPU Using OPENCL
 
Co question 2006
Co question 2006Co question 2006
Co question 2006
 
Using Many-Core Processors to Improve the Performance of Space Computing Plat...
Using Many-Core Processors to Improve the Performance of Space Computing Plat...Using Many-Core Processors to Improve the Performance of Space Computing Plat...
Using Many-Core Processors to Improve the Performance of Space Computing Plat...
 
MULTI-CORE PROCESSORS: CONCEPTS AND IMPLEMENTATIONS
MULTI-CORE PROCESSORS: CONCEPTS AND IMPLEMENTATIONSMULTI-CORE PROCESSORS: CONCEPTS AND IMPLEMENTATIONS
MULTI-CORE PROCESSORS: CONCEPTS AND IMPLEMENTATIONS
 
035
035035
035
 
Design and development of DrawBot using image processing
Design and development of DrawBot using image processing Design and development of DrawBot using image processing
Design and development of DrawBot using image processing
 
Comparative study to realize an automatic speaker recognition system
Comparative study to realize an automatic speaker recognition system Comparative study to realize an automatic speaker recognition system
Comparative study to realize an automatic speaker recognition system
 
Real-time traffic sign detection and recognition using Raspberry Pi
Real-time traffic sign detection and recognition using Raspberry Pi Real-time traffic sign detection and recognition using Raspberry Pi
Real-time traffic sign detection and recognition using Raspberry Pi
 
IMQA Poster
IMQA PosterIMQA Poster
IMQA Poster
 
M017419499
M017419499M017419499
M017419499
 
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
 
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKSOPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
 

Viewers also liked

Uncertainty modelling and limit state reliability of
Uncertainty modelling and limit state reliability ofUncertainty modelling and limit state reliability of
Uncertainty modelling and limit state reliability ofeSAT Publishing House
 
An improved color image encryption algorithm with
An improved color image encryption algorithm withAn improved color image encryption algorithm with
An improved color image encryption algorithm witheSAT Publishing House
 
Comparative analysis of dynamic programming
Comparative analysis of dynamic programmingComparative analysis of dynamic programming
Comparative analysis of dynamic programmingeSAT Publishing House
 
Formulating a trip production prediction model for
Formulating a trip production prediction model forFormulating a trip production prediction model for
Formulating a trip production prediction model foreSAT Publishing House
 
An analysis of raw materials for concretes as metal sheets for solar radiatio...
An analysis of raw materials for concretes as metal sheets for solar radiatio...An analysis of raw materials for concretes as metal sheets for solar radiatio...
An analysis of raw materials for concretes as metal sheets for solar radiatio...eSAT Publishing House
 
Finite element analysis of dissimilar welding between
Finite element analysis of dissimilar welding betweenFinite element analysis of dissimilar welding between
Finite element analysis of dissimilar welding betweeneSAT Publishing House
 
Minimum shear reinforcement for optimum ductility of
Minimum shear reinforcement for optimum ductility ofMinimum shear reinforcement for optimum ductility of
Minimum shear reinforcement for optimum ductility ofeSAT Publishing House
 
Eye sight determination on tablet based hand held device with image processin...
Eye sight determination on tablet based hand held device with image processin...Eye sight determination on tablet based hand held device with image processin...
Eye sight determination on tablet based hand held device with image processin...eSAT Publishing House
 
Improving the performance parameters of microstrip
Improving the performance parameters of microstripImproving the performance parameters of microstrip
Improving the performance parameters of microstripeSAT Publishing House
 
Smfir technology based transportation system and applicability of mppt
Smfir technology based transportation system and applicability of mpptSmfir technology based transportation system and applicability of mppt
Smfir technology based transportation system and applicability of mppteSAT Publishing House
 
Information security by biometric waterking
Information security by biometric waterkingInformation security by biometric waterking
Information security by biometric waterkingeSAT Publishing House
 
Novel method to find the parameter for noise removal
Novel method to find the parameter for noise removalNovel method to find the parameter for noise removal
Novel method to find the parameter for noise removaleSAT Publishing House
 
New approach for generalised unsharp masking alogorithm
New approach for generalised unsharp masking alogorithmNew approach for generalised unsharp masking alogorithm
New approach for generalised unsharp masking alogorithmeSAT Publishing House
 
Design and development of fall detector using fall
Design and development of fall detector using fallDesign and development of fall detector using fall
Design and development of fall detector using falleSAT Publishing House
 
Stability analysis of orthotropic reinforce concrete shear wall
Stability analysis of orthotropic reinforce concrete shear wallStability analysis of orthotropic reinforce concrete shear wall
Stability analysis of orthotropic reinforce concrete shear walleSAT Publishing House
 
The review on automatic license plate recognition
The review on automatic license plate recognitionThe review on automatic license plate recognition
The review on automatic license plate recognitioneSAT Publishing House
 
Testing of memory using franklin method
Testing of memory using franklin methodTesting of memory using franklin method
Testing of memory using franklin methodeSAT Publishing House
 
Improving quality of service using ofdm technique for 4 th generation network
Improving quality of service using ofdm technique for 4 th generation networkImproving quality of service using ofdm technique for 4 th generation network
Improving quality of service using ofdm technique for 4 th generation networkeSAT Publishing House
 
Treatment of solid waste leachate by
Treatment of solid waste leachate byTreatment of solid waste leachate by
Treatment of solid waste leachate byeSAT Publishing House
 
An experimental study on durability of high strength
An experimental study on durability of high strengthAn experimental study on durability of high strength
An experimental study on durability of high strengtheSAT Publishing House
 

Viewers also liked (20)

Uncertainty modelling and limit state reliability of
Uncertainty modelling and limit state reliability ofUncertainty modelling and limit state reliability of
Uncertainty modelling and limit state reliability of
 
An improved color image encryption algorithm with
An improved color image encryption algorithm withAn improved color image encryption algorithm with
An improved color image encryption algorithm with
 
Comparative analysis of dynamic programming
Comparative analysis of dynamic programmingComparative analysis of dynamic programming
Comparative analysis of dynamic programming
 
Formulating a trip production prediction model for
Formulating a trip production prediction model forFormulating a trip production prediction model for
Formulating a trip production prediction model for
 
An analysis of raw materials for concretes as metal sheets for solar radiatio...
An analysis of raw materials for concretes as metal sheets for solar radiatio...An analysis of raw materials for concretes as metal sheets for solar radiatio...
An analysis of raw materials for concretes as metal sheets for solar radiatio...
 
Finite element analysis of dissimilar welding between
Finite element analysis of dissimilar welding betweenFinite element analysis of dissimilar welding between
Finite element analysis of dissimilar welding between
 
Minimum shear reinforcement for optimum ductility of
Minimum shear reinforcement for optimum ductility ofMinimum shear reinforcement for optimum ductility of
Minimum shear reinforcement for optimum ductility of
 
Eye sight determination on tablet based hand held device with image processin...
Eye sight determination on tablet based hand held device with image processin...Eye sight determination on tablet based hand held device with image processin...
Eye sight determination on tablet based hand held device with image processin...
 
Improving the performance parameters of microstrip
Improving the performance parameters of microstripImproving the performance parameters of microstrip
Improving the performance parameters of microstrip
 
Smfir technology based transportation system and applicability of mppt
Smfir technology based transportation system and applicability of mpptSmfir technology based transportation system and applicability of mppt
Smfir technology based transportation system and applicability of mppt
 
Information security by biometric waterking
Information security by biometric waterkingInformation security by biometric waterking
Information security by biometric waterking
 
Novel method to find the parameter for noise removal
Novel method to find the parameter for noise removalNovel method to find the parameter for noise removal
Novel method to find the parameter for noise removal
 
New approach for generalised unsharp masking alogorithm
New approach for generalised unsharp masking alogorithmNew approach for generalised unsharp masking alogorithm
New approach for generalised unsharp masking alogorithm
 
Design and development of fall detector using fall
Design and development of fall detector using fallDesign and development of fall detector using fall
Design and development of fall detector using fall
 
Stability analysis of orthotropic reinforce concrete shear wall
Stability analysis of orthotropic reinforce concrete shear wallStability analysis of orthotropic reinforce concrete shear wall
Stability analysis of orthotropic reinforce concrete shear wall
 
The review on automatic license plate recognition
The review on automatic license plate recognitionThe review on automatic license plate recognition
The review on automatic license plate recognition
 
Testing of memory using franklin method
Testing of memory using franklin methodTesting of memory using franklin method
Testing of memory using franklin method
 
Improving quality of service using ofdm technique for 4 th generation network
Improving quality of service using ofdm technique for 4 th generation networkImproving quality of service using ofdm technique for 4 th generation network
Improving quality of service using ofdm technique for 4 th generation network
 
Treatment of solid waste leachate by
Treatment of solid waste leachate byTreatment of solid waste leachate by
Treatment of solid waste leachate by
 
An experimental study on durability of high strength
An experimental study on durability of high strengthAn experimental study on durability of high strength
An experimental study on durability of high strength
 

Similar to Performance analysis of sobel edge filter on heterogeneous system using opencl

Accelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous PlatformsAccelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous PlatformsIJMER
 
Implementation of linux based uart device driver
Implementation of linux based uart device driverImplementation of linux based uart device driver
Implementation of linux based uart device drivereSAT Publishing House
 
Fpga implementation of a functional microcontroller
Fpga implementation of a functional microcontrollerFpga implementation of a functional microcontroller
Fpga implementation of a functional microcontrollereSAT Publishing House
 
IRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CLIRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CLIRJET Journal
 
Parallel and Distributed Computing Chapter 8
Parallel and Distributed Computing Chapter 8Parallel and Distributed Computing Chapter 8
Parallel and Distributed Computing Chapter 8AbdullahMunir32
 
Accelerated seam carving using cuda
Accelerated seam carving using cudaAccelerated seam carving using cuda
Accelerated seam carving using cudaeSAT Journals
 
Medical imaging computing based on graphical processing units for high perfor...
Medical imaging computing based on graphical processing units for high perfor...Medical imaging computing based on graphical processing units for high perfor...
Medical imaging computing based on graphical processing units for high perfor...eSAT Publishing House
 
A Parallel Computing-a Paradigm to achieve High Performance
A Parallel Computing-a Paradigm to achieve High PerformanceA Parallel Computing-a Paradigm to achieve High Performance
A Parallel Computing-a Paradigm to achieve High PerformanceAM Publications
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor
 
Realization of high performance run time loadable mips soft-core processor
Realization of high performance run time loadable mips soft-core processorRealization of high performance run time loadable mips soft-core processor
Realization of high performance run time loadable mips soft-core processoreSAT Publishing House
 
Performance evaluation of larger matrices over cluster of four nodes using mpi
Performance evaluation of larger matrices over cluster of four nodes using mpiPerformance evaluation of larger matrices over cluster of four nodes using mpi
Performance evaluation of larger matrices over cluster of four nodes using mpieSAT Journals
 
Design and Implementation of Pipelined 8-Bit RISC Processor using Verilog HDL...
Design and Implementation of Pipelined 8-Bit RISC Processor using Verilog HDL...Design and Implementation of Pipelined 8-Bit RISC Processor using Verilog HDL...
Design and Implementation of Pipelined 8-Bit RISC Processor using Verilog HDL...IRJET Journal
 
Driving cycle tracking device big data storing and management
Driving cycle tracking device big data storing and managementDriving cycle tracking device big data storing and management
Driving cycle tracking device big data storing and managementIJECEIAES
 
An area and power efficient on chip communication architectures for image enc...
An area and power efficient on chip communication architectures for image enc...An area and power efficient on chip communication architectures for image enc...
An area and power efficient on chip communication architectures for image enc...eSAT Publishing House
 

Similar to Performance analysis of sobel edge filter on heterogeneous system using opencl (20)

Accelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous PlatformsAccelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous Platforms
 
Implementation of linux based uart device driver
Implementation of linux based uart device driverImplementation of linux based uart device driver
Implementation of linux based uart device driver
 
Fpga implementation of a functional microcontroller
Fpga implementation of a functional microcontrollerFpga implementation of a functional microcontroller
Fpga implementation of a functional microcontroller
 
IRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CLIRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CL
 
H344250
H344250H344250
H344250
 
Parallel and Distributed Computing Chapter 8
Parallel and Distributed Computing Chapter 8Parallel and Distributed Computing Chapter 8
Parallel and Distributed Computing Chapter 8
 
Accelerated seam carving using cuda
Accelerated seam carving using cudaAccelerated seam carving using cuda
Accelerated seam carving using cuda
 
Accelerated seam carving using cuda
Accelerated seam carving using cudaAccelerated seam carving using cuda
Accelerated seam carving using cuda
 
Medical imaging computing based on graphical processing units for high perfor...
Medical imaging computing based on graphical processing units for high perfor...Medical imaging computing based on graphical processing units for high perfor...
Medical imaging computing based on graphical processing units for high perfor...
 
Caps a tool for process scheduling
Caps a tool for process schedulingCaps a tool for process scheduling
Caps a tool for process scheduling
 
A Parallel Computing-a Paradigm to achieve High Performance
A Parallel Computing-a Paradigm to achieve High PerformanceA Parallel Computing-a Paradigm to achieve High Performance
A Parallel Computing-a Paradigm to achieve High Performance
 
20120140502007 2-3
20120140502007 2-320120140502007 2-3
20120140502007 2-3
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
Realization of high performance run time loadable mips soft-core processor
Realization of high performance run time loadable mips soft-core processorRealization of high performance run time loadable mips soft-core processor
Realization of high performance run time loadable mips soft-core processor
 
A044050107
A044050107A044050107
A044050107
 
Performance evaluation of larger matrices over cluster of four nodes using mpi
Performance evaluation of larger matrices over cluster of four nodes using mpiPerformance evaluation of larger matrices over cluster of four nodes using mpi
Performance evaluation of larger matrices over cluster of four nodes using mpi
 
Design and Implementation of Pipelined 8-Bit RISC Processor using Verilog HDL...
Design and Implementation of Pipelined 8-Bit RISC Processor using Verilog HDL...Design and Implementation of Pipelined 8-Bit RISC Processor using Verilog HDL...
Design and Implementation of Pipelined 8-Bit RISC Processor using Verilog HDL...
 
Driving cycle tracking device big data storing and management
Driving cycle tracking device big data storing and managementDriving cycle tracking device big data storing and management
Driving cycle tracking device big data storing and management
 
GRID COMPUTING
GRID COMPUTINGGRID COMPUTING
GRID COMPUTING
 
An area and power efficient on chip communication architectures for image enc...
An area and power efficient on chip communication architectures for image enc...An area and power efficient on chip communication architectures for image enc...
An area and power efficient on chip communication architectures for image enc...
 

More from eSAT Publishing House

Likely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnamLikely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnameSAT Publishing House
 
Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...eSAT Publishing House
 
Hudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnamHudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnameSAT Publishing House
 
Groundwater investigation using geophysical methods a case study of pydibhim...
Groundwater investigation using geophysical methods  a case study of pydibhim...Groundwater investigation using geophysical methods  a case study of pydibhim...
Groundwater investigation using geophysical methods a case study of pydibhim...eSAT Publishing House
 
Flood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, indiaFlood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, indiaeSAT Publishing House
 
Enhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity buildingEnhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity buildingeSAT Publishing House
 
Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...eSAT Publishing House
 
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...eSAT Publishing House
 
Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...eSAT Publishing House
 
Shear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a reviewShear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a revieweSAT Publishing House
 
Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...eSAT Publishing House
 
Risk analysis and environmental hazard management
Risk analysis and environmental hazard managementRisk analysis and environmental hazard management
Risk analysis and environmental hazard managementeSAT Publishing House
 
Review study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear wallsReview study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear wallseSAT Publishing House
 
Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...eSAT Publishing House
 
Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...eSAT Publishing House
 
Coastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of indiaCoastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of indiaeSAT Publishing House
 
Can fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structuresCan fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structureseSAT Publishing House
 
Assessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildingsAssessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildingseSAT Publishing House
 
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...eSAT Publishing House
 
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...eSAT Publishing House
 

More from eSAT Publishing House (20)

Likely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnamLikely impacts of hudhud on the environment of visakhapatnam
Likely impacts of hudhud on the environment of visakhapatnam
 
Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...Impact of flood disaster in a drought prone area – case study of alampur vill...
Impact of flood disaster in a drought prone area – case study of alampur vill...
 
Hudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnamHudhud cyclone – a severe disaster in visakhapatnam
Hudhud cyclone – a severe disaster in visakhapatnam
 
Groundwater investigation using geophysical methods a case study of pydibhim...
Groundwater investigation using geophysical methods  a case study of pydibhim...Groundwater investigation using geophysical methods  a case study of pydibhim...
Groundwater investigation using geophysical methods a case study of pydibhim...
 
Flood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, indiaFlood related disasters concerned to urban flooding in bangalore, india
Flood related disasters concerned to urban flooding in bangalore, india
 
Enhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity buildingEnhancing post disaster recovery by optimal infrastructure capacity building
Enhancing post disaster recovery by optimal infrastructure capacity building
 
Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...Effect of lintel and lintel band on the global performance of reinforced conc...
Effect of lintel and lintel band on the global performance of reinforced conc...
 
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
Wind damage to trees in the gitam university campus at visakhapatnam by cyclo...
 
Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...Wind damage to buildings, infrastrucuture and landscape elements along the be...
Wind damage to buildings, infrastrucuture and landscape elements along the be...
 
Shear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a reviewShear strength of rc deep beam panels – a review
Shear strength of rc deep beam panels – a review
 
Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...Role of voluntary teams of professional engineers in dissater management – ex...
Role of voluntary teams of professional engineers in dissater management – ex...
 
Risk analysis and environmental hazard management
Risk analysis and environmental hazard managementRisk analysis and environmental hazard management
Risk analysis and environmental hazard management
 
Review study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear wallsReview study on performance of seismically tested repaired shear walls
Review study on performance of seismically tested repaired shear walls
 
Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...Monitoring and assessment of air quality with reference to dust particles (pm...
Monitoring and assessment of air quality with reference to dust particles (pm...
 
Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...Low cost wireless sensor networks and smartphone applications for disaster ma...
Low cost wireless sensor networks and smartphone applications for disaster ma...
 
Coastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of indiaCoastal zones – seismic vulnerability an analysis from east coast of india
Coastal zones – seismic vulnerability an analysis from east coast of india
 
Can fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structuresCan fracture mechanics predict damage due disaster of structures
Can fracture mechanics predict damage due disaster of structures
 
Assessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildingsAssessment of seismic susceptibility of rc buildings
Assessment of seismic susceptibility of rc buildings
 
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
A geophysical insight of earthquake occurred on 21 st may 2014 off paradip, b...
 
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
Effect of hudhud cyclone on the development of visakhapatnam as smart and gre...
 

Recently uploaded

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 

Recently uploaded (20)

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 

Performance analysis of sobel edge filter on heterogeneous system using opencl

  • 1. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 __________________________________________________________________________________________ Volume: 03 Special Issue: 03 | May-2014 | NCRIET-2014, Available @ http://www.ijret.org 53 PERFORMANCE ANALYSIS OF SOBEL EDGE FILTER ON HETEROGENEOUS SYSTEM USING OPENCL Aruna Dore1, Sunitha Lasrado2 14th Sem M.Tech, VLSI Design and Embedded System, NMAMIT, Nitte, Karnataka, India 2Assistant Professor, Electronics and Communication, NMAMIT, Nitte, Karnataka, India Abstract The fundamental task required for any image or Video processing applications like video surveillance, medical imaging is Edge detection. Any of the filters available can be used to detect the edges. In this paper Sobel Edge filter is used for comparing the performance analysis on CPUs and GPUs and from this study it is found that the performance analysis on GPUs is much higher as compared to CPUs. Also it is seen that parallel execution time is much less as compared to sequential execution for the heterogeneous systems. Keywords: OpenCL, GPU, Convolution Filter ----------------------------------------------------------------------***------------------------------------------------------------------------ 1. INTRODUCTION OpenCL is a parallel framework for heterogeneous computations. It is a standard which is royalty free which is platform independent and is used to work on any kind of computational device. It is a programming language that allows the programmer to write one version of the code that can be executed virtually on any device that has OpenCL drivers .OpenCL consists of an API for coordinating parallel computations in heterogeneous computational environment and a cross–platform programming language based on C99[9]. CUDA (Compute Unified Device Architecture) [8] is a popular development tool for scientific GPU computing provided by the NVIDIA manufacturer for its GPU products. OpenCL programming model [9], supports cross-platform, parallel programming of heterogeneous processing systems while CUDA is not designed for heterogeneous systems. CUDA is a parallel computing framework designed only for NVIDIA‟s GPUs [3], and OpenCL is a standard designed for diverse platforms including CUDA-enabled GPUs, some ATI- GPUs, multi-core CPUs from Intel and AMD, and other processors such as the Cell Broadband Engine. Sobel Edge detection using CUDA gives the performance analysis of different sized images. In this paper OpenCL is used to detect edges for images of different sizes and we see that the performance of a GPU is greater than the performance got through CPU for which the explanation will be given in the next section. 2. OPENCL STANDARD 2.1 OpenCL Architecture Open Computing Language is a framework for writing programs that execute across heterogeneous platform consisting of Central Processing Units (CPUs), Graphics Processing Units (GPUs), Digital Signal Processor (DSPs), Field Programmable Gate arrays and other computing devices. The OpenCL architecture consists of following programming models:  Platform Model  Execution Model  Memory Model  Programming Model 2.1.1 Platform Model OpenCL platform model consists of host and „n‟ number of compute devices. The compute device consists of compute units which in turn consist of processing elements. This can be viewed as in Figure1.These processing elements execute code as Single Instruction Multiple Data (SIMD) or Single Program Multiple Data (SPMD). Fig -1: OpenCL Platform Model
  • 2. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 __________________________________________________________________________________________ Volume: 03 Special Issue: 03 | May-2014 | NCRIET-2014, Available @ http://www.ijret.org 54 2.1.2 Execution Model The OpenCL application is split into two parts kernel and compute device. The part that runs on the host is named host program and the part that runs on the compute devices is called a kernel. To facilitate parallel execution an index space is created when the kernel is submitted to the device for computation. An instance of the kernel is called work-item .The work-items are organized in an N-Dimensional Range (NDRange)[7] where N can be 1, 2, or 3. Work-items can be grouped into work-group shown in Figure.2. The host program defines the device context which includes program objects, memory objects and kernel function. The interaction between host and OpenCL device is controlled by the host program using command queues. These commands can be memory commands which control all memory transfers, kernel execution commands which submit the kernel code for execution on the devices and synchronization commands which control the order of command execution. Fig -2: NDRange Index Space 2.1.3 Memory Model The memory model of OpenCL includes Private memory, Local memory, Constant memory, and the Global memory. Host processor has host memory this memory is on CPU and accessible by the CPU. Compute device has global memory as well as constant memory. The compute device used in this work is GPU. The GPU architecture is discussed in next section. Fig -3: Memory Model of OpenCL [1] Global Memory: In this region all the work-item and work-group has both read and write access on both the compute device and the host. Constant Memory: This is Read only-Global memory accessible to all work-items. Local Memory: Each work-group has Local memory. Memory is shared within the work-group, i.e. work-items within the same work-group can access this memory region Private Memory: Memory visible to only work-item. 2.1.4 Programming Model Under the OpenCL [9] programming model, computation can be done in data parallel, task parallel, or a hybrid of these two models. 3. GRAPHIC PROCESSOR AND CENTRAL PROCESSING UNIT Modern GPUs are highly optimized and highly parallel computational units can outperform the traditional CPU both in terms of arithmetic operations and memory bandwidth.GPU uses hundreds [5] of simple cores in parallel to enhance performance. The performance of the CPU could hardly be increased anymore by increasing the clock frequency so they developed a CPU into which they installed several cores in order to increase performance. The comparison of the number of cores on CPU and GPU is shown in Figure 4. CPU Multiple cores GPU (hundreds of cores) Fig-4: Comparison of the number of cores on a CPU and a GPU
  • 3. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 __________________________________________________________________________________________ Volume: 03 Special Issue: 03 | May-2014 | NCRIET-2014, Available @ http://www.ijret.org 55 3.1 Graphics Processor Architecture GPUs comprise of set of parallel multi processors which are called as Streaming multi processors(SM). Each SM contains an instruction cache, an instruction queue, a warp-scheduler, registers, 32 CUDA cores and special-function units (SFUs) as shown in Figure.5. Fermi contains up to 448 general purpose arithmetic units known as Streaming Processors (SP) .64 Special Function Units (SFU) for computing special transcendental and algebraic functions not provided by the SPs. Groups of 32 SPs, 16 LDSTs, 4 SFUs, and 4 TEXs compose a Streaming Multiprocessor (SM). Each CUDA cores consists of single Arithmetic Logic Unit (ALU) and Floating Point Unit (FPU). Fig-5: Each Fermi SM includes 32 cores, 16 load/store units, four special-function units, a 32K-word register file, 64K of configurable RAM, and thread control logic. Each core has both floating-point and integer execution units. (Source: NVIDIA) 3. IMAGE CONVOLUTION Edge detection is a common image processing technique used in feature detection and extraction. Applying edge detection on an image can significantly reduce the amount of data needed to be processed at a later phase while maintaining the important structure of the image. The idea is to remove everything from the image except the pixels that are part of an edge. A simple edge detection algorithm is to apply the Sobel edge detection algorithm. It involves convolving the image using a filter. Sobel operator consists of a pair of 3×3 convolution kernels as shown in Figure 6. One kernel is simply the other rotated by 90°. Fig-6: Masks used by Sobel Operator [4] 3.1 Mathematical Representation of Convolution Convolution is a simple mathematical operation which is used by many common image processing operators. The convolution theorem specifies that the applying convolution is the same as a per-frequency multiplication in the frequency domain i.e. if the basis for both the convolution kernel and the image were to be changed to one that consists of simple sine and cosine functions by applying a discrete Fourier transform then we can take each of these components, multiply them and get the same result. This means that Fourier transform of the convolution kernel can be taken and the dampened frequencies can be seen (those having an amplitude<1), strengthen (>1) or leave unchanged (=1).A maximum amplitude value of one indicates that each of the different frequencies are independently attenuated, i.e. the frequency components in an image can be filtered out. The convolution of 2 functions f (t) and g (t) is denoted by (f * g) (t) .Convolution theorem gives the inverse Laplace transform of a product of two transformed functions [2]: L-1{F(s) G(s)} = (f * g) (t) (1) Let f (t) and g (t) be two functions of t. The convolution of f (t) and g (t) is also a function of t, denoted by (f * g) (t).And is defined by the relation However, if f and g are both causal functions then f (t) and g (t) are written as f (t) u (t) and g (t) u (t) respectively, so that Because of the properties of the step functions (u (t – x) = 0 if x>t and u(x) = 0 if x<0). Image convolution can be efficiently implemented on massively parallel hardware, since the same operator gets executed independently for each image pixel. Figure7 shows the convolution using a small 3 x 3 kernel.
  • 4. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 __________________________________________________________________________________________ Volume: 03 Special Issue: 03 | May-2014 | NCRIET-2014, Available @ http://www.ijret.org 56 A convolution kernel works by iterating over each pixel in the input image. For each source pixel, the filter is centred over the pixel and the values of the filter multiply the pixel values that they overlay. A sum of the products is then taken to produce a new pixel value. Fig- 7: Convolution using a 3 x 3 kernel Steps of Sobel Edge detection  Create Image buffer and Objects  Write the Input Data  Create Sampler Object  Compile and Execute the Kernel  Read the Result 4. EXPERIMENTAL RESULTS The main objective is to develop the OpenCL application that runs on any GPUs like AMD, Intel, and NVIDIA etc with less computation time and implementations of OpenCL are available for the operating systems like Mac OS X 10.6 and higher, Microsoft Windows and Linux. 4.1 Experimental Setup For this experimental set up Intel i7 processor and NVIDIA GTX 470 devices used. Detailed Technical specification is given in the table 1. The code is compiled using OpenCL 1.1 NVIDIA CUDA version Table -1: Device Technical Specification 4.2 Results The size of image is varied from 512 pixels to 2048 pixels. When image size is increased the serial program execution time considerably increases. Execution times in serial and parallel versions over CPU and GPU devices are presented in Table 2, 3 respectively and Graphs are plotted. Speed up for sobel filter is given in the Table 4. Table -2: Execution time of serial and parallel over CPU Table -3: Execution time of serial and parallel over GPU Table -4: Speed up for Sobel Filter with CPU and GPU Chart -1: CPU Speed up for Serial Vs Parallel Specifications GPU CPU Vendor NVIDIA GeForce GTX 470 INTEL Core i7 No. Of cores 448 8 Clock(MHz) 1215 3400 Global Memory(MB) 1280 2047 Local Memory(KB) 48 32 Max.Work-Item size 1024 x 1024 x 64 1024 x 1024 x 1024 Max.Work-Group size 1024 1024 Input Image T serial (sequential) (s) TCPU (parallel) (s) Speedup T serial/ TCPU 512x512 3.539 0.102 34.696 1024x1024 5.568 0.376 14.808 2048x2048 17.434 2.788 6.2532 Input Image T serial (sequential) (s) TGPU (parallel) (ms) Speedup Tserial/TGPU 512x512 3.539 0.0021 1685.238 1024x1024 5.568 0.011 506.181 2048x2048 17.434 0.0843 206.809 Input Image CPU Processing (ms) GPU Processing (ms) Speedup 512x512 102.001 2.1 48.5719 1024x1024 376.012 11.01 34.1518 2048x2048 2788.04 84.023 33.1818
  • 5. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308 __________________________________________________________________________________________ Volume: 03 Special Issue: 03 | May-2014 | NCRIET-2014, Available @ http://www.ijret.org 57 Chart -2: GPU Speed up for Serial Vs Parallel Speed up for Sobel with CPU and GPU 0 10 20 30 40 50 60 512x512 1024x1024 2592x3240 Input Image Processing Time (ms) Speed UP Chart -3: Speed up for Sobel with CPU and GPU Efficiency = Speedup/#.of cores Chart -4: Efficiency of CPU Vs GPU for Sobel filter 5. CONCLUSIONS In this work parallel programming of OpenCL studied. OpenCL is tool used for programming on GPU regardless of vendors. OpenCL provides an interface for the interaction of hosts with accelerator devices like GPUs, DSPs and FPGAs. Graphics Processing Units are highly useful in parallel computing. In this paper Sobel edge detection is implemented on GPU using OpenCL in C and compared with sequential execution and efficiency of CPU and GPU is discussed. As the input size is increased the OpenCL gives good performance. The technologies advances in FPGA devices offering hundreds GFLOPs (Giga Flip Flop per Seconds) with maximum power efficiency has made way for parallel processing community towards them. As a future expansion the use of FPGAs using OpenCL as programming language will be considered for comparison of speed ups with the CPUs, GPUs, and FPGAs REFERENCES [1]. Aaftab Munshi, Bendict R. Gaster, Timothy G. Mattson, “OpenCL Programming Guide,”Published by Pearson Inc.,2012 [2]. Anirud Pande,Rohit Chandna,”Matrix Convolution using Parallel Programming,” International Journal of Science and Reasearch (IJSR),India Volume 2 Issue 7, July 2013 [3]. Jia Tse, “Image Processing with CUDA,” Master‟s Thesis, University Of Nevada, Las Vegas, August2012 [4]. J inu Prabhakar.K.P , Anitha Mary.X,” FPGA Based Lane Deviation System Using System Generator” in International Journal of Advanced Research in Computer Science and Software Engineering ,Volume 3, Issue 2, February 2013 [5]. K. Fatahalian and M. Houston, “A closer look at GPUs,” Communications of the ACM, vol.51, No. 10, October 2008. [6]. Krishnahari Thouti and S.R.Sathe, “A Methodology for translating C-Programs to OpenCL,”, vol. 82, International Journal of Computer Applications(0975- 8887), 2013, pp. 11- 16. [7]. Matthew Scarpino,”OpenCL in Action,” by Manning Publications Co., 2012 [8]. Nvidia, OpenCL Programming Guide Version 2.3.pdf [9]. OpenCL, “The open standard for parallel programming of heterogeneous systems,” [Online]. Available: http://www.khronos.org/opencl. [10]. Tadeusz Puzniakowski, “Performance of OpenCL” The University of Gdansk, 2012