SlideShare a Scribd company logo
1 of 1
Download to read offline
IEEE 2005 Workshop on Signal Processing Systems (SIPS'05), November 2, Athens, Greece
Flexible Hardware Architecture for 2-D
Separable Convolution-based Scaling
There is not a single scaling technique that suites all kind of images (photo, CAD, Text...) the user is willing to print
or display. Formally, any convolution-based scaling operation can be decomposed in three steps: an anti-aliasing
filter, image reconstruction by continuous convolution and re-sampling to the final grid. Based on this, we propose
a flexible hardware-friendly discrete convolution engine operating a memory that stores a programmable 2D-
separable interpolation kernel. We also state a technique for optimizing the memory size given the kernel and the
scale factor. Finally, we describe a novel flexible filter that overcomes aliasing artifacts regardless of image
frequency content.
Jordi Arnabat and Francisco Cardells
Hewlett-Packard, Large-Format Technology Lab, Barcelona, Spain
{jordi.arnabat, francisco.cardells}@hp.com
Image Scaling
. Not a unique interpolation technique to
achieve good IQ for all types of images:
adaptable HW is the key to survival.
. Formally, scaling can be thought of as:
continuous reconstruction of the discrete
input and re-sampling at the output grid.
. Propose a flexible hardware built from
a classical convolution-based scaler,
where IQ is chosen by means of a
programmable kernel.
Filtering Stage
0 1 2
1 2 3 4
Y1
5 6 7 8

w2
1
w2
2
Y2
A B
1 2 3
A
B
Y1
Digital
interpolator
low pass
filter
low pass
filter
3 4 5
4 5 6
A
B
Y2
Digital
interpolator
low pass
filter
low pass
filter
0
Scaler Data Flow
. Downscaling implies a pre-filtering step
to remove frequencies not
representable in the output grid.
(aliasing)
(a) Moving Average
(b) Frequency-Sharpened CIC
(c) Multistage CIC
. Propose architecture to enable (a) & (b)
pre-filters.
. Wide range of interpolation
techniques: NN, bilinear, bicubic,
gaussian, …, yours!
. Complexity/latency of the hardware
is determined by the interpolation
function support.
. Resampling by means of shift-
variant FIR filter, of length = kernel
support
. Kernel shape can be programmed in
a memory by means of a LUT,
sampled at .
. As a design rule, any kernel shape
needs twice as many samples per
interval as the maximum scale
factor.
. For example, a scaler performing up
to 32x, using a 4 tabs support
kernel, 8-b word precision requires a
2Kb LUT. The datapath for this
interpolator requires 2.2 kgates.
bilinear
nearest neighbor
4
x
Interpolation
Pre-filtering
Conventional Scaling uses a hardwired set
of rules for upscaling and another for
downscaling.
Instead we build any scaling operation as a
flexible prefiltering + interpolation
this flexibility is required as there is not a
single best scaling algorithm for all kind of
images
Programmable Low-pass
FIR filter.
Cut-off frequency given by
downscale factor
Programmable
Continuous
Convolution
up-scaling
down-scaling
nearest neighbor bicubic
Interpolation, Kernel
Sampling
 
w
-1
w
0
w
1
w
2
W
W24W23W22W21
W14W13W12W11
...

Neighbor index ()
Programmable Interpolation Kernel
1/(1-Z -1 ) 1/(1-Z -1 )
1/(1-Z -1 ) R
R
(1-Z -1 ) 2
3
(1-Z -1 ) (1-Z -1 )
1/(1-Z -1 ) 1/(1-Z -1 )
1/(1-Z -1 ) R
R
(1-Z -1 ) 2
3
(1-Z -1 ) (1-Z -1 )
(a)
(b)
(a)
(b)
(c)
Down-scaling by a factor of 1.5 after (a) moving average and (b) frequency sharpened CIC
filter. Artifacts circled and images resized to aid direct comparison.
(a)
(b)
(original)
Frequency response of three different pre-filtering
schemes. (a) & (b) are combined into one flexible
architecture.
(a) Nearest neighbor (b) Bilinear interpolation
(c) B-spline order 3 (d) Keys’ bicubic a=-1/2 Interpolation by continuous convolution. Principles of operation.

k*D2
1 2 3 n
1 2 k
w
1
w
2
o[k]
Shape of the Interpolation kernel is sampled at a
given frequency (). Data (weights) is stored LUT-
wise in a memory.
. In down-scaling the low-pass filter does
not have to be applied to all the
incoming pixels.
. Instead only the base points for the
interpolation are pre-computed to
remove the aliasing frequencies.
. There must be a number of equivalent
serial low-pass filters equal to the
kernel support.

More Related Content

What's hot

Convolutional neural network
Convolutional neural networkConvolutional neural network
Convolutional neural networkItachi SK
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)inventionjournals
 
[Paper] Multiscale Vision Transformers(MVit)
[Paper] Multiscale Vision Transformers(MVit)[Paper] Multiscale Vision Transformers(MVit)
[Paper] Multiscale Vision Transformers(MVit)Susang Kim
 
2019-06-14:7 - Neutral Network Compression
2019-06-14:7 - Neutral Network Compression2019-06-14:7 - Neutral Network Compression
2019-06-14:7 - Neutral Network Compressionuninfoit
 
High Performance Parallel Computing with Clouds and Cloud Technologies
High Performance Parallel Computing with Clouds and Cloud TechnologiesHigh Performance Parallel Computing with Clouds and Cloud Technologies
High Performance Parallel Computing with Clouds and Cloud Technologiesjaliyae
 
Neural network based image compression with lifting scheme and rlc
Neural network based image compression with lifting scheme and rlcNeural network based image compression with lifting scheme and rlc
Neural network based image compression with lifting scheme and rlceSAT Publishing House
 
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation..."Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...Edge AI and Vision Alliance
 
Deep learning lecture - part 1 (basics, CNN)
Deep learning lecture - part 1 (basics, CNN)Deep learning lecture - part 1 (basics, CNN)
Deep learning lecture - part 1 (basics, CNN)SungminYou
 
Introduction to CNN
Introduction to CNNIntroduction to CNN
Introduction to CNNShuai Zhang
 
Scalable Parallel Computing on Clouds
Scalable Parallel Computing on CloudsScalable Parallel Computing on Clouds
Scalable Parallel Computing on CloudsThilina Gunarathne
 
Machine Learning - Convolutional Neural Network
Machine Learning - Convolutional Neural NetworkMachine Learning - Convolutional Neural Network
Machine Learning - Convolutional Neural NetworkRichard Kuo
 
DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...
DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...
DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...Joonhyung Lee
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Image Compression Using Neural Network
 Image Compression Using Neural Network Image Compression Using Neural Network
Image Compression Using Neural NetworkOmkar Lokhande
 
AI On the Edge: Model Compression
AI On the Edge: Model CompressionAI On the Edge: Model Compression
AI On the Edge: Model CompressionApache MXNet
 
Convolutional neural network
Convolutional neural networkConvolutional neural network
Convolutional neural networkMojammilHusain
 
CONVOLUTIONAL NEURAL NETWORK
CONVOLUTIONAL NEURAL NETWORKCONVOLUTIONAL NEURAL NETWORK
CONVOLUTIONAL NEURAL NETWORKMd Rajib Bhuiyan
 

What's hot (20)

Cnn method
Cnn methodCnn method
Cnn method
 
Convolutional neural network
Convolutional neural networkConvolutional neural network
Convolutional neural network
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
[Paper] Multiscale Vision Transformers(MVit)
[Paper] Multiscale Vision Transformers(MVit)[Paper] Multiscale Vision Transformers(MVit)
[Paper] Multiscale Vision Transformers(MVit)
 
2019-06-14:7 - Neutral Network Compression
2019-06-14:7 - Neutral Network Compression2019-06-14:7 - Neutral Network Compression
2019-06-14:7 - Neutral Network Compression
 
High Performance Parallel Computing with Clouds and Cloud Technologies
High Performance Parallel Computing with Clouds and Cloud TechnologiesHigh Performance Parallel Computing with Clouds and Cloud Technologies
High Performance Parallel Computing with Clouds and Cloud Technologies
 
Neural network based image compression with lifting scheme and rlc
Neural network based image compression with lifting scheme and rlcNeural network based image compression with lifting scheme and rlc
Neural network based image compression with lifting scheme and rlc
 
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation..."Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
 
Deep learning lecture - part 1 (basics, CNN)
Deep learning lecture - part 1 (basics, CNN)Deep learning lecture - part 1 (basics, CNN)
Deep learning lecture - part 1 (basics, CNN)
 
Introduction to CNN
Introduction to CNNIntroduction to CNN
Introduction to CNN
 
Scalable Parallel Computing on Clouds
Scalable Parallel Computing on CloudsScalable Parallel Computing on Clouds
Scalable Parallel Computing on Clouds
 
Machine Learning - Convolutional Neural Network
Machine Learning - Convolutional Neural NetworkMachine Learning - Convolutional Neural Network
Machine Learning - Convolutional Neural Network
 
DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...
DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...
DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Cnn
CnnCnn
Cnn
 
Image Compression Using Neural Network
 Image Compression Using Neural Network Image Compression Using Neural Network
Image Compression Using Neural Network
 
AI On the Edge: Model Compression
AI On the Edge: Model CompressionAI On the Edge: Model Compression
AI On the Edge: Model Compression
 
Convolutional neural network
Convolutional neural networkConvolutional neural network
Convolutional neural network
 
Tldr
TldrTldr
Tldr
 
CONVOLUTIONAL NEURAL NETWORK
CONVOLUTIONAL NEURAL NETWORKCONVOLUTIONAL NEURAL NETWORK
CONVOLUTIONAL NEURAL NETWORK
 

Similar to cbs_sips2005

Accelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous PlatformsAccelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous PlatformsIJMER
 
Comparison of different Fingerprint Compression Techniques
Comparison of different Fingerprint Compression TechniquesComparison of different Fingerprint Compression Techniques
Comparison of different Fingerprint Compression Techniquessipij
 
High Performance Medical Reconstruction Using Stream Programming Paradigms
High Performance Medical Reconstruction Using Stream Programming ParadigmsHigh Performance Medical Reconstruction Using Stream Programming Paradigms
High Performance Medical Reconstruction Using Stream Programming ParadigmsQuEST Global (erstwhile NeST Software)
 
Standardising the compressed representation of neural networks
Standardising the compressed representation of neural networksStandardising the compressed representation of neural networks
Standardising the compressed representation of neural networksFörderverein Technische Fakultät
 
Dynamic Texture Coding using Modified Haar Wavelet with CUDA
Dynamic Texture Coding using Modified Haar Wavelet with CUDADynamic Texture Coding using Modified Haar Wavelet with CUDA
Dynamic Texture Coding using Modified Haar Wavelet with CUDAIJERA Editor
 
PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...
PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...
PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...VLSICS Design
 
APSys Presentation Final copy2
APSys Presentation Final copy2APSys Presentation Final copy2
APSys Presentation Final copy2Junli Gu
 
Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...
Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...
Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...VLSICS Design
 
Threading Successes 06 Allegorithmic
Threading Successes 06   AllegorithmicThreading Successes 06   Allegorithmic
Threading Successes 06 Allegorithmicguest40fc7cd
 
FPGA Implementation of Multiplier-less CDF-5/3 Wavelet Transform for Image Pr...
FPGA Implementation of Multiplier-less CDF-5/3 Wavelet Transform for Image Pr...FPGA Implementation of Multiplier-less CDF-5/3 Wavelet Transform for Image Pr...
FPGA Implementation of Multiplier-less CDF-5/3 Wavelet Transform for Image Pr...IOSRJVSP
 
Performance analysis of sobel edge filter on heterogeneous system using opencl
Performance analysis of sobel edge filter on heterogeneous system using openclPerformance analysis of sobel edge filter on heterogeneous system using opencl
Performance analysis of sobel edge filter on heterogeneous system using opencleSAT Publishing House
 
Conference Paper: Universal Node: Towards a high-performance NFV environment
Conference Paper: Universal Node: Towards a high-performance NFV environmentConference Paper: Universal Node: Towards a high-performance NFV environment
Conference Paper: Universal Node: Towards a high-performance NFV environmentEricsson
 
DEEP LEARNING BASED BRAIN STROKE DETECTION
DEEP LEARNING BASED BRAIN STROKE DETECTIONDEEP LEARNING BASED BRAIN STROKE DETECTION
DEEP LEARNING BASED BRAIN STROKE DETECTIONIRJET Journal
 
Types Of Window Being Used For The Selected Granule
Types Of Window Being Used For The Selected GranuleTypes Of Window Being Used For The Selected Granule
Types Of Window Being Used For The Selected GranuleLeslie Lee
 
High Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image CompressionHigh Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image Compressionsipij
 
A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...
A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...
A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...MDC_UNICA
 
A Survey on Image Processing using CNN in Deep Learning
A Survey on Image Processing using CNN in Deep LearningA Survey on Image Processing using CNN in Deep Learning
A Survey on Image Processing using CNN in Deep LearningIRJET Journal
 
Multi Processor Architecture for image processing
Multi Processor Architecture for image processingMulti Processor Architecture for image processing
Multi Processor Architecture for image processingideas2ignite
 

Similar to cbs_sips2005 (20)

Accelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous PlatformsAccelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous Platforms
 
Comparison of different Fingerprint Compression Techniques
Comparison of different Fingerprint Compression TechniquesComparison of different Fingerprint Compression Techniques
Comparison of different Fingerprint Compression Techniques
 
High Performance Medical Reconstruction Using Stream Programming Paradigms
High Performance Medical Reconstruction Using Stream Programming ParadigmsHigh Performance Medical Reconstruction Using Stream Programming Paradigms
High Performance Medical Reconstruction Using Stream Programming Paradigms
 
Standardising the compressed representation of neural networks
Standardising the compressed representation of neural networksStandardising the compressed representation of neural networks
Standardising the compressed representation of neural networks
 
imagefiltervhdl.pptx
imagefiltervhdl.pptximagefiltervhdl.pptx
imagefiltervhdl.pptx
 
Dynamic Texture Coding using Modified Haar Wavelet with CUDA
Dynamic Texture Coding using Modified Haar Wavelet with CUDADynamic Texture Coding using Modified Haar Wavelet with CUDA
Dynamic Texture Coding using Modified Haar Wavelet with CUDA
 
PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...
PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...
PIPELINED ARCHITECTURE OF 2D-DCT, QUANTIZATION AND ZIGZAG PROCESS FOR JPEG IM...
 
APSys Presentation Final copy2
APSys Presentation Final copy2APSys Presentation Final copy2
APSys Presentation Final copy2
 
Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...
Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...
Pipelined Architecture of 2D-DCT, Quantization and ZigZag Process for JPEG Im...
 
Threading Successes 06 Allegorithmic
Threading Successes 06   AllegorithmicThreading Successes 06   Allegorithmic
Threading Successes 06 Allegorithmic
 
FPGA Implementation of Multiplier-less CDF-5/3 Wavelet Transform for Image Pr...
FPGA Implementation of Multiplier-less CDF-5/3 Wavelet Transform for Image Pr...FPGA Implementation of Multiplier-less CDF-5/3 Wavelet Transform for Image Pr...
FPGA Implementation of Multiplier-less CDF-5/3 Wavelet Transform for Image Pr...
 
B.tech_project_ppt.pptx
B.tech_project_ppt.pptxB.tech_project_ppt.pptx
B.tech_project_ppt.pptx
 
Performance analysis of sobel edge filter on heterogeneous system using opencl
Performance analysis of sobel edge filter on heterogeneous system using openclPerformance analysis of sobel edge filter on heterogeneous system using opencl
Performance analysis of sobel edge filter on heterogeneous system using opencl
 
Conference Paper: Universal Node: Towards a high-performance NFV environment
Conference Paper: Universal Node: Towards a high-performance NFV environmentConference Paper: Universal Node: Towards a high-performance NFV environment
Conference Paper: Universal Node: Towards a high-performance NFV environment
 
DEEP LEARNING BASED BRAIN STROKE DETECTION
DEEP LEARNING BASED BRAIN STROKE DETECTIONDEEP LEARNING BASED BRAIN STROKE DETECTION
DEEP LEARNING BASED BRAIN STROKE DETECTION
 
Types Of Window Being Used For The Selected Granule
Types Of Window Being Used For The Selected GranuleTypes Of Window Being Used For The Selected Granule
Types Of Window Being Used For The Selected Granule
 
High Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image CompressionHigh Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image Compression
 
A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...
A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...
A Coarse-Grained Reconfigurable Wavelet Denoiser Exploiting the Multi-Dataflo...
 
A Survey on Image Processing using CNN in Deep Learning
A Survey on Image Processing using CNN in Deep LearningA Survey on Image Processing using CNN in Deep Learning
A Survey on Image Processing using CNN in Deep Learning
 
Multi Processor Architecture for image processing
Multi Processor Architecture for image processingMulti Processor Architecture for image processing
Multi Processor Architecture for image processing
 

cbs_sips2005

  • 1. IEEE 2005 Workshop on Signal Processing Systems (SIPS'05), November 2, Athens, Greece Flexible Hardware Architecture for 2-D Separable Convolution-based Scaling There is not a single scaling technique that suites all kind of images (photo, CAD, Text...) the user is willing to print or display. Formally, any convolution-based scaling operation can be decomposed in three steps: an anti-aliasing filter, image reconstruction by continuous convolution and re-sampling to the final grid. Based on this, we propose a flexible hardware-friendly discrete convolution engine operating a memory that stores a programmable 2D- separable interpolation kernel. We also state a technique for optimizing the memory size given the kernel and the scale factor. Finally, we describe a novel flexible filter that overcomes aliasing artifacts regardless of image frequency content. Jordi Arnabat and Francisco Cardells Hewlett-Packard, Large-Format Technology Lab, Barcelona, Spain {jordi.arnabat, francisco.cardells}@hp.com Image Scaling . Not a unique interpolation technique to achieve good IQ for all types of images: adaptable HW is the key to survival. . Formally, scaling can be thought of as: continuous reconstruction of the discrete input and re-sampling at the output grid. . Propose a flexible hardware built from a classical convolution-based scaler, where IQ is chosen by means of a programmable kernel. Filtering Stage 0 1 2 1 2 3 4 Y1 5 6 7 8  w2 1 w2 2 Y2 A B 1 2 3 A B Y1 Digital interpolator low pass filter low pass filter 3 4 5 4 5 6 A B Y2 Digital interpolator low pass filter low pass filter 0 Scaler Data Flow . Downscaling implies a pre-filtering step to remove frequencies not representable in the output grid. (aliasing) (a) Moving Average (b) Frequency-Sharpened CIC (c) Multistage CIC . Propose architecture to enable (a) & (b) pre-filters. . Wide range of interpolation techniques: NN, bilinear, bicubic, gaussian, …, yours! . Complexity/latency of the hardware is determined by the interpolation function support. . Resampling by means of shift- variant FIR filter, of length = kernel support . Kernel shape can be programmed in a memory by means of a LUT, sampled at . . As a design rule, any kernel shape needs twice as many samples per interval as the maximum scale factor. . For example, a scaler performing up to 32x, using a 4 tabs support kernel, 8-b word precision requires a 2Kb LUT. The datapath for this interpolator requires 2.2 kgates. bilinear nearest neighbor 4 x Interpolation Pre-filtering Conventional Scaling uses a hardwired set of rules for upscaling and another for downscaling. Instead we build any scaling operation as a flexible prefiltering + interpolation this flexibility is required as there is not a single best scaling algorithm for all kind of images Programmable Low-pass FIR filter. Cut-off frequency given by downscale factor Programmable Continuous Convolution up-scaling down-scaling nearest neighbor bicubic Interpolation, Kernel Sampling   w -1 w 0 w 1 w 2 W W24W23W22W21 W14W13W12W11 ...  Neighbor index () Programmable Interpolation Kernel 1/(1-Z -1 ) 1/(1-Z -1 ) 1/(1-Z -1 ) R R (1-Z -1 ) 2 3 (1-Z -1 ) (1-Z -1 ) 1/(1-Z -1 ) 1/(1-Z -1 ) 1/(1-Z -1 ) R R (1-Z -1 ) 2 3 (1-Z -1 ) (1-Z -1 ) (a) (b) (a) (b) (c) Down-scaling by a factor of 1.5 after (a) moving average and (b) frequency sharpened CIC filter. Artifacts circled and images resized to aid direct comparison. (a) (b) (original) Frequency response of three different pre-filtering schemes. (a) & (b) are combined into one flexible architecture. (a) Nearest neighbor (b) Bilinear interpolation (c) B-spline order 3 (d) Keys’ bicubic a=-1/2 Interpolation by continuous convolution. Principles of operation.  k*D2 1 2 3 n 1 2 k w 1 w 2 o[k] Shape of the Interpolation kernel is sampled at a given frequency (). Data (weights) is stored LUT- wise in a memory. . In down-scaling the low-pass filter does not have to be applied to all the incoming pixels. . Instead only the base points for the interpolation are pre-computed to remove the aliasing frequencies. . There must be a number of equivalent serial low-pass filters equal to the kernel support.