SlideShare a Scribd company logo
1 of 6
Download to read offline
Aparna.M.P. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 6, Issue 10, ( Part -2) October 2016, pp.101-106
www.ijera.com 101|P a g e
Energy Efficient Bit Extension Type Accelerator Chip for
Detection Algorithms
Aparna.M.P, Punitha.V
1
Mtech student, 2
Assistant professor,
Electronics and communication department, AWH Engineering college
ABSTRACT
This paper presents an energy efficient bit extension type accelerator chip that targets decoding tasks of
MIMO(Multiple input multiple output) - orthogonal frequency-division multiplexing (OFDM) systems. The
work is motivated by the adoption of MIMO and OFDM by almost all existing and emerging high-speed
wireless data communication systems. MIMO is an antenna technology for wireless communications in which
multiple antennas are used at both the source (transmitter) and the destination (receiver). MIMO decoder
or sometimes called MIMO equalizer detects or decodes or recovers the transmitted signals from multiple
antennas. MIMO decoding process for a certain application is hard and time consuming. This motivates the need
for a programmable accelerator block to implement the MIMO decoder task as fast and easy application. In this
paper proposing a new pipeline architecture in arithmetic units inside the processing core of accelerator chip.
The proposed architecture can perform with higher frequency with the help of pipeline structure and also
improving the speed of operation of rotation unit with a new arithmetic rotation unit instead of native CORDIC
algorithm. This proposed architecture helps to reduce dynamic power consumption. The accelerator is an ideal
solution for today’s smart phones that implement multiple MIMO-OFDM waveforms on the same platform.
Index Terms: MIMO, OFDM, CORDIC algorithm
I. INTRODUCTION
The main reasons why orthogonal frequency
division multiplexing (OFDM) was adopted in the
wireless communication are its high spectral
efficiency and ability to deal with frequency selective
fading and narrowband interference. The requirement
for wide bandwidth and flexibility imposes the use of
efficient transmission methods especially in wireless
environment where the channel is very challenging. In
wireless environment the signal is propagating from
the transmitter to the receiver along number of
different paths, collectively referred as multipath.
While propagating the signal power drops of due to
three effects: path loss, macroscopic fading and
microscopic fading. Fading of the signal can be
mitigated by different diversity techniques. To
obtain diversity, the signal is transmitted through
multiple (ideally) independent fading paths e.g. in
time, frequency or space and combined
constructively at the receiver. Multiple input-
multiple-output (MIMO) exploits spatial diversity by
having several transmit and receive antennas.
However ―MIMO principles‖ assumed frequency flat
fading MIMO channels. OFDM is modulation
method known for its capability to mitigate multipath.
In OFDM the high speed data stream is divided into
Nc narrowband data streams, Nc corresponding to the
subcarriers or subchannels i.e.one OFDM symbol
consists of N symbols modulated.
In MIMO system multiple antennas are
employed at the transmitter and the receiver. MIMO
transmits and receives two or more data streams
through a single radio channel. Thereby the system
can deliver two or more times the data rate per
channel without additional bandwidth or transmit
Power. In wireless environment the signal is
propagating from transmitter to receiver along number
of different paths collectively referred as multipath.
Propagating the signal power drops due to path loss
and fading. Fading of the signal can be mitigated by
different diversity techniques. To obtain diversity, the
signal is transmitted through ideally independent
fading paths. Example in time, frequency or space and
combined constructively at the receiver.
Fig 1 MIMO antenna system
MIMO operation requires parallel processing
for multiple data streams at the transmitter and, more
importantly, at the receiver where the MIMO decoder
RESEARCH ARTICLE OPEN ACCESS
Aparna.M.P. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 6, Issue 10, ( Part -2) October 2016, pp.101-106
www.ijera.com 102|P a g e
is notorious for being one of the most processing-
intensive blocks. A MIMO decoder is the receiver
component that separates the Nss transmitted data
streams from the signals received on the Nrx receive
antennas. MIMO decoding operation is matrix and
vector intensive. A decoder design uses a single
MIMO decoding algorithm such as zero forcing (ZF),
minimum mean square error (MMSE), maximum
likelihood (ML), or one of the many sphere decoding
(SD) variants. A system designer chooses a single
MIMO decoding algorithm[1]. A hardware engineer
then implements the chosen algorithm with
constraints on complexity, performance, and power
consumption—considering the parallel processing
requirements for OFDM operation. This design cycle
is typically repeated for every new communication
standard. With new wireless communication standards
and new MIMO decoding algorithms emerging every
few years, existing systems need to be redesigned and
upgraded not only to meet the newly defined
standards, but also to allow integration of multiple
standards onto the same platform and improve
performance via more advanced decoding algorithms.
Multimode MIMO decoder can be designed
to target multiple communication standards. A
programmable MIMO decoder design using a
software-driven processor that employs several
general-purpose floating-point processing
units.MIMO decoder or sometimes called MIMO
equalizer detects or decodes or recovers the
transmitted signals from multiple antennas. MIMO
decoding process for a certain application is hard and
time consuming. This motivates the need for a
programmable accelerator block to implement the
MIMO decoder task as fast and easy application.
MIMO accelerator consist of processing
core, data memory, instruction memory, core-input
switch, controller, phase memory and memory input
switch. Processing core is the main element in
accelerator. Processing core consist of adder,
multiplier, rotation, and reciprocal unit. CORDIC is
used as rotation. In existing method, 32 bit processing
core is used. If we have 8,16 and 32 bit of operation,
32 bit data is given to the core and 32 bit operations
are done. Hence it take more dynamic power to
complete the process. The four processing units are
most power hungry blocks in the accelerator because
of the fact that they are constantly performing
complex matrix operations. This meant that for an
instruction, the four unit see, and effectively process,
both operands even though the result of only one of
the processing units is needed. This results in
unnecessary dynamic power consumption. To avoid
this unwanted power dissipation, additional hardware
was introduced.
II. PROPOSED BIT EXTENSION TYPE
ACCELERATOR
In this paper proposing a new pipeline
structure in arithmetic units inside processing core of
MIMO accelerator architecture. The proposed
architecture can perform with higher frequency with
the help of pipeline structure and also improving the
speed of operation of rotation unit with a new
arithmetic rotation unit instead of native CORDIC
algorithm. In the proposed method the 32 bit pc is
divided into 8bit lanes, each lane will perform 8 bit
operations. It can operate 8,16,32 bit or mix of these
according to the instruction selection. Hence
according to the selection, it can disable the unwanted
lanes. And also it can enable the needed part for the
operation. This will reduce the dynamic power which
is a major source of power consumption. CORDIC
algorithm was initially designed to perform a vector
rotation. And it is an iterative algorithm which can be
used for the computation of trigonometric functions,
multiplication and division. CORDIC perform
complex phase rotations, and which have an angle to
rotate the vector. Hence propose a new arithmetic
rotation unit instead of CORDIC. There by improving
the speed of operation of rotation unit and can rotate
vector by using arithmetic rotation unit easily. The
modified architecture is shown below.
Fig 1. Proposed bit extension accelerator
Aparna.M.P. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 6, Issue 10, ( Part -2) October 2016, pp.101-106
www.ijera.com 103|P a g e
Fig 2 Elaboration of one processing core
This architecture uses a pipeline structure in
processing core unit. Processing core unit consist of
multiplier unit, rotation unit, reciprocal unit, and
adder unit. These all arithmetic unit are arranged in
pipeline manner. An extension unit is used in control
unit. For example: a 32 bit processing core was
equally divided into four 8 bit processing core and it
is connected like pipeline structure. According to the
incoming instruction bit, extension unit will decide
which one of the processing core will be operated. By
using this method, at a time all the four operations
have to be done. Processing steps described below.
Fetch instruction from instruction memory,
Then control unit decode the incoming instruction
into commands or control unit works by receiving
input instruction that it converts into control signals,
which are then sent to processor. The processor then
tells the attached hardware what operations to carry
out. Instruction selections from instruction memory
used to decide which of the arithmetic unit is needed
for the operation. And other units are kept disable.
Hence we can save power. Control unit used to give
address location to data memory and instruction
memory. Core-input switch is a two level
multiplexing circuit that select and properly arranges
the complex vectors needed by the processing core
and memory-output switch takes the output of
processing unit and packages them and write all data
into the appropriate memory location. The four units
are described below.
1) Adder Unit: Four lanes
(lane0,lane1,lane2,lane3) consist of adders. In 8bit
addition, each adder inside all the lanes is operated.
And in 16bit addition, two lanes (lane0 and lane2) are
considered at a time. Adder inside lane0 is operated
firstly then carry of first adder is propagated to second
adder inside lane1.finally we get output. And in 32bit
addition, carry of each adder propagates to next adder
inside the lane. Instruction enable is in the order of
(add_en , mul_en , rot, rec). When instruction enable
is 1000, and add enable is high that time adder will
conduct. Input to the adder is coming from data
memory. Input is a complex number. It have a real
and imaginary part. That is in the order of Re_acc0
&Im_acc0 – operand 1 and Re_acc1& Im_acc1 –
operand 2. Instruction selection also coming from
instruction memory. Instruction selection is 00 that
time 8 bit addition will conduct. Instruction selection
is 01, 16 bit is conducted and instruction selection is
10, that time 32 bit is conducted. And 11 is the invalid
case.
Fig 3 Flow chart of adder
2) Multiplier and Reciprocal Unit: Multiplication is done by dot product.
Aparna.M.P. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 6, Issue 10, ( Part -2) October 2016, pp.101-106
www.ijera.com 104|P a g e
Fig 4 flow chart of multiplier and reciprocal unit
Incoming input to the multiplier is coming from data
memory.
When instruction enable is 0101 that time
multiplication and reciprocal are carried out. And
correspondingly mul enable and rec are kept in high.
Bit selection is same as the adder case.
3) Rotation Unit: when instruction enable is 0010, in
that time only rotation is workout. And also rotation
enable is kept high. Bit selection is same as in the
above case. Firstly we consider the real part and
imaginary part input from memory. In 8bit
rotation,8bit value of that input is rotated that means
LSB is rotated to MSB . In 16bit rotation, LSB bit of
the input of lane 1 is move to MSB bit of lane0. In
32bit rotation, LSB bit of the input of lane3 is move to
MSB of lane0.
Fig 5 Flow chart of rotation
Aparna.M.P. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 6, Issue 10, ( Part -2) October 2016, pp.101-106
www.ijera.com 105|P a g e
III. RESULT ANALYSIS
The proposed method is compared with the
result of existing method shows there is better
improvement in power consumption, speed and
throughput. In the proposed method 32 bit processing
core is equally divided into four 8bit lanes. Hence it
can operate 8,16,32bit and mix of these according to
the incoming instruction selection. Hence according
to the selection, it can disable the unwanted lanes. and
also it can enable the needed part for the operation.
This will reduce the dynamic power which is a major
source of power consumption. But in the existing
system only 32bit processor is used. If it have 8bit
operation, 32bit data is given to the module and 32bit
operations are done. Hence there by occurs a huge
dynamic power. For an instruction, the four units see,
and effectively process, both operands even though
the result of only one of the processing unit is needed.
This results in unnecessary power consumption. In the
proposed method pipelining architecture is used inside
the ALU units and lanes too. Hence there by speed of
the operations can be increased. And also arithmetic
rotation unit is used instead of native CORDIC.
Hence also speed can be improved. In the existing
system, in arithmetic units, according to the incoming
instruction, arithmetic operations are done. Hence the
next instruction is coming only after the current
operation in done. Hence it is a time consuming
process and there will be a delay on next instruction,
hence speed of the operation also reduces. Latency is
the time interval between the arrival of input and
output whereas throughput is the time interval
between the arrival of consecutive inputs. Latency of
proposed system shows a better improvement than the
existing system.
In the above table we can observe that the accelerator
used in MMSE(Minimum mean square error ) have
166Mhz clock frequency and 300.9mw average power
consumption. And also by using SVD (singular value
decomposition) will get 166 Mhz clock frequency and
300.9mw average power consumption. And by using
proposed bit extension accelerator will get 924.556
Mhz clock frequency and 42mw average power
consumption. Hence we can analyse that the proposed
system will get reduced dynamic power consumption
and higher clock frequency compared to other system.
Hence speed of the operation also increased compared
to other device. Hence the proposed device is better
than other.
Table 1 Comparison with other ASIC designs
IV. CONCLUSION
This paper focus on the design of a new
accelerator chip for reducing power consumption and
improving speed. The overall architecture consists of
data memory, instruction memory and processing core
unit. Processing core unit consist of adder, rotation,
multiplier and reciprocal units. 32 bit core is equally
divided into four 8bit lanes such as lane0,lane1,lane2
and lane3. Data is coming from data memory that is a
complex value. They have a real part and imaginary
part. According to the incoming instruction from
instruction memory, the operation is done inside the
ALU. By using this method, it can disable the
unwanted parts for the operations. This will reduce
the dynamic power which is a major source of power
consumption. There by speed of the operations also
improved. And pipelining arithmetic units are used
inside the core. There by also speed is improved.
REFERENCES
[1]. Mohamed I. A. Mohamed, Karim Mohammed
and Babak Daneshrad ―Energy Efficient
Programmable MIMO Decoder Accelerator
Chip in 65-nm CMOS‖ IEEE transactions on
very large scale integration (vlsi)
systems,2013
[2]. A.J.Paulraj,D.A.Gore,R.U.Nabar,andH.Bolcskei,
―An overview of MIMO communications—
A key to gigabit wireless,‖ Proc. IEEE,vol.
92, no. 2, pp. 198–218, Feb. 2004
[3]. K. Mohammed and B. Daneshrad, ―A MIMO
decoder accelerator for next generation
wireless communications,‖ IEEE Trans.
Very Large Scale Integr. (VLSI) Syst., vol.
18, no. 11, pp. 1544–1555, Nov. 2010
[4]. C. Studer, P. Blosch, P. Friedli, and A. Burg,
―Matrix decomposition architecture for
MIMO systems: Design and implementation
tradeoffs,‖ in Proc. Conf. Rec. 41st Asilomar
Conf. ACSSC, Nov. 2007, pp. 1986–1990.
[5]. Gordon l. stüber, fellow, John r. barry,Steve w.
mclaughlin, ye (geoffrey) li,mary ann
ingram,and Thomas g. Pratt ―Broadband
MIMO-OFDM Wireless Communications‖
proceedings of the ieee, vol. 92, no. 2,
february 2004
[6]. A. Omri and R. Bouallegue ―New transmission
scheme for mimo-ofdm system‖
International Journal of Next-Generation
Networks (IJNGN) Vol.3, No.1, March 2011
[7]. Soohyun Jang1, Seongjoo Lee 2, and Yunho
Jung ―Low-Complexity and Low-Power
MIMO Symbol Detector for Mobile Devices
with Two TX/RX Antennas‖ journal of
semiconductor technology and science,
vol.15, no.2, april, 2015
[8]. H. S. Kim, W. Zhu, J. Bhatia, K. Mohammed, A.
Shah, and B. Daneshrad, ―A practical,
Aparna.M.P. Int. Journal of Engineering Research and Application www.ijera.com
ISSN : 2248-9622, Vol. 6, Issue 10, ( Part -2) October 2016, pp.101-106
www.ijera.com 106|P a g e
hardware friendly MMSE detector for
MIMO-OFDM based systems,‖ EURASIP J.
Adv. Signal Process.,vol. 2008, p. 94, Jan.
2008.
[9]. C.-J. Huang, C.-W. Yu, and H.-P. Ma, ―A
power-efficient configurable low-complexity
MIMO detector,‖ IEEE Trans. Circuits
Syst.I,Reg. Papers, vol. 56, no. 2, pp. 485–
496, Feb. 2009
[10].M. Ali, K. Mohammed, and B. Daneshrad,
―MIMO accelerator: A design flow for a
programmable MIMO decoder architecture,‖
inProc. Comput. Conf. Rec. 43rd Asilomar
Conf. Signals, Syst., Nov. 2009, pp. 1292–
1296

More Related Content

What's hot

Iaetsd a review on performance analysis of mimo-ofdm system based on dwt and ...
Iaetsd a review on performance analysis of mimo-ofdm system based on dwt and ...Iaetsd a review on performance analysis of mimo-ofdm system based on dwt and ...
Iaetsd a review on performance analysis of mimo-ofdm system based on dwt and ...
Iaetsd Iaetsd
 
Lte ausarbeitung zarei
Lte ausarbeitung zareiLte ausarbeitung zarei
Lte ausarbeitung zarei
mehemed sifaw
 
Development and Application of a Failure Monitoring System by Using the Vibra...
Development and Application of a Failure Monitoring System by Using the Vibra...Development and Application of a Failure Monitoring System by Using the Vibra...
Development and Application of a Failure Monitoring System by Using the Vibra...
inventy
 
An efficient technique for out-of-band power reduction for the eliminated CP...
An efficient technique for out-of-band power reduction for  the eliminated CP...An efficient technique for out-of-band power reduction for  the eliminated CP...
An efficient technique for out-of-band power reduction for the eliminated CP...
IJECEIAES
 

What's hot (19)

Hardware Implementation of OFDM system to reduce PAPR using Selective Level M...
Hardware Implementation of OFDM system to reduce PAPR using Selective Level M...Hardware Implementation of OFDM system to reduce PAPR using Selective Level M...
Hardware Implementation of OFDM system to reduce PAPR using Selective Level M...
 
IRJET-Review of Massive MIMO, Filter Bank Multi Carrier and Orthogonal Freque...
IRJET-Review of Massive MIMO, Filter Bank Multi Carrier and Orthogonal Freque...IRJET-Review of Massive MIMO, Filter Bank Multi Carrier and Orthogonal Freque...
IRJET-Review of Massive MIMO, Filter Bank Multi Carrier and Orthogonal Freque...
 
Iaetsd a review on performance analysis of mimo-ofdm system based on dwt and ...
Iaetsd a review on performance analysis of mimo-ofdm system based on dwt and ...Iaetsd a review on performance analysis of mimo-ofdm system based on dwt and ...
Iaetsd a review on performance analysis of mimo-ofdm system based on dwt and ...
 
Performance Analysis of BER and Throughput of Different MIMO Systems using Di...
Performance Analysis of BER and Throughput of Different MIMO Systems using Di...Performance Analysis of BER and Throughput of Different MIMO Systems using Di...
Performance Analysis of BER and Throughput of Different MIMO Systems using Di...
 
Performance Evaluation and Simulation of OFDM in Optical Communication Systems
Performance Evaluation and Simulation of OFDM in Optical Communication SystemsPerformance Evaluation and Simulation of OFDM in Optical Communication Systems
Performance Evaluation and Simulation of OFDM in Optical Communication Systems
 
Lte ausarbeitung zarei
Lte ausarbeitung zareiLte ausarbeitung zarei
Lte ausarbeitung zarei
 
Design of 4x4 optical MIMO using spatial Modulation
Design of 4x4 optical MIMO using spatial ModulationDesign of 4x4 optical MIMO using spatial Modulation
Design of 4x4 optical MIMO using spatial Modulation
 
Ew31992997
Ew31992997Ew31992997
Ew31992997
 
Performance Improvement of IEEE 802.22 WRAN Physical Layer
Performance Improvement of IEEE 802.22 WRAN Physical LayerPerformance Improvement of IEEE 802.22 WRAN Physical Layer
Performance Improvement of IEEE 802.22 WRAN Physical Layer
 
40120140503001 2-3
40120140503001 2-340120140503001 2-3
40120140503001 2-3
 
Development and Application of a Failure Monitoring System by Using the Vibra...
Development and Application of a Failure Monitoring System by Using the Vibra...Development and Application of a Failure Monitoring System by Using the Vibra...
Development and Application of a Failure Monitoring System by Using the Vibra...
 
IRJET- PAPR Reduction in UF-OFDM and F-OFDM 5G Systems using ZCT Precoding Te...
IRJET- PAPR Reduction in UF-OFDM and F-OFDM 5G Systems using ZCT Precoding Te...IRJET- PAPR Reduction in UF-OFDM and F-OFDM 5G Systems using ZCT Precoding Te...
IRJET- PAPR Reduction in UF-OFDM and F-OFDM 5G Systems using ZCT Precoding Te...
 
An efficient technique for out-of-band power reduction for the eliminated CP...
An efficient technique for out-of-band power reduction for  the eliminated CP...An efficient technique for out-of-band power reduction for  the eliminated CP...
An efficient technique for out-of-band power reduction for the eliminated CP...
 
Pwm technique to overcome the effect of
Pwm technique to overcome the effect ofPwm technique to overcome the effect of
Pwm technique to overcome the effect of
 
B033206014
B033206014B033206014
B033206014
 
analysis on design and implementation of 4×10 gb s wdm tdm pon with disparate...
analysis on design and implementation of 4×10 gb s wdm tdm pon with disparate...analysis on design and implementation of 4×10 gb s wdm tdm pon with disparate...
analysis on design and implementation of 4×10 gb s wdm tdm pon with disparate...
 
Beamforming
BeamformingBeamforming
Beamforming
 
Fp3410401046
Fp3410401046Fp3410401046
Fp3410401046
 
Performance Analysis of OSTBC MIMO Using Precoder with ZF & MMSE Equalizer
Performance Analysis of OSTBC MIMO Using Precoder with ZF & MMSE EqualizerPerformance Analysis of OSTBC MIMO Using Precoder with ZF & MMSE Equalizer
Performance Analysis of OSTBC MIMO Using Precoder with ZF & MMSE Equalizer
 

Viewers also liked

Biomimicry.pdf
Biomimicry.pdfBiomimicry.pdf
Biomimicry.pdf
Jy Chong
 
FINAL LOGO FLOWER 2 NO GRASS
FINAL LOGO FLOWER 2 NO GRASSFINAL LOGO FLOWER 2 NO GRASS
FINAL LOGO FLOWER 2 NO GRASS
Angela Campbell
 
Relacao nominal alunos premiados ee marcílio augusto pinto
Relacao nominal alunos premiados   ee marcílio augusto pintoRelacao nominal alunos premiados   ee marcílio augusto pinto
Relacao nominal alunos premiados ee marcílio augusto pinto
regiani36
 
Edgar allan poe
Edgar allan poeEdgar allan poe
Edgar allan poe
ivvv
 
Sizhunvezh ar brezhoneg - Semaine de la langue Bretonne
Sizhunvezh ar brezhoneg - Semaine de la langue BretonneSizhunvezh ar brezhoneg - Semaine de la langue Bretonne
Sizhunvezh ar brezhoneg - Semaine de la langue Bretonne
tiarvro
 

Viewers also liked (20)

Imagen de evento afrodit naturals
Imagen de evento afrodit naturalsImagen de evento afrodit naturals
Imagen de evento afrodit naturals
 
Yahoo Italia
Yahoo ItaliaYahoo Italia
Yahoo Italia
 
DIPLOMA.PDF
DIPLOMA.PDFDIPLOMA.PDF
DIPLOMA.PDF
 
Fútbol sala fbs
Fútbol  sala   fbsFútbol  sala   fbs
Fútbol sala fbs
 
2008
20082008
2008
 
O santuário e a justiça de jesus cristo
O santuário e a justiça de jesus cristoO santuário e a justiça de jesus cristo
O santuário e a justiça de jesus cristo
 
Biomimicry.pdf
Biomimicry.pdfBiomimicry.pdf
Biomimicry.pdf
 
Verónica.edmodo
Verónica.edmodoVerónica.edmodo
Verónica.edmodo
 
FINAL LOGO FLOWER 2 NO GRASS
FINAL LOGO FLOWER 2 NO GRASSFINAL LOGO FLOWER 2 NO GRASS
FINAL LOGO FLOWER 2 NO GRASS
 
skydrive_word_doc
skydrive_word_docskydrive_word_doc
skydrive_word_doc
 
Relacao nominal alunos premiados ee marcílio augusto pinto
Relacao nominal alunos premiados   ee marcílio augusto pintoRelacao nominal alunos premiados   ee marcílio augusto pinto
Relacao nominal alunos premiados ee marcílio augusto pinto
 
Dieta
DietaDieta
Dieta
 
Festes de la Mercè Almacelles 2012
Festes de la Mercè Almacelles 2012Festes de la Mercè Almacelles 2012
Festes de la Mercè Almacelles 2012
 
Blogs organizacionais: conversação, texto e gestão do conhecimento
Blogs organizacionais: conversação, texto e gestão do conhecimentoBlogs organizacionais: conversação, texto e gestão do conhecimento
Blogs organizacionais: conversação, texto e gestão do conhecimento
 
Edgar allan poe
Edgar allan poeEdgar allan poe
Edgar allan poe
 
Autobiografia calderon
Autobiografia calderonAutobiografia calderon
Autobiografia calderon
 
Sizhunvezh ar brezhoneg - Semaine de la langue Bretonne
Sizhunvezh ar brezhoneg - Semaine de la langue BretonneSizhunvezh ar brezhoneg - Semaine de la langue Bretonne
Sizhunvezh ar brezhoneg - Semaine de la langue Bretonne
 
Taller educació en valors desde el respeto.
Taller educació en valors desde el respeto.Taller educació en valors desde el respeto.
Taller educació en valors desde el respeto.
 
unidad 2
unidad 2unidad 2
unidad 2
 
Presentación natali Acosta. circuitos electricos
Presentación natali Acosta. circuitos electricosPresentación natali Acosta. circuitos electricos
Presentación natali Acosta. circuitos electricos
 

Similar to Energy Efficient Bit Extension Type Accelerator Chip for Detection Algorithms

Design and memory optimization of hybrid gate diffusion input numerical contr...
Design and memory optimization of hybrid gate diffusion input numerical contr...Design and memory optimization of hybrid gate diffusion input numerical contr...
Design and memory optimization of hybrid gate diffusion input numerical contr...
International Journal of Reconfigurable and Embedded Systems
 
Mimo ofdm by abhishek pandey
Mimo ofdm by abhishek pandeyMimo ofdm by abhishek pandey
Mimo ofdm by abhishek pandey
abhi29513
 
Iaetsd iterative mmse-pic detection algorithm for
Iaetsd iterative mmse-pic detection  algorithm forIaetsd iterative mmse-pic detection  algorithm for
Iaetsd iterative mmse-pic detection algorithm for
Iaetsd Iaetsd
 

Similar to Energy Efficient Bit Extension Type Accelerator Chip for Detection Algorithms (20)

VLSI Implementation of OFDM Transceiver for 802.11n systems
VLSI Implementation of OFDM Transceiver for 802.11n systemsVLSI Implementation of OFDM Transceiver for 802.11n systems
VLSI Implementation of OFDM Transceiver for 802.11n systems
 
Wireless Sensor Networks
Wireless Sensor NetworksWireless Sensor Networks
Wireless Sensor Networks
 
Error Control and performance Analysis of MIMO-OFDM Over Fading Channels
Error Control and performance Analysis of MIMO-OFDM Over Fading ChannelsError Control and performance Analysis of MIMO-OFDM Over Fading Channels
Error Control and performance Analysis of MIMO-OFDM Over Fading Channels
 
Performance Analysis of MIMO-LTE for MQAM over Fading Channels
Performance Analysis of MIMO-LTE for MQAM over Fading ChannelsPerformance Analysis of MIMO-LTE for MQAM over Fading Channels
Performance Analysis of MIMO-LTE for MQAM over Fading Channels
 
International Journal of Engineering Inventions (IJEI)
International Journal of Engineering Inventions (IJEI)International Journal of Engineering Inventions (IJEI)
International Journal of Engineering Inventions (IJEI)
 
IRJET- A Novel Technique for the Transmission of Digital Data through Ban...
IRJET-  	  A Novel Technique for the Transmission of Digital Data through Ban...IRJET-  	  A Novel Technique for the Transmission of Digital Data through Ban...
IRJET- A Novel Technique for the Transmission of Digital Data through Ban...
 
MIMO OFDM
MIMO OFDMMIMO OFDM
MIMO OFDM
 
IRJET- Power Line Carrier Communication
IRJET- Power Line Carrier CommunicationIRJET- Power Line Carrier Communication
IRJET- Power Line Carrier Communication
 
Design and memory optimization of hybrid gate diffusion input numerical contr...
Design and memory optimization of hybrid gate diffusion input numerical contr...Design and memory optimization of hybrid gate diffusion input numerical contr...
Design and memory optimization of hybrid gate diffusion input numerical contr...
 
Performance Analysis of 802.lln MIMO OFDM Transceiver
Performance Analysis of 802.lln MIMO OFDM TransceiverPerformance Analysis of 802.lln MIMO OFDM Transceiver
Performance Analysis of 802.lln MIMO OFDM Transceiver
 
Performance Evaluation of MC-CDMA for Fixed WiMAX with Equalization
Performance Evaluation of MC-CDMA for Fixed WiMAX with EqualizationPerformance Evaluation of MC-CDMA for Fixed WiMAX with Equalization
Performance Evaluation of MC-CDMA for Fixed WiMAX with Equalization
 
A New Bit Split and Interleaved Channel Coding for MIMO Decoder
A New Bit Split and Interleaved Channel Coding for MIMO DecoderA New Bit Split and Interleaved Channel Coding for MIMO Decoder
A New Bit Split and Interleaved Channel Coding for MIMO Decoder
 
Mimo [new]
Mimo [new]Mimo [new]
Mimo [new]
 
BARC Report
BARC ReportBARC Report
BARC Report
 
EC8004 WIRELESS NETWORKS UNIT 5 - 4G TECHNOLOGIES
EC8004 WIRELESS NETWORKS UNIT 5 - 4G TECHNOLOGIESEC8004 WIRELESS NETWORKS UNIT 5 - 4G TECHNOLOGIES
EC8004 WIRELESS NETWORKS UNIT 5 - 4G TECHNOLOGIES
 
EC8004 WIRELESS NETWORKS UNIT 5 - 4G TECHNOLOGIES
EC8004 WIRELESS NETWORKS UNIT 5 - 4G TECHNOLOGIESEC8004 WIRELESS NETWORKS UNIT 5 - 4G TECHNOLOGIES
EC8004 WIRELESS NETWORKS UNIT 5 - 4G TECHNOLOGIES
 
Mimo ofdm by abhishek pandey
Mimo ofdm by abhishek pandeyMimo ofdm by abhishek pandey
Mimo ofdm by abhishek pandey
 
Iaetsd iterative mmse-pic detection algorithm for
Iaetsd iterative mmse-pic detection  algorithm forIaetsd iterative mmse-pic detection  algorithm for
Iaetsd iterative mmse-pic detection algorithm for
 
Ijetcas14 396
Ijetcas14 396Ijetcas14 396
Ijetcas14 396
 
A Review on Transmit Antenna Selection for Massive MIMO Systems
A Review on Transmit Antenna Selection for Massive MIMO SystemsA Review on Transmit Antenna Selection for Massive MIMO Systems
A Review on Transmit Antenna Selection for Massive MIMO Systems
 

Recently uploaded

Verification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptxVerification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptx
chumtiyababu
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
MayuraD1
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
jaanualu31
 

Recently uploaded (20)

data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
AIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech studentsAIRCANVAS[1].pdf mini project for btech students
AIRCANVAS[1].pdf mini project for btech students
 
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptxA CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
A CASE STUDY ON CERAMIC INDUSTRY OF BANGLADESH.pptx
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdf
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
 
kiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal loadkiln thermal load.pptx kiln tgermal load
kiln thermal load.pptx kiln tgermal load
 
Verification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptxVerification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptx
 
DeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakesDeepFakes presentation : brief idea of DeepFakes
DeepFakes presentation : brief idea of DeepFakes
 
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLEGEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network Devices
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 
Moment Distribution Method For Btech Civil
Moment Distribution Method For Btech CivilMoment Distribution Method For Btech Civil
Moment Distribution Method For Btech Civil
 
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills KuwaitKuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
Kuwait City MTP kit ((+919101817206)) Buy Abortion Pills Kuwait
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
 

Energy Efficient Bit Extension Type Accelerator Chip for Detection Algorithms

  • 1. Aparna.M.P. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 6, Issue 10, ( Part -2) October 2016, pp.101-106 www.ijera.com 101|P a g e Energy Efficient Bit Extension Type Accelerator Chip for Detection Algorithms Aparna.M.P, Punitha.V 1 Mtech student, 2 Assistant professor, Electronics and communication department, AWH Engineering college ABSTRACT This paper presents an energy efficient bit extension type accelerator chip that targets decoding tasks of MIMO(Multiple input multiple output) - orthogonal frequency-division multiplexing (OFDM) systems. The work is motivated by the adoption of MIMO and OFDM by almost all existing and emerging high-speed wireless data communication systems. MIMO is an antenna technology for wireless communications in which multiple antennas are used at both the source (transmitter) and the destination (receiver). MIMO decoder or sometimes called MIMO equalizer detects or decodes or recovers the transmitted signals from multiple antennas. MIMO decoding process for a certain application is hard and time consuming. This motivates the need for a programmable accelerator block to implement the MIMO decoder task as fast and easy application. In this paper proposing a new pipeline architecture in arithmetic units inside the processing core of accelerator chip. The proposed architecture can perform with higher frequency with the help of pipeline structure and also improving the speed of operation of rotation unit with a new arithmetic rotation unit instead of native CORDIC algorithm. This proposed architecture helps to reduce dynamic power consumption. The accelerator is an ideal solution for today’s smart phones that implement multiple MIMO-OFDM waveforms on the same platform. Index Terms: MIMO, OFDM, CORDIC algorithm I. INTRODUCTION The main reasons why orthogonal frequency division multiplexing (OFDM) was adopted in the wireless communication are its high spectral efficiency and ability to deal with frequency selective fading and narrowband interference. The requirement for wide bandwidth and flexibility imposes the use of efficient transmission methods especially in wireless environment where the channel is very challenging. In wireless environment the signal is propagating from the transmitter to the receiver along number of different paths, collectively referred as multipath. While propagating the signal power drops of due to three effects: path loss, macroscopic fading and microscopic fading. Fading of the signal can be mitigated by different diversity techniques. To obtain diversity, the signal is transmitted through multiple (ideally) independent fading paths e.g. in time, frequency or space and combined constructively at the receiver. Multiple input- multiple-output (MIMO) exploits spatial diversity by having several transmit and receive antennas. However ―MIMO principles‖ assumed frequency flat fading MIMO channels. OFDM is modulation method known for its capability to mitigate multipath. In OFDM the high speed data stream is divided into Nc narrowband data streams, Nc corresponding to the subcarriers or subchannels i.e.one OFDM symbol consists of N symbols modulated. In MIMO system multiple antennas are employed at the transmitter and the receiver. MIMO transmits and receives two or more data streams through a single radio channel. Thereby the system can deliver two or more times the data rate per channel without additional bandwidth or transmit Power. In wireless environment the signal is propagating from transmitter to receiver along number of different paths collectively referred as multipath. Propagating the signal power drops due to path loss and fading. Fading of the signal can be mitigated by different diversity techniques. To obtain diversity, the signal is transmitted through ideally independent fading paths. Example in time, frequency or space and combined constructively at the receiver. Fig 1 MIMO antenna system MIMO operation requires parallel processing for multiple data streams at the transmitter and, more importantly, at the receiver where the MIMO decoder RESEARCH ARTICLE OPEN ACCESS
  • 2. Aparna.M.P. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 6, Issue 10, ( Part -2) October 2016, pp.101-106 www.ijera.com 102|P a g e is notorious for being one of the most processing- intensive blocks. A MIMO decoder is the receiver component that separates the Nss transmitted data streams from the signals received on the Nrx receive antennas. MIMO decoding operation is matrix and vector intensive. A decoder design uses a single MIMO decoding algorithm such as zero forcing (ZF), minimum mean square error (MMSE), maximum likelihood (ML), or one of the many sphere decoding (SD) variants. A system designer chooses a single MIMO decoding algorithm[1]. A hardware engineer then implements the chosen algorithm with constraints on complexity, performance, and power consumption—considering the parallel processing requirements for OFDM operation. This design cycle is typically repeated for every new communication standard. With new wireless communication standards and new MIMO decoding algorithms emerging every few years, existing systems need to be redesigned and upgraded not only to meet the newly defined standards, but also to allow integration of multiple standards onto the same platform and improve performance via more advanced decoding algorithms. Multimode MIMO decoder can be designed to target multiple communication standards. A programmable MIMO decoder design using a software-driven processor that employs several general-purpose floating-point processing units.MIMO decoder or sometimes called MIMO equalizer detects or decodes or recovers the transmitted signals from multiple antennas. MIMO decoding process for a certain application is hard and time consuming. This motivates the need for a programmable accelerator block to implement the MIMO decoder task as fast and easy application. MIMO accelerator consist of processing core, data memory, instruction memory, core-input switch, controller, phase memory and memory input switch. Processing core is the main element in accelerator. Processing core consist of adder, multiplier, rotation, and reciprocal unit. CORDIC is used as rotation. In existing method, 32 bit processing core is used. If we have 8,16 and 32 bit of operation, 32 bit data is given to the core and 32 bit operations are done. Hence it take more dynamic power to complete the process. The four processing units are most power hungry blocks in the accelerator because of the fact that they are constantly performing complex matrix operations. This meant that for an instruction, the four unit see, and effectively process, both operands even though the result of only one of the processing units is needed. This results in unnecessary dynamic power consumption. To avoid this unwanted power dissipation, additional hardware was introduced. II. PROPOSED BIT EXTENSION TYPE ACCELERATOR In this paper proposing a new pipeline structure in arithmetic units inside processing core of MIMO accelerator architecture. The proposed architecture can perform with higher frequency with the help of pipeline structure and also improving the speed of operation of rotation unit with a new arithmetic rotation unit instead of native CORDIC algorithm. In the proposed method the 32 bit pc is divided into 8bit lanes, each lane will perform 8 bit operations. It can operate 8,16,32 bit or mix of these according to the instruction selection. Hence according to the selection, it can disable the unwanted lanes. And also it can enable the needed part for the operation. This will reduce the dynamic power which is a major source of power consumption. CORDIC algorithm was initially designed to perform a vector rotation. And it is an iterative algorithm which can be used for the computation of trigonometric functions, multiplication and division. CORDIC perform complex phase rotations, and which have an angle to rotate the vector. Hence propose a new arithmetic rotation unit instead of CORDIC. There by improving the speed of operation of rotation unit and can rotate vector by using arithmetic rotation unit easily. The modified architecture is shown below. Fig 1. Proposed bit extension accelerator
  • 3. Aparna.M.P. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 6, Issue 10, ( Part -2) October 2016, pp.101-106 www.ijera.com 103|P a g e Fig 2 Elaboration of one processing core This architecture uses a pipeline structure in processing core unit. Processing core unit consist of multiplier unit, rotation unit, reciprocal unit, and adder unit. These all arithmetic unit are arranged in pipeline manner. An extension unit is used in control unit. For example: a 32 bit processing core was equally divided into four 8 bit processing core and it is connected like pipeline structure. According to the incoming instruction bit, extension unit will decide which one of the processing core will be operated. By using this method, at a time all the four operations have to be done. Processing steps described below. Fetch instruction from instruction memory, Then control unit decode the incoming instruction into commands or control unit works by receiving input instruction that it converts into control signals, which are then sent to processor. The processor then tells the attached hardware what operations to carry out. Instruction selections from instruction memory used to decide which of the arithmetic unit is needed for the operation. And other units are kept disable. Hence we can save power. Control unit used to give address location to data memory and instruction memory. Core-input switch is a two level multiplexing circuit that select and properly arranges the complex vectors needed by the processing core and memory-output switch takes the output of processing unit and packages them and write all data into the appropriate memory location. The four units are described below. 1) Adder Unit: Four lanes (lane0,lane1,lane2,lane3) consist of adders. In 8bit addition, each adder inside all the lanes is operated. And in 16bit addition, two lanes (lane0 and lane2) are considered at a time. Adder inside lane0 is operated firstly then carry of first adder is propagated to second adder inside lane1.finally we get output. And in 32bit addition, carry of each adder propagates to next adder inside the lane. Instruction enable is in the order of (add_en , mul_en , rot, rec). When instruction enable is 1000, and add enable is high that time adder will conduct. Input to the adder is coming from data memory. Input is a complex number. It have a real and imaginary part. That is in the order of Re_acc0 &Im_acc0 – operand 1 and Re_acc1& Im_acc1 – operand 2. Instruction selection also coming from instruction memory. Instruction selection is 00 that time 8 bit addition will conduct. Instruction selection is 01, 16 bit is conducted and instruction selection is 10, that time 32 bit is conducted. And 11 is the invalid case. Fig 3 Flow chart of adder 2) Multiplier and Reciprocal Unit: Multiplication is done by dot product.
  • 4. Aparna.M.P. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 6, Issue 10, ( Part -2) October 2016, pp.101-106 www.ijera.com 104|P a g e Fig 4 flow chart of multiplier and reciprocal unit Incoming input to the multiplier is coming from data memory. When instruction enable is 0101 that time multiplication and reciprocal are carried out. And correspondingly mul enable and rec are kept in high. Bit selection is same as the adder case. 3) Rotation Unit: when instruction enable is 0010, in that time only rotation is workout. And also rotation enable is kept high. Bit selection is same as in the above case. Firstly we consider the real part and imaginary part input from memory. In 8bit rotation,8bit value of that input is rotated that means LSB is rotated to MSB . In 16bit rotation, LSB bit of the input of lane 1 is move to MSB bit of lane0. In 32bit rotation, LSB bit of the input of lane3 is move to MSB of lane0. Fig 5 Flow chart of rotation
  • 5. Aparna.M.P. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 6, Issue 10, ( Part -2) October 2016, pp.101-106 www.ijera.com 105|P a g e III. RESULT ANALYSIS The proposed method is compared with the result of existing method shows there is better improvement in power consumption, speed and throughput. In the proposed method 32 bit processing core is equally divided into four 8bit lanes. Hence it can operate 8,16,32bit and mix of these according to the incoming instruction selection. Hence according to the selection, it can disable the unwanted lanes. and also it can enable the needed part for the operation. This will reduce the dynamic power which is a major source of power consumption. But in the existing system only 32bit processor is used. If it have 8bit operation, 32bit data is given to the module and 32bit operations are done. Hence there by occurs a huge dynamic power. For an instruction, the four units see, and effectively process, both operands even though the result of only one of the processing unit is needed. This results in unnecessary power consumption. In the proposed method pipelining architecture is used inside the ALU units and lanes too. Hence there by speed of the operations can be increased. And also arithmetic rotation unit is used instead of native CORDIC. Hence also speed can be improved. In the existing system, in arithmetic units, according to the incoming instruction, arithmetic operations are done. Hence the next instruction is coming only after the current operation in done. Hence it is a time consuming process and there will be a delay on next instruction, hence speed of the operation also reduces. Latency is the time interval between the arrival of input and output whereas throughput is the time interval between the arrival of consecutive inputs. Latency of proposed system shows a better improvement than the existing system. In the above table we can observe that the accelerator used in MMSE(Minimum mean square error ) have 166Mhz clock frequency and 300.9mw average power consumption. And also by using SVD (singular value decomposition) will get 166 Mhz clock frequency and 300.9mw average power consumption. And by using proposed bit extension accelerator will get 924.556 Mhz clock frequency and 42mw average power consumption. Hence we can analyse that the proposed system will get reduced dynamic power consumption and higher clock frequency compared to other system. Hence speed of the operation also increased compared to other device. Hence the proposed device is better than other. Table 1 Comparison with other ASIC designs IV. CONCLUSION This paper focus on the design of a new accelerator chip for reducing power consumption and improving speed. The overall architecture consists of data memory, instruction memory and processing core unit. Processing core unit consist of adder, rotation, multiplier and reciprocal units. 32 bit core is equally divided into four 8bit lanes such as lane0,lane1,lane2 and lane3. Data is coming from data memory that is a complex value. They have a real part and imaginary part. According to the incoming instruction from instruction memory, the operation is done inside the ALU. By using this method, it can disable the unwanted parts for the operations. This will reduce the dynamic power which is a major source of power consumption. There by speed of the operations also improved. And pipelining arithmetic units are used inside the core. There by also speed is improved. REFERENCES [1]. Mohamed I. A. Mohamed, Karim Mohammed and Babak Daneshrad ―Energy Efficient Programmable MIMO Decoder Accelerator Chip in 65-nm CMOS‖ IEEE transactions on very large scale integration (vlsi) systems,2013 [2]. A.J.Paulraj,D.A.Gore,R.U.Nabar,andH.Bolcskei, ―An overview of MIMO communications— A key to gigabit wireless,‖ Proc. IEEE,vol. 92, no. 2, pp. 198–218, Feb. 2004 [3]. K. Mohammed and B. Daneshrad, ―A MIMO decoder accelerator for next generation wireless communications,‖ IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 18, no. 11, pp. 1544–1555, Nov. 2010 [4]. C. Studer, P. Blosch, P. Friedli, and A. Burg, ―Matrix decomposition architecture for MIMO systems: Design and implementation tradeoffs,‖ in Proc. Conf. Rec. 41st Asilomar Conf. ACSSC, Nov. 2007, pp. 1986–1990. [5]. Gordon l. stüber, fellow, John r. barry,Steve w. mclaughlin, ye (geoffrey) li,mary ann ingram,and Thomas g. Pratt ―Broadband MIMO-OFDM Wireless Communications‖ proceedings of the ieee, vol. 92, no. 2, february 2004 [6]. A. Omri and R. Bouallegue ―New transmission scheme for mimo-ofdm system‖ International Journal of Next-Generation Networks (IJNGN) Vol.3, No.1, March 2011 [7]. Soohyun Jang1, Seongjoo Lee 2, and Yunho Jung ―Low-Complexity and Low-Power MIMO Symbol Detector for Mobile Devices with Two TX/RX Antennas‖ journal of semiconductor technology and science, vol.15, no.2, april, 2015 [8]. H. S. Kim, W. Zhu, J. Bhatia, K. Mohammed, A. Shah, and B. Daneshrad, ―A practical,
  • 6. Aparna.M.P. Int. Journal of Engineering Research and Application www.ijera.com ISSN : 2248-9622, Vol. 6, Issue 10, ( Part -2) October 2016, pp.101-106 www.ijera.com 106|P a g e hardware friendly MMSE detector for MIMO-OFDM based systems,‖ EURASIP J. Adv. Signal Process.,vol. 2008, p. 94, Jan. 2008. [9]. C.-J. Huang, C.-W. Yu, and H.-P. Ma, ―A power-efficient configurable low-complexity MIMO detector,‖ IEEE Trans. Circuits Syst.I,Reg. Papers, vol. 56, no. 2, pp. 485– 496, Feb. 2009 [10].M. Ali, K. Mohammed, and B. Daneshrad, ―MIMO accelerator: A design flow for a programmable MIMO decoder architecture,‖ inProc. Comput. Conf. Rec. 43rd Asilomar Conf. Signals, Syst., Nov. 2009, pp. 1292– 1296