SlideShare a Scribd company logo
1 of 22
A SURVEY ON EXPLORING
MEMORY OPTIMIZATIONS IN
SMARTPHONES
-KARTHIKEYAN RAMKUMAR
ABSTRACT
•

Exploring the applicability of the memory optimizations explored for computer systems, in smartphone
hardware.

•

Memory technologies include Mobile RAM (M-RAM), Power Aware Virtual Memory
(PAVM), Dynamic RAM (DRAM) and On-demand mechanisms such as Immediate Power Down
(IPD) mechanism and Immediate Self Refresh (ISR) mechanism.

•

Newly emerging technologies such as Phase Change Memory (PCM) and a hybrid PCM-M RAM
approach.
INTRODUCTION
•

Additional features and improved user experience, provided by fast processors, copious
memory, resource demanding software, and power-hungry hardware makes energy a precious resource.

•

With hardware continuously improving in performance and price, vendors are able to build systems
with higher-performance and higher power components trying to meet users’ ever increasing demands
and compete for customers.

•

This results in systems that are over-provisioned with components that provide more capacity and as a
result, it is becoming more difficult to maintain long battery life in these devices.

•

While a smartphone contains many energy hungry components, such as CPU, display, and multiple
radios, energy consumed by memory subsystem has been given limited consideration.

•

Therefore, we explore the efficiency of the existing energy management mechanisms on smartphones.
1. DYNAMIC RAM (DRAM)
•

Dynamic random-access memory (DRAM) is a type of random-access memory that stores each bit of
data in a separate capacitor within an integrated circuit.

•

As applications are becoming increasingly data-centric, we expect main memory to remain as a
significant energy consumer because achieving good overall system performance will be more likely to
depend on having higher-performance and larger-capacity DRAM.

•

We use the terminology of the Double-Data Rate (DDR) memory simply because DDR is becoming the
most common type of memory used in today's PC and server systems. This approach is not limited to
only DDR but this technique can also be applied to other memory types, e.g., SDR and RDRAM.
1.1 MEMORY TRAFFIC RESHAPING
•

Make memory access less random and more controllable.

•

Use a 4-rank system and this creates small and medium sized idle periods.

•

The concepts of hot and cold ranks are introduced.

•

As Self Refresh can be more utilized, more valuable opportunities are created in cold ranks.
1.1 MEMORY TRAFFIC RESHAPING

In the experiments conducted, the average
interarrival time was elongated by almost 2
orders of magnitude on cold ranks.

An example showing that if memory traffic
is left unshaped, power management cannot
take full advantage of deeper power-saving
states since most idle periods are too short.
1.2 EFFECT OF RESHAPING ON MEMORY TRAFFIC
•

Compared the results of migrating 1%, 5%, and 10% of pages.

•

Migrating 1 % of pages gives limited benefits in power reduction, migrating 10% of pages does not
give any additional energy benefit beyond migrating 5%.

•

Therefore, migrating 5% of pages gives the best result.
1.2 EFFECT OF RESHAPING ON MEMORY TRAFFIC
•

•

To solve the problem at its root, it calls for an
alternative main memory design, where we
should use high-performance, highly parallel
memory on hot ranks and low-performance lowpower memory on cold ranks.

•
Effects of actively reshaping memory
traffic by migrating 1%, 5%, and 10% of
pages for the low memory intensive
workload (above) and high memoryintensive workload (below).

As we can see from the Figure, migrating 1% as
opposed to 5% of pages does not give much
benefit in reducing performance penalty.

Results shows that a 35.63-38.87% additional
energy can be saved by complementing existing
power management techniques with this
technique.
2. PHASE CHANGE MEMORY (PCM)
•

It is a type of non-volatile random access memory.

•

For a DRAM alternative, we must architect PCM.

•

We examine the following:



Buffer Organization



Partial Writes.
2.1 BUFFER ORGANIZATION
•

We examine PCM buffer organizations that satisfy DRAM imposed area constraints.

•

PCM buffer reorganizations reduce application execution time and memory energy, relative to
DRAM-based systems.

Evaluation:
On optimizing average delay and energy across the workloads, we find four 512B-wide buffers most
effective. Although each PCM array write requires 43.1x more energy than a DRAM array
write, these energy costs are mitigated by narrow buffer widths and additional rows, which reduce
the granularity of buffer evictions and expose opportunities for write coalescing, respectively.
2.2 PARTIAL WRITES
•
•

Partial write are utilized.

•

Scaling improves PCM endurance.

We expect write coalescing and partial writes to deliver a memory module average lifetime of 5.6
years.

Evaluation:

•

In a baseline architecture with a single 2048B-wide buffer, average module lifetime is approximately
525 hours.

•
•

For our memory intensive workloads, we observe 32.8 percent memory bus utilization.
On average, the four 512B-wide buffers coalesce 38.9 percent of writes emerging from the memory
bus, which is 47.0 percent utilized.
PHASE CHANGE MEMORY (PCM)
•

Collectively, these results indicate PCM is a viable DRAM alternative.

•

It consumes more energy to perform I/O operations, particularly write operations.

•

PCM consumes significantly less idle power than M-RAM.

•

Therefore, we should leverage the tradeoffs between performance and energy efficiency to apply PCM
technology in mobile devices.
3. CHARACTERIZING MOBILE SOFTWARE
• Compared to the CPU speed, human interactions are extremely slow.
• Prior study has shown that human perception threshold is between 50ms and 100ms.
• Completing task execution earlier than the perception threshold is meaningless.
• The majority of tasks are very short as more than 90% of all tasks complete within 10ms.
• 95% of all tasks are shorter than 50ms, indicating that these tasks can be extended to the 50ms
perception threshold deadline without any performance penalty.
• Similarly, for the remaining 5% of long tasks, any additional extension less than 50ms will not be
noticed by the user, avoiding performance degradation.
3. CHARACTERIZING MOBILE SOFTWARE
The applications selected for this survey are shown in the table

This table lists 12 popular Android applications selected from the Android market along with their trace
statistics. A T-Mobile G1 smartphone is used to collect the application traces. Each trace consists of task
intervals with the task execution length and the number of memory I/Os
4. MECHANISM COMPARISON
•

We will evaluate the effectiveness of various energy management mechanisms on M-RAM and PCM
under the same execution environment.

•

For this survey, a simulator that models the system configuration of a T-Mobile G1 smartphone is used
 System Configuration of a T-Mobile G1

•

The memory subsystem consists of a memory controller and three 64MB ranks (192MB totally), for
either M-RAM or PCM. The simulator feeds with the traces and conducts task execution under the
current CPU and memory state. The memory controller conducts memory I/O operations, and executes
power state transitions for each rank based on the energy management mechanism
4.1 POWER AWARE VIRTUAL MEMORY (PAVM)
•

Powering down the memory devices during the idle period can help in reducing the energy
consumption.

•

Power-Aware Virtual Memory (PAVM) is a simple and efficient way to provide energy management

 Memory energy consumption with a standard system (ON) and the PAVM mechanism. The left two
bars for each application show the energy of M-RAM and PCM in standard system, while the right
two bars show the energy for the PAVM mechanism.
5. ON-DEMAND MECHANISMS
•

Despite PAVM’s benefits to the standard system, it fails to address the energy efficiency of the active
rank accessed during the process execution.

•

Immediate Power Down (IPD) mechanism and Immediate Self Refresh (ISR) mechanism have been
proposed for RAM

•

As soon as an I/O request arrives at the memory controller, the rank to be accessed is transitioned to the
PRE state, and transitioned back to a low-power immediately after the I/O completes.
5. ON-DEMAND MECHANISMS
 Memory energy consumption for
on-demand mechanisms
normalized to the PAVM
mechanism on M-RAM

•

The two on-demand mechanisms outperform the PAVM mechanism on PCM.

•

PCM’s inferior I/O efficiency cannot offset its energy savings from idle periods, except for lightly
loaded applications Amazon, Music and Twidroid.

•

The PCM OFF mechanism completely eliminates the active idle energy, resulting in 44% energy
reduction over the PAVM mechanism on PCM.

•

Compared to the IPD and ISR mechanisms on M-RAM, the PCM OFF mechanism offers 18% and
22% energy savings respectively.
5. ON-DEMAND MECHANISMS
 The distribution of extended tasks
that expose delays for on-demand
mechanisms

• IPD mechanism achieves the best performance with negligible delays exposed.
• The ISR and PCM OFF mechanisms, on the other hand, incur more evident degradation due to the
141.5ns long transition latency.
• PCM technology with on-demand mechanism surpasses the traditional MRAM when memory energy is
the only concern.
• However, when performance is considered, M-RAM still has the chance to beat PCM.
• We therefore need an approach to balance energy and performance more efficiently than any standalone
memory technology.
6. HYBRID MEMORY ARCHITECTURE

•

A hybrid memory consisting of M-RAM and PCM can improve both the energy efficiency and
performance.

•

When an application is invoked and its image does not reside in M-RAM, it is loaded into M-RAM
either from secondary storage or PCM, and the corresponding process identifier is put at the head of the
LRU list.

•

When an application is closed, its memory image will stay in M-RAM until it is swapped out.

•

The hybrid approach achieves the best energy efficiency.
CONCLUSIONS

•

This paper presents a survey of several memory optimization mechanisms that were originally
proposed for desktops and servers, on a mobile system.

•

The mechanisms include improving energy efficiency by making the DRAM less randomly
accessed, architecting the novel PCM as a scalable DRAM alternative and, designing and
implementing PAVM in smartphones.

•

The PAVM mechanism saves more than 90% energy as compared to the standard system with no
energy management.

•
•

Additional energy savings are provided by the on-demand mechanisms.
The energy efficiency can be improved further by a hybrid approach.
THANK YOU

More Related Content

What's hot

FEATURES OF CNC MACHINE BY G.DINESHPIRAN
FEATURES OF CNC MACHINE BY G.DINESHPIRANFEATURES OF CNC MACHINE BY G.DINESHPIRAN
FEATURES OF CNC MACHINE BY G.DINESHPIRANDinesh Piran-Gdp
 
Dma data transfer
Dma data transferDma data transfer
Dma data transferchemjor83
 
Basic structure of computers
Basic structure of computersBasic structure of computers
Basic structure of computersKumar
 
Migration to IBM POWER8
Migration to IBM POWER8Migration to IBM POWER8
Migration to IBM POWER8Pavel Hampl
 
Chapter1 basic structure of computers
Chapter1  basic structure of computersChapter1  basic structure of computers
Chapter1 basic structure of computersjismymathew
 
Power path viewer_technical_presentation
Power path viewer_technical_presentationPower path viewer_technical_presentation
Power path viewer_technical_presentationxKinAnx
 
Taming Latency: Case Studies in MapReduce Data Analytics
Taming Latency: Case Studies in MapReduce Data AnalyticsTaming Latency: Case Studies in MapReduce Data Analytics
Taming Latency: Case Studies in MapReduce Data AnalyticsEMC
 

What's hot (15)

FEATURES OF CNC MACHINE BY G.DINESHPIRAN
FEATURES OF CNC MACHINE BY G.DINESHPIRANFEATURES OF CNC MACHINE BY G.DINESHPIRAN
FEATURES OF CNC MACHINE BY G.DINESHPIRAN
 
Dma data transfer
Dma data transferDma data transfer
Dma data transfer
 
Cache
CacheCache
Cache
 
Stmm white paper
Stmm white paperStmm white paper
Stmm white paper
 
Basic structure of computers
Basic structure of computersBasic structure of computers
Basic structure of computers
 
eet_NPU_file.PDF
eet_NPU_file.PDFeet_NPU_file.PDF
eet_NPU_file.PDF
 
Migration to IBM POWER8
Migration to IBM POWER8Migration to IBM POWER8
Migration to IBM POWER8
 
CPU and its execution of instruction
CPU and its execution of instructionCPU and its execution of instruction
CPU and its execution of instruction
 
Chapter1 basic structure of computers
Chapter1  basic structure of computersChapter1  basic structure of computers
Chapter1 basic structure of computers
 
Power path viewer_technical_presentation
Power path viewer_technical_presentationPower path viewer_technical_presentation
Power path viewer_technical_presentation
 
Taming Latency: Case Studies in MapReduce Data Analytics
Taming Latency: Case Studies in MapReduce Data AnalyticsTaming Latency: Case Studies in MapReduce Data Analytics
Taming Latency: Case Studies in MapReduce Data Analytics
 
Memory hierarchy
Memory hierarchyMemory hierarchy
Memory hierarchy
 
computer architecture.
computer architecture.computer architecture.
computer architecture.
 
Mileage principles
Mileage principlesMileage principles
Mileage principles
 
Kairos aarohan
Kairos  aarohanKairos  aarohan
Kairos aarohan
 

Similar to Literature survey presentation

CA assignment group.pptx
CA assignment group.pptxCA assignment group.pptx
CA assignment group.pptxHAIDERALICH3
 
Virtualization for Emerging Memory Devices
Virtualization for Emerging Memory DevicesVirtualization for Emerging Memory Devices
Virtualization for Emerging Memory DevicesTakahiro Hirofuchi
 
참여기관_발표자료-국민대학교 201301 정기회의
참여기관_발표자료-국민대학교 201301 정기회의참여기관_발표자료-국민대학교 201301 정기회의
참여기관_발표자료-국민대학교 201301 정기회의DzH QWuynh
 
COMPUTER ARCHITECTURE-2.pptx
COMPUTER ARCHITECTURE-2.pptxCOMPUTER ARCHITECTURE-2.pptx
COMPUTER ARCHITECTURE-2.pptxssusere16bd9
 
High performance operating system controlled memory compression
High performance operating system controlled memory compressionHigh performance operating system controlled memory compression
High performance operating system controlled memory compressionMr. Chanuwan
 
lecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDK
lecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDKlecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDK
lecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDKofficeaiotfab
 
Investigations on Implementation of Ternary Content Addressable Memory Archit...
Investigations on Implementation of Ternary Content Addressable Memory Archit...Investigations on Implementation of Ternary Content Addressable Memory Archit...
Investigations on Implementation of Ternary Content Addressable Memory Archit...IRJET Journal
 
DDR4 Compliance Testing. Its time has come!
DDR4 Compliance Testing.  Its time has come!DDR4 Compliance Testing.  Its time has come!
DDR4 Compliance Testing. Its time has come!Barbara Aichinger
 
computer architecture and organisation
computer architecture and organisationcomputer architecture and organisation
computer architecture and organisationRamjiChaurasiya
 
MAC: A NOVEL SYSTEMATICALLY MULTILEVEL CACHE REPLACEMENT POLICY FOR PCM MEMORY
MAC: A NOVEL SYSTEMATICALLY MULTILEVEL CACHE REPLACEMENT POLICY FOR PCM MEMORYMAC: A NOVEL SYSTEMATICALLY MULTILEVEL CACHE REPLACEMENT POLICY FOR PCM MEMORY
MAC: A NOVEL SYSTEMATICALLY MULTILEVEL CACHE REPLACEMENT POLICY FOR PCM MEMORYcaijjournal
 
Modeling of DDR4 Memory and Advanced Verifications of DDR4 Memory Subsystem
Modeling of DDR4 Memory and Advanced Verifications of DDR4 Memory SubsystemModeling of DDR4 Memory and Advanced Verifications of DDR4 Memory Subsystem
Modeling of DDR4 Memory and Advanced Verifications of DDR4 Memory SubsystemIRJET Journal
 
Csc1401 lecture06 - internal memory
Csc1401   lecture06 - internal memoryCsc1401   lecture06 - internal memory
Csc1401 lecture06 - internal memoryIIUM
 
ARM architcture
ARM architcture ARM architcture
ARM architcture Hossam Adel
 

Similar to Literature survey presentation (20)

CA assignment group.pptx
CA assignment group.pptxCA assignment group.pptx
CA assignment group.pptx
 
RAMinate Invited Talk at NII
RAMinate Invited Talk at NIIRAMinate Invited Talk at NII
RAMinate Invited Talk at NII
 
RAMinate ACM SoCC 2016 Talk
RAMinate ACM SoCC 2016 TalkRAMinate ACM SoCC 2016 Talk
RAMinate ACM SoCC 2016 Talk
 
Improving DRAM performance
Improving DRAM performanceImproving DRAM performance
Improving DRAM performance
 
Aqeel
AqeelAqeel
Aqeel
 
Virtualization for Emerging Memory Devices
Virtualization for Emerging Memory DevicesVirtualization for Emerging Memory Devices
Virtualization for Emerging Memory Devices
 
Bg4103362367
Bg4103362367Bg4103362367
Bg4103362367
 
참여기관_발표자료-국민대학교 201301 정기회의
참여기관_발표자료-국민대학교 201301 정기회의참여기관_발표자료-국민대학교 201301 정기회의
참여기관_발표자료-국민대학교 201301 정기회의
 
COMPUTER ARCHITECTURE-2.pptx
COMPUTER ARCHITECTURE-2.pptxCOMPUTER ARCHITECTURE-2.pptx
COMPUTER ARCHITECTURE-2.pptx
 
High performance operating system controlled memory compression
High performance operating system controlled memory compressionHigh performance operating system controlled memory compression
High performance operating system controlled memory compression
 
lecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDK
lecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDKlecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDK
lecture asdkvakm;bk;dv;advvAVHD;KASV;DVKHSVDK
 
Chap2 slides
Chap2 slidesChap2 slides
Chap2 slides
 
Investigations on Implementation of Ternary Content Addressable Memory Archit...
Investigations on Implementation of Ternary Content Addressable Memory Archit...Investigations on Implementation of Ternary Content Addressable Memory Archit...
Investigations on Implementation of Ternary Content Addressable Memory Archit...
 
DDR4 Compliance Testing. Its time has come!
DDR4 Compliance Testing.  Its time has come!DDR4 Compliance Testing.  Its time has come!
DDR4 Compliance Testing. Its time has come!
 
G1034853
G1034853G1034853
G1034853
 
computer architecture and organisation
computer architecture and organisationcomputer architecture and organisation
computer architecture and organisation
 
MAC: A NOVEL SYSTEMATICALLY MULTILEVEL CACHE REPLACEMENT POLICY FOR PCM MEMORY
MAC: A NOVEL SYSTEMATICALLY MULTILEVEL CACHE REPLACEMENT POLICY FOR PCM MEMORYMAC: A NOVEL SYSTEMATICALLY MULTILEVEL CACHE REPLACEMENT POLICY FOR PCM MEMORY
MAC: A NOVEL SYSTEMATICALLY MULTILEVEL CACHE REPLACEMENT POLICY FOR PCM MEMORY
 
Modeling of DDR4 Memory and Advanced Verifications of DDR4 Memory Subsystem
Modeling of DDR4 Memory and Advanced Verifications of DDR4 Memory SubsystemModeling of DDR4 Memory and Advanced Verifications of DDR4 Memory Subsystem
Modeling of DDR4 Memory and Advanced Verifications of DDR4 Memory Subsystem
 
Csc1401 lecture06 - internal memory
Csc1401   lecture06 - internal memoryCsc1401   lecture06 - internal memory
Csc1401 lecture06 - internal memory
 
ARM architcture
ARM architcture ARM architcture
ARM architcture
 

Recently uploaded

Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 

Recently uploaded (20)

Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 

Literature survey presentation

  • 1. A SURVEY ON EXPLORING MEMORY OPTIMIZATIONS IN SMARTPHONES -KARTHIKEYAN RAMKUMAR
  • 2. ABSTRACT • Exploring the applicability of the memory optimizations explored for computer systems, in smartphone hardware. • Memory technologies include Mobile RAM (M-RAM), Power Aware Virtual Memory (PAVM), Dynamic RAM (DRAM) and On-demand mechanisms such as Immediate Power Down (IPD) mechanism and Immediate Self Refresh (ISR) mechanism. • Newly emerging technologies such as Phase Change Memory (PCM) and a hybrid PCM-M RAM approach.
  • 3. INTRODUCTION • Additional features and improved user experience, provided by fast processors, copious memory, resource demanding software, and power-hungry hardware makes energy a precious resource. • With hardware continuously improving in performance and price, vendors are able to build systems with higher-performance and higher power components trying to meet users’ ever increasing demands and compete for customers. • This results in systems that are over-provisioned with components that provide more capacity and as a result, it is becoming more difficult to maintain long battery life in these devices. • While a smartphone contains many energy hungry components, such as CPU, display, and multiple radios, energy consumed by memory subsystem has been given limited consideration. • Therefore, we explore the efficiency of the existing energy management mechanisms on smartphones.
  • 4. 1. DYNAMIC RAM (DRAM) • Dynamic random-access memory (DRAM) is a type of random-access memory that stores each bit of data in a separate capacitor within an integrated circuit. • As applications are becoming increasingly data-centric, we expect main memory to remain as a significant energy consumer because achieving good overall system performance will be more likely to depend on having higher-performance and larger-capacity DRAM. • We use the terminology of the Double-Data Rate (DDR) memory simply because DDR is becoming the most common type of memory used in today's PC and server systems. This approach is not limited to only DDR but this technique can also be applied to other memory types, e.g., SDR and RDRAM.
  • 5. 1.1 MEMORY TRAFFIC RESHAPING • Make memory access less random and more controllable. • Use a 4-rank system and this creates small and medium sized idle periods. • The concepts of hot and cold ranks are introduced. • As Self Refresh can be more utilized, more valuable opportunities are created in cold ranks.
  • 6. 1.1 MEMORY TRAFFIC RESHAPING In the experiments conducted, the average interarrival time was elongated by almost 2 orders of magnitude on cold ranks. An example showing that if memory traffic is left unshaped, power management cannot take full advantage of deeper power-saving states since most idle periods are too short.
  • 7. 1.2 EFFECT OF RESHAPING ON MEMORY TRAFFIC • Compared the results of migrating 1%, 5%, and 10% of pages. • Migrating 1 % of pages gives limited benefits in power reduction, migrating 10% of pages does not give any additional energy benefit beyond migrating 5%. • Therefore, migrating 5% of pages gives the best result.
  • 8. 1.2 EFFECT OF RESHAPING ON MEMORY TRAFFIC • • To solve the problem at its root, it calls for an alternative main memory design, where we should use high-performance, highly parallel memory on hot ranks and low-performance lowpower memory on cold ranks. • Effects of actively reshaping memory traffic by migrating 1%, 5%, and 10% of pages for the low memory intensive workload (above) and high memoryintensive workload (below). As we can see from the Figure, migrating 1% as opposed to 5% of pages does not give much benefit in reducing performance penalty. Results shows that a 35.63-38.87% additional energy can be saved by complementing existing power management techniques with this technique.
  • 9. 2. PHASE CHANGE MEMORY (PCM) • It is a type of non-volatile random access memory. • For a DRAM alternative, we must architect PCM. • We examine the following:  Buffer Organization  Partial Writes.
  • 10. 2.1 BUFFER ORGANIZATION • We examine PCM buffer organizations that satisfy DRAM imposed area constraints. • PCM buffer reorganizations reduce application execution time and memory energy, relative to DRAM-based systems. Evaluation: On optimizing average delay and energy across the workloads, we find four 512B-wide buffers most effective. Although each PCM array write requires 43.1x more energy than a DRAM array write, these energy costs are mitigated by narrow buffer widths and additional rows, which reduce the granularity of buffer evictions and expose opportunities for write coalescing, respectively.
  • 11. 2.2 PARTIAL WRITES • • Partial write are utilized. • Scaling improves PCM endurance. We expect write coalescing and partial writes to deliver a memory module average lifetime of 5.6 years. Evaluation: • In a baseline architecture with a single 2048B-wide buffer, average module lifetime is approximately 525 hours. • • For our memory intensive workloads, we observe 32.8 percent memory bus utilization. On average, the four 512B-wide buffers coalesce 38.9 percent of writes emerging from the memory bus, which is 47.0 percent utilized.
  • 12. PHASE CHANGE MEMORY (PCM) • Collectively, these results indicate PCM is a viable DRAM alternative. • It consumes more energy to perform I/O operations, particularly write operations. • PCM consumes significantly less idle power than M-RAM. • Therefore, we should leverage the tradeoffs between performance and energy efficiency to apply PCM technology in mobile devices.
  • 13. 3. CHARACTERIZING MOBILE SOFTWARE • Compared to the CPU speed, human interactions are extremely slow. • Prior study has shown that human perception threshold is between 50ms and 100ms. • Completing task execution earlier than the perception threshold is meaningless. • The majority of tasks are very short as more than 90% of all tasks complete within 10ms. • 95% of all tasks are shorter than 50ms, indicating that these tasks can be extended to the 50ms perception threshold deadline without any performance penalty. • Similarly, for the remaining 5% of long tasks, any additional extension less than 50ms will not be noticed by the user, avoiding performance degradation.
  • 14. 3. CHARACTERIZING MOBILE SOFTWARE The applications selected for this survey are shown in the table This table lists 12 popular Android applications selected from the Android market along with their trace statistics. A T-Mobile G1 smartphone is used to collect the application traces. Each trace consists of task intervals with the task execution length and the number of memory I/Os
  • 15. 4. MECHANISM COMPARISON • We will evaluate the effectiveness of various energy management mechanisms on M-RAM and PCM under the same execution environment. • For this survey, a simulator that models the system configuration of a T-Mobile G1 smartphone is used  System Configuration of a T-Mobile G1 • The memory subsystem consists of a memory controller and three 64MB ranks (192MB totally), for either M-RAM or PCM. The simulator feeds with the traces and conducts task execution under the current CPU and memory state. The memory controller conducts memory I/O operations, and executes power state transitions for each rank based on the energy management mechanism
  • 16. 4.1 POWER AWARE VIRTUAL MEMORY (PAVM) • Powering down the memory devices during the idle period can help in reducing the energy consumption. • Power-Aware Virtual Memory (PAVM) is a simple and efficient way to provide energy management  Memory energy consumption with a standard system (ON) and the PAVM mechanism. The left two bars for each application show the energy of M-RAM and PCM in standard system, while the right two bars show the energy for the PAVM mechanism.
  • 17. 5. ON-DEMAND MECHANISMS • Despite PAVM’s benefits to the standard system, it fails to address the energy efficiency of the active rank accessed during the process execution. • Immediate Power Down (IPD) mechanism and Immediate Self Refresh (ISR) mechanism have been proposed for RAM • As soon as an I/O request arrives at the memory controller, the rank to be accessed is transitioned to the PRE state, and transitioned back to a low-power immediately after the I/O completes.
  • 18. 5. ON-DEMAND MECHANISMS  Memory energy consumption for on-demand mechanisms normalized to the PAVM mechanism on M-RAM • The two on-demand mechanisms outperform the PAVM mechanism on PCM. • PCM’s inferior I/O efficiency cannot offset its energy savings from idle periods, except for lightly loaded applications Amazon, Music and Twidroid. • The PCM OFF mechanism completely eliminates the active idle energy, resulting in 44% energy reduction over the PAVM mechanism on PCM. • Compared to the IPD and ISR mechanisms on M-RAM, the PCM OFF mechanism offers 18% and 22% energy savings respectively.
  • 19. 5. ON-DEMAND MECHANISMS  The distribution of extended tasks that expose delays for on-demand mechanisms • IPD mechanism achieves the best performance with negligible delays exposed. • The ISR and PCM OFF mechanisms, on the other hand, incur more evident degradation due to the 141.5ns long transition latency. • PCM technology with on-demand mechanism surpasses the traditional MRAM when memory energy is the only concern. • However, when performance is considered, M-RAM still has the chance to beat PCM. • We therefore need an approach to balance energy and performance more efficiently than any standalone memory technology.
  • 20. 6. HYBRID MEMORY ARCHITECTURE • A hybrid memory consisting of M-RAM and PCM can improve both the energy efficiency and performance. • When an application is invoked and its image does not reside in M-RAM, it is loaded into M-RAM either from secondary storage or PCM, and the corresponding process identifier is put at the head of the LRU list. • When an application is closed, its memory image will stay in M-RAM until it is swapped out. • The hybrid approach achieves the best energy efficiency.
  • 21. CONCLUSIONS • This paper presents a survey of several memory optimization mechanisms that were originally proposed for desktops and servers, on a mobile system. • The mechanisms include improving energy efficiency by making the DRAM less randomly accessed, architecting the novel PCM as a scalable DRAM alternative and, designing and implementing PAVM in smartphones. • The PAVM mechanism saves more than 90% energy as compared to the standard system with no energy management. • • Additional energy savings are provided by the on-demand mechanisms. The energy efficiency can be improved further by a hybrid approach.

Editor's Notes

  1. Many memory optimizations have been explored for computer systems and in this survey we explore their applicability to smartphone hardware
  2. To reshape the memory traffic for our benefit, we must make memory access less random and more controllable. Conventional memory traffic often seems random because (1) the operating system arbitrarily maps virtual pages to physical pages, and (2) different pages are often accessed very differently at run-time. As a result of such randomness, the interarrival characteristic of memory requests observed on each rank might not be favorable for existing techniques to manage power. We use a 4-rank system wherein memory requests are likely to be randomly distributed and this creates a large number of small and medium sized idle periods.To elongate idle periods, the concepts of hot and cold ranks are introduced.More valuable opportunities are created on cold ranks as Self Refresh can be more utilized.
  3. . Hot ranks are used to hold frequently accessed pages, which leaves infrequently used and unmapped pages on cold ranks. Hot ranks are created by migrating frequently accessed pages from cold ranks to hot ranks
  4. To study the effect of memory traffic reshaping in more detail, we compare the results of migrating 1%, 5%, and 10% of pages. Migrating only 1 % of pages gives only limited benefits in power reduction. On the other hand, migrating 10% of pages does not give any additional energy benefit beyond that of migrating 5%. In addition, it also suffers from more performance penalty due to having to migrate more pages. Therefore, migrating 5% of pages gives the best result for the workloads that were used.
  5. Phase change memory is a type of non-volatile random access memory and provides a non-volatile storage mechanism agreeable to process scaling. However, for a DRAM alternative, we must architect PCM for feasibility in main memory within general-purpose systems.Drawn from a rigorous survey of PCM device and circuit prototypes published within the last five years and comparing against modern DRAM memory subsystems, we examine
  6. We examine PCM buffer organizations that satisfy DRAM imposed area constraints. PCM buffer reorganizations reduce application execution time from 1.6x to 1.2x and memory energy from 2.2x to 1.0x, relative to DRAM-based systems.As PCM technology matures, baseline PCM latencies may improve. Moreover, process technology scaling will drive linear reductions in PCM energy.
  7. Collectively, these results indicate PCM is a viable DRAM alternative, with architectural solutions providing competitive performance and comparable energy.On utilizing PCM as a viable alternative to M-RAM, we need to note that it consumes more energy to perform I/O operations, particularly write operations. However, PCM consumes significantly less idle power than M-RAM, especially in the low-power state where the power consumption is reduced to 0. Therefore, we should leverage the tradeoffs between performance and energy efficiency to apply PCM technology in mobile devices.
  8. From the previous analysis, we can see that PCM is superior to M-RAM for its lower idle power consumption, while M-RAM excels PCM for faster I/O speed and lower I/O energy. Therefore, a hybrid approach consisting of M-RAM and PCM can improve both the energy efficiency and performance.When an application is invoked and its image does not reside in M-RAM, it is loaded either from secondary storage or PCM, and the corresponding process identifier is put at the head of the LRU list.When an application is closed, its memory image will stay in M-RAM until it is swapped out. The hybrid approach preserves more than 99% of IPD’s performance and achieves the best energy efficiency among all mechanisms while maintaining almost full memory performance.
  9. Therefore, by adopting an optimization strategy based on these mechanisms, designers will be able to better optimize the memory in smartphones and offer superior energy savings to compensate for the slow pace of battery technology improvements.