The document describes the design and implementation of a 16-bit greatest common divisor (GCD) circuit on a Xilinx Spartan6 FPGA. It presents the GCD circuit at different behavioral and structural levels of abstraction, from a high-level behavioral model to an optimized structural model. Implementation results for each level show tradeoffs in resources used and speed obtained. The optimized structural model achieved the fastest speed of 23.6 delay units using the fewest resources of 33 slices, 110 LUTs, and 16 MUXCYs.
An Open Discussion of RISC-V BitManip, trends, and comparisons _ CuffRISC-V International
Join RISC-V BitManip industry leader Claire Xenia Wolf and Dr. James Cuff for an open and lively discussion with an interactive Q&A on RISC-V and BitManip including trends and comparisons with the existing architecture landscape including x86 and ARM and what specifically makes RISC-V unique.
FPGAs for Supercomputing: The Why and HowDESMOND YUEN
Excellent presentation by Hal Finkel (hfinkel@anl.gov), Kazutomo Yoshii, and Franck Cappello as to why FPGAs are a competitive HPC accelerator technology.
If you like what you read be sure you ♥ it below. Thank you!
An Open Discussion of RISC-V BitManip, trends, and comparisons _ CuffRISC-V International
Join RISC-V BitManip industry leader Claire Xenia Wolf and Dr. James Cuff for an open and lively discussion with an interactive Q&A on RISC-V and BitManip including trends and comparisons with the existing architecture landscape including x86 and ARM and what specifically makes RISC-V unique.
FPGAs for Supercomputing: The Why and HowDESMOND YUEN
Excellent presentation by Hal Finkel (hfinkel@anl.gov), Kazutomo Yoshii, and Franck Cappello as to why FPGAs are a competitive HPC accelerator technology.
If you like what you read be sure you ♥ it below. Thank you!
RISC-V growth and successes in technology and industry - embedded world 2021RISC-V International
RISC-V International has more than 1,000 members across over 50 countries who are working in hardware, software, services, and various industries for a strong and healthy RISC-V ecosystem. It is projected that by 2025 there will be over 62 billion RISC-V CPU cores and the total market for RISC-V IP and software is expected to grow to over $1b by 2025.
In 2020 alone, we saw successes with newly defined RISC-V accelerator architectures, affordable RISC-V open source small-board computers, development boards for personal computers, and an incredibly fast 64-bit RISC-V Core as the community also ratified key specifications and made advances in security.
As we see the growth of RISC-V into industries such as AI, machine learning, blockchain, 5G, medical, and industrial, we will see the ratifications of new extensions that enable this growth.
Join Kim McMahon, Director of Marketing and Stephano Cetola, Technical Program Manager as we take a look at where RISC-V is going in 2021.
A review of the history of digital design throughout the years until the era of programmable logic, and a detailed exploration of the architecture of FPGA chips, followed by an introduction to SoC FPGAs and some of their benefits.
This presentation contains an overview of novelties in ARMv8-A and details on application binary interface (ABI), memory management unit (MMU), caches and interrupts.
This talk was held within GlobalLogic Lviv Embedded TechTalk on November 23d, 2017.
RISC-V growth and successes in technology and industry - embedded world 2021RISC-V International
RISC-V International has more than 1,000 members across over 50 countries who are working in hardware, software, services, and various industries for a strong and healthy RISC-V ecosystem. It is projected that by 2025 there will be over 62 billion RISC-V CPU cores and the total market for RISC-V IP and software is expected to grow to over $1b by 2025.
In 2020 alone, we saw successes with newly defined RISC-V accelerator architectures, affordable RISC-V open source small-board computers, development boards for personal computers, and an incredibly fast 64-bit RISC-V Core as the community also ratified key specifications and made advances in security.
As we see the growth of RISC-V into industries such as AI, machine learning, blockchain, 5G, medical, and industrial, we will see the ratifications of new extensions that enable this growth.
Join Kim McMahon, Director of Marketing and Stephano Cetola, Technical Program Manager as we take a look at where RISC-V is going in 2021.
A review of the history of digital design throughout the years until the era of programmable logic, and a detailed exploration of the architecture of FPGA chips, followed by an introduction to SoC FPGAs and some of their benefits.
This presentation contains an overview of novelties in ARMv8-A and details on application binary interface (ABI), memory management unit (MMU), caches and interrupts.
This talk was held within GlobalLogic Lviv Embedded TechTalk on November 23d, 2017.
Slide deck for talk at IETF#92 (Dallas, March 2015) at the IETF Light-Weight Implementation Guidance (lwig) working group about the performance of cryptographic algorithms on ARM processors.
"p4srv6 (P4-16) design document rev1.0" Overview of Open Source SRv6 Mobile Userplane P4-16 on BMv2 design (planned to be public in March) #enog #enog55 @Nigata http://enog.jp/archives/2014
Slides from my presentation on ARM Shellcode at #44CON 2018, London.
In this talk, we explore ARM egghunting and "Quantum Leap" code - polyglot ARM shellcode. A bonus side effect of this talk will be creating headaches for those who like to defend agaisnt attacks using age old signature based techniques.
Preparing to program Aurora at Exascale - Early experiences and future direct...inside-BigData.com
In this deck from IWOCL / SYCLcon 2020, Hal Finkel from Argonne National Laboratory presents: Preparing to program Aurora at Exascale - Early experiences and future directions.
"Argonne National Laboratory’s Leadership Computing Facility will be home to Aurora, our first exascale supercomputer. Aurora promises to take scientific computing to a whole new level, and scientists and engineers from many different fields will take advantage of Aurora’s unprecedented computational capabilities to push the boundaries of human knowledge. In addition, Aurora’s support for advanced machine-learning and big-data computations will enable scientific workflows incorporating these techniques along with traditional HPC algorithms. Programming the state-of-the-art hardware in Aurora will be accomplished using state-of-the-art programming models. Some of these models, such as OpenMP, are long-established in the HPC ecosystem. Other models, such as Intel’s oneAPI, based on SYCL, are relatively-new models constructed with the benefit of significant experience. Many applications will not use these models directly, but rather, will use C++ abstraction libraries such as Kokkos or RAJA. Python will also be a common entry point to high-performance capabilities. As we look toward the future, features in the C++ standard itself will become increasingly relevant for accessing the extreme parallelism of exascale platforms.
This presentation will summarize the experiences of our team as we prepare for Aurora, exploring how to port applications to Aurora’s architecture and programming models, and distilling the challenges and best practices we’ve developed to date. oneAPI/SYCL and OpenMP are both critical models in these efforts, and while the ecosystem for Aurora has yet to mature, we’ve already had a great deal of success. Importantly, we are not passive recipients of programming models developed by others. Our team works not only with vendor-provided compilers and tools, but also develops improved open-source LLVM-based technologies that feed both open-source and vendor-provided capabilities. In addition, we actively participate in the standardization of OpenMP, SYCL, and C++. To conclude, I’ll share our thoughts on how these models can best develop in the future to support exascale-class systems."
Watch the video: https://wp.me/p3RLHQ-lPT
Learn more: https://www.iwocl.org/iwocl-2020/conference-program/
and
https://www.anl.gov/topic/aurora
Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter
Make ARM Shellcode Great Again - HITB2018PEKSaumil Shah
Compared to x86, ARM shellcode has made little progress. The x86 hardware is largely homogenous. ARM, however, has several versions and variants across devices today. There are several constraints and subtleties involved in writing production quality ARM shellcode which works on modern ARM hardware, not just on QEMU emulators.
In this talk, we shall explore issues such as overcoming cache coherency, reliable polymorphic shellcode, ARM egghunting and last but not the least, polyglot ARM shellcode. A bonus side effect of this talk will be creating headaches for those who like to defend agaisnt attacks using age old signature based techniques
44CON 2014 - Stupid PCIe Tricks, Joe Fitzpatrick44CON
44CON 2014 - Stupid PCIe Tricks, Joe Fitzpatrick.
Hardware hacks tend to focus on low-speed (jtag, uart) and external (network, usb) interfaces, and PCI Express is typically neither. After a crash course in PCIe Architecture, we’ll demonstrate a handful of hacks showing how pull PCIe outside of your system case and add PCIe slots to systems without them, including embedded platforms. We’ll top it off with a demonstration of SLOTSCREAMER, an inexpensive device that’s part of the NSA Playset which we’ve configured to access memory and IO, cross-platform and transparent to the OS - all by design with no 0-day needed. The open hardware and software framework that we will release will expand your Playset with the ability to tinker with DMA attacks to read memory, bypass software and hardware security measures, and directly attack other hardware devices in the system.
1. FPGA-Based Design and Implementation of the
Greatest Common Divisor using
Presented by: Ibrahim Hazmi
Supervised by: Dr. Mihai Sim
2. iHaz 20142
Design 16-bit GCD (the Greatest Common Divisor) and
Implement it on Xilinx Spartan6 FPGA.
It is challenging is to design such circuit and examine the
area-speed tradeoff in order to get the result at reasonable
speed with optimal utilization of the FPGA resources.
So, the design came in different Levels:
High Behavioural Level
ASM Behavioural Level
Direct Structural Level
Optimized Structural Level
Design Problem & Motives
3. iHaz 20143
Overview of Xilinx Spartan6 FPGA
Overview of the GCD
Euclidean Algorithm
Behavioural Level Design
High Level (While/For Loop)
ASM Level (ASM —> FSM)
Structural Level Design
Direct Structural (Data-path + FSM)
Optimized Structural Design (SAD)
Conclusion
Summary & Future Work
Outline
7. iHaz 20145
Greatest Common Divisor (GCD)
Greatest Common Factors (GCF)
Greatest Common Measure (GCM)
Highest Common Divisor (HCD)
Highest Common Factor (HCF)
What is GCD ?
36, 54
Euclidean Algorithm ?
Actually, It has many names:
45. iHaz 201425
Conclusion & Future Work
The Design of 16-bit GCD and its Implementation it on
Xilinx Spartan6 FPGA was demonstrated.
The area-speed tradeoff has been shown in different
Levels:
High Behavioural Level
ASM Behavioural Level
Direct Structural Level
Optimized Structural Level
Future Work: Primitives + Check new FPGAs where GCD
might be implemented inside a DSP