Get Multicore Differentiation and Great Integrated Graphics Performance

•

0 likes•3,185 views

While developing Total War*: THREE KINGDOMS, Creative Assembly* collaborated with Intel to get the most out of modern, multicore processors. Join us as team members share their tips and tricks for achieving great performance on the latest integrated GPUs.

Technology

Main Thread and Render Thread are running in sync

Game Tick builds game state and passes it to main thread

We calculate intersection with all the frustums at once

Intersection results are stored in a bitfield

We calculate the area of the projected bounding box and cull small models

Then we select the LOD level based on the
approximated average triangle size on screen

Next step is building instance data for the individual parts

Based on the mesh and instance data and the material
we figure out which parts of the pipeline the mesh needs to be rendered to.

Each pipeline stage uses the same instance data.

Every single mesh has a number of instance lists.
One instance list per pipeline stage.
The instance is added to all the relevant instance lists.

The instance lists are duplicated to every worker thread to ensure lockless access.

And double buffered because while the render thread
is rendering the current frame, the main thread (and workers)
Is adding instances to the next frame’s instance lists.

We have one mesh list per mesh type.
Every mesh with at least a single instance in the frame
will get added to exactly one mesh list matching its mesh type.

Just like the instance lists we have one mesh list for every single worker thread.

We start by processing the mesh lists
We run one task per mesh type

Per-thread mesh lists are combined to a single list of meshes

Then we process all the meshes in the list one by one

Each mesh has instance lists per pipeline stage

We start with the first pipeline stage and
combine the instances there into a single list

Then process all subsequent stages sequentially

Instance data is now uploaded to the GPU and prepared for batching

The actual rendering follows the different pipeline stages

Each stage renders a selection of mesh types

The meshes are prepared by the render thread workers

We have to start by waiting for the preparation tasks to be finished

The tasks process all meshes/instances for all pipeline stages
We can use an atomic counter per pipeline stage

No need to further split the tasks
We just increased the granularity of waiting for other sub-tasks

Usually everything is ready after the short initial wait

The basic building blocks of the system are emitters

Emitters are emitting a number of particles over time

Particles are often just sorted per emitter

Problems start happening when the emitters overlap

Sorting is responsible for moving dead particles to one end of the GPU buffer

This make uploading new particles trivial

Running out of particle space stomps over particles farthest from camera

(for Total War:THREE KINGDOMS)
We were confident that order-independent transparency is the solution

Get Multicore Differentiation and Great Integrated Graphics Performance

The .NET Framework provides several threading locking primitives. The Reader Writer Solution is one of them. The Reader Writer Solution class is used to synchronize access to a resource. At any given time, it allows concurrent read access to multiple (essentially unlimited) threads, or it allows write access for a single thread. In situations where a resource is read frequently but updated infrequently, a Reader Writer Solution will provide much better throughput than the exclusive Monitor lock. Course: Operating System

Reader Writer problem

Manthiravalli Krishnan

Reader/writer problem

RinkuMonani

StormPremnath Thimma, CSM/PMP

Ruby basics || updated

datt30

IEEE1588 - Collision avoidance for Delay_Req messages in broadcast media

Augusto Ciuffoletti

Learning Python through Minecraft on the Raspberry Pi - Worksheets

ManchesterBudo

Thread presentation

AAshish Ojha

1-Information sharing 2-Computation speedup 3-Modularity 4-Convenience 5-allows exchanged data and informations Two IPC Models 1. Shared memory- is an OS provided abstraction which allows a memory region to be simultaneously accessed by multiple programs with an intent to provide communication among them. One process will create an area in RAM which other processes can accessed 2. Message passing - is a form of communication used in interprocess communication. Communication is made by the sending of messages to recipients. Each process should be able to name the other processes. The producer typically uses send() system call to send messages, and the consumer uses receive()system call to receive messages Shared memory Faster than message passing After establishing shared memory, treated as routine memory accesses Message passing Useful for exchanging smaller amounts of data Easy to implement, but more time-consuming task of kernel intervention Bounded-Buffer Problem Producer Process do { ... produce an item in nextp ... wait(empty); wait(mutex); ... add nextp to buffer ... signal(mutex); signal(full); } while (true); Bounded-Buffer Problem Consumer Process do { wait(full); wait(mutex); ... remove an item from buffer to nextc ... signal(mutex); signal(empty); ... consume the item in nextc ... } while (true); client-server model, the client sends out requests to the server, and the server does some processing with the request(s) received, and returns a reply (or replies)to the client. Since Socket can be described as end-points for communication. we could imagine the client and server hosts being connected by a pipe through which data-flow takes place. 1-sockets use a client-server while Server waits for incoming client requests by listening to a specified port. 2-After receiving a request, the server accepts a connection from the client socket to complete the connection 3-then Remote procedure call (RPC) abstracts procedure call mechanism for use between systems with network connections 4-and pipes acts as a conduit allowing two processes to communicate A process is different than a program - Program is static code and static data - Process is Dynamic instance of code and data -Program becomes process when executable file loaded into memory No one-to-one mapping between programs and processes -can have multiple processes of the same program -one program can invoke multiple process Execution of program started via GUI mouse clicks and command line entry of its name The process state transition As a process executes, The process is being created, then The process is waiting to be assigned to a processor therefore, Instructions are being executed then The process is waiting for some event to occur,thereafter The process has finished exec ...

Programming Languages Implementation and Design. .docx

aryan532920

/** * Programming Languages: Implementation and Design. * * A Simple Compiler Adapted from Sebesta (2010) by Josh Dehlinger further modified by Adam Conover * (2012-2015) * * A simple compiler used for the simple English grammar in Section 2.2 of Adam Brooks Weber's * "Modern Programming Languages" book. Parts of this code was adapted from Robert Sebesta's * "Concepts of Programming Languages". * * This compiler assumes that the source file containing the sentences to parse is provided as the * first runtime argument. Within the source file, the compiler assumes that each sentence to parse * is provided on its own line. * * NOTE: A "real" compiler would more likely treat an entire file as a single stream of input, * rather than each line being an independent input stream. */ import java.io.BufferedReader; import java.io.FileReader; import java.io.IOException; public class Compiler { /** * It is assumed that the first argument provided is the name of the source file that is to be * "compiled". */ public static void main(String[] args) throws IOException { //args = new String[]{"<some hard coded path for testing>"}; //args = new String[]{"D:\\Version_Controlled\\_SVN_\\Newton\\COS420\\Java\\ParserSample\\input.txt"}; if (args.length < 1) { System.out.println("Need a filename!"); } else { // Java 7 "try-with-resource" to create the file input buffer. try (BufferedReader br = new BufferedReader(new FileReader(args[0]))) { // Create the new lexer. LexicalAnalyzer lexer = new LexicalAnalyzer(); // Start lexing and parsing. processFile(lexer, br); } } } /** * Reads each line of the input file and invokes the lexer and parser for each. */ static void processFile(LexicalAnalyzer lexer, BufferedReader br) throws IOException { String sourceLine; // Read each line in the source file to be compiled as a unique sentence // to check against the grammar. while ((sourceLine = br.readLine()) != null) { // Ignore empty lines and comments. if (sourceLine.trim().length() <= 0) { continue; } if (sourceLine.trim().startsWith("#")) { System.out.println("Comment: " + sourceLine.substring(1).trim()); continue; } // Create a new syntax analyzer over the provided lexer. SyntaxAnalyzer parser = new SyntaxAnalyzer(lexer); // Parse the given sentence against the given grammar. We assume that the // sentence, <S>, production is the start state. try { // Start the lexer... lexer.start(sourceLine); // Start the parser... parser ...

Introto netthreads-090906214344-phpapp01Aravindharamanan S

Operating System Assignment Help

Programming Homework Help

I am Anne L. I am an Operating System Assignment Expert at programminghomeworkhelp.com. I hold a Ph.D. in Programming, Auburn University, USA. I have been helping students with their homework for the past 8 years. I solve assignments related to Operating systems. Visit programminghomeworkhelp.com or email support@programminghomeworkhelp.com. You can also call on +1 678 648 4277 for any assistance with Operating System Assignments.

import java.io.BufferedReader;import java.io.BufferedWriter;.docx

wilcockiris

import java.io.BufferedReader; import java.io.BufferedWriter; import java.io.FileReader; import java.io.FileWriter; import java.io.IOException; import java.io.PrintWriter; import java.util.StringTokenizer; /** * Read a .dat file and reverse it. * CPS 350 */ public class Reverse { public static void main(String[]args) { if (args.length != 3) { System.err.println(" Incorrect number of arguments"); System.err.println(" Usage: "); System.err. println("\tjava Reverse <stack type> <input file> <output file>"); System.exit(1); } boolean useList = true; if (args[0].compareTo("list")==0) useList = true; else if (args[0].compareTo("array")==0) useList = false; else { System.err.println("\tSaw "+args[0]+" instead of list or array as first argument"); System.exit(1); } try { // // Set up the input file to read, and the output file to write to // BufferedReader fileIn = new BufferedReader(new FileReader(args[1])); PrintWriter fileOut = new PrintWriter(new BufferedWriter(new FileWriter(args[2]))); // // Read the first line of the .dat file to get sample rate. // We want to store the sample rate value in a variable, // but we can ignore the "; Sample Rate" part of the line. // Step through the first line one token (word) at a time // using the StringTokenizer. The fourth token is the one // we want (the sample rate). // StringTokenizer str; String oneLine; int sampleRate; String strJunk; oneLine = fileIn.readLine(); str = new StringTokenizer(oneLine); strJunk = str.nextToken(); // Read in semicolon strJunk = str.nextToken(); // Read in "Sample" strJunk = str.nextToken(); // Read in "Rate" // Read in sample rate sampleRate = Integer.parseInt(str.nextToken()); // // Read in the remainder of the file on line at a time. // The values in the first column are thrown away. // Place values from the second column on the stack. // Stop reading if we reach the end of the file. // DStack s; if (useList) s = new ListStack(); else s = new ArrayStack(); String timestep; double data; int count = 0; while ((oneLine = fileIn.readLine()) != null) { if (oneLine.charAt(0) == ';') { continue; } str = new StringTokenizer(oneLine); // Read in time step value from first col.

Intro To .Net Threads

rchakra

Operating System 4 1193308760782240 2mona_hakmy

Operating System 4tech2click

M|18 Architectural Overview: MariaDB MaxScale

MariaDB plc

Article link httpiveybusinessjournal.compublicationmanaging-.docx

fredharris32

Article link: http://iveybusinessjournal.com/publication/managing-global-risk-to-seize-competitive-advantage/ Requirements: Write one summary and study note both no longer than one pages should include all point of article. Then do a PPT and write a presenting paper only for 5 minutes. Groups of students will create and offer two MS PowerPoint presentation summarizing the main points of one of the readings for this course along with a one page handout for the students in the class. The aim of the presentations and the handouts is to provide the audience with the main ideas of the article and study notes. Groups will bring to class enough copies of the handout for each student in the class. The handout should list the name of the author, the title of the article, the title of the journal, and the publication date and page numbers along with a summary of its main points. Please do not exceed one page for this material. import java.io.BufferedReader; import java.io.BufferedWriter; import java.io.FileReader; import java.io.FileWriter; import java.io.IOException; import java.io.PrintWriter; import java.util.StringTokenizer; /** * Read a .dat file and reverse it. */ public class Reverse { public static void main(String[]args) { if (args.length != 3) { System.err.println(" Incorrect number of arguments"); System.err.println(" Usage: "); System.err. println("\tjava Reverse <stack type> <input file> <output file>"); System.exit(1); } boolean useList = true; if (args[0].compareTo("list")==0) useList = true; else if (args[0].compareTo("array")==0) useList = false; else { System.err.println("\tSaw "+args[0]+" instead of list or array as first argument"); System.exit(1); } try { // // Set up the input file to read, and the output file to write to // BufferedReader fileIn = new BufferedReader(new FileReader(args[1])); PrintWriter fileOut = new PrintWriter(new BufferedWriter(new FileWriter(args[2]))); // // Read the first line of the .dat file to get sample rate. // We want to store the sample rate value in a variable, // but we can ignore the "; Sample Rate" part of the line. // Step through the first line one token (word) at a time // using the StringTokenizer. The fourth token is the one // we want (the sample rate). // StringTokenizer str; String oneLine; int sampleRate; String strJunk; oneLine = fileIn.readLine(); str = new StringTokenizer(oneLine); strJunk = str.nextToken(); // Read in semicolon strJunk = str.nextToken(); // Read in "Sample" strJunk = str.nextToken(); // Read in "Rate" // ...

Peer sim (p2p network)

Hein Min Htike

Report_Ines_SwayamSwayam Tibrewal

Multithreadingbackdoor

[Java concurrency]01.thread management

xuehan zhu

TCP Sockets Tutor maXbox starter26

Max Kleiner

Threads Basic : Features, Types & Implementation

Soumen Santra

CHAP7.DOC.docbutest

Parallelizing Conqueror's Blade

Get Multicore Differentiation and Great Integrated Graphics Performance

Recommended

Recommended

More Related Content

Similar to Get Multicore Differentiation and Great Integrated Graphics Performance

Similar to Get Multicore Differentiation and Great Integrated Graphics Performance (20)

More from Intel® Software

More from Intel® Software (20)

Recently uploaded

Recently uploaded (20)

Get Multicore Differentiation and Great Integrated Graphics Performance