Kgc2014 엄브라(umbra)

•Download as PPTX, PDF•

2 likes•1,331 views

A presentation I gave at KGC 2014 in November 2014. I introduce Umbra 3 and go through three customer use cases that show the sort of development we've done with some of our partners. I cover Witcher 3 (CD Projekt RED), Quantum Break (Remedy Entertainment) and Destiny (Bungie).

Technology

더 빠른 컨텐츠 제작과 초당 프레임
성능 향상: Umbra 3
Sampo Lappalainen
수석 엔지니어

Agenda
• 엄브라와 가시성 (Visibility) 소개
• Case study: Witcher 3
• Case study: Quantum Break
• Case study: Destiny

occlusion culling에 대한 다른 접근
• PVS
• GPU 렌더링
• Portals 그리고 Cells
• 단순화 된 occluder rasterization (레스터)
• UMBRA

UMBRA 3 OVERVIEW
POLYGON SOU
P
OCCLUSION DA
TA
VISIBLE OBJEC
TS

뭐하러 돈 주고 사서 쓰나?
+ 엔지니어와 아티스트의 시간 절약
+ 사용하기 쉽다
+ 이동
+ 검증 된 기술
+ 기술 지원
• 엔지니어들은 작
업 과정에서 화를
덜 내게 된다. ^^;

THE WITCHER 3 요구 사항들
• 방대한 오픈 월드
→ PVS, 수동 작업은 상상도 할 수 없음.
• UMBRA는 자동
• STREAMING
• LODs

STREAMING IN UMBRA 3
VISIBLE OBJEC
TS
COMBINED DAT
A

STREAMING CHALLENGES
• 독립적인 BLOCKS
• BORDERS에서 맞추기
• 빨리 빨리!

LODs IN UMBRA 3
• 기존 방식: 싱글 오브젝트의 인스턴스들이 씬을 구성
• 문제점:
– 여러 개의 LOD LEVEL이 필요
– 레벨 들 간의 SELF-OCCLUSION
– LOD HIERARCHIES?

LODs 해결
DISTANCE RANGE
Z = 0 Z = LIKE, A LOT
LOD 0
(OCCLUDER)
LOD 1
LOD 2
LOD 3

LOD CHALLENGES
• LOD DISTANCE SCALING
• DISTANCE REFERENCE POINT
• 다른 LOD SELECTION에 적용 되는 조건들
• 더 똑똑한 LOD OCCLUDERS

Case study: Quantum break
• Xbox용 3인칭 액션 게임 (Max Payne &Alan Wak
e 제작자)
• 업계에서 저명한 그래픽 팀에서 개발한 자체 3D
엔진 사용
• 각각의 View에 나오는 평군 오브젝트는 기본 4
만개 부 (occlusion 없음)
• 대규모 폭파와 semi dynamic한 지오매트리 사용
• 기존의 GPU occlusion query를 사용 하고 있었음.

Semi-dynamic scene changes
• 대부분 static한 요소에서의
대규모 스케일 변화
– 기물 파괴
– 다른 시간대에서 보여 주는 각각
의 씬 버전
• 해결방안
– 가시성 데이터는 각 씬의 데이터
블럭 내에서 빌드와 저장이 이루
어짐
– 여러 버전의 데이터 블럭은 각각
의 다이나믹 스테이트에 저장
– 활성화 된 가시성 데이터 블럭은
런타임에 연동

Shadow caster culling
• OCCLUDED SHADOW를 캐스트 하는 쉐도우 캐스터는 렌더링 하지 않음.
• OCLLUSION BUFFER 를 다시 RE-PROJECT 하여 RECEIVER MASK를 생성할
수 있는 LIGHT SPACE 만들기.
• RECEIVER MASK에 대응하는 SHADOW CASTER 테스트

Case Study: Destiny
• Bungie가 제작한 자체 엔진 (cross-platform)
• 2014년에 현재/기존 콘솔에 출시
• 2009년부터 Umbra와 협업
• 기존 작업 방식은 수작업으로 포털과 BSP 씬 작업
• Umbra visibility data 사용처는…
– Game play cluster definition
– Spatial connectivity
– Audio occlusion
– Global illumination acceleration

Incremental content updates
• 임의의 폴리곤 풀을 preprocess하기 위한 요건
– 3km x 3km map
– Full rebuild: 5 minutes
– 최소의 업데이트 증가 : 10 seconds
• Umbra의 계산 방식은 작은 점들의 테스크들이 농장을 이
루는 것과 비슷하다; 그래프 형식으로 표현
• 각 태스크의 결과물은 공유된 저장소에 cache된다.
• 로컬에 있는 occlusion data에 따라서 독립적인 업데이
트도 가능

Culling with predicted camera
• 카메라 업데이트 병행해서 가
시성 처리
→ 가시성 쿼리가 시작 될 때에는
정확한 카메라 위치는 파악 되지
않는다.
• Umbra 3는 “camera predict
ion radius” 를 제공하여 보
수적인 관점의 “ from-region”
쿼리를 제공한다.
• 모든 occluder는 최종 결과물
의 양에 따라 축소 된다.

Dynamic changes in visibility
• 닫힌 문, 셔터 달린 문 등은 훌륭한 Occluder
들이다. 단, 그것들이 닫혀 있을 때만 그러하
다.
• 가시성 그래프 결점은 런타임에 링크를 활성
화 하는 것을 돕는다.
• Umbra 3 는 일반적인 “gate” 오브젝트가 런
타임에서 On/Off로 전환되는 것을 지원한다.

Thank you.
For more on Umbra 3, go to umbra3.com
sampo@umbrasoftware.com
Follow us on Twitter @umbrasoftware

What's hot

[14.10.21] Far Cry and DX9 번역(shaderstudy)

해강

[IGC2018] 유영천 개발자 - Voxel기반 네트워크 게임 최적화기법

강 민우

게임프로젝트에 적용하는 GPGPU

YEONG-CHEON YOU

서버와 클라이언트 같은 엔진 사용하기

YEONG-CHEON YOU

에픽게임즈에서 제작한 포트나이트(Fortnite)는 액션 빌딩 게임으로, 낮과 밤의 환경이 존재하며 낮에는 나무, 벽돌, 철 등 오브젝트를 부수고 수집한 자원으로 건물 및 함정을 건설하고 밤에는 습격해 오는 몬스터를 물리치는 것이 특징입니다. 이런 역동적인 환경을 만들기 위해서 언리얼 엔진 4에 추가한 다양한 라이트, 섀도, 캐릭터 애니메이션 기능 그리고 100% 리얼타임 게임 트레일러를 제작하는데 사용된 ‘시퀀서’에 대해서도 살펴봅니다.

[IGC 2017] 에픽게임즈 최용훈 - 밤낮으로 부수고 짓고 액션 빌딩 게임 만들기 - 포트나이트

강 민우

니시카와젠지의 3 d 게임 팬을 위한 ps4

민웅 이

[박민근] 3 d렌더링 옵티마이징_2

MinGeun Park

[박민근] 3 d렌더링 옵티마이징_3 공간분할

나만의 엔진 개발하기

09_Voxel rendering

실시간 게임 서버 최적화 전략

쿠버네티스 멀티 클러스터 관리

Development AR App with C++ and Windows Holographic API

YEONG-CHEON YOU

Ndc2010 전형규 마비노기2 캐릭터 렌더링 기술

henjeon

GPGPU(CUDA)를 이용한 MMOG 캐릭터 충돌처리

YEONG-CHEON YOU

제노블레이도 2 ray marching을사용한 구름 표현

민웅 이

15_TextureAtlas

noerror

Compute shader DX11

민웅 이

Ethash : Ethereum PoW Algorithm

Dongsam Byun

Shaderstudy Motion Blur

yong gyun im

What's hot (20)

[14.10.21] Far Cry and DX9 번역(shaderstudy)

[IGC2018] 유영천 개발자 - Voxel기반 네트워크 게임 최적화기법

게임프로젝트에 적용하는 GPGPU

서버와 클라이언트 같은 엔진 사용하기

[IGC 2017] 에픽게임즈 최용훈 - 밤낮으로 부수고 짓고 액션 빌딩 게임 만들기 - 포트나이트

니시카와젠지의 3 d 게임 팬을 위한 ps4

[박민근] 3 d렌더링 옵티마이징_2

[박민근] 3 d렌더링 옵티마이징_3 공간분할

나만의 엔진 개발하기

09_Voxel rendering

실시간 게임 서버 최적화 전략

쿠버네티스 멀티 클러스터 관리

Development AR App with C++ and Windows Holographic API

Ndc2010 전형규 마비노기2 캐릭터 렌더링 기술

GPGPU(CUDA)를 이용한 MMOG 캐릭터 충돌처리

제노블레이도 2 ray marching을사용한 구름 표현

15_TextureAtlas

Compute shader DX11

Ethash : Ethereum PoW Algorithm

Shaderstudy Motion Blur

Similar to Kgc2014 엄브라(umbra)

Multiplayer Game Sync Techniques through CAP theorem

Seungmo Koo

입문 Visual SLAM 14강 - 2장 Introduction to slam

[Kgc2013] 모바일 엔진 개발기

Motion blur

실전프로젝트 정서경 양현찬

[Shader study]Shadow Map Silhouette Revectorization(2014.01.06)

해강

[Kgc2012] deferred forward 이창희

changehee lee

Basics of deep learning_imcloud

imcloud

AWS 9월 웨비나 | Amazon Aurora Deep Dive

Amazon Web Services Korea

AWS CLOUD 2018- Amazon Aurora 신규 서비스 알아보기 (최유정 솔루션즈 아키텍트)

Amazon Web Services Korea

크게, 아름답게,빠르게, 일관되게 만들기: Just Cause 2 개발에서 배운 교훈들 (GPU Pro)

민웅 이

Amazon Aurora 100% 활용하기

Amazon Web Services Korea

(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...

MYEONGGYU LEE

Clonezilla se

석 허

Convolutional neural networks

HyunjinBae3

Similar to Kgc2014 엄브라(umbra) (15)

Multiplayer Game Sync Techniques through CAP theorem

입문 Visual SLAM 14강 - 2장 Introduction to slam

[Kgc2013] 모바일 엔진 개발기

Motion blur

실전프로젝트 정서경 양현찬

[Shader study]Shadow Map Silhouette Revectorization(2014.01.06)

[Kgc2012] deferred forward 이창희

Basics of deep learning_imcloud

AWS 9월 웨비나 | Amazon Aurora Deep Dive

AWS CLOUD 2018- Amazon Aurora 신규 서비스 알아보기 (최유정 솔루션즈 아키텍트)

크게, 아름답게,빠르게, 일관되게 만들기: Just Cause 2 개발에서 배운 교훈들 (GPU Pro)

Amazon Aurora 100% 활용하기

(Paper Review)Kernel predicting-convolutional-networks-for-denoising-monte-ca...

Clonezilla se

Convolutional neural networks

Recently uploaded

도심 하늘에서 시속 200km로 비행할 수 있는 미래 항공 모빌리티 'S-A2'

Hyundai Motor Group

[OpenLAB] AWS reInvent를 통해 바라본 글로벌 Cloud 기술동향.pdf

ssuserf8b8bd1

Checkmarx SCA는 공급망 공격과 같은 오픈 소스 종속성과 관련된 위험을 탐지하여 지속적으로 관리하는 소프트웨어 구성 분석(Software Composition Analysis) 솔루션입니다. 오픈 소스 라이브러리와 서드파티 구성 요소를 스캔하여 보안 취약점, 악성 코드, 라이선스 위험 등을 식별하고 해결하며 공급망 공격으로부터 보호합니다. 더불어 소프트웨어 취약점 및 악성 패키지를 탐지하고 SBOM(소프트웨어 자재 명세서)을 제공하며, 개인 패키지와 AI 생성 코드도 스캔합니다. 이 솔루션은 전문 오픈 소스 보안 연구 팀의 지원을 받으며, SDLC 전체에 원활하게 통합되어 정확하고 실행 가능한 오픈 소스 위험 인사이트를 제공합니다. 자세한 내용은 소개자료를 통해 확인 부탁드립니다. ■ Checkmarx SCA 문의하기 T. 02-6052-5701 E. pr@softwidesec.com

오픈소스 위험 관리 및 공급망 보안 솔루션 'Checkmarx SCA' 소개자료

Softwide Security

Checkmarx One은 클라우드 네이티브 엔터프라이즈 애플리케이션 보안 플랫폼으로 SAST, SCA, SCS, API 보안, DAST, 컨테이너 및 IaC 보안을 포함한 AppSec 솔루션 제품군을 통합 제공합니다. 이 통합 접근 방식의 플랫폼을 사용하면 여러 도구와 단편화된 워크플로가 필요하지 않으므로 DevSecOps가 간소화되고 그 어느 때보다 빠르게 취약점을 식별하고 해결할 수 있습니다. 또한, SDLC의 모든 단계에 AppSec를 포함하고, 우수한 개발자 경험을 제공하고, 사용하는 기술과 통합하여 성공적인 AppSec 프로그램을 구축할 수 있습니다. 자세한 내용은 소개자료를 통해 확인 부탁드립니다. ■ Checkmarx ONE 문의하기 T. 02-6052-5701 E. pr@softwidesec.com

클라우드 애플리케이션 보안 플랫폼 'Checkmarx One' 소개자료

Softwide Security

파일 업로드(Kitworks Team Study 유현주 발표자료 240510)

Wonjun Hwang

Grid Layout (Kitworks Team Study 장현정 발표자료)

Wonjun Hwang

Recently uploaded (6)

도심 하늘에서 시속 200km로 비행할 수 있는 미래 항공 모빌리티 'S-A2'

[OpenLAB] AWS reInvent를 통해 바라본 글로벌 Cloud 기술동향.pdf

오픈소스 위험 관리 및 공급망 보안 솔루션 'Checkmarx SCA' 소개자료

클라우드 애플리케이션 보안 플랫폼 'Checkmarx One' 소개자료

파일 업로드(Kitworks Team Study 유현주 발표자료 240510)

Grid Layout (Kitworks Team Study 장현정 발표자료)

Kgc2014 엄브라(umbra)

1. 더 빠른 컨텐츠 제작과 초당 프레임 성능 향상: Umbra 3 Sampo Lappalainen 수석 엔지니어

2. Agenda • 엄브라와 가시성 (Visibility) 소개 • Case study: Witcher 3 • Case study: Quantum Break • Case study: Destiny

3. UMBRA 3를 사용한 타이틀

8. OCCLUSION CULLING 도대체 무엇이길래…?

10. occlusion culling에 대한 다른 접근 • PVS • GPU 렌더링 • Portals 그리고 Cells • 단순화 된 occluder rasterization (레스터) • UMBRA

11. UMBRA 3 OVERVIEW POLYGON SOU P OCCLUSION DA TA VISIBLE OBJEC TS

12. POLYGON SOUP

13. VOXELS

14. CELLS AND PORT ALS

15. VISIBILITY QUERY RASTERIZE PORTAL GRAPH

16. DEPTH BUFFER

17. 뭐하러 돈 주고 사서 쓰나? + 엔지니어와 아티스트의 시간 절약 + 사용하기 쉽다 + 이동 + 검증 된 기술 + 기술 지원 • 엔지니어들은 작 업 과정에서 화를 덜 내게 된다. ^^;

18. CASE STUDY WITCHER 3

19. THE WITCHER 3 요구 사항들 • 방대한 오픈 월드 → PVS, 수동 작업은 상상도 할 수 없음. • UMBRA는 자동 • STREAMING • LODs

20. POLYGON SOU P OCCLUSION DA TA

21.

22. STREAMING IN UMBRA 3

23. STREAMING IN UMBRA 3

24. STREAMING IN UMBRA 3

25. STREAMING IN UMBRA 3 VISIBLE OBJEC TS COMBINED DAT A

26. STREAMING CHALLENGES • 독립적인 BLOCKS • BORDERS에서 맞추기 • 빨리 빨리!

27. LODs IN UMBRA 3 • 기존 방식: 싱글 오브젝트의 인스턴스들이 씬을 구성 • 문제점: – 여러 개의 LOD LEVEL이 필요 – 레벨 들 간의 SELF-OCCLUSION – LOD HIERARCHIES?

28. LODs 해결 DISTANCE RANGE Z = 0 Z = LIKE, A LOT LOD 0 (OCCLUDER) LOD 1 LOD 2 LOD 3

29. LOD CHALLENGES • LOD DISTANCE SCALING • DISTANCE REFERENCE POINT • 다른 LOD SELECTION에 적용 되는 조건들 • 더 똑똑한 LOD OCCLUDERS

30. CASE STUDY QUANTUM BREAK

31. Case study: Quantum break • Xbox용 3인칭 액션 게임 (Max Payne &Alan Wak e 제작자) • 업계에서 저명한 그래픽 팀에서 개발한 자체 3D 엔진 사용 • 각각의 View에 나오는 평군 오브젝트는 기본 4 만개 부 (occlusion 없음) • 대규모 폭파와 semi dynamic한 지오매트리 사용 • 기존의 GPU occlusion query를 사용 하고 있었음.

32. Semi-dynamic scene changes • 대부분 static한 요소에서의 대규모 스케일 변화 – 기물 파괴 – 다른 시간대에서 보여 주는 각각 의 씬 버전 • 해결방안 – 가시성 데이터는 각 씬의 데이터 블럭 내에서 빌드와 저장이 이루 어짐 – 여러 버전의 데이터 블럭은 각각 의 다이나믹 스테이트에 저장 – 활성화 된 가시성 데이터 블럭은 런타임에 연동

33. Shadow caster culling • OCCLUDED SHADOW를 캐스트 하는 쉐도우 캐스터는 렌더링 하지 않음. • OCLLUSION BUFFER 를 다시 RE-PROJECT 하여 RECEIVER MASK를 생성할 수 있는 LIGHT SPACE 만들기. • RECEIVER MASK에 대응하는 SHADOW CASTER 테스트

34. CASE STUDY DESTINY

35. Case Study: Destiny • Bungie가 제작한 자체 엔진 (cross-platform) • 2014년에 현재/기존 콘솔에 출시 • 2009년부터 Umbra와 협업 • 기존 작업 방식은 수작업으로 포털과 BSP 씬 작업 • Umbra visibility data 사용처는… – Game play cluster definition – Spatial connectivity – Audio occlusion – Global illumination acceleration

36. Incremental content updates • 임의의 폴리곤 풀을 preprocess하기 위한 요건 – 3km x 3km map – Full rebuild: 5 minutes – 최소의 업데이트 증가 : 10 seconds • Umbra의 계산 방식은 작은 점들의 테스크들이 농장을 이 루는 것과 비슷하다; 그래프 형식으로 표현 • 각 태스크의 결과물은 공유된 저장소에 cache된다. • 로컬에 있는 occlusion data에 따라서 독립적인 업데이 트도 가능

37. Culling with predicted camera • 카메라 업데이트 병행해서 가 시성 처리 → 가시성 쿼리가 시작 될 때에는 정확한 카메라 위치는 파악 되지 않는다. • Umbra 3는 “camera predict ion radius” 를 제공하여 보 수적인 관점의 “ from-region” 쿼리를 제공한다. • 모든 occluder는 최종 결과물 의 양에 따라 축소 된다.

38. Dynamic changes in visibility • 닫힌 문, 셔터 달린 문 등은 훌륭한 Occluder 들이다. 단, 그것들이 닫혀 있을 때만 그러하 다. • 가시성 그래프 결점은 런타임에 링크를 활성 화 하는 것을 돕는다. • Umbra 3 는 일반적인 “gate” 오브젝트가 런 타임에서 On/Off로 전환되는 것을 지원한다.

39. Thank you. For more on Umbra 3, go to umbra3.com sampo@umbrasoftware.com Follow us on Twitter @umbrasoftware

Editor's Notes

Fast content creation and smooth frame-rates with Umbra 3
Introduction to Umbra and visibility Case study: Witcher 3 Case study: Quantum Break Case study: Destiny
Video Games Powered by Umbra 3
The Witcher3 developer by CD projekt RED in Poland, and slated for release next year uses Umbra for visibility on all platforms.
Stunningly beautiful Killzone: Shadow Fall, the PS4 launch title by Guerrilla
Destiny by Bungie, going to be released September this year uses Umbra 3 on all platforms – previous and current gen.
Call of Duty: Ghosts by Infinity Ward – ps4, xbox one and pc
PVS GPU rendering Simplified occluder rasterization Portals and Cells Umbra
Here you’re looking at top down view of an example scene. The requirement was that there should be no manual markup or other requirements on the input geometry. So what we take as input is all the level geometry as it is. So we really don’t have any other information besides the long list of polygons, which are possibly grouped into objects. Doing geometric operations with polygons directly has all kinds of difficulties related to floating point accuracy. Also, almost all real life game levels contain some small modeling errors such as t-vertices and cracks between objects. We need to be able to work with this kind of input as well.
What we do next is to voxelize all the geometry. Great thing about voxelization is that it removes all the nasty problems with floating point accuracy and automatically removes common modeling errors such as cracks and t-vertices in the geometry. Voxelization also discretizes the input, making the following processing independent of polygons count. In effect we can choose the resolution of the input data. This is important for the goal of creating a bounded size data structure. The input could have billions of triangles but after this step we can throw all the original geometry away and work on the voxels instead. Bad thing about voxelization is that it requires quite a lot of memory. In fact since we need accurate visibility data we have to make the voxels quite small and the number of them might be measured in billions or even hundreds of billions for larger levels. Even compressed this data can take gigabytes of memory. The memory requirements alone indicate we need to further refine turn this voxel presentation into something else to make it usable in practice.
The approach we chose is to create a cell-and-portal graph by grouping the voxels together based on proximity and connectivity. Cells are created from groups of voxels. Portals are then created on the boundaries of these groups. We chose to create portals because in the past they have been proven to be an efficient way to represent visibility, and we solve the issues of manually placed portals by generating them automatically. In constrast to manually placed portals, we might generate thousands of portals which allows us to have accurate visibility in outdoor spaces as well. By controlling the number of output cells and portals we can choose the output resolution of the visibility data so that it meets the memory and performance requirements.
There are several options on how we could use the portal data. We could do a traditional recursive portal traversal, clipping the view frustum as we go through each portal. Or we could do ray-tracing or cone-tracing in the graph. The approach we choose is to rasterize the portals using a custom software rasterizer optimized for this purpose. With rasterizer we need to touch each portal only once, as opposed to recursive portal traversal which can suffer from exponential blowup if there’s a lot of portal intersections in the screenspace. (We could also traverse other kind of queries for connectivity.)
Also really useful property of the rasterizer is that it produces a depth buffer as output, which is almost optimal data structure for doing further visibility tests on the query output. Also with rasterization we can choose the output resolution based on the platform and accuracy requirements. Since we’re rasterizing portals instead of occluders, it’s trivial to implement conservative rasterization, which is a requirement for getting correct results in lower resolutions.
SAVE ENGINEERS’ TIME, ARTIST IT’S EASY PORTABLE PROVEN SUPPORT ENGINEERS DON’T GET TO ROLL THEIR OWN 
The game worlds are very large and in many case this means that if your occlusion culling systems require any kind of manual work, it is going to be a major burden for your artists. The worlds are pretty open, so any kind of manual portal placements are pretty much out of the question. So in this sense, Umbra’s tech suits this use case really really well. Also a game like the Witcher 3 relies heavily on dynamic streaming of data and LOD’s, both of which were features that we didn’t really support at the time when we started discussing co-operation with the CD Projekt’s team.
So again, here’s the process how Umbra works. So polygon soup, generate occlusion data. Works really well in many cases, but in a situations where the game worlds are huge there’s a couple of problems. First, on the content authoring side, there might be situations where the artists cannot have the entire world – the source data - in memory at any given time at once. So you need to be able to process just a local section of the world individually. And on the other hand, on the engine runtime the occlusion data for a world that is simply vast might be a bit too much to have in memory at all times. So we needed to do something about that.
Now I’ll tell you about the solution that we did, it is pretty simple really. This is obviously a good thing when it comes to design. So the user is just able to split the game world into these chunks, or tiles. Each of these tiles are just individual polygin soups. The the user can produce individual data sets for each of these tiles as well. This process is up to the user to distribute, so he can do this in multiple threads or multiple processes or even on multiple computers altogether. Processing...
The end result is that you have a set of streaming tiles and corresponding output occlusion data sets.
And then you can proceed to write these data sets on the disk or do what ever you want with them.
Now in the engine runtime then you have the camera location and typically some sort of a radius inside which the camera will be during the next few frames. Based on that information you know which ones of your streaming tiles are going to be active, and you can just select the corresponding Tomes. Once you have the Tomes streamed in, along with the other data you stream in when you select the active streaming tiles, you just combine the Tome s into a Tome Collection.
Then you can use that tome collection exactly as you would be using an individual tome . So you perform visibility queries and so forth.
There were couple of interesting engineering challenges when implementing a system like this even though it sound astoundingly simple. I’m not going into full detail on how exactly we did this, just to give you an idea it’s not all powerpoint animations when you do a system like this. First of all the streaming tiles are completely independent from each other. And especially when you are computing the occlusion data for a streaming tile it would be really nice to access some neighbouring geometry, especially when working near the borders of the tiles. So it would come in handy to know something about the geometry on the other side. Unfortunately this is not possibe in a system like this so we needed a way to circumvent that. Also, the kind of a flipside of this very same issue is that we needed to be able to match those neighbouring tiles together on their borders. Which sometimes can be very tricky when you don’t know anything about the neighbouring tile. You could for instance have completely different set of computation parameters used on the other side and still need to be able to combine the data sets. And obviously, you need to be really really quick when you do the opration so that it does not hurt the frame rate. In a typical scenario, you don’t do this every frame and you get to spend some time over a few frames to do this. Overall, we only had a few milliseconds for the entire operation. This was quite an interesting engineering operation to undertake.
Right, so LOD’s then. Previously Umbra had no notion about LOD’s. We had polygon soup, some triangles grouped into objects and then there were visibility queries. Obviously, in any modern 3D engine you need to have support for LOD’s. It’s not just multiple versions of the same mesh, but you need to support LOD hierarchies and then there is the problem of deciding how the different LOD’s actually contribute to the occlusion. So you don’t want to end up in a situation where different LOD’s of the same mesh occlude each other.
The solution to this one once again sounds pretty simple. So first of all, for occlusion you just use the LOD level that contains most detail. At this point I should probably make a distincition between an occluder and an occludee. An occluder hides other objects and an occludee is returned as a result from the visibility query. Then, each of the LOD levels are occludees in Umbra. And for each of these levels you an specify an active distance range. So in runtime when we do the visibility query based on the camera transformation it’s pretty easy to do distance culling based on each of the occludees active distance range.
Now, in all cases a simple camera distance is not a good criteria for selecting the active LOD level. For instance, when you do things like zooming, or you look through the scope of a sniper rifle, the distance doesn’t change but you still need to use a more detailed LOD. For this purpose we implemented possibility to scale the LOD distance in runtime. So the user specifies a number between 0 and 1 and all the LOD distances are scaled with that number. Another similar feature we implemented was the possibility to override the distance reference point entirely. We considered other crieteria for selecting the LOD level as well, such as the proportional screen space area, but so far it seem that the distance to the camera, distance scaling and modifiable distance reference point are sufficient for all the uses that we encountered. There is currenty no plans to change that. We also considered doing something smarter with the occluder data, since the occluder generation in Umbra is based on voxelization, we considered doing something like taking an intersection between all the LOD levels and using that as the occluder mesh, but then again the simpler approach where we use the most detailed mesh seems to be working sufficiently well, so there hasn’t been any pressure to change that..
Third person action game for Xbox One from creators of Max Payne and Alan Wake In-house next generation 3D engine, developed by one of the most respected graphics team in the industry Average object count per view ~40k without occlusion Features lots of large scale destruction, semi-dynamic changes of geometry Previously used GPU occlusion queries for visibility
One aspect of how Remedy is using Umbra that I wanted to talk about is how to deal with dynamically changing geometry. The first type of changes required in this title are basically transitions from one state to another – I suppose often accompanied by a big collision or explosion. There’s also cases where a scene is visited at different points in time and where some but not all of the geometry changes from one version to the next. Both of these are nicely handled by being able to stream the Umbra data chunks in and out. Multiple versions for a given chunk exist to represent different states it is in, and the engine loads the appropriate chunks for the current game state. Parts of the scene that don’t change only have a single version of the Tome data. While changing the active set of visibility data chunks is not instantaneous, it is something that happens within a couple of frames and therefore poses no problem with these types of transitions. One of the harder technical challenges involved in the runtime linking that we had to solve is that it guarantees that there’s never leaks through solid walls even when the wall extends across streaming units. But I’ll save you from the details of that exercise. So these dynamic scene changes are very similar to what we ended up implementing for the Witcher 3 team for their streaming needs.
Another thing that Remedy does is to cull shadow casters in shadow map rendering with the Umbra Tome data. You can obviously do occlusion culling in light space just as you would for a normal camera. But what Remedy is doing is something that turns out to be even more powerful. Having the Umbra generated occlusion buffer and the visible objects, you can reproject that to light space to create a mask representing the potential shadow receivers for the light source. Shadow casters are then tested against this mask to find if they cast a visible shadow. In the illustrations here, you see that with occlusion culling only we are still spending a lot of time rendering shadows. This can be avoided by using the visibility data for caster culling as shown on the right.
This is a title that needs to introduction. In case you’ve been living in a bottle it’s a huge production by the creators of Halo and it launched in September this year for the previous and current gen consoles using Umbra 3 across the board. We have worked with Bungie since 2009 and indeed many aspects of Umbra 3 were designed according to Bungie’s requirements. Bungie has changed the way they build content pretty dramatically from the previous title, they’ve gone from modeling BSPs to be able to arbitrarily splash geometry around. Their old way of doing visibility by manually placing portals was not going to cut it, so they called us. As a result of the collaboration the Umbra data is being used not only for visibility but for many other engine systems as well. I’m going to concentrate on visibility on the following slides.
Hard requirements from Bungie 3km x 3km map Full rebuild: 5 minutes Smallest incremental update: 10 seconds In Bungie’s words: ”so much content, so little time” The enabling feature for rapid content iteration is that the data is local in nature. Bungie distributes the computation task into their build farm, and all intermediate results are shared by everyone in the team. This also makes sure that you can keep tweaking on content to the very last moment before launch, and even after.
I wanted to bring up one cool way that Bungie is optimizing input latency. The final camera update is sometimes done very late in the frame due to it being dependent on Havok finishing its calculation. But there are known bounds for how much the camera can move, as we know the current velocity of the camera and its previous location. Visibility query done with incomplete information. We built the means of doing a from-region instead of from-point query. Shrink occluders – grow anti-occluders: still correct. We call this feature the predicted camera, and it’s pretty much a unique property of our algorithm.
Finally, I wanted to highlight another type of support for dynamic changes in occlusion that Umbra 3 supports: the gate objects. Ofter there is need to change the occlusion dramatically, but very locally. For instance by opening a door or a window. As the name implies, gate objects are meant exactly for this purpose. A gate object is a special input to Umbra that creates a set of portals that can be toggled on and off in runtime. This is mostly used for doors and windows, and there’s limits to what it scales to. Bungie uses the gates not just for doors, but also to partition space for the other more advanced use cases I mentioned earlier. These uses include audio occlusion, AI activation and other AI operations and broad-phase collision detection. But I’ll leave these more advanced use cases to another time.

Kgc2014 엄브라(umbra)

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Kgc2014 엄브라(umbra)

Similar to Kgc2014 엄브라(umbra) (15)

More from Sampo Lappalainen

More from Sampo Lappalainen (6)

Recently uploaded

Recently uploaded (6)

Kgc2014 엄브라(umbra)

Editor's Notes