Submit Search
Upload
[153] apache reef
•
27 likes
•
12,162 views
NAVER D2
Follow
DEVIEW 2015 DAY1. apache reef
Read less
Read more
Technology
Slideshow view
Report
Share
Slideshow view
Report
Share
1 of 55
Download now
Download to read offline
Recommended
[152] 웹브라우저 감옥에서 살아남기
[152] 웹브라우저 감옥에서 살아남기
NAVER D2
[164] pinpoint
[164] pinpoint
NAVER D2
[143] 모바일 혈액진단기기 개발 삽질기
[143] 모바일 혈액진단기기 개발 삽질기
NAVER D2
[144]mobile앱에서 효율적인 storage 접근 방법
[144]mobile앱에서 효율적인 storage 접근 방법
NAVER D2
[151] 영상 인식을 통한 오프라인 고객분석 솔루션과 딥러닝
[151] 영상 인식을 통한 오프라인 고객분석 솔루션과 딥러닝
NAVER D2
[161] 데이터사이언스팀 빌딩
[161] 데이터사이언스팀 빌딩
NAVER D2
[142] how riot works
[142] how riot works
NAVER D2
[133] 브라우저는 vsync를 어떻게 활용하고 있을까
[133] 브라우저는 vsync를 어떻게 활용하고 있을까
NAVER D2
Recommended
[152] 웹브라우저 감옥에서 살아남기
[152] 웹브라우저 감옥에서 살아남기
NAVER D2
[164] pinpoint
[164] pinpoint
NAVER D2
[143] 모바일 혈액진단기기 개발 삽질기
[143] 모바일 혈액진단기기 개발 삽질기
NAVER D2
[144]mobile앱에서 효율적인 storage 접근 방법
[144]mobile앱에서 효율적인 storage 접근 방법
NAVER D2
[151] 영상 인식을 통한 오프라인 고객분석 솔루션과 딥러닝
[151] 영상 인식을 통한 오프라인 고객분석 솔루션과 딥러닝
NAVER D2
[161] 데이터사이언스팀 빌딩
[161] 데이터사이언스팀 빌딩
NAVER D2
[142] how riot works
[142] how riot works
NAVER D2
[133] 브라우저는 vsync를 어떻게 활용하고 있을까
[133] 브라우저는 vsync를 어떻게 활용하고 있을까
NAVER D2
[132] rust
[132] rust
NAVER D2
[141] react everywhere
[141] react everywhere
NAVER D2
[134] immersive sound vr
[134] immersive sound vr
NAVER D2
[113] lessons from realm
[113] lessons from realm
NAVER D2
[131] packetbeat과 elasticsearch
[131] packetbeat과 elasticsearch
NAVER D2
[112] 실전 스위프트 프로그래밍
[112] 실전 스위프트 프로그래밍
NAVER D2
[162] jpa와 모던 자바 데이터 저장 기술
[162] jpa와 모던 자바 데이터 저장 기술
NAVER D2
[121]네이버 효과툰 구현 이야기
[121]네이버 효과툰 구현 이야기
NAVER D2
[124] mit cheetah 로봇의 탄생
[124] mit cheetah 로봇의 탄생
NAVER D2
[114] DRC hubo technical review
[114] DRC hubo technical review
NAVER D2
[122] line on apple watch
[122] line on apple watch
NAVER D2
[111] 네이버효과툰어떻게만들어졌나
[111] 네이버효과툰어떻게만들어졌나
NAVER D2
[154] 데이터 센터의 오픈 소스 open compute project (ocp)
[154] 데이터 센터의 오픈 소스 open compute project (ocp)
NAVER D2
[123] quality without qa
[123] quality without qa
NAVER D2
[242] wifi를 이용한 실내 장소 인식하기
[242] wifi를 이용한 실내 장소 인식하기
NAVER D2
[223] h base consistent secondary indexing
[223] h base consistent secondary indexing
NAVER D2
[231] the simplicity of cluster apps with circuit
[231] the simplicity of cluster apps with circuit
NAVER D2
[232] 수퍼컴퓨팅과 데이터 어낼리틱스
[232] 수퍼컴퓨팅과 데이터 어낼리틱스
NAVER D2
[234] 산업 현장을 위한 증강 현실 기기 daqri helmet 개발기
[234] 산업 현장을 위한 증강 현실 기기 daqri helmet 개발기
NAVER D2
[224] 번역 모델 기반_질의_교정_시스템
[224] 번역 모델 기반_질의_교정_시스템
NAVER D2
Systematic Testing for Resource Leaks in Android Applications
Systematic Testing for Resource Leaks in Android Applications
Dacong (Tony) Yan
Resume
Resume
Yuanzhe Cai
More Related Content
Viewers also liked
[132] rust
[132] rust
NAVER D2
[141] react everywhere
[141] react everywhere
NAVER D2
[134] immersive sound vr
[134] immersive sound vr
NAVER D2
[113] lessons from realm
[113] lessons from realm
NAVER D2
[131] packetbeat과 elasticsearch
[131] packetbeat과 elasticsearch
NAVER D2
[112] 실전 스위프트 프로그래밍
[112] 실전 스위프트 프로그래밍
NAVER D2
[162] jpa와 모던 자바 데이터 저장 기술
[162] jpa와 모던 자바 데이터 저장 기술
NAVER D2
[121]네이버 효과툰 구현 이야기
[121]네이버 효과툰 구현 이야기
NAVER D2
[124] mit cheetah 로봇의 탄생
[124] mit cheetah 로봇의 탄생
NAVER D2
[114] DRC hubo technical review
[114] DRC hubo technical review
NAVER D2
[122] line on apple watch
[122] line on apple watch
NAVER D2
[111] 네이버효과툰어떻게만들어졌나
[111] 네이버효과툰어떻게만들어졌나
NAVER D2
[154] 데이터 센터의 오픈 소스 open compute project (ocp)
[154] 데이터 센터의 오픈 소스 open compute project (ocp)
NAVER D2
[123] quality without qa
[123] quality without qa
NAVER D2
[242] wifi를 이용한 실내 장소 인식하기
[242] wifi를 이용한 실내 장소 인식하기
NAVER D2
[223] h base consistent secondary indexing
[223] h base consistent secondary indexing
NAVER D2
[231] the simplicity of cluster apps with circuit
[231] the simplicity of cluster apps with circuit
NAVER D2
[232] 수퍼컴퓨팅과 데이터 어낼리틱스
[232] 수퍼컴퓨팅과 데이터 어낼리틱스
NAVER D2
[234] 산업 현장을 위한 증강 현실 기기 daqri helmet 개발기
[234] 산업 현장을 위한 증강 현실 기기 daqri helmet 개발기
NAVER D2
[224] 번역 모델 기반_질의_교정_시스템
[224] 번역 모델 기반_질의_교정_시스템
NAVER D2
Viewers also liked
(20)
[132] rust
[132] rust
[141] react everywhere
[141] react everywhere
[134] immersive sound vr
[134] immersive sound vr
[113] lessons from realm
[113] lessons from realm
[131] packetbeat과 elasticsearch
[131] packetbeat과 elasticsearch
[112] 실전 스위프트 프로그래밍
[112] 실전 스위프트 프로그래밍
[162] jpa와 모던 자바 데이터 저장 기술
[162] jpa와 모던 자바 데이터 저장 기술
[121]네이버 효과툰 구현 이야기
[121]네이버 효과툰 구현 이야기
[124] mit cheetah 로봇의 탄생
[124] mit cheetah 로봇의 탄생
[114] DRC hubo technical review
[114] DRC hubo technical review
[122] line on apple watch
[122] line on apple watch
[111] 네이버효과툰어떻게만들어졌나
[111] 네이버효과툰어떻게만들어졌나
[154] 데이터 센터의 오픈 소스 open compute project (ocp)
[154] 데이터 센터의 오픈 소스 open compute project (ocp)
[123] quality without qa
[123] quality without qa
[242] wifi를 이용한 실내 장소 인식하기
[242] wifi를 이용한 실내 장소 인식하기
[223] h base consistent secondary indexing
[223] h base consistent secondary indexing
[231] the simplicity of cluster apps with circuit
[231] the simplicity of cluster apps with circuit
[232] 수퍼컴퓨팅과 데이터 어낼리틱스
[232] 수퍼컴퓨팅과 데이터 어낼리틱스
[234] 산업 현장을 위한 증강 현실 기기 daqri helmet 개발기
[234] 산업 현장을 위한 증강 현실 기기 daqri helmet 개발기
[224] 번역 모델 기반_질의_교정_시스템
[224] 번역 모델 기반_질의_교정_시스템
Similar to [153] apache reef
Systematic Testing for Resource Leaks in Android Applications
Systematic Testing for Resource Leaks in Android Applications
Dacong (Tony) Yan
Resume
Resume
Yuanzhe Cai
BDW Chicago 2016 - Jim Scott, Director, Enterprise Strategy & Architecture - ...
BDW Chicago 2016 - Jim Scott, Director, Enterprise Strategy & Architecture - ...
Big Data Week
Webinar slides when open source meets network control planes
Webinar slides when open source meets network control planes
Christian Esteve Rothenberg
The power of polyglot searching
The power of polyglot searching
GraphAware
Power of Polyglot Search
Power of Polyglot Search
Janos Szendi-Varga
MOD2014-Mens-Lecture1
MOD2014-Mens-Lecture1
Tom Mens
From Zero to Lots - ScaleCamp UK 2009
From Zero to Lots - ScaleCamp UK 2009
Josh Devins
A Study of the Evolution Process of PaaS Ecosystem -Abridged Edition-
A Study of the Evolution Process of PaaS Ecosystem -Abridged Edition-
Yuichi Yoda
Current Trends and Challenges in Big Data Benchmarking
Current Trends and Challenges in Big Data Benchmarking
eXascale Infolab
AMP Camp 5 Intro
AMP Camp 5 Intro
jeykottalam
Similar to [153] apache reef
(11)
Systematic Testing for Resource Leaks in Android Applications
Systematic Testing for Resource Leaks in Android Applications
Resume
Resume
BDW Chicago 2016 - Jim Scott, Director, Enterprise Strategy & Architecture - ...
BDW Chicago 2016 - Jim Scott, Director, Enterprise Strategy & Architecture - ...
Webinar slides when open source meets network control planes
Webinar slides when open source meets network control planes
The power of polyglot searching
The power of polyglot searching
Power of Polyglot Search
Power of Polyglot Search
MOD2014-Mens-Lecture1
MOD2014-Mens-Lecture1
From Zero to Lots - ScaleCamp UK 2009
From Zero to Lots - ScaleCamp UK 2009
A Study of the Evolution Process of PaaS Ecosystem -Abridged Edition-
A Study of the Evolution Process of PaaS Ecosystem -Abridged Edition-
Current Trends and Challenges in Big Data Benchmarking
Current Trends and Challenges in Big Data Benchmarking
AMP Camp 5 Intro
AMP Camp 5 Intro
More from NAVER D2
[211] 인공지능이 인공지능 챗봇을 만든다
[211] 인공지능이 인공지능 챗봇을 만든다
NAVER D2
[233] 대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing: Maglev Hashing Scheduler i...
[233] 대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing: Maglev Hashing Scheduler i...
NAVER D2
[215] Druid로 쉽고 빠르게 데이터 분석하기
[215] Druid로 쉽고 빠르게 데이터 분석하기
NAVER D2
[245]Papago Internals: 모델분석과 응용기술 개발
[245]Papago Internals: 모델분석과 응용기술 개발
NAVER D2
[236] 스트림 저장소 최적화 이야기: 아파치 드루이드로부터 얻은 교훈
[236] 스트림 저장소 최적화 이야기: 아파치 드루이드로부터 얻은 교훈
NAVER D2
[235]Wikipedia-scale Q&A
[235]Wikipedia-scale Q&A
NAVER D2
[244]로봇이 현실 세계에 대해 학습하도록 만들기
[244]로봇이 현실 세계에 대해 학습하도록 만들기
NAVER D2
[243] Deep Learning to help student’s Deep Learning
[243] Deep Learning to help student’s Deep Learning
NAVER D2
[234]Fast & Accurate Data Annotation Pipeline for AI applications
[234]Fast & Accurate Data Annotation Pipeline for AI applications
NAVER D2
Old version: [233]대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing
Old version: [233]대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing
NAVER D2
[226]NAVER 광고 deep click prediction: 모델링부터 서빙까지
[226]NAVER 광고 deep click prediction: 모델링부터 서빙까지
NAVER D2
[225]NSML: 머신러닝 플랫폼 서비스하기 & 모델 튜닝 자동화하기
[225]NSML: 머신러닝 플랫폼 서비스하기 & 모델 튜닝 자동화하기
NAVER D2
[224]네이버 검색과 개인화
[224]네이버 검색과 개인화
NAVER D2
[216]Search Reliability Engineering (부제: 지진에도 흔들리지 않는 네이버 검색시스템)
[216]Search Reliability Engineering (부제: 지진에도 흔들리지 않는 네이버 검색시스템)
NAVER D2
[214] Ai Serving Platform: 하루 수 억 건의 인퍼런스를 처리하기 위한 고군분투기
[214] Ai Serving Platform: 하루 수 억 건의 인퍼런스를 처리하기 위한 고군분투기
NAVER D2
[213] Fashion Visual Search
[213] Fashion Visual Search
NAVER D2
[232] TensorRT를 활용한 딥러닝 Inference 최적화
[232] TensorRT를 활용한 딥러닝 Inference 최적화
NAVER D2
[242]컴퓨터 비전을 이용한 실내 지도 자동 업데이트 방법: 딥러닝을 통한 POI 변화 탐지
[242]컴퓨터 비전을 이용한 실내 지도 자동 업데이트 방법: 딥러닝을 통한 POI 변화 탐지
NAVER D2
[212]C3, 데이터 처리에서 서빙까지 가능한 하둡 클러스터
[212]C3, 데이터 처리에서 서빙까지 가능한 하둡 클러스터
NAVER D2
[223]기계독해 QA: 검색인가, NLP인가?
[223]기계독해 QA: 검색인가, NLP인가?
NAVER D2
More from NAVER D2
(20)
[211] 인공지능이 인공지능 챗봇을 만든다
[211] 인공지능이 인공지능 챗봇을 만든다
[233] 대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing: Maglev Hashing Scheduler i...
[233] 대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing: Maglev Hashing Scheduler i...
[215] Druid로 쉽고 빠르게 데이터 분석하기
[215] Druid로 쉽고 빠르게 데이터 분석하기
[245]Papago Internals: 모델분석과 응용기술 개발
[245]Papago Internals: 모델분석과 응용기술 개발
[236] 스트림 저장소 최적화 이야기: 아파치 드루이드로부터 얻은 교훈
[236] 스트림 저장소 최적화 이야기: 아파치 드루이드로부터 얻은 교훈
[235]Wikipedia-scale Q&A
[235]Wikipedia-scale Q&A
[244]로봇이 현실 세계에 대해 학습하도록 만들기
[244]로봇이 현실 세계에 대해 학습하도록 만들기
[243] Deep Learning to help student’s Deep Learning
[243] Deep Learning to help student’s Deep Learning
[234]Fast & Accurate Data Annotation Pipeline for AI applications
[234]Fast & Accurate Data Annotation Pipeline for AI applications
Old version: [233]대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing
Old version: [233]대형 컨테이너 클러스터에서의 고가용성 Network Load Balancing
[226]NAVER 광고 deep click prediction: 모델링부터 서빙까지
[226]NAVER 광고 deep click prediction: 모델링부터 서빙까지
[225]NSML: 머신러닝 플랫폼 서비스하기 & 모델 튜닝 자동화하기
[225]NSML: 머신러닝 플랫폼 서비스하기 & 모델 튜닝 자동화하기
[224]네이버 검색과 개인화
[224]네이버 검색과 개인화
[216]Search Reliability Engineering (부제: 지진에도 흔들리지 않는 네이버 검색시스템)
[216]Search Reliability Engineering (부제: 지진에도 흔들리지 않는 네이버 검색시스템)
[214] Ai Serving Platform: 하루 수 억 건의 인퍼런스를 처리하기 위한 고군분투기
[214] Ai Serving Platform: 하루 수 억 건의 인퍼런스를 처리하기 위한 고군분투기
[213] Fashion Visual Search
[213] Fashion Visual Search
[232] TensorRT를 활용한 딥러닝 Inference 최적화
[232] TensorRT를 활용한 딥러닝 Inference 최적화
[242]컴퓨터 비전을 이용한 실내 지도 자동 업데이트 방법: 딥러닝을 통한 POI 변화 탐지
[242]컴퓨터 비전을 이용한 실내 지도 자동 업데이트 방법: 딥러닝을 통한 POI 변화 탐지
[212]C3, 데이터 처리에서 서빙까지 가능한 하둡 클러스터
[212]C3, 데이터 처리에서 서빙까지 가능한 하둡 클러스터
[223]기계독해 QA: 검색인가, NLP인가?
[223]기계독해 QA: 검색인가, NLP인가?
Recently uploaded
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
UK Journal
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
Principled Technologies
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
The Digital Insurer
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
The Digital Insurer
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
Enterprise Knowledge
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
HampshireHUG
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
Delhi Call girls
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
wesley chun
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
sudhanshuwaghmare1
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
Puma Security, LLC
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
Martijn de Jong
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Drew Madelung
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Miguel Araújo
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
The Digital Insurer
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
apidays
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
Pixlogix Infotech
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
hans926745
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
Enterprise Knowledge
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
Radu Cotescu
Recently uploaded
(20)
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
[153] apache reef
1.
딥러닝에서 스트림처리까지: 빅데이터 분석
메타 프레임워크 Apache(REEF 2015년 9월 14일 전병곤 서울대학교 컴퓨터공학부
2.
REEF • A(meta-framework(that(eases(the(development(of(Big(Data( applications(atop(resource(managers(such(as(YARN(and(Mesos • Apache(Incubator(project(since(August(12,(2014 •
24(committers(– committers(from(SNU,(Microsoft,(Alibaba,( Purestorage,(UCLA,(UC(Berkeley,(UW,(SKT • Release(0.12.0-incubating(on(August(15,(2015
3.
SNU(CMSLab Collaborators • Brian(Cho •
Beom-Yeol Jeon • Joo-Seong Jeong • Dong-Jun(Lee • Yun-SeongLee • Young-Seok Yang • Gyeong-In(Yu • Young-Bin(Bok • Geon-Woo(Kim • Tae-Hun(Kim • Gye-Won(Lee • Woo-Yeon Lee • Tae-Geon(Um
4.
Roadmap • What(is(REEF? • Why(would(you(want(REEF? •
From(REEF(to(CAY
5.
Current(Big(Data(Stack Interactive Query Business Intelligence Stream Processing Machine Learning Batch Processing Distributed File System Resource
Manager (YARN, Mesos, etc.)
6.
Vision:(LegoBstyle(Big(Data(Analytics(Creation
7.
Decomposing(Big(Data(Applications • Control'plane:(coordinates(application(job(tasks,(handles(faults,( provides(heartbeats,(configures(jobs,(etc. • Data'plane:(moves(and(processes(data
8.
Common(Patterns • A(centralized(per-job(scheduler(that(observes(the(runtime(state(and( assigns(tasks(to(resources • A(runtime(for(executing(compute(tasks(and(retaining(state(in(an( organized(fashion •
Communication(channels(for(monitoring(status(and(sending(control( messages • Configuration(management(for(passing(parameters(and(binding( application(interfaces(to(runtime(implementation
9.
REEF Interactive Query Business Intelligence Stream Processing Machine Learning Batch Processing ! Reusable(control(plane(for(coordinating(data(plane(tasks ! Adaptation(layer(for(resource(managers !
Container(and(state(reuse(across(tasks(from(heterogeneous(frameworks ! Simple(and(safe(configuration(management ! Scalable(local,(remote(event(handling ! Java(and(C#((.NET)(support Distributed File System Resource Manager Services, Data Processing Lib REEF
10.
Architectural(Choices(Summary 1. Scalable(master-workers(control(plane 2. Event(processing(framework:(Wake 3.
Configuration(using(dependency(injection:(Tang 4. Container(and(state(reuse(across(heterogeneous(frameworks
11.
REEF(Components Driver Service Task Context REEF(application( component REEF(runtime( infrastructure Service
12.
REEF(Runtime(Infrastructure Driver(Runtime Hosts(the(application(control-flow(logic( implemented(in(the(Driver(module Translates(Driver(resource(actions(to(the( underlying(resource(manager(protocol(
13.
REEF(Runtime(Infrastructure Context A(state(management(environment( within(an(Evaluator,(that(is(accessible(to( any(Task(hosted(on(that(Evaluator Evaluator A(runtime(environment(on(a(container(that( can(retain(state(within(Contexts(and( execute(Tasks [ Task(Runtime(]
14.
REEF(Application(Component Driver Application(code(that(implements(the( resource(allocation(and(task(scheduling( components. 1. Runtime(Events 2. Evaluator(Events 3.
Task(Events Task Application(code(executed(in(an( Evaluator.( Task(has(access(to(its(configuration(parameters(and(the Evaluator(state,(which(is(exposed(as(Contexts
15.
REEF(Runtime(Infrastructure(and( Application(Component Service Factors(out(core(functionalities(that( can(be(re-used(across(a(broad(range(of( applications;(adds(extensibility
16.
REEF(Services Storage Key-value(store DataLoader,(OutputService Network Name-based(messaging Elastic(group(communication State Management Checkpointing Monitoring Profiling,(HTTP(server
17.
REEF(Control(Flow:(Job(Life(Cycle Client public class DistributedShell
{ ... public static void main(String[] args)3{ ... Configuration4driverConf = DriverConfiguration.CONF. ... build(); ... reef.submit(driverConf); } } Resource(Manager Name(Node Data(Node HDFS Node(Manager YARN
18.
Client public class DistributedShell
{ ... public static void main(String[] args)3{ ... Configuration4driverConf = DriverConfiguration.CONF. ... build(); ... reef.submit(driverConf); } } REEF(Control(Flow
19.
public class DistributedShellJobDriver
{ private final EvaluatorRequestor requestor; ... public void onNext(StartTime time) { requestor.submit( EvaluatorRequest.Builder() .setSize(SMALL).setNumber(2) .build() ); } ... } REEF(Control(Flow Client
20.
public class DistributedShellJobDriver
{ private final EvaluatorRequestor requestor; ... public void onNext(AllocatedEvaluator eval) { Configuration contextConf = ...; eval.submitContext(contextConf) } ... } REEF(Control(Flow Client
21.
context config + REEF(Control(Flow Client
22.
REEF(Control(Flow Client
23.
public class DistributedShellJobDriver
{ private final String cmd = “dir”; [...] public void onNext(ActiveContext ctx) { final String taskId = [...]; Configuration taskConf = Task.CONF .set(IDENTIFIER, "ShellTask") .set(TASK, ShellTask.class) .set(COMMAND, this.cmd) .build(); ctx.submitTask(taskConf); } [...] } task config REEF(Control(Flow Client
24.
class ShellTask implements
Task4{ private final String command; @Inject ShellTask(@Parameter(Command.class) String c) { this.command = c; } private String exec(final String command){ ... } @Override public byte[] call(byte[] memento) { String s = exec(this.cmd); return s.getBytes(); } } REEF(Control(Flow Client
25.
REEF(Control(Flow Client
26.
Retains( State! REEF(Control(Flow Client
27.
task config REEF(Control(Flow Client
28.
REEF(Control(Flow Client
29.
REEF(Control(Flow Client
30.
Supported(Execution(Environments • Local(Machine:(testing • YARN:(Hadoop(resource(manager((containers) •
Mesos:(resource(manager(started(at(UC(Berkeley((resource(offers) • HDInsight:(Azure(Hadoop(offering
31.
Wake • Push-based(event(handlers,(observers • User(code:(Wake(event(handlers(that(react(to(various(REEF(events •
Stage(to(execute(event(handlers:(synchronous(or(asynchronous( execution • Thread(management • Scalable(remote(messaging
32.
Tang • Configuration(using(dependency(injection(that(restricts(dynamic(bind( patterns • Tang(specification:(binding(patterns •
Bind(an(implementation(to(an(interface • Bind(a(value(to(a(configuration(parameter( • Tang(configuration(language(semantics( • “Configuration”(are(just(data:(cross-language(support • Configuration(options(can(be(set(at(most(once:(Immutability • A(large(subset(of(Tang’s(public(API(is(commutative
33.
REEF(Implementation C# Java CPP
Total Tang Wake REEF Services 8,951 4,677 13,821 4,527 7,454 5,268 24,281 11,620 0 0 1,957 0 16,405 9,945 40,059 16,147 Total 31,976 48,623 1,957 82,556 Lines(of(code(by(component(and(language((September(11,(2015) Applications(can(mix(and(match(Driver(side(event(handlers( in(Java(and(C#(with(any(number(of(Java(and(C#(Evaluators A(distributed(control(flow(framework(that(provides integration(across(language(boundaries
34.
Get(Started(with(REEF
35.
REEF(Project(Pointers • REEF(web(site:(http://reef.incubator.apache.org • Github
mirrored(site:(https://github.com/apache/incubator-reef • Some(basic(examples(to(start(with • HelloREEF*:(HelloREEF,(HelloREEFYarn,(HelloREEFMesos,(… • TaskScheduler:(job(scheduler(to(schedule(embarrassingly(parallel(jobs(and( scale(out(and(in(resources(to(handle(fluctuating(demands • Tutorials:(https://cwiki.apache.org/confluence/display/REEF/Tutorials
36.
Work(with(Your(Favorite(IDE
37.
Why(Would(You(Want(REEF?
38.
Microsoft(Azure(Services(on(REEF Azure(Stream(Analytics((ASA):( a(fully(managed(stream(processing(service( offered(in(the(Microsoft(Azure(Cloud
39.
Azure(Stream(Analytics(on(REEF • REEF(Driver • Compiles(and(optimizes((taking(user(budget(into(consideration)(a(query(into(a(data- flow(of(processing(stages •
Uses(the(stage(data-flow(to(formulate(a(request(for(resources:(an(Evaluator(per(stage( instance • Driver(restartability • REEF(Task • Executes(the(stage(instance(work(on(an(assigned(partition • REEF(Services(used • Communication(service(built(on(Wake(that(allows(sending(messages(with(logical( identifiers • Checkpointingservice
40.
VMWare(CORFU(on(REEF • CORFU((NSDI(2012):(a(distributed(logging(tool(providing(applications( with(consistency(and(transactional(services(at(extremely(high( throughput • CORFU(master(deployed(in(a(Driver •
A(logging(unit(fails,(the(Driver(is(informed,(and(the(CORFU(master(reacts • REEF(Service(for(triggering(checkpointing and(restarting(a(Driver(from( a(checkpoint( • REEF(decouples(CORFU’s(resource(deployment(from(its(state,(allowing( CORFU(to(be(elastic
41.
Why(Would(You(Want(REEF? • New(data(processing(framework • Cloud(Data(Runtime((CDR) •
Iterative(Map-Reduce-Update((IMRU) • Elastic(ML(in(CAY • New(system(services( • Elastic(“memory”,(“communication”,(and(“cache”(in(CAY • Vortex:(harnessing(transient(cluster(resources(in(CAY
42.
How(Does(REEF(Make(Life(Easier? • REEF(provides(high-level(component(abstraction:( Driver,(Task,(and(Service • REEF(application(developers(simply(write(handlers(that(react(to( various(events •
REEF(runtime(handles(common(data(processing(control-plane( functionalities
43.
What’s(Next? • We’re(ready(to(graduate(from(the(Incubator • Expected(to(be(the(3rd
Apache(TLP(project(initiated(from(Korea( (after(Hama,(Tajo) • More(production(adoption • New(REEF(primitives • More(systems(built(on(top(of(REEF
44.
From(REEF(To(CAY
45.
CAY:(A(Unified(Big(Data(Analytics(Stack • 미래창조과학부 SW스타랩
프로젝트:( (SW(스타랩)(빅데이터(플랫폼(연구실 • Unification:(development,(operation • Big(Data(operating(system(layer • Unified(data(management • Unified(programming • New/enhanced(data(processing(engines
46.
Elastic(Memory • Elastically(change(memory(resources(for(in-memory(big(data(analytics • Solves(the(problem(of(the(static(allocation(of(resources(for(low- latency(interactive(services,(machine(learning,(etc.
47.
Elastic(Communication • Elastically(configure(communication(among(nodes • Elastic(shuffle •
Elastic(group(communication • Scatter,(gather,(broadcast,(reduce,(…
48.
Elastic(Machine(Learning((SNU,(Microsoft) • Goal:(optimize(large-scale(ML(execution(automatically • Built(on(“elastic”(system(services •
Elastic(ML(runtime(&(optimizer • A(new(breed(of(elastic(Machine(Learning(algorithms(
49.
LargeBscale(Deep(Learning((SNU,(SKT) Evaluator Model,replica Evaluator Model,replica Evaluator Model,replica 1 2 k Parameter,server
50.
LargeBscale(Deep(Learning((SNU,(SKT) *(Will(be(open(sourced(under(the(Apache(License(2 EvaluatorEvaluator Sub(ModelSub(Model EvaluatorEvaluator Sub(ModelSub(Model 1 2 1 A 1 B 2 A 2 B Parameter(server
51.
Stream(processing • 초대용량 마이크로
스트림 질의 처리 플랫폼 (삼성미래기술육성사업) • Applications • Personalized(personal(digital(assistant • Real-time(disease(control • Smart(home,(smart(building
52.
Vortex((SNU,(Microsoft(Research) • Harness(volatile(resources(available(in(the(datacenter • Large(idle(resources(in(production(systems •
Massively(parallel(tasklets • Rearchitect distributed(analytics(execution((e.g.,(data(flow(graph( execution)(to(perform(well(in(such(volatile(environments
53.
References • Web(links • http://reef.incubator.apache.org((REEF(project(site) •
http://cmslab.snu.ac.kr (My(research(lab(site) • Publication • REEF:(Retainable(Evaluator(Execution(Framework,(ACM(SIGMOD(2015. • Making(Sense(of(Performance(in(Data(Analytics(Frameworks, USENIX(NSDI(2015 • Elastic(Memory:(Bring(Elasticity(Back(To(In-Memory(Big(Data(Analytics,(HotOS 2015.
54.
We(are(hiring(developers(and( postdoc(researchers! Contact:(Byung-Gon Chun((bgchun@snu.ac.kr) Seoul(National(University http://cmslab.snu.ac.krCMS Lab
55.
딥러닝에서 스트림처리까지: 빅데이터 분석
메타 프레임워크 Apache(REEF 2015년 9월 14일 전병곤 서울대학교 컴퓨터공학부 bgchun@snu.ac.kr
Download now