EEDC                          34330ExecutionEnvironments for                                      Part 0.Distributed      ...
ContentPart 0. Course Introduction   0.1. Welcome   0.2. Course Organization   0.3. Course Content and Motivation   0.4. S...
EEDC                          34330ExecutionEnvironments for                     0.1DistributedComputing                  ...
Professor Background (Who am I?) Who am I?  – Current Teaching Activity:      • System Performance Evaluation (FIB)      •...
Welcome          And you?             5
6
EEDC                          34330ExecutionEnvironments for                                       0.2Distributed         ...
Syllabus Stuff (official course description) “Guia docente oficial” (Spanish) : http://docencia.ac.upc.edu/master//es/cour...
Syllabus Stuff (official course description) METODOLOGÍA:    Esta asignatura es avanzada y los conceptos básicos se    sup...
Syllabus Stuff (official course description) De las 150        horas de trabajo, la distribución será:  –   60 horas (4 ho...
Where to find the information Teacher contact :     Office: Campus Nord, Block C6, office 217     Phone: +34 93 401 7223  ...
Students EEDC web site                 12
EEDC as a network of contacts (LinkedIn)                   13
This course provides an overview of the wide scope of this area and introduces past and current research focusing on conce...
Content paper readings assigned for some of the classes. We will have two main reading parts in the course,  – one for gen...
This year’s special focus Current Execution Environment of Distributed Systems:  Explosion of Cloud Computing, dominated b...
Syllabus Stuff  “state-of-the-field” review    All students taking the course are required to complete a    research proje...
Tentative Grading Policy (*) Paper Readings/Presentations (and homework): 35%  – Delivery: 20 %  – Content: 10 %  – Public...
GROUPS (*) Paper Reading/Presentation: 8 group x                                                 3 members Research Projec...
EEDC                          34330ExecutionEnvironments forDistributedComputing                            0.3Master in C...
Today landscape Today’s applications: – Heterogeneous set of                                        T10         T20   appl...
Today landscape Types of applications  – This set of applications includes:     • Interactive web workload     • Non-inter...
Today landscape Type of applications – The SLA for different applications tend to be based on   different characteristics:...
Example (commercial data centers) Current use:  – Resource provisioning: maximum granularity is a node  – Off peak periods...
Execution Environment      To efficiently execute all these kinds of applications,      a new execution environment is nee...
How do these applications access resources?      Solution: software layer       abstracts the details in a heterogeneous d...
Execution Environment or Middleware? Middleware is a somewhat overloaded term!!!!!! For this course: – Generally speaking,...
Different layers Who is responsible for managing resources? Data Center Infrastructure                      …             ...
EEDC Content (2012) Part 0: Introduction Part 1 : Distributed Computing Scenario Part 2: Current Trends in EEDC: Cloud Com...
EEDC Workshop series                30
EEDC open seminars A series of seminars with the view of IT industry and IT entrepreneurs. (Invited speakers) Aimed to sti...
EEDC open seminars Tentative 2012 EEDC Open Seminar case study areas:  – “Infrastructure as a Service”  – “Software as a S...
Tentative schedule                     33
EEDC                          34330ExecutionEnvironments forDistributed                             0.4ComputingMaster in ...
Procedure for delivering homework1. Read the assigned documentation/paper  –   Each of you have to read the document/paper...
Procedure for presenting homework1. Prepare the presentation and bring it to the class   with your «pendrive» or similar  ...
First homework: Wark-up Project:  – prepare a presentation about an easy topic: «Distributed Systems» Characteristics:  – ...
EEDC                          34330ExecutionEnvironments forDistributed                            0.5ComputingMaster in C...
Practical view Source of research information  –   Patents  –   Journals  –   Technical Magazines  –   Conferences  –   Wo...
Patents          40
Google Scholar                 41
Google Scholar                 42
Conferences vs Journals Conferences have higher status in computer systems  – Note that, in computer systems, the top conf...
Conferences vs Journals In most scientific fields, journals have higher standards than conferences; computer science is a ...
Journal Citation Report                   45
IEEE Journals & Magazines                 46
ACM Journals and Magazines                 47
Conference CFP                 48
Case Study: Middleware 2011                 49
Case Study: Middleware 2011  Call for Papers  Call for Posters  Call for Workshop Proposals  Call For Industrial Track Pap...
The other side: E-Energy conference http://events.networks.imdea.org/content/e-energy-2012/home                           ...
The other side: E-Energy conference                  52
The other side: E-Energy conference                  53
Case study: One big conference                  54
Upcoming SlideShare
Loading in …5
×

Part.0.eedc

481 views
448 views

Published on

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
481
On SlideShare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
9
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Part.0.eedc

  1. 1. EEDC 34330ExecutionEnvironments for Part 0.Distributed CourseComputingMaster in Computer Architecture,Networks and Systems - CANS Introduction
  2. 2. ContentPart 0. Course Introduction 0.1. Welcome 0.2. Course Organization 0.3. Course Content and Motivation 0.4. Students presentation warm-up 0.5. Conferences and Journals in Computers Science. How its work? 2
  3. 3. EEDC 34330ExecutionEnvironments for 0.1DistributedComputing WelcomeMaster in Computer Architecture,Networks and Systems - CANS
  4. 4. Professor Background (Who am I?) Who am I? – Current Teaching Activity: • System Performance Evaluation (FIB) • Execution Environments for Distributed Computing (CANS) – Research interests : IT Resource Management • Cloud Computing, Green Computing, Big Data … – Research groups • Professor at High Performance Computing Group at DAC (UPC). • Manager of Autonomic Systems and eBusiness Platforms research group at BSC. – Other interests: • I enjoy talking about technology and mountain biking – For more details : www.JordiTorres.org 4
  5. 5. Welcome And you? 5
  6. 6. 6
  7. 7. EEDC 34330ExecutionEnvironments for 0.2Distributed CourseComputingMaster in Computer Architecture, MotivationNetworks and Systems - CANS & Organization
  8. 8. Syllabus Stuff (official course description) “Guia docente oficial” (Spanish) : http://docencia.ac.upc.edu/master//es/course.24.html 2006 OBJETIVOS/OBJECTIVES (“absolete”): El objetivo del curso es ofrecer una visión de los desarrollos y la investigación en los entornos de ejecución para sistemas paralelos y distribuidos. En el curso se hará especial énfasis en los entornos requeridos en las nuevas aplicaciones ebusiness y grid. Se estudiará la organización interna y la gestión que recursos requerida para garantizar funcionalidades y calidad de servicio. 8
  9. 9. Syllabus Stuff (official course description) METODOLOGÍA: Esta asignatura es avanzada y los conceptos básicos se suponen ya conocidos en asignaturas anteriores y está enfocada principalmente a introducir al estudiante en los conceptos más avanzados. Con clases magistrales se introducirán los temas que se trabajarán sobre artículos recientes publicados en congresos o revistas especializadas y propuestas de proyectos en desarrollo. Los alumnos deberán realizar la lectura y discusión de estos artículos. Los alumos realizaran un trabajo personal y presentaran un tema de investigación relacionado con la asignatura. El estudiante tendrá que desarrollar su capacidad crítica para evaluar la adecuación de las soluciones presentadas o, en su caso, proponer posibles ideas. 9
  10. 10. Syllabus Stuff (official course description) De las 150 horas de trabajo, la distribución será: – 60 horas (4 horas semana, 15 semanas) de clase presencial: se distribuirán en: • clase magistrales, • trabajos, • casos de estudio y debate de artículos de investigación • presentación de trabajos – 90 horas de trabajo no presencial: • lectura de artículos de investigación, • realización de trabajos, • preparación de las presentaciones • y estudio. 10
  11. 11. Where to find the information Teacher contact : Office: Campus Nord, Block C6, office 217 Phone: +34 93 401 7223 Email: torres@ac.upc.edu Course Slides web site http://www.jorditorres.org/teaching/eedc-2011-execution- environments-for-distributed-computing/eedc2012-slides/ Students EEDC web site http://www.jorditorres.org/news/ Official web site for personal and confidential information “RACÓ” at http://www.fib.upc.edu 11
  12. 12. Students EEDC web site 12
  13. 13. EEDC as a network of contacts (LinkedIn) 13
  14. 14. This course provides an overview of the wide scope of this area and introduces past and current research focusing on conceptual and practical aspects. The course also has the objective of introducing the student to research. In general the EEDC course is focused on developing skills rather than content. For this reason each year we focus the course on some of the dichotomies which come to mind when looking at the strategies available to deal with concrete problems in the wider EEDC space and in relation to some problem of the utmost relevance. 14
  15. 15. Content paper readings assigned for some of the classes. We will have two main reading parts in the course, – one for general papers (and homework) and – one for more in depth research in the area (related with the research project). All students taking the course are required to complete a research project (or “state-of-the-field” review). Discussions will be led by one or more students and may include brief presentations. Moreover we will introduce the student to the research information resources. We expect also to have seminars by IT companies. 15
  16. 16. This year’s special focus Current Execution Environment of Distributed Systems: Explosion of Cloud Computing, dominated by the boom of available information (Big Data) and all in a sustainable way (Green Computing). 16
  17. 17. Syllabus Stuff “state-of-the-field” review All students taking the course are required to complete a research project (or “state-of-the-field” review). The project is intended to provide the student with an opportunity to gain experience with research in a topic related to the content of the course. Moreover we will introduce the student to the research information resources 17
  18. 18. Tentative Grading Policy (*) Paper Readings/Presentations (and homework): 35% – Delivery: 20 % – Content: 10 % – Public presentation: 5% Research Project: 35% – Research survey content and writing 20% – Presentation of the workshop paper 10% Participation: 35% – Class/Seminars participation (in paper discussions) 10% – Class attendance: 25% (*) pending to know the final number of students enrolled. Default organization of course have to be changed 18
  19. 19. GROUPS (*) Paper Reading/Presentation: 8 group x 3 members Research Project Group: 1 or 2 students (*) pending to know the final number of students enrolled. Default organization of course have to be changed 19
  20. 20. EEDC 34330ExecutionEnvironments forDistributedComputing 0.3Master in Computer Architecture,Networks and Systems - CANS Course Content
  21. 21. Today landscape Today’s applications: – Heterogeneous set of T10 T20 applications with different T30 T40 characteristics and service T50 T11 T21 goals: T31 T41 • Characteristics: Streaming T51 CPU, I/O, network, Workflow memory… intensive Hardware • Service goals: Transactional Response time, throughput, fault-tolerant, deadlines… Indexing Big Data 21
  22. 22. Today landscape Types of applications – This set of applications includes: • Interactive web workload • Non-interactive workloads such document indexing or data-intensive jobs. • Scientific applications that range from a single thread intensive CPU consuming to multi-threading applications. •… – Current workloads: based on a heterogeneous set of applications that deliver critical services to their customers (SLA). 22
  23. 23. Today landscape Type of applications – The SLA for different applications tend to be based on different characteristics: • Performance goals: – For interactive : response time or … – For non interactive : for example completion time. • Time scale of managing: – Non-interactive workloads typically require computation for an extended period of time. – Interactive workloads present a short duration for individual requests and require short control cycles. • Clients priority: e.g. Gold, Silver, Platinum, Standard. • … and others 23
  24. 24. Example (commercial data centers) Current use: – Resource provisioning: maximum granularity is a node – Off peak periods: wasted computing power – Set of applications: time partition (if possible!) • Transactional on work hours • Batch on nights Example: Web workload Challenges: Workload intensity – Reduce waste of resources – Increase provisioning granularity – Management of mixed workloads – Service level guarantees – Better exploitation of resources – … Day 1 Day 2 Day 3 24
  25. 25. Execution Environment To efficiently execute all these kinds of applications, a new execution environment is needed StreamingT10 T20 T40 T30 Grid T50 T11 T21 Big Data T31 T41 T51 Transactional Indexing 25
  26. 26. How do these applications access resources? Solution: software layer abstracts the details in a heterogeneous distributed computing environment. StreamingT10 T20 T40 T30 Grid T50 T11 T21 Big Data T31 T41 T51 Transactional Indexing 26
  27. 27. Execution Environment or Middleware? Middleware is a somewhat overloaded term!!!!!! For this course: – Generally speaking, middleware provides software services for application programs INCLUDING the basic operating system and networking services above the resources. USER Application Software services Operating System and Networking Services Resources 27
  28. 28. Different layers Who is responsible for managing resources? Data Center Infrastructure … Web Services Aplication Server Applications Web server Java Virtual Machine System Operating System Virtualization layer Hardware … MIDDLEWARE 28
  29. 29. EEDC Content (2012) Part 0: Introduction Part 1 : Distributed Computing Scenario Part 2: Current Trends in EEDC: Cloud Computing as a core part Part 3: Scientific EEDC Part 4: Energy Challenges in today EEDC: Green Computing Part 5: Big Data challenges in EEDC Part 6: Other challenges in today EEDC Part 7: EEDC Open Seminars Part 8: Fourth EEDC Workshop 29
  30. 30. EEDC Workshop series 30
  31. 31. EEDC open seminars A series of seminars with the view of IT industry and IT entrepreneurs. (Invited speakers) Aimed to stimulate intellectual conversations about real cases related with the course as well as to allow an opportunity to learn more about each “case study” presented through Q&A session. The seminars are open to FIB community 31
  32. 32. EEDC open seminars Tentative 2012 EEDC Open Seminar case study areas: – “Infrastructure as a Service” – “Software as a Service” – “Green Computing” – “Open Data” – “Big Data” 32
  33. 33. Tentative schedule 33
  34. 34. EEDC 34330ExecutionEnvironments forDistributed 0.4ComputingMaster in Computer Architecture, StudentsNetworks and Systems - CANS Presentation Warm-up
  35. 35. Procedure for delivering homework1. Read the assigned documentation/paper – Each of you have to read the document/paper – Meet with your team to discuss and do the homework2. Build the presentation – Consider the specifications for the corresponding homework – Be sure to follow the EEDC template – Be sure to include your names on the first slide3. Upload the presentation on to slideshare (or similar) – Create your account if necessary at http://www.slideshare.net4. Link your presentation to EEDC students website – You will have an account at www.JordiTorres.org/news – Find the correct post and edit it to link the presentation5. Check that everything is ok before the deadline 35
  36. 36. Procedure for presenting homework1. Prepare the presentation and bring it to the class with your «pendrive» or similar – Optionally you could use the uploaded presentation2. Only two (*) groups will present the paper/homework – The groups will be chosen at random3. There will be a Q&A session after presentations – This will be considered as class participation and count towards final marks4. After the presentation listeners will choose the best presentation (*)(*) In some cases only 1 group will present 36
  37. 37. First homework: Wark-up Project: – prepare a presentation about an easy topic: «Distributed Systems» Characteristics: – A 15 minute presentation (aprox.). Source of information? – For this homework you could use any information on the Internet (e.g. Wikipedia) – Where can we find more? (next day!) Delivery deadline: – Thursday 23th February at 11:00am Presentation: – Thursday 23th February at 12:00am ( class room) 37
  38. 38. EEDC 34330ExecutionEnvironments forDistributed 0.5ComputingMaster in Computer Architecture, ConferencesNetworks and Systems - CANS & Journals
  39. 39. Practical view Source of research information – Patents – Journals – Technical Magazines – Conferences – Workshops – Others Why to prefer Conferences? Why to prefer Journals? Other important issues Case Studies 39
  40. 40. Patents 40
  41. 41. Google Scholar 41
  42. 42. Google Scholar 42
  43. 43. Conferences vs Journals Conferences have higher status in computer systems – Note that, in computer systems, the top conferences are more important than even the top journals. – (the best researchers want to send their papers to conferences rather than journals) Conferences have higher quality in computer systems – The top conferences use a rigorous review process in which (3-7) program committee members evaluate each submitted paper. – Furthermore, these conferences often "shepherd" the accepted papers, i.e. the program committee members supervise the revision of the accepted papers according to the reviewers comments. – The top conferences in computer systems typically accept 10%- 20% of the submitted papers. (Example EuroSys 27/178 = 15%) 43
  44. 44. Conferences vs Journals In most scientific fields, journals have higher standards than conferences; computer science is a rare exception. Journals may have longer page limits. A journal paper could recap or given an overview of an entire research area. The journal version of a publication will be cited more than the conference version, because the journal version has a later date and thus seems more authoritative. 44
  45. 45. Journal Citation Report 45
  46. 46. IEEE Journals & Magazines 46
  47. 47. ACM Journals and Magazines 47
  48. 48. Conference CFP 48
  49. 49. Case Study: Middleware 2011 49
  50. 50. Case Study: Middleware 2011 Call for Papers Call for Posters Call for Workshop Proposals Call For Industrial Track Papers Program Accepted Papers Accepted Poster Papers Workshops Tutorials Doctoral Symposium Keynotes Important dates Organization Venue 50
  51. 51. The other side: E-Energy conference http://events.networks.imdea.org/content/e-energy-2012/home 51
  52. 52. The other side: E-Energy conference 52
  53. 53. The other side: E-Energy conference 53
  54. 54. Case study: One big conference 54

×