Sparklint @ Spark Meetup Chicago

•Download as PPTX, PDF•

2 likes•400 views

Spark makes it easy to build and deploy complex data processing applications onto shared compute platforms, but tuning them is a skill in itself and can get overlooked. Uncontrolled, this leads to over specified resource requirements, unnecessary platform load and increases the chances of resource contention, degrading overall performance. By identifying inefficient jobs, development teams and platform administrators can wrestle back control of system resources, improve efficiency and lessen the effect of contention across the cluster. Sparklint uses the Spark metrics steam and a custom event listener to analyze individual Spark jobs for over specified or unbalanced resources, incorrect partitioning and sub optimal worker locality. It is easily attached to any Spark job and can also run standalone against historical event logs, presenting data for analysis through a web UI and providing a unique resource focused view of the application runtime.

Technology

Sparklint
a Tool for Identifying and Tuning Inefficient Spark Jobs
Across Your Cluster

Why Sparklint?
• A successful Spark cluster grows rapidly
• Capacity and capability mismatches arise
• Leads to resource contention
• Tuning process is non-trivial
• Current UI operational in focus
We wanted to understand application efficiency

Sparklint provides:
• Live view of batch & streaming application stats
or
• Event by event analysis of historical event logs
• Stats and graphs for:
– Idle time
– Core usage
– Task locality

Demo…
• Simulated workload analyzing site access logs:
– read text file as JSON
– convert to Record(ip, verb, status, time)
– countByIp, countByStatus, countByVerb

Job took 10m7s to finish
Already pretty good
distribution; low idle time
indicates good worker
usage, minimal driver node
interaction in job
But overall utilization is low
Which is reflected in the
common occurrence of the
IDLE state (unused cores)

Job took 15m14s to
finish
Core usage increased,
job is more efficient,
execution time
increased, but the app
is not cpu bound

Job took 9m24s to finish
Core utilization decreased
proportionally, trading execution time
for efficiency
Lots of IDLE state shows
we are over allocating
resources

Job took 11m34s to finish
Core utilization remains
low, the config settings
are not right for this
workload.
Dynamic allocation only effective at
app start due to long
executorIdleTimeout setting

Job took 33m5s to finish Core utilization is up, but execution time is
up dramatically due to reclaiming
resources before each short running task.
IDLE state is reduced to a minimum, looks
efficient, but execution is much slower due to
dynamic allocation overhead

Job took 7m34s to finish
Core utilization way up,
with lower execution time
Parallel execution is
clearly visible in
overlapping stages
Flat tops show we are
becoming CPU bound

Job took 5m6s to finish
Core utilization decreases,
trading execution time for
efficiency again here

Thanks to dynamic allocation the
utilization is high despite being a bi-
modal application
Data loading and mapping requires
a large core count to get throughput
Aggregation and IO of results
optimized for end file size,
therefore requires less cores

Future Features:
• History Server event sources
• Inline recommendations
• Auto-tuning
• Streaming stage parameter delegation
• Replay capable listener

The Credit:
• Lead developer is Robert Xue
• https://github.com/roboxue
• SDE @ Groupon

Contribute!
Sparklint is OSS:
https://github.com/groupon/sparklint

What's hot

Key to optimal end user experience

ManageEngine, Zoho Corporation

Enterprise Beacon Object Hive - Siebel Version Control

Milind Waikul

Taking Splunk to the Next Level - Architecture

Splunk

Infrastructure as code

Aakash Singhal

Siebel monitoring

Sarnindar Purewal

2013 04-29-evolution of backend

Wooga

The Many Faces of Apache Kafka: Leveraging Real-time Data at Scale

Messaging Meetup

Netflix's Could Migration

Chef

Apache Flink is an open-source stream-processing framework and represents the next generation of big data computing engines. To achieve elastic scalability, it is typically deployed in clustered environments, and increasingly on top of container orchestration platforms like Kubernetes This slide helps to gain insight into our best practices for running Flink on Kubernetes, which includes concepts like ~ When to use sidecar containers, ~ How to the checkpoint to external storage, and integration with cloud security models. ~ Pros and Cons of running Flink on Kubernetes ~ and many more… You will leave with knowledge of how to apply these techniques to your own applications.

Flink Jobs Deployment On Kubernetes

Knoldus Inc.

Closing the door on application performance problems

ManageEngine, Zoho Corporation

Across the globe, 75M Netflix members love watching 125M hours per day of TV shows and movies. They love the ease of starting on one device and resuming on another, and the Playback Data Systems team makes that happen. We’re looking for a senior engineering manager to lead this high-impact team at Netflix. Attributions for images: https://www.flickr.com/photos/theholyllama/5738164504/ and https://www.flickr.com/photos/brewbooks/7780990192/, no changes made, https://creativecommons.org/licenses/by-sa/2.0/ https://www.flickr.com/photos/crschmidt/2956721498/, no changes made, https://creativecommons.org/licenses/by/2.0/

Engineering Leader opportunity @ Netflix - Playback Data Systems

Philip Fisher-Ogden

Serverless and AI: Orit Nissan-Messing, Iguazio, Serverless NYC 2018

iguazio

In your ever-changing Infrastructure, some changes are intentional while others are not. Drift is what happens whenever the real-world state of your infrastructure differs from the state defined in your configuration. This can happen for many reasons, sometimes it happens when adding or removing resources, other times when changing resource definitions upon resource termination or failure, and even when changes have been made manually or via other automation tools. While Terraform itself can detect drifts, in most cases, you will be informed about it too late: just before you are about to deploy new changes to your infrastructure. What’s interesting about Terraform though, is that you can apply changes in two separate and distinct steps of “Planning” and “Applying”. This means that you have full visibility of what Terraform is planning on doing beforehand, and if you are satisfied with the changes, you can choose to apply them. So how does this work? When something is changed intentionally, it will appear in the source code, and the Terraform plan will not do anything. However, if any part of the infrastructure has been changed manually, Terraform’s plan will identify this, and alert you to the change. In other words, if your IaC drifted from its expected state, then Terraform’s plan will, in fact, detect it. Applying this simple solution can empower DevOps and developer velocity, with the reassurance and context for unexpected changes in your IaC, in near real-time. This talk will showcase real-world examples, and practical ways to apply this in your production environments while doing so safely and at the pace of your engineering cycles.

DON'T PANIC: GETTING YOUR INFRASTRUCTURE DRIFT UNDER CONTROL, ERAN BIBI, Firefly

DevOpsDays Tel Aviv

Within fintech catching fraudsters is one of the primary opportunities for us to use streaming applications to apply ML models in real-time. This talk will be a review of our journey to bring fraud decisioning to our tellers at Capital One using Kafka, Flink and AWS Lambda. We will share our learnings and experiences to common problems such as custom windowing, breaking down a monolith app to small queryable state apps, feature engineering with Jython, dealing with back pressure from combining two disparate streams, model/feature validation in a regulatory environment, and running Flink jobs on Kubernetes.

Flink Forward San Francisco 2018: Andrew Gao & Jeff Sharpe - "Finding Bad Ac...

Flink Forward

OpenStack Orchestration - Juno Updates

OpenStack Foundation

Imagine a system where one collects real-time data, develops a machine learning model… Runs analysis and training on powerful GPUs… Clicks on a magic button and then deploys code and ML models to production… All without any heavy lifting from data and DevOps engineers. Today, data scientists work on laptops with just a subset of data and time is wasted while waiting for data and compute. It’s about efficient use of time! Join Iguazio and NVIDIA so that you can get home early today! Learn how to speed up data science from development to production: - Access to large scale, real-time and operational data without waiting for ETL - Run high performance analytics and ML on NVIDIA GPUs (Rapids) - Work on a shared, pre-integrated Kubernetes cluster with - - Jupyter notebook and leading data science tools - One-click (really!) deployment to production Speakers: Yaron Haviv, CTO at Iguazio, Or Zilberman, Data Scientist at Iguazio and Jacci Cenci, Sr. Technical Marketing Engineer at NVIDIA

Webinar: Cutting Time, Complexity and Cost from Data Science to Production

iguazio

Cloud applications monitoring in digital transformation era

ManageEngine, Zoho Corporation

Building Reactive applications with Akka

Knoldus Inc.

Reaching State 0 without losing your Versions Describes how we successfully helped Intermountain Rural Electric Assn (a Colorado utility) take their Esri/Schneider Electric GIS system to "state zero" (where no outstanding versions exist), without losing their many crucial versions and edits within those versions. Utilizing the SSP All Edits Reporting & QA Tool and the SSP Nightly Batch Suite product, we were able to record all version and edit information for critical versions that could not be lost, then delete the versions completely, taking the system to state zero. IREA was then able to perform various maintenance activities that are enjoyed at state zero. Once completed, SSP replayed the edits and versions back into the GIS, and users were utilizing the system as if nothing ever happened.

Reaching State Zero Without Losing Your Versions

SSP Innovations

Stream processing with Apache Flink @ OfferUp

Bowen Li

What's hot (20)

Key to optimal end user experience

Enterprise Beacon Object Hive - Siebel Version Control

Taking Splunk to the Next Level - Architecture

Infrastructure as code

Siebel monitoring

2013 04-29-evolution of backend

The Many Faces of Apache Kafka: Leveraging Real-time Data at Scale

Netflix's Could Migration

Flink Jobs Deployment On Kubernetes

Closing the door on application performance problems

Engineering Leader opportunity @ Netflix - Playback Data Systems

Serverless and AI: Orit Nissan-Messing, Iguazio, Serverless NYC 2018

DON'T PANIC: GETTING YOUR INFRASTRUCTURE DRIFT UNDER CONTROL, ERAN BIBI, Firefly

Flink Forward San Francisco 2018: Andrew Gao & Jeff Sharpe - "Finding Bad Ac...

OpenStack Orchestration - Juno Updates

Webinar: Cutting Time, Complexity and Cost from Data Science to Production

Cloud applications monitoring in digital transformation era

Building Reactive applications with Akka

Reaching State Zero Without Losing Your Versions

Stream processing with Apache Flink @ OfferUp

Similar to Sparklint @ Spark Meetup Chicago

Spark Summit EU talk by Simon Whitear

Spark Summit

Lessons Learned from Deploying Apache Spark as a Service on IBM Power Systems...

Indrajit Poddar

Serverless Computing with Azure Functions Best Practices

Juan Pablo

Tuning Spark can be complex and difficult, since there are many different configuration parameters and metrics. As the Spark applications running on LinkedIn’s clusters become more diverse and numerous, it is no longer feasible for a small team of Spark experts to help individual users debug and tune their Spark applications. Users need to be able to get advice quickly and iterate on their development, and any problems need to be caught promptly to keep the cluster healthy. In order to achieve this, we automated the process of identifying performance issues and providing custom tuning advice to users, and made improvements for scaling to handle thousands of Spark applications per day. We leverage Spark History Server (SHS) to gather application metrics, but as the number of Spark applications and size of individual applications have increased, the SHS has not been able to keep up. It can fall hours behind during peak usage. We will discuss changes to the SHS to improve efficiency, performance, and stability, enabling SHS to analyze large amount of logs. Another challenge we encountered was a lack of proper metrics related to Spark application performance. We will present new metrics added to Spark which can precisely report resource usage during runtime and discuss how these are used in heuristics to identify problems. Based on this analysis, custom recommendations are provided to help users tune their applications. We will also show the impact provided by these tuning recommendations, including improvements in application performance itself and the overall cluster utilization. EDWINA LU, Staff Software Engineer, LinkedIn, YE ZHOU, Software Engineer, LinkedIn. Inc

Metrics-driven tuning of Apache Spark at scale

DataWorks Summit

Real time monitoring of hadoop and spark workflows

Shankar Manian

Taking Splunk to the Next Level - Architecture

Splunk

Taking Splunk to the Next Level - Architecture Breakout Session

Splunk

Serverless Patterns by Jesse Butler

Oracle Developers

Tuning Apache Spark can be complex and difficult, since there are many different configuration parameters and metrics. As the Spark applications running on LinkedIn’s clusters become more diverse and numerous, it is no longer feasible for a small team of Spark experts to help individual users debug and tune their Spark applications. Users need to be able to get advice quickly and iterate on their development, and any problems need to be caught promptly to keep the cluster healthy. In order to achieve this, we automated the process of identifying performance issues and providing custom tuning advice to users, and made improvements for scaling to handle thousands of Spark applications per day. We leverage Spark History Server (SHS) to gather application metrics, but as the number of Spark applications and size of individual applications have increased, the SHS has not been able to keep up. It can fall hours behind during peak usage. We will discuss changes to the SHS to improve efficiency, performance and stability, enabling SHS to analyze large amount of logs. Another challenge we encountered was a lack of proper metrics related to Spark application performance. We will present new metrics added to Spark which can precisely report resource usage during runtime, and discuss how these are used in heuristics to identify problems. Based on this analysis, custom recommendations are provided to help users tune their applications. We will also show the impact provided by these tuning recommendations, including improvements in application performance itself and the overall cluster utilization.

Metrics-Driven Tuning of Apache Spark at Scale with Edwina Lu and Ye Zhou

Databricks

DoneDeal - AWS Data Analytics Platform

martinbpeters

SQL PASS Summit 2018

Kendra Little

Join this session to hear from the Photon product and engineering team talk about the latest developments with the project. As organizations embrace data-driven decision-making, it has become imperative for them to invest in a platform that can quickly ingest and analyze massive amounts and types of data. With their data lakes, organizations can store all their data assets in cheap cloud object storage. But data lakes alone lack robust data management and governance capabilities. Fortunately, Delta Lake brings ACID transactions to your data lakes – making them more reliable while retaining the open access and low storage cost you are used to. Using Delta Lake as its foundation, the Databricks Lakehouse platform delivers a simplified and performant experience with first-class support for all your workloads, including SQL, data engineering, data science & machine learning. With a broad set of enhancements in data access and filtering, query optimization and scheduling, as well as query execution, the Lakehouse achieves state-of-the-art performance to meet the increasing demands of data applications. In this session, we will dive into Photon, a key component responsible for efficient query execution. Photon was first introduced at Spark and AI Summit 2020 and is written from the ground up in C++ to take advantage of modern hardware. It uses the latest techniques in vectorized query processing to capitalize on data- and instruction-level parallelism in CPUs, enhancing performance on real-world data and applications — all natively on your data lake. Photon is fully compatible with the Apache Spark™ DataFrame and SQL APIs to ensure workloads run seamlessly without code changes. Come join us to learn more about how Photon can radically speed up your queries on Databricks.

Radical Speed for SQL Queries on Databricks: Photon Under the Hood

Databricks

In this Webcast, iSeries performance expert Mike Boadway of MB Software used your enrollment survey responses to drive the focus. If you start with tuning your applications for appropriate workloads, you can greatly improve your iSeries hardware performance -- without expensive hardware upgrades. In addition, properly tuned applications can enhance your system's performance for end user's and increase availability.

How to Drive Down iSeries Computing Costs

mboadway

Morningstar’s Risk Model project is created by stitching together statistical and machine learning models to produce risk and performance metrics for millions of financial securities. Previously, we were running a single version of this application, but needed to expand it to allow for customizations based on client demand. With the goal of running hundreds of custom Risk Model runs at once at an output size of around 1TB of data each, we had a challenging technical problem on our hands! In this presentation, we’ll talk about the challenges we faced replatforming this application to Spark, how we solved them, and the benefits we saw. Some things we’ll touch on include how we created customized models, the architecture of our machine learning application, how we maintain an audit trail of data transformations (for rigorous third party audits), and how we validate the input data our model takes in and output data our model produces. We want the attendees to walk away with some key ideas of what worked for us when productizing a large scale machine learning platform.

Lessons Learned Replatforming A Large Machine Learning Application To Apache ...

Databricks

Apache Spark has the ‘speculative execution’ feature to handle the slow tasks in a stage due to environment issues like slow network, disk etc. If one task is running slowly in a stage, Spark driver can launch a speculation task for it on a different host. Between the regular task and its speculation task, Spark system will later take the result from the first successfully completed task and kill the slower one. When we first enabled the speculation feature for all Spark applications by default on a large cluster of 10K+ nodes at LinkedIn, we observed that the default values set for Spark’s speculation configuration parameters did not work well for LinkedIn’s batch jobs. For example, the system launched too many fruitless speculation tasks (i.e. tasks that were killed later). Besides, the speculation tasks did not help shorten the shuffle stages. In order to reduce the number of fruitless speculation tasks, we tried to find out the root cause, enhanced Spark engine, and tuned the speculation parameters carefully. We analyzed the number of speculation tasks launched, number of fruitful versus fruitless speculation tasks, and their corresponding cpu-memory resource consumption in terms of gigabytes-hours. We were able to reduce the average job response times by 13%, decrease the standard deviation of job elapsed times by 40%, and lower total resource consumption by 24% in a heavily utilized multi-tenant environment on a large cluster. In this talk, we will share our experience on enabling the speculative execution to achieve good job elapsed time reduction at the same time keeping a minimal overhead.

Best Practices for Enabling Speculative Execution on Large Scale Platforms

Databricks

Performance monitoring - Adoniram Mishra, Rupesh Dubey, ThoughtWorks

Thoughtworks

Cloud Security Monitoring and Spark Analytics

amesar0

Taking Splunk to the Next Level - Technical

Splunk

Are you outgrowing your initial Splunk deployment? Is Splunk becoming mission critical and you need to make sure it's Enterprise ready? Attend this session led by Splunk experts to learn about taking your Splunk deployment to the next level. Learn about Splunk high availability architectures with Splunk Search Head Clustering and Index Replication. Additionally, learn how to manage your deployment with Splunk’s operational and management controls to manage Splunk capacity and end user experience.

Taking Splunk to the Next Level – Architecture

Splunk

Ah, the mainframe. Peel back many transactional business applications at any enterprise and you’ll find a mainframe application under there. It’s often where the crown jewels of the business’ data and core transactions are processed. The tooling for these applications is dated and new code is infrequent, but moving off is seen as risky. No one. Wants. To. Touch. Mainframes. But mainframe applications don’t have to be the electric third rail. Modernizing, even pieces of those mainframe workloads into modern frameworks on modern platforms, has huge payoffs. Developers can gain all the productivity benefits of modern tooling. Not to mention the scaling, security, and cost benefits. So, how do you get started modernizing applications off a mainframe? Join Rohit Kelapure, Consulting Practice Lead at Pivotal, as he shares lessons from projects with enterprises to move workloads off of mainframes. You’ll learn: ● How to decide what to modernize first by looking at business requirements AND the existing codebase ● How to take a test-driven approach to minimize risks in decomposing the mainframe application ● What to use as a replacement or evolution of mainframe schedulers ● How to include COBOL and other mainframe developers in the process to retain institutional knowledge and defuse project detractors ● How to replatform mainframe applications to the cloud leveraging a spectrum of techniques Presenter : Rohit Kelapure, Consulting Practice Lead, Pivotal

How to Migrate Applications Off a Mainframe

VMware Tanzu

Similar to Sparklint @ Spark Meetup Chicago (20)

Spark Summit EU talk by Simon Whitear

Lessons Learned from Deploying Apache Spark as a Service on IBM Power Systems...

Serverless Computing with Azure Functions Best Practices

Metrics-driven tuning of Apache Spark at scale

Real time monitoring of hadoop and spark workflows

Taking Splunk to the Next Level - Architecture

Taking Splunk to the Next Level - Architecture Breakout Session

Serverless Patterns by Jesse Butler

Metrics-Driven Tuning of Apache Spark at Scale with Edwina Lu and Ye Zhou

DoneDeal - AWS Data Analytics Platform

SQL PASS Summit 2018

Radical Speed for SQL Queries on Databricks: Photon Under the Hood

How to Drive Down iSeries Computing Costs

Lessons Learned Replatforming A Large Machine Learning Application To Apache ...

Best Practices for Enabling Speculative Execution on Large Scale Platforms

Performance monitoring - Adoniram Mishra, Rupesh Dubey, ThoughtWorks

Cloud Security Monitoring and Spark Analytics

Taking Splunk to the Next Level - Technical

Taking Splunk to the Next Level – Architecture

How to Migrate Applications Off a Mainframe

Recently uploaded

Architecting Cloud Native Applications

WSO2

GenAI Risks & Security Meetup 01052024.pdf

lior mazor

AWS Community Day CPH - Three problems of Terraform

Andrey Devyatkin

Data Cloud, More than a CDP by Matt Robison

Anna Loughnan Colquhoun

Scaling API-first – The story of a global engineering organization Ian Reasor, Senior Computer Scientist - Adobe Radu Cotescu, Senior Computer Scientist - Adobe Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

apidays

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

Edi Saputra

DBX First Quarter 2024 Investor Presentation

Dropbox

Real Time Object Detection Using Open CV

Khem

Accelerating FinTech Innovation: Unleashing API Economy and GenAI Vasa Krishnan, Chief Technology Officer - FinResults Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

apidays

Manulife - Insurer Transformation Award 2024

The Digital Insurer

Building Digital Trust in a Digital Economy Veronica Tan, Director - Cyber Security Agency of Singapore Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

apidays

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Zilliz

Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows. We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases. This video focuses on the deployment of external web forms using Jotform for Bonterra Impact Management. This solution can be customized to your organization’s needs and deployed to support the common use cases below: - Intake and consent - Assessments - Surveys - Applications - Program registration Interested in deploying web form automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

Jeffrey Haguewood

Modernizing Securities Finance: The cloud-native prime brokerage platform transforming capital markets. Madhu Subbu, Managing Director, Head of Securities Finance Engineering Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu

apidays

💉💊+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI}}+971581248768 +971581248768 Mtp-Kit (500MG) Prices » Dubai [(+971581248768**)] Abortion Pills For Sale In Dubai, UAE, Mifepristone and Misoprostol Tablets Available In Dubai, UAE CONTACT DR.Maya Whatsapp +971581248768 We Have Abortion Pills / Cytotec Tablets /Mifegest Kit Available in Dubai, Sharjah, Abudhabi, Ajman, Alain, Fujairah, Ras Al Khaimah, Umm Al Quwain, UAE, Buy cytotec in Dubai +971581248768''''Abortion Pills near me DUBAI | ABU DHABI|UAE. Price of Misoprostol, Cytotec” +971581248768' Dr.DEEM ''BUY ABORTION PILLS MIFEGEST KIT, MISOPROTONE, CYTOTEC PILLS IN DUBAI, ABU DHABI,UAE'' Contact me now via What's App…… abortion Pills Cytotec also available Oman Qatar Doha Saudi Arabia Bahrain Above all, Cytotec Abortion Pills are Available In Dubai / UAE, you will be very happy to do abortion in Dubai we are providing cytotec 200mg abortion pill in Dubai, UAE. Medication abortion offers an alternative to Surgical Abortion for women in the early weeks of pregnancy. We only offer abortion pills from 1 week-6 Months. We then advise you to use surgery if its beyond 6 months. Our Abu Dhabi, Ajman, Al Ain, Dubai, Fujairah, Ras Al Khaimah (RAK), Sharjah, Umm Al Quwain (UAQ) United Arab Emirates Abortion Clinic provides the safest and most advanced techniques for providing non-surgical, medical and surgical abortion methods for early through late second trimester, including the Abortion By Pill Procedure (RU 486, Mifeprex, Mifepristone, early options French Abortion Pill), Tamoxifen, Methotrexate and Cytotec (Misoprostol). The Abu Dhabi, United Arab Emirates Abortion Clinic performs Same Day Abortion Procedure using medications that are taken on the first day of the office visit and will cause the abortion to occur generally within 4 to 6 hours (as early as 30 minutes) for patients who are 3 to 12 weeks pregnant. When Mifepristone and Misoprostol are used, 50% of patients complete in 4 to 6 hours; 75% to 80% in 12 hours; and 90% in 24 hours. We use a regimen that allows for completion without the need for surgery 99% of the time. All advanced second trimester and late term pregnancies at our Tampa clinic (17 to 24 weeks or greater) can be completed within 24 hours or less 99% of the time without the need surgery. The procedure is completed with minimal to no complications. Our Women's Health Center located in Abu Dhabi, United Arab Emirates, uses the latest medications for medical abortions (RU-486, Mifeprex, Mifegyne, Mifepristone, early options French abortion pill), Methotrexate and Cytotec (Misoprostol). The safety standards of our Abu Dhabi, United Arab Emirates Abortion Doctors remain unparalleled. They consistently maintain the lowest complication rates throughout the nation. Our Physicians and staff are always available to answer questions and care for women in one of the most difficult times in their lives. The decision to have an abortion at the Abortion Cl

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

MadyBayot

Scalable LLM APIs for AI and Generative AI Application Development Ettikan Karuppiah, Director/Technologist - NVIDIA Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...

apidays

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Martijn de Jong

MySQL Webinar, presented on the 25th of April, 2024. Summary: MySQL solutions enable the deployment of diverse Database Architectures tailored to specific needs, including High Availability, Disaster Recovery, and Read Scale-Out. With MySQL Shell's AdminAPI, administrators can seamlessly set up, manage, and monitor these solutions, ensuring efficiency and ease of use in their administration. MySQL Router, on the other hand, provides transparent routing from the application traffic to the backend servers in the architectures, requiring minimal configuration. Completely built in-house and supported by Oracle, these solutions have been adopted by enterprises of all sizes for their business-critical applications. In this presentation, we'll delve into various database architecture solutions to help you choose the right one based on your business requirements. Focusing on technical details and the latest features to maximize the potential of these solutions.

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Miguel Araújo

In the thrilling conclusion to 2023, ransomware groups had a banner year, really outdoing themselves in the "make everyone's life miserable" department. LockBit 3.0 took gold in the hacking olympics, followed by the plucky upstarts Clop and ALPHV/BlackCat. Apparently, 48% of organizations were feeling left out and decided to get in on the cyber attack action. Business services won the "most likely to get digitally mugged" award, with education and retail nipping at their heels. Hackers expanded their repertoire beyond boring old encryption to the much more exciting world of extortion. The US, UK and Canada took top honors in the "countries most likely to pay up" category. Bitcoins were the currency of choice for discerning hackers, because who doesn't love untraceable money?

Ransomware_Q4_2023. The report. [EN].pdf

Overkill Security

Recently uploaded (20)

Architecting Cloud Native Applications

GenAI Risks & Security Meetup 01052024.pdf

AWS Community Day CPH - Three problems of Terraform

Data Cloud, More than a CDP by Matt Robison

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving

DBX First Quarter 2024 Investor Presentation

Real Time Object Detection Using Open CV

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

Manulife - Insurer Transformation Award 2024

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Ransomware_Q4_2023. The report. [EN].pdf

Sparklint @ Spark Meetup Chicago

1. Sparklint a Tool for Identifying and Tuning Inefficient Spark Jobs Across Your Cluster

2. Simon Whitear Principal Engineer

3. Why Sparklint? • A successful Spark cluster grows rapidly • Capacity and capability mismatches arise • Leads to resource contention • Tuning process is non-trivial • Current UI operational in focus We wanted to understand application efficiency

4. Sparklint provides: • Live view of batch & streaming application stats or • Event by event analysis of historical event logs • Stats and graphs for: – Idle time – Core usage – Task locality

5. Sparklint Listener:

6. Sparklint Listener:

7. Sparklint Server:

8. Demo… • Simulated workload analyzing site access logs: – read text file as JSON – convert to Record(ip, verb, status, time) – countByIp, countByStatus, countByVerb

9. Job took 10m7s to finish Already pretty good distribution; low idle time indicates good worker usage, minimal driver node interaction in job But overall utilization is low Which is reflected in the common occurrence of the IDLE state (unused cores)

10. Job took 15m14s to finish Core usage increased, job is more efficient, execution time increased, but the app is not cpu bound

11. Job took 9m24s to finish Core utilization decreased proportionally, trading execution time for efficiency Lots of IDLE state shows we are over allocating resources

12. Job took 11m34s to finish Core utilization remains low, the config settings are not right for this workload. Dynamic allocation only effective at app start due to long executorIdleTimeout setting

13. Job took 33m5s to finish Core utilization is up, but execution time is up dramatically due to reclaiming resources before each short running task. IDLE state is reduced to a minimum, looks efficient, but execution is much slower due to dynamic allocation overhead

14. Job took 7m34s to finish Core utilization way up, with lower execution time Parallel execution is clearly visible in overlapping stages Flat tops show we are becoming CPU bound

15. Job took 5m6s to finish Core utilization decreases, trading execution time for efficiency again here

16. Thanks to dynamic allocation the utilization is high despite being a bi- modal application Data loading and mapping requires a large core count to get throughput Aggregation and IO of results optimized for end file size, therefore requires less cores

17. Future Features: • History Server event sources • Inline recommendations • Auto-tuning • Streaming stage parameter delegation • Replay capable listener

18. The Credit: • Lead developer is Robert Xue • https://github.com/roboxue • SDE @ Groupon

19. Contribute! Sparklint is OSS: https://github.com/groupon/sparklint

20. Q+A

Editor's Notes

Spark cluster success Platform rolls out with a maximum supported load. Early projects ramp up, usage is fine Early successes feed back into recommendations to use the platform New users start loading up the platform just as initial successes are being scaled Platform limits hit, scaling requirements now begin to be understood and planned for Rough times whilst the platform operation learns to lead the application usage ◦ spark ui provides masses of info for only recent jobs / stages/tasks by default when the job is alive ◦ when serving spark ui from history server, there is still little summary information to debug the job config: Have I used the right magic number (locality wait, cores, numPartitions, job scheduling mode, etc.) ◦ difficult to compare different execution of the same job, due to this missing level of summary, (execution time is almost the only metrics to compare)
◦ A mechanism to listen the spark event log stream, and accumulate life time stats without losing (too many) details using constant memory in live mode because of the gauge we are using ◦ The mechanism also provides convenient replay when serving from a file ◦ A set of stats and graphs to describe the job performance uniformly: 1. idle time (duration when all calc are done on driver node, things to avoid) 2. max core usage, core usage percentage (should not be too high or too low, thinking about using avg numTaskInWait to supplement it) 3. task execution time for a certain stage by locality, (which honestly describes the opportunity cost of a lower locality, and indicates the idle locality wait config.)
using the ReduceByKey.scala in repo as a sample to demo a series of attempts when we try to optimize a Spark application. The logs are included as well. The highlights for each run have been annotated in the screenshots in the attachment. The application is basically reading a text file, json parse and convert to "Record(ip: String, verb: String, status: Int, time: Long)", then do countByIP, countByStatus, countByVerb on them, repeat 10 times. These are three independent map reduce jobs, each has one map stage (parsing) and one reduce stage (countByXXX). Algo level optimization is out of the discussion here. The app need a constant number of CPU seconds, and a floating but bound amount of network i/o time (decided by job locality) to finish the execution.
We use 16 cores as the baseline standard. The job takes 10 min to finish. The annotations in the pic describes what are we running here, and how to read sparklint graph. After reading the chart, we decided to decrease core to see if the execution time doubles or not, to figure out if we are bonded by CPU.
by using 8 cores, the job took 15 min to finish, shorter than the 20 min expectation, proving that we are not bonded by cpu. Actually this saw tooth pattern easily indicates we are not bonded by CPU, and can be used as a classic example; An example of cpu bounded application can be found in the last demo slide. This leads to another angle of optimization: job scheduling tweaking.
by using 32 cores, the job took 9 min to finish, proving again that throwing more cores doesn't provide commensurate performance gains.. The graph is a classic example about over allocating resources. We can assume we need no more than 24 cores to do the work effectively, so now we can look into other ways of tuning the job: dynamic allocation and increased parallelism.
we try to optimize resource requirement by using dynamic allocation, initially just using the default executorIdleTimeout of 1min. This has also led us to try 1 core / executor. Since we don't usually have any task longer than 1 minute, we proved that dynamic allocation is not the key in optimizing this kind of app that has shorter tasks.
we reduced executorIdleTimeout to 10s. In this way we decreased resource footprint and increased utilization. However this is a false saving for this job, because the job throughput is reduced due to low core supply and overhead in getting executors. This example proved again that dynamic allocation doesn't solve the optimization challenge when we have shorter tasks So, let’s try parallelism inside the job using FAIR scheduling.
by using 16 cores and FAIR scheduler, this simple tweak cut the execution time from 10 min to 7.5 min, and our job now become CPU bounded (see annotation) The tweak to run the three count stages in parallel and use FAIR scheduling increases efficiency and reduces runtime, allowing us to become CPU bound,
by using 32 cores and FAIR scheduler, the execution time become 5 min (compare to 9 min in pic3 using the same resource). We reduce efficiency in order to gain execution time, this is a decision for the team to decide, if there is a hard SLA to hit, it may be worth running with lower utilization. We can now call the job scheduling optimization done.
Demos the correct scenario of using dynamic allocation, and throwing more cpu will help when the job is CPU bounded (the flat tops in the usage graph is the clear proof) In this case the partition count is chosen to optimize file size on HDFS, so the team are comfortable with the runtime.
Sparklint can easily distinguish CPU bounded and job scheduling bounded applications. (We are working on automating this judgment, by using average number of pending tasks) Really easy to spot when a job is not bounded by CPU, but job scheduling (leads to low core usage) and driver node operations (leads to idle time). In theory your app will be 2x faster if you throw 2x cores to it, but this is not always true The point of spark level optimization is to make your job CPU bounded, when you can decide freely between ($ gain from faster application / $ spent in providing more cores) If your job is CPU bounded, simply add cores If your job has a lot of idle time, try decrease it by reducing unwanted/unintended driver node operations. (could be simple things like doing a map on a large array instead of an RDD and they forgot about it) If your job is job scheduling bounded, you can both reduce waste by using dynamic allocation (which in turns provides you high throughput when needed), and submit independent jobs in parallel using Futures and FAIR scheduler http://spark.apache.org/docs/latest/configuration.html#scheduling

Sparklint @ Spark Meetup Chicago

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Sparklint @ Spark Meetup Chicago

Similar to Sparklint @ Spark Meetup Chicago (20)

Recently uploaded

Recently uploaded (20)

Sparklint @ Spark Meetup Chicago

Editor's Notes