SlideShare a Scribd company logo
1 of 18
Processing large volume of data with
MuleSoft and salesforce
Faridabad MuleSoft Meetup Group
July 15, 2023
Information Classification:
General
Safe Harbour Statement
● Both the speaker and the host are organizing this meet-up in individual capacity only. We are
not representing our companies here.
● This presentation is strictly for learning purposes only. Organizer/Presenter do not hold any
responsibility that same solution will work for your business requirements.
● This presentation is not meant for any promotional activities.
2
Information Classification:
General
A recording of this meetup will be uploaded to events page within 24 hours.
Questions can be submitted/asked at any time in the Chat/Questions & AnswersTab.
Make it more Interactive!!!
Give us feedback! Rate this meetup session by filling feedback form at the end of theday.
We Love Feedbacks!!! Its Bread & Butter for Meetup.
Housekeeping
3
Information Classification:
General
Organizer
s
4
Information Classification:
General
Senior Solution Architect and APIN Competency Head
EPAM Systems
Jitendra Pravin Bafna
Overall 15+ years of experience in the IT industry.
APIN Competency Head at EPAM Systems
Holding various Credentials from MuleSoft
• MuleSoft Ambassador from 2021
• MuleSoft Speaker (Speak in 70+ Meetups and other event).
• MuleSoft Meetup Leader (Mumbai, Surat, Nashik and Engineering Student – Organized
more than 70 MuleSoft Meetups).
• MuleSoft Delivery Champion and Go To Market Champion.
• All MuleSoft Certified (Platform, Integration Architect and Developer Certification).
• Received more than 10 MuleSoft Community Awards and 2 times All-Stars award.
Expertise in setting up MuleSoft Platform (CloudHub, RTF), Customer Hosted Mule), MuleSoft
Security, API Strategy and Governance. Executed more than 20 projects related to the
MuleSoft that includes Catalyst Launch of Platform, API Strategy and Governance, API
Enablement, API Implementation etc.
Published more than 300+ videos related to MuleSoft on YouTube.
Published more than 200 Blogs/Articles on MuleSoft Blogs, Medium and Dzone.
Expertise in the integration between MuleSoft and other systems like Salesforce, NetSuite,
SAP, Databases.
Information Classification:
General
Agenda
● Introduction
● Salesforce Connector
● Bulk API V2
● Batch Processing
● Batch Aggregator
Information Classification:
General
8
Salesforce Connector
Anypoint Connector for Salesforce (Salesforce Connector) enables you
to accelerate your Salesforce integrations across Sales Cloud, Service
Cloud, Salesforce Platform, and Force.com. The connector gives you
access to all Salesforce entities to enable automation of your business
processes to help maximize your investments in services and solutions
like enabling your sales teams, increasing revenue, and serving your
customers better. This connector:
• Provides a powerful solution to implement top Salesforce
integration patterns.
• Supports all create, read, upsert, and delete operations across
Salesforce objects. Additionally, it supports Salesforce's bulk
operations that are optimized for working with large sets of data.
• Enables you to easily leverage custom fields and custom entities to
solve integration needs to your custom Salesforce instance.
Information Classification:
General
Salesforce Bulk API v2
Bulk API v2.0 is available in API version 41.0 and later. Compared to Bulk API v1.0, Salesforce has
dramatically simplified the pre-processing step of Bulk API. In this blog, we will walk you through how to
bulk upload your contact records in CSV format to Salesforce through Bulk API v2.0.
• You can submit up to 15,000 batches per rolling 24-hour
period.
• Maximum number of records uploaded per 24-hour
rolling period - 150,000,000 (15,000 batches x 10,000
records per batch maximum)
• Maximum time that a job can remain open – 24 hours.
• Maximum file size – 150 MB Per Job.
• Maximum number of fields in a record – 5000
• Maximum number of characters in a record - 400,000
• Timeout for retrieving query results – 20 Minutes
More details can be found here https://developer.salesforce.com/docs/atlas.en-
us.244.0.salesforce_app_limits_cheatsheet.meta/salesforce_app_limits_cheatsheet/salesforce_app_limits_platform_bulkapi.
htm
Information Classification:
General
Total Record – 100
Total Number of Record in each block = Total Record/Total Block Size = 100/100 = 1 Record
Block Size - Batch Processing
A lower batch block size will perform more I/O. A higher batch block size will reduce the I/O, but eventually you
will be processing more sequential records whenever you retrieve X amount of records from the queue, and X <
block size.
• If you have millions of records with payload size in KBs, you can process blocks of a high number of records
with no memory issues. Here, setting a larger block size improves the batch job time.
• If you need to process heavy payloads like files of larger size in MBs, then you can consider keeping the block
size smaller to distribute the load and avoid memory issues.
"Max concurrency" configuration of the batch job ( " 2 * CPU cores quantity " by default unless you configure it
on the Batch Job scope)
In this scenario, Each block will have one record and it is not good practice to process one record in one block
and it will end up in queuing the blocks and execution time will be high.
Demonstration 1 – Adding record into
Salesforce one by one with Block Size of
100
Information Classification:
General
Total Record – 100
Total Number of Record in each block = Total Record/Total Block Size = 100/10 = 10 Record
In this case, each block will have 10 records and each block will be processed sequentially. Record within each
block will be executed parallelly.
This execution will be faster, and it will take lesser time as compared to demonstration 1.
Demonstration 2 – Adding record into
Salesforce one by one with Block Size of 10
Information Classification:
General
A Batch Aggregator adds records in a collection until the Nth record equal to the size attribute of the Batch
Aggregator is added. Then the collection contain all records is the payload of the Mule Message that get
processed by the processors inside the Batch Aggregator.
For example, if the size of the Batch Aggregator is set to 10, then the payload will be a collection with 10
records. Every 10 records the processors inside the Batch Aggregator will execute.
Salesforce allows to send 200 records in one batch and not more than otherwise it will throw error.
Total Number of Record in each block = Total Record/Total Block Size = 300/10 = 30
Batch Aggregation Size for 300 records = 2 (One batch of 200 record and other batch of 100 record)
Demonstration 3 – Adding record into
Salesforce with Block Size of 10 and
aggregating the 200 record in one batch
Information Classification:
General
In this scenario, for 300 record it will create one batch in Salesforce, and it will process the records
asynchronously.
For 11000 records, Salesforce will create 2 batches one of 10000 records and other with 1000 records in the
Salesforce Job and it will be automatically taken care by Salesforce.
Finally, we can retrieve the state of Job by passing JobId and there are 4 state maintain for each Job.
• UploadComplete
• JobComplete
• Failed
• InProgress
In this scenario, we don’t have to worry about concurrency, Batch or Block Size.
Demonstration 4 – Processing 300 and
11000 record with Salesforce Bulk API v2
Information Classification:
General
In this scenario, we can submit query to Salesforce using Bulk API V2 and retrieve the results asynchronously.
Whenever you want to retrieve the record that contain thousand to million records, Bulk API V2 is best option to
be used.
Demonstration 5 – Query With Salesforce
Bulk API v2
Q & A
Information Classification:
General
Take a stand !
18
●Nominate yourself for the next meetup speaker and suggest a topic as well.
Information Classification:
General
20
● Share:
○ Tweet using the hashtag #MuleSoftMeetups
○ Invite your network to join: https://meetups.mulesoft.com/faridabad
● Feedback:
○ Fill out the survey feedback and suggest topics for upcoming events
○ Contact MuleSoft at meetups@mulesoft.com for ways to improve the program
What’s next?
Quiz Time
Get ready to WIN a Special Gift from MuleSoft Community
Thank you

More Related Content

Similar to Processing large volume of data with MuleSoft and salesforce.pptx

That conference tfs care and feeding
That conference   tfs care and feedingThat conference   tfs care and feeding
That conference tfs care and feeding
Angela Dugan
 

Similar to Processing large volume of data with MuleSoft and salesforce.pptx (20)

Increase Salesforce Performance using Platform Cache Demo
Increase Salesforce Performance using Platform Cache DemoIncrease Salesforce Performance using Platform Cache Demo
Increase Salesforce Performance using Platform Cache Demo
 
Large Data Volume Salesforce experiences
Large Data Volume Salesforce experiencesLarge Data Volume Salesforce experiences
Large Data Volume Salesforce experiences
 
Amazon Redshift Deep Dive
Amazon Redshift Deep Dive Amazon Redshift Deep Dive
Amazon Redshift Deep Dive
 
Bug Hunting with the Salesforce Developer Console
Bug Hunting with the Salesforce Developer ConsoleBug Hunting with the Salesforce Developer Console
Bug Hunting with the Salesforce Developer Console
 
Oracle apps scm online training
Oracle apps scm online trainingOracle apps scm online training
Oracle apps scm online training
 
Sap security online training
Sap security online trainingSap security online training
Sap security online training
 
Back to FME School - Day 2: Your Data and FME
Back to FME School - Day 2: Your Data and FMEBack to FME School - Day 2: Your Data and FME
Back to FME School - Day 2: Your Data and FME
 
Blazing new trails with salesforce data nov 16, 2021
Blazing new trails with salesforce data   nov 16, 2021Blazing new trails with salesforce data   nov 16, 2021
Blazing new trails with salesforce data nov 16, 2021
 
Blistering fast access to Hadoop with SQL
Blistering fast access to Hadoop with SQLBlistering fast access to Hadoop with SQL
Blistering fast access to Hadoop with SQL
 
SAP Basis Overview
SAP Basis OverviewSAP Basis Overview
SAP Basis Overview
 
Grails Services
Grails ServicesGrails Services
Grails Services
 
Modernize your AS400 - the future proof, low cost solution.
Modernize your AS400 - the future proof, low cost solution.Modernize your AS400 - the future proof, low cost solution.
Modernize your AS400 - the future proof, low cost solution.
 
LDV.pptx
LDV.pptxLDV.pptx
LDV.pptx
 
Technical Presentation - TimeWIzard
Technical Presentation - TimeWIzardTechnical Presentation - TimeWIzard
Technical Presentation - TimeWIzard
 
Anypoint Batch Processing and Polling Scope With Mulesoft
Anypoint Batch Processing and Polling Scope With MulesoftAnypoint Batch Processing and Polling Scope With Mulesoft
Anypoint Batch Processing and Polling Scope With Mulesoft
 
Salesforce automation hour - Rollup Summary Fields with Lookups - Meighan Bro...
Salesforce automation hour - Rollup Summary Fields with Lookups - Meighan Bro...Salesforce automation hour - Rollup Summary Fields with Lookups - Meighan Bro...
Salesforce automation hour - Rollup Summary Fields with Lookups - Meighan Bro...
 
Data warehousing guidelines for bi and BAM solutions
Data warehousing guidelines for bi and BAM solutionsData warehousing guidelines for bi and BAM solutions
Data warehousing guidelines for bi and BAM solutions
 
OpenSource for Enterprise Business Presentation 010
OpenSource for Enterprise Business Presentation 010OpenSource for Enterprise Business Presentation 010
OpenSource for Enterprise Business Presentation 010
 
Ecrire son premier Trigger (et les comprendre)
Ecrire son premier Trigger (et les comprendre)Ecrire son premier Trigger (et les comprendre)
Ecrire son premier Trigger (et les comprendre)
 
That conference tfs care and feeding
That conference   tfs care and feedingThat conference   tfs care and feeding
That conference tfs care and feeding
 

Recently uploaded

Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
panagenda
 

Recently uploaded (20)

Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - Questionnaire
 
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdfFrisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptxCyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
How to Check GPS Location with a Live Tracker in Pakistan
How to Check GPS Location with a Live Tracker in PakistanHow to Check GPS Location with a Live Tracker in Pakistan
How to Check GPS Location with a Live Tracker in Pakistan
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
Vector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptxVector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptx
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
The Ultimate Prompt Engineering Guide for Generative AI: Get the Most Out of ...
The Ultimate Prompt Engineering Guide for Generative AI: Get the Most Out of ...The Ultimate Prompt Engineering Guide for Generative AI: Get the Most Out of ...
The Ultimate Prompt Engineering Guide for Generative AI: Get the Most Out of ...
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform Engineering
 
الأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهلهالأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهله
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
ERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage IntacctERP Contender Series: Acumatica vs. Sage Intacct
ERP Contender Series: Acumatica vs. Sage Intacct
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptx
 
UiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overviewUiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overview
 
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 

Processing large volume of data with MuleSoft and salesforce.pptx

  • 1. Processing large volume of data with MuleSoft and salesforce Faridabad MuleSoft Meetup Group July 15, 2023
  • 2. Information Classification: General Safe Harbour Statement ● Both the speaker and the host are organizing this meet-up in individual capacity only. We are not representing our companies here. ● This presentation is strictly for learning purposes only. Organizer/Presenter do not hold any responsibility that same solution will work for your business requirements. ● This presentation is not meant for any promotional activities. 2
  • 3. Information Classification: General A recording of this meetup will be uploaded to events page within 24 hours. Questions can be submitted/asked at any time in the Chat/Questions & AnswersTab. Make it more Interactive!!! Give us feedback! Rate this meetup session by filling feedback form at the end of theday. We Love Feedbacks!!! Its Bread & Butter for Meetup. Housekeeping 3
  • 5. Information Classification: General Senior Solution Architect and APIN Competency Head EPAM Systems Jitendra Pravin Bafna Overall 15+ years of experience in the IT industry. APIN Competency Head at EPAM Systems Holding various Credentials from MuleSoft • MuleSoft Ambassador from 2021 • MuleSoft Speaker (Speak in 70+ Meetups and other event). • MuleSoft Meetup Leader (Mumbai, Surat, Nashik and Engineering Student – Organized more than 70 MuleSoft Meetups). • MuleSoft Delivery Champion and Go To Market Champion. • All MuleSoft Certified (Platform, Integration Architect and Developer Certification). • Received more than 10 MuleSoft Community Awards and 2 times All-Stars award. Expertise in setting up MuleSoft Platform (CloudHub, RTF), Customer Hosted Mule), MuleSoft Security, API Strategy and Governance. Executed more than 20 projects related to the MuleSoft that includes Catalyst Launch of Platform, API Strategy and Governance, API Enablement, API Implementation etc. Published more than 300+ videos related to MuleSoft on YouTube. Published more than 200 Blogs/Articles on MuleSoft Blogs, Medium and Dzone. Expertise in the integration between MuleSoft and other systems like Salesforce, NetSuite, SAP, Databases.
  • 6. Information Classification: General Agenda ● Introduction ● Salesforce Connector ● Bulk API V2 ● Batch Processing ● Batch Aggregator
  • 7. Information Classification: General 8 Salesforce Connector Anypoint Connector for Salesforce (Salesforce Connector) enables you to accelerate your Salesforce integrations across Sales Cloud, Service Cloud, Salesforce Platform, and Force.com. The connector gives you access to all Salesforce entities to enable automation of your business processes to help maximize your investments in services and solutions like enabling your sales teams, increasing revenue, and serving your customers better. This connector: • Provides a powerful solution to implement top Salesforce integration patterns. • Supports all create, read, upsert, and delete operations across Salesforce objects. Additionally, it supports Salesforce's bulk operations that are optimized for working with large sets of data. • Enables you to easily leverage custom fields and custom entities to solve integration needs to your custom Salesforce instance.
  • 8. Information Classification: General Salesforce Bulk API v2 Bulk API v2.0 is available in API version 41.0 and later. Compared to Bulk API v1.0, Salesforce has dramatically simplified the pre-processing step of Bulk API. In this blog, we will walk you through how to bulk upload your contact records in CSV format to Salesforce through Bulk API v2.0. • You can submit up to 15,000 batches per rolling 24-hour period. • Maximum number of records uploaded per 24-hour rolling period - 150,000,000 (15,000 batches x 10,000 records per batch maximum) • Maximum time that a job can remain open – 24 hours. • Maximum file size – 150 MB Per Job. • Maximum number of fields in a record – 5000 • Maximum number of characters in a record - 400,000 • Timeout for retrieving query results – 20 Minutes More details can be found here https://developer.salesforce.com/docs/atlas.en- us.244.0.salesforce_app_limits_cheatsheet.meta/salesforce_app_limits_cheatsheet/salesforce_app_limits_platform_bulkapi. htm
  • 9. Information Classification: General Total Record – 100 Total Number of Record in each block = Total Record/Total Block Size = 100/100 = 1 Record Block Size - Batch Processing A lower batch block size will perform more I/O. A higher batch block size will reduce the I/O, but eventually you will be processing more sequential records whenever you retrieve X amount of records from the queue, and X < block size. • If you have millions of records with payload size in KBs, you can process blocks of a high number of records with no memory issues. Here, setting a larger block size improves the batch job time. • If you need to process heavy payloads like files of larger size in MBs, then you can consider keeping the block size smaller to distribute the load and avoid memory issues. "Max concurrency" configuration of the batch job ( " 2 * CPU cores quantity " by default unless you configure it on the Batch Job scope) In this scenario, Each block will have one record and it is not good practice to process one record in one block and it will end up in queuing the blocks and execution time will be high. Demonstration 1 – Adding record into Salesforce one by one with Block Size of 100
  • 10. Information Classification: General Total Record – 100 Total Number of Record in each block = Total Record/Total Block Size = 100/10 = 10 Record In this case, each block will have 10 records and each block will be processed sequentially. Record within each block will be executed parallelly. This execution will be faster, and it will take lesser time as compared to demonstration 1. Demonstration 2 – Adding record into Salesforce one by one with Block Size of 10
  • 11. Information Classification: General A Batch Aggregator adds records in a collection until the Nth record equal to the size attribute of the Batch Aggregator is added. Then the collection contain all records is the payload of the Mule Message that get processed by the processors inside the Batch Aggregator. For example, if the size of the Batch Aggregator is set to 10, then the payload will be a collection with 10 records. Every 10 records the processors inside the Batch Aggregator will execute. Salesforce allows to send 200 records in one batch and not more than otherwise it will throw error. Total Number of Record in each block = Total Record/Total Block Size = 300/10 = 30 Batch Aggregation Size for 300 records = 2 (One batch of 200 record and other batch of 100 record) Demonstration 3 – Adding record into Salesforce with Block Size of 10 and aggregating the 200 record in one batch
  • 12. Information Classification: General In this scenario, for 300 record it will create one batch in Salesforce, and it will process the records asynchronously. For 11000 records, Salesforce will create 2 batches one of 10000 records and other with 1000 records in the Salesforce Job and it will be automatically taken care by Salesforce. Finally, we can retrieve the state of Job by passing JobId and there are 4 state maintain for each Job. • UploadComplete • JobComplete • Failed • InProgress In this scenario, we don’t have to worry about concurrency, Batch or Block Size. Demonstration 4 – Processing 300 and 11000 record with Salesforce Bulk API v2
  • 13. Information Classification: General In this scenario, we can submit query to Salesforce using Bulk API V2 and retrieve the results asynchronously. Whenever you want to retrieve the record that contain thousand to million records, Bulk API V2 is best option to be used. Demonstration 5 – Query With Salesforce Bulk API v2
  • 14. Q & A
  • 15. Information Classification: General Take a stand ! 18 ●Nominate yourself for the next meetup speaker and suggest a topic as well.
  • 16. Information Classification: General 20 ● Share: ○ Tweet using the hashtag #MuleSoftMeetups ○ Invite your network to join: https://meetups.mulesoft.com/faridabad ● Feedback: ○ Fill out the survey feedback and suggest topics for upcoming events ○ Contact MuleSoft at meetups@mulesoft.com for ways to improve the program What’s next?
  • 17. Quiz Time Get ready to WIN a Special Gift from MuleSoft Community