External Objects for External Big Data
Sumit Sarkar
Chief Data Evangelist
Progress
sumit.sarkar@progress.com
@SAsInSumit
Agenda
1. Intro to External Objects for Big Data
• What is Salesforce Connect?
• What is Big Data?
• What is OData?
2. How to access Big Data from Salesforce
3. Live Demo (fingers crossed)
4. Lessons Learned
External Objects for External Big Data
What is Salesforce Connect?
Salesforce Connect maps Salesforce external
objects to data tables in external systems.
Instead of copying the data into your
organization, Salesforce Connect accesses the
data on demand and in real time. The data is
never stale, and we access only what you
need.
Recommended when:
•You have a large amount of data that you don’t want to
copy into your Salesforce organization.
•You need small amounts of data at any one time.
•You want real-time access to the latest data.
What is Salesforce Connect?
What is Big Data?
Salesforce uses Big Data technologies within their
platform (HBase, Pig, Phoenix, etc) to support
technologies such as Einstein or exposed through
different APIs such as BigObjects.
However most organizations have their own Big Data
technologies to analyze and expose data sets similar in
scale or variety…
Don’t ETL Big Data sets into the
platform!
What is Big Data?
How my team uses Big Data Technology
Enhance the Customer
Journey
What support resources were
consumed for each evaluation?
What other product or solution
pages were visited to create
smarter targeted campaigns?
What is the success score for
leads and contacts evaluating
products?
What is OData?
An open protocol to allow the creation and consumption of
queryable and interoperable RESTful APIs in a simple and standard way.
OASIS Standard REST API (“SQL for the web”)
Ratified as an OASIS standard February, 2014
Operations built on REST principles
Uniform URL conventions
Surface metadata in standard way
What is OData?
First member to join OData Technical Committee
OData URL Query Conventions
$SELECT /Products?$select=*,Category/Name
/Suppliers?$filter=Address/City eq 'Redmond’
/Products?$orderby=ReleaseDate asc, Rating desc
/Products?$select=*,Category/Name&$expand=Categor
y
$FILTER
$ORDERBY
$EXPAND
/Products?$top=5&$skip=2$TOP $SKIP
http://services.odata.org/OData/OData.svc/Category(1)/Products?$top=2&$orderby=name
__________________________________/ ________________/ __________________/
service root URI resource path query options
How to access Big Data from Salesforce
How to produce OData from Big Data?
Use Salesforce Connect External Objects (intro’d Winter ‘15)
Open Source TechnologiesDataDirect Cloud Apex Connector Framework
Connect Salesforce to Big Data
Success Scoring
Personalization
Archived Insight
360 Reporting
Corporate Firewall
Demo
Demo [Lookups, Search, Write, Reporting]
Progress Corporate Firewall
Data Lake
OData
D2C On-premises
Connector
HTTPS and UDP firewall traversal
Lessons Learned
Lessons Learned
1. Mapping OData entities to Big Data objects
2. Primary keys for Big Data entities
3. HiveServer1 vs HiverServer2 for concurrency
4. External Objects have limits and 2 minute max timeout
5. Native Reporting support being added in Winter ‘17
6. Search considerations
7. Need agile OData service with Data Lake
8. Data Governance and Masking
9. CRM User Experience (strategies to improve performance)
Accessing external Big Data objects
Design Patterns for external objects
Enable Separate Loading of Related Lists of External Objects
Performance Tuning Tips for
Related Lists in Account
000148978
Decrease latency accessing Big Data over Hive
Stuff Salesforce devs don’t care about
1. Use Apache Tez as execution engine for Hive
2. Use ORCfile, new storage format
3. Use vectorization query execution (Hive 0.13)
4. Performance Tuning (Partitions, Indexes, Buckets, Block Sizes, etc)
5. Consider another query interface (i.e. Apache Hawq)
Resources
Subtitle placeholder
Learn More
Salesforce Connect
https://help.salesforce.com/H
TViewHelpDoc?id=platform_
connect_about.htm
Tutorial for SFDevs
http://bit.ly/getxdata
DataDirect Cloud OData
Service (Hadoop, Oracle,
SQL Server, etc)
https://www.datadirectcloud.
com
Sumit.sarkar@progress.com | www.linkedin.com/in/meetsumit | @SASInSumit
thank y u
Salesforce External Objects for Big Data
Salesforce External Objects for Big Data
Salesforce External Objects for Big Data
Salesforce External Objects for Big Data
Salesforce External Objects for Big Data
Salesforce External Objects for Big Data

Salesforce External Objects for Big Data

  • 1.
    External Objects forExternal Big Data Sumit Sarkar Chief Data Evangelist Progress sumit.sarkar@progress.com @SAsInSumit
  • 2.
    Agenda 1. Intro toExternal Objects for Big Data • What is Salesforce Connect? • What is Big Data? • What is OData? 2. How to access Big Data from Salesforce 3. Live Demo (fingers crossed) 4. Lessons Learned External Objects for External Big Data
  • 3.
  • 4.
    Salesforce Connect mapsSalesforce external objects to data tables in external systems. Instead of copying the data into your organization, Salesforce Connect accesses the data on demand and in real time. The data is never stale, and we access only what you need. Recommended when: •You have a large amount of data that you don’t want to copy into your Salesforce organization. •You need small amounts of data at any one time. •You want real-time access to the latest data. What is Salesforce Connect?
  • 5.
  • 6.
    Salesforce uses BigData technologies within their platform (HBase, Pig, Phoenix, etc) to support technologies such as Einstein or exposed through different APIs such as BigObjects. However most organizations have their own Big Data technologies to analyze and expose data sets similar in scale or variety… Don’t ETL Big Data sets into the platform! What is Big Data?
  • 7.
    How my teamuses Big Data Technology Enhance the Customer Journey What support resources were consumed for each evaluation? What other product or solution pages were visited to create smarter targeted campaigns? What is the success score for leads and contacts evaluating products?
  • 8.
  • 9.
    An open protocolto allow the creation and consumption of queryable and interoperable RESTful APIs in a simple and standard way. OASIS Standard REST API (“SQL for the web”) Ratified as an OASIS standard February, 2014 Operations built on REST principles Uniform URL conventions Surface metadata in standard way What is OData? First member to join OData Technical Committee
  • 10.
    OData URL QueryConventions $SELECT /Products?$select=*,Category/Name /Suppliers?$filter=Address/City eq 'Redmond’ /Products?$orderby=ReleaseDate asc, Rating desc /Products?$select=*,Category/Name&$expand=Categor y $FILTER $ORDERBY $EXPAND /Products?$top=5&$skip=2$TOP $SKIP http://services.odata.org/OData/OData.svc/Category(1)/Products?$top=2&$orderby=name __________________________________/ ________________/ __________________/ service root URI resource path query options
  • 11.
    How to accessBig Data from Salesforce
  • 12.
    How to produceOData from Big Data? Use Salesforce Connect External Objects (intro’d Winter ‘15) Open Source TechnologiesDataDirect Cloud Apex Connector Framework
  • 13.
    Connect Salesforce toBig Data Success Scoring Personalization Archived Insight 360 Reporting Corporate Firewall
  • 14.
  • 15.
    Demo [Lookups, Search,Write, Reporting] Progress Corporate Firewall Data Lake OData D2C On-premises Connector HTTPS and UDP firewall traversal
  • 16.
  • 17.
    Lessons Learned 1. MappingOData entities to Big Data objects 2. Primary keys for Big Data entities 3. HiveServer1 vs HiverServer2 for concurrency 4. External Objects have limits and 2 minute max timeout 5. Native Reporting support being added in Winter ‘17 6. Search considerations 7. Need agile OData service with Data Lake 8. Data Governance and Masking 9. CRM User Experience (strategies to improve performance) Accessing external Big Data objects
  • 18.
    Design Patterns forexternal objects Enable Separate Loading of Related Lists of External Objects Performance Tuning Tips for Related Lists in Account 000148978
  • 19.
    Decrease latency accessingBig Data over Hive Stuff Salesforce devs don’t care about 1. Use Apache Tez as execution engine for Hive 2. Use ORCfile, new storage format 3. Use vectorization query execution (Hive 0.13) 4. Performance Tuning (Partitions, Indexes, Buckets, Block Sizes, etc) 5. Consider another query interface (i.e. Apache Hawq)
  • 20.
  • 21.
    Learn More Salesforce Connect https://help.salesforce.com/H TViewHelpDoc?id=platform_ connect_about.htm Tutorialfor SFDevs http://bit.ly/getxdata DataDirect Cloud OData Service (Hadoop, Oracle, SQL Server, etc) https://www.datadirectcloud. com Sumit.sarkar@progress.com | www.linkedin.com/in/meetsumit | @SASInSumit
  • 22.