INGELA VIKSTROM, ANABEL SILVA, SANDRO PRATO
CSL Bio21 Research Scientists
Australia
INGELA VIKSTROM, ANABEL SILVA, SANDRO PRATO
CSL Bio21 Research Scientists
Australia
MARK BAKER
Head of Big Data Infrastructure
CSL Behring
ANALYZING DATA FROM MULTIPLE
MANUFACTURING SITES USING A CENTRAL
HADOOP DATA LAKE
Outline
• CSL Behring
– Introduction of CSL Behring
• CSL Behring’s products and focus
• Growth and global placement of manufacturing facilities
• Current PACE globalization initiative
• Streamlining global processes to improve efficiency
• Partnership with Hortonworks to create our Big Data
Platform
• HDP for Data lake and analytics using Zeppelin
• HDF for secure data movement from global manufacturing sites to
our central data repository SAP HANA & HDP.
• Q & A
CSL Behring’s Products and Focus
• CSL Behring
– CSL Behring is a global biotherapeutics leader
– Focused on serving patients’ needs by using the latest
technologies
• Deliver innovative therapies that are used to treat rare
and serious conditions.
– One of our “super orphan” therapies treats a condition affecting
approximately 300 patients in the U.S. and only one million
worldwide. To meet growing demand and bring more therapies to
more patients, we continue to invest in the expansion of all our
manufacturing facilities
Business Driver
PACE globalization initiative
• PACE is a global, transformation initiative that fulfills our
promise to patients by aligning our processes and
enhancing collaboration to achieve sustainable business
excellence
• Provide advanced analytics capabilities to exploit
existing and new data assets, support decision-making,
and provide predictive models
• Build user community with the right skills and right tools
Global Manufacturing Facilities
• Manufacturing Sites
• United States
– Kankakee
• Germany
– Marburg
• Switzerland
– Bern
• Australia
– Melbourne
• Historically separated by region and operated
independently
Manufacturing & Analytical Silos
13/06/20176
Future Manufacturing Data Flows
13/06/20177
Challenges
• Each Manufacturing system uses a different backend
databases and schema to log the batch execution steps
– 12 x SCADA and MES systems
• Edge servers must not impact MES system performance
– Sensitive systems required impact assessment prior to direct
data extracts
• Data must be encrypted in motion and at rest
– HIPAA compliance and EU privacy requirements
• Data must be compressed over the WAN
– Due to bandwidth constrictions on intranet
• Multiple time zones and string encodings
NiFi
• Allows the creation of custom processors for each MES
system (python).
• Uses back pressure to eliminate any full database pulls
after network/hardware outages.
• Encrypts data over the wire.
• Compresses data over the wire
• Allows data enrichment for the addition of UTC column.
• ETL functionality allows for special characters to be
transformed into data analytical tools can process ex. ṏ
Thank You
CSL Limited
45 Poplar Road
Parkville, Victoria, 3056
Australia
TEST

Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache NIFI and Apche Zeppelin to a central Hadoop data lake at CSL Behring

  • 1.
    INGELA VIKSTROM, ANABELSILVA, SANDRO PRATO CSL Bio21 Research Scientists Australia INGELA VIKSTROM, ANABEL SILVA, SANDRO PRATO CSL Bio21 Research Scientists Australia MARK BAKER Head of Big Data Infrastructure CSL Behring ANALYZING DATA FROM MULTIPLE MANUFACTURING SITES USING A CENTRAL HADOOP DATA LAKE
  • 2.
    Outline • CSL Behring –Introduction of CSL Behring • CSL Behring’s products and focus • Growth and global placement of manufacturing facilities • Current PACE globalization initiative • Streamlining global processes to improve efficiency • Partnership with Hortonworks to create our Big Data Platform • HDP for Data lake and analytics using Zeppelin • HDF for secure data movement from global manufacturing sites to our central data repository SAP HANA & HDP. • Q & A
  • 3.
    CSL Behring’s Productsand Focus • CSL Behring – CSL Behring is a global biotherapeutics leader – Focused on serving patients’ needs by using the latest technologies • Deliver innovative therapies that are used to treat rare and serious conditions. – One of our “super orphan” therapies treats a condition affecting approximately 300 patients in the U.S. and only one million worldwide. To meet growing demand and bring more therapies to more patients, we continue to invest in the expansion of all our manufacturing facilities
  • 4.
    Business Driver PACE globalizationinitiative • PACE is a global, transformation initiative that fulfills our promise to patients by aligning our processes and enhancing collaboration to achieve sustainable business excellence • Provide advanced analytics capabilities to exploit existing and new data assets, support decision-making, and provide predictive models • Build user community with the right skills and right tools
  • 5.
    Global Manufacturing Facilities •Manufacturing Sites • United States – Kankakee • Germany – Marburg • Switzerland – Bern • Australia – Melbourne • Historically separated by region and operated independently
  • 6.
    Manufacturing & AnalyticalSilos 13/06/20176
  • 7.
    Future Manufacturing DataFlows 13/06/20177
  • 8.
    Challenges • Each Manufacturingsystem uses a different backend databases and schema to log the batch execution steps – 12 x SCADA and MES systems • Edge servers must not impact MES system performance – Sensitive systems required impact assessment prior to direct data extracts • Data must be encrypted in motion and at rest – HIPAA compliance and EU privacy requirements • Data must be compressed over the WAN – Due to bandwidth constrictions on intranet • Multiple time zones and string encodings
  • 9.
    NiFi • Allows thecreation of custom processors for each MES system (python). • Uses back pressure to eliminate any full database pulls after network/hardware outages. • Encrypts data over the wire. • Compresses data over the wire • Allows data enrichment for the addition of UTC column. • ETL functionality allows for special characters to be transformed into data analytical tools can process ex. ṏ
  • 10.
    Thank You CSL Limited 45Poplar Road Parkville, Victoria, 3056 Australia TEST