In this presentation Juan M. Huerta talks about big data adoption process at Citi, realising the technical value of big data and global solutions. Huerta goes on to talk about following a hybrid approach, and the future of analytics, expensive algorithms applied to large datasets. With Citi using these approaches in hopes of getting even wider global recognition.
This is a presentation in a meetup called "Business of Data Science". Data science is being leveraged extensively in the field of Banking and Financial Services and this presentation will give a brief and fundamental highlight to the evergreen field.
Big Data Analytics and its Application in E-CommerceUyoyo Edosio
Abstract-This era unlike any, is faced with explosive
growth in the size of data generated/captured. Data
growth has undergone a renaissance, influenced
primarily by ever cheaper computing power and
the ubiquity of the internet. This has led to a
paradigm shift in the E-commerce sector; as data is
no longer seen as the byproduct of their business
activities, but as their biggest asset providing: key
insights to the needs of their customers, predicting
trends in customer’s behavior, democratizing of
advertisement to suits consumers varied taste, as
well as providing a performance metric to assess the
effectiveness in meeting customers’ needs.
This paper presents an overview of the unique
features that differentiate big data from traditional
datasets. In addition, the application of big data
analytics in the E-commerce and the various
technologies that make analytics of consumer data
possible is discussed.
Further this paper will present some case studies of
how leading Ecommerce vendors like Amazon.com,
Walmart Inc, and Adidas apply Big Data analytics in
their business strategies/activities to improve their
competitive advantage. Lastly we identify some
challenges these E-commerce vendors face while
implementing big data analytic
Big Data presence in the high volume in the data storages can help in various ways to learn more about the need and trends of the current market which will be useful for all type of organizations. Modern information technology used to analyze the relationship between social trends and market insights is a useful way to have indirectly interlinked to customers and their interests from unstructured and semi-structured data. Such analysis will give organizations a broader view towards the practical needs of customers and once banking industry or any industry could know the customers, they can serve better and with more flexibility. In this presentation, team has primarily created the platform and designed the architecture in big data technology for banking industry to maximize the users of credit card.
This is a presentation in a meetup called "Business of Data Science". Data science is being leveraged extensively in the field of Banking and Financial Services and this presentation will give a brief and fundamental highlight to the evergreen field.
Big Data Analytics and its Application in E-CommerceUyoyo Edosio
Abstract-This era unlike any, is faced with explosive
growth in the size of data generated/captured. Data
growth has undergone a renaissance, influenced
primarily by ever cheaper computing power and
the ubiquity of the internet. This has led to a
paradigm shift in the E-commerce sector; as data is
no longer seen as the byproduct of their business
activities, but as their biggest asset providing: key
insights to the needs of their customers, predicting
trends in customer’s behavior, democratizing of
advertisement to suits consumers varied taste, as
well as providing a performance metric to assess the
effectiveness in meeting customers’ needs.
This paper presents an overview of the unique
features that differentiate big data from traditional
datasets. In addition, the application of big data
analytics in the E-commerce and the various
technologies that make analytics of consumer data
possible is discussed.
Further this paper will present some case studies of
how leading Ecommerce vendors like Amazon.com,
Walmart Inc, and Adidas apply Big Data analytics in
their business strategies/activities to improve their
competitive advantage. Lastly we identify some
challenges these E-commerce vendors face while
implementing big data analytic
Big Data presence in the high volume in the data storages can help in various ways to learn more about the need and trends of the current market which will be useful for all type of organizations. Modern information technology used to analyze the relationship between social trends and market insights is a useful way to have indirectly interlinked to customers and their interests from unstructured and semi-structured data. Such analysis will give organizations a broader view towards the practical needs of customers and once banking industry or any industry could know the customers, they can serve better and with more flexibility. In this presentation, team has primarily created the platform and designed the architecture in big data technology for banking industry to maximize the users of credit card.
Overview of Data Analytics in Lending BusinessSanjay Kar
AI/ML use cases
BFSI industry overview
Lending Products
Underwriting Strategy
Customer Lifecycle Management
How to prepare for becoming a banking analyst
Materials to study for statistics
What is fintech?
What is a Credit Bureau?
Books for statistics
Tools for data science
Techniques for data science
FinTech: The revolution is here!
In this session, we will introduce fintech and discuss the eight key innovations in fintech that are revolutionizing how companies are doing business. This session is geared towards fintech enthusiasts and financial industry professionals who are intrigued and fascinated by the innovations in fintech and would like to learn and adapt to the new realities of the 21st century
The banking industry is data-demanding with acknowledged ATM and credit processing data. As banks face increasing pressure to stay successful, understanding customer needs and preferences becomes a critical success factor. Along with Data mining and advanced analytics techniques, banks are furnished to manage market uncertainty, minimize fraud, and control exposure risk.
Microsoft Data Platform - What's includedJames Serra
The pace of Microsoft product innovation is so fast that even though I spend half my days learning, I struggle to keep up. And as I work with customers I find they are often in the dark about many of the products that we have since they are focused on just keeping what they have running and putting out fires. So, let me cover what products you might have missed in the Microsoft data platform world. Be prepared to discover all the various Microsoft technologies and products for collecting data, transforming it, storing it, and visualizing it. My goal is to help you not only understand each product but understand how they all fit together and there proper use case, allowing you to build the appropriate solution that can incorporate any data in the future no matter the size, frequency, or type. Along the way we will touch on technologies covering NoSQL, Hadoop, and open source.
Digital redefinition of banking banking transformationDraup
The increase in the number of digital use cases in the banking and financial services industry has led to the emergence of newer digital hotspots in the US. States such as Minnesota, North Carolina, Texas, and California have a high density of mature talent specializing in these digital cases. These digital use cases have also given rise to new hotspots in neighbouring states such as Iowa, Arizona, and Ohio. Bank of America, Wells Fargo, and JP Morgan Chase have capitalized on this rapid digitalization to create solutions in anti-money laundering, digital wealth management, information security, cloud technology.
Analysing the Digital Maturity of Top US Banks
The digital maturity of banks and financial institutions has been measured by their competency in innovation which includes their competitive intensity and growth potential and assessing their capabilities in terms of talent scalability and maturity of skills in new age technologies. By these parameters, firms such as Bank of America, Wells Fargo, Citi, and Capital One have identified as digital leaders while Union Bank, First Republic Bank, HSBC US have been relatively slower in the digital race.
Case-by-Case Analysis of Banking Transformation
Bank of America:
Bank of America has over 14 digital centres with over 76% of the digital talent based out of centres located in the US. The 4,000+ digital workforce is involved in functions such as app development, analytics, security, and cloud. Bank of America is one of the few leading banks looking to increase the digital capabilities of all its bank branches through interactive systems that need very little human intervention. Some branches are also fully automated equipped with an interactive teller machine and a video conferencing room.
Citi Group:
Citi is taking cues from its innovation labs that are involved in developing cutting-edge solutions such as beacons. The firm’s 3,500+ digital talent pool is predominantly based out of North America. The bank’s smart branches are equipped with interactive media walls that display local weather, stock information, and financial updates. Citi announced their partnership with Nasdaq which was formed to create payment systems that use DLT (Distributed Ledger Technology) to record payments.
Wells Fargo:
The firm’s large 7,500+ digital workforce is largely consolidated in the United States with sporadic distribution in India as well. The firm has 15 digital centres with only 2 of them located outside the US i.e. in Hyderabad, and Bengaluru. Over 28% of digital talent is involved in new-age solutions such as RPA, Blockchain, IoT and AI.
BI & Big data use case for banking - by rully feranataRully Feranata
Big Data and all about its business case in banking industry - how it will change the landscape and how it can be harness in order organization to stay ahead of the game
Big data Analytics is a process to extract meaningful insight from big such as hidden patterns, unknown correlations, market trends and customer preferences
A changing market landscape and open source innovations are having a dramatic impact on the consumability and ease of use of data science tools. Join this session to learn about the impact these trends and changes will have on the future of data science. If you are a data scientist, or if your organization relies on cutting edge analytics, you won't want to miss this!
AI powered Decision Making in Banks - How Banks today are using Advanced analytics in credit Decisioning, enhancing customer life time value, lower operating costs and stronger customer acquisition
Azure Synapse Analytics is Azure SQL Data Warehouse evolved: a limitless analytics service, that brings together enterprise data warehousing and Big Data analytics into a single service. It gives you the freedom to query data on your terms, using either serverless on-demand or provisioned resources, at scale. Azure Synapse brings these two worlds together with a unified experience to ingest, prepare, manage, and serve data for immediate business intelligence and machine learning needs. This is a huge deck with lots of screenshots so you can see exactly how it works.
This presentation is about basics of Big data Analytics along with Characteristics,Challenges,Structures,Differences between Traditional and Big data,How Big data is getting benefited in Healthcare Industry,Big data in Real time
Analytics, Business Intelligence, and Data Science - What's the Progression?DATAVERSITY
Data analysis can include looking back at historical data, understanding what an organization currently has, and even looking forward to predictions of the future. This presentation will talk about the differences between analytics, business intelligence, and data science, as well as the differences in architecture — and possibly even organization maturity — that make each successful.
Learn more about these topics we will explore including:
Defining analytics, business intelligence, and data science
Differences in architecture
When to use analytics, business intelligence, or data science
Whether there has been an evolution between analytics, business intelligence, and data science
The data lake has become extremely popular, but there is still confusion on how it should be used. In this presentation I will cover common big data architectures that use the data lake, the characteristics and benefits of a data lake, and how it works in conjunction with a relational data warehouse. Then I’ll go into details on using Azure Data Lake Store Gen2 as your data lake, and various typical use cases of the data lake. As a bonus I’ll talk about how to organize a data lake and discuss the various products that can be used in a modern data warehouse.
This presentation introduces big data and explains how to generate actionable insights using analytics techniques. The deck explains general steps involved in a typical analytics project and provides a brief overview of the most commonly used predictive analytics methods and their business applications.
Vijay Adamapure is a Data Science Enthusiast with extensive experience in the field of data mining, predictive modeling and machine learning. He has worked on numerous analytics projects ranging from healthcare, business analytics, renewable energy to IoT.
Vijay presented these slides during the Internet of Everything Meetup event 'Predictive Analytics - An Overview' that took place on Jan. 9, 2015 in Mumbai. To join the Meetup group, register here: http://bit.ly/1A7T0A1
In this slidedeck, Infochimps Director of Product, Tim Gasper, discusses how Infochimps tackles business problems for customers by deploying a comprehensive Big Data infrastructure in days; sometimes in just hours. Tim unlocks how Infochimps is now taking that same aggressive approach to deliver faster time to value by helping customers develop analytic applications with impeccable speed.
Overview of Data Analytics in Lending BusinessSanjay Kar
AI/ML use cases
BFSI industry overview
Lending Products
Underwriting Strategy
Customer Lifecycle Management
How to prepare for becoming a banking analyst
Materials to study for statistics
What is fintech?
What is a Credit Bureau?
Books for statistics
Tools for data science
Techniques for data science
FinTech: The revolution is here!
In this session, we will introduce fintech and discuss the eight key innovations in fintech that are revolutionizing how companies are doing business. This session is geared towards fintech enthusiasts and financial industry professionals who are intrigued and fascinated by the innovations in fintech and would like to learn and adapt to the new realities of the 21st century
The banking industry is data-demanding with acknowledged ATM and credit processing data. As banks face increasing pressure to stay successful, understanding customer needs and preferences becomes a critical success factor. Along with Data mining and advanced analytics techniques, banks are furnished to manage market uncertainty, minimize fraud, and control exposure risk.
Microsoft Data Platform - What's includedJames Serra
The pace of Microsoft product innovation is so fast that even though I spend half my days learning, I struggle to keep up. And as I work with customers I find they are often in the dark about many of the products that we have since they are focused on just keeping what they have running and putting out fires. So, let me cover what products you might have missed in the Microsoft data platform world. Be prepared to discover all the various Microsoft technologies and products for collecting data, transforming it, storing it, and visualizing it. My goal is to help you not only understand each product but understand how they all fit together and there proper use case, allowing you to build the appropriate solution that can incorporate any data in the future no matter the size, frequency, or type. Along the way we will touch on technologies covering NoSQL, Hadoop, and open source.
Digital redefinition of banking banking transformationDraup
The increase in the number of digital use cases in the banking and financial services industry has led to the emergence of newer digital hotspots in the US. States such as Minnesota, North Carolina, Texas, and California have a high density of mature talent specializing in these digital cases. These digital use cases have also given rise to new hotspots in neighbouring states such as Iowa, Arizona, and Ohio. Bank of America, Wells Fargo, and JP Morgan Chase have capitalized on this rapid digitalization to create solutions in anti-money laundering, digital wealth management, information security, cloud technology.
Analysing the Digital Maturity of Top US Banks
The digital maturity of banks and financial institutions has been measured by their competency in innovation which includes their competitive intensity and growth potential and assessing their capabilities in terms of talent scalability and maturity of skills in new age technologies. By these parameters, firms such as Bank of America, Wells Fargo, Citi, and Capital One have identified as digital leaders while Union Bank, First Republic Bank, HSBC US have been relatively slower in the digital race.
Case-by-Case Analysis of Banking Transformation
Bank of America:
Bank of America has over 14 digital centres with over 76% of the digital talent based out of centres located in the US. The 4,000+ digital workforce is involved in functions such as app development, analytics, security, and cloud. Bank of America is one of the few leading banks looking to increase the digital capabilities of all its bank branches through interactive systems that need very little human intervention. Some branches are also fully automated equipped with an interactive teller machine and a video conferencing room.
Citi Group:
Citi is taking cues from its innovation labs that are involved in developing cutting-edge solutions such as beacons. The firm’s 3,500+ digital talent pool is predominantly based out of North America. The bank’s smart branches are equipped with interactive media walls that display local weather, stock information, and financial updates. Citi announced their partnership with Nasdaq which was formed to create payment systems that use DLT (Distributed Ledger Technology) to record payments.
Wells Fargo:
The firm’s large 7,500+ digital workforce is largely consolidated in the United States with sporadic distribution in India as well. The firm has 15 digital centres with only 2 of them located outside the US i.e. in Hyderabad, and Bengaluru. Over 28% of digital talent is involved in new-age solutions such as RPA, Blockchain, IoT and AI.
BI & Big data use case for banking - by rully feranataRully Feranata
Big Data and all about its business case in banking industry - how it will change the landscape and how it can be harness in order organization to stay ahead of the game
Big data Analytics is a process to extract meaningful insight from big such as hidden patterns, unknown correlations, market trends and customer preferences
A changing market landscape and open source innovations are having a dramatic impact on the consumability and ease of use of data science tools. Join this session to learn about the impact these trends and changes will have on the future of data science. If you are a data scientist, or if your organization relies on cutting edge analytics, you won't want to miss this!
AI powered Decision Making in Banks - How Banks today are using Advanced analytics in credit Decisioning, enhancing customer life time value, lower operating costs and stronger customer acquisition
Azure Synapse Analytics is Azure SQL Data Warehouse evolved: a limitless analytics service, that brings together enterprise data warehousing and Big Data analytics into a single service. It gives you the freedom to query data on your terms, using either serverless on-demand or provisioned resources, at scale. Azure Synapse brings these two worlds together with a unified experience to ingest, prepare, manage, and serve data for immediate business intelligence and machine learning needs. This is a huge deck with lots of screenshots so you can see exactly how it works.
This presentation is about basics of Big data Analytics along with Characteristics,Challenges,Structures,Differences between Traditional and Big data,How Big data is getting benefited in Healthcare Industry,Big data in Real time
Analytics, Business Intelligence, and Data Science - What's the Progression?DATAVERSITY
Data analysis can include looking back at historical data, understanding what an organization currently has, and even looking forward to predictions of the future. This presentation will talk about the differences between analytics, business intelligence, and data science, as well as the differences in architecture — and possibly even organization maturity — that make each successful.
Learn more about these topics we will explore including:
Defining analytics, business intelligence, and data science
Differences in architecture
When to use analytics, business intelligence, or data science
Whether there has been an evolution between analytics, business intelligence, and data science
The data lake has become extremely popular, but there is still confusion on how it should be used. In this presentation I will cover common big data architectures that use the data lake, the characteristics and benefits of a data lake, and how it works in conjunction with a relational data warehouse. Then I’ll go into details on using Azure Data Lake Store Gen2 as your data lake, and various typical use cases of the data lake. As a bonus I’ll talk about how to organize a data lake and discuss the various products that can be used in a modern data warehouse.
This presentation introduces big data and explains how to generate actionable insights using analytics techniques. The deck explains general steps involved in a typical analytics project and provides a brief overview of the most commonly used predictive analytics methods and their business applications.
Vijay Adamapure is a Data Science Enthusiast with extensive experience in the field of data mining, predictive modeling and machine learning. He has worked on numerous analytics projects ranging from healthcare, business analytics, renewable energy to IoT.
Vijay presented these slides during the Internet of Everything Meetup event 'Predictive Analytics - An Overview' that took place on Jan. 9, 2015 in Mumbai. To join the Meetup group, register here: http://bit.ly/1A7T0A1
In this slidedeck, Infochimps Director of Product, Tim Gasper, discusses how Infochimps tackles business problems for customers by deploying a comprehensive Big Data infrastructure in days; sometimes in just hours. Tim unlocks how Infochimps is now taking that same aggressive approach to deliver faster time to value by helping customers develop analytic applications with impeccable speed.
AI as a Service, Build Shared AI Service Platforms Based on Deep Learning Tec...Databricks
I will share the vision and the production journey of how we build enterprise shared AI As A Service platforms with distributed deep learning technologies. Including those topics:
1) The vision of Enterprise Shared AI As A Service and typical AI services use cases at FinTech industry
2) The high level architecture design principles for AI As A Service
3) The technical evaluation journey to choose an enterprise deep learning framework with comparisons, such as why we choose Deep learning framework based on Spark ecosystem
4) Share some production AI use cases, such as how we implemented new Users-Items Propensity Models with deep learning algorithms with Spark,improve the quality , performance and accuracy of offer and campaigns design, targeting offer matching and linking etc.
5) Share some experiences and tips of using deep learning technologies on top of Spark , such as how we conduct Intel BigDL into a real production.
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Denodo
Watch full webinar here: https://bit.ly/35FUn32
Presented at CDAO New Zealand
Advanced data science techniques, like machine learning, have proven an extremely useful tool to derive valuable insights from existing data. Platforms like Spark, and complex libraries for R, Python, and Scala put advanced techniques at the fingertips of the data scientists.
However, most architecture laid out to enable data scientists miss two key challenges:
- Data scientists spend most of their time looking for the right data and massaging it into a usable format
- Results and algorithms created by data scientists often stay out of the reach of regular data analysts and business users
Watch this session on-demand to understand how data virtualization offers an alternative to address these issues and can accelerate data acquisition and massaging. And a customer story on the use of Machine Learning with data virtualization.
Introduction of streaming data, difference between batch processing and stream processing, Research issues in streaming data processing, Performance evaluation metrics , tools for stream processing.
Agile Big Data Analytics Development: An Architecture-Centric ApproachSoftServe
Presented at The Hawaii International Conference on System Sciences by Hong-Mei Chen and Rick Kazman (University of Hawaii), Serge Haziyev (SoftServe).
Webinar: Introducing the MongoDB Connector for BI 2.0 with TableauMongoDB
Pairing your real-time operational data stored in a modern database like MongoDB with first-class business intelligence platforms like Tableau enables new insights to be discovered faster than ever before.
Many leading organizations already use MongoDB in conjunction with Tableau including a top American investment bank and the world’s largest airline. With the Connector for BI 2.0, it’s never been easier to streamline the connection process between these two systems.
In this webinar, we will create a live connection from Tableau Desktop to a MongoDB cluster using the Connector for BI. Once we have Tableau Desktop and MongoDB connected, we will demonstrate the visual power of Tableau to explore the agile data storage of MongoDB.
You’ll walk away knowing:
- How to configure MongoDB with Tableau using the updated connector
- Best practices for working with documents in a BI environment
- How leading companies are using big data visualization strategies to transform their businesses
R+Hadoop - Ask Bigger (and New) Questions and Get Better, Faster AnswersRevolution Analytics
The business cases for Hadoop can be made on the tremendous operational cost savings that it affords. But why stop there? The integration of R-powered analytics in Hadoop presents a totally new value proposition. Organizations can write R code and deploy it natively in Hadoop without data movement or the need to write their own MapReduce. Bringing R-powered predictive analytics into Hadoop will accelerate Hadoop’s value to organizations by allowing them to break through performance and scalability challenges and solve new analytic problems. Use all the data in Hadoop to discover more, grow more quickly, and operate more efficiently. Ask bigger questions. Ask new questions. Get better, faster results and share them.
Real-Time Analytics With StarRocks (DWH+DL).pdfAlbert Wong
StarRocks, an open-source distributed columnar engine for real-time analytics, is carving its niche in the big data landscape. Its ability to handle high-velocity data streams and deliver blazing-fast query responses makes it a compelling choice for modern analytics workloads. Let's delve into the intricacies of real-time analytics with StarRocks and explore its capabilities.
Data Ingestion:
The journey begins with ingesting data into StarRocks. It supports a variety of real-time data sources, including Kafka, Pulsar, and custom streaming protocols. These integrations allow seamless data flow from streaming sources to StarRocks, ensuring minimal latency.
Stream Processing and Storage:
StarRocks employs a hybrid architecture for real-time processing. Incoming data streams are first processed by lightweight stream engines like Flink or Spark. These engines perform initial aggregations and transformations, preparing the data for efficient storage in StarRocks' columnar format. This format facilitates rapid data retrieval and filtering, crucial for real-time querying.
Real-time Querying:
The true power of StarRocks lies in its real-time query engine. Once data lands in StarRocks, users can leverage SQL-like queries to analyze it with minimal lag. StarRocks optimizes queries by exploiting its columnar storage and parallel processing capabilities. This enables sub-second response times for even complex queries, empowering users to gain immediate insights from their data.
Advanced Features:
StarRocks packs several features that further enhance its real-time analytics prowess. Materialized views act as pre-computed summaries of data, accelerating frequently used queries. Additionally, StarRocks' automatic tiered storage seamlessly migrates less frequently accessed data to cost-effective storage solutions, optimizing resource utilization.
When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
Whether to take data ingestion cycles off the ETL tool and the data warehouse or to facilitate competitive Data Science and building algorithms in the organization, the data lake – a place for unmodeled and vast data – will be provisioned widely in 2020.
Though it doesn’t have to be complicated, the data lake has a few key design points that are critical, and it does need to follow some principles for success. Avoid building the data swamp, but not the data lake! The tool ecosystem is building up around the data lake and soon many will have a robust lake and data warehouse. We will discuss policy to keep them straight, send data to its best platform, and keep users’ confidence up in their data platforms.
Data lakes will be built in cloud object storage. We’ll discuss the options there as well.
Get this data point for your data lake journey.
Customer value analysis of big data productsVikas Sardana
Business value analysis through Customer Value Model for software technology choices with a case study from Mobile Advertising industry for Big Data use case.
Enabling Next Gen Analytics with Azure Data Lake and StreamSetsStreamsets Inc.
Big data and the cloud are perfect partners for companies who want to unlock maximum value from all of their unstructured, semi-structured, and structured data. The challenge has been how to create and manage a reliable end-to-end solution that spans data ingestion, storage and analysis in the face of the volume, velocity and variety of big data sources.
In this webinar, we will show you how to achieve big data bliss by combining StreamSets Data Collector, which specializes in creating and running complex any-to-any dataflows, with Microsoft's Azure Data Lake and Azure analytic solutions.
We will walk through an example of how a major bank is using StreamSets to transport their on-premise data to the Azure Cloud Computing Platform and Azure Data Lake to take advantage of analytics tools with unprecedented scale and performance.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on the notifications, alerts, and approval requests using Slack for Bonterra Impact Management. The solutions covered in this webinar can also be deployed for Microsoft Teams.
Interested in deploying notification automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
4. Citi: A Customer Centered Organization
3
As a customer-centered bank, the goal of our Big Data strategy to shift
the focus from independent vertical silos to Common Horizontal Solutions
focused around Citi’s 200-million customer accounts
5. Big Data Adoption Stakeholders
• Lines of Business
• Strategy & Decision Management Organizations: cross LOB & Geo,
global
• Data innovation Office: Governance & Regulatory
• CitiData – Big Data & Analytics Engineering
4
6. Big Data Adoption Roadmap
5
Adoption will not occur at once. The level of capability maturity across the
organization will vary significantly.
On theory we think in terms of Staged Competencies of a Big Data
Maturity Model.
In practice, a hybrid process, which fits the level of maturity of
participants, is needed.
Common
Data
Common
Analytic
Platform
Common
Tools &
Techniques
Common
Solutions
Common
Focus
Strategy
7. Big Data Adoption Hybrid Participation Model
• Novice: Proof of Concept
• Expert: R&D Environment
• Shadowed
6
8. 7
End-to-end Analytic Process for a POC Project
This is one component of the hybrid model
Ideas and
Hypotheses
Information Asset
Inventory
Navigator
(“IAIN”)
• Pipeline of ideas
to use data for
competitive
advantage
• Robust,
comprehensive
ontology
allowing analysts
and economists
to search, sort,
and select data
for analysis
• Preliminary
assessment
for business
value, data
safekeeping
and
alignment to
business
practices
Data
Transformation &
Provisioning
• Transformation rules
executed to
normalize and
conform production
data
• Conformed data set
made available in
production
environment
Production Model
Development
• Develop scalable,
productizable
analytics
Model
Deployment
• Exploit insights and
analyses across the
enterprise to
maximize value
• Models measured
for quality / usage
• Formal approval
process through
Business
Steering
Committee
based on
understanding
expected use of
production data
R&D process
R&D
Project
Approval
Product
Approval
Engineering / Production process
Analytics
Knowledge
Management
• Robust, compreh
ensive ontology
allowing analysts
and economists
to
search, sort, an
d select data for
analysis
Data Set
Preparation
&
Provisioning
• Basic preparation
of data set (e.g.,
consolidation,
conformation)
• Permission-based
provisioning of
data set into a Big
Data Analytics
environment
Analytics
Execution
• Advanced
analytic tools
mine business
insight from
large volumes of
data
• Data scientist
peers review
model findings
and results
Analytics Peer
Review
Data
Acquisition
• Where
necessary,
acquire new
data sets to
support R&D
project
9. Advanced Global Solutions
• A global solution is a tested algorithm or analytic model that carries
out a particular business analysis and which is leveraged at a global
scale
• A big data global solution enables the interplay of complex algorithms
and large datasets
• When a global solution is built upon big data approaches a delivery
roadmap should be considered
• In the exploratory process a Global Solution is developed in the
Innovation R/D environment and validated through a POC process
• Alignment with Innovation, UAT, PRD environments
8
11. The Boom Driving Big Data is Technological
Heebyung Koh , Christopher L. Magee
A functional approach for studying technological progress:
Extension to energy technology
Technological Forecasting and Social Change, Volume 75, Issue 6,
July 2008, Pages 735–758
12. The Quadrant Of Analytic Opportunity
Run Time is affected by Data Size and Algorithmic Complexity
Algorithmic Complexity
Database
Interaction
Mtg+Cards+
Banking
Accounts Transaction
features
Accounts Transactions
Branches Transactions
Accounts Summary Stats.
Employees Summary Stats.
GL-GOCS GL-Entries
Branches Summary Stats.
10^10
10^9
10^9
10^8
10^7
10^6
10^5
Data Size
Sequence
Mining
Predictive
filtering
Latent
Dirichlet Allocation
HMM Baum-
Welch
O(ns nf nt)
CART
O(nf ns log ns)
Iterative
SVD- CF
K-means
Logistic
Regression
PCAPage
Rank
Self-Org.
Maps
Neural Nets
Collaborative
Filtering
(CF)
Vector based
Approaches
HMM
Machine
Learning
Traditional
Statistical
Big Data/Pattern
Mining
Conditional
Random
Fields
Support Vector
Machines
13. Breaking down the gains of P13n:
A Controlled Incremental Benchmark on a
Workstation grade processor (x500)
Implemented an incremental-SVD (Netflix Cup) predictive model that
runs on midsize of datasets…
X30
• Compiled Code (vs. interpreted)
x4
• In Memory (vs. Disk access)
X3.12
• Multithread (vs. single thread)
X1.3
• Workstation grade processor
14. Basic Map Reduce Benchmarks
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
1 2 3 4 5 6
Series1
Impact of overhead as function
Of input volume:
Relative Map Throughput
as a function of # Mappers
0
5
10
15
20
25
0 5 10 15 20
RelativeMapCPUtimespeedup
Number of Maps
0.003351955
0.032258065
0.319148936 1
2.631578947
21.12676056
Linear (0.003351955
0.032258065
0.319148936 1
2.631578947
21.12676056)
0
200
400
600
800
1000
1200
1400
1600
0 5 10 15 20
TokensperWallClockSecond
Number of Maps
Series1
Linear (Series1)
15. HAMSTER: Hadoop Multi-signature Search
for Text-based Entity Retrieval
• Core algorithm: String Edit Distance O(mnk2)
• Baseline runs at 100 matches per day
• HAMSTER speedup: 33x (5 node speedup) 60x (java speedup) =
2000x faster
Source
Items
Target
Items
Source
items
per
target
Input
Size
MAP
Records
Cluster
Max Map
Tasks
Effective
Map
Tasks
CPU
map
(secs)
Wall time
34k 618k 100 4.40GB 345 33 33 196k 2h 14
secs
34k 618k 50 8.8GB 690 40 66 196k 1h
47min
34k 618k 30 14.6GB 1,149 40 110 199k 1h 39
min
20. On Demand Simulation: Generate Branches’ DNA
• Case Scenario: Unusual number of cash advances by 2 tellers.
Single day fraud Multi day fraudOriginal branch (August)
21. Creating Regions of Interest based on
On-Demand-Simulation
Minimum-Spanning-
Tree based branch
association for region
of interest generation
Multi-day fraud simulation
Original branch
Region of interest
• Numbers shown
are randomized
indices
22. Conclusion: Lessons Learned
• One Size does not fit all
• Follow a Hybrid Approach
• Leverage Analytic patterns: Global Solutions
• Big Data is about Parallelization
• The future: expensive Algorithms applied to large datasets
• Global Solutions are the combination of algorithmic building blocks
applied to specific business problems
21