This document discusses big fast data and perishable insights that must be acted on quickly. It describes streaming analytics as analyzing data in motion from real-time sources to gain opportunities and avoid crises. Examples of use cases provided are real-time market surveillance and an IoT-enabled smart bank that can send targeted offers to customers. Key components discussed are storing high velocity data in HBase and analyzing it using Storm or Spark on Kafka.
[Ai in finance] AI in regulatory compliance, risk management, and auditingNatalino Busa
AI to Improve Regulatory Compliance, Governance & Auditing. How AI identifies and prevents risks, above and beyond traditional methods. Techniques and analytics that protect customers and firms from cyber-attacks and fraud. Using AI to quickly and efficiently provide evidence for auditing requests.
The Emergence of Alt-Data and its ApplicationsPromptCloud
Alternative data is information gathered from non-traditional information sources. Analysis of this data can provide insights beyond that which an industry’s regular data sources are capable of providing. Here is all you should know about alt-data.
Big Data is the process of harnessing massive Data – structured or unstructured via the means of sensors, actuators, embedded software’s, & network grids.
[Ai in finance] AI in regulatory compliance, risk management, and auditingNatalino Busa
AI to Improve Regulatory Compliance, Governance & Auditing. How AI identifies and prevents risks, above and beyond traditional methods. Techniques and analytics that protect customers and firms from cyber-attacks and fraud. Using AI to quickly and efficiently provide evidence for auditing requests.
The Emergence of Alt-Data and its ApplicationsPromptCloud
Alternative data is information gathered from non-traditional information sources. Analysis of this data can provide insights beyond that which an industry’s regular data sources are capable of providing. Here is all you should know about alt-data.
Big Data is the process of harnessing massive Data – structured or unstructured via the means of sensors, actuators, embedded software’s, & network grids.
Mohanbir Sawhney, Robert R. McCormick Tribune Foundation Clinical Professor of Technology Kellogg School of Management, Northwestern University presents at the 2012 Big Analytics Roadshow.
Companies are drinking from a fire hydrant of data that is too big, moving too fast and is too diverse to be analyzed by conventional database systems. Big Data is like a giant gold mine with large quantities of ore that is difficult to extract. To get value out of Big Data, enterprises need a new mindset and a new set of tools. They also need to know how to extract actionable insights from Big Data that can lead to competitive advantage. The Big Story of Big Data is not what Big Data is, but what it means for business value and competitive advantage.... read more: http://www.biganalytics2012.com/sessions.html#mohan_sawhney
Big Data, Big Deal? (A Big Data 101 presentation)Matt Turck
Background: I prepared this slide deck for a couple of “Big Data 101” guest lectures I did in February 2013 at New York University’s Stern School of Business and at The New School. They’re intended for a college level, non technical audience, as a first exposure to Big Data and related concepts. I have re-used a number of stats, graphics, cartoons and other materials freely available on the internet. Thanks to the authors of those materials.
Top industry use cases for streaming analyticsIBM Analytics
Organizations need to get high value from streaming data to gain new clients and capitalize on market opportunities. Discover how IBM Streams is best suited for use cases that has the need for high speed and low latency.
This presentation, by big data guru Bernard Marr, outlines in simple terms what Big Data is and how it is used today. It covers the 5 V's of Big Data as well as a number of high value use cases.
Future of Investment Operations & Technology InnovationStephen Huppert
Presentation at IBRC Superannuation Funds Back Office Innovations 2017 Forum looking at the potential impact of emerging technologies on back-office teams.
Blockchain, artificial intelligence, robotics, automated data entry and data mining are some of the big technological trends set to radically disrupt the way the back-office teams of superannuation funds and their service providers do their jobs in the not-too-distant future.
Big Data: Industry trends and key playersCM Research
Big data is data that cannot be analysed on a traditional database. Companies that develop the database platforms to analyse big data will make a fortune. This report looks at industry trends and the key players in this emerging industry.
Francesco Furiani - Marketing is a serious business, moreover tracking and monetizing the campaign that allows your marketing to flourish is very important: our tool allows anyone to monitor, compare and optimize all those campaigns (delivered via links) in one place and to deliver insights about who's using those links. Making this infrastructure, making it works, deliver results in real-time (when necessary) and keep everyone happy from the customer to the CFO will be the point of this talk, from the design to the final result with an eye on the costs/risks/benefits of having everything in the cloud.
Patents are a good information resource for obtaining IoT (Internet of Things) technology development status. IOT big data analytics is becoming important to process unimaginably large amounts of information and data that are obtained by the sensor embedded interconnected IoT devices. The typical IoT big data analytics is Hadoop, an open-source software framework that supports data-intensive distributed applications, and the running of applications on large clusters of commodity hardware. Hadoop, that is based on the architectural framework MapReduce, collects both structured data and unstructured data, processes the collected data set in a distributed network cluster in parallel, and extracts valuable information from the processed data set within a short time. Followings illustrate some examples of patents that provide current status of the IoT big data analytics technology development.
Big data for beginners. Tried to prove that "Big data is like teenage sex: everyone talks about it, nobody really knows how to do it, everyone thinks everyone else is doing it, so everyone claims they are doing it..." is totally wrong.
The conventional supply chain is plagued with various issues, most pressing among which is transparency or the lack of it thereof. Being a self-auditing distributed ledger that is accessible to all participants in the network and updated in real time, Blockchain brings in unprecedented levels of transparency to the supply chain. Moreover, its decentralized nature ensures that no single participant along the supply chain holds an unfair advantage or any influence on the data pertaining to the shipment.
Managing your Assets with Big Data ToolsMachinePulse
This presentation was given by Karthigai Muthu, Lead Big Data Analyst, at a meetup organized by the group Internet of Everything in March 2015.
Through his presentation, Karthik provided a comprehensive understanding of available ecosystem tools and how they can be used to perform data engineering and data analytics. Karthik covers the following topics in his presentation:
• Establishment of complete data pipeline using big data ecosystem tools.
• Tackling of high velocity streams using various stream processing engines on cloud and performing Real Time analytics.
• Tackling of historical data using big data ecosystem tools and migration of traditional infrastructure to big data environments.
• Integration of big data ecosystem for data analysis using SAMOA , R and Mahout.
• Deployments of big data environments on the cloud.
Over the past decade, cloud computing has acted as a disrupter in several areas of IT business. Soon, it will overhaul one area of technology that has been in rapid growth itself: Data Analytics. Nicky will focus on the recent study of IBM Institute of Business Value which shows that capabilities that enable an organization to consume data faster – to move from raw data to insight-driven actions – are now the key differentiator to creating value using data and analytics. He will also talk about the requirements for the underlying infrastructure as critical component allowing real-time crunching and analysis of high volume of data. Based on real cases like retailers and energy companies, we will look at five predictions in five years, based on:
Analytics, Big data, and Cloud coming together will energize the Speed Advantage.
Mohanbir Sawhney, Robert R. McCormick Tribune Foundation Clinical Professor of Technology Kellogg School of Management, Northwestern University presents at the 2012 Big Analytics Roadshow.
Companies are drinking from a fire hydrant of data that is too big, moving too fast and is too diverse to be analyzed by conventional database systems. Big Data is like a giant gold mine with large quantities of ore that is difficult to extract. To get value out of Big Data, enterprises need a new mindset and a new set of tools. They also need to know how to extract actionable insights from Big Data that can lead to competitive advantage. The Big Story of Big Data is not what Big Data is, but what it means for business value and competitive advantage.... read more: http://www.biganalytics2012.com/sessions.html#mohan_sawhney
Big Data, Big Deal? (A Big Data 101 presentation)Matt Turck
Background: I prepared this slide deck for a couple of “Big Data 101” guest lectures I did in February 2013 at New York University’s Stern School of Business and at The New School. They’re intended for a college level, non technical audience, as a first exposure to Big Data and related concepts. I have re-used a number of stats, graphics, cartoons and other materials freely available on the internet. Thanks to the authors of those materials.
Top industry use cases for streaming analyticsIBM Analytics
Organizations need to get high value from streaming data to gain new clients and capitalize on market opportunities. Discover how IBM Streams is best suited for use cases that has the need for high speed and low latency.
This presentation, by big data guru Bernard Marr, outlines in simple terms what Big Data is and how it is used today. It covers the 5 V's of Big Data as well as a number of high value use cases.
Future of Investment Operations & Technology InnovationStephen Huppert
Presentation at IBRC Superannuation Funds Back Office Innovations 2017 Forum looking at the potential impact of emerging technologies on back-office teams.
Blockchain, artificial intelligence, robotics, automated data entry and data mining are some of the big technological trends set to radically disrupt the way the back-office teams of superannuation funds and their service providers do their jobs in the not-too-distant future.
Big Data: Industry trends and key playersCM Research
Big data is data that cannot be analysed on a traditional database. Companies that develop the database platforms to analyse big data will make a fortune. This report looks at industry trends and the key players in this emerging industry.
Francesco Furiani - Marketing is a serious business, moreover tracking and monetizing the campaign that allows your marketing to flourish is very important: our tool allows anyone to monitor, compare and optimize all those campaigns (delivered via links) in one place and to deliver insights about who's using those links. Making this infrastructure, making it works, deliver results in real-time (when necessary) and keep everyone happy from the customer to the CFO will be the point of this talk, from the design to the final result with an eye on the costs/risks/benefits of having everything in the cloud.
Patents are a good information resource for obtaining IoT (Internet of Things) technology development status. IOT big data analytics is becoming important to process unimaginably large amounts of information and data that are obtained by the sensor embedded interconnected IoT devices. The typical IoT big data analytics is Hadoop, an open-source software framework that supports data-intensive distributed applications, and the running of applications on large clusters of commodity hardware. Hadoop, that is based on the architectural framework MapReduce, collects both structured data and unstructured data, processes the collected data set in a distributed network cluster in parallel, and extracts valuable information from the processed data set within a short time. Followings illustrate some examples of patents that provide current status of the IoT big data analytics technology development.
Big data for beginners. Tried to prove that "Big data is like teenage sex: everyone talks about it, nobody really knows how to do it, everyone thinks everyone else is doing it, so everyone claims they are doing it..." is totally wrong.
The conventional supply chain is plagued with various issues, most pressing among which is transparency or the lack of it thereof. Being a self-auditing distributed ledger that is accessible to all participants in the network and updated in real time, Blockchain brings in unprecedented levels of transparency to the supply chain. Moreover, its decentralized nature ensures that no single participant along the supply chain holds an unfair advantage or any influence on the data pertaining to the shipment.
Managing your Assets with Big Data ToolsMachinePulse
This presentation was given by Karthigai Muthu, Lead Big Data Analyst, at a meetup organized by the group Internet of Everything in March 2015.
Through his presentation, Karthik provided a comprehensive understanding of available ecosystem tools and how they can be used to perform data engineering and data analytics. Karthik covers the following topics in his presentation:
• Establishment of complete data pipeline using big data ecosystem tools.
• Tackling of high velocity streams using various stream processing engines on cloud and performing Real Time analytics.
• Tackling of historical data using big data ecosystem tools and migration of traditional infrastructure to big data environments.
• Integration of big data ecosystem for data analysis using SAMOA , R and Mahout.
• Deployments of big data environments on the cloud.
Over the past decade, cloud computing has acted as a disrupter in several areas of IT business. Soon, it will overhaul one area of technology that has been in rapid growth itself: Data Analytics. Nicky will focus on the recent study of IBM Institute of Business Value which shows that capabilities that enable an organization to consume data faster – to move from raw data to insight-driven actions – are now the key differentiator to creating value using data and analytics. He will also talk about the requirements for the underlying infrastructure as critical component allowing real-time crunching and analysis of high volume of data. Based on real cases like retailers and energy companies, we will look at five predictions in five years, based on:
Analytics, Big data, and Cloud coming together will energize the Speed Advantage.
Big data-analytics-changing-way-organizations-conducting-businessAmit Bhargava
Hi Friends ,
There is an interesting post on how to leveraging Big data analytics in an Integrated GRC Environment in an Organize to have visibility in core enterprises issues on real time basis . This presentation is from Metric stream -an international and Global GRC soloutioning providers in association with Dr. Kirk. D. Borne - Big data consultant and Adviser .Hope you like it and enjoy as well.
Big data is a phenomenon brought about by rapid data growth, complex, new, and changing data types, and parallel technology advancements; it brings huge possibilities. By optimizing these enormous amounts of structured and unstructured data, CSPs are in a unique position to capture these opportunities and create new revenue streams.
Abstract:
Big Data concern large-volume, complex, growing data sets with multiple, autonomous sources. With the fast development of networking, data storage, and the data collection capacity, Big Data are now rapidly expanding in all science and engineering domains, including physical, biological and biomedical sciences. This paper presents a HACE theorem that characterizes the features of the Big Data revolution, and proposes a Big Data processing model, from the data mining perspective. This data-driven model involves demand-driven aggregation of information sources, mining and analysis, user interest modeling, and security and privacy considerations. We analyze the challenging issues in the data-driven model and also in the Big Data revolution.
Big Brother Big Sister Bluemix Architecture from #HackathonCLTDave Callaghan
Big Brother and Big Sisters brought their enterprise system challenge to #HackathonCLT. I wanted to design a system that could support 10x the membership with the same expense. By using Bluemix to provide scale, the Watson APIs for both ingestion and analytics, and their Hyperledger implementation for security paired with HBase, I believe we have a potential solution. More to come!
Discuss building a trust solution for HealthIT or other regulated enterprises with blockchain using Hyperledger with Hbase for off-blockchain storage for scaling prototyped on Bluemix.
Stormwater analytics with MongoDB and PentahoDave Callaghan
Use MongoDB and Pentaho to rapidly evaluate a use case for the City of Charlotte's Stormwater Management System by creating "A Single View of a Raindrop".
There are any number of vendors and publications stating that IT departments need to invest big in Big Data and Big Analytics to meet the challenges of the Internet of Things. Let's swap out marketing and hype for logic and math and separate the signal from the noise. We'll come up with a clear problem definition and come up with an algorithmic approach to the problem. Once we have a framework, we can more intelligently choose an implementation.
2. Perishable Insights
1.Data is perishable if there is only a limited
amount of time to act upon that event.
Market data, clickstream, mobile devices,
sensors, and transactions may contain valuable,
but perishable insights.
1.Sieze opportunities and avoid crises amidst
explosive complexity.
3. Streaming Analytics
“Streaming analytics is anything but a sleepy, rearview mirror analysis of
data. No, it is about knowing and acting on what’s happening in your
business at this very moment — now. Forrester calls these perishable
insights because they occur at a moment’s notice and you must act on
them fast within a narrow window of opportunity before they quickly lose
their value. The high velocity, white-water flow of data from innumerable
real-time data sources such as market data, Internet of Things, mobile,
sensors, clickstream, and even transactions remain largely unnavigated
by most firms. The opportunity to leverage streaming analytics has never
been greater.”
4. Big Fast Data
Big Data
Enable analytics of data at scale
PB's of data
Batch Processing (minutes to hours)
High latency
Fast Data
Enable analytics of data in motion
Millions of events per second
Stream Processing (nanoseconds to seconds)
Low latency
Data
Explosion
Computational
Explosion
5. Big Fast Data
lBatch Processing
lIn a traditional query model, you store data and then
run queries on the data as needed.
lQuery-driven model
lStream Processing
lIn a streaming data model, you store queries and then
continuously run data through the queries.
lEvent-driven model
6. Thinking in Streams
lFiltering
lStreaming data can be filled with irrelevant or invalid
data since data typically does not go through a data
governance step until it is stored in Hadoop.
lAggregation/Correlation
lStreaming data typically comes from multiple sources
and can be combined in multiple ways.
lLocation/Motion
lMobile devices, from phones to wearables, are
ubiquitous.
7. Thinking in Streams
lTime Windows
lBy taking a snapshot of the data stream, time
windows can be used to provide time series analysis
in real time (weighted moving averages, Bollinger
Bands, etc)
lTemporal Patterns
lEvents can often contain interesting patterns relative
to new data coming in, such as data streaming at
different times and different patterns of data at the
same time.
8. Use Case for Fast Data Analytics
lManage Risk
lMarket Surveillance in a high frequency world
lMaximize Reward
lCustomer Satisfaction in the Internet of Things
9. Real-Time Market Surveillance
lConvergent threat system
lSupport for historical, realtime and predictive
modeling
lSupport for Big Fast Data
lSupport for multi- and cross-asset class
monitoring
lSupport for cross-border surveillance
l
10. Real-Time Market Surveillance
lContinuous predictive analysis
lBy leveraging historical data, a continuous
predictive analysis can extrapolate events that
have happened up to the current time and then
predict a future event.
lFor example, the normal operating parameters of
an algorithm (size, frequency, instrument, order-
to-trade ratio) can be established strictly based on
history. Deviation from this norm can trigger
defensive measures. Consider Knight Capital.
11. Real-Time Market Surveillance
lConvergent threat system
lA rogue algorithm can be shut down if its operating
outside of its normal behavior pattern.
lA rogue trader, market manipulator or insider dealer can
be identified based on rules specified by Compliance.
lThe key is a unified framework.
lProvide a unified data ingestion system to ingest data
from the large number of disconnected (and
unconnectable) data sources.
12. Disintermediation
lMobile wallets pose the same threat level to
banks as self-directed equities trading.
lDBS uses its vast amount of client infromation to
categorize customers, statistically predict behavior
and send targeted promotional offers to their
client's cell phones when they are engaged with a
business partner.
13. IoT Smart Bank
lA location-aware promotion application requires
continuous, event-driven queries that continuously
monitoring , analyzing and responding to the
changing status of subscribers and promotions.
lThese queries are multidimensional; matching
location, context, preferences, socioeconomic
factors, and spending habits. These dimensions
are in flux.
14. IoT Smart Bank
lThe numbers
l1,000 people moving once per minute equals 17
events per second
l1 million people would generate 16,667 events
per second
lWells Fargo has 70 million customers.
15. Streaming Use Cases
lNetwork monitoring
lIntelligence and surveillance
lRisk management
lE-commerce
lFraud detection
lSmart order routing
lTransaction cost analysis
lPricing and analytics
lMarket data management
lAlgorithmic trading
lData warehouse augmentation
16. Components of Fast Data Analytics
lTo store data at velocity, Flume into Hbase
lTo analyze data at velocity, Kafka in Storm or
Spark
17. Open Source Landscape
lApache Storm
lProvides massively scalable event collection
lApache Spark
lGeneral framework for large-scale data processing
lApache Samza
20. Commercial Landscape
lCommercial products will differ from open source
products in the following areas:
lDevelopment tools
lBusiness applications and platform integration
lImplementation support
lCompany financials
lLicensing