This document discusses leveraging machine learning and big data analytics. It outlines an analytical pipeline that includes data acquisition, data munging, exploratory data analysis, model building, model improvement, validation, and real-time processing. A case study is presented on using these techniques to predict when to scrap parts in an assembly line to reduce costs. Key takeaways are that machine learning can find hidden insights in historical big data, models derived from this can be applied to real-time event processing without redevelopment, and this enables automated actions based on predictive analytics.
The (very) basics of AI for the Radiology residentPedro Staziaki
The (very) basics of AI for the Radiology resident.
Also on YouTube: https://youtu.be/ia90UKjlmBA
Artificial Intelligence, Machine Learning, Deep Learning, CNN, Convolutional Neural Networks, Support Vector Machine (SVM), GPU. Felipe Kitamura. Pedro Vinícius Staziaki.
The Big Data Scotland 2015 conference brought together business leaders and technologists from across the country to explore the value of Big Data & Analytics. The conference considered technological developments, market trends and business strategy; showcasing innovative examples of analytics being used effectively across a range of practical applications. The event offered a unique opportunity for technologists to come together for knowledge exchange, networking and debate.
The (very) basics of AI for the Radiology residentPedro Staziaki
The (very) basics of AI for the Radiology resident.
Also on YouTube: https://youtu.be/ia90UKjlmBA
Artificial Intelligence, Machine Learning, Deep Learning, CNN, Convolutional Neural Networks, Support Vector Machine (SVM), GPU. Felipe Kitamura. Pedro Vinícius Staziaki.
The Big Data Scotland 2015 conference brought together business leaders and technologists from across the country to explore the value of Big Data & Analytics. The conference considered technological developments, market trends and business strategy; showcasing innovative examples of analytics being used effectively across a range of practical applications. The event offered a unique opportunity for technologists to come together for knowledge exchange, networking and debate.
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014pietvz
A customer service call can transform internal processes. Information in Tweets and reviews can lead to better products. Structured and unstructured data brought together can reveal patterns and relationships that unlock powerful business opportunities. We will discuss real-world use cases and best practices for building the infrastructure you need to power Big Data analytics solutions. From the latest in Hadoop innovation, cognitive computing, and cloud-based analytical web services, you will learn how organizations large and small can harness the power of unstructured human information to create, deploy, and deliver the next generation of analytics applications.
Powered by HP IDOL, HP Autonomy delivers intelligent applications that allow your organization to understand the concepts and context of all information in real time, mitigating risk and identifying opportunity. Join us at this session to learn how HP Autonomy can unlock the value of your company’s structured and unstructured data for better insight and greater competitive advantage. HP IDOL, the OS for human information, enables you to index, manage, and process all your data, both structured and unstructured. Learn how HP IDOL delivers unprecedented insights into optimized architecture, scalability, performance, mapped security, and connectivity. Find out more about IDOLOnDemand.com and how you can leverage this revolutionary technology in your own organization.
Robert Lecklin - BigData is making a differenceIBM Sverige
Vad kan Big data göra för ditt företag? Låt dig inspireras av Robert Lecklin som har hjälpt flera kunder att implementera sin Big data strategi. Genom detta har de lyckats omvandlat värdelös data till värdefulla insikter. Han kommer i denna session att dela med sig av erfarenheter av kundcase där en strategi för big data gjort avgörande skillnad...
An AI Maturity Roadmap for Becoming a Data-Driven OrganizationDavid Solomon
The initial version of a maturity roadmap to help guide businesses when adopting AI technology into their workflow. IBM Watson Studio is referenced as an example of technology that can help in accelerating the adoption process.
Invited talk at the LTsolutions International Workshop in Donostia-San Sebastián, Spain on 21st May 2015.
More information about the workshop at: http://www.langune.com/home/presentationen/ltsolutions
IBM Watson Jeopardy! white paper which explains Watson’s workload optimised system design based on IBM DeepQA architecture and POWER7® processor-based servers
YouTube Link: https://youtu.be/Vs9k3FThNic
*** Big Data Hadoop Certification Training - https://www.edureka.co/big-data-hadoop-training-certification ***
This Edureka PPT on Big Data Technologies will provide you in-depth knowledge on Big Data Tools. This video will help you understand different types of Big-Data Technologies and trending Big-Data Tools in IT Industries. This PPT covers the following topics:
Agenda
What is Big Data Technology?
Types of Big Data Technologies.
Operational Big Data
Analytical Big Data
Top Big Data Technologies
Big Data Technologies in Storage
Big Data Technologies in Data Mining
Big Data Technologies in Data Analytics
Data Visualization
Emerging Big Data Technologies
Complete Hadoop Playlist: https://goo.gl/ExJdZs
Complete Blog Series: http://bit.ly/2wO8l0y
Big Data Podcast - https://castbox.fm/channel/id1814029
Follow us to never miss an update in the future.
YouTube: https://www.youtube.com/user/edurekaIN
Instagram: https://www.instagram.com/edureka_learning/
Facebook: https://www.facebook.com/edurekaIN/
Twitter: https://twitter.com/edurekain
LinkedIn: https://www.linkedin.com/company/edureka
Castbox: https://castbox.fm/networks/505?country=in
Top 10 ways BigInsights BigIntegrate and BigQuality will improve your lifeIBM Analytics
BigInsights BigIntegrate and BigQuality offer a cost-effective opportunity to fully leverage the scale and promise of Hadoop. Here are 10 ways BigIntegrate and BigQuality are making it easier for organizations to harness the power of their entire data ecosystems. Learn more at ibm.co/datagovernance
Findability Day 2016 - Enterprise Search and Findability Survey 2016Findwise
In April 2016, Findwise opened the fifth annual Enterprise Search and Findability Survey to investigate how search is managed and used globally. Mattias Ellison presents some of the most interesting founds at Findability Day 2016 in Stockholm.
Scania Interim Report January – June 2016Scania Group
Summary of the first six months of 2016
* Operating income amounted to SEK 1,348 m. (4,737), negatively impacted by a provision of SEK 3.8 billion related to the European Commission’s competition investigation
* Operating income excluding items affecting comparability rose by 9 percent to SEK 5,148 m. (4,737), resulting in an operating margin of 10.3 (10.1) percent
* Net sales rose by 7 percent to SEK 50,110 m. (46,798)
* Cash flow amounted to SEK -492 m. (1,106) in Vehicles and Services
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014pietvz
A customer service call can transform internal processes. Information in Tweets and reviews can lead to better products. Structured and unstructured data brought together can reveal patterns and relationships that unlock powerful business opportunities. We will discuss real-world use cases and best practices for building the infrastructure you need to power Big Data analytics solutions. From the latest in Hadoop innovation, cognitive computing, and cloud-based analytical web services, you will learn how organizations large and small can harness the power of unstructured human information to create, deploy, and deliver the next generation of analytics applications.
Powered by HP IDOL, HP Autonomy delivers intelligent applications that allow your organization to understand the concepts and context of all information in real time, mitigating risk and identifying opportunity. Join us at this session to learn how HP Autonomy can unlock the value of your company’s structured and unstructured data for better insight and greater competitive advantage. HP IDOL, the OS for human information, enables you to index, manage, and process all your data, both structured and unstructured. Learn how HP IDOL delivers unprecedented insights into optimized architecture, scalability, performance, mapped security, and connectivity. Find out more about IDOLOnDemand.com and how you can leverage this revolutionary technology in your own organization.
Robert Lecklin - BigData is making a differenceIBM Sverige
Vad kan Big data göra för ditt företag? Låt dig inspireras av Robert Lecklin som har hjälpt flera kunder att implementera sin Big data strategi. Genom detta har de lyckats omvandlat värdelös data till värdefulla insikter. Han kommer i denna session att dela med sig av erfarenheter av kundcase där en strategi för big data gjort avgörande skillnad...
An AI Maturity Roadmap for Becoming a Data-Driven OrganizationDavid Solomon
The initial version of a maturity roadmap to help guide businesses when adopting AI technology into their workflow. IBM Watson Studio is referenced as an example of technology that can help in accelerating the adoption process.
Invited talk at the LTsolutions International Workshop in Donostia-San Sebastián, Spain on 21st May 2015.
More information about the workshop at: http://www.langune.com/home/presentationen/ltsolutions
IBM Watson Jeopardy! white paper which explains Watson’s workload optimised system design based on IBM DeepQA architecture and POWER7® processor-based servers
YouTube Link: https://youtu.be/Vs9k3FThNic
*** Big Data Hadoop Certification Training - https://www.edureka.co/big-data-hadoop-training-certification ***
This Edureka PPT on Big Data Technologies will provide you in-depth knowledge on Big Data Tools. This video will help you understand different types of Big-Data Technologies and trending Big-Data Tools in IT Industries. This PPT covers the following topics:
Agenda
What is Big Data Technology?
Types of Big Data Technologies.
Operational Big Data
Analytical Big Data
Top Big Data Technologies
Big Data Technologies in Storage
Big Data Technologies in Data Mining
Big Data Technologies in Data Analytics
Data Visualization
Emerging Big Data Technologies
Complete Hadoop Playlist: https://goo.gl/ExJdZs
Complete Blog Series: http://bit.ly/2wO8l0y
Big Data Podcast - https://castbox.fm/channel/id1814029
Follow us to never miss an update in the future.
YouTube: https://www.youtube.com/user/edurekaIN
Instagram: https://www.instagram.com/edureka_learning/
Facebook: https://www.facebook.com/edurekaIN/
Twitter: https://twitter.com/edurekain
LinkedIn: https://www.linkedin.com/company/edureka
Castbox: https://castbox.fm/networks/505?country=in
Top 10 ways BigInsights BigIntegrate and BigQuality will improve your lifeIBM Analytics
BigInsights BigIntegrate and BigQuality offer a cost-effective opportunity to fully leverage the scale and promise of Hadoop. Here are 10 ways BigIntegrate and BigQuality are making it easier for organizations to harness the power of their entire data ecosystems. Learn more at ibm.co/datagovernance
Findability Day 2016 - Enterprise Search and Findability Survey 2016Findwise
In April 2016, Findwise opened the fifth annual Enterprise Search and Findability Survey to investigate how search is managed and used globally. Mattias Ellison presents some of the most interesting founds at Findability Day 2016 in Stockholm.
Scania Interim Report January – June 2016Scania Group
Summary of the first six months of 2016
* Operating income amounted to SEK 1,348 m. (4,737), negatively impacted by a provision of SEK 3.8 billion related to the European Commission’s competition investigation
* Operating income excluding items affecting comparability rose by 9 percent to SEK 5,148 m. (4,737), resulting in an operating margin of 10.3 (10.1) percent
* Net sales rose by 7 percent to SEK 50,110 m. (46,798)
* Cash flow amounted to SEK -492 m. (1,106) in Vehicles and Services
Scania Interim Report, January–March 2016Scania Group
Scania’s sales amounted to SEK 23.1 billion and earnings for the first quarter amounted to SEK 2,295 m. Higher vehicle volume in Europe and higher service volume were partly offset by negative currency rate effects and low capacity utilisation in the production system in Latin America. The high level of investments is starting to impact earnings.
Scania interim report january september 2016Scania Group
Scania’s sales reached SEK 75.2 billion in the first nine months of 2016 and the company’s underlying operational performance was strong. Higher vehicle volume in Europe and increased service revenue was partly offset by negative currency rate effects and lower deliveries in Latin America.
Architecture of Search Systems and Measuring the Search EffectivenessFindwise
Lecture made at the 19th of April 2012, at the Warsaw University of Technology. This is the 9th lecture in the regular course at master grade studies "Introduction to text mining".
Presenterad på Sitevisiondagarna 2012 i Örebro av Anders Häggdahl.
Presentationen bygger på vår långa erfarenhet av att ta fram och implementera sök och Findability-lösningar.
Vi visar exempel på hur några av de mer än 100 organisationer som vi arbetar med, tänker runt Findability och berättar om Findwise metod för att skapa maximal utväxling på våra kunders investeringar. Vidare går vi igenom vilka fördelar vårt samarbete med Sitevision ger användarna när man ska förbättra sin sökfunktionalitet.
Best Practices for Enterprise Search - What Leading Practitioners DoFindwise
Best Practices for Enterprise Search, from the perspective of practitioners. More focused on tasks and processes than technology. Based on data from the Enterprise Search and Findability Survey, other research, empirical evidence and the experience gained by Findwise consultants.
Findability Day 2016 - Enterprise Search and Findability Survey 2016Findwise
In April 2016, Findwise opened the fifth annual Enterprise Search and Findability Survey to investigate how search is managed and used globally. Mattias Ellison presents some of the most interesting founds at Findability Day 2016 in Stockholm.
Scania Year-end Report January-December 2016Scania Group
net sales rose to a record level of nearly SEK 104 billion. Total deliveries of trucks and buses and coaches reached all-time high levels. Service revenue amounted to a record of almost SEK 22 billion, an increase of 5 percent (7 percent in local currency).
The digital transformation is going forward due to Mobile, Cloud and Internet of Things. Disrupting business models leverage Big Data Analytics and Machine Learning.
"Big Data" is currently a big hype. Large amounts of historical data are stored in Hadoop or other platforms. Business Intelligence tools and statistical computing are used to draw new knowledge and to find patterns from this data, for example for promotions, cross-selling or fraud detection. The key challenge is how these findings can be integrated from historical data into new transactions in real time to make customers happy, increase revenue or prevent fraud. "Fast Data" via stream processing is the solution to embed patterns - which were obtained from analyzing historical data - into future transactions in real-time.
This session uses several real world success stories to explain the concepts behind stream processing and its relation to Hadoop and other big data platforms. It discusses how patterns and statistical models of R, Spark MLlib, H2O, and other technologies can be integrated into real-time processing by using several different real world case studies. The session also points out why a Microservices architecture helps solving the agile requirements for these kind of projects.
A brief overview of available open source frameworks and commercial products shows possible options for the implementation of stream processing, such as Apache Storm, Apache Flink, Spark Streaming, IBM InfoSphere Streams, or TIBCO StreamBase.
A live demo shows how to implement stream processing, how to integrate machine learning, and how human operations can be enabled in addition to the automatic processing via a Web UI and push events.
Keywords: Big Data, Fast Data, Machine Learning, Analytics, Analytic Model, Stream Processing, Event Processing, Streaming Analytics, Real Time, Hadoop, Spark, MLlib, Streaming, R, TERR, TIBCO, Spotfire, StreamBase, Live Datamart, H20, Predictive Analytics, Data Discovery, Insights, Patterns
How to Apply Big Data Analytics and Machine Learning to Real Time Processing ...Codemotion
The world gets connected more and more every year due to Mobile, Cloud and Internet of Things. "Big Data" is currently a big hype. Large amounts of historical data are stored in Hadoop to find patterns, e.g. for predictive maintenance or cross-selling. But how to increase revenue or reduce risks in new transactions? "Fast Data" via stream processing is the solution to embed patterns into future actions in real-time. This session discusses how machine learning and analytic models with R, Spark MLlib, H2O, etc. can be integrated into real-time event processing. A live demo concludes the session
Machine Learning Applied to Real Time Scoring in Manufacturing and Energy Uti...Kai Wähner
Kai Wähner (@KaiWaehner) is a Technology Evangelist and Community Director at TIBCO Software - a leading provider of integration and analytics middleware. Kai is an experience guy in broad variety of topics like Big Data, Advanced Analytics & Machine Learning, he loves to write articles and blog about new technologies and make talks. The talk is about 3 different projects where Kai's team built analytic models with technologies R, Apache Spark or H2O.ai which were deployed to real time processing. The use cases include predictive maintenance in manufacturing but also fraud detection in banking and context-specific pricing in insurance. For one of the cases, Kai gonna show detailed steps will be, how it was built and deployed using supervised/unsupervised ML.
Talk was done together with my colleague Ankitaa Bhowmick.
R, Spark, Tensorflow, H20.ai Applied to Streaming AnalyticsKai Wähner
Slides from my talk at Codemotion Rome in March 2017. Development of analytic machine learning / deep learning models with R, Apache Spark ML, Tensorflow, H2O.ai, RapidMinder, KNIME and TIBCO Spotfire. Deployment to real time event processing / stream processing / streaming analytics engines like Apache Spark Streaming, Apache Flink, Kafka Streams, TIBCO StreamBase.
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...Codemotion
Big Data is key for innovation in many industries today. Large amounts of historical data are stored and analyzed in Hadoop, Spark or other clusters to find patterns, e.g. for predictive maintenance or cross-selling. However: How do you increase revenue or reduce risks in new transactions proactively? Stream processing is the solution to embed patterns into future actions in real-time. This session discusses and demos how machine learning and analytic models with R, Spark MLlib, H2O, etc. can be build and integrated into real-time event processing frameworks. The session focuses on live demos
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...Kai Wähner
Streaming Analytics Comparison of Open Source Frameworks, Products and Cloud Services. Includes Apache Storm, Flink, Spark, TIBCO, IBM, AWS Kinesis, Striim, Zoomdata, ...
This session discusses the technical concepts of stream processing / streaming analytics and how it is related to big data, mobile, cloud and internet of things. Different use cases such as predictive fault management or fraud detection are used to show and compare alternative frameworks and products for stream processing and streaming analytics.
The focus of the session lies on comparing
- different open source frameworks such as Apache Apex, Apache Flink or Apache Spark Streaming
- engines from software vendors such as IBM InfoSphere Streams, TIBCO StreamBase
- cloud offerings such as AWS Kinesis.
- real time streaming UIs such as Striim, Zoomdata or TIBCO Live Datamart.
Live demos will give the audience a good feeling about how to use these frameworks and tools.
The session will also discuss how stream processing is related to Apache Hadoop frameworks (such as MapReduce, Hive, Pig or Impala) and machine learning (such as R, Spark ML or H2O.ai).
How to Apply Machine Learning with R, H20, Apache Spark MLlib or PMML to Real...Kai Wähner
"Big Data" is currently a big hype. Large amounts of historical data are stored in Hadoop or other platforms. Business Intelligence tools and statistical computing are used to draw new knowledge and to find patterns from this data, for example for promotions, cross-selling or fraud detection. The key challenge is how these findings can be integrated from historical data into new transactions in real time to make customers happy, increase revenue or prevent fraud.
"Fast Data" via stream processing is the solution to embed patterns - which were obtained from analyzing historical data - into future transactions in real-time. This session uses several real world success stories to explain the concepts behind stream processing and its relation to Hadoop and other big data platforms. The session discusses how patterns and statistical models of R, Spark MLlib and other technologies can be integrated into real-time processing using open source frameworks (such as Apache Storm, Spark or Flink) or products (such as IBM InfoSphere Streams or TIBCO StreamBase). A live demo shows the complete development lifecycle combining analytics, machine learning and stream processing.
Smart Manufacturing and Industry 4.0 - Tibco PoVNicola Sandoli
Smart Manufacturing and Industry 4.0: generating new insights and operational intelligence.
Manufacturers are increasingly relying on advanced analytics to understand data, anticipate and take proactive steps to prevent costly downtime and improve operational efficiency. Collecting real-time sensor data and mashups using machine learning techniques allows you to identify hidden insights into the potential equipment failures and operational discrepancies before they happen.
Startup pitch presented by co-founder and CEO Jaco Els. Cubitic offers a predictive analytics platform that allows developers to build custom solutions for analytics and visualisation on top of a machine learning engine.
Information processing and analytics cannot be focused only on “store-first” or batch-based approaches. To provide maximum business value, information must also be analyzed closer to the source, and at the speed in which it is being created. Streaming analytics utilizes various techniques for intelligently processing data as it arrives at the edge or within the data center, with the purpose of proactively identifying threats or opportunities for your business.
Similar to Findability Day 2016 - Big data analytics and machine learning (20)
Elastic presents how The Guardian uses log analysis when creating articles and content on their website.
Findwise talks about Big Data and log analysis and the possibilities it gives.
Opendatabay - Open Data Marketplace.pptxOpendatabay
Opendatabay.com unlocks the power of data for everyone. Open Data Marketplace fosters a collaborative hub for data enthusiasts to explore, share, and contribute to a vast collection of datasets.
First ever open hub for data enthusiasts to collaborate and innovate. A platform to explore, share, and contribute to a vast collection of datasets. Through robust quality control and innovative technologies like blockchain verification, opendatabay ensures the authenticity and reliability of datasets, empowering users to make data-driven decisions with confidence. Leverage cutting-edge AI technologies to enhance the data exploration, analysis, and discovery experience.
From intelligent search and recommendations to automated data productisation and quotation, Opendatabay AI-driven features streamline the data workflow. Finding the data you need shouldn't be a complex. Opendatabay simplifies the data acquisition process with an intuitive interface and robust search tools. Effortlessly explore, discover, and access the data you need, allowing you to focus on extracting valuable insights. Opendatabay breaks new ground with a dedicated, AI-generated, synthetic datasets.
Leverage these privacy-preserving datasets for training and testing AI models without compromising sensitive information. Opendatabay prioritizes transparency by providing detailed metadata, provenance information, and usage guidelines for each dataset, ensuring users have a comprehensive understanding of the data they're working with. By leveraging a powerful combination of distributed ledger technology and rigorous third-party audits Opendatabay ensures the authenticity and reliability of every dataset. Security is at the core of Opendatabay. Marketplace implements stringent security measures, including encryption, access controls, and regular vulnerability assessments, to safeguard your data and protect your privacy.
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Techniques to optimize the pagerank algorithm usually fall in two categories. One is to try reducing the work per iteration, and the other is to try reducing the number of iterations. These goals are often at odds with one another. Skipping computation on vertices which have already converged has the potential to save iteration time. Skipping in-identical vertices, with the same in-links, helps reduce duplicate computations and thus could help reduce iteration time. Road networks often have chains which can be short-circuited before pagerank computation to improve performance. Final ranks of chain nodes can be easily calculated. This could reduce both the iteration time, and the number of iterations. If a graph has no dangling nodes, pagerank of each strongly connected component can be computed in topological order. This could help reduce the iteration time, no. of iterations, and also enable multi-iteration concurrency in pagerank computation. The combination of all of the above methods is the STICD algorithm. [sticd] For dynamic graphs, unchanged components whose ranks are unaffected can be skipped altogether.
8. Machine Learning
…. allows computers to find hidden insights without being
explicitly programmed where to look.
9. Real World Examples of Machine Learning
Spam Detection
Search Results +
Product Recommendation
Picture Detection
(Friends, Locations, Products)
Machine Learning is already present in daily life…
Now, every enterprise is beginning to leverage it!
The Next Disruption:
Google Beats Go Champion
46. Scenario: Predictive Scrapping of Parts in an Assembly Line
Goal: Scrap parts as early as possible automatically to reduce costs in a manufacturing process.
Question: When to scrap a part in Station 1 instead of doing re-work or sending it to Station 2?
Station 1 Station 2
Cost Before
9€
7€ 13€
Total Cost
29€
(or more)
Scrap? Scrap?
47. TIBCO Spotfire with H2O Integration
Data Discovery / Data Mining (“Are parts that repeat a station more likely scrap parts?”)
48. TIBCO Live Datamart
Operational Intelligence (“Monitor the manufacturing process and change rules in real time!”)
Live Dartmart Desktop Client
49. TIBCO Live Datamart
Operational Intelligence (“Monitor the manufacturing process and change rules in real time!”)
Live Dartmart Web API