Data analysis with pandas and scikit-learnGlib Kechyn
Definition and basic features of data analysis with python, pandas and scikit-learn. Brief explanation about most powerful features. Introduction part.
Data analysis with pandas and scikit-learnGlib Kechyn
Definition and basic features of data analysis with python, pandas and scikit-learn. Brief explanation about most powerful features. Introduction part.
Right now in institutions around the world, some of the greatest minds in computer science and statistics are coming up with amazing new algorithms and mathematically beautiful solutions. However it's entirely possible that the solutions they conceive will be impracticable in industry. The reason is simple; "the best answer is useless if it arrives too late to do anything with it". The key principle here is the compromise between 'accuracy' and 'latency'. In this talk I will describe examples where this holds true, and how I am using real-time machine learning models to solve challenges in eCommerce, Financial Services and Media companies.
http://tumra.com/blog/real-time-machine-learning-at-industrial-scale
Machine learning, or predictive analytics have started entering into our daily life. Businesses and enterprises could use predictive analytics to improve efficiency, improve user experience, as well as to create new business opportunities. This talk will present WSO2 Machine Learner, our experiences of predicting Super Bowl winners, and few real life use cases. Furthermore, talk will discuss open challenges and problems people are working on.
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...Databricks
American Diabetic association states that 29.1 million Americans and 300+ million people all over the world have diabetes. Diabetic medication management is always challenging. Based on doctor’s prescription, patients take insulin dosage one hour before breakfast, lunch or dinner. But the real world scenario insulin intake can be changed based on the blood glucose level, calorie intake on a specific day, etc.
This talk explains how a real-time Big Data pipeline recommendation engine can be used to suggest insulin intake for diabetic patients in near real time. Based on calorie intake and blood glucose level from patients as well as generated dataset, insulin dosage can be recommended which will help patients to avoid over/under dosage. Designing medication recommender system is a need for the Healthcare industry. There is a growing trend for the applications to help doctors by recommending medication based on patient’s historic data. This also helps facilitate a doctor friendly and hospital free atmosphere for all users all over the world.
This talk would delve into a Diabetes medication recommender system using Databricks and Spark. Databricks supports HIPAA compliant deployment for processing PHI data. This talk would cover building a secure pipeline with encrypted data and the end-to-end recommendation system using Structured streaming and IoT data flowing from sensor.
This talk presents you how three scala libraries - Smile, Saddle and Spark ML - satisfy requirements of new Big Data Science projects. Let's see it on example of click-through rate prediction.
Big data, Machine learning and the AuditorBharath Rao
Check an insight as to how an Auditor can leverage Analytics, machine learning, and Technology to achieve absolute assurance and to effectively control the Fraud Risk present in the Enterprise.
Amazon Neptune is a service that allows you to use graph structures and nodes to visualize stored data in an accessible way. You can find more in our blog entry: https://tinyurl.com/y623ff5j
All the sources are linked in the presentation.
Enjoy and don't forget to check out our blog and other social media!
LCloud Blog https://bit.ly/2Vgooz4
Facebook https://bit.ly/2tCqBJS
Twitter https://twitter.com/LCLOUD16
LinkedIn https://bit.ly/2syaQCr
YouTube https://bit.ly/2tGV62b
Questions? Feel free to ask:
kontakt@lcloud.pl
https://lcloud.pl/
Big Data has been around long enough that there are some common issues that occur whenever an organization tries to implement and integrate it into their ecosystem. This presentation covers some of those pitfalls, which also impact traditional data warehouses/business intelligence ecosystems
Frontiers in Alternative Data : Techniques and Use CasesQuantUniversity
QuantUniversity Summer School 2020 (https://qusummerschool.splashthat.com/)
https://quspeakerseries10.splashthat.com/
Lecture 1: Alexander Denev
In this talk, Alexander will introduce Alternative Data and discuss it's uses from his book, The Book of Alternative Data
- What is alternative data?
- Adoption of alternative data
- Information value chain
- Risks associated with alternative data
- Processes required to develop signals
- Valuation of alternative data
Lecture 2: Saeed Amen
In this talk, Saeed will discuss use cases in Alternative Data
-Deciphering Federal Reserve communications
- Using CLS flow data to trade FX
- Geospatial Insight satellite data to estimate retailers' EPS
- Saving "alpha" with transaction cost analysis
- Using Bloomberg News data to trade FX
Right now in institutions around the world, some of the greatest minds in computer science and statistics are coming up with amazing new algorithms and mathematically beautiful solutions. However it's entirely possible that the solutions they conceive will be impracticable in industry. The reason is simple; "the best answer is useless if it arrives too late to do anything with it". The key principle here is the compromise between 'accuracy' and 'latency'. In this talk I will describe examples where this holds true, and how I am using real-time machine learning models to solve challenges in eCommerce, Financial Services and Media companies.
http://tumra.com/blog/real-time-machine-learning-at-industrial-scale
Machine learning, or predictive analytics have started entering into our daily life. Businesses and enterprises could use predictive analytics to improve efficiency, improve user experience, as well as to create new business opportunities. This talk will present WSO2 Machine Learner, our experiences of predicting Super Bowl winners, and few real life use cases. Furthermore, talk will discuss open challenges and problems people are working on.
Building Real-Time Data Pipeline for Diabetes Medication Recommender System U...Databricks
American Diabetic association states that 29.1 million Americans and 300+ million people all over the world have diabetes. Diabetic medication management is always challenging. Based on doctor’s prescription, patients take insulin dosage one hour before breakfast, lunch or dinner. But the real world scenario insulin intake can be changed based on the blood glucose level, calorie intake on a specific day, etc.
This talk explains how a real-time Big Data pipeline recommendation engine can be used to suggest insulin intake for diabetic patients in near real time. Based on calorie intake and blood glucose level from patients as well as generated dataset, insulin dosage can be recommended which will help patients to avoid over/under dosage. Designing medication recommender system is a need for the Healthcare industry. There is a growing trend for the applications to help doctors by recommending medication based on patient’s historic data. This also helps facilitate a doctor friendly and hospital free atmosphere for all users all over the world.
This talk would delve into a Diabetes medication recommender system using Databricks and Spark. Databricks supports HIPAA compliant deployment for processing PHI data. This talk would cover building a secure pipeline with encrypted data and the end-to-end recommendation system using Structured streaming and IoT data flowing from sensor.
This talk presents you how three scala libraries - Smile, Saddle and Spark ML - satisfy requirements of new Big Data Science projects. Let's see it on example of click-through rate prediction.
Big data, Machine learning and the AuditorBharath Rao
Check an insight as to how an Auditor can leverage Analytics, machine learning, and Technology to achieve absolute assurance and to effectively control the Fraud Risk present in the Enterprise.
Amazon Neptune is a service that allows you to use graph structures and nodes to visualize stored data in an accessible way. You can find more in our blog entry: https://tinyurl.com/y623ff5j
All the sources are linked in the presentation.
Enjoy and don't forget to check out our blog and other social media!
LCloud Blog https://bit.ly/2Vgooz4
Facebook https://bit.ly/2tCqBJS
Twitter https://twitter.com/LCLOUD16
LinkedIn https://bit.ly/2syaQCr
YouTube https://bit.ly/2tGV62b
Questions? Feel free to ask:
kontakt@lcloud.pl
https://lcloud.pl/
Big Data has been around long enough that there are some common issues that occur whenever an organization tries to implement and integrate it into their ecosystem. This presentation covers some of those pitfalls, which also impact traditional data warehouses/business intelligence ecosystems
Frontiers in Alternative Data : Techniques and Use CasesQuantUniversity
QuantUniversity Summer School 2020 (https://qusummerschool.splashthat.com/)
https://quspeakerseries10.splashthat.com/
Lecture 1: Alexander Denev
In this talk, Alexander will introduce Alternative Data and discuss it's uses from his book, The Book of Alternative Data
- What is alternative data?
- Adoption of alternative data
- Information value chain
- Risks associated with alternative data
- Processes required to develop signals
- Valuation of alternative data
Lecture 2: Saeed Amen
In this talk, Saeed will discuss use cases in Alternative Data
-Deciphering Federal Reserve communications
- Using CLS flow data to trade FX
- Geospatial Insight satellite data to estimate retailers' EPS
- Saving "alpha" with transaction cost analysis
- Using Bloomberg News data to trade FX
Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...Amazon Web Services
Learning Objectives:
- Get an overview of streaming data and it's application in analytics and big data.
- Understand the factors driving the accelerating transformation of batch processing to real-time.
- Learn how you should plan for incorporating data streaming in your analytics and processing workloads.
Business can now easily perform real-time analytics on data that has been traditionally analyzed using batch processing in data warehouses or using Hadoop frameworks, and react to new information in minutes or seconds instead of hours or days. In this webinar, Forrester analyst Mike Gualtieri and Amazon Kinesis GM Roger Barga will discuss this prevalent trend, it's business significance, and how you should plan for it. You will also learn about the AWS services that can help you get started quickly with real-time, streaming applications fore your analytics and big data workloads.
What is going on? Application Diagnostics on Azure - Copenhagen .NET User GroupMaarten Balliauw
We all like building and deploying cloud applications. But what happens once that’s done? How do we know if our application behaves like we expect it to behave? Of course, logging! But how do we get that data off of our machines? How do we sift through a bunch of seemingly meaningless diagnostics? In this session, we’ll look at how we can keep track of our Azure application using structured logging, AppInsights and AppInsights analytics to make all that data more meaningful.
Big Data Serving with Vespa - Jon Bratseth, Distinguished Architect, OathYahoo Developer Network
Offline and stream processing of big data sets can be done with tools such as Hadoop, Spark, and Storm, but what if you need to process big data at the time a user is making a request? Vespa (http://www.vespa.ai) allows you to search, organize and evaluate machine-learned models from e.g TensorFlow over large, evolving data sets with latencies in the tens of milliseconds. Vespa is behind the recommendation, ad targeting, and search at Yahoo where it handles billions of daily queries over billions of documents.
Take Action: The New Reality of Data-Driven BusinessInside Analysis
The Briefing Room with Dr. Robin Bloor and WebAction
Live Webcast on July 23, 2014
Watch the archive:
https://bloorgroup.webex.com/bloorgroup/lsr.php?RCID=360d371d3a49ad256942f55350aa0a8b
The waiting used to be the hardest part, but not anymore. Today’s cutting-edge enterprises can seize opportunities faster than ever, thanks to an array of technologies that enable real-time responsiveness across the spectrum of business processes. Early adopters are solving critical business challenges by enabling the rapid-fire design, development and production of very specific applications. Functionality can range from improved customer engagement to dynamic machine-to-machine interactions.
Register for this episode of The Briefing Room to learn from veteran Analyst Dr. Robin Bloor, who will tout a new era in data-driven organizations, and why a data flow architecture will soon be critical for industry leaders. He’ll be briefed by Sami Akbay of WebAction, who will showcase his company’s real-time data management platform, which combines all the component parts needed to access, process and leverage data big and small. He’ll explain how this new approach can provide game-changing power to organizations of all types and sizes.
Visit InsideAnlaysis.com for more information.
Introduction to Vespa – The Open Source Big Data Serving Engine, Jon Bratseth...Yahoo Developer Network
Offline and stream processing of big data sets can be done with tools such as Hadoop, Spark, and Storm, but what if you need to process big data at the time a user is making a request?
This presentation introduces Vespa (http://vespa.ai) – the open source big data serving engine.
Vespa allows you to search, organize, and evaluate machine-learned models from e.g TensorFlow over large, evolving data sets with latencies in the tens of milliseconds. Vespa is behind the recommendation, ad targeting, and search at Yahoo where it handles billions of daily queries over billions of documents and was recently open sourced at http://vespa.ai.
The Internet of Analytics – Discovering actionable insights from high-velocity streams of real-time IoT data
IoT devices generate high volume, continuous streams of data that must be analyzed in-memory – before they land on disk – to identify potential outliers/failures or business opportunities. Companies need to build robust yet flexible applications that can instantly act on the information derived from analyzing their IoT data. Attend this session to learn how you can easily handle real-time data acquisition across structured and semi-structured data, as well as windowing, fast in-memory streaming analytics, event correlation, visualization, alerts, workflows and smart data storage.
WebAction Founder and EVP, Sami Akbay
In these slides, we explore the unique challenges that mobile data present. The high cardinality, low signal to noise ratio and realtime needs have significant system implications. We outline how InMobi tackles these challenges. A specific Data Science use case is also presented. We outline our approach to user segmentation. A brief description of the challenges faced and our attempts to address them is also included.
Data Lakes are early in the Gartner hype cycle, but companies are getting value from their cloud-based data lake deployments. Break through the confusion between data lakes and data warehouses and seek out the most appropriate use cases for your big data lakes.
Gain New Insights by Analyzing Machine Logs using Machine Data Analytics and BigInsights.
Half of Fortune 500 companies experience more than 80 hours of system down time annually. Spread evenly over a year, that amounts to approximately 13 minutes every day. As a consumer, the thought of online bank operations being inaccessible so frequently is disturbing. As a business owner, when systems go down, all processes come to a stop. Work in progress is destroyed and failure to meet SLA’s and contractual obligations can result in expensive fees, adverse publicity, and loss of current and potential future customers. Ultimately the inability to provide a reliable and stable system results in loss of $$$’s. While the failure of these systems is inevitable, the ability to timely predict failures and intercept them before they occur is now a requirement.
A possible solution to the problem can be found is in the huge volumes of diagnostic big data generated at hardware, firmware, middleware, application, storage and management layers indicating failures or errors. Machine analysis and understanding of this data is becoming an important part of debugging, performance analysis, root cause analysis and business analysis. In addition to preventing outages, machine data analysis can also provide insights for fraud detection, customer retention and other important use cases.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Internet of Things Chicago - Meetup
1. Capturing & Analyzing
High Velocity High Volume
Machine Data
December 3, 2013
Jason Lobel
CEO
@jasonlobel
2. Internet of Endpoints
“THINGS” (IOT)
Everything (IOE)
Data & Machines
Data is
Primarily sensor-based
50B
12.5B
Machine readable (API)
Accessible on-demand
Possibly even open (Public)
Includes non-machine generated data or
streaming data (catalogs, locations,
historical data, etc.)
4. Capturing Streaming Data – Considerations
Smart storage / backend setup is a key catalyst for downstream analysis
Backend Architecture
NoSQL datastore
Why Important
Long-term scale with data volume
High availability
Ideal for unpredictable demand
No joins for queries in reporting
Auto scaling cloud hosting
(AppEngine, AWS)
Spend less time on server tuning
Enable REST APIs
Enable JavaScript & mobile applications
Writeable and Retrievable
Real-time data
JSON over XML
Power dashboards or visualizations
APIs for history, real-time, query (SQL), Tracking/ How is data consumed
and even predictive
Unify with other sources
OAuth2.0 Security
API management
Multi-party (internet/external) access
Dedicated caching
Faster data retrieval speed
5. APIs Fuel Any Channel & Big Data Analytics
Public vs. Private: Estimate 10x more private APIs
Open: Gartner predicts 75% of the Fortune 500 are predicted to have open APIs by 2014
Competition: By 2015, APIs will be default, like websites in 2000 (Kin Lane, ex White House Fellow)
Growth In Public APIs
9. Make Apps Smarter with Machine Learning
Recommendation:
Analyzes users' preferences and finds items users might like
Frequent Pattern Mining:
Discovers unique frequently co-occurring items in a transaction list
Classification:
Learns from existing categorized data
and assigns a category to
uncategorized data
Clustering:
Organizes items from a large volume of data into groups of similar items
and features
10. Machine Learning Algorithm APIs?
Hard
Eas{ier}
Human
Human
Finding a data scientist
Finding an engineer that can use an API
Training (if needed)
Technical
Database selection
Algorithm(s) selection
Model training & iteration
Embedding predictions into applications
Security
Query speed / caching
Scaling
On-Demand Access
Technical
11. Common ML Applications for Retail
Item Recommendation: observes what the user likes and finds similar items
(“I like the Chicago Bulls, I may like the Chicago Bears”)
User Recommendation: recommend items finding similar users and sees what
they like (e.g., Kin and I are friends. He likes IPAs. I may like IPAs)
else is Y user likely to want based on
Item/Action Affinity: if X user wants X, what
the relationship between X and Y (men who buy diapers, also buy beer)
Predict Inventory: based on history, predict future sales (next 7, 30 days, etc.)
Discover Customer Segments: examine purchasing habits to identify clusters of
shopper segments
Prevent Fraud: identify anomalies in cashier activity, such as voids (is this likely
fraud? yes/no)
12. What We Do with Streaming Data
Focus = at least one massive data source can be transformed into many
insights that were not possible before at a fraction of the cost of legacy tools
Supermarkets: point-of-sale data, product catalog, sensors, etc.
eCommerce: web behavior, point-of-sale data, product catalog, etc.
Supermarket / C-Store
Before SwiftIQ
Unable to store POS order and cashier history
After SwiftIQ
Detailed transaction history available on-demand
Able to pursue real-time supply chain initiatives
Now can analyze product affinity to plan merchandising
strategies, promotions and optimize localization
Capable of visualizing data or generating interactive reports
Able to better predict inventory requirements
Better optimize hiring
Identify cashier fraud
Retail/eCommerce
Before SwiftIQ
Unable to unify disparate data (POS, web, mobile, CRM)
Unlikely to store web behavior
After SwiftIQ
Enable relevant, personalized digital experiences
Know specific customer segments vs. using intuition
Analyze product affinity to plan merchandising strategies,
promotions and optimize localization
Capable of visualizing data or generating interactive reports
Able to better predict inventory requirements