Steve Lucas presentation on #BigData at the SAP & Intel 2013 Forum on Big Data, August 27, 2013. @nstevenlucas
Big data is changing the world. With more information available than ever before, we can gain insights to grow, produce, and advance in real-time.
Do you want to understand the emerging new data-driven jobs? This presentation discusses the emerging roles of Data Science and Data Engineering, and how they are related to Business Intelligence and Big Data. We will talk about skills and background needed for the jobs, and what education and certification is important.
Explains: What is Data Science? What is the difference between Data Science and Data Engineering, and between Data Science and Business Intelligence? What type of work do Data Scientists do, and what types of companies employ them? What is the job outlook for Data Science? What professional education is required?
Steve Lucas presentation on #BigData at the SAP & Intel 2013 Forum on Big Data, August 27, 2013. @nstevenlucas
Big data is changing the world. With more information available than ever before, we can gain insights to grow, produce, and advance in real-time.
Do you want to understand the emerging new data-driven jobs? This presentation discusses the emerging roles of Data Science and Data Engineering, and how they are related to Business Intelligence and Big Data. We will talk about skills and background needed for the jobs, and what education and certification is important.
Explains: What is Data Science? What is the difference between Data Science and Data Engineering, and between Data Science and Business Intelligence? What type of work do Data Scientists do, and what types of companies employ them? What is the job outlook for Data Science? What professional education is required?
Note: Make sure to download the slides to get the high-resolution version!
Also, you can find the webinar recording here (please also download for better quality): https://www.dropbox.com/s/72qi6wjzi61gs3q/H2ODeepLearningArnoCandel052114.mov
Come hear how Deep Learning in H2O is unlocking never before seen performance for prediction!
H2O is google-scale open source machine learning engine for R & Big Data. Enterprises can now use all of their data without sampling and build intelligent applications. This live webinar introduces Distributed Deep Learning concepts, implementation and results from recent developments. Real world classification & regression use cases from eBay text dataset, MNIST handwritten digits and Cancer datasets will present the power of this game changing technology.
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
Better Search Through Query Understanding
Presented as a Data Talk at Intuit on April 22, 2014
Search is a fundamental problem of our time — we use search engines daily to satisfy a variety of personal and professional information needs. But search engine development still feels stuck in an information retrieval paradigm that focuses on result ranking. In this talk, I’ll advocate an emphasis on query understanding. I’ll talk about how we implement query understanding at LinkedIn, and I’ll present examples from the broader web. Hopefully you’ll come out with a different perspective on search and share my appreciation for how we can improve search through query understanding.
About the Speaker
Daniel Tunkelang leads LinkedIn's efforts around query understanding. Before that, he led LinkedIn's product data science team. He previously led a local search quality team at Google and was a founding employee of Endeca (acquired by Oracle in 2011). He has written a textbook on faceted search, and is a recognized advocate of human-computer interaction and information retrieval (HCIR). He has a PhD in Computer Science from CMU, as well as BS and MS degrees from MIT.
What is "deep learning" and why is it suddenly so popular? In this talk I explore how Deep Learning provides a convenient framework for expressing learning problems and using GPUs to solve them efficiently.
What does a data scientist actually do? Here at Good Rebels we wanted to outline a profile of this new profession, with the help of various industry leaders from academia, business and institutions. In short, we concluded that the main tasks of a data scientist are to identify data, transform it when incomplete, categorize it, prepare it for analysis, perform the analysis, visualize the results and communicate them.
Benefiting from Semantic AI along the data life cycleMartin Kaltenböck
Slides of 1 hour session of Martin Kaltenböck (CFO and Managing Partner of Semantic Web Company / PoolParty Software Ltd) on 19 March 2019 in Boston, US at the Enterprise Data World 2019, with its title: Benefiting from Semantic AI along the data life cycle.
As 2017 begins, we are seeing big data and data science communities engage with new tools that specifically cater to data scientists and data engineers who aren’t necessarily experts in these techniques. Given rapid technological advances, the question for companies now is how to integrate new data science capabilities into their operations and strategies—and position themselves in a world where analytics can upend entire industries. Leading companies are using their data science capabilities not only to improve their core operations but also to launch entirely new business models.
Why is Data Science a Popular Career Choice.pdfUSDSI
Do you want to become the backbone of big corporates and giant business groups around the world? Beginning your career trajectory by grabbing the perfect spot in the data science certification courses provided around the world. The US Bureau of Labor Statistics projects 35.8% employment growth for data scientists till 2031, over a decade period beginning 2021. The growing use of machine learning and artificial intelligence technologies is another factor driving the demand for professionals skilled in data science.
Brimming with humungous career opportunities, the data science industry is set in motion to yield multitudinous growth opportunities across diversified industries worldwide. By automating procedures, increasing effectiveness, and allowing predictive capabilities, Artificial intelligence and machine learning algorithms hold the ability to change the entire landscape.
Data Science has become a fascinating career choice that calls for working closely with cutting-edge technology and addressing challenges. If you are someone who wishes to work with humungous data, has a passion for numbers, and has a clear vision of setting their career on a thriving path; data science is the right pick for you!
A diversified array of organizations is actively looking for data-hungry professionals who are coarsely skilled at data science to analyze data and churn out business decisions for the greater good of the company. Today is the ripe time to get started with a data science career, that promises an elevated trajectory and nothing else.
With the rise of technological innovations and industrial evolution, massive datasets become unmanageable. The future of such a massive explosion of data calls for an urgent appointment of qualified data scientists; enabling bigger business moves. This is where getting certified in the field makes sense.
Without wasting any further time, it is an advisable move to get certified in key data science skills that are sure to rage in the industry worldwide. Begin with the most trusted names in the data science certifications providers industry today!
https://www.usdsi.org/data-science-insights/why-is-data-science-a-popular-career-choice
Global Advanced Management Program
All India Management Association
Program Director: Professor Solomon Darwin, UC Berkeley
Expanding Markets by Leveraging Emerging Technologies
Agenda: June 25 – July 01, 2023
Note: Make sure to download the slides to get the high-resolution version!
Also, you can find the webinar recording here (please also download for better quality): https://www.dropbox.com/s/72qi6wjzi61gs3q/H2ODeepLearningArnoCandel052114.mov
Come hear how Deep Learning in H2O is unlocking never before seen performance for prediction!
H2O is google-scale open source machine learning engine for R & Big Data. Enterprises can now use all of their data without sampling and build intelligent applications. This live webinar introduces Distributed Deep Learning concepts, implementation and results from recent developments. Real world classification & regression use cases from eBay text dataset, MNIST handwritten digits and Cancer datasets will present the power of this game changing technology.
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
Better Search Through Query Understanding
Presented as a Data Talk at Intuit on April 22, 2014
Search is a fundamental problem of our time — we use search engines daily to satisfy a variety of personal and professional information needs. But search engine development still feels stuck in an information retrieval paradigm that focuses on result ranking. In this talk, I’ll advocate an emphasis on query understanding. I’ll talk about how we implement query understanding at LinkedIn, and I’ll present examples from the broader web. Hopefully you’ll come out with a different perspective on search and share my appreciation for how we can improve search through query understanding.
About the Speaker
Daniel Tunkelang leads LinkedIn's efforts around query understanding. Before that, he led LinkedIn's product data science team. He previously led a local search quality team at Google and was a founding employee of Endeca (acquired by Oracle in 2011). He has written a textbook on faceted search, and is a recognized advocate of human-computer interaction and information retrieval (HCIR). He has a PhD in Computer Science from CMU, as well as BS and MS degrees from MIT.
What is "deep learning" and why is it suddenly so popular? In this talk I explore how Deep Learning provides a convenient framework for expressing learning problems and using GPUs to solve them efficiently.
What does a data scientist actually do? Here at Good Rebels we wanted to outline a profile of this new profession, with the help of various industry leaders from academia, business and institutions. In short, we concluded that the main tasks of a data scientist are to identify data, transform it when incomplete, categorize it, prepare it for analysis, perform the analysis, visualize the results and communicate them.
Benefiting from Semantic AI along the data life cycleMartin Kaltenböck
Slides of 1 hour session of Martin Kaltenböck (CFO and Managing Partner of Semantic Web Company / PoolParty Software Ltd) on 19 March 2019 in Boston, US at the Enterprise Data World 2019, with its title: Benefiting from Semantic AI along the data life cycle.
As 2017 begins, we are seeing big data and data science communities engage with new tools that specifically cater to data scientists and data engineers who aren’t necessarily experts in these techniques. Given rapid technological advances, the question for companies now is how to integrate new data science capabilities into their operations and strategies—and position themselves in a world where analytics can upend entire industries. Leading companies are using their data science capabilities not only to improve their core operations but also to launch entirely new business models.
Why is Data Science a Popular Career Choice.pdfUSDSI
Do you want to become the backbone of big corporates and giant business groups around the world? Beginning your career trajectory by grabbing the perfect spot in the data science certification courses provided around the world. The US Bureau of Labor Statistics projects 35.8% employment growth for data scientists till 2031, over a decade period beginning 2021. The growing use of machine learning and artificial intelligence technologies is another factor driving the demand for professionals skilled in data science.
Brimming with humungous career opportunities, the data science industry is set in motion to yield multitudinous growth opportunities across diversified industries worldwide. By automating procedures, increasing effectiveness, and allowing predictive capabilities, Artificial intelligence and machine learning algorithms hold the ability to change the entire landscape.
Data Science has become a fascinating career choice that calls for working closely with cutting-edge technology and addressing challenges. If you are someone who wishes to work with humungous data, has a passion for numbers, and has a clear vision of setting their career on a thriving path; data science is the right pick for you!
A diversified array of organizations is actively looking for data-hungry professionals who are coarsely skilled at data science to analyze data and churn out business decisions for the greater good of the company. Today is the ripe time to get started with a data science career, that promises an elevated trajectory and nothing else.
With the rise of technological innovations and industrial evolution, massive datasets become unmanageable. The future of such a massive explosion of data calls for an urgent appointment of qualified data scientists; enabling bigger business moves. This is where getting certified in the field makes sense.
Without wasting any further time, it is an advisable move to get certified in key data science skills that are sure to rage in the industry worldwide. Begin with the most trusted names in the data science certifications providers industry today!
https://www.usdsi.org/data-science-insights/why-is-data-science-a-popular-career-choice
Global Advanced Management Program
All India Management Association
Program Director: Professor Solomon Darwin, UC Berkeley
Expanding Markets by Leveraging Emerging Technologies
Agenda: June 25 – July 01, 2023
A number of recent milestones in AI have rekindled the faith that human-grade computer intelligence can fuel the next technological revolution. In parallel and almost independently, the job role of Data Scientist rose to one of the hottest tickets in the technology sector. Despite the obvious overlap in the domains of Data Science and Artificial Intelligence, the two approaches are sufficiently distinct that choosing the wrong one might trigger a product to fail or a hiring process to go wrong. This presentation will offer some clarity and best practices with regards to understanding what data analysis requirements you really have, as what opposed to what you think you have.
Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018PAÍS DIGITAL
Exposición “ Data Science for private and public good” de Ciro Cattuto, Scientific Director, ISI Foundation, en el marco del VI Summit País Digital 2018, realizado el 4 y 5 de septiembre en Santiago, Chile.
Vikas Arora - Evolution of Search - Nottingham Digital SummitHallam
Machine Learning and Artificial Intelligence – taking a look changes taking place in the digital space, and what marketers can do to stay ahead of the curve. We will explore what AI means , what opportunities it presents, and how can marketers benefit from this today as well as prepare for the changes that lie ahead. You will learn more about the impact of machine learning and artificial intelligence on Search, and - Microsoft's definition of AI and how marketers can take advantage of the tools today.
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfGetInData
Recently we have observed the rise of open-source Large Language Models (LLMs) that are community-driven or developed by the AI market leaders, such as Meta (Llama3), Databricks (DBRX) and Snowflake (Arctic). On the other hand, there is a growth in interest in specialized, carefully fine-tuned yet relatively small models that can efficiently assist programmers in day-to-day tasks. Finally, Retrieval-Augmented Generation (RAG) architectures have gained a lot of traction as the preferred approach for LLMs context and prompt augmentation for building conversational SQL data copilots, code copilots and chatbots.
In this presentation, we will show how we built upon these three concepts a robust Data Copilot that can help to democratize access to company data assets and boost performance of everyone working with data platforms.
Why do we need yet another (open-source ) Copilot?
How can we build one?
Architecture and evaluation
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfEnterprise Wired
In this guide, we'll explore the key considerations and features to look for when choosing a Trusted analytics platform that meets your organization's needs and delivers actionable intelligence you can trust.
The Building Blocks of QuestDB, a Time Series Databasejavier ramirez
Talk Delivered at Valencia Codes Meetup 2024-06.
Traditionally, databases have treated timestamps just as another data type. However, when performing real-time analytics, timestamps should be first class citizens and we need rich time semantics to get the most out of our data. We also need to deal with ever growing datasets while keeping performant, which is as fun as it sounds.
It is no wonder time-series databases are now more popular than ever before. Join me in this session to learn about the internal architecture and building blocks of QuestDB, an open source time-series database designed for speed. We will also review a history of some of the changes we have gone over the past two years to deal with late and unordered data, non-blocking writes, read-replicas, or faster batch ingestion.
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeWalaa Eldin Moustafa
Dynamic policy enforcement is becoming an increasingly important topic in today’s world where data privacy and compliance is a top priority for companies, individuals, and regulators alike. In these slides, we discuss how LinkedIn implements a powerful dynamic policy enforcement engine, called ViewShift, and integrates it within its data lake. We show the query engine architecture and how catalog implementations can automatically route table resolutions to compliance-enforcing SQL views. Such views have a set of very interesting properties: (1) They are auto-generated from declarative data annotations. (2) They respect user-level consent and preferences (3) They are context-aware, encoding a different set of transformations for different use cases (4) They are portable; while the SQL logic is only implemented in one SQL dialect, it is accessible in all engines.
#SQL #Views #Privacy #Compliance #DataLake
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main
State of Artificial intelligence Report 2023kuntobimo2016
Artificial intelligence (AI) is a multidisciplinary field of science and engineering whose goal is to create intelligent machines.
We believe that AI will be a force multiplier on technological progress in our increasingly digital, data-driven world. This is because everything around us today, ranging from culture to consumer products, is a product of intelligence.
The State of AI Report is now in its sixth year. Consider this report as a compilation of the most interesting things we’ve seen with a goal of triggering an informed conversation about the state of AI and its implication for the future.
We consider the following key dimensions in our report:
Research: Technology breakthroughs and their capabilities.
Industry: Areas of commercial application for AI and its business impact.
Politics: Regulation of AI, its economic implications and the evolving geopolitics of AI.
Safety: Identifying and mitigating catastrophic risks that highly-capable future AI systems could pose to us.
Predictions: What we believe will happen in the next 12 months and a 2022 performance review to keep us honest.
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Round table discussion of vector databases, unstructured data, ai, big data, real-time, robots and Milvus.
A lively discussion with NJ Gen AI Meetup Lead, Prasad and Procure.FYI's Co-Found
13. In fact, a McKinsey Global Institute report
estimates that by 2018, “the United States
alone could face a shortage of 140,000 to
190,000 people with deep analytical skills as
well as 1.5 million managers and analysts
with the know-how to use the analysis of
big data to make effective decisions.”
!
!
!
14. Between 2010 and 2020, the data
scientist career path is projected to
increase by 18.7 percent, beat only by
video game designers. The big data
industry is expected to be a 53.4 billion
industry by 2016.
15. Anyone with "data science" in his or
her job title on a LinkedIn page is
going to get "100 recruiter emails a
day," said Josh Sullivan, who leads a
500-person data-science group at the
consulting firm Booz Allen Hamilton
Holding
17. First Competition:
Forecast Eurovision Song Contest Voting
!
!
- 1000 dollars prize
- 22 teams
Outperformed prediction markets:
predict 7 countries from Top10, prediction markets
only 5.
18. Short story of success
- 2011 - relocated to San Francisco
- November, 2011 - raise 11M dollars fundings
- July, 2013 - 100,000 data scientists involved
- February, 2014 - more than 140,000 data
scientists
22. Competitions for knowledge
(always open)
!
- Digit recognizer, CIFAR-10, First steps with Julia
- Titanic: Machine Learning for Disaster
- Bike Sharing Demand
- Learning Social Circles in Networks
23. Competitions with prize:
Open:
- American Epilepsy Society Seizure Prediction
Challenge: 25, 000 prize
- Africa Soil Property Prediction Challenge: 8,000 prize
- Tradeshift Text Classification: 5,000 prize
24. Completed competitions (170+)
- Heritage Health Price: 500,000
- GE Flight Quest: 250,000
- GE Hospital Quest: 100,000
- Higgs Boson ML Challenge: 13,000 + invitation to
CERN
- Galaxy Zoo: 16,000
- KDD Author Paper Identification Challenge
- Job Recommendation Challenge
25. Job competitions (completed):
Facebook:
- recommend missing links in social graph (who to follow)
- optimal graph path
- predict text tags
Yelp:
- estimate the number of useful votes a review will receive
Wallmart:
- predict store sales
+ Job Board