Find Your Passion and Make a Difference in Your CareerAlbert Y. C. Chen
20180314 at National Taiwan Normal University.
Reflection on my own career from being inspired to work on CV/ML research during my graduate studies at NTNU, then going abroad to obtain my Ph.D. and later on my career in this field. The talk emphasizes on the importance of innovation and how to realize ones new ideas within large and small organizations.
Think different, in Finance. An outsider's two cents on how could finance majors rethink their role and value in the rapidly changing AI era, with some FinTech case studies.
AI gold rush, tool vendors and the next big thing
2017/12/27 at Mediatek
- Overview of booming AI applications, from media, entertainment, e-commerce, autonomous driving, surveillance, industrial inspection, medical imaging, bioinformatics, finance, etc., along with expert predictions of their market size and growth.
- Dissect the applications with largest size and growth into their technical components and their unmet demands.
- Among all the unmet demands and uncertainties in this AI gold rush, what should an IC design company do? I’ll briefly cover NVIDIA’s case, which most of us know well already, then supplement case studies of Qualcomm, Intel, Google TPU and other smaller firms.
Even when we have a clear target, it takes years for supporting libraries and software to be properly optimized. I’ll share some thoughts and personal experiences on how to make sequentially-ordered hardware/software/library optimization happen faster and in parallel, and the tools that the IC design house need to provide in order for it to happen.
Machine Learning Foundations for Professional ManagersAlbert Y. C. Chen
20180804@Taiwan AI Academy, Hsinchu
6 hour lecture for those new to machine learning, to grasps the concepts, advantages and limitations of various classical machine learning methods. More importantly, to learn the skills to break down large complicated AI projects into manageable pieces, where features and functionalities could be added incrementally and annotated data accumulated. Take home message: machine learning is always a delicate balance between model complexity M and number of data N so that the trained classifier generalizes well and does not overfit.
Tips for would-be founders, technical or non-technical, before rolling up their sleeves and develop their products! From various ways of "pretotyping" to accurately gauge target customer's response, lean method, minimum viable product, feature selection, planning a product with robust data cycle, coping with delays, and guiding a team of rockstar engineers to build the right product and build the product right. Some personal experienced shared at the end as case studies.
Find Your Passion and Make a Difference in Your CareerAlbert Y. C. Chen
20180314 at National Taiwan Normal University.
Reflection on my own career from being inspired to work on CV/ML research during my graduate studies at NTNU, then going abroad to obtain my Ph.D. and later on my career in this field. The talk emphasizes on the importance of innovation and how to realize ones new ideas within large and small organizations.
Think different, in Finance. An outsider's two cents on how could finance majors rethink their role and value in the rapidly changing AI era, with some FinTech case studies.
AI gold rush, tool vendors and the next big thing
2017/12/27 at Mediatek
- Overview of booming AI applications, from media, entertainment, e-commerce, autonomous driving, surveillance, industrial inspection, medical imaging, bioinformatics, finance, etc., along with expert predictions of their market size and growth.
- Dissect the applications with largest size and growth into their technical components and their unmet demands.
- Among all the unmet demands and uncertainties in this AI gold rush, what should an IC design company do? I’ll briefly cover NVIDIA’s case, which most of us know well already, then supplement case studies of Qualcomm, Intel, Google TPU and other smaller firms.
Even when we have a clear target, it takes years for supporting libraries and software to be properly optimized. I’ll share some thoughts and personal experiences on how to make sequentially-ordered hardware/software/library optimization happen faster and in parallel, and the tools that the IC design house need to provide in order for it to happen.
Machine Learning Foundations for Professional ManagersAlbert Y. C. Chen
20180804@Taiwan AI Academy, Hsinchu
6 hour lecture for those new to machine learning, to grasps the concepts, advantages and limitations of various classical machine learning methods. More importantly, to learn the skills to break down large complicated AI projects into manageable pieces, where features and functionalities could be added incrementally and annotated data accumulated. Take home message: machine learning is always a delicate balance between model complexity M and number of data N so that the trained classifier generalizes well and does not overfit.
Tips for would-be founders, technical or non-technical, before rolling up their sleeves and develop their products! From various ways of "pretotyping" to accurately gauge target customer's response, lean method, minimum viable product, feature selection, planning a product with robust data cycle, coping with delays, and guiding a team of rockstar engineers to build the right product and build the product right. Some personal experienced shared at the end as case studies.
Jane Hsu is a professor and department chair of Computer Science and Information Engineering at National Taiwan University. Her research interests include multi-agent systems, intelligent data analysis, commonsense knowledge, and context-aware computing. Prof. Hsu is the director of the Intel-NTU Connected Context Computing Center, featuring global research collaboration among NTU, Intel, and the National Science Council of Taiwan. She serves on the editorial board of Journal of Information Science and Engineering (2010-), International Journal of Service Oriented Computing and Applications (Springer, 2007-2009) and Intelligent Data Analysis (Elsevier/IOS Press, 1997-2002). She is actively involved in many key international AI conferences as organizers and members of the program committee. In addition to serving as the President of Taiwanese Association for Artificial Intelligence (2013-2014), Prof. Hsu has been a member of AAAI, IEEE, ACM, Phi Tau Phi, and an executive committee member of the IEEE Technical Committee on E-Commerce (2000) and TAAI (2004-current).
MixTaiwan 20170222 清大電機 孫民 AI The Next Big ThingMix Taiwan
講師簡介:
孫民助理教授│清華大學電機系
孫民博士目前任教於國立清華大學電機系,他畢業於國立交通大學電子工程學系後,取得史坦福電機碩士、密西根安雅堡電機系統組博士、以及西雅圖華盛頓大學計算機工程博士後的經歷。他的研究興趣在電腦視覺、機器學習、以及人機互動領域,近年來基於深度學習在電腦視覺的突破,他致力於開發橫跨人工智慧不同子領域的系統,如自動影片文字描述(視覺x自然語言)、以及與人類行為互動的智慧機器(視覺 x 控制)。
4th International Conference On Recent Advances in Mathematical Sciences and Applications (RAMSA - 21) organized by GVP College of Engineering. This deck is an overview of the trends in ML Engineering which is evolving as a discipline and how Mathematics, Machine Learning and ML Engineering are related to one another.
A very high level introduction to the field of Data Science, Artificial Intelligence. Covers an introduction to Supervised Learning, Unsupervised Learning, Deep Learning and Neural Networks. Given as part of Industry Lectures event at GVP College of Engineering
data scientist the sexiest job of the 21st centuryFrank Kienle
Invited talk, describing the exciting work at Blue Yonder (www.blue-yonder.com),
'congress smart services - new business models' in Aachen, Germany 2015
Albert Y. C. Chen, Ph.D., VP of R&D at Viscovery--Visual Search, Simply Smarter.
Invited speech at Automatic Optical Inspection Equipment Association (AOIEA) Annual Summit, Taiwan, 2017/06/15, "Deep Learning and Automatic Optical Inspection".
陳彥呈博士,Viscovery研發副總裁2017年6月15日於自動光學檢測設備聯盟 會員年會 專題演講「人工智慧下的AOI變革浪潮:影像辨識技術的突破與新契機」。
Practical computer vision-- A problem-driven approach towards learning CV/ML/DLAlbert Y. C. Chen
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
Albert Chen Ph.D., 20170726 at Academia Sinica, Taiwan
Invited Speech during Academia Sinica's AI month
Jane Hsu is a professor and department chair of Computer Science and Information Engineering at National Taiwan University. Her research interests include multi-agent systems, intelligent data analysis, commonsense knowledge, and context-aware computing. Prof. Hsu is the director of the Intel-NTU Connected Context Computing Center, featuring global research collaboration among NTU, Intel, and the National Science Council of Taiwan. She serves on the editorial board of Journal of Information Science and Engineering (2010-), International Journal of Service Oriented Computing and Applications (Springer, 2007-2009) and Intelligent Data Analysis (Elsevier/IOS Press, 1997-2002). She is actively involved in many key international AI conferences as organizers and members of the program committee. In addition to serving as the President of Taiwanese Association for Artificial Intelligence (2013-2014), Prof. Hsu has been a member of AAAI, IEEE, ACM, Phi Tau Phi, and an executive committee member of the IEEE Technical Committee on E-Commerce (2000) and TAAI (2004-current).
MixTaiwan 20170222 清大電機 孫民 AI The Next Big ThingMix Taiwan
講師簡介:
孫民助理教授│清華大學電機系
孫民博士目前任教於國立清華大學電機系,他畢業於國立交通大學電子工程學系後,取得史坦福電機碩士、密西根安雅堡電機系統組博士、以及西雅圖華盛頓大學計算機工程博士後的經歷。他的研究興趣在電腦視覺、機器學習、以及人機互動領域,近年來基於深度學習在電腦視覺的突破,他致力於開發橫跨人工智慧不同子領域的系統,如自動影片文字描述(視覺x自然語言)、以及與人類行為互動的智慧機器(視覺 x 控制)。
4th International Conference On Recent Advances in Mathematical Sciences and Applications (RAMSA - 21) organized by GVP College of Engineering. This deck is an overview of the trends in ML Engineering which is evolving as a discipline and how Mathematics, Machine Learning and ML Engineering are related to one another.
A very high level introduction to the field of Data Science, Artificial Intelligence. Covers an introduction to Supervised Learning, Unsupervised Learning, Deep Learning and Neural Networks. Given as part of Industry Lectures event at GVP College of Engineering
data scientist the sexiest job of the 21st centuryFrank Kienle
Invited talk, describing the exciting work at Blue Yonder (www.blue-yonder.com),
'congress smart services - new business models' in Aachen, Germany 2015
Albert Y. C. Chen, Ph.D., VP of R&D at Viscovery--Visual Search, Simply Smarter.
Invited speech at Automatic Optical Inspection Equipment Association (AOIEA) Annual Summit, Taiwan, 2017/06/15, "Deep Learning and Automatic Optical Inspection".
陳彥呈博士,Viscovery研發副總裁2017年6月15日於自動光學檢測設備聯盟 會員年會 專題演講「人工智慧下的AOI變革浪潮:影像辨識技術的突破與新契機」。
Practical computer vision-- A problem-driven approach towards learning CV/ML/DLAlbert Y. C. Chen
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
Albert Chen Ph.D., 20170726 at Academia Sinica, Taiwan
Invited Speech during Academia Sinica's AI month
Chen Sagiv, co founder and co CEO of SagivTech, gave an introduction talk to Computer Vision at She Codes branch in Google Campus TLV.
In the talk an overview was given on what is computer vision, where it is used, some basic notions and algorithms and the AI revolution.
Deep Learning @ ZHAW Datalab (with Mark Cieliebak & Yves Pauchard)Thilo Stadelmann
A high-level introduction to the current buzz around "Deep Learning" (That it is famous, successfull, and a continuation of neural network research; what is new since the last century, what is the basic idea, what is our outlook into ints future).
Followed by our stake in it and two use cases (face recognition, text analytics).
Computer vision techniques can be seen in various aspects in our daily life with tremendous impacts. This slides aim at introducing basic concepts of computer vision and applications for the general public.
Download link: https://uofi.box.com/shared/static/24vy7aule67o4g6djr83hzurf5a9lfp6.pptx
Overview of Computer Vision For Footwear IndustryTanvir Moin
Computer vision is an interdisciplinary field that focuses on enabling computers to interpret and analyze visual data from the world around us. It involves the development of algorithms and techniques that allow machines to understand images and videos, just as humans do.
The main goal of computer vision is to create machines that can "see" and understand the world around them, and then use that information to make decisions or take actions. This can involve tasks such as object recognition, scene reconstruction, facial recognition, and image segmentation.
Computer vision has a wide range of applications in various fields, such as healthcare, entertainment, transportation, robotics, and security. Some examples include medical image analysis, autonomous vehicles, augmented reality, and surveillance systems.
In recent years, the development of deep learning techniques, particularly convolutional neural networks (CNNs), has greatly advanced the field of computer vision, allowing machines to achieve state-of-the-art performance on various visual recognition tasks.
University of florida 3 d lapidary scanner 110614Robert Harker
The subject invention pertains to an apparatus and method for collecting 2-D data slices of a specimen. Embodiments can incorporate a lapidary platen and an image recording system to image a specimen. The lapidary wheel platen can provide an imaging plane such that an image can be taken as the lapidary wheel platen abrades a surface of the specimen. A specimen mount can maintain the surface of the specimen properly aligned in the image plane. The imaging system can be a continuous recording system such as a video camera, a discrete recording system such as a flatbed scanner, or combinations of continuous and discrete recording systems to simultaneously collect two distinct data sets. The 2-D data set(s) can then be processed to create intricate 3-D color models.
Final lecture from the COMP 4010 course on Virtual and Augmented Reality. This lecture was about Research Directions in Augmented Reality. Taught by Mark Billinghurst on November 1st 2016 at the University of South Australia
AI for SDGs and International Development - Basics of AIAtsushi Koshio
This siled was prepared for the training seminar on Artificial Intelligence for International Organizations. Introducing AI technologies into International Development fields for achieving SDGs would be great opportunities to accelerate development. . This material is just explaining basic of AI and some examples of AI application in this field.
Similar to The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work (20)
Machine Learning Foundations for Professional ManagersAlbert Y. C. Chen
20180526@Taiwan AI Academy, Professional Managers Class.
Covering important concepts of classical machine learning, in preparation for deep learning topics to follow. Topics include regression (linear, polynomial, gaussian and sigmoid basis functions), dimension reduction (PCA, LDA, ISOMAP), clustering (K-means, GMM, Mean-Shift, DBSCAN, Spectral Clustering), classification (Naive Bayes, Logistic Regression, SVM, kNN, Decision Tree, Classifier Ensembles, Bagging, Boosting, Adaboost) and Semi-Supervised learning techniques. Emphasis on sampling, probability, curse of dimensionality, decision theory and classifier generalizability.
Covering important topics of Classical Machine Learning in 16 hours, in preparation for the following 10 weeks of Deep Learning courses at Taiwan AI academy from 2018/02-2018/05. Topics include regression (linear, polynomial, gaussian and sigmoid basis functions), dimension reduction (PCA, LDA, ISOMAP), clustering (K-means, GMM, Mean-Shift, DBSCAN, Spectral Clustering), classification (Naive Bayes, Logistic Regression, SVM, kNN, Decision Tree, Classifier Ensembles, Bagging, Boosting, Adaboost) and Semi-Supervised learning techniques. Emphasis on sampling, probability, curse of dimensionality, decision theory and classifier generalizability.
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Sérgio Sacani
Since volcanic activity was first discovered on Io from Voyager images in 1979, changes
on Io’s surface have been monitored from both spacecraft and ground-based telescopes.
Here, we present the highest spatial resolution images of Io ever obtained from a groundbased telescope. These images, acquired by the SHARK-VIS instrument on the Large
Binocular Telescope, show evidence of a major resurfacing event on Io’s trailing hemisphere. When compared to the most recent spacecraft images, the SHARK-VIS images
show that a plume deposit from a powerful eruption at Pillan Patera has covered part
of the long-lived Pele plume deposit. Although this type of resurfacing event may be common on Io, few have been detected due to the rarity of spacecraft visits and the previously low spatial resolution available from Earth-based telescopes. The SHARK-VIS instrument ushers in a new era of high resolution imaging of Io’s surface using adaptive
optics at visible wavelengths.
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...University of Maribor
Slides from:
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Track: Artificial Intelligence
https://www.etran.rs/2024/en/home-english/
Phenomics assisted breeding in crop improvementIshaGoswami9
As the population is increasing and will reach about 9 billion upto 2050. Also due to climate change, it is difficult to meet the food requirement of such a large population. Facing the challenges presented by resource shortages, climate
change, and increasing global population, crop yield and quality need to be improved in a sustainable way over the coming decades. Genetic improvement by breeding is the best way to increase crop productivity. With the rapid progression of functional
genomics, an increasing number of crop genomes have been sequenced and dozens of genes influencing key agronomic traits have been identified. However, current genome sequence information has not been adequately exploited for understanding
the complex characteristics of multiple gene, owing to a lack of crop phenotypic data. Efficient, automatic, and accurate technologies and platforms that can capture phenotypic data that can
be linked to genomics information for crop improvement at all growth stages have become as important as genotyping. Thus,
high-throughput phenotyping has become the major bottleneck restricting crop breeding. Plant phenomics has been defined as the high-throughput, accurate acquisition and analysis of multi-dimensional phenotypes
during crop growing stages at the organism level, including the cell, tissue, organ, individual plant, plot, and field levels. With the rapid development of novel sensors, imaging technology,
and analysis methods, numerous infrastructure platforms have been developed for phenotyping.
Richard's aventures in two entangled wonderlandsRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...University of Maribor
Slides from talk:
Aleš Zamuda: Remote Sensing and Computational, Evolutionary, Supercomputing, and Intelligent Systems.
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Inter-Society Networking Panel GRSS/MTT-S/CIS Panel Session: Promoting Connection and Cooperation
https://www.etran.rs/2024/en/home-english/
Salas, V. (2024) "John of St. Thomas (Poinsot) on the Science of Sacred Theol...Studia Poinsotiana
I Introduction
II Subalternation and Theology
III Theology and Dogmatic Declarations
IV The Mixed Principles of Theology
V Virtual Revelation: The Unity of Theology
VI Theology as a Natural Science
VII Theology’s Certitude
VIII Conclusion
Notes
Bibliography
All the contents are fully attributable to the author, Doctor Victor Salas. Should you wish to get this text republished, get in touch with the author or the editorial committee of the Studia Poinsotiana. Insofar as possible, we will be happy to broker your contact.
Toxic effects of heavy metals : Lead and Arsenicsanjana502982
Heavy metals are naturally occuring metallic chemical elements that have relatively high density, and are toxic at even low concentrations. All toxic metals are termed as heavy metals irrespective of their atomic mass and density, eg. arsenic, lead, mercury, cadmium, thallium, chromium, etc.
Seminar of U.V. Spectroscopy by SAMIR PANDASAMIR PANDA
Spectroscopy is a branch of science dealing the study of interaction of electromagnetic radiation with matter.
Ultraviolet-visible spectroscopy refers to absorption spectroscopy or reflect spectroscopy in the UV-VIS spectral region.
Ultraviolet-visible spectroscopy is an analytical method that can measure the amount of light received by the analyte.
What is greenhouse gasses and how many gasses are there to affect the Earth.moosaasad1975
What are greenhouse gasses how they affect the earth and its environment what is the future of the environment and earth how the weather and the climate effects.
What is greenhouse gasses and how many gasses are there to affect the Earth.
The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work
1. Putting the latest Computer Vision and
Deep Learning algorithms to work
The Opportunities and Challenges
Albert Y. C. Chen, Ph.D.
Vice President, R&D
Viscovery
2. Albert Y. C. Chen, Ph.D.
• Experience
2017-present: Vice President of R&D at Viscovery
2016-2017: Chief Scientist at Viscovery
2015: Principal Scientist @ Nervve Technologies
2013-2014 Computer Vision Scientist @ Tandent Vision
2011-2012 @ GE Global Research
• Education
Ph.D. in Computer Science, SUNY-Buffalo
M.S. in Computer Science, NTNU
B.S. in Computer Science, NTHU
• Some random things about me…
SUNY Excellence in Teaching Award, 2010.
Some rapid promotions, some failed startups, some
patents, some papers…
3. 1. W.Wu,A.Y. C. Chen, L. Zhao, and J. J. Corso. Brain tumor detection and segmentation in a CRF framework with pixel-wise
affinity and superpixel-level features. International Journal of Computer Assisted Radiology and Surgery, 2015.
2. S. N. Lim,A.Y. C. Chen and X.Yang. Parameter Inference Engine (PIE) on the Pareto Front. In Proceedings of International
Conference of Machine Learning,Auto ML Workshop, 2014.
3. A.Y. C. Chen, S.Whitt, C. Xu, and J. J. Corso. Hierarchical supervoxel fusion for robust pixel label propagation in videos. In
Submission to ACM Multimedia, 2013.
4. A.Y.C. Chen and J.J. Corso.Temporally consistent multi-class video-object segmentation with the video graph-shifts
algorithm. In Proceedings of IEEE Workshop on Applications of ComputerVision, 2011.
5. D.R. Schlegel,A.Y.C. Chen, C. Xiong, J.A. Delmerico, and J.J. Corso. Airtouch: Interacting with computer systems at a
distance. In Proceedings of IEEE Workshop on Applications of ComputerVision, 2011.
6. A.Y.C. Chen and J.J. Corso. On the effects of normalization in adaptive MRF Hierarchies. In Proceedings of International
Symposium CompIMAGE, 2010.
7. A.Y.C. Chen and J.J. Corso. Propagating multi-class pixel labels throughout video frames. In Proceedings of IEEE Western
NewYork Image Processing Workshop, 2010.
8. A.Y. C. Chen and J. J. Corso. On the effects of normalization in adaptive MRF Hierarchies. Computational Modeling of
Objects Represented in Images, pages 275–286, 2010.
9. Y.Tao, L. Lu, M. Dewan,A.Y. C. Chen, J. J. Corso, J. Xuan, M. Salganicoff, and A. Krishnan. Multi-level ground glass nodule
detection and segmentation in ct lung images. Medical Image Computing and Computer-Assisted Intervention, 2009.
10. A.Y.C. Chen, J.J. Corso, and L.Wang. Hops: Efficient region labeling using higher order proxy neighborhoods. In
Proceedings of IEEE International Conference on Pattern Recognition, 2008.
4. Some work done before I
caught the startup fever
Freestyle Sketching Stage
AirTouch waits in background
for the initialization signal
Initialize
Terminate
Output
image
database
Start:
Results
CBIR
query
Airtouch HCI interface for Content-based Image Retrieval
5. Interactive Segmentation & Classification
• Segmentation then classification:
• computationally more efficient,
• results in much higher classification accuracy.
• Pioneered the “pixel label propagation” field.
• First to utilize superpixels and supervoxels for the task.
FG
Traditional Spatial
Propagation
Pixel label map
Label a subset of pixels
BG
Spatio-temporal Propagation
time
6. Image/Video Object Recognition
and Content Understanding
approaches
person carries
gives
recieves
Ontology
object
Person 1
Person 1Person 2
High-Level
Mid-Level
approach
activity
receives gives
carries
activity
activity activity
Time
Reasoning
x
x
x
Low-Level
x x
x
x
7. Learning and Adapting Optimal
Classifier Parameters
subspace B
subspace
A
subspace
C
Image-level feature space
priors
Patch-level feature space
posterior
probability
suggest optimal
parameter configuration
8. Graphical Models and
Stochastic Optimization
A
(a) The space-time volume of a
video showing the objects
(A--F) and their appearing
time-span.
space
time
A
B
C
D
E
F
B E
F
C
D
(b) The temporal relationship
graph. An edge between
two vertices mean that the
two objects overlap in time.
(c) The goal is: cover all objects
with the smallest number of
"ground truth key frames".
space
time
A
B
C
D
E
F
key 1 key 2
A
B E
F
C
D
(d) This translates to: iteratively
solving the max clique
problem until all vertices
belong to a clique.
A
B E
F
C
D
key 2
key 1
frame t-1 frame t
layer n layer n
layer n+1 layer n+1
Temporal
Shift
Shift
µ
9. Medical Imaging and
Geospatial Imaging
GNN detection and
segmentation
in Lung CT geospatial imaging:
building detection
Brain tumor detection and
segmentation in MR images.
10. Why Risk to Innovate?
• Good business model NEVER last forever.
• Average “shelf life” on S&P 500: 20 years.
• 100-year old companies constantly reinvent
themselves every 10-20 years
• Startups contribute to 20% of USA’s GDP.
11. The Death of a Good
Business Model
• Foxconn 20 year revenue v.s. net profit (now at 5%)
12. What do 100 year old
corporations do?
GE Schenectady, 1896
13. History of change at GE
• 1886: one of the 12 original companies on the Dow
Jone Industrial Average (also the only one remaining).
• 1889: lightbulbs
• 1919: radios
• 1927: TV
• 1941: jet engine
• 1960: nuclear power
• 1971: room AC units
• 1995: MRI
14. History of change at IBM
• 1960s: mainframe computer
• 1980s: personal computer
• 2000s: integrated solutions
• 2020s: AI, Watson
24. Now, again, do we want to
do OEM/ODM forever?
Optimizing an old business model
is just delaying its eventual death.
25. Startups
• A company, partnership, or temporary
organization designed to search for a new,
repeatable and scalable business model.
26. Your Idea
• Are you passionate about it?
• Is it disruptive enough?
• What is your business plan?
• What is it?
• Can it make money?
• What is the future of the idea?
• What is your competitive advantage?
• How do you build up your entry barrier?
30. Prototype
• Hack out a prototype
• Spend 2-10 weeks max.
• Investors are much more likely to fund you if
you have a minimal initial version of your idea.
• Hackathons are a good place to start.
• Iteratively improve the prototype
36. Brief History
Marvin Minsky
“In 1966, Minsky hired a first-year undergraduate
student and assigned him a problem to solve over the
summer: connect a television camera to a computer
and get the machine to describe what it sees.”
Gerald Sussman
The student never worked on
Computer Vision problems again.
37. Brief History
• 1960’s: interpretation of synthetic worlds
• 1970’s: some progress on interpreting selected images
• 1980’s: ANNs come and go; shift toward geometry and increased
mathematical rigor
• 1990’s: face recognition; statistical analysis in vogue
• 2000’s: broader recognition; large annotated datasets available; video
processing starts
Guzman ‘68 Ohta Kanade ‘78 Turk and Pentland ‘91
55. What alg. should I use then?
• How much data do we have?
• What objects are we trying to detect?
• For example, Google’s DNN trained with 11k images
over 20 classes in 2013 doesn’t always beat DPM.
0
0.15
0.3
0.45
0.6
aero bike bird boat bottle bus car cat chair cow
0
0.15
0.3
0.45
0.6
dog horse m-bike person plant sheep sofa table train TV
D
N
N
D
P
M
56. ML alg. and their Applications
• Deep
Learning
• Markovian/
Bayesian
• Feature
Matching
• Other ML
methods
57. Meta-Learning
• Different use
cases calls for
different ML
algorithms.
• Meta-Learning:
learning how to
learn.
• Requires plenty of
domain-specific
know-how.
68. Face Verification and Identification,
Labeled Faces in the Wild (LFW)
Recognition
Accuracy:
• 1 to 1: 99%+
• 1 to 100: 90%
• 1 to 10,000:
50%-70%.
• 1 to 1M: 30%.
LFW dataset, common FN↑, FP↓
111. Other Applications in
Business Intelligence
• Measure brand exposure.
• Measure sponsorship effectiveness.
• Loss prevention and retail layout optimization.
118. Issues
• Highly anticipated, highly acclaimed, but small
crowd at $500 a license.
• Adobe Photoshop monopoly and the “not
invented here” syndrome.
• Adobe’s arch-rival, Corel (Corel Draw, Paint
Shop Pro, Ulead PhotoImpact) was DYING and
asked too much from the botched deal.
119. Have fun scribbling out your
shadows in photoshop!
Poor Bob from Adobe wasted 9 minutes removing just 1 shadow
122. Retrospect
• 20 researchers burned 25 million in 8 years;
investors got 50 patents in return, period.
• Overestimated the total addressable market
size, in a market with existing monopoly.
• Many missed opportunities. Counterexample of
the lean startup model.
124. Satellite/Aerial Imagery Analysis
• 40cm resolution at 30fps for 90 sec for any location on earth.
• One LEO satellite revisits any place on Earth every 3 days.
• Need 24 satellites to revisit any place on Earth every 3 hours.
125. Challenges for Single satellite depth
estimation and 3D reconstruction
• At 30fps, a LEO satellite
travels 250m between two
consecutive frames —>
theoretically sufficient for
cm-level depth estimation.
• Sources of Noise:
• Camera distortions
• Atmospheric Disturbance
• Ground vegetation
• Sub-pixel sampling noise
1
2
126. What happened?
• B2B customers takes too long to strike deals.
• Google ate us alive in just 3 months, while we
were still pitching for VC-funding with our
prototype.
128. Retrospect
• Growth pains expanding from intelligence
community clients to advertisement clients.
• Forming the right team of engineers and
researchers and moving at the right pace.
• For any Computer Vision/Machine Learning
company:
• Researchers that cannot program—> OUT
• Engineers that don’t know math —> OUT
135. Challenges Encountered
Along the Way
• From Product Recognition in Images, to Face,
Logo, Object, Scene recognition in Videos.
• Number of Categories
• Recognition Accuracy
• Recognition Speed
• System Architecture
• Business Model
136. Viscovery’s Edge
• Market: first mover’s advantage in China’s video
streaming market.
• Speed: we built the whole VDS thing in a few months!
• Team: You! Seriously!
• Technology:
• Depth
• Breadth
• Cloud
• Customizability
• Self-Learning
137. Life is not all rosy at startups
• High Risk, High Pressure, High Uncertainty!
• Resources are scarce, but you MUST DELIVER!
• Forming your all-star team is not that easy…
• Focus, and persistence.
145. The Goldilocks zone of innovation
Business
Relevance
Academic
Relevance
plentiful resources; hierarchical organization
lack of resources; responsive organization
traditional corporations
talking “innovation”
corporate research
startups struggling to survive
academic spinoffs
MSR
翟本橋:never worked a single day in my life
example: Tivo disrupts TV market / creates DVR market
example: Facebook, Twitter disrupt online social networking
example: FourSquare creates location-based "check in" ad market