Roberto Muñoz 1
Roberto Muñoz
Astronomer & Data Scientist
Research Officer at MetricArts
@RobertoKPax
Roberto Muñoz 2
My
Journey
M y J o u r n e y i n
A c a d e m i a
Roberto Muñoz 3
2 0 0 9
PhD degree in Astronomy
Ponticifia Universidad Católica de Chile
2 0 1 2 - 2 0 1 5
Postdoctoral researcher
Pontificia Universidad Católica de Chile
2 0 1 0 - 2 0 1 1
Postdoctoral researcher
Strasbourg University, France
THROUGH THE TIME
Worked in Academia for about 6 years
Roberto Muñoz 4
The Next Generation Virgo survey (NGVS) is
a wide, deep and panchromatic survey of
the Virgo cluster. Area 100 deg2.
Combining optical and near-infrared bands
I found a clear separation between Galactic
sources, extragalactic GCs and galaxies.
Virgo cluster and
uiK diagram
A h i d d e n t r e a s u r e
M u ñ o z e t a l . 2 0 1 4
Roberto Muñoz 5
The Next Fornax Survey (NGFS) is a
panchromatic survey of the Fornax
cluster.
Study giant and dwarf galaxies, GCs and
UCDs.
Fornax cluster and
dwarf galaxies
H u n t i n g g h o s t s
M u ñ o z e t a l . 2 0 1 5
Roberto Muñoz 6
Full of dwarfs
643 dwarf galaxies in Fornax
One of the largest catalog of dwarf
galaxies belonging to a structure
Ó r d e n e s - B r i c e ñ o e t a l . 2 0 1 8
Roberto Muñoz 7
Roberto Muñoz 8
Science funding
crisis
Public budget cuts, Excessive
bureaucracy, Poorly defined
research policies
Roberto Muñoz 9
Science by
the numbers
S c i e n c e b y
t h e n u m b e r s
Roberto Muñoz 10
Number of researchers
Number of researchers per 1,000 workers
9.1 scientists per
1K workers
USA
10.1 scientists
per 1K workers
France
9 scientists per
1K workers
Germany
USA
France
Germany
Roberto Muñoz 11
Number of researchers
Number of researchers per 1,000 workers
1 scientist per
1K workers
Chile
2.9 scientists
per 1K workers
Argentina
0.7 scientists
per 1K workers
Brazil
Chile
Argentina
Brazil
Roberto Muñoz 12
R&D funding
Gross domestic spending on R&D
2.7% of GPD
USA
2.3% of GDP
France
2.7% of GDP
Germany
USA
France
Germany
Roberto Muñoz 13
R&D funding
Gross domestic spending on R&D
0.38% of GDP
Chile
0.63% of GDP
Argentina
1.2% of GDP
Brazil
Chile
Argentina
Brazil
Roberto Muñoz 14
What about Chile?
The National Association of Researchers
of Chile (ANIP) conducted a survey
about working conditions.
79%
9%
5%
Working in Universities
Working in Research centers
Working in Industry
Roberto Muñoz 15
Chile vs Germany
Employed doctorate holders by economic sector
65%5% Startups, Companies
I n d u s t r y
19%79% Public and Private
Universities, Professional
institutes
A c a d e m i a
Roberto Muñoz 16
First
approach
F i r s t a p p r o a c h
t o I n d u s t r y
Roberto Muñoz 17
2 0 1 6
Collaboration Academia-Industry
Ministry of Economy funding
2 0 1 5
Invited by a company to
collaborate
TO D A Y
Data scientist and Research Officer
Chilean company MetricArts
THROUGH THE TIME
Decided to move to Industry
Roberto Muñoz 18
The company was looking for new
business opportunities and wanted to
explore data intensive solutions.
Meet the company
MetricArts is a Chilean company funded in
2007. It started as a Business Intelligence
company and nowadays is a technology
solutions company.
Roberto Muñoz 19
Collaboration
2 0 1 5
The company wanted to explore the new
services of the platform Microsoft Azure.
Several clients were interested in buying
new services, such as image and text
analysis.
Cloud computing, Cognitive analytics.
Roberto Muñoz 20
University-Industry
2 0 1 6
We got funding from the Ministry of
Economy for running a project between
the Institute of Astrophysics at PUC
and the company MetricArts.
Still working at the University.
Develop a computer vision system for
processing surveillance cameras.
Roberto Muñoz 21
Industrial R&D
T o d a y
On 2017 I decided to leave the
academic career and joined a company.
The R&D team at MetricArts consists of
3 scientists and 5 engineers.
• Computer Vision
• Machine Learning
• Data Intensive Analytics
• Cloud computing
Roberto Muñoz 22
Industrial R&D
T o d a y
Daily activities
• Read papers
• Code new algorithms
• Train machine learning models
• Develop prototypes
• Meet with sales team and evaluate
technical challenges
• Meet with engineering and BI teams
• Improve deployed systems
• Submit papers and attend
conferences
Roberto Muñoz 23
Data Science
and IT industry
D a t a S c i e n c e
a n d I T i n d u s t r y
Roberto Muñoz 24
S t or e
Databases, Excel,
XML, JSON
R e t r i e v e
SQL, Data Lake,, Data
Warehouse
Tr a ns m i t
Transmission,
Propagation,
Reception
M a ni pu l a t e
Data Analysis, Data
mining
Information
Technology
Application of computers to Store,
Retrieve, Transmit and Manipulate Data
Roberto Muñoz 25
E m p l o y m e n t o f c o m p u t e r
a n d i n f o r m a t i o n t e c h n o l o g y
o c c u p a t i o n s i s p r o j e c t e d t o
g r o w 1 3 % f r o m 2 0 1 6 t o 2 0 2 6 .
* B u r e a u o f L a b o r S t a t i s t i c s , U S A
Job market
C o m p u t e r a n d I n f o r m a t i o n
s y s t e m s a r e e s s e n t i a l p a r t s
o f e v e r y b u s i n e s s t o d a y .
F i n a n c e a n d h e a l t h c a r e
a r e t h e b e s t c l i e n t s .
Business
Roberto Muñoz 26
Roberto Muñoz 27
A new paradigm
Jim Gray, researcher of Microsoft
talked about the Fourth paradigm
The digital era and new technologies have
changed our life style..
The first three paradigms were experimental,
theoretical and, more recently,
computational science.
We are living in the data-driven era.
Roberto Muñoz 28
Data Science
Interdisciplinary field that applies and
develops new methods to extract
information from data.
Pr og r a m m i n g S t a t i s t i c s
M a c hi ne L e a r ni ng D om a i n
Roberto Muñoz 29
T h e D a t a S c i e n c e V e n n D i a g r a m
D r e w C o n w a y ( 2 0 1 0 )
Programming
Exploratory analysis
Analytical thinking
Modeling
Domain knowledge
Business experience
Roberto Muñoz 30
T - s h a p e d v s P i - s h a p e d
A l e x S z a l a y
New breed
Classic PhD program generates T-shaped researchers:
scientists with wide-but-shallow general knowledge, but deep
skill and expertise in one particular area.
New breed of scientific researchers must be Pi-shaped:
maintain the same wide breadth, but push deeper both in their
own subject area and in statistics/computational methods.
Roberto Muñoz 31
Data is growing exponentially and
standard methods and tools are not
enough.
Facebook and Google ingest more than
500 TB of data per day.
Big Data
N e w m e t h o d s a n d t o o l s
Roberto Muñoz 32
Few companies have to deal with Terabytes
and Petabytes datasets.
Companies generate and accumulate lot of
unstructured data. Variety is a challenge.
Sensors are common nowadays and IoT
industry is growing fast. Processing real-time
data is a challenge.
The three Vs
N o t j u s t V o l u m e
Roberto Muñoz 33
Cross-Infrastructure/Analytics
S h o u l d e r o f g i a n t s
Roberto Muñoz 34
Advices
A d v i c e s
Roberto Muñoz 35
There are multiple online courses about
Data Analysis, Data Science,
Visualization, Machine Learning,.
Take online
courses
K n o w l e d g e a n d C e r t i f i c at e s
Roberto Muñoz 36
Python and R are the most used
languages in data analysis. More than 40
million users.
SQL is from 70’s but still very used by
many companies.
Learn modern
languages
D a t a - s c i e nc e l a n g u a g e s
Roberto Muñoz 37
You can start from an already existing
projects and make small changes.
Look for public datasets and try new
ways to analyze and visualize data. Ask
the right questions.
Do independent
projects
T r y n e w i d e a s
Roberto Muñoz 38
Github is a code hosting platform for version
control and collaboration. It allows to create
private and public code repositories.
Start with simple projects and then move to
more complex projects in Github.
Open a Github
account
S h o w y o u r c o d e s
Roberto Muñoz 39
Internships offer students a period of
practical experience in the industry.
Unlike conventional employment,
internships have an emphasis on training.
Do internships
L a n d i n a c o m p a n y
Roberto Muñoz 40
Write a short story about your background
and projects you have been involved
List the software, tools and programming
languages you have experience.
Make emphasis in your soft and technical
skills related to the job you are applying.
Improve your CV
In the industry, the main purpose of a
CV is to get job interviews.
Roberto Muñoz 41
Upload your CV in Linkedin and expand your
network. Let recruiters know you are open.
Subscribe to the Kaggle job listing. Jobs all
around the world.
Register in Getonboard website and look for
jobs in the Data/Analytics and programming
categories.
Job search
L o o k f o r o p p o r t u n i t i e s
Roberto Muñoz 42
Research is exploration and discovery.
It's investigating something that no
one knows or understands. Research is
creating new knowledge.
N e i l A r m s t r o n g
“

From academy to industry

  • 1.
    Roberto Muñoz 1 RobertoMuñoz Astronomer & Data Scientist Research Officer at MetricArts @RobertoKPax
  • 2.
    Roberto Muñoz 2 My Journey My J o u r n e y i n A c a d e m i a
  • 3.
    Roberto Muñoz 3 20 0 9 PhD degree in Astronomy Ponticifia Universidad Católica de Chile 2 0 1 2 - 2 0 1 5 Postdoctoral researcher Pontificia Universidad Católica de Chile 2 0 1 0 - 2 0 1 1 Postdoctoral researcher Strasbourg University, France THROUGH THE TIME Worked in Academia for about 6 years
  • 4.
    Roberto Muñoz 4 TheNext Generation Virgo survey (NGVS) is a wide, deep and panchromatic survey of the Virgo cluster. Area 100 deg2. Combining optical and near-infrared bands I found a clear separation between Galactic sources, extragalactic GCs and galaxies. Virgo cluster and uiK diagram A h i d d e n t r e a s u r e M u ñ o z e t a l . 2 0 1 4
  • 5.
    Roberto Muñoz 5 TheNext Fornax Survey (NGFS) is a panchromatic survey of the Fornax cluster. Study giant and dwarf galaxies, GCs and UCDs. Fornax cluster and dwarf galaxies H u n t i n g g h o s t s M u ñ o z e t a l . 2 0 1 5
  • 6.
    Roberto Muñoz 6 Fullof dwarfs 643 dwarf galaxies in Fornax One of the largest catalog of dwarf galaxies belonging to a structure Ó r d e n e s - B r i c e ñ o e t a l . 2 0 1 8
  • 7.
  • 8.
    Roberto Muñoz 8 Sciencefunding crisis Public budget cuts, Excessive bureaucracy, Poorly defined research policies
  • 9.
    Roberto Muñoz 9 Scienceby the numbers S c i e n c e b y t h e n u m b e r s
  • 10.
    Roberto Muñoz 10 Numberof researchers Number of researchers per 1,000 workers 9.1 scientists per 1K workers USA 10.1 scientists per 1K workers France 9 scientists per 1K workers Germany USA France Germany
  • 11.
    Roberto Muñoz 11 Numberof researchers Number of researchers per 1,000 workers 1 scientist per 1K workers Chile 2.9 scientists per 1K workers Argentina 0.7 scientists per 1K workers Brazil Chile Argentina Brazil
  • 12.
    Roberto Muñoz 12 R&Dfunding Gross domestic spending on R&D 2.7% of GPD USA 2.3% of GDP France 2.7% of GDP Germany USA France Germany
  • 13.
    Roberto Muñoz 13 R&Dfunding Gross domestic spending on R&D 0.38% of GDP Chile 0.63% of GDP Argentina 1.2% of GDP Brazil Chile Argentina Brazil
  • 14.
    Roberto Muñoz 14 Whatabout Chile? The National Association of Researchers of Chile (ANIP) conducted a survey about working conditions. 79% 9% 5% Working in Universities Working in Research centers Working in Industry
  • 15.
    Roberto Muñoz 15 Chilevs Germany Employed doctorate holders by economic sector 65%5% Startups, Companies I n d u s t r y 19%79% Public and Private Universities, Professional institutes A c a d e m i a
  • 16.
    Roberto Muñoz 16 First approach Fi r s t a p p r o a c h t o I n d u s t r y
  • 17.
    Roberto Muñoz 17 20 1 6 Collaboration Academia-Industry Ministry of Economy funding 2 0 1 5 Invited by a company to collaborate TO D A Y Data scientist and Research Officer Chilean company MetricArts THROUGH THE TIME Decided to move to Industry
  • 18.
    Roberto Muñoz 18 Thecompany was looking for new business opportunities and wanted to explore data intensive solutions. Meet the company MetricArts is a Chilean company funded in 2007. It started as a Business Intelligence company and nowadays is a technology solutions company.
  • 19.
    Roberto Muñoz 19 Collaboration 20 1 5 The company wanted to explore the new services of the platform Microsoft Azure. Several clients were interested in buying new services, such as image and text analysis. Cloud computing, Cognitive analytics.
  • 20.
    Roberto Muñoz 20 University-Industry 20 1 6 We got funding from the Ministry of Economy for running a project between the Institute of Astrophysics at PUC and the company MetricArts. Still working at the University. Develop a computer vision system for processing surveillance cameras.
  • 21.
    Roberto Muñoz 21 IndustrialR&D T o d a y On 2017 I decided to leave the academic career and joined a company. The R&D team at MetricArts consists of 3 scientists and 5 engineers. • Computer Vision • Machine Learning • Data Intensive Analytics • Cloud computing
  • 22.
    Roberto Muñoz 22 IndustrialR&D T o d a y Daily activities • Read papers • Code new algorithms • Train machine learning models • Develop prototypes • Meet with sales team and evaluate technical challenges • Meet with engineering and BI teams • Improve deployed systems • Submit papers and attend conferences
  • 23.
    Roberto Muñoz 23 DataScience and IT industry D a t a S c i e n c e a n d I T i n d u s t r y
  • 24.
    Roberto Muñoz 24 St or e Databases, Excel, XML, JSON R e t r i e v e SQL, Data Lake,, Data Warehouse Tr a ns m i t Transmission, Propagation, Reception M a ni pu l a t e Data Analysis, Data mining Information Technology Application of computers to Store, Retrieve, Transmit and Manipulate Data
  • 25.
    Roberto Muñoz 25 Em p l o y m e n t o f c o m p u t e r a n d i n f o r m a t i o n t e c h n o l o g y o c c u p a t i o n s i s p r o j e c t e d t o g r o w 1 3 % f r o m 2 0 1 6 t o 2 0 2 6 . * B u r e a u o f L a b o r S t a t i s t i c s , U S A Job market C o m p u t e r a n d I n f o r m a t i o n s y s t e m s a r e e s s e n t i a l p a r t s o f e v e r y b u s i n e s s t o d a y . F i n a n c e a n d h e a l t h c a r e a r e t h e b e s t c l i e n t s . Business
  • 26.
  • 27.
    Roberto Muñoz 27 Anew paradigm Jim Gray, researcher of Microsoft talked about the Fourth paradigm The digital era and new technologies have changed our life style.. The first three paradigms were experimental, theoretical and, more recently, computational science. We are living in the data-driven era.
  • 28.
    Roberto Muñoz 28 DataScience Interdisciplinary field that applies and develops new methods to extract information from data. Pr og r a m m i n g S t a t i s t i c s M a c hi ne L e a r ni ng D om a i n
  • 29.
    Roberto Muñoz 29 Th e D a t a S c i e n c e V e n n D i a g r a m D r e w C o n w a y ( 2 0 1 0 ) Programming Exploratory analysis Analytical thinking Modeling Domain knowledge Business experience
  • 30.
    Roberto Muñoz 30 T- s h a p e d v s P i - s h a p e d A l e x S z a l a y New breed Classic PhD program generates T-shaped researchers: scientists with wide-but-shallow general knowledge, but deep skill and expertise in one particular area. New breed of scientific researchers must be Pi-shaped: maintain the same wide breadth, but push deeper both in their own subject area and in statistics/computational methods.
  • 31.
    Roberto Muñoz 31 Datais growing exponentially and standard methods and tools are not enough. Facebook and Google ingest more than 500 TB of data per day. Big Data N e w m e t h o d s a n d t o o l s
  • 32.
    Roberto Muñoz 32 Fewcompanies have to deal with Terabytes and Petabytes datasets. Companies generate and accumulate lot of unstructured data. Variety is a challenge. Sensors are common nowadays and IoT industry is growing fast. Processing real-time data is a challenge. The three Vs N o t j u s t V o l u m e
  • 33.
  • 34.
  • 35.
    Roberto Muñoz 35 Thereare multiple online courses about Data Analysis, Data Science, Visualization, Machine Learning,. Take online courses K n o w l e d g e a n d C e r t i f i c at e s
  • 36.
    Roberto Muñoz 36 Pythonand R are the most used languages in data analysis. More than 40 million users. SQL is from 70’s but still very used by many companies. Learn modern languages D a t a - s c i e nc e l a n g u a g e s
  • 37.
    Roberto Muñoz 37 Youcan start from an already existing projects and make small changes. Look for public datasets and try new ways to analyze and visualize data. Ask the right questions. Do independent projects T r y n e w i d e a s
  • 38.
    Roberto Muñoz 38 Githubis a code hosting platform for version control and collaboration. It allows to create private and public code repositories. Start with simple projects and then move to more complex projects in Github. Open a Github account S h o w y o u r c o d e s
  • 39.
    Roberto Muñoz 39 Internshipsoffer students a period of practical experience in the industry. Unlike conventional employment, internships have an emphasis on training. Do internships L a n d i n a c o m p a n y
  • 40.
    Roberto Muñoz 40 Writea short story about your background and projects you have been involved List the software, tools and programming languages you have experience. Make emphasis in your soft and technical skills related to the job you are applying. Improve your CV In the industry, the main purpose of a CV is to get job interviews.
  • 41.
    Roberto Muñoz 41 Uploadyour CV in Linkedin and expand your network. Let recruiters know you are open. Subscribe to the Kaggle job listing. Jobs all around the world. Register in Getonboard website and look for jobs in the Data/Analytics and programming categories. Job search L o o k f o r o p p o r t u n i t i e s
  • 42.
    Roberto Muñoz 42 Researchis exploration and discovery. It's investigating something that no one knows or understands. Research is creating new knowledge. N e i l A r m s t r o n g “