SlideShare a Scribd company logo
BIG DATA
FIRST INTERNSHIP
Sep 2013 – Jan 2014
Organized by Contemi Vietnam
Outcomes
• Contemi
• Big Data presence
• Big Data know-how
• Big Data experience
• Interns
• Linux
• R / Python language
• Machine Learning practice
• Process
• Scrum
• Cross Industry Standard Process
for Data Mining (CRISM-DM)
• Kaggle profile
• Hadoop
Preparation
• Platform: Ubuntu 12.04 LTS
• Process:
• Scrum
• Cross Industry Standard Process for Data Mining (CRISP DM)
• Weekly blog
• http://contanalytics.wordpress.com/
Headstart for Dung
• 16/09 – 30/09
• Learn R / Python
• Try Digit Recognizer competition on Kaggle.com
• Join in Introduction to Recommender System and Web Intelligence and Big
Data on Coursera.com
3 month plan
• 1/10 – 31/10
• Go through all typical Machine Learning algorithms, implement, demo and present to Contemi
• 1/11 – 15/11
• Compete for AMS 2013-2014 Solar Energy Prediction Contest
• URL: http://www.kaggle.com/c/ams-2014-solar-energy-prediction-contest
• 16/11 – 22/11
• Compete for Accelerometer Biometric Competition
• URL: http://www.kaggle.com/c/accelerometer-biometric-competition
• 23/11 – 31/12 (end of internship)
• Deploy Hadoop
• Learn Java
• Run Word counting and Sorting experiments with large data (> 1GB)
• Compete for Facebook Recruting III – Keyword Extraction (personally)
• Re-optimize built model basing on Hadoop
Next plan for next internships
• App using Singapore open datasets
• Stock prediction app for Vietnam market
• Visualization
• GitHub
• R-Bloggers

More Related Content

What's hot

Free Your On-Premises Data
Free Your On-Premises DataFree Your On-Premises Data
Free Your On-Premises Data
Sam Basu
 
Sharing and Deploying Data Science with KNIME Server
Sharing and Deploying Data Science with KNIME ServerSharing and Deploying Data Science with KNIME Server
Sharing and Deploying Data Science with KNIME Server
KNIMESlides
 
Sap open connectors #sitcph
Sap open connectors #sitcphSap open connectors #sitcph
Sap open connectors #sitcph
Daniel Graversen
 
AWS Finland Meetup 2019 April
AWS Finland Meetup 2019 AprilAWS Finland Meetup 2019 April
AWS Finland Meetup 2019 April
Rolf Koski
 
All About Angular and ArcGIS - Developers Forum - AC18
All About Angular and ArcGIS - Developers Forum - AC18All About Angular and ArcGIS - Developers Forum - AC18
All About Angular and ArcGIS - Developers Forum - AC18
Esri UK
 
DevOpsDays Amsterdam 2016 workshop
DevOpsDays Amsterdam 2016 workshopDevOpsDays Amsterdam 2016 workshop
DevOpsDays Amsterdam 2016 workshop
Arnold Van Wijnbergen
 
Cloud Developer Days - BigQuery
Cloud Developer Days - BigQueryCloud Developer Days - BigQuery
Cloud Developer Days - BigQuery
Wlodek Bielski
 
DevTest Labs en Azure (por Iván Cañizares)
DevTest Labs en Azure (por Iván Cañizares)DevTest Labs en Azure (por Iván Cañizares)
DevTest Labs en Azure (por Iván Cañizares)
Jorge Millán Cabrera
 
Intro to Quantum GIS Desktop GIS
Intro to Quantum GIS Desktop GISIntro to Quantum GIS Desktop GIS
Intro to Quantum GIS Desktop GIS
Joanne Cook
 
An Introduction to Cloud Computing
An Introduction to Cloud ComputingAn Introduction to Cloud Computing
An Introduction to Cloud Computing
Mike Frampton
 
Cloud computing
Cloud computingCloud computing
Cloud computing
Yash Patel
 
API Management: La Puerta de enlace (por Francisco Nieto)
API Management: La Puerta de enlace (por Francisco Nieto)API Management: La Puerta de enlace (por Francisco Nieto)
API Management: La Puerta de enlace (por Francisco Nieto)
Jorge Millán Cabrera
 
Power Apps Deep Dive - Munchen 2019
Power Apps Deep Dive - Munchen 2019Power Apps Deep Dive - Munchen 2019
Power Apps Deep Dive - Munchen 2019
Timo Pertilä
 
Open Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME SoftwareOpen Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME Software
KNIMESlides
 
Scaling and Fault-resistance strategies and geography
Scaling and Fault-resistance strategies and geographyScaling and Fault-resistance strategies and geography
Scaling and Fault-resistance strategies and geography
Jason Smith
 
Sitech
SitechSitech
Sitech
BigDataExpo
 
AWS Dev Day 2018
AWS Dev Day 2018AWS Dev Day 2018
AWS Dev Day 2018
Mohit Ajwani
 
AWS Finland meetup 2017 August
AWS Finland meetup 2017 AugustAWS Finland meetup 2017 August
AWS Finland meetup 2017 August
Rolf Koski
 
dotCMS - Move Forward, Build Faster, Get Farther
dotCMS - Move Forward, Build Faster, Get FartherdotCMS - Move Forward, Build Faster, Get Farther
dotCMS - Move Forward, Build Faster, Get Farther
Jason Smith
 
Real time serverless data pipelines on AWS
Real time serverless data pipelines on AWSReal time serverless data pipelines on AWS
Real time serverless data pipelines on AWS
The Incredible Automation Day
 

What's hot (20)

Free Your On-Premises Data
Free Your On-Premises DataFree Your On-Premises Data
Free Your On-Premises Data
 
Sharing and Deploying Data Science with KNIME Server
Sharing and Deploying Data Science with KNIME ServerSharing and Deploying Data Science with KNIME Server
Sharing and Deploying Data Science with KNIME Server
 
Sap open connectors #sitcph
Sap open connectors #sitcphSap open connectors #sitcph
Sap open connectors #sitcph
 
AWS Finland Meetup 2019 April
AWS Finland Meetup 2019 AprilAWS Finland Meetup 2019 April
AWS Finland Meetup 2019 April
 
All About Angular and ArcGIS - Developers Forum - AC18
All About Angular and ArcGIS - Developers Forum - AC18All About Angular and ArcGIS - Developers Forum - AC18
All About Angular and ArcGIS - Developers Forum - AC18
 
DevOpsDays Amsterdam 2016 workshop
DevOpsDays Amsterdam 2016 workshopDevOpsDays Amsterdam 2016 workshop
DevOpsDays Amsterdam 2016 workshop
 
Cloud Developer Days - BigQuery
Cloud Developer Days - BigQueryCloud Developer Days - BigQuery
Cloud Developer Days - BigQuery
 
DevTest Labs en Azure (por Iván Cañizares)
DevTest Labs en Azure (por Iván Cañizares)DevTest Labs en Azure (por Iván Cañizares)
DevTest Labs en Azure (por Iván Cañizares)
 
Intro to Quantum GIS Desktop GIS
Intro to Quantum GIS Desktop GISIntro to Quantum GIS Desktop GIS
Intro to Quantum GIS Desktop GIS
 
An Introduction to Cloud Computing
An Introduction to Cloud ComputingAn Introduction to Cloud Computing
An Introduction to Cloud Computing
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
API Management: La Puerta de enlace (por Francisco Nieto)
API Management: La Puerta de enlace (por Francisco Nieto)API Management: La Puerta de enlace (por Francisco Nieto)
API Management: La Puerta de enlace (por Francisco Nieto)
 
Power Apps Deep Dive - Munchen 2019
Power Apps Deep Dive - Munchen 2019Power Apps Deep Dive - Munchen 2019
Power Apps Deep Dive - Munchen 2019
 
Open Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME SoftwareOpen Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME Software
 
Scaling and Fault-resistance strategies and geography
Scaling and Fault-resistance strategies and geographyScaling and Fault-resistance strategies and geography
Scaling and Fault-resistance strategies and geography
 
Sitech
SitechSitech
Sitech
 
AWS Dev Day 2018
AWS Dev Day 2018AWS Dev Day 2018
AWS Dev Day 2018
 
AWS Finland meetup 2017 August
AWS Finland meetup 2017 AugustAWS Finland meetup 2017 August
AWS Finland meetup 2017 August
 
dotCMS - Move Forward, Build Faster, Get Farther
dotCMS - Move Forward, Build Faster, Get FartherdotCMS - Move Forward, Build Faster, Get Farther
dotCMS - Move Forward, Build Faster, Get Farther
 
Real time serverless data pipelines on AWS
Real time serverless data pipelines on AWSReal time serverless data pipelines on AWS
Real time serverless data pipelines on AWS
 

Similar to Big data internship plan at Contemi Vietnam

SIC Finale Status Report August 6.pptx
SIC Finale Status Report August 6.pptxSIC Finale Status Report August 6.pptx
SIC Finale Status Report August 6.pptxShaista Ansari
 
Day 00 - Introduction to machine learning with big data
Day 00 - Introduction to machine learning with big dataDay 00 - Introduction to machine learning with big data
Day 00 - Introduction to machine learning with big data
ssusere5ddd6
 
Choosing a Cloud Provider: Public-Private-Hybrid
Choosing a Cloud Provider: Public-Private-HybridChoosing a Cloud Provider: Public-Private-Hybrid
Choosing a Cloud Provider: Public-Private-Hybrid
Robert Starmer
 
SAP Teched 2012 Session Tec3438 Automate IaaS SAP deployments
SAP Teched 2012 Session Tec3438 Automate IaaS SAP deploymentsSAP Teched 2012 Session Tec3438 Automate IaaS SAP deployments
SAP Teched 2012 Session Tec3438 Automate IaaS SAP deployments
Chris Kernaghan
 
Data Science in the Cloud
Data Science in the CloudData Science in the Cloud
Data Science in the Cloud
Margriet Groenendijk
 
Automating Infrastructure as a Service Deployments and monitoring – TEC213
Automating Infrastructure as a Service Deployments and monitoring – TEC213Automating Infrastructure as a Service Deployments and monitoring – TEC213
Automating Infrastructure as a Service Deployments and monitoring – TEC213
Chris Kernaghan
 
Getting started with GCP ( Google Cloud Platform)
Getting started with GCP ( Google  Cloud Platform)Getting started with GCP ( Google  Cloud Platform)
Getting started with GCP ( Google Cloud Platform)
bigdata trunk
 
Connecting DMPs & Repositories
Connecting DMPs & RepositoriesConnecting DMPs & Repositories
Connecting DMPs & Repositories
Sarah Jones
 
Train the Trainers: Cloud Computing & Big Data Workshop
Train the Trainers:  Cloud Computing & Big Data WorkshopTrain the Trainers:  Cloud Computing & Big Data Workshop
Train the Trainers: Cloud Computing & Big Data Workshop
IMC Institute
 
With Automated ML, is Everyone an ML Engineer?
With Automated ML, is Everyone an ML Engineer?With Automated ML, is Everyone an ML Engineer?
With Automated ML, is Everyone an ML Engineer?
Dan Sullivan, Ph.D.
 
Digital Asset Management: Intro & Career Path for Librarians
Digital Asset Management: Intro & Career Path for LibrariansDigital Asset Management: Intro & Career Path for Librarians
Digital Asset Management: Intro & Career Path for Librarians
Laura Fu
 
Google Associate Cloud Engineer Certification Tips
Google Associate Cloud Engineer Certification TipsGoogle Associate Cloud Engineer Certification Tips
Google Associate Cloud Engineer Certification Tips
Daniel Zivkovic
 
Challenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in ProductionChallenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in Production
iguazio
 
QuSandbox+NVIDIA Rapids
QuSandbox+NVIDIA RapidsQuSandbox+NVIDIA Rapids
QuSandbox+NVIDIA Rapids
QuantUniversity
 
Google Cloud Machine Learning
 Google Cloud Machine Learning  Google Cloud Machine Learning
Google Cloud Machine Learning
India Quotient
 
Top 7 mistakes
Top 7 mistakesTop 7 mistakes
Top 7 mistakes
Talbott Crowell
 
Serverless Big Data Architecture on Google Cloud Platform at Credit OK
Serverless Big Data Architecture on Google Cloud Platform at Credit OKServerless Big Data Architecture on Google Cloud Platform at Credit OK
Serverless Big Data Architecture on Google Cloud Platform at Credit OK
Kriangkrai Chaonithi
 
20140116 Tim Willoughby and James Fogarty FOSS in Local Government
20140116   Tim Willoughby and James Fogarty FOSS in Local Government20140116   Tim Willoughby and James Fogarty FOSS in Local Government
20140116 Tim Willoughby and James Fogarty FOSS in Local Government
Tim Willoughby
 
Architecting govCMS: Australian Government as a Service -
Architecting govCMS: Australian Government as a Service - Architecting govCMS: Australian Government as a Service -
Architecting govCMS: Australian Government as a Service -
David Peterson
 
22 May 2014 CDE competition: Information processing and sensemaking presentation
22 May 2014 CDE competition: Information processing and sensemaking presentation22 May 2014 CDE competition: Information processing and sensemaking presentation
22 May 2014 CDE competition: Information processing and sensemaking presentationDefence and Security Accelerator
 

Similar to Big data internship plan at Contemi Vietnam (20)

SIC Finale Status Report August 6.pptx
SIC Finale Status Report August 6.pptxSIC Finale Status Report August 6.pptx
SIC Finale Status Report August 6.pptx
 
Day 00 - Introduction to machine learning with big data
Day 00 - Introduction to machine learning with big dataDay 00 - Introduction to machine learning with big data
Day 00 - Introduction to machine learning with big data
 
Choosing a Cloud Provider: Public-Private-Hybrid
Choosing a Cloud Provider: Public-Private-HybridChoosing a Cloud Provider: Public-Private-Hybrid
Choosing a Cloud Provider: Public-Private-Hybrid
 
SAP Teched 2012 Session Tec3438 Automate IaaS SAP deployments
SAP Teched 2012 Session Tec3438 Automate IaaS SAP deploymentsSAP Teched 2012 Session Tec3438 Automate IaaS SAP deployments
SAP Teched 2012 Session Tec3438 Automate IaaS SAP deployments
 
Data Science in the Cloud
Data Science in the CloudData Science in the Cloud
Data Science in the Cloud
 
Automating Infrastructure as a Service Deployments and monitoring – TEC213
Automating Infrastructure as a Service Deployments and monitoring – TEC213Automating Infrastructure as a Service Deployments and monitoring – TEC213
Automating Infrastructure as a Service Deployments and monitoring – TEC213
 
Getting started with GCP ( Google Cloud Platform)
Getting started with GCP ( Google  Cloud Platform)Getting started with GCP ( Google  Cloud Platform)
Getting started with GCP ( Google Cloud Platform)
 
Connecting DMPs & Repositories
Connecting DMPs & RepositoriesConnecting DMPs & Repositories
Connecting DMPs & Repositories
 
Train the Trainers: Cloud Computing & Big Data Workshop
Train the Trainers:  Cloud Computing & Big Data WorkshopTrain the Trainers:  Cloud Computing & Big Data Workshop
Train the Trainers: Cloud Computing & Big Data Workshop
 
With Automated ML, is Everyone an ML Engineer?
With Automated ML, is Everyone an ML Engineer?With Automated ML, is Everyone an ML Engineer?
With Automated ML, is Everyone an ML Engineer?
 
Digital Asset Management: Intro & Career Path for Librarians
Digital Asset Management: Intro & Career Path for LibrariansDigital Asset Management: Intro & Career Path for Librarians
Digital Asset Management: Intro & Career Path for Librarians
 
Google Associate Cloud Engineer Certification Tips
Google Associate Cloud Engineer Certification TipsGoogle Associate Cloud Engineer Certification Tips
Google Associate Cloud Engineer Certification Tips
 
Challenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in ProductionChallenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in Production
 
QuSandbox+NVIDIA Rapids
QuSandbox+NVIDIA RapidsQuSandbox+NVIDIA Rapids
QuSandbox+NVIDIA Rapids
 
Google Cloud Machine Learning
 Google Cloud Machine Learning  Google Cloud Machine Learning
Google Cloud Machine Learning
 
Top 7 mistakes
Top 7 mistakesTop 7 mistakes
Top 7 mistakes
 
Serverless Big Data Architecture on Google Cloud Platform at Credit OK
Serverless Big Data Architecture on Google Cloud Platform at Credit OKServerless Big Data Architecture on Google Cloud Platform at Credit OK
Serverless Big Data Architecture on Google Cloud Platform at Credit OK
 
20140116 Tim Willoughby and James Fogarty FOSS in Local Government
20140116   Tim Willoughby and James Fogarty FOSS in Local Government20140116   Tim Willoughby and James Fogarty FOSS in Local Government
20140116 Tim Willoughby and James Fogarty FOSS in Local Government
 
Architecting govCMS: Australian Government as a Service -
Architecting govCMS: Australian Government as a Service - Architecting govCMS: Australian Government as a Service -
Architecting govCMS: Australian Government as a Service -
 
22 May 2014 CDE competition: Information processing and sensemaking presentation
22 May 2014 CDE competition: Information processing and sensemaking presentation22 May 2014 CDE competition: Information processing and sensemaking presentation
22 May 2014 CDE competition: Information processing and sensemaking presentation
 

More from Quang Nguyen

Intro to Hadoop
Intro to HadoopIntro to Hadoop
Intro to Hadoop
Quang Nguyen
 
Big Data Internship Program @Natural Science University
Big Data Internship Program @Natural Science UniversityBig Data Internship Program @Natural Science University
Big Data Internship Program @Natural Science UniversityQuang Nguyen
 
Intro to Big Data
Intro to Big DataIntro to Big Data
Intro to Big Data
Quang Nguyen
 
Hồ sơ mời tài trợ - ĐI MÔ FC
Hồ sơ mời tài trợ - ĐI MÔ FCHồ sơ mời tài trợ - ĐI MÔ FC
Hồ sơ mời tài trợ - ĐI MÔ FC
Quang Nguyen
 
Ho so xin tai tro giai bong da dong hanh den giang duong 2013
Ho so xin tai tro giai bong da dong hanh den giang duong 2013Ho so xin tai tro giai bong da dong hanh den giang duong 2013
Ho so xin tai tro giai bong da dong hanh den giang duong 2013Quang Nguyen
 
Gioi thieu du an dong hanh den giang duong 2013 vn
Gioi thieu du an dong hanh den giang duong 2013 vnGioi thieu du an dong hanh den giang duong 2013 vn
Gioi thieu du an dong hanh den giang duong 2013 vnQuang Nguyen
 
[Giaibongda.com] fi 7 min slides
[Giaibongda.com] fi 7 min slides[Giaibongda.com] fi 7 min slides
[Giaibongda.com] fi 7 min slidesQuang Nguyen
 
[Echelon 2012 vietnam satellite] giaibongda.com pitch deck
[Echelon 2012 vietnam satellite] giaibongda.com pitch deck[Echelon 2012 vietnam satellite] giaibongda.com pitch deck
[Echelon 2012 vietnam satellite] giaibongda.com pitch deckQuang Nguyen
 
[Giaibongda.com] FI 3-min pitch
[Giaibongda.com] FI 3-min pitch[Giaibongda.com] FI 3-min pitch
[Giaibongda.com] FI 3-min pitch
Quang Nguyen
 
giaibongda.com pitch deck
giaibongda.com pitch deckgiaibongda.com pitch deck
giaibongda.com pitch deck
Quang Nguyen
 
Agile development @open consultant offline
Agile development @open consultant offlineAgile development @open consultant offline
Agile development @open consultant offline
Quang Nguyen
 
[4interns.vn] hành trang thực tập
[4interns.vn] hành trang thực tập[4interns.vn] hành trang thực tập
[4interns.vn] hành trang thực tập
Quang Nguyen
 
Lean startup overview @ipl offline
Lean startup overview @ipl offlineLean startup overview @ipl offline
Lean startup overview @ipl offline
Quang Nguyen
 
Lean startup
Lean startupLean startup
Lean startup
Quang Nguyen
 
Keep the eyes beyond competition
Keep the eyes beyond competitionKeep the eyes beyond competition
Keep the eyes beyond competitionQuang Nguyen
 

More from Quang Nguyen (15)

Intro to Hadoop
Intro to HadoopIntro to Hadoop
Intro to Hadoop
 
Big Data Internship Program @Natural Science University
Big Data Internship Program @Natural Science UniversityBig Data Internship Program @Natural Science University
Big Data Internship Program @Natural Science University
 
Intro to Big Data
Intro to Big DataIntro to Big Data
Intro to Big Data
 
Hồ sơ mời tài trợ - ĐI MÔ FC
Hồ sơ mời tài trợ - ĐI MÔ FCHồ sơ mời tài trợ - ĐI MÔ FC
Hồ sơ mời tài trợ - ĐI MÔ FC
 
Ho so xin tai tro giai bong da dong hanh den giang duong 2013
Ho so xin tai tro giai bong da dong hanh den giang duong 2013Ho so xin tai tro giai bong da dong hanh den giang duong 2013
Ho so xin tai tro giai bong da dong hanh den giang duong 2013
 
Gioi thieu du an dong hanh den giang duong 2013 vn
Gioi thieu du an dong hanh den giang duong 2013 vnGioi thieu du an dong hanh den giang duong 2013 vn
Gioi thieu du an dong hanh den giang duong 2013 vn
 
[Giaibongda.com] fi 7 min slides
[Giaibongda.com] fi 7 min slides[Giaibongda.com] fi 7 min slides
[Giaibongda.com] fi 7 min slides
 
[Echelon 2012 vietnam satellite] giaibongda.com pitch deck
[Echelon 2012 vietnam satellite] giaibongda.com pitch deck[Echelon 2012 vietnam satellite] giaibongda.com pitch deck
[Echelon 2012 vietnam satellite] giaibongda.com pitch deck
 
[Giaibongda.com] FI 3-min pitch
[Giaibongda.com] FI 3-min pitch[Giaibongda.com] FI 3-min pitch
[Giaibongda.com] FI 3-min pitch
 
giaibongda.com pitch deck
giaibongda.com pitch deckgiaibongda.com pitch deck
giaibongda.com pitch deck
 
Agile development @open consultant offline
Agile development @open consultant offlineAgile development @open consultant offline
Agile development @open consultant offline
 
[4interns.vn] hành trang thực tập
[4interns.vn] hành trang thực tập[4interns.vn] hành trang thực tập
[4interns.vn] hành trang thực tập
 
Lean startup overview @ipl offline
Lean startup overview @ipl offlineLean startup overview @ipl offline
Lean startup overview @ipl offline
 
Lean startup
Lean startupLean startup
Lean startup
 
Keep the eyes beyond competition
Keep the eyes beyond competitionKeep the eyes beyond competition
Keep the eyes beyond competition
 

Recently uploaded

By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 

Recently uploaded (20)

By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 

Big data internship plan at Contemi Vietnam

  • 1. BIG DATA FIRST INTERNSHIP Sep 2013 – Jan 2014 Organized by Contemi Vietnam
  • 2. Outcomes • Contemi • Big Data presence • Big Data know-how • Big Data experience • Interns • Linux • R / Python language • Machine Learning practice • Process • Scrum • Cross Industry Standard Process for Data Mining (CRISM-DM) • Kaggle profile • Hadoop
  • 3. Preparation • Platform: Ubuntu 12.04 LTS • Process: • Scrum • Cross Industry Standard Process for Data Mining (CRISP DM) • Weekly blog • http://contanalytics.wordpress.com/
  • 4. Headstart for Dung • 16/09 – 30/09 • Learn R / Python • Try Digit Recognizer competition on Kaggle.com • Join in Introduction to Recommender System and Web Intelligence and Big Data on Coursera.com
  • 5. 3 month plan • 1/10 – 31/10 • Go through all typical Machine Learning algorithms, implement, demo and present to Contemi • 1/11 – 15/11 • Compete for AMS 2013-2014 Solar Energy Prediction Contest • URL: http://www.kaggle.com/c/ams-2014-solar-energy-prediction-contest • 16/11 – 22/11 • Compete for Accelerometer Biometric Competition • URL: http://www.kaggle.com/c/accelerometer-biometric-competition • 23/11 – 31/12 (end of internship) • Deploy Hadoop • Learn Java • Run Word counting and Sorting experiments with large data (> 1GB) • Compete for Facebook Recruting III – Keyword Extraction (personally) • Re-optimize built model basing on Hadoop
  • 6. Next plan for next internships • App using Singapore open datasets • Stock prediction app for Vietnam market • Visualization • GitHub • R-Bloggers