SlideShare a Scribd company logo
HADOOP PROJECT
Submitted By
Ankush Roy
Apuroop Pulabhatla
Ashitha VS
Krishna Bollojula
Srikanth Mallya
Koushik Rakshit
Roma Agrawal
CONTENTS
Business Problem
Understanding Data
Architecture
Psuedo code
Business Implication
BUSINESS PROBLEM
We need to find the adjusted closing price for
each day a stock has not reported dividend.
We are given two data sets:
1Dividends – A CSV file that gives us information
about the dates a dividend is returned.
2. NYSE_daily_prices – A bunch of CSV files that has
all the information about stock prices.
continued…
• We have to retrieve the dates when
dividends have a value 0, from the
dividends data set and match that with
the NYSE daily prices data set and hence
retrieve the corresponding adjusted
closing price.
.
UNDERSTANDING DATA
 The NYSE Daily Prices File has the following
columns separated by commas(CSV File):
exchange, stock_symbol, date, stock_price_open,
stock_price_high, stock_price_low, stock_price_close,
stock_volume, stock_price_adj_close
 The NYSE Daily Dividends File has the
following columns separated by commas(CSV
File):
exchange, stock_symbol, date, dividends
ARCHITECTURE
Mapper
Reducer
Output
Input Files
<Stock_symbol, Date, X, X, X, X, X
Adjusted_closing_price>
<Stock_symbol | Date, Adjusted_closing_price>
<Stock_symbol | Date, Dividend>
<Stock_symbol | Date, Adjusted_closing_price>
PSEUDOCODE - MAPPER
public static class StockAnalysisMapper extends MapReduceBase
implements Mapper<LongWritable, Text, Text, Text>
{
@Override
public void map(LongWritable key, Text
value,OutputCollector<Text, Text> output, Reporter reporter)
throws IOException
{
// switch case to parse the input lines and store the data
// check for null values in the key
// check the header and send the key value to output collector
}
}
PSEUDOCODE-REDUCER
public static class StockAnalysisReducer extends MapReduceBase implements
Reducer<Text, Text, Text, Text>
{
@Override
public void reduce(Text key, Iterator<Text> values,OutputCollector<Text, Text> output,
Reporter reporter) throws IOException
{
while (values.hasNext())
{
// Parse the inputs which are count,stock adjusted closing price and check
// Store them as required after parsing
//check for null values of stock adjusted closing price
}
//Increment the sum
// write to output if sum is 1
}
}
BUSINESS IMPLICATION
The closing price of a stock is exactly that: the
price of that stock at the close of the trading day.
The adjusted closing price uses the closing price
as a starting point, but it takes into account factors
such as dividends, stock splits and new stock
offerings.
The adjusted closing price represents a more
accurate reflection of a stock's value, since
distributions and new offerings can alter the
closing price.
Continued…
The primary use for the adjusted closing
price is as a means to develop an
accurate track record of a stock's
performance.
The comparison of a stock's historical
adjusted closing price to its current price
shows the true rate of return.
Thank you

More Related Content

Similar to Hadoop

Introduction of ssis
Introduction of ssisIntroduction of ssis
Introduction of ssis
deepakk073
 
Mysql Fun
Mysql FunMysql Fun
Mysql Fun
SHC
 
Understand when to use user defined functions in sql server tech-republic
Understand when to use user defined functions in sql server   tech-republicUnderstand when to use user defined functions in sql server   tech-republic
Understand when to use user defined functions in sql server tech-republic
Kaing Menglieng
 
Educational Objectives After successfully completing this assignmen.pdf
Educational Objectives After successfully completing this assignmen.pdfEducational Objectives After successfully completing this assignmen.pdf
Educational Objectives After successfully completing this assignmen.pdf
rajeshjangid1865
 

Similar to Hadoop (20)

Market Index Forecasting Model
Market Index Forecasting ModelMarket Index Forecasting Model
Market Index Forecasting Model
 
Introduction to-mongo db-execution-plan-optimizer-final
Introduction to-mongo db-execution-plan-optimizer-finalIntroduction to-mongo db-execution-plan-optimizer-final
Introduction to-mongo db-execution-plan-optimizer-final
 
Introduction to Mongodb execution plan and optimizer
Introduction to Mongodb execution plan and optimizerIntroduction to Mongodb execution plan and optimizer
Introduction to Mongodb execution plan and optimizer
 
Introduction of ssis
Introduction of ssisIntroduction of ssis
Introduction of ssis
 
MuleSoft Nashik Virtual Meetup#3 - Deep Dive Into DataWeave and its Module
MuleSoft Nashik Virtual  Meetup#3 - Deep Dive Into DataWeave and its ModuleMuleSoft Nashik Virtual  Meetup#3 - Deep Dive Into DataWeave and its Module
MuleSoft Nashik Virtual Meetup#3 - Deep Dive Into DataWeave and its Module
 
Mysql Fun
Mysql FunMysql Fun
Mysql Fun
 
esProc introduction
esProc introductionesProc introduction
esProc introduction
 
Final Project SQL - Elyada Wigati Pramaresti.pptx
Final Project SQL - Elyada Wigati Pramaresti.pptxFinal Project SQL - Elyada Wigati Pramaresti.pptx
Final Project SQL - Elyada Wigati Pramaresti.pptx
 
Understand when to use user defined functions in sql server tech-republic
Understand when to use user defined functions in sql server   tech-republicUnderstand when to use user defined functions in sql server   tech-republic
Understand when to use user defined functions in sql server tech-republic
 
whats New in axapta 2012
whats New in axapta 2012whats New in axapta 2012
whats New in axapta 2012
 
Educational Objectives After successfully completing this assignmen.pdf
Educational Objectives After successfully completing this assignmen.pdfEducational Objectives After successfully completing this assignmen.pdf
Educational Objectives After successfully completing this assignmen.pdf
 
Oracle_Analytical_function.pdf
Oracle_Analytical_function.pdfOracle_Analytical_function.pdf
Oracle_Analytical_function.pdf
 
Agile Database Development with JSON
Agile Database Development with JSONAgile Database Development with JSON
Agile Database Development with JSON
 
Spring data ii
Spring data iiSpring data ii
Spring data ii
 
4 things maybe you don't know about nasdaq-100 (posted 23th June 2017)
4 things maybe you don't know about nasdaq-100 (posted 23th June 2017)4 things maybe you don't know about nasdaq-100 (posted 23th June 2017)
4 things maybe you don't know about nasdaq-100 (posted 23th June 2017)
 
Realize the potential of sap material ledger
Realize the potential of sap material ledgerRealize the potential of sap material ledger
Realize the potential of sap material ledger
 
7th meeting - Analysis Toolpak - Solver - Forecast Worksheet
7th meeting - Analysis Toolpak - Solver - Forecast Worksheet7th meeting - Analysis Toolpak - Solver - Forecast Worksheet
7th meeting - Analysis Toolpak - Solver - Forecast Worksheet
 
CIS 115 Become Exceptional--cis115.com
CIS 115 Become Exceptional--cis115.comCIS 115 Become Exceptional--cis115.com
CIS 115 Become Exceptional--cis115.com
 
MatPlotLib
MatPlotLibMatPlotLib
MatPlotLib
 
Excel formula
Excel formula Excel formula
Excel formula
 

More from Krishna Bollojula

Krishna Bollojula Analyst Certificate
Krishna Bollojula  Analyst CertificateKrishna Bollojula  Analyst Certificate
Krishna Bollojula Analyst Certificate
Krishna Bollojula
 

More from Krishna Bollojula (18)

Krishna Bollojula Analyst Certificate
Krishna Bollojula  Analyst CertificateKrishna Bollojula  Analyst Certificate
Krishna Bollojula Analyst Certificate
 
Enterprise performance management
Enterprise performance managementEnterprise performance management
Enterprise performance management
 
Multiple Regression using SAS
Multiple Regression using SASMultiple Regression using SAS
Multiple Regression using SAS
 
Wine quality Analysis
Wine quality AnalysisWine quality Analysis
Wine quality Analysis
 
NYE Stock analysis
NYE Stock analysisNYE Stock analysis
NYE Stock analysis
 
Blind product test- Market Research
Blind product test- Market ResearchBlind product test- Market Research
Blind product test- Market Research
 
Adopting Analytics in BFSI
Adopting Analytics in BFSIAdopting Analytics in BFSI
Adopting Analytics in BFSI
 
Efficacy of drugs - SAS Case Study
Efficacy of drugs - SAS Case StudyEfficacy of drugs - SAS Case Study
Efficacy of drugs - SAS Case Study
 
ltu37276_en_certificate
ltu37276_en_certificateltu37276_en_certificate
ltu37276_en_certificate
 
Future groups
Future groupsFuture groups
Future groups
 
Analysis of Reviews on Sony Z3
Analysis of Reviews on Sony Z3Analysis of Reviews on Sony Z3
Analysis of Reviews on Sony Z3
 
Tela sales analysis
Tela sales analysisTela sales analysis
Tela sales analysis
 
Comparison of drugs to prevent post anesthesia
Comparison of drugs to prevent post anesthesiaComparison of drugs to prevent post anesthesia
Comparison of drugs to prevent post anesthesia
 
Blind Product Test - Data Analysis
Blind Product Test - Data AnalysisBlind Product Test - Data Analysis
Blind Product Test - Data Analysis
 
EPM - Hotel Industry
EPM - Hotel IndustryEPM - Hotel Industry
EPM - Hotel Industry
 
Wine ppt template
Wine ppt templateWine ppt template
Wine ppt template
 
Multiple regression
Multiple regressionMultiple regression
Multiple regression
 
Brand dossier on dettol
Brand dossier on dettolBrand dossier on dettol
Brand dossier on dettol
 

Recently uploaded

Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
MAQIB18
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Domenico Conte
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
Introduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxxIntroduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxx
zahraomer517
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 

Recently uploaded (20)

Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDB
 
Introduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxxIntroduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxx
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 

Hadoop

  • 1. HADOOP PROJECT Submitted By Ankush Roy Apuroop Pulabhatla Ashitha VS Krishna Bollojula Srikanth Mallya Koushik Rakshit Roma Agrawal
  • 3. BUSINESS PROBLEM We need to find the adjusted closing price for each day a stock has not reported dividend. We are given two data sets: 1Dividends – A CSV file that gives us information about the dates a dividend is returned. 2. NYSE_daily_prices – A bunch of CSV files that has all the information about stock prices.
  • 4. continued… • We have to retrieve the dates when dividends have a value 0, from the dividends data set and match that with the NYSE daily prices data set and hence retrieve the corresponding adjusted closing price. .
  • 5. UNDERSTANDING DATA  The NYSE Daily Prices File has the following columns separated by commas(CSV File): exchange, stock_symbol, date, stock_price_open, stock_price_high, stock_price_low, stock_price_close, stock_volume, stock_price_adj_close  The NYSE Daily Dividends File has the following columns separated by commas(CSV File): exchange, stock_symbol, date, dividends
  • 6. ARCHITECTURE Mapper Reducer Output Input Files <Stock_symbol, Date, X, X, X, X, X Adjusted_closing_price> <Stock_symbol | Date, Adjusted_closing_price> <Stock_symbol | Date, Dividend> <Stock_symbol | Date, Adjusted_closing_price>
  • 7. PSEUDOCODE - MAPPER public static class StockAnalysisMapper extends MapReduceBase implements Mapper<LongWritable, Text, Text, Text> { @Override public void map(LongWritable key, Text value,OutputCollector<Text, Text> output, Reporter reporter) throws IOException { // switch case to parse the input lines and store the data // check for null values in the key // check the header and send the key value to output collector } }
  • 8. PSEUDOCODE-REDUCER public static class StockAnalysisReducer extends MapReduceBase implements Reducer<Text, Text, Text, Text> { @Override public void reduce(Text key, Iterator<Text> values,OutputCollector<Text, Text> output, Reporter reporter) throws IOException { while (values.hasNext()) { // Parse the inputs which are count,stock adjusted closing price and check // Store them as required after parsing //check for null values of stock adjusted closing price } //Increment the sum // write to output if sum is 1 } }
  • 9. BUSINESS IMPLICATION The closing price of a stock is exactly that: the price of that stock at the close of the trading day. The adjusted closing price uses the closing price as a starting point, but it takes into account factors such as dividends, stock splits and new stock offerings. The adjusted closing price represents a more accurate reflection of a stock's value, since distributions and new offerings can alter the closing price.
  • 10. Continued… The primary use for the adjusted closing price is as a means to develop an accurate track record of a stock's performance. The comparison of a stock's historical adjusted closing price to its current price shows the true rate of return.