SlideShare a Scribd company logo
1 of 12
Download to read offline
TE - A DSBDA MiniProject Assignment
Roll No. Name of the student MiniProject1 MiniProject 2
3101001 ADHAV SOHAM GANESH
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101002 AMBORKAR BHAVESH SUNIL
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101003 ANUBHAW MISHRA
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101004 BADE AASTHA RAMESH
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101005 BENDALE AYUSH YOGESH
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101006 Bhide Ashwini Vinay
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101007 BORA MITALI MAHENDRA
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101008 CHAUDHARI ANIKET RAJENDRA
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101009 CHAUDHARI DNYANESHWAR JAYANT
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101010 CHAUDHARI MANISHA KANARAM
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101011 CHAVAN VEDANG GANESH
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101012 Dadge Krishna Anil
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101013 DALAL PREYASH PARESH
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101014 DESAI NIRANJAN VIKAS
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101015 DESHMUKH OM RAMDAS
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101016 DHAMAL OMKAR DATTATRAY
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101017 DHANAWADE SAKSHI HANUMANT
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101018 DHANE ANIKET ARUN
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101019 Dhole Yash Bandu
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101020 DUSHING SHIVANI PRASHANT
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101021 Gaikwad Rutuja RAVINDRA
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101022 GAIKWAD VIVEK SATISH
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101023 GANJALE SAURABH SANJAY
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101024 GHATTE PRATIK GAJANAN
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101025 HADOLE SAKSHI SHIVAJI
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101026 HEDAU VEDANT RAJESH
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101027 HOLE SAKSHI VIJAY
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101028 INGLE YASH RAJABHAU
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101029 JAGTAP GOURIE BABBAJEEH
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101030 Kale Vaibhav Bhausaheb
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101031 KARPE SAMRUDDHI AJIT
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101032 KHAIRE SWAMINI VINOD
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101033 KHARADE ROHIT SHAHAJI
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101034 Kharat Vishwajeet Dipak
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101035 KHIRID SAHIL GANESH
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101036 KHOLGADE CHETAN GAJANAN
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101037 KOTHAWADE CHIRAG RAJENDRA
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101038 LAVHALE ABHIJIT KADAJI
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101039 MADANE TUSHAR BHAGWAN
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101040 MAHAJAN PRASHANT RAMKRUSHNA
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101041 MALLICK NASIMA YASMIN ILIUS ALI
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101042 MEHKARKAR PRATHMESH GIRIPRASAD
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101043 MOHITE PRAJWAL RAGHAVENDRA
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101044 More Mrunali Sunil
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101045 MORE VISHAL SANJAY
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101046 NAGE AKANKSHA KANIPHNATH
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101047 NAWALKAR ANIKET GAJANAN
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101048 NIKAM RUTUJA RAMDAS
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101049 Padavi Harishchandra Ravindra
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101050 PANSARE NISARG MANOHAR
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101051 PARDESHI ADITI RAJENDRA
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101052 PARJANE KALYANI RAVINDRA
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101053 PATHAK NEHA MANOJ
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101054 PATIL PRANOTI PANDURANG
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101055 PAWAR PRAJWAL JAGDISH
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101056 PHADTARE VEDANT DILIP
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101057 PINGALE PANKAJ CHANGDEV
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101058 PRASAD SUNIL CHAWARE
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101059 PREET POCHAT
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101060 PUTALE KUNAL SHEKHAR
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101061 RELUSINGHANI PREET KAUR KULDEEPSINGH
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101062 RUTUJA MOHAN SATHE
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101063 SANDBHOR SHREYAS AMOL
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101064 SATPUTE DIPTI DEVRAM
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101065 SAWANT ROHIT RAJKUMAR
Write a case study on Global Innovation Network and Analysis (GINA). Components of
analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning
analytic technique 4. Results and Key findings.
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101066 SHEKATKAR HIMANSHU SANJAY
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101067 SHELAKE TEJAS MILIND
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101068 Shinde Kunal Bharat
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101069 SHINDE MITESH RAMESH
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101070 SHINDE RAJWARDHAN SANJAY
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Use the following covid_vaccine_statewise.csv dataset and perform
following analytics on the
given dataset
https://www.kaggle.com/sudalairajkumar/covid19-in-india?
select=covid_vaccine_statewise.csv
a. Describe the dataset
b. Number of persons state wise vaccinated for first dose in India
c. Number of persons state wise vaccinated for second dose in India
d. Number of Males vaccinated
d. Number of females vaccinated
3101071 SAKSHI SIRGAN
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101072 SONPATKI SOHAN SUNIL
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101073 TEJWANI ANISH SHANKARLAL
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101074 TELI SHUBHANGI RAJENDRA
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101075 THANGE SAKSHI SANTOSHKUMAR
"Write a case study to process data driven for Digital Marketing OR Health care systems with
Hadoop Ecosystem components as shown. (Mandatory)
โ— HDFS: Hadoop Distributed File System
โ— YARN: Yet Another Resource Negotiator
โ— MapReduce: Programming based Data Processing
โ— Spark: In-Memory data processing
โ— PIG, HIVE: Query based processing of data services
โ— HBase: NoSQL Database (Provides real-time reads and writes)
โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm
libraries
โ— Solar, Lucene: Searching and Indexing"
2. Use the following dataset and classify tweets into positive and
negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets
https://www.kaggle.com/ruchi798/data-science-tweets
3101076 THORAT ABHIJIT KISAN
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101077 UBALE TEJAS SHAILESH
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.
3101078 WALUNJ OMKAR RAMDAS
Develop a movie recommendation model using the scikit-learn library in python.Refer
dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset.
csv
Write a case study on Global Innovation Network and Analysis (GINA).
Components of analytic plan are 1. Discovery business problem
framed, 2. Data, 3. Model planning analytic technique 4. Results and
Key findings.

More Related Content

What's hot

DALLE-2.pptx
DALLE-2.pptxDALLE-2.pptx
DALLE-2.pptx
PIRSALMANSHAH
ย 

What's hot (20)

Brain Tumour Detection.pptx
Brain Tumour Detection.pptxBrain Tumour Detection.pptx
Brain Tumour Detection.pptx
ย 
Anomaly detection
Anomaly detectionAnomaly detection
Anomaly detection
ย 
Image attendance system
Image attendance systemImage attendance system
Image attendance system
ย 
Malware Dectection Using Machine learning
Malware Dectection Using Machine learningMalware Dectection Using Machine learning
Malware Dectection Using Machine learning
ย 
Data Mining Techniques
Data Mining TechniquesData Mining Techniques
Data Mining Techniques
ย 
Disease prediction and doctor recommendation system
Disease prediction and doctor recommendation systemDisease prediction and doctor recommendation system
Disease prediction and doctor recommendation system
ย 
Modelling and evaluation
Modelling and evaluationModelling and evaluation
Modelling and evaluation
ย 
Big Data to avoid weather related flight delays
Big Data to avoid weather related flight delaysBig Data to avoid weather related flight delays
Big Data to avoid weather related flight delays
ย 
PPT4: Frameworks & Libraries of Machine Learning & Deep Learning
PPT4: Frameworks & Libraries of Machine Learning & Deep Learning PPT4: Frameworks & Libraries of Machine Learning & Deep Learning
PPT4: Frameworks & Libraries of Machine Learning & Deep Learning
ย 
Machine Learning with Earth Observation Imagery
Machine Learning with Earth Observation ImageryMachine Learning with Earth Observation Imagery
Machine Learning with Earth Observation Imagery
ย 
Unit 1 defects classes
Unit 1 defects classesUnit 1 defects classes
Unit 1 defects classes
ย 
Software engineering a practitioners approach 8th edition pressman solutions ...
Software engineering a practitioners approach 8th edition pressman solutions ...Software engineering a practitioners approach 8th edition pressman solutions ...
Software engineering a practitioners approach 8th edition pressman solutions ...
ย 
Evaluating Software Architectures
Evaluating Software ArchitecturesEvaluating Software Architectures
Evaluating Software Architectures
ย 
Artifacts
ArtifactsArtifacts
Artifacts
ย 
Learning With Complete Data
Learning With Complete DataLearning With Complete Data
Learning With Complete Data
ย 
Skin Cancer Detection Using Deep Learning Techniques
Skin Cancer Detection Using Deep Learning TechniquesSkin Cancer Detection Using Deep Learning Techniques
Skin Cancer Detection Using Deep Learning Techniques
ย 
DALLE-2.pptx
DALLE-2.pptxDALLE-2.pptx
DALLE-2.pptx
ย 
Lecture 3 general problem solver
Lecture 3 general problem solverLecture 3 general problem solver
Lecture 3 general problem solver
ย 
10-Software Project Management (Object Oriented Software Engineering - BNU Sp...
10-Software Project Management (Object Oriented Software Engineering - BNU Sp...10-Software Project Management (Object Oriented Software Engineering - BNU Sp...
10-Software Project Management (Object Oriented Software Engineering - BNU Sp...
ย 
Step by Step Guide to Learn SDLC
Step by Step Guide to Learn SDLCStep by Step Guide to Learn SDLC
Step by Step Guide to Learn SDLC
ย 

Similar to DSBDA Miniproject Assignment - TE A (1).pdf

Summer Independent Study Report
Summer Independent Study ReportSummer Independent Study Report
Summer Independent Study Report
Shreya Chakrabarti
ย 
Simplified Machine Learning, Text, and Graph Analytics with Pivotal Greenplum
Simplified Machine Learning, Text, and Graph Analytics with Pivotal GreenplumSimplified Machine Learning, Text, and Graph Analytics with Pivotal Greenplum
Simplified Machine Learning, Text, and Graph Analytics with Pivotal Greenplum
VMware Tanzu
ย 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
Sanjay Padhi, Ph.D
ย 
(R17A0528) BIG DATA ANALYTICS.pdf
(R17A0528) BIG DATA ANALYTICS.pdf(R17A0528) BIG DATA ANALYTICS.pdf
(R17A0528) BIG DATA ANALYTICS.pdf
PoornimaShetty27
ย 

Similar to DSBDA Miniproject Assignment - TE A (1).pdf (20)

Big data analytics in banking sector
Big data analytics in banking sectorBig data analytics in banking sector
Big data analytics in banking sector
ย 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
ย 
Summer Independent Study Report
Summer Independent Study ReportSummer Independent Study Report
Summer Independent Study Report
ย 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
ย 
Simplified Machine Learning, Text, and Graph Analytics with Pivotal Greenplum
Simplified Machine Learning, Text, and Graph Analytics with Pivotal GreenplumSimplified Machine Learning, Text, and Graph Analytics with Pivotal Greenplum
Simplified Machine Learning, Text, and Graph Analytics with Pivotal Greenplum
ย 
Social Media Market Trender with Dache Manager Using Hadoop and Visualization...
Social Media Market Trender with Dache Manager Using Hadoop and Visualization...Social Media Market Trender with Dache Manager Using Hadoop and Visualization...
Social Media Market Trender with Dache Manager Using Hadoop and Visualization...
ย 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
ย 
Certified Big Data Science Analyst (CBDSA)
Certified Big Data Science Analyst (CBDSA)Certified Big Data Science Analyst (CBDSA)
Certified Big Data Science Analyst (CBDSA)
ย 
13 pv-do es-18-bigdata-v3
13 pv-do es-18-bigdata-v313 pv-do es-18-bigdata-v3
13 pv-do es-18-bigdata-v3
ย 
Unstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus ModelUnstructured Datasets Analysis: Thesaurus Model
Unstructured Datasets Analysis: Thesaurus Model
ย 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
ย 
Big Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewBig Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A Review
ย 
Information Security Analytics
Information Security AnalyticsInformation Security Analytics
Information Security Analytics
ย 
DATASCIENCE vs BUSINESS INTELLIGENCE.pptx
DATASCIENCE vs BUSINESS INTELLIGENCE.pptxDATASCIENCE vs BUSINESS INTELLIGENCE.pptx
DATASCIENCE vs BUSINESS INTELLIGENCE.pptx
ย 
(R17A0528) BIG DATA ANALYTICS.pdf
(R17A0528) BIG DATA ANALYTICS.pdf(R17A0528) BIG DATA ANALYTICS.pdf
(R17A0528) BIG DATA ANALYTICS.pdf
ย 
(R17A0528) BIG DATA ANALYTICS.pdf
(R17A0528) BIG DATA ANALYTICS.pdf(R17A0528) BIG DATA ANALYTICS.pdf
(R17A0528) BIG DATA ANALYTICS.pdf
ย 
TSE_Pres12.pptx
TSE_Pres12.pptxTSE_Pres12.pptx
TSE_Pres12.pptx
ย 
Accelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data FabricAccelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data Fabric
ย 
Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...
Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...
Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...
ย 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
ย 

Recently uploaded

Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Christo Ananth
ย 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
ankushspencer015
ย 
Call Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
Call Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort ServiceCall Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
Call Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
ย 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
sivaprakash250
ย 

Recently uploaded (20)

Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
ย 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
ย 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
ย 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
ย 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ย 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
ย 
Top Rated Pune Call Girls Budhwar Peth โŸŸ 6297143586 โŸŸ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth โŸŸ 6297143586 โŸŸ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth โŸŸ 6297143586 โŸŸ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth โŸŸ 6297143586 โŸŸ Call Me For Genuine Se...
ย 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
ย 
Call Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
Call Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort ServiceCall Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
Call Girls in Ramesh Nagar Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
ย 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
ย 
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICSUNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
ย 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and Properties
ย 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
ย 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
ย 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
ย 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
ย 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
ย 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
ย 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
ย 
Vivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design SpainVivazz, Mieres Social Housing Design Spain
Vivazz, Mieres Social Housing Design Spain
ย 

DSBDA Miniproject Assignment - TE A (1).pdf

  • 1. TE - A DSBDA MiniProject Assignment Roll No. Name of the student MiniProject1 MiniProject 2 3101001 ADHAV SOHAM GANESH Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101002 AMBORKAR BHAVESH SUNIL Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101003 ANUBHAW MISHRA Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101004 BADE AASTHA RAMESH 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101005 BENDALE AYUSH YOGESH Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101006 Bhide Ashwini Vinay Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101007 BORA MITALI MAHENDRA Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101008 CHAUDHARI ANIKET RAJENDRA Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated
  • 2. 3101009 CHAUDHARI DNYANESHWAR JAYANT Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101010 CHAUDHARI MANISHA KANARAM Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101011 CHAVAN VEDANG GANESH "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101012 Dadge Krishna Anil "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets
  • 3. 3101013 DALAL PREYASH PARESH "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101014 DESAI NIRANJAN VIKAS "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101015 DESHMUKH OM RAMDAS "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101016 DHAMAL OMKAR DATTATRAY Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 3101017 DHANAWADE SAKSHI HANUMANT Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 3101018 DHANE ANIKET ARUN Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 3101019 Dhole Yash Bandu Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings.
  • 4. 3101020 DUSHING SHIVANI PRASHANT Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 3101021 Gaikwad Rutuja RAVINDRA Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101022 GAIKWAD VIVEK SATISH Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101023 GANJALE SAURABH SANJAY Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101024 GHATTE PRATIK GAJANAN Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101025 HADOLE SAKSHI SHIVAJI Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101026 HEDAU VEDANT RAJESH Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101027 HOLE SAKSHI VIJAY Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101028 INGLE YASH RAJABHAU Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated
  • 5. 3101029 JAGTAP GOURIE BABBAJEEH Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101030 Kale Vaibhav Bhausaheb Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101031 KARPE SAMRUDDHI AJIT "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101032 KHAIRE SWAMINI VINOD "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets
  • 6. 3101033 KHARADE ROHIT SHAHAJI "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101034 Kharat Vishwajeet Dipak "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101035 KHIRID SAHIL GANESH "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101036 KHOLGADE CHETAN GAJANAN Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 3101037 KOTHAWADE CHIRAG RAJENDRA Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 3101038 LAVHALE ABHIJIT KADAJI Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 3101039 MADANE TUSHAR BHAGWAN Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings.
  • 7. 3101040 MAHAJAN PRASHANT RAMKRUSHNA Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 3101041 MALLICK NASIMA YASMIN ILIUS ALI Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101042 MEHKARKAR PRATHMESH GIRIPRASAD Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101043 MOHITE PRAJWAL RAGHAVENDRA Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101044 More Mrunali Sunil Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101045 MORE VISHAL SANJAY Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101046 NAGE AKANKSHA KANIPHNATH Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101047 NAWALKAR ANIKET GAJANAN Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101048 NIKAM RUTUJA RAMDAS Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated
  • 8. 3101049 Padavi Harishchandra Ravindra Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101050 PANSARE NISARG MANOHAR Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101051 PARDESHI ADITI RAJENDRA "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101052 PARJANE KALYANI RAVINDRA "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets
  • 9. 3101053 PATHAK NEHA MANOJ "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101054 PATIL PRANOTI PANDURANG "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101055 PAWAR PRAJWAL JAGDISH "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101056 PHADTARE VEDANT DILIP Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 3101057 PINGALE PANKAJ CHANGDEV Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 3101058 PRASAD SUNIL CHAWARE Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 3101059 PREET POCHAT Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings.
  • 10. 3101060 PUTALE KUNAL SHEKHAR Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 3101061 RELUSINGHANI PREET KAUR KULDEEPSINGH Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101062 RUTUJA MOHAN SATHE Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101063 SANDBHOR SHREYAS AMOL Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101064 SATPUTE DIPTI DEVRAM Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101065 SAWANT ROHIT RAJKUMAR Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101066 SHEKATKAR HIMANSHU SANJAY Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101067 SHELAKE TEJAS MILIND Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101068 Shinde Kunal Bharat Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated
  • 11. 3101069 SHINDE MITESH RAMESH Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101070 SHINDE RAJWARDHAN SANJAY Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Use the following covid_vaccine_statewise.csv dataset and perform following analytics on the given dataset https://www.kaggle.com/sudalairajkumar/covid19-in-india? select=covid_vaccine_statewise.csv a. Describe the dataset b. Number of persons state wise vaccinated for first dose in India c. Number of persons state wise vaccinated for second dose in India d. Number of Males vaccinated d. Number of females vaccinated 3101071 SAKSHI SIRGAN "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101072 SONPATKI SOHAN SUNIL "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets
  • 12. 3101073 TEJWANI ANISH SHANKARLAL "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101074 TELI SHUBHANGI RAJENDRA "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101075 THANGE SAKSHI SANTOSHKUMAR "Write a case study to process data driven for Digital Marketing OR Health care systems with Hadoop Ecosystem components as shown. (Mandatory) โ— HDFS: Hadoop Distributed File System โ— YARN: Yet Another Resource Negotiator โ— MapReduce: Programming based Data Processing โ— Spark: In-Memory data processing โ— PIG, HIVE: Query based processing of data services โ— HBase: NoSQL Database (Provides real-time reads and writes) โ— Mahout, Spark MLLib: (Provides analytical tools) Machine Learning algorithm libraries โ— Solar, Lucene: Searching and Indexing" 2. Use the following dataset and classify tweets into positive and negative tweets.https://www.kaggle.com/ruchi798/data-science-tweets https://www.kaggle.com/ruchi798/data-science-tweets 3101076 THORAT ABHIJIT KISAN Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 3101077 UBALE TEJAS SHAILESH Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings. 3101078 WALUNJ OMKAR RAMDAS Develop a movie recommendation model using the scikit-learn library in python.Refer dataset https://github.com/rashida048/Some-NLP-Projects/blob/master/movie_dataset. csv Write a case study on Global Innovation Network and Analysis (GINA). Components of analytic plan are 1. Discovery business problem framed, 2. Data, 3. Model planning analytic technique 4. Results and Key findings.