rockinterview.in
rockinterview.in
A FRESHER’S GUIDE TO PREPARING
FOR A BIG DATA INTERVIEW
1. BASIC PROGRAMMING LANGUAGES
YOU SHOULD KNOW:
At least one statistical programming language,
like R or Python (along with Numpy and Pandas Libraries)
And one database querying language like SQL
rockinterview.in
2. STATISTICS:
Statistics is important to crunch data and to pick out the most important
figures out of a huge dataset. This is critical in the decision-making process
and to design experiments.
Here are a few phrases you should definitely be able to explain:
null hypothesis
P-value
maximum likelihood estimators
confidence intervals
rockinterview.in
3. MACHINE LEARNING:
Familiarise yourself with how data science is used in practical
manners.
You should be able to explain K-nearest neighbours, random
forests, and ensemble methods.
These techniques are typically implemented in R or Python.
rockinterview.in
4. DATA WRANGLING:
You should be able to identify corrupt or impure data and
correct them.
This basically means understanding that a negative number
cannot exist in a dataset that describes population, or a grey
and gray are the same colour, etc…
rockinterview.in
5. DATA VISUALISATION:
Learn to use data visualisation tools like ggplot, as they help you
present data and findings in a cohesive manner.
This is an important skill set, as it ensures that Product Managers and
other stakeholders understand your work and incorporate it in the
product.
.
rockinterview.in
6. SOFTWARE ENGINEERING:
Know the use cases and run time of these data structures:
Queues, Arrays, Lists, Stacks, Trees, etc.
These are often necessary in creating efficient algorithms for
machine learning.
rockinterview.in
6. SOFTWARE ENGINEERING:
Know the use cases and run time of these data structures:
Queues, Arrays, Lists, Stacks, Trees, etc.
These are often necessary in creating efficient algorithms for
machine learning.
rockinterview.in
7. PRODUCT MANAGEMENT:
Data Scientists that understand the product are the ones who will know
what metrics are the most important.
Usability Testing
Wireframing
Retention
Conversion Rates
Traffic Analysis
Know what these terms mean:
Customer Feedback
Internal Logs
A/B Testing
rockinterview.in
Take a mock interview with us to find out more.
rockInterview.in
Fresher's guide to Preparing for a Big Data Interview
Fresher's guide to Preparing for a Big Data Interview

Fresher's guide to Preparing for a Big Data Interview

  • 1.
  • 2.
    rockinterview.in A FRESHER’S GUIDETO PREPARING FOR A BIG DATA INTERVIEW
  • 3.
    1. BASIC PROGRAMMINGLANGUAGES YOU SHOULD KNOW: At least one statistical programming language, like R or Python (along with Numpy and Pandas Libraries) And one database querying language like SQL rockinterview.in
  • 4.
    2. STATISTICS: Statistics isimportant to crunch data and to pick out the most important figures out of a huge dataset. This is critical in the decision-making process and to design experiments. Here are a few phrases you should definitely be able to explain: null hypothesis P-value maximum likelihood estimators confidence intervals rockinterview.in
  • 5.
    3. MACHINE LEARNING: Familiariseyourself with how data science is used in practical manners. You should be able to explain K-nearest neighbours, random forests, and ensemble methods. These techniques are typically implemented in R or Python. rockinterview.in
  • 6.
    4. DATA WRANGLING: Youshould be able to identify corrupt or impure data and correct them. This basically means understanding that a negative number cannot exist in a dataset that describes population, or a grey and gray are the same colour, etc… rockinterview.in
  • 7.
    5. DATA VISUALISATION: Learnto use data visualisation tools like ggplot, as they help you present data and findings in a cohesive manner. This is an important skill set, as it ensures that Product Managers and other stakeholders understand your work and incorporate it in the product. . rockinterview.in
  • 8.
    6. SOFTWARE ENGINEERING: Knowthe use cases and run time of these data structures: Queues, Arrays, Lists, Stacks, Trees, etc. These are often necessary in creating efficient algorithms for machine learning. rockinterview.in
  • 9.
    6. SOFTWARE ENGINEERING: Knowthe use cases and run time of these data structures: Queues, Arrays, Lists, Stacks, Trees, etc. These are often necessary in creating efficient algorithms for machine learning. rockinterview.in
  • 10.
    7. PRODUCT MANAGEMENT: DataScientists that understand the product are the ones who will know what metrics are the most important. Usability Testing Wireframing Retention Conversion Rates Traffic Analysis Know what these terms mean: Customer Feedback Internal Logs A/B Testing rockinterview.in
  • 11.
    Take a mockinterview with us to find out more. rockInterview.in