• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Statistics
 

Statistics

on

  • 900 views

 

Statistics

Views

Total Views
900
Views on SlideShare
900
Embed Views
0

Actions

Likes
0
Downloads
23
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Statistics Statistics Presentation Transcript

    • Statistics
      By Carmelo Establier Sánchez
    • Descriptive Statistics
    • Descriptive vs. Inferential
      Discrete data are whole numbers, and are usually a count of objects
      Measured data are continuous and may take any real value
      Numerical data are number of any kind
      Categorical data are made of words (i.e. apple, grapes, bananas…)
    • Means, medians and modes
      Median:
      The median is the middle number of a set of numbers arranged in numerical order.
      Mode:
      The most frequent value in a set.
      Mean:
      The sum of all the values of a set divided by the number of values.
    • Variability
      Range:
      the length of the smallest interval which contains all the data.
      It is calculated by subtracting the smallest observation (sample minimum) from the greatest (sample maximum) and provides an indication of statistical dispersion.
    • Variability
      Variance:
      The variance is a measure of items are dispersed about their mean
      If a random variable X has the expected value (mean) μ = E[X], then the variance of X is given by:
    • Variability
      The standard deviation of a statistical population, a data set, or a probability distribution is the square root of its variance
    • Variability
      Relative variability
      The relative variability of a ser is its standard deviation divided by its mean.
    • Linear transformations
      A linear transformation of a data set is one where each element is increased by or multiplied by a constant
      Addition:
      If a constant C is added to each member of a set, the mean will be C more that it was before.
      Standard Deviation will not be affected.
      Range will not be affecter neither.
    • Linear transformation
      Multiplication
      Each member of a set is multiplied by a constant C, then
      The mean will be C times its value before the constant was applied.
      The Standard Deviation and Range, will be |c| times its value before it was applied.
    • Inferential Statistics
    • Inferential Statistics
      Inferential Statistics comprises the use of statistics and random sampling to make inferences concerning some unknown aspect of a population. It is distinguished from descriptive statistics.
      Includes:
      Estimation
      Point estimation
      Interval estimation
      Prediction
      Hypothesis testing
    • Estimation
      Point estimation:
      In statistics, point estimation involves the use of sample data to calculate a single value (known as a statistic) which is to serve as a "best guess" for an unknown (fixed or random) population parameter.
    • Estimation
      Interval estimation:
      It is the use of sample data to calculate an interval of possible (or probable) values of an unknown population parameter, in contrast to point estimation, which is a single number.
    • Hypothesis testing
      Whilst all pieces of quantitative research have some dilemma, issue or problem that they are trying to investigate, the focus in hypothesis testing is to find ways to structure these in such a way that we can test them effectively. Typically, it is important to:
      1. Define the research hypothesis and set the parameters for the study.
      2. Set out the null and alternative hypothesis (or more than one hypothesis; in other words, a number of hypotheses).
      3. Explain how you are going measure. What you are studying and set out the variables to be studied.
      4. Set the significance level.
      5. Make a one- or two-tailed prediction.
      6. Determine whether the distribution that you are studying is normal (this has implications for the types of statistical tests that you can run on your data).
      7. Select an appropriate statistical test based on the variables you have defined and whether the distribution is normal or not.
      8. Run the statistical tests on your data and interpret the output.
      9. Accept or reject the null hypothesis.
    • Prediction
      Prediction or Predictive Inference:
      It is an interpretation of probability that emphasizes the prediction of future observations based on past observations.
    • Regression
    • Regression
      Or linear regression refers to any approach to modeling the relationship between one or more variables denoted y and one or more variables denoted X, such that the model depends linearly on the unknown parameters to be estimated from the data. Such a model is called a "linear model." Most commonly, linear regression refers to a model in which the conditional mean of y given the value of X is an affine function of X. Less commonly, linear regression could refer to a model in which the median, or some other quantile of the conditional distribution of y given X is expressed as a linear function of X. Like all forms of regression analysis, linear regression focuses on the conditional probability distribution of y given X, rather than on the joint probability distribution of y and X, which is the domain of multivariate analysis.