SlideShare a Scribd company logo
1 of 26
DATA
Aggregate Data
• Aggregate Data aggregates groups of cases in the active dataset into single
cases and creates a new, aggregated file or creates new variables in the active
dataset that contain aggregated data. Cases are aggregated based on the
value of zero or more break (grouping) variables. If no break variables are
specified, then the entire dataset is a single break group.
– If you create a new, aggregated data file, the new data file contains one
case for each group defined by the break variables. For example, if there
is one break variable with two values, the new data file will contain only
two cases. If no break variable is specified, the new data file will contain
one case.
– If you add aggregate variables to the active dataset, the data file itself is
not aggregated. Each case with the same value(s) of the break variable(s)
receives the same values for the new aggregate variables. For example,
if gender is the only break variable, all males would receive the same
value for a new aggregate variable that represents average age. If no
break variable is specified, all cases would receive the same value for a
new aggregate variable that represents average age.
Aggregate Data
• Break Variable(s). Cases are grouped together based on the values of the
break variables. Each unique combination of break variable values defines a
group. When creating a new, aggregated data file, all break variables are
saved in the new file with their existing names and dictionary information.
The break variable, if specified, can be either numeric or string.
• Aggregated Variables. Source variables are used with aggregate functions to
create new aggregate variables. The aggregate variable name is followed by
an optional variable label, the name of the aggregate function, and the
source variable name in parentheses.
Aggregate Data
• Aggregate Data: Aggregate Function
• This dialog box specifies the function to use to calculate aggregated data
values for selected variables on the Aggregate Variables list in the Aggregate
Data dialog box. Aggregate functions include:
• Summary functions for numeric variables, including mean, median, standard
deviation, and sum
• Number of cases, including unweighted, weighted, nonmissing, and missing
• Percentage or fraction of values above or below a specified value
• Percentage or fraction of values inside or outside of a specified range
• Aggregate Data: Variable Name and Label
• Aggregate Data assigns default variable names for the aggregated variables in
the new data file. This dialog box enables you to change the variable name
for the selected variable on the Aggregate Variables list and provide a
descriptive variable label. See the topicVariable names for more information.
Aggregate Data
• Data=>Aggregate
SELECT Case
• Select Cases provides several methods for selecting a subgroup of cases
based on criteria that include variables and complex expressions. You can
also select a random sample of cases. The criteria used to define a subgroup
can include:
• • Variable values and ranges
• • Date and time ranges
• • Case (row) numbers
• • Arithmetic expressions
• • Logical expressions
• • Functions
SELECT Case
• Output
• This section controls the treatment of unselected cases. You can choose one
of the following alternatives for the treatment of unselected cases:
• • Filter out unselected cases. Unselected cases are not included in the
analysis but remain in the dataset. You can use the unselected cases later in
the session if you turn filtering off. If you select a random sample or if you
select cases based on a conditional expression, this generates a variable
named filter_$ with a value of 1 for selected cases and a value of 0 for
unselected cases.
• Copy selected cases to a new dataset. Selected cases are copied to a new
dataset, leaving the original dataset unaffected. Unselected cases are not
included in the new dataset and are left in their original state in the original
dataset.
• Delete unselected cases. Unselected cases are deleted from the dataset.
Deleted cases can be recovered only by exiting from the file without saving
any changes and then reopening the file. The deletion of cases is permanent
if you save the changes to the data file.
SELECT Case:if
• This dialog box allows you to select subsets of cases using conditional
expressions. A conditional expression returns a value of true,false,
or missing for each case.
• • If the result of a conditional expression is true, the case is included in the
selected subset.
• • If the result of a conditional expression is false or missing, the case is not
included in the selected subset.
• • Most conditional expressions use one or more of the six relational
operators (<, >, <=, >=, =, and ~=) on the calculator pad.
• • Conditional expressions can include variable names, constants, arithmetic
operators, numeric (and other) functions, logical variables, and relational
operators.
Data=>Select Case…
SELECT Case IF
If=> …=>Continue
Select Cases: Random Sample
• This dialog box allows you to select a random sample based on an
approximate percentage or an exact number of cases. Sampling is performed
without replacement; so, the same case cannot be selected more than once.
• Approximately. Generates a random sample of approximately the specified
percentage of cases. Since this routine makes an independent pseudo-
random decision for each case, the percentage of cases selected can only
approximate the specified percentage. The more cases there are in the data
file, the closer the percentage of cases selected is to the specified
percentage.
• Exactly. A user-specified number of cases. You must also specify the number
of cases from which to generate the sample. This second number should be
less than or equal to the total number of cases in the data file. If the number
exceeds the total number of cases in the data file, the sample will contain
proportionally fewer cases than the requested number.
SELECT Case Random
Select Cases: Range
• This dialog box selects cases based on a range of case numbers or a range of
dates or times.
• • Case ranges are based on row number as displayed in the Data Editor.
• • Date and time ranges are available only for time series data with defined
date variables (Data menu, Define Dates).
Filter Variable
• Select only Missing value and zero value
Weight Cases
• Weight Cases gives cases different weights (by simulated replication) for statistical
analysis.
• • The values of the weighting variable should indicate the number of
observations represented by single cases in your data file.
• • Cases with zero, negative, or missing values for the weighting variable are
excluded from analysis.
• • Fractional values are valid and some procedures, such as Frequencies,
Crosstabs, and Custom Tables, will use fractional weight values. However, most
procedures treat the weighting variable as a replication weight and will simply
round fractional weights to the nearest integer. Some procedures ignore the
weighting variable completely, and this limitation is noted in the procedure-
specific documentation.
• Once you apply a weight variable, it remains in effect until you select another
weight variable or turn off weighting. If you save a weighted data file, weighting
information is saved with the data file. You can turn off weighting at any time,
even after the file has been saved in weighted form.
• Weights in Crosstabs. The Crosstabs procedure has several options for handling
case weights. See the topic Crosstabs cell display for more information.
• Weights in scatterplots and histograms. Scatterplots and histograms have an
option for turning case weights on and off, but this does not affect cases with a
zero, negative, or missing value for the weight variable. These cases remain
excluded from the chart even if you turn weighting off from within the chart.
Sort Case
• Data=>Sort Case
Transpose
• Transpose creates a new data file in which the rows and columns in the
original data file are transposed so that cases (rows) become variables and
variables (columns) become cases. Transpose automatically creates new
variable names and displays a list of the new variable names.
– A new string variable that contains the original variable name, case_lbl,
is automatically created.
– If the active dataset contains an ID or name variable with unique values,
you can use it as the name variable, and its values will be used as variable
names in the transposed data file. If it is a numeric variable, the variable
names start with the letter V, followed by the numeric value.
– User-missing values are converted to the system-missing value in the
transposed data file. To retain any of these values, change the definition
of missing values in the Variable View in the Data Editor.
Example

More Related Content

What's hot

Data mining Basics and complete description
Data mining Basics and complete description Data mining Basics and complete description
Data mining Basics and complete description Sulman Ahmed
 
Cluster analysis using spss
Cluster analysis using spssCluster analysis using spss
Cluster analysis using spssDr Nisha Arora
 
Applied Statistical Methods - Question & Answer on SPSS
Applied Statistical Methods - Question & Answer on SPSSApplied Statistical Methods - Question & Answer on SPSS
Applied Statistical Methods - Question & Answer on SPSSGökhan Ayrancıoğlu
 
Spss series - data entry and coding
Spss series - data entry and codingSpss series - data entry and coding
Spss series - data entry and codingDr. Majdi Al Jasim
 
Ap Power Point Chpt9
Ap Power Point Chpt9Ap Power Point Chpt9
Ap Power Point Chpt9dplunkett
 
Machine Learning (simplified)
Machine Learning (simplified)Machine Learning (simplified)
Machine Learning (simplified)Sreenivasa Etla
 
CART – Classification & Regression Trees
CART – Classification & Regression TreesCART – Classification & Regression Trees
CART – Classification & Regression TreesHemant Chetwani
 
Spss basics tutorial
Spss basics tutorialSpss basics tutorial
Spss basics tutorialJack Rabah
 
Data Creation and Importing in IBM SPSS
Data Creation and Importing in IBM SPSSData Creation and Importing in IBM SPSS
Data Creation and Importing in IBM SPSSThiyagu K
 

What's hot (19)

Spss beginners
Spss beginnersSpss beginners
Spss beginners
 
Data mining Basics and complete description
Data mining Basics and complete description Data mining Basics and complete description
Data mining Basics and complete description
 
Dsa unit 1
Dsa unit 1Dsa unit 1
Dsa unit 1
 
Cluster analysis using spss
Cluster analysis using spssCluster analysis using spss
Cluster analysis using spss
 
Analysis Of Attribute Revelance
Analysis Of Attribute RevelanceAnalysis Of Attribute Revelance
Analysis Of Attribute Revelance
 
XL-MINER: Data Utilities
XL-MINER: Data UtilitiesXL-MINER: Data Utilities
XL-MINER: Data Utilities
 
SPSS-SYNTAX
SPSS-SYNTAXSPSS-SYNTAX
SPSS-SYNTAX
 
Dma unit 1
Dma unit   1Dma unit   1
Dma unit 1
 
Spss
SpssSpss
Spss
 
Ibm SPSS Statistics
Ibm SPSS StatisticsIbm SPSS Statistics
Ibm SPSS Statistics
 
Applied Statistical Methods - Question & Answer on SPSS
Applied Statistical Methods - Question & Answer on SPSSApplied Statistical Methods - Question & Answer on SPSS
Applied Statistical Methods - Question & Answer on SPSS
 
Spss series - data entry and coding
Spss series - data entry and codingSpss series - data entry and coding
Spss series - data entry and coding
 
Ap Power Point Chpt9
Ap Power Point Chpt9Ap Power Point Chpt9
Ap Power Point Chpt9
 
Machine Learning (simplified)
Machine Learning (simplified)Machine Learning (simplified)
Machine Learning (simplified)
 
CART – Classification & Regression Trees
CART – Classification & Regression TreesCART – Classification & Regression Trees
CART – Classification & Regression Trees
 
3 module 2
3 module 23 module 2
3 module 2
 
Spss basics tutorial
Spss basics tutorialSpss basics tutorial
Spss basics tutorial
 
Csrde discriminant analysis final
Csrde discriminant analysis finalCsrde discriminant analysis final
Csrde discriminant analysis final
 
Data Creation and Importing in IBM SPSS
Data Creation and Importing in IBM SPSSData Creation and Importing in IBM SPSS
Data Creation and Importing in IBM SPSS
 

Similar to 3. chapter iii(aggregate data)

4. chapter iv(transform)
4. chapter iv(transform)4. chapter iv(transform)
4. chapter iv(transform)Chhom Karath
 
1. chapter i(pasw)
1. chapter i(pasw)1. chapter i(pasw)
1. chapter i(pasw)Chhom Karath
 
Data exploration validation and sanitization
Data exploration validation and sanitizationData exploration validation and sanitization
Data exploration validation and sanitizationVenkata Reddy Konasani
 
InnerSoft STATS - Analyze
InnerSoft STATS - AnalyzeInnerSoft STATS - Analyze
InnerSoft STATS - AnalyzeInnerSoft
 
04 Classification in Data Mining
04 Classification in Data Mining04 Classification in Data Mining
04 Classification in Data MiningValerii Klymchuk
 
Quantitative analysis using SPSS
Quantitative analysis using SPSSQuantitative analysis using SPSS
Quantitative analysis using SPSSAlaa Sadik
 
Introduction to spss
Introduction to spssIntroduction to spss
Introduction to spssSubodh Khanal
 
Decision tree for data mining and computer
Decision tree for data mining and computerDecision tree for data mining and computer
Decision tree for data mining and computertttiba
 
Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...
Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...
Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...Maninda Edirisooriya
 
Data wrangling with dplyr
Data wrangling with dplyrData wrangling with dplyr
Data wrangling with dplyrC. Tobin Magle
 
Beginners SPSS.ppt
Beginners SPSS.pptBeginners SPSS.ppt
Beginners SPSS.pptsayahuwaina
 
Basic risk management facilitation methods
Basic risk management facilitation methodsBasic risk management facilitation methods
Basic risk management facilitation methodsRutuja Arlekar
 
An introduction to spss
An introduction to spssAn introduction to spss
An introduction to spsszeeshanwrch
 
Inferential statistics quantitative data - single sample and 2 groups
Inferential statistics   quantitative data - single sample and 2 groupsInferential statistics   quantitative data - single sample and 2 groups
Inferential statistics quantitative data - single sample and 2 groupsDhritiman Chakrabarti
 
Discriminant analysis using spss
Discriminant analysis using spssDiscriminant analysis using spss
Discriminant analysis using spssDr Nisha Arora
 

Similar to 3. chapter iii(aggregate data) (20)

4. chapter iv(transform)
4. chapter iv(transform)4. chapter iv(transform)
4. chapter iv(transform)
 
1. chapter i(pasw)
1. chapter i(pasw)1. chapter i(pasw)
1. chapter i(pasw)
 
Data exploration validation and sanitization
Data exploration validation and sanitizationData exploration validation and sanitization
Data exploration validation and sanitization
 
InnerSoft STATS - Analyze
InnerSoft STATS - AnalyzeInnerSoft STATS - Analyze
InnerSoft STATS - Analyze
 
04 Classification in Data Mining
04 Classification in Data Mining04 Classification in Data Mining
04 Classification in Data Mining
 
Quantitative analysis using SPSS
Quantitative analysis using SPSSQuantitative analysis using SPSS
Quantitative analysis using SPSS
 
Introduction to spss
Introduction to spssIntroduction to spss
Introduction to spss
 
XL-MINER:Data Utilities
XL-MINER:Data UtilitiesXL-MINER:Data Utilities
XL-MINER:Data Utilities
 
Decision tree for data mining and computer
Decision tree for data mining and computerDecision tree for data mining and computer
Decision tree for data mining and computer
 
Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...
Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...
Lecture 9 - Decision Trees and Ensemble Methods, a lecture in subject module ...
 
Data wrangling with dplyr
Data wrangling with dplyrData wrangling with dplyr
Data wrangling with dplyr
 
Beginners SPSS.ppt
Beginners SPSS.pptBeginners SPSS.ppt
Beginners SPSS.ppt
 
Basic risk management facilitation methods
Basic risk management facilitation methodsBasic risk management facilitation methods
Basic risk management facilitation methods
 
An introduction to spss
An introduction to spssAn introduction to spss
An introduction to spss
 
Inferential statistics quantitative data - single sample and 2 groups
Inferential statistics   quantitative data - single sample and 2 groupsInferential statistics   quantitative data - single sample and 2 groups
Inferential statistics quantitative data - single sample and 2 groups
 
Discriminant analysis using spss
Discriminant analysis using spssDiscriminant analysis using spss
Discriminant analysis using spss
 
trs-3.ppt
trs-3.ppttrs-3.ppt
trs-3.ppt
 
trs-3.ppt
trs-3.ppttrs-3.ppt
trs-3.ppt
 
trs-3.ppt
trs-3.ppttrs-3.ppt
trs-3.ppt
 
trs-3.ppt
trs-3.ppttrs-3.ppt
trs-3.ppt
 

More from Chhom Karath

More from Chhom Karath (20)

set1.pdf
set1.pdfset1.pdf
set1.pdf
 
Set1.pptx
Set1.pptxSet1.pptx
Set1.pptx
 
orthodontic patient education.pdf
orthodontic patient education.pdforthodontic patient education.pdf
orthodontic patient education.pdf
 
New ton 3.pdf
New ton 3.pdfNew ton 3.pdf
New ton 3.pdf
 
ច្បាប់ញូតុនទី៣.pptx
ច្បាប់ញូតុនទី៣.pptxច្បាប់ញូតុនទី៣.pptx
ច្បាប់ញូតុនទី៣.pptx
 
Control tipping.pptx
Control tipping.pptxControl tipping.pptx
Control tipping.pptx
 
Bulbous loop.pptx
Bulbous loop.pptxBulbous loop.pptx
Bulbous loop.pptx
 
brush teeth.pptx
brush teeth.pptxbrush teeth.pptx
brush teeth.pptx
 
bracket size.pptx
bracket size.pptxbracket size.pptx
bracket size.pptx
 
arch form KORI copy.pptx
arch form KORI copy.pptxarch form KORI copy.pptx
arch form KORI copy.pptx
 
Bracket size
Bracket sizeBracket size
Bracket size
 
Couple
CoupleCouple
Couple
 
ច្បាប់ញូតុនទី៣
ច្បាប់ញូតុនទី៣ច្បាប់ញូតុនទី៣
ច្បាប់ញូតុនទី៣
 
Game1
Game1Game1
Game1
 
Shoe horn loop
Shoe horn loopShoe horn loop
Shoe horn loop
 
Opus loop
Opus loopOpus loop
Opus loop
 
V bend
V bendV bend
V bend
 
Closing loop
Closing loopClosing loop
Closing loop
 
Maxillary arch form
Maxillary arch formMaxillary arch form
Maxillary arch form
 
Front face analysis
Front face analysisFront face analysis
Front face analysis
 

Recently uploaded

Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 

Recently uploaded (20)

Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 

3. chapter iii(aggregate data)

  • 2. Aggregate Data • Aggregate Data aggregates groups of cases in the active dataset into single cases and creates a new, aggregated file or creates new variables in the active dataset that contain aggregated data. Cases are aggregated based on the value of zero or more break (grouping) variables. If no break variables are specified, then the entire dataset is a single break group. – If you create a new, aggregated data file, the new data file contains one case for each group defined by the break variables. For example, if there is one break variable with two values, the new data file will contain only two cases. If no break variable is specified, the new data file will contain one case. – If you add aggregate variables to the active dataset, the data file itself is not aggregated. Each case with the same value(s) of the break variable(s) receives the same values for the new aggregate variables. For example, if gender is the only break variable, all males would receive the same value for a new aggregate variable that represents average age. If no break variable is specified, all cases would receive the same value for a new aggregate variable that represents average age.
  • 3. Aggregate Data • Break Variable(s). Cases are grouped together based on the values of the break variables. Each unique combination of break variable values defines a group. When creating a new, aggregated data file, all break variables are saved in the new file with their existing names and dictionary information. The break variable, if specified, can be either numeric or string. • Aggregated Variables. Source variables are used with aggregate functions to create new aggregate variables. The aggregate variable name is followed by an optional variable label, the name of the aggregate function, and the source variable name in parentheses.
  • 4. Aggregate Data • Aggregate Data: Aggregate Function • This dialog box specifies the function to use to calculate aggregated data values for selected variables on the Aggregate Variables list in the Aggregate Data dialog box. Aggregate functions include: • Summary functions for numeric variables, including mean, median, standard deviation, and sum • Number of cases, including unweighted, weighted, nonmissing, and missing • Percentage or fraction of values above or below a specified value • Percentage or fraction of values inside or outside of a specified range • Aggregate Data: Variable Name and Label • Aggregate Data assigns default variable names for the aggregated variables in the new data file. This dialog box enables you to change the variable name for the selected variable on the Aggregate Variables list and provide a descriptive variable label. See the topicVariable names for more information.
  • 6.
  • 7.
  • 8. SELECT Case • Select Cases provides several methods for selecting a subgroup of cases based on criteria that include variables and complex expressions. You can also select a random sample of cases. The criteria used to define a subgroup can include: • • Variable values and ranges • • Date and time ranges • • Case (row) numbers • • Arithmetic expressions • • Logical expressions • • Functions
  • 9. SELECT Case • Output • This section controls the treatment of unselected cases. You can choose one of the following alternatives for the treatment of unselected cases: • • Filter out unselected cases. Unselected cases are not included in the analysis but remain in the dataset. You can use the unselected cases later in the session if you turn filtering off. If you select a random sample or if you select cases based on a conditional expression, this generates a variable named filter_$ with a value of 1 for selected cases and a value of 0 for unselected cases. • Copy selected cases to a new dataset. Selected cases are copied to a new dataset, leaving the original dataset unaffected. Unselected cases are not included in the new dataset and are left in their original state in the original dataset. • Delete unselected cases. Unselected cases are deleted from the dataset. Deleted cases can be recovered only by exiting from the file without saving any changes and then reopening the file. The deletion of cases is permanent if you save the changes to the data file.
  • 10. SELECT Case:if • This dialog box allows you to select subsets of cases using conditional expressions. A conditional expression returns a value of true,false, or missing for each case. • • If the result of a conditional expression is true, the case is included in the selected subset. • • If the result of a conditional expression is false or missing, the case is not included in the selected subset. • • Most conditional expressions use one or more of the six relational operators (<, >, <=, >=, =, and ~=) on the calculator pad. • • Conditional expressions can include variable names, constants, arithmetic operators, numeric (and other) functions, logical variables, and relational operators.
  • 13.
  • 14. Select Cases: Random Sample • This dialog box allows you to select a random sample based on an approximate percentage or an exact number of cases. Sampling is performed without replacement; so, the same case cannot be selected more than once. • Approximately. Generates a random sample of approximately the specified percentage of cases. Since this routine makes an independent pseudo- random decision for each case, the percentage of cases selected can only approximate the specified percentage. The more cases there are in the data file, the closer the percentage of cases selected is to the specified percentage. • Exactly. A user-specified number of cases. You must also specify the number of cases from which to generate the sample. This second number should be less than or equal to the total number of cases in the data file. If the number exceeds the total number of cases in the data file, the sample will contain proportionally fewer cases than the requested number.
  • 16.
  • 17. Select Cases: Range • This dialog box selects cases based on a range of case numbers or a range of dates or times. • • Case ranges are based on row number as displayed in the Data Editor. • • Date and time ranges are available only for time series data with defined date variables (Data menu, Define Dates).
  • 18.
  • 19.
  • 20. Filter Variable • Select only Missing value and zero value
  • 21.
  • 22. Weight Cases • Weight Cases gives cases different weights (by simulated replication) for statistical analysis. • • The values of the weighting variable should indicate the number of observations represented by single cases in your data file. • • Cases with zero, negative, or missing values for the weighting variable are excluded from analysis. • • Fractional values are valid and some procedures, such as Frequencies, Crosstabs, and Custom Tables, will use fractional weight values. However, most procedures treat the weighting variable as a replication weight and will simply round fractional weights to the nearest integer. Some procedures ignore the weighting variable completely, and this limitation is noted in the procedure- specific documentation. • Once you apply a weight variable, it remains in effect until you select another weight variable or turn off weighting. If you save a weighted data file, weighting information is saved with the data file. You can turn off weighting at any time, even after the file has been saved in weighted form. • Weights in Crosstabs. The Crosstabs procedure has several options for handling case weights. See the topic Crosstabs cell display for more information. • Weights in scatterplots and histograms. Scatterplots and histograms have an option for turning case weights on and off, but this does not affect cases with a zero, negative, or missing value for the weight variable. These cases remain excluded from the chart even if you turn weighting off from within the chart.
  • 24. Transpose • Transpose creates a new data file in which the rows and columns in the original data file are transposed so that cases (rows) become variables and variables (columns) become cases. Transpose automatically creates new variable names and displays a list of the new variable names. – A new string variable that contains the original variable name, case_lbl, is automatically created. – If the active dataset contains an ID or name variable with unique values, you can use it as the name variable, and its values will be used as variable names in the transposed data file. If it is a numeric variable, the variable names start with the letter V, followed by the numeric value. – User-missing values are converted to the system-missing value in the transposed data file. To retain any of these values, change the definition of missing values in the Variable View in the Data Editor.
  • 25.