- The document discusses conducting a discriminant analysis on a bank loan dataset to predict whether borrowers will default or not based on characteristics like employment, address, and age.
- The discriminant analysis will group borrowers into defaulters and non-defaulters and examine the means, standard deviations, and correlations between the predictor variables for each group.
- The analysis will also produce statistics like univariate Fs, box's M test, discriminant function coefficients, and classification tables to help determine which variables best predict group membership and how accurately groups can be predicted.
This presentation briefly will elaborate how IKEA has adopting Porter's Five Forces and Value Chain Analysis in order to maintain its competitive edges over its rivals in furniture market all over the globe by providing good quality furniture at a lower price tag. Hence by bringing in innovative design, improved functionality, low cost operating expenditures and offering excellent quality at lower prices, IKEA's has proved to be a success.
Wal-Mart has traditionally pursued a strategy of low costs, high volume, and customer satisfaction. It achieves this through activities like efficient distribution centers, strong supplier partnerships, advanced data analytics, a customer-oriented workforce culture, and an everyday low pricing model. These activities translate to competitive advantages like lower operating costs than competitors and higher sales per square foot in stores. While some advantages like data analytics may be replicated over time, Wal-Mart's massive scale of operations makes many of its advantages like distribution networks and supplier relationships very difficult for competitors to match in the United States. However, transferring these advantages to new store formats and international markets may be more challenging due to differences in customer demand, real estate costs, and other factors.
Nestle Lanka PLC manufactures and distributes food and beverage products in Sri Lanka. Their Maggi instant noodle brand holds 45-50% of the noodle market share. Between 2008-2013, Maggi noodle revenue grew by 15% annually on average. The company faces competition from Prima and Alli noodle brands. To increase market share, Nestle will reposition Maggi as a family brand and launch new product lines fortified with vitamins and minerals while utilizing competitive pricing and widespread distribution. The marketing objectives are to increase Maggi's market share to 50-60% and brand awareness by 10% through new product launches, upgrading existing lines, and promotional activities over the next year.
The document discusses the development and marketing plan for Metabical, a new weight loss drug launching in January 2009. It outlines the target consumer as overweight individuals with a BMI of 25-30, and describes a marketing strategy focusing on advertising to consumers and healthcare providers, promotions, public relations events, and a sales force visiting doctors' offices. The plan aims to establish Metabical as a safe and effective short-term treatment option supported by a lifestyle program.
The document discusses Metabical's demand forecasting, packaging, and pricing strategy. It presents three approaches to demand forecasting, with Approach 2 forecasting the highest demand. It recommends packaging Metabical in 12-week blister packs with day-of-the-week labeling. For pricing, it considers three options and selects $125 for a 4-week supply, as this prices Metabical above similar products to signal its value as a prescription drug, while still achieving strong demand and financial returns.
AmazonFresh is a grocery delivery service owned by Amazon that is available in some US states, London, Tokyo, Berlin, Hamburg, and Munich. It offers same-day or next-day delivery of grocery items from its stores. While online grocery faced challenges after the failures of Webvan and HomeGrocer, AmazonFresh aimed to address issues like long lead times, the need for its own logistics infrastructure, and customer density through solutions like owning warehouses and delivery fleets. It focused on expanding conveniently while refining its business model and expanding carefully to a second major city.
Cesim Global Challenge International Business Simulation and Strategy Game Gu...Cesim Business Simulations
Cesim Global Challenge is an international business and strategy management simulation game used in higher education institution level courses and corporate training programs to educate better business decision makers.
Find out more here: https://www.cesim.com/simulations/cesim-global-challenge-international-business-strategy-simulation-game
This presentation briefly will elaborate how IKEA has adopting Porter's Five Forces and Value Chain Analysis in order to maintain its competitive edges over its rivals in furniture market all over the globe by providing good quality furniture at a lower price tag. Hence by bringing in innovative design, improved functionality, low cost operating expenditures and offering excellent quality at lower prices, IKEA's has proved to be a success.
Wal-Mart has traditionally pursued a strategy of low costs, high volume, and customer satisfaction. It achieves this through activities like efficient distribution centers, strong supplier partnerships, advanced data analytics, a customer-oriented workforce culture, and an everyday low pricing model. These activities translate to competitive advantages like lower operating costs than competitors and higher sales per square foot in stores. While some advantages like data analytics may be replicated over time, Wal-Mart's massive scale of operations makes many of its advantages like distribution networks and supplier relationships very difficult for competitors to match in the United States. However, transferring these advantages to new store formats and international markets may be more challenging due to differences in customer demand, real estate costs, and other factors.
Nestle Lanka PLC manufactures and distributes food and beverage products in Sri Lanka. Their Maggi instant noodle brand holds 45-50% of the noodle market share. Between 2008-2013, Maggi noodle revenue grew by 15% annually on average. The company faces competition from Prima and Alli noodle brands. To increase market share, Nestle will reposition Maggi as a family brand and launch new product lines fortified with vitamins and minerals while utilizing competitive pricing and widespread distribution. The marketing objectives are to increase Maggi's market share to 50-60% and brand awareness by 10% through new product launches, upgrading existing lines, and promotional activities over the next year.
The document discusses the development and marketing plan for Metabical, a new weight loss drug launching in January 2009. It outlines the target consumer as overweight individuals with a BMI of 25-30, and describes a marketing strategy focusing on advertising to consumers and healthcare providers, promotions, public relations events, and a sales force visiting doctors' offices. The plan aims to establish Metabical as a safe and effective short-term treatment option supported by a lifestyle program.
The document discusses Metabical's demand forecasting, packaging, and pricing strategy. It presents three approaches to demand forecasting, with Approach 2 forecasting the highest demand. It recommends packaging Metabical in 12-week blister packs with day-of-the-week labeling. For pricing, it considers three options and selects $125 for a 4-week supply, as this prices Metabical above similar products to signal its value as a prescription drug, while still achieving strong demand and financial returns.
AmazonFresh is a grocery delivery service owned by Amazon that is available in some US states, London, Tokyo, Berlin, Hamburg, and Munich. It offers same-day or next-day delivery of grocery items from its stores. While online grocery faced challenges after the failures of Webvan and HomeGrocer, AmazonFresh aimed to address issues like long lead times, the need for its own logistics infrastructure, and customer density through solutions like owning warehouses and delivery fleets. It focused on expanding conveniently while refining its business model and expanding carefully to a second major city.
Cesim Global Challenge International Business Simulation and Strategy Game Gu...Cesim Business Simulations
Cesim Global Challenge is an international business and strategy management simulation game used in higher education institution level courses and corporate training programs to educate better business decision makers.
Find out more here: https://www.cesim.com/simulations/cesim-global-challenge-international-business-strategy-simulation-game
The VP of marketing for PDB is deciding how to position Crescent Pure. There are two options - as an energy drink or sports drink. Market research shows the energy drink market is larger and growing faster. While sports drinks have less competition, their market is smaller. Perceptual maps also indicate Crescent could gain market share by positioning as a healthier energy drink. After analyzing customer segments, differentiation strategies, and perceptual maps, the document recommends positioning Crescent as an energy drink to target the larger, faster growing market.
This document provides an analysis of Apple including external factors like competitors and consumers, internal factors like history and leadership, and marketing theories. It examines Apple's positioning targeting consumers across generations and genders. SWOT and 4P analyses are presented. Theories around innovation diffusion, social judgment, and branding are applied. Apple's strong scores on customer satisfaction metrics are highlighted.
Comparison of Marketing Mix of IKEA in Four CountriesFatima Arshad
Report Contains Marketing Mix of IKEA. In this report there is comparison of 4Ps of IKEA in Four Countries i.e Sweden, UK, China and India.
This report is result of the efforts of four people.
J.C. Penney faced primary issues during the 2007 recession as consumer spending shrank. This was compounded by highly competitive discount retailers. A SWOT analysis showed brand recognition but also declining sales. The recommendation was to reach younger audiences through increased online presence and influencer marketing. Alternatives included returning to sales/markdown pricing or adjusting store locations. The process outlined working with beauty influencers through sponsored product reviews and haul videos.
Pepsi Lipton Brisk - Harvard Business Review CaseFamy
This document summarizes a case study regarding marketing options for PepsiCo's Brisk iced tea brand. It includes a SWOT analysis, segmentation of target markets as millennial and Hispanic males, and positioning of Brisk as an "edgy, cool" brand that "gives you energy." Two advertising options are considered: television advertising or a viral social media campaign. The document recommends a viral campaign titled "That's Brisk, Baby: Rebirth" that would involve creative characters called "The Brisk Brothers" and the tagline "Now That's Brisk."
- Dalda is a cooking oil produced by Lever Brothers Pakistan (now Unilever Pakistan) that has been on the market for 30 years.
- It targets middle to upper-income households and promotes the message "where there is motherhood, there is Dalda".
- Lever Brothers manages both external factors like economic conditions and internal factors like production facilities to influence Dalda's marketing. They gather information, plan strategically, implement plans, and evaluate results.
- Dalda faces competition from other cooking oils but has maintained its market share through effective pricing, promotion, distribution, and product strategies.
Strategic Management project on Johnson & Johnson Shobhita Dayal
This presentation talks about the complete framework of the strategic planning of the Johnson & Johnson company. All the topics of strategic management course is been covered, in this ppt starting from SWOT, Strategic Map, External Factor Analysis, and all other major strategic tools.
Harvard Business School Case Study on Mountain Man Brewing Company by Shashank Srivastava, IET Lucknow under the guidance of Prof. Sameer Mathur, IIM Lucknow.
CSP is considering options for pricing, packaging, and demand forecasting for its new weight-loss drug Metabical. Three demand forecasting models were analyzed estimating the potential market between 4.3-9.8 million customers. Packaging and pricing strategies were evaluated using a matrix to determine ROI under different scenarios. Pricing at $150 targeting the ideal customer profile was estimated to achieve a 5.73% ROI, meeting CSP's objective.
This document discusses Pemberton's launch of a new salty snack brand called Krispy Natural in the cracker market. It provides an overview of the US cracker industry and competitors. A marketing strategy is outlined which includes product, pricing, distribution and promotion plans. Test markets in Columbus and Southeast regions exceeded expectations in Columbus but saw limited growth in Southeast. The success in Columbus is attributed to promotional activities generating buzz, while resistance in Southeast may be due to discounts not being sustainable long term. Recommendations include further marketing in Southeast and tailoring the brand to different consumer needs.
Linear technology case analysis dividend payout policyHimanshu Gulia
it is a presentation on case analysis of the case dividend payout policy of linear technology and about its decision whether it should pay more dividend or keep it constant
Zara is a clothing brand known for fast fashion. It was founded in 1963 in Spain and opened its first store in 1975. Since then, Zara has expanded globally and now has over 2,000 stores in 96 countries. Zara's success is largely due to its ability to design and produce clothing in only two weeks in order to quickly respond to the latest fashion trends. It focuses on rapid production in small quantities, frequent store replenishments, and using its stores as a way to get customer feedback. Zara's core competencies include its vertical integration of design, production, and sales as well as its ability to quickly recreate fashion.
Atlantic Computers: A Bundle of Pricing OptionsJasmineDennis
The document discusses four pricing strategies for Atlantic Computers' new "Atlantic Bundle" product, which consists of their new Tronn server and PESA software. The strategies are: 1) status-quo pricing, 2) competition-based pricing, 3) cost-plus pricing, and 4) value-in-use pricing. After reviewing the strategies and conducting a break-even analysis, it is recommended to use value-in-use pricing of $4,200 per bundle. This captures the savings customers realize and has one of the lowest break-even points. Recommendations are also provided for training Atlantic's sales force to sell based on the bundle's value and savings. Potential reactions from main competitor Zink
Aqualisa Quartz - Simply A Better Shower (HBR Case Study)Arjun Parekh
The document discusses Aqualisa's Quartz shower valve which was intended to improve on existing shower technologies but struggled initially. It provides details on the UK shower market, Aqualisa's distribution channels, and the development of the Quartz valve. While the Quartz valve had technological advantages, plumbers were wary of innovation and it was priced too high. As a result, few units sold in the first few months through trade shops and showrooms.
Black & Decker is a leading power tool manufacturer that dominates the consumer and professional-industrial segments but only has 9% market share in the fast-growing professional-tradesmen segment. While the company is profitable, this presents a growth opportunity and risk to their leadership position if not addressed. A 5C analysis of Black & Decker looks at their strong brand image and product line in multiple segments, technology expertise from decades in the industry, and goal of leadership across all power tool markets, though they need to improve their image among professional tradesmen to surpass competitors like Makita.
Colgate Palmolive developed a new toothbrush called Colgate Precision and the product manager must recommend a marketing strategy. There is a conflict over the advertising budget between Colgate Precision and Colgate Plus. The product manager proposes progressively increasing the total budget and allocating 25% more to Precision each year. Analyzing pricing and sales projections, niche positioning earns $1.79 million in profit Year 1 and $19.96 million Year 2, showing increased volume leads to higher profits.
The Tata Group, an Indian multinational conglomerate, adopted a strategy of international expansion through global acquisitions under the leadership of Ratan Tata. As several Tata companies faced challenges from domestic market saturation and regulations in the 1990s, the group pursued acquisitions to diversify and achieve growth in foreign markets. Major Tata acquisitions included Tetley Tea, Corus Steel, Jaguar Land Rover, and several hotel brands. These global acquisitions transformed the Tata Group into one of the largest and most diverse international business groups in India.
Coda Coffee was founded in 2005 in Colorado with a goal of sustainable and socially responsible coffee wholesaling. Bext 360 was founded separately to use technology like machine vision, AI, blockchain and IoT to determine coffee quality. In a pilot project in Uganda, Bext 360's technology helped Coda Coffee and local coffee farmers by providing instant payment and supply chain transparency. This included quality details and premium payments for farmers accessible throughout the chain.
Magazine Protezione Civile - Anno 4 - n. 14 - gennaio-marzo 2014angerado
Magazine "Protezione Civile"
Il magazine che il Dipartimento della Protezione Civile rivolge a volontariato e istituzioni, componenti e strutture operative del Servizio Nazionale della Protezione Civile. Tra i temi di questo numero: emergenza e disabilità, il nuovo Meccanismo Unionale di Protezione Civile e il convegno "Protezione civile e social media: comunicare il rischio e il rischio di comunicare" a cui è dedicato l'inserto staccabile.
Il magazine è aperto a opinioni, segnalazioni di iniziative e richieste di chiarimenti da parte dei lettori. Per informazioni, richieste o invio di contributi scrivi a: magazine@protezionecivile.it
Este documento presenta el Proyecto Educativo Institucional (PEI) del Instituto Tecnológico Superior "Vicente Fierro" para el periodo 2012-2013. Incluye la visión, misión e ideario de la institución, así como datos informativos. También presenta el informe de autoevaluación institucional realizado entre mayo y julio de 2012, con la participación de la comunidad educativa.
The VP of marketing for PDB is deciding how to position Crescent Pure. There are two options - as an energy drink or sports drink. Market research shows the energy drink market is larger and growing faster. While sports drinks have less competition, their market is smaller. Perceptual maps also indicate Crescent could gain market share by positioning as a healthier energy drink. After analyzing customer segments, differentiation strategies, and perceptual maps, the document recommends positioning Crescent as an energy drink to target the larger, faster growing market.
This document provides an analysis of Apple including external factors like competitors and consumers, internal factors like history and leadership, and marketing theories. It examines Apple's positioning targeting consumers across generations and genders. SWOT and 4P analyses are presented. Theories around innovation diffusion, social judgment, and branding are applied. Apple's strong scores on customer satisfaction metrics are highlighted.
Comparison of Marketing Mix of IKEA in Four CountriesFatima Arshad
Report Contains Marketing Mix of IKEA. In this report there is comparison of 4Ps of IKEA in Four Countries i.e Sweden, UK, China and India.
This report is result of the efforts of four people.
J.C. Penney faced primary issues during the 2007 recession as consumer spending shrank. This was compounded by highly competitive discount retailers. A SWOT analysis showed brand recognition but also declining sales. The recommendation was to reach younger audiences through increased online presence and influencer marketing. Alternatives included returning to sales/markdown pricing or adjusting store locations. The process outlined working with beauty influencers through sponsored product reviews and haul videos.
Pepsi Lipton Brisk - Harvard Business Review CaseFamy
This document summarizes a case study regarding marketing options for PepsiCo's Brisk iced tea brand. It includes a SWOT analysis, segmentation of target markets as millennial and Hispanic males, and positioning of Brisk as an "edgy, cool" brand that "gives you energy." Two advertising options are considered: television advertising or a viral social media campaign. The document recommends a viral campaign titled "That's Brisk, Baby: Rebirth" that would involve creative characters called "The Brisk Brothers" and the tagline "Now That's Brisk."
- Dalda is a cooking oil produced by Lever Brothers Pakistan (now Unilever Pakistan) that has been on the market for 30 years.
- It targets middle to upper-income households and promotes the message "where there is motherhood, there is Dalda".
- Lever Brothers manages both external factors like economic conditions and internal factors like production facilities to influence Dalda's marketing. They gather information, plan strategically, implement plans, and evaluate results.
- Dalda faces competition from other cooking oils but has maintained its market share through effective pricing, promotion, distribution, and product strategies.
Strategic Management project on Johnson & Johnson Shobhita Dayal
This presentation talks about the complete framework of the strategic planning of the Johnson & Johnson company. All the topics of strategic management course is been covered, in this ppt starting from SWOT, Strategic Map, External Factor Analysis, and all other major strategic tools.
Harvard Business School Case Study on Mountain Man Brewing Company by Shashank Srivastava, IET Lucknow under the guidance of Prof. Sameer Mathur, IIM Lucknow.
CSP is considering options for pricing, packaging, and demand forecasting for its new weight-loss drug Metabical. Three demand forecasting models were analyzed estimating the potential market between 4.3-9.8 million customers. Packaging and pricing strategies were evaluated using a matrix to determine ROI under different scenarios. Pricing at $150 targeting the ideal customer profile was estimated to achieve a 5.73% ROI, meeting CSP's objective.
This document discusses Pemberton's launch of a new salty snack brand called Krispy Natural in the cracker market. It provides an overview of the US cracker industry and competitors. A marketing strategy is outlined which includes product, pricing, distribution and promotion plans. Test markets in Columbus and Southeast regions exceeded expectations in Columbus but saw limited growth in Southeast. The success in Columbus is attributed to promotional activities generating buzz, while resistance in Southeast may be due to discounts not being sustainable long term. Recommendations include further marketing in Southeast and tailoring the brand to different consumer needs.
Linear technology case analysis dividend payout policyHimanshu Gulia
it is a presentation on case analysis of the case dividend payout policy of linear technology and about its decision whether it should pay more dividend or keep it constant
Zara is a clothing brand known for fast fashion. It was founded in 1963 in Spain and opened its first store in 1975. Since then, Zara has expanded globally and now has over 2,000 stores in 96 countries. Zara's success is largely due to its ability to design and produce clothing in only two weeks in order to quickly respond to the latest fashion trends. It focuses on rapid production in small quantities, frequent store replenishments, and using its stores as a way to get customer feedback. Zara's core competencies include its vertical integration of design, production, and sales as well as its ability to quickly recreate fashion.
Atlantic Computers: A Bundle of Pricing OptionsJasmineDennis
The document discusses four pricing strategies for Atlantic Computers' new "Atlantic Bundle" product, which consists of their new Tronn server and PESA software. The strategies are: 1) status-quo pricing, 2) competition-based pricing, 3) cost-plus pricing, and 4) value-in-use pricing. After reviewing the strategies and conducting a break-even analysis, it is recommended to use value-in-use pricing of $4,200 per bundle. This captures the savings customers realize and has one of the lowest break-even points. Recommendations are also provided for training Atlantic's sales force to sell based on the bundle's value and savings. Potential reactions from main competitor Zink
Aqualisa Quartz - Simply A Better Shower (HBR Case Study)Arjun Parekh
The document discusses Aqualisa's Quartz shower valve which was intended to improve on existing shower technologies but struggled initially. It provides details on the UK shower market, Aqualisa's distribution channels, and the development of the Quartz valve. While the Quartz valve had technological advantages, plumbers were wary of innovation and it was priced too high. As a result, few units sold in the first few months through trade shops and showrooms.
Black & Decker is a leading power tool manufacturer that dominates the consumer and professional-industrial segments but only has 9% market share in the fast-growing professional-tradesmen segment. While the company is profitable, this presents a growth opportunity and risk to their leadership position if not addressed. A 5C analysis of Black & Decker looks at their strong brand image and product line in multiple segments, technology expertise from decades in the industry, and goal of leadership across all power tool markets, though they need to improve their image among professional tradesmen to surpass competitors like Makita.
Colgate Palmolive developed a new toothbrush called Colgate Precision and the product manager must recommend a marketing strategy. There is a conflict over the advertising budget between Colgate Precision and Colgate Plus. The product manager proposes progressively increasing the total budget and allocating 25% more to Precision each year. Analyzing pricing and sales projections, niche positioning earns $1.79 million in profit Year 1 and $19.96 million Year 2, showing increased volume leads to higher profits.
The Tata Group, an Indian multinational conglomerate, adopted a strategy of international expansion through global acquisitions under the leadership of Ratan Tata. As several Tata companies faced challenges from domestic market saturation and regulations in the 1990s, the group pursued acquisitions to diversify and achieve growth in foreign markets. Major Tata acquisitions included Tetley Tea, Corus Steel, Jaguar Land Rover, and several hotel brands. These global acquisitions transformed the Tata Group into one of the largest and most diverse international business groups in India.
Coda Coffee was founded in 2005 in Colorado with a goal of sustainable and socially responsible coffee wholesaling. Bext 360 was founded separately to use technology like machine vision, AI, blockchain and IoT to determine coffee quality. In a pilot project in Uganda, Bext 360's technology helped Coda Coffee and local coffee farmers by providing instant payment and supply chain transparency. This included quality details and premium payments for farmers accessible throughout the chain.
Magazine Protezione Civile - Anno 4 - n. 14 - gennaio-marzo 2014angerado
Magazine "Protezione Civile"
Il magazine che il Dipartimento della Protezione Civile rivolge a volontariato e istituzioni, componenti e strutture operative del Servizio Nazionale della Protezione Civile. Tra i temi di questo numero: emergenza e disabilità, il nuovo Meccanismo Unionale di Protezione Civile e il convegno "Protezione civile e social media: comunicare il rischio e il rischio di comunicare" a cui è dedicato l'inserto staccabile.
Il magazine è aperto a opinioni, segnalazioni di iniziative e richieste di chiarimenti da parte dei lettori. Per informazioni, richieste o invio di contributi scrivi a: magazine@protezionecivile.it
Este documento presenta el Proyecto Educativo Institucional (PEI) del Instituto Tecnológico Superior "Vicente Fierro" para el periodo 2012-2013. Incluye la visión, misión e ideario de la institución, así como datos informativos. También presenta el informe de autoevaluación institucional realizado entre mayo y julio de 2012, con la participación de la comunidad educativa.
Este documento describe los principales riesgos biológicos en el trabajo y medidas para controlarlos. Explica que los riesgos biológicos incluyen exposición a bacterias, virus, parásitos y hongos que pueden causar enfermedades. También clasifica los agentes biológicos según su nivel de riesgo y describe trabajos con exposición potencial como personal médico, agricultores y mineros. Finalmente, resume las principales medidas de control como controles de ingeniería, equipo de protección, vacunación y buenas prá
Este documento describe el proceso administrativo y sus cuatro etapas fundamentales: planeación, organización, dirección y control. Explica que la planeación implica establecer objetivos y estrategias, la organización implica dividir el trabajo y establecer una estructura, la dirección implica guiar los esfuerzos hacia los objetivos a través de la motivación y supervisión, y el control implica medir el desempeño y realizar correcciones.
Este documento presenta una introducción a un libro sobre feminismo dirigido a personas que no están familiarizadas o que son escépticas sobre el feminismo. La autora argumenta que muchas cosas que hoy se consideran normales, como el derecho al voto de la mujer y recibir el mismo salario por el mismo trabajo, se deben al movimiento feminista. Proporciona una lista de 17 logros concretos del feminismo que hoy en día damos por sentado. El objetivo es mostrar que el feminismo ha traído muchos beneficios tangibles a la vida de las mujeres y que
Este documento contiene artículos relacionados con las medidas cautelares en procesos judiciales civiles. Establece que el juez competente para dictar medidas cautelares es aquel que conocerá de la demanda. Las medidas cautelares tienen la finalidad de garantizar el cumplimiento de la decisión definitiva. También regula aspectos como los requisitos de la solicitud de medidas cautelares, el contenido de las decisiones cautelares, las características y efectos de las medidas cautelares, así como responsabilidades y excepciones relacionadas con este tipo de
Este documento presenta el Reglamento del Código Fiscal de la Federación de México. Establece definiciones de términos como autoridades fiscales. Detalla los procedimientos para presentar declaraciones y avisos ante las autoridades fiscales. Explica también los procedimientos para realizar avalúos de bienes para efectos fiscales y pagos de contribuciones.
Este documento define la prueba en materia penal como la actividad tendente a verificar la exactitud o inexactitud de las afirmaciones realizadas por las partes para establecer si coinciden con la realidad. Explica que la prueba tiene un carácter objetivo al proveer información al juez y un carácter subjetivo al crear convicción en su mente. Además, destaca la importancia de la prueba para tomar decisiones justas y la necesidad de que sea obtenida de forma legal.
Este documento presenta la guía oficial del Premio Jalisco a la Calidad Edición 2009. Explica la filosofía, estructura, proceso de evaluación y modelo de gestión del premio. El objetivo es estimular el desarrollo competitivo y sustentable de las organizaciones en Jalisco a través de la adopción de un modelo de gestión de clase mundial basado en la mejora continua. El proceso evalúa a las organizaciones participantes en 8 criterios a través de reportes, visitas y entrevistas para reconocer a las 20 organizaciones más sob
Este documento presenta un resumen de los conceptos clave de la planificación de medios. Explica los cuatro parámetros de la planificación (producto, mercado, publicidad y medios), así como aspectos relacionados con cada uno. También describe variables importantes para analizar como el grupo objetivo, la ponderación, y tácticas comunes de medios. El objetivo general es proveer una introducción a los fundamentos teóricos y procesos involucrados en la planificación efectiva de una campaña de medios.
Este documento presenta un resumen de los 9 capítulos del libro "Ética para Amador" de Fernando Sabater. Cada capítulo explora conceptos éticos como la libertad individual, la responsabilidad de nuestras acciones, el bienestar propio y de los demás, y la importancia de la comunidad y la solidaridad. El documento enfatiza que para tener una buena vida, debemos tomar decisiones informadas que consideren las consecuencias para nosotros y para los demás.
Este documento describe el proceso de realizar un análisis cualitativo de riesgos, incluyendo su propósito y objetivos, factores críticos de éxito, herramientas y técnicas, y documentación de resultados. Explica cómo priorizar los riesgos según su probabilidad e impacto en los objetivos del proyecto para enfocar los esfuerzos de gestión de riesgos en las áreas de mayor preocupación.
El documento presenta un material didáctico elaborado por la Secretaría de Educación Pública de México para alumnos de tercer grado de primaria. El material contiene una serie de desafíos o actividades matemáticas organizadas en cinco bloques, con el objetivo de que los alumnos construyan procedimientos para resolverlos de manera individual o en equipo con apoyo del profesor. Se proporcionan detalles sobre las instituciones y personas involucradas en la elaboración del material.
Este documento presenta las normas de construcción de la administración pública del Distrito Federal. Incluye información sobre la planeación y anteproyectos de obras, requisitos para la elaboración de anteproyectos como la información básica necesaria, alcances del trabajo y coordinación con especialistas. También cubre la entrega y revisiones del anteproyecto antes de su aprobación como proyecto ejecutivo.
Este documento describe la importancia de la vinculación para el desarrollo de la infraestructura en México. Analiza modelos de vinculación nacionales e internacionales y presenta dos casos exitosos de vinculación en México: 1) el desarrollo rural a través de FIRCO y 2) la formación e investigación en infraestructura a través de la Alianza FiiDEM. El documento concluye que la vinculación entre el gobierno, la industria, las universidades y la sociedad es fundamental para aumentar la competitividad de la ingeniería de
Este documento describe los conceptos clave de la estructura organizacional. Explica que existen cinco mecanismos de coordinación (ajuste mutuo, supervisión directa, estandarización de procesos, estandarización de producción y estandarización de destrezas) y que la estructura se divide en la cumbre estratégica, núcleo operativo, línea media, tecnoestructura y staff de apoyo. También cubre los nueve parámetros de diseño que permiten lograr la división del trabajo y coordinación necesarias, incl
El documento presenta el análisis, diseño y programación de tres módulos (SMACOR, SGA y SADTEC) que forman parte de un Sistema de Gestión Tecnológica Hospitalaria asistido por computadoras. El sistema se prevé que se encuentre insertado en un Sistema de Información Hospitalaria siguiendo la norma internacional CEN/TC 251 12967-1 propuesta por la Comunidad Económica Europea. Los módulos siguen la filosofía cliente/servidor y se espera que el sistema se instale en la Red Nacional
Este documento trata sobre programación cuadrática y describe varios conceptos matemáticos como ecuaciones de circunferencias, elipses, hipérbolas y parábolas. Explica cómo reconocer cada curva a partir de su ecuación y resuelve seis ejercicios de minimización o maximización de funciones cuadráticas sujetas a restricciones lineales, encontrando en cada caso los valores óptimos de las variables.
Este documento presenta una lista de productos de diferentes marcas, incluyendo ropa, accesorios y calzado para niños y adultos. Para cada producto se indica el nombre, la marca, el precio normal y el precio en internet, además de un enlace para ver más detalles. La lista contiene más de 100 artículos de marcas como SanBelle, Green Sprouts, Danielle-Nicole, Monjitas Alamoda, Inspira, Jacinta Tienda, Xtrem, Secret by Saxoline, Estela Kids, Oakley y otras.
1) El proceso de control administrativo consta de 4 pasos: establecimiento de estándares de desempeño, medición del desempeño, comparación del desempeño con los estándares, y toma de medidas correctivas.
2) Los estándares son criterios que establecen un nivel deseado de desempeño y sirven para medir y evaluar el desempeño real.
3) El control efectivo requiere enfocarse en los factores críticos o puntos de control que reflejen el cumplimiento de las metas y objetivos.
A Simple Tutorial on Conjoint and Cluster AnalysisIterative Path
A simple tutorial to show conjoint analysis and cluster analysis. please send your feedback, this version is still rough and I would like to iteratively improve it so it is useful for most.
This document discusses the principles and methods of robust design. It defines robustness as a product or process being minimally affected by sources of variation. Taguchi methods are presented for designing products and processes that are robust, including parameter design to optimize the signal-to-noise ratio. An example applies Taguchi methods to the design of a Nerf missile to maximize firing distance while minimizing the impact of noise factors like user skill level. The results of the experiment are analyzed using Minitab to identify the optimal settings for the design parameters to achieve the most robust performance.
This document describes a student project to benchmark actual bond prices using a dataset from a Kaggle competition. The project aims to improve on an original pricing model by including trade-specific factors like whether a bond is callable, reporting delay, previous trade prices and sizes, and trade type. Initial analyses include descriptive statistics and histograms of key variables. Factor analysis identified two factors - "Lagged Prices" and "Trade Frequency" - that are included in partition tree and neural network models to predict price. Preliminary results found the models outperform the original but improvements are small, requiring a more rigorous model.
1 Exploratory Data Analysis (EDA) by Melvin Ott, PhD.docxhoney725342
1
Exploratory Data Analysis (EDA)
by Melvin Ott, PhD
September, 2017
Introduction
The Masters in Predictive Analytics program at Northwestern University offers
graduate courses that cover predictive modeling using several software products
such as SAS, R and Python. The Predict 410 course is one of the core courses and
this section focuses on using Python.
Predict 410 will follow a sequence in the assignments. The first assignment will ask
you to perform an EDA(See Ratner1 Chapters 1&2) for the Ames Housing Data
dataset to determine the best single variable model. It will be followed by an
assignment to expand to a multivariable model. Python software for boxplots,
scatterplots and more will help you identify the single variable. However, it is easy
to get lost in the programming and lose sight of the objective. Namely, which of
the variable choices best explain the variability in the response variable?
(You will need to be familiar with the data types and level of measurement. This
will be critical in determining the choice of when to use a dummy variable for model
building. If this topic is new to you review the definitions at Types of Data before
reading further.)
This report will help you become familiar with some of the tools for EDA and allow
you to interact with the data by using links to a software product, Shiny, that will
demonstrate and interact with you to produce various plots of the data. Shiny is
located on a cloud server and will allow you to make choices in looking at the plots
for the data. Study the plots carefully. This is your initial EDA tool and leads to
your model building and your overall understanding of predictive analytics.
Single Variable Linear Regression EDA
1. Become Familiar With the Data
2
Identify the variables that are categorical and the variables that are quantitative.
For the Ames Housing Data, you should review the Ames Data Description pdf file.
2. Look at Plots of the Data
For the variables that are quantitative, you should look at scatter plots vs the
response variable saleprice. For the categorical variables, look at boxplots vs
saleprice. You have sample Python code to help with the EDA and below are some
links that will demonstrate the relationships for the a different building_prices
dataset.
For the boxplots with Shiny:
Click here
For the scatterplots with Shiny:
Click here
3. Begin Writing Python Code
Start with the shell code and improve on the model provided.
http://melvin.shinyapps.io/SboxPlot
http://melvin.shinyapps.io/SScatter/
http://melvin.shinyapps.io/SScatter/
3
Single Variable Logistic Regression EDA
1. Become Familiar With the Data
In 411 you will have an introduction to logistic regression and again will ask you to
perform an EDA. See the file credit data for more info. Make sure you recognize
which variables are quantitative and which are catego ...
The document summarizes a workshop on deep learning that covered:
- Leadership is key to avoiding digitalization project failures and cannot be replaced by consultants.
- The workshop introduced deep learning concepts using a simplified credit risk dataset and code example for classification.
- The code example showed how to build and train a basic deep learning model to predict credit risk from sample data.
- Creating deep learning applications is accessible, but large datasets are needed and the internal workings remain difficult to interpret, so machine learning may be better for smaller datasets or needing interpretability.
This document discusses various techniques for data preprocessing, which is an important step in the knowledge discovery process. It describes why data preprocessing is necessary due to real-world data often being inconsistent, incomplete or noisy. Common techniques discussed include data cleaning, integration, transformation and reduction. Data cleaning aims to fill in missing values, smooth noisy data and correct inconsistencies. Normalization and discretization are forms of data transformation that put attribute values into appropriate ranges and reduce continuous attributes to discrete intervals. The goal of data preprocessing is to prepare the raw data into a format suitable for data mining algorithms to effectively discover useful knowledge.
The document discusses data modeling concepts in IBM Cognos Framework Manager including:
1) It introduces Framework Manager and discusses its benefits like creating a star schema for optimizing queries.
2) It outlines common traps to avoid in data modeling like chasm, fan, connection and transitive relationship traps.
3) It describes the Framework Manager workflow of creating a project, preparing metadata from data sources, modeling the business view, publishing packages and setting security controls.
An Interactive Introduction To R (Programming Language For Statistics)Dataspora
This is an interactive introduction to R.
R is an open source language for statistical computing, data analysis, and graphical visualization.
While most commonly used within academia, in fields such as computational biology and applied statistics, it is gaining currency in industry as well – both Facebook and Google use R within their firms.
This document provides an introduction to using ggplot in Python. It discusses installing ggplot and importing necessary packages. It then uses a diamonds dataset to demonstrate how to explore the data, prepare it for analysis, evaluate relationships between variables like price and carat size, and use facets to differentiate results. The goal is to help beginners understand basic ggplot functions and how to visualize and analyze data.
Cost optimization through simulation rev1David Panek
The document provides information from David A. Panek on using cost optimization techniques like simulation and cost analysis to improve negotiations and reduce prices. It discusses gathering industry data on cost elements, analyzing a company's costs as a percentage of sales, and summarizing the results of a simulation to compare a product's gross margin to industry averages. The analysis found potential savings of $550-$650 per unit by lowering the price based on a more accurate cost breakdown.
The document discusses various techniques for data pre-processing. It begins by explaining why pre-processing is important for obtaining clean and consistent data needed for quality data mining results. It then covers topics such as data cleaning, integration, transformation, reduction, and discretization. Data cleaning involves techniques for handling missing values, outliers, and inconsistencies. Data integration combines data from multiple sources. Transformation techniques include normalization, aggregation, and generalization. Data reduction aims to reduce data volume while maintaining analytical quality. Discretization includes binning of continuous variables.
The report describes the results of a Discrete Choice Experiment (a type of Conjoint-Analysis) to explore the potential configuration of a tablet computer from a new entrant to the category.
R markup code to create Regression ModelMohit Rajput
The document discusses analyzing a movie dataset to understand predictors of audience score. It loads necessary packages and the movie data. Exploratory data analysis is performed, including checking data structure, selecting relevant variables for modeling, removing missing data, summarizing variables, and graphically analyzing relationships. Runtime is found to have a weak positive linear relationship with audience score based on a scatterplot, while release year and month show no clear relationships. Correlation analysis will further examine relationships between predictors and the response variable, audience score.
The document discusses software development principles and concepts such as fixtures, coupling, cohesion, connascence, and contraconnascence. It defines these terms and provides examples to illustrate the concepts. The document advocates applying principles like the rule of locality and rule of degree to reduce strong forms of connascence and increase loose coupling between software elements.
Akuntansi Manajemen Edisi 8 oleh Hansen & Mowen Bab 3Dwi Wahyu
Materi Bab 3 Activity Cost and Behavior, Akuntansi Manajemen buku Hansen & Mowen Edisi 8. Presentasi powerpoint oleh Gail B. Wright, Professor Emeritus of Accounting, Bryant University
Informix physical database design for data warehousingKeshav Murthy
1. The document discusses physical database design for data warehousing, including star and snowflake schemas that are commonly used.
2. It provides examples of complex queries against a data warehouse schema that involve joins between large dimension and fact tables to retrieve aggregates and filter on dimension attributes.
3. The execution plan for a sample query joining multiple large tables in a data warehouse is shown, utilizing index scans, sequential scans, hash joins, and filters to efficiently evaluate the query.
This document discusses using an artificial neural network (ANN) to predict real estate prices. [1] It provides background on ANNs and describes collecting and preparing real estate transaction data from Hudson County, NJ to train and test an ANN model. [2] The author experiments with different ANN parameters and model configurations to optimize predictive accuracy, achieving a mean absolute error of 11.3% on test data. [3] The summary outlines the author's process for building and evaluating the ANN real estate price prediction model.
The document discusses relational algebra and tuple relational calculus. It describes relational algebra as a procedural query language that uses unary operations like select and project and binary operations like join and union to derive result relations from existing relations. It then explains tuple relational calculus as a nonprocedural query language where queries return tuples satisfying a predicate. Examples of queries using both languages on banking relations are provided to illustrate their usage.
Similar to Spss analysis conjoint_cluster_regression_pca_discriminant (20)
Arjun Sharma graduated with a B-Tech in biomedical engineering and an MBA in hospital administration. While he finds his job in a multi-specialty hospital exciting and enjoys not having homework, he feels confused and hesitant as he has yet to apply the concepts he learned in school to his real work. He thinks education should focus more on practical applications to help graduates like him connect their learning to their jobs.
This document contains a case study presentation about Harley Davidson India. It discusses Harley Davidson as a company, focusing on its strengths as a premium American brand with high quality products and customizations. It also examines weaknesses like an price-sensitive Indian consumer base and a sales model only through dealers. The document analyzes opportunities in the growing Indian cruiser bike market and luxury goods sector. It identifies threats from established competitors and stricter emission standards. The presentation provides suggestions to address issues like establishing an online customization platform, bolstering brand recognition through celebrity endorsements and events, and collaborating to improve service infrastructure and fuel availability.
Business rerearch survey_analysis__intention to use tablet p_cs among univer...Dev Karan Singh Maletia
This document outlines a study on university students' intention to use tablet PCs. It includes an introduction to the problem statement and objectives of the study, a literature review on innovation diffusion theory, and the methodology used. The methodology section describes the correlation research design, sample of university students, measurement of variables like relative advantage and intention to use, questionnaire design, and statistical analysis plan including reliability testing, descriptive analysis, and regression analysis. The goal is to understand the key factors influencing students' intention to use tablet PCs and the relationships between variables related to adoption of the technology.
This document provides an overview of human resource practices in the consulting industry. It discusses job analysis and design for consultants, which can include generalist or specialist roles. Performance appraisal in consulting aims to motivate employees and ensure their contributions align with business goals, while also helping employees understand their roles. Training is important given the diverse backgrounds of consultants and need to adapt to different client cultures. Job design is challenging with cross-domain, project-based work.
This document provides a detailed job analysis for a security personnel position. It outlines the qualifications required, including a high school diploma and experience as a security guard. It describes the key functions and duties such as patrolling, access control, and reporting crimes. It also lists the skills required, including the ability to provide customer service, handle situations effectively, and communicate clearly. Finally, it reviews compensation structures and common industries that employ security personnel.
Market basket analysis (MBA) is a data mining technique used to identify relationships between groups of products or items that are frequently purchased together. This article describes the advantages of MBA and how its adoption could help address divides between micro and macro research and science and practice in management research. The author explains what MBA is, how it can be applied to both archival and primary data, and its advantages over other analytic methods. However, limitations of MBA in management research are not discussed.
This document discusses how analytics can improve supply chain visibility in the steel industry. It begins by providing background on the global steel industry and some key statistics. It then discusses the need for advanced analytics, including data management, business intelligence, and predictive modeling. The benefits of analytics for the steel industry are outlined, such as margin improvement, better price negotiation, and improved mergers and acquisitions integration. Finally, it proposes a roadmap for implementing analytics that includes identifying high impact areas, developing a strategic approach, building a business case, and operationalizing the solution.
This document discusses how analytics can improve supply chain visibility in the steel industry. It begins by providing background on the global steel industry and production levels. It then discusses the need for advanced analytics, including data management, business intelligence, and predictive modeling. Benefits of analytics include improved cost management, regulatory compliance, and growth opportunities. The document proposes a scorecard method for evaluating analytics solutions and provides a roadmap for implementation. Finally, it discusses case studies of analytics applications in areas like product costs, customer insights, and capital investment decisions.
Best practices for project execution and deliveryCLIVE MINCHIN
A select set of project management best practices to keep your project on-track, on-cost and aligned to scope. Many firms have don't have the necessary skills, diligence, methods and oversight of their projects; this leads to slippage, higher costs and longer timeframes. Often firms have a history of projects that simply failed to move the needle. These best practices will help your firm avoid these pitfalls but they require fortitude to apply.
Storytelling is an incredibly valuable tool to share data and information. To get the most impact from stories there are a number of key ingredients. These are based on science and human nature. Using these elements in a story you can deliver information impactfully, ensure action and drive change.
Industrial Tech SW: Category Renewal and CreationChristian Dahlen
Every industrial revolution has created a new set of categories and a new set of players.
Multiple new technologies have emerged, but Samsara and C3.ai are only two companies which have gone public so far.
Manufacturing startups constitute the largest pipeline share of unicorns and IPO candidates in the SF Bay Area, and software startups dominate in Germany.
The Genesis of BriansClub.cm Famous Dark WEb PlatformSabaaSudozai
BriansClub.cm, a famous platform on the dark web, has become one of the most infamous carding marketplaces, specializing in the sale of stolen credit card data.
NIMA2024 | De toegevoegde waarde van DEI en ESG in campagnes | Nathalie Lam |...BBPMedia1
Nathalie zal delen hoe DEI en ESG een fundamentele rol kunnen spelen in je merkstrategie en je de juiste aansluiting kan creëren met je doelgroep. Door middel van voorbeelden en simpele handvatten toont ze hoe dit in jouw organisatie toegepast kan worden.
Zodiac Signs and Food Preferences_ What Your Sign Says About Your Tastemy Pandit
Know what your zodiac sign says about your taste in food! Explore how the 12 zodiac signs influence your culinary preferences with insights from MyPandit. Dive into astrology and flavors!
Top mailing list providers in the USA.pptxJeremyPeirce1
Discover the top mailing list providers in the USA, offering targeted lists, segmentation, and analytics to optimize your marketing campaigns and drive engagement.
𝐔𝐧𝐯𝐞𝐢𝐥 𝐭𝐡𝐞 𝐅𝐮𝐭𝐮𝐫𝐞 𝐨𝐟 𝐄𝐧𝐞𝐫𝐠𝐲 𝐄𝐟𝐟𝐢𝐜𝐢𝐞𝐧𝐜𝐲 𝐰𝐢𝐭𝐡 𝐍𝐄𝐖𝐍𝐓𝐈𝐃𝐄’𝐬 𝐋𝐚𝐭𝐞𝐬𝐭 𝐎𝐟𝐟𝐞𝐫𝐢𝐧𝐠𝐬
Explore the details in our newly released product manual, which showcases NEWNTIDE's advanced heat pump technologies. Delve into our energy-efficient and eco-friendly solutions tailored for diverse global markets.
Taurus Zodiac Sign: Unveiling the Traits, Dates, and Horoscope Insights of th...my Pandit
Dive into the steadfast world of the Taurus Zodiac Sign. Discover the grounded, stable, and logical nature of Taurus individuals, and explore their key personality traits, important dates, and horoscope insights. Learn how the determination and patience of the Taurus sign make them the rock-steady achievers and anchors of the zodiac.
SATTA MATKA SATTA FAST RESULT KALYAN TOP MATKA RESULT KALYAN SATTA MATKA FAST RESULT MILAN RATAN RAJDHANI MAIN BAZAR MATKA FAST TIPS RESULT MATKA CHART JODI CHART PANEL CHART FREE FIX GAME SATTAMATKA ! MATKA MOBI SATTA 143 spboss.in TOP NO1 RESULT FULL RATE MATKA ONLINE GAME PLAY BY APP SPBOSS
The Steadfast and Reliable Bull: Taurus Zodiac Signmy Pandit
Explore the steadfast and reliable nature of the Taurus Zodiac Sign. Discover the personality traits, key dates, and horoscope insights that define the determined and practical Taurus, and learn how their grounded nature makes them the anchor of the zodiac.
The APCO Geopolitical Radar - Q3 2024 The Global Operating Environment for Bu...APCO
The Radar reflects input from APCO’s teams located around the world. It distils a host of interconnected events and trends into insights to inform operational and strategic decisions. Issues covered in this edition include:
1. Conjoint Analysis :
Conjoint Analysis is a marketing research technique designed to help determine preferences of
customers. It is used to analyse how customers value different attributes of a product ( or service)
and thus gives an insight into the trade-offs they are to make among the various attributes. To put
simply, it tells how much each feature of a product is worth to the consumers.
This study includes surveying people with a certain set of attribute combinations which the survey-
takers rank or provide preferences. Analysis will be done to model the customer preferences for
different combination of attributes. The attributes are termed factors and the different values are
levels.
In the example that we have taken to use Conjoint Analysis through the tool SPSS, we have analysed
data on carpet, taking attributes like Price, Brand, Money-return, Package design and Seal as the
attributes based on which the consumers give prefernces. Using two data sets, we calculate the part
worths and decide on the weightage of each of the attributes that the users have provided.
Variable name Variable label Value label
package package design A*, B*, C*
brand brand name K2R, Glory, Bissell
price price $1.19, $1.39, $1.59
seal Good Housekeeping seal no, yes
money money-back guarantee no, yes
Code to fetch import the data and analyse :
GET
FILE='C:UsersAbhiDesktopcarpet_plan.sav'.
DATASET NAME DataSet1 WINDOW=FRONT.
GET
FILE='C:UsersAbhiDesktopcarpet_prefs.sav'.
DATASET NAME DataSet2 WINDOW=FRONT.
CONJOINT PLAN='C:UsersAbhiDesktopcarpet_plan.sav'
/DATA='C:UsersAbhiDesktopcarpet_prefs.sav'
/SEQUENCE=PREF1 PREF2 PREF3 PREF4 PREF5 PREF6 PREF7 PREF8 PREF9 PREF10 PR
EF11 PREF12 PREF13 PREF14 PREF15 PREF16 PREF17 PREF18 P
REF19 PREF20 PREF21 PREF22
/SUBJECT=ID
/FACTORS=PACKAGE BRAND (DISCRETE)
PRICE (LINEAR LESS)
SEAL (LINEAR MORE) MONEY (LINEAR MORE)
/PRINT=SUMMARYONLY.
2. Model Description
Relation to Ranks
N of Levels or Scores
package 3 Discrete
brand 3 Discrete
price 3 Linear (less)
seal 2 Linear (more)
money 2 Linear (more)
Calculation of the part-worth of each attribute
Utilities
Utility Estimate Std. Error
package A* -2.233 .192
B* 1.867 .192
C* .367 .192
brand K2R .367 .192
Glory -.350 .192
Bissell -.017 .192
price $1.19 -6.595 .988
$1.39 -7.703 1.154
$1.59 -8.811 1.320
seal no 2.000 .287
yes 4.000 .575
money no 1.250 .287
3. yes 2.500 .575
(Constant) 12.870 1.282
This table shows the utility (part-worth) scores and their standard errors for each factor level. Higher
utility values indicate greater preference. We can see that the value of the part worths are such that,
for each attribute if part-worths are added for different levels, it sums up to zero. Thus with respect
to brand Glory and Bisell, K2R is preferred more. As expected, there is an inverse relationship
between price and utility, with higher prices corresponding to lower utility. The presence of a seal of
approval or money-back guarantee corresponds to a higher utility.Also, total utility of a combination
can be calculated as :
If the cleaner had package design C*, brand Bissell, price $1.59, a seal of approval, and a money -back
guarantee, the total utility would be:
0.367 + (−0.017) + (−8.811) + 4.000 + 2.500 + 12.870 = 10.909.
Importance:
Importance Values
package 35.635
brand 14.911
price 29.410
seal 11.172
money 8.872
We can see that attributes package has most importance followed by price. Money return is of least
concern for the consumer. The values are computed by taking the utility range for each factor
separately and dividing by the sum of the utility ranges for al l factors. The values thus represent
percentages and have the property that they sum to 100.
4. Coefficients
B Coefficient
Estimate
price -5.542
seal 2.000
money 1.250
The utility for a particular factor level is determined by multiplying the level by the coefficient. For
example, the predicted utility for a price of $1.19 was listed as −6.595 in the utilities table. This is
simply the value of the price level, 1.19, multiplied by the price coefficient, −5.542.
This table provides measures of the correlation between the observed and estimated preferences.
Preference Scores of
Simulations a
Card
Number ID Score
1 1 10.258
2 2 14.292
5. The real power of conjoint analysis is the ability to predict preference for product profiles that
weren't rated by the subjects. These are referred to as simulation cases.
b
Preference Probabilities of Simulations
Card Bradley-Terry-
a
Number ID Maximum Utility Luce Logit
1 1 30.0% 43.1% 30.9%
2 2 70.0% 56.9% 69.1%
The maximum utility model determines the probability as the number
of respondents predicted to choose the profile divided by the total
number of respondents. For each respondent, the predicted choice is
simply the profile with the largest total utility.
Number of Reversals
Factor price 3
money 2
seal 2
brand 0
package 0
Subject 1 Subject 1 1
2 Subject 2 2
3 Subject 3 0
4 Subject 4 0
5 Subject 5 0
6 Subject 6 1
6. 7 Subject 7 0
8 Subject 8 0
9 Subject 9 1
10 Subject 10 2
This table displays the number of reversals for each factor and for each subject. For example, three
subjects showed a reversal for price. That is, they preferred product profiles with higher prices.
Reversal Summary
N of
Revers
als N of Subjects
1 3
2 2
7. Q. Perform Discriminant Analysis on the given dataset.
The dataset chosen contains statistics on set of people who have been given bank loans & have defaulted or not defaulted with their various characteristics.
Discriminant
Notes
Output Created 04-Apr-2013 18:39:05
p{color:black;font-family:sans-serif;font-size:10pt;font-
Comments weight:normal}
Input Data E:VGSOMSTUDYSECOND Your trial period for SPSS for Windows will expire in 14 da
SEMBRMSPSS16Samplesbanklo ys.p{color:0;font-family:Monospaced;font-size:13pt;font-
style:normal;font-weight:normal;text-decoration:none}
an.sav GET
Active Dataset DataSet1 FILE='E:VGSOMSTUDYSECOND SEMBRMSPSS16Samplesbanklo
an.sav'.
File Label Bank Loan Default DATASET NAME DataSet1 WINDOW=FRONT.
Filter <none> DISCRIMINANT
/GROUPS=default(0 1)
Weight <none> /VARIABLES=employ address age
Split File <none> /ANALYSIS ALL
/PRIORS EQUAL
N of Rows in Working /STATISTICS=MEAN STDDEV UNIVF BOXM COEFF CORR TABLE
850
Data File /PLOT=COMBINED
/PLOT=CASES
Missing Value Handling Definition of Missing User-defined missing values are
treated as missing in the analysis /CLASSIFY=NONMISSING POOLED MEANSUB.
phase.
Cases Used In the analysis phase, cases with no
user- or system-missing values for
any predictor variable are used.
Cases with user-, system-missing, or
out-of-range values for the
grouping variable are always
excluded.
Syntax DISCRIMINANT
/GROUPS=default(0 1)
/VARIABLES=employ address age
/ANALYSIS ALL
/PRIORS EQUAL
/STATISTICS=MEAN STDDEV UNIVF
BOXM COEFF CORR TABLE
/PLOT=COMBINED
/PLOT=CASES
/CLASSIFY=NONMISSING POOLED
MEANSUB.
Resources Processor Time 00:00:00.047
[DataSet1] E:VGSOMSTUDYSECOND SEMBRMSPSS16Samplesbankloan.sav
Elapsed Time 00:00:00.121
8. Warnings
All-Groups Stacked Histogram is no longer displayed.
Analysis Case Processing Summary
Unweighted Cases N Percent
Valid 700 82.4
Excluded Missing or out-of-range
150 17.6
group codes
At least one missing
0 .0
discriminating variable
Both missing or out-of-
range group codes and at
0 .0
least one missing
discriminating variable
Total 150 17.6
Total 850 100.0
Group Statistics
Valid N (listwise)
Previously defaulted Mean Std. Deviation Unweighted Weighted
No Years with current
9.51 6.664 517 517.000
employer
Years at current address 8.95 7.001 517 517.000
Age in years 35.51 7.708 517 517.000
Yes Years with current
5.22 5.543 183 183.000
employer
Years at current address 6.39 5.925 183 183.000
Age in years 33.01 8.518 183 183.000
Total Years with current
8.39 6.658 700 700.000
employer
Years at current address 8.28 6.825 700 700.000
Age in years 34.86 7.997 700 700.000
9. Tests of Equality of Group Means
Wilks' Lambda F df1 df2 Sig.
Years with current
.920 60.759 1 698 .000
employer
Years at current address .973 19.402 1 698 .000
Age in years .981 13.482 1 698 .000
Pooled Within-Groups Matrices
Years with
current Years at
employer current address Age in years This matrix shows correlation between the predictors. The largest
Correlation Years with current correlations occur between Credit card debt in thousands and the
1.000 .292 .524 other variables.
employer
Years at current address .292 1.000 .588
Age in years .524 .588 1.000
Analysis 1
Box's Test of Equality of Covariance Matrices
Log Determinants
Log
Previously defaulted Rank Determinant
No 3 11.012
Yes 3 10.501
Pooled within-groups 3 10.919
The ranks and natural logarithms of determinants
printed are those of the group covariance
matrices.
Test Results
Box's M 28.171
F Approx. 4.665
df1 6
df2 7.335E5
Sig. .000
10. Log Determinants
Log
Previously defaulted Rank Determinant
No 3 11.012
Yes 3 10.501
Pooled within-groups 3 10.919
Tests null hypothesis of
equal population covariance
matrices.
Summary of Canonical Discriminant Functions
Eigenvalues
Functio Canonical
n Eigenvalue % of Variance Cumulative % Correlation
1 .100a 100.0 100.0 .301
a. First 1 canonical discriminant functions were used in the analysis.
Wilks' Lambda
Test of
Functio
n(s) Wilks' Lambda Chi-square df Sig.
1 .909 66.251 3 .000
Standardized Canonical Discriminant
Function Coefficients
Function
1
Years with current
.980
employer
Years at current address .436
Age in years -.330
11. Structure Matrix
Function
1
Years with current
.934
employer
Years at current address .528
Age in years .440
Pooled within-groups correlations
between discriminating variables and
standardized canonical discriminant
functions
Variables ordered by absolute size of
correlation within function.
Functions at Group
Centroids
Previo Function
usly
default
ed 1
No .188
Yes -.530
Unstandardized
canonical
discriminant
functions evaluated
at group means
Classification Statistics
Classification Processing Summary
Processed 850
Excluded Missing or out-of-range
0
group codes
At least one missing
0
discriminating variable
Used in Output 850
12. Prior Probabilities for Groups
Previo Cases Used in Analysis
usly
default
ed Prior Unweighted Weighted
No .500 517 517.000
Yes .500 183 183.000
Total 1.000 700 700.000
Classification Function Coefficients
Previously defaulted
No Yes
Years with current
-.192 -.302
employer
Years at current address -.302 -.348
Age in years .797 .827
(Constant) -12.588 -12.444
Fisher's linear discriminant functions
Classification Resultsa
Previously Predicted Group Membership
defaulted No Yes Total The Discriminant Analysis shows that the persons in the category
Original Count No 300 217 517 who have previously defaulted are predicted likely to default this
Yes 44 139 183 time as well & those who haven’t defaulted earlier are predicted less
Ungrouped cases 81 69 150 likely to default this time.
% No 58.0 42.0 100.0 The conclusion is inferred from the total no. of defaulters being
Yes 24.0 76.0 100.0 more than non defaulters (139>44) similarly (300>217).
Ungrouped cases 54.0 46.0 100.0
a. 62.7% of original grouped cases correctly classified.
13. Q. Perform Factor Analysis on the given dataset.
The dataset chosen contains fictional statistics anxiety questionnaire. It contains response given
by students regarding their ease of use, liking and usage of SPSS in statistics.
By using the Scree Plot I have chosen 5 factors.
Since a student may give related answers depending upon the choices hence I considered the
variables to be inter-related and hence used Oblimin rotation. Say a student gave high points for
variable “I have little experience of computers” is likely to give high points for “All computers
hate me” as the variables are correlated somewhat.
14. Using the options of SPSS the following Pattern Matrix was generated.
Pattern Matrix a
Component
1 2 3 4 5
I have little experience of .903
computers
SPSS always crashes when I .732
try to use it
All computers hate me .684
I worry that I will cause .662
irreparable damage because
of my incompetenece with
computers
Computers have minds of .581
their own and deliberately go
wrong whenever I use them
People try to tell you that .446
SPSS makes statistics easier
to understand but it doesn't
Computers are out to get me .333
My friends are better at SPSS .661
than I am
My friends are better at .655
statistics than me
If I'm good at statistics my .622
friends will think I'm a nerd
My friends will think I'm stupid .504 .330
for not being able to cope
with SPSS
Everybody looks at me when .358 .358
I use SPSS
I can't sleep for thoughts of -.728
eigen vectors
I wake up under my duvet .324 -.543
thinking that I am trapped
under a normal distribtion
15. Computers are useful only for .359 .393 -.366
playing games
Standard deviations excite .301 .356 .315
me
I have never been good at -.855
mathematics
I did badly at mathematics at -.736
school
I slip into a coma whenever I -.722
see an equation
Statiscs makes me cry -.772
I don't understand statistics -.730
I weep openly at the mention -.664
of central tendency
I dream that Pearson is -.564
attacking me with correlation
coefficients
Extraction Method: Principal Component Analysis.
Rotation Method: Oblimin with Kaiser Normalization.
a. Rotation converged in 15 iterations.
The total variance explained by each factor is given below
Total Variance Explained
Rotation Sums of
Squared
Loadings a
Compo
nent Total
1 5.522
2 2.452
3 2.383
4 3.535
5 4.913
16. Extraction Method:
Principal Component
Analysis.
It is calculated by the sum of squared loadings of the factor and dividing the sum of squared loadings by
the number of variables and multiplying by 100.
Hence the factoring would be as follows depending on the loading values.
Factor Variable Nos.
1 1,2,3,4,5,6,7,14
2 8,9,10
3 13
4 17,18,19
5 20,21,22,23
Since variables 11, 12, 15 and 16 have very close loadings in different factors it is not good as this
variable is assessing both constructs.15 has exact same value in both Factor 2 and Factor 3.These are
said to have split loading.
They are hence mentioned in a separately.
Factor Variable No
2 11,16,15
3 12,15
As Split loading is present this is not a simple structure.
Factor 1: Anxiety about the usage of computers accounts for 55.22% of the total variance and loads 8 of
the variables.
Factor 2: View of students regarding their understanding of statistics and SPSS with regard to their peers
accounts for 24.52% of the total variance and loads 3 variables. It also split loads variable 11, 16 and 15.
Factor 3: Anxiety about Eigen vectors corresponds to only 23.83% of the total variance and loads only 1
variable directly while it split loads variable 12 and 15.
Factor 4: Students interest in mathematics accounts for 35.35% of the total variance and loads 3
variable.
Factor 5: Dislike for statistics accounts for 49.13% of the total variance and loads 4 variables.
17. CLUSTER ANALYSIS
Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects in the same
group (called cluster) are more similar (in some sense or another) to each other than to those in other groups
(clusters). It is a main task of exploratory data mining, and a common technique for statistical data analysis
used in many fields, including machine learning, pattern recognition, image analysis, information retrieval, and
bioinformatics.
Proximities
Notes
Output Crea ted 02-Apr-2013 22:00:05
Comments
Input Da ta C:Us ers dev
ma l etiaDownloadsClusterAnonFaculty.sav
Acti ve Da taset Da ta Set3
Fi l ter <none>
Wei ght <none>
Spl it File <none>
N of Rows i n Working Data File 44
Mi s sing Value Handling Defi nition of Missing Us er-defined missing values a re treated as missing.
Ca s es Used Sta ti stics a re based on cases with no missing values
for a ny va ri able used.
Synta x PROXIMITIES Sa l ary FTE Ra nk Arti cles Experience
/MATRIX
OUT('C:Us ersDEVMAL~1AppDataLocalTempspss
6496s pssclus.tmp')
/VIEW=CASE
/MEASURE=SEUCLID
/PRINT NONE
/ID=Name
/STANDARDIZE=VARIABLE Z.
Res ources Proces sor Ti me 00:00:00.078
El a psed Time 00:00:00.082
Works pace Bytes 11152
Fi l es Saved Ma tri x Fi le C:Us ers DEVMAL~1AppDataLocalTempspss6496
s pssclus.tmp
18. The variables are which I have used in the dataset are as follows:
• Name -- Although faculty salaries are public information under North Carolina state law
• Salary – annual salary in dollars, from the university report available in One Stop.
• FTE – Full time equivalent work load for the faculty member.
• Rank – where 1 = adjunct, 2 = visiting, 3 = assistant, 4 = associate, 5 = professor
• Articles – number of published scholarly articles, excluding things like comments in newsletters,
abstracts in proceedings, and the like.
• Experience – Number of years working as a full time faculty member in a Department of Psychology.
• ArticlesAPD – number of published articles as listed in the university’s Academic Publications
• Sex –biological sex from physical appearance.
In the first step SPSS computes for each pair of cases the squared Euclidian distance between the cases. This is
quite simply, the sum across variables (from i = 1 to v) of the squared difference between the score on variable
i for the one case (Xi) and the score on variable i for the other case (Yi). The two cases which are separated by
the smallest Euclidian distance are identified and then classified together into the first cluster. At this point
there is one cluster with two cases in it.
Next SPSS re-computes the squared Euclidian distances between each entity (case or cluster) and each other
entity. When one or both of the compared entities is a cluster, SPSS computes the averaged squared Euclidian
distance between members of the one entity and members of the other entity. The two entities with the
smallest squared Euclidian distance are classified together. SPSS then re-computes the squared Euclidian
distances between each entity and each other entity and the two with the smallest squared Euclidian distance
are classified together. This continues until all of the cases have been clustered into one big cluster.
The output obtained can be seen below:
Case Processing Summary a
Ca s es
Va l i d Mi s s i ng Tota l
N Percent N Percent N Percent
44 100.0% 0 .0% 44 100.0%
a. Squa red Euclidean Distance used
19. On the first step SPSS clustered case 32 with 33. The squared Euclidian distance between these two cases is
0.000. At stages 2-4 SPSS creates three more clusters, each containing two cases. At stage 5 SPSS adds case
39 to the cluster that already contains cases 37 and 38. By the 43rd stage all cases have been clustered into
one entity.
The results can be seen below:
Average Linkage (Between Groups)
Agglomeration Schedule
Cl us ter Combi ned Sta ge Cl us ter Fi rs t Appea rs
Sta ge Cl us ter 1 Cl us ter 2 Coeffi ci ents Cl us ter 1 Cl us ter 2 Next Sta ge
1 32 33 .000 0 0 9
2 41 42 .000 0 0 6
3 43 44 .000 0 0 6
4 37 38 .000 0 0 5
5 37 39 .001 4 0 7
6 41 43 .002 2 3 27
7 36 37 .003 0 5 27
8 20 22 .007 0 0 11
9 30 32 .012 0 1 13
10 21 26 .012 0 0 14
11 20 25 .031 8 0 12
12 16 20 .055 0 11 14
13 29 30 .065 0 9 26
14 16 21 .085 12 10 20
15 11 18 .093 0 0 22
16 8 9 .143 0 0 25
17 17 24 .144 0 0 20
18 13 23 .167 0 0 22
19 14 15 .232 0 0 32
20 16 17 .239 14 17 23
21 7 12 .279 0 0 28
22 11 13 .441 15 18 29
23 16 27 .451 20 0 26
24 3 10 .572 0 0 28
25 6 8 .702 0 16 36
26 16 29 .768 23 13 35
27 36 41 .858 7 6 33
20. 28 3 7 .904 24 21 31
29 11 28 .993 22 0 30
30 5 11 1.414 0 29 34
31 3 4 1.725 28 0 36
32 14 31 1.928 19 0 34
33 36 40 2.168 27 0 40
34 5 14 2.621 30 32 35
35 5 16 2.886 34 26 37
36 3 6 3.089 31 25 38
37 5 19 4.350 35 0 39
38 1 3 4.763 0 36 41
39 5 34 5.593 37 0 42
40 35 36 8.389 0 33 43
41 1 2 8.961 38 0 42
42 1 5 11.055 41 39 43
43 1 35 17.237 42 40 0
Cluster Membership
Ca s e 5 Cl us ters 4 Cl us ters 3 Cl us ters 2 Cl us ters
1:Ros alyn 1 1 1 1
2:La wrence 2 2 1 1
3:Suni la 1 1 1 1
4:Ra ndolph 1 1 1 1
5:Mi ckey 3 3 2 1
6:Loui s 1 1 1 1
7:Tony 1 1 1 1
8:Ra ul 1 1 1 1
9:Ca ta l ina 1 1 1 1
10:Johns on 1 1 1 1
11:Beul ah 3 3 2 1
12:Ma rti na 1 1 1 1
13:Ma ri e 3 3 2 1
14:Ernes t 3 3 2 1
15:Chri s topher 3 3 2 1
16:Erni e 3 3 2 1
17:Chri s ta 3 3 2 1
21. 18:Li nette 3 3 2 1
19:Bo 3 3 2 1
20:Ca rl a 3 3 2 1
21:Al berto 3 3 2 1
22:Chri s ti na 3 3 2 1
23:Jona h 3 3 2 1
24:Tucker 3 3 2 1
25:Sha nta 3 3 2 1
26:Mel i ssa 3 3 2 1
27:Jenna 3 3 2 1
28:Johnny 3 3 2 1
29:Cl ea tus 3 3 2 1
30:Jona s 3 3 2 1
31:Ta d 3 3 2 1
32:Ama ryl l is 3 3 2 1
33:Na tha n 3 3 2 1
34:Dea nna 3 3 2 1
35:Wi l ly 4 4 3 2
36:Dea na 5 4 3 2
37:Dea 5 4 3 2
38:Cl a ude 5 4 3 2
39:Ama nda 5 4 3 2
40:Bori s 5 4 3 2
41:Ga rrett 5 4 3 2
42:Stew 5 4 3 2
43:Bree 5 4 3 2
44:Ka rma 5 4 3 2
Vertical Icicle:
In this document, it is not possible to display the full vertical icicle, but, yet, the results for the same are
described below.
For the two cluster solution you can see that one cluster consists of ten cases (Boris through Willy, followed by
a column with no X’s). These were our adjunct (part-time) faculty (excepting one) and the second cluster
consists of everybody else.
For the three cluster solution you can see the cluster of adjunct faculty and the others split into two. Deanna
through Mickey were our junior faculty and Lawrence through Rosalyn our senior faculty
For the four cluster solution you can see that one case (Lawrence) forms a cluster of his own.
22. Dendrogram
It displays essentially the same information that is found in the agglomeration schedule but in graphic form.
* * * * * * * * * * * H I E R A R C H I C A L C L U S T E R A N A L Y S I S * * * * * * * * * *
Dendrogram using Average Linkage (Between Groups)
Rescaled Distance Cluster Combine
C A S E 0 5 10 15 20 25
Label Num +---------+---------+---------+---------+---------+
Amaryllis 32 ─┐
Nathan 33 ─┤
Jonas 30 ─┼─┐
Cleatus 29 ─┘ │
Alberto 21 ─┐ │
Melissa 26 ─┤ │
Carla 20 ─┤ ├─────┐
Christina 22 ─┤ │ │
Shanta 25 ─┤ │ │
Ernie 16 ─┤ │ │
Christa 17 ─┼─┘ │
Tucker 24 ─┤ │
Jenna 27 ─┘ ├───┐
Beulah 11 ─┐ │ │
Linette 18 ─┼─┐ │ │
Marie 13 ─┤ ├─┐ │ │
Jonah 23 ─┘ │ ├─┐ │ │
Johnny 28 ───┘ │ │ │ ├───┐
Mickey 5 ─────┘ ├─┘ │ │
Ernest 14 ─┬───┐ │ │ │
Christopher 15 ─┘ ├─┘ │ ├───────────────┐
Tad 31 ─────┘ │ │ │
Bo 19 ─────────────┘ │ │
Deanna 34 ─────────────────┘ │
Raul 8 ─┬─┐ │
Catalina 9 ─┘ ├─────┐ ├───────────────┐
Louis 6 ───┘ │ │ │
Tony 7 ─┬─┐ ├───┐ │ │
Martina 12 ─┘ ├─┐ │ │ │ │
Sunila 3 ─┬─┘ ├───┘ ├───────────┐ │ │
Johnson 10 ─┘ │ │ │ │ │
Randolph 4 ─────┘ │ ├───────┘ │
Rosalyn 1 ─────────────┘ │ │
Lawrence 2 ─────────────────────────┘ │
Garrett 41 ─┐ │
Stew 42 ─┼─┐ │
Bree 43 ─┤ │ │
Karma 44 ─┘ ├───┐ │
Dea 37 ─┐ │ │ │
Claude 38 ─┤ │ ├─────────────────┐ │
Amanda 39 ─┼─┘ │ │ │
Deana 36 ─┘ │ ├───────────────────────┘
Boris 40 ───────┘ │
Willy 35 ─────────────────────────┘
23. Multiple Regression Analysis
In this Analysis we are using a data file that was created by randomly sampling 400 elementary
schools from the California Department of Education's API 2000 dataset. This data file contains a
measure of school academic performance as well as other attributes of the elementary schools, such
as, class size, enrolment, poverty, etc.,
Now, performing a regression analysis using api00 as the outcome variable and the
variables acs_k3, meals and full as predictors. These measure the academic performance of the
school (api00), the average class size in kindergarten through 3rd grade (acs_k3), the percentage of
students receiving free meals (meals) - which is an indicator of poverty, and the percentage of
teachers who have full teaching credentials (full). We expect that better academic performance would
be associated with lower class size, fewer students receiving free meals, and a higher percentage of
teachers having full teaching credentials. The output is as follows:
Regression
Notes
Output Created 02-Apr-2013 21:48:19
Comments
Input Data C:UsersDivijDesktopSPSS Dataelemapi.sav
Active Dataset DataSet5
Filter <none>
Weight <none>
Split File <none>
N of Row s in Working Data File 400
Missing Value Handling Definition of Missing User-defined missing values are treated as
missing.
Cases Used Statistics are based on cases with no missing
values for any variable used.
Syntax regression
/dependent api00
/method=enter acs_k3 meals full
.
Resources Processor Time 00:00:00.063
Elapsed Time 00:00:00.026
Memory Required 2284 bytes
Additional Memory Required for
0 bytes
Residual Plots
b
Variables Entered/Removed
24. Variables Variables
Model Entered Removed Method
1 pct full
credential, avg
. Enter
class size k-3,
a
pct free meals
a. All requested variables entered.
b. Dependent Variable: api 2000
Model Summary
Adjusted R Std. Error of the
Model R R Square Square Estimate
a
1 .821 .674 .671 64.153
a. Predictors: (Constant), pct full credential, avg class size k-3, pct
free meals
b
ANOVA
Model Sum of Squares df Mean Square F Sig.
a
1 Regression 2634884.261 3 878294.754 213.407 .000
Residual 1271713.209 309 4115.577
Total 3906597.470 312
a. Predictors: (Constant), pct full credential, avg class size k-3, pct free meals
b. Dependent Variable: api 2000
a
Coefficients
Standardized
Unstandardized Coefficients Coefficients
Model B Std. Error Beta t Sig.
1 (Constant) 906.739 28.265 32.080 .000
avg class size k-3 -2.682 1.394 -.064 -1.924 .055
pct free meals -3.702 .154 -.808 -24.038 .000
pct full credential .109 .091 .041 1.197 .232
a. Dependent Variable: api 2000
25. Let's test the three predictors on whether they are statistically significant and, if so, the direction of the
relationship. The average class size (acs_k3, b=-2.682) is not significant (p=0.055), but only just so,
and the coefficient is negative which would indicate that larger class sizes is related t o lower
academic performance, which is what we would expect. Next, the effect of meals (b=-3.702, p=.000)
is significant and its coefficient is negative indicating that the greater the proportion students receiving
free meals, the lower the academic performance. We cannot say that free meals are causing lower
academic performance. The meals variable is highly related to income level and functions more as a
proxy for poverty. Thus, higher levels of poverty are associated with lower academic performance.
Finally, the percentage of teachers with full credentials (full, b=0.109, p=.2321) seems to be unrelated
to academic performance. This would seem to indicate that the percentage of teachers with full
credentials is not an important factor in predicting academic performance which is unexpected.
From these results, we would conclude that lower class sizes are related to higher performance, that
fewer students receiving free meals is associated with higher performance, and that the percentage of
teachers with full credentials was not related to academic performance in the schools. Before we
write this up as our finding, we should do checks to make sure we can firmly stand behind these
results.
Examining Data
Step 1)
To start examining the data we have a look at the first 10 data points for the variables included in our
regression analysis. We need to lay focus on the number of missing data points in the given data.
api00 acs_k3 meals full
693 16 67 76.00
570 15 92 79.00
546 17 97 68.00
571 20 90 87.00
478 18 89 87.00
858 20 . 100.00
918 19 . 100.00
831 20 . 96.00
860 20 . 100.00
737 21 29 96.00
Number of cases read: 10 Number of cases listed: 10
We see that among the first 10 observations, we have four missing values for meals. Keeping this in
mind, we can use the descriptives command with /var=all to get descriptive statistics for all of the
variables, and pay special attention to the number of valid cases for meals.
Step 2)
Descriptive Statistics
N Minimum Maximum Mean Std. Deviation
school number 400 58 6072 2866.81 1543.811
district number 400 41 796 457.73 184.823
26. api 2000 400 369 940 647.62 142.249
api 1999 400 333 917 610.21 147.136
growth 1999 to 2000 400 -69 134 37.41 25.247
pct free meals 315 6 100 71.99 24.386
english language learners 400 0 91 31.45 24.839
year round school 400 0 1 .23 .421
pct 1st year in school 399 2 47 18.25 7.485
avg class size k-3 398 -21 25 18.55 5.005
avg class size 4-6 397 20 50 29.69 3.841
parent not hsg 400 0 100 21.25 20.676
parent hsg 400 0 100 26.02 16.333
parent some college 400 0 67 19.71 11.337
parent college grad 400 0 100 19.70 16.471
parent grad school 400 0 67 8.64 12.131
avg parent ed 381 1.00 4.62 2.6685 .76379
pct full credential 400 .42 100.00 66.0568 40.29793
pct emer credential 400 0 59 12.66 11.746
number of students 400 130 1570 483.47 226.448
Percentage free meals in
400 1 3 2.02 .819
3 categories
Valid N (listwise) 295
Examining the output for the variables we used in our regression analysis above,
namely api00, acs_k3, meals, full. For api00, we see that the values range from 369 to 940 and
there are 400 valid values. For acs_k3, the average class size ranges from -21 to 25 and there are 2
missing values. An average class size of -21 sounds wrong. The variable meals ranges from 6%
getting free meals to 100% getting free meals, so these values seem reasonable, but there are only
315 valid values for this variable. The percent of teachers being full credentialed ranges from .42 to
100, and all of the values are valid.
This has uncovered a number of peculiarities worthy of further examination. We now obtain a
corrected data set from the same source. This data set has got all the data corrected & is free from
the shortcomings diagnosed above. We run another multiple regression on the new data set.
27. New Multiple regression analysis
For this multiple regression example, we will regress the dependent variable, api00, on all of the
predictor variables in the data set.
Regression
Notes
Output Created 02-Apr-2013 22:54:47
Comments
Input Data C:UsersDivijDesktopSPSS
Dataelemapi2.sav
Active Dataset DataSet8
Filter <none>
Weight <none>
Split File <none>
N of Row s in Working Data File 400
Missing Value Handling Definition of Missing User-defined missing values are treated as
missing.
Cases Used Statistics are based on cases with no missing
values for any variable used.
Syntax regression
/dependent api00
/method=enter ell meals yr_rnd mobility
acs_k3 acs_46 full emer enroll .
Resources Processor Time 00:00:00.031
Elapsed Time 00:00:00.022
Memory Required 4724 bytes
Additional Memory Required for
0 bytes
Residual Plots
b
Variables Entered/Removed
Variables Variables
Model Entered Removed Method
28. 1 number of
students, avg
class size 4-6,
pct 1st year in
school, avg
class size k-3,
pct emer
. Enter
credential,
english language
learners, year
round school,
pct free meals,
pct full
a
credential
a. All requested variables entered.
b. Dependent Variable: api 2000
Model Summary
Adjusted R Std. Error of the
Model R R Square Square Estimate
a
1 .919 .845 .841 56.768
a. Predictors: (Constant), number of students, avg class size 4-6, pct
1st year in school, avg class size k-3, pct emer credential, english
language learners, year round school, pct free meals, pct full
credential
b
ANOVA
Model Sum of Squares df Mean Square F Sig.
a
1 Regression 6740702.006 9 748966.890 232.409 .000
Residual 1240707.781 385 3222.618
Total 7981409.787 394
a. Predictors: (Constant), number of students, avg class size 4-6, pct 1st year in school, avg
class size k-3, pct emer credential, english language learners, year round school, pct free
meals, pct full credential
b. Dependent Variable: api 2000
a
Coefficients
Standardized
Model Unstandardized Coefficients Coefficients t Sig.
29. B Std. Error Beta
1 (Constant) 758.942 62.286 12.185 .000
english language learners -.860 .211 -.150 -4.083 .000
pct free meals -2.948 .170 -.661 -17.307 .000
year round school -19.889 9.258 -.059 -2.148 .032
pct 1st year in school -1.301 .436 -.069 -2.983 .003
avg class size k-3 1.319 2.253 .013 .585 .559
avg class size 4-6 2.032 .798 .055 2.546 .011
pct full credential .610 .476 .064 1.281 .201
pct emer credential -.707 .605 -.058 -1.167 .244
number of students -.012 .017 -.019 -.724 .469
a. Dependent Variable: api 2000
1) Examining the output from this regression analysis. As with the simple regression, we look to
the p-value of the F-test to see if the overall model is significant. With a p-value of zero to
three decimal places, the model is statistically significant. The R-squared is 0.845, meaning
that approximately 85% of the variability of api00 is accounted for by the variables in the
model. In this case, the adjusted R-squared indicates that about 84% of the variability
ofapi00 is accounted for by the model, even after taking into account the number of predictor
variables in the model. The coefficients for each of the variables indicates the amount of
change one could expect in api00 given a one-unit change in the value of that variable, given
that all other variables in the model are held constant. For example, consider the
variable ell. We would expect a decrease of 0.86 in the api00 score for every one unit
increase in ell, assuming that all other variables in the model are held constant.
2) R-Square is the proportion of variance in the dependent variable (api00) which can be
predicted from the independent variables (ell, meals, yr_rnd,
mobility, acs_k3, acs_46, full, emer and enroll). This value indicates that 84% of the
variance in api00 can be predicted from the
variables ell, meals,yr_rnd, mobility, acs_k3, acs_46, full, emer and enroll.
3) The beta coefficients are used by some researchers to compare the relative strength of the
various predictors within the model. Because the beta coefficients are all measured in
standard deviations, instead of the units of the variables, they can be compared to one
another. In other words, the beta coefficients are the coefficients that you would obtain if the
outcome and predictor variables were all transformed to standard scores, also cal led z-
scores, before running the regression. In this example, meals has the largest Beta coefficient,
-0.661, and acs_k3 has the smallest Beta, 0.013. Thus, a one standard deviation increase
in meals leads to a 0.661 standard deviation decrease in predicted api00, with the other
variables held constant. And, a one standard deviation increase in acs_k3, in turn, leads to a
0.013 standard deviation increase api00 with the other variables in the model held constant.
4) The adjusted R-square attempts to yield a more honest value to estimate the R-squared for
the population. The value of R-square was .8446, while the value of Adjusted R-square was
30. .8409. The adjusted R-square attempts to yield a more honest value to estimate the R-
squared for the population.
5) The F Value is the Mean Square Regression (748966.89) divided by the Mean Square
Residual (3222.61761), yielding F=232.41. The p value associated with this F value is very
small (0.0000). These values are used to answer the question "Do the independent variables
reliably predict the dependent variable?". The p value is compared to your alpha level
(typically 0.05) and, if smaller, you can conclude "Yes, the independent variables reliably
predict the dependent variable".
6) These are the degrees of freedom associated with the sources of variance. The Total
variance has N-1 degrees of freedom (DF). In this case, there were N=395 observations, so
the DF for total is 394.