SlideShare a Scribd company logo
1 of 5
Download to read offline
Published on : Feb 7, 2012


                                                Hexaware E-book
                                                on Predictive Analytics
                                                Business Intelligence & Analytics
www.hexaware.com                                Actionable Intelligence Enabled
© Hexaware Technologies. All rights reserved.
Hexaware E-book on                                                                  Can you name some specific uses of data mining?
                                                                                    Some Specific uses of data mining include:
Predictive Analytics                                                                Market segmentation - Identify the common characteristics of customers
                                                                                    who buy the same products from your company.
What is Data mining?
                                                                                    Customer churn - Predict which customers are likely to leave your
Data mining, or knowledge discovery, is the computer-assisted process of
                                                                                    company and go to a competitor.
finding hidden patterns in data. Data mining tools predict behaviors and
future trends, allowing businesses to make proactive, knowledge-driven              Fraud detection - Identify which transactions are most likely to be
decisions. Hence it is also called predictive analytics.                            fraudulent.
Data mining tools can answer business questions that traditionally were             Direct marketing - Identify which prospects should be included in a
time consuming to resolve. They scour databases for hidden patterns,                mailing list to obtain the highest response rate.
finding predictive information that experts may miss because it lies outside        Interactive marketing - Predict what each individual accessing a Web site
their expectations.                                                                 is most likely interested in seeing.
When and where the data mining and predictive analytics could be                    Market basket analysis - Understand what products or services are
useful?                                                                             commonly purchased together; e.g., beer and diapers.
The amount of raw data stored in corporate databases is exploding. From             Trend analysis - Reveal the difference between typical customers this
trillions of point-of-sale transactions and credit card purchases to                month and last.
pixel-by-pixel images of galaxies, databases are now measured in
gigabytes and terabytes. Raw data by itself, however, does not provide              Will sharing data to do data mining raise privacy related issues?
much information.                                                                   What needs to be done in such scenarios?
In today's fiercely competitive business environment, companies need to             There is a way to deal with sensitive data like credit card numbers,
rapidly turn these terabytes of raw data into significant insights into their
                                                                                    insurance policy numbers and account numbers. Data need to be masked
customers and markets to guide their marketing, investment, and
                                                                                    or recoded to maintain the privacy.
management strategies.
In these scenarios data mining would help you unlock the hidden potential
of your data and deliver actionable insights.

www.hexaware.com
© Hexaware Technologies. All rights reserved.
                                                                                1
What are all steps involved in data mining?                                     Name some tools used for Data mining?
Data mining process involves predefined steps starting from                     a. Microsoft BI stack has SSAS as part of SQL Server 2008
•    Business case understanding or Problem understanding,                      b. There is an open source tool called R which offers data mining
•    Data understanding                                                            solution

•    Data extraction                                                            c. Rapid Miner is an another open source tool

•    Pre-processing                                                             d. SAS is of course is a highly sophisticated tool with enormous
                                                                                   computational power
•    Mining model building
                                                                                e. IBM has it’s tool called SPSS
•    Testing and Evaluation
                                                                                f.   Oracle’s ODM – Oracle Data Miner

Is there any maintenance to be done for the mining model once it is             The above list is not exhaustive.
deployed?
                                                                                What are all the industries in which Predictive Analytics is
Yes, the mining model needs to be calibrated at least once in six months.       applicable?
The frequency varies based on the business need and the volume of data
                                                                                We have recently provided predictive analytics solutions for following
flow
                                                                                Industries:
Calibration involves
                                                                                •    Insurance
•    Checking the predicted output with the actual output
                                                                                •    Education
•    Modifying the mining model if required                   Public
                                                                                •    Mining
Can data mining solution be offered on the cloud?                               •    Logistics
Yes, Data mining solution can be offered on the cloud.                          •    Health & Hygiene
Organizations can adopt “Pay per use” method without investing on the           In short, predictive analytics can be deployed for diverse industries.
infrastructure required. Here the challenge is to upload huge amount of
data on the cloud.


www.hexaware.com
© Hexaware Technologies. All rights reserved.
                                                                            2
What are all the risks involved in using predictive analytics solution?               Can you list some of the best practices in Data mining and
•    Wrong understanding of business problems and data will result in a               Predictive analytics?
     prediction model with complex statistical algorithms, but it will be of no       •   Executive Support: Support from the decision makers and middle
     use to the business.                                                                 management would make a world of difference.
•    Wrong interpretation of the results would lead to wrong decisions.               •   Business problem specificity: Identification of correct business problem
•    Poor data quality would result in poor predictions.                                  to apply predictive analytics is vital to the success of the mining model.
•    Absence of maintenance of the mining model would make predictions                •   Availability of historical data: Richer the data, the more robust will be
     obsolete.                                                                            the mining model.
•    Building the predictive analytics solution with resources with less              •   Good quality data: It is the most important factor for an accurate mining
     statistical knowledge will lead to less accurate models.                             model.
                                                                                      •   Pre-processing: While building the mining model, one of the main
What is Text mining?
                                                                                          activities is pre-processing where the data is cleansed, sliced, diced
Text Mining is the process of deriving high quality information from
                                                                                          and categorized to suit to the mining model. Good business
unstructured text data. There are various techniques used to derive high
                                                                                          knowledge and a sound data mining knowledge is required to do this
quality information from textual data, such as computational linguistics,
                                                                                          as this is the base for the predictive model.
information retrieval, statistics, machine learning, etc.
                                                                                      •   Selection of statistical techniques: Experienced data mining resource
Various forms of text mining include categorization, classification,
clustering, concept extraction, summarization, sentiment analysis, etc.                   can choose the correct statistical technique and can compare the
                                                                                          accuracy of other techniques.
Are the open source tools sufficient and robust in providing                          •   Interpretation of output: It is extremely important to interpret the output
answers to tough business cases?
                                                                                          in the correct way and link it back to the business problem stated
Open source tools like R and Rapid miner provide excellent flexibility to                 initially.
build the model. Online R community constantly updates algorithms and
industry specific solutions as packages to R after validating. So far there
are around 3500 packages built in R.
R’s popularity has been increasing over the other predictive analytics tools.
In a recent survey Kdnuggets.com reports that R has 24% of market share
and R is the most sought after statistical programming language.
www.hexaware.com
© Hexaware Technologies. All rights reserved.
                                                                                  3
Thank you for reading our E- Book, in case you have any queries please write back to us at corporatemarketing@hexaware.com

  If you want to keep up with the industry's latest trends, please visit our blog on BI http://blogs.hexaware.com/index/business-intelligence

For more information on our Business Intelligence & Analytics services please visit http://hexaware.com/business-intelligence-analytics.htm




                                                                                    About Hexaware
                                                                                    Hexaware is a leading global provider of IT and BPO services. The
                                                                                    company has achieved leadership position in domains such as
                                                                                    Banking, Financial Services, Insurance, Transportation, Logistics and
                                                                                    HR-IT solutions. Hexaware focuses on delivering business result
                                                                                    leveraging technology solution and specializes in Business
                                                                                    Intelligence & Analytics, Enterprise Applications, Independent Testing
                                                                                    and Legacy Modernization. Hexaware has been providing business
                                                                                    technology solutions for over 20 years and offers world class services
                                                                                    delivery, technology leadership and skilled human capital.
          www.hexaware.com
          © Hexaware Technologies. All rights reserved.




                                                                        4

More Related Content

More from Hazelknight Media & Entertainment Pvt Ltd

“A Practitioner’s View” on the latest trends and information on BI/ DW techno...
“A Practitioner’s View” on the latest trends and information on BI/ DW techno...“A Practitioner’s View” on the latest trends and information on BI/ DW techno...
“A Practitioner’s View” on the latest trends and information on BI/ DW techno...Hazelknight Media & Entertainment Pvt Ltd
 
PeopleSoft HRMS eProfile eBenefits_How a global auto parts retailer got more ...
PeopleSoft HRMS eProfile eBenefits_How a global auto parts retailer got more ...PeopleSoft HRMS eProfile eBenefits_How a global auto parts retailer got more ...
PeopleSoft HRMS eProfile eBenefits_How a global auto parts retailer got more ...Hazelknight Media & Entertainment Pvt Ltd
 

More from Hazelknight Media & Entertainment Pvt Ltd (20)

Insurance Business Analytics Whitepaper
Insurance Business Analytics WhitepaperInsurance Business Analytics Whitepaper
Insurance Business Analytics Whitepaper
 
Predictive analytics - The cure for business myopia
Predictive analytics - The cure for business myopiaPredictive analytics - The cure for business myopia
Predictive analytics - The cure for business myopia
 
Ask for maps – location analytics!
Ask for maps – location analytics!Ask for maps – location analytics!
Ask for maps – location analytics!
 
Top North American Ivy League Leverages Data Masking
Top North American Ivy League Leverages Data MaskingTop North American Ivy League Leverages Data Masking
Top North American Ivy League Leverages Data Masking
 
Top North American Ivy League Leverages Data Masking
Top North American Ivy League Leverages Data MaskingTop North American Ivy League Leverages Data Masking
Top North American Ivy League Leverages Data Masking
 
PeopleSoft HCM Leveraged for Global HR & Payroll Operations
PeopleSoft HCM Leveraged for Global HR & Payroll OperationsPeopleSoft HCM Leveraged for Global HR & Payroll Operations
PeopleSoft HCM Leveraged for Global HR & Payroll Operations
 
PeopleSoft 9.1 Upgrade for a Leading Beer Distributor & Retailer
PeopleSoft 9.1 Upgrade for a Leading Beer Distributor & RetailerPeopleSoft 9.1 Upgrade for a Leading Beer Distributor & Retailer
PeopleSoft 9.1 Upgrade for a Leading Beer Distributor & Retailer
 
PeopleSoft 9.1 Upgrade for a Leading Education Services Company
PeopleSoft 9.1 Upgrade for a Leading Education Services CompanyPeopleSoft 9.1 Upgrade for a Leading Education Services Company
PeopleSoft 9.1 Upgrade for a Leading Education Services Company
 
PeopleSoft 9.1 HRMS Upgrade
PeopleSoft 9.1 HRMS UpgradePeopleSoft 9.1 HRMS Upgrade
PeopleSoft 9.1 HRMS Upgrade
 
PeopleSoft 9.1 Upgrade
PeopleSoft 9.1 UpgradePeopleSoft 9.1 Upgrade
PeopleSoft 9.1 Upgrade
 
PeopleSoft FSCM & HCM 9.1 Upgrade
PeopleSoft FSCM & HCM 9.1 UpgradePeopleSoft FSCM & HCM 9.1 Upgrade
PeopleSoft FSCM & HCM 9.1 Upgrade
 
PeopleSoft HCM 9.1
PeopleSoft HCM 9.1PeopleSoft HCM 9.1
PeopleSoft HCM 9.1
 
PeopleSoft HRMS Upgrade
PeopleSoft HRMS UpgradePeopleSoft HRMS Upgrade
PeopleSoft HRMS Upgrade
 
PeopleSoft Asset Management Implementation
PeopleSoft Asset Management ImplementationPeopleSoft Asset Management Implementation
PeopleSoft Asset Management Implementation
 
BIG DATA – Beyond the Hype
BIG DATA – Beyond the HypeBIG DATA – Beyond the Hype
BIG DATA – Beyond the Hype
 
“A Practitioner’s View” on the latest trends and information on BI/ DW techno...
“A Practitioner’s View” on the latest trends and information on BI/ DW techno...“A Practitioner’s View” on the latest trends and information on BI/ DW techno...
“A Practitioner’s View” on the latest trends and information on BI/ DW techno...
 
Business Analytics for the Airline MRO Industry: An Analytics Master class
Business Analytics for the Airline MRO Industry: An Analytics Master classBusiness Analytics for the Airline MRO Industry: An Analytics Master class
Business Analytics for the Airline MRO Industry: An Analytics Master class
 
Customization and integration of e ticketing solution
Customization and integration of e ticketing solutionCustomization and integration of e ticketing solution
Customization and integration of e ticketing solution
 
PeopleSoft HRMS eProfile eBenefits_How a global auto parts retailer got more ...
PeopleSoft HRMS eProfile eBenefits_How a global auto parts retailer got more ...PeopleSoft HRMS eProfile eBenefits_How a global auto parts retailer got more ...
PeopleSoft HRMS eProfile eBenefits_How a global auto parts retailer got more ...
 
Leading na airline reduces costs through mro applications
Leading na airline reduces costs through mro applicationsLeading na airline reduces costs through mro applications
Leading na airline reduces costs through mro applications
 

Recently uploaded

Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 

Recently uploaded (20)

Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 

E-book on Predictive Analytics

  • 1. Published on : Feb 7, 2012 Hexaware E-book on Predictive Analytics Business Intelligence & Analytics www.hexaware.com Actionable Intelligence Enabled © Hexaware Technologies. All rights reserved.
  • 2. Hexaware E-book on Can you name some specific uses of data mining? Some Specific uses of data mining include: Predictive Analytics Market segmentation - Identify the common characteristics of customers who buy the same products from your company. What is Data mining? Customer churn - Predict which customers are likely to leave your Data mining, or knowledge discovery, is the computer-assisted process of company and go to a competitor. finding hidden patterns in data. Data mining tools predict behaviors and future trends, allowing businesses to make proactive, knowledge-driven Fraud detection - Identify which transactions are most likely to be decisions. Hence it is also called predictive analytics. fraudulent. Data mining tools can answer business questions that traditionally were Direct marketing - Identify which prospects should be included in a time consuming to resolve. They scour databases for hidden patterns, mailing list to obtain the highest response rate. finding predictive information that experts may miss because it lies outside Interactive marketing - Predict what each individual accessing a Web site their expectations. is most likely interested in seeing. When and where the data mining and predictive analytics could be Market basket analysis - Understand what products or services are useful? commonly purchased together; e.g., beer and diapers. The amount of raw data stored in corporate databases is exploding. From Trend analysis - Reveal the difference between typical customers this trillions of point-of-sale transactions and credit card purchases to month and last. pixel-by-pixel images of galaxies, databases are now measured in gigabytes and terabytes. Raw data by itself, however, does not provide Will sharing data to do data mining raise privacy related issues? much information. What needs to be done in such scenarios? In today's fiercely competitive business environment, companies need to There is a way to deal with sensitive data like credit card numbers, rapidly turn these terabytes of raw data into significant insights into their insurance policy numbers and account numbers. Data need to be masked customers and markets to guide their marketing, investment, and or recoded to maintain the privacy. management strategies. In these scenarios data mining would help you unlock the hidden potential of your data and deliver actionable insights. www.hexaware.com © Hexaware Technologies. All rights reserved. 1
  • 3. What are all steps involved in data mining? Name some tools used for Data mining? Data mining process involves predefined steps starting from a. Microsoft BI stack has SSAS as part of SQL Server 2008 • Business case understanding or Problem understanding, b. There is an open source tool called R which offers data mining • Data understanding solution • Data extraction c. Rapid Miner is an another open source tool • Pre-processing d. SAS is of course is a highly sophisticated tool with enormous computational power • Mining model building e. IBM has it’s tool called SPSS • Testing and Evaluation f. Oracle’s ODM – Oracle Data Miner Is there any maintenance to be done for the mining model once it is The above list is not exhaustive. deployed? What are all the industries in which Predictive Analytics is Yes, the mining model needs to be calibrated at least once in six months. applicable? The frequency varies based on the business need and the volume of data We have recently provided predictive analytics solutions for following flow Industries: Calibration involves • Insurance • Checking the predicted output with the actual output • Education • Modifying the mining model if required Public • Mining Can data mining solution be offered on the cloud? • Logistics Yes, Data mining solution can be offered on the cloud. • Health & Hygiene Organizations can adopt “Pay per use” method without investing on the In short, predictive analytics can be deployed for diverse industries. infrastructure required. Here the challenge is to upload huge amount of data on the cloud. www.hexaware.com © Hexaware Technologies. All rights reserved. 2
  • 4. What are all the risks involved in using predictive analytics solution? Can you list some of the best practices in Data mining and • Wrong understanding of business problems and data will result in a Predictive analytics? prediction model with complex statistical algorithms, but it will be of no • Executive Support: Support from the decision makers and middle use to the business. management would make a world of difference. • Wrong interpretation of the results would lead to wrong decisions. • Business problem specificity: Identification of correct business problem • Poor data quality would result in poor predictions. to apply predictive analytics is vital to the success of the mining model. • Absence of maintenance of the mining model would make predictions • Availability of historical data: Richer the data, the more robust will be obsolete. the mining model. • Building the predictive analytics solution with resources with less • Good quality data: It is the most important factor for an accurate mining statistical knowledge will lead to less accurate models. model. • Pre-processing: While building the mining model, one of the main What is Text mining? activities is pre-processing where the data is cleansed, sliced, diced Text Mining is the process of deriving high quality information from and categorized to suit to the mining model. Good business unstructured text data. There are various techniques used to derive high knowledge and a sound data mining knowledge is required to do this quality information from textual data, such as computational linguistics, as this is the base for the predictive model. information retrieval, statistics, machine learning, etc. • Selection of statistical techniques: Experienced data mining resource Various forms of text mining include categorization, classification, clustering, concept extraction, summarization, sentiment analysis, etc. can choose the correct statistical technique and can compare the accuracy of other techniques. Are the open source tools sufficient and robust in providing • Interpretation of output: It is extremely important to interpret the output answers to tough business cases? in the correct way and link it back to the business problem stated Open source tools like R and Rapid miner provide excellent flexibility to initially. build the model. Online R community constantly updates algorithms and industry specific solutions as packages to R after validating. So far there are around 3500 packages built in R. R’s popularity has been increasing over the other predictive analytics tools. In a recent survey Kdnuggets.com reports that R has 24% of market share and R is the most sought after statistical programming language. www.hexaware.com © Hexaware Technologies. All rights reserved. 3
  • 5. Thank you for reading our E- Book, in case you have any queries please write back to us at corporatemarketing@hexaware.com If you want to keep up with the industry's latest trends, please visit our blog on BI http://blogs.hexaware.com/index/business-intelligence For more information on our Business Intelligence & Analytics services please visit http://hexaware.com/business-intelligence-analytics.htm About Hexaware Hexaware is a leading global provider of IT and BPO services. The company has achieved leadership position in domains such as Banking, Financial Services, Insurance, Transportation, Logistics and HR-IT solutions. Hexaware focuses on delivering business result leveraging technology solution and specializes in Business Intelligence & Analytics, Enterprise Applications, Independent Testing and Legacy Modernization. Hexaware has been providing business technology solutions for over 20 years and offers world class services delivery, technology leadership and skilled human capital. www.hexaware.com © Hexaware Technologies. All rights reserved. 4