This document summarizes research conducted at West Virginia University and the Jet Propulsion Laboratory under NASA grants. It discusses an approach called STAR (Simulated Annealing with collars and Rules) for handling uncertainty in software project models. STAR uses simulated annealing to stochastically sample different input settings, scores the outputs, and favors higher scoring settings to generate rules. The document compares STAR to other modeling methods and finds that even without tuning, STAR produces surprisingly accurate results, often within 25% of methods using historical data to learn input relationships. This performance is attributed to key inputs being tightly constrained by a few influential "collar" variables.
Defect, defect, defect: PROMISE 2012 Keynote Sung Kim
Software prediction leveraging repositories has received a tremendous amount of attention within the software engineering community, including PROMISE. In this talk, I will first present great achievements in defect prediction research including new defect prediction features, promising algorithms, and interesting analysis results. However, there are still many challenges in defect prediction. I will talk about them and discuss potential solutions for them leveraging prediction 2.0.
A wiki is free, functional and fabulous. This presentation will reveal how a wiki-centric, Web 2.0 classroom provides a constructivist tool for collaboration, communication, publication, presentation and assessment. This session includes the nuts and bolts of setting up and managing a wiki, ideas for classroom use and best practice use of wikis internationally.
Defect, defect, defect: PROMISE 2012 Keynote Sung Kim
Software prediction leveraging repositories has received a tremendous amount of attention within the software engineering community, including PROMISE. In this talk, I will first present great achievements in defect prediction research including new defect prediction features, promising algorithms, and interesting analysis results. However, there are still many challenges in defect prediction. I will talk about them and discuss potential solutions for them leveraging prediction 2.0.
A wiki is free, functional and fabulous. This presentation will reveal how a wiki-centric, Web 2.0 classroom provides a constructivist tool for collaboration, communication, publication, presentation and assessment. This session includes the nuts and bolts of setting up and managing a wiki, ideas for classroom use and best practice use of wikis internationally.
Fishing or a Z?: Investigating the Effects of Error on Mimetic and Alphabet D...Abdallah El Ali
Slides for the talk I gave at ICMI 2012, held in Santa Monica, CA, USA.
The full paper reference is:
El Ali, A., Kildal, J. & Lantz, V. (2012). Fishing or a Z?: Investigating the Effects of Error on Mimetic and Alphabet Device-based Gesture Interaction. In Proceedings of the 14th international conference on Multimodal Interaction (ICMI '12), 2012, Santa Monica, California.
Science has escaped the lab and is roaming free in the world. People use software to understand the world . What tools are needed to support that work?
GALE: Geometric active learning for Search-Based Software EngineeringCS, NcState
Multi-objective evolutionary algorithms (MOEAs) help software engineers find novel solutions to complex problems. When automatic tools explore too many options, they are slow to use and hard to comprehend. GALE is a near-linear time MOEA that builds a piecewise approximation to the surface of best solutions along the Pareto frontier. For each piece, GALE mutates solutions towards the better end. In numerous case studies, GALE finds comparable solutions to standard methods (NSGA-II, SPEA2) using far fewer evaluations (e.g. 20 evaluations, not 1,000). GALE is recommended when a model is expensive to evaluate, or when some audience needs to browse and understand how an MOEA has made its conclusions.
Three Laws of Trusted Data Sharing:(Building a Better Business Case for Dat...CS, NcState
Discussions about sharing
- Too much fear
- Not enough about benefits
Can we learn more from sharing that hoarding ?
- Yes (results from SE)
Three laws of trusted data sharing:
- For SE quality prediction..
- Better models from shared privatized data that from all raw data
Q: does this work for other kinds of data?
A: don’t know… yet
172529main ken and_tim_software_assurance_research_at_west_virginiaCS, NcState
SA @ WV(software assurance research at West Virginia)
Kenneth McGill
NASA IV&V Facility Research Lead
304.367.8300
Kenneth.McGill@ivv.nasa.gov
Dr. Tim Menzies Ph.D. (WVU)
Software Engineering Research Chair
tim@menzies.us
Fishing or a Z?: Investigating the Effects of Error on Mimetic and Alphabet D...Abdallah El Ali
Slides for the talk I gave at ICMI 2012, held in Santa Monica, CA, USA.
The full paper reference is:
El Ali, A., Kildal, J. & Lantz, V. (2012). Fishing or a Z?: Investigating the Effects of Error on Mimetic and Alphabet Device-based Gesture Interaction. In Proceedings of the 14th international conference on Multimodal Interaction (ICMI '12), 2012, Santa Monica, California.
Similar to If you fix everything you lose fixes for everything else (8)
Science has escaped the lab and is roaming free in the world. People use software to understand the world . What tools are needed to support that work?
GALE: Geometric active learning for Search-Based Software EngineeringCS, NcState
Multi-objective evolutionary algorithms (MOEAs) help software engineers find novel solutions to complex problems. When automatic tools explore too many options, they are slow to use and hard to comprehend. GALE is a near-linear time MOEA that builds a piecewise approximation to the surface of best solutions along the Pareto frontier. For each piece, GALE mutates solutions towards the better end. In numerous case studies, GALE finds comparable solutions to standard methods (NSGA-II, SPEA2) using far fewer evaluations (e.g. 20 evaluations, not 1,000). GALE is recommended when a model is expensive to evaluate, or when some audience needs to browse and understand how an MOEA has made its conclusions.
Three Laws of Trusted Data Sharing:(Building a Better Business Case for Dat...CS, NcState
Discussions about sharing
- Too much fear
- Not enough about benefits
Can we learn more from sharing that hoarding ?
- Yes (results from SE)
Three laws of trusted data sharing:
- For SE quality prediction..
- Better models from shared privatized data that from all raw data
Q: does this work for other kinds of data?
A: don’t know… yet
172529main ken and_tim_software_assurance_research_at_west_virginiaCS, NcState
SA @ WV(software assurance research at West Virginia)
Kenneth McGill
NASA IV&V Facility Research Lead
304.367.8300
Kenneth.McGill@ivv.nasa.gov
Dr. Tim Menzies Ph.D. (WVU)
Software Engineering Research Chair
tim@menzies.us
Next Generation “Treatment Learning” (finding the diamonds in the dust)CS, NcState
Q: How have dummies (like me) managed to gain (some) control over a (seemingly) complex world?
A:The world is simpler than we think.
◆ Models contain clumps
◆ A few collar variables decide which clumps to use.
"𝑩𝑬𝑮𝑼𝑵 𝑾𝑰𝑻𝑯 𝑻𝑱 𝑰𝑺 𝑯𝑨𝑳𝑭 𝑫𝑶𝑵𝑬"
𝐓𝐉 𝐂𝐨𝐦𝐬 (𝐓𝐉 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬) is a professional event agency that includes experts in the event-organizing market in Vietnam, Korea, and ASEAN countries. We provide unlimited types of events from Music concerts, Fan meetings, and Culture festivals to Corporate events, Internal company events, Golf tournaments, MICE events, and Exhibitions.
𝐓𝐉 𝐂𝐨𝐦𝐬 provides unlimited package services including such as Event organizing, Event planning, Event production, Manpower, PR marketing, Design 2D/3D, VIP protocols, Interpreter agency, etc.
Sports events - Golf competitions/billiards competitions/company sports events: dynamic and challenging
⭐ 𝐅𝐞𝐚𝐭𝐮𝐫𝐞𝐝 𝐩𝐫𝐨𝐣𝐞𝐜𝐭𝐬:
➢ 2024 BAEKHYUN [Lonsdaleite] IN HO CHI MINH
➢ SUPER JUNIOR-L.S.S. THE SHOW : Th3ee Guys in HO CHI MINH
➢FreenBecky 1st Fan Meeting in Vietnam
➢CHILDREN ART EXHIBITION 2024: BEYOND BARRIERS
➢ WOW K-Music Festival 2023
➢ Winner [CROSS] Tour in HCM
➢ Super Show 9 in HCM with Super Junior
➢ HCMC - Gyeongsangbuk-do Culture and Tourism Festival
➢ Korean Vietnam Partnership - Fair with LG
➢ Korean President visits Samsung Electronics R&D Center
➢ Vietnam Food Expo with Lotte Wellfood
"𝐄𝐯𝐞𝐫𝐲 𝐞𝐯𝐞𝐧𝐭 𝐢𝐬 𝐚 𝐬𝐭𝐨𝐫𝐲, 𝐚 𝐬𝐩𝐞𝐜𝐢𝐚𝐥 𝐣𝐨𝐮𝐫𝐧𝐞𝐲. 𝐖𝐞 𝐚𝐥𝐰𝐚𝐲𝐬 𝐛𝐞𝐥𝐢𝐞𝐯𝐞 𝐭𝐡𝐚𝐭 𝐬𝐡𝐨𝐫𝐭𝐥𝐲 𝐲𝐨𝐮 𝐰𝐢𝐥𝐥 𝐛𝐞 𝐚 𝐩𝐚𝐫𝐭 𝐨𝐟 𝐨𝐮𝐫 𝐬𝐭𝐨𝐫𝐢𝐞𝐬."
Affordable Stationery Printing Services in Jaipur | Navpack n PrintNavpack & Print
Looking for professional printing services in Jaipur? Navpack n Print offers high-quality and affordable stationery printing for all your business needs. Stand out with custom stationery designs and fast turnaround times. Contact us today for a quote!
Recruiting in the Digital Age: A Social Media MasterclassLuanWise
In this masterclass, presented at the Global HR Summit on 5th June 2024, Luan Wise explored the essential features of social media platforms that support talent acquisition, including LinkedIn, Facebook, Instagram, X (formerly Twitter) and TikTok.
Personal Brand Statement:
As an Army veteran dedicated to lifelong learning, I bring a disciplined, strategic mindset to my pursuits. I am constantly expanding my knowledge to innovate and lead effectively. My journey is driven by a commitment to excellence, and to make a meaningful impact in the world.
Enterprise Excellence is Inclusive Excellence.pdfKaiNexus
Enterprise excellence and inclusive excellence are closely linked, and real-world challenges have shown that both are essential to the success of any organization. To achieve enterprise excellence, organizations must focus on improving their operations and processes while creating an inclusive environment that engages everyone. In this interactive session, the facilitator will highlight commonly established business practices and how they limit our ability to engage everyone every day. More importantly, though, participants will likely gain increased awareness of what we can do differently to maximize enterprise excellence through deliberate inclusion.
What is Enterprise Excellence?
Enterprise Excellence is a holistic approach that's aimed at achieving world-class performance across all aspects of the organization.
What might I learn?
A way to engage all in creating Inclusive Excellence. Lessons from the US military and their parallels to the story of Harry Potter. How belt systems and CI teams can destroy inclusive practices. How leadership language invites people to the party. There are three things leaders can do to engage everyone every day: maximizing psychological safety to create environments where folks learn, contribute, and challenge the status quo.
Who might benefit? Anyone and everyone leading folks from the shop floor to top floor.
Dr. William Harvey is a seasoned Operations Leader with extensive experience in chemical processing, manufacturing, and operations management. At Michelman, he currently oversees multiple sites, leading teams in strategic planning and coaching/practicing continuous improvement. William is set to start his eighth year of teaching at the University of Cincinnati where he teaches marketing, finance, and management. William holds various certifications in change management, quality, leadership, operational excellence, team building, and DiSC, among others.
Cracking the Workplace Discipline Code Main.pptxWorkforce Group
Cultivating and maintaining discipline within teams is a critical differentiator for successful organisations.
Forward-thinking leaders and business managers understand the impact that discipline has on organisational success. A disciplined workforce operates with clarity, focus, and a shared understanding of expectations, ultimately driving better results, optimising productivity, and facilitating seamless collaboration.
Although discipline is not a one-size-fits-all approach, it can help create a work environment that encourages personal growth and accountability rather than solely relying on punitive measures.
In this deck, you will learn the significance of workplace discipline for organisational success. You’ll also learn
• Four (4) workplace discipline methods you should consider
• The best and most practical approach to implementing workplace discipline.
• Three (3) key tips to maintain a disciplined workplace.
B2B payments are rapidly changing. Find out the 5 key questions you need to be asking yourself to be sure you are mastering B2B payments today. Learn more at www.BlueSnap.com.
LA HUG - Video Testimonials with Chynna Morgan - June 2024Lital Barkan
Have you ever heard that user-generated content or video testimonials can take your brand to the next level? We will explore how you can effectively use video testimonials to leverage and boost your sales, content strategy, and increase your CRM data.🤯
We will dig deeper into:
1. How to capture video testimonials that convert from your audience 🎥
2. How to leverage your testimonials to boost your sales 💲
3. How you can capture more CRM data to understand your audience better through video testimonials. 📊
Building Your Employer Brand with Social MediaLuanWise
Presented at The Global HR Summit, 6th June 2024
In this keynote, Luan Wise will provide invaluable insights to elevate your employer brand on social media platforms including LinkedIn, Facebook, Instagram, X (formerly Twitter) and TikTok. You'll learn how compelling content can authentically showcase your company culture, values, and employee experiences to support your talent acquisition and retention objectives. Additionally, you'll understand the power of employee advocacy to amplify reach and engagement – helping to position your organization as an employer of choice in today's competitive talent landscape.
Business Valuation Principles for EntrepreneursBen Wann
This insightful presentation is designed to equip entrepreneurs with the essential knowledge and tools needed to accurately value their businesses. Understanding business valuation is crucial for making informed decisions, whether you're seeking investment, planning to sell, or simply want to gauge your company's worth.
The world of search engine optimization (SEO) is buzzing with discussions after Google confirmed that around 2,500 leaked internal documents related to its Search feature are indeed authentic. The revelation has sparked significant concerns within the SEO community. The leaked documents were initially reported by SEO experts Rand Fishkin and Mike King, igniting widespread analysis and discourse. For More Info:- https://news.arihantwebtech.com/search-disrupted-googles-leaked-documents-rock-the-seo-world/
Putting the SPARK into Virtual Training.pptxCynthia Clay
This 60-minute webinar, sponsored by Adobe, was delivered for the Training Mag Network. It explored the five elements of SPARK: Storytelling, Purpose, Action, Relationships, and Kudos. Knowing how to tell a well-structured story is key to building long-term memory. Stating a clear purpose that doesn't take away from the discovery learning process is critical. Ensuring that people move from theory to practical application is imperative. Creating strong social learning is the key to commitment and engagement. Validating and affirming participants' comments is the way to create a positive learning environment.
Premium MEAN Stack Development Solutions for Modern BusinessesSynapseIndia
Stay ahead of the curve with our premium MEAN Stack Development Solutions. Our expert developers utilize MongoDB, Express.js, AngularJS, and Node.js to create modern and responsive web applications. Trust us for cutting-edge solutions that drive your business growth and success.
Know more: https://www.synapseindia.com/technology/mean-stack-development-company.html
[Note: This is a partial preview. To download this presentation, visit:
https://www.oeconsulting.com.sg/training-presentations]
Sustainability has become an increasingly critical topic as the world recognizes the need to protect our planet and its resources for future generations. Sustainability means meeting our current needs without compromising the ability of future generations to meet theirs. It involves long-term planning and consideration of the consequences of our actions. The goal is to create strategies that ensure the long-term viability of People, Planet, and Profit.
Leading companies such as Nike, Toyota, and Siemens are prioritizing sustainable innovation in their business models, setting an example for others to follow. In this Sustainability training presentation, you will learn key concepts, principles, and practices of sustainability applicable across industries. This training aims to create awareness and educate employees, senior executives, consultants, and other key stakeholders, including investors, policymakers, and supply chain partners, on the importance and implementation of sustainability.
LEARNING OBJECTIVES
1. Develop a comprehensive understanding of the fundamental principles and concepts that form the foundation of sustainability within corporate environments.
2. Explore the sustainability implementation model, focusing on effective measures and reporting strategies to track and communicate sustainability efforts.
3. Identify and define best practices and critical success factors essential for achieving sustainability goals within organizations.
CONTENTS
1. Introduction and Key Concepts of Sustainability
2. Principles and Practices of Sustainability
3. Measures and Reporting in Sustainability
4. Sustainability Implementation & Best Practices
To download the complete presentation, visit: https://www.oeconsulting.com.sg/training-presentations
Sustainability: Balancing the Environment, Equity & Economy
If you fix everything you lose fixes for everything else
1. This work was conducted at West Virginia University and the Jet Propulsion
Laboratory under grants with NASA's Software Assurance Research
Program. Reference herein to any specific commercial product, process, or
service by trademark, manufacturer, or otherwise, does not constitute or
imply its endorsement by the United States Government.
If you fix everything you
lose fixes for everything else
Tim Menzies (WVU)
Jairus Hihn (JPL)
Oussama Elrawas (WVU)
Dan Baker (WVU)
Karen Lum (JPL)
tim@menzies.us
International Workshop on Living with Uncertainty,
IEEE ASE 2007, Atlanta, Georgia, oelrawas@mix.wvu.edu
Nov 5, 2007
2. What does this mean?
A supposedly np-hard task
abduction over first-
order theories
nogood/2
Q: for what models does (a few peeks) = (many hard stares)?
2
3. Grow
A: models with Monte Carlo a model
–
Picking input settings at
“collars” random
For each run
–
Score each output
Add score to each input
“Collar” variables set the other settings
variables Harvest
Rule generation experiments,
Narrows –
–
favoring settings with better
Amarel in the 60s
scores
Minimal environments
–
If “collars”, then
DeKleer ’85
… small rules …
–
Master variables
–
– … learned quickly …
Crawford & Baker ‘94
– … will suffice
Feature subset selection
–
Kohavi & John ‘97
Back doors
–
Williams et al ‘03
Etc
–
Implications for uncertainty?
Feather & Menzies RE’02
3
4. STAR: collars + simulated annealing on
For
example
Boehm’s USC’s software process models
USC software process models for effort, defects, threats
controllable
y[i] = impact[i] * project[i] + b[i] for i ∈ {1,2,3,…}
–
α ≤ project[i] ≤ β : uncertainty in project description
–
χ ≤ impact[i] ≤ δ : uncertainty in model calibration
–
uncontrollable
Random solution
pick project[i] and impact[i] from any α .. β , χ .. δ
–
– α .. β set via domain knowledge;
e.g. process maturity in 3 to 5
– range of χ .. δ known from history;
Score solution by effort (Ef),
defects (De) and Threat (Th)
4
5. Two studies
y[i] = impact[i] * project[i] + b[i]
one two
Certain methods
Methods with more uncertainty
Using much historical data
–
Using no historical data
–
Learn the magnitude of the
–
– Monte Carlo at random across
impact[i] relationship
the project[i] settings and
– With fixed impact[I] Tame
impact[i] settings
Monte Carlo at uncontroll-
andom across the ables via
project[i] settings historical
E.g.
E.g. records
STAR
–
Regression-based tools that
–
– Monte Carlo a model
learn impact[I] from historical
records – Score each output
– 93 records of JPL systems – Sort settings by their “C”,
– SCAT: “C”= cumulative score
JPL’s current methods
Rule generation experiments,
–
2CEE:
–
favoring settings with better “C”.
WVU’s improvement over
SCAT (currently under test)
5
6. Bad
Inside STAR
1. sampling
- simulated annealing Good
2. summarizing
- post-processor
38 not-so- good ideas
for setting ∈ Sx { value[setting] += E }
Sort all settings by their value
Ignore uncontrollables impact[I]
–
Assume the top
–
(1 ≤ i ≤ max) project[I] settings
Randomly select the rest
–
“Policy point” :
smallest I with lowest E
–
Median = 50% percentile
Spread = (75-50)% percentile
–
22 good ideas
6
10. Control impact[I] via
historical data
SCAT vs
2CEE vs
STAR project[i]
Stagger around
superset of possible
impact[I]
Median:
50% point
Spread :
(75 - 50)%
10
11. Control impact[I] via
historical data
SCAT vs
2CEE vs
STAR project[i]
Stagger around
superset of possible
impact[I]
Median:
50% point
Spread :
(75 - 50)%
STAR/2cee= 50/ 800= 6%
STAR/scat= 50/1300= 4%
11
12. Control impact[I] via
historical data
SCAT vs
2CEE vs
STAR project[i]
Stagger around
superset of possible
impact[I]
Median:
50% point
Spread :
(75 - 50)%
STAR/2cee= 400/1600= 25% STAR/2cee= 180/ 400= 45%
STAR/2cee= 30/620= 5%
STAR/2cee= 50/ 800= 6%
STAR/scat= 400/1900= 21% STAR/scat= 180/1900= 60%
STAR/scat= 30/730= 4%
STAR/scat= 50/1300= 4%
12
13. Control impact[I] via
historical data
SCAT vs
2CEE vs
STAR project[i]
Stagger around
superset of possible
impact[I]
Median:
50% point
Spread :
(75 - 50)%
STAR/2cee= 400/1600= 25% STAR/2cee= 180/ 400= 45%
STAR/2cee= 30/620= 5%
STAR/2cee= 50/ 800= 6%
STAR/scat= 400/1900= 21% STAR/scat= 180/1900= 60%
STAR/scat= 30/730= 4%
STAR/scat= 50/1300= 4%
13
14. Control impact[I] via
historical data
SCAT vs
2CEE vs
STAR project[i]
Stagger around
superset of possible
impact[I]
Median:
50% point
Spread :
(75 - 50)%
STAR/2cee= 400/1600= 25% STAR/2cee= 180/ 400= 45%
STAR/2cee= 30/620= 5%
STAR/2cee= 50/ 800= 6%
STAR/scat= 400/1900= 21% STAR/scat= 180/1900= 60%
STAR/scat= 30/730= 4%
STAR/scat= 50/1300= 4%
Ignoring historical data is useful (!!!?) 14
15. Control impact[I] via
historical data
SCAT vs
2CEE vs
STAR project[i]
Stagger around
superset of possible
impact[I]
Median:
50% point
Spread :
(75 - 50)%
STAR/2cee= 400/1600= 25% STAR/2cee= 180/ 400= 45%
STAR/2cee= 30/620= 5%
STAR/2cee= 50/ 800= 6%
STAR/scat= 400/1900= 21% STAR/scat= 180/1900= 60%
STAR/scat= 30/730= 4%
STAR/scat= 50/1300= 4%
Ignoring historical data is useful (!!!?) 15
16. Control impact[I] via
historical data
SCAT vs
2CEE vs
STAR project[i]
Stagger around
superset of possible
impact[I]
Median:
50% point
Spread :
(75 - 50)%
STAR/2cee= 400/1600= 25% STAR/2cee= 180/ 400= 45%
STAR/2cee= 30/620= 5%
STAR/2cee= 50/ 800= 6%
STAR/scat= 400/1900= 21% STAR/scat= 180/1900= 60%
STAR/scat= 30/730= 4%
STAR/scat= 50/1300= 4%
Ignoring historical data is useful (!!!?) If you fix everything, you lose fixes for everything else16
17. Luke, trust the force,
I mean, collars
IEEE Computer, Jan 2007
“The strangest thing about software”
19. Related work
Abduction :
World W = minimal set of
–
assumptions (w.r.t. size) such that
Feather, DDP, treatment learning
T ∪ A => G
Optimization of
Not(T U A => error) –
requirement models
Framework for
–
validation,
XEROC PARC, 1980s, qualitative
diagnosis,
representations (QR)
planning,
monitoring,
not overly-specific,
–
explanation,
Quickly collected in a new
–
tutoring,
domain.
test case generation,
– Used for model diagnosis
prediction,…
and repair
Theoretically slow (NP-hard) but
– – Can found creative solutions in
this should be practical: larger space of possible
Abduction + stochastic sampling
qualitative behaviors,
Find collars
than in the tighter space of precise
Learn constraints on collars quantitative behaviors
19
20. Possible optimizations
(not used here)
STAR, an example of a general BORE (best or rest)
process: n runs
–
Stochastic sampling Best= top 10% scores
– –
– Sort settings by “value” Rest = remaining 90%
–
– Rule generation experiments {a,b} = frequency of
–
discretized range in {best, rest
favoring highly “value”-ed settings
See also, elite sampling in the Sort settings by
– Ask
cross-entropy method -1 * (a/n)2 / (a/n + b/n) me why,
off-line
If SA convergence too slow
Other valuable tricks:
Try moving back select into the SA;
–
Incremental discretization:
–
– Constrain solution mutation to Gama&Pinto’s PID +
prefer highly “value”-ed settings Fayyad&Irani
– Limited discrepancy search:
Harvey&Ginsberg
– Treatment learning: Menzies&Yu
20
22. At the “policy point”, diff
diff
STAR’s random solutions
are surprisingly accurate
diff
diff
LC : learn impact[i] via regression (JPL data)
STAR: no tuning, randomly pick impact[i]
Diff = ∑ mre(lc)/ ∑ mre(star)
Mre = abs(predicted - actual) /actual same
diff
∑ mre(lc) / ∑ mre(star) strategic tactical
ground 66% 63%
all 91% 75%
OSP2 99% 125% ●❍ same
same
OSP 112% ●❍ 111% ●❍
flight 101% ●❍ 121% ●❍
same at {95, 99}% confidence (MWU)
{ “●” “❍”}
same
same
Why so little Diff (median= 75%)?
Most influential inputs tightly constrained
–
22
23. (Model uncertainty = collars) << inputs
In many models, a few “collar” variables set the other variables
Narrows (Amarel in the 60s)
–
Minimal environments (DeKleer ’85)
–
Master variables (Crawford & Baker ‘94)
–
– Feature subset selection (Kohavi & John ‘97)
– Back doors (Williams et al ‘03)
– See “The Strangest Thing About Software (IEEE Computer, Jan’07)”
Collars appear in all execution traces (by definition)
You don’t have to find the collars, they’ll find you
–
So, to handle uncertainty
Write a simulator
–
Stagger over uncertainties
–
This talk: a very simple example of this process
From stagger, find collars
–
Constrain collars
–
23
24. Comparisons
Standard software process modeling
Models written more than run (PROSIM community)
–
Limited sensitivity analysis
Limited trade space
Or, expensive, error-prone, incomplete data collection
–
programs
Point solutions
Here:
No data collection
–
Found stable conclusions
–
within a space of possibilities
– Search : very simple
– Solution, not brittle
With trade-off space
22 good ideas, sorted
24
25. Bad
Summary
Living with uncertainty
Sometimes, simpler than you Good
–
may think
more useful than you might
–
think
Simple:
Here, the smallest change
–
to simulating annealing
Useful:
Sometimes uncertainty can
–
teach you more than certainty
If you fix everything, you lose
–
fixes to everything else
An example you
Collars control certainty
can explain to
Uncertainty plus constrained
–
any business user
collars → more certainty
Also, can drive model to
–
better performance
An example you
can explain to
any business user 22 good ideas, sorted
25