Copilot to Cover: Why AI can't replace developers with robots, but can make life better

Copilot to Cover:
Why AI can’t replace
developers with robots, but
can make life better
Dr Andy Piper
VP Engineering, Diffblue

What You Will Learn
• What is AI-Augmented coding?
• Different approaches to AI-Augmented software
development
• How AI can transform mundane-but-vital coding
tasks

Dr Andy Piper
• PhD @Cambridge - Map-Reduce
• Senior Staff Engineer @BEA –
WebLogic Server
• Manager @Oracle – WebLogic
Event Server
• CTO @Push Technology - Real-
time streaming
• Global head valuations tech
@CBRE
• VP Engineering @Diffblue

AI-Augmented Coding:
Use of AI – principally machine learning –
to help developers write code
Especially boring, repetitive code
that is tedious and error-prone to write

AI-Augmented Coding State of the Art
Coding competitions
Auto-completion Unit Test-Writing
Pre-Trained Transformer-Based
(GPT-2, GPT-3, others)
Reinforcement Learning
CodeWhisperer

Transformers: ML That Iteratively Predicts Output
Tokens from Input Tokens
the
quick
brown
fox
Transformer
le
the
quick
brown
fox
Transformer
le
renard
the
quick
brown
fox
Transformer
brun
le
renard

Generative Pre-Trained Transformers (GPT) from
OpenAI
GPT-3
• Pre-trained model
• Closed source
• 175b parameters
• GPT-2 + Writes new
text
.
Codex
• Closed source
• 12b parameters
• Writes boilerplate,
repetitive code and
foreign API calls
GPT-2
• Open source
• 1.5b parameters
• Translates text
• Summarises text
• Answers questions
about text
Feb 2019 July 2020 July 2021

Training Copilot
54m
Public
GitHub
Repos*
Training set
Test set
Model Training
($$$)
GitHub Codex:
12Bn Parameter Model
* Some controversy here – not covered in this talk

Codex Runs In the Cloud Due to Model Size
Your IDE
Your code
Azure
Codex Model
Your IDE
Your code +
Completion
Code
fragments
Potential
completions

What Is Copilot Good For?
• Quickly completing
• Boilerplate code
• Repetitive code patterns
• “Foreign lands” – patterns for calling external APIs

Calling “Foreign Lands” without Googling /
StackOverflow

AWS CodeWhisperer
• Same concept as Copilot
• Designed for apps using AWS
services & APIs
• Also transformer-based (supervised
learning)
• Training data is unknown
• Supports Python, Java, Javascript
• Currently in open ‘preview’

Test Writing Is Harder Than
Completion
• Needs more context
• The bar for value is much higher
• Best when 100% autonomous
• It has to work and be correct – no approximations
• Determinism is important
• Complex interdependencies & practical difficulties

Set of all code
that looks like
it might be a good
test
Supervised learning
(Copilot)
Programs that
are valid and run
High coverage
tests
Tests that
satisfy developer
taste
Tests that will work
The tests you
actually need
Tests that are effective
What you actually need
What GPT will give you, but
not what you need
Searching for the Right Kind of Code

Diffblue Reinforcement Learning
Coverage, other metrics
Existing Java code
Write Test Results
Predict A
Better Test
Evaluate
Effectiveness
Run test

Software Change Process with Diffblue Cover
Working
code
Engineer
writes code
change
Pull Request
Engineer
updates
code
PR approved
Update
Diffblue
baseline
No
Yes
Regression Unit
Test Suite
Diffblue
writes test
baseline
Is the
change
correct?
Run all unit
tests

What Is Cover Good For?
• 100% autonomous Java unit tests (other languages in
future)
• 100% automated Java unit test maintenance
• Skeleton tests for untestable code
• Dashboard and reporting on coverage, testability, risk

Demos:
GitHib CoPilot
Diffblue Cover

How AlphaCode Writes Code
Clear
unambiguous
description of
what the code
must do
Unit tests to
validate the
solution
AlphaCode
Transformer
Hundreds of
potential
solutions
Filter semantic duplicates
via cluster analysis
Tens of
potential
solutions
Run Unit tests
The winning
solution

Some Similarities To Cover
• Generates many potential code solutions to the
problem
• Picks the best one

What Is AlphaCode Good For?
• Beating 46% of programmers in coding competitions
• A demonstration of future potential vs. a practical solution
• Not available outside DeepMind (today)

Summary
• AI-augmented tools are real today
and help eliminate tedious, error-
prone coding tasks
• All the leading tools have free
editions you can try today
• The players in this space are just
getting started: buckle up

Learn More About Diffblue Cover
• Talk to us at stand 16
• Visit www.diffblue.com
• Try Cover plug-in & CLI
• www.diffblue.com/free-trial

Thank you
Thank you
Any questions?

Copilot to Cover: Why AI can't replace developers with robots, but can make life better

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Copilot to Cover: Why AI can't replace developers with robots, but can make life better

Similar to Copilot to Cover: Why AI can't replace developers with robots, but can make life better (20)

Recently uploaded

Recently uploaded (20)

Copilot to Cover: Why AI can't replace developers with robots, but can make life better

Editor's Notes