Performance testing DataHelix
Ciara Gardiner and Frances Bromley
“To investigate the use of Docker,
continuous integration and cloud
technology to performance test the
Scott Logic DataHelix”
Goals
A data generation tool that creates representative test and
simulation data based on a profile.
It allows the generation of data that both conforms to and
violates the profile, with the goal of aiding testing and
validation.
What is the DataHelix?
What’s in a profile?
Technical decisions
What should
we test in the
DataHelix?
Where did
we want our
tool to live?
How should
we trigger
our CI?
Open source tool to help you package, deploy and run your application.
It uses images that at runtime are containers.
A container contains everything you need to run your application.
What is Docker?
Portable
Open source
Shareable
Isolated
Dependable
Small footprint
Microservice architecture
Why Docker?
How did we use Docker ?
Continuous Integration
YML
Artifacts
“Cloud computing is the on-demand availability of
computer system resources”
Allows us to dictate specifics about the type of
computer the experiment was being run in
Cloud
Metrics
Variation in time baselines for a commit.
Input remains the same.
Output is not verified.
What did our app look like?
What did we learn?
Docker
Circle CI
How to approach performance testing
Going forward
● Remove Docker building from within app.
● Deployment onto Cloud to improve cloud knowledge and define the size of the
computers on which it’s being run.
● Set CI up so it updates any time there’s a change to the DataHelix repo.
Questions?

Performance testing DataHelix | Scott Logic