SlideShare a Scribd company logo
1 of 31
QuantumForce.eu
Search and optimisation algorithms for genomics
on quantum accelerators
04th Apr, 2019
Aritra Sarkar
PhD candidate, Quantum Computer Architecture lab
QuTech (Faculty of Applied Sciences)
Dept. of Q&CE (Faculty of Electrical Engineering, Mathematics and Computer Sciences)
Delft University of Technology
Genomics
Machine
Learning
Quantum
Computing
Application
Platform
Method
access the
presentation
here
QuantumForce.eu
01
Big Picture
Big
Picture
Q
Search
Q
Optimise
Dev
Tools
QuantumForce.eu 3
NISQ acceleration
NISQ
FTQC
QEC
ClassicalSimulationLimit
number of qubits
errorrate
https://arxiv.org/abs/1801.00862 - John Preskill, Quantum Computing in the NISQ era and beyond
 NISQ: Noisy Intermediate-Scale Quantum
map problem to quantum:
do:
run Q Algorithm
assess answer
while (result not satisfactory)
save measurement result/statistics
interpret classical answer
HostCPU
Graphics Processing Unit
Field-Programmable Gate Array
Digital Signal Processor
Neural Processing Unit
QuantumAccelerator
4QuantumForce.eu
Genomical exa-scale data
2-40 EB/year
Genomical Big Data
5QuantumForce.eu
Quantum-accelerated genomicsQuantum-accelerated genomics
https://arxiv.org/abs/1903.09575
QuantumForce.eu 6
Whole Genome Sequencing pipeline
https://software.broadinstitute.org/gatk/best-practices/
QuantumForce.eu 7
• Map-to-reference vs. Variant calling
– Multiple solutions evaluated in superposition, but cannot access results for every state
• Superposition is doesn’t have a classical logic equivalent (e.g. AND/OR)
• Generalization of probability theory for complex amplitudes
– Useful when used to explore large solution space but requires only the min/max/mean answer
Superposition vs. ParallelismIndexedbase-pairs
Ref.
Genome
Target
Genome
Differences Variants
embarrassingly parallel
no interaction
need all answers
Not suitable for Q-Acceleration
Ref.
Genome
Splices
Short
Reads
Differences
Index of
min-diff.
Indexedsplices
parallel evolution
global/local interactions
statistical answer
QuantumForce.eu
02
Q Search
Big
Picture
Q
Search
Q
Optimise
Dev
Tools
9QuantumForce.eu
NP
Searching solutions
𝑦𝑠 = 𝑓(𝑥 𝑠) 𝑦𝑠 = 𝑓(𝑥 𝑠) 𝑦𝑠 = 𝑓(𝑥 𝑠)
𝑥 𝑠 = 𝑓−1
(𝑦𝑠)
𝑦0 = 𝑓 𝑥0
𝑦1 = 𝑓 𝑥1
𝑦2 = 𝑓 𝑥2 = 𝑦𝑠
𝑦3 = 𝑓 𝑥3
⋮
Function Evaluation Inductive Logic, GP, ANN, …Function Inversion
Quantum Superposition
P
Bounded Quantum Polynomial
QuantumForce.eu 10
Sub-sequence index search
RG: Reference Genome (3 × 108 𝑏𝑝)
0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
SR: Short Read (50𝑏𝑝)
0
1
2
4
3
21
21
22
23
24
25
0
1
2
4
3
𝑄𝑡𝑎𝑔
𝑄 𝑑𝑎𝑡𝑎
QuantumForce.eu 11
Ab initio alignment
Naïve Method
• Substring(/subsequence) matching problem
Exact match
Boyer-Moore
+ Improvements
Knuth-Morris-Pratt
+ Improvements
Suffix Trees
+ Improvements
Exact match
(wildcards)
Needleman-Wunsch
Global Alignment
Smith-Waterman
Local Alignment
Simple Edit Transcript using memoization of Levenshtein Distance
+ Improvements(alphabet/operation weights)
Approximate match
BYP/CL/Myers/hybrid-dynamic methods
Alignment with arbitrary (k bounded) gaps
Approximate match
Approximate match
Burrows-Wheeler-Transform + Smith-Waterman (BWT-SW) All local hits
Burrows-Wheeler-Aligner + super-Maximal Exact Match (BWA-MEM) Heuristics
12QuantumForce.eu
Dissecting a quantum algorithm
Superpose
Soln. Space
Encode
Function
Clever
Process
Measure
Initialize
|0⟩⊗n
Classical
Output
Classical
Input
1. Prepare all-zero state for n-qubits (not so trivial experimentally as it sounds)
2. Full superposition in computational basis (H-gate on all qubits)
– OR, superposition of classical input space
3. Transform superposition to evaluate the function (using 1 & 2 qubit gates)
– OR, evaluate function based on classical input space
4. Somehow* increase the amplitude of the solution space
5. Measure out the state
6. Repeat Steps 1-5 to access the modal classical output
* the quantum magic of interference
13QuantumForce.eu
Evolution
Tight bounds on quantum
searching
… arbitrary initial
amplitude distribution
Quantum Pattern
Matching
Grover Search one solution
full, uniform
database
known Oracle for
solution in database
optimal iterations
multiple (un)known
solutions
full, uniform
database
known Oracle for
solution in database
optimal iterations
multiple known
solutions
arbitrary database
known Oracle for
solution in database
optimal iterations
multiple unknown
solutions
sliding index
database
alphabet based
Oracles
optimal iterations
one solution
sub-string
phonebook
0 Hamming Distance
Oracle
optimal iterations
… Quantum
Bioinformatics
Quantum Associative
Memory
multiple known
solutions
arbitrary database
known Oracle for
solution in database
higher Pmax
iteration
… associative memory
with distributed queries
multiple known
solutions
arbitrary database Binomial Oracle optimal iterations
… improved distributed
queries
multiple unknown
solutions
arbitrary database Binomial Oracle
higher Pmax
iteration
Gen 1
(tested)
QUS
Gen 2
(tested)
QPM
Gen 3
(tested)
QNN
Q Walk / Graph SearchQ Unstructured Search Q Structured Search HSP (abelian/dihedral)
14QuantumForce.eu
QiMAM-dq
• Quantum indexed multi-associative memory (MSc thesis)
– Grover’s Search + Quantum Neural Networks + Content Addressable Storage
15QuantumForce.eu
03
Q Optimise
Big
Picture
Q
Search
Q
Optimise
Dev
Tools
16QuantumForce.eu
Purebreds
• a.k.a. Coherent Protocols
– e.g. Shor’s factorisation, Shor’s discrete-log, QFT, Quantum Phase Estimation, Harrow-Hassidim-Lloyd,
Matrix inversion ...
– Most studied/popular quantum algorithms so far
• Exponential speedup
– Caveats
• Noise tolerance
– Number of qubits for FT
• Circuit depth
• Quantum I/O
– Classical Input: State preparation
– Classical Output: State tomography
– QRAM
O ( f(experimental) x g(no-cloning) x h(algorithm) )
17QuantumForce.eu
Workhorses
• Peter Shor estimates 2048-RSA requires ~5k qubits (times 102-103 physical qubits) & ~107 gates
• Near-term Quantum Algorithms
– low depth circuits without extensive QEC (small-codes)
– enough qubits to just store the problem (hard to do better)
– still solve useful problems with local constraints
– Adaptable optimization algorithms (easy to map to problem)
• Genetic Algorithm / Evolutionary Programs
• Deep Learning
– Quantum Approximate Optimization Algorithm
• NP-Hard combinatorial optimisation problems in Quantum Machine Learning
• Polynomial-time solution for every instance with guaranteed approximation quality bound
• Interesting because of its potential to exhibit near-term quantum supremacy
• Gate-based implementation inspired by Adiabatic QC and Q Annealing
https://www.bcg.com/en-ca/publications/2018/next-decade-quantum-computing-how-play.aspx
18QuantumForce.eu
Genomics optimisation
ALGORITHM
19QuantumForce.eu
De novo assembly
• Eulerian path/cycle [De Bruijn Graph]
– Eulerian path is a trail in a finite graph which visits every edge exactly once.
– Eulerian cycle is an Eulerian trail which starts and ends on the same vertex.
• Hamiltonian path/cycle [Overlap-Layout-Consensus]
– Hamiltonian path is a graph path between two vertices of a graph that visits each vertex exactly once.
– Hamiltonian cycle is a path which starts from one node and ends at the same node covering all the nodes of that graph.
– If a Hamiltonian path exists whose endpoints are adjacent, then the resulting graph cycle is called a Hamiltonian cycle.
• Decision version vs. Function version
• Travelling Salesman Problem
– A cycle that visits all nodes of the graph and such that the sum of the edge weights is minimum.
– Find a Hamiltonian cycle of minimum weight.
• Using Quantum Approximate Optimisation Algorithm
+ “Easy” to solve
- Error-prone
- Bad for super-sampled reads
- Bad for long reads
- NP-Hard to solve
20QuantumForce.eu
QAOA
Genomics optimisation
Bitflip (X) mixers
VQE
Controlled-bit-flip (Λf(X)) mixers
XY mixers
Controlled-XY mixers
Permutation mixers
Maximum cut
Maximum-L-SAT
Minimum-L-SAT
Set Splitting
MaxE3LIN2
Maximum Independent Set
Maximum Clique
Minimum Vertex Cover
Maximum Set Packing
Minimum Set Cover
Maximum-K-Colorable Subgraph
Graph Partitioning (Minimum Bisection)
Maximum Bisection
Maximum Vertex K-Cover
Maximum-K-Colorable Induced Subgraph
Minimum Graph Coloring
Minimum Clique Cover
Traveling Salesperson Problem = minimum cost Hamiltonian Cycle
SMS, minimizing total weighted squared tardiness
SMS, minimizing total weighted tardiness
SMS, with release dates
DNA Sequence Reconstruction
by De novo Assembly
21QuantumForce.eu
QAOA
• Quantum/classical Hybrid algorithm
– Parameterised quantum subroutine is run within a classical optimization loop
– Prepare the quantum state | 𝜓 𝜃 , often called the ansatz
– Measure the expectation value 𝜓 𝜃 ℋ 𝜓 𝜃
• By Variational theorem, expectation value ℋ ⟩|𝑎𝑛𝑠𝑎𝑡𝑧 ≥ λ1 (smallest eigenvalue; lowest energy; ground-state)
– Find an optimal choice of real-valued parameters 𝜃 such that the expectation value is minimised
– Implementation based on Variational Quantum Eigensolver primitive
• Challenges
– Heuristic - no general recipe of Ansatz definition works universally
– Optimiser choice
– Initial Parameter selection is arbitrary
– Convergence not always guaranteed
– High number of Iteration
22QuantumForce.eu
04
Dev Tools
Big
Picture
Q
Search
Q
Optimise
Dev
Tools
23QuantumForce.eu
Quantum HLL
• OpenQL (inspired by OpenCL)
• Programs (Kernels (Operations))
– Decompose: Toffoli, Control, Unitary, Rotation, CX-CZ
– Optimize: Cancel UU†
– Scheduling: ASAP, ALAP, Balanced
– Mapping: Surface-17 connectivity routing
– QASM: cQASM, eQASM (topology, resource constraints)
• Other features:
– Conjugate uncompute
– Classical logical/comparative operations
– Language features
• Recursion, loops, functions
• Libraries like NumPy, Matplotlib
Configure Platform
Create Program
Create Kernels
Populate each Kernel
Add Kernels to Program
Compile Program
24QuantumForce.eu
Testing
1. Shortest superstring (Σ,M)
– (4,3) = AAATTTGTTCTTATGGTGCTGATCGTCCTCATAGTACTAAGGGCGGAGCCGCAGACGAACCCACAA
– (2,2) = 00110
– (4,2) = AATTGTCTAGGCGACCA
2. Random String
– Chargaff's Parity rules
• %A = %T
• %C = %G
– %GC : %AT (40:60)
– Other entropic measures
3. Real Data Segment (GenBank, wgsim)
– part of HBB (hemoglobin subunit beta)
• Chromosome 11 (region p15.4) of Homo sapiens
– Sickle cell anemia
• ATG-GTG-CAT-CTG-ACT-CCT-GAG
• ATG-GTG-CAC-CTG-ACT-CCT-GTG
4. Minimal entropy
– tandem repeats
25QuantumForce.eu
Platforms
• Quantum Infinity
– DiCarlo lab (QuTech)
– Simulator (QX and QuantumSim)
– Superconducting qubits
– Cloud access
• Quantum Inspire
– QuTech (TU Delft + TNO)
– ~ 37 qubits simulator on Cartesius supercomputer with SURFsara
– Semiconducting qubits
• Quantum Learning Machine
– AtoS BullSequana
• Digital Annealers
– Fujitsu
26QuantumForce.eu
Related applications
DNA Fingerprinting Motif FindingAmino-acid Sequencing
Pattern based Trading
Object Recognition
Speech Recognition
18x18 px
17 qubits
~ 50k gates
Exact matching
Traffic OptimisationWarehousing
27QuantumForce.eu
QuantumForce.eu
Experimental Q Chip
(Decoherence and Gate errors)
Superconducting/Semiconducting qubits
Industrial/Societal Q Chip
(no decoherence, no gate errors)
Classical Q simulators
QuTech
Perfect Qubits
Realistic Qubits
QuantumForce
Industry/Society App.
Physical Platform
QCA lab
28QuantumForce.eu
QuantumForce.eu
29QuantumForce.eu
QuantumForce.eu
• Big Data Analytics
• Industry 4.0
• Cyber-Physical Systems
• Artificial Intelligence and Machine Learning
..... and other applications
Contacts for collaboration:
• Koen Bertels, CEO
koen@QuantumForce.eu
• Zaid Al-Ars, CTO
zaid@QuantumForce.eu
30QuantumForce.eu
Koen
Bertels
Carmen
G. Almudever
Razvan
Nane
Imran
Ashraf
Nader
Khammassi
Hans
van Someren
Leo
DiCarlo
Jeroen
van Straten
LingLing
Lao
Savvas
Varsamopoulos
Matthijs
Brobbel
Aritra
Sarkar
Abid
Moueddene
Xiang
Fu
Leon
Riesebos
Daniel
Moreno
Miguel
Serrao
Amitabh
Yadav
Alejandro
Morais
Anneriet
Krol
Yaoling
Yang
Mengyu
Zhang
Bas
van Wee
Diogo
Valada
Search and optimisation algorithms for genomics
on quantum accelerators
Aritra Sarkar
Quantum Computer Architecture Lab
QuTech and Department of Quantum & Computer Engineering
Delft University of Technology

More Related Content

What's hot

Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)
Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)
Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)Takahiro Harada
 
Automatic Features Generation And Model Training On Spark: A Bayesian Approach
Automatic Features Generation And Model Training On Spark: A Bayesian ApproachAutomatic Features Generation And Model Training On Spark: A Bayesian Approach
Automatic Features Generation And Model Training On Spark: A Bayesian ApproachSpark Summit
 
LEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGATO project
 
Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...
Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...
Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...Shinya Takamaeda-Y
 
Introduction to Chainer 11 may,2018
Introduction to Chainer 11 may,2018Introduction to Chainer 11 may,2018
Introduction to Chainer 11 may,2018Preferred Networks
 
Advances in the Solution of Navier-Stokes Eqs. in GPGPU Hardware. Modelling F...
Advances in the Solution of Navier-Stokes Eqs. in GPGPU Hardware. Modelling F...Advances in the Solution of Navier-Stokes Eqs. in GPGPU Hardware. Modelling F...
Advances in the Solution of Navier-Stokes Eqs. in GPGPU Hardware. Modelling F...Storti Mario
 
EuroPython 2017 - PyData - Deep Learning your Broadband Network @ HOME
EuroPython 2017 - PyData - Deep Learning your Broadband Network @ HOMEEuroPython 2017 - PyData - Deep Learning your Broadband Network @ HOME
EuroPython 2017 - PyData - Deep Learning your Broadband Network @ HOMEHONGJOO LEE
 
Foveated Ray Tracing for VR on Multiple GPUs
Foveated Ray Tracing for VR on Multiple GPUsFoveated Ray Tracing for VR on Multiple GPUs
Foveated Ray Tracing for VR on Multiple GPUsTakahiro Harada
 
On the Capability and Achievable Performance of FPGAs for HPC Applications
On the Capability and Achievable Performance of FPGAs for HPC ApplicationsOn the Capability and Achievable Performance of FPGAs for HPC Applications
On the Capability and Achievable Performance of FPGAs for HPC ApplicationsWim Vanderbauwhede
 
Pr057 mask rcnn
Pr057 mask rcnnPr057 mask rcnn
Pr057 mask rcnnTaeoh Kim
 
Exploring Optimization in Vowpal Wabbit
Exploring Optimization in Vowpal WabbitExploring Optimization in Vowpal Wabbit
Exploring Optimization in Vowpal WabbitShiladitya Sen
 
第11回 配信講義 計算科学技術特論A(2021)
第11回 配信講義 計算科学技術特論A(2021)第11回 配信講義 計算科学技術特論A(2021)
第11回 配信講義 計算科学技術特論A(2021)RCCSRENKEI
 
A Simple Communication System Design Lab #1 with MATLAB Simulink
A Simple Communication System Design Lab #1 with MATLAB Simulink A Simple Communication System Design Lab #1 with MATLAB Simulink
A Simple Communication System Design Lab #1 with MATLAB Simulink Jaewook. Kang
 
Compilation of COSMO for GPU using LLVM
Compilation of COSMO for GPU using LLVMCompilation of COSMO for GPU using LLVM
Compilation of COSMO for GPU using LLVMLinaro
 
A Simple Communication System Design Lab #2 with MATLAB Simulink
A Simple Communication System Design Lab #2 with MATLAB SimulinkA Simple Communication System Design Lab #2 with MATLAB Simulink
A Simple Communication System Design Lab #2 with MATLAB SimulinkJaewook. Kang
 
Nvidia in bioinformatics
Nvidia in bioinformaticsNvidia in bioinformatics
Nvidia in bioinformaticsShanker Trivedi
 
OXiGen: A tool for automatic acceleration of C functions into dataflow FPGA-b...
OXiGen: A tool for automatic acceleration of C functions into dataflow FPGA-b...OXiGen: A tool for automatic acceleration of C functions into dataflow FPGA-b...
OXiGen: A tool for automatic acceleration of C functions into dataflow FPGA-b...NECST Lab @ Politecnico di Milano
 
Technical Tricks of Vowpal Wabbit
Technical Tricks of Vowpal WabbitTechnical Tricks of Vowpal Wabbit
Technical Tricks of Vowpal Wabbitjakehofman
 

What's hot (20)

Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)
Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)
Introduction to Monte Carlo Ray Tracing, OpenCL Implementation (CEDEC 2014)
 
Automatic Features Generation And Model Training On Spark: A Bayesian Approach
Automatic Features Generation And Model Training On Spark: A Bayesian ApproachAutomatic Features Generation And Model Training On Spark: A Bayesian Approach
Automatic Features Generation And Model Training On Spark: A Bayesian Approach
 
Europy17_dibernardo
Europy17_dibernardoEuropy17_dibernardo
Europy17_dibernardo
 
ECE 565 FInal Project
ECE 565 FInal ProjectECE 565 FInal Project
ECE 565 FInal Project
 
LEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGaTO: Software Stack Runtimes
LEGaTO: Software Stack Runtimes
 
Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...
Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...
Veriloggen.Thread & Stream: 最高性能FPGAコンピューティングを 目指したミックスドパラダイム型高位合成 (FPGAX 201...
 
Introduction to Chainer 11 may,2018
Introduction to Chainer 11 may,2018Introduction to Chainer 11 may,2018
Introduction to Chainer 11 may,2018
 
Advances in the Solution of Navier-Stokes Eqs. in GPGPU Hardware. Modelling F...
Advances in the Solution of Navier-Stokes Eqs. in GPGPU Hardware. Modelling F...Advances in the Solution of Navier-Stokes Eqs. in GPGPU Hardware. Modelling F...
Advances in the Solution of Navier-Stokes Eqs. in GPGPU Hardware. Modelling F...
 
EuroPython 2017 - PyData - Deep Learning your Broadband Network @ HOME
EuroPython 2017 - PyData - Deep Learning your Broadband Network @ HOMEEuroPython 2017 - PyData - Deep Learning your Broadband Network @ HOME
EuroPython 2017 - PyData - Deep Learning your Broadband Network @ HOME
 
Foveated Ray Tracing for VR on Multiple GPUs
Foveated Ray Tracing for VR on Multiple GPUsFoveated Ray Tracing for VR on Multiple GPUs
Foveated Ray Tracing for VR on Multiple GPUs
 
On the Capability and Achievable Performance of FPGAs for HPC Applications
On the Capability and Achievable Performance of FPGAs for HPC ApplicationsOn the Capability and Achievable Performance of FPGAs for HPC Applications
On the Capability and Achievable Performance of FPGAs for HPC Applications
 
Pr057 mask rcnn
Pr057 mask rcnnPr057 mask rcnn
Pr057 mask rcnn
 
Exploring Optimization in Vowpal Wabbit
Exploring Optimization in Vowpal WabbitExploring Optimization in Vowpal Wabbit
Exploring Optimization in Vowpal Wabbit
 
第11回 配信講義 計算科学技術特論A(2021)
第11回 配信講義 計算科学技術特論A(2021)第11回 配信講義 計算科学技術特論A(2021)
第11回 配信講義 計算科学技術特論A(2021)
 
A Simple Communication System Design Lab #1 with MATLAB Simulink
A Simple Communication System Design Lab #1 with MATLAB Simulink A Simple Communication System Design Lab #1 with MATLAB Simulink
A Simple Communication System Design Lab #1 with MATLAB Simulink
 
Compilation of COSMO for GPU using LLVM
Compilation of COSMO for GPU using LLVMCompilation of COSMO for GPU using LLVM
Compilation of COSMO for GPU using LLVM
 
A Simple Communication System Design Lab #2 with MATLAB Simulink
A Simple Communication System Design Lab #2 with MATLAB SimulinkA Simple Communication System Design Lab #2 with MATLAB Simulink
A Simple Communication System Design Lab #2 with MATLAB Simulink
 
Nvidia in bioinformatics
Nvidia in bioinformaticsNvidia in bioinformatics
Nvidia in bioinformatics
 
OXiGen: A tool for automatic acceleration of C functions into dataflow FPGA-b...
OXiGen: A tool for automatic acceleration of C functions into dataflow FPGA-b...OXiGen: A tool for automatic acceleration of C functions into dataflow FPGA-b...
OXiGen: A tool for automatic acceleration of C functions into dataflow FPGA-b...
 
Technical Tricks of Vowpal Wabbit
Technical Tricks of Vowpal WabbitTechnical Tricks of Vowpal Wabbit
Technical Tricks of Vowpal Wabbit
 

Similar to Aritra Sarkar - Search and Optimisation Algorithms for Genomics on Quantum Accelerators

Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18Aritra Sarkar
 
Genomics algorithms on digital NISQ accelerators - 2019-01-25
Genomics algorithms on digital NISQ accelerators - 2019-01-25Genomics algorithms on digital NISQ accelerators - 2019-01-25
Genomics algorithms on digital NISQ accelerators - 2019-01-25Aritra Sarkar
 
SW/HW co-design for near-term quantum computing
SW/HW co-design for near-term quantum computingSW/HW co-design for near-term quantum computing
SW/HW co-design for near-term quantum computinginside-BigData.com
 
Quantum algorithms for pattern matching in genomic sequences - 2018-06-22
Quantum algorithms for pattern matching in genomic sequences - 2018-06-22Quantum algorithms for pattern matching in genomic sequences - 2018-06-22
Quantum algorithms for pattern matching in genomic sequences - 2018-06-22Aritra Sarkar
 
EGUE Technikrom Final_8_12_13
EGUE Technikrom Final_8_12_13EGUE Technikrom Final_8_12_13
EGUE Technikrom Final_8_12_13Paul Brodbeck
 
What is Quantum Computing and Why it is Important
What is Quantum Computing and Why it is ImportantWhat is Quantum Computing and Why it is Important
What is Quantum Computing and Why it is ImportantSasha Lazarevic
 
[Q-tangled 22] Deconstructing Quantum Machine Learning Algorithms - Sasha Laz...
[Q-tangled 22] Deconstructing Quantum Machine Learning Algorithms - Sasha Laz...[Q-tangled 22] Deconstructing Quantum Machine Learning Algorithms - Sasha Laz...
[Q-tangled 22] Deconstructing Quantum Machine Learning Algorithms - Sasha Laz...DataScienceConferenc1
 
Reversible Logic Synthesis and RevKit
Reversible Logic Synthesis and RevKitReversible Logic Synthesis and RevKit
Reversible Logic Synthesis and RevKitMathias Soeken
 
Quantum Computing: The Why and How
Quantum Computing: The Why and HowQuantum Computing: The Why and How
Quantum Computing: The Why and Howinside-BigData.com
 
Quantum computation: past-now-future - 2021-06-19
Quantum computation: past-now-future - 2021-06-19Quantum computation: past-now-future - 2021-06-19
Quantum computation: past-now-future - 2021-06-19Aritra Sarkar
 
Fundamentals of Quantum Computing
Fundamentals of Quantum ComputingFundamentals of Quantum Computing
Fundamentals of Quantum Computingachakracu
 
QGATE 0.3: QUANTUM CIRCUIT SIMULATOR
QGATE 0.3: QUANTUM CIRCUIT SIMULATORQGATE 0.3: QUANTUM CIRCUIT SIMULATOR
QGATE 0.3: QUANTUM CIRCUIT SIMULATORNVIDIA Japan
 
開発者が語る NVIDIA cuQuantum SDK
開発者が語る NVIDIA cuQuantum SDK開発者が語る NVIDIA cuQuantum SDK
開発者が語る NVIDIA cuQuantum SDKNVIDIA Japan
 
Quantum Computation simplified.pptx
Quantum Computation simplified.pptxQuantum Computation simplified.pptx
Quantum Computation simplified.pptxSundarappanKathiresa
 
Introduction to Quantum Computing
Introduction to Quantum ComputingIntroduction to Quantum Computing
Introduction to Quantum ComputingNicolo Musmeci
 

Similar to Aritra Sarkar - Search and Optimisation Algorithms for Genomics on Quantum Accelerators (20)

Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18Virus, Vaccines, Genes and Quantum - 2020-06-18
Virus, Vaccines, Genes and Quantum - 2020-06-18
 
Genomics algorithms on digital NISQ accelerators - 2019-01-25
Genomics algorithms on digital NISQ accelerators - 2019-01-25Genomics algorithms on digital NISQ accelerators - 2019-01-25
Genomics algorithms on digital NISQ accelerators - 2019-01-25
 
SW/HW co-design for near-term quantum computing
SW/HW co-design for near-term quantum computingSW/HW co-design for near-term quantum computing
SW/HW co-design for near-term quantum computing
 
Quantum algorithms for pattern matching in genomic sequences - 2018-06-22
Quantum algorithms for pattern matching in genomic sequences - 2018-06-22Quantum algorithms for pattern matching in genomic sequences - 2018-06-22
Quantum algorithms for pattern matching in genomic sequences - 2018-06-22
 
EGUE Technikrom Final_8_12_13
EGUE Technikrom Final_8_12_13EGUE Technikrom Final_8_12_13
EGUE Technikrom Final_8_12_13
 
Quantum programming
Quantum programmingQuantum programming
Quantum programming
 
What is Quantum Computing and Why it is Important
What is Quantum Computing and Why it is ImportantWhat is Quantum Computing and Why it is Important
What is Quantum Computing and Why it is Important
 
[Q-tangled 22] Deconstructing Quantum Machine Learning Algorithms - Sasha Laz...
[Q-tangled 22] Deconstructing Quantum Machine Learning Algorithms - Sasha Laz...[Q-tangled 22] Deconstructing Quantum Machine Learning Algorithms - Sasha Laz...
[Q-tangled 22] Deconstructing Quantum Machine Learning Algorithms - Sasha Laz...
 
Reversible Logic Synthesis and RevKit
Reversible Logic Synthesis and RevKitReversible Logic Synthesis and RevKit
Reversible Logic Synthesis and RevKit
 
Des2017 quantum computing_final
Des2017 quantum computing_finalDes2017 quantum computing_final
Des2017 quantum computing_final
 
2017 10 17_quantum_program_v2
2017 10 17_quantum_program_v22017 10 17_quantum_program_v2
2017 10 17_quantum_program_v2
 
Quantum Computing: The Why and How
Quantum Computing: The Why and HowQuantum Computing: The Why and How
Quantum Computing: The Why and How
 
preskill.pptx
preskill.pptxpreskill.pptx
preskill.pptx
 
Quantum computation: past-now-future - 2021-06-19
Quantum computation: past-now-future - 2021-06-19Quantum computation: past-now-future - 2021-06-19
Quantum computation: past-now-future - 2021-06-19
 
Quantum computing
Quantum computingQuantum computing
Quantum computing
 
Fundamentals of Quantum Computing
Fundamentals of Quantum ComputingFundamentals of Quantum Computing
Fundamentals of Quantum Computing
 
QGATE 0.3: QUANTUM CIRCUIT SIMULATOR
QGATE 0.3: QUANTUM CIRCUIT SIMULATORQGATE 0.3: QUANTUM CIRCUIT SIMULATOR
QGATE 0.3: QUANTUM CIRCUIT SIMULATOR
 
開発者が語る NVIDIA cuQuantum SDK
開発者が語る NVIDIA cuQuantum SDK開発者が語る NVIDIA cuQuantum SDK
開発者が語る NVIDIA cuQuantum SDK
 
Quantum Computation simplified.pptx
Quantum Computation simplified.pptxQuantum Computation simplified.pptx
Quantum Computation simplified.pptx
 
Introduction to Quantum Computing
Introduction to Quantum ComputingIntroduction to Quantum Computing
Introduction to Quantum Computing
 

Recently uploaded

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 

Aritra Sarkar - Search and Optimisation Algorithms for Genomics on Quantum Accelerators

  • 1. QuantumForce.eu Search and optimisation algorithms for genomics on quantum accelerators 04th Apr, 2019 Aritra Sarkar PhD candidate, Quantum Computer Architecture lab QuTech (Faculty of Applied Sciences) Dept. of Q&CE (Faculty of Electrical Engineering, Mathematics and Computer Sciences) Delft University of Technology Genomics Machine Learning Quantum Computing Application Platform Method access the presentation here
  • 3. QuantumForce.eu 3 NISQ acceleration NISQ FTQC QEC ClassicalSimulationLimit number of qubits errorrate https://arxiv.org/abs/1801.00862 - John Preskill, Quantum Computing in the NISQ era and beyond  NISQ: Noisy Intermediate-Scale Quantum map problem to quantum: do: run Q Algorithm assess answer while (result not satisfactory) save measurement result/statistics interpret classical answer HostCPU Graphics Processing Unit Field-Programmable Gate Array Digital Signal Processor Neural Processing Unit QuantumAccelerator
  • 6. QuantumForce.eu 6 Whole Genome Sequencing pipeline https://software.broadinstitute.org/gatk/best-practices/
  • 7. QuantumForce.eu 7 • Map-to-reference vs. Variant calling – Multiple solutions evaluated in superposition, but cannot access results for every state • Superposition is doesn’t have a classical logic equivalent (e.g. AND/OR) • Generalization of probability theory for complex amplitudes – Useful when used to explore large solution space but requires only the min/max/mean answer Superposition vs. ParallelismIndexedbase-pairs Ref. Genome Target Genome Differences Variants embarrassingly parallel no interaction need all answers Not suitable for Q-Acceleration Ref. Genome Splices Short Reads Differences Index of min-diff. Indexedsplices parallel evolution global/local interactions statistical answer
  • 9. 9QuantumForce.eu NP Searching solutions 𝑦𝑠 = 𝑓(𝑥 𝑠) 𝑦𝑠 = 𝑓(𝑥 𝑠) 𝑦𝑠 = 𝑓(𝑥 𝑠) 𝑥 𝑠 = 𝑓−1 (𝑦𝑠) 𝑦0 = 𝑓 𝑥0 𝑦1 = 𝑓 𝑥1 𝑦2 = 𝑓 𝑥2 = 𝑦𝑠 𝑦3 = 𝑓 𝑥3 ⋮ Function Evaluation Inductive Logic, GP, ANN, …Function Inversion Quantum Superposition P Bounded Quantum Polynomial
  • 10. QuantumForce.eu 10 Sub-sequence index search RG: Reference Genome (3 × 108 𝑏𝑝) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 SR: Short Read (50𝑏𝑝) 0 1 2 4 3 21 21 22 23 24 25 0 1 2 4 3 𝑄𝑡𝑎𝑔 𝑄 𝑑𝑎𝑡𝑎
  • 11. QuantumForce.eu 11 Ab initio alignment Naïve Method • Substring(/subsequence) matching problem Exact match Boyer-Moore + Improvements Knuth-Morris-Pratt + Improvements Suffix Trees + Improvements Exact match (wildcards) Needleman-Wunsch Global Alignment Smith-Waterman Local Alignment Simple Edit Transcript using memoization of Levenshtein Distance + Improvements(alphabet/operation weights) Approximate match BYP/CL/Myers/hybrid-dynamic methods Alignment with arbitrary (k bounded) gaps Approximate match Approximate match Burrows-Wheeler-Transform + Smith-Waterman (BWT-SW) All local hits Burrows-Wheeler-Aligner + super-Maximal Exact Match (BWA-MEM) Heuristics
  • 12. 12QuantumForce.eu Dissecting a quantum algorithm Superpose Soln. Space Encode Function Clever Process Measure Initialize |0⟩⊗n Classical Output Classical Input 1. Prepare all-zero state for n-qubits (not so trivial experimentally as it sounds) 2. Full superposition in computational basis (H-gate on all qubits) – OR, superposition of classical input space 3. Transform superposition to evaluate the function (using 1 & 2 qubit gates) – OR, evaluate function based on classical input space 4. Somehow* increase the amplitude of the solution space 5. Measure out the state 6. Repeat Steps 1-5 to access the modal classical output * the quantum magic of interference
  • 13. 13QuantumForce.eu Evolution Tight bounds on quantum searching … arbitrary initial amplitude distribution Quantum Pattern Matching Grover Search one solution full, uniform database known Oracle for solution in database optimal iterations multiple (un)known solutions full, uniform database known Oracle for solution in database optimal iterations multiple known solutions arbitrary database known Oracle for solution in database optimal iterations multiple unknown solutions sliding index database alphabet based Oracles optimal iterations one solution sub-string phonebook 0 Hamming Distance Oracle optimal iterations … Quantum Bioinformatics Quantum Associative Memory multiple known solutions arbitrary database known Oracle for solution in database higher Pmax iteration … associative memory with distributed queries multiple known solutions arbitrary database Binomial Oracle optimal iterations … improved distributed queries multiple unknown solutions arbitrary database Binomial Oracle higher Pmax iteration Gen 1 (tested) QUS Gen 2 (tested) QPM Gen 3 (tested) QNN Q Walk / Graph SearchQ Unstructured Search Q Structured Search HSP (abelian/dihedral)
  • 14. 14QuantumForce.eu QiMAM-dq • Quantum indexed multi-associative memory (MSc thesis) – Grover’s Search + Quantum Neural Networks + Content Addressable Storage
  • 16. 16QuantumForce.eu Purebreds • a.k.a. Coherent Protocols – e.g. Shor’s factorisation, Shor’s discrete-log, QFT, Quantum Phase Estimation, Harrow-Hassidim-Lloyd, Matrix inversion ... – Most studied/popular quantum algorithms so far • Exponential speedup – Caveats • Noise tolerance – Number of qubits for FT • Circuit depth • Quantum I/O – Classical Input: State preparation – Classical Output: State tomography – QRAM O ( f(experimental) x g(no-cloning) x h(algorithm) )
  • 17. 17QuantumForce.eu Workhorses • Peter Shor estimates 2048-RSA requires ~5k qubits (times 102-103 physical qubits) & ~107 gates • Near-term Quantum Algorithms – low depth circuits without extensive QEC (small-codes) – enough qubits to just store the problem (hard to do better) – still solve useful problems with local constraints – Adaptable optimization algorithms (easy to map to problem) • Genetic Algorithm / Evolutionary Programs • Deep Learning – Quantum Approximate Optimization Algorithm • NP-Hard combinatorial optimisation problems in Quantum Machine Learning • Polynomial-time solution for every instance with guaranteed approximation quality bound • Interesting because of its potential to exhibit near-term quantum supremacy • Gate-based implementation inspired by Adiabatic QC and Q Annealing https://www.bcg.com/en-ca/publications/2018/next-decade-quantum-computing-how-play.aspx
  • 19. 19QuantumForce.eu De novo assembly • Eulerian path/cycle [De Bruijn Graph] – Eulerian path is a trail in a finite graph which visits every edge exactly once. – Eulerian cycle is an Eulerian trail which starts and ends on the same vertex. • Hamiltonian path/cycle [Overlap-Layout-Consensus] – Hamiltonian path is a graph path between two vertices of a graph that visits each vertex exactly once. – Hamiltonian cycle is a path which starts from one node and ends at the same node covering all the nodes of that graph. – If a Hamiltonian path exists whose endpoints are adjacent, then the resulting graph cycle is called a Hamiltonian cycle. • Decision version vs. Function version • Travelling Salesman Problem – A cycle that visits all nodes of the graph and such that the sum of the edge weights is minimum. – Find a Hamiltonian cycle of minimum weight. • Using Quantum Approximate Optimisation Algorithm + “Easy” to solve - Error-prone - Bad for super-sampled reads - Bad for long reads - NP-Hard to solve
  • 20. 20QuantumForce.eu QAOA Genomics optimisation Bitflip (X) mixers VQE Controlled-bit-flip (Λf(X)) mixers XY mixers Controlled-XY mixers Permutation mixers Maximum cut Maximum-L-SAT Minimum-L-SAT Set Splitting MaxE3LIN2 Maximum Independent Set Maximum Clique Minimum Vertex Cover Maximum Set Packing Minimum Set Cover Maximum-K-Colorable Subgraph Graph Partitioning (Minimum Bisection) Maximum Bisection Maximum Vertex K-Cover Maximum-K-Colorable Induced Subgraph Minimum Graph Coloring Minimum Clique Cover Traveling Salesperson Problem = minimum cost Hamiltonian Cycle SMS, minimizing total weighted squared tardiness SMS, minimizing total weighted tardiness SMS, with release dates DNA Sequence Reconstruction by De novo Assembly
  • 21. 21QuantumForce.eu QAOA • Quantum/classical Hybrid algorithm – Parameterised quantum subroutine is run within a classical optimization loop – Prepare the quantum state | 𝜓 𝜃 , often called the ansatz – Measure the expectation value 𝜓 𝜃 ℋ 𝜓 𝜃 • By Variational theorem, expectation value ℋ ⟩|𝑎𝑛𝑠𝑎𝑡𝑧 ≥ λ1 (smallest eigenvalue; lowest energy; ground-state) – Find an optimal choice of real-valued parameters 𝜃 such that the expectation value is minimised – Implementation based on Variational Quantum Eigensolver primitive • Challenges – Heuristic - no general recipe of Ansatz definition works universally – Optimiser choice – Initial Parameter selection is arbitrary – Convergence not always guaranteed – High number of Iteration
  • 23. 23QuantumForce.eu Quantum HLL • OpenQL (inspired by OpenCL) • Programs (Kernels (Operations)) – Decompose: Toffoli, Control, Unitary, Rotation, CX-CZ – Optimize: Cancel UU† – Scheduling: ASAP, ALAP, Balanced – Mapping: Surface-17 connectivity routing – QASM: cQASM, eQASM (topology, resource constraints) • Other features: – Conjugate uncompute – Classical logical/comparative operations – Language features • Recursion, loops, functions • Libraries like NumPy, Matplotlib Configure Platform Create Program Create Kernels Populate each Kernel Add Kernels to Program Compile Program
  • 24. 24QuantumForce.eu Testing 1. Shortest superstring (Σ,M) – (4,3) = AAATTTGTTCTTATGGTGCTGATCGTCCTCATAGTACTAAGGGCGGAGCCGCAGACGAACCCACAA – (2,2) = 00110 – (4,2) = AATTGTCTAGGCGACCA 2. Random String – Chargaff's Parity rules • %A = %T • %C = %G – %GC : %AT (40:60) – Other entropic measures 3. Real Data Segment (GenBank, wgsim) – part of HBB (hemoglobin subunit beta) • Chromosome 11 (region p15.4) of Homo sapiens – Sickle cell anemia • ATG-GTG-CAT-CTG-ACT-CCT-GAG • ATG-GTG-CAC-CTG-ACT-CCT-GTG 4. Minimal entropy – tandem repeats
  • 25. 25QuantumForce.eu Platforms • Quantum Infinity – DiCarlo lab (QuTech) – Simulator (QX and QuantumSim) – Superconducting qubits – Cloud access • Quantum Inspire – QuTech (TU Delft + TNO) – ~ 37 qubits simulator on Cartesius supercomputer with SURFsara – Semiconducting qubits • Quantum Learning Machine – AtoS BullSequana • Digital Annealers – Fujitsu
  • 26. 26QuantumForce.eu Related applications DNA Fingerprinting Motif FindingAmino-acid Sequencing Pattern based Trading Object Recognition Speech Recognition 18x18 px 17 qubits ~ 50k gates Exact matching Traffic OptimisationWarehousing
  • 27. 27QuantumForce.eu QuantumForce.eu Experimental Q Chip (Decoherence and Gate errors) Superconducting/Semiconducting qubits Industrial/Societal Q Chip (no decoherence, no gate errors) Classical Q simulators QuTech Perfect Qubits Realistic Qubits QuantumForce Industry/Society App. Physical Platform QCA lab
  • 29. 29QuantumForce.eu QuantumForce.eu • Big Data Analytics • Industry 4.0 • Cyber-Physical Systems • Artificial Intelligence and Machine Learning ..... and other applications Contacts for collaboration: • Koen Bertels, CEO koen@QuantumForce.eu • Zaid Al-Ars, CTO zaid@QuantumForce.eu
  • 30. 30QuantumForce.eu Koen Bertels Carmen G. Almudever Razvan Nane Imran Ashraf Nader Khammassi Hans van Someren Leo DiCarlo Jeroen van Straten LingLing Lao Savvas Varsamopoulos Matthijs Brobbel Aritra Sarkar Abid Moueddene Xiang Fu Leon Riesebos Daniel Moreno Miguel Serrao Amitabh Yadav Alejandro Morais Anneriet Krol Yaoling Yang Mengyu Zhang Bas van Wee Diogo Valada
  • 31. Search and optimisation algorithms for genomics on quantum accelerators Aritra Sarkar Quantum Computer Architecture Lab QuTech and Department of Quantum & Computer Engineering Delft University of Technology

Editor's Notes

  1. Near-term? Accelerators? Genomics? https://t2m.io/1anKYnJu images wider abb. wider
  2. add stack
  3. What’s the big challenge? We all know about the Moore’s Law of transistor scaling – more and more transistors are getting integrated into chips, but we are no longer making better processors – power, memory and frequency walls One the other hand – the amount of data generated from genomics are also increasing exponentially – enabled by the lower cost of sequencing Within the next decade, the amount of data generated per year would be 2-40 exabytes – which is huge! Our computing clusters are not equipped to handle this kind of data volume – a driver for us to turn to the exascale computing promise of quantum systems
  4. sorry if this is not the latest version of the ever-evolving stack diagram
  5. Putting this project in a bigger perspective – let’s have a look at the Whole Genome Sequencing pipeline Extract DNA from human, crops or microorganisms Sequence the DNA in a wet lab – i.e. read the base pairs of the DNA – problem: it is too long (3 billion bp) and tangled to be read in one go Oversample the DNA (make many copies), cut it off in smaller pieces (shotgun sequencing) and the sequencing machine gives back a bucket of short reads Stitch them back together (like solving a jigsaw puzzle) Use the reconstructed DNA for further analysis like disease diagnosis (personalized medicine) or GM crops We focus on the Data analysis part of this pipeline It too consists of a bunch of algorithmic steps as shown in the GATK best practices from Broad Institute – these are for example, reference mapping, indel alignment, variant calling We are going to focus only on the reference mapping (solving the jigsaw puzzle) as it is one of the most computation intensive part
  6. Application-Platform-Tools Why Genomics? Why Digital? Why NISQ? Why Accelerators?
  7. There are 3 basic ways of playing with functions The first is simple function evaluation. We feed in the input, we get back the output. Another way is when we have a set of inputs and their corresponding outputs; we want to infer what is the transformation function. This is called inductive logic and is the basis of machine learning where we train a predictor to approximate the function. The third way of course is when we have the output and the function, and we want to know what input resulted in the particular output. This is function inversion. And we know, if we have an inverse function, we can evaluate it to get back x. But what if such a function does not exist? Like you cannot ask your sibling the pattern lock of their mobile. Of course there is this other possibility where you try out every combinations possible, till you find the solution. Start out with the ninja star if your sibling is in QuTech! Which method should we choose for our problem? For that we need to have a quick look at computation complexity classes. For our interest, there are there 2 important classes, Polynomial complexity (P) or the easy problems, and Non-deterministic Polynomial time (NP), or simply the intractable ones. Where does quantum computers lie in this venn diagram? Well, no one knows for sure, but the quantum equivalent for feasible class, called BQP looks somewhat like this. For example, Shor’s famous algorithm lie in the middle star region. Whereas, we arer trying to solve a problem in the upper star – where even efficient quantum algorithms don’t exist. So we cannot invert. But the good news is that, unlike the stupid idea of trying out every possible pattern lock in a classical system, in quantum we can use the parallelism of superposition states. Which in a way can be reasoned as, trying out all possible paths at the same time.
  8. So let’s define our problem: sub-sequence index search We have a reference genome, which is 3 billion bp for humans, but for this example, I consider a much shorter one of 32 bp. I have used 4 colours for the 4 DNA alphabet Now take one of the read from the bucket. Again the typical size is much larger, but here, it is a modest 5 character string. Proceeding in a naïve linear search style – we start by matching the read at position 0 and give it a score, then at position 1, and so on What we want from our algorithm is the position where it matches best. Note, it doesn’t completely match – approximate yet optimal Additionally some algorithms also give us back the corrected version of our noisy query
  9. There are many ways of doing quantum pattern matching
  10. Application-Platform-Tools Why Genomics? Why Digital? Why NISQ? Why Accelerators?
  11. Application-Platform-Tools Why Genomics? Why Digital? Why NISQ? Why Accelerators?
  12. The general exploration on pattern matching explored in this thesis can be extended to other domains. Of course we can do amino-acid sequencing by just extending the alphabet size DNA fingerprinting is comparing two large sequences and we can use the concept of Hamming distance and amplify the mismatches. Motif finding might also be trivial as it involves finding the consensus in a stored memory. There are other related domains where similar pattern matching applications are useful. For example in speech signals, or stock market patterns. And definitely on images. For example, this is an output from the same algorithm that was used on DNA tweeked for 2D black and white images. Finding the template took 17 qubits and some 50k gates. MEST: not far before Q App Dev becomes a buzzword and these developers start tinkering with Q Algos to find a killer app for QC
  13. Image copyright: Novikov Aleksey
  14. these are the ones people already tried using
  15. How much wood could a woodchuck chuck  If a woodchuck could chuck wood?  As much wood as a woodchuck could chuck,  If a woodchuck could chuck wood. “Run a shitty circuit shitload times such that there are on average less shit” - Malay