The document introduces algorithms and complexity results for problems on strings that are compressed using Straight Line Programs (SLPs). SLPs provide a mathematical model for compressed string representations that can capture many real-world compression schemes. The document discusses the smallest grammar problem, algorithms for problems like compressed pattern matching and equality checking, and complexity results. It shows that computing the Hamming distance between SLP-compressed strings is #P-complete, and the subsequence problem is PSPACE-complete and PP-hard. The algorithms exploit properties of SLPs like arithmetic progressions to achieve subquadratic time bounds.
Introduction to complexity theory that solves your assignment problem it contains about complexity class,deterministic class,big- O notation ,proof by mathematical induction, L-Space ,N-Space and characteristics functions of set and so on
NP completeness. Classes P and NP are two frequently studied classes of problems in computer science. Class P is the set of all problems that can be solved by a deterministic Turing machine in polynomial time.
Introduction to complexity theory that solves your assignment problem it contains about complexity class,deterministic class,big- O notation ,proof by mathematical induction, L-Space ,N-Space and characteristics functions of set and so on
NP completeness. Classes P and NP are two frequently studied classes of problems in computer science. Class P is the set of all problems that can be solved by a deterministic Turing machine in polynomial time.
International Conference on Monte Carlo techniques
Closing conference of thematic cycle
Paris July 5-8th 2016
Campus les Cordeliers
Slides of Richard Everitt's presentation
International Conference on Monte Carlo techniques
Closing conference of thematic cycle
Paris July 5-8th 2016
Campus les cordeliers
Jere Koskela's slides
I am George P. I am a Chemistry Assignment Expert at eduassignmenthelp.com. I hold a Ph.D. in Chemistry, from Perth, Australia. I have been helping students with their homework for the past 6 years. I solve assignments related to Chemistry.
Visit eduassignmenthelp.com or email info@eduassignmenthelp.com.
You can also call on +1 678 648 4277 for any assistance with Chemistry Assignments.
A gentle introduction to 2 classification techniques, as presented by Kriti Puniyani to the NYC Predictive Analytics group (April 14, 2011). To download the file please go here: http://www.meetup.com/NYC-Predictive-Analytics/files/
Quantum Algorithms and Lower Bounds in Continuous TimeDavid Yonge-Mallo
A poster presented at the Quantum Computing & Quantum Algorithms Program Review, in Buckhead, Atlanta, Georgia, 2008.
Abstract: "Many models of quantum computation, such as the Turing machine model or the circuit model, treat time as a discrete quantity and describe algorithms as discrete sequences of steps. However, this is not the only way to view quantum computational processes, as algorithms based on such ideas as continuous-time quantum walks show. By studying the properties of quantum computation in a continuous-time framework, we hope to discover new algorithms, develop better intuitions into existing algorithms, and gain further insights into the power and limitations of quantum computation."
This talk is going to be centered on two papers that are going to appear in the following months:
Neerja Mhaskar and Michael Soltys, Non-repetitive strings over alphabet lists
to appear in WALCOM, February 2015.
Neerja Mhaskar and Michael Soltys, String Shuffle: Circuits and Graphs
to appear in the Journal of Discrete Algorithms, January 2015.
Visit http://soltys.cs.csuci.edu for more details (these two papers are number 3 and 19 on the page), as well as Python programs that can be used to illustrate the ideas in the papers. We are going to introduce some basic concepts related to computations on string, present some recent results, and propose two open problems.
International Conference on Monte Carlo techniques
Closing conference of thematic cycle
Paris July 5-8th 2016
Campus les Cordeliers
Slides of Richard Everitt's presentation
International Conference on Monte Carlo techniques
Closing conference of thematic cycle
Paris July 5-8th 2016
Campus les cordeliers
Jere Koskela's slides
I am George P. I am a Chemistry Assignment Expert at eduassignmenthelp.com. I hold a Ph.D. in Chemistry, from Perth, Australia. I have been helping students with their homework for the past 6 years. I solve assignments related to Chemistry.
Visit eduassignmenthelp.com or email info@eduassignmenthelp.com.
You can also call on +1 678 648 4277 for any assistance with Chemistry Assignments.
A gentle introduction to 2 classification techniques, as presented by Kriti Puniyani to the NYC Predictive Analytics group (April 14, 2011). To download the file please go here: http://www.meetup.com/NYC-Predictive-Analytics/files/
Quantum Algorithms and Lower Bounds in Continuous TimeDavid Yonge-Mallo
A poster presented at the Quantum Computing & Quantum Algorithms Program Review, in Buckhead, Atlanta, Georgia, 2008.
Abstract: "Many models of quantum computation, such as the Turing machine model or the circuit model, treat time as a discrete quantity and describe algorithms as discrete sequences of steps. However, this is not the only way to view quantum computational processes, as algorithms based on such ideas as continuous-time quantum walks show. By studying the properties of quantum computation in a continuous-time framework, we hope to discover new algorithms, develop better intuitions into existing algorithms, and gain further insights into the power and limitations of quantum computation."
This talk is going to be centered on two papers that are going to appear in the following months:
Neerja Mhaskar and Michael Soltys, Non-repetitive strings over alphabet lists
to appear in WALCOM, February 2015.
Neerja Mhaskar and Michael Soltys, String Shuffle: Circuits and Graphs
to appear in the Journal of Discrete Algorithms, January 2015.
Visit http://soltys.cs.csuci.edu for more details (these two papers are number 3 and 19 on the page), as well as Python programs that can be used to illustrate the ideas in the papers. We are going to introduce some basic concepts related to computations on string, present some recent results, and propose two open problems.
I am Britney P. I love exploring new topics. Academic writing seemed an interesting option for me. After working for many years with progamminghomeworkhelp.com, I have assisted many students with their Design and Analysis of Algorithms Assignments. I can proudly say, each student I have served is happy with the quality of the solution that I have provided. I have acquired my bachelor's from Sunway University, Malaysia.
I am Gabriel C. I love exploring new topics. Academic writing seemed an interesting option for me. After working for many years with programmingexamhelp.com, I have assisted many students with their exams. I can proudly say, each student I have served is happy with the quality of the solution that I have provided. I have acquired Business analyst of Information Technology, Montreal College of Information Technology, Canada.
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Ana Luísa Pinho
Functional Magnetic Resonance Imaging (fMRI) provides means to characterize brain activations in response to behavior. However, cognitive neuroscience has been limited to group-level effects referring to the performance of specific tasks. To obtain the functional profile of elementary cognitive mechanisms, the combination of brain responses to many tasks is required. Yet, to date, both structural atlases and parcellation-based activations do not fully account for cognitive function and still present several limitations. Further, they do not adapt overall to individual characteristics. In this talk, I will give an account of deep-behavioral phenotyping strategies, namely data-driven methods in large task-fMRI datasets, to optimize functional brain-data collection and improve inference of effects-of-interest related to mental processes. Key to this approach is the employment of fast multi-functional paradigms rich on features that can be well parametrized and, consequently, facilitate the creation of psycho-physiological constructs to be modelled with imaging data. Particular emphasis will be given to music stimuli when studying high-order cognitive mechanisms, due to their ecological nature and quality to enable complex behavior compounded by discrete entities. I will also discuss how deep-behavioral phenotyping and individualized models applied to neuroimaging data can better account for the subject-specific organization of domain-general cognitive systems in the human brain. Finally, the accumulation of functional brain signatures brings the possibility to clarify relationships among tasks and create a univocal link between brain systems and mental functions through: (1) the development of ontologies proposing an organization of cognitive processes; and (2) brain-network taxonomies describing functional specialization. To this end, tools to improve commensurability in cognitive science are necessary, such as public repositories, ontology-based platforms and automated meta-analysis tools. I will thus discuss some brain-atlasing resources currently under development, and their applicability in cognitive as well as clinical neuroscience.
A brief information about the SCOP protein database used in bioinformatics.
The Structural Classification of Proteins (SCOP) database is a comprehensive and authoritative resource for the structural and evolutionary relationships of proteins. It provides a detailed and curated classification of protein structures, grouping them into families, superfamilies, and folds based on their structural and sequence similarities.
Professional air quality monitoring systems provide immediate, on-site data for analysis, compliance, and decision-making.
Monitor common gases, weather parameters, particulates.
Introduction:
RNA interference (RNAi) or Post-Transcriptional Gene Silencing (PTGS) is an important biological process for modulating eukaryotic gene expression.
It is highly conserved process of posttranscriptional gene silencing by which double stranded RNA (dsRNA) causes sequence-specific degradation of mRNA sequences.
dsRNA-induced gene silencing (RNAi) is reported in a wide range of eukaryotes ranging from worms, insects, mammals and plants.
This process mediates resistance to both endogenous parasitic and exogenous pathogenic nucleic acids, and regulates the expression of protein-coding genes.
What are small ncRNAs?
micro RNA (miRNA)
short interfering RNA (siRNA)
Properties of small non-coding RNA:
Involved in silencing mRNA transcripts.
Called “small” because they are usually only about 21-24 nucleotides long.
Synthesized by first cutting up longer precursor sequences (like the 61nt one that Lee discovered).
Silence an mRNA by base pairing with some sequence on the mRNA.
Discovery of siRNA?
The first small RNA:
In 1993 Rosalind Lee (Victor Ambros lab) was studying a non- coding gene in C. elegans, lin-4, that was involved in silencing of another gene, lin-14, at the appropriate time in the
development of the worm C. elegans.
Two small transcripts of lin-4 (22nt and 61nt) were found to be complementary to a sequence in the 3' UTR of lin-14.
Because lin-4 encoded no protein, she deduced that it must be these transcripts that are causing the silencing by RNA-RNA interactions.
Types of RNAi ( non coding RNA)
MiRNA
Length (23-25 nt)
Trans acting
Binds with target MRNA in mismatch
Translation inhibition
Si RNA
Length 21 nt.
Cis acting
Bind with target Mrna in perfect complementary sequence
Piwi-RNA
Length ; 25 to 36 nt.
Expressed in Germ Cells
Regulates trnasposomes activity
MECHANISM OF RNAI:
First the double-stranded RNA teams up with a protein complex named Dicer, which cuts the long RNA into short pieces.
Then another protein complex called RISC (RNA-induced silencing complex) discards one of the two RNA strands.
The RISC-docked, single-stranded RNA then pairs with the homologous mRNA and destroys it.
THE RISC COMPLEX:
RISC is large(>500kD) RNA multi- protein Binding complex which triggers MRNA degradation in response to MRNA
Unwinding of double stranded Si RNA by ATP independent Helicase
Active component of RISC is Ago proteins( ENDONUCLEASE) which cleave target MRNA.
DICER: endonuclease (RNase Family III)
Argonaute: Central Component of the RNA-Induced Silencing Complex (RISC)
One strand of the dsRNA produced by Dicer is retained in the RISC complex in association with Argonaute
ARGONAUTE PROTEIN :
1.PAZ(PIWI/Argonaute/ Zwille)- Recognition of target MRNA
2.PIWI (p-element induced wimpy Testis)- breaks Phosphodiester bond of mRNA.)RNAse H activity.
MiRNA:
The Double-stranded RNAs are naturally produced in eukaryotic cells during development, and they have a key role in regulating gene expression .
Nutraceutical market, scope and growth: Herbal drug technologyLokesh Patil
As consumer awareness of health and wellness rises, the nutraceutical market—which includes goods like functional meals, drinks, and dietary supplements that provide health advantages beyond basic nutrition—is growing significantly. As healthcare expenses rise, the population ages, and people want natural and preventative health solutions more and more, this industry is increasing quickly. Further driving market expansion are product formulation innovations and the use of cutting-edge technology for customized nutrition. With its worldwide reach, the nutraceutical industry is expected to keep growing and provide significant chances for research and investment in a number of categories, including vitamins, minerals, probiotics, and herbal supplements.
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Sérgio Sacani
We characterize the earliest galaxy population in the JADES Origins Field (JOF), the deepest
imaging field observed with JWST. We make use of the ancillary Hubble optical images (5 filters
spanning 0.4−0.9µm) and novel JWST images with 14 filters spanning 0.8−5µm, including 7 mediumband filters, and reaching total exposure times of up to 46 hours per filter. We combine all our data
at > 2.3µm to construct an ultradeep image, reaching as deep as ≈ 31.4 AB mag in the stack and
30.3-31.0 AB mag (5σ, r = 0.1” circular aperture) in individual filters. We measure photometric
redshifts and use robust selection criteria to identify a sample of eight galaxy candidates at redshifts
z = 11.5 − 15. These objects show compact half-light radii of R1/2 ∼ 50 − 200pc, stellar masses of
M⋆ ∼ 107−108M⊙, and star-formation rates of SFR ∼ 0.1−1 M⊙ yr−1
. Our search finds no candidates
at 15 < z < 20, placing upper limits at these redshifts. We develop a forward modeling approach to
infer the properties of the evolving luminosity function without binning in redshift or luminosity that
marginalizes over the photometric redshift uncertainty of our candidate galaxies and incorporates the
impact of non-detections. We find a z = 12 luminosity function in good agreement with prior results,
and that the luminosity function normalization and UV luminosity density decline by a factor of ∼ 2.5
from z = 12 to z = 14. We discuss the possible implications of our results in the context of theoretical
models for evolution of the dark matter halo mass function.
This presentation explores a brief idea about the structural and functional attributes of nucleotides, the structure and function of genetic materials along with the impact of UV rays and pH upon them.
Seminar of U.V. Spectroscopy by SAMIR PANDASAMIR PANDA
Spectroscopy is a branch of science dealing the study of interaction of electromagnetic radiation with matter.
Ultraviolet-visible spectroscopy refers to absorption spectroscopy or reflect spectroscopy in the UV-VIS spectral region.
Ultraviolet-visible spectroscopy is an analytical method that can measure the amount of light received by the analyte.
This pdf is about the Schizophrenia.
For more details visit on YouTube; @SELF-EXPLANATORY;
https://www.youtube.com/channel/UCAiarMZDNhe1A3Rnpr_WkzA/videos
Thanks...!
2. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Introduction
Introduction
In many areas, large string data have to be not only stored
in compressed form, but the initial data has to be
processed and analyzed as well.
Design of algorithms that operate directly on the
compressed data.
Decompress-and-solve strategy needs many resources.
3. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Introduction
Introduction
A compressed representation of a string makes
regularities in the string explicit. These regularities may
be exploited in a second phase for speeding up an
algorithm.
So, we need a mathematical model for compressed
representations of strings.
4. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Introduction
Introduction
A compressed representation of a string makes
regularities in the string explicit. These regularities may
be exploited in a second phase for speeding up an
algorithm.
So, we need a mathematical model for compressed
representations of strings.
Such a model should have two properties:
Cover many compression schemes used in practice
Be mathematically easy to handle
5. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Introduction
Straight Line Programs
Definition (Straight Line Programs)
A Straight Line Program (SLP) over the terminal alphabet Σ is
a context-free grammar A = (V, Σ, S, P), P ⊆ V × (V ∪ Σ)∗ such
that:
1 For every A ∈ V there exists exactly one production of the
form (A, α) ∈ P.
2 The relation {(A, B) | (A, α) ∈ P, B ∈ alph(α)} is acyclic.
The size of an SLP is |A| =
∑
(A,α)∈P |α|.
The (singleton) language generated by A is denoted
eval(A).
Let alph(s) be the set of symbols occuring in s.
6. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Introduction
Straight Line Programs
Example (Fibonacci Words)
Let SLP A over the terminal alphabet {a, b} with the following
productions:
A1 → b
A2 → a
Ai → Ai−1Ai−2, for 3 ≤ i ≤ 7
The starting symbol is A7.
Then eval(A) = abaababaabaab, the 7th Fibonacci Word.
7. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Introduction
Straight Line Programs
SLPs can capture all the usual compression methods. For
example:
Theorem
From the LZ77-factorization of a given string w ∈ Σ∗, we can
compute an SLP of size O
(
log |w|
m · m
)
for w in time
O
(
log |w|
m · m
)
, where m is the number of factors in the
LZ77-factorization of w.
Also, we can easily design polynomial-time algorithms to
compute |eval(A)| and eval(A)[i], given an SLP A.
8. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Introduction
The Smallest Grammar Problem
Given a string, what is the smallest SLP for it?
This is a Kolmogorov Complexity (decidable) variant.
Let opt(w) the size of a minimal SLP for w, that is, an SLP
A with eval(A) = w, |A| = opt(w) and for every SLP B with
eval(B) = w, |B| ≥ |A|.
Definition
Given a string w, compute a minimal SLP for w.
Theorem (Approximation of an SLP)
There is a O (log |Σ| · n)-time algorithm that computes for a
given word w ∈ Σ∗ of length n an SLP A such that eval(A) = w
and |A| ≤ O
(
log n
opt(w)
)
· opt(w).
9. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Introduction
The Smallest Grammar Problem
Theorem
Unless P = NP there is no polynomial time algorithm with the
following properties:
The input consists of a string w over some alphabet Σ.
The output is an SLP A such that eval(A) = w and
|A| ≤ 8569
8568 · opt(w).
The proof uses a reduction from the vertex cover problem
for graphs with max degree 3, which is hard to
approximate below a ration of 145
144 , unless P = NP.
10. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Algorithms and Hardness for Compressed Problems
Compressed Equality Checking
Definition (Compressed Equality Checking)
Given two SLPs A and B, is eval(A) = eval(B)?
The algorithms for equality checking use combinatorial
properties of strings, such as the periodicity lemma.
Some results on sequential and parallel models:
Theorem
Compressed Equality Checking can be solved in
O
(
(|A| + |B|)2
)
.
Theorem
Compressed Equality Checking belongs to coRNC2
.
11. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Algorithms and Hardness for Compressed Problems
Compressed Hamming Distance
Let dH(a, b) denote the Hamming Distance of a, b ∈ Σ∗.
(the numbers of symbols that a and b differ).
Theorem
The function dH(eval(A), eval(B)), where A, B are SLPs, is
#P-complete.
Proof:
Let:
G(S, T, y) =
{
“yes” , if Ty ̸= Sy
“no” , otherwise
G ∈ FP.
We will reduce the #P-complete problem #SUBSET SUM
to dH using an 1-Turing Reduction.
12. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Algorithms and Hardness for Compressed Problems
Compressed Hamming Distance
Proof:
#SUBSET SUM asks, given W = {w1, . . . , wn}, t in binary,
the number of W′s subsets with elements summing up to
t, that is, the number of x ∈ {0, 1}n for which
x · (w1, . . . , wn) = x · w = t.
Let s =
∑n
i=1 wi.
Consider the texts:
T = (0t
10s−t
)2n
S = ⃝x∈{0,1}n (0x·w
10s−x·w
)
Notice that dH(T, S) is exactly two times the number of
W’s subsets that are not equal to t.
We can easily construct an SLP B such that eval(B) = T.
13. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Algorithms and Hardness for Compressed Problems
Compressed Hamming Distance
Consider the following SLP A with the following rules:
A1 → 10s+w1
1
Ak+1 → Ak0s−sk+wk+1 Ak (1 ≤ k ≤ n − 1)
Using induction, we can prove that
eval(A) = ⃝x∈{0,1}n (0x·w10s−x·w) = S
The size of A is polynomial in the length of the binary
encoding of w.
Thus we can compute the answer to #SUBSET SUM as
2n − 1
2 dH(T, S).
□
14. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Algorithms and Hardness for Compressed Problems
Fully Compressed Pattern Matching
In its most general form, the Pattern Matching Problem
asks for given strings T and P, if P is a factor of T.
Many linear-time algorithms for the uncompressed case
(e.g. Knuth-Morris-Pratt).
Definition (Fully Compressed Pattern Matching)
Given two SLPs P and T, is eval(P) a factor of eval(T)?
An important observation that implies most algorithms is
that if a pattern p is a factor of eval(T), then there exists a
production X → YZ in eval(T) such that p has an
“occurence” in evalT(X) = evalT(Y)evalT(Z) that touches
the cut between evalT(Y) and evalT(Z).
15. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Algorithms and Hardness for Compressed Problems
Fully Compressed Pattern Matching
It is a consequence of the Periodicity Lemmma (Fine,
Wilf, 1965) that the set of all starting positions of the
occurences of p in evalT(X) that touch the cut of X forms
an arithmetic progression.
Lifshit’s algorithm, for example, computes for every
nonterminal A of the pattern SLP P and each nonterminal
X of the text SLP T the arithmetic progression
corresponding to the occurences of evalP(A) in evalT(X)
that touch the cut of X.
16. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Algorithms and Hardness for Compressed Problems
Fully Compressed Pattern Matching
These arithmetic progression can be computed
bottom-up, resulting in overall time bound O
(
|P|2|T|
)
.
Jez’s algorithm beats that time bound with
O ((|T| + |P|) log(|eval(P)|) log(|P| + |T|)) = O
(
n2 log n
)
For uncompressed patterns, we can achieve a bound of
O (|p| · ||T||).
17. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Algorithms and Hardness for Compressed Problems
Subsequence Problems
In many applications, especially in Computational
Biology, approximate occurences of a pattern are more
relavant than exact matches.
Subsequence Problems consist very useful similarity
measures between sequences.
Definition (Fully Compressed Subsequence Problem)
Given two SLPs P and T, is eval(P) a subsequence of eval(T)?
Theorem
The Fully Compressed Subsequence Problem is in PSPACE and
it is PP-hard.
18. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Algorithms and Hardness for Compressed Problems
Querying Compressed Texts
Definition (Compressed Querying)
Given an SLP A, a binary-coded number i, 1 ≤ i ≤ |eval(A)|
and a symbol a ∈ Σ, does eval(A)[i] = a hold?
Theorem
Compressed Querying is P-complete.
The proof of the aforementioned result is a logspace
reduction from the P-complete problem “super
increasing subset sum”.
19. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Algorithms and Hardness for Compressed Problems
References I
Moses Charikar, Eric Lehman, Ding Liu, Rina Panigrahy,
Manoj Prabhakaran, Amit Sahai, and Abhi Shelat.
The smallest grammar problem.
IEEE Trans. Information Theory, 51(7):2554–2576, 2005.
Dan Gusfield.
Algorithms on Strings, Trees, and Sequences: Computer
Science and Computational Biology.
Cambridge University Press, New York, NY, USA, 1997.
Yury Lifshits.
Processing compressed texts: A tractability border.
In Combinatorial Pattern Matching, 18th Annual
Symposium, CPM 2007, London, Canada, July 9-11, 2007,
Proceedings, pages 228–240, 2007.
20. .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Introduction The Smallest Grammar Problem Algorithms and Hardness for Compressed Problems
Algorithms and Hardness for Compressed Problems
References II
Markus Lohrey.
Word problems and membership problems on
compressed words.
SIAM J. Comput., 35(5):1210–1240, 2006.
Markus Lohrey.
Algorithmics on SLP-compressed strings: A survey.
Groups Complexity Cryptology, 4(2):241–299, 2012.
Markus Lohrey.
Equality testing of compressed strings.
In Combinatorics on Words - 10th International
Conference, WORDS 2015, Kiel, Germany, September
14-17, 2015, Proceedings, pages 14–26, 2015.