SlideShare a Scribd company logo
1 of 26
Download to read offline
Nature Structural & Molecular Biology | Volume 30 | August 2023 | 1172–1182 1172
naturestructural&molecularbiology
Article https://doi.org/10.1038/s41594-023-01042-3
Mechanisticandevolutionaryinsightsintoa
typeV-MCRISPR–Caseffectorenzyme
Satoshi N. Omura1
, Ryoya Nakagawa1
, Christian Südfeld2
,
Ricardo Villegas Warren 2
, Wen Y. Wu2
, Hisato Hirano1
, Charlie Laffeber3
,
Tsukasa Kusakizako 1
, Yoshiaki Kise1,4
, Joyce H. G. Lebbink 3,5
, Yuzuru Itoh1
,
John van der Oost 2
& Osamu Nureki 1
RNA-guidedtypeVCRISPR–Cas12effectorsprovideadaptiveimmunity
againstmobilegeneticelements(MGEs)inbacteriaandarchaea.Among
diverseCas12enzymes,therecentlyidentifiedCas12m2(CRISPR–Castype
V-M)ishighlycompactandhasauniqueRuvCactivesite.Althoughthe
non-canonicalRuvCtriaddoesnotpermitdsDNAcleavage,Cas12m2still
protectsagainstinvadingMGEsthroughtranscriptionalsilencingbystrong
DNAbinding.However,themolecularmechanismofRNA-guidedgenome
inactivationbyCas12m2remainsunknown.Herewereportcryo-electron
microscopystructuresoftwostatesofCas12m2–CRISPRRNA(crRNA)–
targetDNAternarycomplexesandtheCas12m2–crRNAbinarycomplex,
revealingstructuraldynamicsduringcrRNA–targetDNAheteroduplex
formation.Thestructuresindicatethatthenon-targetDNAstrandistightly
boundtoauniquearginine-richclusterintherecognition(REC)domains
andthenon-canonicalactivesiteintheRuvCdomain,ensuringstrong
DNA-bindingaffinityofCas12m2.Furthermore,astructuralcomparisonof
Cas12m2withTnpB,aputativeancestorofCas12enzymes,suggeststhat
theinteractionofthecharacteristiccoiled-coilREC2insertionwiththe
protospacer-adjacentmotif-distalregionoftheheteroduplexiscrucialfor
Cas12m2toengageinadaptiveimmunity.Collectively,ourfindingsimprove
mechanisticunderstandingofdiversetypeVCRISPR–Caseffectorsand
provideinsightsintotheevolutionofTnpBtoCas12enzymes.
CRISPR–Cas systems provide adaptive immunity against mobile
genetic elements (MGEs) in bacteria and archaea and are divided
into two classes (classes 1 and 2) and six types (types I–VI)1,2
. The
class 2 systems include types II, V and VI, in which Cas9, Cas12 and
Cas13, respectively, function as effector enzymes responsible for
interference against MGEs. Recent studies have identified a dozen
functionally divergent type V Cas12 effector proteins3–11
. Cas12
enzymes associate with a CRISPR RNA (crRNA) or dual RNA guides
(crRNA and trans-activating crRNA) and cleave double-stranded
DNA (dsDNA) targets flanked by a protospacer-adjacent motif (PAM),
using a single RuvC nuclease domain. Aside from the RuvC domain,
Cas12 effector proteins share little sequence similarity, reflecting
their diverse biochemical features. Type V Cas12 effectors report-
edly evolved from the IS200/IS605 (IS stands for bacterial insertion
sequences) superfamily transposon-associated TnpB protein2,12,13
.
Although the sequences of relatively large Cas12 proteins, such as
Received: 7 February 2023
Accepted: 22 June 2023
Published online: 17 July 2023
Check for updates
1
Department of Biological Sciences, Graduate School of Science, the University of Tokyo, Tokyo, Japan. 2
Laboratory of Microbiology, Wageningen
University and Research, Wageningen, the Netherlands. 3
Department of Molecular Genetics, Oncode Institute, Erasmus MC Cancer Institute, Erasmus
University Medical Center, Rotterdam, the Netherlands. 4
Curreio, the University of Tokyo, Tokyo, Japan. 5
Department of Radiotherapy, Erasmus University
Medical Center, Rotterdam, the Netherlands. e-mail: john.vanderoost@wur.nl; nureki@bs.s.u-tokyo.ac.jp
Nature Structural & Molecular Biology | Volume 30 | August 2023 | 1172–1182 1173
Article https://doi.org/10.1038/s41594-023-01042-3
evolution from TnpB to the larger Cas12 enzymes2
. Among the type
V-U families, the U2–U4 clusters are classified into the Cas12f sub-
type, in which the effector proteins function as a dimer, while the
Cas12a, have weak similarities to the TnpB sequence, five recently
identified small Cas12 variants (V-U1 to V-U5) show greater sequence
similarities to TnpB and are thought to represent the early stage of
RNA–DNA heteroduplex
PAM duplex
Rehybridized
DNA duplex
5′
5′
5′
PAM
TS
TS
3′
3′
3′
NTS
NTS
NTS
PK
Guide (spacer)
Loop
Stem
e
crRNA Target DNA PAM Disordered region
−1*
−1 −9
−1
−9
−24
−12
−13
−23 −17
−36
−29
−26
PAM
−10*
5′
3′
3′
5′
17
18
1
27
3′
NTS
TS
1
20
17
Guide (spacer)
Loop
Stem
PK
RNA–DNA
heteroduplex
1* 17* 18* 26*
5′
d
5′
TS
Disordered region
Disordered region
3′
REC2
REC1
WED
NTS
crRNA
5′
5′
3′
5′
PAM
crRNA
TS
Disordered region
180°
3′
RuvC
REC2
REC1
WED
NTS
TNB
c
5′
3′
PAM
crRNA
crRNA
180°
RuvC
RuvC
REC2
REC2
REC1
REC1
WED WED
TNB
b
19 53 121 190 302 313 541 577 596
REC1
REC1 WED RuvC
REC2 TNB
1
REC lobe NUC lobe
Linker
a
Fig.1|Cryo-EMstructureoftheCas12m2–crRNA–targetDNAternary
complex.a,DomainstructureofCas12m2.b,c,Cryo-EMmaps(b)andstructural
models(c)oftheCas12m2–crRNA–targetDNAternarycomplex.Zincand
magnesiumionsintheTNBandRuvCdomainsareshownasgrayspheres.
d,SchematicofthecrRNAandtargetDNA.Thedisorderedregionsareenclosed
bydashedboxes.e,StructureofthecrRNA-and-targetDNAcomplex.
Nature Structural & Molecular Biology | Volume 30 | August 2023 | 1172–1182 1174
Article https://doi.org/10.1038/s41594-023-01042-3
Table 1 | Data collection, processing, model refinement and validation
Data collection and processing
Sample Cas12m2–crRNA–DNA
Full R-loop
Cas12m2–crRNA–DNA
Intermediate
Cas12m2–crRNA
EMDB ID EMD-34803 EMD-34804 EMD-34824
PDB ID 8HHL 8HHM 8HIO
Microscope Titan Krios G3i
Detector Gatan K3 camera
Magnification 105,000
Voltage (kV) 300
Data-collection software EPU
Image-processing package cryoSPARC
Electron exposure (e–
/Å2
) 55 51
Defocus range (μm) −0.8 to −1.6
Pixel size (Å) 0.83
Symmetry imposed C1
Number of movies 3,570 8,004
Initial particle images (no.) 5,067,096 6,034,518
Final particle images (no.) 182,116 180,921 115,501
Map resolution (Å) 2.9 3.1 3.7
FSC threshold 0.143
Map sharpening B factor (Å2
) −70 −80 −170
3DFSC analysis
Global resolution (Å) 2.94 3.14 3.91
Sphericity 0.900 0.722 0.809
Model building and refinement
Model composition
Protein atoms 4,630 4,036 5,660
Nucleic acid atoms 2,337 1,497 1,019
Metal ions 4 2 2
Model refinement
Model-Map CC (CCmask/CCbox/CCpeaks/CCvolume) 0.83/0.77/0.71/0.83 0.83/0.81/0.72/0.83 0.75/0.74/0.60/0.74
 
Resolution (Å) by model-to-map FSC, threshold 0.50
(masked/unmasked)
3.01/3.10 3.28/3.36 4.15/4.25
Average B factor (Å2
) (protein/nucleotide/metal ion) 90.73/107.33/96.40 107.59/129.37/158.47 155.66/115.67/156.69
r.m.s. deviations
Bond lengths (Å) 0.004 0.004 0.005
Bond angles (°) 0.485 0.510 0.622
Validation
MolProbity score 1.64 1.89 1.68
CaBLAM outliers (%) 0.85 1.39 0.86
Clashscore 6.49 6.39 10.33
Rotamer outliers (%) 2.29 3.10 0.0
Cβ outliers (%) 0 0 0
EMRinger score 3.37 2.37 0.93
Ramachandran plot
Favored (%) 98.64 97.06 97.17
Allowed (%) 1.36 2.94 2.83
Outliers (%) 0.00 0.00 0.00
EMDB, Electron Microscopy Data Bank; FSC, Fourier shell correlation; PDB, Protein Data Bank; CaBLAM, C-Alpha Based Low-resolution Annotation Method; EMRinger score, an extension of the
X-ray crystallography validation tool Ringer in cryo-EM.
Nature Structural  Molecular Biology | Volume 30 | August 2023 | 1172–1182 1175
Article https://doi.org/10.1038/s41594-023-01042-3
U5 cluster is classified as the Cas12k subtype, in which the effector
proteins are catalytically inactive.
A recent study demonstrated that the type V-M (formerly type
V-U1, clade 2) Cas12m2 from Mycolicibacterium mucogenicum (here-
after referred to as Cas12m2 for simplicity) exhibits RNA-guided
dsDNA interference activity11
. Cas12m2 consists of 596 amino acid
residues and is much smaller than other Cas12 enzymes, except for
Cas12f7
. Cas12m2 associates with a single crRNA to recognize dsDNA
targets containing a 5′-TTN-3′ PAM. Interestingly, Cas12m2 lacks
target dsDNA-cleavage activity, probably due to the non-canonical
His–Asp–Asp(HDD)catalyticresiduesintheRuvCdomain,ratherthan
the canonical, DNA-cleavable active site Asp–Glu–Asp (DED). Hence,
instead of target cleavage, Cas12m2 interferes with invading MGEs
through transcriptional silencing by strong DNA binding11
. However,
becauseofthelackofstructuralinformation,themolecularmechanism
of the miniature Cas12m2 and the evolutionary relationship between
TnpBandCas12m2remainunknown.
Here, we report multiple cryo-electron microscopy (cryo-EM)
structural states of Cas12m2–crRNA–target DNA ternary complexes
and the Cas12m2–crRNA binary complex at overall resolutions of
2.9–3.7 Å.ThestructuresrevealauniquetargetdsDNA-bindingmecha-
nismofCas12m2,clearlyexplainingwhyCas12m2iscapableoftightly
binding target DNA but unable to cleave it. Furthermore, a structural
comparison of Cas12m2 with TnpB illuminates their great structural
similarities and diversities, providing insights into the evolutionary
pathfromTnpBtoCas12enzymes.
Results
Cryo-EMstructureoftheCas12m2–crRNA–targetDNAcomplex
To understand the molecular mechanism of Cas12m2, we analyzed
its structure in complex with a 56-nucleotide crRNA (including a
20-nucleotide guide) and its 36-base pair (bp) dsDNA target with a
5′-TTG-3′PAMbycryo-EM.Usingthree-dimensional(3D)classification
andrefinement,wediscoveredtwoconformationalpopulationsofthe
Cas12m2 ternary complex at overall resolutions of 2.9 Å (state I) and
3.1 Å(stateII)(Fig.1a–c,Table1,ExtendedDataFig.1andSupplementary
Table1).Inbothstates,Cas12m2bindsthecrRNAanditsdsDNAtarget
at a molar ratio of 1:1:1. Although the full-length crRNA–target DNA
heteroduplexisvisibleinstateI,ashorterheteroduplex(approximately
12 bp)isobservedinstateII.Forsimplicity,wedescribethestructural
features of state I unless otherwise specified.
The present structures revealed that Cas12m2 adopts a bilobed
architecture composed of REC and nuclease (NUC) lobes (Fig. 1a–c
and Extended Data Fig. 2a). The REC lobe consists of wedge (WED),
REC1 and REC2 domains. The WED domain adopts an oligonucleo-
tide–oligosaccharide-binding fold similar to those of other Cas12
enzymes14–19
(Extended Data Fig. 2b). The REC1 domain consists of
three α-helices, and the REC2 domain has a characteristic coiled-coil
structure, forming a distinctive architecture resembling a ‘sickle’
(ExtendedDataFig.2b).TheNUClobecontainsRuvCandtargetnucleic
acid-binding (TNB) domains. The RuvC domain has an RNase H fold,
and the His317, Asp485 and Asp579 residues are located at positions
similar to those of the well-conserved catalytic Asp, Glu and Asp resi-
duesinmostotherCas12enzymes14–19
(ExtendedDataFig.2c).TheTNB
domain is inserted into the RuvC domain and contains an HCCC-type
zincfinger,inwhichazincioniscoordinatedbyHis549,Cys552,Cys569
and Cys572 (Extended Data Fig. 2d). The guide RNA–target DNA het-
eroduplex is accommodated within the positively charged central
channelformedbytheRECandNUClobes(Fig.1candExtendedData
Fig. 2a). The displaced non-target DNA strand (NTS) is visible in our
density map and is bound to unique pockets located in the REC and
RuvCdomains(discussedbelow).
crRNAarchitectureandrecognition
The crRNA consists of the 20-nucleotide guide segment (G1 to C20)
and the 36-nucleotide repeat-derived scaffold (G(−36) to C(−1))
(Fig.1d).NucleotidesG1toU17inthecrRNAanddA1todC17inthetar-
getDNAstrand(TS)forma17-bpguideRNA–targetDNAheteroduplex
(Fig.1d,eandExtendedDataFig.2e).Unexpectedly,nucleotidesdG(−3)
todC(−1)intheTSdonotformbasepairswithG18toC20inthecrRNA
and instead rehybridize with dG18* to dC20* in the NTS, suggesting
that the 17-bp guide RNA–target DNA heteroduplex represents the
optimallengthforCas12m2-mediatedDNArecognition(todifferenti-
atebetweenNTSandTSinthisstudy,anasteriskisusedtodenotethe
NTS; Fig. 1d,e and Extended Data Fig. 2e).
The crRNA scaffold comprises a pseudoknot (PK) and stem and
loopregions.The5′endofthecrRNA(G(−36)toU(−30))isdisordered
(Fig. 1d,e and Extended Data Fig. 2f). Specifically, G(−23) forms a
non-canonicalbasepairwithA(−2),andC(−22)toC(−17)formcanoni-
cal Watson–Crick base pairs with G(−8) to G(−3) to compose the PK
structure.ThePKcoaxiallystackswiththestemtoformacontinuous
helix.G(−9)andA(−11)intheloopformhydrogenbondswiththeribose
moieties of G(−16) and G(−14), respectively, thereby stabilizing the
crRNAscaffold(ExtendedDataFig.3).
ThecrRNAscaffoldisaccommodatedwithinthegrooveformedby
theWEDandRuvCdomainsandismainlyrecognizedbythesedomains
(Fig.2a,bandExtendedDataFig.3).Notably,thePKandstemarebound
by Arg237 and Arg241 through sugar–phosphate backbone interac-
tions,whilethenucleobaseoftheflipped-outA(−24)issandwichedby
His12 and Arg245 and its ribose moiety hydrogen bonds with Arg245
(Figs. 1d and 2c). The G(−12) in the loop forms a hydrogen bond and
stackinginteractionswithArg447andLys448,therebystabilizingthe
crRNAloopregion(Fig.1dandExtendedDataFig.3).Moreover,thefirst
nucleotideC(−1)andthenon-canonicalG(−23):A(−2)basepairexten-
sively interact with the WED domain in a base-specific manner. The
nucleobase C(−1) forms a base-specific hydrogen bond and stacking
interactionswiththeconservedHis269andArg270,respectively,while
G(−23)stackswithHis269,stabilizingthenon-canonicalG(−23):A(−2)
basepair(Fig.2d).Altogether,thepresentstructurerevealsthemecha-
nismofcrRNAscaffoldrecognitionbythecompactCas12m2.
TargetDNAarchitectureandrecognition
TheguideRNA–targetDNAheteroduplexisaccommodatedwithinthe
positivelychargedcentralchannelformedbytheREC1,REC2andRuvC
domains and is recognized by the Cas12m2 protein through interac-
tionswithitssugar–phosphatebackbone(Fig.2a,bandExtendedData
Fig.3).Met4intheWEDdomainstackswiththefirstG1:dC17basepair
intheheteroduplex,whileArg301intheWEDdomaininteractswiththe
Fig.2|crRNAandtargetDNArecognition. a,RecognitionsitesofthecrRNA
scaffoldandtargetDNA.b,ElectrostaticsurfacepotentialoftheCas12m2–
crRNA–targetDNAternarycomplex.ThecrRNA–targetDNAheteroduplexis
accommodatedwithinapositivelychargedcentralchannelformedbyWED,
REC1,REC2andRuvCdomains,andthePAMduplexissandwichedbytheWED
andREC1domains.c–g,Recognitionofthestem(c),thePK(d),thecrRNA–target
DNAheteroduplex(e),therehybridizedDNAduplex(f)andthePAMduplex
(g).Hydrogenbondsaredepictedwithgreendashedlines.h,DNAbindingof
Cas12m2mutantsinvitro,measuredbySPRspectroscopy,relativetowildtype.
Valuesshownarethemean ± s.e.m.ofthreeindependentexperiments.
StatisticalanalysiswascarriedoutusingTukey’sposthoctestincaseofa
significanttwo-wayANOVAresult.Significantdifferencesrelativetowild-type
Cas12m2areindicatedbyasterisks.***P  0.001.Triple,Cas12m2R111A/R112A/R126A
triplemutant.i,UniqueArg-richpocketformedbytheREC1andREC2domains.
Hydrogenbondsaredepictedwithgreendashedlines.j,Close-upviewofthe
non-canonicalRuvCactivesite.ThemagnesiumioncoordinatedwiththeNTSis
shownasagraysphere.Thecryo-EMdensityfortheRuvCconservedresidues,
themagnesiumionandthedisplacedNTS(dG16*anddT17*)isshownasablue
mesh.k,Schematicofthenon-canonicalRuvCactivesite.
Nature Structural  Molecular Biology | Volume 30 | August 2023 | 1172–1182 1176
Article https://doi.org/10.1038/s41594-023-01042-3
REC2
REC1
P
O
O−
Mg2+
A
O
O
Base
Base
O
HN
N
O
O
O
O
O
O
O
3′
5′
D485
H317
D579
c
b
a
WED
RuvC
RuvC
PAM
REC1 REC1 REC1
WED
WED WED
TNB
TNB
REC2
REC2 REC2
180°
d e
c
d f
j
i
g
e
PAM PAM
RuvC REC2
WED
RuvC
REC1
WED
RuvC
f g h
i j k
5′ 5′
5′ 5′
TS
TS TS
NTS
NTS
R126
R173
R111
R112
dT6
dG7 dA8
dG9
R245
R237
R241
A(−24)
G(−3)
C(−6)
C(−7)
Helix α3
H12
L502
P503
dG19*
dC(−2)
dC(−1)
dA1
dG18*
dG17*
H269
R270
C(−1)
A(−2)
G(−23)
R301
M4
G1
G2
dC16
dC17
dC18
H317
D485
D579
Mg2+
dG16* dT17*
5′
NTS
3′ 3′
3′ 3′
3′
3′
crRNA crRNA
*** *** *** *** *** *** *** ***
***
Y
1
4
1
A
W
1
5
2
A
C
a
s
1
2
m
2
N
1
5
6
A
R
1
1
1
A
Q
1
9
5
A
R
1
1
2
A
R
1
2
6
A
T
r
i
p
l
e
Q
1
9
7
A
Relative
binding
0
0.2
0.4
0.6
0.8
1.0
1.2
Mg2+
B
WED
REC1
N156
W152
Q195
Q197
Y141
dA19
dA20
dT(−2*)
dT(−3*)
Nature Structural  Molecular Biology | Volume 30 | August 2023 | 1172–1182 1177
Article https://doi.org/10.1038/s41594-023-01042-3
backbonephosphategroupofdC18,therebyfacilitatingheteroduplex
formationasobservedinotherCas12enzymes20
(Fig.2e).Leu502and
Pro503 in the RuvC domain form hydrophobic interactions with the
dG18* and dC(−1) bases, respectively, stabilizing reformation of the
DNA duplex downstream of the heteroduplex (Fig. 2f).
ThePAM-containingDNAduplex(referredtoasthePAMduplex)
isdistortedintoanarrowminorgrooveandissandwichedbytheREC1
andWEDdomains(Fig.2a).ThedT(−3*)anddT(−2*)nucleotidesform
hydrophobic interactions with Trp152 and Tyr141 and base-specific
hydrogenbondswithGln197andAsn156,respectively(Fig.2g).Tyr141
also interacts with the phosphate backbone of dT(−3*) (Fig. 2g). The
dA19anddA20nucleobases,whichbasepairwithdT(−2*)anddT(−3*),
form hydrogen bonds with Gln195 and Gln197, respectively (Fig. 2g).
Substitution mutagenesis of Cas12m2 confirmed the importance of
Tyr141,Trp152,Asn156,Gln195andGln197forDNAbinding,asalanine
substitution of any of these residues strongly affected the ability of
Cas12m2 to interact with its DNA target in vitro (Fig. 2h and Extended
Data Fig. 4). These structural and biochemical features explain the
requirement of the 5′-TTN-3′ PAM sequence for Cas12m2-mediated
targetDNAbinding.
The displaced NTS passes through the groove formed by the
REC1 and REC2 domains toward the groove formed by the RuvC and
TNBdomains(Fig.2a,b).Notably,thecentralregionofthenegatively
charged backbone of the NTS is bound to an Arg-rich pocket in the
REC1andREC2domainsandinteractsextensivelywithArg111,Arg112,
Arg126 and Arg173 (Fig. 2i). Such an Arg-rich pocket has not been
observed in any other Cas12 enzymes, even though these Arg resi-
dues are highly conserved among Cas12m family enzymes (Extended
Data Fig. 5). Cas12m2 mutants with alanine substitutions of Arg111,
Arg112 and Arg126 showed reduced binding to target DNA in vitro
(Fig. 2h and Extended Data Fig. 4c), suggesting the critical role of
the unique Arg-rich pocket for the robust DNA-binding affinity of
Cas12menzymes.
Non-canonicalHDD-typeRuvCactivesite
In most Cas12 enzymes, the RuvC active site has the conserved DED
catalytic residues that coordinate two magnesium ions on either
side of the scissile phosphate (Extended Data Fig. 6a). The first mag-
nesium ion (A) activates the nucleophilic water molecule, while the
second magnesium ion (B) stabilizes the transition state, resulting
in a two-divalent cation-dependent DNA-hydrolysis mechanism21,22
(Extended Data Fig. 6a,b).
IntheCas12m2structure,thenucleotidesdG14*todT17*intheNTS
are clearly visible, and the phosphate group of dG16* is located at the
RuvCactivesiteformedbyHis317,Asp485andAsp579residues(Fig.2j
and Extended Data Fig. 6c). At the position corresponding to magne-
siumion(A)inotherCas12enzymes,weobservedamagnesiumionthat
wascoordinatedbyHis317andAsp579,whereasnodensitycorrespond-
ing to magnesium ion (B) was observed (Fig. 2j,k and Extended Data
Fig.6c,d).ThisisduetothereplacementofthefirstAspandthesecond
GluintheactivesitewithHis317andAsp485,respectively,inCas12m2.
The His317 side chain has only one lone pair for metal coordination,
which is used to bind magnesium ion (A), and the Asp485 side chain
is located too far away to coordinate the second magnesium ion (B)
(Fig. 2j,k and Extended Data Fig. 6c,d). Therefore, the non-canonical
HDD motif leads to the absence of the second magnesium ion, result-
inginlossofcleavageactivityforthetargetDNA.Nonetheless,His317
and Asp579 are bound to the NTS via magnesium ion coordination,
and His317 is stabilized by hydrogen bonding with Asp485, suggest-
ing that the HDD motif plays a crucial role in target DNA binding,
rather than target DNA cleavage. Consistent with these structural
observations,ourpreviousstudyshowedthatalaninesubstitutionof
Asp485 (HDD  HAD) leads to reduced DNA-binding affinity in vitro
andtranscriptionalsilencingactivityinvivo11
,indicatingthatCas12m2
uses the RuvC catalytic center as the DNA-binding pocket for robust
DNA-bindingaffinity.
Conformationalchangesuponheteroduplexformationlead
torobustDNAbinding
InstateI,thetargetDNAformsafullR-loopcontaininga17-bpheterodu-
plex, whereas, in state II, the target DNA only forms a 12-bp heterodu-
plex,andthedisplacedNTSisdisordered(Fig.3a,b).Thus,wepropose
thatstateIIrepresentstheintermediatestateduringformationofthe
17-bpheteroduplex.Hereafter,werefertostateIasthefullR-loopstate
and to state II as the intermediate state.
Structural comparisons between the full R-loop and the inter-
mediate states revealed conformational changes in the REC2 domain
(Fig. 3a–c and Supplementary Video 1). In the intermediate state, the
REC2domainisdisorderedduetoitsflexibility(Fig.3a).Bystarkcon-
trast,inthefullR-loop,theREC2domainisorderedandinteractswith
thesugar–phosphatebackboneoftheheteroduplexatthePAM-distal
region (Fig. 3b). This structural rearrangement of the REC2 domain
facilitates formation of the Arg-rich pocket between the REC1 and
REC2 domains, allowing the binding of the displaced NTS (Fig. 3b).
Structural comparisons also indicated local conformational changes
in two short α-helices (Lα1 and Lα2) and the loop connecting them
(together referred to as the lid motif) in the RuvC domain (Fig. 3c–f).
Intheintermediatestate,residues499–506(partofLα2andtheloop)
are disordered due to the flexibility of this region, and Lα1 occludes
theRuvCactivesite(Fig.3d,f).Bycontrast,uponformationofthefull
R-loop, Thr502 in the lid motif hydrogen bonds with the phosphate
backbone of dC2, while Pro503 and Leu504 in Lα2 form hydrophobic
interactionswithdC(−1)anddG18*,respectively(Fig.3e).Thesestruc-
tural rearrangements facilitate the local structural transition of Lα1,
opening the RuvC active site to bind the NTS, as observed in Cas12b
andCas12i23,24
(Fig.3e,f).Takentogether,theconformationalchanges
of the REC2 domain and the lid motif, which enable recognition of
the PAM-distal end, provide a binding site for NTS within the Arg-rich
pocket and the RuvC DNA-binding site, allowing Cas12m2 to silence
transcriptionoftargetgenesthroughitsrobustDNA-bindingaffinity.
TobiochemicallyvalidatetheimportanceofthetwoNTS-binding
sites (that is, the Arg-rich pocket and the RuvC active site) in
Cas12m2-mediated transcriptional repression, we performed an
in vivo transcriptional silencing assay. We generated a target plasmid
Fig.3|Conformationalchangesuponheteroduplexformationenable
robustDNAbindingofCas12m2. a,b,StructuralmodelsoftheCas12m2–
crRNA–targetDNAternarycomplex,representingtheintermediatestate(a)and
fullR-loopstate(b).Thedisorderedregionsareindicatedasdashedlines.
c,Superimpositionoftheintermediatestate(lightblue)andthefullR-loopstate
(coloredasinb).d,e,Close-upviewsaroundthelidmotifoftheintermediate
state(d)andthefullR-loopstate(e).Thedisorderedregionsareindicatedasthe
graycyclesurroundedbydashedlines.Hydrogenbondsaredepictedwithgreen
dashedlines.f,Superimpositionofthelidmotifoftheintermediatestate(light
blue)andthefullR-loopstate(coloredasinb).ThedisplacedNTS(dC15*–dT17*)
inthefullR-loopstateclasheswiththeLα2helixintheintermediatestate.
g,Schematicofthebicistronicrfp-gfpoperon(pTarget-operon),includingthe
crRNAtargetingsitesinthepromoter(A1)andattheendoftheoperon
(3′-UTR;E1)describedpreviously11
.h,NormalizedRFPandGFPfluorescence
ofE.coliculturesexpressingdifferentCas12m2mutants,directedtoeitherthe
promoter(pCRISPR-A1)orthe3′-UTR(pCRISPR-E1).Fluorescenceisshown
relativetotheaveragefluorescenceofculturesexpressinganon-targetingguide
(pCRISPR-NT).Valuesshownarethemean ± s.e.m.oftwovaluesofbiological
duplicates(afteraveragingtechnicaltriplicates).Statisticalanalysiswascarried
outusingTukey’sposthoctestincaseofasignificanttwo-wayANOVAresult.
SignificantdifferencesrelativetoCas12m2valuesareindicatedbyasterisks.
*P  0.05,**P  0.01,***P  0.001.dCas12m2,Cas12m2D485A
mutant;triple,
Cas12m2R111A/R112A/R126A
triplemutant.
Nature Structural  Molecular Biology | Volume 30 | August 2023 | 1172–1182 1178
Article https://doi.org/10.1038/s41594-023-01042-3
(pTarget-operon) containing a bicistronic operon with two fluores-
cence reporter genes, rfp and gfp. Escherichia coli cells harboring
pTarget-operonandeitherpCRISPR-A1(crRNAtargetingthepromoter
regionoftheoperon),pCRISPR-E1(crRNAtargetingthe3′untranslated
region (UTR) of the operon) or pCRISPR-NT (crRNA that does not tar-
get the operon) were made chemically competent and transformed
witheitherpCas-Cas12morplasmidsencodingitsmutants(including
Cas12m2D485A
,Cas12m2R111A
,Cas12m2R112A
andCas12m2R126A
mutantsand
the Cas12m2R111A/R112A/R126A
triple mutant) (Fig. 3g). Wild-type Cas12m2
efficientlyreducedredfluorescentprotein(RFP)andgreenfluorescent
protein (GFP) fluorescence upon crRNA targeting both the A1 and E1
sites,asobservedinapreviousstudy(Fig.3h).Bycontrast,allmutants
exhibited reduced transcriptional silencing capabilities when target-
ing the E1 site, and the Cas12m2R111A/R112A/R126A
triple mutant showed
crRNA crRNA
C
a
s
1
2
m
2
d
C
a
s
1
2
m
2
R
1
1
1
A
R
1
1
2
A
R
1
2
6
A
T
r
i
p
l
e
C
a
s
1
2
m
2
d
C
a
s
1
2
m
2
R
1
1
1
A
R
1
1
2
A
R
1
2
6
A
T
r
i
p
l
e
Relative
fluorescence
(A1)
(%)
0
20
40
60
80
120
100
*
**
C
a
s
1
2
m
2
d
C
a
s
1
2
m
2
R
1
1
1
A
R
1
1
2
A
R
1
2
6
A
T
r
i
p
l
e
Relative
fluorescence
(NT)
(%)
0
20
40
60
80
120
100
Relative
fluorescence
(E1)
(%)
0
20
40
60
80
120
100
***
***
***
***
***
*
*
*
Arg-rich pocket
Arg-rich pocket
rfp
Cas12m2
pCRISPR-A1
Promoter Terminator
gfp
d e
f
RuvC RuvC
RuvC
RuvC
RuvC
REC1 REC1
WED WED
TNB TNB
REC2
REC2 REC2
PAM-distal region
a
g
h
d e f
b
Cas12m2 full R-loop state
Cas12m2 intermediate state
c Cas12m2 full R-loop state
Cas12m2 intermediate state
Cas12m2 intermediate state Cas12m2 full R-loop state
H317
D485
D579
Lα1 Lα1
Lα1
Lα2
Lα2
Lα2
H317
D485
D579
dG16*
dG16*
dT17*
dC15*
dC2
dG3
dA1
T502
K87
R81
TS
TS
NTS
NTS
P503
L504
dC(−1)
dG18*
Cas12m2
pCRISPR-E1
Nature Structural  Molecular Biology | Volume 30 | August 2023 | 1172–1182 1179
Article https://doi.org/10.1038/s41594-023-01042-3
lower transcriptional silencing efficacy when targeting both the A1
and E1 sites (Fig. 3h). These results explain why the Arg-rich pocket
and the RuvC DNA-binding site, which result from conformational
changes of the REC2 domain and the lid motif, play critical roles in
Cas12m2-mediatedtranscriptionalsilencing.
EvolutionarypathfromTnpBtoCas12m2
The Cas12m family enzymes are thought to represent the early stage
of evolution from TnpB to larger Cas12 enzymes2
(Extended Data
Fig. 7). Thus, structural comparisons of TnpB and Cas12m2 may pro-
videcluestotracetheevolutionarypathfromTnpBtoCas12enzymes.
REC WED RuvC TNB
1 408
1 596
Cas12m2 REC1
REC2 WED RuvC TNB
TnpB
a
RuvC
RuvC
REC2
REC1
REC
WED
WED
TNB RuvC
REC2
REC1
WED
TNB
TNB
RuvC
REC
WED
TNB
Guide RNA
Guide RNA
PAM
TAM
b
RuvC
REC2
REC1
WED
TNB
RuvC
REC
WED
TNB
12-bp heteroduplex
17-bp heteroduplex
Fig.4|StructuralcomparisonofCas12m2andTnpB. a,Structuralcomparison
ofCas12m2andTnpB(fromDeinococcusradioduransISDra2)(PDB8H1J).
b,StructuralcomparisonofCas12m2andTnpBwithemphasisontheguide
RNA–targetDNAheteroduplex.WhileCas12m2shareshighsequencesimilarity
withTnpB,Cas12m2hasaninsertedREC2domain(highlightedinblue),whichis
notpresentinTnpB,therebyfacilitatingrecognitionofthelongerguideRNA–
targetDNAheteroduplex.
Nature Structural  Molecular Biology | Volume 30 | August 2023 | 1172–1182 1180
Article https://doi.org/10.1038/s41594-023-01042-3
The structure of Cas12m2 strongly resembles that of the minimal
TnpB containing WED, REC1 (REC domain in TnpB), RuvC and TNB
domains25,26
(Fig. 4a). Both enzymes adopt bilobed architecture and
accommodate the guide RNA–target DNA heteroduplex in similar
manners (Fig. 4a). The PAM duplex (transposon-associated motif
(TAM) duplex in TnpB) is commonly bound between the WED and
REC1 domains, and the guide RNA scaffold is accommodated within
the groove formed by the WED and RuvC domains. These structural
similarities confirm that the RNA-guided DNA-targeting mechanism
is highly conserved between TnpB and Cas12m2.
In contrast to these conserved structural features, there are
also important structural differences. TnpB only recognizes a 12-bp
guide RNA–target DNA heteroduplex, whereas Cas12m2 recognizes
a longer 17-bp heteroduplex (Fig. 4b). This difference is mainly due
REC WED RuvC TNB
1 408
TnpB
Cas12f REC
ZF WED RuvC TNB 529
1
Cas12m 1 596
REC1
REC2 WED RuvC TNB
Cas12a PI WED RuvC TNB 1,300
1 REC2
REC1
TnpB
Evolutionary time
Domain insertion for recognition of the PAM-distal region
Domain insertion/enlargement for maturation of the adaptive immune system
Dimerization for recognition of the PAM-distal region
Cas12m Cas12a
Cas12f
a
b
RuvC
REC
REC2
Mol. 2
REC2
TNB
PI
WED
TNB
Fig.5|ModelofevolutionfromTnpBtoCas12enzymes. a,Structural
comparisonofCas12m2withTnpB(fromD.radioduransISDra2)(PDB8H1J),
Cas12f(fromanunculturedarchaeon)(PDB7C7L)andCas12a(fromFrancisella
novicida)(PDB6I1K).SecondCas12fmolecule(mol.2)andthespecificREC2
insertionsinCas12m2andCas12a,whichareresponsibleforrecognitionof
thePAM-distalregion,arehighlightedinblue.TheinsertionsfromCas12m2to
Cas12aarehighlightedinorange.Thebluecyclerepresentstheregionwhere
TnpBacquiredadditionalelementstoengageintheadaptiveimmunesystem,
andtheorangecyclesrepresenttheregionwheredomaininsertionsand
enlargementsoccurredformoreefficienttargetrecognitionandcleavage.
b,SchematicofdomaininsertionsandenlargementsofCas12enzymesfrom
TnpB.ZF,zincfingerdomain.
Nature Structural  Molecular Biology | Volume 30 | August 2023 | 1172–1182 1181
Article https://doi.org/10.1038/s41594-023-01042-3
tothecharacteristicREC2insertionintotheREC1domaininCas12m2
(Fig.4a,b).TnpBlacksthisREC2domain,andthustheendoftheguide
RNA–target DNA heteroduplex is not recognized and the heterodu-
plexes are disordered beyond the 12th base pair. Therefore, these
structuralfindingssuggestthatCas12m2acquiredtheabilitytorecog-
nizethePAM-distalregionoftheguideRNA–targetDNAheteroduplex
throughREC2insertionintothecorestructureofTnpB.Thesefeatures
contributedtoevolutiontowardCRISPR–Casadaptiveimmunity.
Discussion
In this study, we determined the cryo-EM structure of the Cas12m2–
crRNA–targetDNAternarycomplex,revealingthatthenon-canonical
HDD-type RuvC catalytic triad forms an NTS-binding site via coordi-
nation of a single magnesium ion. Intriguingly, when we introduced
counter-mutations to convert the HDD motif of Cas12m2 into a
canonical DED motif, DNA-cleavage activity was not restored (data
not shown). This observation indicates that there could be more rea-
sonswhyCas12m2lostDNA-cleavageactivityasidefromtheHDD-type
RuvC triad. In a recent study, we divided the Cas12m family enzymes
furtherintofourclades,basedontheconservedRuvCcatalytictriad11
(clade 1, HEK, HQK; clade 2, HDD; clade 3, DED, NED; clade 4, DED,
DND, NED). Because the other Cas12m enzyme clades have different
catalytic residues, they could interfere with invading MGEs through
mechanismsdistinctfromthatofCas12m2.Notably,clade1(HQK)may
directly bind the NTS within the positively charged RuvC active site,
whileclade3andclade4mayfacilitatetwo-divalentcation-dependent
DNA hydrolysis within the canonical DED-type RuvC active site. Bio-
chemicalandstructuralcharacterizationsoftheotherCas12menzyme
clades are required to further clarify the molecular mechanisms of
Cas12m family enzymes.
Most Cas12 enzymes auto-process their own pre-crRNA into
mature crRNA without the involvement of a host ribonuclease.
Cas12a and Cas12i process their pre-crRNA at their WED domain by
metal-independent,acid–basecatalyticmechanisms21,24,27,28
,whereas
Cas12candCas12jprocesstheirpre-crRNAattheRuvCcatalyticsiteby
ametal-dependentmechanism8,20
.Arecentstudyshowedthat,similar
to Cas12a and Cas12i, Cas12m2 cleaves its pre-crRNA upstream of the
crRNAscaffold,andtwoconservedresidues(His269andArg270)inthe
WED domain may participate in crRNA maturation11
. However, unlike
Cas12a and Cas12i, in which the 5′ end of the crRNA is surrounded by
theWEDdomain,the5′endofCas12m2–crRNAisprojectedawayfrom
the WED domain and exposed to the solvent in the present structure
(Extended Data Fig. 8). His269 and Arg270 in the WED domain form
base-specific interactions with the crRNA scaffold, suggesting that
these residues are responsible for crRNA recognition, rather than
crRNA processing (Fig. 2d). Indeed, a processing analysis of Cas12m2
revealed that alanine substitutions of His269 and Arg270 in the WED
domain and Asp485 in the RuvC catalytic site had little or no effect
on the pre-crRNA-processing pattern (Extended Data Fig. 9a). These
structuralandbiochemicalanalysessuggestthatCas12m2mayprocess
its pre-crRNA in a unique manner. Intriguingly, when we analyzed the
cryo-EMstructureofCas12m2complexedwitha56-nucleotidecrRNA,
we observed an additional density close to the 5′ region of the crRNA
(Table 1, Extended Data Fig. 9b–e and Supplementary Table 1). This
densityalignswellwithpartofthedistinctivesickle-likearchitectureof
theREC1andREC2domainsinCas12m2,suggestingthatCas12m2may
facilitate pre-crRNA processing through the formation of a transient
asymmetric homodimer. However, because the present resolution is
notsufficienttounambiguouslyelucidatetheresiduesresponsiblefor
crRNAmaturation,furtherstudiesarerequiredtofullyunderstandthe
pre-crRNA-processingmechanismofCas12m2.
A structural comparison of TnpB, the putative ancestor of Cas12
enzymes,withCas12m2,Cas12f14
andCas12a19
providesinsightsintothe
evolutionary path of Cas12 enzymes (Fig. 5a,b). Cas12m2 and Cas12a
commonlyhaveREC2insertionsthatrecognizethePAM-distalregion
of the guide RNA–target DNA heteroduplex, whereas Cas12f retains
a TnpB-like REC domain. Nevertheless, Cas12f functions as an asym-
metric homodimer to compensate for recognition of the PAM-distal
region of the heteroduplex, with the second molecule serving as a
replacement for the REC2 domain14,15
. These observations suggest
that CRISPR–Cas12 enzymes acquired the ability to recognize the
PAM-distal region of the guide RNA–target DNA heteroduplex to
engage in CRISPR–Cas adaptive immunity. All Cas12 enzymes except
for Cas12f and Cas12k (which are both associated with one or more
additional protein molecules) have the REC2 insertion to recognize
the PAM-distal region14,15,29,30
, supporting our suggestion. Structural
comparison between Cas12m2 and Cas12a provides insights into the
maturation of Cas12 enzymes (Fig. 5a,b). Notably, Cas12a has larger
RECdomainsandaTNBdomain19
,whichareresponsiblefortargetDNA
recognition and loading, respectively. In addition, Cas12a contains a
PAMinteracting(PI)domain,whichextensivelyinteractswiththePAM
duplex19
(Fig. 5a,b). These domain enlargements and the insertion
may have been acquired to target the dsDNA of invading phage more
efficientlyand/ortocompetewithanti-CRISPRsystems.
Onlinecontent
Anymethods,additionalreferences,NaturePortfolioreportingsum-
maries, source data, extended data, supplementary information,
acknowledgements,peerreviewinformation;detailsofauthorcontri-
butionsandcompetinginterests;andstatementsofdataandcodeavail-
abilityareavailableathttps://doi.org/10.1038/s41594-023-01042-3.
References
1. van der Oost, J., Westra, E. R., Jackson, R. N.  Wiedenheft, B.
Unravelling the structural and mechanistic basis of CRISPR–Cas
systems. Nat. Rev. Microbiol. 12, 479–492 (2014).
2. Makarova, K. S. et al. Evolutionary classification of CRISPR–
Cas systems: a burst of class 2 and derived variants. Nat. Rev.
Microbiol. 18, 67–83 (2020).
3. Zetsche, B. et al. Cpf1 is a single RNA-guided endonuclease of a
class 2 CRISPR–Cas system. Cell 163, 759–771 (2015).
4. Shmakov, S. et al. Discovery and functional characterization
of diverse class 2 CRISPR–Cas systems. Mol. Cell 60,
385–397 (2015).
5. Yan, W. X. et al. Functionally diverse type V CRISPR–Cas systems.
Science 363, 88–91 (2019).
6. Burstein, D. et al. New CRISPR–Cas systems from uncultivated
microbes. Nature 542, 237–241 (2017).
7. Harrington, L. B. et al. Programmed DNA destruction by miniature
CRISPR–Cas14 enzymes. Science 362, 839–842 (2018).
8. Pausch, P. et al. CRISPR–CasΦ from huge phages is a
hypercompact genome editor. Science 369, 333–337 (2020).
9. Strecker, J. et al. RNA-guided DNA insertion with
CRISPR-associated transposases. Science 365, 48–53 (2019).
10. Sun, A. et al. The compact Casπ (Cas12l) ‘bracelet’ provides a
unique structural platform for DNA manipulation. Cell Res. 33,
229–244 (2023).
11. Wu, W. Y. et al. The miniature CRISPR–Cas12m effector binds DNA
to block transcription. Mol. Cell 82, 4487–4502 (2022).
12. Karvelis, T. et al. Transposon-associated TnpB is a programmable
RNA-guided DNA endonuclease. Nature 599, 692–696 (2021).
13. Altae-Tran, H. et al. The widespread IS200/IS605 transposon
family encodes diverse programmable RNA-guided
endonucleases. Science 374, 57–65 (2021).
14. Takeda, S. N. et al. Structure of the miniature type V-F CRISPR–Cas
effector enzyme. Mol. Cell 81, 558–570 (2021).
15. Xiao, R., Li, Z., Wang, S., Han, R.  Chang, L. Structural
basis for substrate recognition and cleavage by the
dimerization-dependent CRISPR–Cas12f nuclease. Nucleic Acids
Res. 49, 4120–4128 (2021).
Nature Structural  Molecular Biology | Volume 30 | August 2023 | 1172–1182 1182
Article https://doi.org/10.1038/s41594-023-01042-3
16. Yamano, T. et al. Crystal structure of Cpf1 in complex with guide
RNA and target DNA. Cell 165, 949–962 (2016).
17. Stella, S., Alcón, P.  Montoya, G. Structure of the Cpf1
endonuclease R-loop complex after target DNA cleavage. Nature
546, 559–563 (2017).
18. Liu, J.-J. et al. CasX enzymes comprise a distinct family of
RNA-guided genome editors. Nature 566, 218–223 (2019).
19. Swarts, D. C.  Jinek, M. Mechanistic insights into the cis-
and trans-acting DNase activities of Cas12a. Mol. Cell 73,
589–600 (2019).
20. Kurihara, N. et al. Structure of the type V-C CRISPR–Cas effector
enzyme. Mol. Cell 82, 1865–1877 (2022).
21. Huang, X. et al. Structural basis for two metal-ion catalysis of DNA
cleavage by Cas12i2. Nat. Commun. 11, 5241 (2020).
22. Pausch, P. et al. DNA interference states of the hypercompact
CRISPR–CasΦ effector. Nat. Struct. Mol. Biol. 28, 652–661 (2021).
23. Yang, H., Gao, P., Rajashankar, K. R.  Patel, D. J. PAM-dependent
target DNA recognition and cleavage by C2c1 CRISPR–Cas
endonuclease. Cell 167, 1814–1828 (2016).
24. Zhang, H., Li, Z., Xiao, R.  Chang, L. Mechanisms for
target recognition and cleavage by the Cas12i RNA-guided
endonuclease. Nat. Struct. Mol. Biol. 27, 1069–1076 (2020).
25. Nakagawa, R. et al. Cryo-EM structure of the transposon-
associated TnpB enzyme. Nature 616, 390–397 (2023).
26. Sasnauskas, G. et al. TnpB structure reveals minimal functional
core of Cas12 nuclease family. Nature 616, 384–389 (2023).
27. Swarts, D. C., van der Oost, J.  Jinek, M. Structural basis for
guide RNA processing and seed-dependent DNA targeting by
CRISPR–Cas12a. Mol. Cell 66, 221–233 (2017).
28. Zhang, B. et al. Mechanistic insights into the R-loop
formation and cleavage in CRISPR–Cas12i1. Nat. Commun. 12,
3476 (2021).
29. Querques, I., Schmitz, M., Oberli, S., Chanez, C.  Jinek, M. Target
site selection and remodelling by type V CRISPR–transposon
systems. Nature 599, 497–502 (2021).
30. Xiao, R. et al. Structural basis of target DNA recognition by
CRISPR–Cas12k for RNA-guided DNA transposition. Mol. Cell 81,
4457–4466 (2021).
Publisher’s note Springer Nature remains neutral with
regard to jurisdictional claims in published maps and
institutional affiliations.
Open Access This article is licensed under a Creative Commons
Attribution 4.0 International License, which permits use, sharing,
adaptation, distribution and reproduction in any medium or format,
as long as you give appropriate credit to the original author(s) and the
source, provide a link to the Creative Commons license, and indicate
if changes were made. The images or other third party material in this
article are included in the article’s Creative Commons license, unless
indicated otherwise in a credit line to the material. If material is not
included in the article’s Creative Commons license and your intended
use is not permitted by statutory regulation or exceeds the permitted
use, you will need to obtain permission directly from the copyright
holder. To view a copy of this license, visit http://creativecommons.
org/licenses/by/4.0/.
© The Author(s) 2023
Nature Structural  Molecular Biology
Article https://doi.org/10.1038/s41594-023-01042-3
Methods
Protein and RNA preparation for structural analysis
TheCas12m2proteinforstructuralanalysiswasexpressedandpurified
using the protocol reported previously14,20
. Briefly, the N-terminally
His6-tagged Cas12m2 protein was expressed in E. coli Rosetta 2 (DE3).
E. coli cells were cultured at 37 °C until the OD600 reached 0.8, and
protein expression was then induced by the addition of 0.1 mM iso-
propyl β-d-thiogalactopyranoside (Nacalai Tesque). E. coli cells were
further cultured at 20 °C overnight and collected by centrifugation.
The cells were then resuspended in buffer A (20 mM HEPES–NaOH,
pH 7.6, 20 mM imidazole and 1 M NaCl), lysed by sonication and cen-
trifuged.Thesupernatantwasmixedwith3 mlNi-NTASuperflowresin
(Qiagen),andthemixturewasloadedintoanEcono-Column(Bio-Rad).
The protein was eluted with buffer B (20 mM HEPES–NaOH, pH 7.6,
0.3 M imidazole, 0.3 M NaCl) and then loaded onto a 5-ml HiTrap SP
HPcolumn(GEHealthcare)equilibratedwithbufferC(20 mMHEPES–
NaOH, pH 7.6, and 0.3 M NaCl). The protein was eluted with a linear
gradientof0.3–2 MNaClandfurtherpurifiedbychromatographyona
Superdex200column(GEHealthcare)equilibratedinbufferD(20 mM
HEPES–NaOH, pH 7.6, 0.5 M NaCl). The purified proteins were stored
at −80 °C until use. The crRNA was transcribed in vitro with T7 RNA
polymeraseandpurifiedby10%denaturing(7 Murea)polyacrylamide
gelelectrophoresis.
Electronmicroscopysamplepreparationanddatacollection
TheCas12m2–crRNA–targetDNAcomplexwasreconstitutedbymix-
ing purified Cas12m2, the 56-nucleotide crRNA, the 36-nucleotide
target DNA and the 36-nucleotide non-target DNA at a molar ratio
of 1:1.2:1.5:1.5. The Cas12m2–crRNA binary complex was reconsti-
tuted by mixing purified Cas12m2 and the 56-nucleotide crRNA at a
molar ratio of 1:1.2. The ternary and binary complexes were purified
bysize-exclusionchromatographyonaSuperdex200Increase10/300
column (GE Healthcare) equilibrated with buffer E (20 mM HEPES–
NaOH,pH7.6,50 mMNaCl,2 mMMgCl2 and10 μMZnCl2).Thepurified
complex solution (A260 = 4) was then applied to Au 300-mesh R1.2/1.3
grids (Quantifoil), and, after adding 3 μl amylamine, they were glow
dischargedinaVitrobotMarkIVsystem(FEI)at4 °Cwithawaitingtime
of10 sandablottingtimeof4 sunder100%humidityconditions.The
gridswereplungefrozeninliquidethaneandcooledtothetemperature
ofliquidnitrogen.
Micrographs for all datasets were collected on a Titan Krios G3i
microscope(ThermoFisherScientific)runningat300 kVandequipped
with a Gatan Quantum LS Energy Filter (GIF) and a Gatan K3 Summit
direct electron detector in electron-counting mode (University of
Tokyo, Japan). Movies were recorded at a nominal magnification of
105,000×, corresponding to a calibrated pixel size of 0.83 Å, with a
total dose of approximately 50 electrons per Å2
per 48 frames using
EPUsoftware(ThermoFisherScientific).Thedose-fractionatedmovies
weresubjectedtobeam-inducedmotioncorrectionanddoseweight-
ing using MotionCor2 (ref. 31) implemented in RELION-3.1 (ref. 32),
andcontrasttransferfunction(CTF)parameterswereestimatedusing
patch-basedCTFestimationincryoSPARCversion3.3.2(ref.33).
Single-particlecryo-EMdataprocessing
Data were processed using cryoSPARC33
. For the ternary complex,
5,067,096 particles were initially selected from the 3,570 motion-
corrected and dose-weighted micrographs using a two-dimensional
(2D) reference and extracted at a pixel size of 3.32 Å. These particles
weresubjectedtoseveralroundsof2Dclassificationtocurateparticle
sets.Theparticleswerefurthercuratedbyheterogeneousrefinement,
using a map derived from ab initio reconstruction as the template.
The selected 1,030,197 particles were then re-extracted at a pixel size
of1.16 Åandsubjectedto3Dvariabilityanalysis34
.Theresultingmaps
withdifferentconformationswereusedforsubsequentheterogeneous
refinement. The selected particles after heterogeneous refinement
were refined using non-uniform refinement with optimization of the
CTF value35
, yielding maps at resolutions of 2.87 Å (state I) and 3.08 Å
(state II), according to the FSC criterion of 0.143 (ref. 36). The local
resolutionwasestimatedwithBlocResincryoSPARC.
Forthebinarycomplex,6,034,518particleswereinitiallyselected
from a total of 8,004 motion-corrected and dose-weighted micro-
graphsusinga2Dreferenceandextractedatapixelsizeof3.32 Å.The
particles were further curated by heterogeneous refinement using a
mapderivedfromabinitioreconstructionasthetemplate.Theselected
215,565particleswerethenre-extractedatapixelsizeof1.66 Åandsub-
jectedto3Dclassificationwithoutalignment,usingamaskfocusedon
thesecondsubunit.After3Dclassification,theselectedparticleswere
refined using non-uniform refinement, yielding a map at a resolution
of 3.73 Å, according to the FSC criterion of 0.143. The local resolution
wasestimatedwithBlocResincryoSPARC.
Modelbuildingandvalidation
ThemodelwasbuiltusingthepredictedmodeloftheCas12m2protein
created by AlphaFold2 (the AlphaFold model) as the reference, fol-
lowedbymanualmodelbuildingwithCoot37,38
.Themodelwasrefined
using phenix.real_space_refine version 1.20.1 (ref. 39) with secondary
structure and metal coordination restraints. The metal coordination
restraintsweregeneratedusingReadySet,asimplementedinPHENIX.
Structure validation was performed using MolProbity40
. Residues
592–596 and nucleotides −36 to −30 of the crRNA, nucleotides −9 to
−5 of the TS and nucleotides 22–26 of the NTS were not included in
the final full R-loop state model because these regions were not well
resolvedonthedensitymap.Residues53–120,499–506and592–596
andnucleotides−36to−30and13–20ofthecrRNA,nucleotides−9to
5 and 27 of the TS and nucleotides −10 and 13–26 of the NTS were not
includedinthefinalintermediatestatemodel.Inthebinarycomplex,
residues 87–90, 488–508 and 592–596 of Cas12m2.1, residues 1–17,
65–94, 189–250 and 262–596 of Cas12m2.2 and nucleotides −36 to
−30 and 16–20 of the crRNA were not included in the final model. The
cryo-EMdensitymapswerecalculatedwithUCSFChimeraX41
,andthe
molecular graphics shown in the figures were prepared with CueMol
(http://www.cuemol.org).
AssemblyofCas12m2PAMandRECmutantplasmids
PlasmidsusedforproteinexpressionofCas12m2PAMandRECmutants
were assembled by PCR mutagenesis (Q5 polymerase, NEB; T4 DNA
ligase, NEB) using the primers listed in Supplementary Table 2, with
pML-1B-Cas12m as the PCR template11
. Sequence-verified plasmids
wereusedastemplatesforamplificationoftheirinsertsforrestriction
ligation assembly of pCas-Cas12m2-[x] plasmids, which were used
for in vivo transcriptional silencing assays, with pCas-Cas12m2 as the
templateforbackboneamplification(SupplementaryTable2).
Invivotranscriptionalsilencingassay
The transcriptional silencing activity of Cas12m variants in vivo was
assessed using the protocol reported previously11
, with minor modi-
fications. Briefly, E. coli cells harboring pTarget-operon and either
pCRISPR-A1, pCRISPR-E1 or pCRISPR-NT were made chemically com-
petent and transformed with either pCas-Cas12m, pCas-dCas12m or
pCas-Cas12m-[x] (the ten alanine-substitution mutants). After recov-
ery, 5 μl of transformation mix was diluted with 195 μl M9TG medium
in a 96-well 2-ml masterblock (Greiner) in triplicate, sealed with a
gas-permeablemembraneandgrownat37 °Cand750 r.p.m.for20 h.
Cells were diluted 1:1,000 in fresh M9TG medium in a total volume of
200 μlina96-wellmasterblockandgrownovernightat37 °C.Overnight
cultures were diluted 1:10 in 200 μl PBS and analyzed using a Synergy
Mx microplate reader (BioTek) and Gen5 (software version 3.10.06).
The fluorescence of GFP (excitation, 488 nm; emission, 512 nm) and
RFP (excitation, 532 nm; emission, 588 nm) was measured with gains
of 91 and 140, respectively. Fluorescence values were normalized to
Nature Structural  Molecular Biology
Article https://doi.org/10.1038/s41594-023-01042-3
blank-correctedOD600 values,andtechnicaltriplicateswereaveraged.
In Fig. 3h, the averages of two biological replicates are shown relative
to the average normalized fluorescence of wells containing E. coli
expressing a non-targeting spacer (pCRISPR-NT). Statistical analysis
wasperformedusingTukey’sposthoctestincaseofsignificanttwo-way
ANOVAresults.
Cas12m2purificationforsurfaceplasmonresonanceanalyses
The pML-1B-Cas12m2, pML-1B-dCas12m2 and pML-1B-Cas12m2-[x]
plasmidsweretransformedintochemicallycompetentE.coliRosetta
2 (DE3) cells and purified using a modified version of the previously
reportedprotocol11
.Briefly,thetransformedE.colistrainswereinocu-
latedinto800 mlLBmediumsupplementedwithchloramphenicoland
kanamycin, cultured overnight and grown to an OD600 of 0.6–0.8 at
37 °C and 150 r.p.m. The cultures were cooled on ice for 1 h before the
addition of 0.4 mM isopropyl 1-thio-β-d-galactopyranoside and then
grownfor16–20 hat20 °Cand120 r.p.m.Culturedcellswerepelleted,
washed and resuspended in lysis buffer (1 M NaCl, 20 mM imidazole,
20 mM HEPES, pH 7.6) freshly supplemented with protease inhibi-
tors (Roche cOmplete, EDTA-free) and lysed by sonication (Bandelin
Sonopuls). The lysate was cleared (30 min, 30,000g), and the super-
natant was filtered (0.45 μm). The filtrate was applied to a nickel col-
umn(HisTrapHP,GELifeSciences),whichwaswashed(500 mMNaCl,
20 mM imidazole, 20 mM HEPES, pH 7.6) and eluted (500 mM NaCl,
300 mMimidazole,20 mMHEPES,pH7.6)usinganÄKTAFPLCsystem.
Elutionfractionscontainingtheexpectedbandwerepooled,concen-
trated in SEC buffer (500 mM NaCl, 20 mM HEPES, pH 7.6, 1 mM DTT)
throughcentrifugalfilters(AmiconUltra-15,MWcutoff30,000 Da)and
further purified by size-exclusion chromatography (HiLoad 16/600
Superdex 200) by FPLC. Pure fractions were pooled and snap frozen
inliquidnitrogenuntilanalysisbysurfaceplasmonresonance(SPR).
Surfaceplasmonresonance
SPR spectroscopy was performed at 25 °C on a Biacore T100 system
(Cytiva). A series S CM5 sensor chip surface was modified with 2,500
response units of streptavidin (Invitrogen) using an amine coupling
kit (Cytiva). Flow cell 1 was used as the reference. A 50-nucleotide
biotinylatedoligonucleotidecontainingaDNAtargetsitewasannealed
to its complementary strand (Supplementary Table 2; IDT) in 25 mM
HEPES, pH 7.5, containing 150 mM KCl, and 20 response units were
immobilized on flow cell 2. Ribonucleotide protein complexes were
formedbymixingpurifiedproteinvariantswithatwofoldmolarexcess
of PAM-SCNR RNA or a non-target RNA (Supplementary Table 2) to a
final concentration of 500 nM protein in SPR buffer (20 mM HEPES,
pH 7.8, 150 mM KCl, 10% glycerol, 5 mM MgCl2, 1 mM DTT and 0.05%
Tween-20). Dilutions of this complex were injected over the chip sur-
faceat50 µl min−1
.Flowcellswereregeneratedusingthreeconsecutive
injectionsof4 MMgCl2,followedbyaninjectionofthecomplementary
oligonucleotide (1 µM) for 100 s at 5 µl min−1
in flow cell 2. Wild-type
Cas12m2 displays multiphasic association and dissociation kinetics,
which is most prominent at protein concentrations above 100 nM11
;
therefore,sensorgramsatlowerconcentrations(8–63 nM)werefitted
to a two-state model using BIAevaluation software (Cytiva) to obtain
a KD estimate. After double-reference correction, the remaining bulk
effectsmaskedtheseverelyreducedbindingcapabilitiesofthemutant
proteins in the association phase in the sensorgrams; therefore, the
binding of Cas12m2 variants at 500 nM to the target site was quanti-
fiedfromthestablebindinglevelsatthestartofthedissociationphase
(averagesignalofa10-swindow,20 safterdissociation)relativetothat
ofwild-typeCas12m2.
Pre-crRNAprocessingassay
Pre-crRNAprocessingwasachievedusinganinvitrotranscriptionand
translationsystem,asdescribedpreviously11
.ApCasandpCRISPRplas-
midmixtureexpressingCas12m2andaCRISPRarray(containingfour
spacers)wasincubatedat29 °Cfor5 hinathermocycler.TotalRNAwas
then extracted using a Direct-zol RNA MiniPrep kit, according to the
manufacturer’s instructions (Zymo Research). For northern blotting
analysis, 5 μg of each RNA sample obtained from cell-free transcrip-
tion–translation was fractionated on an 8% polyacrylamide gel (7 M
urea)at300 Vfor140 min.RNAwastransferredontoHybond-XLmem-
branes (Amersham Hybond-XL, GE Healthcare) using an electroblot-
ter at 50 V for 1 h at 4 °C (Tank-Elektroblotter Web M, PerfectBlue)
and cross-linked with UV light at 0.12 J (UV lamp T8C, 254 nm, 8 W).
Hybridizationproceededovernightin17 mlofRoti-Hybri-Quickbuffer,
with5 µlof[γ-32
P]ATPend-labeledoligodeoxyribonucleotidesat42 °C.
ThemembranewasvisualizedusingaPhosphorimager(TyphoonFLA
7000,GEHealthcare).
Reportingsummary
FurtherinformationonresearchdesignisavailableintheNaturePort-
folioReportingSummarylinkedtothisarticle.
Dataavailability
ThestructuralmodelshavebeendepositedinthePDBunderaccession
codes PDB 8HHL (full R-loop state), PDB 8HHM (intermediate state)
andPDB8HIO(binarycomplex).Thecryo-EMdensitymapshavebeen
depositedintheEMDBunderaccessioncodesEMD-34803(fullR-loop
state),EMD-34804(intermediatestate)andEMD-34824(binarycom-
plex).Sourcedataareprovidedwiththispaper.
References
31. Zheng, S. Q. et al. MotionCor2: anisotropic correction of
beam-induced motion for improved cryo-electron microscopy.
Nat. Methods 14, 331–332 (2017).
32. Zivanov, J. et al. New tools for automated high-resolution cryo-EM
structure determination in RELION-3. eLife 7, e42166 (2018).
33. Punjani, A., Rubinstein, J. L., Fleet, D. J.  Brubaker, M.
cryoSPARC: algorithms for rapid unsupervised cryo-EM structure
determination supple info. Nat. Methods 14, 290–296 (2017).
34. Punjani, A.  Fleet, D. J. 3D variability analysis: resolving
continuous flexibility and discrete heterogeneity from single
particle cryo-EM. J. Struct. Biol. 213, 107702 (2021).
35. Punjani, A., Zhang, H.  Fleet, D. J. Non-uniform refinement:
adaptive regularization improves single-particle cryo-EM
reconstruction. Nat. Methods 17, 1214–1221 (2020).
36. Rosenthal, P. B.  Henderson, R. Optimal determination of particle
orientation, absolute hand, and contrast loss in single-particle
electron cryomicroscopy. J. Mol. Biol. 333, 721–745 (2003).
37. Jumper, J. et al. Highly accurate protein structure prediction with
AlphaFold. Nature 596, 583–589 (2021).
38. Emsley, P.  Cowtan, K. Coot: model-building tools for molecular
graphics. Acta Crystallogr. D 60, 2126–2132 (2004).
39. Afonine, P. V. et al. Real-space refinement in PHENIX for
cryo-EM and crystallography. Acta Crystallogr. D Struct. Biol. 74,
531–544 (2018).
40. Chen, V. B. et al. MolProbity: all-atom structure validation
for macromolecular crystallography. Acta Crystallogr. D 66,
12–21 (2010).
41. Pettersen, E. F. et al. UCSF ChimeraX: structure visualization
for researchers, educators, and developers. Protein Sci. 30,
70–82 (2021).
42. Söding, J. Protein homology detection by HMM–HMM
comparison. Bioinformatics 21, 951–960 (2005).
Acknowledgements
We thank C. Liao, A. Migur and C.L. Beisel (HZI, Würzburg, Germany)
for contributing to the guide-processing analysis. J.H.G.L. and C.L. are
supported by the gravitation program CancerGenomiCs.nl from the
Netherlands Organisation for Scientific Research, part of the Oncode
Nature Structural  Molecular Biology
Article https://doi.org/10.1038/s41594-023-01042-3
Institute, which is partly financed by the Dutch Cancer Society.
O.N. was supported by AMED (grant numbers JP19am0401005 and
JP21km0908001), the Platform Project for Supporting Drug Discovery
and Life Science Research (Basis for Supporting Innovative Drug
Discovery and Life Science Research) from AMED under grant number
JP22am0101115 and the Cabinet Office, Government of Japan, Public/
Private RD Investment Strategic Expansion Program (grant number
JPJ008000).
Authorcontributions
S.N.O. performed structural analyses with assistance from R.N.,
H.H., T.K., Y.K. and Y.I.; S.N.O., R.N., H.H. and Y.I. built the model
and performed structural refinement; W.Y.W., C.S., R.V.W., C.L. and
J.H.G.L. performed in vitro and in vivo biological experiments; J.v.d.O.
conceived the project; S.N.O., R.N. and O.N. wrote the paper with help
from all authors; O.N. supervised the research.
Competinginterests
O.N. is a cofounder, board member and scientific advisor of Curreio.
Two patent applications have been filed related to this work, with
J.v.d.O. and W.Y.W. as inventors. J.v.d.O. is an advisor for NTrans
Technologies, Scope Biosciences and Hudson River Biotechnology.
The other authors declare no competing interests.
Additionalinformation
Extended data is available for this paper at
https://doi.org/10.1038/s41594-023-01042-3.
Supplementaryinformation The online version contains supplementary
material available at https://doi.org/10.1038/s41594-023-01042-3.
Correspondence and requests for materialsshould be addressed to
John van der Oost or Osamu Nureki.
Peer review information Nature Structural  Molecular Biology
thanks Yong-Sam Kim and the other, anonymous, reviewer(s) for their
contribution to the peer review of this work. Primary Handling Editor:
Dimitris Typas, in collaboration with the Nature Structural  Molecular
Biology team. Peer reviewer reports are available.
Reprints and permissions informationis available at
www.nature.com/reprints.
Nature Structural  Molecular Biology
Article https://doi.org/10.1038/s41594-023-01042-3
ExtendedDataFig.1|Single-particlecryo-EManalysisoftheCas12m2–
crRNA–targetDNAternarycomplex.(a)Size-exclusionchromatography
profileoftheCas12m2–crRNA–targetDNAternarycomplex.Thepeakfraction
wasusedforcryo-EManalysis.(b)Arepresentativecryo-EMimageofthe
Cas12m2–crRNA–targetDNAcomplex,recordedona300-kVTitanKrios
microscopewithaK3camera.(c)Single-particlecryo-EMimageprocessing
workflow.(d)Fouriershellcorrelation(FSC)curveforthe3Dreconstruction,
withthegold-standardcut-off(FSC = 0.143)markedwithablackdottedline.
(eandf)Local-resolutioncryo-EMdensitymapsofStateI(fullR-loopstate)(e)
andStateII(intermediatestate)(f).
Nature Structural  Molecular Biology
Article https://doi.org/10.1038/s41594-023-01042-3
ExtendedDataFig.2|Overall,domain,andnucleicacidstructuresofthe
Cas12m2–crRNA–targetDNAternarycomplex.(a)Surfacerepresentationof
theCas12m2–crRNA–targetDNAternarycomplex.(b)TheREClobestructure
ofCas12m2.TheWEDdomaincomprisesanoligonucleotide/oligosaccharide-
bindingfold(theconservedβstrands(blue)andα-helix(red)arenumbered).
Thetwoadditionalα-helicesareidentifiedwithgreennumbers.TheREC1and
REC2domainsformadistinctivesickle-likearchitecture.(c)TheNUClobe
structureofCas12m2andthatofCas12f(fromanunculturedarchaeon)(PDBID:
7C7L).TheNUClobesconsistoftheRuvCandTNBdomains.TheRuvCdomains
comprisetheRNaseHfold(theconservedβstrands(blue)andα-helices(red)are
numbered).(d)ZincbindingsiteintheTNBdomain.(e)Thecryo-EMdensities
fortheRNA–DNAheteroduplex,PAMduplex,andrehybridizedDNAduplexare
shownasabluemesh.(f)StructureofthecrRNAscaffold.
Nature Structural  Molecular Biology
Article https://doi.org/10.1038/s41594-023-01042-3
ExtendedDataFig.3|Nucleicacidrecognitionandinvitro DNAbindingprofile.SchematicofnucleicacidrecognitionbyCas12m2.Theresiduesthatinteractwith
nucleicacidsthroughtheirmainchainsareshowninparentheses.Thedisorderedregionisenclosedinadashedbox.
Nature Structural  Molecular Biology
Article https://doi.org/10.1038/s41594-023-01042-3
ExtendedDataFig.4|Seenextpageforcaption.
Nature Structural  Molecular Biology
Article https://doi.org/10.1038/s41594-023-01042-3
ExtendedDataFig.4|Invitrosurfaceplasmonresonanceanalysis.(a)SEC
chromatogramsofthefinalpurificationstepoftheCas12m2proteinsanalyzed
inthisstudy.Arrowsindicatetheloadingofeachproteinontothecolumn.
Elutionfractionsthatwerepooledforproteinanalysisarehighlightedinorange.
Triple:R111A/R112A/R126Atriplemutant.(b)SDS-PAGEanalysisofSEC-purified
proteins.Twomicrogramsofproteinwereloadedperwell.Thetheoretical
molecularmassofthevariantsis~68 kDa.(c)SPRsensorgramsof500 nMwild
typeCas12m2andvariantsboundtoPAM-SCNRRNA,bindingtoa50-basepair
dsDNAoligonucleotidecontainingatargetsite.(d)SPRsensorgramof500 nM
wildtypeCas12m2bindingtotheDNAtargetsitewithPAM-SCNRRNA(black)
andnon-targetRNA(gray).Withthenon-targetRNA,Cas12m2rapidlybindsbut
isunabletoformastablecomplexanddissociatesquickly.WiththetargetRNA,a
stablecomplexisformed.(e)Fitofatwo-statebindingmodel(blacklines)toSPR
sensorgramsofwildtypeCas12mwithPAM-SCNRRNAbindingtothetargetsite
(coloredcurves)returnsanestimatedKD of2.7 nM.
Nature Structural  Molecular Biology
Article https://doi.org/10.1038/s41594-023-01042-3
ExtendedDataFig.5|MultiplesequencealignmentofCas12morthologs.
Mmu,Cas12m2fromMycolicibacteriummucogenicum(WP_061006603.1);
Msp,Cas12m2fromMycobacteriumsp.SM1(WP_213445835.1);Myc,Cas12m2
fromMycobacterium(WP_036473531.1);Mtr,Cas12m2fromMycobacterium
triplex(CDO91315.1).ThesecondarystructuresofCas12m2areindicatedabove
thesequences.KeyresiduesofCas12m2aremarkedbelowthesequencesby
triangles.ThefigurewaspreparedusingClustalOmega(http://www.ebi.ac.uk/
Tools/msa/clustalo)andESPript3(http://espript.ibcp.fr/ESPript/ESPript).
Nature Structural  Molecular Biology
Article https://doi.org/10.1038/s41594-023-01042-3
ExtendedDataFig.6|Non-canonicalRuvCactivesite.(aandc)Structures
oftheRuvCactivesiteofCas12i2(a)(Cas12i2fromanunidentifiedsource)
(PDBID:6LTU)andCas12m2(c).WhileCas12i2coordinatestwomagnesium
ionsoneithersideofthescissilephosphate,Cas12m2coordinatesonlyone
magnesiumion,whichcorrespondstomagnesiumionAinCas12i2.(bandd)
SchematicsoftheRuvCactivesiterepresentingthecanonicalDED-type(b)and
non-canonicalHDD-type(d).InthecanonicalDED-typeRuvCactivesite,thefirst
magnesiumion(A)activatesthenucleophilicwatermolecule,whilethesecond
magnesiumion(B)stabilizesthetransitionstate,resultingintwodivalentcation-
dependentDNAhydrolysis(b).Inthenon-canonicalHDD-typeRuvCactivesite,
nodensitycorrespondingtomagnesiumionBwasobserved,resultingintheloss
ofcleavageactivityforthetargetDNAofCas12m2(d).
Nature Structural  Molecular Biology
Article https://doi.org/10.1038/s41594-023-01042-3
ExtendedDataFig.7|UPGMAdendrogramshowingthesimilaritybetween
differentfamiliesoftypeVeffectors.Thedendrogramwasbuiltusingthe
UPGMA(unweightedpairgroupmethodwitharithmeticmean)methodandis
basedonHHalignmatrix42
scorescalculatedforallfamiliesagainstallpairwise
alignmentswithalengthcoverageof33%ormore.Thealignmentsforthe
respectivefamiliesweretakenfrompreviousreport2
,exceptfortheCas12m
family,forwhichanupdatedalignment(104proteins)wasused.Thedashed
rectanglecorrespondstoatreedepth(D)between1.5and2(D = 2roughly
correspondstothepairwiseHHsearchsimilarityscoreofexp(2D) ≈ 0.02relative
totheself-score)andreflectstheDwherethesubtypeassignmentisuncertain
andrequiresadditionalconsideration.
Nature Structural  Molecular Biology
Article https://doi.org/10.1038/s41594-023-01042-3
ExtendedDataFig.8|Pre-crRNAprocessingmechanismofCas12a,Cas12i,
andCas12m2.Overallstructures(a)andclose-upviewsofthe5′endofcrRNA
(b).Cas12a(fromFrancisellanovicida)(PDBID:5NG6)andCas12i(from
LachnospiraceaebacteriumND2006)(PDBID:7D8C)processtheirpre-crRNAsat
theWEDdomain,andthe5′endofthecrRNAissurroundedbytheWEDdomain.
Incontrast,the5′endoftheCas12m2crRNAisprojectedawayfromtheWED
domainandexposedtothesolvent.TargetDNAandspacerregionofcrRNA
areomittedforclarityintheCas12m2–crRNA–targetDNAternarycomplex
structure.
Nature Structural  Molecular Biology
Article https://doi.org/10.1038/s41594-023-01042-3
ExtendedDataFig.9|Single-particlecryo-EManalysisoftheCas12m2–
crRNAbinarycomplex.(a)Pre-crRNAprocessingactivitiesofwildtype(WT),
H269A/R270A(dWED),andD485A(dRuvC)inTXTL.Aplasmidexpressingthe
Cas12m2CRISPRarraycontainingfourspacerswasincubatedwithaplasmid
expressingWT,dWED,ordRuvCCas12m2inTXTL.Thecleavageproductswere
visualizedbynorthernblotting,usingaprobethatbindstothethirdspacerof
arrayA(yellow,shownasanasterisk).ArrayBissimilartoarrayA;onlytheorder
ofspacersisshifted.(b)Size-exclusionchromatographyprofileoftheCas12m2–
crRNAbinarycomplex.Thepeakfractionwasusedforcryo-EManalysis.
(c)Single-particlecryo-EMimageprocessingworkflow.(d)FSCcurveforthe3D
reconstruction,withthegold-standardcut-off(FSC = 0.143)markedwithablack
dottedline.(e)Cryo-EMdensitymapoftheCas12m–crRNAbinarycomplex.
Theadditionalthree-helix-likedensityisindicatedbyacircle.
2023 Mechanistic and evolutionary insights into a type V-M CRISPR–Cas effector enzyme.pdf
2023 Mechanistic and evolutionary insights into a type V-M CRISPR–Cas effector enzyme.pdf

More Related Content

Similar to 2023 Mechanistic and evolutionary insights into a type V-M CRISPR–Cas effector enzyme.pdf

Chan_Gordenin_et_al_2012
Chan_Gordenin_et_al_2012Chan_Gordenin_et_al_2012
Chan_Gordenin_et_al_2012Kin Chan
 
Crispr guides the_future_of_genetic_engineering[1]
Crispr guides the_future_of_genetic_engineering[1]Crispr guides the_future_of_genetic_engineering[1]
Crispr guides the_future_of_genetic_engineering[1]AshishDutta33
 
Gutell 027.faseb.j.1993.07.0223
Gutell 027.faseb.j.1993.07.0223Gutell 027.faseb.j.1993.07.0223
Gutell 027.faseb.j.1993.07.0223Robin Gutell
 
cytogenomics tools and techniques and chromosome sorting.pptx
cytogenomics tools and techniques and chromosome sorting.pptxcytogenomics tools and techniques and chromosome sorting.pptx
cytogenomics tools and techniques and chromosome sorting.pptxPABOLU TEJASREE
 
The new frontier of genome engineering
The new frontier of genome engineeringThe new frontier of genome engineering
The new frontier of genome engineeringPricyBark0
 
The Efficiency and Ethics of the CRISPR System in Human Embryos
The Efficiency and Ethics of the CRISPR System in Human EmbryosThe Efficiency and Ethics of the CRISPR System in Human Embryos
The Efficiency and Ethics of the CRISPR System in Human EmbryosStephen Cranwell
 
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...Kamel Mansouri
 
115 genomic and proteomic screen
115 genomic and proteomic screen115 genomic and proteomic screen
115 genomic and proteomic screenSHAPE Society
 
Acc 2002 microarray mehran for print
Acc 2002 microarray mehran for printAcc 2002 microarray mehran for print
Acc 2002 microarray mehran for printSHAPE Society
 
Towards a systems-level understanding of RNA secondary structure and interact...
Towards a systems-level understanding of RNA secondary structure and interact...Towards a systems-level understanding of RNA secondary structure and interact...
Towards a systems-level understanding of RNA secondary structure and interact...Alexander Junge
 
Transition metal dichalcogenide NPs, recent advances in scientific research
Transition metal dichalcogenide NPs, recent advances in scientific researchTransition metal dichalcogenide NPs, recent advances in scientific research
Transition metal dichalcogenide NPs, recent advances in scientific researchANJUNITHIKURUP
 

Similar to 2023 Mechanistic and evolutionary insights into a type V-M CRISPR–Cas effector enzyme.pdf (20)

Chan_Gordenin_et_al_2012
Chan_Gordenin_et_al_2012Chan_Gordenin_et_al_2012
Chan_Gordenin_et_al_2012
 
Crispr guides the_future_of_genetic_engineering[1]
Crispr guides the_future_of_genetic_engineering[1]Crispr guides the_future_of_genetic_engineering[1]
Crispr guides the_future_of_genetic_engineering[1]
 
Crispar
CrisparCrispar
Crispar
 
Gutell 027.faseb.j.1993.07.0223
Gutell 027.faseb.j.1993.07.0223Gutell 027.faseb.j.1993.07.0223
Gutell 027.faseb.j.1993.07.0223
 
cytogenomics tools and techniques and chromosome sorting.pptx
cytogenomics tools and techniques and chromosome sorting.pptxcytogenomics tools and techniques and chromosome sorting.pptx
cytogenomics tools and techniques and chromosome sorting.pptx
 
CRISPR REPORT
CRISPR REPORTCRISPR REPORT
CRISPR REPORT
 
Crispr m.raveendra reddy
Crispr   m.raveendra reddyCrispr   m.raveendra reddy
Crispr m.raveendra reddy
 
Genome editing
Genome editingGenome editing
Genome editing
 
The new frontier of genome engineering
The new frontier of genome engineeringThe new frontier of genome engineering
The new frontier of genome engineering
 
The Efficiency and Ethics of the CRISPR System in Human Embryos
The Efficiency and Ethics of the CRISPR System in Human EmbryosThe Efficiency and Ethics of the CRISPR System in Human Embryos
The Efficiency and Ethics of the CRISPR System in Human Embryos
 
Nihms520715
Nihms520715Nihms520715
Nihms520715
 
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
QSAR STUDY ON READY BIODEGRADABILITY OF CHEMICALS. Presented at the 3rd Chemo...
 
Doudna and-charpentier
Doudna and-charpentierDoudna and-charpentier
Doudna and-charpentier
 
115 genomic and proteomic screen
115 genomic and proteomic screen115 genomic and proteomic screen
115 genomic and proteomic screen
 
Genomic and proteomic screen
Genomic and proteomic screenGenomic and proteomic screen
Genomic and proteomic screen
 
Acc 2002 microarray mehran for print
Acc 2002 microarray mehran for printAcc 2002 microarray mehran for print
Acc 2002 microarray mehran for print
 
Acc 2002 microarray mehran for print
Acc 2002 microarray mehran for printAcc 2002 microarray mehran for print
Acc 2002 microarray mehran for print
 
115 genomic and proteomic screen
115 genomic and proteomic screen115 genomic and proteomic screen
115 genomic and proteomic screen
 
Towards a systems-level understanding of RNA secondary structure and interact...
Towards a systems-level understanding of RNA secondary structure and interact...Towards a systems-level understanding of RNA secondary structure and interact...
Towards a systems-level understanding of RNA secondary structure and interact...
 
Transition metal dichalcogenide NPs, recent advances in scientific research
Transition metal dichalcogenide NPs, recent advances in scientific researchTransition metal dichalcogenide NPs, recent advances in scientific research
Transition metal dichalcogenide NPs, recent advances in scientific research
 

Recently uploaded

Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyDrAnita Sharma
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 

Recently uploaded (20)

Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomology
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 

2023 Mechanistic and evolutionary insights into a type V-M CRISPR–Cas effector enzyme.pdf

  • 1. Nature Structural & Molecular Biology | Volume 30 | August 2023 | 1172–1182 1172 naturestructural&molecularbiology Article https://doi.org/10.1038/s41594-023-01042-3 Mechanisticandevolutionaryinsightsintoa typeV-MCRISPR–Caseffectorenzyme Satoshi N. Omura1 , Ryoya Nakagawa1 , Christian Südfeld2 , Ricardo Villegas Warren 2 , Wen Y. Wu2 , Hisato Hirano1 , Charlie Laffeber3 , Tsukasa Kusakizako 1 , Yoshiaki Kise1,4 , Joyce H. G. Lebbink 3,5 , Yuzuru Itoh1 , John van der Oost 2 & Osamu Nureki 1 RNA-guidedtypeVCRISPR–Cas12effectorsprovideadaptiveimmunity againstmobilegeneticelements(MGEs)inbacteriaandarchaea.Among diverseCas12enzymes,therecentlyidentifiedCas12m2(CRISPR–Castype V-M)ishighlycompactandhasauniqueRuvCactivesite.Althoughthe non-canonicalRuvCtriaddoesnotpermitdsDNAcleavage,Cas12m2still protectsagainstinvadingMGEsthroughtranscriptionalsilencingbystrong DNAbinding.However,themolecularmechanismofRNA-guidedgenome inactivationbyCas12m2remainsunknown.Herewereportcryo-electron microscopystructuresoftwostatesofCas12m2–CRISPRRNA(crRNA)– targetDNAternarycomplexesandtheCas12m2–crRNAbinarycomplex, revealingstructuraldynamicsduringcrRNA–targetDNAheteroduplex formation.Thestructuresindicatethatthenon-targetDNAstrandistightly boundtoauniquearginine-richclusterintherecognition(REC)domains andthenon-canonicalactivesiteintheRuvCdomain,ensuringstrong DNA-bindingaffinityofCas12m2.Furthermore,astructuralcomparisonof Cas12m2withTnpB,aputativeancestorofCas12enzymes,suggeststhat theinteractionofthecharacteristiccoiled-coilREC2insertionwiththe protospacer-adjacentmotif-distalregionoftheheteroduplexiscrucialfor Cas12m2toengageinadaptiveimmunity.Collectively,ourfindingsimprove mechanisticunderstandingofdiversetypeVCRISPR–Caseffectorsand provideinsightsintotheevolutionofTnpBtoCas12enzymes. CRISPR–Cas systems provide adaptive immunity against mobile genetic elements (MGEs) in bacteria and archaea and are divided into two classes (classes 1 and 2) and six types (types I–VI)1,2 . The class 2 systems include types II, V and VI, in which Cas9, Cas12 and Cas13, respectively, function as effector enzymes responsible for interference against MGEs. Recent studies have identified a dozen functionally divergent type V Cas12 effector proteins3–11 . Cas12 enzymes associate with a CRISPR RNA (crRNA) or dual RNA guides (crRNA and trans-activating crRNA) and cleave double-stranded DNA (dsDNA) targets flanked by a protospacer-adjacent motif (PAM), using a single RuvC nuclease domain. Aside from the RuvC domain, Cas12 effector proteins share little sequence similarity, reflecting their diverse biochemical features. Type V Cas12 effectors report- edly evolved from the IS200/IS605 (IS stands for bacterial insertion sequences) superfamily transposon-associated TnpB protein2,12,13 . Although the sequences of relatively large Cas12 proteins, such as Received: 7 February 2023 Accepted: 22 June 2023 Published online: 17 July 2023 Check for updates 1 Department of Biological Sciences, Graduate School of Science, the University of Tokyo, Tokyo, Japan. 2 Laboratory of Microbiology, Wageningen University and Research, Wageningen, the Netherlands. 3 Department of Molecular Genetics, Oncode Institute, Erasmus MC Cancer Institute, Erasmus University Medical Center, Rotterdam, the Netherlands. 4 Curreio, the University of Tokyo, Tokyo, Japan. 5 Department of Radiotherapy, Erasmus University Medical Center, Rotterdam, the Netherlands. e-mail: john.vanderoost@wur.nl; nureki@bs.s.u-tokyo.ac.jp
  • 2. Nature Structural & Molecular Biology | Volume 30 | August 2023 | 1172–1182 1173 Article https://doi.org/10.1038/s41594-023-01042-3 evolution from TnpB to the larger Cas12 enzymes2 . Among the type V-U families, the U2–U4 clusters are classified into the Cas12f sub- type, in which the effector proteins function as a dimer, while the Cas12a, have weak similarities to the TnpB sequence, five recently identified small Cas12 variants (V-U1 to V-U5) show greater sequence similarities to TnpB and are thought to represent the early stage of RNA–DNA heteroduplex PAM duplex Rehybridized DNA duplex 5′ 5′ 5′ PAM TS TS 3′ 3′ 3′ NTS NTS NTS PK Guide (spacer) Loop Stem e crRNA Target DNA PAM Disordered region −1* −1 −9 −1 −9 −24 −12 −13 −23 −17 −36 −29 −26 PAM −10* 5′ 3′ 3′ 5′ 17 18 1 27 3′ NTS TS 1 20 17 Guide (spacer) Loop Stem PK RNA–DNA heteroduplex 1* 17* 18* 26* 5′ d 5′ TS Disordered region Disordered region 3′ REC2 REC1 WED NTS crRNA 5′ 5′ 3′ 5′ PAM crRNA TS Disordered region 180° 3′ RuvC REC2 REC1 WED NTS TNB c 5′ 3′ PAM crRNA crRNA 180° RuvC RuvC REC2 REC2 REC1 REC1 WED WED TNB b 19 53 121 190 302 313 541 577 596 REC1 REC1 WED RuvC REC2 TNB 1 REC lobe NUC lobe Linker a Fig.1|Cryo-EMstructureoftheCas12m2–crRNA–targetDNAternary complex.a,DomainstructureofCas12m2.b,c,Cryo-EMmaps(b)andstructural models(c)oftheCas12m2–crRNA–targetDNAternarycomplex.Zincand magnesiumionsintheTNBandRuvCdomainsareshownasgrayspheres. d,SchematicofthecrRNAandtargetDNA.Thedisorderedregionsareenclosed bydashedboxes.e,StructureofthecrRNA-and-targetDNAcomplex.
  • 3. Nature Structural & Molecular Biology | Volume 30 | August 2023 | 1172–1182 1174 Article https://doi.org/10.1038/s41594-023-01042-3 Table 1 | Data collection, processing, model refinement and validation Data collection and processing Sample Cas12m2–crRNA–DNA Full R-loop Cas12m2–crRNA–DNA Intermediate Cas12m2–crRNA EMDB ID EMD-34803 EMD-34804 EMD-34824 PDB ID 8HHL 8HHM 8HIO Microscope Titan Krios G3i Detector Gatan K3 camera Magnification 105,000 Voltage (kV) 300 Data-collection software EPU Image-processing package cryoSPARC Electron exposure (e– /Å2 ) 55 51 Defocus range (μm) −0.8 to −1.6 Pixel size (Å) 0.83 Symmetry imposed C1 Number of movies 3,570 8,004 Initial particle images (no.) 5,067,096 6,034,518 Final particle images (no.) 182,116 180,921 115,501 Map resolution (Å) 2.9 3.1 3.7 FSC threshold 0.143 Map sharpening B factor (Å2 ) −70 −80 −170 3DFSC analysis Global resolution (Å) 2.94 3.14 3.91 Sphericity 0.900 0.722 0.809 Model building and refinement Model composition Protein atoms 4,630 4,036 5,660 Nucleic acid atoms 2,337 1,497 1,019 Metal ions 4 2 2 Model refinement Model-Map CC (CCmask/CCbox/CCpeaks/CCvolume) 0.83/0.77/0.71/0.83 0.83/0.81/0.72/0.83 0.75/0.74/0.60/0.74   Resolution (Å) by model-to-map FSC, threshold 0.50 (masked/unmasked) 3.01/3.10 3.28/3.36 4.15/4.25 Average B factor (Å2 ) (protein/nucleotide/metal ion) 90.73/107.33/96.40 107.59/129.37/158.47 155.66/115.67/156.69 r.m.s. deviations Bond lengths (Å) 0.004 0.004 0.005 Bond angles (°) 0.485 0.510 0.622 Validation MolProbity score 1.64 1.89 1.68 CaBLAM outliers (%) 0.85 1.39 0.86 Clashscore 6.49 6.39 10.33 Rotamer outliers (%) 2.29 3.10 0.0 Cβ outliers (%) 0 0 0 EMRinger score 3.37 2.37 0.93 Ramachandran plot Favored (%) 98.64 97.06 97.17 Allowed (%) 1.36 2.94 2.83 Outliers (%) 0.00 0.00 0.00 EMDB, Electron Microscopy Data Bank; FSC, Fourier shell correlation; PDB, Protein Data Bank; CaBLAM, C-Alpha Based Low-resolution Annotation Method; EMRinger score, an extension of the X-ray crystallography validation tool Ringer in cryo-EM.
  • 4. Nature Structural Molecular Biology | Volume 30 | August 2023 | 1172–1182 1175 Article https://doi.org/10.1038/s41594-023-01042-3 U5 cluster is classified as the Cas12k subtype, in which the effector proteins are catalytically inactive. A recent study demonstrated that the type V-M (formerly type V-U1, clade 2) Cas12m2 from Mycolicibacterium mucogenicum (here- after referred to as Cas12m2 for simplicity) exhibits RNA-guided dsDNA interference activity11 . Cas12m2 consists of 596 amino acid residues and is much smaller than other Cas12 enzymes, except for Cas12f7 . Cas12m2 associates with a single crRNA to recognize dsDNA targets containing a 5′-TTN-3′ PAM. Interestingly, Cas12m2 lacks target dsDNA-cleavage activity, probably due to the non-canonical His–Asp–Asp(HDD)catalyticresiduesintheRuvCdomain,ratherthan the canonical, DNA-cleavable active site Asp–Glu–Asp (DED). Hence, instead of target cleavage, Cas12m2 interferes with invading MGEs through transcriptional silencing by strong DNA binding11 . However, becauseofthelackofstructuralinformation,themolecularmechanism of the miniature Cas12m2 and the evolutionary relationship between TnpBandCas12m2remainunknown. Here, we report multiple cryo-electron microscopy (cryo-EM) structural states of Cas12m2–crRNA–target DNA ternary complexes and the Cas12m2–crRNA binary complex at overall resolutions of 2.9–3.7 Å.ThestructuresrevealauniquetargetdsDNA-bindingmecha- nismofCas12m2,clearlyexplainingwhyCas12m2iscapableoftightly binding target DNA but unable to cleave it. Furthermore, a structural comparison of Cas12m2 with TnpB illuminates their great structural similarities and diversities, providing insights into the evolutionary pathfromTnpBtoCas12enzymes. Results Cryo-EMstructureoftheCas12m2–crRNA–targetDNAcomplex To understand the molecular mechanism of Cas12m2, we analyzed its structure in complex with a 56-nucleotide crRNA (including a 20-nucleotide guide) and its 36-base pair (bp) dsDNA target with a 5′-TTG-3′PAMbycryo-EM.Usingthree-dimensional(3D)classification andrefinement,wediscoveredtwoconformationalpopulationsofthe Cas12m2 ternary complex at overall resolutions of 2.9 Å (state I) and 3.1 Å(stateII)(Fig.1a–c,Table1,ExtendedDataFig.1andSupplementary Table1).Inbothstates,Cas12m2bindsthecrRNAanditsdsDNAtarget at a molar ratio of 1:1:1. Although the full-length crRNA–target DNA heteroduplexisvisibleinstateI,ashorterheteroduplex(approximately 12 bp)isobservedinstateII.Forsimplicity,wedescribethestructural features of state I unless otherwise specified. The present structures revealed that Cas12m2 adopts a bilobed architecture composed of REC and nuclease (NUC) lobes (Fig. 1a–c and Extended Data Fig. 2a). The REC lobe consists of wedge (WED), REC1 and REC2 domains. The WED domain adopts an oligonucleo- tide–oligosaccharide-binding fold similar to those of other Cas12 enzymes14–19 (Extended Data Fig. 2b). The REC1 domain consists of three α-helices, and the REC2 domain has a characteristic coiled-coil structure, forming a distinctive architecture resembling a ‘sickle’ (ExtendedDataFig.2b).TheNUClobecontainsRuvCandtargetnucleic acid-binding (TNB) domains. The RuvC domain has an RNase H fold, and the His317, Asp485 and Asp579 residues are located at positions similar to those of the well-conserved catalytic Asp, Glu and Asp resi- duesinmostotherCas12enzymes14–19 (ExtendedDataFig.2c).TheTNB domain is inserted into the RuvC domain and contains an HCCC-type zincfinger,inwhichazincioniscoordinatedbyHis549,Cys552,Cys569 and Cys572 (Extended Data Fig. 2d). The guide RNA–target DNA het- eroduplex is accommodated within the positively charged central channelformedbytheRECandNUClobes(Fig.1candExtendedData Fig. 2a). The displaced non-target DNA strand (NTS) is visible in our density map and is bound to unique pockets located in the REC and RuvCdomains(discussedbelow). crRNAarchitectureandrecognition The crRNA consists of the 20-nucleotide guide segment (G1 to C20) and the 36-nucleotide repeat-derived scaffold (G(−36) to C(−1)) (Fig.1d).NucleotidesG1toU17inthecrRNAanddA1todC17inthetar- getDNAstrand(TS)forma17-bpguideRNA–targetDNAheteroduplex (Fig.1d,eandExtendedDataFig.2e).Unexpectedly,nucleotidesdG(−3) todC(−1)intheTSdonotformbasepairswithG18toC20inthecrRNA and instead rehybridize with dG18* to dC20* in the NTS, suggesting that the 17-bp guide RNA–target DNA heteroduplex represents the optimallengthforCas12m2-mediatedDNArecognition(todifferenti- atebetweenNTSandTSinthisstudy,anasteriskisusedtodenotethe NTS; Fig. 1d,e and Extended Data Fig. 2e). The crRNA scaffold comprises a pseudoknot (PK) and stem and loopregions.The5′endofthecrRNA(G(−36)toU(−30))isdisordered (Fig. 1d,e and Extended Data Fig. 2f). Specifically, G(−23) forms a non-canonicalbasepairwithA(−2),andC(−22)toC(−17)formcanoni- cal Watson–Crick base pairs with G(−8) to G(−3) to compose the PK structure.ThePKcoaxiallystackswiththestemtoformacontinuous helix.G(−9)andA(−11)intheloopformhydrogenbondswiththeribose moieties of G(−16) and G(−14), respectively, thereby stabilizing the crRNAscaffold(ExtendedDataFig.3). ThecrRNAscaffoldisaccommodatedwithinthegrooveformedby theWEDandRuvCdomainsandismainlyrecognizedbythesedomains (Fig.2a,bandExtendedDataFig.3).Notably,thePKandstemarebound by Arg237 and Arg241 through sugar–phosphate backbone interac- tions,whilethenucleobaseoftheflipped-outA(−24)issandwichedby His12 and Arg245 and its ribose moiety hydrogen bonds with Arg245 (Figs. 1d and 2c). The G(−12) in the loop forms a hydrogen bond and stackinginteractionswithArg447andLys448,therebystabilizingthe crRNAloopregion(Fig.1dandExtendedDataFig.3).Moreover,thefirst nucleotideC(−1)andthenon-canonicalG(−23):A(−2)basepairexten- sively interact with the WED domain in a base-specific manner. The nucleobase C(−1) forms a base-specific hydrogen bond and stacking interactionswiththeconservedHis269andArg270,respectively,while G(−23)stackswithHis269,stabilizingthenon-canonicalG(−23):A(−2) basepair(Fig.2d).Altogether,thepresentstructurerevealsthemecha- nismofcrRNAscaffoldrecognitionbythecompactCas12m2. TargetDNAarchitectureandrecognition TheguideRNA–targetDNAheteroduplexisaccommodatedwithinthe positivelychargedcentralchannelformedbytheREC1,REC2andRuvC domains and is recognized by the Cas12m2 protein through interac- tionswithitssugar–phosphatebackbone(Fig.2a,bandExtendedData Fig.3).Met4intheWEDdomainstackswiththefirstG1:dC17basepair intheheteroduplex,whileArg301intheWEDdomaininteractswiththe Fig.2|crRNAandtargetDNArecognition. a,RecognitionsitesofthecrRNA scaffoldandtargetDNA.b,ElectrostaticsurfacepotentialoftheCas12m2– crRNA–targetDNAternarycomplex.ThecrRNA–targetDNAheteroduplexis accommodatedwithinapositivelychargedcentralchannelformedbyWED, REC1,REC2andRuvCdomains,andthePAMduplexissandwichedbytheWED andREC1domains.c–g,Recognitionofthestem(c),thePK(d),thecrRNA–target DNAheteroduplex(e),therehybridizedDNAduplex(f)andthePAMduplex (g).Hydrogenbondsaredepictedwithgreendashedlines.h,DNAbindingof Cas12m2mutantsinvitro,measuredbySPRspectroscopy,relativetowildtype. Valuesshownarethemean ± s.e.m.ofthreeindependentexperiments. StatisticalanalysiswascarriedoutusingTukey’sposthoctestincaseofa significanttwo-wayANOVAresult.Significantdifferencesrelativetowild-type Cas12m2areindicatedbyasterisks.***P 0.001.Triple,Cas12m2R111A/R112A/R126A triplemutant.i,UniqueArg-richpocketformedbytheREC1andREC2domains. Hydrogenbondsaredepictedwithgreendashedlines.j,Close-upviewofthe non-canonicalRuvCactivesite.ThemagnesiumioncoordinatedwiththeNTSis shownasagraysphere.Thecryo-EMdensityfortheRuvCconservedresidues, themagnesiumionandthedisplacedNTS(dG16*anddT17*)isshownasablue mesh.k,Schematicofthenon-canonicalRuvCactivesite.
  • 5. Nature Structural Molecular Biology | Volume 30 | August 2023 | 1172–1182 1176 Article https://doi.org/10.1038/s41594-023-01042-3 REC2 REC1 P O O− Mg2+ A O O Base Base O HN N O O O O O O O 3′ 5′ D485 H317 D579 c b a WED RuvC RuvC PAM REC1 REC1 REC1 WED WED WED TNB TNB REC2 REC2 REC2 180° d e c d f j i g e PAM PAM RuvC REC2 WED RuvC REC1 WED RuvC f g h i j k 5′ 5′ 5′ 5′ TS TS TS NTS NTS R126 R173 R111 R112 dT6 dG7 dA8 dG9 R245 R237 R241 A(−24) G(−3) C(−6) C(−7) Helix α3 H12 L502 P503 dG19* dC(−2) dC(−1) dA1 dG18* dG17* H269 R270 C(−1) A(−2) G(−23) R301 M4 G1 G2 dC16 dC17 dC18 H317 D485 D579 Mg2+ dG16* dT17* 5′ NTS 3′ 3′ 3′ 3′ 3′ 3′ crRNA crRNA *** *** *** *** *** *** *** *** *** Y 1 4 1 A W 1 5 2 A C a s 1 2 m 2 N 1 5 6 A R 1 1 1 A Q 1 9 5 A R 1 1 2 A R 1 2 6 A T r i p l e Q 1 9 7 A Relative binding 0 0.2 0.4 0.6 0.8 1.0 1.2 Mg2+ B WED REC1 N156 W152 Q195 Q197 Y141 dA19 dA20 dT(−2*) dT(−3*)
  • 6. Nature Structural Molecular Biology | Volume 30 | August 2023 | 1172–1182 1177 Article https://doi.org/10.1038/s41594-023-01042-3 backbonephosphategroupofdC18,therebyfacilitatingheteroduplex formationasobservedinotherCas12enzymes20 (Fig.2e).Leu502and Pro503 in the RuvC domain form hydrophobic interactions with the dG18* and dC(−1) bases, respectively, stabilizing reformation of the DNA duplex downstream of the heteroduplex (Fig. 2f). ThePAM-containingDNAduplex(referredtoasthePAMduplex) isdistortedintoanarrowminorgrooveandissandwichedbytheREC1 andWEDdomains(Fig.2a).ThedT(−3*)anddT(−2*)nucleotidesform hydrophobic interactions with Trp152 and Tyr141 and base-specific hydrogenbondswithGln197andAsn156,respectively(Fig.2g).Tyr141 also interacts with the phosphate backbone of dT(−3*) (Fig. 2g). The dA19anddA20nucleobases,whichbasepairwithdT(−2*)anddT(−3*), form hydrogen bonds with Gln195 and Gln197, respectively (Fig. 2g). Substitution mutagenesis of Cas12m2 confirmed the importance of Tyr141,Trp152,Asn156,Gln195andGln197forDNAbinding,asalanine substitution of any of these residues strongly affected the ability of Cas12m2 to interact with its DNA target in vitro (Fig. 2h and Extended Data Fig. 4). These structural and biochemical features explain the requirement of the 5′-TTN-3′ PAM sequence for Cas12m2-mediated targetDNAbinding. The displaced NTS passes through the groove formed by the REC1 and REC2 domains toward the groove formed by the RuvC and TNBdomains(Fig.2a,b).Notably,thecentralregionofthenegatively charged backbone of the NTS is bound to an Arg-rich pocket in the REC1andREC2domainsandinteractsextensivelywithArg111,Arg112, Arg126 and Arg173 (Fig. 2i). Such an Arg-rich pocket has not been observed in any other Cas12 enzymes, even though these Arg resi- dues are highly conserved among Cas12m family enzymes (Extended Data Fig. 5). Cas12m2 mutants with alanine substitutions of Arg111, Arg112 and Arg126 showed reduced binding to target DNA in vitro (Fig. 2h and Extended Data Fig. 4c), suggesting the critical role of the unique Arg-rich pocket for the robust DNA-binding affinity of Cas12menzymes. Non-canonicalHDD-typeRuvCactivesite In most Cas12 enzymes, the RuvC active site has the conserved DED catalytic residues that coordinate two magnesium ions on either side of the scissile phosphate (Extended Data Fig. 6a). The first mag- nesium ion (A) activates the nucleophilic water molecule, while the second magnesium ion (B) stabilizes the transition state, resulting in a two-divalent cation-dependent DNA-hydrolysis mechanism21,22 (Extended Data Fig. 6a,b). IntheCas12m2structure,thenucleotidesdG14*todT17*intheNTS are clearly visible, and the phosphate group of dG16* is located at the RuvCactivesiteformedbyHis317,Asp485andAsp579residues(Fig.2j and Extended Data Fig. 6c). At the position corresponding to magne- siumion(A)inotherCas12enzymes,weobservedamagnesiumionthat wascoordinatedbyHis317andAsp579,whereasnodensitycorrespond- ing to magnesium ion (B) was observed (Fig. 2j,k and Extended Data Fig.6c,d).ThisisduetothereplacementofthefirstAspandthesecond GluintheactivesitewithHis317andAsp485,respectively,inCas12m2. The His317 side chain has only one lone pair for metal coordination, which is used to bind magnesium ion (A), and the Asp485 side chain is located too far away to coordinate the second magnesium ion (B) (Fig. 2j,k and Extended Data Fig. 6c,d). Therefore, the non-canonical HDD motif leads to the absence of the second magnesium ion, result- inginlossofcleavageactivityforthetargetDNA.Nonetheless,His317 and Asp579 are bound to the NTS via magnesium ion coordination, and His317 is stabilized by hydrogen bonding with Asp485, suggest- ing that the HDD motif plays a crucial role in target DNA binding, rather than target DNA cleavage. Consistent with these structural observations,ourpreviousstudyshowedthatalaninesubstitutionof Asp485 (HDD HAD) leads to reduced DNA-binding affinity in vitro andtranscriptionalsilencingactivityinvivo11 ,indicatingthatCas12m2 uses the RuvC catalytic center as the DNA-binding pocket for robust DNA-bindingaffinity. Conformationalchangesuponheteroduplexformationlead torobustDNAbinding InstateI,thetargetDNAformsafullR-loopcontaininga17-bpheterodu- plex, whereas, in state II, the target DNA only forms a 12-bp heterodu- plex,andthedisplacedNTSisdisordered(Fig.3a,b).Thus,wepropose thatstateIIrepresentstheintermediatestateduringformationofthe 17-bpheteroduplex.Hereafter,werefertostateIasthefullR-loopstate and to state II as the intermediate state. Structural comparisons between the full R-loop and the inter- mediate states revealed conformational changes in the REC2 domain (Fig. 3a–c and Supplementary Video 1). In the intermediate state, the REC2domainisdisorderedduetoitsflexibility(Fig.3a).Bystarkcon- trast,inthefullR-loop,theREC2domainisorderedandinteractswith thesugar–phosphatebackboneoftheheteroduplexatthePAM-distal region (Fig. 3b). This structural rearrangement of the REC2 domain facilitates formation of the Arg-rich pocket between the REC1 and REC2 domains, allowing the binding of the displaced NTS (Fig. 3b). Structural comparisons also indicated local conformational changes in two short α-helices (Lα1 and Lα2) and the loop connecting them (together referred to as the lid motif) in the RuvC domain (Fig. 3c–f). Intheintermediatestate,residues499–506(partofLα2andtheloop) are disordered due to the flexibility of this region, and Lα1 occludes theRuvCactivesite(Fig.3d,f).Bycontrast,uponformationofthefull R-loop, Thr502 in the lid motif hydrogen bonds with the phosphate backbone of dC2, while Pro503 and Leu504 in Lα2 form hydrophobic interactionswithdC(−1)anddG18*,respectively(Fig.3e).Thesestruc- tural rearrangements facilitate the local structural transition of Lα1, opening the RuvC active site to bind the NTS, as observed in Cas12b andCas12i23,24 (Fig.3e,f).Takentogether,theconformationalchanges of the REC2 domain and the lid motif, which enable recognition of the PAM-distal end, provide a binding site for NTS within the Arg-rich pocket and the RuvC DNA-binding site, allowing Cas12m2 to silence transcriptionoftargetgenesthroughitsrobustDNA-bindingaffinity. TobiochemicallyvalidatetheimportanceofthetwoNTS-binding sites (that is, the Arg-rich pocket and the RuvC active site) in Cas12m2-mediated transcriptional repression, we performed an in vivo transcriptional silencing assay. We generated a target plasmid Fig.3|Conformationalchangesuponheteroduplexformationenable robustDNAbindingofCas12m2. a,b,StructuralmodelsoftheCas12m2– crRNA–targetDNAternarycomplex,representingtheintermediatestate(a)and fullR-loopstate(b).Thedisorderedregionsareindicatedasdashedlines. c,Superimpositionoftheintermediatestate(lightblue)andthefullR-loopstate (coloredasinb).d,e,Close-upviewsaroundthelidmotifoftheintermediate state(d)andthefullR-loopstate(e).Thedisorderedregionsareindicatedasthe graycyclesurroundedbydashedlines.Hydrogenbondsaredepictedwithgreen dashedlines.f,Superimpositionofthelidmotifoftheintermediatestate(light blue)andthefullR-loopstate(coloredasinb).ThedisplacedNTS(dC15*–dT17*) inthefullR-loopstateclasheswiththeLα2helixintheintermediatestate. g,Schematicofthebicistronicrfp-gfpoperon(pTarget-operon),includingthe crRNAtargetingsitesinthepromoter(A1)andattheendoftheoperon (3′-UTR;E1)describedpreviously11 .h,NormalizedRFPandGFPfluorescence ofE.coliculturesexpressingdifferentCas12m2mutants,directedtoeitherthe promoter(pCRISPR-A1)orthe3′-UTR(pCRISPR-E1).Fluorescenceisshown relativetotheaveragefluorescenceofculturesexpressinganon-targetingguide (pCRISPR-NT).Valuesshownarethemean ± s.e.m.oftwovaluesofbiological duplicates(afteraveragingtechnicaltriplicates).Statisticalanalysiswascarried outusingTukey’sposthoctestincaseofasignificanttwo-wayANOVAresult. SignificantdifferencesrelativetoCas12m2valuesareindicatedbyasterisks. *P 0.05,**P 0.01,***P 0.001.dCas12m2,Cas12m2D485A mutant;triple, Cas12m2R111A/R112A/R126A triplemutant.
  • 7. Nature Structural Molecular Biology | Volume 30 | August 2023 | 1172–1182 1178 Article https://doi.org/10.1038/s41594-023-01042-3 (pTarget-operon) containing a bicistronic operon with two fluores- cence reporter genes, rfp and gfp. Escherichia coli cells harboring pTarget-operonandeitherpCRISPR-A1(crRNAtargetingthepromoter regionoftheoperon),pCRISPR-E1(crRNAtargetingthe3′untranslated region (UTR) of the operon) or pCRISPR-NT (crRNA that does not tar- get the operon) were made chemically competent and transformed witheitherpCas-Cas12morplasmidsencodingitsmutants(including Cas12m2D485A ,Cas12m2R111A ,Cas12m2R112A andCas12m2R126A mutantsand the Cas12m2R111A/R112A/R126A triple mutant) (Fig. 3g). Wild-type Cas12m2 efficientlyreducedredfluorescentprotein(RFP)andgreenfluorescent protein (GFP) fluorescence upon crRNA targeting both the A1 and E1 sites,asobservedinapreviousstudy(Fig.3h).Bycontrast,allmutants exhibited reduced transcriptional silencing capabilities when target- ing the E1 site, and the Cas12m2R111A/R112A/R126A triple mutant showed crRNA crRNA C a s 1 2 m 2 d C a s 1 2 m 2 R 1 1 1 A R 1 1 2 A R 1 2 6 A T r i p l e C a s 1 2 m 2 d C a s 1 2 m 2 R 1 1 1 A R 1 1 2 A R 1 2 6 A T r i p l e Relative fluorescence (A1) (%) 0 20 40 60 80 120 100 * ** C a s 1 2 m 2 d C a s 1 2 m 2 R 1 1 1 A R 1 1 2 A R 1 2 6 A T r i p l e Relative fluorescence (NT) (%) 0 20 40 60 80 120 100 Relative fluorescence (E1) (%) 0 20 40 60 80 120 100 *** *** *** *** *** * * * Arg-rich pocket Arg-rich pocket rfp Cas12m2 pCRISPR-A1 Promoter Terminator gfp d e f RuvC RuvC RuvC RuvC RuvC REC1 REC1 WED WED TNB TNB REC2 REC2 REC2 PAM-distal region a g h d e f b Cas12m2 full R-loop state Cas12m2 intermediate state c Cas12m2 full R-loop state Cas12m2 intermediate state Cas12m2 intermediate state Cas12m2 full R-loop state H317 D485 D579 Lα1 Lα1 Lα1 Lα2 Lα2 Lα2 H317 D485 D579 dG16* dG16* dT17* dC15* dC2 dG3 dA1 T502 K87 R81 TS TS NTS NTS P503 L504 dC(−1) dG18* Cas12m2 pCRISPR-E1
  • 8. Nature Structural Molecular Biology | Volume 30 | August 2023 | 1172–1182 1179 Article https://doi.org/10.1038/s41594-023-01042-3 lower transcriptional silencing efficacy when targeting both the A1 and E1 sites (Fig. 3h). These results explain why the Arg-rich pocket and the RuvC DNA-binding site, which result from conformational changes of the REC2 domain and the lid motif, play critical roles in Cas12m2-mediatedtranscriptionalsilencing. EvolutionarypathfromTnpBtoCas12m2 The Cas12m family enzymes are thought to represent the early stage of evolution from TnpB to larger Cas12 enzymes2 (Extended Data Fig. 7). Thus, structural comparisons of TnpB and Cas12m2 may pro- videcluestotracetheevolutionarypathfromTnpBtoCas12enzymes. REC WED RuvC TNB 1 408 1 596 Cas12m2 REC1 REC2 WED RuvC TNB TnpB a RuvC RuvC REC2 REC1 REC WED WED TNB RuvC REC2 REC1 WED TNB TNB RuvC REC WED TNB Guide RNA Guide RNA PAM TAM b RuvC REC2 REC1 WED TNB RuvC REC WED TNB 12-bp heteroduplex 17-bp heteroduplex Fig.4|StructuralcomparisonofCas12m2andTnpB. a,Structuralcomparison ofCas12m2andTnpB(fromDeinococcusradioduransISDra2)(PDB8H1J). b,StructuralcomparisonofCas12m2andTnpBwithemphasisontheguide RNA–targetDNAheteroduplex.WhileCas12m2shareshighsequencesimilarity withTnpB,Cas12m2hasaninsertedREC2domain(highlightedinblue),whichis notpresentinTnpB,therebyfacilitatingrecognitionofthelongerguideRNA– targetDNAheteroduplex.
  • 9. Nature Structural Molecular Biology | Volume 30 | August 2023 | 1172–1182 1180 Article https://doi.org/10.1038/s41594-023-01042-3 The structure of Cas12m2 strongly resembles that of the minimal TnpB containing WED, REC1 (REC domain in TnpB), RuvC and TNB domains25,26 (Fig. 4a). Both enzymes adopt bilobed architecture and accommodate the guide RNA–target DNA heteroduplex in similar manners (Fig. 4a). The PAM duplex (transposon-associated motif (TAM) duplex in TnpB) is commonly bound between the WED and REC1 domains, and the guide RNA scaffold is accommodated within the groove formed by the WED and RuvC domains. These structural similarities confirm that the RNA-guided DNA-targeting mechanism is highly conserved between TnpB and Cas12m2. In contrast to these conserved structural features, there are also important structural differences. TnpB only recognizes a 12-bp guide RNA–target DNA heteroduplex, whereas Cas12m2 recognizes a longer 17-bp heteroduplex (Fig. 4b). This difference is mainly due REC WED RuvC TNB 1 408 TnpB Cas12f REC ZF WED RuvC TNB 529 1 Cas12m 1 596 REC1 REC2 WED RuvC TNB Cas12a PI WED RuvC TNB 1,300 1 REC2 REC1 TnpB Evolutionary time Domain insertion for recognition of the PAM-distal region Domain insertion/enlargement for maturation of the adaptive immune system Dimerization for recognition of the PAM-distal region Cas12m Cas12a Cas12f a b RuvC REC REC2 Mol. 2 REC2 TNB PI WED TNB Fig.5|ModelofevolutionfromTnpBtoCas12enzymes. a,Structural comparisonofCas12m2withTnpB(fromD.radioduransISDra2)(PDB8H1J), Cas12f(fromanunculturedarchaeon)(PDB7C7L)andCas12a(fromFrancisella novicida)(PDB6I1K).SecondCas12fmolecule(mol.2)andthespecificREC2 insertionsinCas12m2andCas12a,whichareresponsibleforrecognitionof thePAM-distalregion,arehighlightedinblue.TheinsertionsfromCas12m2to Cas12aarehighlightedinorange.Thebluecyclerepresentstheregionwhere TnpBacquiredadditionalelementstoengageintheadaptiveimmunesystem, andtheorangecyclesrepresenttheregionwheredomaininsertionsand enlargementsoccurredformoreefficienttargetrecognitionandcleavage. b,SchematicofdomaininsertionsandenlargementsofCas12enzymesfrom TnpB.ZF,zincfingerdomain.
  • 10. Nature Structural Molecular Biology | Volume 30 | August 2023 | 1172–1182 1181 Article https://doi.org/10.1038/s41594-023-01042-3 tothecharacteristicREC2insertionintotheREC1domaininCas12m2 (Fig.4a,b).TnpBlacksthisREC2domain,andthustheendoftheguide RNA–target DNA heteroduplex is not recognized and the heterodu- plexes are disordered beyond the 12th base pair. Therefore, these structuralfindingssuggestthatCas12m2acquiredtheabilitytorecog- nizethePAM-distalregionoftheguideRNA–targetDNAheteroduplex throughREC2insertionintothecorestructureofTnpB.Thesefeatures contributedtoevolutiontowardCRISPR–Casadaptiveimmunity. Discussion In this study, we determined the cryo-EM structure of the Cas12m2– crRNA–targetDNAternarycomplex,revealingthatthenon-canonical HDD-type RuvC catalytic triad forms an NTS-binding site via coordi- nation of a single magnesium ion. Intriguingly, when we introduced counter-mutations to convert the HDD motif of Cas12m2 into a canonical DED motif, DNA-cleavage activity was not restored (data not shown). This observation indicates that there could be more rea- sonswhyCas12m2lostDNA-cleavageactivityasidefromtheHDD-type RuvC triad. In a recent study, we divided the Cas12m family enzymes furtherintofourclades,basedontheconservedRuvCcatalytictriad11 (clade 1, HEK, HQK; clade 2, HDD; clade 3, DED, NED; clade 4, DED, DND, NED). Because the other Cas12m enzyme clades have different catalytic residues, they could interfere with invading MGEs through mechanismsdistinctfromthatofCas12m2.Notably,clade1(HQK)may directly bind the NTS within the positively charged RuvC active site, whileclade3andclade4mayfacilitatetwo-divalentcation-dependent DNA hydrolysis within the canonical DED-type RuvC active site. Bio- chemicalandstructuralcharacterizationsoftheotherCas12menzyme clades are required to further clarify the molecular mechanisms of Cas12m family enzymes. Most Cas12 enzymes auto-process their own pre-crRNA into mature crRNA without the involvement of a host ribonuclease. Cas12a and Cas12i process their pre-crRNA at their WED domain by metal-independent,acid–basecatalyticmechanisms21,24,27,28 ,whereas Cas12candCas12jprocesstheirpre-crRNAattheRuvCcatalyticsiteby ametal-dependentmechanism8,20 .Arecentstudyshowedthat,similar to Cas12a and Cas12i, Cas12m2 cleaves its pre-crRNA upstream of the crRNAscaffold,andtwoconservedresidues(His269andArg270)inthe WED domain may participate in crRNA maturation11 . However, unlike Cas12a and Cas12i, in which the 5′ end of the crRNA is surrounded by theWEDdomain,the5′endofCas12m2–crRNAisprojectedawayfrom the WED domain and exposed to the solvent in the present structure (Extended Data Fig. 8). His269 and Arg270 in the WED domain form base-specific interactions with the crRNA scaffold, suggesting that these residues are responsible for crRNA recognition, rather than crRNA processing (Fig. 2d). Indeed, a processing analysis of Cas12m2 revealed that alanine substitutions of His269 and Arg270 in the WED domain and Asp485 in the RuvC catalytic site had little or no effect on the pre-crRNA-processing pattern (Extended Data Fig. 9a). These structuralandbiochemicalanalysessuggestthatCas12m2mayprocess its pre-crRNA in a unique manner. Intriguingly, when we analyzed the cryo-EMstructureofCas12m2complexedwitha56-nucleotidecrRNA, we observed an additional density close to the 5′ region of the crRNA (Table 1, Extended Data Fig. 9b–e and Supplementary Table 1). This densityalignswellwithpartofthedistinctivesickle-likearchitectureof theREC1andREC2domainsinCas12m2,suggestingthatCas12m2may facilitate pre-crRNA processing through the formation of a transient asymmetric homodimer. However, because the present resolution is notsufficienttounambiguouslyelucidatetheresiduesresponsiblefor crRNAmaturation,furtherstudiesarerequiredtofullyunderstandthe pre-crRNA-processingmechanismofCas12m2. A structural comparison of TnpB, the putative ancestor of Cas12 enzymes,withCas12m2,Cas12f14 andCas12a19 providesinsightsintothe evolutionary path of Cas12 enzymes (Fig. 5a,b). Cas12m2 and Cas12a commonlyhaveREC2insertionsthatrecognizethePAM-distalregion of the guide RNA–target DNA heteroduplex, whereas Cas12f retains a TnpB-like REC domain. Nevertheless, Cas12f functions as an asym- metric homodimer to compensate for recognition of the PAM-distal region of the heteroduplex, with the second molecule serving as a replacement for the REC2 domain14,15 . These observations suggest that CRISPR–Cas12 enzymes acquired the ability to recognize the PAM-distal region of the guide RNA–target DNA heteroduplex to engage in CRISPR–Cas adaptive immunity. All Cas12 enzymes except for Cas12f and Cas12k (which are both associated with one or more additional protein molecules) have the REC2 insertion to recognize the PAM-distal region14,15,29,30 , supporting our suggestion. Structural comparison between Cas12m2 and Cas12a provides insights into the maturation of Cas12 enzymes (Fig. 5a,b). Notably, Cas12a has larger RECdomainsandaTNBdomain19 ,whichareresponsiblefortargetDNA recognition and loading, respectively. In addition, Cas12a contains a PAMinteracting(PI)domain,whichextensivelyinteractswiththePAM duplex19 (Fig. 5a,b). These domain enlargements and the insertion may have been acquired to target the dsDNA of invading phage more efficientlyand/ortocompetewithanti-CRISPRsystems. Onlinecontent Anymethods,additionalreferences,NaturePortfolioreportingsum- maries, source data, extended data, supplementary information, acknowledgements,peerreviewinformation;detailsofauthorcontri- butionsandcompetinginterests;andstatementsofdataandcodeavail- abilityareavailableathttps://doi.org/10.1038/s41594-023-01042-3. References 1. van der Oost, J., Westra, E. R., Jackson, R. N. Wiedenheft, B. Unravelling the structural and mechanistic basis of CRISPR–Cas systems. Nat. Rev. Microbiol. 12, 479–492 (2014). 2. Makarova, K. S. et al. Evolutionary classification of CRISPR– Cas systems: a burst of class 2 and derived variants. Nat. Rev. Microbiol. 18, 67–83 (2020). 3. Zetsche, B. et al. Cpf1 is a single RNA-guided endonuclease of a class 2 CRISPR–Cas system. Cell 163, 759–771 (2015). 4. Shmakov, S. et al. Discovery and functional characterization of diverse class 2 CRISPR–Cas systems. Mol. Cell 60, 385–397 (2015). 5. Yan, W. X. et al. Functionally diverse type V CRISPR–Cas systems. Science 363, 88–91 (2019). 6. Burstein, D. et al. New CRISPR–Cas systems from uncultivated microbes. Nature 542, 237–241 (2017). 7. Harrington, L. B. et al. Programmed DNA destruction by miniature CRISPR–Cas14 enzymes. Science 362, 839–842 (2018). 8. Pausch, P. et al. CRISPR–CasΦ from huge phages is a hypercompact genome editor. Science 369, 333–337 (2020). 9. Strecker, J. et al. RNA-guided DNA insertion with CRISPR-associated transposases. Science 365, 48–53 (2019). 10. Sun, A. et al. The compact Casπ (Cas12l) ‘bracelet’ provides a unique structural platform for DNA manipulation. Cell Res. 33, 229–244 (2023). 11. Wu, W. Y. et al. The miniature CRISPR–Cas12m effector binds DNA to block transcription. Mol. Cell 82, 4487–4502 (2022). 12. Karvelis, T. et al. Transposon-associated TnpB is a programmable RNA-guided DNA endonuclease. Nature 599, 692–696 (2021). 13. Altae-Tran, H. et al. The widespread IS200/IS605 transposon family encodes diverse programmable RNA-guided endonucleases. Science 374, 57–65 (2021). 14. Takeda, S. N. et al. Structure of the miniature type V-F CRISPR–Cas effector enzyme. Mol. Cell 81, 558–570 (2021). 15. Xiao, R., Li, Z., Wang, S., Han, R. Chang, L. Structural basis for substrate recognition and cleavage by the dimerization-dependent CRISPR–Cas12f nuclease. Nucleic Acids Res. 49, 4120–4128 (2021).
  • 11. Nature Structural Molecular Biology | Volume 30 | August 2023 | 1172–1182 1182 Article https://doi.org/10.1038/s41594-023-01042-3 16. Yamano, T. et al. Crystal structure of Cpf1 in complex with guide RNA and target DNA. Cell 165, 949–962 (2016). 17. Stella, S., Alcón, P. Montoya, G. Structure of the Cpf1 endonuclease R-loop complex after target DNA cleavage. Nature 546, 559–563 (2017). 18. Liu, J.-J. et al. CasX enzymes comprise a distinct family of RNA-guided genome editors. Nature 566, 218–223 (2019). 19. Swarts, D. C. Jinek, M. Mechanistic insights into the cis- and trans-acting DNase activities of Cas12a. Mol. Cell 73, 589–600 (2019). 20. Kurihara, N. et al. Structure of the type V-C CRISPR–Cas effector enzyme. Mol. Cell 82, 1865–1877 (2022). 21. Huang, X. et al. Structural basis for two metal-ion catalysis of DNA cleavage by Cas12i2. Nat. Commun. 11, 5241 (2020). 22. Pausch, P. et al. DNA interference states of the hypercompact CRISPR–CasΦ effector. Nat. Struct. Mol. Biol. 28, 652–661 (2021). 23. Yang, H., Gao, P., Rajashankar, K. R. Patel, D. J. PAM-dependent target DNA recognition and cleavage by C2c1 CRISPR–Cas endonuclease. Cell 167, 1814–1828 (2016). 24. Zhang, H., Li, Z., Xiao, R. Chang, L. Mechanisms for target recognition and cleavage by the Cas12i RNA-guided endonuclease. Nat. Struct. Mol. Biol. 27, 1069–1076 (2020). 25. Nakagawa, R. et al. Cryo-EM structure of the transposon- associated TnpB enzyme. Nature 616, 390–397 (2023). 26. Sasnauskas, G. et al. TnpB structure reveals minimal functional core of Cas12 nuclease family. Nature 616, 384–389 (2023). 27. Swarts, D. C., van der Oost, J. Jinek, M. Structural basis for guide RNA processing and seed-dependent DNA targeting by CRISPR–Cas12a. Mol. Cell 66, 221–233 (2017). 28. Zhang, B. et al. Mechanistic insights into the R-loop formation and cleavage in CRISPR–Cas12i1. Nat. Commun. 12, 3476 (2021). 29. Querques, I., Schmitz, M., Oberli, S., Chanez, C. Jinek, M. Target site selection and remodelling by type V CRISPR–transposon systems. Nature 599, 497–502 (2021). 30. Xiao, R. et al. Structural basis of target DNA recognition by CRISPR–Cas12k for RNA-guided DNA transposition. Mol. Cell 81, 4457–4466 (2021). Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons. org/licenses/by/4.0/. © The Author(s) 2023
  • 12. Nature Structural Molecular Biology Article https://doi.org/10.1038/s41594-023-01042-3 Methods Protein and RNA preparation for structural analysis TheCas12m2proteinforstructuralanalysiswasexpressedandpurified using the protocol reported previously14,20 . Briefly, the N-terminally His6-tagged Cas12m2 protein was expressed in E. coli Rosetta 2 (DE3). E. coli cells were cultured at 37 °C until the OD600 reached 0.8, and protein expression was then induced by the addition of 0.1 mM iso- propyl β-d-thiogalactopyranoside (Nacalai Tesque). E. coli cells were further cultured at 20 °C overnight and collected by centrifugation. The cells were then resuspended in buffer A (20 mM HEPES–NaOH, pH 7.6, 20 mM imidazole and 1 M NaCl), lysed by sonication and cen- trifuged.Thesupernatantwasmixedwith3 mlNi-NTASuperflowresin (Qiagen),andthemixturewasloadedintoanEcono-Column(Bio-Rad). The protein was eluted with buffer B (20 mM HEPES–NaOH, pH 7.6, 0.3 M imidazole, 0.3 M NaCl) and then loaded onto a 5-ml HiTrap SP HPcolumn(GEHealthcare)equilibratedwithbufferC(20 mMHEPES– NaOH, pH 7.6, and 0.3 M NaCl). The protein was eluted with a linear gradientof0.3–2 MNaClandfurtherpurifiedbychromatographyona Superdex200column(GEHealthcare)equilibratedinbufferD(20 mM HEPES–NaOH, pH 7.6, 0.5 M NaCl). The purified proteins were stored at −80 °C until use. The crRNA was transcribed in vitro with T7 RNA polymeraseandpurifiedby10%denaturing(7 Murea)polyacrylamide gelelectrophoresis. Electronmicroscopysamplepreparationanddatacollection TheCas12m2–crRNA–targetDNAcomplexwasreconstitutedbymix- ing purified Cas12m2, the 56-nucleotide crRNA, the 36-nucleotide target DNA and the 36-nucleotide non-target DNA at a molar ratio of 1:1.2:1.5:1.5. The Cas12m2–crRNA binary complex was reconsti- tuted by mixing purified Cas12m2 and the 56-nucleotide crRNA at a molar ratio of 1:1.2. The ternary and binary complexes were purified bysize-exclusionchromatographyonaSuperdex200Increase10/300 column (GE Healthcare) equilibrated with buffer E (20 mM HEPES– NaOH,pH7.6,50 mMNaCl,2 mMMgCl2 and10 μMZnCl2).Thepurified complex solution (A260 = 4) was then applied to Au 300-mesh R1.2/1.3 grids (Quantifoil), and, after adding 3 μl amylamine, they were glow dischargedinaVitrobotMarkIVsystem(FEI)at4 °Cwithawaitingtime of10 sandablottingtimeof4 sunder100%humidityconditions.The gridswereplungefrozeninliquidethaneandcooledtothetemperature ofliquidnitrogen. Micrographs for all datasets were collected on a Titan Krios G3i microscope(ThermoFisherScientific)runningat300 kVandequipped with a Gatan Quantum LS Energy Filter (GIF) and a Gatan K3 Summit direct electron detector in electron-counting mode (University of Tokyo, Japan). Movies were recorded at a nominal magnification of 105,000×, corresponding to a calibrated pixel size of 0.83 Å, with a total dose of approximately 50 electrons per Å2 per 48 frames using EPUsoftware(ThermoFisherScientific).Thedose-fractionatedmovies weresubjectedtobeam-inducedmotioncorrectionanddoseweight- ing using MotionCor2 (ref. 31) implemented in RELION-3.1 (ref. 32), andcontrasttransferfunction(CTF)parameterswereestimatedusing patch-basedCTFestimationincryoSPARCversion3.3.2(ref.33). Single-particlecryo-EMdataprocessing Data were processed using cryoSPARC33 . For the ternary complex, 5,067,096 particles were initially selected from the 3,570 motion- corrected and dose-weighted micrographs using a two-dimensional (2D) reference and extracted at a pixel size of 3.32 Å. These particles weresubjectedtoseveralroundsof2Dclassificationtocurateparticle sets.Theparticleswerefurthercuratedbyheterogeneousrefinement, using a map derived from ab initio reconstruction as the template. The selected 1,030,197 particles were then re-extracted at a pixel size of1.16 Åandsubjectedto3Dvariabilityanalysis34 .Theresultingmaps withdifferentconformationswereusedforsubsequentheterogeneous refinement. The selected particles after heterogeneous refinement were refined using non-uniform refinement with optimization of the CTF value35 , yielding maps at resolutions of 2.87 Å (state I) and 3.08 Å (state II), according to the FSC criterion of 0.143 (ref. 36). The local resolutionwasestimatedwithBlocResincryoSPARC. Forthebinarycomplex,6,034,518particleswereinitiallyselected from a total of 8,004 motion-corrected and dose-weighted micro- graphsusinga2Dreferenceandextractedatapixelsizeof3.32 Å.The particles were further curated by heterogeneous refinement using a mapderivedfromabinitioreconstructionasthetemplate.Theselected 215,565particleswerethenre-extractedatapixelsizeof1.66 Åandsub- jectedto3Dclassificationwithoutalignment,usingamaskfocusedon thesecondsubunit.After3Dclassification,theselectedparticleswere refined using non-uniform refinement, yielding a map at a resolution of 3.73 Å, according to the FSC criterion of 0.143. The local resolution wasestimatedwithBlocResincryoSPARC. Modelbuildingandvalidation ThemodelwasbuiltusingthepredictedmodeloftheCas12m2protein created by AlphaFold2 (the AlphaFold model) as the reference, fol- lowedbymanualmodelbuildingwithCoot37,38 .Themodelwasrefined using phenix.real_space_refine version 1.20.1 (ref. 39) with secondary structure and metal coordination restraints. The metal coordination restraintsweregeneratedusingReadySet,asimplementedinPHENIX. Structure validation was performed using MolProbity40 . Residues 592–596 and nucleotides −36 to −30 of the crRNA, nucleotides −9 to −5 of the TS and nucleotides 22–26 of the NTS were not included in the final full R-loop state model because these regions were not well resolvedonthedensitymap.Residues53–120,499–506and592–596 andnucleotides−36to−30and13–20ofthecrRNA,nucleotides−9to 5 and 27 of the TS and nucleotides −10 and 13–26 of the NTS were not includedinthefinalintermediatestatemodel.Inthebinarycomplex, residues 87–90, 488–508 and 592–596 of Cas12m2.1, residues 1–17, 65–94, 189–250 and 262–596 of Cas12m2.2 and nucleotides −36 to −30 and 16–20 of the crRNA were not included in the final model. The cryo-EMdensitymapswerecalculatedwithUCSFChimeraX41 ,andthe molecular graphics shown in the figures were prepared with CueMol (http://www.cuemol.org). AssemblyofCas12m2PAMandRECmutantplasmids PlasmidsusedforproteinexpressionofCas12m2PAMandRECmutants were assembled by PCR mutagenesis (Q5 polymerase, NEB; T4 DNA ligase, NEB) using the primers listed in Supplementary Table 2, with pML-1B-Cas12m as the PCR template11 . Sequence-verified plasmids wereusedastemplatesforamplificationoftheirinsertsforrestriction ligation assembly of pCas-Cas12m2-[x] plasmids, which were used for in vivo transcriptional silencing assays, with pCas-Cas12m2 as the templateforbackboneamplification(SupplementaryTable2). Invivotranscriptionalsilencingassay The transcriptional silencing activity of Cas12m variants in vivo was assessed using the protocol reported previously11 , with minor modi- fications. Briefly, E. coli cells harboring pTarget-operon and either pCRISPR-A1, pCRISPR-E1 or pCRISPR-NT were made chemically com- petent and transformed with either pCas-Cas12m, pCas-dCas12m or pCas-Cas12m-[x] (the ten alanine-substitution mutants). After recov- ery, 5 μl of transformation mix was diluted with 195 μl M9TG medium in a 96-well 2-ml masterblock (Greiner) in triplicate, sealed with a gas-permeablemembraneandgrownat37 °Cand750 r.p.m.for20 h. Cells were diluted 1:1,000 in fresh M9TG medium in a total volume of 200 μlina96-wellmasterblockandgrownovernightat37 °C.Overnight cultures were diluted 1:10 in 200 μl PBS and analyzed using a Synergy Mx microplate reader (BioTek) and Gen5 (software version 3.10.06). The fluorescence of GFP (excitation, 488 nm; emission, 512 nm) and RFP (excitation, 532 nm; emission, 588 nm) was measured with gains of 91 and 140, respectively. Fluorescence values were normalized to
  • 13. Nature Structural Molecular Biology Article https://doi.org/10.1038/s41594-023-01042-3 blank-correctedOD600 values,andtechnicaltriplicateswereaveraged. In Fig. 3h, the averages of two biological replicates are shown relative to the average normalized fluorescence of wells containing E. coli expressing a non-targeting spacer (pCRISPR-NT). Statistical analysis wasperformedusingTukey’sposthoctestincaseofsignificanttwo-way ANOVAresults. Cas12m2purificationforsurfaceplasmonresonanceanalyses The pML-1B-Cas12m2, pML-1B-dCas12m2 and pML-1B-Cas12m2-[x] plasmidsweretransformedintochemicallycompetentE.coliRosetta 2 (DE3) cells and purified using a modified version of the previously reportedprotocol11 .Briefly,thetransformedE.colistrainswereinocu- latedinto800 mlLBmediumsupplementedwithchloramphenicoland kanamycin, cultured overnight and grown to an OD600 of 0.6–0.8 at 37 °C and 150 r.p.m. The cultures were cooled on ice for 1 h before the addition of 0.4 mM isopropyl 1-thio-β-d-galactopyranoside and then grownfor16–20 hat20 °Cand120 r.p.m.Culturedcellswerepelleted, washed and resuspended in lysis buffer (1 M NaCl, 20 mM imidazole, 20 mM HEPES, pH 7.6) freshly supplemented with protease inhibi- tors (Roche cOmplete, EDTA-free) and lysed by sonication (Bandelin Sonopuls). The lysate was cleared (30 min, 30,000g), and the super- natant was filtered (0.45 μm). The filtrate was applied to a nickel col- umn(HisTrapHP,GELifeSciences),whichwaswashed(500 mMNaCl, 20 mM imidazole, 20 mM HEPES, pH 7.6) and eluted (500 mM NaCl, 300 mMimidazole,20 mMHEPES,pH7.6)usinganÄKTAFPLCsystem. Elutionfractionscontainingtheexpectedbandwerepooled,concen- trated in SEC buffer (500 mM NaCl, 20 mM HEPES, pH 7.6, 1 mM DTT) throughcentrifugalfilters(AmiconUltra-15,MWcutoff30,000 Da)and further purified by size-exclusion chromatography (HiLoad 16/600 Superdex 200) by FPLC. Pure fractions were pooled and snap frozen inliquidnitrogenuntilanalysisbysurfaceplasmonresonance(SPR). Surfaceplasmonresonance SPR spectroscopy was performed at 25 °C on a Biacore T100 system (Cytiva). A series S CM5 sensor chip surface was modified with 2,500 response units of streptavidin (Invitrogen) using an amine coupling kit (Cytiva). Flow cell 1 was used as the reference. A 50-nucleotide biotinylatedoligonucleotidecontainingaDNAtargetsitewasannealed to its complementary strand (Supplementary Table 2; IDT) in 25 mM HEPES, pH 7.5, containing 150 mM KCl, and 20 response units were immobilized on flow cell 2. Ribonucleotide protein complexes were formedbymixingpurifiedproteinvariantswithatwofoldmolarexcess of PAM-SCNR RNA or a non-target RNA (Supplementary Table 2) to a final concentration of 500 nM protein in SPR buffer (20 mM HEPES, pH 7.8, 150 mM KCl, 10% glycerol, 5 mM MgCl2, 1 mM DTT and 0.05% Tween-20). Dilutions of this complex were injected over the chip sur- faceat50 µl min−1 .Flowcellswereregeneratedusingthreeconsecutive injectionsof4 MMgCl2,followedbyaninjectionofthecomplementary oligonucleotide (1 µM) for 100 s at 5 µl min−1 in flow cell 2. Wild-type Cas12m2 displays multiphasic association and dissociation kinetics, which is most prominent at protein concentrations above 100 nM11 ; therefore,sensorgramsatlowerconcentrations(8–63 nM)werefitted to a two-state model using BIAevaluation software (Cytiva) to obtain a KD estimate. After double-reference correction, the remaining bulk effectsmaskedtheseverelyreducedbindingcapabilitiesofthemutant proteins in the association phase in the sensorgrams; therefore, the binding of Cas12m2 variants at 500 nM to the target site was quanti- fiedfromthestablebindinglevelsatthestartofthedissociationphase (averagesignalofa10-swindow,20 safterdissociation)relativetothat ofwild-typeCas12m2. Pre-crRNAprocessingassay Pre-crRNAprocessingwasachievedusinganinvitrotranscriptionand translationsystem,asdescribedpreviously11 .ApCasandpCRISPRplas- midmixtureexpressingCas12m2andaCRISPRarray(containingfour spacers)wasincubatedat29 °Cfor5 hinathermocycler.TotalRNAwas then extracted using a Direct-zol RNA MiniPrep kit, according to the manufacturer’s instructions (Zymo Research). For northern blotting analysis, 5 μg of each RNA sample obtained from cell-free transcrip- tion–translation was fractionated on an 8% polyacrylamide gel (7 M urea)at300 Vfor140 min.RNAwastransferredontoHybond-XLmem- branes (Amersham Hybond-XL, GE Healthcare) using an electroblot- ter at 50 V for 1 h at 4 °C (Tank-Elektroblotter Web M, PerfectBlue) and cross-linked with UV light at 0.12 J (UV lamp T8C, 254 nm, 8 W). Hybridizationproceededovernightin17 mlofRoti-Hybri-Quickbuffer, with5 µlof[γ-32 P]ATPend-labeledoligodeoxyribonucleotidesat42 °C. ThemembranewasvisualizedusingaPhosphorimager(TyphoonFLA 7000,GEHealthcare). Reportingsummary FurtherinformationonresearchdesignisavailableintheNaturePort- folioReportingSummarylinkedtothisarticle. Dataavailability ThestructuralmodelshavebeendepositedinthePDBunderaccession codes PDB 8HHL (full R-loop state), PDB 8HHM (intermediate state) andPDB8HIO(binarycomplex).Thecryo-EMdensitymapshavebeen depositedintheEMDBunderaccessioncodesEMD-34803(fullR-loop state),EMD-34804(intermediatestate)andEMD-34824(binarycom- plex).Sourcedataareprovidedwiththispaper. References 31. Zheng, S. Q. et al. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods 14, 331–332 (2017). 32. Zivanov, J. et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. eLife 7, e42166 (2018). 33. Punjani, A., Rubinstein, J. L., Fleet, D. J. Brubaker, M. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination supple info. Nat. Methods 14, 290–296 (2017). 34. Punjani, A. Fleet, D. J. 3D variability analysis: resolving continuous flexibility and discrete heterogeneity from single particle cryo-EM. J. Struct. Biol. 213, 107702 (2021). 35. Punjani, A., Zhang, H. Fleet, D. J. Non-uniform refinement: adaptive regularization improves single-particle cryo-EM reconstruction. Nat. Methods 17, 1214–1221 (2020). 36. Rosenthal, P. B. Henderson, R. Optimal determination of particle orientation, absolute hand, and contrast loss in single-particle electron cryomicroscopy. J. Mol. Biol. 333, 721–745 (2003). 37. Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021). 38. Emsley, P. Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D 60, 2126–2132 (2004). 39. Afonine, P. V. et al. Real-space refinement in PHENIX for cryo-EM and crystallography. Acta Crystallogr. D Struct. Biol. 74, 531–544 (2018). 40. Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D 66, 12–21 (2010). 41. Pettersen, E. F. et al. UCSF ChimeraX: structure visualization for researchers, educators, and developers. Protein Sci. 30, 70–82 (2021). 42. Söding, J. Protein homology detection by HMM–HMM comparison. Bioinformatics 21, 951–960 (2005). Acknowledgements We thank C. Liao, A. Migur and C.L. Beisel (HZI, Würzburg, Germany) for contributing to the guide-processing analysis. J.H.G.L. and C.L. are supported by the gravitation program CancerGenomiCs.nl from the Netherlands Organisation for Scientific Research, part of the Oncode
  • 14. Nature Structural Molecular Biology Article https://doi.org/10.1038/s41594-023-01042-3 Institute, which is partly financed by the Dutch Cancer Society. O.N. was supported by AMED (grant numbers JP19am0401005 and JP21km0908001), the Platform Project for Supporting Drug Discovery and Life Science Research (Basis for Supporting Innovative Drug Discovery and Life Science Research) from AMED under grant number JP22am0101115 and the Cabinet Office, Government of Japan, Public/ Private RD Investment Strategic Expansion Program (grant number JPJ008000). Authorcontributions S.N.O. performed structural analyses with assistance from R.N., H.H., T.K., Y.K. and Y.I.; S.N.O., R.N., H.H. and Y.I. built the model and performed structural refinement; W.Y.W., C.S., R.V.W., C.L. and J.H.G.L. performed in vitro and in vivo biological experiments; J.v.d.O. conceived the project; S.N.O., R.N. and O.N. wrote the paper with help from all authors; O.N. supervised the research. Competinginterests O.N. is a cofounder, board member and scientific advisor of Curreio. Two patent applications have been filed related to this work, with J.v.d.O. and W.Y.W. as inventors. J.v.d.O. is an advisor for NTrans Technologies, Scope Biosciences and Hudson River Biotechnology. The other authors declare no competing interests. Additionalinformation Extended data is available for this paper at https://doi.org/10.1038/s41594-023-01042-3. Supplementaryinformation The online version contains supplementary material available at https://doi.org/10.1038/s41594-023-01042-3. Correspondence and requests for materialsshould be addressed to John van der Oost or Osamu Nureki. Peer review information Nature Structural Molecular Biology thanks Yong-Sam Kim and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Dimitris Typas, in collaboration with the Nature Structural Molecular Biology team. Peer reviewer reports are available. Reprints and permissions informationis available at www.nature.com/reprints.
  • 15. Nature Structural Molecular Biology Article https://doi.org/10.1038/s41594-023-01042-3 ExtendedDataFig.1|Single-particlecryo-EManalysisoftheCas12m2– crRNA–targetDNAternarycomplex.(a)Size-exclusionchromatography profileoftheCas12m2–crRNA–targetDNAternarycomplex.Thepeakfraction wasusedforcryo-EManalysis.(b)Arepresentativecryo-EMimageofthe Cas12m2–crRNA–targetDNAcomplex,recordedona300-kVTitanKrios microscopewithaK3camera.(c)Single-particlecryo-EMimageprocessing workflow.(d)Fouriershellcorrelation(FSC)curveforthe3Dreconstruction, withthegold-standardcut-off(FSC = 0.143)markedwithablackdottedline. (eandf)Local-resolutioncryo-EMdensitymapsofStateI(fullR-loopstate)(e) andStateII(intermediatestate)(f).
  • 16. Nature Structural Molecular Biology Article https://doi.org/10.1038/s41594-023-01042-3 ExtendedDataFig.2|Overall,domain,andnucleicacidstructuresofthe Cas12m2–crRNA–targetDNAternarycomplex.(a)Surfacerepresentationof theCas12m2–crRNA–targetDNAternarycomplex.(b)TheREClobestructure ofCas12m2.TheWEDdomaincomprisesanoligonucleotide/oligosaccharide- bindingfold(theconservedβstrands(blue)andα-helix(red)arenumbered). Thetwoadditionalα-helicesareidentifiedwithgreennumbers.TheREC1and REC2domainsformadistinctivesickle-likearchitecture.(c)TheNUClobe structureofCas12m2andthatofCas12f(fromanunculturedarchaeon)(PDBID: 7C7L).TheNUClobesconsistoftheRuvCandTNBdomains.TheRuvCdomains comprisetheRNaseHfold(theconservedβstrands(blue)andα-helices(red)are numbered).(d)ZincbindingsiteintheTNBdomain.(e)Thecryo-EMdensities fortheRNA–DNAheteroduplex,PAMduplex,andrehybridizedDNAduplexare shownasabluemesh.(f)StructureofthecrRNAscaffold.
  • 17. Nature Structural Molecular Biology Article https://doi.org/10.1038/s41594-023-01042-3 ExtendedDataFig.3|Nucleicacidrecognitionandinvitro DNAbindingprofile.SchematicofnucleicacidrecognitionbyCas12m2.Theresiduesthatinteractwith nucleicacidsthroughtheirmainchainsareshowninparentheses.Thedisorderedregionisenclosedinadashedbox.
  • 18. Nature Structural Molecular Biology Article https://doi.org/10.1038/s41594-023-01042-3 ExtendedDataFig.4|Seenextpageforcaption.
  • 19. Nature Structural Molecular Biology Article https://doi.org/10.1038/s41594-023-01042-3 ExtendedDataFig.4|Invitrosurfaceplasmonresonanceanalysis.(a)SEC chromatogramsofthefinalpurificationstepoftheCas12m2proteinsanalyzed inthisstudy.Arrowsindicatetheloadingofeachproteinontothecolumn. Elutionfractionsthatwerepooledforproteinanalysisarehighlightedinorange. Triple:R111A/R112A/R126Atriplemutant.(b)SDS-PAGEanalysisofSEC-purified proteins.Twomicrogramsofproteinwereloadedperwell.Thetheoretical molecularmassofthevariantsis~68 kDa.(c)SPRsensorgramsof500 nMwild typeCas12m2andvariantsboundtoPAM-SCNRRNA,bindingtoa50-basepair dsDNAoligonucleotidecontainingatargetsite.(d)SPRsensorgramof500 nM wildtypeCas12m2bindingtotheDNAtargetsitewithPAM-SCNRRNA(black) andnon-targetRNA(gray).Withthenon-targetRNA,Cas12m2rapidlybindsbut isunabletoformastablecomplexanddissociatesquickly.WiththetargetRNA,a stablecomplexisformed.(e)Fitofatwo-statebindingmodel(blacklines)toSPR sensorgramsofwildtypeCas12mwithPAM-SCNRRNAbindingtothetargetsite (coloredcurves)returnsanestimatedKD of2.7 nM.
  • 20. Nature Structural Molecular Biology Article https://doi.org/10.1038/s41594-023-01042-3 ExtendedDataFig.5|MultiplesequencealignmentofCas12morthologs. Mmu,Cas12m2fromMycolicibacteriummucogenicum(WP_061006603.1); Msp,Cas12m2fromMycobacteriumsp.SM1(WP_213445835.1);Myc,Cas12m2 fromMycobacterium(WP_036473531.1);Mtr,Cas12m2fromMycobacterium triplex(CDO91315.1).ThesecondarystructuresofCas12m2areindicatedabove thesequences.KeyresiduesofCas12m2aremarkedbelowthesequencesby triangles.ThefigurewaspreparedusingClustalOmega(http://www.ebi.ac.uk/ Tools/msa/clustalo)andESPript3(http://espript.ibcp.fr/ESPript/ESPript).
  • 21. Nature Structural Molecular Biology Article https://doi.org/10.1038/s41594-023-01042-3 ExtendedDataFig.6|Non-canonicalRuvCactivesite.(aandc)Structures oftheRuvCactivesiteofCas12i2(a)(Cas12i2fromanunidentifiedsource) (PDBID:6LTU)andCas12m2(c).WhileCas12i2coordinatestwomagnesium ionsoneithersideofthescissilephosphate,Cas12m2coordinatesonlyone magnesiumion,whichcorrespondstomagnesiumionAinCas12i2.(bandd) SchematicsoftheRuvCactivesiterepresentingthecanonicalDED-type(b)and non-canonicalHDD-type(d).InthecanonicalDED-typeRuvCactivesite,thefirst magnesiumion(A)activatesthenucleophilicwatermolecule,whilethesecond magnesiumion(B)stabilizesthetransitionstate,resultingintwodivalentcation- dependentDNAhydrolysis(b).Inthenon-canonicalHDD-typeRuvCactivesite, nodensitycorrespondingtomagnesiumionBwasobserved,resultingintheloss ofcleavageactivityforthetargetDNAofCas12m2(d).
  • 22. Nature Structural Molecular Biology Article https://doi.org/10.1038/s41594-023-01042-3 ExtendedDataFig.7|UPGMAdendrogramshowingthesimilaritybetween differentfamiliesoftypeVeffectors.Thedendrogramwasbuiltusingthe UPGMA(unweightedpairgroupmethodwitharithmeticmean)methodandis basedonHHalignmatrix42 scorescalculatedforallfamiliesagainstallpairwise alignmentswithalengthcoverageof33%ormore.Thealignmentsforthe respectivefamiliesweretakenfrompreviousreport2 ,exceptfortheCas12m family,forwhichanupdatedalignment(104proteins)wasused.Thedashed rectanglecorrespondstoatreedepth(D)between1.5and2(D = 2roughly correspondstothepairwiseHHsearchsimilarityscoreofexp(2D) ≈ 0.02relative totheself-score)andreflectstheDwherethesubtypeassignmentisuncertain andrequiresadditionalconsideration.
  • 23. Nature Structural Molecular Biology Article https://doi.org/10.1038/s41594-023-01042-3 ExtendedDataFig.8|Pre-crRNAprocessingmechanismofCas12a,Cas12i, andCas12m2.Overallstructures(a)andclose-upviewsofthe5′endofcrRNA (b).Cas12a(fromFrancisellanovicida)(PDBID:5NG6)andCas12i(from LachnospiraceaebacteriumND2006)(PDBID:7D8C)processtheirpre-crRNAsat theWEDdomain,andthe5′endofthecrRNAissurroundedbytheWEDdomain. Incontrast,the5′endoftheCas12m2crRNAisprojectedawayfromtheWED domainandexposedtothesolvent.TargetDNAandspacerregionofcrRNA areomittedforclarityintheCas12m2–crRNA–targetDNAternarycomplex structure.
  • 24. Nature Structural Molecular Biology Article https://doi.org/10.1038/s41594-023-01042-3 ExtendedDataFig.9|Single-particlecryo-EManalysisoftheCas12m2– crRNAbinarycomplex.(a)Pre-crRNAprocessingactivitiesofwildtype(WT), H269A/R270A(dWED),andD485A(dRuvC)inTXTL.Aplasmidexpressingthe Cas12m2CRISPRarraycontainingfourspacerswasincubatedwithaplasmid expressingWT,dWED,ordRuvCCas12m2inTXTL.Thecleavageproductswere visualizedbynorthernblotting,usingaprobethatbindstothethirdspacerof arrayA(yellow,shownasanasterisk).ArrayBissimilartoarrayA;onlytheorder ofspacersisshifted.(b)Size-exclusionchromatographyprofileoftheCas12m2– crRNAbinarycomplex.Thepeakfractionwasusedforcryo-EManalysis. (c)Single-particlecryo-EMimageprocessingworkflow.(d)FSCcurveforthe3D reconstruction,withthegold-standardcut-off(FSC = 0.143)markedwithablack dottedline.(e)Cryo-EMdensitymapoftheCas12m–crRNAbinarycomplex. Theadditionalthree-helix-likedensityisindicatedbyacircle.