HEADER VIRAL PROTEIN 21-AUG-21 7V7M TITLE CRYSTAL STRUCTURE OF SARS-COV-2 3CL PROTEASE COMPND MOL_ID: 1; COMPND 2 MOLECULE: 3C-LIKE PROTEINASE; COMPND 3 CHAIN: A; COMPND 4 SYNONYM: MAIN PROTEASE; COMPND 5 EC: 3.4.22.69; COMPND 6 ENGINEERED: YES SOURCE MOL_ID: 1; SOURCE 2 ORGANISM_SCIENTIFIC: SEVERE ACUTE RESPIRATORY SYNDROME CORONAVIRUS SOURCE 3 2; SOURCE 4 ORGANISM_COMMON: 2019-NCOV; SOURCE 5 ORGANISM_TAXID: 2697049; SOURCE 6 EXPRESSION_SYSTEM: ESCHERICHIA COLI; SOURCE 7 EXPRESSION_SYSTEM_TAXID: 562 KEYWDS PROTEASE, VIRAL PROTEIN EXPDTA X-RAY DIFFRACTION AUTHOR Y.YI,M.ZHANG,M.YE REVDAT 3 17-JAN-24 7V7M 1 JRNL REVDAT 2 29-NOV-23 7V7M 1 REMARK REVDAT 1 29-JUN-22 7V7M 0 JRNL AUTH Y.YI,M.ZHANG,H.XUE,R.YU,Y.O.BAO,Y.KUANG,Y.CHAI,W.MA,J.WANG, JRNL AUTH 2 X.SHI,W.LI,W.HONG,J.LI,E.MUTURI,H.WEI,J.WLODARZ,S.ROSZAK, JRNL AUTH 3 X.QIAO,H.YANG,M.YE JRNL TITL SCHAFTOSIDE INHIBITS 3CL PRO AND PL PRO OF SARS-COV-2 VIRUS JRNL TITL 2 AND REGULATES IMMUNE RESPONSE AND INFLAMMATION OF HOST CELLS JRNL TITL 3 FOR THE TREATMENT OF COVID-19. JRNL REF ACTA PHARM SIN B V. 12 4154 2022 JRNL REFN ISSN 2211-3835 JRNL PMID 35968270 JRNL DOI 10.1016/J.APSB.2022.07.017 REMARK 2 REMARK 2 RESOLUTION. 2.08 ANGSTROMS. REMARK 3 REMARK 3 REFINEMENT. REMARK 3 PROGRAM : PHENIX 1.16_3549 REMARK 3 AUTHORS : PAUL ADAMS,PAVEL AFONINE,VINCENT CHEN,IAN REMARK 3 : DAVIS,KRESHNA GOPAL,RALF GROSSE-KUNSTLEVE, REMARK 3 : LI-WEI HUNG,ROBERT IMMORMINO,TOM IOERGER, REMARK 3 : AIRLIE MCCOY,ERIK MCKEE,NIGEL MORIARTY, REMARK 3 : REETAL PAI,RANDY READ,JANE RICHARDSON, REMARK 3 : DAVID RICHARDSON,TOD ROMO,JIM SACCHETTINI, REMARK 3 : NICHOLAS SAUTER,JACOB SMITH,LAURENT REMARK 3 : STORONI,TOM TERWILLIGER,PETER ZWART REMARK 3 REMARK 3 REFINEMENT TARGET : ML REMARK 3 REMARK 3 DATA USED IN REFINEMENT. REMARK 3 RESOLUTION RANGE HIGH (ANGSTROMS) : 2.08 REMARK 3 RESOLUTION RANGE LOW (ANGSTROMS) : 13.09 REMARK 3 MIN(FOBS/SIGMA_FOBS) : 1.360 REMARK 3 COMPLETENESS FOR RANGE (%) : 99.6 REMARK 3 NUMBER OF REFLECTIONS : 15800 REMARK 3 REMARK 3 FIT TO DATA USED IN REFINEMENT. REMARK 3 R VALUE (WORKING + TEST SET) : 0.220 REMARK 3 R VALUE (WORKING SET) : 0.212 REMARK 3 FREE R VALUE : 0.292 REMARK 3 FREE R VALUE TEST SET SIZE (%) : 10.000 REMARK 3 FREE R VALUE TEST SET COUNT : 1580 REMARK 3 REMARK 3 FIT TO DATA USED IN REFINEMENT (IN BINS). REMARK 3 BIN RESOLUTION RANGE COMPL. NWORK NFREE RWORK RFREE REMARK 3 1 13.0880 - 4.5657 0.99 1336 149 0.1711 0.2191 REMARK 3 2 4.5657 - 3.6498 0.99 1299 144 0.1647 0.2605 REMARK 3 3 3.6498 - 3.1961 1.00 1289 143 0.1952 0.2783 REMARK 3 4 3.1961 - 2.9074 1.00 1304 145 0.2163 0.3207 REMARK 3 5 2.9074 - 2.7009 1.00 1291 143 0.2349 0.3077 REMARK 3 6 2.7009 - 2.5429 1.00 1296 144 0.2458 0.3099 REMARK 3 7 2.5429 - 2.4164 1.00 1267 141 0.2353 0.3412 REMARK 3 8 2.4164 - 2.3118 1.00 1296 144 0.2565 0.3304 REMARK 3 9 2.3118 - 2.2233 1.00 1288 143 0.2702 0.3950 REMARK 3 10 2.2233 - 2.1469 0.99 1276 142 0.2625 0.3254 REMARK 3 11 2.1469 - 2.0800 0.99 1278 142 0.2585 0.3288 REMARK 3 REMARK 3 BULK SOLVENT MODELLING. REMARK 3 METHOD USED : FLAT BULK SOLVENT MODEL REMARK 3 SOLVENT RADIUS : 1.11 REMARK 3 SHRINKAGE RADIUS : 0.90 REMARK 3 K_SOL : NULL REMARK 3 B_SOL : NULL REMARK 3 REMARK 3 ERROR ESTIMATES. REMARK 3 COORDINATE ERROR (MAXIMUM-LIKELIHOOD BASED) : 0.310 REMARK 3 PHASE ERROR (DEGREES, MAXIMUM-LIKELIHOOD BASED) : 28.520 REMARK 3 REMARK 3 B VALUES. REMARK 3 FROM WILSON PLOT (A**2) : NULL REMARK 3 MEAN B VALUE (OVERALL, A**2) : 21.80 REMARK 3 OVERALL ANISOTROPIC B VALUE. REMARK 3 B11 (A**2) : NULL REMARK 3 B22 (A**2) : NULL REMARK 3 B33 (A**2) : NULL REMARK 3 B12 (A**2) : NULL REMARK 3 B13 (A**2) : NULL REMARK 3 B23 (A**2) : NULL REMARK 3 REMARK 3 TWINNING INFORMATION. REMARK 3 FRACTION: NULL REMARK 3 OPERATOR: NULL REMARK 3 REMARK 3 DEVIATIONS FROM IDEAL VALUES. REMARK 3 RMSD COUNT REMARK 3 BOND : NULL NULL REMARK 3 ANGLE : NULL NULL REMARK 3 CHIRALITY : NULL NULL REMARK 3 PLANARITY : NULL NULL REMARK 3 DIHEDRAL : NULL NULL REMARK 3 REMARK 3 TLS DETAILS REMARK 3 NUMBER OF TLS GROUPS : NULL REMARK 3 REMARK 3 NCS DETAILS REMARK 3 NUMBER OF NCS GROUPS : NULL REMARK 3 REMARK 3 OTHER REFINEMENT REMARKS: NULL REMARK 4 REMARK 4 7V7M COMPLIES WITH FORMAT V. 3.30, 13-JUL-11 REMARK 100 REMARK 100 THIS ENTRY HAS BEEN PROCESSED BY PDBJ ON 23-AUG-21. REMARK 100 THE DEPOSITION ID IS D_1300024244. REMARK 200 REMARK 200 EXPERIMENTAL DETAILS REMARK 200 EXPERIMENT TYPE : X-RAY DIFFRACTION REMARK 200 DATE OF DATA COLLECTION : 19-AUG-21 REMARK 200 TEMPERATURE (KELVIN) : 100 REMARK 200 PH : 6.0 REMARK 200 NUMBER OF CRYSTALS USED : 1 REMARK 200 REMARK 200 SYNCHROTRON (Y/N) : N REMARK 200 RADIATION SOURCE : ROTATING ANODE REMARK 200 BEAMLINE : NULL REMARK 200 X-RAY GENERATOR MODEL : RIGAKU MICROMAX-007 REMARK 200 MONOCHROMATIC OR LAUE (M/L) : M REMARK 200 WAVELENGTH OR RANGE (A) : 1.541838 REMARK 200 MONOCHROMATOR : NULL REMARK 200 OPTICS : NULL REMARK 200 REMARK 200 DETECTOR TYPE : PIXEL REMARK 200 DETECTOR MANUFACTURER : RIGAKU HYPIX-6000HE REMARK 200 INTENSITY-INTEGRATION SOFTWARE : CRYSALISPRO REMARK 200 DATA SCALING SOFTWARE : AIMLESS REMARK 200 REMARK 200 NUMBER OF UNIQUE REFLECTIONS : 15801 REMARK 200 RESOLUTION RANGE HIGH (A) : 2.080 REMARK 200 RESOLUTION RANGE LOW (A) : 13.090 REMARK 200 REJECTION CRITERIA (SIGMA(I)) : NULL REMARK 200 REMARK 200 OVERALL. REMARK 200 COMPLETENESS FOR RANGE (%) : 99.2 REMARK 200 DATA REDUNDANCY : 6.600 REMARK 200 R MERGE (I) : 0.08500 REMARK 200 R SYM (I) : NULL REMARK 200 FOR THE DATA SET : 15.7000 REMARK 200 REMARK 200 IN THE HIGHEST RESOLUTION SHELL. REMARK 200 HIGHEST RESOLUTION SHELL, RANGE HIGH (A) : 2.08 REMARK 200 HIGHEST RESOLUTION SHELL, RANGE LOW (A) : 2.14 REMARK 200 COMPLETENESS FOR SHELL (%) : NULL REMARK 200 DATA REDUNDANCY IN SHELL : NULL REMARK 200 R MERGE FOR SHELL (I) : 0.48200 REMARK 200 R SYM FOR SHELL (I) : NULL REMARK 200 FOR SHELL : NULL REMARK 200 REMARK 200 DIFFRACTION PROTOCOL: SINGLE WAVELENGTH REMARK 200 METHOD USED TO DETERMINE THE STRUCTURE: MOLECULAR REPLACEMENT REMARK 200 SOFTWARE USED: PHASER REMARK 200 STARTING MODEL: 6LZE REMARK 200 REMARK 200 REMARK: NULL REMARK 280 REMARK 280 CRYSTAL REMARK 280 SOLVENT CONTENT, VS (%): 37.43 REMARK 280 MATTHEWS COEFFICIENT, VM (ANGSTROMS**3/DA): 1.97 REMARK 280 REMARK 280 CRYSTALLIZATION CONDITIONS: 0.05 SODIUM CITRATE TRIBASIC REMARK 280 DIHYDRATE, 0.12 M POTASSIUM CHLORIDE, 0.08 M BIS-TRIS, 14% PEG REMARK 280 4000, PH 6.0, VAPOR DIFFUSION, HANGING DROP, TEMPERATURE 289K REMARK 290 REMARK 290 CRYSTALLOGRAPHIC SYMMETRY REMARK 290 SYMMETRY OPERATORS FOR SPACE GROUP: I 1 2 1 REMARK 290 REMARK 290 SYMOP SYMMETRY REMARK 290 NNNMMM OPERATOR REMARK 290 1555 X,Y,Z REMARK 290 2555 -X,Y,-Z REMARK 290 3555 X+1/2,Y+1/2,Z+1/2 REMARK 290 4555 -X+1/2,Y+1/2,-Z+1/2 REMARK 290 REMARK 290 WHERE NNN -> OPERATOR NUMBER REMARK 290 MMM -> TRANSLATION VECTOR REMARK 290 REMARK 290 CRYSTALLOGRAPHIC SYMMETRY TRANSFORMATIONS REMARK 290 THE FOLLOWING TRANSFORMATIONS OPERATE ON THE ATOM/HETATM REMARK 290 RECORDS IN THIS ENTRY TO PRODUCE CRYSTALLOGRAPHICALLY REMARK 290 RELATED MOLECULES. REMARK 290 SMTRY1 1 1.000000 0.000000 0.000000 0.00000 REMARK 290 SMTRY2 1 0.000000 1.000000 0.000000 0.00000 REMARK 290 SMTRY3 1 0.000000 0.000000 1.000000 0.00000 REMARK 290 SMTRY1 2 -1.000000 0.000000 0.000000 0.00000 REMARK 290 SMTRY2 2 0.000000 1.000000 0.000000 0.00000 REMARK 290 SMTRY3 2 0.000000 0.000000 -1.000000 0.00000 REMARK 290 SMTRY1 3 1.000000 0.000000 0.000000 11.32184 REMARK 290 SMTRY2 3 0.000000 1.000000 0.000000 26.73750 REMARK 290 SMTRY3 3 0.000000 0.000000 1.000000 55.68973 REMARK 290 SMTRY1 4 -1.000000 0.000000 0.000000 11.32184 REMARK 290 SMTRY2 4 0.000000 1.000000 0.000000 26.73750 REMARK 290 SMTRY3 4 0.000000 0.000000 -1.000000 55.68973 REMARK 290 REMARK 290 REMARK: NULL REMARK 300 REMARK 300 BIOMOLECULE: 1 REMARK 300 SEE REMARK 350 FOR THE AUTHOR PROVIDED AND/OR PROGRAM REMARK 300 GENERATED ASSEMBLY INFORMATION FOR THE STRUCTURE IN REMARK 300 THIS ENTRY. THE REMARK MAY ALSO PROVIDE INFORMATION ON REMARK 300 BURIED SURFACE AREA. REMARK 350 REMARK 350 COORDINATES FOR A COMPLETE MULTIMER REPRESENTING THE KNOWN REMARK 350 BIOLOGICALLY SIGNIFICANT OLIGOMERIZATION STATE OF THE REMARK 350 MOLECULE CAN BE GENERATED BY APPLYING BIOMT TRANSFORMATIONS REMARK 350 GIVEN BELOW. BOTH NON-CRYSTALLOGRAPHIC AND REMARK 350 CRYSTALLOGRAPHIC OPERATIONS ARE GIVEN. REMARK 350 REMARK 350 BIOMOLECULE: 1 REMARK 350 AUTHOR DETERMINED BIOLOGICAL UNIT: DIMERIC REMARK 350 SOFTWARE DETERMINED QUATERNARY STRUCTURE: DIMERIC REMARK 350 SOFTWARE USED: PISA REMARK 350 TOTAL BURIED SURFACE AREA: 2450 ANGSTROM**2 REMARK 350 SURFACE AREA OF THE COMPLEX: 24270 ANGSTROM**2 REMARK 350 CHANGE IN SOLVENT FREE ENERGY: -10.0 KCAL/MOL REMARK 350 APPLY THE FOLLOWING TO CHAINS: A REMARK 350 BIOMT1 1 1.000000 0.000000 0.000000 0.00000 REMARK 350 BIOMT2 1 0.000000 1.000000 0.000000 0.00000 REMARK 350 BIOMT3 1 0.000000 0.000000 1.000000 0.00000 REMARK 350 BIOMT1 2 -1.000000 0.000000 0.000000 0.00000 REMARK 350 BIOMT2 2 0.000000 1.000000 0.000000 0.00000 REMARK 350 BIOMT3 2 0.000000 0.000000 -1.000000 0.00000 REMARK 375 REMARK 375 SPECIAL POSITION REMARK 375 THE FOLLOWING ATOMS ARE FOUND TO BE WITHIN 0.15 ANGSTROMS REMARK 375 OF A SYMMETRY RELATED ATOM AND ARE ASSUMED TO BE ON SPECIAL REMARK 375 POSITIONS. REMARK 375 REMARK 375 ATOM RES CSSEQI REMARK 375 HOH A 505 LIES ON A SPECIAL POSITION. REMARK 465 REMARK 465 MISSING RESIDUES REMARK 465 THE FOLLOWING RESIDUES WERE NOT LOCATED IN THE REMARK 465 EXPERIMENT. (M=MODEL NUMBER; RES=RESIDUE NAME; C=CHAIN REMARK 465 IDENTIFIER; SSSEQ=SEQUENCE NUMBER; I=INSERTION CODE.) REMARK 465 REMARK 465 M RES C SSSEQI REMARK 465 ASN A 142 REMARK 465 SER A 301 REMARK 465 GLY A 302 REMARK 465 VAL A 303 REMARK 465 THR A 304 REMARK 465 PHE A 305 REMARK 465 GLN A 306 REMARK 470 REMARK 470 MISSING ATOM REMARK 470 THE FOLLOWING RESIDUES HAVE MISSING ATOMS (M=MODEL NUMBER; REMARK 470 RES=RESIDUE NAME; C=CHAIN IDENTIFIER; SSEQ=SEQUENCE NUMBER; REMARK 470 I=INSERTION CODE): REMARK 470 M RES CSSEQI ATOMS REMARK 470 GLU A 47 CG CD OE1 OE2 REMARK 470 LEU A 50 CG CD1 CD2 REMARK 470 ARG A 60 CG CD NE CZ NH1 NH2 REMARK 470 TYR A 154 CG CD1 CD2 CE1 CE2 CZ OH REMARK 470 ARG A 217 CG CD NE CZ NH1 NH2 REMARK 470 ARG A 222 CG CD NE CZ NH1 NH2 REMARK 470 LEU A 232 CG CD1 CD2 REMARK 470 MET A 235 CG SD CE REMARK 470 LYS A 236 CG CD CE NZ REMARK 470 GLN A 256 CG CD OE1 NE2 REMARK 470 ARG A 298 CG CD NE CZ NH1 NH2 REMARK 500 REMARK 500 GEOMETRY AND STEREOCHEMISTRY REMARK 500 SUBTOPIC: TORSION ANGLES REMARK 500 REMARK 500 TORSION ANGLES OUTSIDE THE EXPECTED RAMACHANDRAN REGIONS: REMARK 500 (M=MODEL NUMBER; RES=RESIDUE NAME; C=CHAIN IDENTIFIER; REMARK 500 SSEQ=SEQUENCE NUMBER; I=INSERTION CODE). REMARK 500 REMARK 500 STANDARD TABLE: REMARK 500 FORMAT:(10X,I3,1X,A3,1X,A1,I4,A1,4X,F7.2,3X,F7.2) REMARK 500 REMARK 500 EXPECTED VALUES: GJ KLEYWEGT AND TA JONES (1996). PHI/PSI- REMARK 500 CHOLOGY: RAMACHANDRAN REVISITED. STRUCTURE 4, 1395 - 1400 REMARK 500 REMARK 500 M RES CSSEQI PSI PHI REMARK 500 ASP A 33 -130.05 54.02 REMARK 500 ASN A 51 73.35 -164.75 REMARK 500 ASN A 84 -121.36 53.98 REMARK 500 TYR A 154 -105.63 68.07 REMARK 500 REMARK 500 REMARK: NULL DBREF 7V7M A 1 306 UNP P0DTC1 R1A_SARS2 3264 3569 SEQRES 1 A 306 SER GLY PHE ARG LYS MET ALA PHE PRO SER GLY LYS VAL SEQRES 2 A 306 GLU GLY CYS MET VAL GLN VAL THR CYS GLY THR THR THR SEQRES 3 A 306 LEU ASN GLY LEU TRP LEU ASP ASP VAL VAL TYR CYS PRO SEQRES 4 A 306 ARG HIS VAL ILE CYS THR SER GLU ASP MET LEU ASN PRO SEQRES 5 A 306 ASN TYR GLU ASP LEU LEU ILE ARG LYS SER ASN HIS ASN SEQRES 6 A 306 PHE LEU VAL GLN ALA GLY ASN VAL GLN LEU ARG VAL ILE SEQRES 7 A 306 GLY HIS SER MET GLN ASN CYS VAL LEU LYS LEU LYS VAL SEQRES 8 A 306 ASP THR ALA ASN PRO LYS THR PRO LYS TYR LYS PHE VAL SEQRES 9 A 306 ARG ILE GLN PRO GLY GLN THR PHE SER VAL LEU ALA CYS SEQRES 10 A 306 TYR ASN GLY SER PRO SER GLY VAL TYR GLN CYS ALA MET SEQRES 11 A 306 ARG PRO ASN PHE THR ILE LYS GLY SER PHE LEU ASN GLY SEQRES 12 A 306 SER CYS GLY SER VAL GLY PHE ASN ILE ASP TYR ASP CYS SEQRES 13 A 306 VAL SER PHE CYS TYR MET HIS HIS MET GLU LEU PRO THR SEQRES 14 A 306 GLY VAL HIS ALA GLY THR ASP LEU GLU GLY ASN PHE TYR SEQRES 15 A 306 GLY PRO PHE VAL ASP ARG GLN THR ALA GLN ALA ALA GLY SEQRES 16 A 306 THR ASP THR THR ILE THR VAL ASN VAL LEU ALA TRP LEU SEQRES 17 A 306 TYR ALA ALA VAL ILE ASN GLY ASP ARG TRP PHE LEU ASN SEQRES 18 A 306 ARG PHE THR THR THR LEU ASN ASP PHE ASN LEU VAL ALA SEQRES 19 A 306 MET LYS TYR ASN TYR GLU PRO LEU THR GLN ASP HIS VAL SEQRES 20 A 306 ASP ILE LEU GLY PRO LEU SER ALA GLN THR GLY ILE ALA SEQRES 21 A 306 VAL LEU ASP MET CYS ALA SER LEU LYS GLU LEU LEU GLN SEQRES 22 A 306 ASN GLY MET ASN GLY ARG THR ILE LEU GLY SER ALA LEU SEQRES 23 A 306 LEU GLU ASP GLU PHE THR PRO PHE ASP VAL VAL ARG GLN SEQRES 24 A 306 CYS SER GLY VAL THR PHE GLN FORMUL 2 HOH *137(H2 O) HELIX 1 AA1 SER A 10 GLY A 15 1 6 HELIX 2 AA2 HIS A 41 CYS A 44 5 4 HELIX 3 AA3 THR A 45 LEU A 50 1 6 HELIX 4 AA4 ASN A 53 ARG A 60 1 8 HELIX 5 AA5 SER A 62 HIS A 64 5 3 HELIX 6 AA6 ILE A 200 ASN A 214 1 15 HELIX 7 AA7 THR A 226 TYR A 237 1 12 HELIX 8 AA8 THR A 243 LEU A 250 1 8 HELIX 9 AA9 LEU A 250 GLY A 258 1 9 HELIX 10 AB1 ALA A 260 GLY A 275 1 16 HELIX 11 AB2 THR A 292 CYS A 300 1 9 SHEET 1 AA1 7 VAL A 73 LEU A 75 0 SHEET 2 AA1 7 PHE A 66 ALA A 70 -1 N VAL A 68 O LEU A 75 SHEET 3 AA1 7 MET A 17 CYS A 22 -1 N THR A 21 O LEU A 67 SHEET 4 AA1 7 THR A 25 LEU A 32 -1 O LEU A 27 N VAL A 20 SHEET 5 AA1 7 VAL A 35 PRO A 39 -1 O TYR A 37 N LEU A 30 SHEET 6 AA1 7 VAL A 86 VAL A 91 -1 O LEU A 89 N VAL A 36 SHEET 7 AA1 7 VAL A 77 GLN A 83 -1 N SER A 81 O LYS A 88 SHEET 1 AA2 5 LYS A 100 PHE A 103 0 SHEET 2 AA2 5 CYS A 156 GLU A 166 1 O VAL A 157 N LYS A 100 SHEET 3 AA2 5 VAL A 148 ASP A 153 -1 N GLY A 149 O TYR A 161 SHEET 4 AA2 5 THR A 111 TYR A 118 -1 N SER A 113 O PHE A 150 SHEET 5 AA2 5 SER A 121 ALA A 129 -1 O TYR A 126 N VAL A 114 SHEET 1 AA3 3 LYS A 100 PHE A 103 0 SHEET 2 AA3 3 CYS A 156 GLU A 166 1 O VAL A 157 N LYS A 100 SHEET 3 AA3 3 HIS A 172 THR A 175 -1 O ALA A 173 N MET A 165 CRYST1 44.657 53.475 113.534 90.00 101.18 90.00 I 1 2 1 4 ORIGX1 1.000000 0.000000 0.000000 0.00000 ORIGX2 0.000000 1.000000 0.000000 0.00000 ORIGX3 0.000000 0.000000 1.000000 0.00000 SCALE1 0.022393 0.000000 0.004425 0.00000 SCALE2 0.000000 0.018700 0.000000 0.00000 SCALE3 0.000000 0.000000 0.008978 0.00000