HEADER VIRAL PROTEIN/RNA 26-OCT-20 7KJW TITLE STRUCTURE OF HIV-1 REVERSE TRANSCRIPTASE INITIATION COMPLEX CORE WITH TITLE 2 EFAVIRENZ COMPND MOL_ID: 1; COMPND 2 MOLECULE: REVERSE TRANSCRIPTASE/RIBONUCLEASE H; COMPND 3 CHAIN: A; COMPND 4 SYNONYM: PR160GAG-POL; COMPND 5 EC: 2.7.7.49,2.7.7.7,3.1.26.13; COMPND 6 ENGINEERED: YES; COMPND 7 MUTATION: YES; COMPND 8 MOL_ID: 2; COMPND 9 MOLECULE: REVERSE TRANSCRIPTASE P51 SUBUNIT; COMPND 10 CHAIN: B; COMPND 11 ENGINEERED: YES; COMPND 12 MOL_ID: 3; COMPND 13 MOLECULE: HIV-1 VIRAL RNA FRAGMENT; COMPND 14 CHAIN: C; COMPND 15 ENGINEERED: YES; COMPND 16 MOL_ID: 4; COMPND 17 MOLECULE: TRNA LYSINE 3 FRAGMENT WITH GNRA TETRALOOP; COMPND 18 CHAIN: D; COMPND 19 ENGINEERED: YES SOURCE MOL_ID: 1; SOURCE 2 ORGANISM_SCIENTIFIC: HUMAN IMMUNODEFICIENCY VIRUS TYPE 1 GROUP M SOURCE 3 SUBTYPE B (ISOLATE BH10); SOURCE 4 ORGANISM_COMMON: HIV-1; SOURCE 5 ORGANISM_TAXID: 11678; SOURCE 6 STRAIN: ISOLATE BH10; SOURCE 7 GENE: GAG-POL; SOURCE 8 EXPRESSION_SYSTEM: ESCHERICHIA COLI; SOURCE 9 EXPRESSION_SYSTEM_TAXID: 562; SOURCE 10 MOL_ID: 2; SOURCE 11 ORGANISM_SCIENTIFIC: HUMAN IMMUNODEFICIENCY VIRUS 1; SOURCE 12 ORGANISM_COMMON: HIV-1; SOURCE 13 ORGANISM_TAXID: 11676; SOURCE 14 STRAIN: ISOLATE BH10; SOURCE 15 GENE: GAG-POL; SOURCE 16 EXPRESSION_SYSTEM: ESCHERICHIA COLI; SOURCE 17 EXPRESSION_SYSTEM_TAXID: 562; SOURCE 18 MOL_ID: 3; SOURCE 19 SYNTHETIC: YES; SOURCE 20 ORGANISM_SCIENTIFIC: HUMAN IMMUNODEFICIENCY VIRUS 1; SOURCE 21 ORGANISM_TAXID: 11676; SOURCE 22 MOL_ID: 4; SOURCE 23 SYNTHETIC: YES; SOURCE 24 ORGANISM_SCIENTIFIC: HOMO SAPIENS; SOURCE 25 ORGANISM_TAXID: 9606 KEYWDS REVERSE TRANSCRIPTASE, RNA, PROTEIN-RNA COMPLEX, TRNA, POLYMERASE, KEYWDS 2 VIRAL PROTEIN, VIRAL PROTEIN-RNA COMPLEX EXPDTA ELECTRON MICROSCOPY AUTHOR B.HA,K.P.LARSEN,J.ZHANG,Z.FU,E.MONTABANA,L.N.JACKSON,D.H.CHEN, AUTHOR 2 E.V.PUGLISI REVDAT 2 12-MAY-21 7KJW 1 JRNL REVDAT 1 17-MAR-21 7KJW 0 JRNL AUTH B.HA,K.P.LARSEN,J.ZHANG,Z.FU,E.MONTABANA,L.N.JACKSON, JRNL AUTH 2 D.H.CHEN,E.V.PUGLISI JRNL TITL HIGH-RESOLUTION VIEW OF HIV-1 REVERSE TRANSCRIPTASE JRNL TITL 2 INITIATION COMPLEXES AND INHIBITION BY NNRTI DRUGS. JRNL REF NAT COMMUN V. 12 2500 2021 JRNL REFN ESSN 2041-1723 JRNL PMID 33947853 JRNL DOI 10.1038/S41467-021-22628-9 REMARK 2 REMARK 2 RESOLUTION. 2.90 ANGSTROMS. REMARK 3 REMARK 3 REFINEMENT. REMARK 3 SOFTWARE PACKAGES : SERIALEM, GCTF, PHENIX, RELION REMARK 3 RECONSTRUCTION SCHEMA : NULL REMARK 3 REMARK 3 EM MAP-MODEL FITTING AND REFINEMENT REMARK 3 PDB ENTRY : NULL REMARK 3 REFINEMENT SPACE : REAL REMARK 3 REFINEMENT PROTOCOL : NULL REMARK 3 REFINEMENT TARGET : NULL REMARK 3 OVERALL ANISOTROPIC B VALUE : NULL REMARK 3 REMARK 3 FITTING PROCEDURE : NULL REMARK 3 REMARK 3 EM IMAGE RECONSTRUCTION STATISTICS REMARK 3 NOMINAL PIXEL SIZE (ANGSTROMS) : NULL REMARK 3 ACTUAL PIXEL SIZE (ANGSTROMS) : NULL REMARK 3 EFFECTIVE RESOLUTION (ANGSTROMS) : 2.900 REMARK 3 NUMBER OF PARTICLES : 112355 REMARK 3 CTF CORRECTION METHOD : PHASE FLIPPING AND AMPLITUDE REMARK 3 CORRECTION REMARK 3 REMARK 3 EM RECONSTRUCTION MAGNIFICATION CALIBRATION: NULL REMARK 3 REMARK 3 OTHER DETAILS: NULL REMARK 4 REMARK 4 7KJW COMPLIES WITH FORMAT V. 3.30, 13-JUL-11 REMARK 100 REMARK 100 THIS ENTRY HAS BEEN PROCESSED BY RCSB ON 27-OCT-20. REMARK 100 THE DEPOSITION ID IS D_1000252429. REMARK 245 REMARK 245 EXPERIMENTAL DETAILS REMARK 245 RECONSTRUCTION METHOD : SINGLE PARTICLE REMARK 245 SPECIMEN TYPE : NULL REMARK 245 REMARK 245 ELECTRON MICROSCOPE SAMPLE REMARK 245 SAMPLE TYPE : PARTICLE REMARK 245 PARTICLE TYPE : POINT REMARK 245 NAME OF SAMPLE : HIV-1 REVERSE TRANSCRIPTASE REMARK 245 INITIATION COMPLEX CORE WITH REMARK 245 EFAVIRENZ REMARK 245 SAMPLE CONCENTRATION (MG ML-1) : 10.00 REMARK 245 SAMPLE SUPPORT DETAILS : NULL REMARK 245 SAMPLE VITRIFICATION DETAILS : NULL REMARK 245 SAMPLE BUFFER : NULL REMARK 245 PH : 8.00 REMARK 245 SAMPLE DETAILS : PERIPHERAL TRNA AND ENGINEERED REMARK 245 STEM LOOP WERE MASKED OUT DURING CRYO-EM DATA PROCESSING. REMARK 245 REMARK 245 DATA ACQUISITION REMARK 245 DATE OF EXPERIMENT : NULL REMARK 245 NUMBER OF MICROGRAPHS-IMAGES : NULL REMARK 245 TEMPERATURE (KELVIN) : NULL REMARK 245 MICROSCOPE MODEL : FEI TITAN KRIOS REMARK 245 DETECTOR TYPE : GATAN K2 SUMMIT (4K X 4K) REMARK 245 MINIMUM DEFOCUS (NM) : NULL REMARK 245 MAXIMUM DEFOCUS (NM) : NULL REMARK 245 MINIMUM TILT ANGLE (DEGREES) : NULL REMARK 245 MAXIMUM TILT ANGLE (DEGREES) : NULL REMARK 245 NOMINAL CS : NULL REMARK 245 IMAGING MODE : BRIGHT FIELD REMARK 245 ELECTRON DOSE (ELECTRONS NM**-2) : 10000.00 REMARK 245 ILLUMINATION MODE : FLOOD BEAM REMARK 245 NOMINAL MAGNIFICATION : NULL REMARK 245 CALIBRATED MAGNIFICATION : NULL REMARK 245 SOURCE : FIELD EMISSION GUN REMARK 245 ACCELERATION VOLTAGE (KV) : 300 REMARK 245 IMAGING DETAILS : NULL REMARK 247 REMARK 247 ELECTRON MICROSCOPY REMARK 247 THE COORDINATES IN THIS ENTRY WERE GENERATED FROM ELECTRON REMARK 247 MICROSCOPY DATA. PROTEIN DATA BANK CONVENTIONS REQUIRE REMARK 247 THAT CRYST1 AND SCALE RECORDS BE INCLUDED, BUT THE VALUES REMARK 247 ON THESE RECORDS ARE MEANINGLESS EXCEPT FOR THE CALCULATION REMARK 247 OF THE STRUCTURE FACTORS. REMARK 300 REMARK 300 BIOMOLECULE: 1 REMARK 300 SEE REMARK 350 FOR THE AUTHOR PROVIDED AND/OR PROGRAM REMARK 300 GENERATED ASSEMBLY INFORMATION FOR THE STRUCTURE IN REMARK 300 THIS ENTRY. THE REMARK MAY ALSO PROVIDE INFORMATION ON REMARK 300 BURIED SURFACE AREA. REMARK 350 REMARK 350 COORDINATES FOR A COMPLETE MULTIMER REPRESENTING THE KNOWN REMARK 350 BIOLOGICALLY SIGNIFICANT OLIGOMERIZATION STATE OF THE REMARK 350 MOLECULE CAN BE GENERATED BY APPLYING BIOMT TRANSFORMATIONS REMARK 350 GIVEN BELOW. BOTH NON-CRYSTALLOGRAPHIC AND REMARK 350 CRYSTALLOGRAPHIC OPERATIONS ARE GIVEN. REMARK 350 REMARK 350 BIOMOLECULE: 1 REMARK 350 AUTHOR DETERMINED BIOLOGICAL UNIT: TETRAMERIC REMARK 350 APPLY THE FOLLOWING TO CHAINS: A, B, C, D REMARK 350 BIOMT1 1 1.000000 0.000000 0.000000 0.00000 REMARK 350 BIOMT2 1 0.000000 1.000000 0.000000 0.00000 REMARK 350 BIOMT3 1 0.000000 0.000000 1.000000 0.00000 REMARK 465 REMARK 465 MISSING RESIDUES REMARK 465 THE FOLLOWING RESIDUES WERE NOT LOCATED IN THE REMARK 465 EXPERIMENT. (M=MODEL NUMBER; RES=RESIDUE NAME; C=CHAIN REMARK 465 IDENTIFIER; SSSEQ=SEQUENCE NUMBER; I=INSERTION CODE.) REMARK 465 REMARK 465 M RES C SSSEQI REMARK 465 MET A -1 REMARK 465 VAL A 0 REMARK 465 PRO A 1 REMARK 465 ILE A 2 REMARK 465 SER A 3 REMARK 465 LYS A 64 REMARK 465 LYS A 65 REMARK 465 LYS A 66 REMARK 465 ASP A 67 REMARK 465 SER A 68 REMARK 465 THR A 69 REMARK 465 ILE A 135 REMARK 465 ASN A 136 REMARK 465 ASN A 137 REMARK 465 GLU A 138 REMARK 465 THR A 139 REMARK 465 PRO A 140 REMARK 465 ILE A 559 REMARK 465 LEU A 560 REMARK 465 MET B -1 REMARK 465 VAL B 0 REMARK 465 PRO B 1 REMARK 465 ILE B 2 REMARK 465 SER B 3 REMARK 465 PRO B 4 REMARK 465 PRO B 217 REMARK 465 ASP B 218 REMARK 465 LYS B 219 REMARK 465 LYS B 220 REMARK 465 HIS B 221 REMARK 465 GLN B 222 REMARK 465 LYS B 223 REMARK 465 GLU B 224 REMARK 465 PRO B 225 REMARK 465 PRO B 226 REMARK 465 PHE B 227 REMARK 465 LEU B 228 REMARK 465 TRP B 229 REMARK 465 MET B 230 REMARK 465 ARG B 358 REMARK 465 GLY B 359 REMARK 465 ALA B 360 REMARK 465 HIS B 361 REMARK 465 LEU B 429 REMARK 465 GLU B 430 REMARK 465 LYS B 431 REMARK 465 GLU B 432 REMARK 465 PRO B 433 REMARK 465 ILE B 434 REMARK 465 VAL B 435 REMARK 465 GLY B 436 REMARK 465 ALA B 437 REMARK 465 GLU B 438 REMARK 465 THR B 439 REMARK 465 PHE B 440 REMARK 465 G C 178 REMARK 465 C C 179 REMARK 465 A C 180 REMARK 465 G C 181 REMARK 465 G D 38 REMARK 465 C D 39 REMARK 465 C D 40 REMARK 465 C D 41 REMARK 465 G D 42 REMARK 465 G D 43 REMARK 465 G D 44 REMARK 465 A D 45 REMARK 465 A D 46 REMARK 465 A D 47 REMARK 465 C D 48 REMARK 465 C D 49 REMARK 465 A D 50 REMARK 465 G D 51 REMARK 465 G D 52 REMARK 465 G D 53 REMARK 465 U D 54 REMARK 470 REMARK 470 MISSING ATOM REMARK 470 THE FOLLOWING RESIDUES HAVE MISSING ATOMS (M=MODEL NUMBER; REMARK 470 RES=RESIDUE NAME; C=CHAIN IDENTIFIER; SSEQ=SEQUENCE NUMBER; REMARK 470 I=INSERTION CODE): REMARK 470 M RES CSSEQI ATOMS REMARK 470 GLU A 28 CG CD OE1 OE2 REMARK 470 LYS A 32 CG CD CE NZ REMARK 470 GLU A 36 CG CD OE1 OE2 REMARK 470 GLU A 40 CG CD OE1 OE2 REMARK 470 LYS A 43 CG CD CE NZ REMARK 470 ILE A 50 CG1 CG2 CD1 REMARK 470 GLU A 53 CG CD OE1 OE2 REMARK 470 LYS A 70 CG CD CE NZ REMARK 470 GLU A 79 CG CD OE1 OE2 REMARK 470 ASP A 86 CG OD1 OD2 REMARK 470 LEU A 92 CG CD1 CD2 REMARK 470 SER A 117 OG REMARK 470 ARG A 143 CG CD NE CZ NH1 NH2 REMARK 470 ASP A 177 CG OD1 OD2 REMARK 470 SER A 191 OG REMARK 470 GLU A 194 CG CD OE1 OE2 REMARK 470 ASP A 218 CG OD1 OD2 REMARK 470 LYS A 220 CG CD CE NZ REMARK 470 THR A 286 OG1 CG2 REMARK 470 LYS A 287 CG CD CE NZ REMARK 470 THR A 290 OG1 CG2 REMARK 470 ASP A 324 CG OD1 OD2 REMARK 470 GLN A 334 CG CD OE1 NE2 REMARK 470 GLN A 428 CG CD OE1 NE2 REMARK 470 ARG A 448 CG CD NE CZ NH1 NH2 REMARK 470 GLU A 449 CG CD OE1 OE2 REMARK 470 ASN A 471 CG OD1 ND2 REMARK 470 ARG A 557 CG CD NE CZ NH1 NH2 REMARK 470 LYS A 558 CG CD CE NZ REMARK 470 ILE B 5 CG1 CG2 CD1 REMARK 470 GLU B 6 CG CD OE1 OE2 REMARK 470 LYS B 11 CG CD CE NZ REMARK 470 LYS B 22 CG CD CE NZ REMARK 470 LYS B 66 CG CD CE NZ REMARK 470 ASP B 67 CG OD1 OD2 REMARK 470 ASP B 86 CG OD1 OD2 REMARK 470 SER B 162 OG REMARK 470 LYS B 166 CG CD CE NZ REMARK 470 GLN B 174 CG CD OE1 NE2 REMARK 470 LEU B 214 CG CD1 CD2 REMARK 470 THR B 240 OG1 CG2 REMARK 470 ASP B 250 CG OD1 OD2 REMARK 470 GLU B 297 CG CD OE1 OE2 REMARK 470 GLU B 298 CG CD OE1 OE2 REMARK 470 HIS B 315 CG ND1 CD2 CE1 NE2 REMARK 470 GLN B 334 CG CD OE1 NE2 REMARK 470 ARG B 356 CG CD NE CZ NH1 NH2 REMARK 470 LYS B 388 CG CD CE NZ REMARK 470 GLN B 428 CG CD OE1 NE2 REMARK 500 REMARK 500 GEOMETRY AND STEREOCHEMISTRY REMARK 500 SUBTOPIC: COVALENT BOND LENGTHS REMARK 500 REMARK 500 THE STEREOCHEMICAL PARAMETERS OF THE FOLLOWING RESIDUES REMARK 500 HAVE VALUES WHICH DEVIATE FROM EXPECTED VALUES BY MORE REMARK 500 THAN 6*RMSD (M=MODEL NUMBER; RES=RESIDUE NAME; C=CHAIN REMARK 500 IDENTIFIER; SSEQ=SEQUENCE NUMBER; I=INSERTION CODE). REMARK 500 REMARK 500 STANDARD TABLE: REMARK 500 FORMAT: (10X,I3,1X,2(A3,1X,A1,I4,A1,1X,A4,3X),1X,F6.3) REMARK 500 REMARK 500 EXPECTED VALUES PROTEIN: ENGH AND HUBER, 1999 REMARK 500 EXPECTED VALUES NUCLEIC ACID: CLOWNEY ET AL 1996 REMARK 500 REMARK 500 M RES CSSEQI ATM1 RES CSSEQI ATM2 DEVIATION REMARK 500 G D 70 O3' G47 D 71 P 0.138 REMARK 500 REMARK 500 REMARK: NULL REMARK 500 REMARK 500 GEOMETRY AND STEREOCHEMISTRY REMARK 500 SUBTOPIC: COVALENT BOND ANGLES REMARK 500 REMARK 500 THE STEREOCHEMICAL PARAMETERS OF THE FOLLOWING RESIDUES REMARK 500 HAVE VALUES WHICH DEVIATE FROM EXPECTED VALUES BY MORE REMARK 500 THAN 6*RMSD (M=MODEL NUMBER; RES=RESIDUE NAME; C=CHAIN REMARK 500 IDENTIFIER; SSEQ=SEQUENCE NUMBER; I=INSERTION CODE). REMARK 500 REMARK 500 STANDARD TABLE: REMARK 500 FORMAT: (10X,I3,1X,A3,1X,A1,I4,A1,3(1X,A4,2X),12X,F5.1) REMARK 500 REMARK 500 EXPECTED VALUES PROTEIN: ENGH AND HUBER, 1999 REMARK 500 EXPECTED VALUES NUCLEIC ACID: CLOWNEY ET AL 1996 REMARK 500 REMARK 500 M RES CSSEQI ATM1 ATM2 ATM3 REMARK 500 G D 70 C3' - O3' - P ANGL. DEV. = -9.8 DEGREES REMARK 500 REMARK 500 REMARK: NULL REMARK 500 REMARK 500 GEOMETRY AND STEREOCHEMISTRY REMARK 500 SUBTOPIC: TORSION ANGLES REMARK 500 REMARK 500 TORSION ANGLES OUTSIDE THE EXPECTED RAMACHANDRAN REGIONS: REMARK 500 (M=MODEL NUMBER; RES=RESIDUE NAME; C=CHAIN IDENTIFIER; REMARK 500 SSEQ=SEQUENCE NUMBER; I=INSERTION CODE). REMARK 500 REMARK 500 STANDARD TABLE: REMARK 500 FORMAT:(10X,I3,1X,A3,1X,A1,I4,A1,4X,F7.2,3X,F7.2) REMARK 500 REMARK 500 EXPECTED VALUES: GJ KLEYWEGT AND TA JONES (1996). PHI/PSI- REMARK 500 CHOLOGY: RAMACHANDRAN REVISITED. STRUCTURE 4, 1395 - 1400 REMARK 500 REMARK 500 M RES CSSEQI PSI PHI REMARK 500 PRO A 25 103.51 -52.47 REMARK 500 THR A 128 53.44 -93.18 REMARK 500 PRO A 392 46.42 -86.73 REMARK 500 ALA A 408 52.46 -90.59 REMARK 500 PRO A 412 -178.53 -66.24 REMARK 500 LYS A 451 29.89 47.98 REMARK 500 ILE A 556 -60.90 -122.03 REMARK 500 TYR B 183 -64.98 -93.77 REMARK 500 MET B 184 -55.51 -120.91 REMARK 500 REMARK 500 REMARK: NULL REMARK 900 REMARK 900 RELATED ENTRIES REMARK 900 RELATED ID: EMD-22900 RELATED DB: EMDB REMARK 900 STRUCTURE OF HIV-1 REVERSE TRANSCRIPTASE INITIATION COMPLEX CORE REMARK 900 WITH EFAVIRENZ DBREF 7KJW A 1 560 UNP P03366 POL_HV1B1 600 1159 DBREF 7KJW B 1 440 UNP P03366 POL_HV1B1 600 1039 DBREF 7KJW C 178 203 PDB 7KJW 7KJW 178 203 DBREF 7KJW D 38 76 PDB 7KJW 7KJW 38 76 SEQADV 7KJW MET A -1 UNP P03366 EXPRESSION TAG SEQADV 7KJW VAL A 0 UNP P03366 EXPRESSION TAG SEQADV 7KJW CYS A 258 UNP P03366 GLN 857 ENGINEERED MUTATION SEQADV 7KJW SER A 280 UNP P03366 CYS 879 CONFLICT SEQADV 7KJW GLN A 478 UNP P03366 GLU 1077 ENGINEERED MUTATION SEQADV 7KJW MET B -1 UNP P03366 EXPRESSION TAG SEQADV 7KJW VAL B 0 UNP P03366 EXPRESSION TAG SEQADV 7KJW SER B 280 UNP P03366 CYS 879 ENGINEERED MUTATION SEQRES 1 A 562 MET VAL PRO ILE SER PRO ILE GLU THR VAL PRO VAL LYS SEQRES 2 A 562 LEU LYS PRO GLY MET ASP GLY PRO LYS VAL LYS GLN TRP SEQRES 3 A 562 PRO LEU THR GLU GLU LYS ILE LYS ALA LEU VAL GLU ILE SEQRES 4 A 562 CYS THR GLU MET GLU LYS GLU GLY LYS ILE SER LYS ILE SEQRES 5 A 562 GLY PRO GLU ASN PRO TYR ASN THR PRO VAL PHE ALA ILE SEQRES 6 A 562 LYS LYS LYS ASP SER THR LYS TRP ARG LYS LEU VAL ASP SEQRES 7 A 562 PHE ARG GLU LEU ASN LYS ARG THR GLN ASP PHE TRP GLU SEQRES 8 A 562 VAL GLN LEU GLY ILE PRO HIS PRO ALA GLY LEU LYS LYS SEQRES 9 A 562 LYS LYS SER VAL THR VAL LEU ASP VAL GLY ASP ALA TYR SEQRES 10 A 562 PHE SER VAL PRO LEU ASP GLU ASP PHE ARG LYS TYR THR SEQRES 11 A 562 ALA PHE THR ILE PRO SER ILE ASN ASN GLU THR PRO GLY SEQRES 12 A 562 ILE ARG TYR GLN TYR ASN VAL LEU PRO GLN GLY TRP LYS SEQRES 13 A 562 GLY SER PRO ALA ILE PHE GLN SER SER MET THR LYS ILE SEQRES 14 A 562 LEU GLU PRO PHE LYS LYS GLN ASN PRO ASP ILE VAL ILE SEQRES 15 A 562 TYR GLN TYR MET ASP ASP LEU TYR VAL GLY SER ASP LEU SEQRES 16 A 562 GLU ILE GLY GLN HIS ARG THR LYS ILE GLU GLU LEU ARG SEQRES 17 A 562 GLN HIS LEU LEU ARG TRP GLY LEU THR THR PRO ASP LYS SEQRES 18 A 562 LYS HIS GLN LYS GLU PRO PRO PHE LEU TRP MET GLY TYR SEQRES 19 A 562 GLU LEU HIS PRO ASP LYS TRP THR VAL GLN PRO ILE VAL SEQRES 20 A 562 LEU PRO GLU LYS ASP SER TRP THR VAL ASN ASP ILE CYS SEQRES 21 A 562 LYS LEU VAL GLY LYS LEU ASN TRP ALA SER GLN ILE TYR SEQRES 22 A 562 PRO GLY ILE LYS VAL ARG GLN LEU SER LYS LEU LEU ARG SEQRES 23 A 562 GLY THR LYS ALA LEU THR GLU VAL ILE PRO LEU THR GLU SEQRES 24 A 562 GLU ALA GLU LEU GLU LEU ALA GLU ASN ARG GLU ILE LEU SEQRES 25 A 562 LYS GLU PRO VAL HIS GLY VAL TYR TYR ASP PRO SER LYS SEQRES 26 A 562 ASP LEU ILE ALA GLU ILE GLN LYS GLN GLY GLN GLY GLN SEQRES 27 A 562 TRP THR TYR GLN ILE TYR GLN GLU PRO PHE LYS ASN LEU SEQRES 28 A 562 LYS THR GLY LYS TYR ALA ARG MET ARG GLY ALA HIS THR SEQRES 29 A 562 ASN ASP VAL LYS GLN LEU THR GLU ALA VAL GLN LYS ILE SEQRES 30 A 562 THR THR GLU SER ILE VAL ILE TRP GLY LYS THR PRO LYS SEQRES 31 A 562 PHE LYS LEU PRO ILE GLN LYS GLU THR TRP GLU THR TRP SEQRES 32 A 562 TRP THR GLU TYR TRP GLN ALA THR TRP ILE PRO GLU TRP SEQRES 33 A 562 GLU PHE VAL ASN THR PRO PRO LEU VAL LYS LEU TRP TYR SEQRES 34 A 562 GLN LEU GLU LYS GLU PRO ILE VAL GLY ALA GLU THR PHE SEQRES 35 A 562 TYR VAL ASP GLY ALA ALA ASN ARG GLU THR LYS LEU GLY SEQRES 36 A 562 LYS ALA GLY TYR VAL THR ASN LYS GLY ARG GLN LYS VAL SEQRES 37 A 562 VAL PRO LEU THR ASN THR THR ASN GLN LYS THR GLN LEU SEQRES 38 A 562 GLN ALA ILE TYR LEU ALA LEU GLN ASP SER GLY LEU GLU SEQRES 39 A 562 VAL ASN ILE VAL THR ASP SER GLN TYR ALA LEU GLY ILE SEQRES 40 A 562 ILE GLN ALA GLN PRO ASP LYS SER GLU SER GLU LEU VAL SEQRES 41 A 562 ASN GLN ILE ILE GLU GLN LEU ILE LYS LYS GLU LYS VAL SEQRES 42 A 562 TYR LEU ALA TRP VAL PRO ALA HIS LYS GLY ILE GLY GLY SEQRES 43 A 562 ASN GLU GLN VAL ASP LYS LEU VAL SER ALA GLY ILE ARG SEQRES 44 A 562 LYS ILE LEU SEQRES 1 B 442 MET VAL PRO ILE SER PRO ILE GLU THR VAL PRO VAL LYS SEQRES 2 B 442 LEU LYS PRO GLY MET ASP GLY PRO LYS VAL LYS GLN TRP SEQRES 3 B 442 PRO LEU THR GLU GLU LYS ILE LYS ALA LEU VAL GLU ILE SEQRES 4 B 442 CYS THR GLU MET GLU LYS GLU GLY LYS ILE SER LYS ILE SEQRES 5 B 442 GLY PRO GLU ASN PRO TYR ASN THR PRO VAL PHE ALA ILE SEQRES 6 B 442 LYS LYS LYS ASP SER THR LYS TRP ARG LYS LEU VAL ASP SEQRES 7 B 442 PHE ARG GLU LEU ASN LYS ARG THR GLN ASP PHE TRP GLU SEQRES 8 B 442 VAL GLN LEU GLY ILE PRO HIS PRO ALA GLY LEU LYS LYS SEQRES 9 B 442 LYS LYS SER VAL THR VAL LEU ASP VAL GLY ASP ALA TYR SEQRES 10 B 442 PHE SER VAL PRO LEU ASP GLU ASP PHE ARG LYS TYR THR SEQRES 11 B 442 ALA PHE THR ILE PRO SER ILE ASN ASN GLU THR PRO GLY SEQRES 12 B 442 ILE ARG TYR GLN TYR ASN VAL LEU PRO GLN GLY TRP LYS SEQRES 13 B 442 GLY SER PRO ALA ILE PHE GLN SER SER MET THR LYS ILE SEQRES 14 B 442 LEU GLU PRO PHE LYS LYS GLN ASN PRO ASP ILE VAL ILE SEQRES 15 B 442 TYR GLN TYR MET ASP ASP LEU TYR VAL GLY SER ASP LEU SEQRES 16 B 442 GLU ILE GLY GLN HIS ARG THR LYS ILE GLU GLU LEU ARG SEQRES 17 B 442 GLN HIS LEU LEU ARG TRP GLY LEU THR THR PRO ASP LYS SEQRES 18 B 442 LYS HIS GLN LYS GLU PRO PRO PHE LEU TRP MET GLY TYR SEQRES 19 B 442 GLU LEU HIS PRO ASP LYS TRP THR VAL GLN PRO ILE VAL SEQRES 20 B 442 LEU PRO GLU LYS ASP SER TRP THR VAL ASN ASP ILE GLN SEQRES 21 B 442 LYS LEU VAL GLY LYS LEU ASN TRP ALA SER GLN ILE TYR SEQRES 22 B 442 PRO GLY ILE LYS VAL ARG GLN LEU SER LYS LEU LEU ARG SEQRES 23 B 442 GLY THR LYS ALA LEU THR GLU VAL ILE PRO LEU THR GLU SEQRES 24 B 442 GLU ALA GLU LEU GLU LEU ALA GLU ASN ARG GLU ILE LEU SEQRES 25 B 442 LYS GLU PRO VAL HIS GLY VAL TYR TYR ASP PRO SER LYS SEQRES 26 B 442 ASP LEU ILE ALA GLU ILE GLN LYS GLN GLY GLN GLY GLN SEQRES 27 B 442 TRP THR TYR GLN ILE TYR GLN GLU PRO PHE LYS ASN LEU SEQRES 28 B 442 LYS THR GLY LYS TYR ALA ARG MET ARG GLY ALA HIS THR SEQRES 29 B 442 ASN ASP VAL LYS GLN LEU THR GLU ALA VAL GLN LYS ILE SEQRES 30 B 442 THR THR GLU SER ILE VAL ILE TRP GLY LYS THR PRO LYS SEQRES 31 B 442 PHE LYS LEU PRO ILE GLN LYS GLU THR TRP GLU THR TRP SEQRES 32 B 442 TRP THR GLU TYR TRP GLN ALA THR TRP ILE PRO GLU TRP SEQRES 33 B 442 GLU PHE VAL ASN THR PRO PRO LEU VAL LYS LEU TRP TYR SEQRES 34 B 442 GLN LEU GLU LYS GLU PRO ILE VAL GLY ALA GLU THR PHE SEQRES 1 C 26 G C A G U G G C G C C C G SEQRES 2 C 26 A A C A G G G A C U U G A SEQRES 1 D 39 G C C C G G G A A A C C A SEQRES 2 D 39 G G G U U C A A G U C C C SEQRES 3 D 39 U G U U C G G G47 C G C C A HET G47 D 71 25 HET MG A 601 1 HET EFZ A 602 21 HETNAM G47 N2-ETHANETHIOL-2'-DEOXY-GUANOSINE-5'-MONOPHOSPHATE HETNAM MG MAGNESIUM ION HETNAM EFZ (-)-6-CHLORO-4-CYCLOPROPYLETHYNYL-4-TRIFLUOROMETHYL-1, HETNAM 2 EFZ 4-DIHYDRO-2H-3,1-BENZOXAZIN-2-ONE HETSYN EFZ DMP-266; EFAVIRENZ FORMUL 4 G47 C12 H18 N5 O7 P S FORMUL 5 MG MG 2+ FORMUL 6 EFZ C14 H9 CL F3 N O2 HELIX 1 AA1 THR A 27 GLU A 44 1 18 HELIX 2 AA2 PHE A 77 LYS A 82 1 6 HELIX 3 AA3 HIS A 96 LEU A 100 5 5 HELIX 4 AA4 ALA A 114 VAL A 118 5 5 HELIX 5 AA5 PHE A 124 THR A 128 5 5 HELIX 6 AA6 LYS A 154 LYS A 173 1 20 HELIX 7 AA7 GLU A 194 TRP A 212 1 19 HELIX 8 AA8 VAL A 254 GLN A 269 1 16 HELIX 9 AA9 VAL A 276 LEU A 282 1 7 HELIX 10 AB1 THR A 296 LEU A 310 1 15 HELIX 11 AB2 ASN A 363 GLY A 384 1 22 HELIX 12 AB3 GLN A 394 GLU A 404 1 11 HELIX 13 AB4 THR A 473 SER A 489 1 17 HELIX 14 AB5 SER A 499 ALA A 508 1 10 HELIX 15 AB6 SER A 515 LYS A 528 1 14 HELIX 16 AB7 GLY A 543 VAL A 552 1 10 HELIX 17 AB8 THR B 27 GLY B 45 1 19 HELIX 18 AB9 PHE B 77 THR B 84 1 8 HELIX 19 AC1 GLY B 99 LYS B 103 5 5 HELIX 20 AC2 ALA B 114 VAL B 118 5 5 HELIX 21 AC3 ASP B 121 ARG B 125 5 5 HELIX 22 AC4 SER B 134 GLU B 138 5 5 HELIX 23 AC5 LYS B 154 ASN B 175 1 22 HELIX 24 AC6 GLU B 194 TRP B 212 1 19 HELIX 25 AC7 HIS B 235 TRP B 239 5 5 HELIX 26 AC8 VAL B 254 SER B 268 1 15 HELIX 27 AC9 VAL B 276 LEU B 282 1 7 HELIX 28 AD1 THR B 296 GLU B 312 1 17 HELIX 29 AD2 ASN B 363 GLY B 384 1 22 HELIX 30 AD3 GLN B 394 TRP B 402 1 9 HELIX 31 AD4 THR B 403 TYR B 405 5 3 HELIX 32 AD5 VAL B 423 TYR B 427 5 5 SHEET 1 AA1 2 ILE A 47 SER A 48 0 SHEET 2 AA1 2 GLN A 145 TYR A 146 -1 O GLN A 145 N SER A 48 SHEET 1 AA2 2 VAL A 60 ILE A 63 0 SHEET 2 AA2 2 ARG A 72 VAL A 75 -1 O LEU A 74 N PHE A 61 SHEET 1 AA3 3 SER A 105 ASP A 110 0 SHEET 2 AA3 3 ASP A 186 SER A 191 -1 O LEU A 187 N LEU A 109 SHEET 3 AA3 3 VAL A 179 GLN A 182 -1 N TYR A 181 O TYR A 188 SHEET 1 AA4 2 PHE A 227 TRP A 229 0 SHEET 2 AA4 2 TYR A 232 LEU A 234 -1 O LEU A 234 N PHE A 227 SHEET 1 AA5 2 LYS A 238 TRP A 239 0 SHEET 2 AA5 2 GLY A 316 VAL A 317 -1 O GLY A 316 N TRP A 239 SHEET 1 AA6 2 TRP A 252 THR A 253 0 SHEET 2 AA6 2 VAL A 292 ILE A 293 -1 O ILE A 293 N TRP A 252 SHEET 1 AA7 5 LYS A 347 LYS A 353 0 SHEET 2 AA7 5 GLN A 336 GLU A 344 -1 N ILE A 341 O LYS A 350 SHEET 3 AA7 5 ILE A 326 GLY A 333 -1 N GLN A 330 O THR A 338 SHEET 4 AA7 5 LYS A 388 LEU A 391 1 O LYS A 390 N ALA A 327 SHEET 5 AA7 5 TRP A 414 PHE A 416 1 O GLU A 415 N PHE A 389 SHEET 1 AA8 2 HIS A 361 THR A 362 0 SHEET 2 AA8 2 LYS A 512 SER A 513 -1 O LYS A 512 N THR A 362 SHEET 1 AA9 5 LYS A 465 PRO A 468 0 SHEET 2 AA9 5 LYS A 454 THR A 459 -1 N ALA A 455 O VAL A 467 SHEET 3 AA9 5 THR A 439 ALA A 445 -1 N TYR A 441 O VAL A 458 SHEET 4 AA9 5 GLU A 492 THR A 497 1 O ASN A 494 N PHE A 440 SHEET 5 AA9 5 LYS A 530 ALA A 534 1 O ALA A 534 N ILE A 495 SHEET 1 AB1 2 VAL B 60 ILE B 63 0 SHEET 2 AB1 2 ARG B 72 VAL B 75 -1 O LEU B 74 N PHE B 61 SHEET 1 AB2 4 VAL B 179 TYR B 181 0 SHEET 2 AB2 4 ASP B 186 SER B 191 -1 O TYR B 188 N TYR B 181 SHEET 3 AB2 4 SER B 105 ASP B 110 -1 N THR B 107 O VAL B 189 SHEET 4 AB2 4 GLU B 233 LEU B 234 -1 O LEU B 234 N VAL B 106 SHEET 1 AB3 2 PHE B 130 ILE B 132 0 SHEET 2 AB3 2 ILE B 142 TYR B 144 -1 O ILE B 142 N ILE B 132 SHEET 1 AB4 2 TRP B 252 THR B 253 0 SHEET 2 AB4 2 VAL B 292 ILE B 293 -1 O ILE B 293 N TRP B 252 SHEET 1 AB5 5 LYS B 350 TYR B 354 0 SHEET 2 AB5 5 GLN B 336 TYR B 342 -1 N TRP B 337 O TYR B 354 SHEET 3 AB5 5 ILE B 326 GLY B 333 -1 N ILE B 326 O TYR B 342 SHEET 4 AB5 5 LYS B 388 LEU B 391 1 O LYS B 388 N ALA B 327 SHEET 5 AB5 5 TRP B 414 PHE B 416 1 O GLU B 415 N PHE B 389 LINK SG CYS A 258 SG G47 D 71 1555 1555 2.06 LINK O3' G D 70 P G47 D 71 1555 1555 1.75 LINK O3' G47 D 71 P C D 72 1555 1555 1.62 LINK OD2 ASP A 443 MG MG A 601 1555 1555 2.84 CISPEP 1 PRO A 225 PRO A 226 0 1.84 CRYST1 1.000 1.000 1.000 90.00 90.00 90.00 P 1 ORIGX1 1.000000 0.000000 0.000000 0.00000 ORIGX2 0.000000 1.000000 0.000000 0.00000 ORIGX3 0.000000 0.000000 1.000000 0.00000 SCALE1 1.000000 0.000000 0.000000 0.00000 SCALE2 0.000000 1.000000 0.000000 0.00000 SCALE3 0.000000 0.000000 1.000000 0.00000