HEADER HYDROLASE 07-FEB-03 1OCB TITLE STRUCTURE OF THE WILD-TYPE CELLOBIOHYDROLASE CEL6A FROM HUMICOLAS TITLE 2 INSOLENS IN COMPLEX WITH A FLUORESCENT SUBSTRATE CAVEAT 1OCB GDA C 4 HAS WRONG CHIRALITY AT ATOM C1 GDA E 4 HAS WRONG CAVEAT 2 1OCB CHIRALITY AT ATOM C1 COMPND MOL_ID: 1; COMPND 2 MOLECULE: CELLOBIOHYDROLASE II; COMPND 3 CHAIN: A, B; COMPND 4 FRAGMENT: CATALYTIC CORE DOMAIN, RESIDUES 89-450; COMPND 5 SYNONYM: CELLULASE, CEL6A; COMPND 6 EC: 3.2.1.91; COMPND 7 ENGINEERED: YES; COMPND 8 OTHER_DETAILS: N-LINKED N-ACETYLGLUCOSAMINE ON RESIDUE ASN 141 SOURCE MOL_ID: 1; SOURCE 2 ORGANISM_SCIENTIFIC: HUMICOLA INSOLENS; SOURCE 3 ORGANISM_TAXID: 34413; SOURCE 4 EXPRESSION_SYSTEM: ASPERGILLUS ORYZAE; SOURCE 5 EXPRESSION_SYSTEM_TAXID: 5062; SOURCE 6 EXPRESSION_SYSTEM_PLASMID: UNDER CONTROL OF THE FUNGAL AMYLASE SOURCE 7 PROMOTER AND AMYLOGLUCOSIDASE TERMINATOR KEYWDS HYDROLASE, CELLULOSE DEGRADATION, CELLOBIOHYDROLASE, CELLULASE, KEYWDS 2 GLYCOSIDE HYDROLASE FAMILY 6, PROCESSIVE MECHANISM EXPDTA X-RAY DIFFRACTION AUTHOR A.VARROT,T.P.FRANDSEN,I.VON OSSOWSKI,V.BOYER,H.DRIGUEZ,M.SCHULEIN, AUTHOR 2 G.J.DAVIES REVDAT 6 13-DEC-23 1OCB 1 HETNAM HETSYN REVDAT 5 29-JUL-20 1OCB 1 CAVEAT COMPND REMARK HETNAM REVDAT 5 2 1 LINK SITE ATOM REVDAT 4 24-JUN-20 1OCB 1 LINK ATOM REVDAT 3 13-JUL-11 1OCB 1 VERSN REVDAT 2 24-FEB-09 1OCB 1 VERSN REVDAT 1 10-JUL-03 1OCB 0 JRNL AUTH A.VARROT,T.P.FRANDSEN,I.VON OSSOWSKI,V.BOYER,H.DRIGUEZ, JRNL AUTH 2 M.SCHULEIN,G.J.DAVIES JRNL TITL STRUCTURAL BASIS FOR LIGAND BINDING AND PROCESSIVITY IN JRNL TITL 2 CELLOBIOHYDROLASE CEL6A FROM HUMICOLA INSOLENS JRNL REF STRUCTURE V. 11 855 2003 JRNL REFN ISSN 0969-2126 JRNL PMID 12842048 JRNL DOI 10.1016/S0969-2126(03)00124-2 REMARK 2 REMARK 2 RESOLUTION. 1.75 ANGSTROMS. REMARK 3 REMARK 3 REFINEMENT. REMARK 3 PROGRAM : REFMAC 5.1.24 REMARK 3 AUTHORS : MURSHUDOV,SKUBAK,LEBEDEV,PANNU,STEINER, REMARK 3 : NICHOLLS,WINN,LONG,VAGIN REMARK 3 REMARK 3 REFINEMENT TARGET : MAXIMUM LIKELIHOOD REMARK 3 REMARK 3 DATA USED IN REFINEMENT. REMARK 3 RESOLUTION RANGE HIGH (ANGSTROMS) : 1.75 REMARK 3 RESOLUTION RANGE LOW (ANGSTROMS) : 38.92 REMARK 3 DATA CUTOFF (SIGMA(F)) : NULL REMARK 3 COMPLETENESS FOR RANGE (%) : 97.7 REMARK 3 NUMBER OF REFLECTIONS : 63815 REMARK 3 REMARK 3 FIT TO DATA USED IN REFINEMENT. REMARK 3 CROSS-VALIDATION METHOD : THROUGHOUT REMARK 3 FREE R VALUE TEST SET SELECTION : RANDOM REMARK 3 R VALUE (WORKING + TEST SET) : 0.135 REMARK 3 R VALUE (WORKING SET) : 0.133 REMARK 3 FREE R VALUE : 0.168 REMARK 3 FREE R VALUE TEST SET SIZE (%) : 5.000 REMARK 3 FREE R VALUE TEST SET COUNT : 3394 REMARK 3 REMARK 3 FIT IN THE HIGHEST RESOLUTION BIN. REMARK 3 TOTAL NUMBER OF BINS USED : 20 REMARK 3 BIN RESOLUTION RANGE HIGH (A) : 1.75 REMARK 3 BIN RESOLUTION RANGE LOW (A) : 1.79 REMARK 3 REFLECTION IN BIN (WORKING SET) : 4393 REMARK 3 BIN COMPLETENESS (WORKING+TEST) (%) : NULL REMARK 3 BIN R VALUE (WORKING SET) : 0.1470 REMARK 3 BIN FREE R VALUE SET COUNT : 252 REMARK 3 BIN FREE R VALUE : 0.2000 REMARK 3 REMARK 3 NUMBER OF NON-HYDROGEN ATOMS USED IN REFINEMENT. REMARK 3 PROTEIN ATOMS : 5630 REMARK 3 NUCLEIC ACID ATOMS : 0 REMARK 3 HETEROGEN ATOMS : 230 REMARK 3 SOLVENT ATOMS : 715 REMARK 3 REMARK 3 B VALUES. REMARK 3 B VALUE TYPE : LIKELY RESIDUAL REMARK 3 FROM WILSON PLOT (A**2) : NULL REMARK 3 MEAN B VALUE (OVERALL, A**2) : 9.06 REMARK 3 OVERALL ANISOTROPIC B VALUE. REMARK 3 B11 (A**2) : 1.68000 REMARK 3 B22 (A**2) : -1.26000 REMARK 3 B33 (A**2) : -0.17000 REMARK 3 B12 (A**2) : 0.00000 REMARK 3 B13 (A**2) : 0.25000 REMARK 3 B23 (A**2) : 0.00000 REMARK 3 REMARK 3 ESTIMATED OVERALL COORDINATE ERROR. REMARK 3 ESU BASED ON R VALUE (A): 0.101 REMARK 3 ESU BASED ON FREE R VALUE (A): 0.097 REMARK 3 ESU BASED ON MAXIMUM LIKELIHOOD (A): 0.061 REMARK 3 ESU FOR B VALUES BASED ON MAXIMUM LIKELIHOOD (A**2): 1.877 REMARK 3 REMARK 3 CORRELATION COEFFICIENTS. REMARK 3 CORRELATION COEFFICIENT FO-FC : 0.972 REMARK 3 CORRELATION COEFFICIENT FO-FC FREE : 0.958 REMARK 3 REMARK 3 RMS DEVIATIONS FROM IDEAL VALUES COUNT RMS WEIGHT REMARK 3 BOND LENGTHS REFINED ATOMS (A): 6073 ; 0.016 ; 0.021 REMARK 3 BOND LENGTHS OTHERS (A): 5176 ; 0.002 ; 0.020 REMARK 3 BOND ANGLES REFINED ATOMS (DEGREES): 8320 ; 1.578 ; 1.967 REMARK 3 BOND ANGLES OTHERS (DEGREES): 12044 ; 0.911 ; 3.000 REMARK 3 TORSION ANGLES, PERIOD 1 (DEGREES): 720 ; 5.904 ; 5.000 REMARK 3 TORSION ANGLES, PERIOD 2 (DEGREES): NULL ; NULL ; NULL REMARK 3 TORSION ANGLES, PERIOD 3 (DEGREES): NULL ; NULL ; NULL REMARK 3 TORSION ANGLES, PERIOD 4 (DEGREES): NULL ; NULL ; NULL REMARK 3 CHIRAL-CENTER RESTRAINTS (A**3): 918 ; 0.100 ; 0.200 REMARK 3 GENERAL PLANES REFINED ATOMS (A): 6713 ; 0.009 ; 0.020 REMARK 3 GENERAL PLANES OTHERS (A): 1209 ; 0.006 ; 0.020 REMARK 3 NON-BONDED CONTACTS REFINED ATOMS (A): 1149 ; 0.221 ; 0.200 REMARK 3 NON-BONDED CONTACTS OTHERS (A): 6063 ; 0.255 ; 0.200 REMARK 3 NON-BONDED TORSION REFINED ATOMS (A): NULL ; NULL ; NULL REMARK 3 NON-BONDED TORSION OTHERS (A): 3347 ; 0.083 ; 0.200 REMARK 3 H-BOND (X...Y) REFINED ATOMS (A): 500 ; 0.131 ; 0.200 REMARK 3 H-BOND (X...Y) OTHERS (A): NULL ; NULL ; NULL REMARK 3 POTENTIAL METAL-ION REFINED ATOMS (A): NULL ; NULL ; NULL REMARK 3 POTENTIAL METAL-ION OTHERS (A): NULL ; NULL ; NULL REMARK 3 SYMMETRY VDW REFINED ATOMS (A): 22 ; 0.180 ; 0.200 REMARK 3 SYMMETRY VDW OTHERS (A): 79 ; 0.258 ; 0.200 REMARK 3 SYMMETRY H-BOND REFINED ATOMS (A): 36 ; 0.154 ; 0.200 REMARK 3 SYMMETRY H-BOND OTHERS (A): NULL ; NULL ; NULL REMARK 3 SYMMETRY METAL-ION REFINED ATOMS (A): NULL ; NULL ; NULL REMARK 3 SYMMETRY METAL-ION OTHERS (A): NULL ; NULL ; NULL REMARK 3 REMARK 3 ISOTROPIC THERMAL FACTOR RESTRAINTS. COUNT RMS WEIGHT REMARK 3 MAIN-CHAIN BOND REFINED ATOMS (A**2): 3620 ; 0.796 ; 1.500 REMARK 3 MAIN-CHAIN BOND OTHER ATOMS (A**2): NULL ; NULL ; NULL REMARK 3 MAIN-CHAIN ANGLE REFINED ATOMS (A**2): 5823 ; 1.348 ; 2.000 REMARK 3 MAIN-CHAIN ANGLE OTHER ATOMS (A**2): NULL ; NULL ; NULL REMARK 3 SIDE-CHAIN BOND REFINED ATOMS (A**2): 2453 ; 2.331 ; 3.000 REMARK 3 SIDE-CHAIN BOND OTHER ATOMS (A**2): NULL ; NULL ; NULL REMARK 3 SIDE-CHAIN ANGLE REFINED ATOMS (A**2): 2497 ; 3.500 ; 4.500 REMARK 3 SIDE-CHAIN ANGLE OTHER ATOMS (A**2): NULL ; NULL ; NULL REMARK 3 LONG RANGE B REFINED ATOMS (A**2): NULL ; NULL ; NULL REMARK 3 LONG RANGE B OTHER ATOMS (A**2): NULL ; NULL ; NULL REMARK 3 REMARK 3 ANISOTROPIC THERMAL FACTOR RESTRAINTS. COUNT RMS WEIGHT REMARK 3 RIGID-BOND RESTRAINTS (A**2): NULL ; NULL ; NULL REMARK 3 SPHERICITY; FREE ATOMS (A**2): NULL ; NULL ; NULL REMARK 3 SPHERICITY; BONDED ATOMS (A**2): NULL ; NULL ; NULL REMARK 3 REMARK 3 NCS RESTRAINTS STATISTICS REMARK 3 NUMBER OF DIFFERENT NCS GROUPS : NULL REMARK 3 REMARK 3 TLS DETAILS REMARK 3 NUMBER OF TLS GROUPS : 2 REMARK 3 REMARK 3 TLS GROUP : 1 REMARK 3 NUMBER OF COMPONENTS GROUP : 1 REMARK 3 COMPONENTS C SSSEQI TO C SSSEQI REMARK 3 RESIDUE RANGE : A 90 A 450 REMARK 3 ORIGIN FOR THE GROUP (A): 1.1000 0.1990 3.0400 REMARK 3 T TENSOR REMARK 3 T11: 0.0155 T22: 0.0437 REMARK 3 T33: 0.0229 T12: -0.0066 REMARK 3 T13: -0.0174 T23: -0.0004 REMARK 3 L TENSOR REMARK 3 L11: 0.5618 L22: 0.9599 REMARK 3 L33: 1.0552 L12: 0.0595 REMARK 3 L13: 0.0866 L23: 0.0830 REMARK 3 S TENSOR REMARK 3 S11: 0.0204 S12: 0.0248 S13: -0.0568 REMARK 3 S21: -0.0616 S22: 0.0263 S23: -0.0360 REMARK 3 S31: 0.1350 S32: 0.0103 S33: -0.0468 REMARK 3 REMARK 3 TLS GROUP : 2 REMARK 3 NUMBER OF COMPONENTS GROUP : 1 REMARK 3 COMPONENTS C SSSEQI TO C SSSEQI REMARK 3 RESIDUE RANGE : B 91 B 450 REMARK 3 ORIGIN FOR THE GROUP (A): -33.3550 38.6180 -22.2750 REMARK 3 T TENSOR REMARK 3 T11: 0.0041 T22: 0.0518 REMARK 3 T33: 0.0619 T12: 0.0052 REMARK 3 T13: 0.0065 T23: -0.0016 REMARK 3 L TENSOR REMARK 3 L11: 0.8781 L22: 1.4555 REMARK 3 L33: 0.7922 L12: -0.0815 REMARK 3 L13: -0.1881 L23: -0.1061 REMARK 3 S TENSOR REMARK 3 S11: -0.0046 S12: -0.0365 S13: 0.1401 REMARK 3 S21: 0.1491 S22: 0.0745 S23: 0.0529 REMARK 3 S31: -0.0562 S32: -0.0165 S33: -0.0699 REMARK 3 REMARK 3 BULK SOLVENT MODELLING. REMARK 3 METHOD USED : BABINET MODEL WITH MASK REMARK 3 PARAMETERS FOR MASK CALCULATION REMARK 3 VDW PROBE RADIUS : 1.40 REMARK 3 ION PROBE RADIUS : 0.80 REMARK 3 SHRINKAGE RADIUS : 0.80 REMARK 3 REMARK 3 OTHER REFINEMENT REMARKS: HYDROGENS HAVE BEEN ADDED IN THE RIDING REMARK 3 POSITIONS REMARK 4 REMARK 4 1OCB COMPLIES WITH FORMAT V. 3.30, 13-JUL-11 REMARK 100 REMARK 100 THIS ENTRY HAS BEEN PROCESSED BY PDBE ON 07-FEB-03. REMARK 100 THE DEPOSITION ID IS D_1290009844. REMARK 200 REMARK 200 EXPERIMENTAL DETAILS REMARK 200 EXPERIMENT TYPE : X-RAY DIFFRACTION REMARK 200 DATE OF DATA COLLECTION : 15-NOV-00 REMARK 200 TEMPERATURE (KELVIN) : 100.0 REMARK 200 PH : 4.60 REMARK 200 NUMBER OF CRYSTALS USED : 1 REMARK 200 REMARK 200 SYNCHROTRON (Y/N) : Y REMARK 200 RADIATION SOURCE : ESRF REMARK 200 BEAMLINE : ID14-4 REMARK 200 X-RAY GENERATOR MODEL : NULL REMARK 200 MONOCHROMATIC OR LAUE (M/L) : M REMARK 200 WAVELENGTH OR RANGE (A) : 0.932 REMARK 200 MONOCHROMATOR : NULL REMARK 200 OPTICS : NULL REMARK 200 REMARK 200 DETECTOR TYPE : CCD REMARK 200 DETECTOR MANUFACTURER : ADSC CCD REMARK 200 INTENSITY-INTEGRATION SOFTWARE : DENZO REMARK 200 DATA SCALING SOFTWARE : SCALEPACK REMARK 200 REMARK 200 NUMBER OF UNIQUE REFLECTIONS : 69418 REMARK 200 RESOLUTION RANGE HIGH (A) : 1.750 REMARK 200 RESOLUTION RANGE LOW (A) : 40.000 REMARK 200 REJECTION CRITERIA (SIGMA(I)) : 2.000 REMARK 200 REMARK 200 OVERALL. REMARK 200 COMPLETENESS FOR RANGE (%) : 97.5 REMARK 200 DATA REDUNDANCY : 2.200 REMARK 200 R MERGE (I) : 0.04100 REMARK 200 R SYM (I) : NULL REMARK 200 FOR THE DATA SET : 17.3000 REMARK 200 REMARK 200 IN THE HIGHEST RESOLUTION SHELL. REMARK 200 HIGHEST RESOLUTION SHELL, RANGE HIGH (A) : 1.75 REMARK 200 HIGHEST RESOLUTION SHELL, RANGE LOW (A) : 1.81 REMARK 200 COMPLETENESS FOR SHELL (%) : 91.3 REMARK 200 DATA REDUNDANCY IN SHELL : 2.00 REMARK 200 R MERGE FOR SHELL (I) : 0.12800 REMARK 200 R SYM FOR SHELL (I) : NULL REMARK 200 FOR SHELL : 6.000 REMARK 200 REMARK 200 DIFFRACTION PROTOCOL: SINGLE WAVELENGTH REMARK 200 METHOD USED TO DETERMINE THE STRUCTURE: MOLECULAR REPLACEMENT REMARK 200 SOFTWARE USED: AMORE REMARK 200 STARTING MODEL: PDB ENTRY 2BVW REMARK 200 REMARK 200 REMARK: NULL REMARK 280 REMARK 280 CRYSTAL REMARK 280 SOLVENT CONTENT, VS (%): 41.70 REMARK 280 MATTHEWS COEFFICIENT, VM (ANGSTROMS**3/DA): 2.10 REMARK 280 REMARK 280 CRYSTALLIZATION CONDITIONS: THE PROTEIN WAS CONCENTRATED IN WATER REMARK 280 TO 7 MG/ML IT WAS INCUBATED 1H PRIOR CRYSTALLISATION WITH THE REMARK 280 5MM OF THE SUBSTRATE 16% PEG5KMME AND 200 MM CALCIUM ACETATE REMARK 280 WERE USED AS PRECIPITANT AND 100 MM SODIUM ACETATE PH 4.6 AS REMARK 280 BUFFER., PH 4.60 REMARK 290 REMARK 290 CRYSTALLOGRAPHIC SYMMETRY REMARK 290 SYMMETRY OPERATORS FOR SPACE GROUP: P 1 21 1 REMARK 290 REMARK 290 SYMOP SYMMETRY REMARK 290 NNNMMM OPERATOR REMARK 290 1555 X,Y,Z REMARK 290 2555 -X,Y+1/2,-Z REMARK 290 REMARK 290 WHERE NNN -> OPERATOR NUMBER REMARK 290 MMM -> TRANSLATION VECTOR REMARK 290 REMARK 290 CRYSTALLOGRAPHIC SYMMETRY TRANSFORMATIONS REMARK 290 THE FOLLOWING TRANSFORMATIONS OPERATE ON THE ATOM/HETATM REMARK 290 RECORDS IN THIS ENTRY TO PRODUCE CRYSTALLOGRAPHICALLY REMARK 290 RELATED MOLECULES. REMARK 290 SMTRY1 1 1.000000 0.000000 0.000000 0.00000 REMARK 290 SMTRY2 1 0.000000 1.000000 0.000000 0.00000 REMARK 290 SMTRY3 1 0.000000 0.000000 1.000000 0.00000 REMARK 290 SMTRY1 2 -1.000000 0.000000 0.000000 0.00000 REMARK 290 SMTRY2 2 0.000000 1.000000 0.000000 77.84050 REMARK 290 SMTRY3 2 0.000000 0.000000 -1.000000 0.00000 REMARK 290 REMARK 290 REMARK: NULL REMARK 300 REMARK 300 BIOMOLECULE: 1, 2 REMARK 300 SEE REMARK 350 FOR THE AUTHOR PROVIDED AND/OR PROGRAM REMARK 300 GENERATED ASSEMBLY INFORMATION FOR THE STRUCTURE IN REMARK 300 THIS ENTRY. THE REMARK MAY ALSO PROVIDE INFORMATION ON REMARK 300 BURIED SURFACE AREA. REMARK 350 REMARK 350 COORDINATES FOR A COMPLETE MULTIMER REPRESENTING THE KNOWN REMARK 350 BIOLOGICALLY SIGNIFICANT OLIGOMERIZATION STATE OF THE REMARK 350 MOLECULE CAN BE GENERATED BY APPLYING BIOMT TRANSFORMATIONS REMARK 350 GIVEN BELOW. BOTH NON-CRYSTALLOGRAPHIC AND REMARK 350 CRYSTALLOGRAPHIC OPERATIONS ARE GIVEN. REMARK 350 REMARK 350 BIOMOLECULE: 1 REMARK 350 AUTHOR DETERMINED BIOLOGICAL UNIT: MONOMERIC REMARK 350 SOFTWARE DETERMINED QUATERNARY STRUCTURE: MONOMERIC REMARK 350 SOFTWARE USED: PQS REMARK 350 APPLY THE FOLLOWING TO CHAINS: A, C, D REMARK 350 BIOMT1 1 1.000000 0.000000 0.000000 0.00000 REMARK 350 BIOMT2 1 0.000000 1.000000 0.000000 0.00000 REMARK 350 BIOMT3 1 0.000000 0.000000 1.000000 0.00000 REMARK 350 REMARK 350 BIOMOLECULE: 2 REMARK 350 AUTHOR DETERMINED BIOLOGICAL UNIT: MONOMERIC REMARK 350 SOFTWARE DETERMINED QUATERNARY STRUCTURE: MONOMERIC REMARK 350 SOFTWARE USED: PQS REMARK 350 APPLY THE FOLLOWING TO CHAINS: B, E, F REMARK 350 BIOMT1 1 1.000000 0.000000 0.000000 0.00000 REMARK 350 BIOMT2 1 0.000000 1.000000 0.000000 0.00000 REMARK 350 BIOMT3 1 0.000000 0.000000 1.000000 0.00000 REMARK 465 REMARK 465 MISSING RESIDUES REMARK 465 THE FOLLOWING RESIDUES WERE NOT LOCATED IN THE REMARK 465 EXPERIMENT. (M=MODEL NUMBER; RES=RESIDUE NAME; C=CHAIN REMARK 465 IDENTIFIER; SSSEQ=SEQUENCE NUMBER; I=INSERTION CODE.) REMARK 465 REMARK 465 M RES C SSSEQI REMARK 465 TYR A 89 REMARK 465 TYR B 89 REMARK 500 REMARK 500 GEOMETRY AND STEREOCHEMISTRY REMARK 500 SUBTOPIC: COVALENT BOND ANGLES REMARK 500 REMARK 500 THE STEREOCHEMICAL PARAMETERS OF THE FOLLOWING RESIDUES REMARK 500 HAVE VALUES WHICH DEVIATE FROM EXPECTED VALUES BY MORE REMARK 500 THAN 6*RMSD (M=MODEL NUMBER; RES=RESIDUE NAME; C=CHAIN REMARK 500 IDENTIFIER; SSEQ=SEQUENCE NUMBER; I=INSERTION CODE). REMARK 500 REMARK 500 STANDARD TABLE: REMARK 500 FORMAT: (10X,I3,1X,A3,1X,A1,I4,A1,3(1X,A4,2X),12X,F5.1) REMARK 500 REMARK 500 EXPECTED VALUES PROTEIN: ENGH AND HUBER, 1999 REMARK 500 EXPECTED VALUES NUCLEIC ACID: CLOWNEY ET AL 1996 REMARK 500 REMARK 500 M RES CSSEQI ATM1 ATM2 ATM3 REMARK 500 ASP A 145 CB - CG - OD2 ANGL. DEV. = 5.9 DEGREES REMARK 500 ASP B 268 CB - CG - OD2 ANGL. DEV. = 6.1 DEGREES REMARK 500 ARG B 382 NE - CZ - NH1 ANGL. DEV. = 3.1 DEGREES REMARK 500 ARG B 382 NE - CZ - NH2 ANGL. DEV. = -4.0 DEGREES REMARK 500 REMARK 500 REMARK: NULL REMARK 500 REMARK 500 GEOMETRY AND STEREOCHEMISTRY REMARK 500 SUBTOPIC: TORSION ANGLES REMARK 500 REMARK 500 TORSION ANGLES OUTSIDE THE EXPECTED RAMACHANDRAN REGIONS: REMARK 500 (M=MODEL NUMBER; RES=RESIDUE NAME; C=CHAIN IDENTIFIER; REMARK 500 SSEQ=SEQUENCE NUMBER; I=INSERTION CODE). REMARK 500 REMARK 500 STANDARD TABLE: REMARK 500 FORMAT:(10X,I3,1X,A3,1X,A1,I4,A1,4X,F7.2,3X,F7.2) REMARK 500 REMARK 500 EXPECTED VALUES: GJ KLEYWEGT AND TA JONES (1996). PHI/PSI- REMARK 500 CHOLOGY: RAMACHANDRAN REVISITED. STRUCTURE 4, 1395 - 1400 REMARK 500 REMARK 500 M RES CSSEQI PSI PHI REMARK 500 THR A 146 -79.28 -112.57 REMARK 500 TYR A 174 69.16 -151.97 REMARK 500 ASP A 175 34.88 -151.79 REMARK 500 GLU A 224 73.41 47.64 REMARK 500 ASP A 226 -22.87 83.03 REMARK 500 TRP A 274 -74.40 -119.33 REMARK 500 ASN A 310 -168.75 -123.68 REMARK 500 THR B 146 -80.94 -112.85 REMARK 500 TYR B 174 72.42 -150.81 REMARK 500 ASP B 175 35.47 -154.77 REMARK 500 PHE B 214 41.06 -105.33 REMARK 500 GLU B 224 72.23 49.28 REMARK 500 ASP B 226 -19.58 80.93 REMARK 500 TRP B 274 -71.39 -114.28 REMARK 500 ASN B 310 -166.02 -124.16 REMARK 500 REMARK 500 REMARK: NULL REMARK 525 REMARK 525 SOLVENT REMARK 525 REMARK 525 THE SOLVENT MOLECULES HAVE CHAIN IDENTIFIERS THAT REMARK 525 INDICATE THE POLYMER CHAIN WITH WHICH THEY ARE MOST REMARK 525 CLOSELY ASSOCIATED. THE REMARK LISTS ALL THE SOLVENT REMARK 525 MOLECULES WHICH ARE MORE THAN 5A AWAY FROM THE REMARK 525 NEAREST POLYMER CHAIN (M = MODEL NUMBER; REMARK 525 RES=RESIDUE NAME; C=CHAIN IDENTIFIER; SSEQ=SEQUENCE REMARK 525 NUMBER; I=INSERTION CODE): REMARK 525 REMARK 525 M RES CSSEQI REMARK 525 HOH A2103 DISTANCE = 6.16 ANGSTROMS REMARK 610 REMARK 610 MISSING HETEROATOM REMARK 610 THE FOLLOWING RESIDUES HAVE MISSING ATOMS (M=MODEL NUMBER; REMARK 610 RES=RESIDUE NAME; C=CHAIN IDENTIFIER; SSEQ=SEQUENCE NUMBER; REMARK 610 I=INSERTION CODE): REMARK 610 M RES C SSEQI REMARK 610 FLG B 460 REMARK 700 REMARK 700 SHEET REMARK 700 THE SHEET STRUCTURE OF THIS MOLECULE IS BIFURCATED. IN REMARK 700 ORDER TO REPRESENT THIS FEATURE IN THE SHEET RECORDS BELOW, REMARK 700 TWO SHEETS ARE DEFINED. REMARK 900 REMARK 900 RELATED ENTRIES REMARK 900 RELATED ID: 1BVW RELATED DB: PDB REMARK 900 CELLOBIOHYDROLASE II (CEL6A) FROM HUMICOLA INSOLENS REMARK 900 RELATED ID: 1GZ1 RELATED DB: PDB REMARK 900 MUTANT D416A OF THE CELLOBIOHYDROLASE CEL6A FROM HUMICOLA INSOLENS REMARK 900 IN COMPLEX WITH METHYL-CELLOBIOSYL-4-DEOXY-4-THIO-BETA-D- REMARK 900 CELLOBIOSIDE REMARK 900 RELATED ID: 1HGW RELATED DB: PDB REMARK 900 CEL6A D175A MUTANT REMARK 900 RELATED ID: 1HGY RELATED DB: PDB REMARK 900 CEL6A D221A MUTANT REMARK 900 RELATED ID: 1OC5 RELATED DB: PDB REMARK 900 D405N MUTANT OF THE CELLOBIOHYDROLASE CEL6A FROM HUMICOLA INSOLENS REMARK 900 IN COMPLEX WITH METHYL-CELLOBIOSYL-4-DEOXY-4-THIO-BETA-D- REMARK 900 CELLOBIOSIDE REMARK 900 RELATED ID: 1OC6 RELATED DB: PDB REMARK 900 STRUCTURE NATIVE OF THE D405N MUTANT OF THE CELLOBIOHYDROLASE CEL6A REMARK 900 FROM HUMICOLA INSOLENS AT 1.5 ANGSTROM RESOLUTION REMARK 900 RELATED ID: 1OC7 RELATED DB: PDB REMARK 900 D405N MUTANT OF THE CELLOBIOHYDROLASE CEL6A FROM HUMICOLA INSOLENS REMARK 900 IN COMPLEX WITH METHYL-TETRATHIO-ALPHA-D-CELLOPENTOSIDE AT 1 .1 REMARK 900 ANGSTROM RESOLUTION REMARK 900 RELATED ID: 1OCJ RELATED DB: PDB REMARK 900 MUTANT D416A OF THE CELLOBIOHYDROLASE CEL6A FROM HUMICOLA INSOLENS REMARK 900 IN COMPLEX WITH A THIOPENTASACCHARIDE AT 1.3 ANGSTROM RESOLUTION REMARK 900 RELATED ID: 1OCN RELATED DB: PDB REMARK 900 MUTANT D416A OF THE CELLOBIOHYDROLASE CEL6A FROM HUMICOLA INSOLENS REMARK 900 IN COMPLEX WITH A CELLOBIO-DERIVED ISOFAGOMINE AT 1.3 ANGSTROM REMARK 900 RESOLUTION REMARK 900 RELATED ID: 1QJW RELATED DB: PDB REMARK 900 CEL6A (Y169F) WITH A NON-HYDROLYSABLE CELLOTETRAOSE REMARK 900 RELATED ID: 1QK0 RELATED DB: PDB REMARK 900 CEL6A (Y169F) WITH A NON-HYDROLYSABLE CELLOTETRAOSE REMARK 900 RELATED ID: 1QK2 RELATED DB: PDB REMARK 900 WILD TYPE CEL6A WITH A NON-HYDROLYSABLE CELLOTETRAOSE REMARK 900 RELATED ID: 2BVW RELATED DB: PDB REMARK 900 CELLOBIOHYDROLASE II (CEL6A) FROM HUMICOLA INSOLENS IN COMPLEX WITH REMARK 900 GLUCOSE AND CELLOTETRAOSE REMARK 999 REMARK 999 SEQUENCE REMARK 999 THERE IS A PRO-SEQUENCE OF 23 AMINO-ACIDS WHICH ARE REMARK 999 POST-TRANSLATIONALLY CLEAVED TO YIELD A MATURE PROTEIN OF 450 REMARK 999 RESIDUES. THE NUMBERING OF THE PROTEIN STRUCTURE ASSUMES THAT REMARK 999 RESIDUE 1 IS THE FIRST RESIDUE OF THE MATURE PROTEIN AND HENCE REMARK 999 DOES NOT INCLUDE THE LEADER SEQUENCE IN THE NUMBERING.ALSO, REMARK 999 THIS STRUCTURE IS OF THE CATALYTIC CORE DOMAIN ONLY, WHICH REMARK 999 COMMENCES AT RESIDUE TYR 89. IN THE CHAIN A THE FIRST REMARK 999 RESIDUE OF THE CORE IS DISORDERED AND NOT VISIBLE IN THE REMARK 999 ELECTRON DENSITY,HENCE THIS ENTRY BEGINS AT HENCE THIS REMARK 999 ENTRY BEGINS AT RESIDUE ASN 90. REMARK 999 REMARK 999 THIS PROTEIN IS CLOSELY RELATED TO AVICELASE 2 (SWISS-PROT REMARK 999 ACCESSION ID:Q9C1S9) WITH WHICH IT HAS 96% SEQUENCE IDENTITY. DBREF 1OCB A 89 450 PDB 1OCB 1OCB 89 450 DBREF 1OCB B 89 450 PDB 1OCB 1OCB 89 450 SEQRES 1 A 362 TYR ASN GLY ASN PRO PHE GLU GLY VAL GLN LEU TRP ALA SEQRES 2 A 362 ASN ASN TYR TYR ARG SER GLU VAL HIS THR LEU ALA ILE SEQRES 3 A 362 PRO GLN ILE THR ASP PRO ALA LEU ARG ALA ALA ALA SER SEQRES 4 A 362 ALA VAL ALA GLU VAL PRO SER PHE GLN TRP LEU ASP ARG SEQRES 5 A 362 ASN VAL THR VAL ASP THR LEU LEU VAL GLN THR LEU SER SEQRES 6 A 362 GLU ILE ARG GLU ALA ASN GLN ALA GLY ALA ASN PRO GLN SEQRES 7 A 362 TYR ALA ALA GLN ILE VAL VAL TYR ASP LEU PRO ASP ARG SEQRES 8 A 362 ASP CYS ALA ALA ALA ALA SER ASN GLY GLU TRP ALA ILE SEQRES 9 A 362 ALA ASN ASN GLY VAL ASN ASN TYR LYS ALA TYR ILE ASN SEQRES 10 A 362 ARG ILE ARG GLU ILE LEU ILE SER PHE SER ASP VAL ARG SEQRES 11 A 362 THR ILE LEU VAL ILE GLU PRO ASP SER LEU ALA ASN MET SEQRES 12 A 362 VAL THR ASN MET ASN VAL PRO LYS CYS SER GLY ALA ALA SEQRES 13 A 362 SER THR TYR ARG GLU LEU THR ILE TYR ALA LEU LYS GLN SEQRES 14 A 362 LEU ASP LEU PRO HIS VAL ALA MET TYR MET ASP ALA GLY SEQRES 15 A 362 HIS ALA GLY TRP LEU GLY TRP PRO ALA ASN ILE GLN PRO SEQRES 16 A 362 ALA ALA GLU LEU PHE ALA LYS ILE TYR GLU ASP ALA GLY SEQRES 17 A 362 LYS PRO ARG ALA VAL ARG GLY LEU ALA THR ASN VAL ALA SEQRES 18 A 362 ASN TYR ASN ALA TRP SER VAL SER SER PRO PRO PRO TYR SEQRES 19 A 362 THR SER PRO ASN PRO ASN TYR ASP GLU LYS HIS TYR ILE SEQRES 20 A 362 GLU ALA PHE ARG PRO LEU LEU GLU ALA ARG GLY PHE PRO SEQRES 21 A 362 ALA GLN PHE ILE VAL ASP GLN GLY ARG SER GLY LYS GLN SEQRES 22 A 362 PRO THR GLY GLN LYS GLU TRP GLY HIS TRP CYS ASN ALA SEQRES 23 A 362 ILE GLY THR GLY PHE GLY MET ARG PRO THR ALA ASN THR SEQRES 24 A 362 GLY HIS GLN TYR VAL ASP ALA PHE VAL TRP VAL LYS PRO SEQRES 25 A 362 GLY GLY GLU CYS ASP GLY THR SER ASP THR THR ALA ALA SEQRES 26 A 362 ARG TYR ASP TYR HIS CYS GLY LEU GLU ASP ALA LEU LYS SEQRES 27 A 362 PRO ALA PRO GLU ALA GLY GLN TRP PHE ASN GLU TYR PHE SEQRES 28 A 362 ILE GLN LEU LEU ARG ASN ALA ASN PRO PRO PHE SEQRES 1 B 362 TYR ASN GLY ASN PRO PHE GLU GLY VAL GLN LEU TRP ALA SEQRES 2 B 362 ASN ASN TYR TYR ARG SER GLU VAL HIS THR LEU ALA ILE SEQRES 3 B 362 PRO GLN ILE THR ASP PRO ALA LEU ARG ALA ALA ALA SER SEQRES 4 B 362 ALA VAL ALA GLU VAL PRO SER PHE GLN TRP LEU ASP ARG SEQRES 5 B 362 ASN VAL THR VAL ASP THR LEU LEU VAL GLN THR LEU SER SEQRES 6 B 362 GLU ILE ARG GLU ALA ASN GLN ALA GLY ALA ASN PRO GLN SEQRES 7 B 362 TYR ALA ALA GLN ILE VAL VAL TYR ASP LEU PRO ASP ARG SEQRES 8 B 362 ASP CYS ALA ALA ALA ALA SER ASN GLY GLU TRP ALA ILE SEQRES 9 B 362 ALA ASN ASN GLY VAL ASN ASN TYR LYS ALA TYR ILE ASN SEQRES 10 B 362 ARG ILE ARG GLU ILE LEU ILE SER PHE SER ASP VAL ARG SEQRES 11 B 362 THR ILE LEU VAL ILE GLU PRO ASP SER LEU ALA ASN MET SEQRES 12 B 362 VAL THR ASN MET ASN VAL PRO LYS CYS SER GLY ALA ALA SEQRES 13 B 362 SER THR TYR ARG GLU LEU THR ILE TYR ALA LEU LYS GLN SEQRES 14 B 362 LEU ASP LEU PRO HIS VAL ALA MET TYR MET ASP ALA GLY SEQRES 15 B 362 HIS ALA GLY TRP LEU GLY TRP PRO ALA ASN ILE GLN PRO SEQRES 16 B 362 ALA ALA GLU LEU PHE ALA LYS ILE TYR GLU ASP ALA GLY SEQRES 17 B 362 LYS PRO ARG ALA VAL ARG GLY LEU ALA THR ASN VAL ALA SEQRES 18 B 362 ASN TYR ASN ALA TRP SER VAL SER SER PRO PRO PRO TYR SEQRES 19 B 362 THR SER PRO ASN PRO ASN TYR ASP GLU LYS HIS TYR ILE SEQRES 20 B 362 GLU ALA PHE ARG PRO LEU LEU GLU ALA ARG GLY PHE PRO SEQRES 21 B 362 ALA GLN PHE ILE VAL ASP GLN GLY ARG SER GLY LYS GLN SEQRES 22 B 362 PRO THR GLY GLN LYS GLU TRP GLY HIS TRP CYS ASN ALA SEQRES 23 B 362 ILE GLY THR GLY PHE GLY MET ARG PRO THR ALA ASN THR SEQRES 24 B 362 GLY HIS GLN TYR VAL ASP ALA PHE VAL TRP VAL LYS PRO SEQRES 25 B 362 GLY GLY GLU CYS ASP GLY THR SER ASP THR THR ALA ALA SEQRES 26 B 362 ARG TYR ASP TYR HIS CYS GLY LEU GLU ASP ALA LEU LYS SEQRES 27 B 362 PRO ALA PRO GLU ALA GLY GLN TRP PHE ASN GLU TYR PHE SEQRES 28 B 362 ILE GLN LEU LEU ARG ASN ALA ASN PRO PRO PHE MODRES 1OCB ASN A 141 ASN GLYCOSYLATION SITE MODRES 1OCB ASN B 141 ASN GLYCOSYLATION SITE HET GTM C 1 13 HET BGC C 2 11 HET BGC C 3 11 HET GDA C 4 11 HET GTM D 1 13 HET BGC D 2 11 HET BGC D 3 11 HET GTM E 1 13 HET BGC E 2 11 HET BGC E 3 11 HET GDA E 4 11 HET GTM F 1 13 HET BGC F 2 11 HET BGC F 3 11 HET NAG A 451 14 HET GOL A 459 6 HET NAG B 451 14 HET GOL B 459 6 HET FLG B 460 28 HETNAM GTM METHYL 4-THIO-BETA-D-GLUCOPYRANOSIDE HETNAM BGC BETA-D-GLUCOPYRANOSE HETNAM GDA 4-AMINO-4-DEOXY-BETA-D-GLUCOPYRANOSE HETNAM NAG 2-ACETAMIDO-2-DEOXY-BETA-D-GLUCOPYRANOSE HETNAM GOL GLYCEROL HETNAM FLG FLUORESCEINYLTHIOUREIDO HETSYN GTM O1-METHYL-4-DEOXY-4-THIO-BETA-D-GLUCOSE; METHYL 4-THIO- HETSYN 2 GTM BETA-D-GLUCOSIDE; METHYL 4-THIO-D-GLUCOSIDE; METHYL 4- HETSYN 3 GTM THIO-GLUCOSIDE HETSYN BGC BETA-D-GLUCOSE; D-GLUCOSE; GLUCOSE HETSYN GDA 4-AMINO-4-DEOXY-BETA-D-GLUCOSE; 4-AMINO-4-DEOXY-D- HETSYN 2 GDA GLUCOSE; 4-AMINO-4-DEOXY-GLUCOSE HETSYN NAG N-ACETYL-BETA-D-GLUCOSAMINE; 2-ACETAMIDO-2-DEOXY-BETA- HETSYN 2 NAG D-GLUCOSE; 2-ACETAMIDO-2-DEOXY-D-GLUCOSE; 2-ACETAMIDO- HETSYN 3 NAG 2-DEOXY-GLUCOSE; N-ACETYL-D-GLUCOSAMINE HETSYN GOL GLYCERIN; PROPANE-1,2,3-TRIOL FORMUL 3 GTM 4(C7 H14 O5 S) FORMUL 3 BGC 8(C6 H12 O6) FORMUL 3 GDA 2(C6 H13 N O5) FORMUL 7 NAG 2(C8 H15 N O6) FORMUL 8 GOL 2(C3 H8 O3) FORMUL 11 FLG C23 H18 N2 O5 S FORMUL 12 HOH *715(H2 O) HELIX 1 1 ASN A 102 LEU A 112 1 11 HELIX 2 2 ALA A 113 ILE A 117 5 5 HELIX 3 3 ASP A 119 ALA A 130 1 12 HELIX 4 4 ARG A 140 VAL A 144 5 5 HELIX 5 5 THR A 146 GLY A 162 1 17 HELIX 6 6 ALA A 191 ASN A 194 5 4 HELIX 7 7 ASN A 195 PHE A 214 1 20 HELIX 8 8 ASP A 226 ASN A 234 1 9 HELIX 9 9 VAL A 237 LEU A 258 1 22 HELIX 10 10 TRP A 277 ALA A 295 1 19 HELIX 11 11 PRO A 320 SER A 324 5 5 HELIX 12 12 ASP A 330 ARG A 345 1 16 HELIX 13 13 ASP A 416 LEU A 421 5 6 HELIX 14 14 PHE A 435 ASN A 445 1 11 HELIX 15 15 ASN B 102 LEU B 112 1 11 HELIX 16 16 ALA B 113 ILE B 117 5 5 HELIX 17 17 ASP B 119 ALA B 130 1 12 HELIX 18 18 ARG B 140 VAL B 144 5 5 HELIX 19 19 THR B 146 ALA B 161 1 16 HELIX 20 20 ALA B 191 ASN B 194 5 4 HELIX 21 21 ASN B 195 PHE B 214 1 20 HELIX 22 22 ASP B 226 ASN B 234 1 9 HELIX 23 23 VAL B 237 LEU B 258 1 22 HELIX 24 24 TRP B 277 ALA B 295 1 19 HELIX 25 25 PRO B 320 SER B 324 5 5 HELIX 26 26 ASP B 330 ARG B 345 1 16 HELIX 27 27 ASP B 416 LEU B 421 5 6 HELIX 28 28 PHE B 435 ASN B 445 1 11 SHEET 1 AA 3 GLN A 98 LEU A 99 0 SHEET 2 AA 3 TYR A 167 VAL A 173 1 N ALA A 168 O GLN A 98 SHEET 3 AA 3 GLN A 136 LEU A 138 1 O GLN A 136 N VAL A 172 SHEET 1 AB 7 GLN A 98 LEU A 99 0 SHEET 2 AB 7 TYR A 167 VAL A 173 1 N ALA A 168 O GLN A 98 SHEET 3 AB 7 THR A 219 ILE A 223 1 O ILE A 220 N ILE A 171 SHEET 4 AB 7 VAL A 263 ASP A 268 1 O ALA A 264 N LEU A 221 SHEET 5 AB 7 VAL A 301 THR A 306 1 N ARG A 302 O MET A 265 SHEET 6 AB 7 GLN A 350 ASP A 354 1 O GLN A 350 N LEU A 304 SHEET 7 AB 7 VAL A 392 VAL A 396 1 N ASP A 393 O PHE A 351 SHEET 1 BA 3 GLN B 98 LEU B 99 0 SHEET 2 BA 3 TYR B 167 VAL B 173 1 N ALA B 168 O GLN B 98 SHEET 3 BA 3 GLN B 136 LEU B 138 1 O GLN B 136 N VAL B 172 SHEET 1 BB 7 GLN B 98 LEU B 99 0 SHEET 2 BB 7 TYR B 167 VAL B 173 1 N ALA B 168 O GLN B 98 SHEET 3 BB 7 THR B 219 ILE B 223 1 O ILE B 220 N ILE B 171 SHEET 4 BB 7 VAL B 263 ASP B 268 1 O ALA B 264 N LEU B 221 SHEET 5 BB 7 VAL B 301 THR B 306 1 N ARG B 302 O MET B 265 SHEET 6 BB 7 GLN B 350 ASP B 354 1 O GLN B 350 N LEU B 304 SHEET 7 BB 7 VAL B 392 VAL B 396 1 N ASP B 393 O PHE B 351 SSBOND 1 CYS A 181 CYS A 240 1555 1555 2.10 SSBOND 2 CYS A 372 CYS A 419 1555 1555 2.05 SSBOND 3 CYS B 181 CYS B 240 1555 1555 2.10 SSBOND 4 CYS B 372 CYS B 419 1555 1555 2.05 LINK ND2 ASN A 141 C1 NAG A 451 1555 1555 1.44 LINK NE2AGLN A 150 C6 FLG B 460 1454 1555 1.78 LINK NE2AGLN A 150 O3 FLG B 460 1454 1555 1.70 LINK NE2AGLN A 150 C7 FLG B 460 1454 1555 1.88 LINK ND2 ASN B 141 C1 NAG B 451 1555 1555 1.44 LINK S4 GTM C 1 C1 BGC C 2 1555 1555 1.83 LINK O4 BGC C 2 C1 BGC C 3 1555 1555 1.42 LINK O4 BGC C 3 C1 GDA C 4 1555 1555 1.41 LINK S4 GTM D 1 C1 BGC D 2 1555 1555 1.79 LINK O4 BGC D 2 C1 BGC D 3 1555 1555 1.43 LINK S4 GTM E 1 C1 BGC E 2 1555 1555 1.84 LINK O4 BGC E 2 C1 BGC E 3 1555 1555 1.41 LINK O4 BGC E 3 C1 GDA E 4 1555 1555 1.42 LINK S4 GTM F 1 C1 BGC F 2 1555 1555 1.79 LINK O4 BGC F 2 C1 BGC F 3 1555 1555 1.43 CISPEP 1 ASN A 164 PRO A 165 0 -2.87 CISPEP 2 SER A 324 PRO A 325 0 -1.64 CISPEP 3 GLN A 361 PRO A 362 0 -9.45 CISPEP 4 LYS A 426 PRO A 427 0 -5.38 CISPEP 5 ASN A 447 PRO A 448 0 -0.21 CISPEP 6 ASN B 164 PRO B 165 0 -3.47 CISPEP 7 SER B 324 PRO B 325 0 -0.57 CISPEP 8 GLN B 361 PRO B 362 0 -5.46 CISPEP 9 LYS B 426 PRO B 427 0 -8.46 CISPEP 10 ASN B 447 PRO B 448 0 5.18 CRYST1 49.763 155.681 51.218 90.00 118.42 90.00 P 1 21 1 4 ORIGX1 1.000000 0.000000 0.000000 0.00000 ORIGX2 0.000000 1.000000 0.000000 0.00000 ORIGX3 0.000000 0.000000 1.000000 0.00000 SCALE1 0.020095 0.000000 0.010874 0.00000 SCALE2 0.000000 0.006423 0.000000 0.00000 SCALE3 0.000000 0.000000 0.022200 0.00000