1.000000
0.000000
0.000000
0.000000
1.000000
0.000000
0.000000
0.000000
1.000000
0.00000
0.00000
0.00000
Coltart, D.M.
Williams, L.J.
Glunz, P.W.
Sames, D.
Kuduk, S.D.
Schwarz, J.B.
Chen, X.-T.
Royyuru, A.K.
Danishefsky, S.D.
Live, D.H.
http://mmcif.pdb.org/dictionaries/ascii/mmcif_pdbx.dic
C8 H15 N O6
221.208
2-acetamido-2-deoxy-alpha-D-galactopyranose
D-saccharide, alpha linking
C2 H4 O
44.053
ACETYL GROUP
non-polymer
C3 H7 N O2
89.093
y
ALANINE
L-peptide linking
C3 H7 N O3
105.093
y
SERINE
L-peptide linking
C4 H9 N O3
119.119
y
THREONINE
L-peptide linking
C5 H11 N O2
117.146
y
VALINE
L-peptide linking
US
J.Am.Chem.Soc.
JACSAT
0004
0002-7863
124
9833
9844
10.1021/ja020208f
12175243
Principles of Mucin Architecture: Structural Studies on Synthetic Glycopeptides Bearing Clustered Mono-, Di-, Tri-, and Hexasaccharide Glycodomains
2002
US
Proc.Natl.Acad.Sci.USA
PNASA6
0040
0027-8424
96
3489
3493
10.1073/pnas.96.7.3489
Probing Cell-Surface Architecture Through Synthesis: An NMR Determined Structural Motif for Tumor-Associated Mucins
1999
1.000000
0.000000
0.000000
0.000000
1.000000
0.000000
0.000000
0.000000
1.000000
0.00000
0.00000
0.00000
Ser A 1, Thr A 2, Thr A 3 modified with glycosylation by STF antigen
503.547
Leukosialin (CD43) fragment
N-terminal glycopentapeptide
1
syn
polymer
221.208
2-acetamido-2-deoxy-alpha-D-galactopyranose
3
syn
non-polymer
pentapeptide fragment of sialophorin
no
yes
(ACE)STTAV
XSTTAV
A
polypeptide(L)
n
n
n
n
n
n
atom_site
chem_comp
entity
pdbx_chem_comp_identifier
pdbx_entity_nonpoly
struct_conn
struct_site
struct_site_gen
repository
Initial release
Carbohydrate remediation
repository
Remediation
Version format compliance
Version format compliance
Atomic model
Data collection
Derived calculations
Structure summary
1
0
2002-02-20
1
1
2008-04-28
1
2
2011-07-13
2
0
2020-07-29
_atom_site.auth_atom_id
_atom_site.label_atom_id
_chem_comp.name
_chem_comp.type
_entity.pdbx_description
_pdbx_entity_nonpoly.name
_struct_conn.pdbx_dist_value
_struct_conn.pdbx_leaving_atom_flag
_struct_conn.pdbx_role
_struct_conn.ptnr1_auth_comp_id
_struct_conn.ptnr1_auth_seq_id
_struct_conn.ptnr1_label_atom_id
_struct_conn.ptnr1_label_comp_id
_struct_conn.ptnr1_label_seq_id
_struct_conn.ptnr2_auth_comp_id
_struct_conn.ptnr2_auth_seq_id
_struct_conn.ptnr2_label_asym_id
_struct_conn.ptnr2_label_atom_id
_struct_conn.ptnr2_label_comp_id
_struct_conn.ptnr2_label_seq_id
DGalpNAca
N-acetyl-a-D-galactopyranosamine
a-D-GalpNAc
GalNAc
2002-02-20
SPRSDE
RCSB
Y
RCSB
2002-02-04
REL
REL
A2G
2-acetamido-2-deoxy-alpha-D-galactopyranose
Fragment of human sequence
sample
Coordinates are given only for the experimentally defined glycopeptide
core residues. These are comprised of the amino acid units in the peptide
and the N-acetyl-galactosamine. Coordinates for the terminal galactose and
sialic acid of each of the glycans are not included.
No NOE violations greter than 0.15A, no coupling violation greater than 0.5 Hz,
no angle violation greater than 5 deg.
200
59
2D NOESY
3D_TOCSY_NOESY
4.5
1
atm
291
K
refinement based on 107 NOE's 11 J coupling restraints and
3 dihedral restraints. See citation for full description.
VNMR, NMRPIPE, X-PLOR, Torsion Space Simulated Annealing
1
closest to the average
10 mM glycopeptide
90% H2O/10% D2O
Varian Inc.
collection
VNMR
6.1
Varian Inc.
data analysis
VNMR
6.1
Delaglio
data analysis
NMRPipe
2.1
Brunger/MSI
refinement
X-PLOR
98
600
Varian
INOVA
800
Varian
INOVA
A2G
7
2
A2G
A2G
7
A
A2G
10
2
A2G
A2G
10
A
A2G
13
2
A2G
A2G
13
A
ACE
0
n
1
ACE
0
A
SER
1
n
2
SER
1
A
THR
2
n
3
THR
2
A
THR
3
n
4
THR
3
A
ALA
4
n
5
ALA
4
A
VAL
5
n
6
VAL
5
A
author_defined_assembly
1
monomeric
A
SER
1
GLYCOSYLATION SITE
A
SER
2
SER
A
THR
2
GLYCOSYLATION SITE
A
THR
3
THR
A
THR
3
GLYCOSYLATION SITE
A
THR
4
THR
1.0000000000
0.0000000000
0.0000000000
0.0000000000
1.0000000000
0.0000000000
0.0000000000
0.0000000000
1.0000000000
1_555
x,y,z
identity operation
0.0000000000
0.0000000000
0.0000000000
1
A
O6
A2G
7
B
O6
A2G
1
1
N
1
A
O6
A2G
10
C
O6
A2G
1
1
N
1
A
O6
A2G
13
D
O6
A2G
1
1
N
2
A
O6
A2G
7
B
O6
A2G
1
1
N
2
A
O6
A2G
10
C
O6
A2G
1
1
N
2
A
O6
A2G
13
D
O6
A2G
1
1
N
3
A
O6
A2G
7
B
O6
A2G
1
1
N
3
A
O6
A2G
10
C
O6
A2G
1
1
N
3
A
O6
A2G
13
D
O6
A2G
1
1
N
4
A
O6
A2G
7
B
O6
A2G
1
1
N
4
A
O6
A2G
10
C
O6
A2G
1
1
N
4
A
O6
A2G
13
D
O6
A2G
1
1
N
5
A
O6
A2G
7
B
O6
A2G
1
1
N
5
A
O6
A2G
10
C
O6
A2G
1
1
N
5
A
O6
A2G
13
D
O6
A2G
1
1
N
6
A
O6
A2G
7
B
O6
A2G
1
1
N
6
A
O6
A2G
10
C
O6
A2G
1
1
N
6
A
O6
A2G
13
D
O6
A2G
1
1
N
7
A
O6
A2G
7
B
O6
A2G
1
1
N
7
A
O6
A2G
10
C
O6
A2G
1
1
N
7
A
O6
A2G
13
D
O6
A2G
1
1
N
8
A
O6
A2G
7
B
O6
A2G
1
1
N
8
A
O6
A2G
10
C
O6
A2G
1
1
N
8
A
O6
A2G
13
D
O6
A2G
1
1
N
9
A
O6
A2G
7
B
O6
A2G
1
1
N
9
A
O6
A2G
10
C
O6
A2G
1
1
N
9
A
O6
A2G
13
D
O6
A2G
1
1
N
10
A
O6
A2G
7
B
O6
A2G
1
1
N
10
A
O6
A2G
10
C
O6
A2G
1
1
N
10
A
O6
A2G
13
D
O6
A2G
1
1
N
11
A
O6
A2G
7
B
O6
A2G
1
1
N
11
A
O6
A2G
10
C
O6
A2G
1
1
N
11
A
O6
A2G
13
D
O6
A2G
1
1
N
12
A
O6
A2G
7
B
O6
A2G
1
1
N
12
A
O6
A2G
10
C
O6
A2G
1
1
N
12
A
O6
A2G
13
D
O6
A2G
1
1
N
13
A
O6
A2G
7
B
O6
A2G
1
1
N
13
A
O6
A2G
10
C
O6
A2G
1
1
N
13
A
O6
A2G
13
D
O6
A2G
1
1
N
14
A
O6
A2G
7
B
O6
A2G
1
1
N
14
A
O6
A2G
10
C
O6
A2G
1
1
N
14
A
O6
A2G
13
D
O6
A2G
1
1
N
15
A
O6
A2G
7
B
O6
A2G
1
1
N
15
A
O6
A2G
10
C
O6
A2G
1
1
N
15
A
O6
A2G
13
D
O6
A2G
1
1
N
16
A
O6
A2G
7
B
O6
A2G
1
1
N
16
A
O6
A2G
10
C
O6
A2G
1
1
N
16
A
O6
A2G
13
D
O6
A2G
1
1
N
17
A
O6
A2G
7
B
O6
A2G
1
1
N
17
A
O6
A2G
10
C
O6
A2G
1
1
N
17
A
O6
A2G
13
D
O6
A2G
1
1
N
18
A
O6
A2G
7
B
O6
A2G
1
1
N
18
A
O6
A2G
10
C
O6
A2G
1
1
N
18
A
O6
A2G
13
D
O6
A2G
1
1
N
19
A
O6
A2G
7
B
O6
A2G
1
1
N
19
A
O6
A2G
10
C
O6
A2G
1
1
N
19
A
O6
A2G
13
D
O6
A2G
1
1
N
20
A
O6
A2G
7
B
O6
A2G
1
1
N
20
A
O6
A2G
10
C
O6
A2G
1
1
N
20
A
O6
A2G
13
D
O6
A2G
1
1
N
21
A
O6
A2G
7
B
O6
A2G
1
1
N
21
A
O6
A2G
10
C
O6
A2G
1
1
N
21
A
O6
A2G
13
D
O6
A2G
1
1
N
22
A
O6
A2G
7
B
O6
A2G
1
1
N
22
A
O6
A2G
10
C
O6
A2G
1
1
N
22
A
O6
A2G
13
D
O6
A2G
1
1
N
23
A
O6
A2G
7
B
O6
A2G
1
1
N
23
A
O6
A2G
10
C
O6
A2G
1
1
N
23
A
O6
A2G
13
D
O6
A2G
1
1
N
24
A
O6
A2G
7
B
O6
A2G
1
1
N
24
A
O6
A2G
10
C
O6
A2G
1
1
N
24
A
O6
A2G
13
D
O6
A2G
1
1
N
25
A
O6
A2G
7
B
O6
A2G
1
1
N
25
A
O6
A2G
10
C
O6
A2G
1
1
N
25
A
O6
A2G
13
D
O6
A2G
1
1
N
26
A
O6
A2G
7
B
O6
A2G
1
1
N
26
A
O6
A2G
10
C
O6
A2G
1
1
N
26
A
O6
A2G
13
D
O6
A2G
1
1
N
27
A
O6
A2G
7
B
O6
A2G
1
1
N
27
A
O6
A2G
10
C
O6
A2G
1
1
N
27
A
O6
A2G
13
D
O6
A2G
1
1
N
28
A
O6
A2G
7
B
O6
A2G
1
1
N
28
A
O6
A2G
10
C
O6
A2G
1
1
N
28
A
O6
A2G
13
D
O6
A2G
1
1
N
29
A
O6
A2G
7
B
O6
A2G
1
1
N
29
A
O6
A2G
10
C
O6
A2G
1
1
N
29
A
O6
A2G
13
D
O6
A2G
1
1
N
30
A
O6
A2G
7
B
O6
A2G
1
1
N
30
A
O6
A2G
10
C
O6
A2G
1
1
N
30
A
O6
A2G
13
D
O6
A2G
1
1
N
31
A
O6
A2G
7
B
O6
A2G
1
1
N
31
A
O6
A2G
10
C
O6
A2G
1
1
N
31
A
O6
A2G
13
D
O6
A2G
1
1
N
32
A
O6
A2G
7
B
O6
A2G
1
1
N
32
A
O6
A2G
10
C
O6
A2G
1
1
N
32
A
O6
A2G
13
D
O6
A2G
1
1
N
33
A
O6
A2G
7
B
O6
A2G
1
1
N
33
A
O6
A2G
10
C
O6
A2G
1
1
N
33
A
O6
A2G
13
D
O6
A2G
1
1
N
34
A
O6
A2G
7
B
O6
A2G
1
1
N
34
A
O6
A2G
10
C
O6
A2G
1
1
N
34
A
O6
A2G
13
D
O6
A2G
1
1
N
35
A
O6
A2G
7
B
O6
A2G
1
1
N
35
A
O6
A2G
10
C
O6
A2G
1
1
N
35
A
O6
A2G
13
D
O6
A2G
1
1
N
36
A
O6
A2G
7
B
O6
A2G
1
1
N
36
A
O6
A2G
10
C
O6
A2G
1
1
N
36
A
O6
A2G
13
D
O6
A2G
1
1
N
37
A
O6
A2G
7
B
O6
A2G
1
1
N
37
A
O6
A2G
10
C
O6
A2G
1
1
N
37
A
O6
A2G
13
D
O6
A2G
1
1
N
38
A
O6
A2G
7
B
O6
A2G
1
1
N
38
A
O6
A2G
10
C
O6
A2G
1
1
N
38
A
O6
A2G
13
D
O6
A2G
1
1
N
39
A
O6
A2G
7
B
O6
A2G
1
1
N
39
A
O6
A2G
10
C
O6
A2G
1
1
N
39
A
O6
A2G
13
D
O6
A2G
1
1
N
40
A
O6
A2G
7
B
O6
A2G
1
1
N
40
A
O6
A2G
10
C
O6
A2G
1
1
N
40
A
O6
A2G
13
D
O6
A2G
1
1
N
41
A
O6
A2G
7
B
O6
A2G
1
1
N
41
A
O6
A2G
10
C
O6
A2G
1
1
N
41
A
O6
A2G
13
D
O6
A2G
1
1
N
42
A
O6
A2G
7
B
O6
A2G
1
1
N
42
A
O6
A2G
10
C
O6
A2G
1
1
N
42
A
O6
A2G
13
D
O6
A2G
1
1
N
43
A
O6
A2G
7
B
O6
A2G
1
1
N
43
A
O6
A2G
10
C
O6
A2G
1
1
N
43
A
O6
A2G
13
D
O6
A2G
1
1
N
44
A
O6
A2G
7
B
O6
A2G
1
1
N
44
A
O6
A2G
10
C
O6
A2G
1
1
N
44
A
O6
A2G
13
D
O6
A2G
1
1
N
45
A
O6
A2G
7
B
O6
A2G
1
1
N
45
A
O6
A2G
10
C
O6
A2G
1
1
N
45
A
O6
A2G
13
D
O6
A2G
1
1
N
46
A
O6
A2G
7
B
O6
A2G
1
1
N
46
A
O6
A2G
10
C
O6
A2G
1
1
N
46
A
O6
A2G
13
D
O6
A2G
1
1
N
47
A
O6
A2G
7
B
O6
A2G
1
1
N
47
A
O6
A2G
10
C
O6
A2G
1
1
N
47
A
O6
A2G
13
D
O6
A2G
1
1
N
48
A
O6
A2G
7
B
O6
A2G
1
1
N
48
A
O6
A2G
10
C
O6
A2G
1
1
N
48
A
O6
A2G
13
D
O6
A2G
1
1
N
49
A
O6
A2G
7
B
O6
A2G
1
1
N
49
A
O6
A2G
10
C
O6
A2G
1
1
N
49
A
O6
A2G
13
D
O6
A2G
1
1
N
50
A
O6
A2G
7
B
O6
A2G
1
1
N
50
A
O6
A2G
10
C
O6
A2G
1
1
N
50
A
O6
A2G
13
D
O6
A2G
1
1
N
51
A
O6
A2G
7
B
O6
A2G
1
1
N
51
A
O6
A2G
10
C
O6
A2G
1
1
N
51
A
O6
A2G
13
D
O6
A2G
1
1
N
52
A
O6
A2G
7
B
O6
A2G
1
1
N
52
A
O6
A2G
10
C
O6
A2G
1
1
N
52
A
O6
A2G
13
D
O6
A2G
1
1
N
53
A
O6
A2G
7
B
O6
A2G
1
1
N
53
A
O6
A2G
10
C
O6
A2G
1
1
N
53
A
O6
A2G
13
D
O6
A2G
1
1
N
54
A
O6
A2G
7
B
O6
A2G
1
1
N
54
A
O6
A2G
10
C
O6
A2G
1
1
N
54
A
O6
A2G
13
D
O6
A2G
1
1
N
55
A
O6
A2G
7
B
O6
A2G
1
1
N
55
A
O6
A2G
10
C
O6
A2G
1
1
N
55
A
O6
A2G
13
D
O6
A2G
1
1
N
56
A
O6
A2G
7
B
O6
A2G
1
1
N
56
A
O6
A2G
10
C
O6
A2G
1
1
N
56
A
O6
A2G
13
D
O6
A2G
1
1
N
57
A
O6
A2G
7
B
O6
A2G
1
1
N
57
A
O6
A2G
10
C
O6
A2G
1
1
N
57
A
O6
A2G
13
D
O6
A2G
1
1
N
58
A
O6
A2G
7
B
O6
A2G
1
1
N
58
A
O6
A2G
10
C
O6
A2G
1
1
N
58
A
O6
A2G
13
D
O6
A2G
1
1
N
59
A
O6
A2G
7
B
O6
A2G
1
1
N
59
A
O6
A2G
10
C
O6
A2G
1
1
N
59
A
O6
A2G
13
D
O6
A2G
1
1
N
1
A
ALA
4
-84.99
-142.57
4
A
ALA
4
-85.30
-143.55
6
A
ALA
4
-84.99
-143.59
11
A
ALA
4
-153.51
-147.73
14
A
ALA
4
-84.74
-142.22
15
A
ALA
4
-85.09
-142.39
16
A
ALA
4
-152.37
-149.20
17
A
ALA
4
-84.84
-142.15
18
A
ALA
4
-84.96
-143.21
19
A
ALA
4
-154.78
-146.26
22
A
ALA
4
-84.77
-143.49
23
A
ALA
4
-153.67
-148.45
25
A
THR
2
-95.49
-159.50
25
A
ALA
4
-84.77
-143.59
27
A
THR
2
-94.80
-156.12
27
A
ALA
4
-84.68
-143.88
28
A
ALA
4
-85.20
-142.04
31
A
ALA
4
-85.04
-142.65
34
A
ALA
4
-154.62
-145.77
35
A
THR
2
-95.96
-155.09
37
A
THR
2
-96.65
-153.06
38
A
ALA
4
-84.76
-143.00
39
A
THR
2
-94.96
-158.67
39
A
ALA
4
-84.75
-143.31
40
A
THR
2
-95.59
-155.40
42
A
THR
2
-94.70
-157.10
42
A
ALA
4
-85.03
-141.80
43
A
ALA
4
-153.85
-147.59
44
A
THR
2
-96.14
-158.31
44
A
ALA
4
-84.85
-143.97
45
A
ALA
4
-85.30
-143.59
47
A
ALA
4
-84.47
-142.07
49
A
ALA
4
-84.59
-143.26
50
A
ALA
4
-84.82
-144.03
52
A
ALA
4
-85.08
-142.46
53
A
ALA
4
-154.30
-145.81
54
A
THR
2
-96.69
-158.78
55
A
ALA
4
-85.01
-144.04
56
A
THR
2
-97.36
-156.78
57
A
THR
2
-98.42
-156.69
57
A
ALA
4
-85.19
-142.72
58
A
ALA
4
-154.35
-146.12
Leukosialin (CD43) fragment
Tumor Associated Mucin Motif from CD43 protein
1
N
N
2
N
N
2
N
N
2
N
N
covale
1.334
both
A
ACE
0
A
C
ACE
1
1_555
A
SER
1
A
N
SER
2
1_555
covale
1.410
one
O-Glycosylation
A
SER
1
A
OG
SER
2
1_555
A
A2G
7
B
C1
A2G
1_555
covale
1.418
one
O-Glycosylation
A
THR
2
A
OG1
THR
3
1_555
A
A2G
10
C
C1
A2G
1_555
covale
1.420
one
O-Glycosylation
A
THR
3
A
OG1
THR
4
1_555
A
A2G
13
D
C1
A2G
1_555
glycoprotein, immune system
Leukosialin, CD43, Mucin glycoprotein, glycoprotein, immune system
1KYJ
PDB
1
1KYJ
0
5
1KYJ
0
5
1KYJ
A
1
1
6