Data items in the EM_2D_CRYSTAL_ENTITY category record the symmetry details of a 2D crystal assembly component. Space-group number from International Tables for Crystallography, Vol. A (1987). Unit-cell angle gamma of the reported structure, in degrees. Additional details describing this 2D crystal component Unit-cell length a corresponding to the structure reported, in Angstroms. Unit-cell length b corresponding to the structure reported, in Angstroms. The 17 plane groups are classified as oblique, rectangular, square, and hexagonal. To describe 2D crystals of biological molecules, the plane groups are expanded to their equivalent noncentrosymmetric space groups. The space group setting is chosen such that the 2D crystal plane corresponds to the 'ab' plane of the space group. . Enumerated space group descriptions include the H-M plane group symbol and plane group class. The thickness of the crystal sample in the out-of-plane direction. The value of attribute id in category em_2d_crystal_entity must uniquely identify a set of the crystal parameters for this assembly component. The value of attribute entity_assembly_id in category em_2d_crystal_entity identifies an assembly component with 2d crystal symmetry. This data item is a pointer to attribute id in category em_entity_assembly in the EM_ENTITY_ASSEMBLY category. Data items in the EM_2D_CRYSTAL_SELECTION category record details for the selection of 2D crystals. Any additional details used for selecting 2d crystals. negative monitor contrast facilitated particle picking The value of attribute selection_id in category em_2d_crystal_selection identifies the general set of selection conditions associated with specific filament selection conditions described in this category. The value of attribute selection_id in category em_2d_crystal_selection points to the attribute id in category em_particle_selection in the EM_PARTICLE_SELECTION category. Data items in the 3D_FITTING category record details of the method of fitting atomic coordinates from a PDB file into a 3d-em volume map file Example 1 - EMDB entry EM1078 <mmcif_iims:em_3d_fittingCategory> <mmcif_iims:em_3d_fitting id="1" entry_id="EM1078"> <mmcif_iims:method>AUTOMATIC</mmcif_iims:method> <mmcif_iims:ref_space>REAL</mmcif_iims:ref_space> <mmcif_iims:ref_protocol>RIGID BODY REFINEMENT</mmcif_iims:ref_protocol> </mmcif_iims:em_3d_fitting> <mmcif_iims:em_3d_fitting id="2" entry_id="EM1078"> <mmcif_iims:method>AUTOMATIC</mmcif_iims:method> <mmcif_iims:ref_space>REAL</mmcif_iims:ref_space> <mmcif_iims:ref_protocol>RIGID BODY REFINEMENT</mmcif_iims:ref_protocol> </mmcif_iims:em_3d_fitting> <mmcif_iims:em_3d_fitting id="3" entry_id="EM1078"> <mmcif_iims:method>AUTOMATIC</mmcif_iims:method> <mmcif_iims:ref_space>REAL</mmcif_iims:ref_space> <mmcif_iims:ref_protocol>RIGID BODY REFINEMENT</mmcif_iims:ref_protocol> </mmcif_iims:em_3d_fitting> </mmcif_iims:em_3d_fittingCategory> Example 2 - based on PDB entry 1DYL and laboratory records for the structure corresponding to PDB entry 1DYL <mmcif_iims:em_3d_fittingCategory> <mmcif_iims:em_3d_fitting id="1" entry_id="1DYL"> <mmcif_iims:method>AUTOMATIC</mmcif_iims:method> <mmcif_iims:target_criteria>R-FACTOR</mmcif_iims:target_criteria> <mmcif_iims:ref_space>REAL</mmcif_iims:ref_space> <mmcif_iims:ref_protocol>RIGID BODY REFINEMENT</mmcif_iims:ref_protocol> <mmcif_iims:details> THE CRYSTAL STRUCTURE OF THE CAPSID PROTEIN FROM CHOI ET AL (1997) PROTEINS 3 27:345-359 (SUBUNIT A OF PDB FILE 1VCQ) WAS PLACED INTO THE CRYO-EM DENSITY MAP. THE CAPSID PROTEIN WAS FIRST MANUALLY POSITIONED INTO THE CRYO-EM DENSITY CORRESPONDING TO POSITIONS OF THE FOUR INDEPENDENT MONOMER DENSITIES BETWEEN THE INNER LEAFLET OF THE BILAYER AND THE RNA. THESE POSITIONS WERE THEN REFINED BY RIGID BODY REFINEMENT IN REAL SPACE WITH THE PROGRAM EMFIT (CHENG ET AL. 1995, CELL 80, 621-630). THE QUALITY OF THE FIT CAN BE SEEN FROM THE MAP DENSITY WITHIN THE PROTEIN. ALL 4563 ATOMS ARE IN DENSITY OF AT LEAST 4 SIGMA (96.73) ABOVE THE AVERAGE (512.04), 1167 ATOMS ARE IN DENSITY BETWEEN 4 AND 5 SIGMA, 3174 ATOMS ARE IN DENSITY BETWEEN 5 AND 6 SIGMA, AND 222 ATOMS ARE IN DENSTY OF 6 SIGMA OR ABOVE. THE VARIATION IN DENSITY OVER THE FITTED PROTEIN CAN BE VISUALIZED WITH THE PSEUDO TEMPERATURE FACTOR. THE DENSITY VALUE AT EACH ATOM IS GIVEN IN THE 8TH COLUM (USUALLY THE OCCUPANCY) AS THE NUMBER OF STANDARD DEVIATION ABOVE BACKGROUND. COLUMN NINE (USUALLY THE TEMPERATURE FACTOR) CONTAINS THE VALUE OF THE RELATIVE DENSITY WITHIN THE FITTED PROTEIN SCALED LINEARLY SO THAT THE MINIMUM DENSITY IS 100.0 AND THE MAXIMUM DENSITY IS 1.0. THE ATOMS THAT LIE IN THE LOWER DENSITY REGIONS WILL HAVE THE HIGHEST PSEUDO TEMPERATURE FACTORS. </mmcif_iims:details> </mmcif_iims:em_3d_fitting> </mmcif_iims:em_3d_fittingCategory> Any additional details regarding fitting of atomic coordinates into the 3d-em volume. partial Description of local variance of fit of the atomic coordinates into the 3dem volume map. The method used to fit atomic coordinates into the 3dem reconstructed map. The overall B (temperature factor) value for the 3d-em volume. Description of the quality of fit of the atomic coordinates into the 3dem volume map. The type of protocol used in the refinement. rigid body A flag to indicate whether fitting was carried out in real or reciprocal refinement space. The quality of fit of the atomic coordinates into the 3dem volume map. best visual fit using the program O The value of attribute id in category em_3d_fitting must uniquely identify a fitting procedure of atomic coordinates into 3dem reconstructed volume map. This data item is a pointer to _entry_id in the ENTRY category. Data items in the 3D_FITTING_LIST category lists the methods of fitting atomic coordinates from a PDB file into a 3d-em volume map file Example 1 - based on EM entry 1078 The end sequence ID for the pdb entry chain used in the fitting The EM entry id pointer The chain id for the resulting fitted coordinates The PDB code for the entry produced by the fitting. Description of a particular component pdb entry used in fitting. The chain id for the entry used in fitting. The PDB code for the entry used in fitting. The symmetry required to be applied to the starting PDB entry chain before starting the fitting procedure The start sequence ID for the pdb entry chain used in the fitting The (1,1) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement The (1,2) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement The (1,3) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement The (1,4) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement The (2,1) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement The (2,2) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement The (2,3) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement The (2,4) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement The (3,1) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement The (3,2) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement The (3,3) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement The (3,4) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement The (4,1) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement The (4,2) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement The (4,3) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement The (4,4) element of a 4,4 matrix relating the starting PDB chain to the fitted coordinates in the case of rigid body refinement This data item is a unique identifier. The value of attribute 3d_fitting_id in category em_3d_fitting_list is a pointer to attribute id in category em_3d_fitting in the 3d_fitting category Data items in the EM_3D_RECONSTRUCTION category record details of the 3D reconstruction procedure from 2D projections. Example 1 - based on PDB entry 1DYL and laboratory records for the structure corresponding to PDB entry 1DYL <mmcif_iims:em_3d_reconstructionCategory> <mmcif_iims:em_3d_reconstruction entry_id="1DYL" id="1"> <mmcif_iims:citation_id>1</mmcif_iims:citation_id> <mmcif_iims:resolution>9.</mmcif_iims:resolution> </mmcif_iims:em_3d_reconstruction> </mmcif_iims:em_3d_reconstructionCategory> The Amplitude correction method. Frequency amplitude correction with X-ray scattering data enhances the Fourier amplitudes of a reconstructed cryo-EM volume so they more closely resemble those of experimental low-angle X-ray scattering data. Normal amplitude correction (in which case the SNR weighted averaging of particles will still occur properly) may be applied or without it, in which case the (phase-flipped) data is not corrected during averaging, then the final 3D model is 'fixed'. Details of the B-factor correction method. This data item is a pointer to attribute id in category citation in the CITATION category. The CTF-correction method. The Contrast Transfer Function CTF compensation for low contrast specimens (e.g. frozen-hydrated), for which phase contrast is the only significant mechanism, then higher defocus levels must be used to achieve any significant transfer, and several images at different focus levels must be combined to complete the information lost from the transfer gaps of any one image. The CTF correction can be applied to each extracted particle separately or to the whole micrograph after digitisation. The simplest level of compensation is to reverse phases at the negative lobes of the CTF. CTF correction of each particle General details on the 3d recontruction Orientation determination using the random-conical data collection method. This method uses a defined geometry in the data collection, and is able to find the handedness of the structure unambiguously. Each specimen field is imaged twice, once tilted, once untilted. Particles are selected simultaneously from both untilted- and tilted-specimen fields, using a special interactive particle-selection program that is able to "predict" the location of a particle in the tilted-specimen field when its counterpart has been selected in the untilted field. From the untilted-specimen particle data set, all particles are selected that exhibit the same view. This can be done by using alignment followed by classification. The corresponding tilted-specimen data subset can be used to compute a reconstruction: the orientations of the tilted-particle projections lie on a cone with fixed angle (the tilt angle) and random azimuths (the in-plane angles found in the alignment of the untilted particle set). 1 Orientation determination using common lines (a.k.a. "angular reconstitution"). This method is based on the fact that in Fourier space any two projections intersect along a central line ("the common line"). Hence, in principle, the relative orientations between three projections can be determined - except that the handedness of the constellation is ambiguous. Because of the low signal-to-noise ratio of raw particle images, averages of projections falling into roughly the same orientation must be used. Since the procedure leads to solutions presenting local minima, it must be repeated several times to find solutions that form a cluster, presumably around the global minimum. Such clustering of solutions can be detected by multivariate statistical analysis of the resulting 3D maps. Two clusters are expected, one for each enantiomorph. After initial structure is obtained, it should be further refined using 3D projection matching strategy described next. 2 Orientation determination by 3D projection matching. Here the existing 3D map is projected in many orientations on a regular angular grid, and the resulting projections that are compared, one by one, with each of the experimental projections. This comparison (by cross-correlation ) yields a refined set of Eulerian angles , with which a refined reconstruction can be computed using one of the possible reconstruction techniques. This procedure requires iteration until the angles for each projection stabilize. 3 Details of the envelope function correction method. The average phase residual for the helical assembly. The layer-line resolution. Layer-lines fade out and are only visible to a certain resolution. The number of datasets used in the 3d reconstruction for the helical assembly. Details on how located layer lines are used to choose a selection rule which best fits the data. Based on this selection rule the filament is then reboxed and restraightened using the original digitized image so that the final image contains an integral number of helical repeats. The selection rule define helical lattices which relate the layer- line number l to the order of the Bessel function, n, contributing to the layer-line. (n is the start number, ie number around the circumference, of the contributing helix). The diffraction pattern from a helix consists not of discrete spots but of difraction spots which have been broadened into layer-lines. The order of the Bessel functions allowed to contribute to the diffraction pattern of a helix on a given layer line versus the layer line along the ordinate gives a function which is described by a lattice. Such a plot is analogous to the diffraction pattern from a planar array corresponding to a flattened helix and is called an (n,l) plot. e.g. the n,l plot corresponding to the selection rule l = 5n + 12m where m is an integer and indicates e.g. 12 (ribosomes) per repeat five turns long, whilst the n,l plot corresponding to the selection rule l = 5n + 17m indicates 17(ribosomes) per repeat five turns along. General details describing any local or approximate symmetry used in the single particle reconstruction The algorithm method used for the 3d-reconstruction. e.g. Random-conical reconstruction: a method of data collection and reconstruction used for single particles, typically used initially in a project, to obtain a first low-resolution reconstruction of the macromolecule [Radermacher et al., 1987]. Two images of the same specimen field are collected, one with untilted grid, the other with the grid tilted by 50 to 60 degrees. Any set of particles presenting the same view in the untilted-specimen image form a random-conical projection set in the associated tilted-specimen image. Helical reconstruction Helical reconstruction is used when the protein of interest forms a natural helix. Since the helix is a recurring structure with a very well defined pattern, the repeating pattern of the helix can be exploited to solve the structure. In this case, no alignment of the particles is needed, since the individual positions of subunits within the helix are clearly defined by the shape of the helix. Two common examples of structures solved by helical reconstruction are TMV and microtubules. Icosahedral reconstruction Icosahedral reconstructions also take advantage of internal symmetry and repetition to generate a detailed three-dimensional structure from the data set. In this case, the symmetry is icosahedral (twenty-one sided). Many viruses exhibit icosahedral symmetry in their capsid proteins, and this method has been used to solve their structures. Electron crystallography Electron crystallography is similar to x-ray crystallography in that it exploits the repeating pattern found within a crystal to generate a structure. Just as with x-ray crystallography, difraction patterns are generated and are used to define an electron density map. However, it differs in that the crystal used is a two-dimensional sheet as opposed to three three-dimensional crystals of x-ray crystallography. Common Lines Another reconstruction method searches for the intersection of any two projections in Fourier space. The Fourier transform of the experimental projections all form slices around a common core in Fourier space. Therefore, the intersection of these projections are unique (unless the projections perfectly overlap), and their relative orientation can be found when three or more projections are used. A principal problem with this method is that the handedness of the image is lost. This, however, can later be corrected by visual examination of the model with other known structural information. Back Projection As its name implies, back projection is the inverse function of projection. When an n-dimensional object is projected, each projection is an n-1 dimensional sum of its density along the projection axis. Therefore, a sphere would have circles as its projections. A cube, on the other hand, would produce either squares, diamonds, or other intermediate parallelograms depending on the direction of projection. The actual shape, of course, depends on the orientation from which the projection was made. The reverse function is called back projection and regenerates the original object. cross-common lines The number of asymmetric units used in the single particle reconstruction The number of particles used in the 3d reconstruction The final resolution (in angstroms)of the 3d reconstruction. The method used to determine the final resolution of the 3d reconstruction. The Fourier Shell Correlation criterion as a measure of resolution is based on the concept of splitting the (2D) data set into two halves; averaging each and comparing them using the Fourier Ring Correlation (FRC) technique. FSC at 0.5 cut-off The actual pixel size of projection set of images in x IF only attribute voxel_size_x in category em_3d_reconstruction is given then a cube is assumed. The actual pixel size of projection set of images in y IF only attribute voxel_size_x in category em_3d_reconstruction is given then a cube is assumed. The actual pixel size of projection set of images in z IF only attribute voxel_size_x in category em_3d_reconstruction is given then a cube is assumed. This data item is a pointer to attribute id in category entry in the ENTRY category. The value of attribute id in category em_3d_reconstruction must uniquely identify the 3d reconstruction. Data items in the EM_3D_REFINEMENT category record details about the class/particle refinement. In random conical tilt, images were assigned angular positions through rotational alignment and tilt-angles. From each different class, a three-dimensional preliminary model is constructed. To improve the output, those preliminary models from each class that have a high degree of similarity are merged. In theory, these models corresponded to groups of the same molecule just viewed from different orientations. Once all the good random conical tilt models (and their corresponding particle data sets) have been merged, iterative angular refinement is used to improve the model's resolution. Equidistant projections are first generated from the merged model. The entire particle data set (whether the old random conical tilt experimental particles, or new untilted experimental particles, or both) is then cross correlated to each reference projection. A correlation coefficient is generated between each experimental particle and reference projection. For each individual experimental particle, it is matched to the reference projection that gave the highest correlation coefficient. Therefore, it is assumed that this particle matches the Euler angles of the reference projection. Alignment radius (pixels) used in alignment search the angular_search_step_size used in refinement Convergence criterion fraction e.g. Converges when x16 % of all images move < 1.5 * stepsize This data item is a pointer to attribute id in category entry in the ENTRY category. the max_spatial_frequency used in refinement (1/A) The criterion used to determine the maximum spatial frequency. Description of the 3d refinement method the number of iterations used in refinement The number of particles used in refinement. the Projection radius in pixels the structure_radius in pixels The value of attribute id in category em_3d_refinement must uniquely identify the refinement used in the em experiments. Data items in the EM_ARRAY_FORMATION category record details of growth conditions for the array samples. Example 1 - based on PDB entry 1AT9 and laboratory records for the structure corresponding to PDB entry 1DYL <mmcif_iims:em_array_formationCategory> <mmcif_iims:em_array_formation id="1" type="2D-CRYSTAL"> <mmcif_iims:atmosphere>room air</mmcif_iims:atmosphere> <mmcif_iims:pH>5.2</mmcif_iims:pH> <mmcif_iims:temp>18.</mmcif_iims:temp> <mmcif_iims:buffer_id>2</mmcif_iims:buffer_id> <mmcif_iims:details>on grid</mmcif_iims:details> <mmcif_iims:citation_id>2</mmcif_iims:citation_id> </mmcif_iims:em_array_formation> </mmcif_iims:em_array_formationCategory> The type of the apparatus used for growing the array. Langmuir trough The type of atmosphere in which arrays were grown. room air This data item is a pointer to attribute id in category em_solution_composition. This data item is a pointer to attribute id in category citation in the CITATION category. Any additional items concerning array growth. Two-dimensional Crystallization-- Purified protein (2 mg/ml) was mixed with E. coli lipids solubilized in OTG (mixed micelles stock solution, 4 mg/ml E. coli lipids in 20 mM Mes-NaOH (pH 6), 5% OTG, 0.01% NaN3) to achieve a lipid to protein ratio of 1 (w/w). The final protein concentration was adjusted to 1.33 mg/ml, and the final OTG content was adjusted to 1.93%. The reconstitution mixture (60 µl) was preincubated at room temperature for 30 min and dialyzed against 1.5 liters of 10 mM Mes-NaOH (pH 6), 100 mM NaCl, 100 mM MgCl2, 2 mM dithiothreitol, 0.01% NaN3 for 24 h at room temperature, 24 h at 37 °C, and another 24 h at room temperature. The method used for growing the array. lipid monolayer the pH value used for growing the array. 4.7 This data item is a pointer to attribute id in category em_solution_composition in the EM_SOLUTION_COMPOSITION category. The value of the temperature in degrees Kelvin used for growing the arrays. 293 The length of time required to grow the array. approximately 2 days The value of attribute id in category em_array_formation must uniquely identify the sample. The value of attribute type in category em_array_formation must identifies the type of array studied. Data items in the em_assembly category record basic information about the assembly represented by the EM map. based on PDB entry 1DGI <mmcif_iims:em_assemblyCategory> <mmcif_iims:em_assembly id="1" entry_id="1DGI"> <mmcif_iims:name>Poliovirus-CD155</mmcif_iims:name> <mmcif_iims:composition>virus-receptor complex</mmcif_iims:composition> <mmcif_iims:num_components>2</mmcif_iims:num_components> <mmcif_iims:array>NO</mmcif_iims:array> <mmcif_iims:superstructure>NO</mmcif_iims:superstructure> </mmcif_iims:em_assembly> </mmcif_iims:em_assemblyCategory> based on PDB entry 2BG9 <mmcif_iims:em_assemblyCategory> <mmcif_iims:em_assembly id="1" entry_id="2BG9"> <mmcif_iims:name>Acetylcholine receptor, Torpedo postsynaptic membrane</mmcif_iims:name> <mmcif_iims:composition>integral membrane receptor</mmcif_iims:composition> <mmcif_iims:num_components>1</mmcif_iims:num_components> <mmcif_iims:array>YES</mmcif_iims:array> <mmcif_iims:superstructure>YES</mmcif_iims:superstructure> </mmcif_iims:em_assembly> </mmcif_iims:em_assemblyCategory> A flag to indicate whether the imaged assembly is part of a regular array, e.g, a 2D or helical crystal. The known composition of the sample. Any additional details about the assembly. The name of the biological assembly helical crystals of acetylcholine receptor poliovirus - CD155 receptor complex The number of components of the biological assembly. A flag to indicate whether the imaged assembly is part of a larger structure, e.g., a membrane, virus, or cell. The value of attribute id in category em_assembly must uniquely identify the EM experiment. This data item is a pointer to attribute id in category entry in the ENTRY category. Data items in the EM_CLASSES category record details about the particle classification. Particle classification involves grouping images that are similar, and separating images that are distinct. In practical use, this means that experimental projections that have the same orientation (shape) are placed within the same category for later averaging. In this case, orientation means that the particles are showing the same face to the viewer and the only difference between them is that they can be rotated by some angle in the plane of the image. The experimental projections might also be shifted relative to each other, but the centering of the experimental projections is often done before classification. is this required? E(e1,e2,e3) = E(w,h,i) cos(i)cos(h)cos(w)-sin(i)sin(h) cos(i)cos(h)sin(w)+sin(i)sin(h) -cos(i)sin(h) -sin(i)cos(h)cos(w)-cos(i)sin(h) sin(i)cos(h)sin(w)+cos(i)sin(h) sin(i)sin(h) sin(h)cos(w) sin(h)sin(w) cos(h) The alignment_method used The percentage angular error threshold The average_angular_error in degrees The average_translational_pixel_shift_error The clustering_method used Description of the classes derived in the em experiments. We have used size variation analyses to classify images recorded from preparations of the WT S. cerevisiae PDC to which sufficient E1 was added to occupy its 60 binding sites and the same preparation with about one-third of the E1 binding sites occupied. Two 3D reconstructions representative of images that vary in size by 10-12% (~50 Å in diameter) from these preparations were computed to document the E1 organization about the core and the length of its inner linkers. In this regard, our previous structure of the WT bovine kidney PDC in which ~22 E1s were bound indicated that the outer shell could readily accommodate 60 molecules of E1 without significant crowding. Surprisingly, this study shows that extensive E1 binding favors a more extended inner linker and an altered arrangement of E1 about the core. 1 The focal pair method of orientation determination, refinement, and 3D reconstruction as implemented in the IMIRS software package was used except that an additional step of particle-size evaluation was performed in the current reconstruction. Data sets consisting of 1,500 and 690 particle images of PDC with a molar ratio of 60 E1/E2 core and ~24 E1/E2 core, respectively, were processed. For both data sets, an iterative procedure was implemented to classify the particles according to their sizes by using the SIZEDIFF program with contrast transfer function correction incorporated. A preliminary 3D reconstruction was calculated by combining all of the particles, and this "average" reconstruction was used to classify the images into a 1.0 size group comprising a 3% size variation of the images. For the PDC with ~60 E1/E2 core, the converged structure from 128 images in the 1.0 size group, was then used as a model to classify 45 and 80 images in the 0.95 and 1.05 size groups, respectively. For the WT PDC preparation (24 E1/E2 core) the converged structure from 80 images in the 1.0 size group was used as model to classify 46 and 53 images in the 0.95 and 1.05 size groups, respectively. The image size distribution appears bell-shaped and is consistent with a more extensive data set of the human PDC (Y.G., Z.H.Z., Y. Hiromasa, H. Bao, X. Yan, T. E. Roche, and J.K.S., unpublished results). The finding that 1.0 size groups consist of the larger and smaller reconstructions in the PDC preparations according to their greater or lesser degree of E1 occupancy, respectively, indicates that the extent of E1 binding is related to the variable size of the molecules. 2 A classification was performed using the self-organizing map (SOM) algorithms of the XMIPP package. The entire set was first low pass-filtered to 3.2 nm, and a reference-free alignment was performed using the Spider software package. Transformations in x, y, and in-plane angle were imposed, and the data set was fed to the kernel density SOM procedure using a 10 x 10 grid. The procedure generates a grid of code vectors that represent the assigned images. It was verified that clean looking code vectors represented classes of clean particles, while particles assigned to defect-ridden code vectors were themselves of poor quality. The procedure was repeated several times with different parameters, and in each case a set of roughly 3000 good particles was obtained. Further processing was conducted on a set containing 2943 particles. 3 The picked particles were submitted to a multivariate statistical analysis without alignment and were classified into clusters of particles with similar features. To this end, a program package kindly provided by J. P. Bretaudière was used. The various cluster averages revealed square and round shaped particles at different angular orientations. These averages were taken as references for subsequent angular and translational alignment of the extracted 4096 particles. Aligned particles were classified again, and cluster averages were calculated. 4 This data item is a pointer to attribute id in category entry in the ENTRY category. The euler angle about z-axis The euler angle about y-axis The second euler angle about z-axis The fractional_minimum_amplitude The global_correlation_coefficient The global_real-space_correlation_coefficient flag for method used for internal resolution The number of particles used in the class average The class origin in X The class origin in Y The value of attribute id in category em_classes must uniquely identify the classes used in the em experiments. Data items in the EM_CRYO_STAIN category record details about the staining techniques used. Text describing a reference citation on the staining techniques used If the details given are for a cryogen staining method the name of the cryogen used General details on the staining techniques used The humidity at which the staining technique was used Details on the instrument used in the staining technique used Text describing the protocol for the staining techniques used A pointer to attribute id in category em_sample_preparation in the EM_SAMPLE_PREPARATION category The staining technique temperature used Text giving details on the time factors involved in the staining techniques used The general class or type of the staining technique used The value of attribute id in category em_cryo_stain must uniquely identify set of stain parameters This data item is a pointer to attribute id in category entry in the ENTRY category. Data items in the EM_DETECTOR category record details of the image detector type. Example 1 - based on PDB entry 1DYL and laboratory records for the structure corresponding to PDB entry 1DYL <mmcif_iims:em_detectorCategory> <mmcif_iims:em_detector entry_id="1DYL" id="1"> <mmcif_iims:type>FILM</mmcif_iims:type> </mmcif_iims:em_detector> </mmcif_iims:em_detectorCategory> The detector type used for recording images. Usually film or CCD camera. This data item is a pointer to attribute id in category entry in the ENTRY category. The value of attribute id in category em_detector must uniquely identify the detector used for imaging. Data items in the EM_DETECTOR_CCD category record details of the CCD detector type. Example 1 - <mmcif_iims:em_detector_CCDCategory> <mmcif_iims:em_detector_CCD detector_id="1"> </mmcif_iims:em_detector_CCD> </mmcif_iims:em_detector_CCDCategory> Any additional information about the detection system. The detector dimension in x The detector dimension in y The CCD detector model used for recording images. The detector pixel size The value of attribute detector_id in category em_detector_CCD must uniquely identify the description of the CCD detector. The value of attribute detector_id in category em_detector_CCD is a pointer to attribute id in category em_detector in category EM_DETECTOR. Data items in the EM_DETECTOR_FILM category record details of the image detector type. Example 1 - based on PDB entry 1DYL and laboratory records for the structure corresponding to PDB entry 1DYL <mmcif_iims:em_detector_filmCategory> <mmcif_iims:em_detector_film> </mmcif_iims:em_detector_film> </mmcif_iims:em_detector_filmCategory> Any additional information about the detection system. The detector dimension in x The detector dimension in y Description of film_processing_conditions The film type used for recording images. The value of attribute detector_id in category em_detector_film must uniquely identify the characteristics of the film detector. The value of attribute detector_id in category em_detector_film is a pointer to attribute id in category em_detector in category EM_DETECTOR. The EM_ELECTRON_DIFFRACTION category records basic information about electron diffraction experiment. Example 1 - based on PDB entry 1TUB and laboratory records for the structure corresponding to PDB entry 1TUB <mmcif_iims:em_electron_diffractionCategory> <mmcif_iims:em_electron_diffraction entry_id="1TUB" id="1"> <mmcif_iims:num_unique_reflections>12000</mmcif_iims:num_unique_reflections> <mmcif_iims:num_diff_patterns>94</mmcif_iims:num_diff_patterns> <mmcif_iims:num_images>149</mmcif_iims:num_images> <mmcif_iims:tilt_range_min>0.</mmcif_iims:tilt_range_min> <mmcif_iims:tilt_range_max>55.</mmcif_iims:tilt_range_max> <mmcif_iims:d_res_high>3.7</mmcif_iims:d_res_high> <mmcif_iims:details xsi:nil="true" /> </mmcif_iims:em_electron_diffraction> </mmcif_iims:em_electron_diffractionCategory> the highest resolution d-value for the electron diffraction experiment. 5.0 Details of the electron diffraction experiment THE MODEL WAS DERIVED USING ELECTRON DIFFRACTION AND IMAGE DATA FROM TWO DIMENSIONAL CRYSTALS OF TUBULIN INDUCED BY THE PRESENCE OF ZN++ IONS. WHAT FOLLOWS ARE THE COORDINATES FOR THE AB-TUBULIN DIMER BOUND TO TAXOL AS OBTAINED BY ELECTRON CRYSTALLOGRAPHY OF ZINC-INDUCED SHEETS. THIS IS THE UNREFINED MODEL, BUILT INTO A RAW DENSITY MAP WHERE THE RESOLUTION IN THE PLANE OF THE SHEET WAS 3.7 ANGSTROMS AND THAT PERPENDICULAR TO THE SHEET ABOUT 4.8 ANGSTROMS. THE MODEL DOES NOT CONTAIN MOST OF THE C-TERMINAL RESIDUES OF EITHER MONOMER WHICH WERE DISORDERED IN THE MAP. THE LOOP BETWEEN HELIX H1 AND STRAND S2, AND THAT BETWEEN H2 AND S3 ARE PRESENT FOR COMPLETENESS BUT WERE BUILT INTO VERY WEAK DENSITY. GIVEN THE LIMITED RESOLUTION OF THE MAP, THE CONFORMATION OF THE SIDE CHAINS, ESPECIALLY THOSE CORRESPONDING TO RESIDUES ON THE SURFACE OF THE DIMER, MUST BE TAKEN CAUTIOUSLY. IN ADDITION, BECAUSE THIS IS AN UNREFINED MODEL, CERTAIN GEOMETRY ERRORS MAY STILL BE PRESENT IN THE STRUCTURE. PLEASE TAKE THIS INTO ACCOUNT WHEN INTERPRETING YOUR OWN DATA BASED ON THE PRESENT TUBULIN STRUCTURE. ALTHOUGH THE POSITION OF RESIDUES (WITH THE EXCEPTION OF THOSE IN THE LOOPS MENTIONED ABOVE) SHOULD NOT CHANGE SIGNIFICANTLY UPON REFINEMENT, DRAWING INFORMATION AT THE LEVEL OF SIDE CHAIN CONFORMATION IS CLEARLY NOT ADVISED. FINALLY, PLEASE NOTICE THAT THE TAXOID IN THE MODEL IS THE TAXOL DERIVATIVE TAXOTERE. 1 The number of diffraction patterns collected in the electron diffraction experiment. The number of 2D crystal images collected in the electron diffraction experiment. The total number of structure factors measured in the electron diffraction experiment, before merging to a unique set. 25743 The number of unique structure factors from the electron diffraction experiment. 12000 the overall phase error in degrees. the rejection criteria (phase error) in degrees. The maximum tilt angle used in the electron diffraction experiment. The minimum tilt angle used in the electron diffraction experiment. The value of attribute id in category electron_diffraction must uniquely identify the electron diffraction experiment. This data item is a pointer to attribute id in category entry in the ENTRY category. data items in the em_electron_diffraction_shell category record details about the quality of the phase information within a specified resolution range. based on pdb entry 1TUB <mmcif_iims:em_electron_diffraction_shellCategory> <mmcif_iims:em_electron_diffraction_shell electron_diffraction_id="1" id="1"> <mmcif_iims:d_res_low>5.0</mmcif_iims:d_res_low> <mmcif_iims:d_res_high>4.0</mmcif_iims:d_res_high> <mmcif_iims:residual>36</mmcif_iims:residual> </mmcif_iims:em_electron_diffraction_shell> <mmcif_iims:em_electron_diffraction_shell electron_diffraction_id="1" id="2"> <mmcif_iims:d_res_low>4.0</mmcif_iims:d_res_low> <mmcif_iims:d_res_high>3.7</mmcif_iims:d_res_high> <mmcif_iims:residual>46</mmcif_iims:residual> </mmcif_iims:em_electron_diffraction_shell> </mmcif_iims:em_electron_diffraction_shellCategory> the highest resolution d-value for the resolution range. 5.0 the lowest resolution d-value for the resolution range. 4.0 the phase residual value for the electron diffraction experiment. the value of attribute id in category electron_diffraction_shell must uniquely identify a resolution range of the electron diffraction data. this data item is a pointer to attribute id in category em_electron_diffraction in the em_electron_diffraction category. data items in the em_electron_diffraction_tilt_angle category record details about data collected at a specific tilt angle. based on pdb entry 1TUB <mmcif_iims:em_electron_diffraction_tilt_angleCategory> <mmcif_iims:em_electron_diffraction_tilt_angle electron_diffraction_id="1" id="1"> <mmcif_iims:tilt_angle>0.</mmcif_iims:tilt_angle> <mmcif_iims:num_patterns>18</mmcif_iims:num_patterns> <mmcif_iims:num_images>12</mmcif_iims:num_images> </mmcif_iims:em_electron_diffraction_tilt_angle> <mmcif_iims:em_electron_diffraction_tilt_angle electron_diffraction_id="1" id="2"> <mmcif_iims:tilt_angle>45.</mmcif_iims:tilt_angle> <mmcif_iims:num_patterns>57</mmcif_iims:num_patterns> <mmcif_iims:num_images>51</mmcif_iims:num_images> </mmcif_iims:em_electron_diffraction_tilt_angle> <mmcif_iims:em_electron_diffraction_tilt_angle electron_diffraction_id="1" id="3"> <mmcif_iims:tilt_angle>55.</mmcif_iims:tilt_angle> <mmcif_iims:num_patterns>19</mmcif_iims:num_patterns> <mmcif_iims:num_images>86</mmcif_iims:num_images> </mmcif_iims:em_electron_diffraction_tilt_angle> </mmcif_iims:em_electron_diffraction_tilt_angleCategory> the number of images measured at the specified tilt angle. 51 the number of diffraction patterns measured at the specified tilt angle. 57 the tilt angle at which diffraction data and/or images were obtained. 45.0 the value of attribute id in category electron_diffraction_tilt_angle must uniquely identify the tilt angle. this data item is a pointer to attribute id in category em_electron_diffraction in the EM_ELECTRON_DIFFRACTION category. Data items in the EM_EMBEDDING_AGENT category record details about the type of reagents into which the sample was embedded Details on a reference citation on the embedding agent used General details on the embedding agent used The temperature the embedding agent was used at Details about the effect of time resolution for the embedding agent used The type of embedding agent used The value of attribute id in category em_embedding_agent must uniquely identify set of the embedding agent parameters This data item is a pointer to attribute id in category entry in the ENTRY category. The EM_ENTITY_ASSEMBLY category defines a hierarchy-independent list of assemblies relevant to the EM experiment. The recommended convention is that the imaged assembly, defined in the category EM_ASSEMBLY, is listed first. Components, arrays and superstructures of the assembly are also described. The hierarchy independence enables descriptions of symmetry, sample preparation, particle selection, and map masks at multiple levels. based on PDB entry 1DGI <mmcif_iims:em_entity_assemblyCategory> <mmcif_iims:em_entity_assembly id="1"> <mmcif_iims:assembly_id>1</mmcif_iims:assembly_id> <mmcif_iims:type>COMPLEX ASSEMBLY</mmcif_iims:type> <mmcif_iims:symmetry_type>point symmetry</mmcif_iims:symmetry_type> <mmcif_iims:name>poliovirus-CD155 complex</mmcif_iims:name> </mmcif_iims:em_entity_assembly> <mmcif_iims:em_entity_assembly id="2"> <mmcif_iims:assembly_id>1</mmcif_iims:assembly_id> <mmcif_iims:type>VIRUS</mmcif_iims:type> <mmcif_iims:symmetry_type>point symmetry</mmcif_iims:symmetry_type> <mmcif_iims:name>poliovirus</mmcif_iims:name> </mmcif_iims:em_entity_assembly> <mmcif_iims:em_entity_assembly id="3"> <mmcif_iims:assembly_id>1</mmcif_iims:assembly_id> <mmcif_iims:type>PROTEIN</mmcif_iims:type> <mmcif_iims:symmetry_type>asymmetric</mmcif_iims:symmetry_type> <mmcif_iims:name>CD155 receptor</mmcif_iims:name> </mmcif_iims:em_entity_assembly> </mmcif_iims:em_entity_assemblyCategory> based on PDB entry 2BG9 <mmcif_iims:em_entity_assemblyCategory> <mmcif_iims:em_entity_assembly id="1"> <mmcif_iims:assembly_id>1</mmcif_iims:assembly_id> <mmcif_iims:type>PROTEIN</mmcif_iims:type> <mmcif_iims:symmetry_type>asymmetric</mmcif_iims:symmetry_type> <mmcif_iims:name>acetylcholine receptor</mmcif_iims:name> </mmcif_iims:em_entity_assembly> <mmcif_iims:em_entity_assembly id="2"> <mmcif_iims:assembly_id>1</mmcif_iims:assembly_id> <mmcif_iims:type>MEMBRANE</mmcif_iims:type> <mmcif_iims:name>torpedo post-synaptic membrane</mmcif_iims:name> </mmcif_iims:em_entity_assembly> <mmcif_iims:em_entity_assembly id="3"> <mmcif_iims:assembly_id>1</mmcif_iims:assembly_id> <mmcif_iims:type>ARRAY</mmcif_iims:type> <mmcif_iims:symmetry_type>helical</mmcif_iims:symmetry_type> <mmcif_iims:name>helical crystal</mmcif_iims:name> </mmcif_iims:em_entity_assembly> </mmcif_iims:em_entity_assemblyCategory> This data item is a pointer to attribute id in category em_assembly in the em_assembly category. Additional details about the component. The Gene Ontology (GO) identifier for the component. The GO id is the appropriate identifier used by the Gene Ontology Consortium. Reference: Nature Genetics vol 25:25-29 (2000). GO:0005876 GO:0015630 The InterPro (IPR) identifier for the component. The IPR id is the appropriate identifier used by the Interpro Resource. Reference: Nucleic Acid Research vol 29(1):37-40(2001). 001304 002353 The name of the component of the observed assembly. The cell from which the component was obtained. CHO HELA 3T3 The cellular location of the component. cytoplasm endoplasmic reticulum plasma membrane A flag to indicate whether the component is engineered. The expression system used to produce the component. eschericia coli saccharomyces cerevisiae The plasmid used in the expression system used to produce the component. pBR322 pMB9 The organelle from which the component was obtained. golgi mitochondrion cytoskeleton The common name of the species of the natural organism from which the component was obtained. The species of the natural organism from which the component was obtained. The strain of the natural organism from which the component was obtained, if relevant. DH5a BMH 71-18 The tissue of the natural organism from which the component was obtained. heart liver eye lens The type of symmetry of the assembly, component or superstructure.