Compilation of small ribosomal subunit RNA sequences

1 downloads 0 Views 18MB Size Report
or have become accessible in sequence libraries, since the last compilation (271), plus 8 ...... in both libraries but there can be a delay before a sequence submitted to one library arrives in the other one. ...... DmU*~A1P3O. CU. AA. GAC C.
.::j 1990 Oxford University Press

Nucleic Acids Research, Vol. 18, Supplement 2237

Compilation of small ribosomal subunit RNA

sequences

Jean-Marc Neefs, Yves Van de Peer, Lydia Hendriks and Rupert De Wachter* Departement Biochemie, Universiteit Antwerpen, UIA, Universiteitsplein 1, B-2610 Antwerp, Belgium

INTRODUCTION Table 1 lists 275 small ribosomal subunit RNA (further abbreviated as srRNA) sequences (references 1-270) that have been published, or submitted to the EMBL or GenBank nucleotide sequence libraries, to our knowledge. The previous compilation (271) listed 106 srRNA sequences. There is a tendency towards publication of partial, rather than complete sequences. This is a consequence of the availability of new techniques for sequencing or DNA amplification, that allow to collect results faster, but do not allow to determine the complete sequence up to the termini. One such method uses reverse transcription of the RNA with primers complementary to universally conserved sequence areas (272), in which case the sequence adjacent to the 3'-terminus cannot be determined. In another approach (27), the rDNA is amplified by the polymerase chain reaction (273) rather than by cloning. Since the primers for the PCR bind to conserved sequences close to the termini but within the boundaries of the srRNA gene, neither of the terminal sequences is comprised in the analysis. Both these methods, however, can yield a continuous sequence spanning more than 90% of the molecule, provided a sufficient number of primers is used. Some authors (e.g. reference 274) use a limited set of primers, in which case a discontinuous set of partial sequences is obtained, which can nevertheless be aligned with complete sequences from other species and used for phylogenetic studies. The set of 270 different srRNA sequences listed in Table 1 comprises complete sequences and continuous partial sequences, but no discontinuous partial sequences. In order to limit the space needed for the alignment, we have restricted it to 60 sequences. This comprises all the complete sequences that were published, or have become accessible in sequence libraries, since the last compilation (271), plus 8 previously listed sequences added as references in order to allow a comparison of the present alignment and secondary structure scheme with the previous one.

SEQUENCE ALIGNMENT The 60 sequences are listed in 5 groups, consisting of eukaryotic (cytoplasmic) -, archaebacterial -, eubacterial -, plastidial -and mitochondrial srRNAs. Each group comprises a sequence already aligned in the previous compilation, viz. Homo sapiens for the eukaryotic cytoplasmic srRNAs, Halobacterium cutirubrum for the archaebacterial srRNAs, Escherichia coli for the eubacterial srRNAs, Zea mays for the plastid srRNAs. Because of the extreme variability in length and secondary structure of mitochondrial srRNA sequences, an animal -, a plant -, a fungal -and a flagellate sequence were added as references in this group. The species chosen are H. sapiens, Glycine max, *

To whom

correspondence

should be addressed

Saccharomyces cerevisiae and Leishmania tarentolae. The sequences are identified on each alignment page by a number corresponding with that in Table 1 and with the literature reference, and by the initials of the species name. Alignment positions are numbered at the top and bottom of each page. In addition, E. coli srRNA nucleotide positions counting from its 5'-terminus are indicated above the eubacterial sequences. Regardless of the analytical method used, all sequences are listed using the ribonucleotide symbols U, C, A, and G. Posttranscriptional modifications are known for few sequences and not indicated in the alignment, but if data are available this is mentioned by a footnote in the last column of Table 1. The symbol X is used for unidentified and the symbols Y (pyrimidine nucleotide) and R (purine nucleotide) for incompletely identified nucleotides. These symbols point to uncertainty in the identification of a nucleotide in the case of analyses performed on cloned or amplified DNA, which examine the structure of a single gene. In the case of sequencing by reverse transcription of srRNA, however, they can point either to analytical uncertainty or to sequence heterogeneity, since the srRNA used as template may be a mixture of different sequences. Gaps introduced in order to optimize the alignment are filled with hyphens. On the contrary, dots are used for an interruption of the sequence due to partial sequencing. Hence, a sequence interrupted by 10 dots means that partial sequences are known to the left and to the right of the interruption, but that the length of the separating stretch is unknown. An interruption by 10 X's means that exactly 10 nucleotides are present, but their identity is unknown. A sequence starting or ending with dots means that sequencing has not reached the terminus. Lower case characters at termini are used to indicate length heterogeneity of the RNA molecules. It should be noted that the alignment of the sequences for optimal similarity is straightforward in areas of relatively conserved primary and secondary structure, but is much more arbitrary in the structurally more variable areas. Alignment rests mostly on the periodic occurrence of conserved sequence motifs and on the observation of compensating substitutions in complementary areas of the secondary structure (see below). In -,

the variable areas, conserved sequence motifs are rarer and at the same time the sequences differ in length. Whether compensating substitutions are seen or not then depends on how the gaps are placed, which makes their observation less

meaningful. Sequence segments presumedly involved in base pairing according to the secondary structure model described below are indicated by shading superimposed on the alignment. Shaded

2238 Nucleic Acids Research, Vol. 18, Supplement

V7 41

40

39 42

V6

P41-3

P35-1

V8

18 P17-1

V3

48

V9

17

vi 121

9

11

V2

10

Fig. 1. Secondary structure model for prokaryotic srRNAs. The 5'-terminus is symbolized by a filled circle and the 3'-terminus by an arrowhead. Helices are numbered in the order of occurrence from 5'- to 3'-terminus. Helices bearing a single number are common to the prokaryotic and eukaryotic (Fig. 2) models. A composite number preceded by P points to a prokaryote-specific helix. Relatively conserved areas are drawn in bold lines, areas of sequence-and length variability in thin lines. Eight variable areas, numbered VI to V9, are distinguished, V4 being absent in prokaryotic srRNAs. Helices drawn in broken lines are present in a small number of known structures only. Archaebacterial sequences follow the prokaryotic pattern except for helix 35, which is unbranched as in eukaryotes.

Nucleic Acids Research, Vol. 18, Supplement 2239 areas corresponding to complementary strands of helices 1 to 48 of the secondary structure model are numbered 1 and 1' to 48 and 48'. Helix numbers are listed twice, the uppermost row applying to the eukaryotic sequences, and the lowermost row to the bacterial -, plastidial -, and mitochondrial sequences. Internal loops and bulges in a helix are indicated by interruption of the shading. Bases suspected to belong to pairs other than the WatsonCrick pairs G * C, A * U, or the wobble pair G * U, because they are intercalated between Watson-Crick and/or wobble pairs, are put in parentheses.

SECONDARY STRUCTURE MODEL The secondary structure models adopted for indication of the double stranded areas in the alignment are shown in Fig. 1 and 2. The prokaryotic model of Fig. 1 applies to archaebacterial, eubacterial, plastidial, and mitochondrial srRNAs. The model of Fig. 2 applies to eukaryotic srRNAs. Helices common to both models, further called universal helices, are numbered 1 to 48, in the order of occurrence of the 5'-proximal strand when the sequence is scanned from 5' to 3'-end. Helix numbers change

V7 40

41

39

35 E21-9 II

E43-1

V5 27

44

E21-7

V8

V4

E21-1 48

V3

V9

E21-4 14

vi 121

11

9

V2 E10-2

E10-1

Fig. 2. Secondary structure model for eukaryotic srRNAs. Symbols are as in Fig. 1. Helices bearing a composite number preceded by E are eukaryote-specific. Variable area V6 is missing in eukaryotic srRNAs.

2240 Nucleic Acids Research, Vol. 18, Supplement at branching points and at pseudoknot loops, not at internal- and bulge loops. Helices specific to the prokaryotic model are numbered Pa-b, where a is the number of the preceding universal helix and b is a serial number. Helices specific to the eukaryotic model are similarly numbered Ea-b. Structurally conserved areas are drawn in bold lines, whereas structurally variable areas, labeled VI to V9, are drawn in thin lines. Helices that are present in a limited number of species are drawn in broken lines. Such is the case for e.g. helices P41-1 to P41-3, specific for plant mitochondrial srRNAs, or helix E10-3, hitherto found only in Euglena gracilis cytoplasmic srRNA. Conversely, helices can be absent in certain srRNAs. As an example, the diplomonad Giardia kamblia misses helices E21-1 to E21-4. Even the 'universal' helices do not occur in all srRNAs, the exceptions being found among the mitochondrial srRNAs. Those from animal mitochondria retain only 36 of the universal helices, with most of the missing ones belonging to variable areas. In flagellate mitochondrial srRNAs the structure retains only 25 universal helices, with the entire area consisting of helices 31 to 45 missing. The Fig. 1 model should therefore be regarded, not so much as a general prokaryotic model, but rather as a model for eubacterial and plastidial srRNA secondary structure, from which mitochondrial srRNA structures can be derived by addition (in the case of plant mitochondria) or subtraction of sets of helices. As for the archaebacterial srRNAs, these differ from the eubacterial ones only by lacking the branching point leading to helices P35-1 and P35-2. The models of Fig. 1 and 2 concur with those proposed by Gutell et al. (275) as far as the conserved areas are concerned, and for a fraction of the variable areas. However, we propose a structure for a long insertion in area V7 found in plant mitochondrial srRNAs, and for area V4 in eukaryotic srRNAs, both left undefined in (275). The latter structure is different from that adopted in the previous compilation (271) and comprises a pseudoknot. In addition, for areas V2 and V3 of the eukaryotic model, we propose a base pairing scheme slightly different from that of Gutell et al. (275). Although most of the structures that we propose in the variable areas are supported by observation of compensating substitutions, (unpublished) they should nevertheless be regarded as tentative in view of the difficulties connected with alignment in these areas explained above.

AVAILABILITY OF THE DATA Of the 270 different srRNA sequences listed in Table 1, 197 were in our computer file at the time of writing, in aligned form and with delimitation of secondary structure elements. The remaining ones, which are not listed in the papers reporting them but have been submitted to GenBank, have only recently become available in the file server of this library. They will be aligned as soon as possible. The sequences will be available on floppy disks, readable on microcomputers operating under MS-DOS, in the following three formats: 1) In the form of an alignment with indication of secondary structure elements. Aligned sequences will be listed on 42 pages each containing 100 alignment positions of all the sequences. This format is most useful for those wishing to produce a hardcopy of the complete alignment, rather than the sample of 60 sequences listed here. From this file, the complete alignment can be printed using a wide carriage printer, condensed print and reduced line spacing. Note, however, that this takes about 252 pages of 15 inch wide printer paper.

2) The sequences, listed one by one, without indication of secondary structure elements, but interspersed with the gaps required for alignment, i.e. with homologous nucleotides in the same position in each sequence. 3) The sequences, listed one by one, written continuously without gaps or secondary structure-describing symbols. The number of formatted floppy disks of different types that should be sent in order to obtain each of these files is listed in Table 2. In addition, all these files can be obtained on a TK50 tape, suitable for a MicroVax computer operating under VMS 5.0 or

higher. ACKNOWLEDGEMENTS Our work was supported in part by the Fund for Medical Scientific Research and in part by the Incentive Program for Fundamental Research in the Life Sciences of the Belgian Office for Science Policy Programming (grant BIO/03). J.N. and Y.V.d.P. are holders of a scholarship from the Institute for Scientific Research in Agriculture and Industry. We thank A. Wilmotte, A. Goris, and R. De Baere for their assistance in the correction of the computer files.

REFERENCES 1. McCallum,F.S. and Maden,B.E.H. (1985) Biochem. J., 232, 725-733. 2. Torczynski,R.M., Fuke,M. and Bollon,A.P. (1985) DNA , 4, 283-291. 3. Gonzales,I.L. and Schmickel,R.D. (1986) Am. J. Hum. Genet. 38, 419-427. 4. Raynal,F., Michot,B. and Bachellerie,J.P. (1984) FFBSleu., 167, 263-268. 5. Torczynski,R., Bollon,A.P. and Fuke,M. (1983) Nucleic Acids Res., 11, 4879-4890. 6. Chan,Y.L., Gutell,R., Noller,H.F. and Wool,I.G. (1984) J. Biol. Chem., 259, 224-230. 7. Rairkar,A., Rubino,H.M. and Lockard,R.E. (1988) Nucleic Acids Res., 16, 3113. 8. Salim,M. and Maden,B.E.H. (1981) Nature, 291, 205-208. 9. same reference as 1. 10. Nelles,L., Fang,B.L., Volckaert,G., Vandenberghe,A. and De Wachter,R. (1984) Nucleic Acids Res., 12, 8749-8768. 11. Hendriks,L., Van Broeckhoven,C., Vandenberghe,A., Van de Peer,Y. and De Wachter,R. (1988) Eur. J. Biochem., 177, 15-20. 12. Hendriks,L., De Baere,R., Van Broeckhoven,C. and De Wachter,R. (1988) FEBS Let., 232, 115-120. 13. Tautz,D., Hancock,J.M., Webb,D.A., Tautz,C. and Dover,G.A. (1988) Mol. BioL Evol., 5, 366-376. 14. Ellis,R.E., Sulston,J.E. and Coulson,A.R. (1986) Nucleic Acids Res., 14, 2345-2364. 15. Takaiwa,F., Oono,K. and Sugiura,M. (1984) Nucleic Acids Res., 12, 5441-5448. 16. Kiss,T., SzkukAlek,A. and Solymosy,F. (1989) Nucleic Acids Res., 17, 2127. 17. Messing,J., Carlson,J., Hagen,G., Rubenstein,I. and Oleson,A. (1984) DNA, 3, 31-40. 18. Eckenrode,V.K., Arnold,J. and Meagher,R.B. (1985) J. Mol. Evol., 21, 259-269. 19. Unfried,I., Stocker,U. and Gruendler,P. (1989) Nucleic Acids Res., 17, 7513. 20. Nairn,C.J. and Ferl,R.J. (1988) J. Mol Evol., 27, 133-141. 21. Gunderson,J.H., Elwood,H., Ingold,A., Kindle,K. and Sogin,M.L. (1987) Proc. Natl. Acad. Sci. USA, 84, 5823-5827. 22. Rausch,H., Larsen,N. and Schmitt,R. (1989) J. Mol. Evol., 29, 255-265. 23. Huss,V.A.R. and Sogin,M.L. (1989) Nucleic Acids Res., 17, 1255. 24. Sargent,M.,Zahn,R., Walters,B., Gupta,R. and Kaine,B. (1988) Nucleic Acids Res., 16, 4156. 25. same reference as 21. 26. Bhattacharya,D. and Druehl,L.D. (1988) J. Phycol., 24, 539-543. 27. Medlin,L., Elwood,H.J., Stickel,S. and Sogin,M.L. (1988) Gene, 71, 491-499. 28. Sogin,M.L., Miotto,K. andMiller,L. (1986)NucleicAcidsRes., 14,9540. 29. Manlin,A.S., Sliyabin,K.G. and Rubtsov,P.M. (1986) Gene, 44,143-145.

Nucleic Acids Research, Vol. 18, Supplement 2241 30. Edman,J.C., Kovacs,J.A., Masur,H., Santi,D.V., Elwood,H.J. and Sogin,M.L. (1988) Nature, 334, 519-522. 31. same reference as 21. 32. Mc Carroli,R., Olsen,G.J., Stahl,Y.D., Woese,C.R. and Sogin,M.L. (1983) Biochemistry, 22, 5858-5868. 33. Johansen,T., Johansen,S. and Haugli,F.B. (1988) Curr. Genet., 14, 265-273. 34. Sogin,M.L., Swanton,M.T., Gunderson,J.H. and Elwood,H.J. (1986) J. ProtozooL, 33, 26-29. 35. Elwood,H.J., Olsen,G.J. and Sogin,M.L. (1985) Mol. Biol. Evol., 2, 399-410. 36. same reference as 35. 37. Spangler,E.A. and Blackburn,E.H. (1985) J. Biol. Cem., 260, 6334-6340. 38a. Sogin,M.L., Ingold,A., Karlok,M., Nielsen,H. and Engberg,J. (1986) EMBO J., 5, 3625-3630. 38b-46. same reference as 38a. 47. Sogin,M.L. and Elwood,H.J. (1986) J. Mol. Evol., 23, 53-60. 48. Herzog,M. and Maroteaux,L. (1986) Proc. Natl. Acad. Sci. USA, 83, 8644-8648. 49. Gunderson,J.H., McCutchan,T.F. and Sogin,M.L. (1986) J. Protozool., 33, 525-529. 50. Gunderson,J.H., Sogin,M.L., Woliett,G., Hollingdale,M., de la Cmz,V.F., Waters,A.P. and McCutchan,T.F. (1987) Science, 238, 933-937. 51. McCutchan,T.F., de la Cruz,V.F., Lal,A.A., Gunderson,J.H., Elwood,H.J. and Sogin,M.L. (1988) Mol. Biochem. Parasitol., 28, 63-68. 52. same reference as 51. 53. Waters,A.P., Unnasch,T.R., Wirth,D.F. and McCutchan,T.F. (1989) Nucleic Acids Res., 17, 1763. 54. Waters,A.P. and McCutchan,T.F. (1989) Nucleic Acids Res., 17, 2135. 55. Gunderson,J.H. and Sogin,M.L. (1986) Gene, 44, 63-70. 56. Clark,C.G. and Cross,G.A.M. (1988) MoL. Biol. Evol., 5, 512-518. 57. Sogin,M.L., Elwood,H.J. and Gunderson,J.H. (1986) Proc. Natl. Acad. Sci. USA, 83, 1383-1387. 58. Schnare,M.N., Collings,J.C. and Gray,M.W. (1986) Curr. Genet., 10, 405-410. 59. Looker,D., Miller,L.A., Elwood,H.J., Stickel,S. and Sogin,M.L. (1988) Nucleic Acids Res., 16, 7198. 60. same reference as 57. 61. Vossbrinck,C.R., Maddox,J.V., Friedman,S., DeBrunner-Vossbrinck,B.A. and Woese,C.R. (1987) Nature, 326, 411-414. 62. Sogin,M.L., Gunderson,J.H., Elwood,H.J., Alonso,R.A. and Peattie,D.A. (1989) Science, 243, 75-77. 63. Hui,I. and Dennis,P.P. (1985) J. Biol. Chem., 260, 899-906. 64. Mankin,A.S., Kagramanova,V.K., Teterina,N.L., Rubtsov,P.M., Belova,E.N., Kopylov,A.M., Baratova,L.A. and Bogdanov,A.A. (1985) Gene, 37, 181-189. 65. Gupta,R., Lanter,J.M. and Woese,C.R. (1983) Science, 221, 656-659. 66. Leffers,H. and Garrett,R.A. (1984) EMBO J., 3, 1613-1619. 67. Yang,D., Kaine,B.P. and Woese,C.R. (1985) System. Appl. Microbiol., 6, 251-256. 68. Eggen,R., Hannsen,H., Geerling,A. and deVos,W.M. (1989) Nucleic Acids Res., 17, 9469. 69. Jarsch,M. and Bock,A. (1985) System. Appl. Microbiol., 6, 54-59. 70. Lechner,K., Wich,G. and B6ck,A. (1985) System. AppL MicrobioL, 6, 157-163. 71. Oestergaard,L., Larsen,N., Leffers,H., Kjems,J. and Garrett,R.A. (1987) System. Appl. Microbiol., 9, 199-209. 72. Achenbach-Richter,L., Gupta,R., Zillig,W. and Woese,C.R. (1988) System Appl. Microbiol., 10, 231-240. 73. Ree,H.K., Cao,K., Thurlow,D.L. and Zimmermann,R.A. (1989) Can. J. Microbiol., 35, 124-133. 74. Kjens,J., Garrett,R.A. and Ansorge,W. (1987) System. Appl. Microbiol., 9, 22-28. 75. Kaine,B.P., Schurke,C.M. and Stetter,K.O. (1989) System. Appl. Microbiol., 12, 8-14. 76. Olsen,G.J., Pace,N.R., Nuell,M., Kaine,B.P., Gupta,R. and Woese,C.R. (1985) J. Mol. Evol., 22, 301-307. 77. Leinfelder,W., Jarsch,M. and Bock,A. (1985) System. Appl. Microbiol, 6, 164-170. 78. Achenbach-Richter,L., Stetter,K.O. and Woese,C.R. (1987) Nature, 327, 348-349. 79. Yang,D., Oyaizu,Y., Oyaizu,H., Olsen,G.J. and Woese,C.R. (1985) Proc. Natl. Acad. Sci. USA, 82, 4443-4447. 80. Dorsch,M., Moreno,E. and Stackebrandt,E. (1989) Nucleic Acids Res., 17, 1765. 81. Weisburg,W.G., Dobson,M.E., Samuel,J.E., Dasch,G.A., Mallavia,L.P.,

Baca,O., Mandelco,L., Sechrest,J.E., Weiss,E. and Woese,C.R. (1989) J. Bacteriol., 171, 4202-4206. 82. Stackebrandt,E., Fischer,A., Roggentn,T., Wehmeyer,U., Bomar,D. and Smida,J. (1988) Arch. Microbiol., 149, 547-556. 83-85. same reference as 81. 86. Weisburg,W.G., Woese,C.R., Dobson,M.E. and Weiss,E. (1985) Science, 230, 556-558. 87. Dewhirst,F.E., Paster,B.J. and Bright,P.L. (1989) Int. J. Syst. Baceriol., 39, 258-266. 88-94. same reference as 87. 95. Rossau,R., Heyndrickx,L. and Van Heuverswyn,H. (1988) Nucleic Acids Res., 16, 6227. 96. Toschka,H.Y., Hopfl,P., Ludwig,W., Schleifer,K.H., Ulbrich,N. and Erdmann,V.A. (1988) Nucleic Acids Res., 16, 2348. 97. same reference as 87. 98. same reference as 79. 99. same reference as 87. 100. Unterman,B.M., Baumann,P. and McLean,D.L. (1989) J. Bacteriol., 171, 2970-2974. 101. same reference as 100. 102. Woese,C.R. (1989) unpublished. 103. same reference as 81. 104. same reference as 102. 105. Brosius,J., Dull,T.J., Sleeter,D.D. and Noller,H.F. (1981) J. Mol. BioL, 148, 107-127. 106. Carbon,P., Ebe1,J.P. and Ehresmann,C. (1981) Nucleic Acids Res., 9, 2325-2333. 107. Martens,B., Spiegl,H. and Stackebrandt,E. (1987) System Appl. Microbiol., 9, 224-230. 108. same reference as 81. 109-110. same reference as 102. 111. Oyaizu,H. and Woese,C.R. (1985) System. Appl. Microbiol., 6, 257-263 112. same reference as 102. 113. same reference as 111. 114. Lau,P.P., DeBrunner-Vossbrinck,B., Dunn,B., Miotto,K., MacDonell,M.T., Rollins,D.M., Pillidge,C.J., Hespell,R.B., Colwell,R.R., Sogin,M.L. and Fox,G.E. (1987) System. Appl. Microbiol., 9, 231-238. 115. Weisburg,W.G., Tully,J.G., Rose,D.L., Petzel,J.P., Oyaizu,H., Yang,D., Mandelco,L., Sechrest,J., Lawrence,T.G., Van Etten,J., Maniloff,J. and Woese,C.R. (1989) J. Bacteriol., 171, 6455-6467. 116-123. same reference as 115. 124. Green,C.J., Stewart,G.C., Hollis,M.A., Vold,B.S. and Bott,K.F. (1985) Gene, 37, 261-266. 125-126. same reference as 115. 127. Zhao,H., Yang,D., Woese,C.R., Bryant,M.P. (1989) Int. J. Syst. Bacteiol., submitted. 128-129. same reference as 115. 130. Yang, D and Woese,C.R. (1989) System. AppL Microbiol., 12, 145- 149. 131-134. same reference as 130. 135. same reference as 115. 136. Collins,M.D., Ash,C., Farrow,J.A.E., Wallbanks,S. and Williams,A.M. (1989) J. AppL BacterioL, 67, 453-460. 137-139. samne reference as 136. 140-144. same reference as 130. 145-149. same reference as 115. 150. Iwami,M., Muto,A., Yamao,F. and Osawa,S. (1984) MoL Gen. Genet., 196, 317-322. 151-152. same reference as 115. 153. Woese,C.R. and Gutell,R.R. (1989) Proc. Natl. Acad. Sci. USA, 86, 3119-3122. 154. same reference as 115. 155. Taschke,C., Ruland,K. and Herrmann,R. (1987) Nucleic Acids Res., 15, 3918. 156-170. same reference as 115. 171. Frydenberg,J. and Christiansen,C. (1985) DNA, 4, 127-137. 172. same reference as 115. 173. Lim,P.O. and Sears,B.B. (1989) J. Bacteiol., 171, 5901-5906. 174-185. same reference as 115. 186. same reference as 136. 187. Collins,M.D., Smida,J. and Stackebrandt,E. (1989) Int. J. Syst. Bacteriol., 39, 7-9. 188. same reference as 127. 189. Suzuki,Y., Nagata,A., Ono,Y. and Yamada,T. (1988) J. BacterioL., 170, 2886-2889. 190. Edwards,U., Rogall,T., Blocker,H., Emde,M. and Bbttger,E.C. (1989) Nucleic Acids Res., 17, 7843-7853.

2242 Nucleic Acids Research, Vol. 18, Supplement 191. Collins,M.D., Dorsch,M. and Stackebrandt,E. (1989) Int. J. Syst. Bacteriol., 39, 1-6. 192-194. same reference as 191. 195. Charfreitag,O. and Stackebrandt,E. (1989) J. Gen. Microbiol., 135, 2065-2070. 196-200. same reference as 195. 201. Pernodet,J.L., Boccard,F., Alegre,M.T., Gagnat,J. and Guerineau,M. (1989) Gene, 79, 33-46. 202. Baylis,H.A. and Bibb,M.J. (1987) Nucleic Acids Res., 15, 7176. 203. Suzuki,Y. and Yamada,T. (1988) Nucleic Acids Res., 16, 370. 204. same reference as 191. 205. Collins,M.D., Smida,J., Dorsch,M. and Stackebrandt,E. (1988) Int. J. Sv'st.

Bacteriol., 38, 385-391. 206. Woese,C.R., DeBrunner-Vossbrinck,B.A., Oyaizu,H., Stackebrandt,E. and

Ludwig,W. (1985) Science, 229, 762-765. 207. Murzina,N.V., Vorozheykina,D.P. and Matvienko,N.I. (1988) Nucleic Acids

Res., 16, 8172. 208. Weisburg,W.G., Giovannoni,S.J. and Woese,C.R. (1989) System. Appl.

Microbiol., 11, 128-134. 209. Weisburg,W.G., Oyaizu,Y., Oyaizu,H. and Woese,C.R. (1985) J.

Bacteriol., 164, 230-236. 210. same reference as 209. 211. Weisburg,W.G., Hatch,T.P. and Woese,C.R. (1986) J. Bacteriol., 167, 570-574. 212. Oyaizu,H., DeBrunner-Vossbrinck,B., Mandelco,L., Studier,J.A. and

Woese,C.R. (1987) System. Appl. Microbiol., 9, 47-53. 213-214. same reference as 212. 215. Witt,D., Bergstein-Ben Dan,T. and Stackebrandt,E. (1989) Arch. Microbiol.,

152, 206-208. 216. Tomioka,N. and Sugiura,M. (1983) Mol. Gen. Genet., 191, 46-50. 217. Achenbach-Richter,L., Gupta,R., Stetter,K.O. and Woese,C.R. (1987) System. Appl. Microbiol., 9, 34-39. 218-219. same reference as 102. 220-221. same reference as 127. 222. Schwarz,Z. and Kossel,H. (1980) Nature, 283, 739-742. 223. Hiratsuka,J., Shimada,H., Whittier,R., IshibashiT., Sakamoto,M.,

224. 225. 226.

227.

Mori,M., Kondo,C., Honji,Y., Sun,C.R., Meng,B.Y., Li,Y.Q., Kanno,A., Nishizawa,Y., Hirai,A., Shinozaki,K. and Sugiura,M. (1989) Mol. Gen. Genet., 217, 185-194. Tohdoh,N. and Sugiura,M. (1982) Gene, 17, 213-218. Von Allmen,J.M. and Stutz,E. (1988) Nucleic Acids Res., 16, 1200. Ohyama,K., Fukuzawa,H., Kohchi,T., Shirai,H., Sano,T., Sano,S., Umesono,K., Shiki,Y., Takeuchi,M., Chang,Z., Aota,S.I., Inokuchi,H. and Ozeki,H. (1986) Plant. Mol. Biol. Rep., 4, 148-175. Dron,M., Rahire,M. and Rochaix,J.D. (1982) Nucleic Acids Res., 10,

7609-7620. 228. Durocher,V., Gauthier,A., Bellemare,G. and Lemieux,C. (1989) Curr.

Genet., 15, 277-282. reference as 228. Yamada,T. (1988) Nucleic Acids Res., 16, 9865. Huss,V.A.R. and Giovannoni,S.J. (1989) Nucleic Acids Res., 17, 9487. Witt,D. and Stackebrandt,E. (1988) Arch. Microbiol., 150, 244-248. Graf,L., Roux,E., Stutz,E. and Kossel,H. (1982) Nucleic Acids Res., 10, 6369-6381. 234. Eperon,I.C., Anderson,S. and Nierlich,D.P. (1980) Nature, 286, 460-467. 235. Hixson,J.E. and Brown,W.M. (1986) Mol. Biol. Evol., 3, 1-18. 236-238. same reference as 235. 239. Van Etten,R.A., Walberg,M.W. and Clayton,D.A. (1980) Cell, 22, 157-170. 240. Kobayashi,M., Seki,T., Yaginuma,K. and Koike,K. (1981) Gene, 16, 297-307. 241. Gadaleta,G., Pepe,G., De Candia,G., Quagliariello,C., Sbisa,E. and Saccone,C. (1989) J. Mol. Evol., 28, 497-516. 242. Anderson,S., de Bruijn,M.H.L., Coulson,A.R., Eperon,I.C., Sanger,F. and Young,I.G. (1982) J. Mol. Biol., 156, 683-717. 243. Nagae,Y., Fujii,H., Yoneyama,Y., Goto,Y. and Okazaki,T. (1988) Nucleic Acids Res., 16, 10363. 244. Roe,B.A., Ma,D.P., Wilson,R.K. and Wong,J.F.H. (1985) J. Biol. Chem., 260, 9759-9774. 245. Clary,D.O. and Wolstenholme,D.R. (1985) Nucleic Acids Res., 13, 4029-4045. 246. Clary,D.O. and Wolstenholme,D.R. (1987) J. Mol. Evol., 25, 116- 125. 247. Jacobs,H.T., Elliott,D.J., Math,V.B. and Farquharson,A. (1988) J. Mol. 229. 230. 231. 232. 233.

same

248.

Cantatore,P., Roberti,M., Rainaldi,G., Gadaleta,M.N. and (1989) J. Biol. Chem., 264, 10965-10975.

Biol., 202, 185-217. Saccone,C.

249. Chao,S., Sederoff,R. and Levings,C.S.III. (1984) Nucleic Acids Res., 12, 6629-6644. 250. Dale,R.M.K., Mc Clure,B.A. and Houchins,J.P. (1985) Plasmid, 13, 31-40. 251. Gwynn,B., Dewey,R.E., Sederoff,R.R., Timothy,D.H. and Levings,C.S.III. (1987) Theor. Appl. Genet., 74, 781-788. 252. Spencer,D.F., Schnare,M.N. and Gray,M.W. (1984) Proc. Natl. Acad. Sci. USA, 81, 493-497. 253. Grabau,E.A. (1985) Plant Mol. Biol., 5, 119-124. 254. Brennicke,A., Moller,S. and Blanz,P.A. (1985) Mol. Gen. Genet., 198, 404-410. 255. Boer,P.H. and Gray,M.W. (1988) Cell, 55, 399-411. 256. Sor,F. and Fukuhara,H. (1980) C. R. Acad. Sc. Paris Serie D, 29, 933 -936. 257. Li,M., Tzagoloff,A., Undebrink-Lyon,K. and Martin,N.C. (1982) J. Biol. Chem., 257, 5921-5928. 258. Huttenhofer,A., Sakai,H. and Weiss-Brummer,B. (1988) Nucleic Acids Res., 16, 8665-8674. 259. same reference as 258. 260. Dyson,N.J., Brown,T.A., Wafing,R.B. and Davies,R.W. (1989) Gene, 75, 109-118. 261. Trinkl,H., Lang,B.F. and Wolf,K. (1989) Nucleic Acids Res., 17, 6730. 262. Cummings,D.J., Domenico,J.M., Nelson,J. and Sogin,M.L. (1989) J. Mol. Evol., 28, 232-241. 263. Labriola,J., WeissI., Zapatero,J. and Suyama,Y. (1987) Curr. Genet., 11, 529-536. 264. Schnare,M.N., Heinonen,T.Y.K., Young,P.G. and Gray,M.W. (1986) J. Biol. Chem., 261, 5187-5193. 265. Seilhamer,J.J., Olsen,G.J. and Cummings,D.J. (1984) J. Biol. Chem., 259, 5167-5172. 266. same reference as 265. 267. Eperon,I.C., Janssen,J.W.G., Hoeijmakers,J.H.J. and Borst,P. (1983) Nucleic Acids Res., 11, 105 - 125. 268. Sloof,P., Van den Burg,J., Voogd,A., Benne,R., Agostinelli,M., Borst,P., Gutell,R. and Noller,H. (1985) Nucleic Acids Res., 13, 4171-4190. 269. de la Cruz,V.F., Lake,J.A., Simpson,A.M. and Simpson,L. (1985) Proc. Natl. Acad. Sci. USA, 82, 1401-1405. 270. Lake,J.A., de la Cruz,V.F., Ferreira,P.C.G., Morel,C. and Simpson,L. (1988) Proc. Natl. Acad. Sci. USA, 85, 4779-4783. 271. Dams,E., Hendriks,L., Van de Peer,Y., Neefs,J., Smits,G., Vandenbempt, I. and De Wachter,R. (1988) Nucleic Acids Res., 16, r87-rl73. 272. Lane,D.J., Pace,B., Olsen,G.J., Stahl,D.A., Sogin,M.L. and Pace,N.R. (1985) Proc. Natl. Acad. Sci. USA, 82, 6955-6959. 273. Saiki,R.K., Gelfand,D.H., Stoffel,S., Scharf,S,J., Higuchi,R., Horn,G.T., Mullis,K.B. and Erlich, H.A. (1988) Science, 239, 487-490. 274. Field,K.G., Olsen,G.J., Lane,D.J., Giovannoni,S.J., Ghiselin,M.T., Raff,E.C., Pace,N.R. and Raff,R.A. (1988) Science, 239, 748-753. 275. Gutell,R.R., Weiser,B., Woese,C.R. and Noller,H.F. (1985) Progr. Nucl. Acids Res. Mol. Biol., 32, 155-216. 276. Brosius,J., Palmer,M.L., Kennedy,P.J. and Noller,H.F. (1978) Proc. Natl. Acad. Sci. USA, 75, 4801-4805. 277. Kaestner,A. (1969) In Kaestner,A. (ed.) Lehrbuch der speziellen Zoologie. Gustav Fischer Verlag, Stuttgart, Vol. 1, pp 96-789. 278. Engler,A. (1954,1964) Syllabus der Pflanzenfamilien. GebrUder Bomtraeger, Berlin, Vol. 1 & 2. 279. Fritsch,F.E. (1972) The Structure and Reproduction of the Algae. University Press, Cambridge, Vol. 1 & 2. 280. Ainsworth,G.C., Sparrow,F.K. and Sussman,A.S. (1973) The Fungi: an Advanced Treatrise. Academic Press, New York, Vol. 4A. 281. Wetzel, A. (1969) in Kaestner,A. (ed.) Lehrbuch der speziellen Zoologie. Gustav Fischer Verlag, Stuttgart, Vol. 1, pp 21-95. 282. Corliss,J.O. (1984) BioSystems, 17, 87-126. 283. Woese,C.R. (1987) Microbiol. Rev., 61, 221-271. 284. Stackebrandt,E., Murray,R.G.E. and Truper,H.G. (1988) Int. J. Syst. Bacteriol., 38, 321-325. 285. Wayne,L.G., Brenner,D.J., Colwell,R.R., Grimont,P.A.D., Kandler, O., Krichevsky,M.I., Moore,L.H., Moore,W.E.C., Murray,R.G.E., Stackebrandt,E., Starr,M.P. and TrUper,H.G. (1987) Int. J. Syst. Bacteriol., 37, 463-464.

'14

2-

to C'11-.-mflO'.4la0r-0tfl

t-I

NifO

o00 M 00U.-rN 0.- 0 0 -0 00

XzXXzXXX

^0

0O.-O

co to C( f-coN

lU) .-Mm

.-aO- 4

00I0oO.-

.-O

00.-

O,-OOO.-

.

4,

3

-------o000 -ow0000

Q~

0OC'4M 04flN0 N1-

XXXXXX

XXX

Nucleic Acids Research, Vol. 18, Supplement 2243

XX

1@01h

0%Ov4vm

r.m0

r

-

-

--

-w--

--9

--

HH

__O4)

H)-. *-.-..

.0

00

00

-

EXZZXXX)4

000.-.-D MO N 4N N MNW -4' 4 CVC4NN NN(N NN NN .--

--

U t)--0 * HU.0 0 0 0

00

.----.- 0 0,- 0

Xzz

0004

MOM mm

N%o 0 '4t- 4OO IIrgr%Ow0

wk O

N

0 -.-

X>X ~XXXXE

--------N----__________________M_

1-1--1-I---

O1 00

.

00

0

0 muI0 -.OOWOOIJ *0 0U *00000.o.OOO -VV H 0000 .HH--.-. *0000000fa 04, HO-"4-0 -'00000000000000 -----0_, H 4....4 ~ 00 -.--.-.-.-..-.I0 . -H_ . 00 __.___ ......... '4.H'40 00>. 004)0000 V VI000 OH 00.00.000000000000 VI A 00 00 0 4 0. .0.0.0.0.0.0.0.0.0.0.0.0.0 1.414 00-r- r-4 4j j Li so $4 LI u w ut u Li LI Li LI $4 YAQ to QO.OH 154 -4X 0 01. R VI fa4 O ) 4 t0>.>. >. >4 t>0 H 0'-.44000000000-0 OOHH 0'4tJo 0 1 >.0 v to o 0 0la(a 000e 0 ` V 4,'aH 0 1. 1-. '-' H-* .H 4H '4- `4 -' - `4'-H-- 'H u 4 * A ` .4.4 -*. V4 0 4 1 HH -004 v00v 0>.) 0000 Vv vv . 4 Ao 00 0 000 .4'44, , 4,4,4, 4v > iXOOv > 0HaZ 1=VoQUVVf 000000A4.4 .04 OOOH1.a wa Vu"OO0W W4 00 v fiv 00000000 v fi O 04 4,41.40 000 4O 4v 4 h v00 °0 0 u 00 0 0 -H -H >1 0 0 >.4 Q0 Q4 O>.001.. 0

HHH

-

I~~~~~~~~~~ I

2200U> 9U

f

a:-.40Iu U I0

0a -

I

0

0

0

>J Qti

c!

X O -} _ . . . . . . . . . . . H . . . . . 0_000 -

p pe

-

-

-

-

0>.~~

-9

V

ZX

^ffiffiffi0ffiattv...... :00> 1-. 1.4.0.0.0.00 .000 14 fWo00iii0iii0ffii OUO 0 00.000 .0.0.0.0.0.0.0.0.01-.1-.ZS 4,

4,4,4,4,4,4,4,4,0

00000

$-

H0 O 04 1.4H OO 90 0 '4 0VIO 000 0 0HHHHHHHHHHHHHU H lZhZZ )04h4 4,i 4,4 ,4, Z 00'0 HHHH4040(6 '0' oO p000 0000000000UUUUUU 000000000rt 00 00 0000 4

k

0

I I

"I -i

HHvr

9

I H

U;

aF

3 8 sr CL

m.

0fi

4

H

U)

+

('M D

.-

-,

E'

.4 ) f )4

H1 -i -I

0 0I e 000

4) 0

HI, 1.4

ZEI

0

I

l

0

0)

cn

.4

V

.4.40

>*>H

0o

H

ah_ ' eo 000 ('1-I X). .e044 ID

$4 iw



@NrPk_X O .-1fl _

f 4'0 04 o-0. 4,. O ('444.4

OH

-H

LI A.

4

Io co%l_ H

+

- E

-

.


.40 HZ W L s s 1 .a 00 x 4to0 1 0 ONO !0g 0 V.400i Q t0 0 a @ t *4. 4.4H 001 440LA 014 X $ O ,0.1.. 00 0Z W4 . 40 Z 00.41'. 0 '41N0a004 0$4U .4.0 Z0000

..0

.0

H vel 0

Ui)

4

4

00 4 .00 1 0'44 0H @ ' H0 I U 0U u V.4 H -I.I

V

_. N

00I XI00 000000I

1.4

0

H

0I~Pk

,

O ..

0

0.4-40" 0 00 4) t 0 to

r.

e00 00.

0

$4 0' 0Paa_a0 NID-E' 0 -0 104.4

14.4

r.

to

tn to 4i

01-

u}

ul

rl

tn $

L 0 U) 00 4,1 0OD -..-.1.0.C-00O--I

I

ul 0

0

0

H

U oZ 0r m000 ZOO .> 0. Z

O=e

e

-.-.40

('1

ocoo + -v 0

4, 0'4 )co

0.-

0

lflZO

co o 00uo oo tn fn m

H

4.

44 VI-

VI

40

HN

0

n

-0

GI 41.4

--------ri

-'4

-0

>.z

4.41

z

0 4 00 0

*'4. 1t

U

.-

o>

u Q0.4> V 4'4 00 4r

-

u

x v) H O 4J >

'

0' 44

0 0 0000000 a0 0 0 a

0

0

8

iI

0 H

*,4

!

Ez

° es, ,,>o 4,f 4, 4, 4, 4, 4, ,4 -H 0 I o , Li z 0, uf 4,

l. 0 0 >. 0 0 -000 0 4, 4.4 >0 0. 1.4 0 0 0

a 03 fe v)

-4

M-IH4

$4 >4 0

0 .44

0

0

>4 4,4, 4, 4,

f ---.-.-.-~----

0.0

.

1. 0

NeS

41

>4

o '4

04.4 0

A;

e I~ ~ ~ ~ ~

2244 Nucleic Acids Research, Vol. 18, Supplement 11

0

0

-

4)

-

en N (I co N ai Ln V D _ O O

IV

NDeN

kD

-

.0

r- Ntn

r,

1

Un U1 0

co ON

en as tL %O 0-00 .-

0 %

Ai lh

bE-4 E-4 E-4E4E4E

E-

X41 li

-

-

I.

N

lw

w

IV IV IV

lr

lr

ML A

~

~

~

I

L

O

0

Xn

I

1 0

x

0

o 4 o 0 44044

6

I8

00~~0 tv0

$4 4 4 2OJ 2OJ 0J

4

4

-,

-1

-1

H -m -m

-1

-1

-m -m

-m -m -H

-1

-1

-m -m -H

4

-m -m

-m -H

0 0

0

0~~

4 C 4 - -

-

-

0

4 4 A 4 4 4 A

.

-- -- -- - -- - - --

-

00 t-OHHX www X x x F ss§vUUUU

I0%M i -00-4XX

O O O W O O O W O O HH O SW O W O W) O HO W O O W O W O O $4 O O H4 O O $4HH O W O HH O W O W O O HH iWO SWO O O HO WO O WO HWO 4O HH

.

AJ

n:::x

$J

i i

o

1.H -.

-H .

ON

(a N N.-

0A~

H

,1

_%

v X

uyO

_1

a-

-

o

o:N

v2 Q0

i ~~ . U~~~~~~~ -H

4.

4i 4J

4J

4J 4J 4. 4.

$4J

NL

N v N

-M

-H

P P

tl

r- oL

N r_

en r1

O 00 _ 0r N 0%- -_ 0

UI

4

N

sr o

Om 0n

rn LM

n ~ _ o

0

M

A0

en

MMMr-0%fn Utu v u

N LA MrN MM 0 r- MM M .-_L

tacuut)tS v C 4 z

-0 II

.

tn L IO

00

UCJ

0r U pi-0% .

000

r

-

-H

t)

ON Pk < z P)F EEX8X

s

0 U - IN

'
5>S>SH 000000U 11 0 0>4.1041000 a.4-.4-1.4-.4-.40 0 0 000U0 000000000000000000000000000000 J J 4 JU 4-.-.4.4..4.I4-.4-.-.0 0 000 0 0 0 0.. 0.0.0..0.0.0.0.-'.S~ 0 0 Z0 -

-

l

4SUUUU4

r.

00~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

'rn

0 14-

00.1 A4t.4j onrAl(44.' 8Ak VVV

ouuuu 0 uuuuuuuu ~~~ ~~~~~~~~~~8888ooooo0.0.%

1I

00-

1I

00-

MW

0D

1.N

0D

1.N

0D

('lcalU) O00-

14141414k

zzP4

1I

00-

1I

I

2246 Nucleic Acids Research, Vol. 18, Supplement 4) ;I

4.)4 N (O m korL 7)-tAN

oLor-0)... N

0-

CX

-ZZ-z

10 0

00W.0 0 1.17 e

0'n

0

4)

004.) 4

0.-0000

xxx

wx

----

0

0c0IV - 0 0 OLAco co coN e 0 IV0 N N

L

-

0 0

xx

0.-0.-00

.L4~A LDc4) N N w0 0

A~~

OO Nc p44) .- 0 0.-

xx

0

U)T

4

0

-

4)44 -00 >>L)0>14-

.0.00.E-O4)z.

0

0 M

4

)4)4

r

)4 )4

)4

)

F) c

HH

c

It (

4.-44)

-

C

0 0u0 00000 it)44I) x~HWwzx

0 0*-H 4)000.0 .0 4 0C z zOaOZ> > >OOo).)

ILOE-

-H-

-

0a

.0.0

-

000.-UU-.--

-

ONO

>

.-;-0-0000 x xZXX>X

~OOO

0a)~ SW)-tt -4(aIV4)4)-a--4)40) ----------. *4)4---)4)----4)-q-q 0 H0 04)40004-)-)--I.HO4 . O00 4111.01100 0 w 0 c -4 -4

A).4

0-L LLA -- N

-----.--.----

0

-

)

L -r-r o o r-( r-4) N o or- r LwinN-

.-00,>x>xxxX XP)Zxz

>

-----

00. 00

0 00 00

4.1) H4 fO4)

.H-

NO4 NL NIO .--

~~~~~0

0

t)

co LA- Lnco LAa) w lN 0-ON; t %L OlOr)L

~~~~~~~-

0.0

*

LDolO 10)01010o%O

N

H1-H 0 >

00 00-H-H-H-HI0

ma4X

0. 0 0.4.)

0 11-

4)4)4)4

)4

0.0.

-14

).0

4444

1.41144)4)).)4

4

4)

88::

-H

88

44444

1

4004)

4)

s

t

tn in in

444) 4.) 4) 4)-H tn-H VHV4 4.14414 4) 4) -H -H -H

-H00 4) -HE 4) t4) -Hr4)-Hla

0 0 4) 0 4.4) 4) 4)

0 0 0 4)

4)4)4)4)~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ '3'

4)

U

01.0~~~~~~~~~~~~~~~~~~~~~~~~0 11111111 4)>.>.>.>.>.>.0

.0

4)~~~~4

0

0'00 0 0'>.

.4 . 4-I4)0 0.0.0.0-04)0

.1111 -1V tot 0-4 114)S4)14)4)4)4) 111111 .0k4. xi E-.

4a

------.41------0

VV-----------0

-

0 .H 440 0 0 0 4)0 0 114) 4) 0

0

W

4))4)

0

)

>>.>0.>.>>.444444N 000N

.0000004I.. 0.0 0.0 0.00001

>,,--I

0

0

4C

)0000 ) .

-H-H -H0 0 -HIo000).

4uInl4)

1.1 a aa. 2 00000000C0I00 ICA ALSW

$w A

A

L

0

000

0.

0 4.)

am

0-

-H .0

00

4L4

.0 0'

1-4e $40$ OOlA0n

7asN

OLA

4)0

4444-

4.1.

CO

04)c

UNO-r-

4.)

4.)

0000

4.)4.4444

)(nl H Cn

.00: V0 -H

0-HU

-

0J4)4

4)

4

4)

C-fl

4)

co r- 4)>r- o ~Cn t- LA-ID4 N . 1LAO -M r-ILAO r0NN0. r N N 000r N H-H .- -H .. H -H -H 4) 4)) 4)) 0 toto4to H Le11011011111$ ID 01-

0 >

4).

>0 -HH

S4

4V J4).. cn -HEnE

4)4

4) 4

00u0>

.4)4)4)4 111111)

04

LA LA LA LA

O P14 -i

SW

101 4.-L

HN 4.

-H

SW 4)

NNUt

LI

ELALAE-H

4)

.-

-H4 XI X~ -H

4) 4.)4.4.1 4.14.1 mLALALALA LAU

ALALAc

.H r-

*

04)114114)*H414)44 I

4)4)000H I

4.104).-Int

0110 -H'H41. O

1

00000014

u U.o

i nL ,q

0

I.: 4)

.-.

I

4)1

Jr S

-Ir.f (

,

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~H4.)

4.4

4

H

222

4))).4i.

~HH *H4)00 -H-H H -H -U) .) 4I4) .o 4) U o0 . . > >>j .0

o4H 4)4)' 04.H4)0t 0u. -4001144-H-1mm'4Min0000.111 .H 0to).-4)H

H 4.) 4 4114444)00.0 44 4

$4 4) n

V~ 4)

044)0

)-

:

n4 C U nI 4 4>

('U Ij-H44 dA H- 0~

N

u LA N

UU U

M

0 W U

a :: 5;00-N' 4 $ .q 0A4 ON N N N

UUUU

la

0gIH

4)4)

j

U

.1

r

H4 )u VIN o- : c: o

NNNNN

U U

UU

iE

1

N

n N

V

0 r

*H-H4-'4o OV S

(a0

>>>> o )4 )4 a4 o00$ 4I04 NSWL W 4NNNN4%4Z )Z J-

U UU U) UU

,I.

I

UU I

Nucleic Acids Research, Vol. 18,

Supplement

2247

Table 2. Number of formatted disks to be sent in order to obtain a copy of the database. Disk type Size

Capacity

Format 1

5.25" 5.25"

DSDD (360 Kbytes) DSHD (1.2 Mbytes) DSDD (720 Kbytes) DSHD (1.44 Mbytes)

15 5 8 4

3.5" 3.5"

Database format (see text) Format 3 Format 2 4 2 2 1

2 1 1 1

Footnotes to Table 1 The same number, and the a) This number corresponds with the literature reference and is preceded by an asterisk if the sequence is printed in the alignment. case character, are attributed lower a different followed numbers, Identical each by on the follow or page. alignment sequence initials of the species name, precede and Tetrahymena borealis). canadensis same the that have Tetrahymena same the sequence (e.g. species, to srRNAs from related species, or from different strains of In such cases only one sequence is listed in our computer file. b) This column contains the following data, if specified by the authors: - Strain name for laboratory animals, (cultivated) variety for plants, culture collection and strain number in the case of microorganisms. - Tissue from which the DNA used for cloning or amplification was extracted in the case of differentiated organisms. - Ribosomal RNA operon to which belongs the cloned srRNA gene in the case of bacteria. to 20), 279 c) The taxonomic position is described according to the following references: 277 for the metazoa (No. 1 to 14), 278 for the higher plants (No.to15Wetzel (281) is according described 34-62) of the (No. The taxonomic 28 to protista position 33). for the algae (No. 21 to 27), 280 for the fungi (No. for the species numbered 34-47 and 49-56, but according to Corliss (282) for Prorocentrum micans (No. 48) and for Euglenozoa, Microsporida and Polymastigotes to Stackebrandt et al. (284) for the Proteobacteria, (57-62). The archaebacteria are classified according to Woese (283). The classification of the eubacteria is according no information yet on the taxonomic position We have taxa. the for to Woese remaining and (283) the for al. Firmicuta, et according (285) according to Wayne of species 218-221. Taxon designations corresponding to an established taxonomic level are followed by the abbreviation Ph. (phylum), Sph. (subphylum), Cl. (class), 0. (order). by comparison with structures from d) The srRNA termini are located experimentally (e.g. by Si nuclease mapping) by some authors, but more often deduced denotes uncertainty on the location the length mark following A is listed. variant of the the question longest of related species. In case length heterogeneity length of the sequenced area. No length the length and gives has been determined partially the that means sequence of the termini. A number enclosed in brackets is mentioned for sequences not yet accessible in sequence libraries at the time of writing. The accession number for a sequence is the same in both libraries but there can e) Accession number in the EMBL and Genbank nucleotide sequence libraries.one. other in the arrives to one submitted a library be a delay before sequence f) PCR: the DNA was amplified by the polymerase chain reaction. RT: the srRNA was sequenced by the dideoxynucleotide method using reverse transcriptase. method. In the remaining cases DNA was amplified by cloning and sequencing was performed in most cases by the dideoxynucleotide sequence. this in the be found reporting can modification paper nucleoside on data g) Complete h) Partial data on modified nucleosides are mentioned in this paper or other papers cited therein. and has been corrected in (105). i) Complete data on nucleoside modification can be found in (276), but the sequence listed there misses I innucleotide are interspersed with genes coding for the rDNA j) Chlamydomonas reinhardtii mitochondrial srRNA consists of a set of 4 discontinuous fragments, which with alignment positions 239 to 312, 911 correspond the fragments between The and interruptions RNA subunit proteins. ribosomal fragments, tRNAs, large 2811. to 2643 to 930, and k) In the mature RNA, the nucleotides in alignment positions 431 to 618 are deleted by processing.

Nucleic Acids Research, Vol. 18,

2248

Supplement

40

30

20

10

.

.

.

.

.

.

.

.

.

.

.

.

I

-

-

-

50

60

.

--c

Hs

11 Ec 12 Tm 13 Dm 16 Le 19 At

-

-

-

20 Zp 22 Vc 23 Cv 24 Ne 27 Sc 30 Pc 33 Pp 51 Pf 52 Pf 53 P1 56 Ng 59 Ld 62 Gl

63 68 72 73 75

Hc Ms Tc Ta Po

95 96 100 101 102 105 153 173 188 189 201 203 207 208

Ng Pa Ap Ap

222 223 225 228 229 230 231

Zm Os

234 241 243 247 248 253 255 256 258 260 261 262 269 270

Hs Rn Rc

Cv Ec

Mg

Oh Me Mb Sa Si Tt Dr

-

-

Gmf

Cm Ce Ce Cv

-6

Sp

P1 Gm Cr Sc Sc An

Sp

Pa G U G A U G U U A U U G U A A U U U A Lt Ls

I lu

A U

U U A A U U U A A U U U A U A A U A U C A U U A A U U G A U A A A G U G G U G

~~~~I IIIIIIIII 4u bu bu

Nucleic Acids Research, Vol. 18, Supplement 2249 70

I

.

.

.

.

80

I

.

.

.

.

.

.

.

-

.

I

.

90 .

.

.

.

I

.

.

.

.

100

I

.

.

.

.

.

.

.

.

.

.

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

120

110 .

.

-

I

.

.

.

I

.

.

.

.

.

I. . .. U- G A U- G A U - G A U - G A U - G AU U - G A U - G A U - G A U- G A U- G A U - G A U - G A U- G A U - G A U - G A U - G A U-GALiA U - G A U- G A j C - G A U

--U A C --U A C --U A C --U C C --A U U - - U A C - - U A C - - U A C - - U A C - - A A C - - U A C --A A C --U A C - - U A C --A A C - - A A C - - A A C - - U A C - - G A U --C A U

-1

Hs

Oc Ec Tm Dm Le At

Zp Vc Cv Ne Sc Pc

Pp Pf Pf P1 Ng Ld Gl

1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

2 ,.........

- - - - - - - -

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

* -_

-

__

-

__

-

-

-

-

-~~~~~~ ~ ~ ~

- - - - - - - - - - - - - - - - - - - - - - -

-

-

A A A A A

U U U U U U C U C U

10 10

1*~

GA

U

........H

-

Ng 95 Pa 96 Ap 100 Ap 101 Cv 102 Ec 105 Mg 153 Oh 173 Me 188 Mb 189 Sa201 SI 203 Tt207 Dr 208

-_

__-

-_

_-

-_

_-

_---

-U

-CUCAU UCUCAU UCUCAU ACC U A U ACCUAU UUUCAU GCCUGC

-~~~~_____________--GC -~~~~~___________--AC .--------------AUCAUACUC--AAAAUA---A .--------------------AAAUU---A ------------- --- - - - ------- - -AAAU U U AUAA .---------------AGUAAA---AAAUUUAUAA ___A _U _- _---GGUAUAGAGUAAUAUCCCCAUUUCCCUAUACUAUGAAAAAUUU--U ------- ---- - ---- - ----------- -AUAAAAA UAU AU AU AAU- UUAAAUAAGGCGUGGCGCAAGCUCGCCUUUACUCACUUUAACCAAAGUU _-

_-

_-

_-

_-

_-

_-

_-

_-

_-

I 70

80

_-

_-

9 90

_-

_-

_-

100

2

G GA .GGAG& G GGA G A U UU G A U GA . 4 U G G A( . i U A G A GA.tY

.________________-------AAU .-----------------------AA------------------------CAA

_-

75

- - - - - - -

-

-

63 68 72 73

- - - - - -

-_

-_-

Ms Tc Ta PO

- 1.

-1-

-

:..

U - G A U-GA.tf. U - G A U . C - G A il:. U - G A U.:

U GA U G AACAUAA G AACUGA A U - G A A A AU UG A A U - G A A AAU U G A A GJU - G A .~~~~~~~~~~~~~~~~~~~~UGU XACUGAA U - G A - - - - - - - - - - - - - - A A A U U G A A U - G A - U U C U G A U - G A - - - - - - - - - - - - - - - - - - A U A C G - A A U - GA - - - - - - - U A U C A U G G A tU - G A GUUUGGAA G A JU -GAUA U -G AI UCACC G G A -_ - - - - - - - - A U UCAC G G AG&%-1JU- GAUA - - - - - - - - - - - - - - - - U U G --UUGGAG GA U-GU UUAUGGAGA U - GAUA

- - - - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - -

___

-

110

U U U U U U U U U U U U

C G A ieZm 222 CGAUC Os 223 CGA Gm 225 - G A .[OCm 228 : Ce 229 - G A C G AU Ce 230 - G A Cv 231

-

G G G G G G

G G - C A I- C A -

Hs 234 Rn 241 Rc 243 Sp 247 Pl 248

G G

GGU. G G G G

G A Gm 253 G Cr 255 Sc 256 A A Sc 258 A !tG~An 260 A Sp 261 G Pa 262 A - - Lt 269 A - - Ls 270

120

2250 Nucleic Acids Research, Vol. 18, Supplement 140

130

. . . I. . -

*

~

VC

U CA A A

....

U

....C....

~

~

~

..::..:......

-..... U C A A A G - ....*C U C A A A G - 0.U*: U C A A A G - UC....... U C A A A G - U$JCX.U C A A A G - U:.vC U C A A A G - C~Q.) CU C A A A G CU C A A A G - U 8:.U

- - -

AU A U A U A U A U A U A U

U U U U U U U

l.::.

.%

........

ICC A hOC A U:.UcccAG A U..A AU GU(A A A A A A 0CC A U:4 AUG U G U A A VA)QA A AG C£C A . C A U G U G U A AG .( ...A A h,

U CA A A G - - A U U A A

-...... C1 120TpXQ.¢g4Um U 20 Zp 531Hs

180

I

**

A U.C A U G U C U A A Q V AC - - U CA A AAGG -- -- AA UU UU AA AA G.t GCC A U.C A U G U C U A A :UAC --U CA A AAG G --A U U A A ~..C.. A.. . .A.. . A .V.:.. ... .....(t2. -....---...-.. V$.. ..-

FGUC

-

170

160

150

A A G ¢CCt A U t A U G U G U C(A$ GA A. A A A .GCCA.. A U..G.C A U G U C U ...-..-.A A 6 £. $ A ".. C A U G U C U A A AVA AA 24 CU CA A A 0 CC A U GC A U G U C U A A AU A A A 7 Ne k ,, AAC..A...gi; C - - A U U A A G C C A G A U G U G U A A ..G: .....A.. 27 Sc .....,, V......C .CVQ W. * C i 30 Pc CUGAGUA.U ................C..... - - A U U A A G CC A U 4 C A U G U C U A A A U.AA. CAUUg 33Pp ..GC.A....i...-.U .GC - U* C*.tgCU. A A A G A U U A A 0C CA.. Gi A U .CAA U G G ...A..A .......) - .U.VC U C A A A 51 AVQCU 52pf U.U...C.... ......... .C - hG.. C.U C A C A G - - A U U A A 0.C.C A .. ...C A A G U G A A A G ...UVA AU - U.UC U A A A A U C A A A G - - C C U A A 0...CCA U 4:C A A A U G U A A G.AU@)A A U - u:&WA - Kg7U. U C A A G G - - A C U U A GCCCA "... A U G C C U C A Q.AAt 62 Gi t..U A C J.. CG.t.. UC - Y.B..t CCCU C A A G G - - A C G A A .C.AU... - - A U G C C A AG .

.

.~UCI~C .. ... .

U..CA..U.A.....C.

- -

- - - -

..

Pf:X. G..CAS U..CAV ..

...

.........

..

A6 GS CAUGCCUCAr ......

2 -1' 63 Hc 68 Ms

3

4-

A A A "...g :R.. GG -- -- AA CU UU AAU AA |;...................... A U:.ta..u...t ¢t.............................

A ...G..g.. A. ..C....A m.. G

:.

..

63 2c 0 ........ 75 Po ...C....U

MsI-

G

68

....

40

30

I. 95 Ng 96 Pa

A

AP

A

102 Cv

CGGCUA

Ec

A

CA

173 Oh

A

1 88

Me

A

189 Mb

A

201

Sa

I

.

.

203 Si

A

208

A

Dr

*

.

.

.A

...4

50

70

60 .

.

.

.

.

.

.

G G C C U U C A A A A A C.. ..G.C

U

C U

A A

.

.

.

.

.

.

.

.

4 A G C - U U U A C

GA

.£A

A

TI0C

A

A

A

C

GUCC~~A

ACG

A A

U

AUGAA

........... C....... G.. G.

C 4. V...O...C U A A

G

A

207 Tt

.

- - - - C U A A GU.C.8g...G A.G A.G C G G C AC..A. A U...GC.£ A A GCCGCAAC...C A A G..)...,C..G A CG AG C.G >..C. Q.. 4C..:XB....S..£ - - - - C U A A C...A...:C A V..O...C.

A

W

.

.....

A

mg

153

. . .

A

130 Ap AUG

...... .....

*............:.: :. . . . ....................

C

A A G

A

C G G

A

C G

A

G

..UC

UGC..............L

,

2 -1' -

A

A

AG

CG0AC

GC...CU.. ...........'

3

4

6

5 .e.z...@@@@@@*. *. ^........

222

Z7m

223 Os

GCGCA:XXUCBc

G

225 Gm

G .G

228 Cm 229 Ce 230 Ce 231

G

. ..C....... .C..C.. ...

Cv

U U A A U U A A C U U A U U A A U U A A ---- U U A A - - - - U U A A

-

-

-

-

...A

A

..A.U

A

U

..C

.C

G

A

A

A

A

C

A

A

A

A

C G G:

AC4:: :: U:A

A A

'.:

C b...u SX.X...A...s. A - - -.............-.U U.... U:U A fi.GU A A AC - --- U U A - C A C A U.; GC A A G C

234 Hs 241

- - - - - - - - -

Rn

243 Rc 247

Sp

248

P1

C

AC.AS...).V.4)G

CCU

253 Gm

A

255 Cr 256

- A U GC-........... -... A W...W.X AGUG ~~~~~U -A A G C A A-:::-. A U U AU CAA.... ---UGC-C A G:...(.ii .X..A... ---- U UA A C.A.CA UGC.£A A GUCG A A C G U

ACUC CC

C C 0.AGCG.s...V

A

Sc

G

A

2585Sc

G

OVUCA

260 An

U

261

Sp

G

262

Pa

U

269 Lt

U

270 Ls

U

- - G U U U A

V.AC

A V...C. G A G X X X X X X X X X X X X

............ .............. --------. ........ UCCAAGUA CAA C AA UU4C A'CGC -- ------ --- AU UUU GG AAAAC "AC A A CC G... U CA G ACUA(A) GAG AA" CAA AXU A V U:1 AUAC~VUU+ -G C GU AA- AA VC .........A..A A GAU - - - - U U U, , ., A,. ,., .,A,., 0 A UCU. ..,,,...,.,,,..,.,.,.,,.,.,,,....,.,....,.,,..,,.,,,,,,,,, A- A - - -U - A- A-UC- -............... A ............. UC -A

130

140

150

160

170

180

~.

Nucleic Acids Research, Vol. 18, Supplement 2251

190

.

.

.

I

.

.

.

.

.

I.

200 .

.

.

I

.

.

.

.

220

210

I *** I . -

.

.

.

I

.

.

.

.

I

.

.

.

.I

-

-

-

..1j.. ,^ ,.,0

_

0..W

_

U - C-(C)A-*--U--A

230 . -

.

.

- - - - - - - - - - - - - - - - - - - --

.I

.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ---

.

-

-

.

.I

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ---

240 .

.

- - - - - - - - - - - - - - - - - - - --

.

.I

.

.

.

.I

G G Hs G G Oc G C Ec A A Tm A C Dm A A Le A A At A U Zp U G Vc U G Cv - - - C U G Ne - - - - A A Sc - - - C A A Pc AG A G C A A Pp - - - U A U Pf - G U A A G Pf - - - U A U P1 - - U U C G Ng - C A C U G Ld - -CCC-Gl

1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

G Hc A Ms - Tc A Ta A Po

63 68 72 73 75

G C A C G C A C UA C A U - U A C - - A C - - C A - - C G - - C U - - A C - - - C

- - - - - - - - - - - - - - - - - --

6

-_

-_ -_ -_ -_

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

-

_-

U G C G C

U C U U G

A A U A G

80

I

.

.

.

.

I

. .

-CI u UG -CLuPUG -CLuPUG -CLuPUG -UL uJCG C -CL uPUG -G(c"AA -G(c"AA -CI uPUG uJCG uJCG -U' -ULuuJCG PU-ULu CG

Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa SI Tt Dr

95 96 100 101 102 105 153 173 188 189 201 203 207 208

Zm Os Gm Cm Ce -GCUU Ce - - C U U Cv

222 223 225 228 229 230 231

A U C C Hs - - - - Rn A G - - Rc C A - - Sp C A -- - PI G U U G Gm A A - - Cr A U A A Sc A U A A Sc U U A - An U U A G Sp U U A.A Pa

234 241 243 247 248 253 255 256 258 260 261

-

-UI

6

- - - G G - - - G G - - - G G - G C A A - G C A A

..

...

.......,.;

U. ,.,u UUUU...... U -. U A AUUfi.U.G8G --__

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

- - -

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

..i............G.. G G A G &M..GuG G C A G A A G G A A A A G A G G C ............. X X X X*X X X X X XX.X X X X X X X X X X X X X X X X X X X X X X X

. -_-__-__ _-_-___ _ _ ___-_ - - - - - - - - - - - - - - - C - - - - - - - - - - - - - - - C - - - - - - - - - - - - - - - C

.C~~~~~~~~~~~~~ -P1-....................

X X X - X X X -

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

UU: -A' -:z - ii

--

:

:

...w.. w.....

C G G

. ---------------...---...---...---A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A.A C A A U A C A4*.OU.U ..,......M.." _ Q..A.Aik UG C A GJ.AA G U U A U U C U U U C .I U U C U G U A A.C U G G 262 -- - A .U__________________- - - - - - - - - - - - - - - - - - - - - - - - - - - - U U U U Ls 270 ......... .... u^.....

..

....

...

......

190

200

210

220

230

240

2252 Nucleic Acids Research, Vol. 18, Supplement 250

I

.

1 Hs 70c 11 Ec 12 Tm

13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

.

.

.

260

I

.

.

.

.

.

.

.

.

270 .

.

.

.

.

I

.

.

.

.

I

280 .

.

.

.

.

.

290

I

.

.

.

.

I

.

.

.

300

I

.

.

.

.

.

.

.

.

.

.

.

CCG-C C G -

-

CCU-C_C C_C

Dm Le At Zp

-

Vc

Cv Ne Sc Pc Pp Pf Pf P1 Ng Ld GI

C C

-

-

-

-

-

_

_

_

_

_

_

-

_

_

_

_

_

___

_

_

_

_

_

G-----_____ GU

-

-

_

_

_

_

_

_

_

_

_

_- ______GU.U CA--------__________

__-_

C

_

_

_

_

_

_

_

_

U A

6' 63 68 72 73 75

95 96 100 101 102 105 153 173 188 189 201

Hc Ms Tc Ta Po C G -_

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

-_

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

-_

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

-_

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

-- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- --C------

Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa

----

Cu

-_

_-

-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-_-

-

--

- --

-

------

A-A A ~

~

~

~

-A

_________________________________--____--__________

203 Si 207 Tt 208 Dr

A-

6'

222 223 225 228 229 230 231

234 241 243 247 248 253 255 256 258 260 261 262 269 270

Zm

Os Gm

U

-

-_

_-

_-

_-

_-

_-

_-

_-

_-

U

--

-_

_-

_-

_-

_-

_-

_-

_-

_-

__- _- _-

U- - -

-_

_-

_-

_-

_-

_-

_-

_-

_-

__- _- _-

-_

_-

_-

_-

_-

_-

_-

_-

_-

__- _- _-

Cm Ce Ce AGG .-----Cv G-.-.----_______

Hs Rn Rc

Sp P1 Gm Cr Sc Sc An

CC -_

-_

---_-

__- _- _-

>

.>........

------------------------uA,i:h-

-_

_-

_-

-_

_-

_-

_-

-_

_-

_-

-_

_-

-_

_-

_-

-_

_-

-_

_-

_-

-_

_-

-_

_-

-_

_- _-

CA ...C

.U

...... ...

...

Sp Pa G G At4. A A A U UU __________.. Lt _Ls -_

250

UUAUU U U C U C C U C U A WuV A.U AA G G

U

UAU

C

UGG G U

-_

_-

_-

--------------------------------------

-_

_-

_-

--------------------------------------

260

270

280

290

300

~U

Nucleic Acids Research, Vol. 18, Supplement 2253 310 .

.

.

.

.

I

320 .

.

*I

.

- - - - G4fAiC AG U GA AA - - - - GAX&C AG U GA AA ---1,)I~ AA ~ A AA t" A A A ---- U U(A)A A A A A A

.

.

CU CU C C C C C C

.

G G G G G

.

I

330 .

.

.

I

.

.

.

C GA AUG G

.

AU CA U AU A U A U

C GAAUGG C C G A A UGG C G A A A G G C A A A A G G

A.g.C1,Jf>GUGAAACUGCGAAUGGC*J..CAU

- ~~~~ G U G A A A C U GOC G A A UG G CtCA U - ~frW~(G)AC(~)G U GA A A C U G C GA AUG G CU CA U

-. g.U

ACUGUGAAACUGCGAAUGGCUCAU

- UWJAUACUGUGAAACUGCGAAUGG G AU At.*fiA U G U G A A A C U G C G A A U G G A U _ Al,).ACUU U A U C G A A A C U G C G G A C G G ( A U - - U UU AC AG U G A A A C U G C GA AUG G CU C A U - - - - i*,CU(i.)U. .)GA A U C U G C GA A C G G C G

-1JAL>AUG.1*JA- GAAACUGCGAACGGC*JCAU

--A.U:&U(&)UU -GAAACUGCGAACGGCU AU W.*JUiAUA*x> *x*-GAAACUGCGAACGGG CAU A UoAoA U G AU UG U GG AA G G A U - - A* G C A G G- GA.UC U G C G C A U G GCg.>.A U - - ------A ~ C G G C G GA C G G

6'-

340

I. . . . I

.

.

350

..I

.

U AAAU UAAAU U A A A U U A A A U U A U A U U A A A U U A A A U UAAAU U AAAU U A A A U U A A A U U A U A U U A U A U C A U A C U A A A A U A A A A U A A A A U A U A A U A C A U G A C A A

100

110

.

G A A C G G

G GAC G A A AC G G GAOC G G GAC G G GAOC G GA AC G GA AC G A A AC G

.

.

A G U

QG

Om 13

LAu G.,AAC CVc 22

Cv .:A U %IJA C U Ne UiA U ..iGU A C CSc

23 24 27 AAA YVC U 30 UQt1A~ ACPc A...w....C(A{.Q)AW.GCi'p 33

-Pf 52 CA :7A AA.Ng 56 AA U C

C OiC('U

A A A A A

U-:Id

59 GI 62

- - - - - - -

- - - - - - ----

.

I

130 .

.

I

-

-

-

U A A U

A A A A A A

Hc Ms Tc Ta Po

63 68 72 73 75

140 .

.

.

.

.

.

.

I

.

.

.

A A C

U A A U U U U U U U

.

-

A'

A U AC G A U A CA AC G AC G

Ii.A

-- -

A

-- -

-

A-U A--

A UX

A G U A A C A

GA AC G G.LGAG U A A CA G A AC G GGUG A G U A A C A G GAOCG GG AG U AACOG G C AC G G GUG A G U A A C A

6'

~U U AA

~,G UA UC Le 16 4,GG A A CAt 19 GA CU Zp 20

120 .

G L*AG G GGAG G G. AG G AG G AAG G AG G AGAG G GA G

G A A CG G

I

8

G GA A AG CUAG U A A C GA A C U G AG U A A C G GAC G G ACAG U A A C GA A CA G CAG U A A C G GAC G G CAGAG U A A C .. . . . . . I I. I I. . I * .I

360

......... ..

C U C G Hs 1 uG U .CC UCG0Oc 7 A1U A C CEc 11 ~~~~ ACCT 12

7

90 .

I

.....

-

-

-

- -

Ng 95 Pa 96 Ap 100

Ap 101 Cv 102 Ec 105 Mg 153 Oh 173 Me 188 Mb 189 Sa 201 SI 203 Tt 207 Dr 208

8

7 .................

G G G G G G G

A A C G G GUA G U A A C G G G....G A G U G A C G G GU A G U GACG G C G U G A C G G GX$1G C G U G A C G G A G U G A C G G GG A G U

A A A A A A A

-U------- C C A G GA G U U C C A U A A A C C G *, U A - A CACA C C C AG - A U G A - - A A G C G A A C C A (. - -- - - - - - - - - - - - A U G A C A A A C C A W.U.G A - - A :::..~G:.:::A.:A..::~:.A A G U0 CG A A C GGGU CGUA A.A£GAACGGX......................... - - -.- - - - - - - - A A G U U G C - - - U A G W.U . A C C C ..; ..C..G AUA.U.G..G.G U A AA AG.U.A A A G1, A G GA C4A GG U A A G U A A GQ A GU A UUUUUU IJ U B A U J gAX4 A A G . G U A C A G GIG A G U A U U

-- - - - - - - - - - - - - - -

..

GUG~~~~~~ACGUGUGA

1,j

A A A A A A A

CG CG C 9 C d C C d

A-A -A --

C A

A

I

-

-

-- -

- C A C Cot A A U C CC

- - -

U AGAA U A A A A U GCG. - A l,IfA)C U U U U:A U U UA A A Q.U A A G U A U G:S

- -

A C G>UAAU-U 59LdEc-AAACU ... G A A A A~ - - - - - - - - - - - - - - - - - - - - - - - - - 24 C C G .js. U A A U - U C.a 27 UCA CU UU AA CC U . 622Gl..AA ACACC - - - - - - - - - - - - - - - - - - - - - - - - - U C A C UAG Sc - ---------------------------13~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nA A C C G 1.. CAAC ~ ~ ~ ~ ~ U~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~.................... u}U. wUAAU - U :.u. l6Le-U A C-U A C U ..A. AA........A....A AU.. .-........GA. A C-U A C U C G~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ A4JA ACCG........................A GA... 2OZp G ACCG.................A.......A........C:.. C-U A C A C#~~~~~~~~~~~~~~~~~~~~~~~~~~ A A.....U.... - - - - - - - - - - C--U U A Pf - 30l9At--U - -_ U U U A A {.^.A.( C A A AA AA E:Ci f G...G A 22V c---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~.......................................... P1 A C~~~~~~~~~~~~~~~~~U A CC U~~~~~~~~~~~~~~~ 53~~~~~~~~~~~~~~~~~~ A A U U 23Cv--U CU A A A G.;|;..as88 ACC.......CG......L1A.:.....:.....U A U AA.-..........C A A-G:.C,i.^. U.8.I..Sl .....G A A C U U A C U C:4~~~~~~~~~~~~~~~~~~~~~~~~~~~A................GA.. CCCG.........................A A U A 5624Ne--U Pf - - - - - - - - - - - - - - - - - - - - - - - - - UX .AA Q..t CU4. G A A A - - .......... A.

Mg

Oh Me Mb Sa

-

-

-

Sl

Tt Dr

-

-

-

-

-

-

-

-

G(GA

A A G CU.

- _ _ _ _ _ _ _ _ _ _ _ tG4 gAC A(A) -_ _ _(A)A _ _ _ _ A AA AA C

G A A A - -

G A A A - C~ G A -A

9

222 223 225 228 229 230 231

Zm Os Gm Cm Ce Ce Cv

t'-GA.A

C A A

'G

G A A A

".GAA

A

G G 4

-------------

91 G A A A G A A

C: -A-: A' CG::

C A A

U

A

G A A A

`

V

G A G G~~9G)A

234 Hs 241 Rn 243 Rc 247 Sp 248 P1 253 Gm 255 Cr 256 Sc 258 Sc 260 An 261 Sp 262 Pa 269 Lt 270 Ls

-

CCK J. GA A A-GCGA

15()

95 96 100 101 102 105 153 173 188 189 201 203 207 208

..

.........

-

G

A

A

A

V--

-G &--

- - - - - - - - - - u~~~..u...........w....XW^AAU- : --

- - - - - - - - - - - - - - -- - - - - - - - - - - - -

-A

-

-l

lo 38(

A

C A A A A AAU ~CU

....CU A A A

UA

-

-

...^ U

............t...-..

- - - - - - U.u.c....1CU A A A U -A U ,..()UUAU------ ... - - - --AUAU -

-- - - - - - - - - - - - - --

- - - - -~~~~~~~~~~~~... -u. -" ........

3j90

400

410

420

~.U ~.

Nucleic Acids Research, Vol. 18, Supplement 2255

430

.

.

.

.

.

I

.

.

.

.

I

.

.

440

.

.

I

460

450 .

.

.

.

I

I

.

.

.

.

.

.

.

.

.

I

470 .

.

.

.

I

.

.

.

.

I

480 .

.

.

-C G A -

.

.

.

-

I -

Hs

7 Ec 11 Tm 12 Dm 13 --Le 16 - - At 19 Zp 20 - -22 Vc -Cv 23 - - Ne 24 - - Sc 27 30 Pc Pp 33 Pf 51 Pf 52 - - P1 53 - - A Ng 56 - - A Ld 59 --Gl 62 Oc

- --

-

- - C A C A:.A. - - - - - U A AA} - - - - U - - - - -

C A C A

-

-

u

C. Q

-

-

-

- _

-

-

-

-

-

-

iX

FAAA ..

c

-

C

A A A CQ~ --C A-u U1I.A

_ - U GA A A &

A - - -- U UU1.Q U3(C)ft - A AC . U

A A

-

A U

- - -

iUaC U G .i. G C _____.

A A G G ..C%.G G C G C

-..

.. ........

.

-

A AC- - - C CAA... AU.*.. ....>

~

I

-

- A - A

C:bI.. A A G~A

.

-

-

-

-

-

-

......... ..............

_9,0

C U A A A U A AU ACO C IG C U A A :-. G C U A A U(UC C C C

A U A

---

-

10 -

ElO-1

G G A ---

A C

A U A - - - G

A U A - - A U A - --

,U CU C.A U U

U AC G G A A U . U C A U U A C A AG C*CU G G A A C -

Hc Ms - - - Tc -- - Ta --Po - - -

U G G A A U -

- - -

-

-

-

63 68 72 73 75

.. ... ..... ..

170

180

..................

.*I**

.AAUCACc u

C GG A - - - - - - - - - - C C UGAA U G .-. G A A A - - - - - - - - - AUAG C A A - - - - - - - - - A U A - - - C G ..*.:. A C G - - - - - - - - - A U A - - - A C g G C A A - - - - - - - - - - A A AU U A A A U A GC A U AGA A U A - - - A. ACAA*A4+.t C A UjG CU U U U A U A C G U - A.. A U(A)C - - - - - - - C C A C G G . A U A --- G A W()C A U A --- C U C G C U U--------A U(A)C C G G A U A - - - C U QG C C..:U C G C - - - - - - - A U C C C C C A U G - - - U . AC * C U U G - - - A U(A). .U G A U G - -- U A U U U C G U G - - - -

C U A :...G C U A t K S C U A *.. C U A . . .G. *.SC U A QA¢ CU A A U

git(C:)i.U

C U A U C U A U CUA U.' C U A :(C)U A .G C U A

190

A U A A U A A U A

- - --

9,

..

G C U A A

1j()

U(k)CCC

)C U A A U(A)CC(.

..

A A

...

..

...........

:G CU A AU(

U ...FA U ...i4 A U A U A UA U

A A A A A

.G ....-...

_

A

A

G U A

- -

-

-

G

G U A

- -

-

-

G

-

-

)CA

U A

.A

U A

-

.-

- -

- - - - -- -

- -

- - ---

-

- - - - -

- - - - - - -

Ng 95 Pa 96 Ap 100 Ap 101 Cv 102 Ec 105 Mg 153 Oh 173 Me 188 Mb 189 Sa 201 S1 203 Tt 207 Dr 208

10

..........; ..:...,

C U A A

-

- - -- -

- - -

%

*.-

CU

A

G G G

............................. A exG. A~~~~~~~~~~~~~~.

GU A

A A A A A A A

G G G G G G G

G G G G G G A

-

-

-

- - -

- - - - -

-

-

.

- - - -

- -

- - - - - - - - - - - - - - - -- - - - - - -- -- - -- - -- - - - - - - - -- - - - --- -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A - - - - - - - - - - - - - - - - - - - - - - - -

.

-

_ . -_-_-_ _ _ _ -_-_-__ _ _ -_-_ .

- - - GC..U A A A A

-

Zm Os Gm Cm Ce Ce Cv

222 223 225 228 229 230 231

Hs 234 Rn 241 Rc 243 Sp 247 Pl 248 Gm 253 Cr 255

U A __--U::iAUA.U:.:.AA*A AAA U A A-A_ U A -~~~~~~~ A UAAA A -c - _n26 --UUAIJUA AA AA AU _58 A A A A G A A A---...................An 260

G U A A U A A A----

-

41ju

GA.AU..A.GUUA--~~~~~~~~~~--Lt 269 ~...U... .-.s2.0............. ::.Q."A A --.fi..

440

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

4!)U

460

470

480

Ls 270

2256 Nucleic Acids Research, Vol. 18, Supplement 490

I

.

1 Hs

7Oc 11 Ec 12 Tm 13 Dn 16 Le 19 At 20 Zp 22 Vc 23 Cv 24 Ne 27 Sc 30 Pc 33 Pp 51 Pf 52 Pf 53 P1 56 Ng 59 Ld 62 Gl

.

.

.

.

510

500

I

.

.

.

.

I

.

.

.

.

.

.

.

.

I

520 .

.

.

I

.

.

.

.

.

.

.

.

.

.

.

.

I

540

.

.

.

.

I

.

.

.

.

I

- U G C -A U GAU G U G U G.3 G A G,XU - U G - - A U ACC G C G C - - U U U C U G . A)A -C GC --u u GAA G A C O--UUW u UUA A - - - - - - - G G A - uGO - -AU ~AU - - - - - - - G G A ~A-C GO - -A U %A~ A - CG C - - A U C C A . G A - - - - - C U G AQGA -C GU - -A U UA A - - - - - C U Gi - - - - - C U Gi GAAQG A- C GU - -A U'LiA" .U~~~~~~~~~~~ A U U- - - - - - C G G O'..-A; A CC G U - _- _A - - - - U A U . _G_U _A_A G A ... i;:

__________--------

CX

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -_-_

_-_

_-_

_-

_

_-

_

_-

_

_-

-- -- ---

- - - - - - -

--

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

-_

-- -- -- --

- - -

-

- -

-

- - -

-

-

---

-

W-

*..

.u u It

-- -- -- -- - --

U ~'UA~

A U UAAuA. C U G

A GC-

A

VU A

A

... .....

... .... ..

U

G U G U

-

-A UUA -AUUUA

4) ~~-GAGGUAA Gu --A

C0A:$

-

E1-2

E1O-1 ' 63 68 72 73 75

530 .

Hc Ms Tc Ta Po

95 Ng 96 Pa 100 Ap 101 Ap 102 Cv 105 Ec 153 Mg 173 188 189 201

Oh Me Mb Sa

203 S1 207 Tt 208 Dr

222 223 225 228 229 230 231

Zm Os Gm Cm Ce Ce Cv

234 241 243 247 248 253 255 256 258 260 261 262 269 270

Hs Rn Rc

Sp

P1 Gm Cr Sc Sc An

Sp Pa Lt Ls

490

500

510

520

530

540

Nucleic Acids Research, Vol. 18, Supplement 2257 560

550

~

~

~

~

570

..

~

590

580

...

..

..

..

.

..

600

..AA....G .. ..

...~ .#. _________________________C__________________---------------........-..... 19...... ...

..

..

..

..

....

..

..

..

..E11

... C....C... ..A..... ..L e..1 ...

..

.....

. .. p. .2

.. .. .. U..

C G Q&C C

CU

UC---G A-------------------CCAu C

AAAG-~~

AUAG

,R

U C

-----

-

-----

G C

AAAG-~GC

A

GG

CA-

~UGACC

---G

Of5 T1 GU A G A G G A C 12 5 UCC G~~~~~~A~~~Ait1

R

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

P153

AG4

~~A U

-

-

-

-

-

-

-

-

-

d rxt- U C U G A U

-

A

U

L

A U

t

E10-2' -

-

A ~UGA(

ACU

E10-2

-

Pc

GA CCAA

-G AG-A -A-A

-

C u u

-

G A UC A

-AA-A

--------

-------

u CGuA-

--G

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

- -

Hc Ms Tc Ta Po

Ng 95

-_-_-_-_____----_________---------------------------------------------------------------------- - - - -- -- -- -- -- -- - -- -- -- - - - - -- -- -- - -- - - - --

-_____-----_ -__ ____ _______- -____ __ __ __ -_ -_ -_ ____ -__ __

Pa 96 Ap 100 Ap 101

Ec -_ Mg O ---------O-- -- -- -- -- -- -- -- -- -- --- - ---- ----Me - - - - - - - - - - - - - -- - - - -- - - - - - - -Mb - - Sa

- -- - - - - - - - - -- -- -- - -- - -- -- -- -- - -- - -- - -- - -- - - - - - - - - - - - -- - - - - - - - - -

-_

-

-

~

-

-

-

-

-

-

---- - --

-~

-

-

-

-

-

-- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - -- - -

-

-

570

580

590

600

189 20 1

222 223 225 228 229 230 231

Hs 234 Rn 241 Rc 243 Sp 247

Pl

248

Sp

256 258 260 261 262 269 270

Pa Lt Ls

560

717 I188

Tt 207 Dr 208

Gm Cr Sc Sc An

-

105 1 53

SI 203

Zm Os Gm Cm Ce Ce Cv

-

550

63 68 72 73 75

253 255

2258 Nucleic Acids Research, Vol. 18, Supplement .

1 Hs 70c 11 Ec 12 Tm 13 Dm 16 Le 19 At 20 Zp 22 Vc 23 Cv 24 Ne 27 Sc 30 Pc 33 Pp 51 Pf 52 Pf 53 P1 56 Ng 59 Ld 62 GI

.

.

610

I

.

.

.

.

I

.

.

.

.

I

620 .

I

630 .

.

.

.

I

.

.

.

I

.

~.

640

.

.

.

I

.

.

.

.

.

I

650 .

.

.

.

I

.

V GAA~*JA-C-G G C C G A ~~JAC U ~ G C C G AI iU A

A C A U G C A G A A C G --GA G A CG A U G A A % ~~~~~-AX.---------CUCU(CG--GAA A -----------------CUC A C G --A A: C A~-----------------C -------

-------------C U. C G C G --GA -A C UA C AAC A _-A---cA~AA---------...4AA A U A A G AA A ~~VAVU GOA -CGA A UA A-AA A--G -------------C G -t~~~~~~~C C A G C GA A----U CA GAAA

A

-------A-----G

A|

- 10' -

E1-2

.

I

660 .

.

.

.I

.

.

.I

.

i---CCUCGA----G '----CAUCGU----G. ----CUCUGU----G ,---- U U CU U'---CACGGCG---A U t--UUGC U GU-----CU G C G U -C4 '----AAUAG---: --C UGC G U-C--UUUGCUU ,----CUUGUU----e *--C UUGUU-A----UGA------*----GA-U---

A UGA------AA---CUUCACG--AA..

A

.

----CUUCGU----Q '----UUCGC----A i---CAUCGU----Q --- C U U G U----

___U_____G-C---CG A ~G A&*~&A-----A C G C U G A A

.

11

..............

63 Hc 68Ms 72 Tc 73 Ta 75 Po

................ . ... ....

_ g-.

C. G G

-

............ S.. -..G . A- - V..t..A..#..Z.fi0C G U- AA AA AG..G. A A A

G

G.........

U U

.GtA.- C

t: i

200

....1 95 96 100 101 102 105 153 173 188 189 201 203 207 208

Pa Ap Ap Cv Ec Mg Oh Me Mb Sa

______

S1

253 255 256 258

-

-

-

-

-

-

.. ........

C U - C

...........................

210

1.... 1.

-

,

A...X

_A_-_" ....

-

.AA......--.A.....A. ...........

Hs Rn Rc

-

-

--U C A U G UCU C(> G- - - G 4 .. C A C.... -- C - - C G A A - - - - A..G...S C A UCt - U - :-:- C G A -- - - - C G A A - -

10'

234 241 243 247 248

-

-

-G&G~~~~~~~

Tt Dr

Zm Os. Gm Cm Ce Ce

-

A-.-.-...A.C.C - A CAGAGGG A AG --G CGA A-AAAGU C -CU C UUU U C G GA -----GG -AAC C - ~~~~A ~& - GA C C U U U U G G A A ~ ~ ~ ~ A~~CC-A A A - GA C C U U C G G G --AAA-GG U U C G-C C-A A A AG-...A..G.-. A C C U U C G G G -.......... A%P.A..AA.t.M..UU-AAA...AA.X------AACU-¢---e . C VWV.1i.) X i - A A A A(SA VA-G C A A - G C A U l.4.. *' C' AX''t't'AGCGAA'X''''z'^ G - A A AG G U-U U C G -

Ng

222 223 225 228 229 230 231

CCG---CCG----

C U ----UU C U -

__

UUUA--

11

.A

.t:- A .......... ------g.~~....... _---G

AAA--

-___--G -A AA GGA-P--AAA - -A--PG.CA

-A---_---G .C A - A A A GGGA-...... GA--______A,St.....S.. ....tA ..... -_---G AAA--_---G AAA-A AA - - -G__ GAAU ACU - -A---G ii:.UGA-AA A AU-..-......... -_-__- --G AAA- -

-A---G.U

Cv

-_ -_ -_

Sp P1

Gm Cr Sc Sc 260 An 261 Sp 262 Pa 269 Lt 270 Ls

__-

___-

-----------------AC CA---C A .____________----GC CUA--AC C U A .U----------------

-

-

-

-

-

-~~~~~~~~~~~~~~~~C ------------------- - -U:.AUA-AUA AAA GA U --:A -.-. - A CU UAA---~~......UCU - - - -- A U -i -j -:J A A-I -A ------U - -U --GUU-UG A A-U-----CCC---

U'UA ..U..A.U.A.A.. U A UA.U-C.U A--_ ____u....8.a........... A--_ AA AA GAAGG--...

-_-_- -A-

____-__-_-----------.......i.... ----UUG.t..GA.B AAA. -UA A U U U A-IA:A US::UG U-

..

--

U GAAA-GUCA- GA UAU A GAU---

-- -- -- -

_ _

..u....

A.. :W..:-:

_.

6 blU

620

630

640

650

660

Nucleic Acids Research, Vol. 18, Supplement 2259 670

680

690

700

710 .

A.......AA.....CC..U..A... A.....C.A.U.U....A. ~~~~UC..............C...........

........ Z~~~A C~~~

I

720 .

.

.

.

I

........

....................

-

C~A U U C

...AC.- C... -GU U

AA1U

4"4~~~AU U C AAA*

AU C.A ......U.A..UC..A...U ~ J~ C AA U 4Uf1IB .....r

--C

.......

UQU~~....A..)U UUUA

- ---

~~1B~iJ

A-

(A)

-tCA

C

CC

t* AU....

~*

G c c

-CC -CC

()U

14Ai4

C

12

I~~~~......... U:.~

A U A U GAACCG A A C....

'40

X

-CA G U -- Q(U U .C.G..U.A.....U A~AU A ~k G GAG C-..... UUUC CU QAG AGU WCU ~C.U(U)U G- C GGAJG-~- UGGG A U....CC.C..

-CG U CU C

95 96 100 101 102 105 153 173 188 189 201 203 207 208

Ld 59 Gl 62

8'

-CU 14 G C~

Ng Pa Ap Ap

P1

C

C

- -CU CGUGC U-U

63 68 72 73 75

Pf Pf

c

*1....

Hc Ms Tc Ta Po

Pp

--A C CUAUC --A C C UA1

11X

Ng

11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56

Zp

-C CUAC A

1

7

Vc Cv Ne Sc Pc

UA

u C AAUCci(A)4~IJUu

~

-C(~~~~ 4~~~~(A)(~~~~~ (AC (aT~~~C(G~~~~UC Af&t~c (u u ~~(A)ACAA~~~~~....... it~

Hs

Oc Ec Tm Dm Le At

- -

G

Cv Ec

Mg

-- -

CA U G U()CGU UCO - G CCt~~~~~- A AGG ~~A UG GA ~).G UCC-G c4++~~~'+ ~G A...U CG1 - ~ AAC G Ut C U +Uc

Oh Me Mb Sa Sl Tt Dr

-

-

12

AU AAG~AG U ~~~~A G C ACUUAAGA

Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231

-~~~~G :U~. Tc 72 ......Ta73 _- --- G G. Po 75

._ -

..¢..... Hc.

___

- -

- - - U AAG

G". .G G U G G A A G.CC - U GA A G C - GA 4 G A A G A A %G~ - C.C G A A G A A W G 4 ArA AG A U G A G.A).S.. - AAGG A U G ;A-.-A--"~CC .... .........

-

-.4g:Tt

Nucleic Acids Research, Vol. 18, Supplement 2263

A-..-

-

G C C A - A - A U

-

A A A U G G - - U A AC - - - U A A

-_

-_

-_

-_

-_

-_

-_ -_ -_ -_ -_

-_ -_ -_ -_ -_

-_ -_ -_ -_ -_

-_ -_ -_ -_ -_

-_ -_ -_ -_ -_

-_ -_ -_ -_ -_

-

-

-

-

-

-

-

-

-

-

-

..

.....

..

......

-

-

920

930

Rc 243

Sp 247 P1 248

...C.U .. Gm

------ - - ---- - - -

253

255 ....ScrSc 256

U-1IQ...Cr 255

-U

S~ ~.3. . . .u. .t. . .Sc 2

- - - - - - - - - - -

G-- --

-

-_

-_

G_

-

-_

-_

-_

-_

-_

-_

-_

-_

_

- - -

A.UA.

940

An26 An 260

- .. 61......... _ - - - - - - - - - - -Pa ........ 262 A.Of--

-_

-_

UW

27

Lt 269 -.U-.UA--

-Ls 270 -A .UU ..........

v-iu

Hs 234 Rn 241

950

960

2264 Nucleic Acids Research, Vol. 18, Supplement

1 Hs ~~4$~A A *4LA AU 7 Oc A 11 Ec AL Tm 12 'C4&t IGU A Le

1020

1010

1000

990

980

970

--------G A A U G A G U C C A C U

---------4.

4G A A U G AG U C C A C U --------------------------------G A A U G AG U A C A C U C ~ --------------------------------G A A U G AG U A C A C Ul C '~ -

13

-G A

A U G AG U A C A

CA

*4

--------------------------------G A A U G AG U A C A A 4UA 4L-

23 Cv

.U UUUA

--------------------------------AA UG G GA GAWC-

33 Pp

A

52 Pf 4*4AAU-G A A U G A U A G G A A A A UG G G G ~VC

59 Ld

A

U

17'

1

u** UA .......v C G # -U~ A4-U3 C4

63 Hc 68 Ms 72 Tc

73 Ta

43040 GU AAA

95 Ng

"G U AAA g X-L*~ G U AAAGC

101 Ap 102 Cv

153 Mg

1730Oh 188 Me 189 Mb

2035S1 Tt 207 Dr 208

4A

4AAAQU~-U&A

A Au1 #AAAG*4CU* ~1AAAC~-J~

-----...UG----

-----

UAAAC----------CUC ...GGUAAA -

GJAAC---U

17'

-

------------

PI17-1

P7-1 l

A

AA

18

-----------------------------------------UUU A UGAAQUVGUAAAC U--------------------------------------------UL~*i -----------------------------------------UI1 G UAAA (U-----------------------------------------UU*U 4AAA 1CUAAACUCUU*

225 Gm 228 Cm 229 Ce 230 Ce 231 Cv

-

-

-

-

-

-

$

-

-

234Hs ~ ~ ~ 241 Rn.~~~......... c ..R ...4... ....

2347 Hsp

-

2481R G U A A A 253 Gm U 255CSp --V& Sc U -UA G G 256258 P1 S ~~I .~A c ~J U

--.

-

-

-----

-

C C

C-~ A C G G A

-. A A

G A A A

-k k

A A A

Ui~A~......A.U.A.A.A.. An U CACACUAACAGCGGAAA4..A Gi 260~~~~~~~~~~~ ~..... ..

261 Sp 262 Pa 269 Lt 270 Ls

~

:.--C U U A k ~ ~ 1 ------------------------------------A UC G AA U AA -. U)U. A U A

~A U-C G.

-

I

I

970

~~~~~~I ~ 980~ ~

990

1 000

1 010

1020

.

Nucleic Acids Research, Vol. 18, Supplement 2265 1030

I

.

.

.

.

.

.

I

.

.

.

.

.

I

.

.

.

.

I

.

.

.

.

I

.

.

.

I

.

.

I

.

.

.

.

I

.

.

.

.

I

.

.

.

.

I

.

.

.

-

Oc

------------------------ ----

7 11

12 13 16 19 20 .~~~~~~Vc22 . Cv 23 ._______-.- -Ne 24 -~~~~~~Sc27 Pc 30 _--.____ . Pp 33 - - - - Pf 51 - - - - . Pf 52 - - - - - - - - - - - Pi 53 .______---Ng 56 - -. Ld 59 ------------ Gl 62

-

AA U

-

-

AA U

Hs

-

Ec Tm Dm Le .----. _______--- -At .___ -Zp

-

-

I

.

-

A U

-

-

-

iA

U AU AA U AA ...A A C - A A C - -

-

-

-

-

-

-

-

-

_-----

-

-

-

-

-

.AA U - -

-

u - -

-

-

-

-

-

-

A A A - -

-

-

-

-

-

-

C- -

A A

.

- . -_______---

-

AA U

j

.

.

1080

1070

1060

1050

1040 .

.

-

-

-

-

-

-

-

-

-

-----

18-

CUA---G U GCU

-

- - - -

C U-----

.....

4.At$iAG G-

A..

.

-

~

.

.

.

.

I

-

-

-

460

450 .

Hc 63 Ms 68 . .__________-.------------------------ Tc 72 ------------------------------------Ta 73 - - - - - - - Po 75 .

-------------------------------------

......s....S A:

.

.

.

.

I

.

.

.

.

I

.

.

.

.

C U C U C U

C U U U

AA U - - - AAU ---A A U---A A U---U A A U---U AAU ---A G U G G A A----G G A A-U C A A - - - -

Ng Pa Ap Ap Cv Ec Mg

-

-

-

-

-

-

-

-

-

-

-

-

-

Oh

-

-

Me Mb

-

Sa Si

-

-

A---_____ U C G------

Tt Dr

-

95 96 100 101 102 105 153 173 188 189 201 203 207 208

18 G A A G A A G A A G A A G A A G A A G A A G A A GAAGAA GAAGAA G A A G A A

A-- -.-_ __ A - - - - . ____ G .-.----___

Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231

U. .--___ U. .-___ G----______ G .-.---____ -

-

--_______

.--

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

_----------- Hs 234 -------- Rn 241 _ - - - - - - - - - - - - - - - - - Rc 243

247 .~~~~~~~~~~~~~~~~~~~~~~~~~~~~Sp

Sp 247 Pl 248

A

-.3.A...R

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -.~~~~~~~~~~~~~~~~~~~~~~~~~~~~P1 248 - - -- - - - - - - - - -- - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - -- - - -- - - -

C UG A... G.C.G;o. - - - -

-! -m.Q -..(#.

iP.A U

A A A

A

U

U

A

A A

-:-:-C

-An 253 Cr 255

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

.~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L C C U Ls 270 AU .U U A 269

1030

10u40

1050

1 060

1070

1080

2266 Nucleic Acids Research, Vol. 18, Supplement

1 7 11 12 13 16 19 20 22

.

.

.

.

I

1110

1100

1090 .

.

.

.

.

.

.

.

.

I

.

.

.

.

.

.

.

.

1130

1120 .

.

.

.

.

.

.

.

I

.

.

.

.

I

.

. . .

1140

. . . .

. . . .

Hs

Oc Ec Tm Dm Le At Zp Vc 23 Cv 24 Ne 27 Sc 30 Pc 33 Pp 51 Pf 52 Pf 53 P1 56 Ng 59 Ld 62 Gl

63 Hc 68 Ms 72 Tc 73 Ta 75 Po

95 96 100 101 102 105 153 173 188 189 201 203 207 208

Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa

S1 Tt Dr 18

222 223 225 228 229 230 231

18'

Zm Os Gm Cm Ce Ce Cv

234 Hs 241 Rn 243 Rc

- - - - - - - - - - - - - - - - - - - - -- - -- -- - -- - - - - - - - - - - - - - - - - - - -

- - - - - - - - - - - - - - - - - - - - - -

- - - - - - - - - - - - - - -

_ - - - - - - -

247 Sp

255 Cr

- - - - - - - - - - - - - - - - - - - - - -

256 Sc

A A A

261 Sp 261 SPa 262

- - - -

2

XU A A U GA C C A U A U A .." iA A U G A u U

A__________ _________

Pa

-

- - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

-

-

-

-

-

-

270 Ls

1 090

1100

1110

1120

1130

1140

-A

Nucleic Acids Research, Vol. 18, Supplement 2267

*1..

-

1150 .

I

1160 .

.

.

.

I

.

.

.

1170

.

.

-_

_-

_-

-_

_-

_-

-_

_-

_-

-_

_-

_-

-_

_-

_-

-_

_-

_-

-_

_-

_-

-_

_-

_-

-_

-

-_

.

_-

_-

_-

__-

-_

_-

-_

_-

_-

-_

_-

_-

-_

_-

_-

-_

_-

_-

-_

_-

_-

-_

_-

_-

.

.

.

.

.

.

I

1180 .

.

.

.

I

.

.

.

G G G G G G G G GG U

U U A U U U

----- -- -A G A U ----- -- -C C A U ------C C C A C A

G

1190 .

.

.

.

.

.

.

.

.

I

A A A A A A A A

- - -

470

G

U A A - - - - C A A - - - - U A A ----U A A - - - - U A A

.

490 .

.

.

.

.

.

.

C C

G A CG U U A U U G A C G U U UU G A C U G U A A A1,)IG CU G AC GG U UALU A U A~GAG U G A A C i A G -

A A A G U G A . CUU~A A A A UUGACG G U G A 18

- - - - - - - - - - - - - - - - - - - - - - - - - - -

-

-

- - -

-

-

- - -

C C C A A

A A A A A

A A A A A

G CG

.

.

Zp

Pp Pf Pf Pl

Ng Ld

Gl 62

m Q

1

G G C A A G A G G C A A G A

Hc Ms Tc Ta Po

:: G G C A A G G GG G C A A G A G G C A AG U

510

Ng Pa Ap Ap

95 96 100 101 Cv 102 Ec 105 Mg 153 Oh 173 Me 188 Mb 189 Sa 201 SI 203 Tt 207 Dr 208

A

A

---UAA - - -U AA - U A A -U AA -U

Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231

AA

- U A A - U A A

A U C A

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Hs 234 Rn 241 Rc 243 Sp 247 P1 248 Gm 253 Cr 255 Sc 256 Sc 258 An 260 Sp 261 Pa 262 Lt 269 Ls 270

A

GA AA

-. -.

----------

-...u....X....

.A________________________________________,... U A U A A AA AA~A& Ci¢ A A: U'. A~~~~~~~~~~~~~~~ U

A

U

A

U

A

...........

tA..-s. . . A.U..U..................t

-

A U A A U-U A A U AUeW.A A U U - - - - - -

-

-

-

UU ).QM.X.....U AA U A A - - - -

-

- G A A U A A U-GA U A A U G AC AAU uUU C u A-~-. - &- - -GA -~~~~A ----A A U U U G -A UCUUAAUUUU AU U A -

-

-

-

-

-

-

-------

AUAAU-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~........ AUAAU--A-------------------------------

1150

1160

63 68 72 73 75

I.

C G G U GAA - C G G U AU_;A - -C G G U AU.AA.A. - U U C C G G U AUIU A..AA - C A U UGACGG.U AUCU...... - GA G A G A G A

A

1

7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59

19

U GA C G G U

U U U U

.

Vc Cv Ne Sc Pc

18' -

.

-AAU AA -- - U AA - - - - A A - - - - A A - - - - A A - - - - A A - - - U A A - - - U A A - - - - A A - - - - A A - - - - A A - - - - A A - - - - U A - - - - U A

AC----ACUGGACGUU

A. la U

.

Hs Oc Ec Tm :Dm Le At

500

I *

A U GGA U G A C ------A C j $_ G A C G U U ------A A i:>*.*i.....i.*. C G U G A C G U U

1

.

19

A A . A A AKA A A - A A -

480

.

GAUC C ;. A U C C A A UU C U A%. A C A A C C A AK A U C C A U C C A U C C K A U C A A U C A t A U C A A U C A A A C A A A C A, A A C A IC. A A C Ali A G C C A A ACAA - -C A

18' -

1200 .

1170

1180

1190

1 200

.

2268 Nucleic Acids Research, Vol. 18, Supplement 1210

.

.

1 Hs 7 Oc 11 Ec 12 Tm X3 VA G C 13 Dm AG C 16 Le C g C AAGO G C 19 At 20 Zp ..g.. ..........A.G. 22 Vc 1sC AGOC 23 Cv t A G C 24 Ne ACAGOC 27 Sc .B.u....4X....£ A G C 30 Pc ................... 19- AGO 33 Pp 51 Pf A GO 52 Pf .................. AGO 53 Pl s0 GO 56 Ng AAG 59 Ld 62 Gl ...................

I .~C

1230

1220 .

.

.

.

.

.

.

.

.

.

.

.

.

AGO 0 GOC~* AU U 1C AGCCGCAUU"...... A G C C G CGVA;A A U U e A CGC G C A U U Q A G C C G CGA A U U C. A CGC G C A U U. A G C C G C G . A U U " A CGC G C A U U CC A CGC G C A U U C; A G C C G C GiA A U U C A CGC G C A U U C.:t A G C C G C A U U cc AG 0 GOC BU. A G C C G CG1 A G C C GOGC UA CG CC CC GG CC G,S. AAGOC0GOC

A A A A

0 C C GGOC~ A 000 CG

A Uu uU ~ CC

.......

....

.

.

.

.

.

.

.

.

.

.

.

1260

1250 ...

1240 .

.

.

I.

A A G U U G

G G BBfl#M A*.^UAU g ~A A G U U G ...Cx.*.J ...C.A.A.U .A .X.::t.UA.UA.^>.U1J U A A G U U G CUCVCAU S.: :UA. A

A

U

U

A

U

....

A AG U U G A A G U U G e.U ..

..CU.B. .CC ...AA..Ui. AGC~UAUAUU .U A A G U U G C.£X..^...* B.xa 8;C.A A. :"C.U.UA .I .U AA AA GG UU UU GG CU~OCAAU

AGCGUAUA4JUU

A A G U U G K.B.u P0a.t .. Bfi CC CUB AAU. A4.C.UAUAU.U A AG U U G .....AU .4. UAVAV4JA CIJ-CCA AU AAP~~.C~~AUAUAU LI A .....x AA AA G G UU UU GG g.

U U t>. U U CC U U CC ..... UCC ¢... U W.AU UUM

~.

~

.

..C.... --AA

LJ~OUAAU C

GCAUACGU*JAA

AGA

U U G A A A U U G A A U U G

CU.CCx.. .AA.U A....G .U .. .U.... A~~~CAUAUAUUA CU ~~~CCAAU

~~A

.. U - gt .A. A G A .C U. XA.X'U .U gA "X AA UU GA CC UU GG C ~GXC G ASt;SX, ''CG .AC

GC A A~~~~~~ .>. A...AU. A U U CC C C G C i.'.

~ 0A G 0

.

,... -

19'

...........

.....

..............,

.............

3'

.............

...... ........

20 ...........................

...

63 68 72 73 75

.......................................................

Hc Ms Tc Ta Po

OCAUC A1gXC G A GAU0G

.4 &QCIJCt G A;U

OC G 0 0c GG 0 li:G: A A 0 Ai C C

O G 0 0 G 0 G:M 530X A A Ui...AgC

..........

.

............ ........... A

A

.G

U

.

.

XCU~~UA

UC~~~CC00 U A LIU~4~U U A U~~CC~C0 U A

IJAA.CCt G U U AU U A

....A.....U.U.A 540 560........... 550 I GAU.U.A * * * * ..I.*. **.*.I..*.*.*I.*.*..* OC G AGCG.. .G. A...U...................... 5 50 ....40 ..

"Ill

....

..... .........

C*IGGG~CCU

A

570 .

95 Ng 96 Pa 100 Ap 101 Ap 102 Cv 105 Ec 153 Mg 173 Oh 188 Me 189 Mb 201 Sa 203 SI 207 Tt

AGC AGC AGC AGC AGC AGC AGC AGC AGC AGC AGC AGC AGC AGC

208 Dr

AGCCGC AGCCGC AGCCGC AGCCGC AGCCGC AGCCGC AGUCGC AGCCGC AGCCGC AGCCGC AGCCGC AGCCGC AGCCGC AGCCGC

x G

Zm Os Gm Cm Ce Ce Cv

GU

19'

UAUAUAUAUAUAUAUAUAUAUAU AUAUA-

3'

20

.,... ;..................

CCA CA A CCA

GAAUGA GAAUGA GAAUGA GAAUGA GAAUGA GAAUGA GAAUGA

UAUA-AUAUAUAUA-

...

F U-- -- - - -- " .......a AG ¢

.. U~~~~~~~.. ....

2

2 2

Hs

cRn nRc

A

G C

A

G C

CA C CA C

A G C

G

CAA

.GG...U

AC

--

A A- --

--

A

C

AA ..AU.. ACGU-AAGAGAC~~~~~~~~~

Pls

A G

OC

C

G

C

A.A...U... A

:.I4LIU

,

....

CC..CU

A -

:C4

A G- --

GA

AOGG-A. 2

_--

GA

C G C U

....

.............

A

C G C G C

I

.

G G A A U U A G G A U U C A G G A A U C A

G C AGCCGC G a, a A A G A-.v ::A .- A G AU A AG G U G C AGCCGC ~~UA AG AC~~~A UC .. A... C:..VAVO :-" G C GGCCGC C......................................A V A AA GU AACA C ..A...G.....A.....A..G.G............C.G G G C AGCCGC Q UA G AA 1 :U A A G A C A ....A.....A..........A... C ..OU CA G C AGCCGC GG UA G U0 A t:iA C. A G...AGGA..G.AAO.1 IG CC A G C AGCCGC G.L[G.A G C AGCCGC .

.

GGAA U U A G G A A U U A 'GGA A U U A

A A G A

W

222 223 225 228 229 230 231

.

GGAAUUA GGAAUUA AGAAUUA GGAAUUA GGAAUCA GGAAUUA GGA U U U A GGAAUUA

G A A A A A A A

19

.

(A)U(C .gU U U AOG GLIG ..iAA-

A

AOCG ;:A u- .:.......G .......

..............

- U AI C-l4tgU UA -UDU U AA- U -C -U -U -U A-AL(.U}A .gA...... CG.0-. U U U A 4.UK .... .........

A UA A U-

-C

U

A U

A UA A UA

I~~~~~~~~~~~.

..

..

..

..

...

......

A U

..

0~~~~~

A

A

A

U

A

A

A

A

C

U

U

A

A

A

U

- - - -C

A

U

U

U

1250

A.G..GO 4

C U

U A U

A

-

- - -.A....-....

1260

Nucleic Acids Research, Vol. 18, Supplement 2269 1270

I

.

.

.

.

I

1280 .

.

.

.

1290

I

.

.

.

A A A A A A A A A A A A A

U U U U C C C U U U U U U

C C C C U U C C U U U U U

A A A C A C

U U U C C C

U A U U U G - C C -

.

.

.

.

.

I

.

.

.

1300 .

I

.

I.

1310

. . .

I

.I.

.

I

1320 -

I

-

... .. .. ..

AA A A G AA A A G .A A A G A G ~ A...A *..::: A C G A AA A G A A G .A* AA A A G *A A A G A. A A G

C C C C UCC CCC -

-A AAG C AA A C G C -

AAAC -

G C AA;^ A C G C

-

I..4 C-

G G G -A A A C G 4A

20

U-U-U-U-U-U-U-U-U-U-U-U-U-U-U-U-A A-

__ - -Hs 1 __ _Oc 7 Ec 11 .--Tm 12 .--Dn 13 .--Le 16 .--At 19 .--- - - - -Zp 20 - - - - -Vc 22 - -- - -Cv 23 - - - - -Ne 24 - - - - -Sc 27 _ - - - -Pc 30 _ _ _ _ -Pp 33 _ _ _ _ _Pf 51 _ _ _ - -Pf 52 - - - - -P1 53 - - - - -Ng 56 -- - - Ld 59 .Gl 62 .---

-

21 *~~~~~~~~~~. A A

- - -

A

G

G

A G

.

A.AG

A A

G U A

-

G

A

-

U A A

-

. .G

A G

U

A

G U A

590 .

I

.

.

.

.

I

600 .

.

.

.

I

.

.

.

.

I

610 .

.

.

.

I

.

.

.

.

-

-

A

A A GA

G

C

C

A

-

UAA

U

-

A

GA

~

-

CG

A -

-A A G A

A A GAGCU

CG U A

-

G U A

-

A A A

G U A G

G U

C U C A A C U C A A

- C U C A A -

--A

Hc Ms Tc Ta Po

-

A

63 68 72 73 75

Ng 95

Pa 96 Ap 100

C U C A A

C

-

G GCC ---A GA G ---AA A

A A A A

620

I -

-

UA UA C A U A

CU C A A

580

A A

CU CU C U C U

-

>Z

Ap

101

C A A E Cv 102 C A A E Ec 105 UIA A X Mg 153 C A A Oh 173 U A A Me 188 U A A . Mb 189 U A A . Sa 201 U A A Si 203 C U C A A r Tt 207 C U C A A (. Dr 208

C U C U C U C U C U C U CU C U

P21-1

21

AAG C1~~~~....G... ~ 1 U U.AA....AA.U U...-U ~Z A A G C QU CU G U.A....U.... U U U..AA..CC.C.UC.AA C~AIG-CuC AG ~- uC CO 22 U U U t AAUC ---A A G -CUCAA G U A ~GG~lUU U AA...G25 Cm 228 A ---AAG~~~~~...UGUA.W U%AA G U A G A A A U C C U A A C AG .A UGUU A4AAG U&ACUG 229 ---A A G CG UG U A G UGUUUAUUAAG U UAU%UUA A A G A UAGG- C U U A A Ce 230 ---A A G C CUUG U A G-U -(AAAG UCUCU UCA A A G A U AGG- C U U A A CCv 231

~~A~

OnC

---A A G G G~~. CA...C..G..U..A...... CG -AU-C G-----------(A T~ A A AG UCQC-CA- - - A A A A C-~ H 2534 C A aCG1~~~G. CA UGA G1 G - - -I U U A A U G G.CRn 255 --------- -Rc 243 GC U AGA U-A--AAGGAU AU-UA--UA---UAU---------AAGGAUCCGUAGA4........ ........UAU.-..c... UUA A---S --- -UAAAU ---U A A247 ------- -CAAA-------AAAAGUAUG........ U A G A - --GA--- AAAA(-A)A--SPl248 AG%G GtAAA AC ~~~~~ Pa 262 ---U~~G -UUACG)GUGA UU...%UA -.....A UA -UUAUUG.L26 -A U Q ) UAUA.......-A.........U(. s ...

1270

I 1280s

290

l

1310

I 1320

_- _ __-G-C

2270 Nucleic Acids Research, Vol. 18, Supplement 1330

.

.

1 7 11 12

13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

Hs Oc Ec Tm Dm Le At Zp Vc Cv Ne Sc Pc Pp Pf Pf P1 Ng Ld Gl

.

-_

1340

.

.

.

_

_

.

.

_

I

.

_ _-

_

.

.

.

.

1350

.

_ _ ___--

--

.

.

.

.

.

.

.

.

.

.

u

-

-_

- - -

-

- - - - - - - - - - - - - - - - - - - - - - - - - - - -

-

_

-

_

-

-

.

§

C A E G L GU U

ua N .______-C -__-

-

-

-

_

-

_

_

_

_-

_

-

_

-

~ ~. 1360

.

.

.4

.

.

.

.

.

'b. L ..i

A

.

.

I

.

X. A

1370 .

.

i .....

1380 .

.

.

.

.

.

.

.

.

u

- U - AAA 3A(A$I C G A VU - C - AA#A- U A ~~~ 14 --.

1A3

A ~~~~~ ..... ..

->

E21-1

63 68 72 73 75

Hc Ms Tc Ta Po

630

640

650

I .I 95 96 100 101 102 105 153 173 188 189 201 203 207 208

I

Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa

S1 Tt Dr

P21-l' 222 223 225 228 229 230 231

Zm Os Gm Cm Ce Ce Cv

-

;-

;(-

234 Hs. 241 Rn - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 243 Rc - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 247 Sp 248 P1 253 Gm .:X.J'kxG:t A A b. c13(cx U C4.A A - A C.CC9A.A *CA 255 Cr - - - - - - - - - - - - - - - - - - - - - - - - 256 Sc. -. A. .. ... ..

260 An 262 Pa 269 Lt 270 Ls

-

- - - - - - - - - - - - - - -

------

---

--

.

-

1 330

-

----

-.---.U- --C

AC-

--.--

1380__

I"'.'

S-0 A.-1 ?.AA C "A A1340

1350

1

360

1370

Nucleic Acids Research, Vol. 18, Supplement 2271 1400

1390

1430

1420

1410

1440

................GC A-Oc.

C G G IVWTm 12 -------C GU G.---..WA --UAU~1 ---~~~~~~~ -CUA-Le 16c~~~ ~~~-.*~~~~~A~~~)cG~~~~~--CUU~~~~~UGAG U -At------- 19 --- -

~~~~~

-

~~~~~~~ --

----------C UUUA-Z---------p 20L 1

------- -

PSc

30

L C --------------AC UU U UG G---- UAU UCWJ ~ CGC Pf U UA t~~QU ~UE&~~A A &~~C A A A U A U A U U A U A I& tt*G-CuUGUU-A A$ A Pf C .kAu u0uuuC UCUU ;Pl -UG-MUNNAAE. C --A GA GG AC -- ------------- - - Ng UA G---------------------UC GU CC C GU ---------- ---- Ld uCC ---Gi-

51 52 53 56 59

~~~~~

CUUG

UC~~~~~~-- ..A-----AG

--------

-

--

-

626

E21-1'-

E21-1

. Hc ------------------------------------------------------Ms . Tc . Ta . Po

------

-

-

-

-

-

63 68 72 73 75

. Ng 95 . Pa 96 . AP lO . Ap 1Ol . Cv 102 . Ec 1OS . mg 153 . Oh 173 -----------------------------------------------------------Me 188 . Mb 189 . Sa 201 . Si 203 . Tt 207 . Dr 208 -

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

.

-

-

-

-

-

-

-

Zm 222 .0s 223 . Gm 225 . Cm 228 . Ce 229

-

-

-

. .

Ce 230 Cv 231

. . .

-

-

.

. . . . . . . . .

-

-

-

-

-

-

-

-

-

-

Hs 234 Rn 241 Rc 243 Sp 247 P1 248 Gm 253 Cr 255 Sc 256 Sc 258 An 260 Sp 261 Pa 262 Lt 269

.Ls2~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 370 I

1 390

1400

1410

1420

IlI 1 430

I 1440

2272 Nucleic Acids Research, Vol. 18, Supplement 1450

1 7 11 12

13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

Hs Oc Ec Tm Dm Le At Zp Vc Cv Ne Sc Pc Pp

Pf Pf PI Ng Ld Gl

~A-G C C

1480

1470

1460

I I ~

U G

::AAm

~

1490

~ ~ UUG CC U C--------~AGC C CA -

A4AC C U Ul U A (#4 4& $A A G C 4~A ~ ~ G ~ .~ ~ ~ ~ ~ ~ ~ ~ ~~~~$U ~~~~~~~~ ~~~~ ---~~~~~4~

G

UG

Ju C u u

CU A ~UGA43A U AG ~ G ~A C

~~ C ~~~~*A A A

UU CC CCCC----------

~G4

G$ UC AA CAA A U Q~~~~~AC ~U C ~U$ AAU *A AA C C

U AU U~*

~ U ~ ~ AG A~ U

A

A

AJ~ ---

A

G

UAC

Zm Os Gm

234 241 243 247 248 253 255 256 258 260 261 262 269 270

Cm Ce Ce Cv

Hs Rn Rc

Sp Pl Gm Cr Sc Sc An

-

Sp Pa Lt Ls

1450

1460

--

E21-2

Hc Ms Tc Ta 75 Po

222 223 225 228 229 230 231

--

1470

--

GC AU------------U ---:

63 68 72 73

Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa SI Tt Dr

--

C

E21-1'

95 96 100 101 102 105 153 173 188 189 201 203 207 208

A U UA

U C-------------

C

CU

CAUGU

1500

I~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ . . .

I...I..

1480

1490

_-----1510

I

.

.

.

.

I

.

Nucleic Acids Research, Vol. 18, Supplement 2273

1520

.

.

.

.

I

.

.

.

.

I*.

.

1530

I*.

I

.

1540 .

.

.

I**.

.

I

1550 .

.

.

.

I

.

.

.

.

I

1560 .

.

.

.

I

.

.

.

-

-

U. Ui G __-__-__-__C

_

_

_

_-

-_-

-

-- Tm 12

-GGA.

-

U ---------------------------

_________________-----

--------_ _ -__ ----A A G A A

--

- --

---

- - - - - - - - - - - - - - - - - - - - - - - - - -

.

(f

--

-

-

-

.C UU

-U--

__

.--

G GGG

- - - - - - - - - - - - - - - - - - - - - - - - - - -_

_-

_-

_-

-_-

- _-

E21-2

_-

_-

_-

_-

CU

G G C C CU _-

_-

_-

1 7

-- Ec 11

C UC A

-

uccuU cUA U AA U U U::UjAU:U:JC:C C:C AJ i. u

1

I

____- - Hs --~Oc

ti~~

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - CW I

.

- - --

_-

E21-2-

Le At Zp Vc Cv Ne Sc Pc

Pp

Pf Pf P1 Ng Ld ---- GI

16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

E21-3 ___

_

Hc 63 68

.Hc --___-----____________________________-----------Ms __________________________________-_------------Tc . -

-

_

. .

- - - - - --

72 _ _ _ _ _ _---- Ta 73 Po 775

- - -

. _ _- - - - Ng 95 .------ Pa 96 .------ Ap 100 -

. _- --- - Ap . - - - - - - Cv .___--- Ec . - - - - - - Mg . ~ ~ - - - - Oh Me - - - - - - - Mb .- -- -- - Sa . - - - - - - Si . - - - - - - Tt . - - - - - - Dr

. . . . . . .

-

-

-

-

-

-

101 102 105 153 173 188 189 201 203 207 208

-

-

-

-

-

-

Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231

Hs 234 Rn 241 _-- - - -- - - - - -- Rc 243 .-c243 - - - - - _ . Sp 247 248 .~~~~~~~~~~~~~~~~~~~~~~~~~P - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -------Gm 2 53 - - - - - - - - - - - -- - - - -- - -- - - - - - - - - - - - - - - - - - - - -- - - - - - - - Cr 255

- - - ---- -- ------

----------------------- - ----------- - -- - - - ----------------------- ------- ---------- - -

SS 256 .Gm

. Sc An .5-Sp . ---- ---- -- -- - _ Pa . - - - - - - - - - - - - - - - - - - - - - - - - - Lt

- - - - - - - - - - - - - - - -- - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

- --

I ]!)-I u

I 1520

1530

I b4u

1550

1 560

258 260 261 262 269 Ls 270

2274 Nucleic Acids Research, Vol. 18, Supplement 1570 .

.

.

.

I

.

.

.

.

I

1580 .

.

1 Hs

7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

.

I.

.

-_

-_

-_

-_

-_

-_

-_

-_

-_

-_

-

-

.

.

U ---U ---U - - - U ---U ---U ---U ---U - - - U

U U U U U U U U U

A G C U A G C U

.

.

- - -

-

Oc Ec Tm Dn Le At Zp Vc Cv Ne Sc Pc Pp Pf Pf P1 Ng Ld A C A A G Gl

1590

I

.

.

C C A A A A C

A G A A A A A

.

I

C C U U AC U U U U C U

.

.

.

-

-

I

1600 .

.

.

.

I

.

.

.

.

I

Hc Ms Tc Ta Po

95 96 100 101 102 105 153 173 188 189 201 203 207 208

Ng Pa Ap Ap Cv Ec

222 223 225 228 229 230 231

234 241 243 247 248 253 255 256 258 260 261 262 269 270

.

.

.

I

.

.

.

.I

1620 .

.

.

.I

.

.

.

.I

-

---UUCGCU

---U U A A G U -

-

--U U UU

- -

-U U U U AU U ----U U A.. - --

-_

-

..

---

..5, ,,,...,,^

-

-41-..AA ,Ak.,.-r AC-~:- - - __.. ..

..

jU G G U U C -_

_-

.* _-

_-

A. _-

_-

_-

A C G C A U G U C A _-

Mg

A

,~~~~~~~~~~~~~~~~~~~~~~.......... +...

_-

E21-3 63 68 72 73 75

1610 .

C U

---UUCACU

-

.

E21-3'

-

Oh Me Mb Sa

Si

Tt Dr

Zm

Os Gm Cm Ce Ce Cv

Hs Rn Rc

Sp P1 Gm Cr Sc Sc An

Sp Pa Lt Ls

1570

1580

1 590

1600

1610

1620

Nucleic Acids Research, Vol. 18, Supplement 2275 1640

1630

I

.

.

.

.

.

I C-

.

.

-

-

.

I

.

.

1650

I-

-

.

.

.

-

-

-

-

-

-

-

-

-

-

-

-

-

I -

-

-

-

1660

I

i

-

-

-

-

-

-

-

-

.

.

.

.

.

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

G-GG

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

G--

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

--

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

_

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

U-------------

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

a> -

. i-

-

-

-

-

-

-

-

-

... -

-

-

-

-

-

-

-

-

-

-

-

-

i-

-

4CA C C. _ UC G 5-----U C C G U.--

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

E21-3-' -

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

1640

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

Hc ms Tc Ta Po

63 68 72 73

Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa Si Tt Dr

95 96 100 101 102 105 153 173 188 189 201 203 207 208

Zm Os Gm Cn Ce Ce Cv

222 223 225 228 229 230 231

75

Hs .234 Rn .241

Rc .243 Sp .247 248 .~~~~~~~~~~~~~~~~~~~P1 .Q-uGm 253 Cr .255 Sc .256 Sc .258 An .260 Sp .261 Pa .262 Lt .269 Ls .270 -

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

I

-

-

-

-

-

1630

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

--- ---

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

--

-

-

-

-

-----------------

-

-

-

-

-

-

-

-

-

-

-

-

-

-

- - - - - - - - - - - - - - - - - - - - -

-

-

-

-

-

_ _

-

-

-

-

-

_

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

Pc 30

Pp 33 Pf 51 Pf 52 P1 53 Ng 56 - - - -- ---- -- -- - --- Ld 59 ________--------- GI 62

_ _ _ _ _ _ _ _ _

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

v-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

--

-

-

-

-

-

-

-

-

-

-

-

-

-

--

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

I

-

-

-

-

-

-

-

.

-

-

-

.

E21-4

-

-

.

-

-

-

-

.

_________--------

- - - - - - - - - - - -

I

Hs 1 Oc 7 --- Ec 11 - - - --- Tm 12 -__-----__ - - - Dm 13 -- Le 16 At 19 - - - - - - - - - - - - - - - - - Zp 20 _ - - - - - - - - Vc 22 _ _ _ _ - - - - - - Cv 23 _ _ _ _ - - - - - - - Ne 24 - - - - - - Sc 27 -

-_

-

.

------

-

-

.

-

- - - - - -

.

--- -

- --

1680 .

-

- - -

I

I

-

-

-

1670

I

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

I

-

-

-

-

1650

-

-

-

-

-

-

-

I

1660

1670

-

-

-

-

1680

2276 Nucleic Acids Research, Vol. 18, Supplement 1690 .

1 7 11 12 13 16 19

20 22 23 24 27 30 33 51

52 53 56 59 62

.

.

.

Hs Oc Ec Tm Dm Le At

________

Zp

________

.

1700 .

.

.

.

.

.

.

.

1710 .

.

.

.

.

.

.

1720 .

.

.

.

.

I

.

.

.

.

1730 .

.

.

.

.

.

.

.

1740 .

.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

________

________

__

________

_

.

.

.

.

.

A G A G A- A - A

C G C G CG C G C U

- - - - - -

A G

- - - - - - - - - -

- - - - - - - - - - - - - - - - - - -

________

.

________

Vc Cv Ne Sc Pc Pp Pf Pf P1 Ng - - A C U U - Ld GI

-_

________

G C G C A U U A CC A

-

__.______

- - - - - - - - - - - - - - - - - - - - - - -

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

________

________

________

________

-.tau

.-.u. :-. . .t X -U

________

- - - - - - - - - -

________

- - - - - - - - - - - - - - - - - -- - - - - - - - - - -- - - - - - - - - -

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

_-

GA U U U

-AG

C

E21-4'

63 68 72 73

Hc Ms Tc Ta 75 Po

95 96 100 101 102 105 153 173 188 189 201 203 207 208

Ng Pa Ap Ap Cv Ec Mg Oh

222 223 225 228 229 230 231

Zm Os Gn Cm Ce Ce Cv

234 241 243 247 248 253 255 256 258 260 261

-

Me Mb

Sa

S1

Tt Dr

-

- - -

- -- - -

- - - - -

Hs Rn Rc

Sp P1 Gm Cr Sc Sc An

Sp 262 Pa 269 Lt 270 Ls

-

I 1690

1700

1710

1 720

1730

1 740

Nucleic Acids Research, Vol. 18, Supplement 2277 1750

1760

1770 .

.'.

.

.

.

.

1780 .

.

.

.

.

.

.

.

I

1790 .

.

.

.

.

.

....:...', AAAAA

-

U

A UU A A AA UUJt )44l4 A UtA~ UUA A CC&UA LA A

-

C

-

- -

_____

C C C C C U

U U U U C U

CGUU

I

.

G C C G C C -

- - - - -

- - - - - -

U - - U G A A C U - C-- C U G C - - C - - C - - -

- -

- -

- - -

- - - -

------

G C U C G - - - G A C G G G C A C . --U U U AC A G U U CAU C U A A AA g A G A - - - A C U - - - - - G G - - - - - - - -

-

AAU()A

AL*AA

.A A U A A A C(.).... G A A U AA A A A GA G .C(GIU A G GA A A G A -

--C -C -G C C G C -C - - -

GA '.C. A GU C

E21-5

.

A G C A G C - G C - G C

-

G U AG A A .ACUUG .U4A AA GUA A A GGGU UUU UA UUU CItU U AA AG GU AA AA UUAG AG U ( U GAU U A A G AGU U AA U -U ACUf)G A GA A A U U Atf ~ G U A A

.

.

... UAC....... tHJ A U

UU UtAM U U UWa WU ,AA A A A A A UU G U r1Jl$A A A A A A t$i AX BA GU UUU W IfUA CUU AU4aA A A A A A A A E C AA Aj AA W AU ACA A CUA AA AA UBUA K A U UAIX;S U:AX

1800

.

E21-!5'

Hs Oc Ec Tm Dm Le At

Ld Gl

1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

Hc Ms Tc Ta Po

63 68 72 73 75

Ng Pa Ap Ap

Dr

95 96 100 101 102 105 153 173 188 189 201 203 207 208

Zm Os Gm Cn Ce Ce Cv

222 223 225 228 229 230 231

Zp

Vc Cv Ne Sc Pc

Pp Pf Pf

P1 Ng

E21-6

Cv Ec

Mg

Oh Me Mb Sa

SiTt

Hs 234 Rn 241 Rc 243

. . . .

Sp 247

248 .~~~~~~~~~~~~~~~~~~P1 Gm 253 .(-i. .

Cr 255 Sc 256 Sc 2258 An 260 Sp 261 Pa 262 Lt 269 Ls 270

. .

. . . .

I 1 750

1760

1770

1780

1 790

1800

-.CAQ- . ~UGA ~. ~. ^C~GA'

2278 Nucleic Acids Research, Vol. 18, Supplement 1820

1810

1 Hs 7 Oc 11 Ec

1840

1830

1850

1860

A .. . ................. A . A.. GCA A. A* .. WG tA ^.G G AJA ........ 4-AC - - 4.0(W } .....C A........ -

....... .....................

AA...G........-A. ---A.....U.A.C... .................

Q --............ -- -

......

-G----------------A BA ; ......................... .B......

-

-

-G $mWtG 12 Tm 13 Dm A ....G)" -" --------A'¢ ................................. - G-A WAG4 G A I)A) 16 Le G--L-(A)QCA AIAAC(A UI ... . .G.fi.W A CA G....."UA..:....uG..tAQ..Sg^A..AA A ... 19 At U().A AAC(CGAJGG-C UI: -L ACAU AA A - AA G - -- - L :::.O Ii U(A U.: ----G(A)Al,LAUAU~~~~~U(A) .AAAUA~~~~~~(A)U A A..A::A A.:..AU AU GA G CA U:G 20 Zp UAUSt§S: AG )~~~~~.....A A A U A-g-t*5 GUAC t ....A......... C -A. -U.G-.-A.C CACG ......A...A....... U C 22 Vc --LK..... 23 Cv E ---2AAUACAVVA...AC 24 Ne 27 Sc .....V A(-A 0 -A G C A)l A t) U A G .~ ~ A A A ~~ C A U ~~ 0 G U G A --A CU. 30 Pc 33 Pp 51 Pf AE21G-8 A2O' A EC1A 52 Pf 53 P1 56 Ng 59 Ld 62 GI

-

..

GAi. AU. :.

...

...................A.........

ALIAG.A-AC A-U::AA ----

63 68 72 73 75

Hc Ms Tc Ta Po

95 Ng 96 Pa 100 Ap 101 Ap

102 105 153 173 188 189 201 203 207 208

Cv Ec Mg

Oh Me Mb

Sa

S1 Tt Dr

222 223 225 228 229 230 231

Zm Os Gm Cm Ce Ce Cv

234 241 243 247 248 253 255 256 258 260 261 262 269 270

Hs

Rn Rc

Sp P1 Gm Cr Sc Sc An Sp Pa Lt Ls

1810

1820

1830

1840

1850

1 860

;iC-U - U G C4A

Nucleic Acids Research, Vol. 18, Supplement 2279

1870 ...............

1880

1890

gL*(t4 G

1910

U U

G UUuuU U U CC GG

-

IJ.GG

1900

-

C C------ Hs

1 7 ..U A .------- Ec 11 U A------- Tm 12 U A------- Dm 13 U A------- Le 16 4 U A------- At 19 ----- Z 4 U A . Zp 20 U A------- Vc 22 4 U A -.----- Cv 23 U A------- Ne 24 4UA------- Sc 27 ..U A ------- Pc 30 UAA------- Pp 33 Pf 51 -__-UI-----Pf 52 --___-- P1 53 UGU A .---- -- Ng 56 C UCUAUUGGA Ld 59 ---------- Gl 62

.CC------- Oc

G-----------UU U C G G-----------UU U C A G G-C C u u C G Gt G G--------------C u U C G G4 G-C C U U C G ---

~~A~~~~~4G-~UUUCUUCA: GGU-U C-U G U AUC GUUG-------------U C U G U A A G.U--------U U U CU C U A G Gf) -U ----U. IGUCu - U.U U U U C U U

KGi

i

U

-----

U

----

UU U C A U U A U U U U U U U U G A U A U U C

.*

UU

U U U

~ ~~ --C U---UA

Cu

C ~GAG CUU-C---------UUGU4 CGGCUU-------------UGCG..UU -CC---------GCG4¢

C

-

E21-7'

E21-8

1920

E21-8' -_

_-

_-

_-

_-

_-

-_

_-

_-

_-

_-

_-

_-

_-

_-

-_

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

-_

-_

-_

-_ -_

-_

-_

-_

-_

_-

-_

_-

_-

-_

Hc 63 .----Ms 68 Tc 72 .-----------------------

Ta 73

_--- -Po 75 .__

-_

-

-

-

Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa Si Tt Dr

9S 96 100 101 102 105 153 173 188 189 201 203 207 208

Zm Os Gm Cm Ce Ce Cv

222 223 225 228 229 230 231

Hs 234 Rn 241 Rc243 Sp 247 P1248 Gm 253 Cr 255 Sc 256 Sc 258 An 260 Sp 261 Pa 262 Lt269 Ls 270

1870

1880

1890

1900

1910

1920

2280

Nucleic Acids Research, Vol. 18,

Supplement

1930

1 Hs. 7Oc. 1 1 Ec. 12 Tm. 13 Dm. 16 Le. 19 At. 20 Zp. 22 Vc. 23Cv. 24 Ne. 27 Sc. 30 Pc. 33 Pp. 51 Pf. 52 Pf. 53 P1. 56 Ng. 59 Ld GA U UA .--62 Gl

1940

1950

1960

1970

1980

-

-

-

-

-

-

-

-

-

-

-

-

-

..................................

AMA

C G A C

C

WU(

C C A

A A C C U C G

O."O" C

IG

-------------------------------------------------------

E21-9 63 68 72 73 75

Hc Ms Tc Ta Po

95 96 100 101 102 105 153 173 1 88 189 201 203 207 208

Ng Pa Ap Ap

-

-

-

Cv Ec

-

-

-

-

Mg

Oh Me Mb Sa

Si

Tt Dr

222 223 225 228 229 230 231

Zni Os Gm Cm Ce Ce Cv

234 241 243 247 248 253 255 256 258 260 261 262 269 270

Hs Rn Rc

-

-

-

-

-

-

-

-

Sp Pl

Gm Cr Sc Sc An

Sp Pa Lt Ls

I

1930i

1940

1950

1960b

1970

1980d

Nucleic Acids Research,

1990

2000

2010

2020

Vol.

18, Supplement

2030

2281

2040

Hs

-

-

-

-

-

-

-

-

-

-

-

-

-

Oc

7

Ec

11

Tm

12

Dm

13

Le

16

At

19

Zp

20

Vc

22

Cv

23

Ne

24

Sc

27

Pc

30

62....~~~~~ ~ ~~ AcG~

G C C

C

.-

i~

E21-9'

E21-9

-

-

Hc Ms Tc Ta Po

63 68 72 73 75

.~~~~~Ng95 .~~~~Ap100 .~~~~~Ap101 102 -

-

-

-

.~~~~~Pa96 -

-

-

-

-

-

-

-

-

.~~~~Cv .~~~~~Ec105 -

-

-

-

-

-

-

-

-

-

-

.~~~~Mg153 173

-

-

-

-

-

-

-

-

-

-

-

-

-

-

.~~~~Oh 188 -~~~~Me .~~~~Mb189 .~~~~~Sa201 .~~~~Si203 .~~~~Tt207 .~~~~~Dr208 -

-

-

-

-

-

-

-

-

-

-

-

-

. Zm 222 . Os 223 . Gmn225 .Cm O 228 . Ce 229 . Ce 230 . Cv 231 -

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

Hs 234 Rn 241 Rc 243 Sp 247 P1 248 Gm 253 Cr 255 Sc 256 Sc 258 An 260 Sp 261 Pa 262 Lt 269 Ls 270

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

I

I

-

-

-

I

I 2000

2010u

I

2020

I

I 2030u

2040

__ _ -

2282 Nucleic Acids Research, Vol. 18, Supplement 2050

.

.

1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

.

.

.

I

.

.

.

.

.

.

I

.

Hc - - Ms - - Tc - - Ta - - Po - -.-

- - - - - - - - - - - - - - - -

- - -

- - -

-

-

-P.UGA:~ .

G C G A GA-

I

2090

. . . .

2100

. . . .

- - - - - - - - - - - - - - - - - - - - - -- - - - - - - - -

. . . . C G G G C G G G U GGG

-'..;;.:.-:-'-.: ---:.:;;.;':';:::: -:- .:-: G G G G U - - - - - - - - - - - - C G G G G U - - - - - - - - - - - - C G G G G U - - - - - - - - - - - - C G G G - - C G G G G U - - - - - - - - - - --GGG GU-C---G U - - - - - - - - - - - - C G G G G U - - - - - - - - - - - - U G G G G U - - - - - - - - - - - - U G G G G A G GU-CGGG~~~~~~~~~~~~.......... G C U - - - - - - - - - - - U G G . - - - - - - - - - - U U G G i X; G U U - - - - - - - - - - - U G G 99

G U

G UG - -

::

-

.-

- - - - - - - - - - - - - - - - -

................... ........ ..........

C G G C G G G

.......

22

UU G U C G U U G U U G U AG

*

660 I . .

.

670

. .

. . . .

UCGG U AG UAG. U AG U AG U UG U AG UAG U AG U AG U AG UAG U AG UUG

Ng Pa Ap Ap Cv Ec Mg Oh Me Mb

Sa S1 Tt Dr

... ................

222 223 225 228 229 230 231

M.

. . . .

. . . .

O)4

.

95 96 100 101 102 105 153 173 188 189 201 203 207 208

I

. . . .

. . . .

- - - - - - - - - - - - A U GAA *J.A G Hs -- - - - - - - - - - - - - - - - A U G A A G Oc Ec Tm _ --~~~--_J ACA U G A ... AAAG A *J -A Dm - - - - - - - - - - - - - - - - - - - - - - A U G A 4 G Le U GA AA C _ AUGA At - - - - - - - - - - - - - - - - - - - - - - A U G A I AC Zp - - - - - A U G A iJ.A G Vc Cv UU A G - - - Ne ----------AA UUU GGG AA V.4$AA0 A Sc ---------A Pc -G U A AA-AGC Pp . A C G A 11..... Pf ---------A C G A Pf ---------A C G A AA4$ P1 Ng -A C A AG G Ld C G C C U U U U -: A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A G Gl E21-9'

63 68 72 73 75

2080

2070

2060

.

Zm Os Gm Cm Ce Ce Cv

...,.,

-

--U

G GA4A

....

G G UAuGA

--U --U

GUU

A

G

22

>........................ ......... G

-

...........

tAC............................ A G G

..........

.,W.%-

.B ......-..G..-..-

GU A

-_ _ --A; A GA AGOA

G

G

..........................

.................................. ...............

234 241 243 247 248 253 255 256 258 260 261 262 269 270

Hs Rn Rc

- - - - - -

- - - - - - - - - - - - - - - - - - - - - - - - - -

P1 Gm Cr Sc Sc An

- - - - - - - - - - - - -

------

- -

Pa Lt Ls

- -

A

A A AG A:.

U A G A.AA A G A: - - - - - - - - - - - U A G - - - - - - - - - - - U A G A.. .A__ A:

- - - - - - - - - - - - - - - - - - -

Sp

..............

-di

- - - - - - -

Sp

........................

U

-A__

2050

2060

2070

2080

2090

2100

2110

CA

2120

VUCtU(A~.....U.. at................ At.:G

CA QUA

..A... U.. U

CU

2140

Supplement 2283

2150

2160

Hs

..A..

oc 7 GA.... ACA....A.A.C.U.U.A.-.A Ec 11 ACAC........ G.UGA A.....U......U Tm 12 )AU..A.CA.CA.CC.CACG.G..( A.C.GA.AC.A.U.C... GU

GA

AA

U

V

CV ....

c

t

&

a~

C

C

A

G

141.... A...C.* CAC.. G WU:U>$( C .C ...QA. V.. tA...A A CA..G..G.AA.:V.U......UU.A..CA...A..CA

A

.C(

)

CA... ...U....C.....V AU... CA U U CGU

~.

2. . 2130

Nucleic Acids Research, Vol. 18,

ISV

A

.AU(V)C.....

CAV........

U

i A A .G A AA U

U U CA U U C (AJV G U CC U(U)G

C

U

GAG

U

G

GA

A

A

S U...V...AU

Vt)

U+GS

CU

A

G

..

.A

C-

GA CGA.C..A..-.

U(

UU

U

Dm 13 Le 16 C U` X A 19 UA At 20 Zp Vc 22 ACAGG U .AC Cv 23 A G U G A AA:A An Ne 24 "Sc 27 Pc 30 Pp 33 Pf 51 ..Pf 52 :, P1 53 I ~ .:~L.FlU Fl'z ~ .~U G AI,A GU A A C* U U:4a A - G A C C...U.C.A UC U .GU..A. C A c U A A -C Ng 56 A'A C G U GA A A A WV C U -U A~C Q&~(A)CAAG A~A A C U A C -AL 59 CA C&U AC C0 (a-C C~-ACGGG U G A A ACA&GYA AUC (C AGACCCCGGGiC 62

~~ ~ ~ ~ ~ ~ ~ ~ ~~

U

CA C 14(MV U::U U C ~A(9)C'~U

22

C-

23

~t ~~.... ... ~~&U(A)CUUtA&.... (AU..... ...4.. U....

A--:A

A.-G 680 .

.

G

VU

AA

.

.

-A

GA

-

UUVC

U

CO

6

A

.

:

A

23'

24

.......... .............................................................

63

68 W.u.MG .A (.i ......C H GG UU GG AA AA AA UYCU GA'"S GA UU AA ..A.MCee... . .C...C .. A.C A CCCGGU C A G U - GG Ms . CSGUAAAA 72 A... U..G.U.A AU... ..A ACGACCG.G.U.G 73 AGGGC GAAAUCAA AUCUGGAGGACA CC A GUG Ta Po 75 700

690

I

A

I. . . .

710

720

.. .. ............ ...........................

.................

.U---G GG AA AA ----U .t---A GG AA AA ..M---A G A A . ---G .U * ---G G A A ...---G GG AA AAI *.t---G :,-.GGAA . ---G G A AI . ---G G A A C.---G G A A *.---G G A AI . ---G G A A -

-

-

AG

A

G U G A A A

G U G A A A G U G A A A G U G A A A

C4.

C

U

G U GA A A G U G A A A

A

C-

G U G G A A U G U G A A A

G U G A A A G U G A A

A

GA

A

G U G

U..;

Z-4 G G A & 'U

J.

C

'r,

G G A A t-J.':: "..c.

G G A

..............

,

......

G U G A A A WA G G G G G

U U U U U

G G G G G

A A A A A

A A A A A

G G G G G

U.. U.::Ic: G-'-'Aj'-'.'G U .............

:::t................. ..................

U U

......

.:.'A+ t-JA A

.:..

IJ %......... U

k

'U W.:

A A C

G A A C A C C G G U GA A CA CC G U

- G

G A A C G C C G A U G A A C A C C A A U

- G - G

G A A C A C C A A C G A A C A C C A A C G A A C A C C A A C G A A C A C C A G C G A A C A C C A G C G A A C A C C A A U G A A C A C C A A A

G G - G - G - G - G - G

Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231



:A.G

A

GA A C A C C AG U G A A C A C C A G U

G

U.(A)C¢

A

A...A C..

AS

- A

G A A C A C C G G U

-

-

U U U U U

G U A A A A A U U A U 'A'%' G U G A A A 'A':: U C A U :U U A 0 -A C G G U A U A A U U U G G U A A A A A A G C A G U A G U G A A A U..................... G G A G U C A A A U G U A A'G A: X...U G A A A U A G U A 'IJ: "G A U A U A A G,:...' '.:::G A U C A A A .0U

A............................................ U ............%

A A A A A

G

...............

G G G G

G u u U u G G A::

A A A A G A

.................

IJ

A U C A

..A.:A (

2120

-

-

-

-

IJ

U -1:

G

C A C A A C C U A A G A A C - - A C A G A A

......... v............

u C ...U:...

WA

:'-.G G

U A A U

G U A G A C

U A U A A A U C

A U U

A

...

U

G C A A A G C A A A 1-.::

2130

2140

C A A G G

C A A U A

.......................

2150

-

U Hs 234 A Rn 241

- -

- - -

G U U G A

G U U A C

A A A A U

A C U G G U A A

.............

A U A U A U A U

-

A A C G C C A A

................

...

A A U A G A C G C C C A A U A U C C A C A G A -

C R-M W A G A A A ::t-)::X.X -A G A C kM :.: V. -J.'.-.-A G A C .A..G A G A C U .................. ...............X .................. .-A U G X

A A

........................

..............

...........

U

................ ....................

2110

G

G A A C A C C A G A - G

CCAS ACC

-

...

...................

C U G U A G C

....

- G - G

...

.....

A G A A G

G A A C A C C A G U A. G A A U A C C G G U

A

A U U G

0"'."

A A A A Ij.:. A

.......................

V.

- G

95

Pa 96 Ap 100 Ap 101 Cv 102 Ec 105 Mg 153 Oh 173 Me 188 Mb 189 Sa 201 Sl 203 Tt 207 Dr 208

...

..........

A A A A C A A A A A A A

....

U A G

G A A U A C C C G U G A A U A C C G G U

a...

AU

G C A G A

...-.G

...

G U A A

A

Ng

U

23'

U'u

G G A A

U A A

A

- G

ACA:S

G U A G A A Ut G C A G A U A

G

G A A C A C C A G U G G A.

G U A G A G G U A G A G

- G C A G A

.....

G G A A

:Aj-

C

G

UA GA

23

..............

"G

G U A G

G U A G A

CC^ G

G A A U A C C

A

G U G A A A U.t..-. G U A G A ..VAVA

G U A A A A G U G A A A

.-.-AGAA

G G A A

.

A G A G

G U A G

.............

A - A A - G U A U U A U C - G G - C AG - U G - U G

2160

Rc 243 Sp 247

P1 248 Gm 253 Cr Sc Sc An

Sp Pa Lt Ls

255 256 258 260 261 262 269 270

~.

2284 Nucleic Acids Research, Vol. 18, Supplement 2170

*

.

.

.

.

.

.

.

2180

1Hs MtG-----------------7Oc ----llEc ... ---12Tm QC---------------13Dm 4------------------------------16Le l9At 20Zp ---------------22Vc .---------------- --------------23Cv 24 Ne .------------------ 27Sc 30OPc 33Pp :C ----51Pf -&-C -----_--___---__ -- - - - -_----52Pf 53P1 -C ----____________ ------__________ 56Ng 59Ld __________ .--------------62 Gl ----------

-----------

24 ........

63 68 72 73 75

Hc Ms Tc Ta Po

...n

_

a-"..T.

2210

2200

.......

2220

.

.

.

.

.

W_____

u> E;-____.

| F_____

fi_____

m§E _ _ _ _ _ !..9._____ B.N_____

%n_____

B_____. >>t .x;

-

_

_

_

_

--GA A A-G------t--G U --GA AG------------------ G CA G----------------

A U U cC-

.........2 249

22'

-G --

-

-G--

-G--

-G --

-

-G --

-

R _ _ _ _ _ *m. :X::

24S .....

222 Zm 223 Os 225 Gm 228 Cm 229 Ce 230 Ce 231 Cv

234Hs 241 Rn 243 Rc

B-----,

-

-

i:*s

e - - - - t - - - - H -

-

-

-

-

Ct - - - - -

AC-------

t-----------A

--------- -

247 Sp UA----------248 P1 -A-C -----------C-------253Gm 255Cr g----------256Sc .t----------258Sc .U----------260 An &-----------

261

Sp

740 .

-------GAAG.---------GAAA---.------GAAG.---------GAAG----------GAAG----------GAAG---.------GAAG.---------GAAA----------GAAG----------GAAG----------GAAG---.------GAAG---.------GAAG----

*...>......

-IIV

.

--

--------GAAG.---

M_____.

.

-GAAA.-----CA---.U

730

95Ng 96 Pa 100 Ap 101 Ap 102 Cv 105 Ec 153 Mg 173 Oh 188 Me 189 Mb 201 Sa 203 SI 207 Tt 208 Dr

.

--GA A A-A----AUU -----------GAAA---------CAU U--GA A A--- -------GA A A---------------AUU --GAAA.--------------.--AUU--GA A A----------------U UU--GA A A--------u u-------GA A A------------ - ---U U--GA A A.--------.------U U --G A A A----------------AUu U --G A A A------------------G A.---AUuUU --G U A A.Q A u --G A A A ---------------U U --GA A A.---.-.-.-.-.-.-.---- U-U --GA A A---------------U U

---GAAA ---GAAG -- -GAAG -- -GAAA ---GAAG

- -

.......

2190

I1

I

-_____A -__-

-QC~~~~~~>

G---~ -_____-

G----------------__-_-_-_-_->---_-

--

G-______ --C

----___ _

_-

G

-_____GC .G ---~ G----____--

C--G------..G--_____

--

--GA-

--

A-_____ .-

249

GAAA---GAAA---GAAA---GAAG---GAAG---GAAA---GAAA - - - -

22'

A -------

-~---A--__ A -___2 ----

----;..G--__ A ---|.. --_ GC A -__ ---~

----GAAA.GU-------G.>.-----------GA A A------------------------GA A A----C C CU- --------

- - - - -

A C---

------AA U A A-______________ C -----GA AG-- - - - - - - - - - - - -A----------G A A A-- - - - - --A-------

----GA A A.---------.-.-.-.--A--------GAAA.----------$A.--AG A A G.--------.-.-.-.-. -A .---A -. -A -------A A G U.-A

262 Pa ,----------269Lt ACAUUU UUAU A 270Ls ACAUU AUUAU j U

AG-Q-O------- A----------GA VAAUUAUAUL%U1 . ,AUUBCcAuCA-cU ~ U U A U A U C A U A U CU U ......

I 2170

2180

....,,,,,..*.*. *.....

2190

2200

2210

2220

.k:A

2230 .

.

.

.

.

2240 .

.

.

.

I

.

.

.

-A

Nucleic Acids Research, Vol. 18, Supplement 2285

2250

*

.

.

..

2260

2270

I

. . . . . . . . . . . . . . . . . . . . . . .I. ... . .......... .........I. ......

1..I. .

.

2280 2280

. .. ¢g A ,UCAGAU A C C Hs - AA A G4 U U C GA A . ........G..U.C. G U.UC.GA.A.. A A A..... A ,UCAGAU AA -AACC Oc

A --A A G U U C G A A G U U C G A A -AAJ~A A .. .r.--A ..G U U C G A A Ai - - A A G UUAGAmS8m.G G

U C G A A .

A

U C GA A

- -AA

GGC U CGAA

A A

G

;^--A - - A A B. U

A A - A GAGA3

A --A

22'

.C

GGG C U C G AA A

. u A B UA.W G C U E --A - - AAGI) A GG AA GUU A -CAAA - A AAS..W:E G.A.GAG A--A A G AAGGG AA GU A ~A A A~GG A G C A A A A... GA U

.

A A A A A A

C

CC GG AA CG A CU GA G A

A X AAA ...l A U G A AA ~:~ U G A A C G A A.

GA ~ U C GA A C U A G A A

21'

A

U

A A

,UCAGAU A C r Ec ,UCAGAU A C C Tm ,UCAGAU A C Dm ,UCAGAU A C CLe ,UCAGAU A C C At ,UCAGAU A C C Zp U U A GA U A C C Vc ,UUAGAU A C Z Cv ,UUAGAU AC Ne U UAGA U SC AC ,UCAGAU ACC Pc ,UCAGAU A C Pp ,UCAGAU A C Pf ,UCAGAU A C Pf ,UCAGAU A C CPi ,UUAGAU AC Ng U UAGAG A C C Ld ,UCAGAC A C CGI

7 11 12 13

16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

25

..........s....S.S

A A

A

A A

..A

A A

A

A AC

..

Q.C

A--A A--A A--A A--A A--A

~ G A GCGA CC. G..A -

A A

.....

:

GC A GA-

G

...

750

760

I*

I

*I

G G G G

. .

. .

. I

I

I

*A

GA ....A GA

`GA

C

gC~A AG

.

...

A.

G

A

..

A..A

U:U AL

A

A U

U G C

CA

U

U-...G........

UW G(C)kU.: f$. U. ." AUUU---------GAACGC AU AAGC

U

G

CA C

U

A A A

A C

C A

G A -. .............-.. G A G UU. .aS..^ .A. .A.i.UX GAG G A - C A.; Uf. Ui. G

G U

. . U ...U

AA

A .

. . . . .A.i9 ...

C

G A

A C

U

-

G G

C"

G

A U G A. C.A.U..G G A

-A

AUU AUU AUU AUU AUU AUU AUU AUU AUU AUU AUU AUU AUU AUU

*

AA

AGAUACC AGAUACC

A G A U A C

A A A A A A A A A A A

G G G G G G G G G G G

A A A A A A A A A A A

U U U U U U U U U U U

A A A A A A A A A A A

C C C C C C C C C C C

63

Ms 68

Tc 72

Ta 73 Po 75

Ng

95

Pa

96

Ap 100

Ap

101

Oh

173

Cv 102 Ec 105 Mg 153

188 189 Sa 201 Si 203 Tt 207 Dr 208

Me

Mb

25

AUUAGAGACC U U AG

Zm222

AUU AUU AUU AUU AUU AU U

Hs 234 Rn 241 Rc 243

A A A A A A

A U U A G A U U A G A U U A G A

G A C C Os 223 U A C Gmn 225 U A C C m228 U ACC Ce 229 U U A G A U A C Ce 230 U U A G A U A C Cv 231

..

;; ;;;

C

I1

790

A--AA A--AA A--AA A--AA A--AA A--AA A--AA

C..U....... AA.C.-A- U AUC. A

I... I. . . . I....

A -AA -AA

'-

GA

u

.

I....



G A GA G A

CA

C

...

.GG AA---AA A AA-AA

21

.A CA C GACA

C C C C

780 .

U U U U

A A A A

XA -AA GA--AA G A -AA GA--AA G A -AA G A -AA -AA GA GA--AA

22'-

gCC

.,

-

A AA A-

,.. 2............. ...........2......1.'..

Q C

A A U U A G A A U U A G A A U U A G A

770

. .

GAGAGA GAGAG A GG AG AA -G A-

....

AUUAGAUACM Hc A U U A G

A

W

.~U

G U U U

_

.x

2230

2240

A.... U............. -

.......

AA- UA C GA- U AC GA--CAGC G A- AAUC^:.. GA- AAAC G A- A AQ G.A- A A A

G G G G G G G

A A A A A A A

U U U U U U G

A A A A A A A

C C C C C C C

Sp 247

Pl 248 Gm 253 AUU Cr 255 AUU G A U A U Sc 256 AUU G A U A C Sc 258 AUU G A U A C An 260 AUU G A UA C .Sp 261 AUU G A U A C Pa 262 AU- -GGAA-- Lt269 AU- -GGAA-- Ls270

UA- AAA GA AG~ UA--AAA G A - - A G . G A-- A GG G A--AG... - C A U GACACCAU

I 2250

A A A A A A A C C A A A

2260

I Zz /u

2280

2330

2320

2310

2300

2290

.AU U............A.G....C.....C..A..A..A..GA.UG.CC..A.CC....CG

Hs

....... ............. .... .. ................ ......... ...

.........._ .-J ...- -t

"' -t

t'

(

I 2

2290

2300

2310

2320

2330

2340

Nucleic Acids Research, Vol. 18, Supplement 2287 2360

2350

I

.

-

.

.

.

.

.

I

.

.

.

.

.

.

2370

I

.

.

.

I

.

.

2390

2380

I

.

.

.

I

.

.

.

.

I

.

.

.

.

I

.

.

.

.

2400

I

.

.

.

.

.

I

.

.

.

.

I

CCc

Hs 1 Oc 7 ------------------------------------------CCUCAAA Ec 11 ------------------------------------------CCUCCGA Tm 12 ._______-__-----------CUUUUA Din 13 .---------------------GCUUUUA Le 16 ----------------------GCUUAUA At 19 ______----------------GCUCUAA Zp 20 C U U U U A Vc .---------------------CUUUUGA 22 GU U U U U A .____-----------------UCUUCGA Cv 23 .---------------------UUUUUG- Ne 24 Sc 27 Pc 30 -_______--------------CAUCUC Pp 33 .----------------------UUUCG Pf 51 .------------------UUAG Pf 52 P1 53 ---------------------GCAA . _----------------------AUCCC Ng 56 Ld 59 %>¢¢giCAG:~~~~~~~~~~~~~~~~~~~~~~~J __- ---------------------CGCGCGUCC Gi 62

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

u

-

-

-

-

-

-

-

-

u C

-

-

-

c c

-

-

-

-_

-

-

-

-

-

-

-

-

-

-_

-

-_

-

-

-

-

-

-

-

-_

-_

-

-_

27 -

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

--------------------------ACG- Kc -____________-------------GCG- Ms --------------------------UCG- Tc -------------------------GUUG- Ta --------------------------UGG- Po

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

--CUUGA --CUUGA --CAAGA - - CU U GA --AUUUA --CUUGA --CGAAU --UAAA- - A U CG A -

-

_

_

_

_

_

_

-

-

-

-

-

-

-

-

-

-

-_

-

-

-

-

-

-

-

-___--------------------ACGU

-

-

-

-

-

-

-

-

-

-

-

-

-_

----------------------AUAUU--.__________-----------AUAUU--_A _G .~~~~~------------GUUUA-

-

-

-

-_

-

-

-

-

-

-

-

-

-_

-_

-_

-

-_

-

-_

-_

-_

-_

-_

-_

-

-_

-

-_

-

-_

-

-_

_

_

_

__

-

-

-

-

-

-

__

__

-

__-

__-

__-

__-

_ _-

_ _-

_ _-

_ _-

__

__

_ _-

__

__

_ _-

_ _ -

_ _-

__-

__-

_

_

----------------------ACAAGGGCG

-_

.__

I

I

235,0

2360

I 2370

I

I 2380

75

Oh Me

96 100 101 102 105 153 173 188 189 201 203 207 208

-A- UC GA Zin222 -ACUC GA Os 223 GA Gi 225 - CG A A U- Cn228 -CGAA U- Ce 229 -ACUC A A Ce 230 AGGUU A A Cv 231

-

-

Ec

Mg

-- U C U Tt --GAUG- Dr

-

-

Pa Ap Ap Cv

- - A U U CC Sa --GAUAU Si -

63 68 72 73

Ng 95

U V C C Mb

-C

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

2390

2400

Hs 234 Rn 241 Rc 243 Sp 247

P1 248 Gm 253 Cr 255 Sc 256 Sc 258 An 260 Sp 261 Pa 262 Lt 269 Ls 270

2288 Nucleic Acids Research, Vol. 18, Supplement 2460

2450

2440

2430

2420

2410

lHs----~~~~~

--_______________ 70c 11lEc U----------------------~~~~~~~~ 12 Tm U-----------------------~~~~~~~ 16 Le G-----------------------------l9At G-------------------------------------------_____________________________--_- - -$. 20 51Hs---------------f Zp G-----------------------------22 Vc U 23Cv U _ 24 Ne________________ ________________ 27Sc 30 Pc U _ --______________ 33 Pp-

______

52Pf--_

_________________ 56Ng g.+.tC.; 59Ld GUAUCUUUUCUAUU.---------¢.:C-UC 62G1 --------------________________ 27'63 Hc 68Ms 72 Tc 73 Ta 75 Po

95 Ng 96 Pa 100 Ap 101 Ap 102 Cv A----------------- -- -- -- -- -- - --___-__-__--__-__-__-__-__--__-__-__-__-__-__--__-_-l05 Ec Iv L.%. 153 MgA-173 Oh --188Me C-189 Mb U U 201Sa AC203S1 ACA 207Tt --208 Dr - - -

222 Zm 2230s 225 Gm 228 Cm 229 Ce 230 Ce 231 Cv

234 241 243 247 248 253 255 256 258 260 261

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

__- _-

_-

_-

_-

_-

_-

-_

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

__-

_-

_-

_-

_-

_-

_-

_-

_-

-_

_-

_-

_-

_-

_-

_-

_-

_-

-_

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

__-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

-_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

-_

I.

_-

-_

-

-

C C ---CC---C C --------------

A A A U C

----

-

-

Hs Rn Rc

Sp P1 Gm Cr Sc Sc An

Sp 262 Pa 269 Lt 270 Ls

-_ -_ U A A G C

2

2410

2420

2430

2440

2450

2460

~I

.

2470 .

.

.

.

I

2480 .

.

.

.

.

.

-

-

-

-

.

2490 .

.

.

.

.

I

.

.

.

.

I

.

J-U C U UC

C C

u C UC

C

2500

.

.

C

G

I

.

.

.

2510

I

.

.

A

.

.

I

.

.

.

2520 .

.

.

.

I

.

.

.

.

I Hs

G A A A A-- A A A A OC: --A A

-

Oc Ec Tm Dm Le

-

-

-- AA -A ~A AA - -A A

gUA UA -A ttA

-

GAA

Pc 30

::ij

UGU A GAAAUCAC- GA G UUA * GAAGAAAACG--AA G AAAC--A JUU A A AA

UA

Pp 33

-N

Pf 51 Pf 52 Pl 53 Ng 56 Ld 59 GI 62

-AA

A GAAA

-UAA GAAAG U-UUG C UCA A............... A-:GGAAA uG-GGA AG

IUCAC 27'

-

28

28'

GUA- -q3 GGA GUA- - GI..-G A A G GAA- - ..- G A A G GUA- -. ...- G A A G GCAGGA A G QC-C

-

850

860

~~G

U

AG

20'

G GAAG G U

Hc 63 -C----C2.

-CA

G

Ms

G U U G U U

Ta 73 -----C

G U U

Po 75 -CG

880

1I.

....

U A A C t~VG U G A A~

A-

C~U A A C G CAAU AA*C

Ng 95 Pa 96 Ap 100

A

U A AC G U U A U A A C G C U U A A C ~VG U U A ~ A UA ACAA UU AAA*C U A A C A U U A ~U A $ ~UUA AC A A U AA U A A C GCA U U A A~

|,V

G U AC A

~~~G

G G A

Ap 101

Cv 102 Ec 105

~U

~G C AG A A

G G A G U A

Mg 153

-

A GAU AACt U

U A A C

28'm

28

*&

269

20'

GUAQVA UAACI;CGUUA GUA- -C UAACGCGUUA GUA--CUAACGCGUUA GUA- -C X-UAAC CaGUUA GUA- -.V.. UAAC.G..*GUUA GUA..-UAAC " GUGA GUA- -G.C -U A A C G U G A

I- .- .- -

A --A U C -A A C---A --A C C -U A C---A U U UACACC-- --

"C C A -

-Hs 234 ~ -- Rn 241

A A

AA--

U A A C

G U GAA A A- G A A A )[U.1 GA --.> -G A A A ..I G A A

-

-

-

-

-A.X. ....' it...>

-wlU -i. -t

(A-------)A (AA

.0 00 i 01..0

_0'

UA

x

:B

A

U A C C :..

.. A A A A

.. A A IU U A A A

...

A

G A

CCRc

- -

- -

G>U

-

G U

C A

G A A G G A A A

G G G U

CSc 256 CSc 258

C

-_

_-

_-

_-

_-

-_

_-

_-

_-

_-

2490

2500

--C --U ---- C

U A A A

A

An Pa 262 Lt 269 Ls 270

260 G..C Sp 261

- - - - - - -- - -- - -

2510

243

Sp 247 G A A P1 248 Gm 253 _ __-CfCr 255

__________

G A A A

__________________________

2480

C CA

A A ______

A-U

__________________________-

2470

Cm 228 Ce 229 Ce 230 Cv 231

-

- - - - -

-

Gm 225

-

A A

-

Zm 222 Os 223

-

........

.....

A-

201 203 Tt 207 Dr 208

-

27'

173 188 189

Si

-

A A

-

-

-

G C A-

__C

Oh Me Mb Sa

-

G C A-

GA A-

68

Tc 72 -C~

870

G C AGA A-

A

-26'-

870 *1..

*

12 13 16

Cv 23 Ne 24 Sc 27

--AA ~AA

GAAA GAA AA

1 7 11

At 19 Zp 20 Vc 22

GAAA.G --AA

.

eU A

I

.

.

A--A A A- -AA

G A A A U.... GG A AA A AU - G A A A

.....

U U A -

UUA

.

G A A A G A A AC.(

U UAC U A U JUA

.__

-

.

Nucleic Acids Research, Vol. 18, Supplement 2289

2520

~.

2290 Nucleic Acids Research, Vol. 18, Supplement .

1

.

Hs

.

.

.

2540

2530 .

.

.

.

G-CA-A

G-CA-A G-CA-A G-CA-A G-CA-A G-CA-A

GG

IGG

Om

iGG iGG

G-CA

GG

G-CA-A

G-CA -A G-CA -A G- C A - A G-CA-A G-CA-A G-CA-A G-CA-A G- C A - A G-CA-A G-CA-A G-CA-A

GG

GG

GG

iGG iGG iGC GC

GC

iGG iGG iGG

63Hc

29 GGGGG-

...

.

68 Ms

vmUG

....J...

C.U-x:G

75 Po

I

.

.

105 Ec 153 Mg 173 Oh 188 Me 189 Mb 201 Sa 203 207 Tt 208 Dr

S1

222 Zm

225 Gm

.. m

228 Cm 229

Ce

231

Ce

Hs Rn Rc

V(g)G

A

,

A

256

Sc P

....

261

Sp

270

Ls

-

-

-

-

-

C

A

C

U

-

A

A

C

U

-

A

A

..

...

A

C -C A....

C - A-:" C -CA

U

c -CUA,,. ....

C

U

[.U..

G

C

30

2'

29'

-

A

C

GA.-x

A

C

U

A

C

.

...

C-CA.

C-CA.

A

A A

A

U

UU

X

G

m

A

U

A U

U.. ..

920

910 I

.

.

.

.

A-A A-A A-A A-A A-A A-A A-A A-A A-A A-A A-A

.

.

.

.

.

:AAA

C

C

-A -U -

A

-

-

A

A

C

-

A

:Ct

A

.U - - G..U.

- - - - - - - - U-

-

U A C A

A..

-

-

C

A.

-

- C A

-

-

C A

-

-

CA'

-

-CA0,'. A.

31

--c

A

c

A::

A.e. At:: A:

--c

A

A

A

C

U

.....

>

.

G

-

C

A

-

A

A

A

A--.-.A

C

.

-

-G

-

-

G

-

C A

-

A

- A

G£A

A

A

...

A.

ACC

.A... A A

U U A A A A ACUC A A A A -A AC C- C C AAt^AC A A

U

A

C U

-

A

AXG %A

-

C C

.AA A C U A C

. .U.C

A

GAC

-

U A U C A U C

C A A

C C

A

CA

-C G--

C

2540

2550

2560

U

-

-

GA

-

-

A A

- -C

u CAAU '.AAA --C

2530

C

A

AA AA-:A: U--G~~~~~~~~~~~~~~~~~~~~~~~~~ A G A A .......~~~~~~~~~~~~~~~~~~~:. .......A ..............~G C A

-W-

-

A

:

....

-

A.

(G-

. .A

-

c --c --c --c

A A~A

A

A

-

30

A AA

A

.

- -C

2'

A A

A

.

A.:.:. -A-:-.

u u u u u u u

AA0A

- A

-UC A.:

930 .

.

.. U - A C - A A....A U A A A A C U C A A ....

.-......

G) J.:AAAAiGs AGUA

A

31

.....................

.

Aq C U'A ~ A GA f -

-CAG. -CA^..

C

AA AA ,AA ,AA AA 'AAI AA IAAA AAA AA ,AAA AAA

A

A.

C -C

A U CA

GA A

A -A A -A

CA

A.

C

A U AA UU

A.A.A

A A

A-A

-C

-

C -C A....

............

A

G

GA A A

C

A U

A A

A A A

..... ..........

-

U

-

m

A A

C A":

-

C -C A.

A U

A A

...

...

-

Sc

-

C C C C C C C C C C C C C C

C

A U A UU A U

AmG

A A

q

s

-

247P1 CpU-.-A-..G~-A-253 2 258

-

G G

A

A

C

-

.

A

G ~(A)A... $...

Gm

A A A A A A

S.1.. A.... A A A... A A. ..... . A.A A A A .C...

-

.

A U A A A&.A 4.- A U

U.

-

.

A AG4 A U A A. A U A ANA A U A A U A t A U A A U

29'

..........................

243

A

.

.

A A

..

G

..

241

-

. . .

A A A A A A

29

G

. .. . . .. ...

234

.

2580

2570

2560

.

AA GGJ

G

.

A A A A A

.

.

G G G G G G G G G G G G G G

20'

223 Os

.

G-G-G-G-G-G-G-G-G-G-G-G-G-G--

95 Ng 96 Pa 100 Ap 101 Ap 102 Cv

C C C C C

900

890

.

-A

G-CA-A

iGG iGG

20'

2550 . . .

.

.

GG

Le At Zp Vc Cv Ne Sc Pc Pp Pf Pf P1 Ng Ld Gl

72 Tc 73 Ta

.

GG

7 Oc 11 Ec 12 Tm

13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

.

A..

A

2570

2580

.i ; ~. 31 .

.

Nucleic Acids Research, Vol. 18,

2590

.

.

.

.

2600

.

.

G G G G G

G G G G G G G G G G G G G G G

.

.

.

.

.

2610 .

G G

.

.

I

.

.

.

.

I

.

.

.

.

I

2620 .

G

U U A A tYU U 4 A C U C A A C A UA U A A U U:%G A C U C A A C A U.. U A A -............. U C A

G G G G G G

U U U U U U U U U U U

G G G G G

G G

U U U U U U

G G G G

U U U U U U U U U U U

U U U U U C

.

I

.

.

A A 1tUG A C U C A A C A A A U. U. A C U C A A C A A A U.tG A C U C A A C A A A U ..:4. A C U C A A C A A A II.%.G A C U C A A C A A A tflIIIGACUCAA.C....A.C.A y*G. A A UIJ . A C U C A A C A A A $.f....... A C U C A A C A A A t U A C U C A A C A A A 1 A C U C A A C A A A I. A CU C A A C A A A UU.AG A C U C A A C A A A *.... A C U C A A C A A A U.. A C U C A A C A A A U.'.U A C U C A A C A A A . A C U C A A C A A A A C U C A A C G

.

A A A A A

A A A A A

C C C C A A A A A A A A A A A A A A C C

A A A A A

G A A G G G A A A G A G A A G G C

..

2630

.I

.

A A A A A A A A A A A A A

A A A A A A A A A A A A A A

-

2640

C C UU CC A~ A CU U Am CC UU C AA C U UU A C U U A C C U U A . C U U A C U U A ... C U U A C U U A . C U C A ¢. C U C A C U C A f C U C A C U C A C U C A U U U A ¢. C U C A ¢

Hs Oc ~~ G(~~CCG G A Ec C( G G AA Tm Dm Le g A G A At ~~~~C Zp C A G A Vc Cv Ne Sc Pc ~A~G ~)CCG GA Pp ~~~~ G A Pf ~~~~ ~~A G A Pf Pl A

32-

_33

~~A

**A ~~ GW~~C

'

33'

.9. rA

O(A)A ~~~~~ ~ G G A~~~~~~ C(A)A C~~ G A

G

G

A~~

~~

U~

A J U U A A UIJ G~~~~~~~~~~~ G~A U

C~

GU

U

U

A

A

IU

A

G

U

C A A C G

U

C

A

ACG

A C

34

-

A Hc 63 ~~~~~~~~~~~~G

GA ACCUCAAMA6 C A Tc 7

AA CG

U

1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 G A Ng 53 56 Ld 59 GG A Gl 62

~~CG G A

gX

:............... ...

C ( A ) A ~ % ~ U G

U C U

A

G

A

A

U C

A

C

UUA

C

C

A

%

C

G

~~~~~~~~~..GG A A U C U U AG A

940

950 .

........... .....

,. .

.

Supplement 2291

.

.

.

960 .

.

.

.

.

970 .

.

.

.

.

.

.

.

980 .

.

.

.

.

.

.

.

.

.

.

I

Ta

Po

7

75

990 .

.

.

.

.............................

..

..i

.

......

,UUAA IUUAA

iUUAA

C(A)A- ~C1~,U G G

.G% UG UG C(A)A -H~UG C(A) * ~-~~)G

G G G G C(A)A~C~U G G A)A~C ~ ~G G C(A)A-~CGQCG G C(A)A-~C~LJG G

IUUAA IUUAA UUAA UUAA UUAA UUAA UUAA UUAA UUAA UUAA

C(A)A- GC%~UG G

31

.C fA:

A-R..

-

AUGCAACG AAGCAACG AUGCAACG AUGCAACG AUGCAACG AUGCAACG ACGGUACA AAGGUACC ACGCAACG AUGCAACG ACGCAACG

iUUAA

X A~t~UGG

ACGCAACGG AAGCAACG AAGCAACG

32

33

U U U U

G G G

G

G G G G

C ( A ) A

A A A A

A A A A

95 Pa 96 UUGA Ap 100 Ap 101 UUUGA ,UUGA Cv 102 ,'UUGA Ec 105

Mg 153 UUUGA UUGA Oh 173 UUGA Me 188

UUGA UUGA UUGA UUGA UUGA

-34 A A

A

C C U U A

GGA A C CU U A

A AUG C A A C G . AGA AAUG C A A C G GA AUG C A A C G G A At.-.AUG C A A C G GGAGA

U A A U A A

iUUGA Ng UUGA

UA UA UA UA UA UA UA CA UA UA UA UA UA UA

33, . AUG C A AG G GAUG C A A AGGCQ A C A A AG AUG

U A A

G G

GAACCU GAACCU AAACCU GAACCU GAACCU GAACCU AAACCU AAACCU GAACCU GAACCU GAACCU GAACCU GAACCU GAACCU

A C C U U A A C C U U Ai A C C

U U A

A CC U U A A C C U U A

Mb 189 Sa 201 SI 203 Tt 207 Dr 208

_ UUGA UUGA UUGA UUGA UUGA UUGA UUGA

Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231

G.. U ..

,..

U A A U A G

G G A.

G

GG4 u*.2A C(A GG A %XA t . ... .. - C Cc Ccj.. ..W ..... G. AGA .C.

,.A....

A U A

-C

-..U C C

C*.,

.'¢

..1. .,Q

*.

U(A,) -..-..G -.C.- .

v*@ *v

GGA. G A` GGi

U

A

GA

U A A A

U

U.MUAA.(U.Y

GGA^. GGA.

U

G A

U C C C

r..

C

..

.

C

A C C UC A U A C C

U U

A

CAJA U A C C C G A

U U A A

A Ay X4U GG G A

AAiA_

.A

*

U A C G

U G

A U C C G

U A A

U C C G

A U GA C C C A

-_

2600

A U U

C A A A A U C U U A

GAUAAUCCACUAAUCC)AAACUUUA

.A

U U A A

U U A A

C.A

A U A A U C C A

U A

GAA A

2590

A U A A A C C C A U A A A C C C

U G A G8,..,a,',CR A U A A C C C A G-S A .C C A C C U C A U G A A-1.:A U A A U C C A rW AA C A C C U C A U U A A U(,t) A U A C A A C G A A A C C U U A

AG...

_________U

.-...W....... ..K)C

_-

2610

_-

_-

A A C C U U A

C..A.A A _-

2620

A A C C U U A A A U C U

U A

A A C C U U A

U

U G C Hs 234 Rn 241

*C(W.ZQ...Q..W U C G C G C u

UUG

U

~~A(A)~~~~IJU

U G U

P

(~~A~~GC>CC~U

U G A

Gm 253

U U U U G

gJU

UUG

CAC&~~~~*I~U

U G A

2630

248

Cr 255 SC

Sc258 An

260

U U A A

Sp 261

U U G A

Pa

-------------------

__- _- _-

Rc 243

Sp 247

2640

262

Lt 269 Ls 270

2292 Nucleic Acids Research, Vol. 18, Supplement 2650 .

.

.

.

I

.

.

.

.

2670

2660

I

.

.

.

I

.

.

.

.

.I

.

.

.

I.

.

.

2690

2680

I.

.

.

.

.

I.

.

.

.

I

.

.

.

.

.

I

.

.

.

.

2700

I

.

.

.

.

.I

UUGAG----------AGC UUGAG----------AGC UUGAG----------AGC

AUUGAG----------AGU

UUGAG----------AGC CUGAA----------AGA UUAAU----------AGC UUAAU----------AGC UUAAU----------AGC UUAAU----------AGC UUGAG----------UGU 'CGGGC-----------GC

56Ng

59 Ld 62 Gl

35,

35 GrI AA UII - - - GAA---------GAA---------C U GAC---------C U GAC----------

63 Hc 68 Ms 72 Tc 73 Ta 75 Po

-

1000

.I

95 Ng COAU~4 AG$G -A G 96 Pa lOO Ap COAU~1 lOl Ap COA044A

I.

I

. . .. .

COA~ COA* ~4 0 --GAA COAU COA~~U V. --AAA W. 188 Me OA~3 --CUA 189 Mb COAU.~4 G --ACG --AAA 201 Sa COA*.~C

4..

203 Si 0 A 207 Tt COA~1A 208 Dr COA~*C~

I'_

35

222 Zm COA~C 223 Os COA. 225 Gmn OA 228mOAnUt A

- - -

--AAA --A A --A A.

44 AA

- - -A AU

- - - - - - - - - - - - - - - - - - - - - - - - - - -

U U

- - - -

~

260 An

-A

261 Sp

i

-U

-

~

U

A

-

- - - -

. . M- -

t----GUG-

G A

-

-A AA

-

GC------------

UGG-xBCO-------

A;-- - -GGUG - Q--------

GA

A A

-AA-AA

A

- - - - - - - - - - - - - - - -

_-

_-

_-

-_

_-

_-

-

- - - - - - - - - - - - - - - - U

WO

G U A A A A A A A U G C U A U C U A C U A C C A U C A C U

-

-

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

2660

-_

-_

2 2670U

_-

U G'4 A C -~~~A

U U A

G

CU--

-

- - - - - - - - - - - - - - - -

U'. -AA

- - - - - - - - - - - -

-

UUUUG AGA_.. -. -. ............................. ---GUU-.......-

- - - - - - - - -

U

- - - - - - - - - - - - -

P35-2 _- --G

3

-AAGA

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

2650

-

-~~~~ --t-; A GU U u>x.C-u

- - - - -

. A U A

-

U U

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

A.A

258 Sc

. I.

.--GUG-..U--------

G A

GAf~UCAGA

---UUULLCCCU.

- - A U U - - U U A U - - A A C A C - - A A A U 253 Gm A I"k::X..Am. 0.C A A C A A A 255 Or 255 Cr U A - - - - - - - - - - - - - _ ....-U A 256 Sc

-

-

-P35-1'-

AUCCtiCt

- - - A AU

-

----G U G ----G U G _

A

- - -A

-

----GUG-QCU--------GU G-u - -- -- -- -- -- -- ----GUG-CtU------- - - - - - - - - - - - -

-P35-1

-

2290Ce COAU&4& 2300Ce COAU~4C1 2310Cv COALGfAU

U U A

. .

GA--GA GA--GA GA--AA GA--GA GA--GA GA--GA GA--AA GA--AA GA--GA GA--GA GA--GA GA--GA GA--AA GA--GA

--A A --A A C --A A --A A --AAC --A A

1020Cv lOS Ec 153 Mg 1730Oh

-

1020

1010

I.......

262 Pa 269 Lt 270 Ls

I

.

AUUGAG----------AGC

20 Zp 22 Vc 23 Cv 24 Ne 27 Sc 30Pc 33 Pp 51 Pf 52 Pf 53 P1

Hs Rn Rc Sp P1

.

UUGAU ------AGC UUGAU----------AGC UUAAG----------AGC UUGAG----------AGC UUGAU----------AGC OCUGAG----------AGC OCUGAG----------AGC

1 Hs 7Oc 11 Ec 12 Tm 13 Dm 16 Le 19 At

234 241 243 247 248

.

UAUUUAA-AGUAGO AAAGGUAA U U

- - - - - - - -

-___-

-_

_-

_-

-_

_-

_-

-_

_-

_-

-_

_-

_-

-_

_-

-_

_-

-_

_-

G GC GC G GG GC GG C

2680

2690

-_

_-

-_

_-

2700

-

:ags.~ .

Nucleic Acids Research, Vol. 18, Supplement 2293 2720

2710

.

.

.

.

.

.

.

.

.

I

.

.

.

.

I

2730 .

.

.

I

.

.

.

.

.

I

2750

2740 .

.

.

.

I

.

.

.

.

I

.

.

.

.

I

I

.

..

......... ..........

-

-

-

-_

_-

_-

_-

-

-

-

-

-

-

-

-

_-

-

-

-

-

-_

-_

1

Pp Pf Pf P1 Ng Ld Gl

7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

Hc Ms Tc Ta Po

63 68 72 73 75

Pc ........................I... _-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

%

......

__-_- _- _- _- _- _- _-

-_ __--_ -

-_

Hs

Oc Ec Tm Dm Le At Zp Vc Cv Ne Sc

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

__-_- _- _- _- _- _- _-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

__-_- _- _- _- _- _- _-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

__-_- _- _- _- _- _- _-

.... *.,R~ ~ -.. '. '. ~ ~ R --.:.' ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~.. -:.R; ...-R R __.........

__~~~~~~~~. ......

....... __.. __......

*...*....*.*...*.

.....

-

-

-

-

-

-

-

- -

-

-

-

-

-

- -

-

-

-

-

-

-

-

-

-

-

-

- -

-

-

-

-

-

-

-

-

-

-

-

-

- - -

-

Ng 95 Pa 96 Ap 100 Ap 101 Cv 102 Ec 105

Oh Me Mb Sa Si Tt Dr

153 173 188 189 201 203 207 208

Zm Os Gm Cm Ce Ce Cv

222 223 225 228 229 230 231

Mg

Hs 234 241 .~~~~~~~~~~~~~~~~~~~~~~~~~~~Rn 243 .~~~~~~~~~~~~~~~~~~~~~~~~~~Rc 247 .~~~~~~~~~~~~~~~~~~~~~~~~~~Sp 1 248 .~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 253 .~~~~~~~~~~~~~~~~~~~~~~~~~~Gn 255 .~~~~~~~~~~~~~~~~~~~~~~~~~~Cr .~~~~~~~~~~~~~~~~~~~~~~~~~~Sc 2S6 258 .~~~~~~~~~~~~~~~~~~~~~~~~~~Sc 260 .~~~~~~~~~~~~~~~~~~~~~~~~~~An 261 .~~~~~~~~~~~~~~~~~~~~~~~~~~~Sp

G A A A G C U C C

U CC UCG C UC UA A AU

UUA A GU GU UA G GC GC A AG CUCU AA G AU

A U AU

A GC U GG

Pa 262

269 .~~~~~~~~~~~~~~~~~~~~~~~~~~Lt 270 .~~~~~~~~~~~~~~~~~~~~~~~~~~Ls

I

2710

2720

2730

2740

2750

2760

.

2294 Nucleic Acids Research, Vol. 18, Supplement

1 Hs 7 Oc 11 Ec 12 Tm 13 Dm 16 Le 19 At

20 22 23 24 27 30 33 51 52 53 56 59 62

Zp Vc

Cv Ne

Sc Pc Pp Pf Pf

P1

Ng Ld

Gl

2770 .

.

.

.

2780

.

.

.

.

.

.

95 96 100 101 102 105 153 173 188 189 201 203 207

Hc Ms Tc Ta Po

Ng

Pa Ap Ap Cv Ec Mg Oh Me Mb Sa Sl Tt 208 Dr

.

2790

.

.

.

.

.

.

.

.

2800 .

.

.

.

.

.

.

2810

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

..

................................ ...

..

:.:.: .......

.......... ......

'41 -.-VM

:.-,

-

-

-

-

........... '"l,

GAGGG--GG - - G G--GA-AGG--GA-GAGGA-GAG-

G-__

-

-

-

-

-

-

-

-

-

-

-

-

-

...............

-

A A--G G---

-

.................

..............

.............

35' -

-

-

-

_

- - - - - - - - - - - - - - - - - - - - - - -

A- - - A-- -A----

-A-- --

A-

1030

1040 . ..

-

- -

U

-

-

-

U

C G C G

U

C G

-

-

-

-

-

-

-

-

-

- -

AG-G A A - -

G

C C

G GG A

A

A A

G G

A

-

-

- -

-

Zm Os Gm Cm Ce Ce Cv

-

A

:i

-

-

U

G

A

A A

U

A..

~ ....A

A G G A

U

U

G A A

U U G U :G -C1IG*JmUS U U G U G. - - - U U G U 1 - - - - G A AG-G - C G G. A A A - -

-

-

-

-

-

-

-

-

-

-

-

-

-

-

U C G U C G

35, A A

GG

A

- - - - - - - A A - - - - - - -A A U

A

A -

A

G' - -

G

C G G G

- -

KAG

AG G

G .

P35-2' 222 223 225 228 229 230 231

2820 .

.

GG - - GG - - GG - - GG - - GG - - GG - - -

....................

63 68 72 73 75

.

J

.OG

U

A A

A A U

-

-

- -

A A

U

U............................, - - U --

- - - - - A .A

234 Hs - 241 Rn - 243 Rc - 247 Sp

G - -

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

248P 1

247 Sp - - - -- -- -- -- -- -- -- -- -- -- -- -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 253 -l - - - - - - - - - - - - - - - - - - - - - - - - - - - - 248 Pl---c - - - -...-- -.. -.B.g 256 Sc - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -.............. 255 CrGm

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

.

.

.

.

.

-

-

- - - - - - -

-

262

269 Lt 270 Ls

-

- - - - - - - - - - - - - - - - - - - - - - -- - - -- - - - - - - -

2770

2780

2790

-

-

-

-

-

-

-

- - - - - - - - -

2800

- - - - - - - - - - - - - - - -

2810

2820

2830 .

.

.

.

G~.

2840 .

.

Nucleic Acids Research, Vol. 18, Supplement 2295

.

.

.

I

2850

2860

2870

2880 .................

CG..

..........

A G -C G A ............U UG GAG-CGAUU*J~~~~~~~~G-MUGGUU

CT~~WU A ~~UU GG- U ~~-~

~~~GAG C G A U U

A

A

G

UU

U U O C 4 UU UUA4~~~~~~~~~~~~~UU~~........ .....U.'.iG G -UCG A U.-'.-'U U

A~

C G GAU-UG A H A LU GA U UA - U G ..A..U.G..&AA.A-UGA..U. C.....

CUU

...........

UAGLCGLA ........ CA

C A

.

1060 ..

38

...G - U GG A G -U

....

G(

-CAA

-CA~V

R -.

R

()

ACA -CA

--&G()

-ACA C A G(C Y. - C A GC . - C A U.¢*

............B¢G

GRU A 1080

..

19

Zp 20

Ld GI

22 23 24 27 30 33 51 52 53 56 59 62

Hc Ms Tc Ta

63 68 72 73

Vc Cv Ne Sc Pc

Pp

Pf Pf

P1

Ng

.

.

.

.

.

.

.

.

U

uC.G

U :A U

UG::UA.

ul-,a.::A'

uCCG..G.. Po 75

1

.

1.

Zm Os Gm % U U A AG U AAUU ¢ U U A A G U .C U G Cm A U C `.. U U A A G U'. U G Ce Ce GG U U A A G U C Cv GU U A A G U C.C"

222 223 225 228 229 230 231

-U G A -U G A

U G A U G A

-U G A -U G A

- 38' A A A A A A G A G A G A GA

Ng Pa Ap Ap

Oh Me Mb Sa Sl Tt Dr

Gs A

U G A

G- U G- U G -U U - U U-U G- U U- U

AG AG A G A G AG

Mg

G'A

-38

A A A A A

95 96 100 101 102 105 153 173 188 189 201 203 207 208

A

-U G A A U G U

U U U U U

U U AA G U U UAAG U U U AAG U U U A AG U U U AAG U U U AAG U U U AAG U U U AA G U U UAAGU U UAA G U UUAAG U UUAAG U UUAAGU U U A AG U

U G A U G

U U U U U

1090

1

-

U G A

37

ACAG()

7 11 12 13 16

39'

AU

U G A

... .........

39

t- ¢ - .A .

U(A

1070

-CAGU

-

-38'-

C..

*.I... I..

I.

U.U U

Hs Oc Ec Tm Dm Le At

................ ................

* -s a *- iB

1050

U U

U G CU

UCGU G-GA~C 4UCU

C A

I ..

-

O G -U GA

UG

37

GA GA G A GA GA

-

AUA

AUUCUG U U G- UGAUCU U% AU U U

~U

CCA

36

U

A AAUU A A U U QAG A.......... U A U U A A A U U C. ... A A A U U 1:..G... A A A UU U A A U UCCG U A A U U~...U.....U G A U U CCG G G A UU G G G A U U G G A A U U G U A A U U A U A U U C A A A U U A A A U U CCGA A A U U A A A U U :A.. A G A U U U.CU C A U U A .

Cv Ec

39'-

39

G'G~ UU U A A G U

t--C-U

M A-C¢ GAUGA ;C UA A --A A -- - Hs234 ...U.CAGAAA-C AA-AA C U A -AA - Rn 241 - G A A AQ%UUgA -C -(A)UtG - UG AA(CGC U UG ---C A GU ----Rc 243

A C AG X()

C A U CC AA G Cf1 _(_) _C_ G U s )A A C A G(t)U U.

U A . ji .U.Wl U U- A U - -- -- -- - P1 248 ij..C -GUAAG G .A.. AGA. U U.. U C U) U Gm C.. 253 WK.~~*tU U A GUUAA-U e~dtC G - U G.A U QG U CUU UAA AAG 260 U A CQU*A ()GUUAAUUC~~)S A AGAcU A A GI UU iC A- )C - UCG 6 U CG --UC AGA A AGA G~U(U)U -A AAG UG -O C SAnR Cr 256 A UG - C A A A ':G C U(A .Lt269~~~~ A A A URn.VC UU C A A A S 258 ... ..

- -

2830

2840

---U C A

G.. U.GA..........C

.......

...

....

-

-

.~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a.--------

-

- -U

- U

G -U

AA- C--

GU

-

- - - - - - - - - -

I 2850

2860

ls248

2870

-

2880

as 267

.

~A

2296 Nucleic Acids Research, Vol. 18, Supplement

1 7 11 12 13 16 19 20 22 23 24 27

30 33 51 52 53 56 59 62

Hs Oc Ec Tm Dmn Le At Zp Vc Cv Ne Sc Pc Pp Pf Pf P1 Ng Ld Gl

.

.

.

.

.

.

.

A A C G A t. G A G A A C G A...AC. t GAG A A C G AA G A G A A C G AAPF.' G A G A A C GA G A G A A A C G A GAG A A A C G A G AG A A C G AA G AG A A C G A..A..C G A G A A C G A. A G A G - - A A C G A-'-. G A G - - A A C G A.AC GAG U - - A A C G GAG A U - - A A C B.k S.C. GA G

U U U U U U U U U U U U

-

-

U U U C C

-

-

A A A A A

A A A A A

C C U U C

-

-

A A A A A

A A A A A

...

C A... G A G C GRACG A G G A G C A AG C C GA G A G

2920 .

.

.

.

.

.

.

.

2930

.

.

.

.

.

.

.

.

2940 .

.

.

.

.

.

.

-

-

.

.

1110

95 Ng C--AAC 96 Pa U--AAC C--AAC C--AAC U--AAC C--AAC C- -AAC C--AAC C--AAC C--AAC C--AAC S1 C--AAC Tt C--AAC Dr C--AAC

G G G G G G G G G G G G G G

.1 ...

*.

C

A

..

--.-.. ..

..t..t....

A C

-

C .. AGG.

A.

A--....

1120

C C C C C C C C C C C C C C

A A A A A A A A A A A A A A

-

-

-

-

A A A - A - A - A - A - A - A - A -A - A - A - A

- - - - - - - - - -

- - - - - - - - - - - - - - -

- - -

- - - - - - - - - -

UU U

-

C C U C

- - - - -

U C U A

-

-

-

G - - - - - G C U - - - G C - - - - -

1130 .

.

.

.

.

.

.

.

.

U U C G

C C C C C C C C C C C C C C

- - - - - - - - - - U C-----------

.4

U

C-

A. A A U

-

-

-

-

-

-

-

-

-

-

- - - - - -

C -C-

cC

rgG U A

-

-

-

-

-

-

- - - - -

GU A A

-

-

-

-

-

-

-

-

-

-

-

-

-

- - - - - - - - -

.G C C C U U C G G G G--> G C C C U U C G G G G--U U C40

37,

..........

41

440

A

~

. . . . I . . .. Ap Ap Cv Ec Mg Oh Me Mb Sa

.

C AA G A GG C G A G __----A- C G AAC GAG _-_---A- C G " G A G - _---A- G A G _-_---A- C G A

11 100

100 101 102 105 153 173 188 189 201 203 207 208

.

__----A- _---A- _---A- __----A- __----A- -__---A--- - -A- - _---A- -

-

37,

Hc Ms Tc Ta Po

.

.

_---A- _---A- __----A- -

-

.-.... *....-.........

63 68 72 73 75

2910

2900

2890

.

41

*

222 223 225 228 229 230 231

Zm Os Gm Cm Ce Ce Cv

C--AAC GA£ C--AAC mA~C C--AAC C--AAC C--AAC C--AAC &4~ C--AAC

234 241 243 247 248 253 255 256 258 260

Hs Rn Rc

Gm Cr Sc Sc An

261

Sp

262 Pa

-- -- -- -- -G C A A G - - U - A C C C .....AA U........ -------A C A - U U A A U CCG A U A A C AC.t U--------------U U A - - - - C C UA G G - G A A A A U-~~~~~~~ U --A A C .A"G. G A A - - A C C...C....LfiCGS - U . .. . U G A U. G C C - - - - - G.U--------: A_ U _A U A A A C ___A A A - U:W,:.:_A-:-i¢ -h:is :-iLU -:-:: -4 U:: ___A_ G_ _A _U A A A C AA. A A A - U U-- A A CG A A - A A 2e.U C A U U G C - A U U U g-WU~ U U A A G A A- - A G t --U --- U U G G G A A U AA G A A - - A A C G U - A A A

269 Lt 270 Ls

- - - - - - - - - - - -

Sp P1

-

-

-

-

--

-

-

G G G G

C C C C

A

- C A - .....G G C A - G C A

A A . A

-

-

-

-

-

A A A A A A A

U A U G-------

C C C C C C C

CAU U- -

- - - - -

A UA A

A A U U U Ai..

.

....................

- A A - A A G G - GG

-

I.

-

-

_

_

_

-

A

A

C

G

U

- - - - G A

C.

- - - - - - - - - -

C..U.g

A A G G A G

U

*

i

:

4

1

.l.U........ A.

-------

-.

U

AU

U _A U

- - - - - -

A _

_

__-

_

_

_-

U

-

- - - - --- - --

2890

2900

2910

2920

2930

2940

.

G.

.

Nucleic Acids Research, Vol. 18, Supplement 2297 2950

I

.

.

.

.

2960

I

.

.

.

.

.

.

-.

.

%A

U IJ

2970 .

.

-C %,

.

.

.

.

.

.

2990

2980 .

.

.

.

.

I

.

.

.

I

.

.

.

.

I

.

.

.

3000 .

.

.

.

.

.

.

.

- - - %.

J .)G........*.. G ............... (C.)1.. .. :..A..(. A ..X .. A.A.A. ..............

.

.......

U U A U U

-

-_

------------------------------------------

----------------------------------__------

----------------------------------__-----------------------------------------------

..G.)..A..

UC

G

-_

U0A h.A A A A A A

A UA

A

AAAAAAU

UU UU

A.AA AC .A A U AA

1-

-C... G G A A CA -C .. .....IU U U --.. .A.... U____ 1-~~C

G A U A G C A A A

-.%

u A IIu II u A

W--.--W

M..v

----------------

.... ..

Hs Oc Ec Tm Dm Le At

- - -- -

Vc Cv Ne

. ... U U ----

Pf Pf

Sc _- Pc pPp

-_

.%.4A.Af:^{|)#::|::Ui A G U U U

---- ----

-- Zp

------------------------------------------

UA. U AA-AUUQAAUAUA&GUU) OCaU

.

----------------

----------------

P1

Ng

Ld

GI

1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

41

-

-

- -

.

Hlc 63 Ms Tc Ta Po

- - - - - - - - - ------------------------

Ng 95

-_ __

Ap 100 Ap 101

__

__--_

- - -- - - - - - - - -- - - - - - -

Pa 96

Cv . ---Ec . -------___________ _-.. Mg .---- - -Oh . Me - - - - - - --- - . -._ Mb - ---- - - - - -- - - - - - -- - - - -- Sa . Si - - - - - - - - - - - - - - - - - - - - - - - Tt - - - - - - - - - - - - - - - - - - - - - - -Dr

-

-

68 72 73 75

102 105 153 173 188 189 201 203 207 208

41 -_

_-

_-

_-

__-

-

-.K. .........

Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231

-

-

-

-

-

- - .......

.......

>...

. .

P41-1

.~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 241 .~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

243 .~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 .~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ - - -_ S s 2356 .__ _P1248 GCC AA A GW.X..u...UU G G. AACCGA ACCG.... AA... CG A G.G AGA C AA ...U GACC> C G C UA C UA-A -m5 __

-_

- - - - - -- - - - - - - - -- -- - -

Rc 243

247 Sp - - - - - - - - - - - - -.Pa262~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~...... - - - - - - - - - - - - .Sc_________258_____241 Scm269 .Lt~~~~~~~~-: - - - - - - - - - - - - - - - - - - - - - - - - - - - - - G-,CGAGGAGCCGAG'. ........ PLs 248

-_-___-__-__-__-__-__-__-__-_-- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -_

_

......L... .......7 0.:....... _

_

_

_

_

_

_

_

...

2950

261 .~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

2960

2970

2980

2990

3000

Nucleic Acids Research,

2298

1

Hs

O

18, Supplement 3040

3030

3020

3010

7

Vol.

3050

-

c

11

Ec

-

12

Tm

-

13

Dm

-

16

Le

-

19

At

20 Zp

-

2 2 Vc

-

23 Cv

-

24

Ne

-

27

Sc

-

30

Pc

-

33

Pp

-

51

Pf

U

52

Pf

U

53 P

U

~

I

56

Ng

-

59

Ld

-

U

U

A

U

U

-

A

U

U

-

U A U C.--

62 G 1

63 Hc 68 Ms 72 Tc 73 Ta 75 Po

95 96 100 101 102 105 153 173 1 88 189 201 203 207 208

Ng Pa Ap Ap

222 223 225 228 229 230 231

Zm Os

Cv Ec

Mg

Oh Me Mb Sa SI Tt Dr

Gmn

Cm Ce Ce Cv P41-l'

234

Hs.--

241

Rn.--

243 247

Rc.--

Sp.--

248

PI

253

Gm

-A

C

A

G

G

U A G C U G U

Ox. ''O:

..........

C A Q

G C C G G C

255 Cr.--

256

Sc.--

258 Sc.--

260 An.-261

Sp.--

262 269

Pa.-Lt.--

270

Ls.--

3010

3020

3030

~P41-2

3060

3040

3050

3060

Nucleic Acids Research, Vol. 18, Supplement 2299 .

.

.

I

3090

3080

3070 .

.

.

.

.

.

I

.

.

.

.

.

.

.

.

.

.

.

.

I

3110

3100 .

.

.

.

I

.

.

.

.

I

.

.

.

.

I

.

.

.

I

.

3120 .

.

.

.

.

.

.

.

I

Hs 1 Oc 7 Ec 11 Tm 12 Dm 13 Le 16 At 19 Zp 20 Vc 22 Cv 23 Ne 24 Sc 27 Pc 30 Pp 33 Pf 51 Pf 52 P1 53 Ng 56 Ld 59 Gl 62

-

-

Hc Ms Tc Ta Po

63 68 72 73 75

95 96 100 101 Cv 102 Ec 105 Mg 153 Oh 173

Ng Pa Ap Ap -

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-_

Me 188 Mb 189 Sa 201 S1 203 Tt 207 Dr 208

- - - - - - - - - - - - - - - -

-

- - - - - - - - - - - - - - - - - - - - -

Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231

P41-2 --- - ----------------------- - ---------- Hs 234 - - - - - - - - -- - - - - -- - - - -Rn 241 - - - - - -- -- - -- -- - - - - - - - - - - - - - - - - - - --

_

_~ ~ ~~7G C

Q.W)

C

XA

U C G

- - - - --

Rc 243

- - - - - -

Sp 247 PI 248

- -7 ---....... .. ... : Q W U U U..A U U .U. . A G U G U G C G %.J.Cr~~~~~~~~~~~~~~~~~~~~~~~ A )A:Q W C G 253 255

.__ _Sc

256

_Sc 258 .__ - - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - --

Cr 255

--

_Sp .__

-_-_-_-_-_-_-_-_-_-_

_-..______________________________

- - - - - - - - - - - - - - - --- - ---- - -- - ----

I

I 3070

3080

3090

3100

3110

3120

261

Lt 269 Ls 270

2300 Nucleic Acids Research, Vol. 18, Supplement 3140

3130

*

.

.

.

.

.

.

3150

3160

3170

3180

.1

.

.

1 Hs 70c 11 Ec 12 Tm 13 Dm 16 Le 19 At 20 Zp 22 Vc 23 Cv 24 Ne 27 Sc 30 Pc ---------______ 33 Pp ------_________ 51 Pf -.-.---________ 52Pf -P-----________ 53P1 --------------56Ng - .-.----_______ 59Ld - .-.-_______ 62G1 ---------------

63 68 72 73 75

Hc Ms Tc Ta Po

95Ng.

96 Pa. 100 Ap. 101 Ap.

102Cv. 105 Ec. 153Mg. 173Oh. 188Me 189 Mb

201Sa. 203S1 207 Tt. 208 Dr.

222 223 225 228 229 230 231

Zm

Os

Gm Cm Ce Ce Cv

P41-2'

P41-2-

234 241 243 247 248 253 255 256 258 260 261 262 269 270

Hs

-_

Rn Rc

-_ -_

Sp

P1 Gm Cr Sc Sc An Sp Pa Lt Ls

A:Q:A:A.:A

__-

__-

_- -_ -_ -_

-_

-C-G--U-G -C-CGAAC-

C

----------- -_ ------ -_ ------ -_ ------ -_ ------ -_ ------ -_ ------ -_

G

G

A

______3130

U

_______-

G

C

-_

G

C

G

A

A

C

_-

-_

_-

_- _- _- _- __- _- _- _- __- _- _- _- _-

-_ -_ -_

_- _- _- __- _- _- __- _- _- _-

_-

_-

_-

_-

......

G_C _G_G_C

-_ -_ -_ -_ -_ -_ _- -_ 3140

_______-

_______-

U_C_G_

_______-

3150

_______-

.....g

.......... Eg.."-...a.''.S..g

_______-

3160

U

_-

_-

_-

,_ A cu_.._ .._ ...._

3170

3180

18., Supplement

Nucleic Acids Research, Vol.

3210

3200

3190

3230

3220

2301

3240

--- ss 1 --O c-c 7 --Ec- 111 -Tm- 121 131 Le 16 . . At 19 . Zp 20 . -Vc 22 Cv 23 . Ne 24 . SC 27 . -----------------------------------------------------------Pc 30 . Pp 33 P 51 .RE P 52 .RE . P1 53 -----------------------------------------------------------Ng 56 Ld 59 . . GI 62 -

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

. . . . . . . .

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

------------------------------------------------------------

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

---

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

63 68 72 73 75

Ng 95 Pa 96 AplOO0 AplOl1 CvlO02 EclOS5 mg 153 Oh 173 Me188

. Mb 189 . Sa 201 . SI 203 . Tt 2 07 .0r 208 -

-

-

Hc Ms Tc Ta Po

P41-2'

Zm Os Gm Cm C. Ca Cv

222 223 225 228 229 230 231

P41-3 -

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

s

-

234

1 U-0 3 .Cr~~~~~~~~~~AGU UCUUGCA25 cr 25 A

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

I

I 3190

I

I 3200

-

-

-

-

-

5

5

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

I

Gm

-

.S -

I

-

-

-

-

-

-

-

-

Sc 25 S 2 n 26

pa262 2 a 26

-

-

-

I

-

-

-

-

-

I

3230

-

-

-

-

-

I

Lt 27

-

3240

9

2302 Nucleic Acids Research, Vol. 18, Supplement 3250 .

1 7 11 12 13 16 19 20 22 23 24 27 30 33 51

52 53 56 59 62

Hs Oc Ec Tm Dm Le At Zp Vc Cv Ne Sc Pc Pp Pf Pf P1 Ng Ld Gl

.

.

.

I

.

.

.

.

I

3260 .

.

.

.

I

-_

_-

_-

_-

_-

_-

-_

_-

_-

_-

_-

_-

.

.

I

.

3270 .

.

.

.

I

.

.

.

.

I

3280 .

.

.

.

I

.

.

.

I

.

3290 .

_-

.

.

.

I

.

.

.

.

_-

_-

__- _-

_-

__-

I

_-

3300 .

.

.

.I

.

.

.I

.

_-

_-

_-

_-

_-

_-

_-

_-

-_ __--

__--_ -_

-_

_-

_-

_-

_-

_-

-_

_-

_-

_-

_-

_-

-_

_-

_-

_-

_-

_-

_-

__- _- _-

_-

_-

_-

_-

_-

_-

_-

_-

__- _- _-

_-

_-

_-

_-

_-

_-

_-

_-

__- _- _-

_-

_-

_-

_-

_-

U G U U C.A

-_

__U

_- _-_-_-_-_-_ _-_-_-_-_ _-_-_-_ __-_-_ __-_-_ __-_-_ --_ -_ _--_ __-_ __-_ __-_ 41'

63 68 72 73 75

Hc Ms Tc Ta Po

95 96 100 101 102 105 153 173 188 189 201 203 207 208

Ng Pa Ap Ap Cv Ec

222 223 225 228 229 230 231

Mg Oh Me Mb

Sa S1 Tt Dr

Zm

Os Gm Cm Ce Ce Cv P41-3

234 241 243 247 248 253 255 256 258 260 261 262 269 270

Hs Rn Rc

-_

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

-_

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

-_

_-

_-

_-

_-

_-

-_

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

-_

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

_-

Sp P1 Gm C GA C G A C G U C Cr Sc Sc An

P41-3'

..Q.

.

_-

_-

_-

G G C G G A G A A A G j>

_-

_-

_-

-_

-_

_-

-_

_-

_-

NS¢A U U C A G G U G A G C C

-_

---------------------------------

-_

---------------------------------

-_

---------------------------------

-_

---------------------------------

Sp

---------------------------------

Pa Lt Ls

---------------------------------

-_ -_ -_ -_ -_ -_

---------------------------------

3250

3260

3270

3280

3290

_-

_-

_-

G__..... U.g..S. G G U G

_______-

---------------------------------

_-

_- _______- _-

_____-

_______-

_______-

3300

.

-

-

*:Ait. -

-

-

-

Nucleic Acids Research, Vol. 18, Supplement 2303 3310 .

.

.

.

I

3320 .

.

.

.

I

.

.

.

.

3330

I

.

.

.

3350

3340 .

.

.

.

.

.

.

.

.

I

3360

. .

I . . . .

I

. .

. . .

.

-- - - - - - - - - - - -- - - - - - - - - CCGA GC -CCGAGCz f A r

Hs Oc Ec Tm Dm Le At

, .I.__ \, MZ\

:-~ ~ ^. .L .' -

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

:

@i: -

5,;- M--. -::->.IU -

-

-

-

-

^

-' .

-1

F-v

-

- - - - - *;.:

-va

-

..;

U G .GM .X.W _.

G

ACU U C OF

---------------------------------------

,,.,it.

C A U G U U - G A - U U - U C

-------------------------------------__ -------------------------------------__ -------------------------------------__

---------------------------------------

...

C U C U G C GC C G.

1

1ii.:.:.iir,

i

U c U c

.i.i.I1r

II rii

Ld GI

Hc Ms Tc Ta Po

63 68 72 73 75

Zp

Q_

Vc Cv Ne Sc Pc

..,,, ,

X.ii.W X.gX .St$Si:i..:r::

1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

Pp

.. ...UCUC U U t

C,

U U

Pf Pf

G. c~ f. u

U U U A C

P1

U

..Q..

C C C G C~A~ACU~ A UU ¢¢.

*

Ng

41 '

-~~~G .~~~G -~~~A .~~~G

-

-

-

Ng 95 Pa 96

Ap 100 Ap 101 Cv 102

.~~~G Ec 105 Mg 153 .~~AU Oh 173 .~~~G Me 188

Mb Sa Sl -~~~G Tt .~~~A Dr -

189 201 203 207 208

41' - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - -

-

-

-

-

. - - - - - - - -C A U A A .

-

-

-

-

-

-

-

-

- - - - --

4..:.,........... A

-

-

-

A Zm 222 A Os 223 U A Gm 225 - - Cm 228 - Ce 229 A ....... Ce 230 -.- Cv 231

- - - -- - - -

......

-

-

P41-3'

-__

__ -__

__

U G G U A C G U A

{

. ____ --- --- - - -- - ----------- -- -- - - -- - -------Hs 234 __ _----- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Rn 241 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - Rc 243 . - - - - - - - _ Sp 247

.~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P

A ...................................... 253 ._Cr255 ------------- - --------- -r - - - - -_ Sc 2 56 . ~ ~ ~ ~ ~ ~ ~ Sc 258 -- - - - -- - - - - - - - - - - - - - - - - - -_- -2.An___________________ An 260 .~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 261 - - -- - - - - - - - - - - - - - - - - - - - - - - - 261 G U A C G C C C C

A A A A C

C A A A C G A A A

~~~~~::~~~~~~~~.A~3tJ1

-

. . .--

3310

3320

I

1

3330

3340

- - - - - - - - - - - - - - - - -- - - --- - - -- --- -

3350

3360

SPaLt 269

Ls 270

260An-

2304 Nucleic Acids Research, Vol. 18, Supplement .

.

.

3370

I.

.

.

.

.

I.

.

I.

.

.

.

59 62

63 68 72 73 75

Pp G U U Pf - A Pf A U A P1 --A Ng Ld -

Gl

-

A A A A

41 '

G G iG ,G IG

* *

Ce

Ge

2 31

234

241 243 247

Gv

I

.

.

.

.

I

-

3410 .

.

.

.I

.

.

.I

3420 .

.

.

.I

.

.

.

.I

_---

-

G G G G G

C C A C U

C U U U C

A A A A

A - A - A - A - C A A -

- - - - - - - - - -

- - - - - - -

1170

1

G-AiA G- A

G A C A -GAC A - - U A U A - - G A U A - - G A U A .------ -- -- -G A U A - - - - - - - - - - - - G - U A - - - - - - - - - - - - G A U A -- - - - - - - - - - - G A C A i- - - - - - - - - - - - G U C A i- - - - - - - - - - - - G U C A ,- - - - - - - - - - - - G U C A ,- - - - - - - - - - - GGA- A - - - - - - - - - - - - %a G Am -A - - - - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

G-AI G-A G- A G- A G- A G- A G-A G- A G- A G-A G-A G- A

-

A A A AA A A A A A AA A A -

- - - - -

-

- - - - -

- - - - -

-

- - -

. .

- - - - - - - - -

42 _ A

UcA

----G U ----G U ----G A

A

A C AU A AUU C U A -

-A

..

G

UAAUAAUAAGAAA A - -

___-G A

-

----G

-A

A

GAAUAA-

UAA--

Hs.

Rn ---

.

G U U CA--GC G U U CAGCG G U U UA--U U C UA--. - - - - - - - - - - - - - G U C UA--UU UA-._____ - - - - - - - - - - - - - G U U UA-- - - - - - - - - - - - - G U U UA--- - - ---- - - -- - GU U CA - - - - - - - - - - - - - - G A C UA-- - - - - - - - - - - - - G A C UA-- - - - - - - - - - - - C U AC AA-- - - - - - - - - - - A U G AA-- - - - - - - - - - - - - G A U AA - - - - - - - - - - - - - - G U C UA-- - - - - - - - - - - - - A U C UA-- - - - - - - - - - - - - G U C UUA-- - - - - - - - - - G U A A A C 'UA-- - - - - - - - - - - - - A A U IU GC - - - - - - - - - - - - - G C G iA---

U A A A: C. AA& .. -

Rn

- - - - - - - -- - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Sp.

253 Gm

255 Cr

u.

.

GA

256 Sc AX .h...L. U A - - UU 258~~~~UU U ScSc. A -.". - - fKA. V." 260 An - - - - - 261 Sp - - - - - - - -UU 262 Pa - - - - - C U A A G U -

Lt

-

- - -

A U

- - - - - - - - - - - - - - - - U --A.uUU U A - - - -

258

26 9

.

- - - - - - - - - - - - - - - - - - - - - - -

...

299 7m

230

.

A A A A - A

1

Sa SI Tt Dr

--

.

1160

*

Mb

-

3400 .

-

1150

I * *

.

Oh Me

229

I

42

Ng Pa Ap Ap Cv Ec Mg

228 C

.

A-A C A G A C A * A A A G AUUC G -A CGGG-

.

G U

.

G - A G -A Cg)~ G -ACu....

-

G U U

.

G - A CVAB.

Hc Ms Tc Ta Po

223 Os 225 Gm

.

G-A C A AG-A C A A- A C A A - A G - AC A - A CUA

G G G G

1140

95 96 100 101 102 105 153 173 188 189 201 203 207 208

I *** I

.

IHs C CC C 7Oc C CC C 11 Ec C. ' 12 Tm A C A A 13 Dm C A C i> 16 Le . G C 19 At : G C 20 Zp GGC C G C A 22 Vc GCG G 23 Cv 24 Ne .G G G A i 27 Sc 30 Pc GC G

33 51 52 53 56

3390

3380 .

-

iN"..

..........

^

- - - - - - - - - - - - - - - -

--G -

- - - - -

-

J

C U U U A U AU----G A U ---U A U A---

-

270OLs.--

3370

3380

3390

3400

3410

-i4ZU

AU

Nucleic Acids Research, Vol. 18, Supplement 2305 3450

3440

430

* IC... IA

.1I

I

.

.

.

.

.

I

.

.

.

.

.

.

I >

-C G A G -C G A G A U G A G 4G A A G G A A G

A A A C

A C A U A U U U%. U.

G G G G

U

.

U A. C A U A

~CAt

.Afi::..

A A G A . UG A A A U 1 A G G A C G UG

-

GA

*t*~G A CA C A AG A C(~tA

U U C

A

A A

~~GA ~G A

A A

C~

AIJ~

1190

1200.........

1180

1190

1200

AAGG AAAGG

A A A A A A

IAAGG

AAGG AAGG AAGG

1AAGG

,AAGG ,AAGG

A AC AC AC AU

'AAGG

jAAGG iAAGG AAGG

1220

I. ..

.

A-A-A-A-A-A-A--

A A A A A A A

AAAGG~ A G Gt

AAGGWJ AA G

:..'.:....S.. s..'s

A

-----------

A

AG

...

.

.

.

.

I Ng 95 Pa 96 Ap 100 Ap 101 Cv 102 Ec 105 Mg 153 Oh 173 Me 188 Mb 189 Sa 201 Si 203 Tt 207 Dr 208

-34'-

36'9

42' AAGGA AAGGA A AG GG A AG G

I

A ----U A -- -- C A -- -- C A -- -- C A -- -- U A -- -- C A -- -- U A -- -- U AA -- -- U A- - --U A -- -- U A -- -- C A -- -- C

A

|AAGG

....

Zm Os Gm Cm Ce Ce Cv

--U --U --U --C

--C

--U -

-C

A

U

A A A A

A..C

O C C A A QGu.:::.::::~:

-UA

------

A

-

_______________________ U _

_ _ _

_

_

_

_

_

_

_

_

_

_

_

_

. .

I

3430

3440

222 223 225 228 229 230 231

...

-U A

U AAC U C CCAA -:G: A UC A A ~ G~AC-UU A-A-AA--A CA. U-C A -----C A A U UAA~ .. AC)UA G G AC G U -C A G C u CAQR -----GC -A -UA () A CAAA -G~ AC CGU -CA C--CUCOC A-GA ~C A U*U A AG U ~CA1 ~~A A G G A UC At&#(U AGU(~~~~) ----CA A A~~6 AU ACA¢UCAG ------ GA IUA - UAA~~~ ~~A~~AcA ~AAU C -AAi U A A~ GC A~~~~ A -A--U AA ~~~~~ ~~~A U C - - ------A A G A A A: A U U A A--U O A ~1AG U C --A ~ $ -A - - U A?K~~~~~ A &W~~~~~W~~4AAU A AA -------t G U U A U --A-U .AU GQGA - -U &A.~~~ A -4G*~~~~- -~ CU CA A.A A &A........GA.U.A.

----AC

--

63 68 72 73 75

34,-

1210 .

.

Hc Ms Tc Ta Po

P1

Ng Ld

- - -

1180

GI

24 27 30 33 51 52 53 56 59 62

Ne Sc Pc Pp Pf Pf

G A A - U ---GAA-U - - - G A A - A - - - G A A - U - - - G A A - U

AAGGA A A A G G A A .C GA A AGG A AG GA CuU CAX..: A.-. . A.. A A G G A QQ&GG C C A

....

Cv 23

-~

36'

42'

Hs 1 Oc 7 Ec 11 Tm 12 Dm 13 Le 16 At 19 Zp 20 Vc 22

U U

~G A J**G A

At

3480

I.... I

U

U U U U U U U U U U

)..G

mJQG A A G U A A AG A..

A A QGG GA A A A A A G G A A

GA

A

A tfl¢ C(C)AGA A A G A U G g Ag UGAA G A A A GA ~C C~~G A ~G A A ~~ GA & ~G A

o.0

A(W)

>:.>:.........> -...:. ..:.>>.. ...>.>.>. ...... :.>>>>

...>

.i.A

C C C C C

C - CG A G A U

3470

3460 .

.

3450

3460

3470

3480

Hs 234 Rn 241 Rc 243 Sp 247

P1 248 Gm 253 Cr 255

Sc 256 Sc 258 An 260 Sp 261 Pa 262 Lt 269 Ls 270

.~

2306 Nucleic Acids Research, Vol. 18, Supplement .

.

3490

.

.

3500 .

.

.

.

.

1 Hs :..4 G C U

7Oc 11 Ec 12 Tm 13 Dm 16 Le 19 At 20 Zp 22 Vc 23Cv 24 Ne 27 Sc 30 Pc 33 Pp 51 Pf 52 Pf 53 Pl 56 Ng 59 Ld 62 GI

G 4G 4G . G 4 G 4 G 4 G

C C C C C C C G C G C G C G C G C G C G C G C G C G C G C

4 4 4 4

4 4

C C C

C C C C C U U

U U

G

....

63 Hc

A A A A A

A

72 Tc 4GCU A 73 Ta 4G C U VA C4. 75 Po A UCG C U

.... 95 96 100 101 102 105

Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa Sl Tt

G G G G G 4.G G G G G G G G G

153 173 188 189 201 203 207 208 Dr

.. ...A

...

CZm

G

GC

229 Ce

G

G

22

..

..

A

4G C GACe(

Cv

4G C

234 241 243 247

Hs 4-C Rn . A Rc G Sp..- G

248

p1i

G AA

253 Gm 255 Cr

G C C G C UA

258 260 261 262 269 270

Sc An

~4G UAA 4

Sp . Pa Lt Ls

G G G G

U C C

C C C C C

A A A A A

.

.

.

.

*

.

3540

.

U

A~

G A A

AA C

A A A A A

A A A A G

U U U U C

-

- - -

- - - - - - - - - -

- - - -

1250 *

.

ACAGA---ACAAA---ACAAA---ACAAA---ACAGA---ACAAA---ACAAA---ACAAA---AAUAG---ACAAA---ACAAU---AC A A U

-

-

-

-

ACAAA---ACAAC----

..

..

A C A A U

A C A A C A A C A A U A C A A C A A U l. .¢.2A C A A C A A U A C A A C A A U A C A A ...,x .................A...............

.U44C444 .

230 Ce

Sc

.

..

.. ..

2 31

256

.

E43-1

A C A AU

AA AA AA GA

.

.

43

.. A~~... ~

.

3 _ _ _

AC AC AC AC AC AC AC AC AC AC AC AC AC AC

U U U G U

~

.

U U U

1240

U U U U

~

3530

3520 .

UUGUUA ____AC

32'

~

.

.

....I.. .I

34'

~

.

C C C C C

1230

I...

C U C U C U C U C U

C C C C C C C C C

.

43

A'A~A

4G C

3510 .

.

......

4GUG C UCA

68 Ms

.

32'

..........

. ..

.

ACAC ACAC ACAC ACAC ACAA ACAC ACAC ACAC ACAC ACAC ACAC ACAC ACAC ACAA ACAC A C A A AC AC ACAA ACAA ACAC

U C C U

G C C

34'

.

A A A A A A A

A A A -

- - - - - - - - - - - - -

- - - - -

U A

- - - - - - -

AUQ44C U AC - A ........... U AGO . ..... -A..C-. AC-AAUU......-......... B. -----. ...f...-......A C AA U UA .......................

-

-

A AAAUG. U CU

A 4 ~~J 4WflICA ~A UC 4>4:.|A4i

--AAU

-

-

-

-

-

A ACA UAG - - - A A AU U % - A AU COCA A ~~ ~E ~~cc~tj AA...uX - ... U A~~A -U -U A - C- ~$A .. ~ C. A ~A AX .. U AAUG U A-

A U

A

A A

U

C Ui

.A.(:4LI4.gC GAOUAA ().i~

...Ct

A

UA A A A

- - - -

-

-

- - - - -

3490

3500

3510

3520

3530

3540

~

Nucleic Acids Research, Vol. 18, Supplement 2307 3550

3570

3560

3580

3590

*

3600

*~**

~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~. . . . .

-- U GO C 1J A -U GO C U A ~ ~ ~ ~ ~ ~-~~~~~~~~~...... ~ ~ ~ ~ ~ ~ ~ ~ ~C~ ~ ~ ~ ~ ~ ~ ~ ~ ~ G-00 0 ~ ~ ~ ~ ~ ~u~ ~ ~ ~ ~-~ ~ ~ ~ ~ ~~~ ~~ ~~~ ~~ ~~~ ~~c~~~ ~~~~~ ~~~ ~~c~~~ ~~ ~~ ~~~~~~~. ~. . U A C -U G G-u -~~~~~~~~~~~~~~~~~~~~~~~~~~~AU~~~~~~~~~~~~~~~~~~~~~~~~ .......CC... A G G-CU U A~ ~ ~ ~ ~ ~ ~U~ ~ ~ ~ ~ ~ ~ A~ ~ ~ ~ ~ ~ ~ ~ ~. ---------------------------------------U CA CA-CC - CG A--UG .U CU A U A A ~~~~~~~~~~~~~~~~~~~~ U A C GAC-U.'A U -----------------------------------------UGC ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~. . . . . . . . . -CC -~ ~ ~ ~U~ ~ ~ ~ ~A~ ~ ~ CC.. u G GA ...........

---------

U U

U~

~fl.t&~~GGGAA-

Hs

Cc Ec Tm Dm Le At

Zp

Vc U1~~.44.~ A- Cv *G A- Ne SC c -G~~~~~~~~~~~~~~~~~~~~~~~~ G G A- PC C~CG A- Pp 41---' G A- Pf U &~~JAC~GA- Pf

~ GO UU. U A U U U UGO--UAU

- -

t A U A CU V

G

*~(C~A~ UG A- P1

~~AAU - ~AG- Ng Ld j.0. Gl GO

44

G A G A G A G AG A-

Hc Ms Tc Ta Po

7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

63 68 72 73 75

1260

Ic 1c 1c -c 1c

3 Ic

3 1u 1c

3 -

A

%c

3 Ic

3 1c

3 1c

3 1c

3 1u

3

-

-

-

-

-

-

-

-

-

-

-

-

Ng 95 Pa 96 Ap 100 Ap 101

Cv 102 Ec 105 Mg 153 Oh 173 Me 188 Mb 189 Sa 201 Si 203 Tt 207 Dr 208

44

GU C -& G G ~~~UC ~U C ~ ~ ~ ~ ~ ---- ~~~~ U A --- ~~ A ~ U A -& A G A

G C G A U -~-C~~G C C1CG C G C G AU GO G GOC G A U ~G C G C U U A G C G C U U

~~~UG C A ........C.

Zm 222

Os 223 Gm 225

Cmi

228

Ce 229 Ce 230

G C

Cv

231

Hs

234

Rn

241

Rc

243

Sp 247

a. A..... A A G C -

-

A A

A -

-A A--A -

G

AU

A

U

G-

-UAUAUUUAU

3560

3570U

3 580

3590U

248 253

Cr

255

Sc

256

Sc

258

An

260

Sp 261

I

I 3550

P1 Gm

3600U

Pa

262

Lt

269

Ls

270

2308 Nucleic Acids Research, Vol. 18, Supplement 3610 .

.

I

.

.

.

.

.

.

I

3620 .

.

1 Hs -CA70c -CA-UA-GO-GA-CA-CA-GA-GA- A A A: -GA-GAA A - GA -AA-AA-

11 Ec 12 Tm 13 Dm 16 Le 19 At 20 Zp 22 Vc 23 Cv 24 Ne 27 Sc 30 Pc 33 Pp 51 Pf 52 Pf 53 P1

.

I

.

.

.

3630

I

.

.

.

.

.

.

I

.

.

.

.

I

3640 .

.

.

.

I

.

.

.

I

.

AAC-

---

---C.

.

.

UAAC---C IAAAU-------...;

UGAUGGGGAU UGAUGGGGAU UGAUAGGGAU UGCUAGGGAU UGCUUGGGAU UGAUGGGGAU UGAUGGGGAU UGAUGGGGAU UGAUGGGGAU UGAUGGGGAU UGACGGGGAU UGAUAGGGAU UGCUGGGGAU UGACUGGGAC UGAUGGGGAU UGAUGGGGAU UGACAGGGAU CACUUGGGAC U

AA C A

A

U A UAAU--------------C..u~... A U -C-------U USUA A A .....

.QAAU-----UQ--%gCG-AAAK UAAU-OU------aUUG-AAU VAAU--------QVUVIAAC UAAU ----UU---- LJGGAAA UAAC.-------CCUUAGUC

-AAis

-U

----------

UA

U

----

-

-

-

-

-

-

-

-

-

-CU

AC----------

U

U

U U AC A : GAU CAA:

U.tAAUUGGGAU

U U U U C A A

AC CCCCGGAA AGOC-----------------------

AA-----------

GGCCGGGAC

433'

*... -GA - AOG -C GOUAA(4C--C U A A A -AA - ... A G G G A A -C UCLi--C U A A A A CC.W G-OCGCA AoCe-AA - G G GA A $#O~~~~~~ -A- As V C UCAU A A AA --- AA - ...GCGC GA SC CGA AtICCC-.AGCAtA ". -AA - S ax"G C A G-~~~~~~A -U U A A A ...........

;

1270

1280 I. . . .

95 Ng -GA96 Pa -GA 100 Ap -AA101 Ap -GA102 Cv -GA105 Ec -GA153 Mg -AA173 Oh -AA188 Me -GA189 Mb - GA 201 Sa - GA 203 SI -A A 207 Tt -AA208 Dr -GA-

I

. . ..

C.GCAGUUGGGAU U U~1300 A G U C C G G A U

I

*1..

. . ..

**

I.

CGU A G U C C G G A U

--------ACAAA --------AUAAA --------AUAAA .-------ACAAA --------ACAAA --------AUAAA U

G U COGGA U

CQ

AG U COGGAO A G U CC G GA U C4C AG U CCG GA U

454

UCCAGUUCGGAU

-GU-------AAA .--------AAAAA .--------AAAAA -U-------UAAAA ---------AAAAA ---------AAAAA A AA A ---------------

gUCAGUUCGGAU

CC.gCAGUUCGGAU .CUCAGUUCGGAU CgUCAGUUCGGAU CUCAGUUCGGAU CCCAGUUCGGAU

-U-------GAAAA

CtCCAGUUCAGAU

44' 222 Zm -GA223 Os -GA225 G(n 228 Cm 229 Ce -GA230 Ce 231 Cv -GA

-

CtUUAGUUCGGAU C..CCAGUUCGGAU C.*tGUAGUCAGGAC

1290

I....

G U U C G G A U

.

WtVA

-GA

.. .1

3660

.I

.

CA AC

44,

234 Hs 241 Rn 243 Rc 247 Sp 248 P1 253 Gn 255 Cr 256 Sc 258 Sc 260 An 261 Sp 262 Pa 269 Lt 270 Ls

.I

.

--AAC -CC---#UO---O AAC--------------C CAAC

56Ng

Hc Ms Tc Ta Po

.

-A A C

59 Ld -AA62 Gl - G A -

63 68 72 73 75

3650 .

43, -------AAAAA

GCUG-AGCUAACIYCC

UCAG U UCGGA U

....UG-AGCUAACSe::cc-------AAAAA

UCAGUUCGGAU U CA G U U CGG A U UCAGUUCGGAU UCAGUUCGGAU UCAGUUCGGAU UCAGUUCGGAU

---------AAAAA

-------A--GAAA

A#lU-GGCAAAUCV#A--------GAAA A-AGCUAACCUC.t---------AAAAA CCAAC -U-------------AAAAA

----CC---

------CUACGAUA -CGCAC A

~~~~~~~~~~~~~~~~~o-

-U

U

U A C G A A A

C A A A C G A A A A G U U G GA A

A.

~~~~~~~~~~~~~~~AAACC

-~~~~~

U~~~~~~~~%C AGU CG GAU CGGG UAC AG AIittt.C~~~~~ tA 'Wr A,1 -4I--A u*A A-4-4 AAG A A --

-A ~ ~ ~ G V~~ UG

A A

A A G A U ;~~~~~~~ C U A

4

A

-A

UU

4--$ -A-

A

A

C A

G G A

A

A U AL

U

A

AU

U

AG

U

A

U

C

A U G

U

U

G A

U

A

U

G A A ____________________________________________________________

.A_ __

1 3610u

3620

3630

3640

5

3650

3

360t

bC..~GU~-

Nucleic Acids Research, Vol. 18, Supplement 2309 3680

3670

C 00.i.. U C i . A C:....CU C U U 5.gAA AA A GG UU C G U A A C-- - - - - A A G G UU^4W ^5.W*: A Ai. A G.(AA..U A A A G U C G U A A C- -----A A G G M g S AcAG G A A G U CC GG UU AA AA CA G G b....... utXM. ....+..X CA A G GGG "XA k.A ... 5-..,8.A A G U C G U A A CC G U A A CA G G b _ . . eA QGA)A A A G U CC GG UU AA AA C-- -----A - - - - A A G G C- -----A A G U A AA G GGG - - - -- A A X ..A:*A . C A A G A U A G U G UA AC4-A GA A G U C A A G G AG A A A G G A A G U CC GG UU AA AA CU. CA A Q.A) GA^ A A G U C G U A A CA A G G A. A A)A.A..A A G U C G U A A CC G U A A CAA~~~~~AA -AA AA GG GG C G U A A C- C G U A A CA A G G A A A G U C G U A A C- -- A A G G - - - - -

C-

-

-

-

-

-

-

G~

47'

48

.

63 Hc +. A.A

AU.

A

A

A.fl..

A AG U C C GA 'U ......U. GA-A A AGU C 73 TAca GA AA G C A , ,G ..., .. . .A C ""A" AA ..

72 Tc 7

T

U

1470

1480

*

.

I

.

.

.

.

G U A A G U A A G U A A G U A A G U A A

1490 .

.

.

.

.

C - --A A G G ~ C - - - - - A A G G U C - - - - - A A G G

C

J

w

---A A G G U...

- -

C - -

G4

---A A G G UACG

1500 .

.

.

.

.

.

.

1510

.

*

95 96 100 101 102 105 153 173 188 189 201 203 207 208

Ng Pa Ap Ap Cv Ec Mg

AAGUC GUAAC-- AAGUC GUAAC-- AAG U C GUAAC-- AAGUC GUAAC-- -

Oh Me

AAGUC GUAAC--

xxxxx GUAAC--

-

-

A A G G

*.......

...........

...

AAGG A AG GXAA C AAGG

AAGUC GUAAC-AAGUC GUAAC--

xxxxx GUAAC-- - - - A A G X X X X X X X X

Mb

AAGUC AAGUC AAGUC AAGUC AAGUC

Sa Si Tt Dr

GUAAC-GUAAC-- - - - A GUAAC- - ---A GUAAC-- ---A GUAAC-- ---A

AA GG GG 5 UA CG A G G UA~~~ AG G ACJ

47, 222 223 225 228 229 230 231

Zm Os Gm Cm Ce Ce Cv

234 241 243 247

Hs ...iA C G -C A Rn C U A A C -A A Rc ....i...... - - - - -.-- - - - Sp

48 AAGU AAGU AAGU AAGU AAGU AAGU AAGU

A C A A U A - C A A C A .A A ( A U A G A A t. A IJU A A A C A i..A

-

248 P1 - - - - - - - - - 253 Gm -AC1 255 Cr -UAUf*4

256 Sc 258 260 261 262 269 270

CA

U C G C AU Sp.. C Pa Lt - - - Ls - - - - -

Sc An

- -

- - - - -

3970

V

3980

A A A A A A A A

G G G G G G G G

AAG

A A A A A

G G G C C

3990

U U U U U U U U U U U U G G

C G C G CG C G C G C G C G

U U U U U U U

A A A A AA A A A A A A A A

C C CC C C C -

-

-

-

A A G G AA GG A A G G -A A G G - A AGG - A A G G - A A GG

-

C G U A A C - - - - - A U G G C G U A A C- - - - - A A G G C G U A A C - - - - - A U G G C G U A A C - - - - - - - - - - - A C C G U A A C - - - - - - - - - - - A C C G U A A C - - - - - A A G G C G U A A C- - - - - - A G G ...f> U G A A - U -A C AGG .AtN UGAAAU ----- ACAG C G A A A U - - - - - A U G G gi 4 C G A A A U - - - - - A A G G 1 C G A A A U - - - - - A C G G VWC. U G C A G U - - - - - A A U U.> U G C A G U - - - - -A A U U -W

,.-.-..-. 4000

4010

4020

Nucleic Acids Research, Vol. 18, Supplement 2315

I

AU

.

.

.

.

I

.

G A A

.

.

.

I

I

I

.

I

.

.

.

.

GAUCA---U UA----_______ GAUCA---U UA----------GAUCA---U UA--_________ GA U CA ---U UA----------GAUCA---U UA.-.---_____ GA U CA ---U UG---GAUCA---U UG----------GAUCA---U UG----------GAUCA---U UG----------GAUCA---A GAUCA---U UG---------GAUCA---A GAUCA---U UA---------GAUCA---U GA UCA--- U GAUCA---U GAUCA---U U A GAUCA-UUU GAUCA--UU GA UCC--U C ,UA.-____

:GU G A A

*$U G A A :GU G A A SU G A A .GUGAA SU G A A ...GUGAA UGUGAA GSUGAA ..GUGAA EU GGAA .G.UGAA U GGA A GUGAA UGUGAA U G A A U GGA A

-

-

-

-

-

-

-

-

-

-

-

UA.----IUA.----eUA.--___

S UGAA

-

GGGAA UGG A A S.GUGAA G.. UGA US G A AA .....UGAA G A .8GA ..G.GA A

-

.

.

I

.

GAUCAGA UCA GAUCAGAUCAGA U C A -

. . .

.

I. *

I

.

.

.

.

I

.

.

.

.

I Hs

1 7 Ec 11 ________________-------Tm 12 ____Dm______----------- 13 ----------------------- Le 16 -----------------------At 19 --------------------Zp 20 Vc 22 Cv 23 24 -Ne _________-_-------Sc 27 _______-_------------- -Pc 30 ______________-------- -Pp 33 Pf 51 _____________---- --Pf 52 ___________-----P1 53 _________-_---------M----------------------g 56 --- Ld 59 ---------------Gl 62 -

_____O________------c -

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-----

-

------------

I.

GGA U C A

.

.

.

I

---CCUCCUUUCU-

SG G A A ctsscaGAUC A ---CCUCCUUA.

CGGA A G AkA

-

-

-

-

-

-

GA U CA ---CCUCCUUA.----

-

4P5t.0EG* GGGAA UU CC AA ---CCUCCUUUCU.-----CCUCCUUUCU.----C U CC U U U GAUCA ---C U C CU U

G.G.....G A U C A 48 48'

-

-

-

-

-

-

-

-

-

-

95 .Mg______Pa 96 100 ._____-----101 ._____-----

----------

----

-

---

---------

--Cv 102 EclOS

153 .Mg______--------------Oh 173 --------------

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

U-------U--------

#GCtLGGA U C A ---CC

-

-

GA U CA ---CCUCCUUA.----

GGA A

GGAA UGGAA

-

-

XXXXXXXXXXXXXXXGAUCA ---CCUCCU-----GG A A ........... G A U C A ---CCUCCUUA. AGAA G A U U A ---CC UCC U U UCU------VG G A A G~4AtGA U C A ---C U C CU U U-------XXGAAGXXXXXXXXXGAUCA ---CCUCCUUUCU.--CG GA A G4~4$40 GA U C A ---CCUCCUUUCU.--." CGGAA

--------Hc 63 -------- Ms 68 --------Tc 72 --------Ta 73 _Po 75 ._

----------

1540

..I I

SG G A A

UG G A A UM 6 G A A

.

--CCUCCU--CCUCCU- .-----------CCUCCU- .----------CC UCC --CCUCCU- .----------

1530

1520

CG G A A GG G A A

.

'48

~iG G A A

.... I

.

4080

4070

4060

4050

4040

4030 .

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

--------------Mb 189

Sa201 S 203 Tt 207 -Dr208 .___------

-------------- ------------------

Zm Os Gm Cm Ce Ce Cv

U CC UU G A U C A ---CC UCC UU.-----------.------GA UCA ,---CCUCCUUU .------,---CCUCCUUC ------------

~ &CJGAUCA ,---CCUCCUUC

.-------

#,SUSG A U C A ---CCUCCUUA-------GGS A A S UG G A A G&51~UGA U C A ---CCUCCUUG.-------

4GACGA---AC.----------------------------------- Hs 234

U..GGAA

G GA A

4GAAUA---AU.-----------------

Rn 241

Rc 243 Sp 247 P1 248 GD 253 255 4WAGUCA---AAUUUAUAUAA.------------- Cr Sc 256 .GGCUUA---UAAAUAUCUUAAAUAUUCUUACA-------Sc 258 5GCUUA---UAAAUAUCUUAAAUAUUCUUACA.------An 260 U*.AUGAA--UUAACCACUUAGCAAUAAAUAAAUGCAUAUAUAUACCA U U A A C C C C C C C C C------------- Sp 261 AACU A - - - A U U A A U U C A A ACU CA A A A U A U UU U A U G A A A A U C C U C A Pa 262 GS AC U U A --- U A UAU U Lt 269

*JGGAA

GGUUU----AUA.-----------------

-GAAAAU-------------------UGGAAAAU.------------------G..,GAUGG---AAUCC-----------------

...SCACA CACA SGGAA

CGGAA GSGAA

*:GGAA AU G G A A

UGSB6AA U-.GGAA

A :.'I G.G.A. *J..AUAA A. !.

Ls 270

UU

.-AUAA

I

222 223 225 228 229 230 231

I 4030

I

I

4040

I

4050

I

I

4060

I

I 4070

408

2316 Nucleic Acids Research, Vol. 18, Supplement 4090 .

I

.

.

.

.

I

4100 .

.

.

.

.

.

.

4110 .

.

.

.

.

.

.

.

.

4120 .

.

.

.

I

.

.

.

.

4130 .

.

.

.

.

.

.

4140 .

.

.

.

.

.

.

.

.

1 Hs 70c 11 Ec 12 Tm 13 Dm 16 Le 19 At 20 Zp 22 Vc 23 Cv 24 Ne 27 Sc 30 Pc 33 Pp 51 Pf 52 Pf 53 P1

56 Ng 59 Ld 62 Gl

63 Hc 68 Ms 72 Tc 73 Ta 75 Po

95 Ng 96 Pa 100 Ap 101 Ap 102 Cv 105 Ec 153Mg 173 Oh 188 Me 189 Mb 201 Sa 203 S1 207 Tt 208 Dr

222 223 225 228 229 230 231

-

-

Zm

Os

Gm

Cn Ce Ce Cv

234Hs. 241 Rn -.-.---- ---------------------.----243 Rc -.-.---------------------------.---247 Sp. 248P1. 253 Gm. -.----------------------------------_-_-___ CACUUGUUA 255Cr --------------------------------------------256 Sc --------------258Sc --------------260 An A A U U U U A U A U A C U U U U G U A U A U A A A A G A U G G A A C A A A U A U A U A U A ______4140 261 Sp. 262 Pa AAUCAAAUUUUGAUUGAUUUUUAUAUAUAUAUAGGAAUAU A U U C U A A A U A U 269 Lt ---------------------------------------------

270Ls

-_

_-

_-

-_

_-

_-

-_

_-

_-

-_

_-

_-

-_

_-

_-

-_

_-

_-

-_

_-

_-

--.------------------------------------------

I 4090

I

1

4100U

4110

4120

41.30

Nucleic Acids Research, Vol. 18, Supplement 2317 4160

4150

Hs Oc Ec Tm Dm Le At Zp Vc Cv Ne Sc Pc Pp Pf Pf P1 Ng Ld GI

1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62

--------Hc -____----Ms --------- Tc --------- Ta .____---- Po

63 68 72 73 75

--------------------

_-.-----.______ --------------------

.__--------------------------------------------------------------------

-

--

--------

-

.-----------------

_______-_------------------------------

-

-

-

-

-

-

-

-

-

-

-

-

-

-

._______-----------.___________-------.___________----------- - -------.___________-----------------------------------------------------------------

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

.__________--------- Ng 95 Pa 96 .__________--------- Ap100

--------------------

-------------------- -----------------------------------

AplOl

Cv 102 . EclO5 ________------------ Mg153 ---------- Oh173 .Oh-------

--------------------

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

Me188

-----Mb 189 . Sa201 . Si 203 . Tt207 . Dr 208 -

-

-

-

-

-

-

-

-

-

Zm 222 Os 223 Gn 225 Cn 228 Ce 229 Ce 230 Cv 231

Hs 234 Rn 241 Rc 243

Sp 247 P1 248 Gm 253 Cr 255 Sc 256 Sc 258 An 260 Sp 261 U AU UU AU UA UA UA CU AU AU A Pa 262

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

_

_

-

_

_

-

-

-

-

-

-_ -_

-

-

-

-_

-

I

_ _-

_-

_-

_-

_-

_

_

_-

__-

_-

_-

_

_

_-

_-

_ _-

_

__

_

_

I 4150

_-

__-

-

__-

-

_

_

Lt 269 Ls 270

_-_-

I 4160