Suppl. Figures

2 downloads 0 Views 2MB Size Report
GT TTTT.GCT.. .AA.CAGC.. A.AAAGCG.. GTCA..GG.. T.GAGCC.CA TTCAAA.G.. ...... G.G.G.G .TTG...GCG. At1g34650 HDG10. C.TA..G.GT .T.T.A.C.. AA. . AC...C.. ..... AAAA. At3903260_HDG8. GCCT.GC.CG ATGCTCA.TC CAAGGT.AT. TGG.
1

Suppl. Figure 1. Bayesian phylogram of streptophyte C4HDZ genes. Numbers at nodes indicate posterior probability values. Tree was constructed using amino acid alignment in Suppl. data (Suppl.dataC4strepBAYES.txt). Taxa are color coded as in Figure 2 with the additon of red for charophycean algae (purple, bryophytes; light blue, lycophytes; dark blue, ferns). Taxon abbreviations: AT = Arabidopsis thaliana; Aev = Angiopteris evecta; Apl = Asplenium platyneuron; Co = Coleochaete orbicularis; Cr = Ceratopteris richardii; Cru = Cycas rumphii; Edi = Equisetum diffusum; Ga = Gossypium arboreum; Gb = Ginkgo biloba; Gh = Gossypium hirsutum; Ha = Helianthus annuus; Mp = Marchantia polymorpha; Os = Oryza sativa; Pa = Picea abies; Pc = Phaeoceros carolinianus; Pgl = Picea glauca; Phsp = Phalaenopsis sp; Pm = Pseudotsuga menziesii; Pn = Psilotum nudum; Pp = Physcomitrella patens; Pta = Pinus taeda; Sk = Selaginella kraussiana; Sm = Selaginella moellendorffii; Solyc = Solanum lycopersicum; Sppr = Spirogyra pratensis; Ssp = Sphagnum sp; Vpl = Vanilla planifolia; Vvi = Vitis vinifera; Zm = Zea mays.

2

At4g26920 MRWVTIFPSL At4g26290_annotated .......... A_lyrata_XM_002869541 V...S..... Eutrema_parvulum_AFAN01000032 EM.PK....I Capsella_rubella VN.QK....I A_lyrata_5g07260 ?T.KK....I At5g07260 VS.QK....I Brassica_napus_DQ166821 VK.GEL..CI Brassica_napus_DQ182489 VK.GEL..CI Brassica_napus_DQ182490 E..AEM...M At3g61150_HDG1 E..AEM...M A_lyrata_XM_002876550 E..AEM...M Thellungiella_halophila_AK353253 N..TEM..CN Thellungiella_halophila_AK352868 NQ.SCV.SGI Brassica_rapa_EU826522 EQ.KEM.AC. At4g25530_FWA GK..NV.API Turritis_glabra_AB367817 GK..NV.A.I Pt_XM_002320719 N..AEM..CV Pt_XM_002301295 N..AEM..CM Pt_XM_002318195 NG..EM.... Pt_XM_002322424 NG..EM.... Pt_XM_002320422 NQ.S.L.SGI Pt_XM_002305566 NQ.S...CGI Pt_XM_002302844 NQ.S.L.SGI At5g52170_HDG7 NK.AEM.ECI At4g00730_ANL2 N..TEM..CN At1g05230_HDG2 NQ.S...AGM At4g04890_PDF2 NQ.SCV.SGI At4g21750_ML1 NQ.SSV.CGI At1g79840_GL2 GQ.KET.AC. At2g32370_HDG3 NL.S.M.AGI At1g34650_HDG10 EK.ARL..TI At1g17920_HDG12 VKLTEL...I At1g73360_HDG11 VK.TEL...I At3g03260_HDG8 EK.KEL..TI At5g17320_HDG9 EK.ARL..TI At5g46880_HDG5 DK.SEM.C.I At4g17710_HDG4 DK.SEM.FPI

1 11 21 31 41 51 | | | | | | VVLACIVNEI IALATLESPL WRR-FVHEAS RASEVIHVDA SWLLTKLKNP .......... .......... .......... .......... .......... G......... .....P.... .S...F.... ...A..R... ...VR..E.. AMT.LSLQ.V VS..RQGG.M .T-....... ...AL.PC.. LS.V.T.ID. .MTSLSLK.V VS.SRQGT.M .TS.....V. ...ALVPF.. .L.VEN.T.H .MMSLSLK.V VS..RQRT.M .?S....... ...AFVPC.. .S..AN.M.H .MTSLSLK.V VF..RQRT.M .TS.....V. ...AFVPC.. .S.VAN.M.H DIALTAME.L LR.FNTNE.. .T..VRT... .S.GIVFMN. MT.VDMFMDG DIALTAME.L LR.FNTNE.. .T..VRT... .S.GIVFMN. MT.VDMFMDG DLALAAME.L VKM.QRHE.. .V....S... KETGNVIINS LA.VET.MDS DLALAAMD.L VKM.QTRE.. .V....S... KEAGTVIINS LA.VET.MDS DLALASMD.L VKM.QTRD.. .V..Y.S... KEAGTVIINS LA.VET.MDS ELALTAMD.L VK..HS.E.. .VK.L.T... KI.GMVIINS LA.VET.MDS ELAVAAME.L VRM.QAVD.. .VS.LRS... .E.A.VIMNH IN.VEI.MDV EIANRATL.L QKM..SGE.. .L..KTI... .DVGIVFM.. HK.AQSFMDV NLAITALR.L .T.GEVDC.F .MI.QIV... ..KGLVPMTC VT.VKT.MDT DLANTAMK.L .V.GEPDC.F .TI.C.V... .DTGLVPMTS .T.VKT.MDT ELALAAMD.L VKM.QTDE.. .I....S... .ETGMVIINS LA.VET.MDS ELALAAMD.L VKMVQTDE.. .IG...S... .ETGMVIINS LA.VET.MDS DLALAAMD.L .KM.QI...I .IK...T..T .V.G.VL.NI .A.VET.MDV DLALAAMD.L .KI.QV...I .IK...I..T .E.G.VLANS LD.VET.MDV ELAVAAME.L VRM.QMDE.. .MG..KC... .E.A.VIMNH IN.VEY.MDV EIAVAAME.L MRI.QAGE.. .IQ.MRS... .E.A.VIMNH VN.VEI.MDA ELAVAAME.L .RM.QMDE.. .MN..KC... .E.A.VIMNH IN.VEY.MDV DLAMEAMD.L LK..E..TS. .SS.NHFPG. .ETGLVLINS LA.VET.MDT ELALTAMD.L VK..QS.E.. .VK.LAT... .T.GMVIINS LA.VET.MDS DLSVAAME.L MRMVQVDE.. .KS.YRS... .E.A.VIMNH VNIVEI.MDV ELAVAAME.L VRM.QTGD.. .LS.LRS... .Q.A.VIMNH IN.VEI.MDV ELAVAAME.L VRM.QTGD.. .VS.LRS... .E.T.VIMNH IN.IEI.MDV EISNRATL.L QKM..SGE.M .L..KTI... .DAGIVFM.. HK.AQSFMDV ELAFGAME.L LVM.QVAE.. .MG..RT... .ETALVAMCP TGIVEM.MQE EIAKNA.A.V MS.IQM.HSM .IK.SH..S. MEVV.VQM.. RN.VDMFL.T NIAVTAME.L LR.LQTNE.. .IK.LGM... .S.G.VFTN. IT.VDM.M.S GIALTAME.L LR.LQTNE.. .T...RV... .S.GIVFMN. MA.VDMFMDC EIA.SA.E.L KR.FLA.EQF .VK.AHV.S. K.VT.V..E. IN.IQMFLD. EAAEKA.S.V LS.IQMDDTM .KK.SSKD-- --VV.VQM.. GN.IDIFLTA EFAVSC.Q.L TKMCDT.E.. .IK..LR... K.NA.VIMNS IT.VDAFL.A LAVS.ARELA KMCDIN.PLW NKK..RR... ..NA..MLNC IT.VKAFLDA

3

MpC4HDZ1 V..TEM.LCM PcC4HDZ1 S..MEM.SCI SmC4HDZ01 G..MDM.SCI SmC4HDZ03 G...DM.SNI PpC4HDZ123 SQ.AEM..CV

At4g26920 LRFPSGFIIK At4g26290_annotated .......... A_lyrata_XM_002869541 .........Q Eutrema_parvulum_AFAN01000032 M.....YLVQ Capsella_rubella M.....YLVQ A_lyrata_5g07260 M.....YL.Q At5g07260 M.....YL.Q Brassica_napus_DQ166821 YK....CL.Q Brassica_napus_DQ182489 YK....CL.Q Brassica_napus_DQ182490 R.L...CLVQ At3g61150_HDG1 R.L...CLVQ A_lyrata_XM_002876550 R.L...CLVQ Thellungiella_halophila_AK353253 R.L...CVVQ Thellungiella_halophila_AK352868 R.R...CL.Q Brassica_rapa_EU826522 RKR...C..E At4g25530_FWA K.L...L..D Turritis_glabra_AB367817 N.L...L..E Pt_XM_002320719 R.L...CVVQ Pt_XM_002301295 R.L...CVVQ Pt_XM_002318195 K.L...C..Q Pt_XM_002322424 R.L...C..Q Pt_XM_002320422 R.R...CL.Q Pt_XM_002305566 R.R...CL.Q Pt_XM_002302844 R.R...CL.Q At5g52170_HDG7 KM....C..Q At4g00730_ANL2 R.L...CVVQ At1g05230_HDG2 R.RA..CL.Q At4g04890_PDF2 R.R...CL.Q At4g21750_ML1 R.R...CL.Q At1g79840_GL2 RKL...C..E At2g32370_HDG3 R.R...CL.Q At1g34650_HDG10 TKR...VL.Q At1g17920_HDG12 Y.....CL.Q

ELAVAAMD.L VR..QSDE.. .IP.LKT..T .ETGLVMMNG VT.VET.MDH DLALSSME.L LR..QGGD.. .VQ.MHT..T .E.GLVM.NG AN.VNMFMDA ELAIIAME.L L...QSRE.. .IL.LKA.VT .DTGLVMMNG AA.VDTIMDA ELAIVAME.L LLV.AETDGA LWS.MET... .ETGLVMMN. AG.IDTIM.V ELAVAAME.L VRMVQA.E.. .V..LRT... .ETALVMMNG VN.VET.LDA 61 71 81 91 101 111 | | | | | | VGNVS-IDME FLTLITPVIP TRKVKVLRYC HR-IANDTWI IADISM-PEF .......... .......... .......... .......... .......... .......... .......... ...I...... ....G..... .......... .AEA...I.D .MP..F.L.Q ..H.EL..CF RQ..ETG..V .....V..N. IAD...LNVD .MPQ.S.L.Q ..N..L..RS MH..E....V ..Y.....N. IAD....NVD .MPQ.S.L.Q ..N..L..RS RH..E..A.V .....V.... IAD....NVN .MPQ.S.L.Q ..N..L..RS RH..ED...A ..E......Y .ASSK.LMY. EMAVLS.LVA ..EFCE.... QM..EQGS.. VVNV.Y.SHS .ASSK.LMY. EMAVLS.LVA ..EFCE.... QM..EQGS.. VVNV.Y.SHS ISRT..LMHA E.Q.LS.LV. V.Q.SF..F. KQ.H.EGV.A VV.V.I.SSC .SRT..LMHA E.Q.LS.LV. V.Q.SF..F. KQ.H.EGV.A VV.V.I.SSC ISRT..LMHA E.Q.LS.LV. V.Q.SF..F. KQ.H.EGV.A VV.V.I.SSC .ARAA.LMNA E.QVLS.LV. V.N.NF..F. KQ.H.EGV.A AV.V.I.VII .SRAL.VMTA EFQVPS.LV. ..ENYFV... KQ.HSDGS.A VV.V.L.SRT ISKAV.LMFG EMQ.L...V. ..E.YFV.S. RQ.LSPEK.A .V.V.V.LRC .PVA..QIQA EFQV.S.LV. K...TFI... KE..RQGL.V VV.VTP.GCS .PVS..QIQA EFQV.S.LV. K.Q.TF.... KE.LKHGL.V VV.VTP.GAS IART..LMHA E.QVLS.LV. V.E.NF..F. KQ.H.EGV.A VV.V.V.VNC IART..LMQA E.HVLS.LV. V.E.NF..F. KQ.H.EGV.A VV.V.I.VNC IARAA.MIHA EFQ..S.FV. V.Q..F..L. KQ.LTEGV.A VV.V.I.VTC IARAA.MIHA EFQV.S.FV. V.Q..F..L. KQ.L.EGV.A V..V.V.VTC .SRAL.VMTA EFQ.P..LV. ..ESYFV... KQ.H.DG..A VV.V.L.ARC .SRAM.VMTA EFQVPS.IV. ..ENYFV... KQ.HTDG..A VV.V.L.SKC .SRAL.VMTA EFQ.P..LV. ..ESYYV... KQ.H.DG..A VV.V.L.ARC .AVA..LMQA EFQVMS.LV. IKQK.F.... KQ.HGDGL.A VV.V.Y.GGS .ARAT.LMNA E.QVLS.LV. V.N.NF..F. KQ.H.EGV.A VV.V.I..VI .SRAM.VMSA EFQVPS.LV. ..ETYFA... KQ.QGDGS.A VV...L.ARC .SRAL.VMTA EFQVPS.LV. ..ENYFV... KQ.HSDGS.A VV.V.L.LRT .SRAL.VMTA EFQVPS.LV. ..ENYFV... KQ.HSDGI.A VV.V.L.TRS ISKAA.LMFG EMQ.L...V. ..E.YFV.S. RQ.LSPEK.A .V.V.V.LKC ..RAR..MSA EYQVLS.LVT ..ESYFV... KQ.QGEGL.A VV...I.LKC .TEAK.VVY. Q.HILS.LVL P.EFII..T. QQ.MKE.L.L ...V.C..IC .ASSK.LMI. E.QVLS.LVT ..EFC..... QQ..EHG..A .VNV.Y.SRS

4

At1g73360_HDG11 Y.....CL.Q At3g03260_HDG8 YKR...CL.Q At5g17320_HDG9 TKR...VL.Q At5g46880_HDG5 K.K...C..Q At4g17710_HDG4 R.K...C..Q MpC4HDZ1 R.R...CL.Q PcC4HDZ1 Q.R...CV.Q SmC4HDZ01 RLR.P..L.Q SmC4HDZ03 RMR....F.E PpC4HDZ123 R.R...VL.Q

At4g26920 RLVHIFCSGT At4g26290_annotated .......... A_lyrata_XM_002869541 HM..L..... Eutrema_parvulum_AFAN01000032 M.NL..T.V Capsella_rubella M.YA..T.V A_lyrata_5g07260 M.NA..T.V At5g07260 M.NV....V Brassica_napus_DQ166821 .M.SNY.ISV Brassica_napus_DQ182489 .M.SNY.LSV Brassica_napus_DQ182490 .MTDN..G.V At3g61150_HDG1 .MTDN..G.V A_lyrata_XM_002876550 .MTDN..G.V Thellungiella_halophila_AK353253 .MTFN....I Thellungiella_halophila_AK352868 .M.MS....V Brassica_rapa_EU826522 .MTQS.YRAI At4g25530_FWA .MTLNYYR.I Turritis_glabra_AB367817 .MTLKYYT.I Pt_XM_002320719 .MTAN..A.V Pt_XM_002301295 .MTDN..A.V Pt_XM_002318195 .M.DN....V Pt_XM_002322424 .M.DS....V Pt_XM_002320422 .M.IS..A.V Pt_XM_002305566 .M.MS..T.V Pt_XM_002302844 .M.IS..A.V At5g52170_HDG7 .MKLN.Y..I At4g00730_ANL2 .MTFN....I At1g05230_HDG2 .M.IS..A.V At4g04890_PDF2 .M.MS....V

IAASK.LLY. EMEVLS.LVA ..EFCE.... QQ.TEQGS.. VVNV.Y.SQS .NKAN.VMW. Q.HILS.LV. A.EFM.V.C. QE..EKGI.. ...V.H.AAC .NEAK.VIY. Q.HILS.LV. P.EFMI..T. QQ..EDNV.M ...V.C..IC .ARAK.LMFA E.QVLS.LV. ..EAYF...V EQ.AETGN.A .V.FPI.H.Y .SSAK.LMFA E.QVVS.LV. ..EAYF...V EQ.AEEGK.M VV.FPI.DQY .SRAL.LMYA E.QVLS.LV. A.ECYF..F. KQ.HGEGI.A VV.V.V.MRS .SKAL.-MYA E.QVMS.LV. ..DMYF..F. KQ.P.EGV.A VV.V.V.VRC ISRAL.LMYA EFQVLS.LV. ..EAYF.... KQ.H.EGV.A .V.V.V.LRN .SRAF.LLYA E.QILS.LV. ..EFFF.... KQ.HSERV.A .V.V.I.LRC .SRAV.LMYA E.QVLS.LV. ..E.YF.... KQ.H.EGV.G VV.V.V.MRC 121 131 141 151 161 171 | | | | | | HVARIFRVQS VKRMYVGNYV -EALPLFWFR IWCQEMA-SA GKNNLLQASK ........-- ---------- .--------- -------... .......... .LPNGYSKVT ILEHW.YKED .GFGAKK.LV ALQRYCS..T .RD...EV.R .ISNGVSRVT .LDHW.YKEE .GFGAHR.LS ALQRHFF.I. RT.L.NLSTR R..NGISKVT .LDHW.YKEE .AFGAYR.LA ALQKHCY.IC HT.L.NLSTS .I.NGISKVT ILDHW.YKEQ ..FEAHR.LA ALQRHCY.IC LT.L.NLS.F .I.NGISKVT ILDHW.YKEE ..FGAQR.LT ALQKHYY.IC R..L.NLS.F DMPNGYSKVT WVEHVETEEK .AFGAER.VT TLQRMCE.PE ..RSMMRLAH DMPSGYSKVT WVEHVETEEK .AFGAER.VT TLQRMCE.PE ..RSMMRLAH DM.NGYSKVT WIEHTEYDET .AFGAQR.MA ALQRQCE.CN .RKSM.KLA. DM.NGYSKVT WIEHTEYDEN .AFGAHR.MA ALQRQCE.CN .RKSM.KLA. DM.NGCSKVT WIEHTEYDEN .AFGAHR.MA ALQRQCE.CN .RKSM.KLA. DMSNGYSKVT WVEHAEYDEN .GFGSQR.VA TLQRQCE.PG .RKSM.KLAQ ELPNGYSKVT WIEHMEVDDR .AFGAKR.VS TLERQCE.PE .RKSM.KLAE DTSNGHSKVT WVEHLDLSAS .AFGAKH.VA TLQLHCE.L. .RKSV.KMAQ DLSNGYSQVT WIEQAEY.ES .GLGAKR.LA TLQRHCE.AK .ATEIVKLAQ DI.NGYSKVT WIEQAEY.ES .GLGAKR..K TLQRYCG.AK .ATE.VKLAQ DMPNGYSKVT WIEHAEYDES .GFGAQR.IA TLQRQSE.AS .RRSM.KLAQ DMPNGYSKVT WVEHAQYDER .GFGAQR.IA TLQRQCE.TS .RRSM.KLAQ DMNNGCSKVT WVEHSEYDES .GFGAQR.LA ALQRYYE.LG ..KSM.KLAR DMNNGCCKVT WVEHSEYDES .GFGAQR.IA ALQRHYE.LG ..KSM.KLAR EMLNGYSKVT WVEHVEVDDR .AFGAKR.VA TLDRQCE.QE .RKSMMKLAE ELPNGYSKVV WVEHIEVDDR .AFGAKR.VG TLDRQCE.AE .RKSM.KLAE EMPNGYSKVT WVEHVEVDDR .AFGAKR.VA TLNRQCE.QE .RKSMMKLAE DIGNGCSKVT WIEHSEYEES .GLGATK.LA TLQRQCE.H. .TKSI.KLAQ D.SNGYSKVT WVEHAEYDEN .GFGSQR.LA TLQRQCE.PG .RKSM.KLAQ ELPNGYSKVT WVEHVEVDDR .AFGAKR.VA .LDRQCE.QE .RRSM.KLAE ELPNGYSKVT WIEHMEVDDR .AFGAKR.VA TLERQCE.PE .RKSM.KLAE

5

At4g21750_ML1 .M.MS..T.V At1g79840_GL2 .MTQS.YRAI At2g32370_HDG3 .IART.FA.M At1g34650_HDG10 .MLKN.AWIM At1g17920_HDG12 .M.SN..LSV At1g73360_HDG11 .MISNY.LSV At3g03260_HDG8 .M.KN.NEML At5g17320_HDG9 .MLRN.AWMM At5g46880_HDG5 ...KT..VNI At4g17710_HDG4 .M.KT..LNI MpC4HDZ1 .MTNN..A.V PcC4HDZ1 .MTNN..A.V SmC4HDZ01 .MTNN..A.V SmC4HDZ03 .MTSNY.A.V PpC4HDZ123 .MTSN..A.V

At4g26920 MVIHSIL At4g26290_annotated .......... A_lyrata_XM_002869541 .......... Eutrema_parvulum_AFAN01000032 AG..I...FV Capsella_rubella GA..I.YACI A_lyrata_5g07260 GA..I..AC. At5g07260 GA..I..TCV Brassica_napus_DQ166821 GA.L.VY.PV Brassica_napus_DQ182489 GA.L.VY.PV Brassica_napus_DQ182490 GA.V.VYAPV At3g61150_HDG1 GA.V.VYAPV A_lyrata_XM_002876550 GA.L.VYAPV Thellungiella_halophila_AK353253 GA.L.VYAPV Thellungiella_halophila_AK352868 G..Y..YAPV Brassica_rapa_EU826522 E..V.VYAPV At4g25530_FWA GA...VYAPV Turritis_glabra_AB367817 GA.V.VYAPV Pt_XM_002320719 G..L.VYAPV Pt_XM_002301295 G..L.VYAPV Pt_XM_002318195 G..Q.VYAPV 'Pt_XM_002322424 ' G..L.VYAPV Pt_XM_002320422 A..F..YAPV Pt_XM_002305566 G..Y..YAPV

ELQNGYSKVT WVEHIEVDDR .AFGAKR.VA TLDRQCE.PE .RKSM.KLAE DTSNGHSKVT WVEHLDVSAS .AFGARH.VA TLQLHCE.L. .RKSV.KMAQ EMHSGYSKVT WVEHVEVDDA .AFAANR.VG TLVRQCE.NH ..MSM.KIAE ALPHGRSKVT WIEHVEVTDK .GYGARR.TA TLQRMCE.IE .RRSVMSLGE DMSNGYSKVT WVEHGEFEEQ .AFGAER.IA TLQRMCE.PE ..RSIMRLAH DMPNGYSKVT WVEHIETEEK .AFGADR.VT TLQRMCE.PE ..RSMMRLAQ ALPDAHSKVM WIEHVEVDHK .GYGAKR.IV TLERMCE.GE ARRSVMKLGE ALPHG.SKVT WIEHV.V.DN .GYGARR.TV TLERTCE.IR .R.SVMHLGE DMPNGYSQVK WVEHVEVDEK .AFGANR.LD VLQRQCE.AE ARR.IMRL.Q AMRNGYSQVT WVEHVEVEEK .AFGAER.LS VLKRQCE.VE ARK..MKL.Q DMPNGYSKVT .LEHMDYDDR .AFGAQR.LA TLQRQCE.AN .RRSM.KLAQ EASNGCCKVT LVEHLEHDNR .AYGAQR.LA TLQRQCE.ES .RRSM..LAQ DMPNGYSKVT ILQHMEYDDR .AFGAKR.LA TLQRQCE.AT .RRSM.KLAQ DLQNGYSKVT AVQHIEADHR .AFGAKR.LA .LQRQCE.AN .RRSM.FLAQ DTPNGYARVT CVEHAEYDDR .AFGAQR.VA TLERQCE.AS .RRSM.KLAQ 181 191 201 211 221 231 | | | | | | CGVIGNRGRW L-PYG-IISA SGLTKIHAKP EILFPLI-VF LLQEAYNEAS SS.......... .......... .......... .......... .......... .....YQW.R ...C...... ...A...... .M...F.... .......... Y..S.EEWKR ....V.L..S T.SAEMQ... .TI.GV..WC VI..T.Y... ..TPRQKWKR ...CV.F... T.M.RMQ.T. .VI.G...WY MI..T.LD.. ..IT.QSWNR F..CV.LVT. T..ARMQ.R. .VI.G...WY .I..T.YDET ..IT.Q.WNR ...CV.LV.. T..ARM.T.. .VM.G...WY .I..T.YDE. SRSNNTHSTV V..N.T.LC. ATTVWLPNS. QNV.NFL.ML I...TSIDS. SRSNNTHSTV V..N.T.LC. ATTFWLPNS. QSV.NFL.ML I...TSIDS. .ASSLQKWSK ...P.IVLN. ATSVWMPVS. KR..DFL.ML I...TSID.V .ASSLQKWSK ...P.I.LN. ATSVWMPVS. RR..DFL.ML I...TSID.A .ASSLQKWSK ...P.I.LN. ATSVWMPIS. RR..DFL.ML I...TSID.A SAPSVHSWSK ..DS.I.L.. ATSVWLP.S. QR..DFL.ML I...TCID.. GASTAHAWTT M..P.IVL.. ATSFW.PVA. KRV.DFL.ML I...SCTD.. AASSYHQWNK I..T.V.VC. .SSLWLPVS. TL..DFF.TW V..DSCTNSY TSPSVDKWQK I.LT.IVL.. .TSVWLPVNQ HT..AF..ML V...IW.D.. T.SSTDKWEI I.YT.IVL.. ATSVWFPVNQ QT..AFL.ML V...VW.D.. .ASTVHKWNK ...P.IVL.. ATSVWLPVS. QR..DFL.ML I...TCID.A .ASTVHKWNK ...P.IVL.. ATSVWLPVS. QR..DFL.ML I...TCID.A .ASSLHNWGN ...D.IVL.. ATSVWLPVSR QR..DFL.ML I...TR.DV. .ASTLHNWGN ...D.IVL.V .TSVWLPVSQ QR..DFL.ML I...TW.DV. SASTAHTWTT ...P.IVL.. ATSFWLPVP. KRV.DFL.ML I...SCADQT GASTAHAWTT ...P.IVL.. ATSFW.PVQS KRM.DFL.ML I...SCTDST

6

'Pt_XM_002302844 ' A..F..YAPV At5g52170_HDG7 GA.L.VYAPV At4g00730_ANL2 GA.L.VYAPV At1g05230_HDG2 A..F..YAPV At4g04890_PDF2 G..Y..YAPV At4g21750_ML1 G..Y..YAPV At1g79840_GL2 E..V.VYAPV At2g32370_HDG3 A..F.LYAPV At1g34650_HDG10 GG...VYAPM At1g17920_HDG12 .AAL..YTPV At1g73360_HDG11 GA.F.VY.PV At3g03260_HDG8 GG..IVYAPM At5g17320_HDG9 GG...AYAPM At5g46880_HDG5 G..LIVY.TV At4g17710_HDG4 G..LLVY.TV MpC4HDZ1 G..L..YAPV PcC4HDZ1 G..L..YAPV SmC4HDZ01 G..LIVYAPV SmC4HDZ03 G..LIVYAPV PpC4HDZ123 G..L..YAPG

SASTAHTWTT ...P.IVL.. ATSFWLPVP. KRV.DFL.ML I...SCTDQT TASCIHKWEK ...S.IVL.. ATSLWLPVTQ QR..EFL.ML I...TW.DV. SAPSVHNWSK ...P.IVL.. ATSVWLP.A. QR.YDFL.ML I...TCID.. SASTAHTWTT ...P.IVL.. ATSFW.PVP. KRV.DFL.ML I...SCTDPT GASTAHAWTT M..P.IVL.. ATSFW.PVA. KRV.DFL.ML I...SCTD.. GASTAHAWTT ...P.IVL.. ATSFW.PVA. KRV.DFL.ML I...SCTD.. AASSYHQWTK I..T.V.VC. .SSLWLPVS. AL..DFF.IW V..DSSTNSY TNAT.STIFS G..P.V..C. ATSFWLP.P. NTV.DFL.MM IV..TSTDPT KMSDKLDLPQ Q..P.L.VC. GSSLSLPLP. LQVYDFL.LM I..DGFID.L GTSNNT.STV V..N.MVLC. ATSFWLPIS. QNV.NFL.ML I...SCIDS. SRSNNT.STV V..N.TVLC. ATTFWLPNS. QNV.NFL.ML I...SSTDS. TMSGKIDFPQ Q..P.IVV.. .SSLA.PLT. LQV.AFL.ML M..DC.MD.L KM.NKLDFSP Q..P.L.VC. GSSLSLPLP. VQVYDFL.LM I..DSFKD.L STAY.QSWTA ...T.VVLC. VST.WLPFSH HQV.D...EL M...SCIDN. INSH.QAPTK D.CG.LVPC. VSV.LLPYSH QQV.D.L.EL M...TCTDN. SASTVHTWTT ...H.IVL.. ATSLWLPVS. QRV.EFL.ML I...SCTDV. SASSTHAWTT ...H.TVL.. ATSIWLPVL. QRV.NSL.ML I...SCTDV. SASTVHTWTT ...P.IAL.. ATSLWMPVS. QRV.EFL.ML I...SSTDE. SAS.VHTWTT ...P.IVL.. ATSLWVPVNS QR..EFL.ML I...SCTD.. SASTAHTWTT ...Q.IVL.. ATSMWLAVSA ARV.EFL.ML I...SCTDV.

241 251 261 | | | At4g26920 DVSSLAKIIN G-DRSYSFTY PCGFTIMP At4g26290_annotated .......... .......... ........ A_lyrata_XM_002869541 .E...R.... ...S.F.I.. ........ Eutrema_parvulum_AFAN01000032 QA...QAV.. ...L.RLRIF ...I..V. Capsella_rubella EAPYFTAV.. ...L.DIKLL .....M.. A_lyrata_5g07260 EAPYF.AV.. D..L.GIELL ........ At5g07260 EAPYF.AA.. ...L.GVELL .S....I. Brassica_napus_DQ166821 .LPA.NIAMS ...T..IPLL SS..A.S. Brassica_napus_DQ182489 .LPA.NIAMS ...T..IPLL SS..A.S. Brassica_napus_DQ182490 .IPAMQAVM. ...SA.VALL .S..A.L. At3g61150_HDG1 .IPAMQAVM. ...SA.VALL .S..A.L. A_lyrata_XM_002876550 .IPAMQAVM. ...SA.VALL .S..A.L. Thellungiella_halophila_AK353253 .IPAMNVVM. ..ES..VALL .S..A.L. Thellungiella_halophila_AK352868 .IVAMNVVLS ...PD.VALL .S..A.L. Brassica_rapa_EU826522 .INTTQMV.A ...P.NIQIL ....S.I. At4g25530_FWA ETN.IELVKR ..NSDSVKFL .S..S.V. Turritis_glabra_AB367817 ET..IEPVKR ..NSDSVQLL .S..S.L. Pt_XM_002320719 .IPAMHVVM. ...SA.VALL .S..A.V. Pt_XM_002301295 .TPAMHVVM. ...SA.VALL .S..A.V. Pt_XM_002318195 .IQ.MSVVTS ...ST.VALL .S..V.L. Pt_XM_002322424 ..Q.VSVVM. ...ST.VALL .S..V.L. Pt_XM_002320422 .IVAMNVVL. ...PD.VALL .S..AVL. Pt_XM_002305566 .I.AMNIVLS ...PD.VALL .S..A.L. Pt_XM_002302844 .IVAMNVVL. ...PD.VALL .S..A.F. At5g52170_HDG7 .IP.MNTVMS ...SA.VALL .S..S.L. At4g00730_ANL2 .IPAMHVVM. ...S..VALL .S..AVL. At1g05230_HDG2 .IVAMNIVL. ...PD.VALL .S..A.L. At4g04890_PDF2 .IVAMNVVLS ...PD.VALL .S..A.L. At4g21750_ML1 .IIAMNVVLS ...PD.VALL .S..A.L. At1g79840_GL2 .INTTQLVLA ...P.NIQIL .S..S.I. At2g32370_HDG3 .MT.MDITLH ...PDFVVIL .S..A.F. At1g34650_HDG10 NLNTAYSA.S ...P.TIPIL .S..I.SR At1g17920_HDG12 .LPA.NIAMS ...T..IPIL .S..A.S. At1g73360_HDG11 .LAA.NIAMS ...P..IPLL SS....S. At3g03260_HDG8 .MATMHFAVS ...P.HIPIL .S..V.SS At5g17320_HDG9 .LNTACAA.S ...PTTIPIL .S..M.SR

7

At5g46880_HDG5 At4g17710_HDG4 MpC4HDZ1 PcC4HDZ1 SmC4HDZ01 SmC4HDZ03 PpC4HDZ123

At4g26920 GTCTCCTTTG At4g26290_annotated .......... A_lyrata_XM_002869541 .......... Eutrema_parvulum_AFAN01000032 CGG...GA.. Capsella_rubella AA....GA.. A_lyrata_5g07260 AA....GA.. At5g07260 AA....GA.. Brassica_napus_DQ166821 CGAA...C.C Brassica_napus_DQ182489 CGAA...C.C Brassica_napus_DQ182490 CGAG..GC.T At3g61150_HDG1 TGAG..GC.C A_lyrata_XM_002876550 TGAC..GC.C Thellungiella_halophila_AK353253 AGAA..G..A Thellungiella_halophila_AK352868 .GA......A Brassica_rapa_EU826522 TGAA...C.. At4g25530_FWA C.G...A..T Turritis_glabra_AB367817 C.G...G..C Pt_XM_002320719 TGAG...C.T Pt_XM_002301295 TGAG...C.T Pt_XM_002318195 AAG...CA.T Pt_XM_002322424 AAG...CA.T Pt_XM_002320422 TGAA..C..A Pt_XM_002305566 AGAA..C... Pt_XM_002302844 TGAA.....A At5g52170_HDG7 AA..T....A At4g00730_ANL2 AGAA..G..A At1g05230_HDG2 CGAG...C.. At4g04890_PDF2 AGA......A At4g21750_ML1 TGA...C..A At1g79840_GL2 CGAA...A.. At2g32370_HDG3 TGAA..AC..

..D.IQQAM. .PVAVQLAM. .IPAMTLVLQ .IPAMQMVMQ .IPAMNLVMQ .IPAMNLVMQ .IPAMNLVLQ

...S.NIPIL ...P.EIPLL ...PA.VALL ..NPELVALL ...PA.VALL ...PATVALL ...PA.VALL

.L..S.V. .V..SVV. .S..A.L. .S..A.L. .S..A.L. .S..A.S. .S..A.L.

1 11 21 31 41 51 | | | | | | GTGGTCTTGG CATGTATAGT CAACGAAATT ATTGCTTTAG CAACGCTCGA .......... .......... .......... .......... .......... .G........ .......... .......... ........G. ......C... .CTA.GACC. ..CTATCGC. .C.A...G.. G..T..C.T. .G.G..AG.G ..TA.GACTT ..CT.TC.C. ...A...G.. G..T..C.TT .G.GA.AA.G ..TA.GA..T ..CT.TCGC. ...A...G.. G..T..C.T. .G.G..AAAG ..TA.GAC.T ..CT.TCGC. G..A...G.. G..TT.C.T. .G.G..AAAG .ACA.TGCTT TGAC.GCGA. GG.G...T.G C..AGGC.TT TC.ACACAA. .ACA.TGCTT TGAC.GCGA. GG.A..GC.G C..AGGC.TT TC.ACACAA. .ACT.GGCTC TTGCGGCTA. GG.G..GC.. G.GAAGA.G. .TCA.AGAC. .ATT.AGCTC TTGCGGCTA. GG.T..GC.. G.GAAGA.G. .TCA.ACACG .ATT.AGCTC TTGCGTCTA. GG.T..GC.. G.GAAGA.G. .TCA.ACACG .A.C.TGCTC T.ACGGCCA. GG.T..GC.. G.GAAGC.C. .TCACAGT.. .A.C.AGC.. TTGCAGCTA. GG.G...C.C G.GAGAA.G. .TCAAGC..T .A.A.TGCCA ACC.AGCCAC .CTT...C.. CAGAAGA.G. .C..CTC..G AATC.TGCCA TTACGGCTT. G.GA..GT.G ...A.A..G. G.GAAG.G.. .ATC.AGCCA ATACGGCTA. G..G..GT.G ..A.TG..G. G.GAA.CG.. .A.T.GGCTT TGGC.GCCA. GG.T...T.G G..AAGA.G. .TCA.ACT.. .A.T.GGCTT T.GC.GCCA. GG.T...T.G G.GAAAA.G. TTCA.ACG.. .ATC.TGCAC TTGC.GCTA. GG.T...T.G ..AAAGA.G. .TCA.A.T.. .ATC.TGCAC TGGC.GCTA. GG.T...T.G ...AAGA.T. .TCA.G.T.. .A.C.TGCA. TTGCAGCCA. GG.G..GC.A G..AGAA.G. .CCAAA.G.. .AAA..GC.. TTGCAGC.A. GG.G...C.A ..GAGAA... .TCA.GCT.G .A.C.TGCA. TGGCAGCCA. GG.G...C.A ...AGAA.G. .TCAAA.G.. .ATC.TGCCA TGGAAGCCA. GG.T..GT.G T.GAAG.... ..GAAT.G.. .A.C.GGCTT T.AC.GCTA. GG.T..GT.A G.GAAGC.T. .TCA.AGT.. .ACT.A.CC. TGGC.GC.A. GG.A..GC.C ..GAGGA.G. TTCAAG.A.. .A.C.AGC.. TTGCAGCTA. GG.G...C.C G.GAGAA.G. .TCAAACT.G .A.T.AGCT. TTGCAGC.A. GG.A..GC.. G.GAGAA.G. .TCAAACT.G .A.A.T.CTA ACC.AGCCAC .CTT...C.C CAGAAGA.G. .C..CTCA.G .A.T.GGCAT TTG.AGCCA. GG.G..GC.C T.G.TGA.G. .TCAAG.G.C

8

At1g34650_HDG10 ACACT.AA.. At1g17920_HDG12 TGAG...C.. At1g73360_HDG11 TGAA...C.A At3g03260_HDG8 .GAA.AA..T At5g17320_HDG9 TGA.A.AA.. At5g46880_HDG5 AGAG...... At4g17710_HDG4 .C.CTTG.G. MpC4HDZ1 CGAG..GC.A PcC4HDZ1 AGA....C.T SmC4HDZ01 AGAG..AC.. SmC4HDZ03 AGACGG.GCA PpC4HDZ123 .GAG..GC..

At4g26920 CGTTGATGCT At4g26290_annotated .......... A_lyrata_XM_002869541 .......... Eutrema_parvulum_AFAN01000032 .TG....... Capsella_rubella .T.C...... A_lyrata_5g07260 ATG....... At5g07260 ATG....... Brassica_napus_DQ166821 .A.GA..... Brassica_napus_DQ182489 .A.GA..... Brassica_napus_DQ182490 .A.CA..AGC At3g61150_HDG1 .A.CA..AGC A_lyrata_XM_002876550 .A.CA..AGC Thellungiella_halophila_AK353253 .A.CA..AGC Thellungiella_halophila_AK352868 AA.GA..CAC Brassica_rapa_EU826522 TA.G..C..A At4g25530_FWA .A.GAC.TGC Turritis_glabra_AB367817 TA.GAC.AGC Pt_XM_002320719 .A.CA.CAG. Pt_XM_002301295 .A.AA.CAG. Pt_XM_002318195 ...CA.CATC Pt_XM_002322424 ..CCA.CAGC Pt_XM_002320422 .A.GA.CCAC Pt_XM_002305566 .A.GA..CA. Pt_XM_002302844 .A.GA.CCAC At5g52170_HDG7 TA.CA..AGC At4g00730_ANL2 .A.CA..AGC

.AAA.AGC.A A.AACGCG.. GGCA..GG.. ..GAG.C.CA TTCAAA.G.. AACA.TGCT. TGACCGCTA. GG.A...T.G C..AGGC.TC TTCAAACAA. .GCA.TGCTT TGAC.GC.A. GG.A...T.G C.CAGGC.TC TTCA.ACAA. .AAA..GC.. ..A..GCG.. TG.A..GC.C .AGCGGC.GT TTTT.GCT.. .AA.CAGC.. A.AAAGCG.. GTCA..GG.. T.GAGCC.CA TTCAAA.G.. .AAT..GCT. TC.C.TGT.. TC.A...C.. .C.AAAA.GT GTGACACA.. C.T.CAG.TT .T...GCTCG AG.ATT.GCA .AGATG.GT. AC.TAAAT.. .A.C.TGCT. TCGCAGCTA. GG.....C.. G..CGCC.T. ..CAATCA.. .ACC.TGCTC TGA.CTC.A. GG.A..GC.C C.ACG..... .TCAAGGT.G .A.C..GC.A TCATCGC.A. GG.G..GC.G C.G..AC.G. .TCAATCGAG .A.C.TGCAA T.GT.GCGA. GG.A..GC.G C.GCTGG.T. ..G.TGAAAC .A.C.GGC.. TGGCGGCGA. GG.G..GC.G G.GCGAA.G. TGCA.GCG.. 61 71 81 91 101 111 | | | | | | TGGAGGAGA- --TTTGTCCA TGAGGCGTCA AGAGCTTCTG AAGTAATCCA .......... .......... .......... .......... .......... .....T.... .....T.... .......... .......... C........G ....CA---. .......A.. ...A..T... .......... C.C......C ....CA..T. .......A.. .....TC... .......... C.T.GG...C ...-CA..C. .......A.. ......A... .......... CGT.TG...C ....CA..T. .......A.. .....TA... .......... CTT.TG...C ....CT.... ..G..AGAAC C........T ..GT...... GTA.TG..TT ....CT.... ..G..AGAAC C........T ..GT...... GTA.TG.TTT ...GTTC.G. .......GTC ...A..T..T .A..AAA... G.AACG.TAT ...GTTC... ........TC A..A..T..T .A..AAG... G.ACTG.TAT ...GTT..G. ...A....TC A..A..T..T .A..AAG... G.ACTG.TAT ...GTT.A.. ....A..TAC C.....T..T .A.ATC..C. GTA.GG..AT ...GTTTC.. ....GAGATC A..A...... ....AA..C. CT..TG.TAT ...CTCC.C. ..AAAAC.AT ...A..C..C C...A.GTG. GTA.CG.GTT ....T..T.. ..CAAA.TGT G.....T... ......AAA. GTT..G.T.C ....CT.TC. ...G...TGT G.....T... ....A.A.A. GTT..G.T.C ....TC..G. .......TAG ......T..G ....AGA... GGA.GG.AAT ....TCG.G. .......TAG ......T..T ....AGA... GTA.GG.AAT ....TC.A.. .......TAC ......TA.. C.G.TGAG.. GT...G.A.T ....TC.A.. .......TAT C.....AA.T C.G.AGAG.. GT...G.A.T ....T.G.T. .....AAATG ......T... ....AG.... CT..TG.TAT ....TTCA.. ..A.GAGATC ...A..T... ..G.AA..C. C...TG.TAT ....T..AC. .....AAATG ......T... ....AG.... CT..TG.TAT .....TTC.. ..AACCATTT CCCT.GT... ....AAA... GTT..G.A.T ...GTC.A.. ....A.CTAC ...A..T..T ...A.C.... GTA.GG..AT

9

At1g05230_HDG2 .A.GA..CA. At4g04890_PDF2 AA.GA..CAC At4g21750_ML1 .A.GA..CA. At1g79840_GL2 TA.G..C..A At2g32370_HDG3 AA.GTG.C.. At1g34650_HDG10 AA.G.....A At1g17920_HDG12 .AC.A..... At1g73360_HDG11 .A.GA..... At3g03260_HDG8 T..A..A..G At5g17320_HDG9 AA.G...... At5g46880_HDG5 .A.GA.CAGC At4g17710_HDG4 GT.GA.CTGC MpC4HDZ1 GA.GA.C.G. PcC4HDZ1 G..AA...GA SmC4HDZ01 GA.GA...GA SmC4HDZ03 GA.GA....A PpC4HDZ123 GA.GA.C.GG

At4g26920 CCCATCACTC At4g26290_annotated .......... A_lyrata_XM_002869541 .......... Eutrema_parvulum_AFAN01000032 ......GA.T Capsella_rubella T......A.. A_lyrata_5g07260 ...G...A.. At5g07260 T..G...A.. Brassica_napus_DQ166821 T..T.GTA.A Brassica_napus_DQ182489 T..T.GTA.A Brassica_napus_DQ182490 T.....TA.G At3g61150_HDG1 T..TAGTA.G A_lyrata_XM_002876550 T...AGTA.G Thellungiella_halophila_AK353253 T..G.GTAA. Thellungiella_halophila_AK352868 .T.TGGGA.T Brassica_rapa_EU826522 TG...GTT.G At4g25530_FWA TG..C.TA.A Turritis_glabra_AB367817 TG....TA.T Pt_XM_002320719 T..T.GCG.G Pt_XM_002301295 T..T.GTA.G Pt_XM_002318195 T..CAGCT.G Pt_XM_002322424 T..CAGCT.G

....A...T. ...A.AGATC A..A..T..G C...AAAGC. CG..TG.GAT ...CTTTC.. ....AAGATC A......... ...CAA.... C...TG.TAT ...GTTTC.. ....GAGATC A..A..T... ....AG...A CT..TG.TAT ...CTCC.C. ..AAAAC.AT C..A..A..T ....A.G.G. GGA.TG.GTT ....T.G... .....CGAAC C.....A..C ..G.AAA... C.C.CG.GGC ....TT.AG. ...C.CAT.. ....T.T..T .TG.AAGTC. TG..GG.T.. ....TC.A.. ..C...GAAT G..A..A..T ...T...... GT..TG.TTT ....CA.... .....CGAGT C..A..A... ..GT...... GTA.TG..TT ...GTT.AG. ..GC.CATGT C...T....C .A....GTCA CG..GG.T.. ....AA.AG. ...C.TCTA. A..T------ ------GTC. TG..GG.T.. ....TT.AG. .....C.TAG A..A..T... .AG..GAA.. C...CG.TAT AAT.A..AG. .....CG.AG A..A..T... ......AA.. C...C...AT ....TTCCT. ....GAAGAC G.....CA.T C...AGA... GTC.TG..AT ...GT.CA.. ..A.GCA.AC ......TA.. ..G.AG.... GTT.GG..AT ....TCCTG. ..C.CAAGGC C..A.TTA.T C.C.ACA... GTC.CG.GAT CT.T....T. ..A.G.AAAC A..A..A... ....AGA... GTC.GG..AT ...GT.C.G. ..C.GAGGAC G......AGC C.C.AAA... CGC.GG.TAT 121 131 141 151 161 171 | | | | | | TCATGGCTTT TGACAAAACT TAAAAACCCT ATGCGTTGGG TGACTATTTT .......... .......... .......... .......... .......... .........G ...G...... .G....T... G......... ...G...... .T..C...GG .C....CT.. ..TCG.T... GAAATG...C CA.AA..C.. ..G.T....G ..GA...T.. ..CG..T.A. G..AA....C AA.AA..C.. ....C..... ..G....T.. ..TG..T.A. -.AAC....A AA.AA..C.. ....C...CG ..G....T.. ..TG..T.A. G.AA.....C AA.AA..C.. ATGAC....G .CGAC.TGT. C.TGG.TGG. G.CAAG.... GAGAGC.C.. ATGAC....G .CGAC.TGT. C.TGG.TGG. G.CAAG.... GAGAGC.C.. .T.GCA..CG .TGA..CCT. G.TGG..T.G GAA..A.... C.GAG..G.. .T.GC...CG .TGA..CCT. A.TGG..T.G GAA..G.... C.GAA..G.. .T.GCC..CG .TGA..CCT. G.TGG..T.G GAA..G.... C.GAA..G.. .T.GCT..CG .CGAG.CCT. A.TGG..T.C .AT..A...A C.GAG..G.. ATCAAC..CG .TGAG.TT.. C.TGG.TGTG .AT.AA...T CTTG.G.A.. CATAAA...G CCCA..GTT. C.TGG..GTG GA..AA...A AAGAG..G.. GTGACT..GG .C.AG.CT.. ..TGG..A.A GGCAAA.... .C.ACG.G.. ..GACT..GG .C.AG.CT.. ..TGG..A.G GGCAAA.... .C.ACG.G.. .TGGCC..AG .TGAG.C... G.TGG.TT.A .AC..A.... CAGAA..G.. .TGGCCT.AG .TGAG.C... G.TGG.TT.G .AC..A.... CAGAA..G.. ..GGCT..AG .TGAG.C.T. G.TGG..GTG .ATG.A.... ..GAG..G.. .TGGAT..GG .TGAG.C.T. G.TGG..GTG .ATG.A.... ..GAA..G..

10

Pt_XM_002320422 TT.CGGCA.A Pt_XM_002305566 .TGTGGTA.T Pt_XM_002302844 TT.TGGCA.T At5g52170_HDG7 TGAG.GCA.A At4g00730_ANL2 ...G.GTAA. At1g05230_HDG2 .G.GGGGA.G At4g04890_PDF2 .T.TGGGA.T At4g21750_ML1 .TGCGGGA.T At1g79840_GL2 TG...GCT.G At2g32370_HDG3 TG.CGG.A.T At1g34650_HDG10 T...A..A.T At1g17920_HDG12 T..C..GA.. At1g73360_HDG11 T..C..TA.. At3g03260_HDG8 T...A.GA.T At5g17320_HDG9 T...A..A.T At5g46880_HDG5 .TGT...A.. At4g17710_HDG4 .TTCC.CA.A MpC4HDZ1 T.TC.GCA.G PcC4HDZ1 TT...GCA.. SmC4HDZ01 .T...GTA.T SmC4HDZ03 TT.GAACA.A PpC4HDZ123 ...G.GCG.G

At4g26920 GGTCATTCCG At4g26290_annotated .......... A_lyrata_XM_002869541 ...A...... Eutrema_parvulum_AFAN01000032 AT.A....AA Capsella_rubella .T.A....AA A_lyrata_5g07260 TT.A....AA At5g07260 TC......AA Brassica_napus_DQ166821 AT.AG.AG.A Brassica_napus_DQ182489 AT.AG.AG.A Brassica_napus_DQ182490 .C..G.G..T At3g61150_HDG1 AC..G.G... A_lyrata_XM_002876550 AC..G.G..T Thellungiella_halophila_AK353253 .T.GG..... Thellungiella_halophila_AK352868 .T.GG.C... Brassica_rapa_EU826522 A..TG.C..A At4g25530_FWA .C.GG.A..A

ATCAAC...G .TGAGT.T.. C.TGG.TGTG .AT.AG...T CT..AT.G.. GTTAAT...G .TGAG.TT.. ..TGG..G.G .AT.AA...T CA........ ATCAAC...G .CGAGT.T.. G.TGG.TGTG .AT.AG...T CT..AT.G.. .TGGCT..AG .CGAG.CT.. ..TGG..A.G .ACAAA.... CAGAA..G.. .T.GCT..CG .CGAG.CTT. A.TGG..T.C .AT..G...A C.GAG..G.. GTTAACA.CG .TGAG.TT.. C.TGG.TGTG .AT.AA...T C...G..... ATCAAT..CG .TGAG.TT.. C.TGG.TGTG .AT.AA...T CTTG.G.... ATCAAT..CA .TGAG.TT.. A.TGG.TGTG .AT.AA...T CT.G.G.G.. CATAAA...G CCCAG.GTT. C.TGG..GTG GGA.AA...A AAGAG.CA.. A.TG.CA..G .TGA..TG.. C.TGC.AGAG .AT.TG...T CA..A..G.. AG.AACT.GG .CGAC.TGT. CTT...TA.G GA.AAA.... C..GGC.... ATTACA...G ..GAC.TG.. ..TG...T.. G.CAAA.TAA CAGAGC.... ATGGCA...G .CGAC.TGT. C.TGG.TTG. G.CAAG...A CAGAAC.C.. AT.AACT.GA .CCA..TGT. CTT.G.T..G GAAAAA...A AAGAGC.... GG.AACT.GA .CGAC.TCT. CTT..CTG.G GA.AAA.... C..GGC.... AT.ACA..AG .TGACGCCT. .CT....G.. GATAAA...T CAGAG..G.. AT.ACC..CG .C.A.GC.T. CCTTG.TG.. GATAAA...T C.GAA..G.. GTTACTT.GG ..GA..CT.. G.TGG.T.A. G.T..A...A CCGAA..G.. G.GAACT.GG ...AT.TGT. C.TGG.TG.G .GC..G...A ..GAG..G.. G.TGCCT.GG .TGAC.CTA. C.TGG..G.G GGAA.A...A ..GA...G.. G.GG....GA .TGAT.CCA. C.TG...GT. GGCA.A.... .CGAC..G.. GTGAAC..GG ..GAG.CG.. GCTGG..G.G TC..AG.... C.GAG..G.. 181 191 201 211 221 231 | | | | | | GTTGGAAATG TGTCA---AT TGATATGGAG TTTCTGACAT TAATTACCCC .......... .......... .......... .......... .......... .......... .......... .......... .....A.... ....C..... ....C.G.A. C...T..... .ATA.....T ...A..C..C ....CTT... A.C.CCG... ....T...C. .A..G....T ...A..C..C A...CT.T.. A.C.C.G... ....G..... .A..G....T ...A..C..C A...CT.... A.C.CGG... ....T..... .A..G..A.T ...A..C..C A...CT.T.. ....C.TCGT CTAA....T. GATGTAT..A GAAA..G..G .GC..T.T.. ....C.TCAT CTAA....T. GATGTAT..A GAAA..G..G .GC..T.T.. A.CTC..GAA CA......C. GATGCAC.CA GAG..TCA.C .GT.GT.T.. ..CTC..GAA CT......C. GATGCAC.CA GAG..TCA.C .GC.GT.T.. A.CTC..GAA CT......C. GATGCAC.CA GAG..TCA.. .GC.GT.T.. ....C..GA. CCG.....T. GATG.AT.C. GAG..ACAGG .TT.GT.T.. ..GTC..GA. CC.TG...G. GATG.CT.CT GAGT.TCA.G .TCC.T.T.. A.CTC...G. C.GTT...T. GATGT.T.G. GAGA..CAGC .GC.C..T.. ..CCCTGTG. CA......CA GAT.CAA.CA GAAT.TCA.G .....T.T..

11

Turritis_glabra_AB367817 .C.GG.A..A Pt_XM_002320719 TT.GG....T Pt_XM_002301295 AT.GG..... Pt_XM_002318195 TT.TG.C..T Pt_XM_002322424 TT.TG.C..T Pt_XM_002320422 AC.TG....T Pt_XM_002305566 AA.TG....T Pt_XM_002302844 AC.TG....T At5g52170_HDG7 AT.AG....A At4g00730_ANL2 AC.GG..... At1g05230_HDG2 AT.AG.C..A At4g04890_PDF2 CC.AG.C..A At4g21750_ML1 .C.TG.C..T At1g79840_GL2 ....G.C..C At2g32370_HDG3 .C.AG.CA.A At1g34650_HDG10 AT.GG.G.TA At1g17920_HDG12 AT.GG.AA.A At1g73360_HDG11 TT.AG.AG.A At3g03260_HDG8 AC.AG.G..A At5g17320_HDG9 AT.GG.G..A At5g46880_HDG5 TC.GG....A At4g17710_HDG4 AT.GG.A..A MpC4HDZ1 CC.TG....C PcC4HDZ1 TC.TG.G..T SmC4HDZ01 .C.GG.G... SmC4HDZ03 TT.GG....T PpC4HDZ123 .C.GG.G...

At4g26920 TACATGGATT At4g26290_annotated .......... A_lyrata_XM_002869541 .......... Eutrema_parvulum_AFAN01000032 .......G.. Capsella_rubella .......G.. A_lyrata_5g07260 .G.....G.. At5g07260 .......GC. Brassica_napus_DQ166821 A.GC...... Brassica_napus_DQ182489 A.GC...... Brassica_napus_DQ182490 .GTT...GCA At3g61150_HDG1 .GTT...GCA

..GCCTGTGT CA......CA GAT.CAA.CA GAAT.TCA.G .C...T.T.. A...CC.GGA CC..T...T. GATGCAC.C. GAG..CCA.G .CT.AT.... A...C..GGA CC..T...T. GATGCAA.CC GAG..CCATG .TT.AT.... A...CT.GA. CTG.C..... GATACAC.CC GAGT.TCA.. .G...T.G.. A...CT.GA. CTG.G..... GATACAT.CT GAAT.TCA.G .G...T.T.. ..CTC..GG. CT.TG...G. GATG.C..CT GAAT.TCA.C .TCCA..T.. ..ATC..GA. CAATG...G. GATG.CA.CT GAGT.TCA.G ..CC.T.... ..CTC..GA. CT.TG...G. GATG.CA.CT GAAT.TCA.C .TCCA..T.. ..C.CGGT.. CA......T. GATGCAA.CA GAGT.TCA.G .G..GT.T.. ..C.C..GA. CTA.....C. GATG.AT.CA GAG..ACA.G .TT.GT.T.. ...TCT.GA. CAATG...G. GATG.GT.CA GAGT.TCA.G .TCCAT.T.. ..GTC..GA. CC.TG...G. GATG.CA.CT GAGT.TCA.G .TCCAT.A.. ..ATC..GA. CA.TG...G. GATG.CA.CA GAGT.CCA.G .CCCAT.G.. A.CTC...G. CTG.....C. GATGT.C.GA GAGA..CAGC .GC.C..T.. .....T.GA. CCAGG..... AATG.GT.CT GAGTACCA.G .GC..T.... ..GACCG.A. CTAA....G. G.TATAT... CAA...CACA ..C.GT.A.. ....C.TCAT CTAA....T. GATG..T..A GAG..TCA.G .GC..T.A.. A...C.GC.T CTAA....T. GTTGTAT..A GAAA..GA.G .GC..T.G.. ..GAAC..G. CCAAC...G. GATGTG.... CAA..ACACA .TC.GT.G.. ..GAACG.A. CTAA....G. GAT.TAT... CAA...CACA ..C.GT.A.. ..G.CT.GG. C.AA....C. GATGT.T.CA GAGT.ACAGG ..C.AT.T.. ..CTC..GC. CCAA....C. GATGT.T.CA GAGT.ACA.G ..G.GT.T.. ..CTCT.GA. CACTC...C. GATGTAC.CT GAG..TCAGG .CT.AT.T.. ..GTCC..G. CCCTG...-- -ATGTAT.CC GAGT.ACA.G .C..GT.A.. A..TCT.GA. CT.TG...C. .ATGTAT.CC GAGT.CCA.G .GT.AT.A.. ..GTC..GA. CT.TT...C. GTTGTAT.CA GAAT..CA.A .CC..T.G.. ..GTCGCGG. C.GTG...C. GATGTAC.C. GAG...CAGG .GC.GT.G.. 241 251 261 271 281 291 | | | | | | ACCCGAAAAG TAAAAGTTCT CCGATATTGT CACCGT---A TAGCAAATGA .......... .......... .......... .......... .......... .........A .......... .......... .......... ...G...... .....GC.T. .GG..C.... T....GC.T. AGG.AA.... ...AG.CC.G ..T.....T. .G..GC.... T...CGC.CA ATG.A..... .T.AG..... ..T.....T. .G...C.... T...CGC.CG AGG.A..... ...AG..... ..T..G..T. .G...C.... T...CGC.CG AGG.A..... ...AGG.... ..A..CG.GT .CTGC.AG.. A.....C... ..GATG.... .T.A.C.A.G ..A..CG.GT .CTGC.AG.. A..C..C... ..GATG.... .T.A.C.A.G GTG...C... .CTCGT.CT. GA.G.TC... A.G.AG...C AC..GG.A.G GTTA..C... .GTCGT.CT. G..G.TC... A.A.AG...C AC..GG.A.G

12

A_lyrata_XM_002876550 .GTT...GCA Thellungiella_halophila_AK353253 GGT....GCA Thellungiella_halophila_AK352868 .T.T...GC. Brassica_rapa_EU826522 G.A....GCC At4g25530_FWA CTT....G.G Turritis_glabra_AB367817 CTT....G.G Pt_XM_002320719 GGTT...GC. Pt_XM_002301295 GGTT...GC. Pt_XM_002318195 .GTG...GC. Pt_XM_002322424 .GTG...GC. Pt_XM_002320422 ...T...GCA Pt_XM_002305566 G..T...GCA Pt_XM_002302844 G..T...GC. At5g52170_HDG7 .TT....GCG At4g00730_ANL2 .GTG...GCG At1g05230_HDG2 .T.G...GCG At4g04890_PDF2 CT.T...GC. At4g21750_ML1 ..TT...GCG At1g79840_GL2 G.A....GCA At2g32370_HDG3 .TTG...GCG At1g34650_HDG10 .CTC...T.G At1g17920_HDG12 A..T...GCA At1g73360_HDG11 A.GC.....A At3g03260_HDG8 G.TC...... At5g17320_HDG9 .GTC.....G At5g46880_HDG5 A.AT...GCA At4g17710_HDG4 A.A......G MpC4HDZ1 A.TT...GC. PcC4HDZ1 .GTG...GCA SmC4HDZ01 AGTG...GCG SmC4HDZ03 CGTG...GCG PpC4HDZ123 GGTG...GGG

At4g26920 TATTATTAAA At4g26290_annotated .......... A_lyrata_XM_002869541 .......C.. Eutrema_parvulum_AFAN01000032 .C..G.GC.G Capsella_rubella .C..G.CC.G A_lyrata_5g07260 .C....CC..

GTT...C... .GTCGT.CT. G..G.TC... A.A.AG...C AC...G.A.G GTT.....T. .C..TT.C.. ...G.TC... A.G.AG...C AC..GG.A.G ..A..TG.GA ACT.CT..G. GA....C... A.A.AG...C ATAGCG.C.G ..AA..G... .CT.CT..G. GA..AG...C .G..AG...C .GAGCCC... .AGA...... ...CGT..A. TA....C..C A.AGAG.... .CAG.C.G.G .AGA..C... ...CGT.... TA....C..C A.AGAG...C .CAA.C.C.G GTT..TG.G. .C..TT.... .....T...C A.G.AG...C AC..CG.G.G GTG..TG.G. .C..TT.... .....T...C A.G.AG...C AC...G.G.G GTG..TC... .G..GT.C.. T..G.TA..C A.G.AA...C .CA.GG.G.G GTG..TC... .G..GT.C.. T..G.TA..C A.G.AA...C .....G.G.G ......G..A GTT.CT..G. GA.G..C... A.A.AA...C AT..CG...G ..A...G..A ATT.CT..G. GA.G..C..C A.A.AG...C ATA.CG...G ..T..TG..A GTT.CTA.G. .A.G..C... A.A.AA...C AT..TG...G .T.AA.C..A AG..GT.... TA.......C A.G.AA...C AT.G.G...G GTT..T..T. .T..TT.C.. ...G.TC..C A.A.AG...C AC..GG.G.G ..A..TG..A CCT.TT.CGC A..T..C... A.A.AA...C A..G.G...G ..G..TG.GA ACT.CT..G. GA....C..C A.A.AA...C ACAGTG.C.G ..T..TG.GA ACT.CT..G. AA.G..C... A.A.AG...C ACAGTG.C.G ..AA..G... .GT.CT.CG. GA..AGC..C .GG.AG...C .GAGCCC... .....CG..A GCT.CT.CG. ...C..C... A.G.AA...C A..G.G.G.G C.TA..G.GT .T.T.A.C.. AA..AC...C ..G.AA.... .GAA.G.A.. ..G..TG.GT .CTGT..G.. AA........ ..G.AA.... .C.A.C...G ..A..CG..T .CTGC.AG.. A..C...... ..A.AG.... CT.A.C.A.G G.AA.GG..T .C.TG..CG. .A.G.GC..C ..AGAG.... .C.AG..A.G C.GA.GG..T .T.TGA.C.. AA.GAC...C ..A.AA.... .T.A.G.CA. ..TA..G... CTT.TT.... T..G...GTG G.A.AA...G C..A..CC.G ..AA..G... CTT.TT.... T..G...GTG G.G.AA...G CT.A.G.A.G G.A..TG.GT GCT.CT.CT. G....T...C A.G.AG...C AT.GCG.A.G ..T..TG.CA .GT.TT..T. GA.G.T...C A.G.AA...C CT...G.G.G ..GA..G.G. CTT.CT..T. G..C..C..C A.G.AA...C AC..GG.A.G ..A..GG.GT .CTTCT.... .A.G..C..C A.G.AG...C ATT.CG.ACG ..G..GG.G. .GT.CT.C.. G..G..C..C A.G.AG...C AT..GG.G.G 301 311 321 331 341 351 | | | | | | ATAGCGGATA TCTCCATG-- -CCTGAGTTT CTCAGATTTC CTTCTGGATT .......... .......... .......... .......... .......... .....A.... .......... .......... .......... .......... ..C....... ....GG.... ...CA.T... A.G....... .........A ..C..AT... ....T..... ...CA.T..C A.G....... .........A ..C..C.... .A..TG.... ...C...... A.G....... .........A

13

At5g07260 .C....CC.. Brassica_napus_DQ166821 CT.G...C.G Brassica_napus_DQ182489 CT.G...C.G Brassica_napus_DQ182490 CC.CG.AC.. At3g61150_HDG1 CC.CG.AC.. A_lyrata_XM_002876550 CC.CG.AC.. Thellungiella_halophila_AK353253 .G.AG.GC.. Thellungiella_halophila_AK352868 .C.C...C.. Brassica_rapa_EU826522 C..C..CG.G At4g25530_FWA ...C..AG.C Turritis_glabra_AB367817 ...C..CG.G Pt_XM_002320719 .G.GG.GC.. Pt_XM_002301295 .G.GG.GC.. Pt_XM_002318195 C.....AC.. Pt_XM_002322424 C..A..AC.. Pt_XM_002320422 .T.G...C.. Pt_XM_002305566 CT.G..CC.. Pt_XM_002302844 .T.A...C.. At5g52170_HDG7 C.....AC.. At4g00730_ANL2 .G.GG.GC.G At1g05230_HDG2 .T.G...C.. At4g04890_PDF2 .C.G...C.. At4g21750_ML1 .C.G...C.. At1g79840_GL2 C..C..CG.G At2g32370_HDG3 .C.G...C.. At1g34650_HDG10 .C.A..CC.. At1g17920_HDG12 CT.G..CC.. At1g73360_HDG11 CT.G...C.G At3g03260_HDG8 CC.C..CC.. At5g17320_HDG9 GC.C...C.. At5g46880_HDG5 C..C...C.G At4g17710_HDG4 C..C...C.. MpC4HDZ1 CC.....C.G PcC4HDZ1 CG....CC.G SmC4HDZ01 .T.G..CC.. SmC4HDZ03 CT.C..CG.. PpC4HDZ123 GT.G..CC.G

At4g26920 TAACTATGTT

..T..A..A. .T..T..... ......A.A. A.G....... .........A G.T.TAA..G ....ATAT.. .T..C.C.CC TAT.AG.... .........G G.T.TAA..G ....ATAT.. .T..C.C.CC TAT.AG.... .........G G.G.TA...G ....A..T.. .T.AAGC.G. AGA..G..A. .A..G..T.G G.C.T....G ....A..T.. .T.GAGC.G. AGA.....A. .A..C..T.G G.G.TC...G ....A..T.. .T.CAGC.G. AGA..G..A. .G.....T.G GCG.T...CG .T..A..T.. .GTCATAA.C .GGC..C... .A..G..C.G G.G.TT...G ....TT.... .T.AAGAACC AGA...AGG. ....A..T.G ..C.T....G .A..GG.C.. ..TGAGA.G. .GG.AGCGC. ....A..G.G G.C.TC..CG .TA.TCCT.. .GG.TGT.C. AAG..GC.A. .C..A..CC. G.C.TC..CG .TA.TCCT.. .GG..CT.C. AAT..GC.A. .C..A..TC. G.T.TT...G .G...G.C.. .GTGA.C.G. AGG..GC... .......C.G G.T.TT...G .A.....T.. .GTGA.C.GC AGG..GC... .......T.G G...TT...G .T.....T.. .GTAACC.GC AAG..GC... .......C.G G.G..T...G .T..TG.T.. .GTAACC.GC AGA..GC... .......C.G G.G.TT...G .T..AT.... .G..AG..G. .GA..GAGG. .A.......G G.G.TA...G .T...T.... .T.GA.A.G. AGA...AGG. .A..A..T.G G.G.TT...G .T..AT.... .G.AAGA.G. .GA...AGG. .A.......G G.C.TC...G .T..TTAT.. .GG..GC.C. AAA.TG.... ....A....G G.G.T....G .A..A..T.. ...G.TAA.C .GGC..C.A. .A..A....G G.T.TC.... .T..GT.... .G..AGA.GC AGGC.GCGAG ....A....G G.G.TT...G ....TT.... .TTAAGAAC. AGA...AGG. ....A..T.G G.T.T....G ....TT.... .A..AGAAGC AGA...AGA. .C.....T.G ....T...CG ....GG.C.. ..TGA.A.G. .GA.A.C.C. .C..C..T.G G.G.TC.... .T.....C.. ..TAA.A.G. .G.C.CCGA. .C.......G .....T..CG .T...TGT.. ....ATT.GC AC..A.CG.. .C..C..TG. ....TAA..G ....ATAT.. .T..CG..CA TAT....... .......T.G G.T.TAA.CG ....ATAT.. .T..C...CC TAT....... .A.......G ..T..T..CG .G..ACAC.. .G...CT.G. TA..A.CG.. .......T.G ..T..T...G .G..GTGT.. ...CATT.GC AC..A.CG.. .C..A..TG. ..C.T....T ..C.G..C.. ..A.....A. AAG...AAA. ....A..C.G G.T.TA...T ..C.G..C.. .GA.C.A.AC .GG...AAA. .......T.G G...TC...G .G..TG.C.. .ATGCGT.C. .GA...CGG. .........G G.T.T....G .T..TG.... .GTGAGA.GC .AG...CGG. ....C....G ....TC...G .T..AG.... ..TCCG.AAC AGGCTTCG.. .CC.G..G.. ..T.TT..CG ....T..C.. .TTGCG..GC AGG.TGCG.. .......T.. G.G.T....G .G..GG.... .ATGAG..G. .GGC.GCGG. ....G..GG. 361 371 381 391 401 411 | | | | | | CATGTAGCAA GGATATTCAG AGTTCAATCG GTTAAACGTA TGTACGTAGG

14

At4g26290_annotated A_lyrata_XM_002869541 C..GG.G.A. Eutrema_parvulum_AFAN01000032 ...GG.A.AA Capsella_rubella ...GG.A.AA A_lyrata_5g07260 ...GG.ACAA At5g07260 C..GG.A.AA Brassica_napus_DQ166821 .G.AG.AAAA Brassica_napus_DQ182489 .G.AG.AAAA Brassica_napus_DQ182490 CG..G.GACC At3g61150_HDG1 .G..G.AAAC A_lyrata_XM_002876550 .G..G.AAAC Thellungiella_halophila_AK353253 CG..G.GAAC Thellungiella_halophila_AK352868 GG.TG..AGA Brassica_rapa_EU826522 GTCTGC.TCC At4g25530_FWA ...TG.GAG. Turritis_glabra_AB367817 ...TG.GAG. Pt_XM_002320719 .G.TG.AAGC Pt_XM_002301295 .G.TG.AAGG Pt_XM_002318195 .G.TG.GAG. Pt_XM_002322424 .G.TG.GAG. Pt_XM_002320422 GG.TG..AGA Pt_XM_002305566 GG.TG..AGA Pt_XM_002302844 GG.TG..AGA At5g52170_HDG7 CG.AG.GAG. At4g00730_ANL2 CG..G.GAAC At1g05230_HDG2 .G.TG.CAGA At4g04890_PDF2 AG.TG..AGA At4g21750_ML1 GG.TG..AGA At1g79840_GL2 GTCTGCATCC At2g32370_HDG3 AG.TG...CA At1g34650_HDG10 G.CTG..AAA At1g17920_HDG12 CG.GG.GCAA At1g73360_HDG11 .G.AG.AAAA At3g03260_HDG8 GG..C..AAA At5g17320_HDG9 G..TG..AA. At5g46880_HDG5 AG..G.GAAA At4g17710_HDG4 AG.AG.GAAG MpC4HDZ1 CG..G.CAGA PcC4HDZ1 .G.TA..CGA

.......... .......... ...A------ ---------- ---------- --------...C.TC... ATGG..ATTC .AAGGT.A.A A..TTGGAAC AC.GG..TTA ..CA..T.C. ATGGTG..TC GAGGGT.A.C ...CTTGACC AT.GG..TTA .G......C. ACGGTA..TC CAAGGTCA.T ...CTCGACC AC.GG..TTA ...A....C. ATGGTA..TC CAAGGT.A.C A..CTCGACC AT.GG..TTA ...A....C. ATGGTA..TC TAAGGT.A.C A..CTCGACC AT.GG..TTA G.CA.GC.T. ATGG..A.TC CAAGGTTA.T TGGGTTGAAC ATGTT.A.AC G.CA.GC.T. .TGG..A.TC CAAGGTTA.T TGGGTTGAAC ATGTT.A.AC G.CA.G..C. ATGG..A.TC TAAGGT.A.. TGG.TCGAGC ACACA.AGTA G.CA.G..C. ATGGC.A.TC .AAGGT.A.A TGG.TCGAGC ACACG.AGTA G.CA.G..C. ATGGC.G.TC TAAGGTCA.A TGG.TCGAGC ACACG.AGTA G..A.GT.T. ATGG..A.TC .AAGGTCA.. TGGGTGGAGC ATGCA.A.TA G.AT.GC.T. ATGGT.ATTC TAAGGTTA.A TGG.TTGAAC ATATG.AG.T G.CACCT.C. ACGGCCA.TC CAAGGTCA.T TGGGTGGAGC ATCTT.ACTT G.CC.GT.C. ATGGG.A.TC CCAGGTTA.A TGG.TTGAAC AAGCG.A.TA G.CA....T. ATGGG.A.TC CAAGGTTA.A TGG.TTGAAC AAGCG.A.TA G..A.GC.C. ATGGG.A.TC TAAGGTTA.A TGG.TTGAGC ATGCA.A.TA G..A.GC.C. ATGGG.A.TC CAAGGTTA.A TGGGTTGAGC ATGCACA.TA G..A.GAAC. ATGGT.G.TC TAAGGTTA.A TGGGTGGAAC AT.C..A.TA G.CA.GAAC. ATGGT.G.T. CAAGGTTA.A TGGGTGGAAC AT.CT.A.TA G.AA.GCTT. ATGG..A.TC CAAGGTTA.A TGGGTTGAAC ATGTA.AG.T G.AT.GC... ATGGT.ATTC .AAGGTCGTC TGGGTTGAAC ACATT.A..T G.AA.GC.C. ATGGT.A.TC GAAGGTTA.A TGGGTTGAAC ATGTA.A..T G.CA...GC. ACGGC.G.TC CAAGGTGA.A TGG.T.GAAC AT.CA.AGTA G....GT.T. ATGG..A.TC TAAGGTCA.. TGGGTGGAGC ATGCA.A.TA G.AT.GC... ATGG..ATTC TAAGGTGA.T TGGGTGGAGC ATGTG.A..T G.AT.GC.T. ATGGT.ATTC TAAGGTTA.A TGG.T.GAGC ATATG.AG.T G.AT.GCAG. ATGGT.A.TC CAAGGTGA.A TGGGT.GAGC ATATT.AG.T G.CACCT.C. ACGGTCA.TC CAAGGTCA.C TGGGTGGAGC ACCT..AC.T G.AA.GCAT. .TGGT.A.TC CAAGGTTA.A TGGGTGGAAC ATGTG.A..T GCCT.GC.CC ATGGTCGTTC TAAGGT.A.A TGG.T.GAGC ATGTG.A..T G..A.GT.C. ATGGC.ATTC .AAGGTTA.T TGGGTGGAAC ATGGT.A.TT G..A.GC.C. ATGG..ATTC CAAGGTTA.T TGGGTTGAAC ATATT.A.AC GCCT.GC.CG ATGCTCA.TC CAAGGT.AT. TGG.T.GAGC ATGTG.A..T GCCT.GC.CC ACGGC...TC TAAGGTGA.. TGG.T.GAGC ATGTG....T G.CA.GC.T. ATGG..A.TC .CAAGTCAA. TGGGTGGAGC ATGTT.A..T GCAA.GCGT. ACGG..A.TC TCAGGTCA.A TGGGT.GAGC ACGTA.A..T G.SA.GC.G. ATGGT.A.TC GAAGGTGA.T ..CCTCGAGC ACATG.ATTA G.A.CCT... ATGGG.G.T. TAAGGTGA.. C.GGT.GAAC ATCTT.AGCA

15

SmC4HDZ01 CG.TG..CGC SmC4HDZ03 CG.TC.CCGC PpC4HDZ123 CG..G.CCG.

At4g26920 TCCGCC At4g26290_annotated ......... A_lyrata_XM_002869541 C.....GA.. Eutrema_parvulum_AFAN01000032 C...AT...T Capsella_rubella C...ATATGT A_lyrata_5g07260 C...AT.TGT At5g07260 C...AT.TGT Brassica_napus_DQ166821 A...C.G.AA Brassica_napus_DQ182489 A...C.G.AA Brassica_napus_DQ182490 A....GTAAT At3g61150_HDG1 A....G.AAT A_lyrata_XM_002876550 A....G.AAT Thellungiella_halophila_AK353253 A...C.T.GT Thellungiella_halophila_AK352868 G...C.A.AA Brassica_rapa_EU826522 A...CTG... At4g25530_FWA A...G.AAAA Turritis_glabra_AB367817 A...G.AAAG Pt_XM_002320719 G...G.AAGT Pt_XM_002301295 G...A.AAGT Pt_XM_002318195 G...CTG.GA Pt_XM_002322424 G...CTG.GA Pt_XM_002320422 G...CAA.AA Pt_XM_002305566 A...G.T.AA Pt_XM_002302844 G...CAA.AA At5g52170_HDG7 G...CAT..T At4g00730_ANL2 G...C.T.GT At1g05230_HDG2 G...CAA.AA At4g04890_PDF2 G...C.T.AA At4g21750_ML1 G...C.T.AG At1g79840_GL2 A...CTT... At2g32370_HDG3 G...AA.CAT At1g34650_HDG10 G...ATA.AA At1g17920_HDG12 G...C.A.AA At1g73360_HDG11 A...C.G.AA At3g03260_HDG8 G...GGA.AA

G.CA.GC... ACGG..A... CAAAGTGA.C A..CTT.AGC ATATG.AGTA G.CT.GCA.. ATGG..A.TC .AAGGTGA.. .CAGTG.AAC ACATA.A..C G.CACGC.G. ACGGG.A.GC GAGGGTGA.. TGCGTGGAGC ACGCG.AGTA 421 431 441 451 461 471 | | | | | | ---GAGGCCT TACCTCTCTT CTGGTTTAGG ATTTGGTGCC AAGAAATGGC T--...------- ---------- ---------- ---------- ---------- ....GATTTG GTG.AAAGAA A...C.CGTC GC.CTACAA. G.T.CTGCT. ....GATTTG GTG...A.AG A...C.CTCT GC.CTCCAAA GGC.TT.CTT ....C.TTTG GTG.ATATAG A...C..GCT GC.CTTCAAA .GC.TTGCTA .....ATT.G AGG.A.ATAG A...C.CGCT GC.CTCCAAA GGC.TTGCTA .....ATTTG GTG.A.AGAG A...C.C.CT GC.CTCCAAA .GC.TTACTA ....CTTTTG G.G..GAAAG A...G...CT .C.CTACAGA G.ATGTGT.A ....CTTTTG G.G..GAAAG A...G...CT .C.CTACAGA G.ATGTGT.A ....C.TTTG GTG.A.AGCG T...A.GGCT GCG.TACAA. GCC..TGT.A ....C.TTTG GTG.G.ATCG A...A.GGCT GCA.TACAA. GTC..TGC.A ....C.TTTG GTG.G.ATCG A...A.GGCT GCG.TACAA. GTC..TGC.A ....GTTTTG GGT.G.AAAG A...G.CGCT .CACTTCAGA G.C.GTGC.A ....CTTTTG G.G.GAAACG T...G.GTCT .CACTCGAA. GCC.GTGC.A ....CCTTTG GTG..AAACA T...G.CGCC .CCCTTCAG. TCC.CTGC.A ....G.CTAG GTG.AAAGAG A...C.CGC. .CGCT.CAGA G.C.CTGC.A ....G.CTGG GCG.GAAGAG A.....C.A. .CGCT.CAGA G.T.CTGC.G ....GCTTTG GGG.C.AACG A...A.AGCC .C.CTCCAA. GCC...GC.A ....GCTT.G GTG.C.AACG A...A.AGCT .CCCTTCAA. GTC..TGC.A ....GCTTTG GTG.A.AGAG G...C..GCT GCCCTTCAA. GGT.TTAC.A ....GCTTTG GTG.A.AGAG G...A..GCC GCCCTTCAA. GGC.CTAC.A ....CATT.G GGG.AAAACG A...G.GGCA .CCCTAGAT. GGC..TGC.A ....CTTTTG G.G.AAAACG T...G.GG.A .CACTTGAT. G.C..TGC.A ....CATT.G GGG.AAAACG T...G.CGCA .CCCTAAA.. GGC..TGC.A ....G.TTAG GCG.AAC.AA A...C..GC. .C.CT.CAGA GGC..TGC.A ....GTTTTG GCT...AAAG A...C.CGCT .CACTTCAGA G.C.GTGC.A ....CCTTTG GTG..AAACG ....G.AGCC ...CTTGA.. GCC..TGC.A ....CTTT.G GTG.GAAACG T...G.GGCT .CACTCGAA. G.C..TGC.A ....CTTT.G GTG.AAAACG T...G.GGCT .CACTTGA.. GCC..TGT.A ....CCTTTG GGG...GACA ....G.CGCC .CCCTTCAG. TCC.TTGC.A ....CTTTTG CTG..AA.CG ....G..G.T .CA.T.GTA. GCC.GTGT.A ....GCTATG G.G...GACG T...ACCGCT .C.CTTCAGA GGATGTGT.A ....CTTTTG G.G..GAACG T...A..GCT .C.CTCCAAA G.ATGTGT.A ....CTTTTG GGG..GATCG T...G...CC .C.CTCCAGA G.ATGTGT.A ....GCTATG G.G..AAGCG T...A.CGTC .CACTTGAGA G.ATGTGT.A

16

At5g17320_HDG9 G...ATACGA At5g46880_HDG5 G...G.A.AA At4g17710_HDG4 A...GTA.AA MpC4HDZ1 G...G..AAT PcC4HDZ1 G...GAAAGT SmC4HDZ01 G...G.YA.. SmC4HDZ03 G...G.AAAT PpC4HDZ123 G...G.GAG.

At4g26920 CTCCGGTACG At4g26290_annotated .......... A_lyrata_XM_002869541 .......... Eutrema_parvulum_AFAN01000032 TA....AGTA Capsella_rubella TA.G..AGTT A_lyrata_5g07260 .A.T..AGTT At5g07260 T..G..AGTT Brassica_napus_DQ166821 TATAA..GTC Brassica_napus_DQ182489 T.TAA..GTC Brassica_napus_DQ182490 .GGG..CGTT At3g61150_HDG1 TGGA..GGTT A_lyrata_XM_002876550 TGGA..CGTT Thellungiella_halophila_AK353253 T..G..C.TA Thellungiella_halophila_AK352868 .AG...CGTA Brassica_rapa_EU826522 .CG..CC.TT At4g25530_FWA .AGA....TT Turritis_glabra_AB367817 .A.A....TC Pt_XM_002320719 TG....GGTT Pt_XM_002301295 TG.T..GGTT Pt_XM_002318195 T..T..GGTT Pt_XM_002322424 T..T..GGTC Pt_XM_002320422 TG.T..AGT. Pt_XM_002305566 .A.T..AGTT Pt_XM_002302844 TG.T..CGT. At5g52170_HDG7 ...G..A.TA At4g00730_ANL2 T..G..A.TC At1g05230_HDG2 TG.A..AGT. At4g04890_PDF2 .AGT...GTT At4g21750_ML1 TA....AGTC At1g79840_GL2 .CG..CC.TT

....GCTA.G G.G...GACG T...ACCGTT .C.CTTGAGA GGACGTGT.A ....CTTTTG G.G..AA.CG T...C.CGAC G.G.T.CAGA G.C..TGC.A ....CCTTTG GCG..GAACG G...C.ATCT G.G.T.AAGA G.C..TGT.A ....CTTT.G G.G.A.AACG A.....AGCC .C..TACAAA GGC..TGT.A ....CTTATG GTG.A.AAAG G...C..GCC .CACT.CAG. GGC.GTGT.A ....CATT.G GTG.CAAGCG A...C.CGCA .C..T.CAA. GCC.GTGC.A ....CCTT.G GCG.AAAGAG A...C.CGCC ..C.T.CAG. GTC.GTGC.A ....CATT.G G.G.G.AGCG G...G.GGC. .CG.T.GAGA GGC..TGC.A 481 491 501 511 521 531 | | | | | | GGTAAGAACA ACTTGCTCCA AGCATCAAAA CGTTTGGTTC ACATATTTTG .......... .......... .......... .......... .......... ...CGTG... ........G. ..T.....GG .A.A...... ..T....... C...CC...T TGC.TAA..T .T..A.GCGT ---A.....A ..T....C.. CA..C....T TGC.TAA..T .T.TA..TCT ---A.....T ..GC...... CT..CC...T TGC.TAA..T .T.T...TTT ---A.....A ..GC...... C.C......T TGC.TAA..T .T.T...TTT ---A.....A ..G....... ..G....GA. G.A..A.GAG .CTTG.TC.T A.AA.....A G..AC.AC.. ..G....GA. G.A..A.GAG .CTTG.TC.T A.AA.....A G..AC.AC.. ..G.G...G. G.A..T.GA. GCT.G....G A.GA..AC.G .T.AT..C.. ..G.GA..A. G.A....AA. GCT.G.G..G A.GA..AC.G .T.AT..C.. ..G.GA..A. G.A....AA. GCT.G.G..G A.GA..AC.G .T.AT..C.. ...CG...A. G.A.....A. GTT.G.TC.. ...A..ACGT TT.AC..C.. ..G.G...G. G.A..T.GA. GCT.G.GG.G A.GA....GA TG.GC..... ..G.GA..G. G.G.A...A. GATGG..C.G A.GA..ACA. .A.GT..C.A ...GCA.CTG .AA.AG.GA. GCT.G..C.G ..AA..AC.. T..AC.AC.A ...GCA.CTG .A..AG.GA. GCT.G.GC.G ..AA..AC.. T..A..AC.A ...CGCCGA. G.A....GA. GCTGG.GC.. A.AA..ACGG CT.AC..C.. ...CGCCGA. G.A..T.GA. GCTGG.GC.. A.AA..ACCG .T.AC..C.. ........A. G.A..T.AA. GTT.G..CG. ..AA....GG .T.AC..C.. ........A. G.A..T.AA. GTTGG..CG. ..AA....GG .T.GC..C.. ..G.GA..G. GTA..A.GA. GCTGG..G.G A.AA....GA TA.GC..C.. ..G.G...A. GTA....GA. GCTGG.CG.G A.AA....GA TG.GT..... ..A.GA..G. GTA..A.GA. GCTGG..G.G A.AA....GA TA.GC..C.. ..A.CA..G. GTA.A..AA. GCT.G.GC.G ...A..AAG. T..AC....A ...CGA..A. G.A....GA. GTT.G.TC.. ..CA..ACGT T..AC..C.. ..G.G..GG. GTA....GA. .TTGG..G.G ..GA.....A TA.GC..... ..A.G...G. GTA..T.GA. GCT.G.TG.G A.AA.....A TG.GT..C.. ..G.GA..G. G.A....GA. .CT.G.GG.G A.AA....GA TG.GC..C.. ..G.GA..G. GTG....GA. GATGG.TC.G A.AA..ACA. .A.GC..C.A

17

At2g32370_HDG3 TG.T..A.T. At1g34650_HDG10 A.GGATA.T. At1g17920_HDG12 ..TGA..GTT At1g73360_HDG11 T.TAA..GTC At3g03260_HDG8 .GAAATGCT. At5g17320_HDG9 A.GGATG.T. At5g46880_HDG5 TGTGAA..TA At4g17710_HDG4 TCTGAAC.TA MpC4HDZ1 .G....AGTT PcC4HDZ1 TG.G..AGT. SmC4HDZ01 .G.G...GT. SmC4HDZ03 .G.S...GTT PpC4HDZ123 .G.G..GGT.

At4g26920 TATTAGCGCT At4g26290_annotated .......... A_lyrata_XM_002869541 .......... Eutrema_parvulum_AFAN01000032 G..C...T.G Capsella_rubella C.....T... A_lyrata_5g07260 GG...C.... At5g07260 GG........ Brassica_napus_DQ166821 CT.AT.T..A Brassica_napus_DQ182489 CT.AT....A Brassica_napus_DQ182490 .T.A.A.... At3g61150_HDG1 .C.A.AT..G A_lyrata_XM_002876550 .C.C.A.... Thellungiella_halophila_AK353253 .C.G..T... Thellungiella_halophila_AK352868 .C.......C Brassica_rapa_EU826522 AG..T..... At4g25530_FWA GC.G..T..A Turritis_glabra_AB367817 GC.G..T..A Pt_XM_002320719 .T.G...... Pt_XM_002301295 .T.G...... Pt_XM_002318195 GT.G..T... Pt_XM_002322424 GC.G..T.T. Pt_XM_002320422 GC....T... Pt_XM_002305566 GC....T... Pt_XM_002302844 GC....T..A At5g52170_HDG7 CC.G......

..A....TG. G.A....GA. GAT.G.TG.G ..GA.T.CGA GA.CC..C.T ..A.GA.GG. G.G.CA.GAG TTTGGG.G.. A.AA..T.GA .G.AC...GC ..G....GA. GTA.AA.GAG .CTTG.TC.C A.AA....AA G..AC..C.. ..G....GA. G.A..A.GAG .CTTG.TC.G A.GA..A.CA G..AC.AC.. .CA.GA.GG. GTG.AA.GA. .TT.GGTG.G A.AA....GA .G.AT..CAA ..C.GA..T. G.G.AA.G.. TTTGGG.G.. A.AA..T.GA GG.AC...GC .CA.G..GG. ..A.AA.GAG GTT....C.G A.AC.T..GA .A.C...C.. .CG.G...G. .....A.GA. GCTG...C.G A.AA....GA .A.CT..C.. ....GACGA. GTA.....A. GCTTG.TC.. A.AA..ACAA ...AC..C.. ..A.GACGA. G.A....G.. GCTTG..C.G ..AA..ACCA .T.AC..... ..ACGTCG.. G.A....GA. GCTGG.GC.G ..AA..ACCA .T.AC..C.. ...CGRCGG. G.A....GTT CCTYG.CC.G ..CA..ACGA GT.AC.AC.. ..GCG.CGG. G.A....GA. GCTGG.GC.G A.GA..ACGA GT.AC..C.. 541 551 561 571 581 591 | | | | | | TGTGGGGTAA TCGGGAACCG AGGGAGGTGG CTC---CCTT ATGGT---AT .......... .......... .......... .......... .......... .......... .....T...A .T.....C.. .......... G......... .AC..T..T. G...AG.AGA GT...A.C.. .......... ...T....T. ..C...AC.C CAA.AC.AAA GT...A.C.C .......... GC.T....T. ..C...A... C...AC.GA. TT...ATC.C T......... GC.T....T. ..C...A... CT..AC.AA. GT...ATC.. T.G....... GC.T....T. A.CA.ATCT. A.AAC.CT.A CTCA.CAGTA G.T.....AA .C..CACT.. A.CA.ATCT. A.AAC.C..A CTCA.CTGTT G.T.....AA .C..CACT.. ....CATCGT CGCT.C.GAA .T....CAAA ........GC C...AATCG. ....CTTCGT CGTTAC.GAA .T....CAAA ........GC C...GATA.. ....CTTCTT CGTTAC.AAA .T....CAAA ..T.....GC C...GATC.. .C..C.CCGT CA.TCC..A. TT....CAA. ......GA.. CG...ATT.. G.C.C.TCG. CT.CAC..GC GT...CTACA A.G......C CG...ATCG. GC..CTTCT. G.TACC.T.A GT...ACAA. A........A CC..AGTC.. ACGA.TCCTT CG.T.G..AA GT..CA.AAA A.T....TAA C...GATTG. ACC..TTCTT CGAC.G..AA GT..GA.ATA A.T...TA.A C...GATTG. ....CCTC.. CA.T.C..AA .T...ACAA. ..A.....AC CA..CATTG. ....CCTCC. CA.T.C..AA .T...ACAA. ..G.....AC CA..CATAG. ....CTTCTT CTTT.C..AA TT..G.CAAC ..T.....AG .....ATTG. ....CTTCT. CTTT.C.TAA CT..G.CAAC ..T.....AG .....ATTG. A.C.CCTCT. CT.CTC.TAC TT...CTACA ..A......C C...GATTG. G...CTTCT. CT.CAC.TGC .T...C.ACA T.G......C C....ATTG. A.C.CCTCT. C..CCC..AC .T...CTACA ..A......C C...GATTG. AC..CTTCGT G.ATTC..AA .T..GA.AA. ..T.....G. C....ATTG.

18

At4g00730_ANL2 .T.G..T... At1g05230_HDG2 .C....T..A At4g04890_PDF2 .C....T..A At4g21750_ML1 .C.C.....C At1g79840_GL2 .G.CT..... At2g32370_HDG3 ....T.T..A At1g34650_HDG10 CG.CT....C At1g17920_HDG12 .C.AT.T..A At1g73360_HDG11 CC.AT.T..A At3g03260_HDG8 .G.C..T... At5g17320_HDG9 .G.CT.T... At5g46880_HDG5 .C..T.T... At4g17710_HDG4 .CC.T.T..C MpC4HDZ1 .C.C...... PcC4HDZ1 GC.G..T..A SmC4HDZ01 .C.A.....C SmC4HDZ03 .C.C...... PpC4HDZ123 GC.G......

At4g26920 GTGTTT At4g26290_annotated .......... A_lyrata_XM_002869541 .......... Eutrema_parvulum_AFAN01000032 ....TG..G. Capsella_rubella ....TG..A. A_lyrata_5g07260 ....TG..A. At5g07260 ....TG..A. Brassica_napus_DQ166821 ....A....G Brassica_napus_DQ182489 G...A....G Brassica_napus_DQ182490 A...A..C.G At3g61150_HDG1 T...A..C.G A_lyrata_XM_002876550 T...A..C.G Thellungiella_halophila_AK353253 G...A..C.A Thellungiella_halophila_AK352868 ....A....G Brassica_rapa_EU826522 T...ACA.GG At4g25530_FWA T...A..C.G Turritis_glabra_AB367817 T...A..C.A Pt_XM_002320719 ....A....G Pt_XM_002301295 A...A..C.A Pt_XM_002318195 G...A....A

.CG.C.CCGT CG.TCC..AA TT....CAA. .........C CG..GATTG. A...CTTC.. C..CTC..AC GT...CTACA T.G......C C....ATTG. G.C.C.TCG. CT.CAC..GC TT...CAACA A.G......C CG...ATTG. G.C.C.TC.. CT.C.C.TGC CT...CTACA T.G......C CA..CATCG. GC..CATC.. G.TACC.T.A .T...CCAAA A.......CA CG..AGTC.. ACCAAT.CG. CG...TCTAC .ATATTT.CT GGT......C CC...GTC.. AAAAT.TCTG A.AAACT.GA TCTCCC.CAA .AG.....GC C....CTC.. G.CACATCT. A.AAC.CT.. CTCA.C.GTT G.......AA ....AATGG. A.CA.ATCC. A.AAC.CA.. CTCA.CCGTT G.T.....AA .C..CACAG. ACCAT.TCCG GAAAA.TTGA CTTTCC.CA. .AG.....AC CC...ATAG. AAAAT...T. A.AAACT.GA CTTCTC.CCA .AG.....GC CC...CTC.. A.CACT.C.T AT..AC.GTC TT...CCGCT T.G.....AA CA..AGTTG. ATCAATTC.C A...AC.AGC .CCA.CAAA. GA....TGCG G....CTTG. A.C.CTTCT. CT.TCC..AC GT...CTACA T.G.....GC ....CATCG. A...C.TC.T C.ACAC.TGC .T...CAACA T.G......C ....AACTG. A.C.CATCG. CG.T.C..AC CT...C.ACT ........GC CA..AATCGC A...CCTCG. .T.TAC..AC .T...CCAC. .........C CC..CATTG. A.C.C.TCG. CG.C.C..AC GT...C.AC. ..G.....GC .G..GATCG. 601 611 621 631 641 651 | | | | | | AGCGGTTTGA CTAAAATACA TGCTAAACCG GAAATACTAT TTCCTTTAAT C--.......... .......... .......... .......... .......... .........G .......... .......... .....G.... .......C.. .C.....C.G ..G....G.. A........A ....C.A.C. ..GG.G.... .CT...A... .A.G...G.. A....C...A ...G.CA.T. ..GGG..... .CG.....AG ...G...G.. A....GG... ...G.TA... ..GGA..... .C......AG ...G...G.. CA.......A ...G..A.G. ..GGA..... GCAACCACCG TCTGGC.C.C AAACTCT..T C...ATG.T. .CAAC..CC. GCTACCACCT TCTGGC.C.C AAACTCT..T C...GCG.C. ..AA...CC. GCGAC..CCG T.TGG..G.C C.T.TCT..A A.GCGG..G. ..GA...CT. GCGACG.CAG T.TGG..G.C G.TATCT... AGGCGGT.G. ..GAC..CC. GCGACG.C.G T.TGG..G.C GAT.TCT..A CGG.GGT.G. ..GAC..CC. GC.ACG.CTG TGTGGC.T.C G...TCG..A C.GCGT..G. ..GAC..CT. GCGACG.C.T T.TGG..C.C ..TGGCT... A..CGTG.T. .CGA...CC. TCTTCC.CCC TCTGGT...C ..T.TCT..T ACTC.T..C. ..GA...CT. TC.AC..CTG T.TGGC.C.C A.TG..C.A. C.T.C...C. ..G....C.. GC.AC..CTG T.TGGT.T.C A.TG..C.A. C...C...C. ..G....CC. GC.ACC.CTG TCTGGC...C ..T.TCT..A C.G.G...C. ..GA...CC. GCGACC.CTG T.TGGC...C ..T.TCT..A C...GG..C. ..GA...CC. GCGAC..CTG T.TGGC.G.C A.T.TC..GA C...GGT... ..GAC..CT.

19

Pt_XM_002322424 G...A....G Pt_XM_002320422 T...A..C.G Pt_XM_002305566 T...A..C.G Pt_XM_002302844 A...A..C.G At5g52170_HDG7 G...A..C.C At4g00730_ANL2 G...A..C.A At1g05230_HDG2 ....A..C.G At4g04890_PDF2 ....A....G At4g21750_ML1 ....A....G At1g79840_GL2 T...A.A.GG At2g32370_HDG3 ....A..A.G At1g34650_HDG10 T...T..A.G At1g17920_HDG12 ....A....G At1g73360_HDG11 ....A..C.. At3g03260_HDG8 T...A....G At5g17320_HDG9 T...C.AA.G At5g46880_HDG5 .....A.C.G At4g17710_HDG4 .....A...G MpC4HDZ1 G...A..C.C PcC4HDZ1 ....A..C.C SmC4HDZ01 ....A....G SmC4HDZ03 ....A..C.C PpC4HDZ123 G...A..C.A

At4g26920 CTCCATACTT At4g26290_annotated .......... A_lyrata_XM_002869541 ...G..T... Eutrema_parvulum_AFAN01000032 ...TT.CG.A Capsella_rubella .G.TTGCA.A A_lyrata_5g07260 TG.TTGCT.A At5g07260 .A.TTGCG.A Brassica_napus_DQ166821 .AGTCC.G.G Brassica_napus_DQ182489 .AGTCC.G.G Brassica_napus_DQ182490 .G.GCCTG.. At3g61150_HDG1 .G.GCCTG.. A_lyrata_XM_002876550 .G.GCCTG.G Thellungiella_halophila_AK353253 .G.GCCTG.. Thellungiella_halophila_AK352868 .G.TCCGG.G Brassica_rapa_EU826522 .G.TCCCG.A

TCAAC..CTG T.TGGC.G.C G.T.TC..AA C...GGT... ..GAC..TT. GCAAC..CCT TCTGGC.T.C A.T.CCT..A A...GGG.T. ..GA...TC. GC.AC..CCT TCTGG..T.C A.T.C..T.C A.G.GGA.G. ..GA...CC. GCAAC..CCT TCTGGC.T.C A.T.CC...A A.G.GGG.T. ..GA...TC. GCGACA.CTC TGTGGC...C A.TG.CT.A. C.G.G...C. ..GAG..TC. GC.AC..CTG T.TGGC...C ....GC...A C.GCGT..G. A.GA...CT. GC.AC..CTT T.TGG..C.C ..T.CCT..A A.GCG.G.C. ..GAC..CC. GCTAC..CAT TCTGG..C.C A.T.GCT..C A..CGTG.T. ..GA...CC. GCTAC..CTT TCTGG..C.C ..TAGCT..A A..CG.G.G. .CGA...TC. TCTTC..C.C TGTGGT...C ..T.TCT..A .CTC.T..C. .CGA...CT. GC.AC..CCT T.TGGC.T.C ....CCT..T A.C.CTG.C. ..GAC..CC. G.TTCA.CTT TATCTC.C.C .CTCCCT..T CTCCA.G.C. ACGA...CC. GC.ACAAGTT TCTGGC.C.C .AT.TCT..A C...ACG.C. .CAA...CC. GC.ACCACTT TCTGGC.T.C CAA.TCT..T C...ATG.C. .CAA...CC. TCGTC..CTC TCGC...C.C .CTC.CT..T TTGCA.G.C. .CG....CC. G.TTCA.CTT TATCCC.C.C .CTCCCT..T .TCCA.G.G. ACGA...CC. GT.TCAACC. ..TGGC.T.C ATT.TCT.AC C.TCA.G.C. ..GA.C.T.. GTGTCAG.C. .GCTTC.C.C .TA.TCT.AT C..CA.G.C. ..GA.C.TC. GCTAC..C.C TATGGC...C C.TGTCC..T C.GCG.G.T. ..GAG..TC. GCTACA.C.. TCTGGC...C ..TATTG..A C.GCGTG.G. .CAA..CTC. GCTAC..CCC TCTGG..G.C ..TG.GC..C C.G.G.G.T. ..GAG..CC. GCGACG.CCC TGTGGG.G.C G.TA..CT.. C.GCGC..G. .CGAG..CC. GCGACG.C.. TGTGGC.GGC G.TGTCGG.A .CGCGGG.G. ..GAG..TT. 661 671 681 691 701 711 | | | | | | TTGCTTCAAG AAGCATACAA TGAGGCATCA TCGTCG---A TGGTGATCCA .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... G..A...... ..A.....T. .......... G.AGG..... ..A.A..... A..A...... ..A.....TT G..C...... GG.G.A.... ..A.A...T. C..A...... ..A.....T. C..T.A.A.. GGAG.A.... ..A.A..... C..A...... ..A.....T. C..T.A.... GGAG.A.... ..A.A..... A.T..A.... ..A...CA.T A..CT..... GGAG.A...C .A...G.GT. A.T..A.... ..A...CA.T A..CT..... GGAG.A...C .A...G.GT. A.A..A..G. .GA..AG..T A..C..TGT. GGAG.C...G .T...G.GT. A.AT.A.... .GA..AG..T A..C..TG.. GGAG.....G .T..CG.GT. A.AT.A.... .GA..AG..T A..C..TG.. GGAG.....C .T...G..T. A.T.....G. .GA...G..T A..C.....G GGAG.....C .T..AG..T. A.C..A.... .GAGT.GT.C G..C..T..T GGA......T AT.....AT. G....G.... .CAGC.G..C AA.CT...AT GA.......G ....TG.AT.

20

At4g25530_FWA TG.ACC.G.G Turritis_glabra_AB367817 .G.ACCTG.G Pt_XM_002320719 .G.TCCCG.. Pt_XM_002301295 .G.TCCCG.. Pt_XM_002318195 TG.ACC.G.C Pt_XM_002322424 .G.ACC.G.C Pt_XM_002320422 TG.ACCGG.. Pt_XM_002305566 TG.ACC.G.G Pt_XM_002302844 TG.TCCTG.. At5g52170_HDG7 .G.GCCTG.. At4g00730_ANL2 .G.GCCTG.C At1g05230_HDG2 TG.TCC.G.C At4g04890_PDF2 .G.GCC.G.G At4g21750_ML1 .G.ACC.G.. At1g79840_GL2 .G.TCCCG.A At2g32370_HDG3 TG.GCCTG.. At1g34650_HDG10 .G.TCC.A.G At1g17920_HDG12 .A.TCC.G.G At1g73360_HDG11 .AGTCC.G.G At3g03260_HDG8 .G.TCCTA.G At5g17320_HDG9 .G.TCC.A.G At5g46880_HDG5 .....CCG.C At4g17710_HDG4 ...T.CCG.C MpC4HDZ1 .G.ACCTG.. PcC4HDZ1 TG.ACCTG.G SmC4HDZ01 .G.TCCGG.A SmC4HDZ03 .G.TCCCG.A PpC4HDZ123 .G.GCCGGGG

At4g26920 CTTTACCTAT At4g26290_annotated .......... A_lyrata_XM_002869541 .A........ Eutrema_parvulum_AFAN01000032 .CGA.TT.T. Capsella_rubella .AAACTT.TG A_lyrata_5g07260 .GAACTT.TA At5g07260 .GAACTG.TA Brassica_napus_DQ166821 TCCCCT..TG Brassica_napus_DQ182489 TCCCCT..TG Brassica_napus_DQ182490 GGCACTACT.

G.T..G.... .GATT.GG.. ...T...... GGTG.A.... .....G.GT. G.T..G.... .G.TT.GG.. ...T.....G GGTG.A...G .....G.GT. A.A..G.... .GA.T.G..T A..T...G.. GG...A...C .T..AG.GT. A.A..G.... .GA...G..T A..T...G.. GGC..T...C .T..AG.GT. A.AT.A.... ..A.CAGG.. ...T.TC... GGT......C AA..AG.GT. A.AT.G.... ..A.C.GG.. ...T.TC..G GGT......T .A..AG.GT. A.TT.A.... .GAGT.G.GC ...TCA.A.. G.C..T...T .C..A...T. A.AT.G.... .GAGC.G..C ...TT.TA.. GGC..A...T AT..T..TT. A.TT.G.... .GAGT.G..C ...TCA.A.G G.C..T...T .C..A...T. A.CT.G.... .GA...GG.. ...C.TT..T GGTG.A...C .....G.TT. A.T.....G. .GA...G..T ...C.....T GGAG.....C .T..AG.AT. A.C..A.... .GAGC.G..C ...TC.TA.. G.T..C...T .T......T. A.T..A.... .GAGC.GT.C A..T...... GGA......T AT.....TT. A.CT.A.... ..AGT.GT.C G..C...... GG...C...T AT.....AT. G....G.... .CAGCAG..C .A.CT.G.AT GA.......G .....G.AT. A.AG...... .GA.T.CT.C ...CC..A.. G.T..A...T .T...C.TT. A.A..A.... .T.GC.T..T A..T....T. GGAGGT.... .....G.AT. A.T..A.... ..AGC.G..T A..CT..... AGTG.AGCGC .T......T. A.T..G.... ..AGC.CA.C A..CT..... GGAG.A...T .T...G..T. A....G.... .CTGC....T G..C..C.TG GGAGGC.... ..A.CG.GT. A.A..C.... .TAGC.T... A..T....TG GGAGGA.... .....GC.T. A......... ..AGC.G..T ...TAAC..C GGTAGC...T .AA.CG..T. A......... ..A...G..C C..TAAC..C GG.AGT...C .CC..G.GT. A.Y..C..G. .GAGT.G..C C..C.TG..T GGC..T...C .A......T. A.T..A..G. .GAGT.G..C A..T.TG..T GG...C...C .C..T...T. A.C..C.... .GAGCAG..C C..C.AG..C GGC..T...C .CA.CG..T. A.CT.G..G. .GAGC.GT.C C..T.....C GGC......C .CA.AG.TT. A.C..G..G. .GAGC.G..C G..C.TG..G GG...C...C ........T. 721 731 741 751 761 771 | | | | | | GACGTGTCAT CTCTTGCGAA GATTATAAAC GGT---GACC GTTCCTACTC .......... .......... .......... .......... .......... ....A..... .....CG... A......... ........TA ....G.T... C.A.C..... .....CA.GC TG.A..C... ........TT TG..TCG.CT ..A.CAC... A.T..A..GC TG.A...... .........T TG...G..AT ..A.C.C... A.T....AGC TG.A...... .A......TT TG..TGG.AT ..A.C.C... A.T....AGC TGCA...... ........TT TG..TGG.GT ..TT.AC..G .A..GAAC.T CGCA..G.G. ........TA C...T...AT ..TT.AC..G .AT.GAAC.T CGCA..G.G. ........TA C...T...AT ..TA.AC..G .GA.GCAAGC TG.G..G... ........TT C.G.T...GT

21

At3g61150_HDG1 GGCACT.CT. A_lyrata_XM_002876550 GGCACTACT. Thellungiella_halophila_AK353253 GGC.CT.CT. Thellungiella_halophila_AK352868 GGCGTTGCTG Brassica_rapa_EU826522 .CAA.T.CTC At4g25530_FWA GAAGTTTCT. Turritis_glabra_AB367817 GCAGCTTCT. Pt_XM_002320719 AGCGCTTCT. Pt_XM_002301295 GGCGCTTCT. Pt_XM_002318195 GGC.CT.CTG Pt_XM_002322424 GGC.CT..TG Pt_XM_002320422 AGCACTTCT. Pt_XM_002305566 TGCCCTGCT. Pt_XM_002302844 GGCACTTCT. At5g52170_HDG7 GGCGCTTCT. At4g00730_ANL2 GGC.CT.CT. At1g05230_HDG2 GGC.CTGCT. At4g04890_PDF2 GGCGTTG.TG At4g21750_ML1 .GC.TTG.TA At1g79840_GL2 .CAA.T.CTC At2g32370_HDG3 GG.G.T.CTG At1g34650_HDG10 TCCA.TTCTC At1g17920_HDG12 TCCG.TA.TA At1g73360_HDG11 TCC.CT..TG At3g03260_HDG8 .CCA.T.CT. At5g17320_HDG9 TCCA.T.CTC At5g46880_HDG5 TCCG.TT.TA At4g17710_HDG4 ACCACTT.TG MpC4HDZ1 AGCACTGCTA PcC4HDZ1 TGCGCTGCTA SmC4HDZ01 TGC.CTACT. SmC4HDZ03 GGCGCTG.TA PpC4HDZ123 GGCGCTGCTG

At4g26920 At4g26290_annotated A_lyrata_XM_002869541 Eutrema_parvulum_AFAN01000032 Capsella_rubella A_lyrata_5g07260 At5g07260 Brassica_napus_DQ166821 Brassica_napus_DQ182489 Brassica_napus_DQ182490

..TA.CC..G .AA.GCAAGC TG.G..G... ........TT C.G.T...GT ..TA.CC..G .AA.GCAAGC TG.G..G... ........TT C.G.T...GT ..TA.TC..G .CA.GAATGT TG.G..G... ........AT CA......GT ..TA.AGTGG .GA.GAATGT TG..C.G.G. ..A.....T. C.GAT...GT ..TA.AAACA .GACACA..T .G.G..CGCA ..A.....T. CAAG.A..AT ..AACCAAT. ..A...A.CT .G.C.AG.GA ......A.TT CAGAT.CTGT ..AACCAGT. .AA...A.CC .G.C.AG.GA ......A.TT CAGAT.CTGT ...A.TC.GG .CA.GCACGT .G.G..G..T ........TT CAG.....GT ...ACTC.GG .CA.GCACGT AG.G..G... ........TT CGG.T...GT ...A.ACA.. .GA.GAGTGT AG.G.CGTCT .........T C.A....TGT .....ACA.. .AG.GAGTGT AG.G..G..T .........T C.A....TGT ..TA.TGTTG .AA.GAACGT CG.AC.T..T ..A.....T. CAGA...TGT ...A.T..GG .CA.GAAC.T AG.CC...G. .......... CAGAT..TGT ..TA.TGTTG .AA.GAACGT .G.GC.T... ..C.....T. CAGAT..TGT ..TA.TC... ..A.GAAT.C TG.A..G.G. ........TT CAG.T..TGT ...A.TC.CG .CA.GCATGT .G.G..G... ........TT CA......GT ..TA.TGT.G ..A.GAAC.T AG.GC.T..T ..A.....T. CAGA...TGT ..TA.AGTGG .GA.GAATGT .G..C...G. ........T. C.GAT...GT ..TA.AAT.G ..A.GAACGT TG.CC.G.GT ........T. CGGAT..TGT ..TA.AAACA .GACACA.CT .G.GC.CGCG ..A.....T. CAAG.A..AT ..TA..A... .AA.G.AT.T T.C.C.CC.T ..A.....T. C.GA..TTGT A.TC.AAACA .AGCATACTC CGCC..TTCT ..C.....T. C....AC.AT ..TC.AC..G .GT.GAAC.T AGCA..G.GT .........A CA..T..TAT ..TT.AG..G .AT.GAAC.T CGCA..G.G. ........T. C...T..TAT ...A..G.TA .AA.GCATTT CGC.G.CTC. ..C....... C....C.TAT ..TC.AAACA .CGCCTGCGC TGCC..TTCA ..C.....T. C.A..AC.AT .....TGAC. .AA..CA.C. .GCG..G... ........TT C....A.TAT ...CCCGTCG ..G..CA.CT CGCC..G... ..C.....T. C...TG.AAT ..TA.TC.CG .GA.GA.CTT .G.AT.GC.A ..C.....T. CCG.....GT ...A.CC.TG ..A.GCA..T .G.C..GC.A ..G...A.T. C.GAA.TGGT ...A.AC..G .CA.GAACTT AG.A..GC.A ..C.....T. CAG.....GT ..TA.TC.GG .GA.GAATCT .G.G..GC.A ..A.....T. CAG..AC.GT ...A.CC.GG .GA.GAACCT .G.GC.GC.G ..G....... CGG.G...GT 781 791 801 | | | CCGTGTGGAT TCACCATAAT GCCG .......... .......... .... .......... .......... ...C .....C...A ....G...G. ...A ..T..C.... ....G..G.. ...A .......... ....A..... ...A ..CA...... .......... A..A T.C.CA..C. ..G.A..CTC A..A T.C.CA..C. ..G.A..CTC A..A ..C.CG.... ..G....CC. C..T

22

At3g61150_HDG1 A_lyrata_XM_002876550 Thellungiella_halophila_AK353253 Thellungiella_halophila_AK352868 Brassica_rapa_EU826522 At4g25530_FWA Turritis_glabra_AB367817 Pt_XM_002320719 Pt_XM_002301295 Pt_XM_002318195 Pt_XM_002322424 Pt_XM_002320422 Pt_XM_002305566 Pt_XM_002302844 At5g52170_HDG7 At4g00730_ANL2 At1g05230_HDG2 At4g04890_PDF2 At4g21750_ML1 At1g79840_GL2 At2g32370_HDG3 At1g34650_HDG10 At1g17920_HDG12 At1g73360_HDG11 At3g03260_HDG8 At5g17320_HDG9 At5g46880_HDG5 At4g17710_HDG4 MpC4HDZ1 PcC4HDZ1 SmC4HDZ01 SmC4HDZ03 PpC4HDZ123

..T.CA.... ..C.CA.... ..C.CC..T. ....C...T. ..T....... ..T.CG.... ..T.CG.... ....CA..T. ....CA..G. ..A.CA..G. ..A.CA.... ..T.CA..G. ..A.CA..T. ..A.CA..G. ..A.CG..G. ....CC..T. ..A.CA..T. ....C...T. ..A.CC.... ..C.C..... ..T.C...T. ..T.CC..T. ..C.CA..T. T.C.CA..T. ..A.C...T. ..T.CC..T. ..CCTC.... ..CGT..... ..T.CA..C. ..C.CC..T. ..T.CC.... ....CA.... ....CG..G.

..G....TC. ..G....TC. ..G.T..TC. .TG.T..TT. ..T.A..C.. .TT.G...G. .TT.G...T. ..G.T..CG. .TG.T..CG. .TGTG..TC. .TGTG..TC. .TG.TG.TC. .TG.T...C. .TG.T..TT. ..T.G...T. ..G.TG.TC. .TG.T..TC. .TG.T..TT. ..G.T..TT. ..T.A..C.. .TG.T..TT. ...T...CTC .TG....TTC ....A..CTC ..GT...CTC .T.TG..CTC ..T....TG. ..T.TG.TG. ..G.G..CC. .TG.A..CC. .TG.....C. .TG....CTC ..G.T..CC.

C..T C... C..T A... A..T ...A ...A A..A A... T..A T..A T..C C..T T..T ...T T..T T..T A... .... A..T T..A T.GT A..A A..A AAGC C.GT C..C T..A A..T T..A C... A... ....

Suppl. Figure 2. Alignment of the Brassicaceae specific BZG genes, with other land plant C4HDZ genes. The alignment was constructed using amino acids (a) and the corresponding nucleotide (b) sequences were used to generate the phylograms in Figure 3 and Suppl. Figure 4. Genbank numbers are provided for all genes not previously described in Figure 2, with the exception of A_lyrata_5g07260, which was deduced from scaffold_6 of the A. lyrata genome sequence (http://www.phytozome.net/alyrata)

23

Suppl. Figure 3. Bayesian phylogram of Brassicaceae specific BZG genes with other land plant C4HDZ genes.

24

Tree was constructed using amino acid alignment in Suppl. Figure 3. Numbers at branches indicate posterior probability values. The tree was rooted with the liverwort gene, MpC4HDZ1. Additional C4HDZ genes from Brassicaceae species and Populus were included (Genbank numbers provided) for better resolution of the clades of interest.

25

Suppl. Figure 4. Tree was constructed using amino acid alignment in Suppl. data (Suppl.dataC4spBAYES.txt). Numbers at branches indicate posterior probability values. See Figure 2 for taxon abbreviations. Solanum lycopersicum genes labeled 'ps' are not detectably expressed (http://solgenomics.net/organism/Solanum_lycopersicum/genome). Additional sequences in the S. lycopersicum genome annotated as homologous to C4HDZ genes include: Solyc00g099580.1.1, Solyc04g025490.1.1, Solyc04g025740.1.1, Solyc05g015030.1.1, Solyc07g026560.1.1, Solyc07g026570.1.1, Solyc07g037910.1.1, Solyc08g062520.2.1, Solyc09g057510.1.1 , Solyc09g060130.1.1, Solyc09g060170.1.1, Solyc09g066060.1.1, Solyc10g018020.1.1 , Solyc11g022420.1.1. 26

1 11 21 31 | | | | MpC4HDZ1 ---------- ---------- ---------PcC4HDZ1 .......... .......... .......... PpC4HDZ1 .......... .......... .......... PpC4HDZ2 .......... .......... .......... PpC4HDZ3 .......... .......... .......... PpC4HDZ4 .......... .......... .......... SmC4HDZ1 .......... .......... .......... SmC4HDZ2 .......... .......... .......... SmC4HDZ4 TGGAAAGCTG GGAAGTGATC AGGAAAGTTG SmC4HDZ3 .......... .......... .......... Pn_0003807 .......... .......... .......... Edi_1007259 .......... .......... .......... Edi_1001922 .......... .......... .......... Apl_0023502 .......... .......... .......... CruC4HDZ2 .......... .......... .......... Zv_FD774992_1 .......... .......... .......... GbC3HDZ1 .......... .......... .......... GbC4HDZ3 .......... .......... .......... gb_deg2_UTR .......... .......... .......... PmC4HDZ1 .......... .......... .......... PmC3HDZ4 .......... .......... .......... Vpl_0009732 .......... .......... .......... At4g04890_PDF2 .......... .......... .......... At4g21750_ML1 .......... .......... .......... At1g73360_HDG11 .......... .......... .......... At1g17920_HDG12 .......... .......... .......... At4g17710_HDG4 .......... .......... .......... At5g46880_HDG5 .......... .......... .......... At3g61150_HDG1 .......... .......... .......... At4g00730_ANL2 .......... .......... .......... At2g32370_HDG3 .......... .......... .......... At1g05230_HDG2 .......... .......... .......... At5g52170_HDG7 .......... .......... .......... At4g25530_FWA .......... .......... .......... AT3G03260_HDG8 .......... .......... .......... Coleochaete_orbicularis_2036 .......... .......... ..........

41 | ---------.......... .......... .......... .......... .......... .......... .......... GGTGGGATGG .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

51 | ---------.......... .......... .......... .......... .......... .......... .......... ATGGATGATG .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

---------.......... .......... .......... .......... .......... .......... .......... GATGGTTTTT .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

61 71 81 91 | | | | MpC4HDZ1 ---------- ---------- ---------PcC4HDZ1 .......... .......... .......... PpC4HDZ1 .......... .......... .......... PpC4HDZ2 .......... .......... .......... PpC4HDZ3 .......... .......... .......... PpC4HDZ4 .......... .......... .......... SmC4HDZ1 .......... .......... .......... SmC4HDZ2 .......... .......... .......... SmC4HDZ4 TGGGGGACAG GATCACAGGA CAGGTGGTGG SmC4HDZ3 .......... .......... .......... Pn_0003807 .......... .......... .......... Edi_1007259 .......... .......... .......... Edi_1001922 .......... .......... .......... Apl_0023502 .......... .......... .......... CruC4HDZ2 .......... .......... .......... Zv_FD774992_1 .......... .......... .......... GbC3HDZ1 .......... .......... .......... GbC4HDZ3 .......... .......... .......... gb_deg2_UTR .......... .......... .......... PmC4HDZ1 .......... .......... .......... PmC3HDZ4 .......... .......... .......... Vpl_0009732 .......... .......... .......... At4g04890_PDF2 .......... .......... .......... At4g21750_ML1 .......... .......... .......... At1g73360_HDG11 .......... .......... .......... At1g17920_HDG12 .......... .......... .......... At4g17710_HDG4 .......... .......... .......... At5g46880_HDG5 .......... .......... .......... At3g61150_HDG1 .......... .......... .......... At4g00730_ANL2 .......... .......... .......... At2g32370_HDG3 .......... .......... .......... At1g05230_HDG2 .......... .......... .......... At5g52170_HDG7 .......... .......... .......... At4g25530_FWA .......... .......... .......... AT3G03260_HDG8 .......... .......... .......... Coleochaete_orbicularis_2036 .......... .......... ..........

101 | ---------.......... .......... .......... .......... .......... .......... .......... AACTTCTCGT .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

111 | ---------.......... .......... .......... .......... .......... .......... .......... TCTTTTCGTA .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

---------.......... .......... .......... .......... .......... .......... .......... GGGGATGACT .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

121

131

141

27

151

161

171

| ---------.......... .......... .......... .......... .......... .......... .......... TTCCTCCGCC .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

| --------GG ........-T ....GGAG.. .......... ........TA .......A.A ........T. ........CT AGCTTTCCA. ........CC ........-. ........-.......AAA .......G.T .......... .......... ........AC .........A ........-.......ACC .......ACC .........A .......CCT ........CC ........-.........A ........AC ........-........CC ........CC ........CC ........CC ........CC ........CC ........-........--

AA-------.G........ TTATGCATGG .GCCACGGAT .GATCGGTCG GGGGGGCGGG GGATTTAGCA CGCTGGATTC GGGCTCCAAG GCCAAGCTAA CG........ --........ GCCCATCCTG .CCCTGTCTA G......... C......... C.GGTTTCTA G.ATCAGGCA CG........ CTACTGGTAA TCCTACGAGT GTCGACGGCC .CAGCTAAAC T.CCGCTAAA G.TAACGGTT CGTCATTAAA C.CCGAAAGA GTCACCACCA T.CGGCTAAG ..CGGCAAAA TGAGGCTAGG T.CGGCTAAG G.CTGCAAAA ..CTGCTGCA --........ --........

181 191 201 211 221 | | | | | MpC4HDZ1 ---------- ---------- ---------- ---------PcC4HDZ1 .......... .......... .......... .......... PpC4HDZ1 TCAGCAGCAG CGCGGTCTTT TCCCTTTTGG AGGTCATGGA PpC4HDZ2 GGTCAGTATC AGCGCGGTCT TTTCCCTCTC GGAGGTGATG PpC4HDZ3 GTATCCATAA GAACAGCGGC TCTCGGAATA AAATTGGCAG PpC4HDZ4 GTGGGTGGGG GATAGCATTT TTGTGTGGAA TTTACGGGCG SmC4HDZ1 GGGGTTTTAG AGGGACAAAC AAATGAAGAA CAGCAACCAA SmC4HDZ2 CATYGTCGCC ATCAACACCC TCATCTCCAA CACGGTCCAG SmC4HDZ4 GTCTGGTTTT TAAAGATGGA TCGAGACAGG GGCGGTATGT SmC4HDZ3 ACTTGGAGTC CGTGGCCACT GTCAACAGCC TCATCTCCAG Pn_0003807 .......... .......... .......... .......... Edi_1007259 .......... .......... .......... .......... Edi_1001922 GCAGTATTTT GGCAGCTGCT GTTCTGCTCT CCTTTTAGAC Apl_0023502 TGGAAAACCA GATTTTGAGC AGCATAGCAG CATCTGTAAC CruC4HDZ2 .......... .......... .......... .......... Zv_FD774992_1 .......... .......... .......... .......... GbC3HDZ1 TTAGCGGGCT AGAATTTCCT GTTACTTTGG ACATGTTTCA GbC4HDZ3 GCCCTTTTAG TATGAGCCTC TTCTTCAATC TAATGTAGCT gb_deg2_UTR .......... .......... .......... .......... PmC4HDZ1 TGGTGTTGGA GTCAACAGCC CGCGTGTGGG AGGATCACTT PmC3HDZ4 TCACTTCTTG AAGGGGCTAA CAGCAGCAGC AACAGCAGCA Vpl_0009732 AATATTGTGG GT........ .......... .......... At4g04890_PDF2 TCTCACTTGG CTCGGTGGCT ACGGTTAATA GTCTGATCAA At4g21750_ML1 CTCTCTCTCG GTTCAGTTGC TACAGTCAAT AGTCTGATCA At1g73360_HDG11 GGGTTTCAGA TAATGGTAAG CAATTTACCG ACGGCAAAAC At1g17920_HDG12 AGCTATTTCC CCATCTGATT TCTGAGAAGG GTCTTCAATG At4g17710_HDG4 CTCGATCTTT CCACCGTCTC CGTCATCAAT CACCGTATTT At5g46880_HDG5 TCAATAACCA TCTCTGCGCC ACCGTTAATC AGATCACTTC At3g61150_HDG1 CTAACGGTGG AATCTGTGGA GACGGTTAAT AATTTGATAT At4g00730_ANL2 CTCACGGTTG AGTCGGTGGA GACTGTCAAC AACCTAATAT At2g32370_HDG3 CTGAGTGTTA GCTCTGTTGC AACTACTGAG AATCTGATTC At1g05230_HDG2 CTGTCTCTTG GCTCTGTTGC AACTGTCAAC AATCTAATAG At5g52170_HDG7 CTCAACGTGG AATCTGTTGA AACTGTTAAC AACCTCATCG At4g25530_FWA CTCATTCAAG GTACTGTCAA AAGTGTCGAG ACACTCATGG AT3G03260_HDG8 .......... .......... .......... .......... Coleochaete_orbicularis_2036 .......... .......... .......... ..........

231 | ---------.......... GGGGTGGAGG TATGGGTGGG CACGTTCTGT TGGGTGTGGA GAACACAAAA AAGGTGAAAG TGCATGCAAG CACCGTGCAA .......... .......... TCTTAGCGGA ACATACTCAT .......... .......... TTACATTTAA GTGCTTCTCA .......... TTAACTGTTG GCAGCAGTGG .......... ATGTACGGTG AATGCACTGT TGAATATGGA AGCAGAGAAG GCGCCACTGT TGCCCTCAGC CGTGCACCGT CATGCACCGT GTACAACCGT CTTGCACTGT CTT....... CTCATACTAT .......... ..........

---------.......... TTCTGCTCGT CGCCTCTGCT TCTCTTTGGG GGTCCTGCCG GGGGGA.... GAGCACTGAC TTGGGGCTTG CGCATCAAGA .......... .......... AAGCAAACTA TATTAGCATC .......... .......... CGCCATGGTG TCTGCAATGG .......... CATTCCAAAT ATTGGATAGC .......... GAGAGGATTA CGAGCGGATT GTCGGTGGAA TGAAAGTTAT CAACCGTATC AACACCATTA TCAGAAGATT TCAGAAGATT GCGGAGGATC TGAGAGAATC .......... TGTCAAGATC .......... ..........

MpC4HDZ1 PcC4HDZ1 PpC4HDZ1 PpC4HDZ2 PpC4HDZ3 PpC4HDZ4 SmC4HDZ1 SmC4HDZ2 SmC4HDZ4 SmC4HDZ3 Pn_0003807 Edi_1007259 Edi_1001922 Apl_0023502 CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

| | | ---------- ---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TCTCAGTTCT TTCTCTCTTT .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

241 |

251 |

| ---------.......... .......... .......... .......... .......... .......... .......... CGGTACGATC .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

261 |

28

271 |

281 |

291 |

MpC4HDZ1 PcC4HDZ1 PpC4HDZ1 PpC4HDZ2 PpC4HDZ3 PpC4HDZ4 SmC4HDZ1 SmC4HDZ2 SmC4HDZ4 SmC4HDZ3 Pn_0003807 Edi_1007259 Edi_1001922 Apl_0023502 CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

---------.......... CGAGTCCCGC CTGTGGGTCT TGACATCTGG CCCGGTGGTG .......... TCGAAGATCG TCGGGGAGCT CGAGCATGGA .......... .......... AGTTAACCCA CTTTATTATC .......... .......... GATACAGAGA CTTGTCTTAG .......... CACAGTGGAG GTGGCCTTTC .......... GAGCGTAG.. ..TC..GATG CATCAAATTA GAGTGTAAGA TAA....... CCAATCAAGA .......... .......... .......... .......... .......... .......... .......... ..........

---------.......... TAGAATGAGA GCTGGAAATG ACGTCACGGT GGCGGTGCAG .......... CTTCCCTTCT TTTTTCCTTG TTGGACAGAG .......... .......... ACTAGTTTTA ACTTAGTGCT .......... .......... GATCAACATA GTTTCTCTGG .......... TCAGTGGAGA AAGTCCTTGT .......... .......... TTTTCGGAAG AAACCGCCTT AATCATTATT .......... AGTCAGCTAG .......... .......... .......... .......... .......... .......... .......... ..........

---------.......... GAATTATGGT AGACGACGAT CTCACTTCCG GGAGGAGCAA .......... TTCAGCTTCT ATCTCTCTCC AAAGAGCTAC .......... .......... AAGTTAGCTA ACTCTTTAGA .......... .......... TGACTAGAAT TAAACTTCCA .......... CAGTGAATAA CAGCCATTTA .......... .......... GTAAGAGTGA GAGCGGTCCT CTTCTGGGTA .......... TTGACCGAAT .......... .......... .......... .......... .......... .......... .......... ..........

301 311 321 331 341 | | | | | MpC4HDZ1 ---------- ---------- ---------- ---------PcC4HDZ1 .......... .......... .......... .......... PpC4HDZ1 AAATCAGAGC GGTCTTTCAC TTGTTGCTAT GGGCGAGGTA PpC4HDZ2 GGCGGATCAT CAGTATTTCA CTTGGGTGAT GGACGAAGCA PpC4HDZ3 GTGGTGGGCG CAGCAGGGAA TAGTAGTACA AATGGTCAGC PpC4HDZ4 AGAGGGTGGG CATTGAGGAG CGTTCAAGAT TGTGTGTCGT SmC4HDZ1 ........T. .......... .......... .......... SmC4HDZ2 CTCGTTTCCG TAAGTTTGTT TCTCGTCCTT CCCGTGATCA SmC4HDZ4 TTTTTGGCAA GCTCGGTTTT GGTTATACAC TGGACAGAAG SmC4HDZ3 AGTGGTTTCG ATATAGCTTT CGGATTGGAC AGAGAAAGAG Pn_0003807 .......... .......... .......... .......... Edi_1007259 .......... .......... .......... .......... Edi_1001922 GTTAGTTAGT AGTTGAGGGA AATCAACAAG CCAGCTGTTT Apl_0023502 ACACTCACAC CTGCTGCGTT GTACATGAGT GGGGCATCAA CruC4HDZ2 .......... .......... .......... .......... Zv_FD774992_1 .......... .......... .......... .......... GbC3HDZ1 CATTATAATA GTTTCT.... .......... .......... GbC4HDZ3 ATCCTAACTT ATAAACTGAC GTACAGGAAG ATGGCCAGCC gb_deg2_UTR .......... .......... .......... .......... PmC4HDZ1 CCTCATATCC TGTACTGTTC AGAAAATCAA AGCAGCTCTG PmC3HDZ4 CCGACAGCCA AACTGGGTTT AGATTCTGTT ACAACCATCA Vpl_0009732 .......... .......... .......... .......... At4g04890_PDF2 .......... .......... .......... .......... At4g21750_ML1 AAGGGGAGGT TTAGGGAGTT TATGATAATG TTTGTGGTCT At1g73360_HDG11 ACAGCTTCAA CTACAGCTTG ACACACACCA TTAAAAGCTC At1g17920_HDG12 TCACTCTCTC TCTCCTCTTT CTTGTTCTTT ATTTTTGAGT At4g17710_HDG4 .......... .......... .......... .......... At5g46880_HDG5 TACCCCTAAA CCTCCTTTTG CTTTTTCTAT CTCAGAGTCA At3g61150_HDG1 .......... .......... .......... .......... At4g00730_ANL2 .......... .......... .......... .......... At2g32370_HDG3 .......... .......... .......... .......... At1g05230_HDG2 .......... .......... .......... .......... At5g52170_HDG7 .......... .......... .......... .......... At4g25530_FWA .......... .......... .......... .......... AT3G03260_HDG8 .......... .......... .......... .......... Coleochaete_orbicularis_2036 .......... .......... .......... ..........

351 | ---------.......... GATCTGAGTA AGGCAGATAG GGGTGGGCAT CTCACATTGA .......... CGCCCCTCGT TTTGGGACTC CTAGTACAGG .......... .......... ACCTGTGTTG CACTCCAATG .......... .......... .......... TTGGCTGTCA .......... CACTGTGAAG ACAATCTTAT .......... .......... TTTGTTTTTT TCTTCCCTCT TTCTTTTTTA ........GA GAGCAGTCCC .......... .......... .......... .......... .......... .......... .......... ..........

---------.......... CAAACGGCAG TGCGAACGGC TGAGGAGCCT GCGTGGTCTT .......... TTTGTCTCTT GCTTGAGGGA CATTACAGGC .......... .......... TTGATTCATC GTGTTGATGT .......... .......... .......... CAAAGAGAAT .......... ACGCTTGACA ATGCAATACA .......... .......... TAAGTCTTTT TTCTTCTGGG AGCTTAGTGG GAAGATTTGG ATGTCAGGGG .......... .......... .......... .......... .......... .......... .......... ..........

MpC4HDZ1

---------.......... CCGCCACGCG GGGCCACCAC GAAGAACCTC GGTCACCACA .......... GCCGAGCAAC AGCAAGCCGA CCGCTCTCGG .......... .......... GGGCGGCGGC AACTGCATGA .......... .......... GTATTGTTGA GTGTCATCCA .......... ATTAGTCAAC CCGCTCACAG .......... AAGCTGCTGT AAAGCCGCTC ACGGTTAATA TATTAGTGTT ACTTCCGCGC CGCCGGTGAT AAAGCTGCTC CGTGCCGCTT AAAGATTTGT AAAGCTTCAA .......... AAATCCGCGT .......... ..........

---------.......... CAGTGAGATA GCGTAGTGAG TGCCGGTCCA TGGATTGAGA .......... GAGCTCTAAA GGAGTGGTCA CTGCTGCTAA .......... .......... GTACTCTCCA CAAATATCTA .......... .......... CCTCGGGTTA AAATGTCTCC .......... AGCTTGCCAA GAGGGGGTTC .......... TTCTTGTGAT TGGCCTGCGA ACCTGATAGG TTGTCTTCAA TCGTCAACGA TGCATCTTCC TTCATTGTGA TACAATGCGA TTCCTTGTCA TGTCTTGTGA .......... TAGATTTACA .......... ..........

---------.......... CGAATTGCAG AATCGAATTG CCACATGTGG CGATCTCAGT .......... GTCTCAACCT AACAAGGTTC GTTTCAGTCT .......... .......... TAGTCAGTTT ACCGTAGCTG .......... .......... CTGTTTCTTG CTATGGAGTT .......... CAGCCAAACT ATTACTCACA .......... GTTGGAGGAG CGGAGCCTAA AACAACTGTA TGCCTGCAAA CGTCGGTAAT GCTGACGTGT TAGCACCTGA AAGCTGA... GACTGCTTGA GACTGCTTGA .......... GACGTAA... .......... ..........

361 371 381 391 401 411 | | | | | | ---------- ---------- ---------- ---------- ---------- ----------

29

PcC4HDZ1 PpC4HDZ1 PpC4HDZ2 PpC4HDZ3 PpC4HDZ4 SmC4HDZ1 SmC4HDZ2 SmC4HDZ4 SmC4HDZ3 Pn_0003807 Edi_1007259 Edi_1001922 Apl_0023502 CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

.......... ATTGTGTGTA CAAGACTGTG TGGTCTTGAG CAGGAGCAGG .......... ACCATCCTGC GTGAAGAAGG AGAGGAGGAG .......... .......... TTGTCTCTAT ATTATCAGCT .......... .......... .......... GAGCTTGCAT .......... TCTAGTTTTT GCAGATGTCT .......... AGAAGTAAGA .......... AAGGGAATTA .......... AAAACTTATT ATAGGTAATT .......... TTTCTTTTTC .......... CATCCATTA. .......... .......... .......... ..........

.......... CACTCACATT TGTAGACTCA CGTTCCTCAG ACCAGGAGCA .......... TCGTCCTCTG CAATGATTCT AAAACTCACA .......... .......... TCTAAAACCT GCTACATATA .......... .......... .......... GGTGTTGATT .......... TGTGTCCCTA GAATCGCAGT .......... ATAAGAGAAA .......... ACT.C..CTT .......... ATTTGTTGAG CGTTTATGGC .......... TTAATGAATT .......... .GGAAATAAC .......... .......... .......... ..........

.......... GAGCGTGGTC CATTGAGCGT GAGGGGGAAA GGTGAGGAGT .......... GAGCACTGCG TTTGTTCTGT GGGAGAGCTC .......... .......... CACTGTGAAA TAGTACATGC .......... .......... .......... TCTTTCTCAG .......... TCAGCTTTTT GTAATCATCA .......... AGGAAGGATT .......... TTGCCTTCAA .......... GTAAATTCGA TTTTAGGTTG .......... GTGAATTTTT .......... AAAATGGTGA .......... .......... .......... ..........

421 431 441 451 461 | | | | | MpC4HDZ1 ---------- ---------- ---------- ---------PcC4HDZ1 .......... .......... .......... .......... PpC4HDZ1 TTGAGCGCTC CTCAGGAGGG AGAAGGAGGT GAGGAGCAGG PpC4HDZ2 GGTCTTGAGC GCTCCTCAGG GGGGGAGAAG GAGGTGAGCA PpC4HDZ3 AAAGGGGGAA AAAAAAAAGA AAAAAAGAAG AAGAAGAAAA PpC4HDZ4 AGAGTGGCGG AGGAAGACGA CGGAGACGGA GGCGGACGCT SmC4HDZ1 .......... .......... .......... .......... SmC4HDZ2 TTGATTTCTC CCTCAGCAAC AGGCAGAAAC CTCCATAGCC SmC4HDZ4 CTCGACTACA AACACCAACA CACAAGGCAG TGAGCTCGCA SmC4HDZ3 ACAAACATCA CACGGGAAAG AAAAAGGAC. .......... Pn_0003807 .......... .......... .......... .......... Edi_1007259 .......... .......... .......... .......... Edi_1001922 CGCCGTAGAG GTTGACTTCG ACTCTGGCTT GTTGCAAACC Apl_0023502 ATATCCCAAA GCTTCCTTTA CTCGGTAACC TATGAACATC CruC4HDZ2 .......... .......... .......... .......... Zv_FD774992_1 .......... .......... .......... .......... GbC3HDZ1 .......... .......... .......... .......... GbC4HDZ3 CAGCAAGATT TCCTGGACCT GAAACAAAAC TTTTCTTAGA gb_deg2_UTR .......... .......... .......... .......... PmC4HDZ1 GTTTTTGTTT TTGTTTAGTC CATGCCAATC AATGTTTAAC PmC3HDZ4 GACCGGGGGT GGAGGGGGTG TAATCAACAG CATTCCATTG Vpl_0009732 .......... .......... .......... .......... At4g04890_PDF2 AAGAGCGGTT ATGGTTTCTC ATTTTGTGTT TTTTGAAATC At4g21750_ML1 .......... .......... .......... .......... At1g73360_HDG11 TGCCAGAGAG AGAGAAGAGA AGCTCATTAA CTCCAATTGT At1g17920_HDG12 .......... .......... .......... .......... At4g17710_HDG4 TCCGATACAA AGGGTAAAAA CTGGAATTTA CCACTAAATC At5g46880_HDG5 AGGTTTGGTA AATAACGTAG CATCTAAATG GATAAGCCTT At3g61150_HDG1 .......... .........T CATCGAGATC AGACGATGGA At4g00730_ANL2 ATATAACTTG TTTTTTTGGT CTTGTACTAT GGGAGGGAGG At2g32370_HDG3 ......ACCA CCCCCATTTC GGTAATCTAT TGGAGAGCTT At1g05230_HDG2 TGATGGAAAA AAGAGAGAGA TTTCAGTTTG .........A At5g52170_HDG7 .......... .......... .......... .......... At4g25530_FWA .......... .......... .......... .......... AT3G03260_HDG8 .......... .......... .......... .......... Coleochaete_orbicularis_2036 .......... .......... .......... ..........

471 | ---------.......... TCAACAACAA GCAGGTCGAC CTAACAGGTG ACGGGGGGGA .......... CCGTCCTTTT TTGAGCAGGG .......... .......... .......... CGGTTTGACT ACATACCTCA .......... .......... .......... CAAGCACAGC .......... GTGCAAGGTA ATTCTTCACA .......... CAA....... .......... TCTGCTGCTT .......... TTTT...... TTAGGTTATT GGAG...... GGGT...... CAAAAGGGGA GAAAAGCGGA .......... .......... .......... ..........

---------.......... CTACAACAAC ACCGAATTGG GTGAGCAGCA CGTCAGGGTG .......... TGCCTGTGTT TGGTGATTTT .......... .......... .......... TTTGGCCCTA CATATCTGAA .......... .......... .......... AATAAGCCCT .......... GGGGGGTTTG CCCCGACGGT .......... .......... .......... CTGTTGGTGC .......... .......... TTCGGTTTAT .......... .......... .......... .......... .......... .......... .......... ..........

MpC4HDZ1 PcC4HDZ1

.......... GTGGAATGGG AGGATTCGGT TCAAGACTGT GAGCGTTCCT .......... GAAGAAGTTT AAATCACCAC ATGGTGAAGA .......... .......... CCTCAACTCC CTTCCTCATC .......... .......... .......... GCATATATCA .......... TGAATGGCCC GTGCAGCAGA .......... .......... GAGATTCTCC TGTCTCTTTC GAAAGTG... AGGGTATTTT CAAAATAGTA .......... TCTTGATCAA .......... .......... .......... .......... .......... ..........

.......... TTGATTGAGG GGGTACAATT GTGTTATCTC CAGGAGGGAA .......... GTTGCCTTGA TCTTTTGACA TCGCAGCATT .......... .......... AGCTGAACTA ATGCTTAGGA .......... .......... .......... GTATCTTTTT .......... TCTCTTAATC TCAAATCTGC .......... ........TT AAAG...... TCTTCAATGA .......... GGTAAATAAT AATGGAAAAG .......... GACTGGCG.. .......... .......... .......... .......... .......... ..........

.......... AGCCTTCAAG GAGGAGCCTT ACATTGAGCG GGGCGGCGAT .......... GGGGATATCA ACAGTGGCTA CAAGACAAAC .......... .......... GTCTTCGTCC TAGCGTTGAA .......... .......... .......... GGCTCATGTT .......... TTCATTGCTG ATTGCACTGT .......... TTTTTTCAGA .......... TGGAACAGAG .......... CGCAGCTAAA AGGGAGGTTC .......... CGTGCGTGTG .......... ......AAAC .......... .......... .......... ..........

481 491 501 511 521 531 | | | | | | ---------- ---------- ---------- ---------- ---------- ---------.......... .......... .......... .......... .......... ..........

30

PpC4HDZ1 PpC4HDZ2 PpC4HDZ3 PpC4HDZ4 SmC4HDZ1 SmC4HDZ2 SmC4HDZ4 SmC4HDZ3 Pn_0003807 Edi_1007259 Edi_1001922 Apl_0023502 CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

ACGCCATTTT ACGCCATTTT AAGATGGTGA .......... .......... ATCCAGCAGG TCCGTGCCAA .......... .......... CAAGCTTTTA AACGACTGCA TAGTGGCAAC .......... .......... .......... CAAGAGAGTC .......... CCATTTTGCT GTTGTTTTGT .......... .......... .......... AACTTCTGTA .......... .......... ATGTTTTTTT .......... .......... .......... .......... .......... .......... .......... ..........

AGCGATTCAT AGCGATTCAT CGAGACAAGA .......... .......... G......... CATAGCAGGG .......... .......... GGAGTTCTTT TCAGTTTTGA ATCAAAGCCC .......... .......... .......... ATAAAGCCAT .......... CGCATTGAGC GTAGGTTTCT .......... .......... .......... ARAAAARAAA .......... .......... T..GTTGCTG .......... .......... .......... .......... .......... .......... .......... ..........

TGGCACATA. TGGCACACA. CAAGATCAGA .......... .......... .......... AAAGCAAAGA .......... .......... GTTACTTTGG ATGATGGCGA AACTTCAAAG .......... .......... .......... AATATGCTAC .......... TGGGTGTTGA CTGCCGCGGT .......... .......... .......... AAAACAGMTG .......... .......... CTTTTGTGAG .......... .......... .......... .......... .......... .......... .......... ..........

541 551 561 571 581 | | | | | MpC4HDZ1 ---------- ---------- ---------- ---------PcC4HDZ1 .......... .......... .......... .......... PpC4HDZ1 .......... .......... .......... .......... PpC4HDZ2 .......... .......... .......... .......... PpC4HDZ3 CAAGACAAAC TACGCCATTA AATTACATTA GTGTTGGC.. PpC4HDZ4 .......... .......... .......... .......... SmC4HDZ1 .......... .......... .......... .......... SmC4HDZ2 .......... .......... .......... .......... SmC4HDZ4 AAACCGGAGA AGAATAAGCA CAGGGAAGGC TGGGAACTTG SmC4HDZ3 .......... .......... .......... .......... Pn_0003807 .......... .......... .......... .......... Edi_1007259 TCTTCATGTT TTGTAGTAGA AAGTAAGTTT GTTCTCTACA Edi_1001922 TGATGCGGTC ATGATAAATG ATAATGCTAA CGATGATGGT Apl_0023502 ATAGAACCGA ACACAGAACG ACATA..... .......... CruC4HDZ2 .......... .......... .......... .......... Zv_FD774992_1 .......... .......... .......... .......... GbC3HDZ1 .......... .......... .......... .......... GbC4HDZ3 TGGTATGTGT TTGTG..... .......... .......... gb_deg2_UTR .......... .......... .......... .......... PmC4HDZ1 TTTGTCCCTC ACAGCAACAG CCACAGCCAC TTCCTCCTGC PmC3HDZ4 CAATTTAGTA AGTGTAATGT AGAAGAAGCT ACTAA..... Vpl_0009732 .......... .......... .......... .......... At4g04890_PDF2 .......... .......... .......... .......... At4g21750_ML1 .......... .......... .......... .......... At1g73360_HDG11 GATTCTTCTC TCTTCATTTT TTTTGGGTTT TTTTTTTTTT At1g17920_HDG12 .......... .......... .......... .......... At4g17710_HDG4 .......... .......... .......... .......... At5g46880_HDG5 GGAGGGTCAT AATGGTAATT .......... .......... At3g61150_HDG1 .......... .......... .......... .......... At4g00730_ANL2 .......... .......... .......... .......... At2g32370_HDG3 .......... .......... .......... .......... At1g05230_HDG2 .......... .......... .......... .......... At5g52170_HDG7 .......... .......... .......... .......... At4g25530_FWA .......... .......... .......... .......... AT3G03260_HDG8 .......... .......... .......... .......... Coleochaete_orbicularis_2036 .......... .......... .......... ..........

591 | ---------.......... .......... .......... .......... .......... .......... .......... TGGAGGAGGA .......... .......... AAAAACATCC GTATGTGATG .......... .......... ...ATTTGCT ........AA .......... .......... CTGCGTTTTC .......... .......... .......... .......... TTTTGGTTGG .......... .......... .......... .......... .......... .......... .......... .......... ......CCAT .......... ..........

---------.......... .......... .......... .......... .......... .......... .......... AAAGCAAACT .......... .......... ATAACTTTGT GTTTATGGTG .......... .......... GGTTGCCATT ATCACACAGT .......... .......... ACCTGGACGG .......... .......... .......... .......... TTATGAGTAT .......... .......... .......... .......... .......... .......... .......... .......... CAGCAGCTCC ....GTGAGG ..........

MpC4HDZ1 PcC4HDZ1 PpC4HDZ1

ACCGAAATGG ACGAAACGGA GGATTGCATG TGGGATTTGG .......... TTGTCTTTGA TTCCCTCAGG .......... .......... ......GGGA CTCCCTTTCC GGATAGTGTT .......... .......... .......... ATCAACAGCA .......... GTACTGGGCA AGTTAAATAA .......... .......... .......... TGTTCTGTTC .......... .......... TATGGTAAGG .......... .......... .......... .......... .......... .......... .......... ..........

GCGAATCGAA ACAACAAAAA AAAGTGATCG GGGACGAACA .......... CTTTTCGTTA AAGGATTTGC .......... .......... TGAAAAAAGC GCTAAAGTTC CTTAAGTAGA .......... .......... .......... GATGCTACTT .......... GTTATGAGGG GAAATGATCT .......... .......... .......... TTGGACAAAC .......... .......... GTAAAATGAG .......... .......... .......... .......... .......... .......... .......... ..........

GAAAGACCGG ACACGACTGG AAGAAATACC GGG....... .......... TTTACTTACT TCTTGGGACT .......... .......... CGGGTTGAAC GATTCTCAAC AGTAAACCTT .......... .......... .......... ACTTGAGCTG .......... GATCTCAACA ATGTTCGGTG .......... .......... .......... CTCATTAAAC .......... .......... AAACTAGTTT .......... .......... .......... .......... .......... .......... .......... ..........

601 611 621 631 641 651 | | | | | | ---------- ---------- ---------- ---------- ---------- ---------.......... .......... .......... .......... .......... .......... .......... .........C GGATATACAG ACACGGATTA GAGGTAGGAA A.........

31

PpC4HDZ2 PpC4HDZ3 PpC4HDZ4 SmC4HDZ1 SmC4HDZ2 SmC4HDZ4 SmC4HDZ3 Pn_0003807 Edi_1007259 Edi_1001922 Apl_0023502 CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

ACACGGCATA TAAGTTGCTA AGAATCGGGA .......... .......... AAAGGTGAGA .......... .......... TATATGAAAG GTGTGTATGC .......... .......... CGAAATCCGA TTTCGCTACG .......... .......... CTGAAAATAT .......... .......... .......... .......... TACTAGAAGA .......... .......... .......... .......... .......... .......... .......... .......... TTGGCTTGTG TTGTTTTGTT CCTTGTGACC

GGGGTAGGAA GGGGTAGGAA GGGAGAGGAA .......... ........G. TATGGAGTAT .......... .......... GAGAC..... GTGGAGAGTT .......... .......... GAAGGGCCGT AGAAGTTCTT .........T .......... TCATGAAGGA .......... .......... ......AG.. .......... AACTG..... .......... .......GG. .....AGAG. .......AT. .......GT. .......... .......... .......... CCCTAGTTTG TTGAGATATT GTCGCATTCC

A......... A......... G......... .......... .......... GGGAG..... .......... .......... .......... GGGATTGGAT .......... .......... GGCC...... TGAG...... AGGAA..... .......... TGATGCTTCC ..GGG..... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GCACC..... CTTTT..... TCTGT..... AGATC.....

661 671 681 691 701 | | | | | MpC4HDZ1 -ATAGTCAAG AACGCACCAC AG-------- --------AT PcC4HDZ1 .GA....... .......... CA........ ........-PpC4HDZ1 .TG....... .G..A..... CA........ ......CCGA PpC4HDZ2 .TG....... .G..A..... CA........ ......CTGA PpC4HDZ3 .TG....... .G..A..... CA........ ......CTGA PpC4HDZ4 .CG....... .G..A..... CA........ ......GTGA SmC4HDZ1 .GG....... .......... .........A CCAGATTTTA SmC4HDZ2 ..A....... ........T. G......... ....AATTTC SmC4HDZ4 .GG....... .........T CA........ .......A.A SmC4HDZ3 ..G....... .G........ .......... ........-A Pn_0003807 .......... .G........ .C........ ........-A Edi_1007259 ..G....... .......... .A........ .....GGCT. Edi_1001922 .G........ .........T .A........ ........-Apl_0023502 ..G.T..... ..T......T .C........ ........-. CruC4HDZ2 .GG....... .......... .T........ ........-A Zv_FD774992_1 ..A....... .......... .C........ ........-A GbC3HDZ1 ..A....... .......... .......... ........-A GbC4HDZ3 .GG....... .......... .T........ ........-A gb_deg2_UTR .........C .G....G.T. .C........ ........-A PmC4HDZ1 TGG....... .......... .T........ ........-PmC3HDZ4 .GA....... .........T ...TCGATGG GTTTCTGTCA Vpl_0009732 .TG....... ....A...G. GT........ ....GGTG.A At4g04890_PDF2 .GA....... ........TT TT........ ........-At4g21750_ML1 .GA....... ........TT TT........ .TGCGTTT.A At1g73360_HDG11 .GA....... .........T GC........ ........-At1g17920_HDG12 .GG....... .........T GC........ ........-A At4g17710_HDG4 .GG....... .........A G......... ........-C At5g46880_HDG5 .GA....... .G.......G GT........ ........-At3g61150_HDG1 .TG....... ....A...T. .T........ ........-At4g00730_ANL2 .CG....... ....A...G. GC........ .GAGTGGGCG At2g32370_HDG3 .GG....... ....A...T. .A........ ........-At1g05230_HDG2 .GG....... .T..A...T. .C........ ........-At5g52170_HDG7 ...TCA.... .T.AG.G.CG CT........ .....CTTCG At4g25530_FWA .CC.AC.... .CTCTT.TGG .T........ ....TAACT. AT3G03260_HDG8 .CGG.GG.TT TTATG...T. GA........ .....TTGG. Coleochaete_orbicularis_2036 .T.G....GC ...CTT.... .......... ...GCAAAGC

711 | GCCTCCTTGC .AGA.....G TGGC...CT. TGGC...CT. TGGC...CA. TGGC...G.T .AA....... CTG-....T. ..TC...... CGA.....C. TG........ T..C.....G ---....C.. CT.C...... TA-C...... TG........ TG........ AA-....... TG...G.... ATA....... AAT....... .AAA....TG ---------T.TCAT..C. ---------CAAATG..TT ATGATT.GTT ------------------AGA...AACT ------------------TATA...GCT TTT.ATGC-TATGATGG-TGTCT..G.A

TA-------C......... .G........ .G........ .G........ .G........ CT........ A-.......C AG........ .CGCC..... CC........ G......... ..A....... A......... C......... CT........ CT........ C......... CC........ C......... .G........ CT.......A --........ GCGTTTGTT. --........ .TTG...... G.A......C --........ --........ CCT......T --....GGAG --........ ..GCTAA... --........ --.....TTT GTCAGTGAGC

721 731 741 751 761 | | | | | AGTGGTAAGG CATTATGCAC CCGAGT---- ----GTGCCT ........A. TG.C..AAGT .GT..A.... ....-----C.....TCAT GG.GTA.TC. TT..ACTTTG TAGCTA---C.....TCAT GG.GTA.TC. TTAGCCTTTG GTGCTA----

771 | GTGCGCCATC ----------------------------

TCTAGCTCAA ----------------------------

MpC4HDZ1 PcC4HDZ1 PpC4HDZ1 PpC4HDZ2

.......... ..AATTTACG .......... .......... .......... TTTAGAGGCT .......... .......... TTAAACTTTA CATGGTGCAT .......... .......... AAGGAGCAGC ATTGTCTTTG .......... .......... GACGGCAATG .......... .......... .......... .......... TTTTTTAGGG .......... .......... .......... .......... .......... .......... .......... .......... AGTTTCCGTT GAATATGCTG ..........

........CG GATTATACAG .......ACA .......... .......... CTCTTTCTAA .......... .......... GATGTTAATT GGTTTTATCT .......... .......... AGGTGCAACT TATGATCAAA .......... .......... GGATTACATG .......... .......... .......... .......... TATTTTTATT .......... .......... .......... .......... .......... .......... .......... .......... GTTCCGCAAG TGGAAGATTC GGTGTATCGG

32

GAATATACAG ACACGACGTT GAGACGAGGT .......... .......... CATATAAGAA .......... .......... TTAATATATA GTGGTACATG .......... .......... CTTCTGCATG TAAAATCAAT .......... .......... ATATATGTAA .......... .......... .......... .......... GGGTTTTTAT .......... .......... .......... .......... .......... .......... .......... .......... AAATTCGACA TCTTTTGTTT GAAGAGGTTC

PpC4HDZ3 PpC4HDZ4 SmC4HDZ1 SmC4HDZ2 SmC4HDZ4 SmC4HDZ3 Pn_0003807 Edi_1007259 Edi_1001922 Apl_0023502 CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

GTGT.CAAGA GTAC.A.G.G ....-----....-----CCAGTGTGAG ....-----GTTGCAT.TGCATT.C.A....-----TGGA.CCGAC AGAGAA.GTG AATGC...A. ATCTACTG.ACAAACCT.GTTGCAT.TAGACAGA.AA TCAT.C..TTGGCTG.G.G ....-----AAGAAAAGG. ....-----....-----....-----CCGGA.TATC ....-----GGT.-----....-----....-----....-----....-----....-----GCACTGT..A

TC.ATG.C-.ATTC.GT-------------------TGTA------------------------------.CA -----------------AA -----------------C-----------------------------------A---------..TTATT-----------AA----------------------------------.GT.TTA.A. ---------------------------------------------------------------.C..ATTCGG

---------------------------------------------------------------CAAGT.AGG. ---------ATCCT..... ---------------------------------------------------------------------------------------------------------------------CGATTT------------------------------------------------------------------G..GC.CTTG

781 791 801 811 821 | | | | | MpC4HDZ1 CCCGGGCTTA TTTTGATGCG CGCCTTGCAC ACTGATGAGG PcC4HDZ1 ---------- ---------- ---------- ---------PpC4HDZ1 ---------- ---------- ---------- ---------PpC4HDZ2 ---------- ---------- ---------- ---------PpC4HDZ3 ---------- ---------- ---------- ---------PpC4HDZ4 ---------- ---------- ---------- ---------SmC4HDZ1 ---------- ---------- ---------- ---------SmC4HDZ2 ---------- ---------- ---------- ---------SmC4HDZ4 ---------- ---------- ---------- ---------SmC4HDZ3 ---------- ---------- ---------- ---------Pn_0003807 ---------- ---------- ---------- ---------Edi_1007259 TTGCC.A.AT .C.CTCATT. .CTT.GTATA CT.T.CC.CT Edi_1001922 ---------- ---------- ---------- ---------Apl_0023502 .TACACAGGG CAG.T.CA.T TCTGAA.AT. .TGTG.AGT. CruC4HDZ2 ---------- ---------- ---------- ---------Zv_FD774992_1 ---------- ---------- ---------- ---------GbC3HDZ1 ---------- ---------- ---------- ---------GbC4HDZ3 ---------- ---------- ---------- ---------gb_deg2_UTR ---------- ---------- ---------- ---------PmC4HDZ1 ---------- ---------- ---------- ---------PmC3HDZ4 ---------- ---------- ---------- ---------Vpl_0009732 ---------- ---------- ---------- ---------At4g04890_PDF2 ---------- ---------- ---------- ---------At4g21750_ML1 ---------- ---------- ---------- ---------At1g73360_HDG11 ---------- ---------- ---------- ---------At1g17920_HDG12 ---------- ---------- ---------- ---------At4g17710_HDG4 ---------- ---------- ---------- ---------At5g46880_HDG5 ---------- ---------- ---------- ---------At3g61150_HDG1 ---------- ---------- ---------- ---------At4g00730_ANL2 ---------- ---------- ---------- ---------At2g32370_HDG3 ---------- ---------- ---------- ---------At1g05230_HDG2 ---------- ---------- ---------- ---------At5g52170_HDG7 ---------- ---------- ---------- ---------At4g25530_FWA ---------- ---------- ---------- ---------AT3G03260_HDG8 ---------- ---------- ---------- ---------Coleochaete_orbicularis_2036 T.TAACGCAT AGGA.GA.GA A.TACACTTG ...ACA.TAA

831 | GGCATGT---------... -------... -------... -------... -------... -------... -------... -------... -------... -------... TTTCAAGGGC -------... C.TGG.GTAA -------... -------... -------... -------... -------... -------... -------... -------... -------... -------... -------... -------... -------... -------... -------... -------... -------... -------... -------... -------... -------... ACTTCTCCCT

---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TGCGCGCACA .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... CTGGGTACGG

841 851 861 871 881 | | | | | ---------- ---------- ---------- ---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

891 | ---------.......... .......... .......... ..........

---------.......... .......... .......... ..........

MpC4HDZ1 PcC4HDZ1 PpC4HDZ1 PpC4HDZ2 PpC4HDZ3

C.....TTA. C....CGT.. C.......A. C...T.GTCT C.......A. TC...CC..A T.......A. ........A. TT....T.AA GA...CTGTA ........AA T......T.. T.......A. ........A. T.......A. ........AA T.......A. ......G... ---------..C..CGG.C ------------------C.G...TTAT --.......T ---------T..CACCGT. .T....TTTT ---------...T...G.A -..TT----TA...AC.T.C..TG----

TGGG..CAGT TGGG...AGT ....GGAG.. GC.GGCA.G. TC.GGATTCG ATA.T.C..T T.AG.A..CA ..AAGAATGA A.AGTC..TA T.GA.G..G. .TCG.GAG.G T.AG.AC.CA T.AG.A.ATG ACGA.A.TTA T.AG.A.ACA G.AAC.A..A .GGAGGAA.T A.GCTGC.T. ---------.GAA.AAA.G ------------------.CGGT.TAC. TGACC.TTTG ---------ATGGGACA.. T.----------------.C.A..CTTG ----------------------.ACAGT

33

G.ACTCACTC G.ACCAGGCG GAAGAAGAC. ATTGCA.... TA..TGCACT .GA---.... TT.T.ACAAT TA.CA.TTTG AAT---.... TGTGCAAGCT AGAGAGCGCA TT.TAGCGAC GATTCAAGAG AAC.ACACCG TT.T.ACAAT GTCTAGAAGC .ATCCGTAGA .G.CCGAAAA ------.... AG.CT.GAGA ------.... ------.... ------.... GTTTTCTGAA ------.... GACGTAAAAG ------.... ------.... TTTCTAT... ------.... ------.... .TCGTCTCCT

PpC4HDZ4 SmC4HDZ1 SmC4HDZ2 SmC4HDZ4 SmC4HDZ3 Pn_0003807 Edi_1007259 Edi_1001922 Apl_0023502 CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

.......... .......... .......... .......... .......... .......... .......... .......... AAAAGAGTAC .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GAAGGAAGGA

.......... .......... .......... .......... .......... .......... .......... .......... ATGTATGTGC .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .........G .......... .......... GTGAGAGCTT

.......... .......... .......... .......... .......... .......... .......... .......... AATATCCAGG .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TCTGAAACAA .......... .......... GAGGTGGAGA

901 911 921 931 941 | | | | | MpC4HDZ1 ---------- ---------- ---------- ---------PcC4HDZ1 .......... .......... .......... .......... PpC4HDZ1 .......... .......... .......... .......... PpC4HDZ2 .......... .......... .......... .......... PpC4HDZ3 .......... .......... .......... .......... PpC4HDZ4 .......... .......... .......... .......... SmC4HDZ1 .......... .......... .......... .......... SmC4HDZ2 .......... .......... .......... .......... SmC4HDZ4 .......... .........G ATCACTGGCG GTGGAACAAC SmC4HDZ3 .......... .......... .......... .......... Pn_0003807 .......... .......... .......... .......... Edi_1007259 .......... .......... .......... .......... Edi_1001922 .......... .......... .......... .......... Apl_0023502 CACTTGTTTG ATCTTTCGTA TATGTATAGC CTTTGTGAAG CruC4HDZ2 .......... .......... .......... .......... Zv_FD774992_1 .......... .......... .......... .......... GbC3HDZ1 .......... .......... .......... .......... GbC4HDZ3 .......... .......... .......... .......... gb_deg2_UTR .......... .......... .......... .......... PmC4HDZ1 .......... .......... .......... .......... PmC3HDZ4 .......... .......... .......... .......... Vpl_0009732 .......... .......... .......... .......... At4g04890_PDF2 .......... .......... .......... .......... At4g21750_ML1 .......... .......... .......... .......... At1g73360_HDG11 .......... .......... .......... .......... At1g17920_HDG12 .......... .......... .......... .......... At4g17710_HDG4 .......... .......... .......... .......... At5g46880_HDG5 .......... .......... .......... .......... At3g61150_HDG1 .......... .......... .......... .......... At4g00730_ANL2 .......... .......... .......... .......... At2g32370_HDG3 .......... .......... .......... .......... At1g05230_HDG2 .......... .......... .......... .......... At5g52170_HDG7 TTTGTGAGAG TGTAGCCAAA TTTGAGACTG AGTAGCTAGT At4g25530_FWA .......... .......... .......... .......... AT3G03260_HDG8 .......... .......... .......... .......... Coleochaete_orbicularis_2036 GAGTCGCTGC CAAACGGTGG TGGATTGCTG GGGAGGTAAT

951 | ---------.......... .......... .......... .......... .......... .......... .......... ACTGGTGATT .......... .......... .......... .......... ATCGAATGTG .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... AGAACAAACA .......... .......... GGACTCTGTG

---------.......... .......... .......... .......... .......... .......... .......... TCGCTCACAC .......... .......... .......... .......... TGTGCTCGTG .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... AATAAACAAA .......... .......... GATTCAGAAA

961 971 981 991 1001 1011 | | | | | | ---------- ---------- ---------- ---------- ---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

---------.......... .......... .......... .......... ..........

MpC4HDZ1 PcC4HDZ1 PpC4HDZ1 PpC4HDZ2 PpC4HDZ3 PpC4HDZ4

.......... .......... .......... .......... .......... .......... .......... .......... CATTTGATCT .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GAATTGTACC

.......... .......... .......... .......... .......... .......... .......... .......... TGAGTAGGTA .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GATCTGATAA

34

.......... .......... .......... .......... .......... .......... .......... .......... CTCTCGACAG .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GAAGCGCGCA

SmC4HDZ1 SmC4HDZ2 SmC4HDZ4 SmC4HDZ3 Pn_0003807 Edi_1007259 Edi_1001922 Apl_0023502 CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

.......... .......... .......... .......... .......... .......... .......... ATATGTATTT .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TTTTTGT... ..GTTGA... .......... GGGGATCCCT

.......... .......... .......... .......... .......... .......... .......... GCACATGCAG .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GAGTGCCACA

1021 1031 1041 1051 1061 1071 | | | | | | MpC4HDZ1 ---------- ---------- ---------- ---------- ---------PcC4HDZ1 .......... .......... .......... .......... .......... PpC4HDZ1 .......... .......... .......... .......... .TAAAGAAAG PpC4HDZ2 .......... .......... .......... .......... ..CTGCGAAG PpC4HDZ3 .......... .......... .......... .......... .......... PpC4HDZ4 .......... .......... .......... .......... .......... SmC4HDZ1 .......... .........A TCAACGACGA CGAAAAGACG TGGTTTGCTA SmC4HDZ2 .......... .......... .......... .......... .......... SmC4HDZ4 .......... .......... .......... .......... .......... SmC4HDZ3 .......... .......... .......... .......... .......... Pn_0003807 .......... .......... .......... .......... .......... Edi_1007259 .......... .......... .......... .......... .......... Edi_1001922 .......... .......... .......... .......... .......... Apl_0023502 TCGCAAATGG TCCTTCCTAC TTCGTCTCCT CTTTGAAGGA TGATCTTGTG CruC4HDZ2 .......... .......... .......... .......... .......... Zv_FD774992_1 .......... .......... .......... .......... .......... GbC3HDZ1 .......... .......... .......... .......... .......... GbC4HDZ3 .......... .......... .......... .......... .......... gb_deg2_UTR .......... .......... .......... .......... .......... PmC4HDZ1 .......... .......... .......... .......... .......... PmC3HDZ4 .......... .......... .......... .......... .......... Vpl_0009732 .......... .......... .......... .......... .......... At4g04890_PDF2 .......... .......... .......... .......... .......... At4g21750_ML1 .......... .......... .......... .......... .......... At1g73360_HDG11 .......... .......... .......... .......... .......... At1g17920_HDG12 .......... .......... .......... .......... .......... At4g17710_HDG4 .......... .......... .......... .......... .......... At5g46880_HDG5 .......... .......... .......... .......... .......... At3g61150_HDG1 .......... .......... .......... .......... .......... At4g00730_ANL2 .......... .......... .......... .......... .......... At2g32370_HDG3 .......... .......... .......... .......... .......... At1g05230_HDG2 .......... .......... ...AAGAGAA TACCATTGAG TGTTTGTTAG At5g52170_HDG7 .......... .......... .......... .......... .......... At4g25530_FWA .......... .......... .......... .......... .......... AT3G03260_HDG8 .......... .......... .......... .......... .......... Coleochaete_orbicularis_2036 CCGGGGCTTT GATTGTCGCT CTCAGCCTCT CTTTTGCCTT TGCCGTATTC

---------.......... CCTGAACTGG AAAAAGCCTG .......... .......... ATTTTTATTC .......... .......... .......... .......... .......... .......... GCACGTTTAA .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TGTTAAGTTT .......... .......... .......... TTGTCTCTGG

1081 1091 1101 1111 1121 1131 | | | | | | ---------- ---------- ---------- ---------- ---------.......... .......... .......... .......... .......... CGATCTTTTC TTTGAAGCCA TGATTCTTTT AAAGTTGTTC AGGATTAAGA AAATGGCTCT CTCTTCATGG AAGAAAAATG TCATTTTGAA GTAGCTCAAG .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GAGAGGAGAG TGAGAGAAGA GGTTTTTTTT TTTTGGATTC CTTCTTCTTC

---------.......... GCTTGAA... GATTAATAGC .......... .......... TTCCTTCTTC

MpC4HDZ1 PcC4HDZ1 PpC4HDZ1 PpC4HDZ2 PpC4HDZ3 PpC4HDZ4 SmC4HDZ1

.......... .......... TGGCGCAAAC .......... .......... .......... .......... CATGCGAGGA .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... CAAATGGACC .......... .......... GTGGGAGGGT

.......... .......... TCT....... .......... .......... .......... .......... CACGTCTCGT .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... AAAGTTTACA .......... .......... GAGCCATCGG

35

.......... .......... .......... .......... .......... .......... .......... CTTTATGGCT .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ATTTGCAAAA .......... .......... ATAAGTGG..

.......... .......... .......... .......... .......... .......... .......... ATAGAAGATC .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... CTGATTTGTT .......... .......... ..GGAGCAGG

SmC4HDZ2 SmC4HDZ4 SmC4HDZ3 Pn_0003807 Edi_1007259 Edi_1001922 Apl_0023502 CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

.......... .......... .......... .......... .......... .......... CCTCATGTAG .......... .......... .......... .....ATCAT .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GCTTTTGGAT

.......... .......... .......... .......... .......... .......... CCTCTGTTCT .......... .......... .......... TTCATGACCC .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ....TATCTT

1141 1151 1161 1171 1181 1191 | | | | | | MpC4HDZ1 ---------- ---------- ---------- ---------- ---------PcC4HDZ1 .......... .......... .......... .......... TGCACTTCAG PpC4HDZ1 .......... .......... .......... ......TGCG CGGTCGTGGT PpC4HDZ2 CTAGACACGC GCGCGCGCGG TCCTCTCGAG GCCCCGCTCC TGCAGTTGGC PpC4HDZ3 .......... .......... .......... .....GGTGC CCCAGCCCGA PpC4HDZ4 .......... .......... .......... .......... ....GGAGGG SmC4HDZ1 ATGCATACTT TCTAGCGCAG CGGATGGATT GAGAGGCCTT TTTCTGGTTC SmC4HDZ2 .......... .......... .......... .......... .......... SmC4HDZ4 .......... .......... .......... .......... .......... SmC4HDZ3 .......... .......... .......... .......... .......... Pn_0003807 .......... .......... .......... .......... .......... Edi_1007259 .......... .......... .......... .......... .......... Edi_1001922 .......... .......... .......... .......... .......... Apl_0023502 CGT....... .......... .......... .......... .......... CruC4HDZ2 .......... .......... .......... .......... .......... Zv_FD774992_1 .......... .......... .......... .......... .......... GbC3HDZ1 .......... .......... .......... .......... .......... GbC4HDZ3 AGGGAAATGT GCAATTTGTT GGTTTTCAGC AAAAGCAAGG CATTTTTCTA gb_deg2_UTR .......... .......... .......... .......... .......... PmC4HDZ1 .......... .......... .......... .......GCT TCTTTCCTTC PmC3HDZ4 .......... .......... .......... .CTAGTGTGT GTGTTTTTCC Vpl_0009732 .......... .......... .......... .......... .......... At4g04890_PDF2 .......... .......... .......... .......... .......... At4g21750_ML1 .......... .......... .......... .......... .......... At1g73360_HDG11 .......... .......... .......... .......... .......... At1g17920_HDG12 .......... .......... .......... .......... .......... At4g17710_HDG4 .......... .......... .......... .......... .......... At5g46880_HDG5 .......... .......... .......... .......... .......... At3g61150_HDG1 .......... .......... .......... .......... .......... At4g00730_ANL2 .......... .......... .......... .......... .......... At2g32370_HDG3 .......... .......... .......... .......... .......... At1g05230_HDG2 .......... .......... .......... .......... .......... At5g52170_HDG7 .......... .......... .......... .......... .......... At4g25530_FWA .......... .......... .......... .......... .......... AT3G03260_HDG8 .......... .......... .......... .......... .......... Coleochaete_orbicularis_2036 CACATTCTTT AATGAACATT TTCTCATGCA TTGGTCCTGG TGGAGATACT

---------GACAAAGAGT CAGACCTTTC GCTGGC.ATC CCTTCTTCTT GCGGGATGGT TTCCTCCTCG .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... CTTGGGATGA .......... TTCTCAATGC ACCGTCTCTG .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... CTTTGAAATC

1201 1211 1221 1231 1241 1251 | | | | | | ---------- ---------- ---------- -GTCTCACCA CTC---TGCA TAGTCTGATG TTAAGTGTCC CGTGACACTA C.CT...... .--.TTG... ATGAACC.TG CACACTAGCC TCAGTACATG T.GTG..... .GTATAGC.. ATCAACT.TG CATAGCTTCC TCCGTGCAGG T.C.G..... .G-.TAG-.C GCCCACTTGC CCTAATTATA TTGGTGCATG T.CT...... .G-.CAG-.. CGCACAGTGG AGGGAGCAGA GCGGTGCATG T.CTA..... .G-.CAG..ATTCTTCCTT GCGCTAGAAG CTGTGCATGG T.CT...... ..-.CGG-.. .......... .......... .......... ..AGA..... .--.CT..T.

AGGCT--AGT ...G...... ...G...... ...G...... ...G...... ...G...... ...AG..C.. ....G.....

MpC4HDZ1 PcC4HDZ1 PpC4HDZ1 PpC4HDZ2 PpC4HDZ3 PpC4HDZ4 SmC4HDZ1 SmC4HDZ2

.......... .......... .......... .......... .......... .......... CTCCCTTCTC .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TGGTCTGCTT .......... .......... .......... TTATTATCTG

.......... .......... .......... .......... .......... .......... ATATCATCTT .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ATTTGATGAA .......... .......... .......... TCGTGGAATG

36

.......... .......... .......... .......... .......... .......... GCTAAAAAGA .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ACTAAGCAGT .......... .......... .......... GTTTACGCGC

.......... .......... .......... .......... .......... .......... CCTTGTAAGT .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GAAAAACTTT .......... .......... .......... CGAGATGAGA

SmC4HDZ4 SmC4HDZ3 Pn_0003807 Edi_1007259 Edi_1001922 Apl_0023502 CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

.--.GG.... GA..CAG... .--.TA.... ---...---A--.GTC... .--.GA.... .--.CA.... .--.AA.... .--.TC.... .--.TG.... .--.TA.... .--.CA.... .--.TCA... .--.CGA... G-ATGAAAA. ---...---T.TGAC.TTG ---...------...-TGC ---AGAAAAG ---TCCGTTT ---GGTG.AG ---...---A.GTAGATGG --GATAC-AT ---...------TCCGATC .--..A..A.

....A...A. ........A. ...G....T. -----..T.. T..G....A. G..TA..T.. ...G...... ...G...... ...A...... ...G...... G..G....T. ...G...... ...A...... .T.G...G.. ...TAA.... -----....A TTTG-..TA. -----..TA. TT.--...T. C.TA-..GC. TT.T-..GA. GA.T...G.. -----..C.A TTTTA..C.A .AA--...AA -----..TC. G.TAA..GA. C.CTG..TTG

1261 1271 1281 1291 1301 1311 | | | | | | MpC4HDZ1 GGTTCGGGTA -TTGACTT-- -----TCTT- ---------- ---------PcC4HDZ1 .......... .......... .....A..-. .......... .......... PpC4HDZ1 .........G .......... .....A.C.T CGACCTCCCC TCCCTCCCTT PpC4HDZ2 .......... .......... .....A.C.T CAACCCTCCT CTTTCATTTC PpC4HDZ3 .......... .......... .....A...T CAACCATCCC TCTTTCATTT PpC4HDZ4 ........C. ......CG.. .....AG..C CGGGGGTCCC CCGTTGGATT SmC4HDZ1 .......... .......... ......G..T ACCAACTTGG TCCTGTTCTT SmC4HDZ2 .......... .......... .......G.C CTCTCGTGTG TGCTCTTAAG SmC4HDZ4 ........C. .......... .....CAC.T CCACTCCAGA TTTCCATGTA SmC4HDZ3 .......... .......C.. .......C.T TTTCTGATCC ATCGTTCCTT Pn_0003807 .......... .......... ......A--. .......... .......... Edi_1007259 ........C. ......A... .....CTGCT GTTTCTTCTC TTTTCACAGA Edi_1001922 ........A. .....T.A.. .....CGCCC TTTCCCAAAC TCTTTTTTTT Apl_0023502 ....A..... .......C.. ......A.CC TCGTACCTCA GCTTCCAGTA CruC4HDZ2 .......... .......C.. .....C..A. .......... .......... Zv_FD774992_1 .......... .......... .....G.CAT CAGAAGCGTT TTGCAACAAT GbC3HDZ1 .......... .......... .......CAC CAGGCGTTTT CCTACTGGAA GbC4HDZ3 .......... .......C.. .....----. .......... .......... gb_deg2_UTR .......... .......... ......ACAT .......... .......... PmC4HDZ1 .......... .......C.. .....C..GC AAAATGTTAA TCGCTTGGTA PmC3HDZ4 ........AT .......... .....CACAT GCATCTCTCT TTTAATCGTT Vpl_0009732 .......... .......C.. .....CGCCT AATCTATCTG TTGTTAATTG At4g04890_PDF2 .......... .......... .....CTGCT GGAAACCACG GTTTTAAATT At4g21750_ML1 .......... .......... .....CTGCT GGAACCAAAA AAAAAAGAAT At1g73360_HDG11 ........C. .......... .....CTA.T GCAAAGTTTT TTTGAAAGCT At1g17920_HDG12 ........C. .......... .....CTG.T TGTTTGTTTT TTACTTCTTT At4g17710_HDG4 .......... .......... .....C.AAC TTGGTTAAAC GAAAACGTTT At5g46880_HDG5 .......... .......... ........AC TTGGTTTGAC TCCTTTGAAA At3g61150_HDG1 .......... .......C.. .....GA.GA AATTT..... .......... At4g00730_ANL2 .......... .......G.. .....GGG.T TGGAAACCCC CTCTTTACTC At2g32370_HDG3 ........-. T......A.. .....G.CCT GGAATAAACC CATTCAAGAT At1g05230_HDG2 ........A. T......... .....C.CCT GTCACATACT GAATTAGACA At5g52170_HDG7 C...-...A. AAA.CTAC.. .....AGACT TCTCAAGAAA CATACAAGAC At4g25530_FWA .T..GCT.G. TA.TT..... .....G...C CTTTCTTCTT TCCTTTTCTG AT3G03260_HDG8 TACAGACCG. .....TGA.. .....AAG.A ATAAAACAAC CCATCCTATC Coleochaete_orbicularis_2036 .TGG...... ...C..AA.. .....A.CAG AGATCTCCTT GTTTCTTCTC

---------.......... CTCCTCTTTC TCACTGCAAC CTCACGGCAT TGTACCAGCA CATTTTTCTG TTATGGAAAA TCAGTTGTTT TTTTTCTTTT .......... ATATTATGAT ACTGTTTTAC GATCATCTAC .......... ATTTAAATAT AGATATATTT .......... .......... CTTCATCTTT TGTAATGTTT CTGTTATGTT GTCTATTTTG CGGGTTTGTT TTAAACTTTG TTGTCTAATT TATATTATGG TGTTTTGGCT .......... TTTTTGTTTG GGAAAAAACA AAAACAAAAA ACCTCGACGA AAACTCTCAA ATGGGTTTTT TAGTCCC...

1321 1331 1341 1351 1361 1371 | | | | | | ---------- ---------- ---------- ---------- ---------.......... .......... .......... .......... .......... ATTTCTTACA GCATTCAACA TTGC...... .......... .......... ATTCAACGTT GCTCATCTGA TCCTGTCATC GACAGCTCTA TGGATTTTGG TTGAACATTG CTCAGCCAAT CCTGTCTTTG ACAGCTTTGT .......... TGTGGAAGGA TGCGGGGGCA AATCTTGTGT TTGACAGCTT TGCGGATTTT TGTTCTAGTA GTCTTTTTTT C......... .......... .......... TGAAAAACAA AAGACCCA.. .......... .......... .......... GCTTTCGATT GATCTAAAAT CTAAGAAAAA TTGTGCTCTT GACTTCGTGT

---------.......... .......... ACCCTGTAAT .......... GGAAGCCTGT .......... .......... CCAAAGTTCT

MpC4HDZ1 PcC4HDZ1 PpC4HDZ1 PpC4HDZ2 PpC4HDZ3 PpC4HDZ4 SmC4HDZ1 SmC4HDZ2 SmC4HDZ4

.......... .......... .......... .......... .......... .......... .......... .......... .......... TGGCAATCAG .......... AATCATCAAA GCTGGGGCGC .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TGATGTTGTA

.....GTGCA .......... .......... .......... ........TT .......... .......... .......... .......... TTGTGCATGA .......... CAACACCTGA CCACACTACG .......... ........GC .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .........G TGCAATGTTG

37

CGACGAGTCT ..CAAACTTC .......... .......... TGAATGTGGG ........AC .....CATGA .......... .......... GA........ .......... TATTTGCATA CCTGTGCATG ......TGTA GTTTAATTTC .......... .......... .......... .......... .........T ........TT .....TCGGG .......... ....TTACTT GTTTATGCCA .......... CTTTAAGTCT AGCTACA...

C.CT...... C..T.TT... .ACT...... .--------CT.T...... CT..CT..TG GC.T.T.... TACT...... .--T.....T .--T.T.... .ACT...... A.CT.T.... T.CT...... ATC....G.. ATCTCA.G.T .--------.-ATR..A..--------.-----.A.C.GT.TGA.T..GATGAATCCGATCA..--------GAAAGTGAAT TT.TC...T.--------TAAGAT.TTCC.GC.TG..

SmC4HDZ3 Pn_0003807 Edi_1007259 Edi_1001922 Apl_0023502 CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

AGCACATTTT .......... .......... TTACTTTCTT CCTTGTATCA .......... TGTAATTCAC GTGTCGTCGA .......... .......... TGACGGCATT ATGTTATTTT GGACTCGTAT CTTTTTGCGT ATTTATCATT .......... CCATAGTTTC ATCTACTTAT AAAAAAAAAC .......... CGTGGGGTGT ATTGTTTGGA CAGTTACTGT AAAGAACCCA .......... .......... GAGATCGGAG

TC........ .......... .......... ACCCTCTCTC ACTCTGCTGC .......... AGTTTGTCAA GGAATTGTCA .......... .......... GA........ AGGCACATCC GGGGTGTTTT TCTCCGACCC GACTAGTGAA .......... TC........ GGTTTATTAT TGAGAC.TAG .......... GTTGTGGTTA CTTGGAACCT TTTTCTTTCC CCAAGACATC .......... .......... GAAACCCAAC

1381 1391 1401 1411 1421 1431 | | | | | | MpC4HDZ1 ---------- ---------- ---------- ---------- ---------PcC4HDZ1 .......... .......... .......... .......... .......... PpC4HDZ1 .......... .......... .......... .......... .......... PpC4HDZ2 CACCTCTTCC ATGCAATGTA TTTAACTTCC C......... .......... PpC4HDZ3 .......... .......... .......... .......... .......... PpC4HDZ4 GATGAGTACA ACACGGCTCA TCATTGCACG TTGTATAACC TTCGTGTTCC SmC4HDZ1 .......... .......... .......... .......... .......... SmC4HDZ2 .......... .......... .......... .......... .......... SmC4HDZ4 CGTTCGAATG GGAGAAAGCG AGTGAGACTA GTTCTTGACG ATCTTTTCTT SmC4HDZ3 .......... .......... .......... .......... .......... Pn_0003807 .......... .......... .......... .......... .......... Edi_1007259 .......... .......... .......... .......... .......... Edi_1001922 CTTTCAGACG GAAGAAAAAA AATGCAGCAA ATTATTTTCC TGCTTTTTTT Apl_0023502 AACAGTCTGT TCAAGAAAAG GCTGGTAAAT AATGCCTAAA ATTTCAACAC CruC4HDZ2 .......... .......... .......... .......... .......... Zv_FD774992_1 CTTCTGTCTG ATATTGTTGA TAAAAAAGAT TCTTCTCTTA CGATGCWCTG GbC3HDZ1 TGTAATTTGG CTAATGTCTG AATTGGAAGT TTAATAGTTC CTTGCCTATA GbC4HDZ3 .......... .......... .......... .......... .......... gb_deg2_UTR .......... .......... .......... .......... .......... PmC4HDZ1 .......... .......... .......... .......... .......... PmC3HDZ4 CTCTTTTATG CGTGCCTCTG GATAAGTTTT .......... .......... Vpl_0009732 CTCTATGTTG TTTTCTCGCG ATATTTACTT TTTAAATGTT TATTTAGTTG At4g04890_PDF2 AAGTTGTCAG TTTCTTTGTT GTTTTGTATT GTAAGCCCTT TTGTTGCACT At4g21750_ML1 CAGTTTAGCG TTCTGCTTTT CGCGTCTACT GTGAAACTCC TTGTTATTAA At1g73360_HDG11 .......... .......... .......... .......... .......... At1g17920_HDG12 .......... .......... .......... .......... .......... At4g17710_HDG4 TCGTATTGAA TCCTTGTAAA ATAATGTACA ACTAGGAAAA CCACCTCACA At5g46880_HDG5 TTCTCTCTCT CTCTCTCTCT CAAGGGCGA. .......... .......... At3g61150_HDG1 .......... .......... .......... .......... .......... At4g00730_ANL2 GAAAACGCGA ACTGTGAGAT GATGATGTTA TGTTGGATGG GATCTGTATC At2g32370_HDG3 TTTGCTATTT CAGTTGAGTC TTTGGTTTCC TTGTCATTGT TGGACTTGGA At1g05230_HDG2 TTCTTGTGGT TAGATGGACC ATCATCAGGA ATTTGGAGTT TGTCTTTCTT At5g52170_HDG7 AACAAACCAA CTTGTTTAAT GAAATCGCTA CAAGGCAAAG AACCACATAC At4g25530_FWA .......... .......... .......... .......... .......... AT3G03260_HDG8 .......... .......... .......... .......... .......... Coleochaete_orbicularis_2036 TGCGCATTTT TTCTCTCTTT TCTTTGTTGT TGTGCCGGCC AGCACCTGCG

---------.......... .......... .......... .......... CTTTGTCTGC .......... .......... CGTTGGATG. .......... .......... .......... AAACCTCCCT AGATTCACGT .......... TTTCATTCGT TCTTATC... .......... .......... .......... .......... AATGAATGAA TGACAGATAT GCCACTCTAG .......... .......... AATTAATTGC .......... .......... GTTTCAAGTT ACCTTTTGTT TTGTATAAAT CAGTTGAGAT .......... .......... GACTTTTGTA

1441 1451 1461 1471 1481 1491 | | | | | | ---------- ---------- ---------- ---------- ---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GGAGCCGTCT GATAGTCCTG TTTGGTTGGT GGCCTGATTT CGTGACTGTG .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

---------.......... .......... .......... .......... TATGTGGTAT .......... .......... .......... ..........

MpC4HDZ1 PcC4HDZ1 PpC4HDZ1 PpC4HDZ2 PpC4HDZ3 PpC4HDZ4 SmC4HDZ1 SmC4HDZ2 SmC4HDZ4 SmC4HDZ3

TTCTTTTTTT .......... ATGAAAATTA TTTAACCTTT CTTTTGTATC .......... TGGAATCTTG ATGCAGAGAA .......... .......... CAGCTGAACT TAACTTTAAA GCTGTTCAAC CGTTTCGTTG GTGTTTCGGC TTCATCTCTT ACAATGACCA TTTTGTTCAA TATTTGCAAT .......... TGTTTGTTAT ACTATCTTTT CTAGGTTAGA TAGAAAACAA ATTGTTACAA CTT....... .ATGGCCGGG

CATTTTTATC .......... CACATTGCAA CATTCAACGC CTCCCTCTTG .......... TCCATGATTG TGATCTGTAC .......... .......... CAATCATCCA TGCTCGGACT TTATTTAGGT ACAGTTTTTT GGTTTAGCGT CATTAATGGT AATTACTATT AACAAAGCTA TAATATGCTA .......... TTTTATTATG CCTTTCATTT AAGAATGCTT GACTCGCTGA ACCAGAATTA .......... ACTTGTGCAT

38

TGAGATATTA .......... GTTTTTT... ATCGCCGTTG TGGGTTGTTC .......... TAAATTTGAG ATTGAAGGAT .......... .......... TTCATGTAAA TGCTGTTACT TCTGCGTTGT GAACACCTTT TTTGCGTTTT CT........ TCTTCTTGGT GTTTAATGCT CCTTCTTTAT .......... TAGCGCGAGT GAGTCTTTTG TCGGATTTCT AGTAAAACCC ATAGAGAGTT .......... GCTGGGACGT

CGGACTCATC .......... .......... ACCATTGTAC GTTGTTCTCG .......... GATATCCTTC TCTACTTTTT .......... .......... TGATATGACA ATTATAATCT TTTTCCTCTC TGGGTTTCCC CTTTGTTATT .......... TTAATTTATT AATTACGAAA GAAAAAAAAA .......... AAGCCACACG TTTCCTTGTC TTTTGTGTTA GACCAAACAA AG........ .......... GGATGTGAGA

Pn_0003807 Edi_1007259 Edi_1001922 Apl_0023502 CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

.......... .......... .......... CATG...... .......... GGTTAGCCTT .......... .......... .......... .......... .......... .......... TTAGCTCTCT .......... .......... .......... TTAAACTTTG .......... .......... .......... TGGAAACCTT .......... AAAGAATCAC .......... .......... TGGGGACAAG

.......... .......... .......... .......... .......... GATTAATTTT .......... .......... .......... .......... .......... .......... CTTATCCCAT .......... .......... .......... TCGAGCATAC .......... .......... .......... TTGTTATTTC .......... TAGCACCGAA .......... .......... GATTGAGCAA

1501 1511 1521 1531 1541 1551 | | | | | | MpC4HDZ1 ---------- ---------- ---------- ---------- ---------PcC4HDZ1 .......... .......... .......... .......... .......... PpC4HDZ1 .......... .......... .......... .......... .......... PpC4HDZ2 .......... .......... .......... .......... .......... PpC4HDZ3 .......... .......... .......... .......... .......... PpC4HDZ4 CGTTTTGCAG TTGTTTCCGA CACGTGCGCT GTACATTTTT GTGAGATTTA SmC4HDZ1 .......... .......... .......... .......... .......... SmC4HDZ2 .......... .......... .......... .......... .......... SmC4HDZ4 .......... .......... .......... .......... .......... SmC4HDZ3 .......... .......... .......... .......... .......... Pn_0003807 .......... .......... .......... .......... .......... Edi_1007259 .......... .......... .......... .......... .......... Edi_1001922 .......... .......... .......... .......... .......... Apl_0023502 .......... .......... .......... .......... .......... CruC4HDZ2 .......... .......... .......... .......... .......... Zv_FD774992_1 GGGGTTTAGT CGCCTAAACA TTACACAGCT GCACTACTTT TATTTAGGCA GbC3HDZ1 .......... .......... .......... .......... .......... GbC4HDZ3 .......... .......... .......... .......... .......... gb_deg2_UTR .......... .......... .......... .......... .......... PmC4HDZ1 .......... .......... .......... .......... .......... PmC3HDZ4 .......... .......... .......... .......... .......... Vpl_0009732 .......... .......... .......... .......... .......... At4g04890_PDF2 TCAACCAACG GTCCAATTTT CTAAC..... .......... .......... At4g21750_ML1 .......... .......... .......... .......... .......... At1g73360_HDG11 .......... .......... .......... .......... .......... At1g17920_HDG12 .......... .......... .......... .......... .......... At4g17710_HDG4 ATAATACATT ATGTGATCCA CATTTCCAGA TGGATCTGAT CTTATGAGAA At5g46880_HDG5 .......... .......... .......... .......... .......... At3g61150_HDG1 .......... .......... .......... .......... .......... At4g00730_ANL2 .......... .......... .......... .......... .......... At2g32370_HDG3 AGTTGAGTCT GTTTGTTTCC TTGTCATTGT TGGACATGGA ACCTTTCGTT At1g05230_HDG2 .......... .......... .......... .......... .......... At5g52170_HDG7 GGCGCCAAAA ACTGCAGAGC AACAAGGCAT AGGTAATGGT TGTGAAGAAA At4g25530_FWA .......... .......... .......... .......... .......... AT3G03260_HDG8 .......... .......... .......... .......... .......... Coleochaete_orbicularis_2036 CTAGAGGTGG ACAGGAGGAG CAGTTGAGCA CCGCCAGATG AGTGAGAGCT

---------.......... .......... .......... .......... TGATCATGTT .......... .......... .......... .......... .......... .......... .......... .......... .......... AATTTTGGGG .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GCTTCGTAAT .......... .......... .......... GTTTCAGTTG .......... CATCTTTGCA .......... .......... TACATTTGTG

1561 1571 1581 1591 1601 1611 | | | | | | ---------- ---------- ---------- ---------- ---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TGTGTCGTGG TAGTGTGTGC TG........ .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

---------.......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

MpC4HDZ1 PcC4HDZ1 PpC4HDZ1 PpC4HDZ2 PpC4HDZ3 PpC4HDZ4 SmC4HDZ1 SmC4HDZ2 SmC4HDZ4 SmC4HDZ3 Pn_0003807

.......... .......... TGTCCTTTTT GCTTAGTTCC .......... AGTTGGGTTG .......... .......... .......... .......... .......... CTGGT..... GGAGTTAAAA TGGTACTGTC .......... .......... TCAGAAACCA .......... .......... .......... ATTTCATTCG ATCTTATACA ACAGCAAAAG .......... .......... GACGAGCATC

.......... .......... T......... CGAATTGTCG .......... CTGGGCGGTC .......... .......... .......... .......... .......... .......... TCTTTGTTAT ATTATATATT .......... .......... AACAATCCAC .......... .......... .......... AGTCTTTGGT AGTATTTTGG AGCTCAAGTC .......... .......... CTGTCATTTG

39

.......... .......... .......... TTAATTTCAA .......... TGTTGTGTTT .......... .......... .......... .......... .......... .......... TAGTACACTA ATGAATCTAT .......... .......... TTAACAATGC .......... .......... .......... TTCCTTGTCA TACTTTTGT. AATCTGAGAC .......... .......... TTTGTTTTTT

.......... .......... .......... TACAAATGTA .......... TCATTGCGTT .......... .......... .......... .......... .......... .......... CTAGTTTCTG GAAACTGTG. .......... .......... AAACAACCAA .......... .......... .......... TTGTTGGACT .......... AAGCCGAGGA .......... .......... GAAGTGTGCT

Edi_1007259 Edi_1001922 Apl_0023502 CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

.......... .......... .......... .......... AATTTAGGTA .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GCCGTGGTGC .......... .......... .......... TTATGAAGTT .......... TCTGATACCA .......... .......... TCCTATGTAT

.......... .......... .......... .......... AATTTCATGC .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TTTTGGACGT .......... .......... .......... ATTGAGTACT .......... CGAAAGGACT .......... .......... GTAAATTACA

1621 1631 1641 1651 1661 1671 | | | | | | MpC4HDZ1 ---------- ---------- ---------- ---------- ---------PcC4HDZ1 .......... .......... .......... .......... .......... PpC4HDZ1 .......... .......... .......... .......... .......... PpC4HDZ2 .......... .......... .......... .......... .......... PpC4HDZ3 .......... .......... .......... .......... .......... PpC4HDZ4 .......... .......... .......... .......... .......... SmC4HDZ1 .......... .......... .......... .......... .......... SmC4HDZ2 .......... .......... .......... .......... .......... SmC4HDZ4 .......... .......... .......... .......... .......... SmC4HDZ3 .......... .......... .......... .......... .......... Pn_0003807 .......... .......... .......... .......... .......... Edi_1007259 .......... .......... .......... .......... .......... Edi_1001922 .......... .......... .......... .......... .......... Apl_0023502 .......... .......... .......... .......... .......... CruC4HDZ2 .......... .......... .......... .......... .......... Zv_FD774992_1 TAAGTTAACG GTATAGCTGA GTGATGTTTT TAACCAAGTA AATGAA.... GbC3HDZ1 .......... .......... .......... .......... .......... GbC4HDZ3 .......... .......... .......... .......... .......... gb_deg2_UTR .......... .......... .......... .......... .......... PmC4HDZ1 .......... .......... .......... .......... .......... PmC3HDZ4 .......... .......... .......... .......... .......... Vpl_0009732 .......... .......... .......... .......... .......... At4g04890_PDF2 .......... .......... .......... .......... .......... At4g21750_ML1 .......... .......... .......... .......... .......... At1g73360_HDG11 .......... .......... .......... .......... .......... At1g17920_HDG12 .......... .......... .......... .......... .......... At4g17710_HDG4 TGATGCCTAG TTATCGCCTA AACCACTTTT TAGAAAAGTT TCTTTATAGT At5g46880_HDG5 .......... .......... .......... .......... .......... At3g61150_HDG1 .......... .......... .......... .......... .......... At4g00730_ANL2 .......... .......... .......... .......... .......... At2g32370_HDG3 CTTTTTCTCT .......... .......... .......... .......... At1g05230_HDG2 .......... .......... .......... .......... .......... At5g52170_HDG7 GCTTCGTTCA AGAACATCTG AAGGTTAGGA CGAACCGCCA TCGCTTGTAG At4g25530_FWA .......... .......... .......... .......... .......... AT3G03260_HDG8 .......... .......... .......... .......... .......... Coleochaete_orbicularis_2036 ACATGGACGC AAGGGTAGCA ATCAGGTCTG TTGCTACTGT CAGACAGAAA

---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ATTGGCTCTG .......... .......... .......... .......... .......... TAAACCCGAA .......... .......... CGGGCAACTA

1681 1691 1701 1711 1721 1731 | | | | | | ---------- ---------- ---------- ---------- ---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

MpC4HDZ1 PcC4HDZ1 PpC4HDZ1 PpC4HDZ2 PpC4HDZ3 PpC4HDZ4 SmC4HDZ1 SmC4HDZ2 SmC4HDZ4 SmC4HDZ3 Pn_0003807 Edi_1007259

.......... .......... .......... .......... TTTGTTCGCC .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TAGTAATTAG .......... .......... .......... AGTCTGTTTG .......... CGACTAAAAT .......... .......... CTCTCTGTTG

.......... .......... .......... .......... ATTGCACAGC .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... AGTAGGACTC .......... .......... .......... TTTCCTTGTC .......... CCAAATCAGG .......... .......... GCCTCATCCG

40

.......... .......... .......... .......... TGCACTACTT .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GAACCGTACG .......... .......... .......... ATTGTTGGAC .......... AGTTTTACTT .......... .......... CAACAATCTG

.......... .......... .......... .......... TTATTTAGGT .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... AGCGACGACG .......... .......... .......... TTGGAAACCT .......... CCATGTAATC .......... .......... ACTGGGAGTT

Edi_1001922 Apl_0023502 CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TCTTCTTACT .......... .......... .......... .......... .......... TCAACATGCC .......... .......... CTGTCGTTCT

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ATAGCTATGC .......... .......... .......... .......... .......... TACATTACCA .......... .......... AAAGGAAGAG

1741 1751 1761 1771 1781 1791 | | | | | | MpC4HDZ1 ---------- ---------- ---------- ---------- ---------PcC4HDZ1 .......... .......... .......... .......... .......... PpC4HDZ1 .......... .......... .......... .......... .......... PpC4HDZ2 .......... .......... .......... .......... .......... PpC4HDZ3 .......... .......... .......... .......... .......... PpC4HDZ4 .......... .......... .......... .......... .......... SmC4HDZ1 .......... .......... .......... .......... .......... SmC4HDZ2 .......... .......... .......... .......... .......... SmC4HDZ4 .......... .......... .......... .......... .......... SmC4HDZ3 .......... .......... .......... .......... .......... Pn_0003807 .......... .......... .......... .......... .......... Edi_1007259 .......... .......... .......... .......... .......... Edi_1001922 .......... .......... .......... .......... .......... Apl_0023502 .......... .......... .......... .......... .......... CruC4HDZ2 .......... .......... .......... .......... .......... Zv_FD774992_1 .......... .......... .......... .......... .......... GbC3HDZ1 .......... .......... .......... .......... .......... GbC4HDZ3 .......... .......... .......... .......... .......... gb_deg2_UTR .......... .......... .......... .......... .......... PmC4HDZ1 .......... .......... .......... .......... .......... PmC3HDZ4 .......... .......... .......... .......... .......... Vpl_0009732 .......... .......... .......... .......... .......... At4g04890_PDF2 .......... .......... .......... .......... .......... At4g21750_ML1 .......... .......... .......... .......... .......... At1g73360_HDG11 .......... .......... .......... .......... .......... At1g17920_HDG12 .......... .......... .......... .......... .......... At4g17710_HDG4 GATCAGGAGT CTTATGACCA AGACGACGCT TAGGACCCCG TCCTTTATTC At5g46880_HDG5 .......... .......... .......... .......... .......... At3g61150_HDG1 .......... .......... .......... .......... .......... At4g00730_ANL2 .......... .......... .......... .......... .......... At2g32370_HDG3 .......... .......... .......... .......... .......... At1g05230_HDG2 .......... .......... .......... .......... .......... At5g52170_HDG7 ACCAACCCAC TCTGACTCTG ACTACTACCA CTAGTCCCAG GATCACCACT At4g25530_FWA .......... .......... .......... .......... .......... AT3G03260_HDG8 .......... .......... .......... .......... .......... Coleochaete_orbicularis_2036 AAAAGAGTAT CTGTATCTAC CCCTCAAACT CTAGTTGTTC ACTTGTGAAA

---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GCGTGGTACT .......... .......... .......... .......... .......... TCTCTTCACT .......... .......... ATTATCCTTT

1801 1811 1821 1831 1841 1851 | | | | | | ---------- ---------- ---------- ---------- ---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

MpC4HDZ1 PcC4HDZ1 PpC4HDZ1 PpC4HDZ2 PpC4HDZ3 PpC4HDZ4 SmC4HDZ1 SmC4HDZ2 SmC4HDZ4 SmC4HDZ3 Pn_0003807 Edi_1007259 Edi_1001922

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GGAACTCGAG .......... .......... .......... .......... .......... AGTAACAAAC .......... .......... GGCAACTGTG

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... AGTGAGGTAT .......... .......... .......... .......... .......... ACAAAACCCG .......... .......... GGTTTTGCCT

41

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... CCTGTATTCT .......... .......... .......... .......... .......... AAACTTCGCA .......... .......... TGTCCATGAC

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TGTCTAAGGT .......... .......... .......... .......... .......... AAATCATCAC .......... .......... TGATGAGCTT

Apl_0023502 CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... CTCACTTGGC .......... .......... .......... .......... .......... AAACCGATAC .......... .......... ..........

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GGCTCCATAT .......... .......... .......... .......... .......... GATATAATTC .......... .......... ..........

1861 1871 1881 1891 1901 1911 | | | | | | MpC4HDZ1 ---------- ---------- ---------- ---------- ---------PcC4HDZ1 .......... .......... .......... .......... .......... PpC4HDZ1 .......... .......... .......... .......... .......... PpC4HDZ2 .......... .......... .......... .......... .......... PpC4HDZ3 .......... .......... .......... .......... .......... PpC4HDZ4 .......... .......... .......... .......... .......... SmC4HDZ1 .......... .......... .......... .......... .......... SmC4HDZ2 .......... .......... .......... .......... .......... SmC4HDZ4 .......... .......... .......... .......... .......... SmC4HDZ3 .......... .......... .......... .......... .......... Pn_0003807 .......... .......... .......... .......... .......... Edi_1007259 .......... .......... .......... .......... .......... Edi_1001922 .......... .......... .......... .......... .......... Apl_0023502 .......... .......... .......... .......... .......... CruC4HDZ2 .......... .......... .......... .......... .......... Zv_FD774992_1 .......... .......... .......... .......... .......... GbC3HDZ1 .......... .......... .......... .......... .......... GbC4HDZ3 .......... .......... .......... .......... .......... gb_deg2_UTR .......... .......... .......... .......... .......... PmC4HDZ1 .......... .......... .......... .......... .......... PmC3HDZ4 .......... .......... .......... .......... .......... Vpl_0009732 .......... .......... .......... .......... .......... At4g04890_PDF2 .......... .......... .......... .......... .......... At4g21750_ML1 .......... .......... .......... .......... .......... At1g73360_HDG11 .......... .......... .......... .......... .......... At1g17920_HDG12 .......... .......... .......... .......... .......... At4g17710_HDG4 TATCTTTAGT TTCTATCGCT ACCATGACAA TCACCAACTT AGATAGATTC At5g46880_HDG5 .......... .......... .......... .......... .......... At3g61150_HDG1 .......... .......... .......... .......... .......... At4g00730_ANL2 .......... .......... .......... .......... .......... At2g32370_HDG3 .......... .......... .......... .......... .......... At1g05230_HDG2 .......... .......... .......... .......... .......... At5g52170_HDG7 GCATATAGAA TCATCCTACC AAACCATCCA CAAGAAATGA TCGACAAATT At4g25530_FWA .......... .......... .......... .......... .......... AT3G03260_HDG8 .......... .......... .......... .......... .......... Coleochaete_orbicularis_2036 .......... .......... .......... .......... ..........

---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GGCTGCAACA .......... .......... .......... .......... .......... ACTCCTAATC .......... .......... ..........

1921 1931 1941 1951 1961 1971 | | | | | | ---------- ---------- ---------- ---------- ---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

MpC4HDZ1 PcC4HDZ1 PpC4HDZ1 PpC4HDZ2 PpC4HDZ3 PpC4HDZ4 SmC4HDZ1 SmC4HDZ2 SmC4HDZ4 SmC4HDZ3 Pn_0003807 Edi_1007259 Edi_1001922 Apl_0023502

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TGTACTTTAT .......... .......... .......... .......... .......... AAAACCTCTG .......... .......... CGGTTCGCTT

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GTAGAAAAAT .......... .......... .......... .......... .......... CCAACGGATG .......... .......... CCTCTTGT..

42

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GCATTCCTGG .......... .......... .......... .......... .......... AATCCAAAGC .......... .......... ..........

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... CCTTGAGTCT .......... .......... .......... .......... .......... AACGAAGTGA .......... .......... ..........

CruC4HDZ2 Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TACCAAAAGT .......... .......... .......... .......... .......... CAGGCATATA .......... .......... ..........

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... CATTGAAAAG .......... .......... .......... .......... .......... AAGGAAACCA .......... .......... ..........

1981 1991 2001 2011 2021 2031 | | | | | | MpC4HDZ1 ---------- ---------- ---------- ---------- ---------PcC4HDZ1 .......... .......... .......... .......... .......... PpC4HDZ1 .......... .......... .......... .......... .......... PpC4HDZ2 .......... .......... .......... .......... .......... PpC4HDZ3 .......... .......... .......... .......... .......... PpC4HDZ4 .......... .......... .......... .......... .......... SmC4HDZ1 .......... .......... .......... .......... .......... SmC4HDZ2 .......... .......... .......... .......... .......... SmC4HDZ4 .......... .......... .......... .......... .......... SmC4HDZ3 .......... .......... .......... .......... .......... Pn_0003807 .......... .......... .......... .......... .......... Edi_1007259 .......... .......... .......... .......... .......... Edi_1001922 .......... .......... .......... .......... .......... Apl_0023502 .......... .......... .......... .......... .......... CruC4HDZ2 .......... .......... .......... .......... .......... Zv_FD774992_1 .......... .......... .......... .......... .......... GbC3HDZ1 .......... .......... .......... .......... .......... GbC4HDZ3 .......... .......... .......... .......... .......... gb_deg2_UTR .......... .......... .......... .......... .......... PmC4HDZ1 .......... .......... .......... .......... .......... PmC3HDZ4 .......... .......... .......... .......... .......... Vpl_0009732 .......... .......... .......... .......... .......... At4g04890_PDF2 .......... .......... .......... .......... .......... At4g21750_ML1 .......... .......... .......... .......... .......... At1g73360_HDG11 .......... .......... .......... .......... .......... At1g17920_HDG12 .......... .......... .......... .......... .......... At4g17710_HDG4 GAAAATAAAA ATCAAAATTG TACATTTTTA GGCGATCTCA AATGCGCTTT At5g46880_HDG5 .......... .......... .......... .......... .......... At3g61150_HDG1 .......... .......... .......... .......... .......... At4g00730_ANL2 .......... .......... .......... .......... .......... At2g32370_HDG3 .......... .......... .......... .......... .......... At1g05230_HDG2 .......... .......... .......... .......... .......... At5g52170_HDG7 CCAATACAAG CCATCGAAAA CG........ .......... .......... At4g25530_FWA .......... .......... .......... .......... .......... AT3G03260_HDG8 .......... .......... .......... .......... .......... Coleochaete_orbicularis_2036 .......... .......... .......... .......... ..........

---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TAGGAAAGCA .......... .......... .......... .......... .......... .......... .......... .......... ..........

MpC4HDZ1 PcC4HDZ1 PpC4HDZ1 PpC4HDZ2 PpC4HDZ3 PpC4HDZ4 SmC4HDZ1 SmC4HDZ2 SmC4HDZ4 SmC4HDZ3 Pn_0003807 Edi_1007259 Edi_1001922 Apl_0023502 CruC4HDZ2

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... AATAGTTTTA .......... .......... .......... .......... .......... TGATCAGTCC .......... .......... ..........

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... AAAAAAAATA .......... .......... .......... .......... .......... CAACCCAAAA .......... .......... ..........

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... CACTATTGTA .......... .......... .......... .......... .......... CGACCTTGCA .......... .......... ..........

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... TTGTTATTAA .......... .......... .......... .......... .......... CTTCTTCCAG .......... .......... ..........

2041 2051 2061 2071 2081 2091 | | | | | | ---------- ---------- ---------- ---------- ---------.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ..........

43

------....... ....... ....... ....... ....... ....... ....... ....... ....... ....... ....... ....... ....... .......

Zv_FD774992_1 GbC3HDZ1 GbC4HDZ3 gb_deg2_UTR PmC4HDZ1 PmC3HDZ4 Vpl_0009732 At4g04890_PDF2 At4g21750_ML1 At1g73360_HDG11 At1g17920_HDG12 At4g17710_HDG4 At5g46880_HDG5 At3g61150_HDG1 At4g00730_ANL2 At2g32370_HDG3 At1g05230_HDG2 At5g52170_HDG7 At4g25530_FWA AT3G03260_HDG8 Coleochaete_orbicularis_2036

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... AGTTCCATGT .......... .......... .......... .......... .......... .......... .......... .......... ..........

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... ATAAACGAAT .......... .......... .......... .......... .......... .......... .......... .......... ..........

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... CCAGTCATGT .......... .......... .......... .......... .......... .......... .......... .......... ..........

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... GTTAGCAGCT .......... .......... .......... .......... .......... .......... .......... .......... ..........

.......... .......... .......... .......... .......... .......... .......... .......... .......... .......... .......... CTGATTTTTT .......... .......... .......... .......... .......... .......... .......... .......... ..........

....... ....... ....... ....... ....... ....... ....... ....... ....... ....... ....... CACCTGC ....... ....... ....... ....... ....... ....... ....... ....... .......

Suppl. Figure 5. Alignment of the conserved 17 and 18 nucleotide motifs in the 3'-UTRs Streptophyte C4HDZ genes. Sequences of 3'-UTRs of selected C4HDZ genes were aligned manually. In addition to the two highly conserved motifs, additional sequence similarities can be seen flanking the highly conserved motifs.

44

Structure 1 Folding bases 1 to 183 of MpC4HDZ1 Initial ΔG = -81.40 10 20 30 40 50 60 70 80 90 -| AU GAA- C - AUGCCU UA A U A G---CUC UCU--AA GGAA AGUCAA CG ACCAC AG CCUUGC AGUGGU AGGCAU AUGC CCC AGUGUGC GUGCGC AUC AGCUC \ UCUU UCAGUU GC UGGUG UC GGAACG UCACCA UCUGUG UACG GGG UCACACG CGCGCG UAG UCGGG C U^ -AUGG U A -----UC C AGUAG UUC UUUUAU CC 180 170 160 150 140 130 120 110 100

Structure 1 Folding bases 1 to 157 of PcC4HDZ1 Initial ΔG = -56.70 10

20 30 40 50 60 70 80 GAA-| C CAGAGA G A CAUAA UAGAU CAG AGU AG AAGUCAA CG ACCAC CCUUG CAAGUGGU AAGUGU GUCG GCACUU GACA AGA \ UC UUCAGUU GC UGGUG GGAAC GUUCACCA UUCGCA CAGU UGUGAA UUGU UCU U - A AUGG^ U AUG--G C UCA-GCCC--AG GA 150 140 130 120 110 100 90 U

G

Structure 1 Folding bases 1 to 119 of Pn_0003807 Initial ΔG = -41.80 10

20 30 40 50 60 GAG-| C CAC U CCU A A A -U AGUCAA CG ACCA AUGC CCUUGC GUGGU AAGUA GA GC CAUUG \ UCAGUU GC UGGU UAUG GGAACG CACCA UUCAU CU CG GUAAC G AUU-AUGG^ U --UAU C - A UU A 110 100 90 80 70 GCGAU

Structure 1 Folding bases 1 to 128 of CruC4HDZ2 Initial ΔG = -58.40 10

50 .-A| G GG GGAGUCAA CG ACCAC UACCCUUGC GUGGUAAA CUC A UC CCUCAGUU GC UGGUG AUGGGAACG CACCAUUU GAG G A AUGG U --UAC \ -^ A 120 110 100 90 G

A

20

GAA-

C

30

AUA

40

CAA

60 AGAGA--

70 AAG GCGC A CGUG G UCGAGUA GAA 80

Structure 4 Folding bases 1 to 507 of SmC4HDZ4 Initial ΔG = -173.90

220 230 240 250 260 270 280 290 A AU - GG GAA- C CAAAAGCUC G A CU AGA U--------GAGAU UGGAGU GGG AG AGUCAA CG ACCAU CCUUGCA CGUGGU AAGU GGAUUCGU UGCAC CUUUA ACCUCA UUC UC UCAGUU GC UGGUA GGAACGU GCACCA UUCG UCUGAGCA ACGUG G CC A UU ACGG U AAC-----G C CGCUCUCAAACGC 440 430 420 410 400 390 380 370 360

.-U

Structure 1 Folding bases 1 to 507 of AtML1 Initial ΔG = -142.90 200 210 220 230 240 AA GAA- C G AA-.-UUU GUU UCCA GGAAGUCAA CG ACCUUUUUGC UUU UCUCA CCGC U AGGU UCUUCAGUU GC UGGAGAAAUG AAA AGAGU GGCG G CCA CG AUGG U G AGAA \ --AUU 320 310 300 290 280 250 C--

45

Suppl. Figure 6. Potential secondary structures predicted in the 3'-UTRs of C4HDZ genes spanning land plant phylogeny. Secondary structures were predicted using the Mfold software (Zuker, 2003). Many C4HDZ 3'UTR sequences have the potential to fold into similar structures, with the conserved 17 and 18 nucleotide long motifs, red and blue, respectively, base pairing towards the base of a stem-loop structure. Other sequence similarities can be found in many, but not all stem-loop structures (e.g. example the motif highlighted in turquoise).

46

Suppl. Figure 7. Semi-quantitative reverse transcription polymerase chain reaction (RT-PCR) of PpC4HDZ1, PpC4HDZ2, PpC4HDZ3 and PpC4HDZ4. All reactions were run for 35 cycles. The cytosolic glyceraldehyde-3-phosphate dehydrogenase gene (PpGapC1) was used as a loading control. P, protonemata; G, gametophores; S, sporophytes within archegonial tissues.

47

Suppl. Figure 8. Schematic summarizing our outlined GUS fusion strategy for Physcomitrella C4HDZ genes. ~1 Kb of coding region (shaded) and 3’ flanking sequence (unshaded, 3’ UTR plus downstream sequence) were amplified from genomic sequence of each PpC4HDZ gene via PCR and cloned into pTN85. The stop codon was removed during amplification of the 5’ fragment for each gene to maintain an open reading frame. Upon sequence confirmation of both fragments, we transformed wild-type lines with pTN85PpC4HDZ1-4 and screened for candidates. Successful transformants, PpC4HDZx-GUS, integrated the uidA coding region in frame with the endogenous PpC4HDZx gene and a downstream G418 selection cassette (indicated by the double breaks in pTN85). uidA – betaglucuronidase (GUS). Hashed TAG codon represents a removed stop codon.

48