PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cz02g00230.t1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Sphaeropleales; Chromochloridaceae; Chromochloris
Family MYB
Protein Properties Length: 841aa    MW: 88297.8 Da    PI: 6.0165
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cz02g00230.t1genomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding41.82.4e-131065148
                     TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHT........TTS-HHHHHHHHHHHT CS
  Myb_DNA-binding  1 rgrWTteEdellvdavkqlGggtWktIartmg........kgRtlkqcksrwqkyl 48
                     +g+W +eEd+ l++av+ +G ++W+ +ar+++         gRt+kqc+ r++ +l
    Cz02g00230.t1 10 KGAWRQEEDDALKQAVAVHGIKNWTNVARCFNdlmgrepnVGRTAKQCRGRYLHHL 65
                     79**************************************************9775 PP

2Myb_DNA-binding48.61.8e-1572113245
                      SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45 
                      g+WT eE+e+ ++++k +G++ W++Ia++++ gRt+  +k++w+
    Cz02g00230.t1  72 GAWTVEEEEKMIQGHKCYGNH-WSKIAHMLP-GRTETAIKNHWN 113
                      89*******************.*********.***********8 PP

Sequence ? help Back to Top
Protein Sequence    Length: 841 aa     Download sequence    
MSSMDAVEIK GAWRQEEDDA LKQAVAVHGI KNWTNVARCF NDLMGREPNV GRTAKQCRGR  60
YLHHLKPDLK TGAWTVEEEE KMIQGHKCYG NHWSKIAHML PGRTETAIKN HWNCTLRRKV  120
GADERRRTVL QNYILSLGLQ QDAQQRAIAA AAPDTACLAG MLGFPANMLA FALQSQPMAH  180
LASHANFTPS PPRTRGNINP KRSESEATAE SAATTGTTAT LAEAVAAGSS QDLLRRCALF  240
YLELAMQQAK QNGHEQGKLQ ELGSSSGDPT LDAVHLQVGF KRAASGSTAK HPPGRHGNTT  300
KRSRMHQESR AGGGGSRARH RASALRALTA VAAAQGACDD GFDDDKQDMG AAWEHGDRQR  360
LDRMECEPYD LVDPETDTAA VLQAQQSYPG EVPGSIAPVT PPLASDAVQA AMQTLLAAAA  420
TAADRLKAEK SRPADADAAD SQIPAGGKLL GSQGGSVAHN TPDSSPKATG HGDESQTGLP  480
PCSAIETLAS VAVNQSKPTA PATKVGSKRV VAVDADPTGH PDEPDDVFDA EDIQHSCREL  540
VQQICHGALT GTRHGGRALL NPSGGQHLTH HATGYEENQQ GVWCLPAVSS IRHRQLNHKG  600
AGARWVGAAM SHHEVNDHVA DDDGDNDDDD DDDDNPQGYD DDDVDDGEEE DADDPQHGPA  660
SDVQANGAIG GVAVMQQENL VGQQGAGGKS PDGHVLGVHL GTPQPQKGND HHFGDDALLV  720
QGALPAACMS VSHMGMMPVL RCSSKHANGL VASEVQAGCH NHSGPAVGGC NGGLAGVNMD  780
LVHLSQQLTA AVESAVASAA SSLVNSNGDS KLQLVVLPII LNAALPRPTV CPAISTAKSI  840
*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1117127RRKVGADERRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G27785.12e-29MYB family protein