PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc001941.1_g020.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family MYB_related
Protein Properties Length: 1522aa    MW: 170664 Da    PI: 6.3523
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc001941.1_g020.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding301.2e-0913171356446
                             S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
        Myb_DNA-binding    4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46  
                             +T+eE + l+++ k++G++ Wk+Ia +mg  +   ++k+ w+ 
  Cse_sc001941.1_g020.1 1317 FTPEELKTLKELYKKHGND-WKRIADEMG--KYMVHVKDAWRR 1356
                             8******************.********9..678888888875 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1522 aa     Download sequence    
MKKINMKNDV VLCALMAVLL LVMLPFTTSL ADEGKALMSI KASFSNVVNV LLDWDVDQND  60
DLCSWRGVLC DNISTTVVAL NLSNLNLGGE ISPAIGDLRN LQSIDLQGNK LTGQIPDEIG  120
SCVSLILLDL SDNMLYGDIP FSISKLKQLE LLNLKNNQIT GPVPSTLTQI PNLKTLDLAQ  180
NQLTGEIPRL IYWNEVLQYL GLRGNSLTGT LSADMCQLTG LWYFDVRGNN LTGTIPDSIG  240
NCTSFEILDV SYNQITGEIP YNIGFLQVAT LSLQGNKLTG KIPEVIGLMQ ALAVLDLSEN  300
ELVGPIPPIF GNLSFTGKLY LHGNRLTGPI PPELGNMTKL SYLQLNNNQF TGGIPAELGN  360
LDQLFELNLA LNNLEGPIPE RISSCTALNQ LNVHGNFLNG SIPSGFRNLE SLTYLNLSSN  420
KFKGTIPFQL GRIINLDTLD LSSNQFSGPI PASIGDLEHL LTLNLSRNHL GGPIPQEFGN  480
LRSVQIIDMS FNKLRDAIPV EMGQLQNIIS LILNDNNLNG EIPNQLSNCF SLTNLNISYN  540
NISGVVPPSK TFSRFPSDSF LGNPLLCGNW LGSICDPYSP KRRALFSRTT VVCMTLGFVV  600
LVAMVVLTIL RSSKPRQYMK EPSKGIQGPP KLVVLHMDLA IHTYDDILRI TENFSEKYII  660
GYGSSSTVYK CALKNSRPIA IKRLYNQYQH NFQEFETELT TIGSIRHRNL VSLHGYSLSP  720
TGNLLFYDYM PNGSLWDLLH GPSKKVKLDW ETRHKIAVGA AQGLAYLHHD CNPRIIHRDV  780
KSSNILLDEN FEAHLADFGI AKSLPTTKTY ASTYVLGTIG YIDPEYARTS RLTEKSDVYS  840
FGIVLLELLT GKKAVDNESN LHQLILSKAD DNTVMEAVDP EVSVTCMDIS HVKKTFQLAL  900
LCTRRLPSER PTMHEVAGIL QSLLPAAPAL KTSLGPEKTK EYKQYVVGDE ANHPQKQQEE  960
TENSSDAQLH TLIIEDAETA DLDHSELLKR RVQSKNSIKS SSPQAPQIFV EAAGSLNENQ  1020
EFNSPISKFR SGSIKQSVAQ IEMNNQLKGE VEAQEMHKAG EYADASPVTM ETKSKNKKSE  1080
ITGGNDDNRH VEIDSSLPKE KKGHKRREKK KKKIELNEDG VEDEDRPHKG NLVNDDDVSM  1140
KDKKKKRKRK ISEKIQIEEN ENNRNDEPEL SREDSESDNV LNEIMTEEST PETQYDTEDN  1200
KGTKEKKKAN VKGSKVKNKT TESVENNKRK ATAKGDGLIR GKRFTQEEDE IIKEAVLNYM  1260
TAHDLGDDGL KMVLNCRSYP GMKQCWKEIA NCLPYRPITS VYHRAHVLFE RAETPGFTPE  1320
ELKTLKELYK KHGNDWKRIA DEMGKYMVHV KDAWRRIKLE NLKSGKWSQE EYQNLYDLVN  1380
LDLQMKLNSE KKSQHGMLRD NIPWTAISDK LSTRNDATCC TKWYRQLTSS LVAEKKWCDA  1440
DDYRMIGKLY ELDAACVDDV DWDSLLEHRT GDISRKRWDQ MVKHIGDYSS KLFAEQVDIL  1500
AKRYSPDLAE TREAWDNKPI VS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
111051112KRREKKKK
211061148RREKKKKKIELNEDGVEDEDRPHKGNLVNDDDVSMKDKKKKRK
311091151RREKKKKKIELNEDGVEDEDRPHKGNLVNDDDVSMKDKKKKRK
411101152RREKKKKKIELNEDGVEDEDRPHKGNLVNDDDVSMKDKKKKRK
511431148KKKKRK
611431149KKKKRKR
711441150KKKKRKR
811451150KKRKRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G41020.11e-112MYB_related family protein