PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG78378.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family MYB_related
Protein Properties Length: 2073aa    MW: 214965 Da    PI: 7.7431
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG78378.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding23.11.8e-07150191346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT  E+e + +a ++ G++ +++I ++++  +   q++++++ 
       GBG78378.1 150 AWTHAEEESFFHALRRCGKN-FEKITNRVP-SKNKDQVRHYYYR 191
                      6*****************99.*********.***********97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2073 aa     Download sequence    
MHGACPLVVL ELVKLKSIFG CSMRINILFD QEFLECGYWP AADAAAAAAT APAASSAADG  60
ADAAPAAPVA AAAAYTMAGA ACDQEEFLGK DKCSEYKQAE MADEEEAVAG EMLRELADQI  120
AESPPAGSSI EDTAAPATRD EKKNTRHWAA WTHAEEESFF HALRRCGKNF EKITNRVPSK  180
NKDQVRHYYY RVIKRMNKLL GPGLVLDARN PHDVNAAMLR WWALMEKQGC SAAKLRHKPK  240
RRKMFVTALE QQLLADRRKA KRKQASGTQA SAVQASAAPA TASPVSAAPA PAAVATGIQD  300
APVCELPSTT GASLSSPACV NDACVPCPSA TGVVSAASGS RGSPGSGRDN MPVAAGPGGE  360
GKGGAGEEQK EGHEAKRGGG EEGTGGGDGK GSGFAEGKGT GEKRGGGVGE GKGGGGEVKT  420
SGGEKKGAAS GEGKGSIDRK AGGLGDSKGV VERKGGLSGE GKAGGGRKVG GLGSKKEGAS  480
PKNSGHKSKK GAAVVIGNIC NVSEVSGAKS TTKVARNAPR QPARRPRPKK DHIEIMTGSQ  540
EPSSINMWEN VMCGVRLVAE AAAQLEQQAM VDQVSELDEE RRISGQMESV HHKPVTQTAL  600
LNQKGGTINQ TSDHLLQEGV HTLQDGVSRS EHEDGIGTSH AVIASSSMGS EELGLLQQPG  660
NRPSSAATIR ELPLCECDNA ASGRVDITKD SAANGLGSAN ADLGAEKVKL QLYPIDDITR  720
RSLAKEGYNP YLELTLKGRK SISAVVSHLL RKWVRKGNDA VATRIDIRRA SPPTAESTGR  780
IIVSSWQDGE LRLFPYGVSP STLQSAKFSW GKEHGSVTAS DVGKVVGSLK PQTFRLRYAW  840
VSYLSDVQVS GQSSAPAERL GSVSEDIQAN AAMKPPGAQP AELSTQMTAM QRQPVTQKQS  900
GQHITILQLS ALSTIQTTRE GFQGASIPTS PRGMDGINVS APKESCPHCG CSGCSKDESQ  960
AVTGINLAPE ERHNNSTGSA VQATAKEVDS TNTTSVPSRR EAPVMLGRAS EQTELEQTGG  1020
LVHGGKVEGS DGEKVTGCQE IGTAKDKAAC RDAQAIGDAA NGRLTFGKST EASRMAIDQA  1080
MRPKFSAKRF CSAGQAEENQ ILKMSTGVQE ASHVAATTPA ATIEEMPAAR CAWAYEQSND  1140
GFGRRSWDVA ENLGNRGYMV EGGLSSLGLD SALSLPSLDT TQTTPSPSCQ PKGVLGASST  1200
SATACTASSG AVPSAHVGTV ERGCFVVPSS IGNSALGKTP PSDADPGLQG SVHSENIPGN  1260
SMSWIDSLSD MSLGDLLTDT LCGNGPGSVE GPSGNSLVVT PTGMATAALS VADLHSPHYI  1320
QHDRKFAYSR QISQISGPLP QRRLFAGGVM TGNTECDRSK GSRGSDPLPS SGKQSLAEEG  1380
IPRQSKEKSS NASLPSLLRE EGTCYGLGFD TPAGSAVAVE PIAYGTCDGG LDFSTPARPA  1440
VPSSSQSVSN RGELAADGVL TEGHNSGDVT AKTNHSVPSV RAEGAPETIE AGAGKQFDSG  1500
ENWSSSKQSE KRQLAAGAQA GAGLELHSNV LPGDSLVIGE DVGFPSTGSL GNELLFGSDF  1560
SLDGSSGLSL LRNIIGTTST GMETPTHPLC PLPTPVSQPF PMTSLTSPGV NMLSSGKSSA  1620
VAAVSVAMGA QGMHRAGTSM SAGDHDVTPK DTGKCTLSSG AICPQQKRTA ATEKDVHTDS  1680
EGLVSMGGGF SEHPPEIAAH IGIKSIDNNN NNNNHNSARE ERSNVSKVRI SHEVIGRNKS  1740
TSTSGPCFPN DSNLQDRSAI GAVTAHKLPS VRDVGVGMTP DHPGVALESE QRVRHEEEPN  1800
QPPPNKSGHQ DGQIIGRRTA AGLSEDGALN PTKGEASKLG QTGKVGLRVS GTESPSKCKM  1860
KDVKVIAGRA KQPPVKADPN VWVPERPFQS LFASLPTNKQ KDEMEFEKGS CRTKSRVATA  1920
DKGGKERGVK AVPSNKGEAS RGGGSSAPKT SKAGPKLESK RVNKKEMSKS SDAVKLKGAS  1980
PEKMGCTGWG EGVKRQRTGM QTMRKIPAGT KDGSTVNDHV DGEDGEAKEA ASAVGCGATS  2040
AGKCQFPTAL IEPPRVGCNP PLWSCSIPAP ASP
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1240263KRRKMFVTALEQQLLADRRKAKRK
2257263RRKAKRK
3257264RRKAKRKQ
4524531RRPRPKKD
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G36960.34e-55MYB_related family protein