PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG70066.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family MYB_related
Protein Properties Length: 5224aa    MW: 550465 Da    PI: 4.9329
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG70066.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding23.31.5e-071446336
                     SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS- CS
  Myb_DNA-binding  3 rWTteEdellvdavkqlGggtWktIartmgkgRt 36
                     rWTt E++ll dav+++G+ +W+ I+ ++  gR 
       GBG70066.1 14 RWTTLEELLLTDAVRRYGSANWSVISMELR-GRA 46
                     8****************************9.774 PP

Sequence ? help Back to Top
Protein Sequence    Length: 5224 aa     Download sequence    
MEGRSNCDNG AQWRWTTLEE LLLTDAVRRY GSANWSVISM ELRGRARQMG GRSESSLNRL  60
SPEVCCTKFD ELKRCFKKKK KKKKKKKKKE NETATKIHRV KSDNSRNTNP ENNNSNNNNN  120
NNNNNNNNNN NKSNNNDNNS NCNTSKINKR KSRSRCDRSE ETGGRVLSSS PTEVTGGNED  180
EEREREGLQN GIKEGGEEEE EEDDGDDDGD GDCEEEEEKT EKEGEEEEGE GEGGSWDKNL  240
VELIEQELRQ KRCAELKREL ERRDGWIDDL KMKIMRLKRD RKVDCPSRGA RSMVADKQEK  300
GKDCLKSGVG GGEGLSERLL KSRAAGTGLA SDGQRRAEKD KIGEKQQRGD EWEGERQEVG  360
EGGEGGGGGG GVRSKSPQRP RGKSSGLGSS STGHDRYGFI QRDREGEGGG GGGGGDGEGE  420
RERDEGSKEA GGGDEGGSGV GERATGTEEG GGGGGGGKGG QTGGHEGGGG GGGGGKGGQT  480
GGHEGGGGGG GDGLAIYGGV LIGGEGQGGG EGAGEGGGGG RRGDYAAAEG FANDVDGRSV  540
RRAAVEGVRP RSDERDGVVQ LPVGREIKRA EQLASGTGGR ASPLCAESKA ELRGNGLRLP  600
VVTSAASQSP PGRIGDPDGS SQGTPVVSVS CKESVPRGEG VEEEGDGRSG GGGRSGPTPA  660
TGSASSFQRK LRLRLRRGNA GAICAGHYHG RDISSSCADG AKNSSPDGPD TTDRGRLAGA  720
DLADDWDPAG GLRPAPHDLA SAAGRDSRPP LFGGEESARP MNEIAGAARR GVIGSGAAGP  780
SDIITLDSAT CHSAAVSGER LGDGAGGQPE RKADLTGCGD DGRKVNDDEA ATNYRPLDHP  840
HRLTEIVHVA DHVTARPLVS SHQVLARRTK GTTTMPAPDR YPVLASDSII GCLPPCPTAC  900
AKKHVGGGDV ANRCADVDTC GRDVTCRRAD VAGCHEIAGR ADVAVGFAAE GAGSADVADR  960
IDYIVTDRAD VAVRRSVLAS EVSGADEVGR DDGVNVSDDR DADVARRRPN VSDRRVVVSS  1020
WRRGGGPGPR RADVAAGGHR RELVSGSRRG DGEYHRSHDV AGRGPDVSAR EGEGEGDDVQ  1080
AAAAANHSGN QSGGIADVDA SGPGIAIFGI NSTDESDYSE SGPVATAIAA EDPPRQHRNC  1140
GGSKAKTAAR RAISNRTRGS STWRDSDAAA DVAAAERGPD DTAAGALAMG RGFSDADVAD  1200
CPSAAAGLST PLTAATSVSD ADVEKDVPSS NRTLVDRCAF ADDARDSRIV VHVATSCSRG  1260
RSADGHVDVS DGDRFIRSMD IVPSGRLRSC SRCPDTDKVV VVVPGQNGEG GGSEAVGTDD  1320
LPRIGHVSIP PDCRVSSPVC ERADVDVSRR MLSERASRED VTCPSAGDAH VTGDDRVAEA  1380
GGADAALVVS GSRREADKPA DALMAAAGWD PGADDLLAAT VCHVDDHASG RMKSPCCVMT  1440
GTSGAEDGKI ELVPRPRRNH CADKLCNAGR AYRCDPDRYR AIPIPIPNVD GSSGVRPPSS  1500
DDRGSKIVVA MDKESDKEKI ENRWGASINS EKQRGGGGGG GRSKDVVLMR SEGVSREEES  1560
YVDAVRQLYD VYNSGESSAK CATDRHVSSS DDEEGLSVHA QQCTTARHVS SCSDGEALSR  1620
LPGAEVLAAS TSSRLTISRV EVCGGERRKE YDKLRRCCFN GVISEARERE EGEVVEKFTM  1680
DDSKKAVLEN GNEVGGSGGR SCCEDSSDQM RGGGRGAECL VPDIRASVMT RTTDVRQEVH  1740
QEVHQEVHQE VQQQEVQQEV QQEEVWQEED RDGKEQEGGS KGVGVDVNEK SWEGANGNDG  1800
GRRREEEDVD MDMDTMKHGD KDEWADKTTD ADVVRKNCDR IGGKWKEKDL AGKGKDLAGK  1860
GKDLVGKGSA SEHLESSGRD ESARRAEDDR SRGDFLRGGR EKDDKDGGGE EGSSPLRGGE  1920
EGRVTGRRIA LLREEGNDDI SDGGGGGGLE TYVNRQRIGG KVEEEGRKGE CVEGDRASAC  1980
HSRTSRLTRG REEEEGRSRR QQDVEEVVEY AVGDGVEIQQ TLRDLPTGIP IGVTIATCPL  2040
VSEGTSRMAC ADVRIPSTAE ICTIADAAPR EGGEHSVQKG SGSRVGIGRR RERSSSLSCA  2100
AMDFSLPGGE GMGIEGRDIE EEKEGEGVDD LPPVERMETK GGKGEDHHMQ KKCSAGQTGI  2160
IVAGEEKQQE EKPSAVEPRG GGNPGVSERG LDLGEIGGAA CSFVGDGEIG REREGEGLPE  2220
AVGYGSHVGT ELTRGERGRG GGGGGGGGGT SQDLRMEREG RLQRCRRGGG DDEAIKVDDG  2280
ERKAGGRSVG DQLRAHEGGL SSDDGDRSSR DRLRMIVPGS GSREVEGYSG GNEDCSDGFV  2340
DQGGESLARP AEEREEREER EGREGISVNG RVKSGLVLDR EERRGQLEKK REVEGNGRSG  2400
SRDESSSKGS MEAGKEEKMA GEVGGRRLTL SGLIEWSRKG EGEGHGGKEG KMERASSALL  2460
PETSPEIKAK ESAAAAAAEE EEEEEGKKAR CVEEGVVQLD DMMDDKKDDD VEVELSEGEN  2520
GAAIGRSGWL VLEAEETEET EECMGVEEMG NEEEDEQGVS RDGEEHARMA KEIEIGMRKE  2580
VVSLQQTEEG GSRKKTKDMA IEAGGGENGE TCKKEEEEEE EYTIPPPEFR ASLSTLDVGK  2640
KRLEEESVGG RRRFQRVLAE EEEGEGEGEG KREERKVWRV KGEQPDLAND KRVAMAMAMI  2700
HQRETGVEVV DCEKTLHAHD DEEEGGLDTA HEADEQGGGG VSKAAGGGRE KEGVVVLRAH  2760
SSSSQVLTSR GIVMCEFGEE DDRDRDVGVK GDGNSGGEEC LLEWMKGDGD FGGKRDSESA  2820
ADDDERFGGK RDSKSAADHD VRFGGKRDSE SAVDDDERFG GKSVSKAKRG EGVGGKGVVV  2880
RLGDEAGSFG GKEADDSVNH RALRMGKELA DEGMVASRDQ GGANRDRDDG VRGLVGGASL  2940
EEGERSDLDN ASKCNAPKEN LKDEKGEEEE EEEEEEEEGR PRERCSGLVK MYEEKRGRGV  3000
FDGREQEDAE GIAANSSDEK DDNGKEKTNV GAKVVGRRGV EGGGEQIDIT VDDPDPSSAM  3060
GFVAGTSGEE DDQRSAVSEA AGVVTEEMKI GGGGGVKGME GDVLGGEMDL EGWVDGCFGG  3120
KNVSSAKNGG LVDGGKESLD MSRRVMVEEE GGVDDYQRKA LIRVKGRKTG ARKMVMGKDS  3180
EGKEVVVIEE REKIVGKKVK EEEQGAKAGV KAGKVEGKEE EEEEEEEQGR TVGAKAAADG  3240
EGEEGRRARR MGETNMKEEE EERERESVAG RGQEKEESGK RVGTSRKEEE RWMDEENAME  3300
KGTDWENGGV GRNVSRGGVT TGKRGTGQLL VNAEEGEEEG GGREKKGVEC GNEEAMEFCA  3360
SHHDGDDDKV KDGEEQDGCA DRSMEERGSV KGAQSEILNS NATMMKDGCA NRSMEERGAA  3420
KGGQSEILNS NATMIKDGGP CAMELQLAEG GGGGGEEGGG SEVTDRQKVV CTTMIADGGG  3480
SREDGMSIFG NERATTRAAG SWTGEGEASA GSNGGKVLKK RKASGYGRLG SRRGKKVREN  3540
RKKGGDGGRD QGGDGERGGG GRRRGGAGLS WDKGQQPGES ERGSEVWART EGEENQSPPP  3600
KRSRREPKVS GRLLPLVEVL RGISSHKFAS LFRHHPDSRE HPKYDDLVRQ HVDLTMVRSR  3660
LEGGIYKNSM EFYRDLLLIF NNALVYYPPR SQQYVGALTL RDRATREMEV VFKAESLLSA  3720
DHPMTRRARQ SSSNTTQRAG VDASAAAGGG AGGTSTSDAA ARGGGGGEGG GGGGGVAGIA  3780
DGQSVGNDGG SSVAMVDSSC AEKRPALASL SVRDNDGKGV GNRDKGVYTS QPLTMLEKQA  3840
LLPSAAELKL KMTNTNKGGG GGGGRGKRGG PPVVKKSPLD TVPPSSTDDG LRKRVGKQRG  3900
SGLVIGAVQS DAADDSRSSE NLLEQKRVKR RKTTATAVSS KSNIATAGGE PKTSAPGEEL  3960
GACKLAVEAS DRALKEVDRR PTKVSIPLPK RTASGFAVKA QAAMAKKTTV VAKIAVRVVT  4020
GRAQEALRKG RAASPIGSGY QQHSAHNMEW PELTADGGVG VMEEGGGEQS CANEREGREG  4080
DEGKREEEEG KREEEEEEEE EEEEGEMATT GQVNSLPVNL KVPYHAAIGE GLGGRMHTKR  4140
GEGTSSTCTE VCVVGDLKTG GGRTSKGKAR VENRMGSYVM ASNTPGSAAD GRQSRRGAKR  4200
PHEADVMLTE SKNRGSSVNQ AAIRETEWVK DAVPLVEDQR ELGSRVLRGG RGAGRCGGAG  4260
TKEARQSCDF IPHSVDWGGW QDGRQSEVKP HGRKQVMVPG GSRLGEGGKN QIAATSENQR  4320
PLGRAEGGSG VAVIESGASG GVLRLSVKMP PNSKGPGERV NLRCDHPPWV ERTRQATAAM  4380
TPTVGKGDGA RGGGTAAVGG GGISTRRGAG RGGGGRDGTG EGSEGGVSSQ GISGGGGSSG  4440
KGGNASGGVS GGRKMSVAKG PASNPESSTE GYAEKEPSLP SRLPMKPIHA VKVADDARNR  4500
REGGRKLPGT KGCTRTPKVH PGNEAAGAEE SAGNGVSVLA KEVKRGAKQN RKPKEAAAGS  4560
GVRKRAAADR MGGRKSGAES VAPPRKTPPE TPPAKVEPRE SAEDLVTVAS LLKTKVALAR  4620
SAKVVQPFLD LADGKEGAAA AKWCMTYLAA AKPTVILGFP GTGGGRNGNG AKVQSQISKI  4680
LKVPKTPRLH APQDPMDAVG MGGAGKTKKE AEPTWVGKGG TQGAQRKWED GKTGQSCWEA  4740
KARQLESESL AVAMAAKGAR SKRKVGSGYQ VIDDDGADGE EVERELAFEE RVDHHPPIVR  4800
GRGSKGKDGW KNDGGQGRAV KASAVAVVLL EFMLEAIVLE FSMHRQKGEK VQVKDLGIRM  4860
RDAFEQVVFG PSRDGGYYLI GMTEVHDVLF KDIPWSTPEV LDLSIKLAQG NGISVLPTKM  4920
MPILQDIDTV EDLEAWLMEL DADRRCKGND RIDSPELGTD RRDHGNGPVD STDLDADQRR  4980
RGNDPSDAVG RMDGKAVETA EVAVDECDLA EYYPVGDLES RTLSAIDRSN EAAGAAVRQR  5040
GEACLGPEGG FALTLLLGND GRAIYRDDDV DGCRLDDPCI GSRGDSEAFA TAGDKISANE  5100
MSVESRKEER GSDLVEENVA VGVEEMVVEK NVAASGGGGK EVVSNSANRN AVRLTAEIAS  5160
SSLTEEGRVA ASGGGKEVVS NSVNRNAVRL TAEMASSSST EEGRVVASGG GKEVLSNSTN  5220
RNAI
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
17289LKRCFKKKKKKKKKKKKK
27783KKKKKKK
37785KKKKKKKKK
47789KKKKKKKKKKKKK
57884KKKKKKK
67886KKKKKKKKK
77890KKKKKKKKKKKKK
87985KKKKKKK
97987KKKKKKKKK
107991KKKKKKKKKKKKK
118086KKKKKKK
128088KKKKKKKKK
138092KKKKKKKKKKKKK
148187KKKKKKK
158189KKKKKKKKK
168288KKKKKKK
178389KKKKKKK
1815341542RGGGGGGGR
1926402653KKRLEEESVGGRRR
2035163522KVLKKRK
2135283537RLGSRRGKKV
2238573865RGGGGGGGR
2339263932KVLKKRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G60110.14e-17MYB_related family protein