![]() |
PlantRegMap/PlantTFDB v5.0
Plant Transcription
Factor Database
|
| Home TFext BLAST Prediction Download Help About Links PlantRegMap |
Transcription Factor Information
| Basic Information? help Back to Top | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| TF ID | GBG70066.1 | ||||||||
| Organism | |||||||||
| Taxonomic ID | |||||||||
| Taxonomic Lineage |
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
|
||||||||
| Family | MYB_related | ||||||||
| Protein Properties | Length: 5224aa MW: 550465 Da PI: 4.9329 | ||||||||
| Description | MYB_related family protein | ||||||||
| Gene Model |
|
||||||||
| Signature Domain? help Back to Top | |||||||
|---|---|---|---|---|---|---|---|
| No. | Domain | Score | E-value | Start | End | HMM Start | HMM End |
| 1 | Myb_DNA-binding | 23.3 | 1.5e-07 | 14 | 46 | 3 | 36 |
SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS- CS
Myb_DNA-binding 3 rWTteEdellvdavkqlGggtWktIartmgkgRt 36
rWTt E++ll dav+++G+ +W+ I+ ++ gR
GBG70066.1 14 RWTTLEELLLTDAVRRYGSANWSVISMELR-GRA 46
8****************************9.774 PP
| |||||||
| Sequence ? help Back to Top |
|---|
| Protein Sequence Length: 5224 aa Download sequence |
MEGRSNCDNG AQWRWTTLEE LLLTDAVRRY GSANWSVISM ELRGRARQMG GRSESSLNRL 60 SPEVCCTKFD ELKRCFKKKK KKKKKKKKKE NETATKIHRV KSDNSRNTNP ENNNSNNNNN 120 NNNNNNNNNN NKSNNNDNNS NCNTSKINKR KSRSRCDRSE ETGGRVLSSS PTEVTGGNED 180 EEREREGLQN GIKEGGEEEE EEDDGDDDGD GDCEEEEEKT EKEGEEEEGE GEGGSWDKNL 240 VELIEQELRQ KRCAELKREL ERRDGWIDDL KMKIMRLKRD RKVDCPSRGA RSMVADKQEK 300 GKDCLKSGVG GGEGLSERLL KSRAAGTGLA SDGQRRAEKD KIGEKQQRGD EWEGERQEVG 360 EGGEGGGGGG GVRSKSPQRP RGKSSGLGSS STGHDRYGFI QRDREGEGGG GGGGGDGEGE 420 RERDEGSKEA GGGDEGGSGV GERATGTEEG GGGGGGGKGG QTGGHEGGGG GGGGGKGGQT 480 GGHEGGGGGG GDGLAIYGGV LIGGEGQGGG EGAGEGGGGG RRGDYAAAEG FANDVDGRSV 540 RRAAVEGVRP RSDERDGVVQ LPVGREIKRA EQLASGTGGR ASPLCAESKA ELRGNGLRLP 600 VVTSAASQSP PGRIGDPDGS SQGTPVVSVS CKESVPRGEG VEEEGDGRSG GGGRSGPTPA 660 TGSASSFQRK LRLRLRRGNA GAICAGHYHG RDISSSCADG AKNSSPDGPD TTDRGRLAGA 720 DLADDWDPAG GLRPAPHDLA SAAGRDSRPP LFGGEESARP MNEIAGAARR GVIGSGAAGP 780 SDIITLDSAT CHSAAVSGER LGDGAGGQPE RKADLTGCGD DGRKVNDDEA ATNYRPLDHP 840 HRLTEIVHVA DHVTARPLVS SHQVLARRTK GTTTMPAPDR YPVLASDSII GCLPPCPTAC 900 AKKHVGGGDV ANRCADVDTC GRDVTCRRAD VAGCHEIAGR ADVAVGFAAE GAGSADVADR 960 IDYIVTDRAD VAVRRSVLAS EVSGADEVGR DDGVNVSDDR DADVARRRPN VSDRRVVVSS 1020 WRRGGGPGPR RADVAAGGHR RELVSGSRRG DGEYHRSHDV AGRGPDVSAR EGEGEGDDVQ 1080 AAAAANHSGN QSGGIADVDA SGPGIAIFGI NSTDESDYSE SGPVATAIAA EDPPRQHRNC 1140 GGSKAKTAAR RAISNRTRGS STWRDSDAAA DVAAAERGPD DTAAGALAMG RGFSDADVAD 1200 CPSAAAGLST PLTAATSVSD ADVEKDVPSS NRTLVDRCAF ADDARDSRIV VHVATSCSRG 1260 RSADGHVDVS DGDRFIRSMD IVPSGRLRSC SRCPDTDKVV VVVPGQNGEG GGSEAVGTDD 1320 LPRIGHVSIP PDCRVSSPVC ERADVDVSRR MLSERASRED VTCPSAGDAH VTGDDRVAEA 1380 GGADAALVVS GSRREADKPA DALMAAAGWD PGADDLLAAT VCHVDDHASG RMKSPCCVMT 1440 GTSGAEDGKI ELVPRPRRNH CADKLCNAGR AYRCDPDRYR AIPIPIPNVD GSSGVRPPSS 1500 DDRGSKIVVA MDKESDKEKI ENRWGASINS EKQRGGGGGG GRSKDVVLMR SEGVSREEES 1560 YVDAVRQLYD VYNSGESSAK CATDRHVSSS DDEEGLSVHA QQCTTARHVS SCSDGEALSR 1620 LPGAEVLAAS TSSRLTISRV EVCGGERRKE YDKLRRCCFN GVISEARERE EGEVVEKFTM 1680 DDSKKAVLEN GNEVGGSGGR SCCEDSSDQM RGGGRGAECL VPDIRASVMT RTTDVRQEVH 1740 QEVHQEVHQE VQQQEVQQEV QQEEVWQEED RDGKEQEGGS KGVGVDVNEK SWEGANGNDG 1800 GRRREEEDVD MDMDTMKHGD KDEWADKTTD ADVVRKNCDR IGGKWKEKDL AGKGKDLAGK 1860 GKDLVGKGSA SEHLESSGRD ESARRAEDDR SRGDFLRGGR EKDDKDGGGE EGSSPLRGGE 1920 EGRVTGRRIA LLREEGNDDI SDGGGGGGLE TYVNRQRIGG KVEEEGRKGE CVEGDRASAC 1980 HSRTSRLTRG REEEEGRSRR QQDVEEVVEY AVGDGVEIQQ TLRDLPTGIP IGVTIATCPL 2040 VSEGTSRMAC ADVRIPSTAE ICTIADAAPR EGGEHSVQKG SGSRVGIGRR RERSSSLSCA 2100 AMDFSLPGGE GMGIEGRDIE EEKEGEGVDD LPPVERMETK GGKGEDHHMQ KKCSAGQTGI 2160 IVAGEEKQQE EKPSAVEPRG GGNPGVSERG LDLGEIGGAA CSFVGDGEIG REREGEGLPE 2220 AVGYGSHVGT ELTRGERGRG GGGGGGGGGT SQDLRMEREG RLQRCRRGGG DDEAIKVDDG 2280 ERKAGGRSVG DQLRAHEGGL SSDDGDRSSR DRLRMIVPGS GSREVEGYSG GNEDCSDGFV 2340 DQGGESLARP AEEREEREER EGREGISVNG RVKSGLVLDR EERRGQLEKK REVEGNGRSG 2400 SRDESSSKGS MEAGKEEKMA GEVGGRRLTL SGLIEWSRKG EGEGHGGKEG KMERASSALL 2460 PETSPEIKAK ESAAAAAAEE EEEEEGKKAR CVEEGVVQLD DMMDDKKDDD VEVELSEGEN 2520 GAAIGRSGWL VLEAEETEET EECMGVEEMG NEEEDEQGVS RDGEEHARMA KEIEIGMRKE 2580 VVSLQQTEEG GSRKKTKDMA IEAGGGENGE TCKKEEEEEE EYTIPPPEFR ASLSTLDVGK 2640 KRLEEESVGG RRRFQRVLAE EEEGEGEGEG KREERKVWRV KGEQPDLAND KRVAMAMAMI 2700 HQRETGVEVV DCEKTLHAHD DEEEGGLDTA HEADEQGGGG VSKAAGGGRE KEGVVVLRAH 2760 SSSSQVLTSR GIVMCEFGEE DDRDRDVGVK GDGNSGGEEC LLEWMKGDGD FGGKRDSESA 2820 ADDDERFGGK RDSKSAADHD VRFGGKRDSE SAVDDDERFG GKSVSKAKRG EGVGGKGVVV 2880 RLGDEAGSFG GKEADDSVNH RALRMGKELA DEGMVASRDQ GGANRDRDDG VRGLVGGASL 2940 EEGERSDLDN ASKCNAPKEN LKDEKGEEEE EEEEEEEEGR PRERCSGLVK MYEEKRGRGV 3000 FDGREQEDAE GIAANSSDEK DDNGKEKTNV GAKVVGRRGV EGGGEQIDIT VDDPDPSSAM 3060 GFVAGTSGEE DDQRSAVSEA AGVVTEEMKI GGGGGVKGME GDVLGGEMDL EGWVDGCFGG 3120 KNVSSAKNGG LVDGGKESLD MSRRVMVEEE GGVDDYQRKA LIRVKGRKTG ARKMVMGKDS 3180 EGKEVVVIEE REKIVGKKVK EEEQGAKAGV KAGKVEGKEE EEEEEEEQGR TVGAKAAADG 3240 EGEEGRRARR MGETNMKEEE EERERESVAG RGQEKEESGK RVGTSRKEEE RWMDEENAME 3300 KGTDWENGGV GRNVSRGGVT TGKRGTGQLL VNAEEGEEEG GGREKKGVEC GNEEAMEFCA 3360 SHHDGDDDKV KDGEEQDGCA DRSMEERGSV KGAQSEILNS NATMMKDGCA NRSMEERGAA 3420 KGGQSEILNS NATMIKDGGP CAMELQLAEG GGGGGEEGGG SEVTDRQKVV CTTMIADGGG 3480 SREDGMSIFG NERATTRAAG SWTGEGEASA GSNGGKVLKK RKASGYGRLG SRRGKKVREN 3540 RKKGGDGGRD QGGDGERGGG GRRRGGAGLS WDKGQQPGES ERGSEVWART EGEENQSPPP 3600 KRSRREPKVS GRLLPLVEVL RGISSHKFAS LFRHHPDSRE HPKYDDLVRQ HVDLTMVRSR 3660 LEGGIYKNSM EFYRDLLLIF NNALVYYPPR SQQYVGALTL RDRATREMEV VFKAESLLSA 3720 DHPMTRRARQ SSSNTTQRAG VDASAAAGGG AGGTSTSDAA ARGGGGGEGG GGGGGVAGIA 3780 DGQSVGNDGG SSVAMVDSSC AEKRPALASL SVRDNDGKGV GNRDKGVYTS QPLTMLEKQA 3840 LLPSAAELKL KMTNTNKGGG GGGGRGKRGG PPVVKKSPLD TVPPSSTDDG LRKRVGKQRG 3900 SGLVIGAVQS DAADDSRSSE NLLEQKRVKR RKTTATAVSS KSNIATAGGE PKTSAPGEEL 3960 GACKLAVEAS DRALKEVDRR PTKVSIPLPK RTASGFAVKA QAAMAKKTTV VAKIAVRVVT 4020 GRAQEALRKG RAASPIGSGY QQHSAHNMEW PELTADGGVG VMEEGGGEQS CANEREGREG 4080 DEGKREEEEG KREEEEEEEE EEEEGEMATT GQVNSLPVNL KVPYHAAIGE GLGGRMHTKR 4140 GEGTSSTCTE VCVVGDLKTG GGRTSKGKAR VENRMGSYVM ASNTPGSAAD GRQSRRGAKR 4200 PHEADVMLTE SKNRGSSVNQ AAIRETEWVK DAVPLVEDQR ELGSRVLRGG RGAGRCGGAG 4260 TKEARQSCDF IPHSVDWGGW QDGRQSEVKP HGRKQVMVPG GSRLGEGGKN QIAATSENQR 4320 PLGRAEGGSG VAVIESGASG GVLRLSVKMP PNSKGPGERV NLRCDHPPWV ERTRQATAAM 4380 TPTVGKGDGA RGGGTAAVGG GGISTRRGAG RGGGGRDGTG EGSEGGVSSQ GISGGGGSSG 4440 KGGNASGGVS GGRKMSVAKG PASNPESSTE GYAEKEPSLP SRLPMKPIHA VKVADDARNR 4500 REGGRKLPGT KGCTRTPKVH PGNEAAGAEE SAGNGVSVLA KEVKRGAKQN RKPKEAAAGS 4560 GVRKRAAADR MGGRKSGAES VAPPRKTPPE TPPAKVEPRE SAEDLVTVAS LLKTKVALAR 4620 SAKVVQPFLD LADGKEGAAA AKWCMTYLAA AKPTVILGFP GTGGGRNGNG AKVQSQISKI 4680 LKVPKTPRLH APQDPMDAVG MGGAGKTKKE AEPTWVGKGG TQGAQRKWED GKTGQSCWEA 4740 KARQLESESL AVAMAAKGAR SKRKVGSGYQ VIDDDGADGE EVERELAFEE RVDHHPPIVR 4800 GRGSKGKDGW KNDGGQGRAV KASAVAVVLL EFMLEAIVLE FSMHRQKGEK VQVKDLGIRM 4860 RDAFEQVVFG PSRDGGYYLI GMTEVHDVLF KDIPWSTPEV LDLSIKLAQG NGISVLPTKM 4920 MPILQDIDTV EDLEAWLMEL DADRRCKGND RIDSPELGTD RRDHGNGPVD STDLDADQRR 4980 RGNDPSDAVG RMDGKAVETA EVAVDECDLA EYYPVGDLES RTLSAIDRSN EAAGAAVRQR 5040 GEACLGPEGG FALTLLLGND GRAIYRDDDV DGCRLDDPCI GSRGDSEAFA TAGDKISANE 5100 MSVESRKEER GSDLVEENVA VGVEEMVVEK NVAASGGGGK EVVSNSANRN AVRLTAEIAS 5160 SSLTEEGRVA ASGGGKEVVS NSVNRNAVRL TAEMASSSST EEGRVVASGG GKEVLSNSTN 5220 RNAI |
| Nucleic Localization Signal ? help Back to Top | |||
|---|---|---|---|
| No. | Start | End | Sequence |
| 1 | 72 | 89 | LKRCFKKKKKKKKKKKKK |
| 2 | 77 | 83 | KKKKKKK |
| 3 | 77 | 85 | KKKKKKKKK |
| 4 | 77 | 89 | KKKKKKKKKKKKK |
| 5 | 78 | 84 | KKKKKKK |
| 6 | 78 | 86 | KKKKKKKKK |
| 7 | 78 | 90 | KKKKKKKKKKKKK |
| 8 | 79 | 85 | KKKKKKK |
| 9 | 79 | 87 | KKKKKKKKK |
| 10 | 79 | 91 | KKKKKKKKKKKKK |
| 11 | 80 | 86 | KKKKKKK |
| 12 | 80 | 88 | KKKKKKKKK |
| 13 | 80 | 92 | KKKKKKKKKKKKK |
| 14 | 81 | 87 | KKKKKKK |
| 15 | 81 | 89 | KKKKKKKKK |
| 16 | 82 | 88 | KKKKKKK |
| 17 | 83 | 89 | KKKKKKK |
| 18 | 1534 | 1542 | RGGGGGGGR |
| 19 | 2640 | 2653 | KKRLEEESVGGRRR |
| 20 | 3516 | 3522 | KVLKKRK |
| 21 | 3528 | 3537 | RLGSRRGKKV |
| 22 | 3857 | 3865 | RGGGGGGGR |
| 23 | 3926 | 3932 | KVLKKRK |
| Best hit in Arabidopsis thaliana ? help Back to Top | ||||||
|---|---|---|---|---|---|---|
| Hit ID | E-value | Description | ||||
| AT3G60110.1 | 4e-17 | MYB_related family protein | ||||




