![]() |
PlantRegMap/PlantTFDB v5.0
Plant Transcription
Factor Database
|
| Home TFext BLAST Prediction Download Help About Links PlantRegMap |
Transcription Factor Information
| Basic Information? help Back to Top | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| TF ID | GBG71018.1 | ||||||||
| Organism | |||||||||
| Taxonomic ID | |||||||||
| Taxonomic Lineage |
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
|
||||||||
| Family | Trihelix | ||||||||
| Protein Properties | Length: 1473aa MW: 160490 Da PI: 6.3417 | ||||||||
| Description | Trihelix family protein | ||||||||
| Gene Model |
|
||||||||
| Signature Domain? help Back to Top | |||||||
|---|---|---|---|---|---|---|---|
| No. | Domain | Score | E-value | Start | End | HMM Start | HMM End |
| 1 | trihelix | 34.2 | 6.8e-11 | 899 | 974 | 2 | 70 |
trihelix 2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekk 70
W+ ++ aL+ a+r m++++ r k ++ +W++v + +++ g+ rs + C +kw+nl +++kk+ + +++
GBG71018.1 899 WSVDHMVALVSAKRdqdahleGMGHAFARMKPREWKWQDVHETLKKVGVARSGENCGKKWDNLMQQFKKVHRFMQE 974
999999********77777777777888999**************************************9887665 PP
| |||||||
| Sequence ? help Back to Top |
|---|
| Protein Sequence Length: 1473 aa Download sequence |
MVCMRTLLSR GGTVPGDVIN DWERGISAAR TMSHQAGGVP LVVPCSVPWV VCFHDAKAAK 60 LDLHTATWSL NALTDRIAGS CVLNDTRRRG CLARRERSIA AGGPTADAVW NPSEKDHTLA 120 EKSSCSCLHT TSDLAVNWNA DESRGNGSHS KSNQASESLA FSRTSVKDIM AVHLAAEEQL 180 SADSARGYSS EDGPEPSMQT TTVRDDSGEG HLQARQMEER GLQFWIGDAF YRAESALSRD 240 LSTLAAAVYR DKHGRLTVID AMAGCGVRAA RYLSQAGADF VWANDASIAN HLPLVQNLSR 300 CSQLEACNGS ARTLGSGPEV EWGMEAATEG VAPGGDSTWR VSHDDVVRVL SSRRSGPSRE 360 DINGAANVKD DHFDIVDIDS FGSDAWMFSS ALQAVRYGGM IYVTCTDGLT AGGHQPLTSL 420 ASYGAMVLRL PYANEIGIRM LIGGLVRQAA MWGLHAWPVF SHYSYHGPVF RAMMRIQKIK 480 GSHSWPSEWY AFIGYCHSCG ESRVVQWEDL GNVFCACRQD AGRPMQLTGP LWVGPLHTEE 540 DVREMESKAE SWGWITSGQT KEAGGSRSKG KSLDLRTLLE IMLEECQPGL SPYYTSLDLI 600 SRFGRIQTPR RDALVQALRQ EGYIAGRSHV ERSSVKTNAP MAECIRNCGH VLITVVVVVM 660 CRSARGVPAM VPDEEDAHAD VNLLFGLCSG SSTAMTRKLI INPHPDGDWG DAALVGRSGG 720 GSSIGSPATK KTRDRHCLQS KSAPATRPEV VRSQRMQLPS PLSGGQVSGR RSGGGGVDAR 780 FAVDVDERTG RRVWAEHRQQ LRGDREDAIT RGVRQLHVGG SDTDKEAAVE VDDFEDMDEE 840 DEDNEEAAVD IRPVGRTSMG GWGRSKKAYA AHGRQTKKTA SGGSDGEGDI DVDSGRNFWS 900 VDHMVALVSA KRDQDAHLEG MGHAFARMKP REWKWQDVHE TLKKVGVARS GENCGKKWDN 960 LMQQFKKVHR FMQESGKPDY FQLTGKERRS HGFNFVMERA VYEEIKGSTT KNHTIHERNV 1020 ADTGAPGGVE MSSGSWGGRG DGGGVGDVGG EPQEEEDGST KVKETESSTS LTPRRQAVQD 1080 EAVSGAFQRA VGSCGTGGAG GVVAAAGAGG AVLTRAAGGA AHAVNNTDDE DEEPLANRVW 1140 RGNGPIVLAE QTRLWVDDNR FWNDTEDNRM YKIVNGTRNY LVSVVRGVNP LSRSVVLPHS 1200 SIAQGTITDE SQLREATERA AKVQRVVMRV IHGWIFKSTS RSQGYTAAYG YLLQHVATDI 1260 TRAMWCAEEW SVCVSSVVCH VTLELGMQVP LWFVSTHIED RPEDDELAIY QEETLQRLVA 1320 GFTSAVVVAE LQDGGRLSHD RLKNVAEGMK VFVVVVMWLM RMSGHDLRSH YAASFFVQLA 1380 AKPTLLASMH CSFDARRHIT QAANVVTERL CKPPMVLADS PHYILDWASC GVKFGHDANL 1440 PSPDDAKRLD WLGTGPLEEE DDDDGEQQKV GGR |
| Nucleic Localization Signal ? help Back to Top | |||
|---|---|---|---|
| No. | Start | End | Sequence |
| 1 | 1037 | 1044 | GGRGDGGG |
| Best hit in Arabidopsis thaliana ? help Back to Top | ||||||
|---|---|---|---|---|---|---|
| Hit ID | E-value | Description | ||||
| AT2G35640.1 | 2e-07 | Trihelix family protein | ||||




