PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG71436.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 865aa    MW: 91735.8 Da    PI: 5.3383
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG71436.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix29.71.7e-0979151267
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikeg 67 
                 W+ +   +L++ +r        ++++++r + k+ +W++++k+m   g  + +  C +kwenl + ykki++ 
  GBG71436.1  79 WSLEDQVLLVRCKReqemhlaGLGHNYGRMRTKEWKWDDIAKRMTNAGRPKDADDCMKKWENLFQHYKKIQRF 151
                 5555556666666622222223345555788**************************************9875 PP

Sequence ? help Back to Top
Protein Sequence    Length: 865 aa     Download sequence    
MQAHSDGGDD YGGGGDDADE GFREDVEARD DDEDIPIWPL GKTGGRGKGR SRGVVRGRSV  60
GCGGRRGVND DGGKSATYWS LEDQVLLVRC KREQEMHLAG LGHNYGRMRT KEWKWDDIAK  120
RMTNAGRPKD ADDCMKKWEN LFQHYKKIQR FKNVSGQADF FRLSNEERKE HNFKFRMERV  180
LYNETHAGML GNHTIFPPNV ADTGSPDVVQ LPRRGAVGGE SVSNMEEEEE EGRRRRKEDG  240
EEEEEEERRR GGGGGGKKTK GGGGGGGGGK KTKGGGGGGG GGGGSEEEEK MKKKGGGRGG  300
EKKKMKAGGG GGGGGPAGVE IGTRRRRKKK KMKGGGGGGD NKKANGGGGG DNKKANGGGG  360
GGGAAGDEVG TFGQAADNDI DAIMPTVRVP DCEGVNHGEE FLLVGGVIHL RGEELLACEG  420
DGVFARWSLG DREDLAKVLK VGLEGGAEDK DVIKVDDDAD FEEVAEDVVH GRLEGGGGID  480
ESEWHYEELV VPEPRAEGGF VGVLLADTDL VEATAKVDLG EVLGSTETIK ELGNARPALQ  540
THASVMKLPV APMSRRARNL VVDAPRVAVM NLRWKRDELA VKMMVLRRRC SLPKRTSRSC  600
RACSSTTSEK VTVDGFGRRL EEHGARGGVC DAWESSGEGG VTGGGEVEEL SMPDRCPGDC  660
VHDEGPVLVK GGDAEVVGRV EAEWRVWFEE VMEVVSSGVV WWKSGCGRGR ARSGGGVESI  720
VVKHAREEVS KGHVGFVGEG GCKVFVAYSF DAGDERKVGN DGVGEVVAQG ADVVDEAVRE  780
TGLAEVAELF KVVVNGFLGA EGGSEKVGPL EEGVTWSSGG STVFDFSHPP FGYIAKEAGG  840
GNGEPVGTGH VVEVKRVQED RPRGQ
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1233238RRRRKE
2233260RRRRKEDGEEEEEEERRRGGGGGGKKTK
3233273RRRRKEDGEEEEEEERRRGGGGGGKKTKGGGGGGGGGKKTK
4234261RRRRKEDGEEEEEEERRRGGGGGGKKTK
5234274RRRRKEDGEEEEEEERRRGGGGGGKKTKGGGGGGGGGKKTK
6253263GGGGKKTKGGG
7254264GGGGKKTKGGG
8255265GGGGKKTKGGG
9260270KGGGGGGGGGK
10266276GGGGKKTKGGG
11267277GGGGKKTKGGG
12268278GGGGKKTKGGG
13324330RRRRKKK
14325331RRRKKKK
15325331RRRRKKK
16326332RRRKKKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.16e-06Trihelix family protein