PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG80203.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 735aa    MW: 80601.8 Da    PI: 7.004
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG80203.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix38.43.2e-1242113266
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkike 66 
                 W+ +++ aLi+a+ + + +++       r k  + +W++v+++++  g+ r++++C +kw+nl +++kk+ +
  GBG80203.1  42 WSVEHIIALIRAKCDQDAHMQgmghayaRMKPWEWKWQDVAQRLKNVGVDRNAEKCGKKWDNLMQQFKKVHH 113
                 9*************777777744433325678899**********************************976 PP

Sequence ? help Back to Top
Protein Sequence    Length: 735 aa     Download sequence    
MAGKGGKSKP SGRNARPRAK KGQGKGSGGE GDGDAEEKRN FWSVEHIIAL IRAKCDQDAH  60
MQGMGHAYAR MKPWEWKWQD VAQRLKNVGV DRNAEKCGKK WDNLMQQFKK VHHFQSPSGG  120
ADFFQLTSKE RMSRGFNFTM DRVVYNEIEG STGMNHTIHP KNVADTGASG GMRTPSTSYV  180
DPESVADGEG GAGREDEEEG STRGSSQTXG TPXGXGKRKS XRQQTFEALT ECMEKHGELM  240
ASTMESASKR QCSIXVRQCE AXEAEVEVQR KHYAASDEVX KLMCHALLEI AKAILFVDVA  300
RDVARDVARR DVGGRAKEGA ASHETPQCVL APLNRPRTPA ADVAGSSQAA VEGGTLQSPA  360
VAARGGAVAV PGEAVEVPKG GDGVAAGEDD EALVHRLRGQ RAATHAMDAA AKLWEDDNRF  420
WNDTQGSAIV RIIQEVRTYL VAVTRGGQPP AIRRSISLPH NSIPQHKIED ESELNAAKER  480
ALKVQTISLR AIHGWVFKSE SRQRGYHLAY QYALKHAATD IARAMWSAED WRSLVSPMLF  540
RATLDVDMKL PLWFVGVNIV DRHEDDECAA YQEACVQRLV QDFTSAVGTT EAMDGGRVSY  600
ERLKGMAEAI RYLLAATMWI MRMAGDDPRS HYDAWVFVQL TAKTTLLASM NRQFDARRHI  660
TQSAQVMTDK LGRLPLTFAP PPAYIPDWAS KCGVTFSHDA TLASPMEAKR LDWLGTGPPE  720
DDDDDAKGDD KGEGG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
11523ARPRAKKGQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.15e-08Trihelix family protein