PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG58912.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1027aa    MW: 114657 Da    PI: 7.8542
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG58912.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix35.33e-11386462271
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr 71 
                 W  +e l+L++ +r       ++++++ r k ++ +W ++s++mr++g+ r    C ++wen+ + ykk+ e+ek+ 
  GBG58912.1 386 WDLNESLVLVQCKRdqedylaNVGSNFARLKTEEWKWTDISNRMRQQGVMRDWDSCMKRWENVIGHYKKVLEREKES 462
                 899999********4444444444455556899***************************************99973 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1027 aa     Download sequence    
MRRCCVSLHD RCCAEEQTGR LQRPDAVLCV LRHGSSESRG QHKVHDHGGC VHCPLPCCRV  60
LTTHVHMADL LRKTALLALL LFALLTVLVV QLWQLGRRAA VQHQPKSSKK KVRFVGAEKS  120
MGDYRSGAGR EHGMEGAGFT NGTDVREHRF DPALYSHLPS WQQPLPDDNH WEPPSSTLPL  180
GWGLTLTSYD TVGIPHRRAS SLGPMSALLG KVREGGGHPL DLHLSPSCSM HVSRTLIVDH  240
SSDGRRPMRV ATDVPTKGLS SASVNPQQEV AMKNIGAAAA DSVQRGVTGG TTAAESLPVI  300
ERIIARVSNM RATGQGEQGG VGNAGCEEEG EVDDDDEDDS DTDEEEEDGV QEVTARGGKK  360
KGSNGHGKGK AKKGGEGGGG NRRGLWDLNE SLVLVQCKRD QEDYLANVGS NFARLKTEEW  420
KWTDISNRMR QQGVMRDWDS CMKRWENVIG HYKKVLEREK ESGVQSFFTL TAKQSKQLGY  480
KFTMDQMVYD AIDGMQQGNQ AIHLPNLADS GIPQPQQSQQ GEQSHVHGPP ATGETSASEN  540
GEGEGGDGCG TRSSGSAPHG KRKNARQLAF DSVTDVMKTH STVVADSVDR ASKRPCEVIK  600
RQCDIMEREA MMQERQCEVL DAGQRMLCDA LLKIASAFVF RESMDVVVAL RGRIAGRTVA  660
FYEMRHHRDK DCGRAFDFVH TVTRYEVRHF EVDDNERACL RLTVGYGFAR GLLTHVVAFA  720
TKVGDAIPDR WDFGVDVLAD LVDLLVSNVA MEFNDRLDKA AANCWAAGNT RPQQQQFQQQ  780
TSQKHDDDET SEMKAYFRKK IHKQKLEEEK REKEEEERRR RQNEERKEAD RLRELEAREA  840
RLEAKLLRLI SQHTKTVSIP APHIEKKKSP RTKARMLREI RSYLDESKNE SDEVKEEAGR  900
LVDAIEKRKG KHKINGGEAW LSNMKMRASK FTPIDIDDLP DELWTPPTRK RDGRNDAREE  960
NILEFALDLH QKWSAIKAPE LKKICNKESI KWTKKETAIT ELVRCRTKLV YGEGGTGNQG  1020
ATSLSGK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1368374KGKAKKG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33550.11e-07Trihelix family protein