PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG62291.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 846aa    MW: 89365.8 Da    PI: 6.6394
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG62291.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix64.52.4e-202711886
    trihelix 18 eerlrrgk..lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86
                e ++++++  +k++lWeev++km++ g++rs+k+Ckekwen+ k+y+k+ke++k++   + ++++y+ +le
  GBG62291.1  2 EPAFQSANslQKGHLWEEVGRKMSDLGYQRSAKKCKEKWENVTKYYRKTKESSKSG-GADARNYRYYAELE 71
                56666543349********************************************9.67778*******98 PP

2trihelix88.11e-27366449186
    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                 rW+kq+v+aLi++r++me ++++   k++lWe +s++m++ gf+rs+k+Ckekwen+nk++++ k   +   ++++++cpyf  l+
  GBG62291.1 366 RWPKQQVQALIQFRSSMEGSFQEPGPKQHLWEVISARMARLGFSRSAKRCKEKWENINKYFRRSKSKGRV--DGKKKKCPYFALLD 449
                 8***************************************************************988876..688899****9887 PP

Sequence ? help Back to Top
Protein Sequence    Length: 846 aa     Download sequence    
MEPAFQSANS LQKGHLWEEV GRKMSDLGYQ RSAKKCKEKW ENVTKYYRKT KESSKSGGAD  60
ARNYRYYAEL ERFYREGGES ALAAAAAARS SQGGDGNGGA SEESEEEEHE GLGGQEVEEE  120
EDELVESGGK RKRKRRGDVE EALLSMERMM KRILYCHHQQ QQRMFETIDK REKDRLAKEE  180
EWRRQEQARL AHEQKLRESE HVSAANRESA LVIILQKLAA GMAPGCPQPL SLPLLAASLS  240
SPTKPSVNAC ALPTPALGAE AFPPQQAHGV AGMTMAMGGG TPATVFAPAG TAIARGSAAT  300
PASTLLAGGG AKGSGPVATV GSEGGADAGA ESPATAAIPV AKRTPSVTPV LAHDPEIVQD  360
RQAEKRWPKQ QVQALIQFRS SMEGSFQEPG PKQHLWEVIS ARMARLGFSR SAKRCKEKWE  420
NINKYFRRSK SKGRVDGKKK KCPYFALLDE LYNDKGRGGM LLAACSAGEE GTDLQAVSDG  480
LEPEVGQGRT EGITSGGRQA AGGGDGEAAL TGEASDSGGM CVEMVSGGGG STGGGGGFCS  540
SEEGLRGELS ACLLDAVEGR VKEQEMAMAT ANAAMGLGAE GVESQRRTFS KAWHVVLHDH  600
QTTGGEHRGE GHCARDSWGG AVPASEGVGQ VGVAADKGMR EEERGGNNAS TADEGSGEDA  660
EGGGLRVSGM IVQGEAQCDR EMGGTGVGLV DEENQAVVVK KKKKKKQKGA LVGPGVGKVL  720
GNKFVTAGGG GGGAGAAAAA VGMVSKSTRI QQLEGMVKGL VNSQQEQQKQ LMEYLSRSEQ  780
ERLKRDEERR QQEEARLARE QARAAERESA LMALVQKLTC NPAELPTQGT AGALPSLSTP  840
SENGGT
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1130136KRKRKRR
2131136RKRKRR
3496504GGRQAAGGG
4700706KRKRKRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.27e-26Trihelix family protein