PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG71984.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 861aa    MW: 93883.1 Da    PI: 7.9726
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG71984.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix36.81e-11147221269
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegek 69 
                 W+ + + aLi+a+r+ + +l+       r k ++ +W +v  ++++ g+ r++ +C +kw+nl +++kk+ + ++
  GBG71984.1 147 WSVDDIIALIRAKRDQDAHLQgmghayaRMKPREWKWLDVVTRLKKVGVDREADRCGKKWDNLMQQFKKVHHFQG 221
                 99************777776643333336789999999*******************************988666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 861 aa     Download sequence    
MQSPSPLSAG SSAGRRLGEC RETARGVADV GDAHDGREVW AEQRRLMRSG RKESITRGVQ  60
RLHVGEDGHD GEEAVGDAHD PDWNDNGAEG GEDNAGYISP SKQAAAMGGR GGKKKLCAGN  120
GRRGERTAGK GTDAEVDVDG EGGRHFWSVD DIIALIRAKR DQDAHLQGMG HAYARMKPRE  180
WKWLDVVTRL KKVGVDREAD RCGKKWDNLM QQFKKVHHFQ GLSGKKDFFQ LSGKDRMSKG  240
FNFNMDRAVY DEILGTTAKN HTINPKNVAD TGAQGGVRLP SASSADPESV GDGDGGAEHD  300
DDGDGSTKAS TSFRPSSSPP SSHFRRLSPS SLGLSSSPRT SSNTSPRLPS QLAVGFDSAG  360
HVIEAKRANK VVVGVSCAMS SRGSGRGKAA VNLAQQPPTR EKKGRHMASK KRKILQGAPT  420
HGGYVVDEEW VPEDVATQEG TKFENSDDMP LQRKSSRWGS GGLRIDDAGD RRLAGGRAVP  480
EDVLDVDATM AARKGGGNRA PLPRVKAGNV RGEGDDDEAS VNRVRQRNTR EGMEAAAKLW  540
VDDLRFWNER EGFAIVKLIV EARGYLVAVA RGEQPPPIHR SIVLPHNNIP QQKIADESEF  600
NAAKERAVKV QGIALRVIHG WVFKSQNRQR GYHAAYQYAL NHVATDIARA MWLGEDWRYC  660
VSPMVIHHTL DMDMKLPLWF VGGDVEDRHE DDDLAAYQEA SIQRLVGAFT SAVIIAEATN  720
GGRVSHERLK TMADAMRMML KVAMWLMRMA GDDHRAHYGA WVFVQLTMKS TLVASMHCCF  780
DARRHIVQAA TVITDKLASP PITLIDPPMY VPDWASIGVK FSHDATLSSS MEAKKVDWLG  840
TGPPEDEDDG KGDEEGSGGG R
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.12e-08Trihelix family protein