PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG83377.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 2096aa    MW: 233499 Da    PI: 7.0715
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG83377.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix30.97.1e-1011621234267
    trihelix    2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikeg 67  
                  W+   + aL++a+r        m++++ r k ++ +We+v +++++ g++r  + C +kw+nl +++kk+ + 
  GBG83377.1 1162 WSVGDTIALVRAKRdqdlyiiGMGTSFPRMKTREWKWEDVRARLQSMGVTRDVVDCGKKWDNLMQQFKKVHKF 1234
                  8889999*******999999999**********************************************9875 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2096 aa     Download sequence    
MADPDVELSL KFQERDIEEG KELLDVHHHL PWIVLGRHDD GNYNLQAWNY ENDTRVDWEL  60
PKGIKPWGVK FVGQEVRVMV VLKTQLVEYE IPPDSSSSTE SWTFALDTTD FSNWAVHSSR  120
RRVVLAVHPP GSSRYNIYFW DLDQNLHEER GTNLPEMTKM CFHPLESDIF VTVHGNGTIN  180
IWNAKTMTVL RTLEAEREVT SVRFCTKPQT SLLITGHKSG KLQVWDYPRK KCLTTFDAHE  240
HAVEHAFFHP QLPYILSASE DGAIKVWSDS NYQFKMLFSG LNGLSGSLLC GNDNTLVRLC  300
KKTLRVITIV GEHDSAAAPK HNFSSLKIKQ LETGLSTLPL KIKQLESVLS TLSLKIKQLE  360
TVLSTLSPKI KPLESVLSRL AAIKIELDVV DTGPIWSEHS YDNNARQEWI GWKNKVSNMF  420
KNVRVDIMDI GSQPSKENIY SERVKLLETE FSSSNRDGYP HQRVNDARKW RTVVNDITKE  480
LETKLQELQT KLLESQAELA KLQAELAKLQ AELAKLQAEL AKLQAELSNL RLERQAFEKE  540
LERLQSEYDT ERGMHSERVK QIGTELSTLR AERRALEEGF EKKRQTKHQR VKELDTELSN  600
VRMQRQALGE KVERFQSMHD REKGMESERV KQLETELSSA RAERMVVEQG FEEERQIHAK  660
GMKELETELS DVRTEMQGFG ERFERFQSEH ETEKGMHLES IMKLENEISN LLAEREELEE  720
SFEKPTVGAQ GLECRSEMDG GASEILKRCD GVPKGVSGGD MHPFREFSVD ELKNATNSFH  780
DDCKLEQRHY GCMYKGKITP VVVKRLGYEK STAQNQHTRL TMAILDLLKS LQHPHLQTLL  840
GVCYKGNCLV YEHMAHGNVK DWISSAKVSQ SGFLPWYIRL RIIAQVAQAV SFLHSHQSPG  900
GGPIVHCSIT LDNIFLDNSF VAKIANVDMA VLAPVFAKDG ETPDMNRSDV QYLAPEFFQT  960
EVFTPETDTY AFGITLLEML TGKFKNALGI MKDAVEDATT FRSTLDPNAG SWDIDLALEA  1020
ARVGLRCASL NKHHRPSITT GEGAVLPALE SIAHKVELAD SLEEERRISL CGGNRNESTS  1080
TVGGGVGARE RPEWMRLTVV TCVAKRIRRS SWTAAAGGFA SGRKGTNGGA TARKGHATKT  1140
RRSKKMDDGT GRSDGEGGRN FWSVGDTIAL VRAKRDQDLY IIGMGTSFPR MKTREWKWED  1200
VRARLQSMGV TRDVVDCGKK WDNLMQQFKK VHKFHNLSGG RDYFKLASKA RRSEGFIFVM  1260
DRSVYDEMEA MTKGDHMIHP KNLANTGAVG GVQMPAGAGA ARDTMATEGG GEAADEEPGS  1320
TKDSTFSAGS GSGYRKRKNM RQQTFEVVAE VMDKHGALMA STMDSVSKRQ CSMMLRQCEI  1380
LESEVEVQRK HYAAADEANG MISEGRRTRC TATTSTRDAL CLFSSSSHNT ATTKSPVSSL  1440
SSSSWISSMS SSIIIVDLVA LLLAADLYRP FGGPASFSHE DLLLAAEQCP QGSSPSRLGL  1500
LQCRCRSSLW ILSLSFSPRI LLAADPYRPF GGPASLSRED LLVAAEKCPQ GSSPSRVRPQ  1560
TAIDGGGKHV CARRRLWEEA GNIPADNQGR ERGRRHVPKA KRLRSEEASA SLPLRRGRSW  1620
TLTNVEEDDD VFTTEEEAAE DNVLAPRGSS LQSSSDQSGA RRLVTPPPEA QQVSAHNTQK  1680
AKEVVVDVGG EDDEPLESRR QRNVTQGATA TAVSIRAATE ERPPQGDLPS TPSQPRPRNT  1740
AAEGGSMERG GGEGALQEAC VASGGAITAA AAGSSGNDGV VARAREEVPV VEREAMRSDN  1800
KGERENEDPL LNRVRRGGMA RDLADRARLW VDDKAFWTTG EGRRLHNIVH ESLEYFVAIA  1860
SGLQTPVMPR SVIMPKSSTT LIRIADPAQL QQVIARVTAA ENIALRVLHG WVFKSGNRPR  1920
GFNVAFQYVL ELVATDIARV MWYGKEWSNV VSAVVHVHTI DLNMDLPLWF AGANIEDRLE  1980
DDDMTAHQEA TVICITHAFR AAVQMGGIVD GGFISHNRLS RIADCFRLLL AACMWLMCME  2040
GDNPRSHYQA FYFAKLVAKP TLVASMHRAF DHRRSVIRAT NVVTERLGKA NATFGE
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.13e-07Trihelix family protein