PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG76579.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 948aa    MW: 102007 Da    PI: 7.8001
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG76579.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix37.56.4e-12267342270
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekk 70 
                 W+ +   aLi+a+r+ + +l+       r k ++ +W +v +++++ g+er++++C +kw+nl +++kk+ + ++ 
  GBG76579.1 267 WSVDDMVALIRAKRDQDAHLQgmgtayaRMKPREWKWLDVEQRLKKVGVERKAERCGKKWDNLMQQFKKVHHFQGL 342
                 999999********8888777444333356899999*********************************9887665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 948 aa     Download sequence    
MDGRQAVCKS VGGEGGGQRA GQSPSRGSRA VERSEYAHLP PHLQPLPDTS DEEEDDRRSR  60
AVPLGSGSTQ EWTATELCGS REPSYGQSYT QLLQQGLSGD EGDGASSVRV ASGNNKDPRT  120
QQYRASSASR GASPRPSWML SPSPLSGNSS AARNRGECGE HDCEIEERGD ERDGREVWQE  180
QRRMLHPRRQ ESITRGVQRL RVVDDENDGD APDVGGNDQD LNDDDCGGGE DDAGQASVRN  240
GGRGKKAMGK GSDVEADGDA EGGRHFWSVD DMVALIRAKR DQDAHLQGMG TAYARMKPRE  300
WKWLDVEQRL KKVGVERKAE RCGKKWDNLM QQFKKVHHFQ GLSGKQDFFP FTGKERLSKG  360
FSFNMDRAVY DEILGSTAKS YTINPKNVAD TGAPGGVRLP STNSGDPESV GDGDAAVGLD  420
DDDDGSTRGS SQTTGGPAGF GKRKSTRQQT FDVLSECMEK HRALVASTME SNSKRQCSIQ  480
IQQCEALEAE VEVQKKHYAA SDEVSKLMCH ALLEIAKAIR ERSGRGRSAG KQAVDASLRD  540
KRGRHVANKQ RKLVQGASAI AIHADAEDWV AEGDDSHVES DFAESEEIPM KRKSSKRQTG  600
ALRIDDIGER RSAGCRGANE QDVNVAARPA TGRAVGSAAT QQNRAPLPRV NEPPASQEVF  660
VRGRTPSTPR QTTATEVGVA AQGIAQGITS RSPARDERRA STVVAARAAD VAKAGAATAG  720
GASQAAEGGR SVGAATVRAS HAVEGAREGG GNAGEGSRAR AVAGVGEDDE ALTNRLPLWF  780
VGAYIEDRHE DDELVAYQEA SVQRLVGAFT SAVSTAEGVD GGRVSHERLK SVAEAMQVML  840
AATMWLMRMG GGDRRAHYNT WVFVQLTAKP TLIASMHRSF DARRHIVQAA TAITDKLAKP  900
PITLLAPPLY IPDWVSIGVK FLARRHFVFP NGGSKAPLVG HRATGRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.12e-09Trihelix family protein