PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG74820.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1808aa    MW: 197160 Da    PI: 5.7288
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG74820.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix331.5e-1015421614267
    trihelix    2 WtkqevlaLiearremeerlrrgk.......lkkplWeevskkmrergferspkqCkekwenlnkrykkikeg 67  
                  W+ ++  aLi+a+r+ + +l+++        +++ +W +v +++ + g+ r + +C +kw+nl +++kki + 
  GBG74820.1 1542 WSVEHMVALIRAKRDQDAQLQAAGhafawmrSREWKWLDVRERLLKVGVDRPADKCGKKWDNLMQQFKKIHTF 1614
                  99*******************9733444555**************************************9875 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1808 aa     Download sequence    
MEGVEAGVAA AARRVHNARC AAITRGRAGR TSRRSWRGGD VWTDGDEVQC DVEEDDNSDV  60
LPLHAPNMGG RGGKGRASTN AGRRRKRSSA TSADAEEKGF NLNMDRVVYE EIKGAKQRSH  120
IISPSNVADT GGPEGVQLAS AHSATPESVG DGDATGDGDD EYDNSARGGS QTTGSPTDFG  180
KRKNVRQQTF EALSEWMEKH GALMASTMES ASRRQCEAME SASKRQCSIQ IRQCEAIEVD  240
VEVQKQHCAA SNEVSKLMCH ALLEITKAIR ERLPQGIFRL WIRVVVVSSS SPSSLSVFSV  300
MTNTRNSGKK KRGRNSQKME AAGVVKRGRH QAKKPKASSG GALVIGSSAQ EEWVTEASSG  360
QDDDFKSEAD TFHGRKRTLR ELSNARMVSG EEGVDTFHER KRTLRELSNA RRGHGVARGK  420
GEAGGDVGGG VAAREERGQP TTPGMRGAVP SKDVQAAQDI GKHPPMQAPS NPATPTAGQA  480
RRDEGARMNA AVRGQLATHI PAKEAAAVGA VVGTSATGAG GGGGDGRDNA XDNDPLINRQ  540
RRSGNLAALE AKAKLWVDDL RFWNETEGHG MFKIIQETRL HLISIAKGVK PSEIRKSIVL  600
PNATILQQKL EDISELGAAK ERASRVQTIA LRIIHGWIFK SPNRARGYHC SYGYVLNHIA  660
TDLARAIWCG EDWRVGVSPA VVHNTLELHM XLPIWYVGGV IHDRHEDDEM ASYQESTTQR  720
LVGAFTNVVD MGEGVDGGRI SYERLRNVAD CMRLLLSAAM WIMRMAGDNL RSHYEASHLV  780
ELIAKPTLIA SLHASFDARC HVLQCVNVVT EKMGKPPMTL VDPPVYIPEW VSCGVSFHND  840
ATLTSPELAR RLDWLGPGPA EIDDEDDEKD GGGESDVDDD DVDDDDDNGD DDDGDDNDDD  900
DDDRDDDDCD DDGDDDDGDD EDDDDDDDED DDDDVHDQDD DDDDDDDDDD DEVPFQIRHP  960
YAVDQVQPDG HQTKDHRPNT RPAAAYNIIR LPAASPRSIL GDRPEGRVER MRENPRAAPR  1020
ISPRSPRTPR GVSNEPLRGS LDTPWALKNP EELIVQSVLG ECSLEIDVKT FPRGVVEASP  1080
GYRHLAVFPK VPREVPRSGS FRENRDGDGA LRRRHNFVVW YIAGPAKVRA DISGRQHSTP  1140
AGTIATVRLG GGRGPRPFPV YPSPSPAPAG NGKASNGQPI RKSQRRTPVN ARSGPQCKEA  1200
EGVASRAHAR LLFVVVWVDV CILGRLPSRT RRKRGVRKSM AMDERRPNLA TLTDAYPSHR  1260
PAICTGVDRR AAGPSPYEGL APHMQPLPDS DEGEADVDVS NTVPLGSGST QEWTGSQLYD  1320
RCGAKYKQSF TSLLHEGAQD EERLPPVDLT FGLRSGNPSS ATRTVLVNPH PNDDAGQVTL  1380
VGRGTRGGSA PQGTSEKTRE RPRLGGDNVP RPVEQGVTEE DFPFEGVHGD GRRVWKESRQ  1440
ELRRQQEESI TQGVQRLHVG ERAGQADAVG GGGDVWTDGD EVQCDVEEDD SGDVFPLHAP  1500
NMGGRGGRGR ASTNVGRRRK RPSATSADAE GEGDREGGHN FWSVEHMVAL IRAKRDQDAQ  1560
LQAAGHAFAW MRSREWKWLD VRERLLKVGV DRPADKCGKK WDNLMQQFKK IHTFMGMSGK  1620
EDFFQLTSQQ RAEKRFNFNM DRAVYEEIEG AKERSHTISP SNVADTGGAG GVQLPSAQSA  1680
ALKSMGDRDA FGDGNDEDNS SACGGSQTTG SPAGFGKRKN VGQQTFEALS ECMEKHGALM  1740
ASTMESTSRR QCEAMESASK RQCSIQIRQC EAIEAEVEVQ KQHRAASNEV SKLMCHALLE  1800
IAKAIRER
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1309336KKKRGRNSQKMEAAGVVKRGRHQAKKPK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.18e-07Trihelix family protein