PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG76385.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 886aa    MW: 97829.4 Da    PI: 8.0507
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG76385.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix25.34e-08603676470
    trihelix   4 kqevlaLiearremeerlrrgk.......lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekk 70 
                 ++e+++Li+   e++ +  +++       lk+ lW +v k +re  + rs ++Ck+ w  + + y+++k+++k 
  GBG76385.1 603 EEETMKLIRCWYEVKATHDDDSeawgvakLKQRLWPDVEKLVREASYDRSDEECKNWWHFVLNNYRAVKDHDKW 676
                 78999999998877666665422333333*****************************************9986 PP

Sequence ? help Back to Top
Protein Sequence    Length: 886 aa     Download sequence    
MAGYPPRDNG CYQCGAPNNW IRDYPQCAFW PVATGANAIP TSTPILALLP VNNSSAASSA  60
ANRAPASSNQ HQSSSVAASN HGQNRPNWWT RNQEKPDLCY SKVMEDCEKE AKNREEERLR  120
IEREEEGKRL KWKGELEKFE ADMGLCLEKK LDELGAFIKG KSTSEVPLTN GSEDELTKLR  180
RENGDLKAKI NSFLNKSGDD KVMYLPQEIM ELRKQVTSKQ ASEDAIFALK LEVNEMKRLS  240
NSKLDLEKEV ANLKKEISSL QGRKDRVTAE ANQWKDEALR SGNKRGSVAV CTPDGPARGT  300
PKPRWTDTMR DADKWREEYR NLRNLHQLAN VEAEALKKKR AEAEAKRMEV ELQVKKLEEK  360
MSKLTASGEK GGKGGGTNLK DRMEEVALRS ARKGVKVTPG RLAGRSPSSG ASQEVAEVND  420
HASFVDGEKN KLRLLRKAGL EPLCKEAGIK LGRFEDTICE LAEYRAKIRL GSSFGEGGDA  480
KEACSFVEVD DDSSKEVCEH VTAVTAGGTA QAGARSHDHE LQAPPCRPTT GGDAAVAAEV  540
QALECSPQTM GESGGTSTLA PLARGGGVRS GHEAGRGQTS RGQPARGRAG RGRGKRPHPR  600
FGEEETMKLI RCWYEVKATH DDDSEAWGVA KLKQRLWPDV EKLVREASYD RSDEECKNWW  660
HFVLNNYRAV KDHDKWSGKQ KYFTMNETER KRWVLDFLMR REWYDFIDQH EKDKDAINMN  720
DITDLGAETE NVGVGDGGAD KSAEAGKGGD DGAGSGTSSQ PQDNRTNTAG PSDCARSNAV  780
QGNMAAKRRR GGSNAREVAM GAICGAMRDH TTAQVRSDKE HHAIMREICE KKIAAQERIA  840
KMTCEAMEKD TAARDLRAER MATETRAGYS LLADVIKSLA DRNATQ
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1337360KKKRAEAEAKRMEVELQVKKLEEK