PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc005590.1_g020.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family HSF
Protein Properties Length: 330aa    MW: 38249.6 Da    PI: 4.8766
Description HSF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc005590.1_g020.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HSF_DNA-bind114.56.8e-36401312102
                            HHHHHHHHHCTGGGTTTSEESSSSSEEEES-HHHHHHHTHHHHSTT--HHHHHHHHHHTTEEE---SSBTTTTXTTSEEEEESXXXXX CS
           HSF_DNA-bind   2 FlkklyeiledeelkeliswsengnsfvvldeeefakkvLpkyFkhsnfaSFvRQLnmYgFkkvkdeekkskskekiweFkhksFkkg 89 
                            Fl+k+y++++d+++++++sw++ g+sfvv+d++ f++++Lp+yFkh+nf+SF+RQLn+YgFkk++++          weF+++ F kg
  Cse_sc005590.1_g020.1  40 FLTKVYDMVDDSNYDHILSWNRGGQSFVVWDPQAFSTNLLPRYFKHNNFSSFIRQLNTYGFKKIDSDI---------WEFANEAFLKG 118
                            9***************************************************************9876.........*********** PP

                            XXXXXXXXXXXXX CS
           HSF_DNA-bind  90 kkellekikrkks 102
                            +++ l++ikr+k 
  Cse_sc005590.1_g020.1 119 QRHILKNIKRRKA 131
                            **********986 PP

Sequence ? help Back to Top
Protein Sequence    Length: 330 aa     Download sequence    
MDNKFSIVKE EYPSAGTSGG SGGQLPQPME GLHEAGPPPF LTKVYDMVDD SNYDHILSWN  60
RGGQSFVVWD PQAFSTNLLP RYFKHNNFSS FIRQLNTYGF KKIDSDIWEF ANEAFLKGQR  120
HILKNIKRRK APSNSPPQQQ ANNTCVEVGR FGLDGELERL QRDKQVLMME LVKLRQQQQN  180
TRAHLQEMEL RLQGTEKKQQ KTMSFLAKAL QNPEFVQKLS RQYERKELEA MIKKRPRPID  240
QGPSQPYPGE SSRIDENFED LPGFQVSELE ELAREMQGIG RSKRIQEEEN KEDNDFDEEF  300
WEELFDEQFG TTGNEGDEAD RLDFLGSNEK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1123129LKNIKRR
2234238KRPRP
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22830.11e-107HSF family protein