PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc020573.1_g030.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family HSF
Protein Properties Length: 358aa    MW: 41397.1 Da    PI: 5.0697
Description HSF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc020573.1_g030.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HSF_DNA-bind110.31.4e-34141052102
                            HHHHHHHHHCTGGGTTTSEESSSSSEEEES-HHHHHHHTHHHHSTT--HHHHHHHHHHTTEEE---SSBTTTTXTTSEEEEESXXXXX CS
           HSF_DNA-bind   2 FlkklyeiledeelkeliswsengnsfvvldeeefakkvLpkyFkhsnfaSFvRQLnmYgFkkvkdeekkskskekiweFkhksFkkg 89 
                            F+ k+ye+++d+ +++++sws+n++sfvv+++ ef+ ++Lp++Fkh+nf+SF+RQLn+YgF+k++ e+         weF+++ F kg
  Cse_sc020573.1_g030.1  14 FIVKTYEMVDDPLTDHIVSWSHNERSFVVWNPPEFSGELLPRFFKHNNFSSFIRQLNTYGFRKIDPEQ---------WEFANEDFVKG 92 
                            9***************************************************************9998.........*********** PP

                            XXXXXXXXXXXXX CS
           HSF_DNA-bind  90 kkellekikrkks 102
                            + +ll++i+rkk 
  Cse_sc020573.1_g030.1  93 QPHLLRNIHRKKP 105
                            **********985 PP

Sequence ? help Back to Top
Protein Sequence    Length: 358 aa     Download sequence    
MNDVQGNVNN LPPFIVKTYE MVDDPLTDHI VSWSHNERSF VVWNPPEFSG ELLPRFFKHN  60
NFSSFIRQLN TYGFRKIDPE QWEFANEDFV KGQPHLLRNI HRKKPVHSHS MHNINMNGAS  120
SSSPLTESER IQYKKDLYRL RYEKESLSLE FQKVQQVQEE IDLAARALTE RLKSARKQQI  180
DFLCALDNTL QKPAQFLETN DRKRRLLAET SDQASSFNVP LSDAFSTDTL LDFDTELVEK  240
LESSVTFWED ILTEARGAFV QQVPQPVVYH EPPISPHMDS EPSKTVETNE VRESGNVGHV  300
QAGVNDGFWE QFFTETPGGS MTDNDRKGFD QYGKFWWNMR SVNSLADQMG QLAQSERT
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1201206DRKRRL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G18880.12e-85HSF family protein