PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID RcHm_v2.0_Chr6g0290041
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family HSF
Protein Properties Length: 308aa    MW: 35192.1 Da    PI: 9.6541
Description HSF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
RcHm_v2.0_Chr6g0290041genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HSF_DNA-bind114.57e-36451362102
                             HHHHHHHHHCTGGGTTTSEESSSSSEEEES-HHHHHHHTHHHHSTT--HHHHHHHHHHTTEEE---SSBTTTTXTTSEEEEESXXXX CS
            HSF_DNA-bind   2 FlkklyeiledeelkeliswsengnsfvvldeeefakkvLpkyFkhsnfaSFvRQLnmYgFkkvkdeekkskskekiweFkhksFkk 88 
                             Fl+k+y++++d+++++++sws+ g sfvv+d+++f  ++Lp+yFkh+nf+SFvRQLn+YgF+kv+ ++         weF+++ F +
  RcHm_v2.0_Chr6g0290041  45 FLNKTYDMVDDPSTNRIVSWSRGGGSFVVWDPHSFVMNLLPRYFKHNNFSSFVRQLNTYGFRKVDPDR---------WEFANEGFLR 122
                             9****************************************************************999.........********** PP

                             XXXXXXXXXXXXXX CS
            HSF_DNA-bind  89 gkkellekikrkks 102
                             g+k+ll++i+r+k+
  RcHm_v2.0_Chr6g0290041 123 GQKHLLKNIRRRKT 136
                             ************86 PP

Sequence ? help Back to Top
Protein Sequence    Length: 308 aa     Download sequence    
MNYLYPVKEE YPGGSSSSQR SGDPLMTVAP PQPMEGLNDT GPPPFLNKTY DMVDDPSTNR  60
IVSWSRGGGS FVVWDPHSFV MNLLPRYFKH NNFSSFVRQL NTYGFRKVDP DRWEFANEGF  120
LRGQKHLLKN IRRRKTPSQP IPAHQALGPC VEVGRFGLDG EVDRLRRDKQ VLMMELVKLR  180
QQQQGTRACL QAMEQRLKAT EMKQQQMMAF LARALQNPAF LQQLVQHKEK RKELEEAVTK  240
KRRRPIDQGP SGFGGGETNQ RGKGINPVKA EPLEFGDGEY EMSELEALAM EMQGFHKARK  300
EPEEEILG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1230244KRKELEEAVTKKRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22830.11e-117HSF family protein