PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID RcHm_v2.0_Chr2g0105601
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family HSF
Protein Properties Length: 297aa    MW: 32701.4 Da    PI: 6.5391
Description HSF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
RcHm_v2.0_Chr2g0105601genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HSF_DNA-bind113.91.1e-35151062102
                             HHHHHHHHHCTGGGTTTSEESSSSSEEEES-HHHHHHHTHHHHSTT--HHHHHHHHHHTTEEE---SSBTTTTXTTSEEEEESXXXX CS
            HSF_DNA-bind   2 FlkklyeiledeelkeliswsengnsfvvldeeefakkvLpkyFkhsnfaSFvRQLnmYgFkkvkdeekkskskekiweFkhksFkk 88 
                             Fl+k++++++d++ +++isw+e+g++fvv+++ +fa+++Lp+yFkh+nf+SFvRQLn+YgF+k+  ++         weF++++F++
  RcHm_v2.0_Chr2g0105601  15 FLMKTFQLVDDPASDDVISWNESGTAFVVWKTVDFARDMLPNYFKHNNFSSFVRQLNTYGFRKTAPDK---------WEFANDNFRR 92 
                             9***************************************************************9999.........********** PP

                             XXXXXXXXXXXXXX CS
            HSF_DNA-bind  89 gkkellekikrkks 102
                             g+kell++i+r+ks
  RcHm_v2.0_Chr2g0105601  93 GEKELLSEIRRRKS 106
                             ************97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 297 aa     Download sequence    
MQMIKMAQRS VPAPFLMKTF QLVDDPASDD VISWNESGTA FVVWKTVDFA RDMLPNYFKH  60
NNFSSFVRQL NTYGFRKTAP DKWEFANDNF RRGEKELLSE IRRRKSVSSS PAQANAAEKS  120
GGDAPSTPSN SGEDLASTST SSPDSKNPGS VETAAAVCQA VDLSGENEKL KKDNETLSSE  180
LAQTKKQCDE LVAFLMEYMK VGPDQINQIM RQGSLGSSCD VDDHGRGHSE NDNDEKVVAG  240
GEEGLKLFGV WLKGNENEEK KKRREEKSGG GGGGPQAKKM KMAEFHAPLR KSGKVYN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1260264KKKRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G36990.13e-89HSF family protein