PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Rmu_sc0002888.1_g000038.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family Trihelix
Protein Properties Length: 563aa    MW: 64207.5 Da    PI: 6.3234
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Rmu_sc0002888.1_g000038.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix100.11.8e-31375459186
                   trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdq 84 
                                rW+++ev aLi +r+++e+++++  lk+plWe+vs+ m++ g++rs+k+Ckekwen+nk+++k+k++ kkr +++s+tc+yf++
  Rmu_sc0002888.1_g000038.1 375 RWPQSEVEALILVRSNIESKFQEPGLKGPLWEQVSASMASLGYQRSAKRCKEKWENINKYFRKTKDSPKKR-PQQSKTCSYFNK 457
                                8*********************************************************************8.99999******* PP

                   trihelix  85 le 86 
                                l+
  Rmu_sc0002888.1_g000038.1 458 LD 459
                                98 PP

Sequence ? help Back to Top
Protein Sequence    Length: 563 aa     Download sequence    
MQSNFEIADL PSQQHFMDTN SSSPVLPIFF NPQNQNHPSF HRQLEHQQLS HHHLVHPTPI  60
THELFQGLHP QPEQEQSGTL HWQMSPINFK LGLNENSASG EVALLDENSP LFLQSRPQNL  120
GFKSWQPQEF CTRKEPFWKP HETQGMKQKQ ATEGEICKEL ESKYRLYGEL EAIYSLGKIG  180
EVNNKQTGSG SAVTNENSPK NVDLVPVPFG VSHGLNGGPA ATDGVDDGSE ASIDEEVSYR  240
KVQKRKRKRR MKEQLSSSTT RFFEGLVKRV MDHQESLHHK YLALIEKMDR ERREREAAWR  300
LQEAEKHKRK AIAQLHEQAL ASKRESLIVL YIEKITGQKV NIPSREIPSL LQCDYSNGAM  360
EEELTPVKVD QTNSRWPQSE VEALILVRSN IESKFQEPGL KGPLWEQVSA SMASLGYQRS  420
AKRCKEKWEN INKYFRKTKD SPKKRPQQSK TCSYFNKLDQ LYSRTPVTNP SSSSYCSSNP  480
VVSIERQGYS ELLQAVLDGS ETQNLSSGNF EILSEIGSNR LDFDGLTNGK VEHLREDHGK  540
ERENHEDDGS MEEEDIDGID SDE
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1244250KRKRKRR
2244251KRKRKRRM
3245250RKRKRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.22e-40Trihelix family protein