PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_021672615.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Euphorbiaceae; Crotonoideae; Micrandreae; Hevea
Family Trihelix
Protein Properties Length: 624aa    MW: 71873.8 Da    PI: 6.5771
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_021672615.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix101.56.7e-32433517186
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                     rW++ ev aLi++r+++e+++++  lk+plWeevs+ m++ g++rs+k+Ckekwen+nk+++k ke+ kkr s++s+tc+yfdql+
  XP_021672615.1 433 RWPEAEVEALIQVRSSVETKFQEPGLKGPLWEEVSSLMASMGYQRSAKRCKEKWENINKYFRKAKESTKKR-SQQSKTCSYFDQLD 517
                     8*********************************************************************8.78889********8 PP

Sequence ? help Back to Top
Protein Sequence    Length: 624 aa     Download sequence    
MQPSYGVPDI HRQQQNHHHL QQFIENDDCC SSVLPISNPS QNLNHPYQPH LPQQKQPEYI  60
FLQQQQNSIP ILHQLFEHQH QPQQQDFRQF QSQEERLYSQ IRHQHQQVQA QPPHPPFFSV  120
KFKLGLDQNG GNKECALNQR EATDFLNGNE HNPPHVPPVM PHCWHPQEDS TSIKEPFWKP  180
LSRSKNKQQD ENGEQEVQRK KNEHCKFLDT QQIDERNERC KDLENKYRLF GELEAIYSLA  240
KVGEANQTGS GSALTGETSP TNAGLSVPFN AVHGQNVGAG NAGNGIDHGS ENSIGEEASL  300
RKSQKRIRKR KMKKKLSTMA EFFENLVKQV MDHQEILHRN FLEVIERMDK ERTKREEAWR  360
CQEAEKYNRE AVSRAHEQAL ASRREAQIVS YVEKITGQSI DLPARKTPLL LQPEIPEEPT  420
KRLTPIITDS HSRWPEAEVE ALIQVRSSVE TKFQEPGLKG PLWEEVSSLM ASMGYQRSAK  480
RCKEKWENIN KYFRKAKEST KKRSQQSKTC SYFDQLDQLY SRTFINSPFN NSSSNGIEVE  540
KQGHSELLEA FIAGKDIATS INPSSGNVII ADMGSSRLEF GGIINEKVER GSHEQEKENH  600
DDYYDEKGEE DRSIDSDEEI GNFR
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1301314RKSQKRIRKRKMKK
2301316RKSQKRIRKRKMKKKL
3305312KRIRKRKM
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.21e-50Trihelix family protein