PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_021641660.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Euphorbiaceae; Crotonoideae; Micrandreae; Hevea
Family Trihelix
Protein Properties Length: 485aa    MW: 55315.9 Da    PI: 5.8076
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_021641660.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix77.12.7e-24107206186
        trihelix   1 rWtkqevlaLiearremeerlrrgk.............lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstcpy 81 
                     +Wt+++v++Li a+ +++++ +++              +kk++W++vs++m+e+gf +sp+qC++k+++lnkryk+++++ +k+ +++++++ ++
  XP_021641660.1 107 KWTDSMVRLLIMAVFYIGDEAGSEGndptgkkkagglsQKKGKWKSVSRAMMEKGFYVSPQQCEDKFNDLNKRYKRVNDILGKGtACKVVENQSL 201
                     7**************99888875422556667777777**********************************************559*******9 PP

        trihelix  82 fdqle 86 
                     ++ ++
  XP_021641660.1 202 LETMD 206
                     99997 PP

Sequence ? help Back to Top
Protein Sequence    Length: 485 aa     Download sequence    
MEPNGLPGGI FSDMGSGMLG LEMSLQQQNT QNPQNSPNLH HPQMVAYAHR ESDHHPQQTM  60
KHAYPYASST TRQKPQSTAS DEDEPGFTGD DSTADGKKKV SPWQRMKWTD SMVRLLIMAV  120
FYIGDEAGSE GNDPTGKKKA GGLSQKKGKW KSVSRAMMEK GFYVSPQQCE DKFNDLNKRY  180
KRVNDILGKG TACKVVENQS LLETMDLSTK MKEEVKKLLN SKHLFFREMC AYHNSCGHGS  240
SGVASGNIHS PEVGTDQSHA QHPQSSHAQQ QRCSHSTENA QFVTNSRTET EGSKLAKRVS  300
NEEDDEEDDD DDDESEEDED DYDEEVDEAI EGNSRGQNSH HGHDDEDEHE EKGSRKRRRT  360
EVFSLSSSLM QQLNNELASV IQDGAKSTWE KKHWMKLRLM QLEEQQVSYQ CQALELEKQR  420
LKWVKFSSKK EREMERAKLE NERRRLESER MVLLIRQKEL EFLDLHQQQQ QLSSNKRSEP  480
SSTTG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1355359RKRRR
2355360RKRRRT
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G10040.11e-76Trihelix family protein