PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID AUR62020649-RA
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; Caryophyllales; Chenopodiaceae; Chenopodioideae; Atripliceae; Chenopodium
Family Trihelix
Protein Properties Length: 274aa    MW: 30358.3 Da    PI: 4.8725
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
AUR62020649-RAgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix37.56e-1229104279
        trihelix   2 WtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstc 79 
                     Wt q  l L++    +e + +++ +  ++W  +s+ + + g+ ++++qC+ kw++l  +y  +ke++ +  s+s s +
  AUR62020649-RA  29 WTAQDMLILVNEIAAVEGDWKNSLSTHQKWTIISETCTALGVGKTANQCRRKWQSLLADYNLVKEWKLN--SGSDSYW 104
                     ******************************************************************987..3444455 PP

2Myb_DNA-binding21.65.1e-072985448
                     S-HHHHHHHHHHHHHTTTT.........-HHHHHHHHT...TTS-HHHHHHHHHHHT CS
  Myb_DNA-binding  4 WTteEdellvdavkqlGgg.........tWktIartmg...kgRtlkqcksrwqkyl 48
                     WT+++ ++lv+ ++   g+          W+ I++t      g+t++qc+ +wq++l
   AUR62020649-RA 29 WTAQDMLILVNEIAAVEGDwknslsthqKWTIISETCTalgVGKTANQCRRKWQSLL 85
                     *****************99*****************99999*************986 PP

Sequence ? help Back to Top
Protein Sequence    Length: 274 aa     Download sequence    
MREADVGQLA ESISGSRCTR SHLQAAAVWT AQDMLILVNE IAAVEGDWKN SLSTHQKWTI  60
ISETCTALGV GKTANQCRRK WQSLLADYNL VKEWKLNSGS DSYWSLGCDR RKEAGLPLEF  120
DKDLYKAMED FMKASDRADT DPECDPDAGE VDVLDAEESG PKPKKRRRRS IPKKRLSEGR  180
KQPATEVEHV IPEPPVAEDM EQILAAKLME NTELIHAILA GNVTENVDED LTSVRNEESL  240
KTENVIRKAD KLVACLGDLA KNLEHLCGAV EKG*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1161167PKPKKRR
2164169KKRRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G31270.11e-44Trihelix family protein