PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Araha.11451s0008.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis
Family Trihelix
Protein Properties Length: 552aa    MW: 62483.6 Da    PI: 7.6527
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Araha.11451s0008.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix91.31e-281094187
              trihelix  1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87
                          rW++qe+laL+++r++m+ ++r+++ k+plWeevs+km+e g+ r++k+Ckek+en+ k++k++keg+ ++++++  t+++fdqlea
  Araha.11451s0008.1.p 10 RWPRQETLALLKIRSDMGIAFRDASVKGPLWEEVSRKMAELGYIRNAKKCKEKFENVYKYHKRTKEGRTGKSEGK--TYRFFDQLEA 94
                          8********************************************************************975544..6*******85 PP

2trihelix105.43.9e-33352437187
              trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                           rW+k e+ aLi++r++++++++++  k+plWee+s+ mr+ gf+r++k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+ql+a
  Araha.11451s0008.1.p 352 RWPKVEIEALIKLRTNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINKYFKKVKESNKKR-PEDSKTCPYFHQLDA 437
                           8*********************************************************************8.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.12769IPR001005SANT/Myb domain
PfamPF138375.2E-18995No hitNo description
CDDcd122033.88E-22974No hitNo description
PROSITE profilePS500906.957967IPR017877Myb-like domain
PROSITE profilePS500907.201345409IPR017877Myb-like domain
Gene3DG3DSA:1.10.10.604.6E-4349408IPR009057Homeodomain-like
SMARTSM007170.0017349411IPR001005SANT/Myb domain
PfamPF138372.9E-22351438No hitNo description
CDDcd122035.20E-27351416No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0010192Biological Processmucilage biosynthetic process
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 552 aa     Download sequence    Send to blast
MNDRGFGGNR WPRQETLALL KIRSDMGIAF RDASVKGPLW EEVSRKMAEL GYIRNAKKCK  60
EKFENVYKYH KRTKEGRTGK SEGKTYRFFD QLEALESQST TSLHHPQPLQ PRPPQNNNSI  120
FSTPPPVTTV MPAVANMSTL PSSSIPPYTQ QINVPSFPNI SGDFLSDNST SSSSSYSTSS  180
DMEIGGGTTT TRKKRKRKWK EFFERLMKQV VDKQEELQRK FLEAVEKREH ERLVREESWR  240
VQEIARINRE HEILAQERSM SAAKDAAVMA FLQKLSEKQP NQPTAAQPQP QQVRPQMQLN  300
NNNNQQQMPQ PSPPPPPPPL PPAIQAVVPT LDTTKTDNGD QNMTPASASS SRWPKVEIEA  360
LIKLRTNLDS KYQENGPKGP LWEEISAGMR RLGFNRNSKR CKEKWENINK YFKKVKESNK  420
KRPEDSKTCP YFHQLDALYR ERNKFHSNNV NIAAASSSAS GLVKPDNSVP LMVQPEQQWP  480
PAVTTATTTA AVAAAQPDQH PQPSDQNFDD EEGTDEEYDD GEEDEENEEE EGGEFELVPS  540
NNNNNKTTNN L*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1191196RKKRKR
2191197RKKRKRK
3192197KKRKRK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00243DAPTransfer from AT1G76880Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapAraha.11451s0008.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0792830.0AC079283.4 Arabidopsis thaliana chromosome 1 BAC F7O12 genomic sequence, complete sequence.
GenBankCP0026840.0CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002887660.10.0trihelix transcription factor GT-2
SwissprotQ391171e-150TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLD7KTX90.0D7KTX9_ARALL; Uncharacterized protein
STRINGscaffold_202643.10.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48492553
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.10.0Trihelix family protein