PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa09g082580.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family Trihelix
Protein Properties Length: 617aa    MW: 69123.4 Da    PI: 6.3256
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa09g082580.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix90.51.8e-2871155187
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                     rW++qe+laL+++r++m+ ++r+++ k+plWeevs+km e g+ r++k+Ckek+en+ k++k++keg+ +++  + +t+++fdqlea
  Csa09g082580.1  71 RWPRQETLALLKIRSDMGIAFRDASVKGPLWEEVSRKMGELGYIRNAKKCKEKFENVYKYHKRTKEGRTGKS--DGKTYRFFDQLEA 155
                     8********************************************************************964..5557*******85 PP

2trihelix105.24.6e-33417502187
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                     rW+k e+ aLi++r++++++++++  k+plWee+s+ mr+ gf+r++k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+ql+a
  Csa09g082580.1 417 RWPKVEIEALIKLRTNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINKYFKKVKESNKKR-PEDSKTCPYFHQLDA 502
                     8*********************************************************************8.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.03268130IPR001005SANT/Myb domain
PfamPF138373.9E-1770156No hitNo description
PROSITE profilePS500906.95770128IPR017877Myb-like domain
CDDcd122037.95E-2170135No hitNo description
PROSITE profilePS500907.271410474IPR017877Myb-like domain
SMARTSM007170.0017414476IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.606.6E-4415473IPR009057Homeodomain-like
PfamPF138373.5E-22416503No hitNo description
CDDcd122031.32E-25417481No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0010192Biological Processmucilage biosynthetic process
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 617 aa     Download sequence    Send to blast
MMQLGGGTPS TTTTTTAAAS TATPPPPPPQ PQPQPQPQSN DSAATEAAAA AAVGAFEVSE  60
EMNDRGFGGN RWPRQETLAL LKIRSDMGIA FRDASVKGPL WEEVSRKMGE LGYIRNAKKC  120
KEKFENVYKY HKRTKEGRTG KSDGKTYRFF DQLEALETHH HHHHQHQPPL RPHHHQTNNV  180
NSIFSTPPPI TTVMPPSAPS SSVPPYTQQL NVPSSFPTDF LSDNSTSSSS SSYSTSSDMD  240
TGGGGTTTTT TNRKRKRKWK EFFERLMKQV VDKQEELQRK FLEAVEKREH ERLVREESWR  300
VQEIARINRD QEILAQERSM SAAKDAAVMA FLQKLSEKQQ PNQPSATTVQ PQPQPQQVRP  360
QMQQTPQPSP PAPPPLLQPI QAVLPSSSET RKTDNGDQSM MMTPASASAS GSASSSRWPK  420
VEIEALIKLR TNLDSKYQEN GPKGPLWEEI SAGMRRLGFN RNSKRCKEKW ENINKYFKKV  480
KESNKKRPED SKTCPYFHQL DALYRERNKL HHNNSIASSS SASGLVKPEN SVPLMVQPEQ  540
QWPPAPVTTT ETTTAVAVVQ QQHPQPPSDQ NYDDEEGTDE EYDDDEDEED EENEEEEEGG  600
EFELVPSNNN KTTNDL*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1252257RKRKRK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00243DAPTransfer from AT1G76880Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapCsa09g082580.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0792831e-167AC079283.4 Arabidopsis thaliana chromosome 1 BAC F7O12 genomic sequence, complete sequence.
GenBankCP0026841e-167CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010428752.10.0PREDICTED: trihelix transcription factor GT-2-like
SwissprotQ391171e-141TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLD7KTX90.0D7KTX9_ARALL; Uncharacterized protein
STRINGXP_010428752.10.0(Camelina sativa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48492553
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.10.0Trihelix family protein