PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa07g048450.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family Trihelix
Protein Properties Length: 709aa    MW: 79985.8 Da    PI: 7.945
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa07g048450.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix88.95.7e-2870154187
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                     rW++qe+laL+++r++m+ ++r+++ k+plWeevs+km e g+ r++k+Ckek+en+ k++k++keg+ +++  + +t+++fdqlea
  Csa07g048450.1  70 RWPRQETLALLKLRSDMGIAFRDASVKGPLWEEVSRKMGELGYIRNAKKCKEKFENVYKYHKRTKEGRTGKS--DGKTYRFFDQLEA 154
                     8********************************************************************964..5557*******85 PP

2trihelix104.95.7e-33419504187
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                     rW+k e+ aLi++r++++++++++  k+plWee+s+ mr+ gf+r++k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+ql+a
  Csa07g048450.1 419 RWPKVEIEALIKLRTNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINKYFKKVKESNKKR-PEDSKTCPYFHQLDA 504
                     8*********************************************************************8.99***********85 PP

3trihelix104.29.4e-33511596187
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                     +W+k e+ aLi++r++++++++++  k+plWee+s+ mr+ gf+r++k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+ql+a
  Csa07g048450.1 511 KWPKVEIEALIKLRTNLDSKYQENGPKGPLWEEISAGMRRLGFNRNSKRCKEKWENINKYFKKVKESNKKR-PEDSKTCPYFHQLDA 596
                     7*********************************************************************8.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.004267129IPR001005SANT/Myb domain
CDDcd122031.80E-2069134No hitNo description
PROSITE profilePS500907.08569127IPR017877Myb-like domain
PfamPF138377.8E-1769155No hitNo description
PROSITE profilePS500907.271412476IPR017877Myb-like domain
SMARTSM007170.0017416478IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.608.6E-4417475IPR009057Homeodomain-like
PfamPF138374.2E-22418505No hitNo description
CDDcd122032.64E-25419483No hitNo description
SMARTSM007175.0E-4508570IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.607.9E-4508567IPR009057Homeodomain-like
PROSITE profilePS500907.457509568IPR017877Myb-like domain
PfamPF138371.8E-22509597No hitNo description
CDDcd122034.02E-25510575No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 709 aa     Download sequence    Send to blast
MMQLGGGTPS TTTTTTAAAA STATPPPPPP QPQPQPQSND SAATEAAAAA AVGAFEVSEE  60
MNDRGFGGNR WPRQETLALL KLRSDMGIAF RDASVKGPLW EEVSRKMGEL GYIRNAKKCK  120
EKFENVYKYH KRTKEGRTGK SDGKTYRFFD QLEALETHDH HQHQPLRPHH HQNNNSIFST  180
PPPITTVMPP SVPSSSFPPY TQQLNVPSSF PTDFLSDNST SSSSSSYSTS SDMDIGGGSG  240
GGGTTTNRKK RKRKWKEFFE RLMKQVVDKQ EELQRKFLEA VEKREHERLV REESWRVQEI  300
ARINRDQEIL AQERSMSAAK DAAVMAFLQK LSEKQPNQPP TTVQPQQVRP QMQLLNNNNN  360
QQQTPQPSPP APPPLLQPIQ AVLPSSSETR KTDNGDQSMM TPASASASAS GSGSASSSRW  420
PKVEIEALIK LRTNLDSKYQ ENGPKGPLWE EISAGMRRLG FNRNSKRCKE KWENINKYFK  480
KVKESNKKRP EDSKTCPYFH QLDALYRERN KWPKVEIEAL IKLRTNLDSK YQENGPKGPL  540
WEEISAGMRR LGFNRNSKRC KEKWENINKY FKKVKESNKK RPEDSKTCPY FHQLDALYRE  600
RNKFHNNSIA SSSSASGLVK PDNSVPLMVQ PEQQWPPAPV TTTATTTAVA VVQQQHPQPS  660
DQNYDDEEGT DEEYDDEEDE EDEENEEEEE GGEFELVPSN NNKTTNDL*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1247252RKKRKR
2247253RKKRKRK
3248253KKRKRK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCsa07g048450.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0792830.0AC079283.4 Arabidopsis thaliana chromosome 1 BAC F7O12 genomic sequence, complete sequence.
GenBankCP0026840.0CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010416611.10.0PREDICTED: trihelix transcription factor GT-2-like isoform X1
RefseqXP_019083408.10.0PREDICTED: trihelix transcription factor GT-2-like isoform X2
SwissprotQ391171e-136TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLQ9C6K30.0Q9C6K3_ARATH; Duplicated homeodomain-like superfamily protein
STRINGXP_010416611.10.0(Camelina sativa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48492553
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.11e-174Trihelix family protein