PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa16g041070.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family Trihelix
Protein Properties Length: 579aa    MW: 66125.5 Da    PI: 7.0473
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa16g041070.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix956.8e-3041125187
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                     rW++ e+laL+++r+em++++r++ lk+plWee+s+km+e g++rs+k+Ckek+en+ k++k++keg+ ++++++  t+++f++lea
  Csa16g041070.1  41 RWPRPETLALLRLRSEMDKAFRDSTLKAPLWEEISRKMMELGYKRSAKKCKEKFENVYKYHKRTKEGRTGKSEGK--TYRFFEELEA 125
                     8********************************************************************975544..6******985 PP

2trihelix108.83.5e-34398483187
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                     rW+k ev aLi++r+++e +++++  k+plWee+s+ mr+ g++rs+k+Ckekwen+nk++kk+ke++kkr + +s+tcpyf+qlea
  Csa16g041070.1 398 RWPKTEVEALIRIRKNLEANYQENGTKGPLWEEISAGMRRLGYNRSAKRCKEKWENINKYFKKVKESNKKR-PLDSKTCPYFHQLEA 483
                     8*********************************************************************8.9************85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.009338100IPR001005SANT/Myb domain
PfamPF138374.8E-2040125No hitNo description
CDDcd122035.32E-2540105No hitNo description
PROSITE profilePS500907.1434098IPR017877Myb-like domain
PROSITE profilePS500907.526391455IPR017877Myb-like domain
SMARTSM007179.2E-4395457IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.603.0E-4397454IPR009057Homeodomain-like
CDDcd122033.11E-26397462No hitNo description
PfamPF138377.2E-24397484No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 579 aa     Download sequence    Send to blast
MSGNSSGPLE SSGGGVGGSG EEEKDMKMEE TGEGAGDGGS RWPRPETLAL LRLRSEMDKA  60
FRDSTLKAPL WEEISRKMME LGYKRSAKKC KEKFENVYKY HKRTKEGRTG KSEGKTYRFF  120
EELEAFETLN SYQHEPESQL AKSSAAVATA AITTSLIPCI SSNNPSTEKS SLPLKHQHQV  180
SVQPITTNPT FHAKQPSATM PFPFYSNNNT TTVSQPPSIS NDLMNNVSSL HLFSSSTSSS  240
TASDEEEDHH QGKRSRKRRK YWKGFFTKLT KELMDKQEKM QKRFLETLEN REKERISREE  300
AWRVQEIARI NREHETLIHE RSNAAAKDAA IISFLHKISG GQQQQPQQQN HKPAQRKQYQ  360
SDHSITFESK EPRPVLLDTT MKMGNYDTNQ SISPSSSRWP KTEVEALIRI RKNLEANYQE  420
NGTKGPLWEE ISAGMRRLGY NRSAKRCKEK WENINKYFKK VKESNKKRPL DSKTCPYFHQ  480
LEALYNERNK SGAMPLPLPS PLMVTPQRQL LLSQENQTEF ETDQRDKVGD KEDDEEGESE  540
EDEYDDEEEG EGDNETSEFE TVLKKTSSPM DINNNLFT*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
18391KRSAKKCKE
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCsa16g041070.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY0871170.0AY087117.1 Arabidopsis thaliana clone 3190 mRNA, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010471850.10.0PREDICTED: trihelix transcription factor GT-2
SwissprotQ391170.0TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A178W7Z20.0A0A178W7Z2_ARATH; GT2
STRINGXP_010471850.10.0(Camelina sativa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM59952847
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.20.0Trihelix family protein