PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cucsa.102050.10
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Cucumis
Family B3
Protein Properties Length: 1013aa    MW: 114880 Da    PI: 5.1184
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cucsa.102050.10genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B351.81.5e-1615105199
                      EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EE CS
               B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvv 94 
                      ffkv++p + + +  +++p  f+++ +g    + + t++d +g+sW ++l  +k ++  ++++GWk Fv+ + Lk gDf+vF+++g+  f   v
  Cucsa.102050.10  15 FFKVFLPGSCTLH--MSIPPAFMKHLNGT--FPEKATMQDHTGNSWCITL--EKLDDLLYFKNGWKAFVDYHSLKYGDFLVFQYHGHCLF--DV 100
                      7777755555544..8*******666666..6778***************..9**999**************************997777..88 PP

                      EEE-S CS
               B3  95 kvfrk 99 
                      k+f k
  Cucsa.102050.10 101 KIFGK 105
                      88876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019362.75E-2311108IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.101.1E-2012108IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086312.61913106IPR003340B3 DNA binding domain
SMARTSM010191.2E-1615106IPR003340B3 DNA binding domain
CDDcd100172.71E-1815103No hitNo description
PfamPF023621.4E-1315104IPR003340B3 DNA binding domain
PfamPF093311.3E-29690822IPR015410Domain of unknown function DUF1985
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1013 aa     Download sequence    Send to blast
MMSHSGIEPA SNLEFFKVFL PGSCTLHMSI PPAFMKHLNG TFPEKATMQD HTGNSWCITL  60
EKLDDLLYFK NGWKAFVDYH SLKYGDFLVF QYHGHCLFDV KIFGKNGCKK AAAAKPASSI  120
PVLETEIAEA GNSVSDSEAK VADAGNSIAN LEAMSADAGN SDSKLEVVEA KTGNAVPTLK  180
VKEEPVVEEE DVKPSISHKR KRLQDGSELD HQSKSVVPLN RGRPDNVSNS VEQAIPRGPF  240
FERTMKRWSG QILVEEPLHY MPFFGRKNFR IEPVKIFPVR TNPEVAKYFE ECNQFQEYSW  300
EMLMSHVHSL SANQELKYIQ FEHEEVDSQQ NYQYFQDDDV QDNQSEGLDM IMTDEQPISQ  360
SEEILYLEYQ PLQTDIEDNR KSANMDSTEL VENSYDIEQN NMDDTSEVGG NLCDNIKENN  420
VDTTELSGNS YDNIKENNMD TTEPGGNSCD DIKENNVDST ELGGNLCDNI KENNVDTFEL  480
SGNSYDDIEE NNMDTAELGV NSCDDIEENN VDSTEHCGNL CDNIKDNNVD TTEEKQSPAS  540
VEPSRKKKKR KSASFEVQEQ KEETSEIDTD QDSRRGVETR QRKKIAEQSK GEDGKRKKRG  600
KRGKKSGISG TSSEHDDEVD VHKEYPLLLP RSSWATTQRI NLYSKLDVIS IIKNTLNERQ  660
LKKFKKSCFG NFLDLKISKF SSQLFYHLIR RQCCSKNRNE LWFNLEGRIH KFGMKDFALI  720
TGLNCGELPA IDMSKIQKGK FNKRYFGGEK TIRRAKLHKV FTEMDKGRNK DVVKMAKLYI  780
LEMFILGKQI RTGINHEYTL LIDDKKQFDS YPWGRISYEI TVDFVKKSIK SNDASAIGVG  840
GFPYALLVWA YETIPLLALN SNFLAMRISF GTPRMNNWAA GVHPEWKDLS EKVFQSEAFD  900
VQPLIATTTE MEMPYMIPFG GVKPSNEKNI SPVDQEHNSD ARTSYNKDHC NWKGSQSVSK  960
DGVENFLFTK IVNIEGILGS LVHDIDNLKS FFHKMCGTAN EAADSEKMRK PL*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1544549RKKKKR
2544550RKKKKRK
3545550KKKKRK
4581598RKKIAEQSKGEDGKRKKR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6818750.0LN681875.1 Cucumis melo genomic scaffold, anchoredscaffold00007.
GenBankLN7132620.0LN713262.1 Cucumis melo genomic chromosome, chr_8.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_011657831.10.0PREDICTED: uncharacterized protein LOC101221625 isoform X8
TrEMBLA0A1S3B1760.0A0A1S3B176_CUCME; uncharacterized protein LOC103484737 isoform X1
STRINGXP_008440207.10.0(Cucumis melo)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G66980.13e-20B3 family protein