PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thhalv10020260m
Common NameEUTSA_v10020260mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema
Family MYB
Protein Properties Length: 650aa    MW: 73903.1 Da    PI: 8.7082
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thhalv10020260mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding22.82.2e-07462504140
                      TSSS-HHHHHHHHHHHHHTTTT...-HHHHHHHHTTTS-HHHH CS
  Myb_DNA-binding   1 rgrWTteEdellvdavkqlGgg...tWktIartmgkgRtlkqc 40 
                      +++W++eE + l ++  ++++g   +W+ I++++g+gR+ +++
  Thhalv10020260m 462 KQPWSKEEIDMLRKGMIKYPKGtsrRWEVISEYIGTGRSVEEI 504
                      79************************************99876 PP

2Myb_DNA-binding23.71.1e-07593636445
                      S-HHHHHHHHHHHHHTTTT...-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding   4 WTteEdellvdavkqlGgg...tWktIartmgkgRtlkqcksrwq 45 
                      W++  +  lv+a k ++++   +W+++a+ ++ g+t  qck ++ 
  Thhalv10020260m 593 WSAVQERALVQALKTFPKEtsqRWERVAAAVP-GKTMIQCKKKFA 636
                      ********************************.********9986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.287.1101.6E-2095192IPR001623DnaJ domain
SuperFamilySSF465651.31E-2096177IPR001623DnaJ domain
SMARTSM002718.9E-1698173IPR001623DnaJ domain
CDDcd062575.57E-1399170IPR001623DnaJ domain
PfamPF002263.4E-1799178IPR001623DnaJ domain
PROSITE profilePS5007618.0399181IPR001623DnaJ domain
PRINTSPR006254.6E-8104122IPR001623DnaJ domain
PRINTSPR006254.6E-8122137IPR001623DnaJ domain
PRINTSPR006254.6E-8153173IPR001623DnaJ domain
PROSITE patternPS006360158177IPR018253DnaJ domain, conserved site
PROSITE profilePS500906.226457512IPR017877Myb-like domain
SuperFamilySSF466891.35E-7458505IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.603.2E-5461504IPR009057Homeodomain-like
SMARTSM007179.3E-7461514IPR001005SANT/Myb domain
PfamPF002493.9E-6462504IPR001005SANT/Myb domain
CDDcd001672.79E-4464504No hitNo description
SMARTSM007179.9E-8589641IPR001005SANT/Myb domain
SuperFamilySSF466891.2E-8590637IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.3E-4592637IPR009057Homeodomain-like
PfamPF002497.3E-6593637IPR001005SANT/Myb domain
PROSITE profilePS500906.841593639IPR017877Myb-like domain
CDDcd001671.19E-5593639No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 650 aa     Download sequence    Send to blast
MPSWRSDSAV KLITYSEELV DGKPFYAFSN CLPVKALNRE PAGHAFHSAA LKLHGCAEEP  60
TDNEGSDKKV GDDKEKEYVP SFNSYANKGK KKSGTQQQDH YALLGLSNLR YLATEDQIRK  120
SYREAALKHH PDKLATLLLA EETEEAKEAK KDEIESHFKA IQEAYEVLMD PTRRRIFDST  180
DEFDDEVPTD CLPQDFFKVF GPAFKRNARW SVNQRIPDLG DENTKLKDVD KFYNFWYAFK  240
SWREFPDEEE HDLEQADSRE ERRWMEKENA KKTAKARKEE HARIRTLVDN AYRKDPRIVK  300
RKEEEKAEKQ QKKEAKLMAK KKQEEEAAIA AEEEKRRKEE EEKRAAESAQ QQKKAKEKEK  360
KLLRKERNRL RTLSAPLVAQ RLLDISEEDI ENLCMSLSIE QLQNLCDNMG NKEGLELAKV  420
IKDGCNSSRN DEANSKEKES EKTNGGAEPT NRVSQLGSTQ KKQPWSKEEI DMLRKGMIKY  480
PKGTSRRWEV ISEYIGTGRS VEEILKATKT VLLQKPDSAK AFDSFLEKRK PTVSISSPLS  540
TREELGESLP ATATATTTTK AKPAKETVVG KSSSSQSSDS NGEASGSSDT DGWSAVQERA  600
LVQALKTFPK ETSQRWERVA AAVPGKTMIQ CKKKFAELKE ILRSKKTGV*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5dje_A9e-1519529820123Zuotin
5dje_B9e-1519529820123Zuotin
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1319337KKKQEEEAAIAAEEEKRRK
Cis-element ? help Back to Top
SourceLink
PlantRegMapThhalv10020260m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0081530.0AC008153.5 Arabidopsis thaliana chromosome 3 BAC F24K9 genomic sequence, complete sequence.
GenBankCP0026860.0CP002686.1 Arabidopsis thaliana chromosome 3, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006407446.10.0dnaJ homolog subfamily C member 2
TrEMBLV4LYG40.0V4LYG4_EUTSA; Uncharacterized protein
STRINGXP_006407446.10.0(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM23612755
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G11450.10.0DnaJ domain ;Myb-like DNA-binding domain