PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.013G026300.3
Common NameB456_013G026300
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family MYB
Protein Properties Length: 1546aa    MW: 168839 Da    PI: 6.7595
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.013G026300.3genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding27.19.6e-09767808346
                         SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
     Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                         +WT eE e++ d  + +G++ +++Ia  +  ++t  +c+++++k
  Gorai.013G026300.3 767 PWTSEEKEIFMDKLAAFGKD-FRKIATFLD-HKTTADCVEFYYK 808
                         8*****************99.*********.***********98 PP

2Myb_DNA-binding30.49.2e-109861025445
                          S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
     Myb_DNA-binding    4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                          WT eE   +++av  +G++ ++ I+r++  +R++ qck ++ 
  Gorai.013G026300.3  986 WTDEEKSVFIQAVSSYGKD-FAMISRCVR-TRSRDQCKVFFS 1025
                          *****************99.*********.********8776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.05E-13751812IPR009057Homeodomain-like
PROSITE profilePS5129314.53763814IPR017884SANT domain
SMARTSM007171.6E-7764812IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.603.0E-5766812IPR009057Homeodomain-like
PfamPF002498.0E-6766808IPR001005SANT/Myb domain
PROSITE profilePS5129311.1749811032IPR017884SANT domain
SMARTSM007171.0E-79821030IPR001005SANT/Myb domain
SuperFamilySSF466892.96E-109841032IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.605.3E-69851026IPR009057Homeodomain-like
PfamPF002493.5E-79861025IPR001005SANT/Myb domain
CDDcd001677.37E-79861024No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1546 aa     Download sequence    Send to blast
MPQEPLPWDR KDIYKDRKHE RAELQPPPLL AARWREASSM SSYQHGSFRE FARWGSADFH  60
RPPGHGKQGN WHLFPEDIGG HGYVPWRSSD KILDGETYRQ SVSRGDGKYG RSYSRDNNRG  120
SYNQRDWRGH SLETSNGSPN TSVRPHDVNN EQRSVDDMFT YPSRTHSDFV NTWNQLQKDH  180
HDNRTCGVNG LGTGQRCERE NSLGSVDWKP LKWSRSGSLS SWGSGFSHSS SSKSLGGVDS  240
GEAKLELHQK NLAPVQSPSG DAAACVTSAP PSDETTSRKK PRLGWDKSPR VLGFSDCSSP  300
ATPSSVACSS SPGVEEKSFG KAANIDNDVN NLCGSPSFGS QNQLEGSSFS LEKLDINSII  360
NMGSSLIDLL QSDEPSTMDS SFVQSTAINK LLLWKGDILK ALEMTESEIE SLETELKSSK  420
DDPGRRCQCP ATSSSLPVRE NGKSCEEQEA ASSMIPQPAP LKIDPSNDVL EVLQEANADI  480
KDGVIDSPGT ATSDFMLSSS LEKAESLCDV VKAQDCSGNS SSAQLKTMEE VILATDSCNE  540
EAAAVISGEG SVLVKIDNEA HVPESSNSDA GGENMTCDVI LTTNKELANR SSLVFKKLFP  600
KDQYSIEISE ISNAVRGQIS SLIREKIAMR KRHLRFKERV LTLKFKAFQY AWKEDMLSPA  660
MRKYWAKSQK KYELSLRSTY GGYQKHRSSS RSRVASSAGN LVLEPTAEMI NFTSKLLLDS  720
HVKLYRNALK MPALILDEQE QLSRFISSNG LVEDPCAIEK ERALINPWTS EEKEIFMDKL  780
AAFGKDFRKI ATFLDHKTTA DCVEFYYKNH KSECFKKTKK KLDLTKQGKS SANTYLLTSG  840
KKWSKEFNAA SIDVLGSASV IATHAESGMQ KHQTSSSRIF FGGRYSKISR ADDRIADRLS  900
SFDIIGNDRE TAAADVLAGI CGSLSSEAMS SCITSSLDPG ESFHRDWKCH KVDSLLKRRS  960
TSNVAQNVDD GTCSDESCGE MDPADWTDEE KSVFIQAVSS YGKDFAMISR CVRTRSRDQC  1020
KVFFSKARKC LGLDLIDPRT RNLGTPMSDD ANGGGSDAED ACVLERLVVS SDKLGSKPED  1080
LPSNIVCTNM DERNPTSKPI LPTDLNVPDE NNRKLVDHRD SEAVQTVDSD AGLAELISEC  1140
SVDMNIDSKA GSLQVQKSFV ALGNLNAGRD VTEQGVSVAV SASLGAAAHP CTPSLDSVAV  1200
SKPATSLYEN DTKCSAETSS QSICRIDSNK ASDGSVGKNS CSGFSLSAKG LHQIPPDLDS  1260
AKKPSVSNNS SANGSALHDS DGLRCEKICN LGRLSSTLDY KENEAKQAQK SVREDESGRL  1320
SGKTSVNVTE PHRILRGYPL QVSTLKEMNG DVKCLATSKR GSAGPCLAQE CYLQKCNSSK  1380
SAAELPLLVE NLEQAKDRPK SHCRISDTEN PGRNGNVKLF GQILNSSSRD DKVSHFSKQN  1440
TEPSNSKPIG NNVDGNSKFD ANNHVVENVP KRSYGFSDGK RIQTGLSSLP DSSILMAKYP  1500
SAFANYPPTS SSQMEQQALQ TVVHGTDRTL NGVSFPLKGN KQQQR*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C2e-17726816494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D2e-17726816494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5788050.0JX578805.1 Gossypium hirsutum clone NBRI_GE10901 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017619121.10.0PREDICTED: uncharacterized protein LOC108463731 isoform X1
RefseqXP_017619122.10.0PREDICTED: uncharacterized protein LOC108463731 isoform X1
TrEMBLA0A0D2W2970.0A0A0D2W297_GOSRA; Uncharacterized protein
STRINGGorai.013G026300.10.0(Gossypium raimondii)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-178MYB family protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]