PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.1889s0015.2.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family MYB
Protein Properties Length: 1528aa    MW: 166676 Da    PI: 6.4303
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.1889s0015.2.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding33.59.5e-11742783346
                          SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
      Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                          +WT eE e+++   + +G++ +k+Ia+++   +t  +c+++++k
  Cagra.1889s0015.2.p 742 PWTSEEKEIFLSMLAIHGKD-FKKIASYLT-EKTTADCIDYYYK 783
                          8*****************99.********9.9**********98 PP

2Myb_DNA-binding28.24.5e-09957998447
                          S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
      Myb_DNA-binding   4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47 
                          WT +E   +++++ ++G++ +++I+r++g +R++ qc+ ++ k+
  Cagra.1889s0015.2.p 957 WTDDERSAFLQGFSLFGKN-FASISRYVG-TRSPDQCRVFFSKV 998
                          *****************99.*********.********998776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.02E-14725786IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.608.2E-7734785IPR009057Homeodomain-like
PROSITE profilePS5129314.857738789IPR017884SANT domain
SMARTSM007174.2E-10739787IPR001005SANT/Myb domain
PfamPF002491.4E-8741783IPR001005SANT/Myb domain
CDDcd001671.66E-8742784No hitNo description
PROSITE profilePS5129312.899521003IPR017884SANT domain
SMARTSM007175.1E-89531001IPR001005SANT/Myb domain
SuperFamilySSF466891.17E-99561003IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.3E-4956997IPR009057Homeodomain-like
CDDcd001675.79E-7957998No hitNo description
PfamPF002491.3E-7957997IPR001005SANT/Myb domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1528 aa     Download sequence    Send to blast
MSNGSSRSFE RPFGIRNGRR SVDERPLHAS DTHTTMVNSL DPTNSAHQPD TEICTPVRSL  60
RFKNEQKFSD QRLSLPSDPH SDCVRLFEQA SSENNYGNKI CSPAKQCNDL MYGRRIANDN  120
SLDPPILNAE LEGTWEQLHM KDPQEDNKLH GITDLDDARK CAKESSLGAI GKLPLWNSSG  180
SFASQSSGFS HSSSLKSVGA VDSTDRKTEV LPKIATVTQS SSGDATPCAT TTHLFEEMSS  240
RKKQRLGWGE GLAKYEKKKV DVNTNEDGTT LLENGLDEQH SLNKNIADKS PTAAILPDYG  300
SPTTPSSVAC SSSPGFADKS SAKAAIAASD VSNMCRSPSP VSSIHLERFP VNIDELDNIS  360
MERFGCLLNE LLCTDEPGTG DSSSVQLTSM NRLLAWKSEI LKAVEMTESE IDLLENKHRT  420
LKLEGGRHCH VGSSSYFCEG DADVPKEQEA SCILGPKAAA TPVAEALVRS PVHQSSLAKV  480
SVDVCEDNNE EVKFLSQSFA TVDSNEDILP KLSMKAVTSS KEISTPAFVN QETVELSSAD  540
DSMASNEDIL CAKLLSSNKK YACESSGVFN ELLPRDCSFD ESRYFGICQM QFDSHVKEKL  600
ADRVELLRAR EKILLLQFKA FQLSWKKDLD QLALTKYQSK SSRKSDLYPN AKNGGYLKLP  660
QPVRLRFSSS APRRDSVVPT TELVSYMEKL LPGTNLKPYR DILRMPAMIL DERERVMSRF  720
ISSNGLVEDP CDVEKERTMI NPWTSEEKEI FLSMLAIHGK DFKKIASYLT EKTTADCIDY  780
YYKNHKSDCF GKIKKQRAYG KEGKHTYMLA PRKKWKREMG AASLDILGAV SIIAANAGKV  840
ASTRQISSKR ITLRGCSSSN SLQHDGNNSE GCSYSFDFPR KRTVGADVLA VGPLSSEQIN  900
SCLRTSVSSR ERCMDHLKFN PVVKKPRISH TLHNENSNEE DDSCSEESCG ETGPIHWTDD  960
ERSAFLQGFS LFGKNFASIS RYVGTRSPDQ CRVFFSKVRK CLGLEFIQSG SGNLSTSVSV  1020
DNGNEGGGSD LEDPCPMESN SGICNNGVCA KMDINSPTSP FNMNQNGANH SGSANVKADL  1080
SRSEQENGLT YIHLKDGRNL VSNAYIKGDL PGLVSESCRD LVDINTVENQ SQAAGISKSS  1140
DLLSMEIDEG VLTSVAVSSE PLYCGLSVLS NVIVETPTES SQMGSGDQGA ATMLKLNSKN  1200
QDGVMQAANR TKNPGLDPES APSGFKYPEC LHHVPIEVCT ENPIGVSVPR GNPNCHTEAK  1260
SGNSLVGQAV ETHDLGWQFS KENLELNGRL QVIGHVNPEQ NGQLNSINAE SCQIPQRSVT  1320
QDPSRISRSK SDLIVKTQRT GEGFSLNKCT SSAPNSLTVS HKEGKSGHSR SHSFSLSDTE  1380
RLDKNGDVKL FGTVLTADEN GIKQKHNPGG SVRSSSTLSR DHDTRHHYIN QQHLQNVPIT  1440
SYGFWDGNRI QTGLTSLPES AKLLASCPEA FSTHLKQQVG SNKEIRRDVN GGGILSFGKH  1500
NEDRAEASSA KDGGNIGGVN GVAEAAT*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C3e-15700790493NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D3e-15700790493NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Cis-element ? help Back to Top
SourceLink
PlantRegMapCagra.1889s0015.2.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006290487.10.0uncharacterized protein LOC17885771
TrEMBLR0HEB90.0R0HEB9_9BRAS; Uncharacterized protein
STRINGCagra.1889s0015.1.p0.0(Capsella grandiflora)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.10.0MYB family protein