PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Araip.E4ENU
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Dalbergieae; Arachis
Family MYB
Protein Properties Length: 1666aa    MW: 181065 Da    PI: 6.1837
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Araip.E4ENUgenomeNCGR_PGCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding26.51.5e-08799840346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT eE e++++ ++ +G++ +++Ia+ +  ++t  +c+++++k
      Araip.E4ENU 799 PWTSEEREIFLEKFAVFGKD-FRKIASFLH-HKTTADCVEFYYK 840
                      8*****************99.*********.***********98 PP

2Myb_DNA-binding28.34.2e-099941034345
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                        WT +E   +++av  +G + + + ar++g +R+++qck ++ 
      Araip.E4ENU  994 DWTDDEKAAFIQAVSSFGRD-FVKLARCIG-TRSPEQCKVFFS 1034
                       5*****************87.*********.********8876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466894.93E-14783843IPR009057Homeodomain-like
PROSITE profilePS5129315.926795846IPR017884SANT domain
SMARTSM007171.3E-8796844IPR001005SANT/Myb domain
PfamPF002493.7E-6798840IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.6E-5798843IPR009057Homeodomain-like
PROSITE profilePS5129311.3439901041IPR017884SANT domain
SMARTSM007178.3E-99911039IPR001005SANT/Myb domain
SuperFamilySSF466895.46E-99921041IPR009057Homeodomain-like
PfamPF002495.3E-79941034IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.603.5E-59941035IPR009057Homeodomain-like
CDDcd001672.00E-79951033No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1666 aa     Download sequence    Send to blast
MPPEPLPWDR KDFFKERKHE RSESLGSVAR WRDSSHHRDF HRWGSAEFRR PPGHGKQGGW  60
HVFSEDSGHG YGISRSSSEK MLDEDCRPSV SRGDGKYGRG SRENRGPFGQ RDWRGQSWET  120
TNGSMNLPRR PPDVNNDHRS VDDNLTYSTH PHSDFVNTWD PHHLKDQHDK IGGANGFGTG  180
ARSDRENSLA SIDWKPLKWT RSGSLSSRGS GFSHSSSSRS AGGADSHEAK AELHPKNATV  240
NESHSGEAAV CVTSSAPCED TTSRKKPRLN WGEGLAKYEK KKVEVPDGSA NKDGPVLSNG  300
SIEPCAFPGS SLVDKSPKVT GFSDCACASP ATPSSVACSS SPGVDDKLFG KPANVDNDVS  360
NLTCSPVPGS QDHFQRFSFN LEKLDIESLN SLNSSIIELI QSDDTSYVNS GPMRSTAMNK  420
LLIWKADISK VLETTESEID SLENELKSLR SASGDRGSYP AVLGSQMVGN NENPFEVPVG  480
VSDEVTRPEP LKILSSDDPD AEKLPLSTNL NSIHENGKEE DIDSPGSATS KLSEPPPLVK  540
AVSSSDTRRY DTFLEDANAG QSNGMKCLIP CTTRKYPSNS ACSDVNASSE VPDSIITASG  600
ASLWSSTEDS LYKKIISSNR ELAKSACGVF AKLLPQGYTK IDKVGASSDL CSQTSIMEKF  660
AEKKQFARFK ERVITLKFKA LHHLWKEDMR LLSIKKCRPK YHKKHELSVR STFNGNQKNR  720
FSIRSRFPLP AGNHLSLVPT AEVINFTRKL LSEPQVKIHR DALKMPALVL DEKIPKFISS  780
NGLVEDPLAI EKEKALINPW TSEEREIFLE KFAVFGKDFR KIASFLHHKT TADCVEFYYK  840
NHKSDCFEKL KKQQKLGKSF LAKTDLVASG KKWNHEANTA SLDILSAASV MADGFACNKK  900
MRPGNFLMGG YVNVKASRVD DSIRERSSSF DILGDEREAF ADVMASSEAM SFCGTSSVEP  960
VEGSRDSRLM PDTAENVDDE TCSDESCGEM DPTDWTDDEK AAFIQAVSSF GRDFVKLARC  1020
IGTRSPEQCK VFFSKARKCL GLDLMRPMPE NVGSPANDGA NGGGSDTDDA CAVETGSVVG  1080
TDKSGTKTDE DLPSSVINTY HDESDPVEVR NLAAELNEPK EEDDTVVDHE DANLVSDGVV  1140
LYNSDKSGSV NGQAPIVMTD STTVGKDKAI KFGGADLVSI SALDTTEPCE RSLAGQDNVV  1200
TEVSSGVLGS GLERQSVPST QCPDDRGDKL VAVTAVGVEL KSSVQDSCTT TVNASVSSVG  1260
NSCSGLSFDT ESKHMALGKP VSALYVEDLH ATANSLSQNT SVSAAVQCEK TATQDQLSCT  1320
TETPGGRNLQ CHNPISNGDH QLPVPGNRVD RANSILHGYP LQMAIKKEVN GDIKCSSSAN  1380
ELPLLSRKDE QDDHFKARLS YSSDSEKTSR NGDVKLFGKI LTNPSSTQKP NLTTKSCEEN  1440
GIHHPKSSRL SSLKYADGNF KMLKFERDDC SEYLGLENVP LRSYGYWDGN RIQTGLTSLP  1500
DSAILLAKYP AAFSNYPSSS AKLEQQSFHA FGKNNERHLS GSPAFTARDM NGSNAVIDYQ  1560
MLRSRDGSVV DVKHCQDVFS EMPRRNGFEA ISSLHQQQQG RGVVGMSSGG VGGTGIVVGS  1620
CSGVTDPVAA IKMHYPNSDK YGGQTGNNIS SREDESWAGG KGDLGR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C2e-157718481794NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D2e-157718481794NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Cis-element ? help Back to Top
SourceLink
PlantRegMapAraip.E4ENU
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_020966324.10.0uncharacterized protein LOC107617278 isoform X2
RefseqXP_025675909.10.0uncharacterized protein LOC112776118 isoform X2
TrEMBLA0A444XTP90.0A0A444XTP9_ARAHY; Uncharacterized protein
STRINGGLYMA20G31871.10.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF49863352
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.10.0MYB family protein