PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID 462853135
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Chloridoideae; Eragrostideae; Eragrostidinae; Eragrostis
Family MYB_related
Protein Properties Length: 1384aa    MW: 155509 Da    PI: 9.9642
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
462853135genomeTefView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding22.13.5e-07153189341
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqck 41 
                       W+++E e+++ a +++G++ W+++a  +   Rt +++ 
        462853135 153 QWSKDELERFYGAYRKYGKD-WRKVAGAIR-DRTSEMVE 189
                      6*****************99.*********.***98875 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007172677127IPR001005SANT/Myb domain
SuperFamilySSF466895.62E-579122IPR009057Homeodomain-like
SuperFamilySSF466892.84E-9149197IPR009057Homeodomain-like
PROSITE profilePS512939.479149200IPR017884SANT domain
SMARTSM007170.0056150198IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.7E-4153193IPR009057Homeodomain-like
PfamPF002491.1E-5153193IPR001005SANT/Myb domain
CDDcd001673.37E-4154196No hitNo description
PfamPF065841.0E-319111011IPR033471DIRP domain
SMARTSM011352.6E-559111012IPR033471DIRP domain
PfamPF065841.0E-3112511351IPR033471DIRP domain
SMARTSM011352.6E-5512511352IPR033471DIRP domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1384 aa     Download sequence    Send to blast
MDSRVVCGVG YSNSLLLELF VDRGLRVKLV VRFSMASSRK VRNANKRYAK INEDWQDKDD  60
TNVHKSKVRK KKISDMLGSQ WSKDELERFY GAYRKYGKDW RKVVRFSMAS SRKVRNANKR  120
YAKINEDWQD KDDTNVHKSK VRKKKISDML GSQWSKDELE RFYGAYRKYG KDWRKVAGAI  180
RDRTSEMVEA LYNMNKDGSN SDRESNDSPK ASRKPQKRGR AKFQSVSKTS DTCYPDLLQS  240
QPASSSYGCL SLLKKKRSGG NRPRAVGKRT PRVPVSSMYY RDDRGVAERR AKADANNGDD  300
EGAHVAALAL AEVCQRGGSP QVSETHGRSG DRMFLSPVKS SDRKNADSEM GSSKLHGFHL  360
DADYPEGSLG SREAETGDYT KGASFFMTNE GSASGKPQKK VKKSQKRRKK AARKTGDQFE  420
DDREACSGTE EGHSARKAKE ESDMEALGWP STSNKRSRQL FFGGNRPRAV GKRTPRVPVS  480
SMYYRDDRGV ADRRAKADAN NGDDEGAHVA ALALAEVCQR GGSPQVSETH GRSGDRMFLS  540
PVKSSDRKNA DSEMGSSKLH GFHLDADYPE GSLGSREAET GDYTKGASFF MTNEGSASGK  600
PQKKVKKSQK RRKKAARKTG DQFEDDREAC SGTEEGHSAR KAKEESDMEA LGWPSTSNKR  660
SRQLFFGDES SALDALHTLA DLSVNILQPS SVVESESSAQ IKDENKDDDS DEKPSMPAAV  720
SVYEQKIGSK STARKAKRQS ETANTEMVTR KKAKLVKDPR HDGSSTDVKQ QACTCGVKAE  780
KKKRKSSTAK ISKDERNILK DVEKTEVSAE EGKVSSNKGT LLILEKAVDI AETTTQGETT  840
PQADLSSKGK SRRKLGIQQA LTEECKPTKG TDDTGSDKFS YSVNNVVDLK DKLSHCLSSR  900
LLRRWCMFEW FYSAIEYPWF AKSEFVEYLN HVKLGHVPRL TRVEWGVIRS SLGKPRRLSK  960
QFLHEEREKL SQYRDSVRQH YAELRSGVRE GLPTDLARPL AVGQRVIACH PRTRELHDGN  1020
VLTVDNNCCR VQFDRPELGV EFVMDIDCMP LHPMENFPES LRQQNIDNKY LSEVKLEDQM  1080
KELGSGGAAR FTSNVNVNGA DATFHIPSGH PIDTLMKQAK GDTINSIAQA KATVNEVAVA  1140
AQQAMYNQPC SHSQIQEREA DIKALAELSR ALDKKATTRF QPLDLSSKGK SRRKLGIQQA  1200
LTEECKPTKG TDDTGSDKFS YSVNNVVDLK DKLSHCLSSR LLRRWCMFEW FYSAIEYPWF  1260
AKSEFVEYLN HVKLGHVPRL TRVEWGVIRS SLGKPRRLSK QFLHEEREKL SQYRDSVRQH  1320
YAELRSGVRE GLPTDLARPL AVGQRVIACH PRTRELHDGN VLTVDNNCCR VQFDRPELGV  1380
EFVM
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1601617QKKVKKSQKRRKKAARK
2604610VKKSQKR
Cis-element ? help Back to Top
SourceLink
PlantRegMap462853135
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012703938.10.0protein ALWAYS EARLY 2 isoform X1
TrEMBLA0A3L6PQS60.0A0A3L6PQS6_PANMI; Protein ALWAYS EARLY 2 isoform X2
STRINGSi025839m0.0(Setaria italica)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP62302331
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G21430.21e-125DNA binding