PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cz15g16100.t1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Sphaeropleales; Chromochloridaceae; Chromochloris
Family MYB_related
Protein Properties Length: 2354aa    MW: 247954 Da    PI: 6.4379
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cz15g16100.t1genomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding33.21.2e-1016531697248
                       SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
  Myb_DNA-binding    2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48  
                       + WT++E e+++d ++++G + W + a  ++  +tl q+k ++q+y+
    Cz15g16100.t1 1653 QVWTADERERFLDTFRRYGRD-WGRLADAIP-SKTLVQIKTFYQNYK 1697
                       68*****************77.*********.**************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2354 aa     Download sequence    
MLQKREWPPP IDKPSFVGRG GPFGGRRGRL DHPGGGFLDR PPLPLPNRPP FYGGGGRGPP  60
APPGPPRRAP PGPYSSGPDW PRPGPPLGDG WMGPGGYPGS REGFPVGRSN SPHEYSPRPA  120
WDKRSSDGMS APGPPRRDRD PGCVDMLASP CDWDRDKDWG RERDRDRERE REPRARSWER  180
DRDRGGRERD RDRERDRERE LERPGARLPV VDSRPPGIGA PSTSSSRAPR WSDELEAGEL  240
PAEQAVSPPT RSYRLDRADM SIKSPRMAGG REVPLHVSPR ASRDVPARPS HMPVSSLDPS  300
PATAGSSALS PAAAVQPSNM PSLPPGFRAQ RAAGSPSAAV KVAASPEELD HLYSPPTSDD  360
TPAGSPAELA AAAATTPVRA VAPHDPPRAA SVMLQPRLNG TLEQPKPLDR PVPLERPVPS  420
MQRWASNPMP SQPQQAASSA MPSHARNALP SRGSNGTVPP QSSSTLQHTP TSIISPQKQQ  480
QQDTTAAMHK SASLSSNLPI KLEPRRSMSG IPTLEIIDSK PTTTSIPSPF SVPHIKVTDI  540
GTAPSDLPSV GTSEDPAPKR RRIGFGMGLA RLGTAKLTTE PSKPLDSITP SDTAETPVAL  600
QSESSQQQQQ QQQQHGDAPD SVTQSMGIAQ PVSAVKDTVE APFEATTAAA MGGSTSGSIQ  660
PCSSTELQSP LDAAGQMRPL VDAASIPDVR LQPALGPVEP QANAASFETA QHMPAEQQHQ  720
HEDAVHAAFE PHAVAPVQGD FQADAAETPA ASADGQAAAA AQAAIGNDEH GMVIDSRPAE  780
TSFSAQPSSE VAQAHDQPAL SYAENAMNAS TSTPAGATKQ QQLTAAVTQG HTSTAGDIST  840
LAAASAAPAQ EALPPAAHSI THGTLDDRQI SAQAGPANLV PNTPLAMQNH LLQAVGFGDT  900
LLPVVLPSLG KDEISDAIER LEEQAAGLEA ELAALQQQQR QLVQDITAVD EALKDTDRDE  960
FTQAASAGSP RQQQQEDVLQ LPPAKRIKLD ISIRQQQQRQ QQQQQQEQEV ASDNEGSSGV  1020
RQDPEEFSEV VDTVVKEELA DQPSGELEQE QEQQQQQQQQ QPAKLRKARR KVQVGATAVP  1080
ADLWAFAEAF QAKDLQEQVQ FLDGNQAIAV AARTEFLKLL PDSMQMEAGE SIPTAKPLKP  1140
LYSGPSDWPR YADMQQQHEQ TAPAAKAYLQ QRHTLLRIKQ RTLAEHYKAG VASFEKYMRE  1200
IRSNQAKEAP PALPTAPLRT SRSGMHPSNA VRTEYEEVQL MSVFKAIEDL KNMTACPDMV  1260
LDPWVRRWQR FVSVNGLVAD PERDLQEDKT GKDWSTKERR IFMDRFLQHP KDFRAISSYL  1320
QNRSQAECIV FFYKHQKLDE FANVRRKQQL KKRRLQTETK RTLYGPHRIL PASARPEPTP  1380
VAAVGNRARG RSRGRARGRN RVTNDSVALD DGDEDGVYAG GEGFCSRGGD EVDVASLGIV  1440
GSHGFSDQQF MEAVRACGKD MGAISAHLGH VKGAGYVKWW FSRNKNRLGL EQIVKNREKH  1500
GYMAMARGVG LKAADDEDDA DREDMGMEGE DVAGVVQALA GMQGRLGEGL DLSRGRHGGI  1560
IGLEHVLHGL AHGSEGVAMN AASLLGIDPH APAAQALLAN LPMLPTLLSG GPSLFQSAME  1620
RARTSSPMGD GCQDNAAAGQ EEMLAAFSKR SGQVWTADER ERFLDTFRRY GRDWGRLADA  1680
IPSKTLVQIK TFYQNYKSKL GLDKLEAAAG VTHPRNRGGR PRLQLPRGSS GAMTDFGEDS  1740
LGPGGLPSGR LDTASSLKSL PGFRGALDDV ADGADYATSN AAGEGSFTGG SGPPGQQGGA  1800
DSTIEEQLRA AAAAAAAGVD QSLHGSLASR SAGDDRANMD LQALAAAAAA SLSQAHGREG  1860
INIMSLLQDG IFNTMPSGVD SERQLPGEEI KLFAGQKASG PSSSLPSLND LPAADRLLLG  1920
GAAYKPHQMA AARAAAAAAV AASAGFPLLS GYNAASLAHA GLPVRGLMGK LPGAGLLAAP  1980
DQGYLHPLAA QLGMYGLNQQ GPSADPVTAA VLSQLSNAAA LHAAGLNAHD PATVAALNPN  2040
LALQLAVAGA GAMPLDAAMA MALLAAQNAA HGGHHPVSAS MGHGGGSDTP AGAEAASHQH  2100
DDEAGNVHFS RGVNTAGLSA LPAAEAAAAA ASASDKAATS KAMARVPHSN AMGSQPMLTY  2160
AIEGFPHAMD IQQALKDAVA SINAHDISRA GGIASSMGPA LAWPPPAALQ GAAGALGTGF  2220
AVPAHAAAAL GAQAKQMPLL GAQLARLPHQ PGLTDALALL QANQMTGIPL HPSLLQVSSS  2280
QVLAGADVLP PQMLGIAVRP NLAAAGAPAW AGGQLNVPLE LLQQLDANAA RLAALQQLHQ  2340
AQQPPKEGPA GSK*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1161171RERDRDRERER
2187197RERDRDRERER
3189199RERDRDRERER
4559563KRRRI
5982990PPAKRIKLD
613871397RERDRDRERER
713891399RERDRDRERER
813911401RERDRDRERER
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.12e-15MYB family protein