PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID KXZ47433.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Volvocaceae; Gonium
Family MYB_related
Protein Properties Length: 2635aa    MW: 255157 Da    PI: 6.5805
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
KXZ47433.1genomeGPGRPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding22.33e-076341747
                     HHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
  Myb_DNA-binding 17 kqlGggtWktIartmgkgRtlkqcksrwqky 47
                     +++G   W++I +++g ++t+ q++s+ qky
       KXZ47433.1  6 RLYGRQ-WRKIEEHVG-TKTAVQIRSHAQKY 34
                     678866.*********.*************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5129416.041139IPR017930Myb domain
CDDcd001670.00325435No hitNo description
SuperFamilySSF466893.49E-6440IPR009057Homeodomain-like
PfamPF002491.0E-5534IPR001005SANT/Myb domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2635 aa     Download sequence    Send to blast
MTTGLRLYGR QWRKIEEHVG TKTAVQIRSH AQKYFNKLEK GGGGEEAIEV PPPRPKRKSA  60
PRADGAGAGA GAGAAAAGAG AGTDGGEFGS DGEAGMQAPD DCAPAAELQA LQLQQLKAES  120
PSLPPLGLAA LPALAGQLSG QLQPPLGAAA GTQDLLQRMQ APGAGQQQSQ PPQHGVLGLP  180
PLPRAPSLPL QGQHPPQQQQ LLLQAPGLTG LLPVQPHQQQ HQHQQQQQQQ QPMLPPQPQL  240
HQQQQQSGGG PVPTLADSIA RNIAMNMAQW AHAQVAGHAQ EGPGGPMGQP SAAGPHGLTP  300
DRQTVEAVAA AAAAAAAAAA TAVISAAGEA IQRRIQEKAA AGFLPFLVHP LAELIPPTLA  360
ASASAATTSD QSAPPWAVPP PQQSLMRVLA ASSSGRGGAG TGPAAGYPQS QSQQHSQQRT  420
QPFSLDQGAP SGLGGLTTGS DPLTREETTA GQVTATPGSS QQQQAVSGPH NSPQHMSPSF  480
AICAAQGGAG LGSGGGGGGC ATGSGSVGLG VNSTSLLRLA SLGAPSISLA ATGGSLGGGA  540
TAAGSADGGQ QGSQPMAWEM AQQAGMELGS MGARGHPGSM TGMGAAAATL RVLGGLPAPW  600
DDQLQQLQLQ QLQQQQQQLH HQQQHLQQQQ EALGRFMGAY GDGSNIFSFD AAQQMATAAA  660
AVGAEGLQAS AGGGGRDAQR QLSPPPPPQR RDSRLGMLPA AQPQPMSAVQ PPLTAFPLLR  720
PPGAALASPY TTPGAMAASG PALFDPRVRA RELQQRQAEE PQAMQLDAPE AAATDPRVAG  780
RVSHATQPRV QWDDAPDVGG KREPVEEEAR ELQPGRGCRR AVPAQDGVDA AAAPDGSRRA  840
AGAGQQRPAG MAIGTGSGSG LCLALMDSGW ADEFGAAASG DGGYQLGQLQ YLQQQSGPGL  900
SGTTGSALNS LQEVLAGPDL TAMVPLLVPS SGATSAQPDA GGGSGGGAAA ASARATVAAA  960
AAHAARLHAS TSGDQTTQNN QHHHQPHAQQ QHGGDGQHGR TAAVRWSQIQ VPGGPQKAQQ  1020
QGTDAGSGGN TTTDVNAYIR DFSLVSPSGG QQPPLSSSGP SSLPSTQQLR NGGDAAAPPL  1080
QPPAAAAAAG RSGSDAVAPG TAAAAAARGT EAGASNFGSG SGAAAEGMVV DGRQSSGPTG  1140
LLARQYGNRQ ETPPVGVSGG TRQGVSPSPG PCGPSGYQAS GPAATGHGSG AGDDTGTGPA  1200
AAGGGLVSGS GSGSDGAGRG GGDGNGDRPA GQGASGIGGS TLAPLNKQQK RHQQQRTVGE  1260
SEREQSAEAG PQDAPAPLAA SGGAGGKGEG GGAAAADEEA DAQGAARRRQ GCADPESNLA  1320
AAAAAVGGGA SAAADAAGLQ QRQGSQEPRP GGGPRQSPDS PIKRQPSQQR GPQPPPPASA  1380
PSQQQLRSAA VPQPTPAAAG AETLALLQRM LMLQQLQAQS DQTVSPQPPS LMSTLAGLSA  1440
GAQAVANMLA LQQQQQQQQR QLEAVLSEPV PRQPLALPPT AAGPETMSTD AGGSAADADG  1500
GGGPMAAARL ELLQSTAEAA AAAPEGLAAL LPYLCDRGAV GSEQIAQLEE MARALQYASS  1560
LAGPLAAVLQ RPALQRAFGA GVASQGSGGA LPPAGQCDED GGADARTAAQ QCPPAVTSDA  1620
EAPEAEGPLA SGDGPLGAAH AAGRAPSARG AAAAGESRQP PQGAPPGLFP GASSDPGLAS  1680
LLLHWHAAAA ASYGYGLHPL VRPQQQPQLE ESMLAAAAVA AAPSFLPHPP HPLLMQQHYM  1740
KQHLAAAALG LSLGGGGGGQ ASRGQSTEQE RKAARRAATT AEISGRTSGG ADGSAGNAAA  1800
AVAAANAANQ AAAPQRGRVQ GPGSGQPQAA AFDAGLGSGG GAAGGHWRGA SAGVPSDPRL  1860
SSGGPRRSSG DGGGGAGAGG FRQPPRPVPN FADAAAAAAA AAVRVMSGGA DGGAGSDKGG  1920
AAGAGGSGKR SLGDVYGNAH AAAAAQRPSR HKAARRRHHR GCSPQRSVGS GASSSGFHGG  1980
GDKSGGEEAH HCPRPSREYG NQDGGALAGG AAAQNAAADQ QDTATSSGGK DGHGSGAGGG  2040
AGCGVHGGGA GGAAGLMLPP KKHMRSGHDP RAPGGGGAAG SGGAAAAGGG GAAAAAGRQA  2100
HLQKDATPSG AGSGGSGGGS IGGNKSGTSN DAYDGSMNAS DEGSNEPGSA PADGPMGQPL  2160
PPSDGEGRLG MGAGRDGDGV NVQGMVVAGA AIIAGGARSF SVAQPDPLSG GAAAAPGGGA  2220
SGPAAGAGHK RQRFELPAGQ AAPPEAAPVG TSSEEAAGGK QAGGGSADGP PPHKVHRTGP  2280
ASGPHGAAGV DVNTTSAGAA ADAAAAAGGD SGGDVDMDDA ADAADATRAV GREPQPQARP  2340
AGSLPSQLAY VLEQGSSKQG GDTGHNNSGT GPRSGVGSNE GALGGNGGGR CRSNEGGNGN  2400
GSQGNGSQGN GNGSHGNGSN GNGSHGQGSN GNNGNGSNGN GASTNLPGSG HHAYFRWPGG  2460
GALHRGGGGA GGAGAGGGSR ADAGGGGAGG SGAADGSGGG ADGRIAAAGS GGGVDGASGG  2520
DRLSPYDDGG AAAGAATPPE DHHHHHSHHG RGERRNQEGT SMRTQTPQDS DQDPERRLGD  2580
ASWQQHALPY RHHQPDAVGE GALGRGAAGH GAQAEGGAAG GGGLGPDEAA DNGGQ
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
122572264GGKQAGGG
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A150GCD70.0A0A150GCD7_GONPE; Uncharacterized protein
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP4541427
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G01060.34e-14MYB_related family protein