PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.004G204000.2.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family B3
Protein Properties Length: 1000aa    MW: 112416 Da    PI: 9.0187
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.004G204000.2.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B330.75.7e-103284111099
                           HHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE- CS
                    B3  10 vlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfr 98 
                           +++++ lv+p + a + ++ +++   + l+d++g+  +v +  +k  ++ v+ +GW  Fv+ + +k g+f++F+++ +s+f   v+vfr
  Sobic.004G204000.2.p 328 NNSEKFLVIPPTVAPRLEYLTNQ--LVYLKDSEGKCSKVLV--SKVAETLVFHQGWDIFVSNHLIKWGEFLLFEYIAESTF--SVRVFR 410
                           4566679*******999888544..8***************..*********************************98999..999998 PP

                           S CS
                    B3  99 k 99 
                           +
  Sobic.004G204000.2.p 411 T 411
                           6 PP

2B331.14.2e-109359933596
                           EEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS.SEE..EEEE CS
                    B3  35 tltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr.sefelvvkv 96 
                            ++l+d+  r W + +    ++  + +t+GWk+Fv an+L++gD++ +    + ++ e+v +v
  Sobic.004G204000.2.p 935 VVMLKDPMKRLWPIIY--HDNPIFVGFTAGWKHFVAANNLQTGDVCELIK--EsEDDEPVYSV 993
                           5899************..****************************8883..33666666555 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019361.1E-14317414IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.101.4E-14317413IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086310.362319412IPR003340B3 DNA binding domain
SMARTSM010194.0E-4322412IPR003340B3 DNA binding domain
CDDcd100171.11E-7322410No hitNo description
PfamPF023623.7E-8327411IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.104.1E-11900992IPR015300DNA-binding pseudobarrel domain
SMARTSM010192.7E-4905999IPR003340B3 DNA binding domain
CDDcd100175.71E-9911994No hitNo description
SuperFamilySSF1019363.34E-10913986IPR015300DNA-binding pseudobarrel domain
PfamPF023622.4E-7935992IPR003340B3 DNA binding domain
PROSITE profilePS508638.98936997IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1000 aa     Download sequence    Send to blast
MIGSAPDSAR RKPSHDRAND EVRHGDTVVK NTKRSADAFT EKRKKKDHLY NANHGKDRDI  60
RNGDQRIKKS RVPAASSEKG REKGDHDLNK TSSKKKMMGV DLDKKRTVTS RRKKLERERK  120
KKLLINASIR KNMQRDDGEE GRIMSNYYET KVKSKKVSTT LSDKERNKEK LNKTHREKKM  180
QAADSKMRNH DSGVNKGKVS TTFCDKEKKR KRPSNTNSEK ETAPITYAVK EKKMRTTESV  240
EIKMRHDRQN RRNVSLDVSN EKMDTSSGSN YKIRKRKLAH TLLKEKKRMR YNDSDKIHSG  300
RVKEMEKISG GKEKNNQAPI AFLKFIRNNS EKFLVIPPTV APRLEYLTNQ LVYLKDSEGK  360
CSKVLVSKVA ETLVFHQGWD IFVSNHLIKW GEFLLFEYIA ESTFSVRVFR TDSCERVDFN  420
PESTNKGGRK KQAWSNMPPD DLVITDGSSQ NIDDGYYVSG ECPRTKVPQT CHVTCNTKND  480
PKQVEHVVGS GVMAQDNNGK SIDPQCKTKG TSPLCSKGKT LITLIDSEDS EPLEHENGDT  540
MKLATSVADS DTSLVAVNTN EGPIRAQSGI GNGPSVVLGD EKGSSPEIEC GTKSISTTCS  600
EGKTRSQIII TSTALLDLHD SDEDLGRKQR TNVVPLDSIT PVIDYHNHSK TDIIQNLYRK  660
YEAPGGFRCL EKWRKDVVNN QASLDCTVPI KPENPQKNDS MLVDGYGSIE LNPVDEYICS  720
EGNHECVQPL FTMPIKEPSS ADRVTNCGHD GTEIDYSINE KDGGASVLLE AKGERLEPMG  780
SIVHSQSNNA PLCANPVVPG PIEKTSSPDE ISKCSSSMIE IEHNVNEKGT PVQFETQMDQ  840
VEPVRSSVRS KSRNIVVRAN ESEHCFSKQE GRMPSNTEVP EPLLPMKDKI LELDYHSPPE  900
INSQLCIPDT TQKWLGLSKS LSSAVIRQRR HHWDVVMLKD PMKRLWPIIY HDNPIFVGFT  960
AGWKHFVAAN NLQTGDVCEL IKESEDDEPV YSVRMCGKI*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1103121KKRTVTSRRKKLERERKKK
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.004G204000.2.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021315763.10.0B3 domain-containing protein Os02g0598200
TrEMBLA0A1Z5RNC30.0A0A1Z5RNC3_SORBI; Uncharacterized protein
STRINGSb04g025030.10.0(Sorghum bicolor)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18960.18e-11B3 family protein