PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.001G166100.10.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family B3
Protein Properties Length: 1089aa    MW: 122713 Da    PI: 9.2159
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.001G166100.10.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B350.63.4e-1634122599
                            -..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE.. CS
                     B3   5 ltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefel 92 
                            l+p  +++++ +v+pk+f++++ gk   s  ++le+++  s++v +    + + +v++ GW +Fv+++++ke+D ++F++++ s+f  
  Sobic.001G166100.10.p  34 LSPMTASSKHSMVVPKRFLKHFAGK--LSGIIKLESPNRGSYDVGI--IEHCNNVVFRHGWGQFVESHHIKENDYLLFRHVEGSCF-- 115
                            56677888999********888777..6679***************..9*******************************998999.. PP

                            EEEEE-S CS
                     B3  93 vvkvfrk 99 
                            +v +f++
  Sobic.001G166100.10.p 116 KVLIFDS 122
                            9999875 PP

2B338.91.5e-122563351696
                            EE--HHH.HTT---..--SEEEEEE.TTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEE..E-SS.SEE..EEEE CS
                     B3  16 lvlpkkfaeehggkkeesktltled.esgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFk..ldgr.sefelvvkv 96 
                            l ++k +a +h   +++s+ +tl+   ++++W  k++ ++k+ +++l++ W +Fv++n+++egD+++F   + gr s+f  +v +
  Sobic.001G166100.10.p 256 LAISKGYALAHF--PRKSMNVTLQRpGKSKKWHPKFC-KRKDAQMLLKGQWMDFVRDNHVQEGDICIFLptMAGRrSTF--TVYL 335
                            789999999997..56899******5566*******4.444445899999******************94333443444..6655 PP

3B357.52.4e-18431523597
                            -..-HHHHT.T-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS.SEE CS
                     B3   5 ltpsdvlks.grlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr.sef 90 
                            +++s+v+ + + lv+ k++a+eh  +  es+ +tle + gr+W+ +l +r  +++++lt+ W++Fv++n+ +  D+++F+ + + ++f
  Sobic.001G166100.10.p 431 MKKSNVNHLrSDLVICKDYAAEHFPQ--ESQFITLERPGGRKWRTRLYVRPDGRAFMLTTRWQNFVHDNHFQKDDICLFQPMPNeKGF 516
                            444555444456***********855..6678***************88999999*************************99666999 PP

                            ..EEEEE CS
                     B3  91 elvvkvf 97 
                            + +v+++
  Sobic.001G166100.10.p 517 RVMVHLL 523
                            9999876 PP

4B353.93.2e-17803890998
                            HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS.SEE..EEE CS
                     B3   9 dvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr.sefelvvk 95 
                              l+s +lv+ k +a++h  +  es+ +tle + g++W  kl +r  +++y+l++ Wk+Fv++n+L+e D+++F+ + + ++f+ + +
  Sobic.001G166100.10.p 803 KRLNS-NLVICKGYAAQHFPQ--ESQFITLECPGGKRWHPKLHVRPDGRGYMLSTQWKNFVRDNRLREDDICLFQPMPSeKGFRVMAH 887
                            34444.5***********855..6678***************99*******************************8844499988887 PP

                            EE- CS
                     B3  96 vfr 98 
                            ++r
  Sobic.001G166100.10.p 888 LLR 890
                            776 PP

5B331.23.9e-10101210842199
                             HH.HTT..---..--SEEEEEE.TTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
                     B3   21 kfaeeh..ggkkeesktltled.esgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99  
                             + a+e+  +gk  +   ltl+   +gr W+  l   ++   ++ t+ W+eFv++ gL++gD+++F+ ++ ++  ++v+++r+
  Sobic.001G166100.10.p 1012 SDAAEYlpDGK--Q--SLTLRWqGQGRAWRTDL--HNRL--MLATGEWREFVRDSGLEDGDICLFEPMK-ERLAMLVHIIRS 1084
                             55666654333..4..4555554699*******..4443..556677*******************887.788899999885 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019366.08E-2228130IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.104.0E-2028125IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086311.98430123IPR003340B3 DNA binding domain
CDDcd100175.26E-2031121No hitNo description
SMARTSM010194.3E-1533123IPR003340B3 DNA binding domain
PfamPF023622.9E-1434122IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.2E-18232338IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.75E-17233337IPR015300DNA-binding pseudobarrel domain
CDDcd100172.50E-17240336No hitNo description
SMARTSM010192.2E-4240339IPR003340B3 DNA binding domain
PROSITE profilePS5086311.194241339IPR003340B3 DNA binding domain
PfamPF023624.4E-10256336IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.105.8E-21419521IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019365.1E-21421525IPR015300DNA-binding pseudobarrel domain
CDDcd100176.30E-19425521No hitNo description
SMARTSM010199.3E-7427526IPR003340B3 DNA binding domain
PROSITE profilePS5086311.476428526IPR003340B3 DNA binding domain
PfamPF023627.1E-17430523IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.104.4E-20785889IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019363.73E-20787891IPR015300DNA-binding pseudobarrel domain
CDDcd100172.25E-17791884No hitNo description
SMARTSM010190.0043793885IPR003340B3 DNA binding domain
PROSITE profilePS5086311.265794892IPR003340B3 DNA binding domain
PfamPF023626.2E-16805890IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.0E-139801085IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.8E-119851085IPR015300DNA-binding pseudobarrel domain
SMARTSM010190.00379911086IPR003340B3 DNA binding domain
PROSITE profilePS5086312.0559921085IPR003340B3 DNA binding domain
CDDcd100173.39E-910141083No hitNo description
PfamPF023624.5E-810211084IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1089 aa     Download sequence    Send to blast
MAGSGASSIQ KPCDACKRYL DHLDGKNQNV RSFLSPMTAS SKHSMVVPKR FLKHFAGKLS  60
GIIKLESPNR GSYDVGIIEH CNNVVFRHGW GQFVESHHIK ENDYLLFRHV EGSCFKVLIF  120
DSDGCEKVFP CAGIRSVEYV DISSSSHHET TESLASERFV RCQKGSSCHR GKTAKMAAAF  180
SSSEESGENI PSKNKSSELD DLQTPLRQHY VLSQRSYLSE AQEERVIALI QEIQPESTAF  240
IAVMCKSHVQ PPCPYLAISK GYALAHFPRK SMNVTLQRPG KSKKWHPKFC KRKDAQMLLK  300
GQWMDFVRDN HVQEGDICIF LPTMAGRRST FTVYLIQATT TCSRGGSGKR GSLSRQKETA  360
KKAATSSLYE DSGGEDSLSG YESIQLDHFK AFSKRNYVLS AWCHLTAEQE EKIVALVKKV  420
QPEIPFLVVQ MKKSNVNHLR SDLVICKDYA AEHFPQESQF ITLERPGGRK WRTRLYVRPD  480
GRAFMLTTRW QNFVHDNHFQ KDDICLFQPM PNEKGFRVMV HLLHEPSTRS SSLCRHVHGL  540
NSHINRGVTP TAHVHEKSGS ERDSLSCQKE TTKKAGTSSL HEESGEDSLS GHESIQSDHV  600
KAFSERNYVL SARCHLTAEQ EEEIITLVKK VQPAIPFLVI QMKKSNVNRL SSNLLRKDDI  660
CLFQPMPSEK GFRVMVHLLC EPRTRSSSLG GHAHGLNSHI KRVTSTAHVH EKSGSERGSL  720
SCQKETANKA RTSSLYEESE EGTLSGYEST QLDHVKAFSE RYYVLSARCH LTAEQKEKIV  780
ALVKKVQPEI PVLVVKMKKI NVKRLNSNLV ICKGYAAQHF PQESQFITLE CPGGKRWHPK  840
LHVRPDGRGY MLSTQWKNFV RDNRLREDDI CLFQPMPSEK GFRVMAHLLR ERSTRSSSSD  900
GHVHGLHSHI ERGLASTAHV HEKSGSENSG LLDLHKRQPV QQGHQVLNDC GGASSSKPPL  960
YVVLGGTCLT PAQDKVVQEK AMAIKAEVSI FVATMNKKIL GYNNEAFILD FSDAAEYLPD  1020
GKQSLTLRWQ GQGRAWRTDL HNRLMLATGE WREFVRDSGL EDGDICLFEP MKERLAMLVH  1080
IIRSKQYS*
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.001G166100.10.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021313447.10.0uncharacterized protein LOC8062507 isoform X2
TrEMBLA0A1Z5S5Y70.0A0A1Z5S5Y7_SORBI; Uncharacterized protein
STRINGSb01g014595.10.0(Sorghum bicolor)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G33280.15e-19B3 family protein