PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.001G166100.9.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family B3
Protein Properties Length: 1089aa    MW: 122713 Da    PI: 9.2159
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.001G166100.9.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B350.63.4e-1634122599
                           -..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..E CS
                    B3   5 ltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelv 93 
                           l+p  +++++ +v+pk+f++++ gk   s  ++le+++  s++v +    + + +v++ GW +Fv+++++ke+D ++F++++ s+f  +
  Sobic.001G166100.9.p  34 LSPMTASSKHSMVVPKRFLKHFAGK--LSGIIKLESPNRGSYDVGI--IEHCNNVVFRHGWGQFVESHHIKENDYLLFRHVEGSCF--K 116
                           56677888999********888777..6679***************..9*******************************998999..9 PP

                           EEEE-S CS
                    B3  94 vkvfrk 99 
                           v +f++
  Sobic.001G166100.9.p 117 VLIFDS 122
                           999875 PP

2B338.91.5e-122563351696
                           EE--HHH.HTT---..--SEEEEEE.TTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEE..E-SS.SEE..EEEE CS
                    B3  16 lvlpkkfaeehggkkeesktltled.esgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFk..ldgr.sefelvvkv 96 
                           l ++k +a +h   +++s+ +tl+   ++++W  k++ ++k+ +++l++ W +Fv++n+++egD+++F   + gr s+f  +v +
  Sobic.001G166100.9.p 256 LAISKGYALAHF--PRKSMNVTLQRpGKSKKWHPKFC-KRKDAQMLLKGQWMDFVRDNHVQEGDICIFLptMAGRrSTF--TVYL 335
                           789999999997..56899******5566*******4.444445899999******************94333443444..6655 PP

3B357.52.4e-18431523597
                           -..-HHHHT.T-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS.SEE. CS
                    B3   5 ltpsdvlks.grlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr.sefe 91 
                           +++s+v+ + + lv+ k++a+eh  +  es+ +tle + gr+W+ +l +r  +++++lt+ W++Fv++n+ +  D+++F+ + + ++f+
  Sobic.001G166100.9.p 431 MKKSNVNHLrSDLVICKDYAAEHFPQ--ESQFITLERPGGRKWRTRLYVRPDGRAFMLTTRWQNFVHDNHFQKDDICLFQPMPNeKGFR 517
                           444555444456***********855..6678***************88999999*************************996669999 PP

                           .EEEEE CS
                    B3  92 lvvkvf 97 
                            +v+++
  Sobic.001G166100.9.p 518 VMVHLL 523
                           999876 PP

4B353.93.2e-17803890998
                           HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS.SEE..EEEE CS
                    B3   9 dvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr.sefelvvkv 96 
                             l+s +lv+ k +a++h  +  es+ +tle + g++W  kl +r  +++y+l++ Wk+Fv++n+L+e D+++F+ + + ++f+ + ++
  Sobic.001G166100.9.p 803 KRLNS-NLVICKGYAAQHFPQ--ESQFITLECPGGKRWHPKLHVRPDGRGYMLSTQWKNFVRDNRLREDDICLFQPMPSeKGFRVMAHL 888
                           34444.5***********855..6678***************99*******************************88444999888877 PP

                           E- CS
                    B3  97 fr 98 
                           +r
  Sobic.001G166100.9.p 889 LR 890
                           76 PP

5B331.23.9e-10101210842199
                            HH.HTT..---..--SEEEEEE.TTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
                    B3   21 kfaeeh..ggkkeesktltled.esgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99  
                            + a+e+  +gk  +   ltl+   +gr W+  l   ++   ++ t+ W+eFv++ gL++gD+++F+ ++ ++  ++v+++r+
  Sobic.001G166100.9.p 1012 SDAAEYlpDGK--Q--SLTLRWqGQGRAWRTDL--HNRL--MLATGEWREFVRDSGLEDGDICLFEPMK-ERLAMLVHIIRS 1084
                            55666654333..4..4555554699*******..4443..556677*******************887.788899999885 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019366.08E-2228130IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.104.0E-2028125IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086311.98430123IPR003340B3 DNA binding domain
CDDcd100175.26E-2031121No hitNo description
SMARTSM010194.3E-1533123IPR003340B3 DNA binding domain
PfamPF023622.9E-1434122IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.2E-18232338IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.75E-17233337IPR015300DNA-binding pseudobarrel domain
CDDcd100172.50E-17240336No hitNo description
SMARTSM010192.2E-4240339IPR003340B3 DNA binding domain
PROSITE profilePS5086311.194241339IPR003340B3 DNA binding domain
PfamPF023624.4E-10256336IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.105.8E-21419521IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019365.1E-21421525IPR015300DNA-binding pseudobarrel domain
CDDcd100176.30E-19425521No hitNo description
SMARTSM010199.3E-7427526IPR003340B3 DNA binding domain
PROSITE profilePS5086311.476428526IPR003340B3 DNA binding domain
PfamPF023627.1E-17430523IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.104.4E-20785889IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019363.73E-20787891IPR015300DNA-binding pseudobarrel domain
CDDcd100172.25E-17791884No hitNo description
SMARTSM010190.0043793885IPR003340B3 DNA binding domain
PROSITE profilePS5086311.265794892IPR003340B3 DNA binding domain
PfamPF023626.2E-16805890IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.0E-139801085IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.8E-119851085IPR015300DNA-binding pseudobarrel domain
SMARTSM010190.00379911086IPR003340B3 DNA binding domain
PROSITE profilePS5086312.0559921085IPR003340B3 DNA binding domain
CDDcd100173.39E-910141083No hitNo description
PfamPF023624.5E-810211084IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1089 aa     Download sequence    Send to blast
MAGSGASSIQ KPCDACKRYL DHLDGKNQNV RSFLSPMTAS SKHSMVVPKR FLKHFAGKLS  60
GIIKLESPNR GSYDVGIIEH CNNVVFRHGW GQFVESHHIK ENDYLLFRHV EGSCFKVLIF  120
DSDGCEKVFP CAGIRSVEYV DISSSSHHET TESLASERFV RCQKGSSCHR GKTAKMAAAF  180
SSSEESGENI PSKNKSSELD DLQTPLRQHY VLSQRSYLSE AQEERVIALI QEIQPESTAF  240
IAVMCKSHVQ PPCPYLAISK GYALAHFPRK SMNVTLQRPG KSKKWHPKFC KRKDAQMLLK  300
GQWMDFVRDN HVQEGDICIF LPTMAGRRST FTVYLIQATT TCSRGGSGKR GSLSRQKETA  360
KKAATSSLYE DSGGEDSLSG YESIQLDHFK AFSKRNYVLS AWCHLTAEQE EKIVALVKKV  420
QPEIPFLVVQ MKKSNVNHLR SDLVICKDYA AEHFPQESQF ITLERPGGRK WRTRLYVRPD  480
GRAFMLTTRW QNFVHDNHFQ KDDICLFQPM PNEKGFRVMV HLLHEPSTRS SSLCRHVHGL  540
NSHINRGVTP TAHVHEKSGS ERDSLSCQKE TTKKAGTSSL HEESGEDSLS GHESIQSDHV  600
KAFSERNYVL SARCHLTAEQ EEEIITLVKK VQPAIPFLVI QMKKSNVNRL SSNLLRKDDI  660
CLFQPMPSEK GFRVMVHLLC EPRTRSSSLG GHAHGLNSHI KRVTSTAHVH EKSGSERGSL  720
SCQKETANKA RTSSLYEESE EGTLSGYEST QLDHVKAFSE RYYVLSARCH LTAEQKEKIV  780
ALVKKVQPEI PVLVVKMKKI NVKRLNSNLV ICKGYAAQHF PQESQFITLE CPGGKRWHPK  840
LHVRPDGRGY MLSTQWKNFV RDNRLREDDI CLFQPMPSEK GFRVMAHLLR ERSTRSSSSD  900
GHVHGLHSHI ERGLASTAHV HEKSGSENSG LLDLHKRQPV QQGHQVLNDC GGASSSKPPL  960
YVVLGGTCLT PAQDKVVQEK AMAIKAEVSI FVATMNKKIL GYNNEAFILD FSDAAEYLPD  1020
GKQSLTLRWQ GQGRAWRTDL HNRLMLATGE WREFVRDSGL EDGDICLFEP MKERLAMLVH  1080
IIRSKQYS*
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.001G166100.9.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021313447.10.0uncharacterized protein LOC8062507 isoform X2
TrEMBLA0A1Z5S5Y70.0A0A1Z5S5Y7_SORBI; Uncharacterized protein
STRINGSb01g014595.10.0(Sorghum bicolor)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G33280.15e-19B3 family protein