PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sphfalx0018s0244.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Bryophyta; Sphagnophytina; Sphagnopsida; Sphagnales; Sphagnaceae; Sphagnum
Family B3
Protein Properties Length: 1076aa    MW: 121985 Da    PI: 8.0824
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sphfalx0018s0244.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B347.33.8e-15247336597
                           -..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..E CS
                    B3   5 ltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelv 93 
                           l p +v+ ++ l++p +f ++++  +++++ ++l d sg++W +    r +  ++ +t+GW+eF  a +L +gD++vF+l ++++  ++
  Sphfalx0018s0244.1.p 247 LRPFHVHGRAFLRFPVTFGKHFM--PTKTVPMRLIDASGNQWPMVW-LRDNETHMGFTRGWREFSLAEDLSDGDVCVFELLDTTDLMFR 332
                           56678889999********6666..457788***************.466666788***********************9998999888 PP

                           EEEE CS
                    B3  94 vkvf 97 
                           v+vf
  Sphfalx0018s0244.1.p 333 VHVF 336
                           8887 PP

2B335.41.9e-11695784398
                           EE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-.SS.SE CS
                    B3   3 kvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkld.gr.se 89 
                            v+++s+v ++ ++ +p  f ++h   + e ++l+l+d  g++    l     sg+ vlt+GW++F   + L+egD++vF+l+ ++ ++
  Sphfalx0018s0244.1.p 695 VVMKKSNVYRNFIVKIPPAFRQAHI--PCEATDLKLQDAVGQQTHAFL-----SGE-VLTAGWQQFSLHHLLEEGDTCVFELItNKtED 775
                           57899***************98884..345679**********99999.....333.7***********************99655799 PP

                           E..EEEEE- CS
                    B3  90 felvvkvfr 98 
                            +++v++fr
  Sphfalx0018s0244.1.p 776 LTFMVHIFR 784
                           999999998 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019363.73E-21236338IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.104.5E-19237338IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086312.731243339IPR003340B3 DNA binding domain
SMARTSM010192.4E-11243339IPR003340B3 DNA binding domain
CDDcd100174.22E-20246337No hitNo description
PfamPF023621.9E-12246336IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.106.3E-16686784IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.79E-16687784IPR015300DNA-binding pseudobarrel domain
CDDcd100173.64E-15691784No hitNo description
SMARTSM010190.0026693786IPR003340B3 DNA binding domain
PROSITE profilePS508639.939693786IPR003340B3 DNA binding domain
PfamPF023626.8E-9695784IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1076 aa     Download sequence    Send to blast
MELQLHMKKL CFDIAEGGIM DETPPLQAFE SLCSSESKHV QDSSCGKAEA LDSPGAHWNL  60
NEPYLHEPSD HLCPANDEYG ETITATQEIQ LLAHDQLLQV KETDSDDVLQ CQTLLGKLPH  120
DAAGEDLNTN KEHLKASNSN ANEVKQDYSG VGPATRFASQ RRPVVETLRD IMHTDVAYRQ  180
SFQACSTSNA QNVQPMINQA GELHREHDKV ARHLIPYPGL VMEREKAMEL GRFTVLKSDK  240
PCHLVVLRPF HVHGRAFLRF PVTFGKHFMP TKTVPMRLID ASGNQWPMVW LRDNETHMGF  300
TRGWREFSLA EDLSDGDVCV FELLDTTDLM FRVHVFCTER KSPRSQKKDS SNQHDHHHRA  360
NARAKRLQLQ DDNHCNNPYN EVLLYSSGKK RELSGRINMH VGAEQDKETT ASQLSHTFSP  420
SLKTLYNAFL RLPPPTDASK KDAMNSRERE GNLPEEMTCE KASTTSLVLV MEDKVKTSES  480
LQVHSSSKSE RCCKCSKRHK LEPGTECETY MEPGIPQDNT TRNLECDGEL LFTACNTGIA  540
KQDDRLGKLE PHHWGATKVD PVFQKEQKLQ ECHQSITMQR AKPASNRSTA EVKKDSEISA  600
QWWTRTCKPD KQCLRWQSDM SSLRCRKKPG ASWRRLTASN TNLSRNRQAS SLLQARVRKQ  660
LLFVGSKRLP VTQLQREAAR DAACALKTMN PSVVVVMKKS NVYRNFIVKI PPAFRQAHIP  720
CEATDLKLQD AVGQQTHAFL SGEVLTAGWQ QFSLHHLLEE GDTCVFELIT NKTEDLTFMV  780
HIFRVVEVDH SKVQWQDHYM ILSRGKCRNN EMHLITCDKK TEQDSSLAFS EKKNLEDEDA  840
TGCVFPGLGS TNALCIAVQK CMVRHNKPQV AKSSVLRERL AYRLWSSSSH PQHTAKTNDA  900
SSPAKETTTT TTTICDSRVR GTLEELSKVN QAARSSNHLL TRITIPSYSR RIHEQQKKKK  960
KIILPEKKKK EKTSNQISKQ SRKGNNTNLR AAERMVMPLP ADDDDDDDGK GEGIYYRVVR  1020
IIKKRFFENE KHYLTELDGP VLQSNEDGVL TEYNGILWWV PERAFSAGFS SCYLD*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1956968KKKKKIILPEKKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18990.11e-12B3 family protein