PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.20G162100.1.p
Common NameGLYMA_20G162100, LOC100800097
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family GRAS
Protein Properties Length: 595aa    MW: 65434.9 Da    PI: 8.1689
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.20G162100.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS306.85e-942385932373
                 GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetsekn.sseelaalklfsevsPi 90 
                          ++ L+e+A a+s+g +  a+++L+rl ++      + qR + +++ AL++r+++        +++ +   +  s+e++ +++l++e s +
  Glyma.20G162100.1.p 238 KQSLTEAAIAISEGRFDTATEILTRLLQN------SDQRFVNCMVSALKSRMNH--------VECPPPVAELfSIEHAESTQLLFEHSLF 313
                          578***********************886......579***************9........333333322349**************** PP

                 GRAS  91 lkfshltaNqaIleavege.ervHiiDfdisqGlQWpaLlqaLasRpegpp.slRiTgvgspesgskeeleetgerLakfAeelgvpfef 178
                          +k++ ++aN aIle++ +e  ++ ++Dfdi+ G Q+++Ll++L++R++g+p  ++i +v++  +g  e+l+++g  L + Ae+lg+ fef
  Glyma.20G162100.1.p 314 FKVARMVANIAILESALTEnGKLCVLDFDIGDGNQYVSLLHELSARRKGAPsAVKIVAVAE--NGADERLNSVGLLLGRHAEKLGIGFEF 401
                          **************7666559***********************999887769********..779************************ PP

                 GRAS 179 nvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalf 268
                          +v + +r+++l+ e+L+++ +EalaVn++++l+r++desvs+e++rde+L+ vk l P+vv++veqea+ n+++F++r++e + yy alf
  Glyma.20G162100.1.p 402 KV-LIRRIAELTRESLDCDADEALAVNFAYKLYRMPDESVSTENPRDELLRRVKALAPRVVTLVEQEANANTAPFVARVSELCAYYGALF 490
                          **.699************************************************************************************ PP

                 GRAS 269 dsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgsl 358
                          dsle++++re+++r+ +E+  l+r++ n vaceg++r+er+e ++kWr+r+++aGF+  pls+++a+++k+ l   +++   v+ e+g +
  Glyma.20G162100.1.p 491 DSLESTMARENSARVRIEEG-LSRKVGNSVACEGRNRVERCEVFGKWRARMSMAGFRLKPLSQRVAESIKARLGGAGNR-VAVKVENGGI 578
                          ********************.********************************************************77.9********* PP

                 GRAS 359 vlgWkdrpLvsvSaW 373
                          ++gW++r+L+++SaW
  Glyma.20G162100.1.p 579 CFGWMGRTLTVASAW 593
                          *************** PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098540.643211575IPR005202Transcription factor GRAS
PfamPF035141.7E-91238593IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 595 aa     Download sequence    Send to blast
MSSPRFPGGG ASDFYGGAGG FPSQFTAVQP TMNNHSATTP RPLYRSQPSI LLNPSSHIAQ  60
HQPSPLIGKR TLAEFQTHNL NSSNNNNNNP LLSNYLLRSV KPRTFQHTEL STFPSNRYGL  120
PLLHHLRPNA VNAQQQQPVT NSILPNTNYF PPVRSRLTAP HELEKNSIDR RLQELEKQLL  180
EDNEDEQGDA VSVITNTTTT SEWSHTIQNL ITPQKPTSSS PTSSTTSSNS SVESTSSKQS  240
LTEAAIAISE GRFDTATEIL TRLLQNSDQR FVNCMVSALK SRMNHVECPP PVAELFSIEH  300
AESTQLLFEH SLFFKVARMV ANIAILESAL TENGKLCVLD FDIGDGNQYV SLLHELSARR  360
KGAPSAVKIV AVAENGADER LNSVGLLLGR HAEKLGIGFE FKVLIRRIAE LTRESLDCDA  420
DEALAVNFAY KLYRMPDESV STENPRDELL RRVKALAPRV VTLVEQEANA NTAPFVARVS  480
ELCAYYGALF DSLESTMARE NSARVRIEEG LSRKVGNSVA CEGRNRVERC EVFGKWRARM  540
SMAGFRLKPL SQRVAESIKA RLGGAGNRVA VKVENGGICF GWMGRTLTVA SAWC*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5hyz_A7e-3526659338374GRAS family transcription factor containing protein, expressed
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gma.303690.0leaf| root| seed coat
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in seedlings, roots, leaves and siliques. {ECO:0000269|PubMed:10341448, ECO:0000269|PubMed:18500650}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapGlyma.20G162100.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_003556120.10.0scarecrow-like protein 8
SwissprotQ9FYR71e-145SCL8_ARATH; Scarecrow-like protein 8
TrEMBLI1NGX20.0I1NGX2_SOYBN; Uncharacterized protein
STRINGGLYMA20G30150.10.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF62663151
Representative plantOGRP7599918
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G52510.11e-135SCARECROW-like 8