PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Peinf101Scf00061g04007.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Petunioideae; Petunia; Petunia integrifolia
Family GRAS
Protein Properties Length: 524aa    MW: 57355.9 Da    PI: 5.9752
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Peinf101Scf00061g04007.1genomeSGNView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS3503.7e-1071525243374
                      GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsev 87 
                               + ++e+A+a+s+g++  a ++L+rls+ a+ +g++ qRl+ay++ AL++r++     +    p+ e    +s e++ +++ ++ev
  Peinf101Scf00061g04007.1 152 QSIIEAATAISEGKKDVAVEILTRLSQVANVRGSSDQRLTAYMVSALRSRVNP----TDYPPPVME---LHSREHVDSTQNLYEV 229
                               6789************************************************9....222233333...35899999999***** PP

                      GRAS  88 sPilkfshltaNqaIleavege...ervHiiDfdisqGlQWpaLlqaLasRp.....egppslRiTgvgspesgskeeleetger 164
                               sP++k+++++aN aIleav+++   ++vH+iDfdi+qG+Q++ Ll+aLa+       ++pp l+iT++ ++  g  ++l+++g  
  Peinf101Scf00061g04007.1 230 SPCFKLGFMAANLAILEAVAEQplnNKVHVIDFDIGQGGQYLHLLHALAAAMksdsnKPPPVLKITAFTDQVGGVDNRLNSIGVE 314
                               *********************9999***********************9544347777899*********8888*********** PP

                      GRAS 165 LakfAeelgvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhn 249
                               L+ +A+++gv + fnv ++ ++++++ ++L ++ +EalaVn++++l+rl+desv++e+ rde+L+ vk+lsPkvv++veqe++ n
  Peinf101Scf00061g04007.1 315 LKALASKIGVCLFFNV-MSCNITEMSRDKLGIEADEALAVNFAFKLYRLPDESVTTENLRDELLRRVKGLSPKVVTMVEQELNGN 398
                               ***************9.7889**************************************************************** PP

                      GRAS 250 sesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaa 334
                               +++Fl+r+ ea+ yy alfdsl+a+++r++ er+k+E   l+r+  n vaceg++r+er+e ++kWr+r+++aGF + p+s+ +a
  Peinf101Scf00061g04007.1 399 TAPFLARVNEACGYYGALFDSLDATVARDNMERVKIESG-LSRKMANSVACEGRDRVERCEVFGKWRARMSMAGFVSRPMSQLVA 482
                               ***************************************.********************************************* PP

                      GRAS 335 kqaklllrkvk..sdgyrveeesgslvlgWkdrpLvsvSaWr 374
                               ++ ++ l+     + g++v+e+sg +++gW++r+L+++SaWr
  Peinf101Scf00061g04007.1 483 NSLRSKLNSGTrgNPGFTVNEQSGGICFGWMGRTLTVASAWR 524
                               ***9999876556899*************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098547.283124502IPR005202Transcription factor GRAS
PfamPF035141.3E-104152524IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0005737Cellular Componentcytoplasm
Sequence ? help Back to Top
Protein Sequence    Length: 524 aa     Download sequence    Send to blast
MSSGFSGDFY GGINTGRSSM IPMNNNNNNT LRPQVQQLPY GTQISGMLPD HVSSLNPIHN  60
KAAVQESEKK MMNQLQELEK QLLEDEEEDG DTVSVVTNNE WSDTIQNLIT PCSQNQNQNQ  120
NQNQKLASLS PSSSTSSCAS STESPPITCP KQSIIEAATA ISEGKKDVAV EILTRLSQVA  180
NVRGSSDQRL TAYMVSALRS RVNPTDYPPP VMELHSREHV DSTQNLYEVS PCFKLGFMAA  240
NLAILEAVAE QPLNNKVHVI DFDIGQGGQY LHLLHALAAA MKSDSNKPPP VLKITAFTDQ  300
VGGVDNRLNS IGVELKALAS KIGVCLFFNV MSCNITEMSR DKLGIEADEA LAVNFAFKLY  360
RLPDESVTTE NLRDELLRRV KGLSPKVVTM VEQELNGNTA PFLARVNEAC GYYGALFDSL  420
DATVARDNME RVKIESGLSR KMANSVACEG RDRVERCEVF GKWRARMSMA GFVSRPMSQL  480
VANSLRSKLN SGTRGNPGFT VNEQSGGICF GWMGRTLTVA SAWR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A2e-3715652325378Protein SCARECROW
5b3h_A2e-3715652324377Protein SCARECROW
5b3h_D2e-3715652324377Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016564536.10.0PREDICTED: scarecrow-like protein 8
SwissprotQ9FYR71e-153SCL8_ARATH; Scarecrow-like protein 8
TrEMBLA0A2G3CWM80.0A0A2G3CWM8_CAPCH; Uncharacterized protein
STRINGPGSC0003DMT4000262400.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA40232443
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G52510.11e-139SCARECROW-like 8