PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID NNU_012101-RA
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; stem eudicotyledons; Proteales; Nelumbonaceae; Nelumbo
Family GRAS
Protein Properties Length: 783aa    MW: 84714.7 Da    PI: 6.2332
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
NNU_012101-RAgenomeCASView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS341.81.1e-1044217821374
           GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshl 96 
                    l+++L+++Ae v++g+   a+ +Larl+++ sp g+p+qR+a+yf+eAL+  l++ +++++ + pp+++++ +++ ++ a+k+fse+sP+l+f+++
  NNU_012101-RA 421 LIDQLFKAAELVEAGNSVHARGILARLNHQLSPVGKPLQRAAFYFKEALQLLLLS-SNNMATSPPPRNSTHFDVVLKIGAYKAFSEISPLLQFANF 515
                    5789**************************************************9.88888899999999999*********************** PP

           GRAS  97 taNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAeelgvpfefnvlvakrledl..e 190
                    t+Nqa+l+ + g +r+Hi Dfdi+ G+QW +++q+LasR  g+psl+iT+++s++s+++ el  t+e+L++fA+ lg+ fe +++  ++++    +
  NNU_012101-RA 516 TCNQALLDVLLGFDRIHIMDFDIGIGAQWSSFMQELASR--GAPSLKITAFASQASHDALELVLTRENLTHFANDLGIAFELDIVNLDSFDPAswS 609
                    **************************************9..99************9****************************988877654348 PP

           GRAS 191 leeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvE 286
                    l +L+v ++Ea+aVnl +   + +  + s++    ++L++vk+lsPk+vv v++ +d+++ +F ++fl+al+ +s+l+dsl+a    +s++ +k+E
  NNU_012101-RA 610 LAQLHVAENEAVAVNLPVGSSSAH--PSSVP----SLLRFVKQLSPKIVVSVDRGCDRSDLPFSHHFLHALQSFSVLLDSLDAV-NVNSDAVHKIE 698
                    99****************877776..55555....5*********************************************665.5799******* PP

           GRAS 287 rellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                    ++ll+++i+++v  ++++     e++  Wr+ + +aGF+p+p++++a++qa++l+++ +++g++ve++++sl+l+W++r+Lvs+SaW+
  NNU_012101-RA 699 KFLLQPRIESIVLGRQRA----PEKMPPWRNLFASAGFSPLPFTNFAETQAEYLVKRLQVRGFHVEKRQASLILYWQRRELVSASAWK 782
                    *************99998....*****************************************************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098549.296395762IPR005202Transcription factor GRAS
PfamPF035144.0E-102421782IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 783 aa     Download sequence    Send to blast
MRGMPFNLQG KGVLEVAEIS PISGGKWKDS SCLGSEPTSV LDTRRSPSPP TSTSTLSSSL  60
GGGGSTDTAG VAAVSDNPTQ KWPPTQQQED SSAAVAEPGS CVGGGGGSRK DEWASELQPI  120
PTALEIVNGG ATGVEKCVLG MEDWESMLSE SASSPSQEQS LLRWIMGDVD DPSSGLKHLL  180
QGGGSSEFEG NAGGFGIVDQ GFALESVGGG ASVSGNVMGT INPSLAFPGS ICAPNNLNGR  240
AGSVPNTSAL PNYKVPCFGL NNNSNPPNPI NLPLPISFPP GMFFQQSQQQ QPQLEPADEK  300
PQLFNPPQVP INQQQAHHPQ NPTFFMPLPY TQQEQHLLPP QPKRYHATVD PSCQIPKVPF  360
SDSGQELFLR RQQQQQQQGF PPQLQLLSPH LPQRPTTMAT KPKVVGAGDE VAHQHQQQQA  420
LIDQLFKAAE LVEAGNSVHA RGILARLNHQ LSPVGKPLQR AAFYFKEALQ LLLLSSNNMA  480
TSPPPRNSTH FDVVLKIGAY KAFSEISPLL QFANFTCNQA LLDVLLGFDR IHIMDFDIGI  540
GAQWSSFMQE LASRGAPSLK ITAFASQASH DALELVLTRE NLTHFANDLG IAFELDIVNL  600
DSFDPASWSL AQLHVAENEA VAVNLPVGSS SAHPSSVPSL LRFVKQLSPK IVVSVDRGCD  660
RSDLPFSHHF LHALQSFSVL LDSLDAVNVN SDAVHKIEKF LLQPRIESIV LGRQRAPEKM  720
PPWRNLFASA GFSPLPFTNF AETQAEYLVK RLQVRGFHVE KRQASLILYW QRRELVSASA  780
WKC
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3h_A9e-374067811377Protein SCARECROW
5b3h_D9e-374067811377Protein SCARECROW
Search in ModeBase
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankKM6607174e-38KM660717.1 Buxus sempervirens clone VV_contig_1686 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_019054166.10.0PREDICTED: scarecrow-like protein 22
TrEMBLA0A1U8Q7Z60.0A0A1U8Q7Z6_NELNU; scarecrow-like protein 22
STRINGXP_010264405.10.0(Nelumbo nucifera)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00150.11e-117GRAS family protein