PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_A03G0365
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 760aa    MW: 85754.9 Da    PI: 4.8833
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_A03G0365genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS355.29.7e-1093837571373
         GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshlta 98 
                  l++lL+ cA+avs++d + a++lL++++e++sp gd++qRla  f+ +L+arl +s+  +    ++ +++ ++ ++ l+a+k++   +P+ k++ l a
  Gh_A03G0365 383 LRTLLILCAQAVSADDRRTASELLKQIKEHSSPLGDANQRLAYIFADGLEARLDGSGALIHVFYASLASKMTTAADILKAYKAYLCSCPFTKLAILFA 480
                  5789***************************************************777776666777777777************************* PP

         GRAS  99 NqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakrledleleeL 194
                  N+ I+  +e+ + +Hi+Df+i +G+QWp L+q L++Rp+gpp+lRiTg++ p+ g   +e++eetg+rLak++e+++vpfe+n+++++++e++++e++
  Gh_A03G0365 481 NKSIYHMAEKTSVLHIVDFGILYGFQWPILIQHLSTRPGGPPKLRITGIEIPQRGfrPAERIEETGRRLAKYCERFNVPFEYNPIAVEHWETIQIEDI 578
                  *****************************************************99******************************************* PP

         GRAS 195 rvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpreseerikvErellgr 292
                  +++++E+laVn+ ++ h+llde+++++ +r+++Lkl+++++P+++v++  +  +n++ F++rf e l + sa+fd +e++lpre+ +r + Ere+ gr
  Gh_A03G0365 579 KIDSNEMLAVNSLFRFHNLLDETADVDCPRNAMLKLIRKMKPDIFVHSIVNGAYNAPFFVTRFKEVLFHISAVFDVFENTLPREELARLMFEREFYGR 676
                  ************************************************************************************************** PP

         GRAS 293 eivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaW 373
                  e++nv+aceg++r++r et+++W+ r  + GFkp+pl+++ +k ++  l+  + + + ++e+++++++gWk+r L+  S+W
  Gh_A03G0365 677 EAMNVIACEGSARVQRPETYKQWQIRTLREGFKPLPLDQELMKIIRDKLKAWYHKDFVIDEDNHWMLQGWKGRILYGSSCW 757
                  *****************************************************888************************* PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098564.667357738IPR005202Transcription factor GRAS
PfamPF035143.4E-106383757IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 760 aa     Download sequence    Send to blast
MTMDPNSIEI SDYLNCFKVE DHTFHNGFEF NVPSPDLNFM NMNVPFIPLD SDPGINVPSI  60
TASSDGSPFS ASTGWSPLGE SYSPPSDSDS TDPVLKYISQ MLMEENMEDK PYMFNDYLAL  120
EDTEKSLYDA LVSNIIQPVK VESPDSNLFG TNGHSDASIS SRSGTSDHIN PRGIGEVGGP  180
DPSLLRAPYS LQPDLQQSSS QFSVDSVNSL SNIGNGLMES SVSELLVKNI FSDKESVLQF  240
QRGFEEASKF IPSSEQLVID LESSTFAVGK KVDVPKVVVK VEKDEREISS NGLTGRKNHE  300
RDDWELEDER SNKQSATYTE ESDLSEVFDK VLLCTEGKTM CGIDQTVQHG ETDSSQHEEQ  360
LDGSIVGRNR SKRQGKKKEV VDLRTLLILC AQAVSADDRR TASELLKQIK EHSSPLGDAN  420
QRLAYIFADG LEARLDGSGA LIHVFYASLA SKMTTAADIL KAYKAYLCSC PFTKLAILFA  480
NKSIYHMAEK TSVLHIVDFG ILYGFQWPIL IQHLSTRPGG PPKLRITGIE IPQRGFRPAE  540
RIEETGRRLA KYCERFNVPF EYNPIAVEHW ETIQIEDIKI DSNEMLAVNS LFRFHNLLDE  600
TADVDCPRNA MLKLIRKMKP DIFVHSIVNG AYNAPFFVTR FKEVLFHISA VFDVFENTLP  660
REELARLMFE REFYGREAMN VIACEGSARV QRPETYKQWQ IRTLREGFKP LPLDQELMKI  720
IRDKLKAWYH KDFVIDEDNH WMLQGWKGRI LYGSSCWVPA
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3h_A2e-453727597379Protein SCARECROW
5b3h_D2e-453727597379Protein SCARECROW
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1367372RNRSKR
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in roots, shoots, flowers and siliques. {ECO:0000269|PubMed:10341448, ECO:0000269|PubMed:18500650}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017639189.10.0PREDICTED: scarecrow-like protein 33
SwissprotQ9XE580.0SCL14_ARATH; Scarecrow-like protein 14
TrEMBLA0A2P5W8M70.0A0A2P5W8M7_GOSBA; Uncharacterized protein
STRINGGorai.003G130500.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM35827189
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G07530.10.0SCARECROW-like 14
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]