PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Carubv10004949m
Common NameCARUB_v10004949mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family GRAS
Protein Properties Length: 407aa    MW: 45038.2 Da    PI: 6.3586
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Carubv10004949mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS376.73e-115424002373
             GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlar.svselykalppsetseknsseelaalklfsevsPilkfs 94 
                       +lLl+cAe vs+++l +a++lL+++se++sp g++ +R++ayf++AL++r+ +   s+ + +l+ ++ s ++s + ++al++f+ vsP++kfs
  Carubv10004949m  42 LSLLLQCAEYVSTDHLPEASTLLSEISEICSPFGSSPERVVAYFAQALQTRVISsYLSGACVSLSEKPLSVSQSRKIFSALQTFNSVSPLIKFS 135
                      689************************************************999788999999*******99********************** PP

             GRAS  95 hltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAeelgvpfefnvlvakrled 188
                      h+taNqaI +a++ge+ vHiiD+d++qGl+WpaL++ LasRp++  s+RiTg+gs    s++ l +tg+rLa+fA++l++pfef+++  k  + 
  Carubv10004949m 136 HFTANQAIFQALDGEDSVHIIDLDVMQGLHWPALFHILASRPRKLRSIRITGFGS----SSDLLASTGRRLADFASSLNLPFEFHPIEGKIGNL 225
                      *******************************************************....9**************************98888889 PP

             GRAS 189 leleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadh.nsesFlerflealeyysalfdsleaklpresee 281
                      +++++L +++gEa++V+++   hrl+d +++  +    +L+++++l+P++++vveqe+++ +++sFl rf+eal+yysalfd+l  kl++es e
  Carubv10004949m 226 IDPSQLGTRQGEAVVVHWMQ--HRLYDVTGNDLE----TLEILRRLKPNLITVVEQELSYdDGGSFLGRFVEALHYYSALFDALGDKLGEESGE 313
                      9******************9..****88888777....**********************8999****************************** PP

             GRAS 282 rikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaW 373
                      r +vE+ +l+ ei+n+va+ g + r       kW+e l+++GF+pv+l+ + a+qa lll + +++gy++ ee+g+l lgWkd +L+++SaW
  Carubv10004949m 314 RFTVEQLVLATEIRNIVAHGGGR-RR----RVKWKEELNRVGFRPVSLRGNPATQAGLLLGMLPWNGYTLVEENGTLRLGWKDLSLLTASAW 400
                      ********************997.33....357*********************************************************** PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098556.53715381IPR005202Transcription factor GRAS
PfamPF035141.0E-11242400IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0048366Biological Processleaf development
GO:0090610Biological Processbundle sheath cell fate specification
GO:0005634Cellular Componentnucleus
Sequence ? help Back to Top
Protein Sequence    Length: 407 aa     Download sequence    Send to blast
MTSKRFDRDF QPSDDPSAAK RRIEEFSDEA LRSGGAAAIK LLSLLLQCAE YVSTDHLPEA  60
STLLSEISEI CSPFGSSPER VVAYFAQALQ TRVISSYLSG ACVSLSEKPL SVSQSRKIFS  120
ALQTFNSVSP LIKFSHFTAN QAIFQALDGE DSVHIIDLDV MQGLHWPALF HILASRPRKL  180
RSIRITGFGS SSDLLASTGR RLADFASSLN LPFEFHPIEG KIGNLIDPSQ LGTRQGEAVV  240
VHWMQHRLYD VTGNDLETLE ILRRLKPNLI TVVEQELSYD DGGSFLGRFV EALHYYSALF  300
DALGDKLGEE SGERFTVEQL VLATEIRNIV AHGGGRRRRV KWKEELNRVG FRPVSLRGNP  360
ATQAGLLLGM LPWNGYTLVE ENGTLRLGWK DLSLLTASAW ISQPFD*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-1464740025378Protein SCARECROW
5b3h_A1e-1464740024377Protein SCARECROW
5b3h_D1e-1464740024377Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCarubv10004949m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAB0170670.0AB017067.1 Arabidopsis thaliana genomic DNA, chromosome 5, P1 clone:MJC20.
GenBankBT0297600.0BT029760.1 Arabidopsis thaliana At5g41920 mRNA, complete cds.
GenBankCP0026880.0CP002688.1 Arabidopsis thaliana chromosome 5 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006282634.10.0scarecrow-like protein 23
SwissprotQ9FHZ10.0SCL23_ARATH; Scarecrow-like protein 23
TrEMBLR0F1R50.0R0F1R5_9BRAS; Uncharacterized protein
STRINGXP_006282634.10.0(Capsella rubella)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM125122631
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G41920.10.0GRAS family protein
Publications ? help Back to Top
  1. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]
  2. Yoon EK, et al.
    Conservation and Diversification of the SHR-SCR-SCL23 Regulatory Network in the Development of the Functional Endodermis in Arabidopsis Shoots.
    Mol Plant, 2016. 9(8): p. 1197-1209
    [PMID:27353361]