PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa19g020620.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family GRAS
Protein Properties Length: 512aa    MW: 57652.7 Da    PI: 4.9452
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa19g020620.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS270.74.7e-831455112374
            GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlar.svselykalppsetseknsseelaalklfsevsPilkfsh 95 
                     ++lL  cA  ++s++ +++q++L+ lselas++gd+++Rla + ++AL+++l++ s ++++++ p+ + ++ +++   + l  f+evsP++ + +
  Csa19g020620.1 145 EQLLNPCALTITSRNSSRVQHYLCVLSELASSSGDANRRLADFGLRALQHHLSSsSLPSISSSSPVVAFASAEVKMFQKTLLKFYEVSPWFALPN 239
                     689**************************************************989999999999999995555555555555************ PP

            GRAS  96 ltaNqaIleavege....ervHiiDfdisqGlQWpaLlqaLasRp.egppslRiTgvgspesg....skeeleetgerLakfAeelgvpfefnvl 181
                      +aN+aIl+ +++e    + +H++D+++s+G+QWp+Ll+aL++R+ ++pp++RiT+v++ +++       +  ++ ++L  fA++l+++++++v+
  Csa19g020620.1 240 NMANSAILQILAQEpidkQDLHVLDIGVSHGMQWPTLLEALSCRSeGPPPHVRITVVSDLTADipfsVGPSAYSYDSQLLGFARSLKIHLQISVI 334
                     *************99998999************************55667*********99999998889999********************96 PP

            GRAS 182 vakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadh.nsesFlerflealeyysalfdsleakl 275
                             +++ +++ p+E+l+++ +++lh+l     s+++er+e L+++++l+Pk v+++e++ +  +s++F++ f+++ley ++++ds+++++
  Csa19g020620.1 335 -------DKFQLIDTAPHETLIICAQFRLHHLK---YSIPEERSEALRALRRLRPKGVILCENNGEDnTSGDFAAAFSRKLEYLWKFLDSTSSGF 419
                     .......566889*******************8...88999999********************9995778********************9997 PP

            GRAS 276 preseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvk.sdgyrveeesgslvlgWkdrpLvs 369
                         +e+++ Er+l++ e+++v++++g++     e + kW er++eaGF + +++e+a + ak+llrk++ +++ r+e+ +  + l+Wk++++ +
  Csa19g020620.1 420 ----KEENSEERKLIEGEATKVLMNAGEM----DEGKDKWYERMREAGFAAEAFGEDAIDGAKSLLRKYDkNWEIRMEDGDTFAGLTWKGEAVSF 506
                     ....8888899999**************9....999**********************************666666666666666********** PP

            GRAS 370 vSaWr 374
                     +S W+
  Csa19g020620.1 507 CSLWK 511
                     ****8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098538.374118490IPR005202Transcription factor GRAS
PfamPF035141.6E-80145511IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 512 aa     Download sequence    Send to blast
MSLEETEPPN QTLDHVLSWL EDSVSLSPLP GFDDSFLLHE FDGSQTWEWD QTQDPENGFI  60
QSYSQDLSAY VGYEATNLEV LTEAPFIYLD PLPELQQPND QSRKRSSEKV IEAQHVKRSE  120
RRKKKSNKSS EKSCKDGNKE ERWAEQLLNP CALTITSRNS SRVQHYLCVL SELASSSGDA  180
NRRLADFGLR ALQHHLSSSS LPSISSSSPV VAFASAEVKM FQKTLLKFYE VSPWFALPNN  240
MANSAILQIL AQEPIDKQDL HVLDIGVSHG MQWPTLLEAL SCRSEGPPPH VRITVVSDLT  300
ADIPFSVGPS AYSYDSQLLG FARSLKIHLQ ISVIDKFQLI DTAPHETLII CAQFRLHHLK  360
YSIPEERSEA LRALRRLRPK GVILCENNGE DNTSGDFAAA FSRKLEYLWK FLDSTSSGFK  420
EENSEERKLI EGEATKVLMN AGEMDEGKDK WYERMREAGF AAEAFGEDAI DGAKSLLRKY  480
DKNWEIRMED GDTFAGLTWK GEAVSFCSLW K*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_B5e-3513651179473Protein SHORT-ROOT
5b3h_B3e-3513651125419Protein SHORT-ROOT
5b3h_E3e-3513651125419Protein SHORT-ROOT
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1102123RKRSSEKVIEAQHVKRSERRKK
2102124RKRSSEKVIEAQHVKRSERRKKK
3116123KRSERRKK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCsa19g020620.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC2373280.0AC237328.1 Arabidopsis lyrata clone JGIFAFI-41L7, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010487189.10.0PREDICTED: scarecrow-like protein 29
SwissprotQ9LRW30.0SCL29_ARATH; Scarecrow-like protein 29
TrEMBLD7L2900.0D7L290_ARALL; Scarecrow transcription factor family protein
STRINGXP_010487189.10.0(Camelina sativa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM107472834
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G13840.10.0GRAS family protein