PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_19599_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 624aa    MW: 69444.8 Da    PI: 5.0291
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_19599_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS445.43.8e-1362566233373
                        GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfs 85 
                                 + L+ecA  +s+g+ e+a+a++++l++ +s +gdp qR+aay++e+Laar+a s++ lykal+++e +   ss++laa+++++
  Cotton_A_19599_BGI-A2_v1.0 256 QMLIECAAILSEGHIEKASAIINELRQKVSIQGDPPQRIAAYMVEGLAARMAASGKYLYKALRCKEPP---SSDRLAAMQILF 335
                                 79*****************************************************************9...9*********** PP

                        GRAS  86 evsPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLa 166
                                 ev+P++kf++++aN aI ea++ge+rvHiiDfdisqG Q+++L+q++a+ p++pp+lR+Tgv++pes+   +  le +g rL+
  Cotton_A_19599_BGI-A2_v1.0 336 EVCPCFKFGFMAANGAIIEAFKGEKRVHIIDFDISQGSQYITLIQTIAKLPGKPPHLRLTGVDDPESVqrLNGGLEIVGLRLE 418
                                 ******************************************************************99777889********* PP

                        GRAS 167 kfAeelgvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhn 249
                                 k+Ae lgvpfef++ v +r++ + +++L++kpgEal+Vn+++qlh+++desvs+ ++rd++L++vks++Pk+v+vveq++++n
  Cotton_A_19599_BGI-A2_v1.0 419 KLAEILGVPFEFRA-VPSRTSLVAPSMLDCKPGEALIVNFAFQLHHMPDESVSTINQRDQLLRMVKSMNPKLVTVVEQDVNTN 500
                                 **************.7******************************************************************* PP

                        GRAS 250 sesFlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsek 332
                                 +++F+ rf+ea +yysa+fdsl+a+lpres++r++vEr++l+r+ivn++aceg+er+er e ++kWr+r+ +aGFk+ p+s++
  Cotton_A_19599_BGI-A2_v1.0 501 TSPFFPRFIEAYSYYSAVFDSLDATLPRESQDRMNVERQCLARDIVNIIACEGEERIERYEVAGKWRARMIMAGFKSCPMSSN 583
                                 *********************************************************************************** PP

                        GRAS 333 aakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSaW 373
                                 + ++++ l++++ ++ y+++e+ g+l +gW+d++L+++SaW
  Cotton_A_19599_BGI-A2_v1.0 584 VIDTIQKLIKEYCDR-YKLKEDVGALHFGWEDKSLIVASAW 623
                                 *************66.************************* PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098565.59228605IPR005202Transcription factor GRAS
PfamPF035141.3E-133256623IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 624 aa     Download sequence    Send to blast
MATFLRSFWS ENAETQTLSP YSQVIVQPDL TKKGSQLPGF RNLNTAVTWS ISYGPLEPFY  60
NSTVQRVGTM SLVRSAEPAT ASCRNTKLYS IQDSSDSTGM AIRMFGSDKH KSVYVMDSYS  120
SESYEKYFLD SPTDELIHSS SSGISGSSVR LQDVSSCQIR DYSEIQSPDT LDSDSDKMKL  180
KLQELERALL ADNDVDGDDD MFGTGLSMEV DGEWSDPIRM GSHHDSPKES SSSGSYLDCV  240
SGDKEVSHVS SQTPKQMLIE CAAILSEGHI EKASAIINEL RQKVSIQGDP PQRIAAYMVE  300
GLAARMAASG KYLYKALRCK EPPSSDRLAA MQILFEVCPC FKFGFMAANG AIIEAFKGEK  360
RVHIIDFDIS QGSQYITLIQ TIAKLPGKPP HLRLTGVDDP ESVQRLNGGL EIVGLRLEKL  420
AEILGVPFEF RAVPSRTSLV APSMLDCKPG EALIVNFAFQ LHHMPDESVS TINQRDQLLR  480
MVKSMNPKLV TVVEQDVNTN TSPFFPRFIE AYSYYSAVFD SLDATLPRES QDRMNVERQC  540
LARDIVNIIA CEGEERIERY EVAGKWRARM IMAGFKSCPM SSNVIDTIQK LIKEYCDRYK  600
LKEDVGALHF GWEDKSLIVA SAWS
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-6426062425379Protein SCARECROW
5b3h_A1e-6426062424378Protein SCARECROW
5b3h_D1e-6426062424378Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016742756.10.0PREDICTED: scarecrow-like protein 1 isoform X1
RefseqXP_017633559.10.0PREDICTED: scarecrow-like protein 1 isoform X1
SwissprotQ9SDQ30.0SCL1_ARATH; Scarecrow-like protein 1
TrEMBLA0A2P5Y4F20.0A0A2P5Y4F2_GOSBA; Uncharacterized protein
STRINGGorai.005G083700.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM69532744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21450.10.0SCARECROW-like 1
Publications ? help Back to Top
  1. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]