PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_Sca004802G04
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 1776aa    MW: 204888 Da    PI: 6.4531
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_Sca004802G04genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS165.64.2e-51153703374
             GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfshl 96 
                      e L++cA+a+++g+l+ a+++L ++ ++a+ + d + +l+ yf+eAL +r ++         pp  t+++ ++ +  ++ +++  + i    + 
  Gh_Sca004802G04  15 EILVSCAHAIEDGNLKTADSFLHQIWNTAAVELDLISKLVRYFAEALVRRAYGLH-------PPYYTHSNLQIPHPLYYYYYYSRFDI----NE 97 
                      689**************************************************32.......44444444455555555555554443....56 PP

             GRAS  97 taNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAeelgvpfefnvl...vakrle 187
                      +  +aI  a+ g++  H iDf i +      L+++L +R+++p s+RiT+v +   +++   +e  e L++  + l+++++ + l   +a++l 
  Gh_Sca004802G04  98 MVGEAIESATTGKKGFHLIDFHIPHLYGRGYLFKTLPNRSSDPLSVRITVVLPTFLKNTVDFQEEMEYLTEAGKLLKIELKKEDLrvvYANSLG 191
                      6789********************9999999**********************7777999********************86555444999*** PP

             GRAS 188 dleleeLrvkp...gEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpre 278
                      +++ ++L++++   +Eal+V   ++ h+ll+e  ++++     L  +++++P++v++ eq a+ n+++F +r+  +++yys +f  +++ + + 
  Gh_Sca004802G04 192 EVDESTLDLRRtndDEALVVYYNFKFHTLLAEAEAMKK----ELIKLRQINPEIVIMQEQYANDNDGNFIKRLEYSFRYYSNFFQYYSNLFKSG 281
                      *******9999999****************99999999....78889*******************************************9999 PP

             GRAS 279 seerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvSa 372
                      +  + +  +    r+i n+vaceg++r++rh++l +Wr+ l +aGF ++p+++++    + l + + ++  +++ee+g+lvl  kd p+++vS+
  Gh_Sca004802G04 282 KPLDYNTAKY-YMRQIHNIVACEGRDRIMRHQSLDEWRDLLLTAGFLQIPFQKDV----ENLHALYWVE--EIKEEKGCLVLSHKDCPILFVSC 368
                      9999998888.679************************************98765....5666666666..89********************* PP

             GRAS 373 Wr 374
                      Wr
  Gh_Sca004802G04 369 WR 370
                      *8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098529.2991356IPR005202Transcription factor GRAS
PfamPF035141.5E-4815370IPR005202Transcription factor GRAS
SMARTSM006146.2E-12698748IPR003656Zinc finger, BED-type
PROSITE profilePS5080810.491698753IPR003656Zinc finger, BED-type
PfamPF028922.2E-8701745IPR003656Zinc finger, BED-type
SuperFamilySSF576672.86E-7701751No hitNo description
SuperFamilySSF530984.55E-388511291IPR012337Ribonuclease H-like domain
PfamPF143727.7E-1910641158IPR025525hAT-like transposase, RNase-H fold
PfamPF056999.4E-1312091289IPR008906HAT, C-terminal dimerisation domain
PROSITE profilePS5060014.71914511628IPR003653Ulp1 protease family, C-terminal catalytic domain
SuperFamilySSF540011.88E-2814721642No hitNo description
PfamPF029024.6E-1414741641IPR003653Ulp1 protease family, C-terminal catalytic domain
Gene3DG3DSA:3.30.310.1303.7E-1115021623No hitNo description
PfamPF015350.5816441661IPR002885Pentatricopeptide repeat
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006508Biological Processproteolysis
GO:0003677Molecular FunctionDNA binding
GO:0008234Molecular Functioncysteine-type peptidase activity
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 1776 aa     Download sequence    Send to blast
MASSSSFSSA DAALEILVSC AHAIEDGNLK TADSFLHQIW NTAAVELDLI SKLVRYFAEA  60
LVRRAYGLHP PYYTHSNLQI PHPLYYYYYY SRFDINEMVG EAIESATTGK KGFHLIDFHI  120
PHLYGRGYLF KTLPNRSSDP LSVRITVVLP TFLKNTVDFQ EEMEYLTEAG KLLKIELKKE  180
DLRVVYANSL GEVDESTLDL RRTNDDEALV VYYNFKFHTL LAEAEAMKKE LIKLRQINPE  240
IVIMQEQYAN DNDGNFIKRL EYSFRYYSNF FQYYSNLFKS GKPLDYNTAK YYMRQIHNIV  300
ACEGRDRIMR HQSLDEWRDL LLTAGFLQIP FQKDVENLHA LYWVEEIKEE KGCLVLSHKD  360
CPILFVSCWR PRAGEEHFKF NLNSNKFGQG FNPRPFQPFP EGFILNRLAT FAEIYDMLED  420
VCFRYELPVA LTWACEATTD KIMLDGKKHT LFMERTSCYA SNEGSQCFME ACAKHHIQEG  480
QAIAGKAFQS SANFHFEPSI TKLMKSDYPL FNAAQLFGSH AVVAICLQNH YIIGDVYVVE  540
FYWPEIESEK SESLALDIFN DLKNMKKKFV TIRVGGNEVG FEREAISTTL QGTMHMRNAQ  600
PASSTNDLLS SNTTWSLNTV QPCDVHEMER HGLVEQVESA PFSTPNPMSH GGVLQTQGPH  660
KQEIGEKDFI SQTVSIGDYE IVKASMETCK VPRTKRRKYS SKVWLDFDKF EVNGKQVAKC  720
KHCNKDFTGS SKSGTTHLKN HLERCQSKKI KNQERQLITS EIGDLITRDS DESNFTFDQE  780
RSRLDFAKMI IKHQCPLDMA EQEFFKIFVK NLQPMFEFQS KDILLSDIHR IYKEETEKLQ  840
LYFDHLACNF NLTISLCKNN HGKTAYCCLI AHFIDDNWEP RMKIIACKPL EHIYDTKALN  900
EIIQSSVLEW NISKKVFSIT MDNPYLNDDM FQKIKETCFS DQGSFPSTHW FIGCTFIEDG  960
FREMDLILLK LRKSIEYVSE IAEGKLKFEE VVNQVKLQGG KSWDDLSLRL DSDFGVLHSA  1020
LESREIFCQL EKIDGNFKLN PSVEEWEMVL AFHSCLKCFD DIEGTQSLTA NLYFPKLCNI  1080
CKKFLHLEKS NYPIVTLMKR KFDYYWSLCN SAFAVATILD PRLKFKFVEF SYTEIYGHDS  1140
KMHLNRFHKV LTDVYYEYAN EARNLSKSTS DLDDSNYSTT EIVNDCILES FSKFASANNF  1200
NEVASWKSEL DCYLDEPLLP LDGAFDLLYW WCINNKRFPT LAKMARDFLA MPIPILAPCL  1260
NFNAMITNPT YNNLNTESME ALVCSQNWLK IPKENDGENH GPMQNMYKRK RKMEDHSNVV  1320
KVSKNWNREE ANSSGDIAKG SIKNENIEAS VCNQNRLEIS TGKPNHGRNI TALIEIPEDD  1380
SPFSNNKSGQ FQSLSSESDN ETTLKEEGSW CKEDVRAYLV SRFTGKENKR LNRWQTNELI  1440
GKLIGRDKEF FLMGDKLAPL LMVPHGDETR KEYYIDDSVV NTFFKLLKKR SDRFPKAYIN  1500
HYSFDSQIAT SLIQGSRSEH EVLAWFKAEK LRGAHKLFLP LCLSAHWVLF YVNTKEKKIS  1560
WLDSNPSTRI MSNNVEKQTI LQWFTTFLLP EFGYYDANEW PFLVRTDIPV QKNWVDCGVF  1620
VMKYGDCLTH GDFFPFTQND MVQTSLVHMY CRCGRIDKAE VLGGVLYKDL AVWISKVNGY  1680
AIHGMGHQMQ ITEPCSLHHV VFMSILLACS NSRLVEDGLK YFKSMKDDFG IKPGIEHYTC  1740
LVDLLGRVGH FNLALKIENH PRDAYASTSS SLGSIA
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A8e-122037326382Protein SCARECROW
5b3h_A8e-122037325381Protein SCARECROW
5b3h_D8e-122037325381Protein SCARECROW
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016714039.10.0PREDICTED: uncharacterized protein LOC107927484 isoform X1
TrEMBLA0A1U8LLH70.0A0A1U8LLH7_GOSHI; uncharacterized protein LOC107927484 isoform X1
STRINGGorai.005G014600.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM8809420
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G01570.17e-39GRAS family protein