PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.005G031000.1
Common NameB456_005G031000
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family GRAS
Protein Properties Length: 1710aa    MW: 196834 Da    PI: 6.9005
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.005G031000.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS67.23.2e-21113593374
                GRAS   3 elLlecAeavssgdlelaqalLarlselaspdg...dpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPi 90 
                         +lLl cAea++++dl+ a+ +L  +  la ++    +    +++yf+ AL +r ++        l+p ++          ++++ ++  P+
  Gorai.005G031000.1  11 RLLLFCAEAIENRDLKSADVFLHVILILADKRHywfRDDSIVVKYFAYALVRRAYG--------LHPASS----------YFTFPVDPAPY 83 
                         78999****************9999999777663344667899***********99........322222..........22233455555 PP

                GRAS  91 lkfshltaN....qaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesgskeeleetgerLakfAeelgvpfe 177
                         ++      N    + I +a+ g++r+H iDf+i + +   + l +L +  +    + +  + +p  ++  e ++  e L+k A+e++v++e
  Gorai.005G031000.1  84 YQYNSCHINgvikKVINDALMGNRRLHLIDFNIPYYGFEGSVLSTLPNFVGYCLPVCVSYILPPFLKEYVEFSRQMEFLTKDAKEVNVKLE 174
                         55555555511115678899999***********999999*******999999999*******99999999999999*************6 PP

                GRAS 178 ..fnvlvakrledleleeLrvkp...gEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealey 263
                            +v++ ++l++++  e+++k+   +E++++   ++l++l+++  +++      L  +k+++P++v++ +  ++h++++Fl+ f ++++y
  Gorai.005G031000.1 175 deLKVVYGNSLAEVDECEIDFKRrrdDEMVVIYYKFKLEKLVRDAKAMKR----ELVRLKEINPTIVIILDFYSNHSDSDFLTCFKDSFQY 261
                         337788999************99999****************88888877....8999********************************* PP

                GRAS 264 ysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveee 354
                           + +d +++    +   + k       ++  n+ a eg++ + rh tl +W++ +++aGF+ +pl+++  +   l ++  +     + ee
  Gorai.005G031000.1 262 SLKTLDYWQEL---DRYLDGKK------EWEFNIEAGEGNNIIRRHPTLPEWQHLFSSAGFSRIPLNHRKDN---LSVEDNS-SLKIMREE 339
                         ********777...22222233......333578899*****************************988664...3344444.32356699 PP

                GRAS 355 sgslvlgWkdrpLvsvSaWr 374
                         +++l+lg k+ p++++SaW+
  Gorai.005G031000.1 340 EECLILGYKGCPMFFLSAWK 359
                         9******************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098513.5611343IPR005202Transcription factor GRAS
PfamPF035141.1E-1811359IPR005202Transcription factor GRAS
PROSITE profilePS508089.431620674IPR003656Zinc finger, BED-type
SMARTSM006144.2E-14620670IPR003656Zinc finger, BED-type
PfamPF028925.0E-5623663IPR003656Zinc finger, BED-type
SuperFamilySSF576675.36E-5623672No hitNo description
SuperFamilySSF530982.28E-387641198IPR012337Ribonuclease H-like domain
PfamPF143721.9E-169791073IPR025525hAT-like transposase, RNase-H fold
PfamPF056992.1E-1511161197IPR008906HAT, C-terminal dimerisation domain
PROSITE profilePS5060014.46913911569IPR003653Ulp1 protease family, C-terminal catalytic domain
SuperFamilySSF540013.73E-2514121595No hitNo description
PfamPF029025.8E-1514161595IPR003653Ulp1 protease family, C-terminal catalytic domain
Gene3DG3DSA:3.30.310.1301.5E-714431564No hitNo description
SuperFamilySSF578296.04E-1316651708IPR011332Zinc-binding ribosomal protein
Gene3DG3DSA:2.20.25.304.3E-1816651708IPR011331Ribosomal protein L37ae/L37e
PfamPF019072.5E-1616661708IPR001569Ribosomal protein L37e
PROSITE patternPS01077016671686IPR018267Ribosomal protein L37e, conserved site
ProDomPD0051324.0E-1416721709IPR001569Ribosomal protein L37e
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0006412Biological Processtranslation
GO:0006508Biological Processproteolysis
GO:0005840Cellular Componentribosome
GO:0003677Molecular FunctionDNA binding
GO:0003735Molecular Functionstructural constituent of ribosome
GO:0008234Molecular Functioncysteine-type peptidase activity
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 1710 aa     Download sequence    Send to blast
MDSSFSDTAL RLLLFCAEAI ENRDLKSADV FLHVILILAD KRHYWFRDDS IVVKYFAYAL  60
VRRAYGLHPA SSYFTFPVDP APYYQYNSCH INGVIKKVIN DALMGNRRLH LIDFNIPYYG  120
FEGSVLSTLP NFVGYCLPVC VSYILPPFLK EYVEFSRQME FLTKDAKEVN VKLEDELKVV  180
YGNSLAEVDE CEIDFKRRRD DEMVVIYYKF KLEKLVRDAK AMKRELVRLK EINPTIVIIL  240
DFYSNHSDSD FLTCFKDSFQ YSLKTLDYWQ ELDRYLDGKK EWEFNIEAGE GNNIIRRHPT  300
LPEWQHLFSS AGFSRIPLNH RKDNLSVEDN SSLKIMREEE ECLILGYKGC PMFFLSAWKP  360
KVEDGHFNSN STNHKFEQGF NPNPLPLQPL QPFLEGLILN RLAALAEIHN ISKDLCCKYK  420
LSLALTWASK VNNMNGTISD PNKKHTFFIQ SNYCYVKDRK SYDFMFGFER MISVPFFEKA  480
FESRDGYHFE PSLTEVEDFK YFMLKDCNID VALAICLQNR HTSDEVYVVE FYWPPTESEI  540
SKSVALRIFD DLKHMKTTFV TVKVQGPEIK FQEEAISSIP TSSNTAMPLK IAEEARGIHA  600
KEINAHIEQI VETKRNKQRK LRSKVWVDFH KSEEEGKQVA KCEHCPKVLT GSSKSGTTHL  660
NNHSKVCPGK KKQNQESQLI LPVDTNERSS TFDQERSHLA LVKMVIRQQY QLDLAGQEAF  720
KNFVKGLQPM YEFQSRDKLL SGIHRIYNEE REKLQLYFDQ LACKLNLTVS LSKNNHGKTA  780
YCCLIAHFID DSWELKMKTL GLRTLEHIND TKAVGGIIQS LVSEWNIGNK VRSITVDNSF  840
LDDSMVQQIK ENCLSNLVSL SSTHWFINCT LLEDGFREMD DLLFKLKKSI EYVTETKHGR  900
LKFQEAVDQV KLQDGKSWDD LSLKLESDFG ILDSALRSRE ILCKLEQIDG NFKLNPSMEE  960
WENAAALQSC LRCFDDIKGT QSLTVSLYLP KLCDIYKKFL QLERSNPSFV TLMKRRFDHY  1020
WRLCNSALAV ASVLDPRLKF KVVEFSYIVI YGHDSKVQLN TFREVLTNVY NEYANETKNQ  1080
TTSASVLDDI NWLGNNSIWD SFSKFVTASE ASSKSELEIY LDEHLIPMDG AIFDILGWWS  1140
DKSQKFPILA KMARDFLAIP VSIFIPCSNI KATINNPAYN ILNPESMEAL VCSENWLETP  1200
KGNDGENHEP TQTMDKGKRK LDEDTCVRKK PKPSNCEKAI STEDIARDSN NNDEPAGEIS  1260
IGKLQTENSS KNGCYGETSS GNKSKASNKM MGTISLRDIH QEKSSSELNH GRNVEDVSSG  1320
DSSSDNDQSD QLQSSSSESD VEITLKEQGS WSEQDIKAYL LSEFTEKENE LIDTWQKNEL  1380
KGKMIGRDKY FKIQGEKLAP LLMMPQGDET REEYYIEDLV VNTFFELLKK RSDKFPNIYI  1440
NHYSFGSQIA TQLIEGPRTE QEVLAWVKVD ELRGAHKMFL PMSLSKHWVL FYVDTKEKKI  1500
SWLDPIASSR IRSYNVEKDI ILQWFTTLLL PKLGYVDAKE WPFLVRNDIP EQKNLVDCAV  1560
FVMKYGDCLT HGDCFPFKQE DMVHFRRGIF VDIYRGIIHK KNKQRLMHCL GTQSSRIGYW  1620
PITSIACNSY YGPSARIRDS ACFSSSFPFF CPHQASESPT FPFQGKGTGS FGKRRNKTHT  1680
LCVRCGRRGF HLQNSRCSAC AFPAARKRT*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4v3p_Ll3e-2016651708245Ribosomal protein L37
4v7e_Cj3e-201665170824560S ribosomal protein L37E
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
112171231KRKLDEDTCVRKKPK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5891383e-53JX589138.1 Gossypium hirsutum clone NBRI_GE23769 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012481703.10.0PREDICTED: uncharacterized protein LOC105796516 isoform X3
TrEMBLA0A0D2R8F40.0A0A0D2R8F4_GOSRA; Uncharacterized protein
STRINGGorai.005G031000.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM3620522
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G01570.14e-24GRAS family protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]