PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.15G119000.1.p
Common NameGLYMA_15G119000, LOC100815105
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family Trihelix
Protein Properties Length: 339aa    MW: 38594.4 Da    PI: 6.7441
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.15G119000.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix92.44.5e-29223307186
             trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                          rW++ ev+ Li++r+++e+++r   +k++ Wee+s++m   g++rs+k+Ckekwen+nk+yk++  + kkr  ++s+tcpyfd+l+
  Glyma.15G119000.1.p 223 RWPDVEVQSLITVRTSLEHKFRLMGSKGTIWEEISEAMNGMGYNRSAKKCKEKWENINKYYKRTIGSGKKR-RQNSKTCPYFDELD 307
                          8*****************************************************************99998.78889*******97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.062216280IPR017877Myb-like domain
CDDcd122034.21E-25222286No hitNo description
PfamPF138375.7E-22222307No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 339 aa     Download sequence    Send to blast
MDLFTGDHFP VPDHVAPFPD SGDLLFAADL LSHRHNPQKL RPIRSVPTAP IANPSPPPLS  60
HDPIPSGSGH APCHESSLSF DAEDEDEDDD NSSASTKGHG PRKKRRKMVR KLEDFAKDLV  120
VKVMEKQEQM HKQLLEIIEN NERERIKREA AWKNEEMERI RKDEEARAQE NSRNLALISF  180
IQNLLGHEIQ IPQQPAKPCS KREEDEVEAS ARKELNNDPG DNRWPDVEVQ SLITVRTSLE  240
HKFRLMGSKG TIWEEISEAM NGMGYNRSAK KCKEKWENIN KYYKRTIGSG KKRRQNSKTC  300
PYFDELDILY RKGLLSIGNA LSNTCGVPQI EVKELNET*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1102106KKRRK
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gma.456255e-45somatic embryo
Cis-element ? help Back to Top
SourceLink
PlantRegMapGlyma.15G119000.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_003547291.10.0trihelix transcription factor GTL1
TrEMBLK7MAW90.0K7MAW9_SOYBN; Uncharacterized protein
STRINGGLYMA15G12591.10.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF111123338
Representative plantOGRP1130157
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G47660.13e-46Trihelix family protein