PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG83731.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family bHLH
Protein Properties Length: 853aa    MW: 90167.3 Da    PI: 7.7311
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG83731.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH44.33.1e-14576622354
                 HHHHHHHHHHHHHHHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHHHH CS
         HLH   3 rahnerErrRRdriNsafeeLrellPkaskapskKlsKaeiLekAveYIksL 54 
                 r+h+e+Er+RR+riN  +++Lr+llP++     +K +Ka+ L  +++Y+k+L
  GBG83731.1 576 RTHSEAERKRRERINGHLSTLRQLLPNS-----TKTDKASLLGDVINYLKDL 622
                 68*************************9.....9****************99 PP

Sequence ? help Back to Top
Protein Sequence    Length: 853 aa     Download sequence    
MVIGLSTAVT TSVTTAVTTA VTTAVTTAVT AAVEIAGTRV RTQSGTLSTE VDASASENHY  60
DHGEDEQWSE HRPDSPAAVS SYVSSIPEND KLSEDLTMAE SSPHHHQHHQ HHHQHHHHHH  120
SPLSPVRTTG SLPSLSHPCT HAGITASCNK KIRAMKEGQH NDNQPCNNGS RHLQLPQHHN  180
HRNDFHPRER QQAHHGDEGR TAARWSMASS QHLSHPQQQQ QQQQQQQRYR CTGASSGAAM  240
APRSLFPADM PTSHSRYLHF DAGDGGHDSR HRHYHPSSSS PPPPPPLPAT ASSLSPAHVH  300
PGHPLHQRQQ EQHPSQAVCE GSRPLRAAPS GVAVCQQEIG EFGRYNNGGH AHLFVSDQRR  360
GGGGGLREKG GGSCSGACHL SIEKQSDVQQ TALLGHLAPG SRGSGSSRAG GGRRGGGGGG  420
GGGGENGISS QTCNRTGHVA SDDEKIFHHD NLLRHVSLRS KGGSRSGGGG GGGGGGGGEE  480
DDLIDLEEDR KAAGSAGGSE EEREVMAMVD EGGSGNGDAR GVYDGREGGV GGVGVGGGGG  540
GGGGGGGGRA GQIRVAAHQV MPHQPLLDSK SLSALRTHSE AERKRRERIN GHLSTLRQLL  600
PNSTKTDKAS LLGDVINYLK DLKRQVSVVA SPSTNQTATV SSSICHRRSL PATDVEDNIH  660
VTTITLSPME HRSAILRATI CCEDCPDLLP NILKTFQDLR LKIIKAEIVA IDGGRLRIDL  720
FLTETTDDSC SAATKVTVAD SMSTFLVDSN SNLLYHGDSP RGSVGISSCC HLQQHEEGDG  780
ASRTPKVLSS RNPQPRIALF CPRRIKEALQ AVIKASEGRG MTNQCNITCA KRQRLCSSCG  840
TGNIAAMSTP SVT
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1410419GGGRRGGGGG
2411420GGGRRGGGGG
3582587ERKRRE
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G68810.12e-30bHLH family protein