PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG83973.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 943aa    MW: 103217 Da    PI: 6.7581
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG83973.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix29.51.9e-09195271271
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr 71 
                 W+ +e ++L++ +r        ++++++r + k+ +W++++k+m+  g  +    C +kw+nl + ykki++ ++  
  GBG83973.1 195 WSPEEQMLLVRCKReqemhlvGLGHNYERMRTKEWKWDDIAKRMANAGRPKDVDDCLKKWDNLFQNYKKIQRFQNAS 271
                 999***********55444444555556789***************************************9988764 PP

Sequence ? help Back to Top
Protein Sequence    Length: 943 aa     Download sequence    
MYTHLESWRT SLPPPDEEPE TEELPTLPLA SRSTQSLSQT VRAGGSASNE GGEFTSLLHQ  60
GLGDDDNGGL DLRFGLCSGG AREASRTFII NADPSPRGLQ HAGRPTVENI TRGVSNMRAQ  120
SDGGNNDGDG GDHADEGLRE DVEAGDDDGD IPIRPLGKTG GRGKGCSRGV VRGLSVGRGG  180
RGGVSDDGGK SATYWSPEEQ MLLVRCKREQ EMHLVGLGHN YERMRTKEWK WDDIAKRMAN  240
AGRPKDVDDC LKKWDNLFQN YKKIQRFQNA SSQTDFFRLS NEERKEHNFK FQMQRVLCPG  300
EEQLVRSLLV VRPVVTAARR GRSSARDSDN NAGSGAGGGK RKNARQQVLE SLADVMDRHG  360
KLMSATIDSS SKRQCSIFTR QYDILEQEVA FQKAHYAASD ETQRMMCHAL MEIVAAIRGR  420
SPIMSTRGSA RGKKRDDVPD ESQGQGRGRR HVPKAKTARL DDASASVPPR RAQGWAASAE  480
GDYDNDSTRE EEQAEVTASA VQESARQRSP DQTASKRMLT PPPEAQQLRA RDTRTEKVVV  540
VDLGGDDDEP LEKRRLRTLA QGTAAQATTA HVAVDERRLA VRGKPAGAAG AGASGVVAPV  600
GTAREEAAVV AMAREEARGE NKIDRDGGKP GSSWTTGEGR RLYNIVHEMR EYFVAIASGL  660
PVPVVLRSVV LPKSSTKEAR ITDPSQLQQA ISRAAAVENI ALCILHGCVF KSGNRPRGYN  720
LAFQYSLESV ATAIACAMWY GEEWCNVVSG VVCAHTINLS MDLPPWFADT NIEDKPEDND  780
MAAYQESTII CIVHAFRAAV QMGAHIDDDF ISYDRLCRVV DCFRLLFAAC MWIMRMAGDD  840
PCSHYEAFYF ANLVAKPTLV ASMHRSFDHR RSVIRAAKII TERLGKANAT FGEYPDYIPG  900
WAPCGIGFRR NMSITGLEDA KKLDWLGSGA PADDNNDDGK DDA
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.16e-06Trihelix family protein