PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG69956.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1042aa    MW: 114048 Da    PI: 6.5448
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG69956.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix36.81e-11414486267
    trihelix   2 WtkqevlaLiearremeerlrrg.......klkkplWeevskkmrergferspkqCkekwenlnkrykkikeg 67 
                 W+ +++ aLi+a+r+ + +++         k ++ +W++v++++    + r++++C +kw++l +++kk+ + 
  GBG69956.1 414 WSVEHIIALIRAKRDQDAHMQGMghayawmKPREWKWQDVAQRLNNVAVDRNAEKCGKKWDSLMQQFKKVHHF 486
                 9*************9999999632334444799************************************9875 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1042 aa     Download sequence    
MSQRSADTNI SGRTPAPNYA LTVGDRRRPR GFVAKSGRPF AALRSAAAIA APRCPVAAPR  60
GSLPRVHPWE RAATSTQKKV FDEGRQDGGT TGDPPVQWGA GRREAVRGEP VDRWQAPQRV  120
GYDALPPHLQ PLPGSSDEEE EVERRPQTVS LGSGSTQEWT ATELCGTGGD VYEQSFTELL  180
RPGLGEDEGD GHVNLSFGLS TGRSTTPSRT VLVRPHPGDE GGQLTVVDRS ARTRALVSET  240
AGANQNSSTA QPRAASPSKG AQGRPEWMQL PSHLSAASEV ARGRGVGVDG GTDFLDVGDG  300
RDGRKVWRDL WRDHRLRREE YITRGVERLH VGDRENENET DDPPAEADDD DDNDVECGEG  360
GGGHASPSLQ SDMAGKGGKS KPSGHNARPR AKKGQGKGSG GEGDGDAEEK RNFWSVEHII  420
ALIRAKRDQD AHMQGMGHAY AWMKPREWKW QDVAQRLNNV AVDRNAEKCG KKWDSLMQQF  480
KKVHHFQSPS GSADFFQLTL KERASRGFNF TMDRAVYDEI EGSTGMNHTI HPKNAADTGR  540
EDDEEGSTRG SSQTTGTPGG SGKRKSTRQQ TFEALTECME KHGELMASTM EIASKRQCSI  600
QVRQCEALEV EVEVQRKHYV ASDEEGAASH ETAQRVLAPV NRPRTPAAHV AGSSQAAVEG  660
GTLRSPVVAA RGGAVAVPGE VVEVLKEGDG AAVGEDNEAL VHRLRGQRAT THAMDAAAKF  720
WEDDNRFWND TQGSAIVRII QAARAYLVAV ARGVQPPAIR RSISLPHNSI PQHKIEDESE  780
LNAAKERALK VQTISLRAIH GWVFKSESRQ KGYHLAYQYA LNHTATDIAR AMWSAEDWRS  840
LLSPMLFRTT LDVDMKLPLW FVGVNIMDRH KDDECATYQE ACVQRLVRDF TSAVGTTEAM  900
DGGRVSYERL KRMAEAMRYL LTATMWIMLM AGDDPRSHYD AWVFVQLTAK TTLLASMNRQ  960
FDARRHITQS AQVMTDKLGR PPPTFAPPPV CIPDWASKCG VTFSHDATLA SPMEAKRLDW  1020
LGTGPPEDDD DDAESDDKGE GG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1387395ARPRAKKGQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.11e-06Trihelix family protein