PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG69845.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1974aa    MW: 221762 Da    PI: 6.0426
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG69845.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix28.73.4e-09199275270
    trihelix   2 WtkqevlaLiearremeerlrrg.......klkkplWeevskkmrergf.erspkqCkekwenlnkrykkikegekk 70 
                 W  +e l+L++ +r++++ + ++       k k  +W e++k+m+++g+ +r    C ++wen+ + y k  ++ek+
  GBG69845.1 199 WHLDESLVLVRCKRDLDDYMASQgsnfarmKTKTWKWNEIAKRMAQQGVtNRDGDSCMKRWENIFGWYIKFWDREKN 275
                 99************777666643222222279999***********999899****************999998887 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1974 aa     Download sequence    
MGPMSTLLTG GRGAGGDPLQ LHLSSTCTMD ASRTVLVEHN SRSQGGVSVT MQEAREDRLQ  60
TQWPPRSSSA CTSSHSRPPT FSTPVSNGVS HTVLNGRVTT AIAAESPRVI ERIIARYSRM  120
RTDVEREDGG DANADGEDVA EECMDGNDES EEDDEPAVKE VTSKGGKKKA SKSRGRSKAR  180
EEGGGAYGES GGGSMRQNWH LDESLVLVRC KRDLDDYMAS QGSNFARMKT KTWKWNEIAK  240
RMAQQGVTNR DGDSCMKRWE NIFGWYIKFW DREKNSGVQS FFLLTSKKCE ELEFTFAMDQ  300
QLYDAIHATT PNNHAIHPPN LLDTDGLPPQ QPNEGEADGQ ARGSAVVGET SASDNIDSAV  360
GDGYGNKSSG RPTPLPRLTS FPSRWKMNNR NEKIKFQVEL PKIKAMDKLM VFEQGTLPSV  420
DWIAEYQCLT SVPDIEMGFK DIKHYFISRS CPALGNALTH IEDTLTTPTE LFDKAAEIIV  480
TNKEAKKLHR SSATGPIRDQ HRPKVVVVVA AMPIDQTSET VSTNEGDRLA AARDGGRPDK  540
GRGRGKTKTN TTSSPGPGAT APAPWSQYGI SEQAYKARTR FRYCLWCNSD LHETMGCPPK  600
GKGKSENVAP PGEEVVARMK EEFREGGLVL GAPNDDATET RPFIVETDAG PTSLGGVLIQ  660
GDKEGKERPL RFESRTLCTT ERNYSQFKRE TLAVLHCLRI FRNYIFGRRF ILRVDPTALA  720
YSLRNYVSSD PTVVRWLTYI WIFNFELERI PGNKNRADGL SRINWDGKEG EAVENTPPVD  780
EFLDQEEDVR LHINKWSPRV PSCVRHPIRQ APSGYERKAE LVLKPFEEED PWGGKDVQWM  840
MKLALAGSHS LVEDMRAIEE GSGQVERHEE LMGGMYLLVN TLLQENLDQA DSLNPAGNED  900
EVPESQDDEF EESEIKEVFR AEEYEGIYLE LGLLLSCEVR LRDASDRAQR MMQRYLVRDG  960
HLFVRREVGN PRRVVCGRNR QIDVIAALHD GIAGGHRGIQ ATYAKISELY YWDGMMDMIG  1020
KFCRSCVPCQ ERSCLRQGEP LHPKLEREVG AIVHLDLLFT PLGDQGYNYI FDARDNLSGF  1080
VDGRAIRTKT GLVLVSCIEE YYLRYPFIRE FVIDRGSEFT CQEVQELLSR SDFTKTAMPR  1140
GGKGTRPPQR PLGASGGYER HGPRHRESTP VYDDGDIELF LDSFWDHARR MGWTVAQAIE  1200
RLGGAGRFEE LIARIRREAM TRPEVEMRMQ ELRPSPVGPD ARPIRLEIGN AADFILAFEW  1260
FMQGQGIPRD DWARTLPLLT RKAERPLARQ IRDMARDWES CRAHLREAFR RPESPRPRVE  1320
RRQRIRRQRD PEPSEARPSR GGRKALARRE EESEPEGEER GAYAKCGLGP VEFHHFTEGG  1380
LRRSPVRTQE EAPASEGPLR ELEAHLDVSR WGTSPRGEER KEVIEVGEDT PPQTPAVGFK  1440
LGDAPGSTRQ AGEGLQREEA PLPSSEKAPS PERRTESKRE IAVVRREALA LIDRHLAAHT  1500
LEHPDLEEPA AAEPRQESCQ PEKEVEAKIP KRVSHRTRER APAGETAEEK RARRSRRIEE  1560
IWQERQRLAA VGALPEQQQE RQGLEAAGTL PDQPPSAPPK APEIPEMRRD FWEQRAEGLP  1620
SPTRAGFEVA RKAEKRLDRK IKFLAKTSFN RYLLVESDLV GKKMKEQGHG LRLETMEVEI  1680
QELRALVASQ ATIIEDLRQQ RQGGADKAES SRQGEQRRPG QGLPGQPSAA EPRQDPPMGR  1740
VILEPEEARA QREAKREAFE FRAPTELATL PIIAAGPVVP LAVEEGQPPS SSEPAQGSAE  1800
GSMDGLLEAV HTMQEEASLF WPEQRIEEPL EREMGIETKG VIEGRPQKLD TPEYGPEGVG  1860
DRPGPSTQEM ETGEEPLDMP QSHELSREAN EAPSSPGSQR GKKRSRKWFD TSCFFCTKEG  1920
HRALQCPKFL KDKAEGKVTK EDGRMYDRHG RRVERSADGG KAQLYRQNQE EMSE
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
116351642KRLDRKIK