PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG59572.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 2395aa    MW: 264546 Da    PI: 6.4963
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG59572.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix40.95.4e-13362433266
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkike 66 
                 W+ +++ aLi+a+r        m++++ r k ++ +W++v+++++  g+ r++ +C +kw+nl +++kk+ +
  GBG59572.1 362 WSVEHIIALIRAKRdqdvhmeGMGHAYARMKPREWKWQDVAQRLKNMGVDREADKCGKKWDNLMQQFKKVHH 433
                 9*************66665555666666889**************************************976 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2395 aa     Download sequence    
MSIPDANGGY DSGSSTTAKG EQQSPTNRVV YLKQGLMLHN ACASDRRTVL ENKSNGMKTS  60
ERSKFATNGK MACWVHQCPT IMRLDCFHSV RPTQFPFVAD RAHGHKWAAT ELGETGGGLC  120
EQSFTDLLRS GLSGDEGDGS VNLSFGLSTG RSSTPSRTLL VNPHPDNDGG QLTVVDRPSK  180
TRALAWEATG ANPNPSMQAA RAPSLSRGAS GRPQWVQLPS PLSAASSVAR RRDDGVDTET  240
AFFDVGDDRD GREVWAEQRR DVRAGREESI RWGVEWLPVA DQEKDTDTTH AEGDDNNDND  300
NDGEGGKGES GYVSPFRHKD MAGRGGKTKA SCRNARPWGK KAQGKGSNGD GDGGAEEKRN  360
FWSVEHIIAL IRAKRDQDVH MEGMGHAYAR MKPREWKWQD VAQRLKNMGV DREADKCGKK  420
WDNLMQQFKK VHHFQTPSGG QDFFQLNGKK RANRGFNFNM DRAVYDEIEG STGKNHTIYP  480
KNVADTGASS GVRMPTAASP DPESVVDGDD GVGRDDDEEG SMRGSSQTTG SPDGFGKRTS  540
TRQPTFEAMT ECMEKHGALM ASRMESASSN APSRMGCVLT LLSPSPRVPF SSSTLLVVLS  600
PSRHIVAFSP STLLAILSPS RCVPFSPSTL LAVLLPSRHT LAFSSSTLLA ILSASRRVPL  660
SSSTLVPVLS PSRRLPFSSS SCLQDEWVGK EPRNNEDDDF EEEEESLKRK PRGKTAGVVK  720
INEGGKETPG QQHGGGGSAG AEADVILVKR DVACREIAGC PKDGAVAQDT AQRPQTPVNR  780
RNPPPANVVG SSQVPSQGNA VRSSAYEARG GGVEAAQDAG EGVRAGDGGP IGEDDEALVH  840
RLRAPRESSH AMQGAAKLWE DDMCFWNETQ GNAIVKIIKE ARTHLVGVAR GVQPPAVWSS  900
IALPHNTIPQ HKIEDESELN AAKERAVKVQ TIALRVIHGW IFKSESRQRG RGGCLSAGGR  960
GTTAYGARKE PRRRLKASRS MAWKKLILVG PMEVNVWTTG SNGLTEMCGV IPEINDAKKK  1020
LTFDRANITE FLIDYENLAT LLKWTEEEKM EHLGQHVSLS LGRDIMAIVA SSGSWKESRN  1080
EMMRKYLKAE KMATEAELAA VKRKNYATYN DFLMTFTLKA LRIPRVTDRI MSKYFLRQFS  1140
EFDNDKIMSA YQQTSKFEHK RDMDFNTVTD LAEKTVVTET LALLKEGEVI DLTGKTGDKV  1200
KKGIESLHEW VHGVDSKMDR MENALLVMQA QVSRPALPPQ EAVVPAAVAN RGFGRRDPAN  1260
EQCKYCTMIG HFVRACPRLN HDIERQRSSR SLKGEILGPR GERVNWNSPG GMRRAVILLN  1320
NLEIVAVEAE PIADIVWDQP RGRGPQANFI LEGNDQDRVN ITTRRAGAEN KLIQDTVMEE  1380
LVGTSTGQQE TEAAEQEKVY GKPREDEPVD KTTTAKKKFR YQISILTSPE IDGTLSKLLG  1440
TMVSVSFQTM LQASPRLLKG LRQLLTRKHV EVEEAPELQE QDTEEAEAPQ GVPNLKSIPG  1500
GLGELEKAFA DIRLSLPDHE GGEVMRAPLG TKLSFHALPV GKLKVQIGTH HTNALVDGGA  1560
EITLIRWDFA TVTGCTVNKE VVGSIRGAGG KIRFTGYVTK CAVRAGIRES IWSFQRMTVM  1620
EEMDYDVILG RPWCANVEMI GMHLHDDTYM VDIKDPVTGR GELLRLLGTG GDPPKGKLAT  1680
WFPTFEARKG AFARMEGMRE RVEIMIEEAF SKKEWIKMGL PIKKRRPEDE SLGVMVAEKE  1740
QEVELGASLP KPKEGRNETP ELALEIPDLL HKVGVDPTTL AKFEDEVRKG YCLNGKIVSI  1800
QPQGSVRKYK PVGKKTKLVS ILVETSKEEA MEKEEEILKV IRERRATEGD RIPNEVADTM  1860
KIGVEGFLTA EETQLIRKAC QEFHLAFAFN DHQKRRLDAK LVPPVRIHTV QHECWNDKGP  1920
AYEFGIAAEV TDLLRVKIDS FVAKPTASPY ANKWFVFRKP NKTLRWIQNL QKLNAVTIRD  1980
AGSLPQIDLL AESHAGRSIY SLVDLYSGYD QLPLDVRDRP YTAMHTPVGQ LQMQVTSMGF  2040
TNAVAEAQRR MLAVAGDIFS KKCEPYIDDN PVKGARYKDE TEVEPGVKRI AGLRNRADGL  2100
SRVCITLEGV EDAEPIDAFL EYEGGTLVVD NEMADPAITT GQLPIQTLGK GTPAVVAELR  2160
EGPVTTVRRK EGKDSWGAEV GAREELMPMT VEGGRDAVMM LAETWAQKEC QYLVNLTREE  2220
QGTDKNEQEF VLIQMLDDDP HLSTEELITA RRQQVARNEE ALEEVVNRVT DNRMRDKARW  2280
DQAGRGTIVK ASWRVKLQEK KLYVGFDHNL VVNRGGRKQG TRGPSPLKEG EPIELPSDGD  2340
HSSKEEGNPE EGQRPGESEP VEFIDLTNDE EEDEVTTPTR EDRGREEDPG EEKDP
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
117231753KKRRPEDESLGVMVAEKEQEVELGASLPKPK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.19e-08Trihelix family protein