PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG70604.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1916aa    MW: 214483 Da    PI: 5.6898
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG70604.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix33.69.8e-1111861260269
    trihelix    2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegek 69  
                  W+   + aL++a+r       +m++++ r k ++ +We+v ++++  g++r+ + C +kw+nl +++kk+ + ++
  GBG70604.1 1186 WSVGDTTALVKAKRdqdlyiaSMGTSFARMKTREWKWEDVRARLQTMGVTREVVDCGKKWDNLMQQFKKVHKFQN 1260
                  899999********99999999***********************************************987665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1916 aa     Download sequence    
MANVAFRSGK VHALGDVVVL DVNTYDVLFG LPALVALRAN LDFERRSVIL RNTGGKPYVI  60
PMRLTLRTSV KRIREFEANG WIEPATGPWS FPVVLVPKKN GSVRICIDYR KLNDITIKDV  120
YPLPRIDDLL DVIGCANYFS KFDIRHGFHH ILMKEEDRPK TAFVLFEGTW QWVRCPMGIC  180
NAPATFQRAM NVTFQNFVNK TRLTQGMINF CVIVYMDDIL VYSDTFRGHA QHIEWTLGAL  240
RDAGFKIALE KSEFFLLEIS FFGDVVTRGG LRPDSRKVAA VKEAPVPTSL AQVRAFLGLA  300
SYYRRFIKGF TAIARPLKNL LRKEQPLHWD DECDQAFGAL KDALVTAPIL IRPDPSRQFI  360
LITDWQPEAI SGILAQKGND GKEHIIEYAS WTVPDERRND SAPQGECYAV VWGIQHFHPY  420
LYGQKFLLVT DHEPLLALKK LTNYTGMIGR WAVWLQEYEF DIVHRKTERH RNADGLTRLH  480
RPVHPGCRRL LTQELTVPIA QLADDLDVNI VSQVDPRLVP HVTSRTLSPY LQWSACVEGF  540
PSRIPPSRLD YLDPRDIVDP AFYRPPYMDE LEEIIREELA EESSKEEEES LNEDEGEPAQ  600
QQEGDEEELL QTESEEEAEE EDNEQGSGDD NDKEHADEDP QLEIAPVADL PISNDPTLDP  660
EPPQPDDGHV AQMAGPSARR PPSPPRCRRR SSSPSASLSL ATHEFGLKIA TRTVPWQDIC  720
DGITLEGRVA IQEEDAQMMA TVFSWRSDHL FSSAPPPDVA KQARMKQIDV RIWDQLFELH  780
VPQYVPDEIH RLILDILTEY QGAISVTDTD IGLSLVRRTG VEGDLLGFLF GSVRPNHRQP  840
ITQELTVPLA QLADDLPLEI VSQSDNSPVL HVLACTLTPY LRWSACLEAP GSSRNPPSQR  900
GYLDPHEIVD LAFFQDRTAS ENEEVEIEAE EESSEEEEEA EEEADEEETP KEGSYSEHSE  960
GEQSEEEKEE EEEEQDGEEE EEEDQEELEE SEWEGFEEEV RDEAWAQAQA QKREEIPAGK  1020
RQLEFASVAG HEMTILLNRF VKVVVIGMSR RRSSEEESVR VNVRSGCACH LRREADPALL  1080
LDASGRRICV REGLHQRGTE TITRGVQRLH VDEGDEAAVE EAQGCDDVDG DDDCNSDDLP  1140
DIRPLGRKVT NGGASVKKGP ATKTRRSKKM DDDTGRSDGE GGRNFWSVGD TTALVKAKRD  1200
QDLYIASMGT SFARMKTREW KWEDVRARLQ TMGVTREVVD CGKKWDNLMQ QFKKVHKFQN  1260
LSGGKDYFKL ASKARRSEGF NFVTDRSVYD EIEAMTKGDH TIHQKNLADT RAAGEVQMPA  1320
GAGAGGDTMA SEGGGEAADE GQGSTKDSTF SAGSGGGYRK RKNIQQQTFE AVVEVMDKHG  1380
ALMASTMDSA SKRQCSMMLR QCEILESEVE VQRKHYAAAD EANRMIGVEG SAGASRKVGV  1440
ATDEGRVHEG GLESCCHTYH CPRRRQAKNR GRGLSVCSML LMRLRLLQHK GRPLAASPAL  1500
KVGAHRAQRR HPCGTPYAFS RRRTAQRSRG RPFLRSPARL GLLQCRRRSS LWILSLSFSP  1560
RILLATDPYR PFGGPASFTR EDLLLEAEQC PQGSSPSRVR IDVELVCVAR AGPSLFVAAD  1620
LSRHAGVRRR LCRLLLAANQ SRRCRPSTSV HVDALLPRAA KALVRRRCEG RCSHRWPLTK  1680
AGNMFARGGA RGKKRENITA DTQGRERGRR HVPKAKRLRS EEASASLPLR RGRSWAAANE  1740
EEDNDVFTTE EEAAEDNVFA PRGSNLQRSS DQSCARRLLT PPPEAQQVHA HNTPKAKEVV  1800
VDVGGEDDEP LESRRQRNVI QGATATTVRI RGATEERPPQ GGLPSRRPSP VRATRLQRET  1860
PWNAGEGRGP ARSARGEWGA DRWCSHGVFG QCCCRGEGER GAPCCGAEGD ARLQQG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
115221528RRTAQRS
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.13e-06Trihelix family protein