PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG71207.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 2501aa    MW: 281364 Da    PI: 5.9348
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG71207.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix30.41e-09312385268
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkikege 68 
                 W+ +  l+L++ ++e + +l        r k k  +We+++k+m+  g  + + +C +kw+nl + ykki++ +
  GBG71207.1 312 WSTEDQLLLVRCKQEQDMHLAglahnygRMKTKDWKWEDIAKRMANAGRPKDADECMKKWDNLFQNYKKIQRFQ 385
                 77788888888888555554433332227899*************************************98755 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2501 aa     Download sequence    
MPWWTSRYPE RFRSAEDNQI ASEEEGEEEG GGGGGGGGGE EEGGGEGGGE EEEAEEEAEE  60
AEEEGGGGGG GGEGRGEEDE RRRMAYEKLG MENRRDPGGS EEMENMEEEK EEDRRCVRRK  120
VSAICMPLVV MRRLPLLVLV VVMFLIVIFL HVCLSWRAWK QHRKRRASTE RAAGERSMAG  180
MQARAAGSPL FGGGRSSMAN RTTPTQADAR DEPACRTPVR PGPTVENITH GVSNMRAHSD  240
GGDDGTDGGD DADEGLRADV EAGDDNEDIQ IRPLGKMAGR AKGRSRGVRG RSVGRGGRGG  300
VSDDDGKSAT YWSTEDQLLL VRCKQEQDMH LAGLAHNYGR MKTKDWKWED IAKRMANAGR  360
PKDADECMKK WDNLFQNYKK IQRFQNASGG ADFFRLSNEE EGEQLQVPDG QGAVQRDPWR  420
HAGEPHNFPS QRRRHRQSGP GAAAEARSGW RGVCGQGYTL AFQYVLESVA TDLARAMWHG  480
EEWSNVVTPA VCAHTLELGM DLPLWYAGAH IEDRPKDEDM AAYQEATVMR LIAAFSTTVE  540
TGQSIDGGCI THERLWRVAE SFKILLAASM WIMRMSGDDQ RSHYEAFYFS NLVPKPFLLA  600
SMHRAFDHRH HILAAANAVT DRLGKAQVTL GRRCRTWAAR DDDDNDDDDD DDDDDDDNDD  660
DDDDDDGDDD DDDDDEDDND DDDDDDHHGE KCRDDDNDDY ADWAKLWPTW LFHVPLSREC  720
GSVHHCLIAA MSVVTWHITN VRTSEAMHDL AMSYQRQLFR RSTDEFTRSF QRKVRLLNIF  780
LTLVQTDAIL MPNQSFAMFL RKVQATSWNI LKLSPELLSI GMQSMDGPGY LHRREEIIPT  840
YGAASSVSSQ TEASGIDTNG STGLGYMDNM GPEKWQVLSC LLYNAADDVN SSPKYEQISA  900
NAARFKHQCM TLLTRSQTKA MDRKVGESLL AYEERLAAFI QKANDRAAAA AKERNRLEQE  960
EEAKRQKEEQ DRLRQEEADL QAAAEHRSRH RERLFTRETV IGDEAAHWVE VTSADGAPET  1020
EKGLSALAQV SHDLVATCAL QQEEILHLQQ TVDQMLARLQ ALEKQPAAVA AAGPSTLATR  1080
VQVLEDDVSN IKRVQAAAIV TPAAASSRRP PKLDDIPVFC DQSKTEPIPW WRQFTLRLDM  1140
YHVPNNDRHP CLYHRSGGAC QAWLDNILTS HGVAVSELHT KLTWQELTDA WHKRFQVEPP  1200
DLQAMDKLNK LHQNTLLSQD WITEFQGLAS TPDLPLAFRA IKHLFIMRSC PALQNALTQI  1260
AETLDTSDKL FSTASRIILT NIEPHNAGRS FVTTNGTQPK PKVACITSTE AATPNEGEVL  1320
AAARDDEVAV GDMLGYVAKV AHEFVSQWYE ENNAPLLYVR IQIGQRGCNA LLDCGATRNF  1380
ISQSFMTRAG LGAQVRRKSH PTTVALADGK TKQLLDRYIS AVPVYFAPHA CEPVTFDVLD  1440
TDFDVILGMP WLSSADYTVN FHRRTLTIRD ATGAEVPYKF SDVFETPSGI VPDRPISHEI  1500
ILEAGAVPPK GCIYRMSEEE LTVLRAQFDD LLDKGWIRPS SSPYGAPVLF VRKKNKDLRL  1560
YIDYRRLNAQ TIKNAGPLPR IDDLLERLSG AKYFSKLDLK SGYHQISIRP QDRYKTAFKT  1620
RYGHFEWVVM PFGLTNAPAT FQATMTTEFR AMLDRFVLVY LDDILVCSRT LEEHLEHLRR  1680
VLETLRSARY KANRDKCEFV RQELEYLGHF VSPEGIRPLA DKIQAIQEWP EPKNVTDVRS  1740
FLGLAGYYQR FIKGYSKIAA NLSKLQSEER PFDFDDDARH SFQTLKAALL SAEVLHIYDP  1800
LLATRVTTDA SGYGIGVVLE QHDGTDWHPV EYFSKKVPPV HSIDDARKKE LLAFVHALKR  1860
WRHFLLGRKA FRWVTDNNPL VFYKSQDTIN STIARWMAFI DQFDFFPDHI PGKSNRFVDA  1920
LSRRPDLCTA VYSTFEIDVD LRESFIRGYQ ADPHFRDKYA NCSSPNPVPS HYRIQEGYLF  1980
VHSRARPVIC LGVGFKRRTG EMAVAVLRET LATLSSFIDQ YDLEGGRMFV IDVGGRLVAT  2040
QGGNTYRVDP DTNTVIHFKG TESPDELVRE GARYVEKRYK RQYLLENCTS SIDVKIKGNK  2100
YYIHTRPFRF NNLRLTAIIM QPRAAILGAV DRNRYMTIIA PIVMAVGGLG FSCLLISLFA  2160
GIKLSHLQLR MRLAKELVAK QKVEAENEAK TTFLSNMSHE LRTPVACIIG MLELLQADAL  2220
TEDQRSGVEQ INSCAVRLLQ ILNSILDIAK EEKGKLVLEN EKFNLWKDLE SLVDMFAVKC  2280
ESKGLEISLE LADDVPEIVR GDQARVLQIF TNLVGNAVKF TSSGYVVVRG WVHNVKRSRV  2340
DVKVANKGRH KVTHADDHAL TTGTDYQPGC LCEQFSWSEM FRCSQRHLFS AVRRLFKYFV  2400
SRLKRKVQSV MMCIHKDGPG DADLGSDHST KEHRQDHKGT AEGTGPYTVV LVFEVEDTGP  2460
GIPLEMREKI FEDFVQGDAS TTRRATHLTT GKQQDTANAM G