PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG82004.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 2266aa    MW: 249291 Da    PI: 5.4439
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG82004.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix33.31.2e-1011971269267
    trihelix    2 WtkqevlaLiearremeerlrrg.......klkkplWeevskkmrergferspkqCkekwenlnkrykkikeg 67  
                  W+ ++  aLi+a+r+ + +l+++       ++++ +W +v +++ + g+ r + +C +kw+nl +++kki + 
  GBG82004.1 1197 WSVDHMVALIRAKRDQDAQLQAAghafarmRSREWKWLDVRERLLKVGVDRPADKCGKKWDNLMQQFKKIHTF 1269
                  99************9999999753333333699************************************9865 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2266 aa     Download sequence    
MAAAEEMERK FAANFLEGED YPRDGDLTEL ARARAELAAL QNKFENMGFV SGHVTYMEQI  60
GPKRVILSKT PSISAPMLPP PVRALPPPPV EAPVRSNESV AIFELTKTLI SLIQESKKEQ  120
REHNARLEAM MRNMRPIAVR APPMPQGLAP APALGGVANN MNVCYACHQP RHLARDCPHR  180
PPQQRTGAAP MVAAPIANGS GRVGALMEEV EGGGSALTTS EEAAELIALD QYVSLGYPGL  240
GMIGTIRPAP EPKEVADWRP SLGEMESGGP TFVTGEIDVL NIIRTLDHRI PVPIGHLLSI  300
SEQANERMLQ HCKANPKRFA LARTPNAKAK SPAPEENPAS TPDPIRRAMN MTFQNFVNKT  360
RLTQGMINFC VIVYMDDILV YSETYHGHAQ HIEWTLGALR DAGFKIALEK SEFFLSKISF  420
LGYVVTRGGL RPDSRKVAAV KEAPVPTSLT QVRAFLGLVS YYRRFIKGFA AIARPLTNLL  480
RKDQPLSWDA ECQQAFATLK DALATTPILI RPDPSKQFIL ITDWQPDAIS AILAQKGNNG  540
REHVIEYASR TVPDERRNDS APQGECYAVV WGIQHFHPYL YGHKFRLVMD HEPLLALKKL  600
TNYTGTIGRW AVRLQEYDFD IVHRKTERHG NADRLTRLHR PAKVPKGEEI TPWKEPDLTN  660
EPKYGQVAVL PREEAEEESE GEFRREAGEA AARLAEVEEE ESEAEEGSAE AKEEEEEEAE  720
EETSEEEEEA YNEYSEGKQS EEEDEGDEEV ESGDDQEPTH DKLEWVQGPD RREENLEAAA  780
QRKKEIEEGK WLAIDPAPND PFKDPEPPKP EHGDLAATTS GAATTRHRSR SRSTSPSASG  840
RPPIRQRVCI LGRLPSRIRR KRGVRKSVAM DGQRPNLATL SDTYPSHRPA ICTGVDKRAA  900
GPSTYEGLPP HMQPLPDSDE GEADVDVSNT VPLGSGSTQE WTGSQLFDGR GAKYRQSFTS  960
LLHEGAQEEE RLPPVDLTFG LRSGSPSSAT RTVLMNPHPD DDAGQVTLVG RGTRGGSTRQ  1020
ETSEKTRERP RVQATSGPSI PRTTAGRPEW MNPPPAFSRG RDSVPRPVER GVSAEDFPFE  1080
GARGDGRRVW KESRQVLRRQ QEESITQGVQ RLRVGERAGQ ADAVGGGGDV WTDGDEVECD  1140
AEEDDSGDVF PLRASNMGGR GSRGRASTNA GRRRKRPSAA SGDAEGEGDG EGGRNYWSVD  1200
HMVALIRAKR DQDAQLQAAG HAFARMRSRE WKWLDVRERL LKVGVDRPAD KCGKKWDNLM  1260
QQFKKIHTFM GMSGKEDFFQ LTSQQRAEKG FSFNMDRTVY EEIKGAKERS HTISPSNVAD  1320
TGGAGGVQLP SAQSATPESV GGGDAARDGN DEDDSSARGA SQTTGSPAGF GKRKNVRQQT  1380
FEALSECMEK HGALMASTME SASMRQCEAM ESASNRQCSI QIWQCEAIEA EVEVQKQHCA  1440
ASNEVSKLMC QALMEIAKAI RERRWADDIF AFGSSSRTAF VAAIAQSQCH LHTTTTFLFA  1500
QCVSRRHLSR QTSSSRWDSC SRTVRLAAIY LDRRHLHVGA PVRALFVSPP SISPDVIFTL  1560
GLLFAHCSSR RHLSRQTSYP RWDSCSRTVC VAAIYLARRH LHVGTSVRAV CVSPPSLSPD  1620
AVFARRRKLS STSPAQAFPR RHRSRPSFFS SSQSVFSTMT NTRNSGKGKR ERDSQKTEAA  1680
GVVKRGRHQA KKLKASSGGA LVIRRSAREE WVEEASSGQD DDFESEADTF HGRKGTLRDV  1740
SNARMVSGEE DVDGGKDGGH GVARGKDEAA GEVGGGVATR EERGQPTTPA MRGAVPSKDV  1800
QAAQDVGKHP PMRAPSNPAT PTAGQARRDE GAGVNAAVRG QLVTRIPAKE AAAVGAVVGT  1860
STAGARGGGG DGRDDADDDD PLINRQRRSG NMAALEAKAK LWVDDLRFWN ETEGHGMFKI  1920
IQETRLHLIA IAKGVKPPEI RKSVVLPNAT IPQQKLEDSS ELGAAKERAS RVQTIALRII  1980
HGWIFKSPNR ARGYHCSYGY VLNHVATDLA RAIWFGEDWR VCVSPAVVYN TLELHMKLPI  2040
WYVGGVIHDR HEDDEMASYQ ESTTQRLVGA FTNAVDTGEG VDGGRISYER LRNVADCMRL  2100
LLSAAMWIMR MAGDNLRSHY EASHLVELIA KPTLIASLHR SFDARRHVLQ CVNAVTEKMG  2160
KPPMTLVDPP VYIPDWKMGK PPAAAADDDD DVDDEDVHDD DDDRDDDDDG DDNDDDVDDV  2220
DDDADDRDDD GDDDDGDDDD GDDDDSDDDD GDDDNDDDDD DDDVHD
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.11e-07Trihelix family protein