PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG66236.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1050aa    MW: 114934 Da    PI: 6.7922
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG66236.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix27.39.3e-09569656285
    trihelix   2 WtkqevlaLiearremeerlrrg.....klkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdql 85 
                 W   e+laL++a++e  ++  ++     + +  +W+ +  k+++ g++r    C  +wenl++ ++ki ++e  r + ss + p f ++
  GBG66236.1 569 WGLWETLALVRAKQEEANERANQhgglvRPARERWALIVMKLKSWGIHRDVSNCARRWENLSRCFRKIYRHEVYRLNCSS-ERPSFWRM 656
                 9999**********98888877523344699*****************************************99955554.67777776 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1050 aa     Download sequence    
MPTSILPPRT GTGGERRLTA AESSSTIPTS NPGRCCHHLH RHRHSHRHNH SHRYGADWKH  60
RSSPIFLVAS TSNKKRGIAA ITDTPRAGSS RGALPCGISA LGSRSRCGQA SRKIDCDALL  120
LQVRQVDETR RGGCWMQLPK APSSVGLNVP FDKDPNGRGW RRRWLSVLQA AAARDDERGA  180
ILGRSEGATS LRSSSGIEDR SFVQGESLQV EENPQGLDKP PAPKRQLHLD VESIDFVDVS  240
TTILFLGPIV CRDATCIRIV NILYIQMQYV PSVPAGAIAN VHLTVMVNAP QSSWPAIDMP  300
SAVHASEMKV DADHHVRGAT DGNIKLEEFP SSILRLAVLD GNECTNQWVC EETAAKALYS  360
DEEEDWSLHE SKGGVCNAEL GGSPEAACVL NSVRRREVQG NAGAAKEAVL SGSRNDCGRY  420
NDKIDCATTS AVEDQVDHGE DAVKEKPGLE RQTLILEMIM NNPGTRGRRL RRRNTCCDSD  480
CVRDFDLPCN DTIQCTKPGS EKLFDTDHQE LDATLIFSKS CPEASPEANP REAGQCVTPI  540
SWTSLDKASL GPVRRRHGLS RDVLRSTNWG LWETLALVRA KQEEANERAN QHGGLVRPAR  600
ERWALIVMKL KSWGIHRDVS NCARRWENLS RCFRKIYRHE VYRLNCSSER PSFWRMSFSE  660
RKELNMPFQM EPRVYKAMEE FFTSVDECGL KAETTLALCA GEGSPGDTTD DFVKGETEDC  720
GRIKVEVAPM DLIPSVPAAV EGQGCCSSSP NAGDEGDGPN SGTRPAVDAG SNCPSSIATA  780
ISDLTKAVLG GGEAFTSAYA MAAQRETDLV ADSQDVVLEQ MQVVREADSL VDAGHQMLLS  840
TLRGINTTLD SILSNMARTD PQGALCNVTR SVRACDKLQT IHSCQWSARA LKSVMDELIV  900
VPVHGVTETQ LVNLFYRAMP KPLRGYFFEK SKESTITYDT PSREVVAVEA QSMPVSTFWH  960
KDLDKGKKWK GHTISGQDNN DHQDSDLLYF AKFAGICFVA AATIKYGSVA LPSLAKPNLL  1020
SALIMVMVPM GVSAIALVSA SAEEKGDRQM
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.17e-15Trihelix family protein