PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG67301.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 850aa    MW: 92467.5 Da    PI: 5.9273
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG67301.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix34.16.9e-1189165270
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergf.erspkqCkekwenlnkrykkikegekk 70 
                 Wt ++  aLi+a+r+ ++++        r + k  +W++v k++ + g+ +r+++ C +kw+nl +++k + + +++
  GBG67301.1  89 WTVEHMIALIQAKRDQDSHFAglahttgRMETKTWKWDDVEKRLVHMGVtSRKAVDCGKKWDNLYQQFKTVHKFMGE 165
                 **************999999966555444679999*********99999799*****************99876665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 850 aa     Download sequence    
MRGVAKISVG ADDILRDEDG GVAEDCEADG GAGNDDEVDD ENMEIRPLGR KWGGSTAAKK  60
LPETRTGRRG KKGVEDALAG EGSKSRDFWT VEHMIALIQA KRDQDSHFAG LAHTTGRMET  120
KTWKWDDVEK RLVHMGVTSR KAVDCGKKWD NLYQQFKTVH KFMGESGKPN FFTLTPGERK  180
ERGFDFRMDE RVYSEMAAMT RSDHTIHPTN LADTGATGGV QMPGPHGGRN ESGGSEGGGD  240
GQNDDQGSTK DSISGGGVGG GNKEKNVRQQ TFDTIADVMK EHGSLMASTV DSASKRQCSI  300
LTRQCDILER EVEVQKEHYF KADQANLMMC EALMKIAKAI RERSGHIPKS KRQRIDDASS  360
SQTEDFLADE VAMVDAQGTT GVVRLGFGRD GVSREQLQAA RRSVVGGAAG TVPRSPNTAA  420
VVVTAARVAG QISAAAGQGQ PRQALPLQHV LATGGGHTAT AQKGDATATA SRTVAADHTA  480
KGSFVEGAER VGVDDVGCDD GRRDRKREGN DGDDRPLVPT GKGAPKEDEL QEKAKLWVDC  540
DAFWGQGRGK PLREAVGECT DYFVAVANGD AGAEPPSMLI MPPNDVPRFK IDDPAQREPA  600
LRRARSVERV VLRTIHGWIF KSQSRWTGFS RAESYITVDF ATDLARAVWQ ALEWSRVVSP  660
ALVYHTVAMK MDVPLWFAGV KIEDRLEDDD MVARQEATVL LLAECWTDAL WCGQWADGGC  720
VKQERLSRLA DCLRALLCAV MWIMRMGGDD DRSHYEAWSY ASMVAKPTMI AAGSYIFNWR  780
RHVVDSANLV LDRLGKAHLT LGDYPQCIPE WCDCGLAFGH NASLKNAAEA AKHGWIDSGP  840
AADDDGDDGK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1349354KSKRQR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.19e-06Trihelix family protein