PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022731221.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Helicteroideae; Durio
Family Trihelix
Protein Properties Length: 614aa    MW: 70545.5 Da    PI: 6.9985
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022731221.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix102.14.3e-32424508186
        trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                     rW++ ev aLi++r ++e+++r+  lk+plWeevs++m++ g++rs+k+Ckekwen+nk+++k ke+ kkr s++s+tcpyfdql+
  XP_022731221.1 424 RWPRAEVEALIQVRCNLESKFREPGLKGPLWEEVSSFMASLGYQRSAKRCKEKWENINKYFRKSKENVKKR-SQQSKTCPYFDQLD 508
                     8********************************************************************98.78889********8 PP

Sequence ? help Back to Top
Protein Sequence    Length: 614 aa     Download sequence    
MHPVKYGAHH LQQQLLMDDD GSSSSVFSIS NPYHQQQQSV LHPNYPLHFQ QQQKQHQTLF  60
QQHSLPVTHQ LFQHHHQHQF QPFQEQAETV HHHHVHQQPF LAVNFKLGLN ENSRKKEVAL  120
ALNHQTNDAT FLHGNEQHVP ENRRPQQHSL LTPHCWHPQE DSPIKEPFWK PLNRCEDRQC  180
SGDEAREIEW NKYNKVLQQP DQCTSERSKN LENRYRLFGE LEAIYGLAKG GETTRAGSGS  240
ALTGENSPAN VGLSMLLTEF QGHNVGANVG FGNVATGVDH GSEASIGEEA SLRKIQKKKR  300
KKKMKEQLSS MVGFFESLVK QVMDHQEGLH RRFLEVIERM DKERSVKEES WSRQEAEKRN  360
REAIARAHEQ ALASSREAFI VSYLEKITGQ SINLPARAPL LMQRESAIEP YNESTPVKVD  420
NNSRWPRAEV EALIQVRCNL ESKFREPGLK GPLWEEVSSF MASLGYQRSA KRCKEKWENI  480
NKYFRKSKEN VKKRSQQSKT CPYFDQLDQL YSRIPITCPT SPTPLINSYI EMQQQDDSNF  540
LEAYMPKRDL GTAQVNGSGN LKVSEMNFPK LDFDGAVCEN IVQGSNRKDN ESHGNYLNNE  600
GERVDDDNES DGEE
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1292301LRKIQKKKRK
2296303QKKKRKKK
3297303KKKRKKK
4298302KKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.22e-53Trihelix family protein