PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OMO94198
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus
Family C2H2
Protein Properties Length: 1762aa    MW: 202248 Da    PI: 8.3852
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OMO94198genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H213.30.0002513871409123
                EEETTTTEEESSHHHHHHHHHHT CS
   zf-C2H2    1 ykCpdCgksFsrksnLkrHirtH 23  
                y C+ C+k F r  nL+ H r H
  OMO94198 1387 YICEVCHKGFQRDQNLQLHRRGH 1409
                89******************988 PP

2zf-C2H211.10.001214631485123
                EEETTTTEEESSHHHHHHHHHHT CS
   zf-C2H2    1 ykCpdCgksFsrksnLkrHirtH 23  
                +kC +C+k++  +s+ k H + +
  OMO94198 1463 WKCDKCSKRYAVQSDWKAHTKIC 1485
                58*****************9987 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1762 aa     Download sequence    
MTSGSGSEAG LPKPVDDALR ALEQRWDQRF HEQDVRQRRF ETLLETILER LDTLGIDANN  60
RYDYDRENSD FRLKVDIPNF NGCLGIEEFM DWIAEVDRFF NYLDTPAEKR VKLVACRLKG  120
GASAWWDRLQ SRRVRKGKNL VRTWYRMKFL LQQEFLPPDY EQILFQAYQN LKQGAKSVHE  180
YTADFMRGRI GVQVVRTLSE VKNLVLKAEL MQQERSRRNY DNYRDDGSDD KDKKELTDRK  240
FNNNNRQNYR EEKAVGKRVV AEGGDNKKGD NNPYAKFHGV TCYRYEDDEE DDYDEDDKQI  300
YVVRRMMLTP KSETQTQRHQ LFKTRCTING KVFRLIIDSG SYENIISKEA VRKLNLPVEK  360
HPTPYSLWWI KSESRQLDVT KCCKVPFSIG KYKDEVYCDV VDMDVCQFLL GRPWQYDLDV  420
LHSGKSNQYR FEKNGEKFLL LPMQKSDKSK EQKTFLSVAK DFGSEMKESN ELYALVVKQQ  480
VELQVEEHPD LVQPLLLEFK DIMPDDIPDG LPPMRDIQHH INLIPGARLP NMPHYRMNPK  540
ESEILKEKVE ELLKKGLIRE SMSPCAVPAL LTPKKDGSWR MCVDSSAINR ITVGYKFPIP  600
RPDDMLDRLT GSAWFSKFDL KNGYHQIRIR PGDEWKTAFK TKEGLFEWLV MSFGLSNAPS  660
TFMRLMNQVL KPFPGQFVVV YFDDILIYSN SEEEHLAHLR QVLEVLQQNK LYVNLKKCNF  720
LTRQLLFLGF VVSADGVQVD ESKILAIKDW PEPKNVSELR SFHGLATFYR SFVTLKEKLY  780
TAPVLALPDF DKVFKVDCDA SGVGIGAVLS QEKRPVAFFS EKLSDSTRKW STYDKEFYYV  840
VRALKTWEHY LVGKEFVLYS DHEALKYLNS KKRISSDMHA RWCQYLQKFP FRLQHKSGVQ  900
NKVADSLSRR VTLLTTLSSS ILGFEQLKGE YEDDEDFSEI WSKVSNKQPS AGHVGRDKTL  960
EVVKERFYWS HLRRDVCKFV EKCYTCQTSK GQSKNTGLYM PLPVPENIWE DLSMDFVLGL  1020
PRTQRGVGSI FVVVDRFLKM AHFIPCKKTS DAVAIARLFF KEVVPLHGVP KTITSDRDNK  1080
FLSHFWRTLW KIFDSSLNFS STAHPQTDGQ TEAVKRTLGN LVRCLYGEKP KQWDIALPQA  1140
EFAYNNAVHS ATDRTPFSIM YIKAPSQTLD MVKLPKGNGL NVSAKHLAEQ VVEVQQAVKQ  1200
KLEAANQKYK QANDKYRRHE TFEVGDQVRV FLRKERLPVG TYSKLKPKKY GPYTILQKIN  1260
DNAYIVDLPD NMGISKTFNV SDLSKFHDSS VPLYPNSRTS FSQVEETDAD ELGISFMDLW  1320
TIRSPFERAC NVLDFWLEAP VDFEDDEQFQ STVISQSSNQ SPNPVEHDPD AEVVALSPRT  1380
LMATNRYICE VCHKGFQRDQ NLQLHRRGHN LPWKLKQRTT TQVKKRVYVC PEPNCVHHDP  1440
SRALGDLTGI KKHFCRKHGE KKWKCDKCSK RYAVQSDWKA HTKICGTREY RCDCGTIFSR  1500
KDSFVTHRAF CDALTEENYR VNHNLGASGG ILQSQAQELF TCDTGSNANT MMNLSISNEN  1560
MDNSPRPLSL TSAGVMISSN LDSIFNPRTS PLAMGSAYTS ATALLQKAAE MGAKISDNTI  1620
APILLRGFTG YSTSGLNSSG AVQEGSSMVG SNMATHATST NNFYVGEETY EKNLEPGDPR  1680
SLNTVPPALF DSHFLHSEEN GNTANLLGEV YMGGSEKMTV DFLGVEPTGH QSISKKRSYD  1740
GNIVNLGYSN GQKSLNHLRS NW
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1132138RRVRKGK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G50700.13e-98C2H2 family protein