PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OMP02914
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus
Family C2H2
Protein Properties Length: 1653aa    MW: 186094 Da    PI: 8.5957
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OMP02914genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H211.40.0009715371562123
                EEET..TTTEEESSHHHHHHHHHH.T CS
   zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                y+C    C++sF +k +L  H r+ +
  OMP02914 1537 YQCDmeGCTMSFGSKQELAMHKRNvC 1562
                99********************9877 PP

2zf-C2H213.20.0002715611584223
                EET..TTTEEESSHHHHHHHHHHT CS
   zf-C2H2    2 kCp..dCgksFsrksnLkrHirtH 23  
                +Cp   Cgk F ++ +L++H r+H
  OMP02914 1561 VCPvkGCGKKFFSHKYLVQHRRVH 1584
                69999*****************99 PP

3zf-C2H211.60.0008616201646123
                EEET..TTTEEESSHHHHHHHHHH..T CS
   zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                y+C    Cg++F+  s++ rH r+  H
  OMP02914 1620 YVCAeeGCGQTFRFVSDFSRHKRKtgH 1646
                89********************99666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1653 aa     Download sequence    
MAASSLSPEP SQEVFPWLKS LPLAPEYRPT FAEFQDPIAY IFKIEKEASQ YGICKIIPPV  60
PPAPKKTAIG NLNRSLLARA AANTSSDSKP APTFTTRQQQ IGFCPRKPRP VQKPVWQSGE  120
HYTFQEFEAK AKNFERNYLK RYSKKGSLSA LEIETLYWKA TVDKPFSVEY ANDMPGSAFV  180
PLSSKKSSGG GRETGEGVSV GETPWNMRVV SRAKGSLLRF MKEEIPGVTS PMVYIAMLFS  240
WFAWHVEDHD LHSLNYLHMG AGKTWYGVPR DAAVAFEEVV RIDGYAGEFN PLVTFSTLGE  300
KTTVMSPEVF VRAGIPCCRL VQNAGEFVVT FPRAYHSGFS HGFNFGEAAN IATPEWLRVA  360
RDAAIRRASI NYPPMVSHFQ LLYDLAHELC SGVPVSINAK PKSSRLKNKT KSEGETLVKE  420
LFVQNLIQNN ELLHILGKGS SVVLLPKSSC DISLGSDLRV ASQLRINPRM PLGLSNYKEV  480
VKSSKDLASD EMLGGHEEIK GVKGFYPVKG KFASIYEGNR DSSYSGNDYL SRLSSNTLNI  540
STERENAVQG DVLSDQRLFS CVTCGIVCFS CVAVLQPTEQ ASRYLMSADC SFFNDWTVGS  600
GATRDGFTVA HVDAITSEQN LSTRWINKRD PNSVCDVPVQ SIHCKLQSTD QSNQLVEDTE  660
KGGGISALGL LASAYGNSSD SEDHNEPNAT DETNSANVSP QRNFQYTGSS PGDANGSHNL  720
SLSRVDSEEE APAHVMDCYS DPGSRRGDTK TRSPQTSDHA VEFETDNLAS RRSNGLEDKF  780
RDPVTASHAN PGYSPVTHGT ENMRFSKAIG LIENADMPFA PRSDEDSSRM HVFCLEHAAE  840
VEQQLCQIGG VHVFLLCHPE YPKIEAEAKL VAEELGIDYA WNDIIFGDAT KEDEERIRSA  900
LDSEDAIPGN GDWAVKLGIN LFYSANLSRS SLYSKQMPYN CVIYNAFGRN SPASSPTKLN  960
VYGRRSKQKK VVAGKWCGKV WMSNQVHPFL TQRDPQEQEQ ERNFHAWATS DENLDRKPEN  1020
VRKAETTKVA KKFNRKRKIR AGIASSKKVK CIETEDANSD DSLGGSSLRQ QQRFIRGKQP  1080
RLIEKEEAVS YDSQEDDSLL SQRILSRKKQ TEFVDREDAE SEDAEEEFTH QQPWRKLKGK  1140
QGKYVEEDDA VSGDSLDEGS LQQYRRVKRS WQAKCVERED AVSDDDFEET SHQMRRRIPK  1200
GRQIKSFERN DAISDDSQVD NSLKQYRRMP KGRQSKFVER DDAMSDDASE GDSQHDHHGR  1260
ISRGKQMKCM ETDDAFSDDS LEDNPQQHRG IPRSKGSKFT DREDVGSFDS LKRNSLQQHR  1320
RVCRSQLTKF IDRDDAVSSD SPDDSSLQQP RRILKSKQTK ILEREDAVSE DSLDDTSQQQ  1380
LRKTPRSRQG KFIEREDAAS YDSLEEDYQP SRTLRSRNKK APTPRQIKQE TPRNVKQGKR  1440
RTTKQVSSQQ MKQATPRNKN IKIEQSARQC NSYGEDELEG GPSTRLRKRV RKPLKESEPK  1500
PKEKKKQASK KKVKNASNVK TLSGHSTTKV RDEEAEYQCD MEGCTMSFGS KQELAMHKRN  1560
VCPVKGCGKK FFSHKYLVQH RRVHMDDRPL KCPWKGCKMT FKWAWARTEH IRVHTGARPY  1620
VCAEEGCGQT FRFVSDFSRH KRKTGHSVKK GRG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1964970RRSKQKK
214851491RLRKRVR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.10.0C2H2 family protein