PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022134319.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Momordiceae; Momordica
Family C2H2
Protein Properties Length: 1544aa    MW: 171566 Da    PI: 6.6453
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022134319.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H210.60.001714531476223
                      EET..TTTEEESSHHHHHHHHHHT CS
         zf-C2H2    2 kCp..dCgksFsrksnLkrHirtH 23  
                      +Cp   Cgk+Fs++ + + H+r+H
  XP_022134319.1 1453 QCPheGCGKRFSSHKYAMLHQRVH 1476
                      69999*****************99 PP

2zf-C2H211.10.001215121538123
                      EEET..TTTEEESSHHHHHHHHHH..T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                      ykC+   Cg sF+  s++ rH r+  H
  XP_022134319.1 1512 YKCKveGCGLSFRFVSDYSRHRRKtgH 1538
                      99*********************9777 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1544 aa     Download sequence    
MGGVEIPKWL KGLPFAPEFR PTDTEFADPI AYISKIEKEA SAFGICKIIP PFPKPSKKYV  60
ISNLNKSLSR STELSPPNVC PSSKLGSADG ANEGEVRAVF TTRHQELGQS VKKTKGVVQN  120
PQFGVHKQVW QSGEKYTLEK FESKSKVFAR SVLSGIKEPS PLVVESLFWK AASGKPIYIE  180
YANDVPGSAF GEPRGKFRYF HRRRRKRNYY HRSKERSSEL RTGEMGTLTD SLSLDSAGTS  240
PRNDLNTSSE ILKASTSTVP SEDTSHNSRG KSSDSCINME GTAGWRLSNS PWNLQVIARS  300
PGSLTRYMPD DIPGVTSPMV YIGMLFSWFA WHVEDHELHS MNFLHVGSPK TWYSIPGDHA  360
FAFEEVVRTQ AYGGSVDHLA ALTLLGEKTT LLSPETVIAS GIPCCRLIQN PGEFVVTFPR  420
AYHVGFSHGF NCGEAANFGT PQWLSVAKDA AVRRAAMNYL PMLSHQQLLY LLTMSFVSRV  480
PRSLLPGVRS SRLRDRQKEE REFMVKKGFV EDILRENNML SVLLEKESSC RAVLWNPDML  540
PYSSNSQVAT NSAVATSRKE NISCNHTESI DGNDKNMQNF MDEMTLDLDT VNDIYLESDD  600
LSCDFQVDSG TLACVACGIL GFPFMSVVQP SEKAARELSA DNLSIHKRGG VFGPKDAHDS  660
PDFGGTHPED STSVPDVNCL SKNLSVASIP KFDKGWSTFG KFLRPRSFCL QHAVDIIELL  720
KNKGGANILV ICHSDYHKIK ANAVAIAEEI GNHFVYNEVR LDIASEEDLR LIDLAVDVER  780
NECREDWTSR LGINLRHCVK VRKSSPTKQV QHALELGGLF LNRNHGFDLS PINWPSKKSR  840
SKKISRPRYY KPFQSMPLKD EVLGKRSDCK IAKREEKVFQ YYRRNKKSGN SKGVGSATQP  900
VSSGDSIDLC NMRTFRSNTS ELAIPGPIGT TNQQNAVLQD RGNTNSDPAS SMVADSICAV  960
VGRMTEPRIE NCTPEVVDVN GESCHLPVDT SGMQQKIMTT SDTSEPNEKA VLPSFTCPHV  1020
NAINESEMHK EQEIVGSCNN TNQVCDIASE GQSHALADVG LDETSSIHFE SSKVMMDNAD  1080
VRNLNCEACD GTTKDDDAEQ EIEIANRLKD VEEDSCSLIP IKQQHCVATE CDSQLGHLED  1140
RIEQEMEPTC RSNESEPILV NTGTASAATS HSRDENSEVP GVGCEAPNLC NAVTSVDLVN  1200
NCQIDADVET QSVSGVVVQS KTQQSSCLAD ERSFENLGSQ EDKEHLSDIE MRTEPRSLVN  1260
EPGSNSCILG EGRPMDVEAS GKEACDRENL TGGMTPDDAM ECANMSGNQH VDDPSPITLE  1320
THDVAEICSS KHNEQGKNTR NLKSNPSSDV EKRRKRKREE ELIIENGFSS CDFIRSPCEG  1380
LRPRVGKNLT SRTGADVVSV QEKPERERVR KLPDALSPKR KKEIRKGSFK CDLEGCRMSF  1440
ETRAELALHK RNQCPHEGCG KRFSSHKYAM LHQRVHDDDR PLKCPWKGCS MSFKWAWART  1500
EHIRVHTGER PYKCKVEGCG LSFRFVSDYS RHRRKTGHYI DQPV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
113521358KRRKRKR
213531358RRKRKR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.10.0C2H2 family protein