PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Dr03885.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; Dioscoreales; Dioscoreaceae; Dioscorea
Family C2H2
Protein Properties Length: 1475aa    MW: 163927 Da    PI: 6.3547
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Dr03885.1genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H210.80.001513891407523
                 TTTEEESSHHHHHHHHHHT CS
    zf-C2H2    5 dCgksFsrksnLkrHirtH 23  
                  Cgk+Fs++ + +rH+r H
  Dr03885.1 1389 GCGKMFSSHALAMRHQRAH 1407
                 7****************98 PP

2zf-C2H211.20.001214431469123
                 EEET..TTTEEESSHHHHHHHHHH..T CS
    zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                 ykC+   Cg +F+  s++ rH r+  H
  Dr03885.1 1443 YKCKvaGCGLTFRFVSDFSRHRRKtgH 1469
                 9**********************9777 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1475 aa     Download sequence    
MVSSDPPTIT IPGWLETLPH APVYRPTISD FSDPISFISR IEHEASRFGI CKVIPPLPRA  60
SKSTTLQHLH SSLRIPFSTR CQELGSKKPS KTKQVWQSGD SYTIEQFEAK SKAFAKSQLH  120
GLKEVTPLLV ESLFWNAVAE KPITVEYAND VPGSAFAPRR KKRKREEGTP RSLSDSPWNL  180
QGVARSPGSL TRFMPDDIPG VTSPMVYIGM LFSWFAWHVE DHELHSLNFL HMGAPKTWYA  240
VPGEYAASLE EVVRVKGYGG NVDRLAAFMM LGEKTTLLLP EVLVEAGIPC CRLVQYPGEF  300
VVTFPRAYHV GFSHGFTCGE AANFATPQWL KVAKEAAVRR TAMNHLPMLS HQQLLYMLTM  360
SFISRKIVHL IPAQVFLKNL NLKLVVPRPV LSGVRSSRLR DREKERELLI KEAFLNDMMD  420
ENRKLHSLLE KESIPTVVLW EPELLPSASN VSQLNSSSTI HDTQLSVIDA EQGSDVNCKF  480
KTMCDTVSXN RQLKEILCEE TMGLTSIATH DISSGMTVKD VNTCCRSSNE ETVGKMDDDD  540
DEEEEEEEAE EEEVLPFGLN VDSGTLACVA CGVLGFPFMA IVQPELFSLS NDESYQKSEK  600
SGWLKPCLPS YVQRTXKLGS DTKDSCLESE EQPNQESGPD SSSQLCGHIP AAECQGNACG  660
SKLLKNVSQZ STSCVTKLVK VTCGRFGDXD SEYIKARIFC LQHAIEIADL FRCKGGARIL  720
IICHSDYLKI KALAIAVAEE IGMQFNFKDF PIEDASATDL DLINFSIDGG EEHEDWTSKL  780
GINLRYCIKV RKQSSSNQEP LPLSLSKLFA DSPHLSVVST LKWLSRKSRT PYKVVGKSYS  840
KTHIVKDTVN DEALKVCQNH KRRPSFITAK HHGQHSKGQL EESHGXRGTK SVDDGNDHSR  900
TSGFLLCKDN LELFCTDSLV TVPVLAAERL HMHQDSWPTQ KTNPLCIACD SSPSDKRDVH  960
VNSAVKGCER QQVLNSNEEA SQTLLCDSVN SGFCQSSSDL SMFENAEAHQ SGLVADAVPT  1020
GCKTGDFEES ESEINIVVQE VESLEVLKEI IVAEDSYPLK FGRTQAPANK PTPEXETIAH  1080
EKLEGDDLVH NTGLESSXLK FALKVDQSIP QLVSHNNSVP LENLEVSASE LVNAELDVCS  1140
KLNNNESDID DSQPQQNHIL GSKESSLQCN GEDQXRQLGC EYVIDKPELQ QNIKASADXE  1200
INGSMASEAE TSEIPLDASX VDCINLLQDI SVKLPHSVGR AKGLVMPQLS NGXIDVNSVP  1260
EDVSVQNGVI RNNVRPIIVY KRAERPKKKQ KSEAEPMSNV HLSSNEFIRS PCEGLRPRTG  1320
RRSLDETADV GAAEKGEGYK TKKRDRPAGQ SIXQKAEGTY ICDIDGCLIS FRTKRELDRH  1380
RDNRCIFKGC GKMFSSHALA MRHQRAHEDE RPLKCPWKGC GMSFKWAWAR TEHVRLHTGE  1440
RPYKCKVAGC GLTFRFVSDF SRHRRKTGHS GNSKT
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1159165RRKKRKR
2159166RRKKRKRE
3160165RKKRKR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.11e-155C2H2 family protein