PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Kaladp0032s0404.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; Saxifragales; Crassulaceae; Kalanchoe
Family C3H
Protein Properties Length: 1670aa    MW: 181419 Da    PI: 4.7791
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Kaladp0032s0404.1.pgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH17.76.5e-0616441667327
                           S---SGGGGTS--TTTTT-SS-SSS CS
              zf-CCCH    3 telCrffartGtCkyGdrCkFaHgp 27  
                           t+ C++++  G Ck G++C+  H++
  Kaladp0032s0404.1.p 1644 TRVCDYYMS-GRCKKGASCNWLHPQ 1667
                           899**7777.*************97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1670 aa     Download sequence    
MADDDEVAAG DHSPRRFQEA QGPAPEQLTN QERRPVSEEI VVEELEAVAG SEVSEFGGSE  60
VPAELGAVCA ADGQAGAEIG GSVGKDEAAV ESNESGVAME VEPCNAGEGG VGSNCCPSEL  120
EGAEPEKYDQ AEDVKSTAFG IGTESQNEPD SEKLHLDVKS EGVVAAEDYH MNDAFEEKTE  180
ESEKNVSVEQ CEASTVCGGM EQVDDSKKTV KEEGVEENVA GISDLIGGDP TGDVGYEHKG  240
ELPVAVPEIV IGEQETTLLS TSRPIVESEQ VALAEDDLME DASTSSDEDN GDHHVTELVE  300
DIVHSELMND QESPSSREVN SVDSAAQGKV DDILSVAEQQ KHVALSVGLD HIDQSDKPKD  360
EGGDEDAEMR TEEENIENEE KVKEDLSAVP DLDFSLQNVE LDPTIGEDEI DEEFVGGEDP  420
SIADEVETQI DITDAEKGNA PRKRGRKPVK AASTKKTEED VCFICFDGGD LVVCDRRGCP  480
KVYHPSCVNR DEAFFRSKGR WNCGWHLCSI CEKNAQYMCF TCTFSLCKAC IKSTVFYSVR  540
GNKGFCDNCM KTVNLIESNE QGNNEAKFDF DDKNCWEYLF KDYWIDVKGK LQLTSEELAQ  600
AKNPLKGSEA VTSKQESADE TYNIDNNADS GSNNSSGNAV ASTPKKKKSK KRSKSVGKGE  660
DSASGSDSSG NGNKSKRKKT KKQVKSLRKR GSSGNTSEVG IDSGSKWASS ELLKFVMHMR  720
NGDASVLSQF EVQELLLEYI KTNKLRDPRR KSQIICDAML KGLFRKQRVG HFEMLKLLEL  780
HFLMREDQQN DFHGSSVDNE SSPLDETRKA DVSAKTSKDK KRKMHKKRFG RGTPTNIEDY  840
AAIDIHNVSL VFLRRNLMEI LLEDVEKFQE KVVGAFVRIR ITGINQIQDM YRLVQIVGTT  900
KAPVPYKVGK RTTDINLEIQ NLNKKEVVSM DIISNQEFTE DECNRLRQSI KCGLIPPLTV  960
GYILEKAMDL QAARVNDWLE TEMVRLSHLC DRASDMGRRK ELRENVEKLQ NLKKPEERQR  1020
RLEQIPEVHS DPKMDPSYES EEDEVEAEDR KQENFSRPMG SSFSRQGKET LSPRKGSSVS  1080
SDSWSGAKGH SVSGMNRELG RSLSVKGEDT SLTGELNRDI MSLKRSNEHE SITRDRQKGP  1140
YSSPLVGGNS YSAVASEPVS GVVSDTQTAA TVNESEKIWH YKDPSGKVQG PFSLVQLKKW  1200
NTTGFFPKDL TIWRTSENED DGILLLDALA GKFQKETKFN ENQSNKSHNV QIGHSSLPHS  1260
ARSALNGSMG NSASPSQIST VGRTSLSVDV PMANAAARGS GFDSRNDSTN LPSPTPNQTP  1320
KGSTEGQAFV DRVSSFSRSA IGSLSADSTQ LDRLTATSVA NAFRSTYNQQ PTSTGYHLQQ  1380
PDPSAASVNS GVESRNSAAA VPSIMQSVAL QQPGHSGSNN QIPFSASNTY NQWGNPTAVS  1440
TENPGGSFPN QGFLGMPVST AWRPPTLPVN QPGAQAVVLP DQSWRAVPGN PNMGWGGVLQ  1500
ANANVSWTPT GQVAPQVNPN WAVQGMGPVT GSATAGWMAA QGHATPGFVA PGNNQAGAIM  1560
NPSWAGGAVV NANSNQAWAA AGHGSMPNSA APNNNSQGSW VNGQNGSERY ANDRTGGGSS  1620
GRQWNNRRGE GSYHQRGGSG AGPTRVCDYY MSGRCKKGAS CNWLHPQRQ*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1645685KKKKSKKRSKSVGKGEDSASGSDSSGNGNKSKRKKTKKQVK
2647687KKKKSKKRSKSVGKGEDSASGSDSSGNGNKSKRKKTKKQVK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G16485.11e-174C3H family protein