PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_024995619.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Carduoideae; Cardueae; Carduinae; Cynara; Cynara cardunculus; Cynara cardunculus subsp. cardunculus
Family C2H2
Protein Properties Length: 479aa    MW: 51672.4 Da    PI: 9.1225
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_024995619.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H217.98.2e-0688110123
                     EEETTTTEEESSHHHHHHHHHHT CS
         zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                     y+C++C+k F r  nL+ H r H
  XP_024995619.1  88 YVCEICNKGFQRDQNLQLHRRGH 110
                     9********************88 PP

Sequence ? help Back to Top
Protein Sequence    Length: 479 aa     Download sequence    
MSNISGDGGG GSFSSGGLED DGQQRQPPMV NSHTHTHTHT TPLDTGSISQ QLLPDSSKKK  60
KRSLPGTPDP NAQVIALSPT SLMAKNKYVC EICNKGFQRD QNLQLHRRGH NLPWKLRQRT  120
STEIIKRVYI CPEPSCIHHN PARALGDLTG IKKHFSRKHG EKKWKCEKCS KRYAVQCDWK  180
AHSKICGTKE YKCDCGTIFS RRDSFITHRA FCDALAEENS KLTQTLQQNH HSTVQNINPT  240
TSITSPEFSH GGMPDSKNPS ELLPLNIMQG SRGSLFSGSP RNGSPSSLQL GGTTLSSHLT  300
SATALLQKAA QMGATASNGN NNGNNGGMNN YVTTMAPPSY GGGGGGGAYH GADQNLVDQY  360
HTHHSQISGI IGGGFSSQFQ ETSGISRFFN PSINGGGNGD GIGGYSGFMN PSKEVMIMNN  420
NNNNGGHDGN PGMSDSSNPL LRFKRDGNGD NLTVDFMGVG GMRLRSFNEQ HQQQQGMEI
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
15887KKKKRSLPGTPDPNAQVIALSPTSLMAKNK
25988KKKKRSLPGTPDPNAQVIALSPTSLMAKNK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G50700.11e-108C2H2 family protein