PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc001023.1_g110.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family C2H2
Protein Properties Length: 1152aa    MW: 129177 Da    PI: 7.7639
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc001023.1_g110.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H211.80.0007710371062123
                             EEET..TTTEEESSHHHHHHHHHH.T CS
                zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                             +kC    C+ sF++k +L+ H ++ +
  Cse_sc001023.1_g110.1 1037 HKCDieGCKLSFKTKTELRLHRKNrC 1062
                             789999***************99866 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1152 aa     Download sequence    
MRDVSIPDWL KELPLAPVFY PTDTEFADPI AYISKIEKQA SAFGICKVIP PLPKPSKKYV  60
VNNLNKSLLK SPELGRNVNV GKDNRAVFTT RHQELGCDNG KGGKEVNKQV WQSGEIYTLE  120
QYELKSKVFA KSQLGVVKDV TPLDVESRFW KAASEKPIYI EYANDVPGSG FGEPVSASRF  180
MRKRRRRRRN FKRYGNDSDE RKGERESVKC SGSSDVDVLK KDTSLVDDDR EVEGTAGWKL  240
SNCPWNLQVI ARSPGSLTRF MPDDIPGVTS PMVYLGMLFS WFAWHVEDHE LHSLNFLHIG  300
APKTWYSVPG DYAFTFEEVI RSKAYGGIDR LAALTLLGEK TTLLSPETVV ASGIPCCRLV  360
QYPGEFVVTF PRAYHIGFSH GFNCGEAANF GTPKWLSVAK GAAVRRAAMN FLPMLSHQQL  420
LYLLTMSFIP RVPKSLLPGK RSSRSKDRQK EERELLVKKE FIEDILKENK LINHILQRNS  480
AYHAVLWDLE SLSPSVITES DLKNIDDIHV KNDNNLDTEI VVNIEDEDMS SDFQIDSGSL  540
PCVACGVLGY PFMSVIQPTT KMLIDEMPVT GHGVELVNAE DRSLEASTNS CEMNVEKGWN  600
ISNGFLRPQI FCLKHASKVI ELLDSMGGAK LLIICHSDFL KIKIQVSAIA EQIGNTFRYN  660
EVRLDEATQD DLDLINLAID NEQDNDSVED WTLKLNVNLR QSVKLRPKLS PDKIHNALII  720
DALFADTPTS SVDSHAMIFR WNATKSRSNR KLNSSIKKFS KTVDVEVIDN ISEAEILKKG  780
GSSIQYSRRK FKSKRQDSVA ILSKNRNEDN LLVSGDLSTP GNPDTQQEKV DSVLSQVGVD  840
IIAPPVAEDT EPHTSMEMEG GSESCATTLE KSGTENGEES DLQNEISFVA TDDNGSLEQG  900
SEDGLMQEDT ELTKDGGSSA DEPVIDSNDK VKKDHTTDND ENNHVSNSKS GSKRKREVEL  960
LQTEEKSDFD GFVKSPCEGL RPRGGKESLK GGINISTKTV PEKPTKKSKP SNVTEKHAKK  1020
TKASNVKTPE KTEHKSHKCD IEGCKLSFKT KTELRLHRKN RCPHVGCGKK FKSHRYAVIH  1080
VRVHEDHRPL KCTWKGCKMT FKWAWARTEH IRVHTGERPY KCSVEGCGLN FRFVSDFSRH  1140
RRKTGHNVHV KS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1182186RKRRR
2182188RKRRRRR
3182189RKRRRRRR
4183188KRRRRR
5183189RKRRRRR
6184189RRRRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.10.0C2H2 family protein