PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PON91568.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Cannabaceae; Trema
Family C3H
Protein Properties Length: 2222aa    MW: 247318 Da    PI: 7.9768
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PON91568.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH18.43.8e-0620922113627
                  -SGGGGTS--TTTTT-SS-SSS CS
     zf-CCCH    6 CrffartGtCkyGdrCkFaHgp 27  
                  C++f +tGtC+ G++Ck +H++
  PON91568.1 2092 CSTFEATGTCPQGSKCKLHHPK 2113
                  *****************99985 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2222 aa     Download sequence    
MDLPNLLHHH HHQPRYVPLP NSSRPHHPDN PNLHNNPRFQ PQYHHHNHHV PQPQPTLPPP  60
PPKTPPFQPL PPPPPPPHSS SSSSSSAHHP PPPIRPYNPG QSQFAFGGNP NLNRPPFEDD  120
HLRSSHPHRD FVLSGNVSHR VPLDDGRPRH RLPQFEPEIR PEIWDQPRVL PEHQSERLQR  180
PLDFDCGSQK FQFDHKPVSP YKVLRHDSAG SSRFRGEYVD GFESKHRDEL LRGGDDENYS  240
RRGSFLSSPG TSLRDTGFAS NPSLSVKDLE LETDSYNGQY SSAYDNEVFR SSREDGVYDN  300
QRWLRDKKFP RDAHKSSFER GSTEISNGDD ENYSRRGSFL SSPGTSFRDM GFVSNPSLKV  360
RDLESETDSY NGRCGSTYDT EVFRGGRRDG LYDNQRRVRD RKFPRDAHNS SFERGSTEIT  420
NGDGAHMGSV KREYYGPDLG RYSSRGSRED NNEYNRTPRK QLQKKSALLR IQMVKPNHRN  480
AESEHLHYPG YHDNSNSSFF RRKDQYVYEE EREGSPIELD VSFKSNSLVA KAIMAPSSSF  540
DVTDADLTLR DMKIWEDSLS DKDCSNAELA KLSDSTANVD SSMYVGKKTS NHDDDTIPSE  600
ENGVKNMCDT TSQASVSVTN QSVEKSEVKR SPRDADLDKD CAKVAPDKVS LKIPKKKKVV  660
KKVIKRVLRP RSHKLSSQQK KKHGRLLTAD ASIPTPSESC ATDKVVADLS CCCPNEVDML  720
HERKKVEGSP VAIDSEEHGI DINSSLKCVD DIEKDGNSLN NCSGSSSRKD EGPENEECSV  780
QGLPPILNCN KDLVKSLNVS TLPDIAHVAY IEQSCQNGDS LLLDSGLKKE SLPFTLPPVG  840
KVDSDLLNLG KTKMHDNLMN TYRSDNCTLD IVNGTNLSKE ESRVCDVGTF DALSKLQCSI  900
QVTSLQEKDL KEEVSKAHLS AESSAPDGLS SVRETTMDVD SSHHDRTTFL GYNNECNFEE  960
DNVISSRCTI DRITKPPSTD GATEFAGNFA TISFPNNKIL FGGNEGDTPL TGRKRTFRNQ  1020
LDFLRTNDID LDAVDVSNPA TAVDTTLSLS FEDPNLLLHG NGSSDGFPDA NLIARDDVNS  1080
GFLVTPSPCK KRRKVVASHL AIPSPTISET DEKAADRSTS CAEAPLTSDG VLTELKTDCD  1140
RPTADTFCAA TNLMHSESGI TVLYENKLSE GSSVTVGAVK SFFNDENVKS EHQLDCCPNH  1200
EKSIFPTVES LCSKNDQQVE AFNIGSGEEM LGLPSSREQV IAQDETPQCM VPSDIQPLDI  1260
DRRFSFAGME SDNHLVKDGF PNLSNYLSLP DDHSGVSTTT SNDEAMELVP DKLTVMGSPQ  1320
SSFNVSNVHI SDAISVSQLS SETCREGGKL VEKSVDEAGS DVSAQKSFLQ CTDANVKSDC  1380
ATESDQAIEG KTVSLPSQDS RSVSHGLNIN SAECTGQKNQ LGHAIPRTFP SRSSFRFTTL  1440
KKKSTSIHAN PRTWHRNSAS SASPLPGSKP SSRTVPSQSQ LPERDENIHS TSYVRKGNSL  1500
VRKPSPAASS LPQGSPGFSP SVYRLKSPVL DESKRSGEPD YRVDSGNSPG LLRMGETKSS  1560
CDKPRTPLIR SGTKLPNCVT ISSGDCTSFP SAGPLLNGCC ETTSDPISSL TNNDTTKFVE  1620
DSLTSENQNG QPKSLDNQTE LNDGKLAYLN TKRIVYVKRK SNQLVATSNS TDLAAPNAHK  1680
MQASSFDGYY KRRKNQLIRT SLESHTRRPI MLDENLNPGV KMALKAISSI RLSKRRSQKA  1740
VAKTFKRSAN SLVWTLCSSQ SSENDSGSVT HQKVYPYLFP WKRTAYWRST LQNWNLISKC  1800
NSLAISGKLL LTRKRDTVYT RSINGFSLRK SKVLSVGGSS LKWSKSIDRH SKKANEEATL  1860
AVAAVERKKR EQKGATHISS GSKDRNHSSR ERIFRIGLFR YKMDPSRRTL QRISDDESSF  1920
SATNPDKDAK RSYIPRRLVI GHKEYIRIGN GNQLIRDPKK RTRLLASEKV RWSLHTARLR  1980
LARKRKYCQF FTRFGKCNKD NGRCPYIHDP SKIAVCTKFL NGLCSNANCK LTHKVIPERM  2040
PDCSYFLQGL CTNQSCPYRH VNVNPKATTC EGFLRGYCAD GNECQKKHSY VCSTFEATGT  2100
CPQGSKCKLH HPKNRIQDKK RKRSREVRNA RGRYFGSPDL RVFESKKPLF EKNFAQDYND  2160
IISDGNLSDF IAIDVSDDDF GEYNDAASEQ TTFCDSDTSD LQQEDLDELI KPIRLMDASM  2220
TS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1655682KKKKVVKKVIKRVLRPRSHKLSSQQKKK
210901094KKRRK
321192124KKRKRS