PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID RcHm_v2.0_Chr6g0296771
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family C3H
Protein Properties Length: 2107aa    MW: 232203 Da    PI: 9.3534
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
RcHm_v2.0_Chr6g0296771genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH19.41.9e-0619812002627
                              -SGGGGTS--TTTTT-SS-SSS CS
                 zf-CCCH    6 CrffartGtCkyGdrCkFaHgp 27  
                              C++f +tGtC+ G++Ck +H++
  RcHm_v2.0_Chr6g0296771 1981 CQTFEETGTCPQGPKCKLHHPK 2002
                              *****************99985 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2107 aa     Download sequence    
MVVEIDLSFL DHEKLTVYPG LRWESETAGQ DKTKLRKLRT KTKKKIRRRR RRKKTKKSRE  60
PNQPEIPDLS LSSWTSLSTF TTIQGTLLPP PPLTLISQTT LTAPPTTTTT ATLRLLLPQP  120
SPHSSAHNPP PPSNPYPLLR RHHIPKGPPS PPPHRFRSFQ NLLPRTTHRP LPLDSDRPRH  180
RLYLESGPDY RDRSRNYSPR DYDRELHKPY RRDPDGGGAR FRSEYEEQRD RRVDDGYRRR  240
SSNHETTTFR EVELQDSGSG CYDESAYEEL LKSGRRYEAY EIQRWGLERA QSEEDLYDPF  300
ELDEHDEEDR IVSDKRDYYG SESGRNSSTS RWKQIQKKSA LLRLQTPKAR DRNYSAYYDS  360
SGSGGSFRGK EQYEYEYSGH GRKDAVEEER RGGGGEERGQ QQRGGSSPLD LDVSFKSNSL  420
VAKPVRTRDE KRGSVSDMDC SESPPAKPFC ADVVNLDSLA LVSNKTSSPG KEAMKLKGQV  480
TTSGMNSQPC SVVTDDAFVK SEVETAPKGK ILHKDGKDVG SSQKSSPNVS KKTKVVKKIV  540
AKKTVKKVTN PPTSQPESKI DEPGTSTVHS LFAGTGKGET SSISDPCPND LHVLPVDKEV  600
EGSQPNVLSD EYGTVLNSCS KNIGNNSSAN LGSLSPAEIN TDQKHPLNVD TPVKGLLTIS  660
NFNSNATDSI KVASCPETGG VVDVSKSICQ SGNILLVDKV VQKESSQAML AVEGNLSSGF  720
LNSDKMHELT NVKGSEHGAE TLEKLSHQEI IVPDVGTVDA VSEQPCRYQL PTSLQNGGVV  780
EELPKAISSA ENNMAVGLSS SGETKAVRSK IGRRTRWDSD EVYVKKDKSI TVSNGGNTNS  840
FGKQSSPDCD SRSFEICATE RSPNIPKSGG DTHSVTPMIQ KKRKVKTQLD FSRISDTCVD  900
PVNVSPRKDA PDTTVSSFLK DPSHAEVSVS GVQKLDMGSQ PGNDWVSVLN GKSSVNGFSD  960
TKLSATIDAN YDTNETSPEY LKRRKVSGTH LVLTSAQTNG GPANKSTSYI QESSTHNDVP  1020
TYQADKGASS SIGSQCATSN LIPSPEEINV YLEDIMAAVS SDTVAAARDS FTNDDMKIKH  1080
QGIDSSSVSE GSGIPHMQIL CPLQSQNEDK EDGTEVMVVN NHHLDILDID GSHEKDFDVC  1140
ATNEHIMVQG ETAPCTIHSE LQPADLGINS FSTNIQSDYL CVKDKLPFVP SCLLSIANGN  1200
EVTSTNSIDE GMKSVSDTLS DTGTPETSTS ITDVHMLICN PSVIKSFDEK VCADDQKFEV  1260
KSEVASAGNL FSETKTNLTL DNATEGHQSV TEKAVPLKLQ DSKKTSHGLH LISAESALKN  1320
QLGQATHRIV PGRPFSAFTT SQKATSSTHI SKPRTWHRNA NSSVSPLPVS TLPPQRQLPQ  1380
RNGKLESNSY VRKGNTLVRR RATIAAVPQI SQGLSSSVHQ LNFSGIDGLK KNAGSDSRVD  1440
IKNPPRTGGL NASSDRPPAP LPSGVKMSAS AAVSSGIPTS SPLAEPLLSD ISGTKSDPMN  1500
CSETKDAEGS VKDSLATSDT QEYHSGPVNN LHDGNLASSN MKKVIYVKRK LNQLVASSSN  1560
PSDLSVHNAD KNQPSDGYYK RRKHQLIRSS LECNGKDTVL LPTDNINSGE QKARKVIPSR  1620
TFNKKRSLKA VARMSKKNSL VWTPSGSQSS NNNGSSYDHQ KVLPQLFPWK RARYWRTVMQ  1680
SQASNFNYSS SSTISKKLLL SRMRDTVYTR STHGFSLRKY KVLSVGGSSL KWSKSIESRS  1740
KKVNEEATRA VAEVEKKKRE HNGAACTSSG SKIRNSPGKR IFRIGSVRYK MDPSRRTLKR  1800
ISDDESSKSV VLNPETNAKR SYVPRRLVIG NDEYVRIGNG NQLIRDPKKR TRILASERVR  1860
WSLHTARQRL AKKRKYCQFF TRFGKCNKDD GKCPYIHDSS KIAVCTKYLN GLCSNPNCRL  1920
THKVIPERMP DCSFFLQGLC TNKNCSYRHV NVNPKASTCE GFLKGYCADG NECRKKHSYV  1980
CQTFEETGTC PQGPKCKLHH PKKRMKGKIK KRSREHRNGW GRYFVSKDVG VSEPVTASAK  2040
HCAQNGDDIF GNDFVSINVS DEEAGESNNP PEQTTFYDSD PSELDLDDLD ELIKPVRLLN  2100
KMKTNIY
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
14154KTKKKIRRRRRRKK
24356KTKKKIRRRRRRKK
34452KKIRRRRRR
44752RRRRRR
54753RRRRRRK
64854RRRRRRK
74957KKIRRRRRR