PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID RcHm_v2.0_Chr7g0191461
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family C2H2
Protein Properties Length: 1493aa    MW: 166184 Da    PI: 9.0831
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
RcHm_v2.0_Chr7g0191461genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H213.40.0002313981421223
                              EET..TTTEEESSHHHHHHHHHHT CS
                 zf-C2H2    2 kCp..dCgksFsrksnLkrHirtH 23  
                              +Cp   Cgk F ++ +L++H r+H
  RcHm_v2.0_Chr7g0191461 1398 VCPvkGCGKKFFSHKYLVQHRRVH 1421
                              69999*****************99 PP

2zf-C2H211.80.0007314571483123
                              EEET..TTTEEESSHHHHHHHHHH..T CS
                 zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                              y+C    Cg++F+  s++ rH r+  H
  RcHm_v2.0_Chr7g0191461 1457 YVCAepGCGQTFRFVSDFSRHKRKtgH 1483
                              899999****************99666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1493 aa     Download sequence    
MAASEQPQEV LSWLRALPVA PEYHPTWAEF QDPIAYIFKI EKEASKYGIC KIVPPVPPAP  60
KKTAIANLNK SLVARGGPSV GKGPKALPTF TTRQQQIGFC PRKARPVQRP VWQSGEHYTF  120
NQFEAKAKSF EKSYLKKRRK KGGLNPLDVE TLYWRATVDK PFSVEYANDM PGSAFVPLSS  180
KKSGASTSRE AGDGVTLGET AWNMRGVSRS RGSLLRFMKE EIPGVTCPMV YVAMMFSWFA  240
WHVEDHDLHS LNYLHLGAGK TWYGVPREAA VAFEEVVRVQ GYGGEINPLV TFATLGEKTT  300
VMSPEVFISS GIPCCRLVQN AGEFVVTFPR AYHTGFSHGF NCGEAANIAT PEWLRVANDA  360
AVRRASINYP PMVSHFQLLY DLALALCSRT PVRNSAEPRS SRLKDKKKGE GETVVKGLFV  420
KNVIQNNELL HVLGKGSSIV LLPQSSSDIS VCSKLRVGSQ LRVNPDDLMI DGKQGIKQVK  480
GLYSVKGKLA SLCESSQHPS LNANGNASTP SKMLNMSAKR ESNVEGEGLS DQRLFSCVTC  540
GILSFSCVAI IQPREAAARY LMSADCSFFN DWAVDCEPIQ VANGDPNSSK KGPCTETGLM  600
QKSTHDGLYD VPVQSADYRN QITDPINEVD SNTEMQRDTN ALGLLALTYG DSSDSEEDQA  660
EPDAPVCGDE TNLSDCSLEG RYEYKSASPP LRDSYGGTAG VRSPTSPGFD CGNELPTVDG  720
NVENRREATN FKDNGHQYID CSVDLDTLTK TNGLGGTSID PVKVSYSGSP DALDIQPTRF  780
GQATLQKEST GTSFFPGCDQ DSSRMHVFCL EHAVEVEQQL RSFGGAHILL LCHPDYPRIV  840
DEAKEIAEEL GVNYPWNDMV FRDATREDEE RIQSALDSEE AIAGNGDWAV KMGINLFYSA  900
SLSRSHLYSK QMPYNSVIYN AFGRSSPASS PAGPEVCGRR PAKQKKIVVG KWCGKVWMSN  960
QVHPFLIKRD HEEKKVELEQ RRFHDSEMPD EKLDGKSEST RKTEKTMVTK QYSRKRKMTV  1020
EGGTTKKAKC PDAVSAHSVD DNSHQQQKRF LKNKQAKYIE SGPTKKAKFT ETEDAVSGDS  1080
MEDDFRQQNR RTLRSEQAKY IEGDDDVSDD SMGVDSHQQQ RRIAKSKQAK YIARDFSMLS  1140
DDSVGVNSDH QQRRVAESNA REFSAVSDDS LEDNIHQLHR RSLRRNKGKC IGRGNLTSQN  1200
LHGVSSPQQQ RRTSKSKQAK TVEREDAALD DTPDDNAALQ LKSFRGRQIK PETVQQKKQE  1260
TPRRVKQGSR RLQETQQKTP RIQNIQSEQN TVDVNAEEPE GGPSTRLRKR PPKEQPETGR  1320
KKAKGQPETG RKKAREQQQT GRIKVNTALG VKTKNASARK GKSASAVREE EAEFLCDVEG  1380
CTMSFGTKQE LNLHKKNVCP VKGCGKKFFS HKYLVQHRRV HEDDRPLRCP WKGCKMTFKW  1440
AWARTEHIRV HTGARPYVCA EPGCGQTFRF VSDFSRHKRK TGHSVKKGKG RSR
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1136140KKRRK
2136141KKRRKK
310131018SRKRKM
413201333RKKAKGQPETGRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.10.0C2H2 family protein