PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID RcHm_v2.0_Chr4g0445361
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family C2H2
Protein Properties Length: 470aa    MW: 51771.7 Da    PI: 8.6524
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
RcHm_v2.0_Chr4g0445361genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H218.55.6e-067193123
                            EEETTTTEEESSHHHHHHHHHHT CS
                 zf-C2H2  1 ykCpdCgksFsrksnLkrHirtH 23
                            y+C++C++ F r  nL+ H r+H
  RcHm_v2.0_Chr4g0445361 71 YVCEICNQGFQRDQNLQMHRRRH 93
                            9*********************9 PP

2zf-C2H215.35.5e-05149171123
                             EEETTTTEEESSHHHHHHHHHHT CS
                 zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                             ++C +C+k +  +s++k H++t+
  RcHm_v2.0_Chr4g0445361 149 WVCDKCSKGYAVQSDYKAHLKTC 171
                             58*******************98 PP

3zf-C2H210.90.0015179197523
                             TTTEEESSHHHHHHHHHHT CS
                 zf-C2H2   5 dCgksFsrksnLkrHirtH 23 
                             dCg++Fsr +++  H+ t+
  RcHm_v2.0_Chr4g0445361 179 DCGRVFSRVESFIEHQDTC 197
                             9***************987 PP

Sequence ? help Back to Top
Protein Sequence    Length: 470 aa     Download sequence    
MLANNNHTNT SASSAAPSSS SDHHHAFTPL ENGAAAAAAT NKRKRRPAGT PDPDAEVVSL  60
SPKTLLESDR YVCEICNQGF QRDQNLQMHR RRHKVPWKLL KREIQDQVKK RVFVCPEPSC  120
LHHDPCHALG DLVGIKKHFR RKHSNHKQWV CDKCSKGYAV QSDYKAHLKT CGTRGHSCDC  180
GRVFSRVESF IEHQDTCSVR HVVRPELQAL QPAAACSSRT ASSTSPSSDG HFSINNNNNA  240
APPLPGLPVM PKPNDQPVVF ISSHEGHDGN PSTSNHQHQQ PREQVLELKL LPSSDTQTSP  300
RNEDENYATH LKLSIGGSSR GGDQKNDSPR SSSREMNTNI GFGGRASREV ARLKDFASEE  360
LKLAMAEKAY AEDARREAKR QIEMAELEFA NAKRIRQQAQ AELQKAQLLK EQSTKKISSA  420
IMQITCPSCK QHFHASNPSN AVGVGPSVDE TTSLGMSYMS SATTEGEGEQ
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
14247KRKRRP
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G68130.11e-137C2H2 family protein