PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GAY58469.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Rutaceae; Aurantioideae; Citrus
Family C3H
Protein Properties Length: 954aa    MW: 105842 Da    PI: 8.832
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GAY58469.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH19.41.8e-06255275526
                 --SGGGGTS--TTTTT-SS-SS CS
     zf-CCCH   5 lCrffartGtCkyGdrCkFaHg 26 
                 lC++fa+ G C++G+ C+F+H 
  GAY58469.1 255 LCKDFAA-GKCRRGSHCHFSHH 275
                 9******.*************5 PP

Sequence ? help Back to Top
Protein Sequence    Length: 954 aa     Download sequence    
FSFVFAPCLV SLLLDTICKN LHQLQSQILL TASVYLSKTQ YFNKVNAWRD CQIVLVLVVN  60
LSLCFYDMSG TRRKHNSKWD LKEESQLSHE KVRDSARPGK AGISFYERES RSGRFSPRAA  120
GYNSGHNWSA READDIQSSR HDMQFSSREP LPGSRSSRKD DRIDDYRENF KATATWDADG  180
NYDMKMSPGL DDWRQQIRRR SPRKDWNGHR RSRSRSRSRS RSRSWNRSRS PVRELRRESG  240
VYGRNRGRPG VSAQLCKDFA AGKCRRGSHC HFSHHSSQSY EDNWDSRHKQ AGAPRFSTPH  300
ESREYPIRSG RNREGSLEIV DIPCKFFAAG NCRNGKYCKF FHSSQALASP VRRSRDDSLV  360
RGQNSDEREK LWHGSKWTDA TTISDAARLS EDKNERMGAK KSRDDGLVRN HNSDDVEKLW  420
NGSTWNGTDI STDAAKLSEN QNVGMGAPGP RFSGWSTDDR LPHTLDENAT HSKITAVTLG  480
GDEINKMEAS QGSIKIAGAV MGAPESGGTE NWLGDMEMSP EWNYPVKPCS RVMNEDHGQI  540
TRSSQSLPIC DTSVLHEQGI IQETSGLLCD EAATMEPMMD KSYLKRDINQ RDVGGVRLPG  600
ADKVAIGETA IPHIDLNFSA NVLPTQGLEQ NGQSSSALPF LNLNSIGQSQ GAINSESSRG  660
GNINNPQNHA VFQVEKSINK PGTGDGSALQ FSSAIQPTQN MVSSEQLTQL TNLSASLVQI  720
LGNGQQLPQL YAALNSHNVM QVPSSVKSEG PIAPDSAVAS QTSEAIRSQN QNQSQYDPLS  780
DSIDPKQLEL VSPPGFSVNP SGQKSNADGK PNGGLENHKV SEINGEVEAE EGRKAQAENK  840
VPQENGEVQK TDGDDKDDKA DEGKKSKDTK GLRAFKFALA EFVKELLKPT WKEGQINKDA  900
YKNIVKKVVD KVIGTMQGAA NIPQTQEKID QYLSFSKSKL TKLVQAYVEK SRKG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1211219RSRSRSRSR
2211221RSRSRSRSRSR
3213221RSRSRSRSR
4213223RSRSRSRSRSR
5215223RSRSRSRSR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33835.14e-13C3H family protein