PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GAY58470.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Rutaceae; Aurantioideae; Citrus
Family C3H
Protein Properties Length: 942aa    MW: 104317 Da    PI: 8.8069
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GAY58470.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH19.51.8e-06243263526
                 --SGGGGTS--TTTTT-SS-SS CS
     zf-CCCH   5 lCrffartGtCkyGdrCkFaHg 26 
                 lC++fa+ G C++G+ C+F+H 
  GAY58470.1 243 LCKDFAA-GKCRRGSHCHFSHH 263
                 9******.*************5 PP

Sequence ? help Back to Top
Protein Sequence    Length: 942 aa     Download sequence    
FSFVFAPCLV SLLLDTICKN LHQLQSQILL TASVYLSKTQ IVLVLVVNLS LCFYDMSGTR  60
RKHNSKWDLK EESQLSHEKV RDSARPGKAG ISFYERESRS GRFSPRAAGY NSGHNWSARE  120
ADDIQSSRHD MQFSSREPLP GSRSSRKDDR IDDYRENFKA TATWDADGNY DMKMSPGLDD  180
WRQQIRRRSP RKDWNGHRRS RSRSRSRSRS RSWNRSRSPV RELRRESGVY GRNRGRPGVS  240
AQLCKDFAAG KCRRGSHCHF SHHSSQSYED NWDSRHKQAG APRFSTPHES REYPIRSGRN  300
REGSLEIVDI PCKFFAAGNC RNGKYCKFFH SSQALASPVR RSRDDSLVRG QNSDEREKLW  360
HGSKWTDATT ISDAARLSED KNERMGAKKS RDDGLVRNHN SDDVEKLWNG STWNGTDIST  420
DAAKLSENQN VGMGAPGPRF SGWSTDDRLP HTLDENATHS KITAVTLGGD EINKMEASQG  480
SIKIAGAVMG APESGGTENW LGDMEMSPEW NYPVKPCSRV MNEDHGQITR SSQSLPICDT  540
SVLHEQGIIQ ETSGLLCDEA ATMEPMMDKS YLKRDINQRD VGGVRLPGAD KVAIGETAIP  600
HIDLNFSANV LPTQGLEQNG QSSSALPFLN LNSIGQSQGA INSESSRGGN INNPQNHAVF  660
QVEKSINKPG TGDGSALQFS SAIQPTQNMV SSEQLTQLTN LSASLVQILG NGQQLPQLYA  720
ALNSHNVMQV PSSVKSEGPI APDSAVASQT SEAIRSQNQN QSQYDPLSDS IDPKQLELVS  780
PPGFSVNPSG QKSNADGKPN GGLENHKVSE INGEVEAEEG RKAQAENKVP QENGEVQKTD  840
GDDKDDKADE GKKSKDTKGL RAFKFALAEF VKELLKPTWK EGQINKDAYK NIVKKVVDKV  900
IGTMQGAANI PQTQEKIDQY LSFSKSKLTK LVQAYVEKSR KG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1199207RSRSRSRSR
2199209RSRSRSRSRSR
3201209RSRSRSRSR
4201211RSRSRSRSRSR
5203211RSRSRSRSR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33835.13e-13C3H family protein