PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_19983_BGI-A2_v1.0
Common NameF383_16436
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family C3H
Protein Properties Length: 424aa    MW: 47055.8 Da    PI: 9.3783
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_19983_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH22.52.1e-07101122426
                                 ---SGGGGTS--TTTTT-SS-SS CS
                     zf-CCCH   4 elCrffartGtCkyGdrCkFaHg 26 
                                 +lC+++++ G+C++Gd+C + H+
  Cotton_A_19983_BGI-A2_v1.0 101 KLCKYWMA-GHCHRGDKCWYLHS 122
                                 69******.*********99996 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5010314.63397124IPR000571Zinc finger, CCCH-type
SuperFamilySSF902292.75E-798122IPR000571Zinc finger, CCCH-type
SMARTSM003566.5E-599123IPR000571Zinc finger, CCCH-type
PfamPF006421.0E-4101122IPR000571Zinc finger, CCCH-type
Gene3DG3DSA:4.10.1000.104.9E-6101121IPR000571Zinc finger, CCCH-type
SuperFamilySSF509783.57E-44120176IPR017986WD40-repeat-containing domain
Gene3DG3DSA:2.130.10.101.2E-44122416IPR015943WD40/YVTN repeat-like-containing domain
SMARTSM003207.7E-6128167IPR001680WD40 repeat
PfamPF004006.3E-5132167IPR001680WD40 repeat
PROSITE profilePS5008213.148135176IPR001680WD40 repeat
PROSITE profilePS5029417.456135337IPR017986WD40-repeat-containing domain
PRINTSPR003201.4E-5154168IPR020472G-protein beta WD-40 repeat
PROSITE patternPS006780154168IPR019775WD40 repeat, conserved site
SMARTSM0032077207244IPR001680WD40 repeat
SuperFamilySSF509783.57E-44214418IPR017986WD40-repeat-containing domain
SMARTSM003201.9E-7251288IPR001680WD40 repeat
PfamPF004007.6E-5255288IPR001680WD40 repeat
PROSITE profilePS5008214.184258297IPR001680WD40 repeat
PROSITE patternPS006780275289IPR019775WD40 repeat, conserved site
PRINTSPR003201.4E-5275289IPR020472G-protein beta WD-40 repeat
SMARTSM003200.0043291328IPR001680WD40 repeat
PfamPF004000.0025293326IPR001680WD40 repeat
SMARTSM0032030334375IPR001680WD40 repeat
PRINTSPR003201.4E-5362376IPR020472G-protein beta WD-40 repeat
SMARTSM00320160378415IPR001680WD40 repeat
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005515Molecular Functionprotein binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 424 aa     Download sequence    Send to blast
MARFFTLPIQ ITRVRKKFGR KSGRLAPTAI YISSPTSTFS LVTRCAFNLF FQRMNVHGRE  60
RKRAGNNGVF SYSKANDIFR RNPNPPPNSS PSTLKSPDQN KLCKYWMAGH CHRGDKCWYL  120
HSWSLGDGFT MLAKLEGHKK TIRGIALPSG HDKLYSGSSD GTARIWDGHT GKCLHFTNLG  180
DEAGSLITEG AWVFIGMKNV VKALNIHSDL ELNLKGPVGQ VYAMIVAGDM LFAGAQNGGI  240
IAWRASFETD SFQLAASLEG HNGAVSCLAV GDKMLFSGSL DKTIRVWDID TFQCIKTFSG  300
HADVVTSLVH CNGYLFSSSL DCTIKVLFAT EGQNWVVLYT HKEENGVLTL CGMNDAETKP  360
VLFCSYNDDT IRLYDLPSFC ERGRIFSKRE VRVIERGPKN LFFTGDASGS LTVWKWRQNP  420
QGSS
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
6g6o_A1e-191164146321Ika8
6g6o_B1e-191164146321Ika8
6g6o_C1e-191164146321Ika8
6g6p_A1e-191164146321Ika8
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017648259.10.0PREDICTED: zinc finger CCCH domain-containing protein 48-like isoform X2
RefseqXP_017648260.10.0PREDICTED: zinc finger CCCH domain-containing protein 48-like isoform X2
SwissprotQ0DYP51e-126C3H17_ORYSJ; Zinc finger CCCH domain-containing protein 17
TrEMBLA0A0B0NDR80.0A0A0B0NDR8_GOSAR; Uncharacterized protein
STRINGEOX929121e-177(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM18392869
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G51980.11e-129C3H family protein
Publications ? help Back to Top
  1. Kikuchi S, et al.
    Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice.
    Science, 2003. 301(5631): p. 376-9
    [PMID:12869764]