PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_A02G1109
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family CPP
Protein Properties Length: 935aa    MW: 101322 Da    PI: 7.3108
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_A02G1109genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.41.9e-15521559341
          TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                  +k+CnCkk+kClk+YC+Cfaag +C+e C+C++C+N+ e
  Gh_A02G1109 521 CKRCNCKKTKCLKLYCDCFAAGIYCAEPCSCQGCFNRPE 559
                  79**********************************876 PP

2TCR47.82.8e-15607645139
          TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                  ++k+gCnCk+s ClkkYCeC++a++ Cs  C+Ce+CkN 
  Gh_A02G1109 607 RHKRGCNCKRSMCLKKYCECYQANVGCSIGCRCEGCKNV 645
                  589***********************************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011144.8E-16519560IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163434.815520647IPR005172CRC domain
PfamPF036382.9E-11522557IPR005172CRC domain
SMARTSM011143.8E-17607648IPR033467Tesmin/TSO1-like CXC domain
PfamPF036385.2E-11609644IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 935 aa     Download sequence    Send to blast
MDSPKLSKSP IPSASAFASA SASISSSSSP VQESPFSNYI SSLSPIKHDK AAHVAQGFVG  60
LSSPPLVFTS PRINTLGRPQ SSSVEISQKC EGDKKIINES CILERSVTES QGLVTDIKNE  120
DNKDDVAVQL GSSSECVDEY LADPVETDCA KSACSVKLNL KQSNNVLQSS VDGLLDLKNI  180
KFGSKNNVGR EVDAAQFLSG QSEESIERKL TSDEKLLKIE NEQGSAQGIS DGFQKFESDR  240
FDLSSKEKEC KNFGPQKDGR GDGCSNFLQQ LPGSLLGVQS YEGFAKNIGG DADVPVHSMT  300
HEASELQRSM SRRCLQFGGA QPEATATCSI STNRANNIIS STSLATNSET ESLSSSHLDL  360
SAKSRTRQLV NLSQLAMNMI PQCYGNSSLT VLKPSGIGLH LNSIVNATSM GQGGTASMKL  420
AQAIKSTSTT SCQTTENIDN CSDAFEKVST PQEGALEQKV CTIAGSASES LFAEESVGFH  480
MTPNTKRKFS SEDGDGNDMF DQQSPIKKRQ KLSNSTDGDG CKRCNCKKTK CLKLYCDCFA  540
AGIYCAEPCS CQGCFNRPEY EETVLETRKQ IESRNPLAFA PKIVQPVTEF PLSNREDGNR  600
KTPSSARHKR GCNCKRSMCL KKYCECYQAN VGCSIGCRCE GCKNVYGKKK DYCVTEEMVN  660
RSGEISESRV AAKPKKEIFH SELCDPYHLT PLTPSVQCSD HGKNASISRL LSRRCLPSPE  720
SDLTVLSYES PRSRRTSDSN DILLETSKGN LDIDSFCEGI SYNNAVTLAD EFHHTPLPNH  780
PSVIIGSSSS KARELTSLSR VQLDPRSRSL TSGGSLHWHS SPIKPMSPLN DNKKLQGLDS  840
ADGGLYDILE DDMPEILKYT SMPIKPVKAG SPNGKRVSPP HNLHQLGSSS SGPLRSGRKF  900
ILKAVPSFPP LTPCIDAKGS CNQSRNNFQE NRSND
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A3e-1852264512121Protein lin-54 homolog
5fd3_B3e-1852264512121Protein lin-54 homolog
Search in ModeBase
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankDQ9082322e-49DQ908232.1 Gossypium hirsutum clone LIB5327-014-A1-N1-A9_Gh292 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016742466.10.0PREDICTED: protein tesmin/TSO1-like CXC 2 isoform X1
TrEMBLA0A1U8NU050.0A0A1U8NU05_GOSHI; protein tesmin/TSO1-like CXC 2 isoform X1
STRINGGorai.013G110900.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM151961015
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.13e-63Tesmin/TSO1-like CXC domain-containing protein