PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC4BG053560.2
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family C2H2
Protein Properties Length: 1429aa    MW: 158757 Da    PI: 5.6644
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC4BG053560.2genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H213.30.0002413191344123
                        EEET..TTTEEESSHHHHHHHHHH.T CS
           zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                        ++C+   C+++F ++ +L+ H r+ +
  TRIDC4BG053560.2 1319 FQCEidFCDMTFESRADLRAHERNiC 1344
                        89********************9877 PP

2zf-C2H2110.001414021428123
                        EEET..TTTEEESSHHHHHHHHHH..T CS
           zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                        y+C    Cg++F+  s++ rH r+  H
  TRIDC4BG053560.2 1402 YECLveGCGQTFRYVSDYSRHRRKfnH 1428
                        889999*****************9777 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1429 aa     Download sequence    
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXALFTTRHQ  60
ELGTARRGRP PPQVLKQVWQ SGERYTLDQF EAKSRAFAKA HLAGLRDPTP LAVESLFWKA  120
SADRPIYIEY ANDVPGSGFA ASAQSHRRPH KKRRREGAPV DEGEKATGWK LSCSPWNLQA  180
IARAPGSLTR FMPDDVPGVT SPMVYIGMLF SWFAWHIEDH ELHSLNFLHT GAPKTWYAVP  240
GDRAAELEEV IRVHGYGGNP DRLASLAVLG EKTTLMSPEV IVASGLPCCR LVQHPGEFVV  300
TFPRAYHVGF SHGFNCGEAA NFATPQWLKF AKEAAVRRAV MNYLPMLSHQ QLLYLLAVSF  360
ISRTPRELLY GIRTSRLRDR RKEERELLKK LIDNAVLWEP DLLPSSTALH SCSSGPKAPL  420
KVDDVHSIES VPKENCSSDD IASRAGIQPK CMSMDSKSSD AMSTSEAQKL DTDTDDDGDL  480
PFDLSIDSGS LTCVACGILG FPFMAILQPS KKALEDMSLV DIERFKLNCE KENHSNAIPC  540
SPDDGNSGHP VIAKRPSSPV AESNFSHQNA ESDKDGVGLD GPLLPHNNSS HSCSSENTLN  600
PCINTETTET KIPSARFGIE FSKQTGRGDI DAQATESCGN TVDWNITSAF VRPRIFCLQH  660
ALEIEELLEG RGGVHALIIC HADYTKLKAL AISIAEEIEF QFDCKDVPLV NASKSDLHLI  720
NISIDDEGYK EDERDWTTQM GLNMKYFAKL RKETPGCQEQ PPLSFWKRLD ISDKPLPISV  780
VPNLKWLCRR ARTPYRVVGY AANRNATVGP DVVSPAVTKA EMGTSGNAYE NAKEQRTAEQ  840
DALLEPSRLQ EADDVADMHT CSEDIDQDMH CLIGSKRQRT AEQDAPLQPS RLQEADDVVD  900
MHTCSVDNDQ DMHRLIGIPV AVAEYPMVHQ VCEGTVSVST CELDDLVSAS TSDDSVCSAY  960
SQDSPGVSDD FTTEQKCVQS DELTSSVAMS VQQFLLDESM TAEDSSNQEK LGSYNVTSEC  1020
KDKQLQVQQE QENIELCNNA GRNMATVVQV DSSHFPDKAV NLKSAIPTES QHEYPKRDAI  1080
VLEGMQAALT TVVSGENRNS VHTELDSLGI LLGALAEESI LADVPGKDEV DDASLTLMTL  1140
ASIDQSAGDV AHNEVIETSS SSIGASLSCR GRTLTNLASD GSLRIQNAEI QNKQENAEEV  1200
DAWNCQGWKS SRGVLDSSAN SLSETGKSSG TPNTYQPDIL SRSIGSSKRT SIICYVRRKR  1260
KQKRKRESQS VGSFARAPCE RLRPRTKRAV IEEPAEQIET AKPSAAATKG KRSKVVELFQ  1320
CEIDFCDMTF ESRADLRAHE RNICTDESCG KRFQSHKYLK RHQCVHRDER PFKCPWEGCG  1380
MTFKWLWAQT EHIRVHTGER PYECLVEGCG QTFRYVSDYS RHRRKFNHY
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
112561263VRRKRKQK
212581267RKRKQKRKRE
312601269RKRKQKRKRE
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.11e-144C2H2 family protein