PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC4BG053560.4
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family C2H2
Protein Properties Length: 1291aa    MW: 142987 Da    PI: 5.0701
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC4BG053560.4genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H213.50.0002211811206123
                        EEET..TTTEEESSHHHHHHHHHH.T CS
           zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                        ++C+   C+++F ++ +L+ H r+ +
  TRIDC4BG053560.4 1181 FQCEidFCDMTFESRADLRAHERNiC 1206
                        89********************9877 PP

2zf-C2H210.70.001612341258123
                        EEET..TTTEEESSHHHHHHHHHHT CS
           zf-C2H2    1 ykCp..dCgksFsrksnLkrHirtH 23  
                        +kCp   Cg++F+     + Hir+H
  TRIDC4BG053560.4 1234 FKCPweGCGMTFKWLWAQTEHIRVH 1258
                        89*********************99 PP

3zf-C2H211.10.001212641290123
                        EEET..TTTEEESSHHHHHHHHHH..T CS
           zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                        y+C    Cg++F+  s++ rH r+  H
  TRIDC4BG053560.4 1264 YECLveGCGQTFRYVSDYSRHRRKfnH 1290
                        889999*****************9777 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1291 aa     Download sequence    
GAPVDEGEKA TGWKLSCSPW NLQAIARAPG SLTRFMPDDV PGVTSPMVYI GMLFSWFAWH  60
IEDHELHSLN FLHTGAPKTW YAVPGDRAAE LEEVIRVHGY GGNPDRLASL AVLGEKTTLM  120
SPEVIVASGL PCCRLVQHPG EFVVTFPRAY HVGFSHGFNC GEAANFATPQ WLKFAKEAAV  180
RRAVMNYLPM LSHQQLLYLL AVSFISRTPR ELLYGIRTSR LRDRRKEERE LLVKREFLQD  240
MISENELLCS FLKKKLIDNA VLWEPDLLPS STALHSCSSG PKAPLKVDDV HSIESVPKEN  300
CSSDDIASRA GIQPKCMSMD SKSSDAMSTS EAQKLDTDTD DDGDLPFDLS IDSGSLTCVA  360
CGILGFPFMA ILQPSKKALE DMSLVDIERF KLNCEKENHS NAIPCSPDDG NSVIAKRPSS  420
PVAESNFSHQ NAESDKDGVG LDGPLLPHNN SSHSCSSENT LNPCINTETT ETKIPSARFG  480
IEFSKQTGRG DIDAQATESC GNTVDWNITS AFVRPRIFCL QHALEIEELL EGRGGVHALI  540
ICHADYTKLK ALAISIAEEI EFQFDCKDVP LVNASKSDLH LINISIDDEG YKEDERDWTT  600
QMGLNMKYFA KLRKETPGCQ EQPPLSFWKR LDISDKPLPI SVVPNLKWLC RRARTPYRVV  660
GYAANRNATV GPDVVSPAVT KAEMGTSGNA YENAKEQRTA EQDALLEPSR LQEADDVADM  720
HTCSEDIDQD MHCLIGSKRQ RTAEQDAPLQ PSRLQEADDV VDMHTCSVDN DQDMHRLIGI  780
PVAVAEYPMV HQVCEGTVSV STCELDDLVS ASTSDDSVCS AYSQDSPGVS DDFTTEQKCV  840
QSDELTSSVA MSVQQFLLDE SMTAEDSSNQ EKLGSYNVTS ECKDKQLQVQ QEQENIELCN  900
NAGRNMATVV QVDSSHFPDK AVNLKSAIPT ESQHEYPKRD AIVLEGMQAA LTTVVSGENR  960
NSVHTELDSL GILLGALAEE SILADVPGKD EVDDASLTLM TLASIDQSAG DVAHNEVIET  1020
SSSSIGASLS CRGRTLTNLA SDGSLRIQNA EIQNKQENAE EVDAWNCQGW KSSRGVLDSS  1080
ANSLSETGKS SGTPNTYQPD ILSRSIGSSK RTSIICYVRR KRKQKRKRES QSVGSFARAP  1140
CERLRPRTKR AVIEEPAEQI ETAKPSAAAT KGKRSKVVEL FQCEIDFCDM TFESRADLRA  1200
HERNICTDES CGKRFQSHKY LKRHQCVHRD ERPFKCPWEG CGMTFKWLWA QTEHIRVHTG  1260
ERPYECLVEG CGQTFRYVSD YSRHRRKFNH Y
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
111181125VRRKRKQK
211201129RKRKQKRKRE
311221131RKRKQKRKRE
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.11e-119C2H2 family protein