PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC4AG055050.3
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family C2H2
Protein Properties Length: 1173aa    MW: 129560 Da    PI: 5.065
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC4AG055050.3genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H213.40.0002310631088123
                        EEET..TTTEEESSHHHHHHHHHH.T CS
           zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                        ++C+   C+++F ++ +L+ H r+ +
  TRIDC4AG055050.3 1063 FQCEidFCDMTFESRAELRAHERNiC 1088
                        89********************9877 PP

2zf-C2H210.90.001511161140123
                        EEET..TTTEEESSHHHHHHHHHHT CS
           zf-C2H2    1 ykCp..dCgksFsrksnLkrHirtH 23  
                        +kCp   Cg++F+     + Hir+H
  TRIDC4AG055050.3 1116 FKCPwdGCGMTFKWLWAQTEHIRVH 1140
                        89*********************99 PP

3zf-C2H214.10.0001411461172123
                        EEET..TTTEEESSHHHHHHHHHH..T CS
           zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                        y+C+  dCg++F+  s++ rH r+  H
  TRIDC4AG055050.3 1146 YECSvpDCGQTFRYVSDYSRHRRKfnH 1172
                        89999******************9777 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1173 aa     Download sequence    
YSVETEDLHL CILIVLCSSS KWSDQYITGF NCGEAANFAT PQWLKFAKEA AVRRAVMNYL  60
PMLSHQQLLY LLAVSFISRT PRELLYGIRT SRLRDRRKEE RELLVKREFL QDMISENELL  120
CAFLKKKLIE NAVLWEPDLL PSSTALHSCS SGPKAPLKVD DVHSIESVPK ENSSSDDIAS  180
RAGIQPKCMS MDSKSSDAMS AAEAQKLDTD TDDDGDLPFD LSIDSGSLTC VACGILGFPF  240
MAILQPSKKA LEDMSLVDIE RFKLNCEKEN HSNAIPCSPD DSISGHPVIA KRPSSPVAQS  300
NFSHQNAESD KDGVGLDGPL LPHNNSAHSC NSENTLNPGI NTETTETKIP SARFGIEFSK  360
QTGRGDIDAQ ATESCGNTVD WNITSAFVRP RIFCLQHALE IEELLEGKGG AHALIICHAD  420
YTKLKALAIS IAEEIEFQFD CKDVPLANAS KSDLHLINIS IDDEGYKEDE RDWTTQMGLN  480
MKYFAKLRKE TPGCQEQPPL SFWKRLDISD KPSPISVVPN LKWLCRRART PYRVVGYAAS  540
RNATVGPDVV SPAVTKAEMG TSGNAYENAK EQRTGEQDAP LEPSRLQEAD DVADMHTCSE  600
DIDQDMHCLI GSKRQRTAEQ NAPLQPSRLQ EADDVVDMHM CSVDNDQDMH RLIGIPVAAA  660
EYPMTHQVCE GTVSVSTCEL DDLVSASTSD DPICSAHSQD SPGVSDDFTT EQQCVQSDEL  720
TSSVAMSAQQ FLVDGSMTAE DSSNHENLGS YNVTSECKDK QLQVQQEQEN IELCNNAGRN  780
LAAAVQVNSG HFGDKAVNLK SAIPTESQHE YPKRDAIVLE GMQAALTTVV SGENRNSVNT  840
ELDSLGILLG ALAEESILAD VPGKDEVDDA SLTLMTLASI DQSAGDVAHN EVIETSSSSV  900
GASISCKGRT LSNLASDGSL RIQNAEIQNK QENAEEVGAW NCQGLKNSRG ILDSSANSLS  960
DTGKSSGTPK AYQPDILSRS IGSSKRRSII CYVRRKRKQK RKRESELSTS NSQSFGSFAR  1020
APCERLRPRR KPAVIEEPAE QIETAKPSAA ATKGKRSKVV ELFQCEIDFC DMTFESRAEL  1080
RAHERNICTD ESCGKRFQSH KYLKRHQCVH RDERPFKCPW DGCGMTFKWL WAQTEHIRVH  1140
TGERPYECSV PDCGQTFRYV SDYSRHRRKF NHY
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
19851001KRRSIICYVRRKRKQKR
29851003KRRSIICYVRRKRKQKRKR
39931000VRRKRKQK
49951004RKRKQKRKRE
59971006RKRKQKRKRE
610271031RPRRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.12e-50C2H2 family protein