PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OGLUM03G03870.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Oryzoideae; Oryzeae; Oryzinae; Oryza
Family C2H2
Protein Properties Length: 1489aa    MW: 163187 Da    PI: 5.9875
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OGLUM03G03870.1genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H215.74.4e-0513791404123
                       EEET..TTTEEESSHHHHHHHHHH.T CS
          zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                       ++C    C+++F +k +L+ H+r+ +
  OGLUM03G03870.1 1379 FQCDreFCDMTFETKAELRAHQRNiC 1404
                       899999*****************877 PP

2zf-C2H212.50.0004314041426323
                       ET..TTTEEESSHHHHHHHHHHT CS
          zf-C2H2    3 Cp..dCgksFsrksnLkrHirtH 23  
                       C+   Cgk+Fs++ +LkrH+ +H
  OGLUM03G03870.1 1404 CTdeSCGKRFSSHKYLKRHQCVH 1426
                       77778***************998 PP

3zf-C2H213.70.0001914621488123
                       EEET..TTTEEESSHHHHHHHHHH..T CS
          zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                       ykC+  dCg+sF+  s++ rH ++  H
  OGLUM03G03870.1 1462 YKCSapDCGQSFRYVSDYSRHRKKfnH 1488
                       99********************98666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1489 aa     Download sequence    
MPPQPPPAAS ASASAPDPAV PAWLRGLPRA PEYRPTESEF ADPIAFLSRV EREAAAYGIC  60
KVIPPHPRPS RRFVFAHLNR SLASSYDAPA PSPAAASDSS IPPSSSSSPP PASAAVFTTR  120
HQELGNPRRG RPTPQVLKQV WQSGERYTLD QFVSKSRAFS KTHLAGLHEP TALAVESLFW  180
KASADRPIYI EYANDVPGSG FAAPVQLQRK KKRKRETAPM DGWEKSSGWR LSNSPWNLQA  240
IARAPGSLTR FMPDDVPGVT SPMVYIGMLF SWFAWHVEDH DLHSLNFLHT GAPKTWYAVP  300
GDRAVELEEV IRVHGYGGNT DRIGICLILC FFIQILCSCD ASLAVLGEKT TLMSPEVLID  360
NGVPCCRLVQ YPGEFVVTFP RAYHVGFSHG FNCGEAANFA TPQWLKFAKE AAVRRAVMNY  420
LPMLSHQQLL YLLAVSFISR NPRELLSGIR TSRLRDRKKE DRELLVKQEF LQDMISENEL  480
ICSFLGKKSV DNVVLWEPDL LPSLTALHPC SSCSKAPEKK GEDGPRIGST QSSSKDDSSS  540
DGTACMTGTQ SKGLSMDSKQ APEGEKLDTD DGDDLPFDLS IDSGSLTCVA CGILGYPFMA  600
ILQPSRKALE EISLVDKERY KLSCEKEICS NVLPCSPNDG SSGCPLIANR SSSPVENANL  660
SHQDVKPIRS DISLMGKEFN GTLGKHIGTS CSCSSENTID PYGDTETPEK KIPSDCPGSE  720
LSKQSGRGDV NVPDVEGSDE TISWNTGCAF ARPRIFCLQH ALEIEELLAS KGGVHALIIC  780
HADYVKLKAL AISIAEEIEF QFDYKDVALA NASKSDLHLI NISIDDEGYE EEGTDWTSRM  840
GLNLKHSSKI RKETSESQEQ PPLSFWGLFS KPSPISVVSN LKWLCRKART PYKVIGYASS  900
PDVVATPDKV KPAVTETQID TSGNAHENIG SEQTLQQDCV LQESNDVADM CKRPKVNDQD  960
GHSLINIPIA VAEYPMMHQV CERPDSPTTV AVSAGKPTRE QCGAESTELS TVQQFLDNGL  1020
IAEGGSMNFI SNHEHLESDN ATSVCKDEQL QVQQDQLAMV LCNNPNTELV AGELHGGAAS  1080
STLENEDSCG NTSYYSDTVL KNSKPDTDDQ PETCDRSVVL VTPKSSCDQM ISSSDRSCSL  1140
TLDCPVSTDA AFSSEKLSMA HDLMGSELQA VHNSKAEVAA SLTDVKGAKL NSIHTAQLPH  1200
ESPSSDFIIS EGSQSASATA IPRKNGTSMH TESNSIDILL GVLADESKVS SGKDEVGKAS  1260
LTLMTLAGND QSADDVTQDE VAEITDTSHG FCASDIVSRS IGSSNRTNII CYARRKHKRK  1320
SGSEFNINSP QSLGSFVRSP CESLRPRTRP AIVEDMTNET KTAEASTANK RKKAKVEAFQ  1380
CDREFCDMTF ETKAELRAHQ RNICTDESCG KRFSSHKYLK RHQCVHRDER PFKCPWDGCP  1440
MTFKWLWAQT EHIRVHTGER PYKCSAPDCG QSFRYVSDYS RHRKKFNHY
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1208215QRKKKRKR
2209215RKKKRKR
3209216RKKKRKRE
413141320RRKHKRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.10.0C2H2 family protein