PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG74597.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family NF-X1
Protein Properties Length: 1987aa    MW: 208603 Da    PI: 7.5684
Description NF-X1 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG74597.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-NF-X121.74.2e-0711761194119
    zf-NF-X1    1 CGkHkCqklCHeGpCppCp 19  
                  CG+H+C+++CH GpC+ C+
  GBG74597.1 1176 CGNHRCERPCHSGPCGACR 1194
                  ******************8 PP

2zf-NF-X119.42.3e-0612391256118
    zf-NF-X1    1 CGkHkCqklCHeGpCppC 18  
                  CG+H C++ CH+GpCppC
  GBG74597.1 1239 CGEHACEQVCHTGPCPPC 1256
                  ****************** PP

3zf-NF-X128.14.2e-0913601378119
    zf-NF-X1    1 CGkHkCqklCHeGpCppCp 19  
                  CG+H+Cq+lCHeGpCppC 
  GBG74597.1 1360 CGRHTCQELCHEGPCPPCT 1378
                  ******************5 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1987 aa     Download sequence    
MDGRSGDGST GVEHDREGGR FPARRDHMMT MDAIGGNGGG GGGVHHHRNG SNSDEADLLS  60
HHAGRISGAS SQRVRPQLQI AFSRHDGNEK VGSGGGGGVR DDNALTPGAG GGSIRRTQLR  120
GHRRSVSTSA VDLVVHERTY RSQAGSGGGS IRRQLPPQGP LSPAPHVVQR YQSPPPSSSS  180
SGEAEEGSSL SHQQQQYRQP NASREQLYHQ AQQQLQRPQS QAQQQQQHLQ LFSLFQMSSP  240
LSAGRVTDER GKVLVEIAAP ISSSRSNLGR SAETGGAGAL GGLCRGVGAS GSGGEKQSRG  300
GGMVAVGGET NVRGQQKRVS RGSATRSRHA RAATWDGQNN RLMDMINSRG IESGDRDSPR  360
CSPRDSPRAS PRSSPPRDWT RSRDVNFAGS AAGAGGRAGA VGGVAGLGAA ADRAPVDGVA  420
GHGGEAGGGV PGLPPASSTP RGGGHYRRSS WGHDRPPQSQ RERRDVRPND WKVRQDDELS  480
VSSDKSDDKS HTRFATIGAT SVEGGGRDFP PSTTGSQKKQ QPEEGPGAAN VGESGKQQPA  540
LLPSLTADHH QEAAAFCGDE RTGVPDAGED GTATPTPRSW GEGQGGGAVG KYQAPRGRDS  600
DSCPWRSAKG RSSEGERGGG GAPGVDERED PSVGAAKLQE LWRSGSGKQL TGLSKNRGAG  660
RDDEGEGCRD GTEDSVGRSV AGLVADESAE EKGAGETGRG RGREVDIGIQ RTLSGRGAAV  720
GGGGGGIGAP PGAIGMSRVN GSRRSRSRSE SFDWGGSRHR SDMSSNWRSG AGDDYGHGEG  780
GEEHWDAKER KRAREEREER RDRDEIRGGE RDWRDRESAW PPHRAPIFAG GEAEWRSGGG  840
GGGGGGGGGG GGGGSIRKKE TSGSSSSASR PSRLMFGSRA LPSLVQELED KLWKCLLDCP  900
VCSEPVSRSA TIWYCTACYG IFHLDCVRKW AAETIALFPP FSLPSSSMSG GLSAKPDVSC  960
SWHCPACKSF QTTSPAELAC KCFCGKKEEP TNNPFIVPHS CGETCHHPLD RSITGVGGAG  1020
AGGAKDGGGG AILTGAAASA SAEGGGYRCP HLCTLRCHPG PHITCTAMAP TVFCYCRKSE  1080
ITRKCAEYQK HGRSCGATCL RQMSCGRHLC PRVCHEGPCG LCEVTIKARC FCGRKEELQT  1140
CGRLNVKGDL SIPQPGAGAP LGVFSCGEKC GKMLACGNHR CERPCHSGPC GACRLLPSVL  1200
IRCPCGKSLI RDLLGGKERK SCLESVPTCE GRCRKKLSCG EHACEQVCHT GPCPPCEVLV  1260
DQKCRCGSSS RQVPCFQVTP SNAAVVASAA DTSATVTTCR ERSSGWFACN TKCGRVKNCG  1320
RHTCAAVCCS AMNRPTEEED GSAANESHRC MLVCGRKLRC GRHTCQELCH EGPCPPCTDY  1380
LSVDKGLSCA CGRAFIPPPV PCGTPQPACP FPCSKPQNCG HATTHLCHFG ACPPCTEPVE  1440
KECEGGHVVL RGVPCGSRDI RCNAICGKMR ACGRPCTRTC HRPPCDSPDV AAGEDREDGV  1500
RDRDRGGGED GRRQCSAGPI DLANCYNNDD AAEQSCVLTM VRRDPHWVAE VEARLRYLLT  1560
LESRWREVCS AGAIEVDSGG LRVHVFAYLE KERRDVIKAI AREWDMEALS VGWEPKRFVV  1620
VFATGRSQAP LRLPLSAALA FSSHANAVIE PVLDKDLDMD PDRVVTFFDW PEDLDVAAEL  1680
AKFDEECELV FLSERNAIAV FGQSAETAAA VLKRVDGAST YSGAVAPVAA AAAAATSSSA  1740
YLSATPSGGA TAWGRGGRAL WGRGPAIAEN RGVAASSEAQ GSSCGNVRDP VRAGLIGNGI  1800
KSGSFGNELN LAGGNTCLLW EGGVVSASSP ALSWMDTGGW RLPEWDNPPP PSSSSSAEAQ  1860
GFRVVRGEDA DQSSSRFNTP AEKSHSQGGA IMSTSEPARQ NAPSSSDDAG GIGRSREGVV  1920
GKGTSSAGCA SSRDESPRST SSVIVRNDDV AEDGRWETLA GLTLGEDSSK DDDEDCWEDA  1980
VGEEFIM
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G10170.11e-156NF-X1 family protein