PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP009910.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family NF-X1
Protein Properties Length: 1895aa    MW: 208809 Da    PI: 8.2151
Description NF-X1 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP009910.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-NF-X119.22.6e-06392409118
     zf-NF-X1   1 CGkHkCqklCHeGpCppC 18 
                  CG+H+C++ CH+GpC+ C
  PCP009910.1 392 CGNHSCSEVCHPGPCGDC 409
                  ****************99 PP

2zf-NF-X116.51.8e-05546563118
     zf-NF-X1   1 CGkHkCqklCHeGpCppC 18 
                  CG+H+C++lCH G CppC
  PCP009910.1 546 CGQHSCESLCHSGHCPPC 563
                  ****************** PP

Sequence ? help Back to Top
Protein Sequence    Length: 1895 aa     Download sequence    
MSFHVQNERR DRSRFLAQPA TQPTQPARRE WVLRGTNPTT ATAAVNPPPV YPNPTGNVSQ  60
PNPRFNPNNL NGNLSQPNPR FNPNNLNGNA GLPNHSSVPS EIRPHRGGNN GVIKGHMGQS  120
ANHRRERGRS ENQEEKGLKD SNLPQLVQEI QDKLTKGTVE CMICYEMVRR SAPVWSCSSC  180
YSIFHLNCIK KWARAPTSID MSAEKNQGFN WRCPGCQSVQ LTSSKEIQYV CFCGKRTDPP  240
SDLYLTPHSC GEPCGKQLER EVPGKGVSKD DLCPHVCVLQ CHPGPCPPCK AFAPPRLCPC  300
GKKTITTRCS DRASVLTCGQ HCNKLLDCWR HRCERTCHVG PCDPCQVLVD ASCFCKKKVE  360
VVLCGDMTVK GEVKAEDGVF SCSSPCGKML SCGNHSCSEV CHPGPCGDCN LMPIKIKTCH  420
CGKTSLQEER RSCLDPIPTC SQLCSKSLPC EMHQCQEVCH TGDCPPCLVE VTQKCRCGST  480
SRSAECFKTT MENEKFTCDK PCGRKKNCGR HRCSERCCPL SNLNNALLGD WDPHFCSMSC  540
GKKLRCGQHS CESLCHSGHC PPCLDTIFTD LTCACGRTSI PPPLPCGTPP PSCQLPCSVP  600
QPCGHTSSHS CHFGDCPPCS VPVAKECIGG HVVLRNIPCG SRDIKCNKLC GKTRQCGMHA  660
CGRTCHPPPC DTSCLAEQGS KTSCGQICGA PRRDCRHTCT SLCHPYASCP DSRCDFPVTI  720
TCSCGRMTAS VPCDSGGSNA SFKADIVYEA SIVQRLPAPL QPIESTCKNI LLGQRKLMCD  780
DECAKRERKR VLADAFDITP PNLDALHFGE SSAVSELLSD LLRRDPKWVL SVEERCKYLV  840
LGKSRRATSG LKVHVFCPML KEKRDVVRMI AERWKLAVQA AGWEPKRFIV VHVTPKSKAP  900
ARILGVKGTI TVSAPQPPAY EHLVDMDPRL VVSFPDLPRD ADISALVLRF GGECEVVWLN  960
DKNALAVFND PARAATARRR LDNGALYHGA IAVHSNGSAS MAASGSNAWG GLGTSREGGA  1020
SAVLMGNPWK KTVTQESGWR EDSWGKEEWP GSSTDAPANV WNKKAPIAAS VNRWSVLDGD  1080
TALGSSASSP RVEESRKQSL GPLNSALDLK ASGSSSSSTL EGRPVGVIAE TPEVVDDWEK  1140
TSMEPQLTAP NQFKETWKHT LLLSFQSLGV IYGQMSTAPL YVFGTLTAGD IPSEDTVYEL  1200
FSFIFWTINF ISLLKYAFIV LTADDNGEGG TFALYSLLCR HAKVGLLPNE SSANDVMHYE  1260
TGSPFKIKAE SRARRAIEKL RCSHYLMLFL ALFGCCMTIC IGVLTPTLSV YSVSSGIQRS  1320
LSDISHRIST SEKRREAISR ALEKYVPVST ASAILVCLFT LQHYGTRKIG FIFAPIVVVW  1380
LFFIGGGGVY NIFHWNKEIL LATSPMYMYR FFRNIHIKSW RSLGSIALCV AGSEAMFTSL  1440
GHFRRKSIKM TFVCFIYPVL VLSYAGQAAY ISKNLDVKDF NHLSQSIPKQ MRHWFVAFSL  1500
LASVVGSQAT ITACFSIINQ CLALGCFPRV KVIHTSDKIH GQVYIPDINW LLMVLSLLIT  1560
IGFHDIIKIG SATGLAVSSG MLVTTCLMSL VIALYWEKSL FESVCCLIFF GSIEVMYVSA  1620
CMLNFHKGAW YFVVLLALSL TIMLSWHYGT KKKLEFDIQN KVSAEWLTGI SPGLGVTRVP  1680
GIGFIYSDIV TGIPAFFSHF ITNLPAFHQV LIFVSFKSLP MPYVPPSRRY LVGRVGPSNF  1740
KIYRCVVRYG YCDPIRDTDN FEEQIISSIG EFICMEENDF DSQNSSEGRM VVIGKPPADG  1800
TGSALIPLND TNSYEMDCVS MASNETRSPL ESLQLHSIFL MLLLLRLAWS VAYELYCQTP  1860
CGQGIEQRVV QRARGEKEED RIWDRRRYKK PVLLV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
118841891DRRRYKKP
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G10170.10.0NF-X1 family protein