PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen02g006770.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family NF-YA
Protein Properties Length: 1989aa    MW: 225101 Da    PI: 9.3952
Description NF-YA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen02g006770.1genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1CBFB_NFYA34.37.6e-1118381872136
         CBFB_NFYA    1 deplYVNaKQyqrIlkRRqkRakleeekkldeksrk 36  
                        +ep++VNaKQy++Il+ Rq+ ak+e+++kl  ksr+
  Sopen02g006770.1 1838 EEPVFVNAKQYHGILRLRQSCAKAESDNKL-LKSRR 1872
                        69***************************9.55554 PP

2CBFB_NFYA44.26.3e-1419431977136
         CBFB_NFYA    1 deplYVNaKQyqrIlkRRqkRakleeekkldeksrk 36  
                        +ep++VNaKQy++Il+RRq+Rak+e ++kl  ksrk
  Sopen02g006770.1 1943 EEPVFVNAKQYHGILRRRQSRAKAEPNTKL-LKSRK 1977
                        69***************************9.88887 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF577564.19E-7139177IPR001878Zinc finger, CCHC-type
Gene3DG3DSA:4.10.60.101.4E-11149172IPR001878Zinc finger, CCHC-type
SMARTSM003433.8E-5156172IPR001878Zinc finger, CCHC-type
PfamPF000987.0E-6156172IPR001878Zinc finger, CCHC-type
PROSITE profilePS5015811.054157172IPR001878Zinc finger, CCHC-type
SuperFamilySSF566721.19E-34253415No hitNo description
Gene3DG3DSA:3.10.10.101.9E-18263329No hitNo description
CDDcd092741.41E-42322423No hitNo description
PfamPF037327.5E-17616709IPR005162Retrotransposon gag domain
Gene3DG3DSA:4.10.60.101.4E-11797817IPR001878Zinc finger, CCHC-type
SMARTSM003430.013801817IPR001878Zinc finger, CCHC-type
PROSITE profilePS501588.796802817IPR001878Zinc finger, CCHC-type
SuperFamilySSF506302.64E-8890986IPR021109Aspartic peptidase domain
CDDcd003034.66E-16892979No hitNo description
PfamPF136501.4E-8892979No hitNo description
SuperFamilySSF566726.45E-13310591431No hitNo description
PROSITE profilePS508789.65510671266IPR000477Reverse transcriptase domain
Gene3DG3DSA:3.10.10.101.1E-1110851143No hitNo description
CDDcd016471.72E-6211181266No hitNo description
PfamPF000781.2E-2511401265IPR000477Reverse transcriptase domain
Gene3DG3DSA:3.10.10.108.3E-711441187No hitNo description
Gene3DG3DSA:3.30.70.2701.2E-911881267No hitNo description
CDDcd092741.15E-5813311446No hitNo description
PROSITE profilePS5099411.50514091625IPR001584Integrase, catalytic core
Gene3DG3DSA:3.30.420.103.0E-814591498IPR012337Ribonuclease H-like domain
Gene3DG3DSA:3.30.420.103.0E-815471629IPR012337Ribonuclease H-like domain
SuperFamilySSF530987.4E-815611629IPR012337Ribonuclease H-like domain
SuperFamilySSF541606.68E-1117021785IPR016197Chromo domain-like
PROSITE profilePS500139.48117421780IPR000953Chromo/chromo shadow domain
Gene3DG3DSA:2.40.50.403.2E-717431784No hitNo description
PfamPF003856.6E-817431786IPR023780Chromo domain
CDDcd000243.32E-417441780No hitNo description
SMARTSM005212.6E-518361893IPR001289Nuclear transcription factor Y subunit A
PROSITE profilePS5115214.41718371872IPR001289Nuclear transcription factor Y subunit A
PfamPF020451.2E-618391872IPR001289Nuclear transcription factor Y subunit A
SMARTSM005212.9E-619411987IPR001289Nuclear transcription factor Y subunit A
PROSITE profilePS5115217.88619421989IPR001289Nuclear transcription factor Y subunit A
PfamPF020455.4E-1019441979IPR001289Nuclear transcription factor Y subunit A
PROSITE patternPS00686019471967IPR018362CCAAT-binding factor, conserved site
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0015074Biological ProcessDNA integration
GO:0016602Cellular ComponentCCAAT-binding factor complex
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0008270Molecular Functionzinc ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 1989 aa     Download sequence    Send to blast
MLSQVVTNQV RQESEARQEG ADTSRVREFL RMNLPSFTES STTEDPENFV EELKKVSEDS  60
MSAHDYGLKF TQLSRYAPEM VKNMRSRNNL FVAGLGRASS KEGRDTMLIG HMDISRFMKS  120
KGHAPSSSSA PAPRNRGENN GQNSQNFKAR PAESQGCFKC GQKGHFMREC PKNKQGSGNS  180
GNRAQSSSVT PLDKAAPRGA TFGIGGGANR LYAITSRQEQ ENSPDVVTSV PIVKEFSEVF  240
PDDLPGVSPK REIDFGIDII LDTRPISITS YRMEPAQLKE LKEQLKDLLD KGFIRSNVSP  300
WGAPVLFVRK KDGSLRICID YRQLNKNGKV ITYASRQLKV HEKNYPTHEL ELDAVEFALK  360
IWRHYLYGVH VDIFTGHKSL QYVFTQKELN LRQRRWLELL KDYDISIIYH PGKANVVVDA  420
LSRSLRNEEA VQDKEPEVRK EPSKKRDKRK VREPSCNPSL GGRPAVEEEE GADNSQLDER  480
IVGVEVGLSA LNRRIVVVEN NFSSLESIEG LDEVKNNLVE LEDVQNEGLS TLELKLNEAI  540
SSLQREVEAL RRQVDEAAEV GVPTPVTVCE TRIEAPKPKD FRGERSAQDV ENFLWQMDAY  600
FEHVDMHNEA AKIRTATMYL TDKAMLWWRR KKADMERGVC QIDDWEQFKV ELKRQFYPQN  660
VVHEARRRLR ELKQTSSIRD YVKEFTKLTL QIPSLTSEDL LFYFLEGLQN WAKQELQRRQ  720
VSDVDEAIVV AESLNDLRTD AAKGRDNRSK TIPPKVDNNN RGRSRPNPNR GSDTRSNTRD  780
QPSNFRKNYE DRKRGAPHRE GCYICGETTH AARYCPSLRK LSAMVAAKKQ QEKAATQTRS  840
SAGEQRGQSS GSDKGKNVAV GMFNHMALIN HISTAALAAK PASVKPRESL FVDTKLNGKN  900
VRIMVDNGVT HNFVTEQKAK ELGLSYVASN TKLKTVNATP TTVHGISPKV PIDLGEWTGQ  960
TDFTIAPMYV FDVILGLDFW YEVNAFISPR HNQLHISDTG GSCVVPLIRV PQNGMQLSAM  1020
QLIKGFKRGE PIFLATLVGG AESCTEAVQL PHCIEKVLNS NKDVTPEELP QRLPPRREVD  1080
HQIELVPGAK PPAMTPYRMA PPELEELRKQ LKELLDAGHI RPSKAPFGAP VLFQKKKDGT  1140
LRLGQAKVFT KMDLRKGYHQ VRIAEGDEPK TACVTRYGAF DWLVMPFGLT NAPATFCTLM  1200
NKLFHSYLDQ FVVIYLDDIV VYSNNMEDHV EHLCKVFKVL RDNELYVKRE KCSFAQPTVQ  1260
FLGHSISHGE IRMDSNKVDA IKNWEAPTKV PELRSFLGLA NYYRRFIFNY SAIAAPLTDL  1320
LKKDHFSKAF EVHTDASDFA IGGVLMQEGH PIAYESRKLN EAERRYSAHE KEMTAVVHYL  1380
RTWRHYLLGA PFVVKTDNVA TTYFQSQKKL SAKQARWQDF LVEFNFTLEY KPGKANVVAD  1440
ALSRKASLAA VVSSSCSSIV GEIREGLQHD PVAKQLFALA QQGKTKKFWV DDGLLYTTAR  1500
RVYVPKWANL RRTLIKEGHD SAWAGNPGQK RTLALIEASY YWPRMRDDVE AYLVGVGVEL  1560
LDQLSPQTDG QTERINALLE CYLRLYVSAN QKDWAGLLDT AQFSYNLQRS EATGRSPFEL  1620
ATGQQPNTPQ SLPVNAGFKS PGAYQMAKAW EEQVDLARSY LDKAARKMKK FADRKRRPVD  1680
YQIGDRVMAG KISYRVDMPH HLKIHPVFHA SQLKPYFEDR EDKERGQAGR ARIFITPPTV  1740
DKQIEAIIDH QLVRGKGWNN SSSQFLVHWK GTAPEEATWE NTKTCGSFAT KSTSTCSCVA  1800
SGSSPIQARE RVPPRKAVRL QLMGIQQACV PLPSDVAEEP VFVNAKQYHG ILRLRQSCAK  1860
AESDNKLLKS RREEGFTNAM ISAQEIGLEM NVEPVFHPYY MSIFEPYDAQ PYPAQPYPVP  1920
LMVQLQLIRI QQAGVLLPVN VAEEPVFVNA KQYHGILRRR QSRAKAEPNT KLLKSRKIFC  1980
KMHFCHHFT
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4ol8_A4e-71311144439477Reverse transcriptase/ribonuclease H
4ol8_B4e-71311144439477Reverse transcriptase/ribonuclease H
4ol8_E4e-71311144439477Reverse transcriptase/ribonuclease H
4ol8_F4e-71311144439477Reverse transcriptase/ribonuclease H
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
116651677RKMKKFADRKRRP
216721677DRKRRP
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754410.0HG975441.1 Solanum pennellii chromosome ch02, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_027769952.10.0LOW QUALITY PROTEIN: uncharacterized protein LOC114075973
TrEMBLA0A1U8FPD10.0A0A1U8FPD1_CAPAN; uncharacterized protein LOC107857977
STRINGXP_009612101.10.0(Nicotiana tomentosiformis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA127
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G30500.27e-22nuclear factor Y, subunit A7