PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_28157_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family NZZ/SPL
Protein Properties Length: 325aa    MW: 35150.6 Da    PI: 8.646
Description NZZ/SPL family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_28157_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1NOZZLE58.12.7e-183132339314
                      NOZZLE  39 tkksrgrkpgsktaqqkqkkptlrgmgvaklerfiieeekkkl.vvatvgdtssvaaisntatrlpvpvdrgvvlqgfps... 117
                                  k  +grkp  k   qk  k   rgmgva+ler +   e +   ++   g  +        a+ +  pv +gv   g ps   
  Cotton_A_28157_BGI-A2_v1.0  31 VKSNKGRKPSGKGPYQK--KQPQRGMGVAQLERLRKMSETRTPtTINQFGCEA------MGASNV--PVLHGVANYGVPSmmi 103
                                 45669*****9988764..4457***********9888877652333334333......344444..5899*********999 PP

                      NOZZLE 118 ..........................slgs..srilcggvgsgqvmidpvispwgfvetsatthelssisnpqmynassnnrc 172
                                                            +g+  s++l+g  g  qv     ++    ve+s    elss+++ q   +  ++ c
  Cotton_A_28157_BGI-A2_v1.0 104 nggsgglwgwgadtdglmmqrvvgngGFGGfnSQVLVGNPGNVQV----SCAAASVVEASK---ELSSMPKFQH--CKPEH-C 176
                                 999999999988888544444443333333334444444444433....356666777776...9**9987654..44555.* PP

                      NOZZLE 173 dtcfkkkrldgdqnnvvrsngggfskytmi.pppmngydeyllqsdhhqrsqgfl...ydqriaraasvsaasasinpyfnea 251
                                 d cfkkkr + ++    + ngg f ++    p     +  + l+++  q ++  +   +  r ar+a   aa+  +n  +ne+
  Cotton_A_28157_BGI-A2_v1.0 177 DLCFKKKRCNMEN---GKFNGGLFNQFGQTfPNKGTDFHGWNLENN--QNTNEEMmkgFRARAARSAYAYAAAGQMN--INET 252
                                 *********9988...789********9873555555666666655..43333221124678888887777777766..6887 PP

                      NOZZLE 252 tnltgsreefgsvlegnprngsrgvkeyeffpgkydervs.k.......vakvaslvg......dcspntidlslkl 314
                                  +    + +      gn       v eyeffpgk +  ++ k       +   a +vg      + +pn +dlslkl
  Cotton_A_28157_BGI-A2_v1.0 253 VDVVAIHRK------GNSFGTGSYVMEYEFFPGKNGRNTAsKewefpeeASSSAITVGgeasyaNNAPNCVDLSLKL 323
                                 777665544......44445556799*******98754331211111112222223332221114568999999997 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF087441.8E-1929323IPR014855Plant transcription factor NOZZLE
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0048653Biological Processanther development
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 325 aa     Download sequence    Send to blast
MATSLTLLTP NTDNPTTKPM EDEAKPAMEF VKSNKGRKPS GKGPYQKKQP QRGMGVAQLE  60
RLRKMSETRT PTTINQFGCE AMGASNVPVL HGVANYGVPS MMINGGSGGL WGWGADTDGL  120
MMQRVVGNGG FGGFNSQVLV GNPGNVQVSC AAASVVEASK ELSSMPKFQH CKPEHCDLCF  180
KKKRCNMENG KFNGGLFNQF GQTFPNKGTD FHGWNLENNQ NTNEEMMKGF RARAARSAYA  240
YAAAGQMNIN ETVDVVAIHR KGNSFGTGSY VMEYEFFPGK NGRNTASKEW EFPEEASSSA  300
ITVGGEASYA NNAPNCVDLS LKLSY
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016733178.10.0PREDICTED: protein SPOROCYTELESS-like isoform X1
TrEMBLA0A1U8N5Q20.0A0A1U8N5Q2_GOSHI; protein SPOROCYTELESS-like isoform X1
TrEMBLA0A2P5WHU30.0A0A2P5WHU3_GOSBA; Uncharacterized protein
STRINGGorai.002G077500.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM1863967
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G27330.19e-06sporocyteless (SPL)