PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Neem_4111_f_1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Meliaceae; Azadirachta
Family Trihelix
Protein Properties Length: 504aa    MW: 57279.2 Da    PI: 9.9111
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Neem_4111_f_1genomeNGDView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix92.15.6e-2949133187
       trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                    rW++qe+laL+++r++m  ++r+++ k+plW+evs+k+ e g++rs+k+Ckek+en+ k++k++keg++   +++s+t+++fdqlea
  Neem_4111_f_1  49 RWPRQETLALLKIRSDMAVAFRDASVKGPLWDEVSRKLGELGYNRSAKKCKEKFENVYKYHKRTKEGRST--KGQSKTYRFFDQLEA 133
                    8******************************************************************997..69999********85 PP

2trihelix105.53.8e-33356441187
       trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                    rW+k ev+aLi++r+ ++++++++  k+plWee+s+ mr+ g++r++k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+ql+a
  Neem_4111_f_1 356 RWPKVEVQALIKLRTHLDSKYQENGPKGPLWEEISAGMRKLGYNRNAKRCKEKWENINKYFKKVKESNKKR-PEDSKTCPYFHQLDA 441
                    8*********************************************************************8.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007177.2E-446108IPR001005SANT/Myb domain
PfamPF138376.0E-1848133No hitNo description
CDDcd122033.72E-2348113No hitNo description
PROSITE profilePS500907.07348106IPR017877Myb-like domain
PROSITE profilePS500907.375349413IPR017877Myb-like domain
Gene3DG3DSA:1.10.10.604.5E-4353412IPR009057Homeodomain-like
SMARTSM007178.1E-4353415IPR001005SANT/Myb domain
CDDcd122035.04E-28355420No hitNo description
PfamPF138371.4E-22355442No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0010192Biological Processmucilage biosynthetic process
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 504 aa     Download sequence    Send to blast
MLVSGEPTVS GDVATTTTTA VATAGSGEAR EDDKSRVLDE ADRGFGGNRW PRQETLALLK  60
IRSDMAVAFR DASVKGPLWD EVSRKLGELG YNRSAKKCKE KFENVYKYHK RTKEGRSTKG  120
QSKTYRFFDQ LEAFENNHPS LSSPTPKPAQ AVAAPAPVSV AMPAGNPPYN INTVPSTTTH  180
NFVTIPGATI HSFPSTNPTN LPPQQGTNPT NFPSQSTKPS SFHNIPVDLL SNSTSSSTSS  240
DLELEGRRKR KRKWKDFFER LMKEVIEKQE ELQKKFLEAI EKREHERAVR EEAWRMQEMT  300
RINREREILA QERSISAAKD AAQPAVSSSQ QVINLETMKT DNGGSQFNNS TTSSSRWPKV  360
EVQALIKLRT HLDSKYQENG PKGPLWEEIS AGMRKLGYNR NAKRCKEKWE NINKYFKKVK  420
ESNKKRPEDS KTCPYFHQLD ALYKERNKFD HNSSTNNNQF FKPENTVPLM VRPEQQWPPQ  480
PQQEDSAMED VESEHNIIMK FIPL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1246251RRKRKR
2246252RRKRKRK
3247252RKRKRK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00243DAPTransfer from AT1G76880Download
Motif logo
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006473053.10.0trihelix transcription factor GT-2
SwissprotQ391171e-136TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A067H7R90.0A0A067H7R9_CITSI; Uncharacterized protein
STRINGXP_006473053.10.0(Citrus sinensis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48492553
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.11e-150Trihelix family protein