PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Neem_9967_f_1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Meliaceae; Azadirachta
Family Trihelix
Protein Properties Length: 423aa    MW: 47518.1 Da    PI: 10.0843
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Neem_9967_f_1genomeNGDView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix87.21.9e-2769140172
       trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrt 72 
                    rW++qe+laL+++r++m++ +r+++lk+plWeevs+k++e g++rs+k+Ckek+en+ k++k++keg+ ++ 
  Neem_9967_f_1  69 RWPRQETLALLKIRSDMDQVFRDSSLKGPLWEEVSRKLAELGYNRSAKKCKEKFENVYKYHKRTKEGRTGKP 140
                    8******************************************************************99863 PP

2trihelix86.23.9e-273193911487
       trihelix  14 rremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                    r+++ +++ ++  k+plWee+s++mr+ g++rs+k+Ckekwen+nk++kk+ke++k+r se+s+tcpy++ql+a
  Neem_9967_f_1 319 RTDLANKYHENGPKGPLWEEISAAMRSIGYNRSAKRCKEKWENINKYFKKVKESNKRR-SEDSKTCPYYNQLDA 391
                    78899999************************************************98.89999********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.003366128IPR001005SANT/Myb domain
PROSITE profilePS500907.07368126IPR017877Myb-like domain
PfamPF138374.5E-1868141No hitNo description
CDDcd122036.94E-2468133No hitNo description
SMARTSM007170.13269365IPR001005SANT/Myb domain
PfamPF138373.7E-18318392No hitNo description
CDDcd122035.95E-18318370No hitNo description
PROSITE profilePS500905.889322363IPR017877Myb-like domain
Sequence ? help Back to Top
Protein Sequence    Length: 423 aa     Download sequence    Send to blast
MLQQRDTPAL VAASSSGDAA ATTAAATVKA AHGGAGGGGA GAEVNNNSAG EEDNKGRGDE  60
GDRSFGGNRW PRQETLALLK IRSDMDQVFR DSSLKGPLWE EVSRKLAELG YNRSAKKCKE  120
KFENVYKYHK RTKEGRTGKP EGKHYNVNPP NPVSHINVSS AANPITIVPQ NVVVPPVVNP  180
TVTPLTQAVS SYQSSLPSSF QNVHGNLFSS STSSSTASDE EYEEQRGTRK RKWKEFFKRL  240
TKEVIKKQEQ LQNKFLEEID RRERERISRE EAWKVQEMAR INREHEILIQ ERSTAAAKDA  300
AVIAFLQKIS GQQQQNPFRT DLANKYHENG PKGPLWEEIS AAMRSIGYNR SAKRCKEKWE  360
NINKYFKKVK ESNKRRSEDS KTCPYYNQLD ALYKEKSKNE NASGYGSVKA VNHNMAPLMH  420
EIK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJN3157825e-38JN315782.1 Populus tomentosa SANT DNA-binding domain-containing protein (GT2) gene, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016698354.11e-136PREDICTED: LOW QUALITY PROTEIN: trihelix transcription factor GT-2-like
SwissprotQ391171e-101TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A1R3FZ121e-145A0A1R3FZ12_9ROSI; Uncharacterized protein
STRINGevm.model.supercontig_2611.11e-126(Carica papaya)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48492553
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.29e-46Trihelix family protein