PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID EcC001020.10
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Myrtales; Myrtaceae; Myrtoideae; Eucalypteae; Eucalyptus
Family Trihelix
Protein Properties Length: 754aa    MW: 81400.5 Da    PI: 5.4505
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
EcC001020.10genomeECGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix95.54.9e-3052136187
      trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                   rW++qe+l+L+++r+em+ ++r++ lk+plWe+vs+k+ e g++rs+k+Ckek+en++k+yk++keg+ +r  ++ +t+++f++lea
  EcC001020.10  52 RWPRQETLELLRIRSEMDAAFRDATLKGPLWEDVSRKLGELGYRRSAKKCKEKFENVHKYYKRTKEGRAGR--QDGKTYKFFSELEA 136
                   8********************************************************************98..56667******985 PP

2trihelix105.63.5e-33470555187
      trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                   rW+k ev aLi++r+ +e+r++++  k+plWee+s+ m++ g++rs+k+Ckekwen+nk++kk+ke++kkr +e+ +tcpyf++l+a
  EcC001020.10 470 RWPKVEVIALIKLRSGLESRYQEAGPKGPLWEEISAGMARMGYKRSAKRCKEKWENINKYFKKVKESNKKR-PEDAKTCPYFHELDA 555
                   8*********************************************************************8.99999********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.001949111IPR001005SANT/Myb domain
CDDcd122036.97E-2851116No hitNo description
PROSITE profilePS500906.6951109IPR017877Myb-like domain
PfamPF138372.7E-2051137No hitNo description
PROSITE profilePS500906.69463527IPR017877Myb-like domain
SMARTSM007170.011467529IPR001005SANT/Myb domain
PfamPF138374.0E-23469556No hitNo description
CDDcd122038.79E-30470534No hitNo description
SuperFamilySSF1014477.06E-5606616No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 754 aa     Download sequence    Send to blast
MVVEEASPIS SRPPAPSGLD EFLPLSGGDM DALAAAAAAD VGGSSGGGGG NRWPRQETLE  60
LLRIRSEMDA AFRDATLKGP LWEDVSRKLG ELGYRRSAKK CKEKFENVHK YYKRTKEGRA  120
GRQDGKTYKF FSELEALHNT AAGATVGMSA ASSGGGAASG TAALGGLSVP PVSIGISFAN  180
PVPISTVRGQ PPVLAHPPPH THHIFAPTTA PGLATRPAAT AAMSASPMGM IFSSNTSSSS  240
GRSEEDDDDE EDEEEGEPST TGSSRKRKRG PGETSGSGGG SRRKMMELFE GLMRQVMQKQ  300
EEMQKRFFEA IEKREQDRLI REEAWRRQEM NRLARENEIA AQERAASASR DAAIVSFLQK  360
ITGQTIHLPT MPATMLATVT TSVPPLSQTP LPKTQVRPTH AAPHPQPPMP PPPTYQEQPQ  420
QQRQNIEVVT HPLPASTGII MAVPEQQVPP QSQELILSGG SGGEPSSSSR WPKVEVIALI  480
KLRSGLESRY QEAGPKGPLW EEISAGMARM GYKRSAKRCK EKWENINKYF KKVKESNKKR  540
PEDAKTCPYF HELDALYRKK ELGGGGGGGA TSGAAATSGS MGFRAEPSPS TTNLEARPVI  600
AQDQPPPPPP PPPPPPAAPA QAGEPERKDG GDNSGAQERT VEEGNGGSLK KPEDTVKELI  660
GQGQHQNQHL DSDQQHLRVN SSNYNDKKME EGDSDNIEDQ EENELEEDDE ENDEDEELEE  720
ERKMAYKIEF QRPNASAPPN GGGNGAPSFL AMVQ
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00244DAPTransfer from AT1G76890Download
Motif logo
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010054910.20.0PREDICTED: trihelix transcription factor GTL1, partial
SwissprotQ391176e-72TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A059CIU70.0A0A059CIU7_EUCGR; Uncharacterized protein
STRINGXP_010054910.10.0(Eucalyptus grandis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM62262746
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.12e-46Trihelix family protein