PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thhalv10024265m
Common NameEUTSA_v10024265mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema
Family HD-ZIP
Protein Properties Length: 1119aa    MW: 123975 Da    PI: 4.4001
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thhalv10024265mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox53.44.4e-1746101257
                      T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
         Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57 
                      rk t++t++q++eLe++F++n+ p++e++eeL + l+L+++q+++WFq rR + kk
  Thhalv10024265m  46 RKNTRHTRQQIQELENFFNENPLPTEEQKEELGRMLNLETKQIRIWFQTRRVQAKK 101
                      7889************************************************9997 PP

2START132.54.7e-426428602205
                      HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.....SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEE CS
            START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv....dsgealrasgvvdmvlallveellddkeqWdetla....kaetle 83 
                      +a  a++el+ + + +++ W+ +     e++n++ + +kf+ + v    + +ea+r++g+v m++  lv++l  dk  W + +a     a+t+ 
  Thhalv10024265m 642 CATMAMTELLMLGQTNSAYWTIDLssnrENLNYEQYQSKFKNGSVtplgYVMEASRETGLVLMDSLALVKILTTDK--WVNVFApivsVASTHR 733
                      677899****************9999**************99988******************************9..*********9****** PP

                      EECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--..-TTSEE-EESSEEEEEEEECTCEEEEEEEE- CS
            START  84 vissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe.sssvvRaellpSgiliepksnghskvtwveh 169
                      vi sg      g l l  ae+q++splvp R + f+Ry++ l+++ wvivd  v+++ +     +    +++lpSg +ie+++ng skvtw+e+
  Thhalv10024265m 734 VIPSGsggprnGSLKLVEAEFQVISPLVPkRQVKFLRYCKMLRDDLWVIVD--VTPRMQDLRfLPD-GGSKRLPSGVIIEDLKNGSSKVTWIEQ 824
                      ***************************************************..6666655444333.44579********************** PP

                      EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
            START 170 vdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                      ++++++ ++ ++++lv+sg+  gak+w+ tlqr+ce
  Thhalv10024265m 825 AEYDESEIPLIYQPLVGSGIGLGAKRWLTTLQRYCE 860
                      ***********************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.604.9E-1928104IPR009057Homeodomain-like
SuperFamilySSF466893.59E-1836102IPR009057Homeodomain-like
PROSITE profilePS5007117.86442102IPR001356Homeobox domain
SMARTSM003895.4E-1644106IPR001356Homeobox domain
CDDcd000864.05E-1846103No hitNo description
PfamPF000461.2E-1446101IPR001356Homeobox domain
PROSITE profilePS5084831.072632864IPR002913START domain
CDDcd088751.24E-80636860No hitNo description
SuperFamilySSF559619.49E-19640863No hitNo description
SMARTSM002341.2E-35641861IPR002913START domain
PfamPF018521.0E-36645860IPR002913START domain
SuperFamilySSF559612.45E-88841079No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 1119 aa     Download sequence    Send to blast
MNGQEDDFDV DQYLNPNYIG GAEDHEGDDN IGAMTGGDDQ AGGATRKNTR HTRQQIQELE  60
NFFNENPLPT EEQKEELGRM LNLETKQIRI WFQTRRVQAK KRHENEVLRQ EHDMLQGEHN  120
HLKRKLIDCS CNLCDGPADF GTVNYVVQQL MHANARIKQD RNKAKRIAVR ERTPFWSGQP  180
LALPSSPLPL LNKNSTFLVG ESSSSLVFSN EGRNALPECE SVGESSFMNE RSPFLVGSSN  240
VVQPPSSLMI STQARNALPE LTWSGETSIM NGSSTFLASP PVEHPSLMIH DQERNAVQEL  300
DSGGDTLMMD ERSKFFDRPP VDQPSTSSLM ISDPGRNIVQ ELDSGRETMM NERSTFFDRP  360
PVEEPSSSLW ISSPGTNAAP ELESECDTSI MNNKRSIFSI GSPVEEPSSS LWISSPGRNA  420
PPELDSGPEN SMVNESLIFS MDAIVEQPSP LMISSPERDA PPELDSARET SMVNERLIFS  480
TGALVEHTSS PLIISSPERD APPELDSGGE PAMMNERSIF SIGALVEHPY SPLLISSPER  540
DAPPELDSGP ETWMMNESLI FSIGSPVENH SFPLVISSPE SDASPELNLG GEPARTNERS  600
IFSIGSPVEE PSSALVIYSP GRNAPQELSE TWTINKETLV ECATMAMTEL LMLGQTNSAY  660
WTIDLSSNRE NLNYEQYQSK FKNGSVTPLG YVMEASRETG LVLMDSLALV KILTTDKWVN  720
VFAPIVSVAS THRVIPSGSG GPRNGSLKLV EAEFQVISPL VPKRQVKFLR YCKMLRDDLW  780
VIVDVTPRMQ DLRFLPDGGS KRLPSGVIIE DLKNGSSKVT WIEQAEYDES EIPLIYQPLV  840
GSGIGLGAKR WLTTLQRYCE NLKTLSSVNV AQVYQGLSAD AATEIVKLAQ RMTVNYYSGI  900
TDSSPRKWRI INVENDEEGQ EKNVRLMSRK NVCVRGEHTG LVLNVATSVW FPVNQQTMFD  960
FLTNPNLRDK WDLLATETYF EEKIKIQKSR SREHYVSLLQ FAEGVEDGPA VLQETWNDAS  1020
GALLVFAPLE EQSVKGLMRG GDSYSMPILP SGFSILPDGG DTEDADELGE GCLLTIGYQF  1080
LFTKNLTTEE VDQEFLNAAK ELIAFTIDNI KSAFNIPA*
Cis-element ? help Back to Top
SourceLink
PlantRegMapThhalv10024265m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006413300.20.0homeobox-leucine zipper protein HDG6
TrEMBLV4MP310.0V4MP31_EUTSA; Uncharacterized protein
STRINGXP_006413300.10.0(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM1872833
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G25530.11e-160FLOWERING WAGENINGEN