PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_32389_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family HD-ZIP
Protein Properties Length: 840aa    MW: 92081.3 Da    PI: 6.0988
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_32389_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox66.53.7e-21149204156
                                 TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
                    Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                                 ++k +++t+ q++eLe++F+++++p++++r eL+++lgL+ +q+k+WFqNrR+++k
  Cotton_A_32389_BGI-A2_v1.0 149 KKKYHRHTPRQIQELESFFKECPHPDEKQRMELSRRLGLEGKQIKFWFQNRRTQMK 204
                                 79999************************************************999 PP

2START170.21.4e-533505722205
                                 HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT CS
                       START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdet 75 
                                 +a++a++el+k+a+ + p+W k      es+n +e++++f++  +     + +ea +a+g+v      lve l+d + +W e+
  Cotton_A_32389_BGI-A2_v1.0 350 VALSAMDELIKMAQMDNPLWIKGMgggmESLNVEEYRRNFSSCIGmkpssYATEATKATGLVYLRGLALVEALMDAN-RWVEM 431
                                 7899************************************98888********************************.***** PP

                                 -S....EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EE CS
                       START  76 la....kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRael 147
                                 ++    +a+t++v+ssg      ++lq+m ae+q+lsplvp R + f+R+++q+++g+w++vdvS+d+ q+ ++++ +  +++
  Cotton_A_32389_BGI-A2_v1.0 432 FPcmisRAATIDVLSSGtgvtrdNELQVMDAEFQVLSPLVPvRQVRFIRFCKQHSEGVWAVVDVSIDPSQDATNTQMFPNCRR 514
                                 *********************************************************************************** PP

                                 SSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
                       START 148 lpSgiliepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                                 lpSg++i++++   sk+twveh +++++ +h ll++l++sg  +ga +w+atlqrqc 
  Cotton_A_32389_BGI-A2_v1.0 515 LPSGCVIQDVDTKCSKITWVEHSEYDDSAVHHLLQPLLSSGFGFGAHRWLATLQRQCD 572
                                 ********************************************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.607.1E-22136206IPR009057Homeodomain-like
SuperFamilySSF466893.76E-20137206IPR009057Homeodomain-like
PROSITE profilePS5007117.54146206IPR001356Homeobox domain
SMARTSM003891.3E-18148210IPR001356Homeobox domain
PfamPF000469.0E-19149204IPR001356Homeobox domain
CDDcd000862.19E-19149206No hitNo description
PROSITE patternPS000270181204IPR017970Homeobox, conserved site
PROSITE profilePS5084837.59340576IPR002913START domain
SuperFamilySSF559613.71E-30342572No hitNo description
CDDcd088753.11E-105344571No hitNo description
SMARTSM002341.3E-34349573IPR002913START domain
PfamPF018523.4E-46350572IPR002913START domain
SuperFamilySSF559614.67E-19602836No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 840 aa     Download sequence    Send to blast
MSFGGLINSS SSSDSGSSGA RVVVADFVPQ NNMLSYAIVE PPLLTQHIPI SMLSSPSLSL  60
SYVLAHLPLP LPPPLPPSLF LFLSLSRDLV LVQKRMDGHG EMGLIGENFD PGFVGRMKED  120
GYEIRSESDN FDVASGDDQD AAADGPSKKK KYHRHTPRQI QELESFFKEC PHPDEKQRME  180
LSRRLGLEGK QIKFWFQNRR TQMKTQLERH ENVILRQEND KLRAENDLLK QAMTTPICSS  240
CGGPAVPGEI SYEQHQLRIE NARLKDELTR ICALTNKFLG RPLSSSGSPI PPHGLNSNLE  300
LAVGRNGFGG LNNAGTSLPM GFEFGDGSMM SIVKPMVNEM QYDRSAFVDV ALSAMDELIK  360
MAQMDNPLWI KGMGGGMESL NVEEYRRNFS SCIGMKPSSY ATEATKATGL VYLRGLALVE  420
ALMDANRWVE MFPCMISRAA TIDVLSSGTG VTRDNELQVM DAEFQVLSPL VPVRQVRFIR  480
FCKQHSEGVW AVVDVSIDPS QDATNTQMFP NCRRLPSGCV IQDVDTKCSK ITWVEHSEYD  540
DSAVHHLLQP LLSSGFGFGA HRWLATLQRQ CDCMAILMSQ DIPGENNTGI TPAGRKSMIK  600
LAQRMTYNFC AGVCASSVHK WDKLSVGNVG EDVRVMTRKN INDPGEPHGV VLSAATSVWM  660
PVTQERLFDF LRDERMRSEW DILSHGGPMQ EMVHVAKGMG HGNCVSLLRG SAINANENNM  720
LILQETWSDA SGALVVYAPV DISSMSVVMN GGDSTYVALL PSGFAILPGI SPSYHGGRSE  780
SNGALVKPEI DGSIVSGCLL TVGFQILVNN VPTAKLTVES VETVNNLISC TIQKIKAALT
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017646295.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A1U8NWL90.0A0A1U8NWL9_GOSHI; homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
STRINGGorai.004G120700.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM112827105
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]