PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_25654_BGI-A2_v1.0
Common NameF383_18358
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family HD-ZIP
Protein Properties Length: 731aa    MW: 79525.6 Da    PI: 5.3802
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_25654_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox34.53.5e-1164106143
                                 TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHH CS
                    Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterq 43 
                                 r++ +++t+ q++e+e+lF+++++p+ ++r++L+++lgL+  q
  Cotton_A_25654_BGI-A2_v1.0  64 RKRYHRHTQRQIQEMEALFKECPHPDDKQRKQLSRELGLDPLQ 106
                                 789999*********************************9766 PP

2START2083.6e-652514731206
                                 HHHHHHHHHHHHHHHHC-TT-EEEE...EXCCTTEEEEEEESSS.........SCEEEEEEEECCSCHHHHHHHHHCCCGGCT CS
                       START   1 elaeeaaqelvkkalaeepgWvkss...esengdevlqkfeeskv........dsgealrasgvvdmvlallveellddkeqW 72 
                                 ela +a++el+++a+++ep+Wv++    + +n+ e+l++f+++ +        +++ea+r+ +v +m++++lve+l+d++ qW
  Cotton_A_25654_BGI-A2_v1.0 251 ELAVTAMEELIRMAQSGEPLWVTDEnsiDVLNENEYLRIFPRGIGskpfanlgFRSEASREAAVIIMNPVNLVEILMDVN-QW 332
                                 57899*********************99**************999***********************************.** PP

                                 -TT-S....EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE CS
                       START  73 detla....kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvR 144
                                 ++ +     +a+tl+v+s+g      galq+m+ae+q++splvp R+ +f+Ry++++ +g w++vdvS+d+ ++ p    + R
  Cotton_A_25654_BGI-A2_v1.0 333 STVFCgivsRAMTLDVLSTGvagnynGALQVMTAEFQLPSPLVPtRENYFARYCKRHHDGIWAVVDVSLDNLRHAP----FTR 411
                                 ****99999*****************************************************************99....9** PP

                                 -EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                       START 145 aellpSgiliepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                                 ++++pSg+li++++ng+skv+wve+v++++r ++ +++ lv+++la+gak+wvatl+rqce+
  Cotton_A_25654_BGI-A2_v1.0 412 CRRRPSGCLIQELPNGYSKVIWVENVEVDDRGVSDIYKTLVNTSLAFGAKRWVATLDRQCER 473
                                 ************************************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466896.68E-1048108IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.604.1E-1351106IPR009057Homeodomain-like
SMARTSM003890.001762140IPR001356Homeobox domain
CDDcd000861.90E-963106No hitNo description
PfamPF000468.1E-964106IPR001356Homeobox domain
PROSITE profilePS5084844.819242476IPR002913START domain
SuperFamilySSF559611.1E-33243475No hitNo description
CDDcd088752.14E-119246472No hitNo description
SMARTSM002342.1E-66251473IPR002913START domain
PfamPF018521.4E-55252473IPR002913START domain
Gene3DG3DSA:3.30.530.203.7E-6349473IPR023393START-like domain
SuperFamilySSF559612.88E-22495722No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 731 aa     Download sequence    Send to blast
MFNSDLYENP NMFDMFQRPS DSDQTERDDD NNDTKSGTEV DAPSADDDNQ GPASSGPSRR  60
RAKRKRYHRH TQRQIQEMEA LFKECPHPDD KQRKQLSREL GLDPLQAQTE RHENGLLKAE  120
NEKLRAENHR YKEALNNASC PTCGGPAALG EMSFEEQHLR LENARLREEI ERISGVTAKY  180
VGKPIGPSFS RFADRAPISF GTQPGFLGEY GGPGGAAGGP GVGAGGPGGG LGEVLRPVSV  240
TNEADKPLIV ELAVTAMEEL IRMAQSGEPL WVTDENSIDV LNENEYLRIF PRGIGSKPFA  300
NLGFRSEASR EAAVIIMNPV NLVEILMDVN QWSTVFCGIV SRAMTLDVLS TGVAGNYNGA  360
LQVMTAEFQL PSPLVPTREN YFARYCKRHH DGIWAVVDVS LDNLRHAPFT RCRRRPSGCL  420
IQELPNGYSK VIWVENVEVD DRGVSDIYKT LVNTSLAFGA KRWVATLDRQ CERLASAMAN  480
NIPAGDLGVL NSSDGRKSIL KLAERMVNSF CTGVGASTAH AWTTLTGSDE IRVMTRKSID  540
DPGRPPGIVL SAATSFWVAV PPRKAFNILR SEKFRSEWDI LSNGGVVDEM AHIANGRDPG  600
NCVSLLRVNG ANASQSNMLI LQESSNDATG SYVIYAPVDF AAMNIVLNGG DPDYVALLPS  660
GFAILPDREG PNRGIGITEI GSGGSLVTLA FQILVDSAPN SKISVGSVAT VNSLIKCTLE  720
RIRTAVMCND A
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
15864RRRAKRK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds to the L1 box DNA sequence 5'-TAAATG[CT]A-3'. Plays a role in maintaining the identity of L1 cells, possibly by interacting with their L1 box or other target-gene promoters. Functionally redundant to ATML1. {ECO:0000269|PubMed:12505995}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX6159805e-96JX615980.1 Gossypium hirsutum clone NBRI_GE60293 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017647009.10.0PREDICTED: homeobox-leucine zipper protein PROTODERMAL FACTOR 2-like
SwissprotQ93V990.0PDF2_ARATH; Homeobox-leucine zipper protein PROTODERMAL FACTOR 2
TrEMBLA0A1U8L9J60.0A0A1U8L9J6_GOSHI; homeobox-leucine zipper protein PROTODERMAL FACTOR 2-like
STRINGGorai.011G016200.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM2482434
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G04890.10.0protodermal factor 2
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]