PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Carubv10028088m
Common NameCARUB_v10028088mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family HD-ZIP
Protein Properties Length: 825aa    MW: 91531.6 Da    PI: 5.1401
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Carubv10028088mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox64.81.2e-20112167156
                      TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
         Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                      +++ +++t+ q++e+e+lF++n++p+ ++r++L+++lgL+ rqVk+WFqNrR+++k
  Carubv10028088m 112 KKRYHRHTNRQIQEMEALFKENPHPDDKQRKRLSAELGLKPRQVKFWFQNRRTQMK 167
                      688999***********************************************998 PP

2START141.49.1e-453205531206
                      HHHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS................SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S.. CS
            START   1 elaeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv...............dsgealrasgvvdmvlallveellddkeqWdetla.. 77 
                      e+a ++ qel k+ + eep+W k  + ++g evl   ee+                 +  ea++a +vv+m++ +lv  +l+   +W+e++   
  Carubv10028088m 320 EIAVSCVQELTKMCETEEPLWIKKKSDKIGGEVLCLNEEEYMrlfpwpvenpnnkgdFGREASKANAVVIMNSITLVDAFLNAD-KWSEMFCsi 412
                      578899*****************9955555555544444333556666678999999999************************.******999 PP

                      ..EEEEEEEECTT.....EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE..TTS-EEEEEEEEE-..TTS--..-TTSEE-EESSEEEEEEEECT CS
            START  78 ..kaetlevissg.....galqlmvaelqalsplvp.RdfvfvRyirq.lgagdwvivdvSvds..eqkppe.sssvvRaellpSgiliepksn 159
                        +a+t++ issg     g l lm+aelq+lsplvp R+ +f+Ry +q  + g w+ivd  +ds  +q++p  +      ++ pSg++i++++n
  Carubv10028088m 413 vaRAKTVQIISSGvsgasGSLLLMYAELQVLSPLVPtREAYFLRYVEQnAETGNWAIVDFPIDSfhDQMQPPsTNTPHEYKRKPSGCIIQDMPN 506
                      99**********************************************99*********99885223333334444444558************ PP

                      CEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
            START 160 ghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                      g+s+v wvehv++++++ h+ +   vksg+a+ga++w+  lqrqce+
  Carubv10028088m 507 GYSQVKWVEHVEVDEKHLHETFADYVKSGMAFGANRWLDVLQRQCER 553
                      *********************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466896.27E-20101169IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.603.1E-21106174IPR009057Homeodomain-like
PROSITE profilePS5007117.394109169IPR001356Homeobox domain
SMARTSM003891.9E-18110173IPR001356Homeobox domain
CDDcd000864.10E-19112170No hitNo description
PfamPF000463.5E-18112167IPR001356Homeobox domain
PROSITE patternPS000270144167IPR017970Homeobox, conserved site
PROSITE profilePS5084843.299311556IPR002913START domain
SuperFamilySSF559612.06E-30312555No hitNo description
CDDcd088758.41E-107315552No hitNo description
SMARTSM002343.2E-27320553IPR002913START domain
PfamPF018522.4E-37321553IPR002913START domain
SuperFamilySSF559619.61E-15597800No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0048497Biological Processmaintenance of floral organ identity
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 825 aa     Download sequence    Send to blast
MLSMGDDNVM TSNNMRFASQ LLPSSSSPGT IQNPNFNFIP FNSFSSIIPK EEHGMMSMMM  60
MMGDGTVEEM MENGSAGGSF GSGSEQAEDP KFGNESDVNE LQDDEQPPPA KKKRYHRHTN  120
RQIQEMEALF KENPHPDDKQ RKRLSAELGL KPRQVKFWFQ NRRTQMKAHQ DRTENVLLRA  180
ENDSLKSENC HLQAELRCLS CPSCGGPTVL GEIPFSELHI ENCRLREELD RVCSITSRYN  240
GRPMQSMPSS QALITPSPTL PHHQPSLELD MSVYAGNFPE QSCADMMMLP PQDTTCFFPD  300
QTANNNNMLL ADEEKVIAME IAVSCVQELT KMCETEEPLW IKKKSDKIGG EVLCLNEEEY  360
MRLFPWPVEN PNNKGDFGRE ASKANAVVIM NSITLVDAFL NADKWSEMFC SIVARAKTVQ  420
IISSGVSGAS GSLLLMYAEL QVLSPLVPTR EAYFLRYVEQ NAETGNWAIV DFPIDSFHDQ  480
MQPPSTNTPH EYKRKPSGCI IQDMPNGYSQ VKWVEHVEVD EKHLHETFAD YVKSGMAFGA  540
NRWLDVLQRQ CERIASLMAR NITDLGVISS AEARRNIMRL SQRMVRTFCV NISTAYGQSW  600
TALSETTKDT VRITTRKMCE AGQPTGVVLC AVSTTWLPFS HHQVFDLIRD QHHQSLLEVL  660
FNGNSPHEVA HIANGSHPGN CISLLRINVA SNSWHNVELM LQESSIDNSG SLIVYSTVDV  720
DSVQLAMNGE DSSNIPILPL GFSIVPVNPP EGVSVNSNSP QSCLLTVAIQ VLASNVPTAK  780
PNLSTVTTIN NHLCATVNQI TSALTSTLTP AVASSDAVSK QEVS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1137142DKQRKR
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCarubv10028088m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAB0133941e-179AB013394.1 Arabidopsis thaliana genomic DNA, chromosome 5, P1 clone:MQD22.
GenBankCP0026881e-179CP002688.1 Arabidopsis thaliana chromosome 5 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006281890.10.0homeobox-leucine zipper protein HDG5
SwissprotQ9FJS20.0HDG5_ARATH; Homeobox-leucine zipper protein HDG5
TrEMBLR0GUB60.0R0GUB6_9BRAS; Uncharacterized protein
STRINGXP_006281890.10.0(Capsella rubella)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM43562548
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G46880.10.0homeobox-7
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Lung SC, et al.
    Arabidopsis ACYL-COA-BINDING PROTEIN1 interacts with STEROL C4-METHYL OXIDASE1-2 to modulate gene expression of homeodomain-leucine zipper IV transcription factors.
    New Phytol., 2018. 218(1): p. 183-200
    [PMID:29288621]