PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa20g057450.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family Trihelix
Protein Properties Length: 601aa    MW: 68970.7 Da    PI: 7.188
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa20g057450.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix49.41.2e-1597164275
        trihelix   2 WtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtses 75 
                     W+ +evlaL+++r+ +e+++ +       We+ s+k++e gf+rsp++Ckek+e+ ++ry + ++++ +   ++
  Csa20g057450.1  97 WCSDEVLALLRFRSTVENWFPEF-----TWEHTSRKLAEVGFKRSPQECKEKFEEEERRYFNGNNNNTND-HQH 164
                     ********************998.....9*******************************9999999875.333 PP

2trihelix103.61.5e-32445539186
        trihelix   1 rWtkqevlaLiearr..........emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdql 85 
                     rW+k+evlaLi++rr          +++++l++++++ plWe++skkm e g++rs+k+Ckekwen+nk+++k+k+ +kkr + +s+tcpyf+ql
  Csa20g057450.1 445 RWPKDEVLALINIRRsissmndddhKDGNSLSSSSKAVPLWERISKKMLEVGYKRSAKRCKEKWENINKYFRKTKDVNKKR-PLDSRTCPYFHQL 538
                     8**************99999999888888999999*********************************************8.9***********9 PP

        trihelix  86 e 86 
                     +
  Csa20g057450.1 539 T 539
                     8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138376.6E-1195166No hitNo description
PROSITE profilePS500905.48296148IPR017877Myb-like domain
CDDcd122031.27E-25444519No hitNo description
PfamPF138371.1E-19444540No hitNo description
PROSITE profilePS500906.98445512IPR017877Myb-like domain
Sequence ? help Back to Top
Protein Sequence    Length: 601 aa     Download sequence    Send to blast
MFDGGVPEQI HRFIASPPPA SPLPPHQSAA ERSLPFPVSF ASYNTNHQAQ HILSLDSRKI  60
IHHHHHHHHH DIKDGGPTPA EWICHTDHDG DNHHHPWCSD EVLALLRFRS TVENWFPEFT  120
WEHTSRKLAE VGFKRSPQEC KEKFEEEERR YFNGNNNNTN DHQHISNYNN KGNSYRIFSE  180
VEEFYQHGHD DEHVSSEVGD NQNKRNNSLE RKRNVEETVQ DLMEEDKLRD QDQGQVEEAS  240
MGNKINSINV GKVGNVEDDA KSSSSSSLMM IMREKKKRKR KKEKERFGVL KGFCEGLVRN  300
MIAQQEEMHK KLLEDMVKKE EEKIAREEDW KKQEMERLNK EVEIRKQEQA MASDRNTNII  360
KFISKFTDHD LPTSAFQDPS SLALPQTQGR KKFQTSSSLL HQTLTPHNPL TNDNSLEPTS  420
TKTLKTKTQN PKPPKSDDKS DLGKRWPKDE VLALINIRRS ISSMNDDDHK DGNSLSSSSK  480
AVPLWERISK KMLEVGYKRS AKRCKEKWEN INKYFRKTKD VNKKRPLDSR TCPYFHQLTA  540
LYSQPSTGTT TTATATSAGD LQTRPEVGSE DPDIPAPMHV DADGAGDKSN VPFSGFDLEF  600
*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1275280KKRKRK
2275281KKRKRKK
3276284KRKRKKEKE
4277281RKRKK
5277282RKRKKE
6277283RKRKKEK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCsa20g057450.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAF3869630.0AF386963.1 Arabidopsis thaliana Unknown protein mRNA, complete cds.
GenBankAY0814840.0AY081484.1 Arabidopsis thaliana unknown protein mRNA, complete cds.
GenBankBT0004610.0BT000461.1 Arabidopsis thaliana Unknown protein mRNA, complete cds.
GenBankBT0083890.0BT008389.1 Arabidopsis thaliana At5g28300 gene, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010494089.10.0PREDICTED: trihelix transcription factor GTL2
SwissprotQ8H1810.0GTL2_ARATH; Trihelix transcription factor GTL2
TrEMBLA0A178UB250.0A0A178UB25_ARATH; GT2L
STRINGXP_010494089.10.0(Camelina sativa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM82682838
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G28300.10.0Trihelix family protein
Publications ? help Back to Top
  1. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]