PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc009404.1_g020.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family Trihelix
Protein Properties Length: 353aa    MW: 38518.5 Da    PI: 9.9328
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc009404.1_g020.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix52.81e-1642125186
               trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkm....rergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdq 84 
                            +W++  v +L+ea+++++   +r+klk+++We+v+k++     ++++ ++++qCk+k+e+++kry+ + ++        +s++p+f++
  Cse_sc009404.1_g020.1  42 EWSEGAVSTLLEAYESKWILRNRAKLKGHDWEDVAKYVssraNSTKLPKTQTQCKNKIESMKKRYRSESASGD------VSSWPLFQR 123
                            5*************************************999889999*******************9887766......568****99 PP

               trihelix  85 le 86 
                            l+
  Cse_sc009404.1_g020.1 124 LD 125
                            97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 353 aa     Download sequence    
MDKQQQQPND TPSPKNNPVS ASSGAGLLAS ASGGGDRLKR DEWSEGAVST LLEAYESKWI  60
LRNRAKLKGH DWEDVAKYVS SRANSTKLPK TQTQCKNKIE SMKKRYRSES ASGDVSSWPL  120
FQRLDLLLRG TGGASGGSQV VTVASACGGG DVLPVVSAVP VSTSMMMVQS SSSVPVIVQA  180
PEPAPAPVLV PGPVLSPGQQ VTAQNSVDSN GVDRDNKQED VVVTKVPDNQ PPDKMDIESD  240
CSTPALYSNE KEKLSSKNHR TKMPDNRKRR RKRENWDVAE SIRWLAKVMV KSEQAKAETM  300
RELEKMRADA DIRRSEIDLK RTEIIAHTQL EIAKLFAASL GKSVDPSLRI GRS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1267271RKRRR
2267273RKRRRKR
3267274RKRRRKRE
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G54390.13e-54Trihelix family protein