PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cse_sc031829.1_g020.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae; Artemisiinae; Chrysanthemum
Family C3H
Protein Properties Length: 1433aa    MW: 159896 Da    PI: 6.2091
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cse_sc031829.1_g020.1genomeKazusaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH19.32e-0614101433226
                             -S---SGGGGTS--TTTTT-SS-SS CS
                zf-CCCH    2 ktelCrffartGtCkyGdrCkFaHg 26  
                             +++ C+ f+++G Ck+G++C++ H+
  Cse_sc031829.1_g020.1 1410 GQRVCK-FYESGRCKRGASCNYWHP 1433
                             6799**.8899***********997 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1433 aa     Download sequence    
MDLWCWHICS ICQKAAHHMC YTCTYSLCKG CIKKTDYVCV RGDKGFCTVC LKTIMLIENN  60
GQGKDEKAQV DFDDKTSWEY LFKVYWVYLK GKLSLTLDEL TQAKNPPKVP ETTLAASPST  120
CVHNVDNDLK SIKTDTPPQV LEASESKRRK KDEQMVIPHK ETVSTKKLAT KESSSVVVRK  180
DWATKELLDF VAHMRNGNTS VISQFDVQEL MLEYIKRNNL RDPKKKSQII CDSRLKILFG  240
KPRVGHFEML KLLEFHFFMQ EDLSKSKINS IAKQVDPDWN SDNIVTMSNK KRKNHKKVED  300
RAPQNKLDEY AAIDIHNINL VYLRRKVMEN LIGDSEEFNK KVVGAIVRIR ITSKDQKHDI  360
YRLVRVVGTS KADAPYKVGD KSADVMLEVL NLDKKETVSI DAISNQDLSE DECRRLRQSI  420
KCGLVQRFTV GEIQDKAVSL QYARLNDILE AETLRLNNLR DRASEKGHKK RYPLMIDYLL  480
IFVVLSCIHL LLDTCISYKS YRFGEYVEKL QLLKTPEERE RRMREIPDVH SDPKMNPDYE  540
TDDAEEHSNK EDEQTKTKFT SHKNTSTPKK RAEFSNNVSS RSWKIEKNKQ LTIKSEEDRV  600
TTRKDAHGIK NNSGRPQNQV ESNGSTTTNW NHRTSGSSSF PNTASETATT SSMRNTPISN  660
DSEIDKVWYY RDPSGKVQGP FCMVQLQKWS TTGYFPTDMR IWINREDESL LLNDVLKEQL  720
QKKTATEKVG IRIEGLSVRT TNLLSDSQSS SGRITSSFAK SIESTGQFKM QEMASPTPIE  780
NSADNKPVSS VFAGNASLND LPSPTPNKIT SEGEKVENTI KEQSLPSSET LVRESENSDV  840
GRAHLPILSS ETLVPEAANT AGWSNGSSQD VGGAQLPKLA SEDTVRESGN PSSCNNDSDV  900
GRAHLSKLSS ETLAPEAGNT AGWSNGSSQD VGGSQLPKIA EEWSGYSPTE VKQEERHPGL  960
SQDIGGAQLP KQSSEKLVQD FGNAASWNNG SKQDVGGAQV PKSSSETLVQ VAGNETSWNN  1020
NSRQDVGGAQ VPKSSSETLV QVAGNETSWN NNLRQDVGGA QLSKTADEWS GYSPTQVKKQ  1080
EWHPEDDHVA TTTANTEQTI SSPPPSSQPE YNNYMPSWQG VRETIEFSTL AEESVSDLLA  1140
EVDAMESQNG FPSPTSRRNN FAEDIFNGSF EEFSPTPVHG ARSNGFSSNG IDIQLPFKWP  1200
EIARENITKD IQVTAHSSEV QPPSQDMIDV NNSIKSVEGE SQTKVGNVQI EEESVPIRPG  1260
HDASETSQDR EGCLNEHKES EFTQSASTKS EDPISEKVRA DHSDVMAVDV TLQSAKVDAV  1320
GPPQQGTLNN FAQPAPEITT RADTHLNPSQ GLTFGWDPQQ RYNNNSLSQR GYNVEDSGYR  1380
NGKPWVRQSL FGSGSDVNGG VGYSRPPPKG QRVCKFYESG RCKRGASCNY WHP
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1147171KRRKKDEQMVIPHKETVSTKKLATK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G16485.11e-175C3H family protein