PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_020696040.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; Asparagales; Orchidaceae; Epidendroideae; Malaxideae; Dendrobiinae; Dendrobium
Family C2H2
Protein Properties Length: 1421aa    MW: 158925 Da    PI: 8.687
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_020696040.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H210.90.001513361359223
                      EET..TTTEEESSHHHHHHHHHHT CS
         zf-C2H2    2 kCp..dCgksFsrksnLkrHirtH 23  
                      +Cp   Cgk F ++ +L +H ++H
  XP_020696040.1 1336 VCPekGCGKKFFSHKYLLQHRKVH 1359
                      69999*****************97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1421 aa     Download sequence    
MAPDPSLSEV LPWLKTLPLA PEYRPTLVEF QDPIAYILKI EQEASRYGIC KIVPPVPAAP  60
KTTAVANLNL SFSSRSPSGD PTFTTRQQQI GFCPRRPRPV QKSVWQSGCR YTLSQFETKA  120
RSFERAHLRK SRKKNRSSTS LCPIEVESLF WKTAADKPSS VEYANDMPGS GFAELREGEM  180
DMTAAENVGE TAWNMRGVAR AKGSLLRFMR EEIPGVTSPM VYVAMLFSWF AWHVEDHDLH  240
SLNYLHMGAG KTWYGVPRDA ALAFEEVVRI HGYGGEVNPL ITFATLAEKT TLMSPEVLIS  300
SGIPCCRLVQ NAGEFVVTFP GSYHSGFSHG FNCGEASNIA TPEWLKVAKE AAIRRASTNC  360
PPMVSHYQLL YALALSLCSR NPKIVSTPKS SRLKHKRRCV GETMIKETFV QNVIHNNSLL  420
NILLDDGSSC LILPQRTFDK PLCSGLHLKL QMKVKPRIAL GLCHAEDKME GNSVHDFYIG  480
SRSSWPSSSG FCPAEENSNS VYYRKNLLAT KISSLGSSGS QILSSRLQNK EGQNNKYRTD  540
GVLDQGLLSC VTCGILSYAC VAVMQPSEAA AEYLLSSNCN SFSNQIVGFE ENCGLNNEAT  600
SNTLNSDVDV NVGHIEKIHS PVQVHFRNDG LLSNAKVERG ASALDLLAAA YGDLSDSEED  660
ALHDVSFCAD GDKLTHSSLM CIVDDQSASD KDSYRMHVFC LEHAVEVEKR LRVLGGVNMM  720
LLCHPEYPKI ESEAKLVAEE LGLDHQWKSV KYRDASEQDQ EKIRLAIEDE DVIPCNGDWA  780
VKLGINLYYS VSLSQSPLFR KQLPYNEVIY KSFFQKSPVE QISSGTISRR QKKIIFAGKW  840
CGKVWMSNQV HPFLADQNFP DDDDNIDVSL EANSYRENVN QLVLSRRNSL NSYTADRRNS  900
GKKRTKSEYL SSARKHAPAP ARLDKDCNAR YAEENSGYAG NADCDPNTDR SEAETDIEED  960
DEQAILGRRN STNNNAVARR KSDHKKYTVR SSTRKCTHTW LDKSSDARCA EENSGNADGE  1020
QDRDRSEADT DMEDDKQANG SRWNSTKNNA YARRKFYKRK KSAVISNALK HAHIKSDNFS  1080
DSRYAKENSA HAGNSACAPD ADRSQSNSDI DDDIQATIRK RHLTNSNGAP KRKSDEKRKC  1140
SVVRSRTRKA THTQLEDSSN AGCAEENTGN ARYADDDRHR DQSSEADTDS EGKKQASVPR  1200
KNLPNNNAAA RKKSGKKRKK LKDGCNRKAA QTRSGSSAKT GCAEDNFSMK CGGRFSCVSP  1260
SVKGTNQDQR SMQLRKRRLN SGEANVIVAP EKPNNRKKIK KALDSVPDKE NFSCDIEGCC  1320
MSFRTKHDLC LHKRNVCPEK GCGKKFFSHK YLLQHRKVHC DDRPLKCPWK GCKMTFKWAW  1380
ARTEHIRVHT GERPYSCREP GCDLTFRFVS DFSRHRRKTG H
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
112161220KKRKK
212161228KKRKKLKDGCNRK
312171229KKRKKLKDGCNRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.10.0C2H2 family protein