PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022736427.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Helicteroideae; Durio
Family C2H2
Protein Properties Length: 1691aa    MW: 191554 Da    PI: 9.1533
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022736427.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H212.80.0003616001622323
                      ET..TTTEEESSHHHHHHHHHHT CS
         zf-C2H2    3 Cp..dCgksFsrksnLkrHirtH 23  
                      Cp   Cgk F ++ +L++H r+H
  XP_022736427.1 1600 CPvkGCGKKFFSHKYLVQHRRVH 1622
                      9999*****************99 PP

2zf-C2H211.60.0008816581684123
                      EEET..TTTEEESSHHHHHHHHHH..T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                      y+C    Cg++F+  s++ rH r+  H
  XP_022736427.1 1658 YVCAeeGCGQTFRFVSDFSRHKRKtgH 1684
                      89********************99666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1691 aa     Download sequence    
MAASSLSPEP SQEVFSWLKS LPLAPEYRPT LAEFQDPIAY IFKIEKEASL YGICKIIPPV  60
PPAPKKTAIG NLNRSLLARA AANASSDLKP APTFTTRQQQ IGFCPRKPRP VQKPVWQSGE  120
YYTFQEFETK AKSFEKTYLK KYSKKGSLSA LEVETLFWKA TVDKPFSVEY ANDMPGSAFV  180
PLNSKKSSGG GREAGEAVTV GETPWNMRAV SRAKGSLLRF MKEEIPGVTS PMVYIAMLFS  240
WFAWHVEDHD LHSLNYLHMG ASKTWYGVPR DAAVAFEEVV MVDGYGGEFN PLVTFSTLGE  300
KTTVMSPEVF VRAGIPCCRL VQNAGEFVIT FPRAYHSGFS HGFNFGEAAN IATPEWLTVA  360
RDAAIRRASI NYPPMVSHIQ LLYDLALEFC SRVPMSISAK PKSTRLKDKK KSEGETVVKK  420
LFVQSLKQNN DLLHILGKGS SVVLLPKSSD ISLCSDLRVA SQLRTNPRMS LGLFNYKEVV  480
KSPKDLATDE IMPGGNEETK GVKGFYSAKG KFVTMYEGNQ DSSFSGTDYL CRLPVRTLNM  540
SMERENAVQG DALSDQRLFS CVTCGILCFA CVAVLQPTEQ AARYLMSADC SFFNDWTVSS  600
GVTRNGFTAA QGDAITSEQN PFTRWMNKRA PNALYDVPVQ SVECKFRLVD QSIPVVEDTE  660
KGGDTSALGL LASTYGNSSD SEEDHVEPKA TVSGDETNPA KVSCERKFQY NESGFSPGDV  720
SGSHNPSLSR LDKEEAPVHV IDGYSEPGSR RVDVKNRSPQ TFDSTIEVET DNLASRRSNG  780
LEDKFRDPIT ASHANPSYSP ATHGTEKMRF GKAVVPMENA DIPYAPRSDE DSSRMHVFCL  840
EHAVEVEQQL RQIGGVHVFL LCHPEYPKIE AEAKSVAEEL GIDYTWSDIL FGDATKDDEE  900
RIHSALDSED AIPGNGDWAV KLGINLFYSA NLSRSTLYSK QMPYNCVIYS AFGRNSLVSS  960
PSKLNVYGRR SGKQKKVVAG KWCGKVWMSN QVHPFLAQRD PDEQEQERSF HAWATSDENL  1020
ERKPENVLKA ETTKVAKKFN RKRKMRAGIA PRKKVKYIEP EGAASDDSLD GSSLRQQQRF  1080
SRGKQPRLIE KEEAISYDSL EDDSLLQHRN LTRNKQAKFI EREDAESEDA EEDFTHQQHW  1140
RNLRGKQGKY IEEDDAVSGD SLDERSLKQY RRIPRSWRAK YLEREDALSD DEQEEISHQL  1200
HRRIHRGRQI KSFERNDAIS DDSLADNSLK QYKRMRKGKQ SKFFERDDAM PDDASDDDSQ  1260
HQLRRIPRGK QMKCIERDDA XADDASDDDS QHQLRRSPRG KQMKCIERDD AFSDDSLEDN  1320
PQQQHRRIPR NKVAKFTDRE AVVSFDSLKG NSHQQHRRVP RRQLTKFIER EDAVSSDSPD  1380
DSSLQHHRRI PRSKQSKILG REDGVSDDSQ DDTSLQQLRK TPRSRQGKFI ERENAVSYDS  1440
MDENYHQPNR RSLRSRKRKA QTPRQKKQDT PQNVKQGKRR TTKQVASQQV KQETLRNQNI  1500
KIDRSARRCN SYGEDEIEGG PSTRLRKRIQ KPLKVSETKP KENKQAGKKK VNNSSNLKAL  1560
AGHNTVKVRD EEAEYQCDME GCIMRFGSKQ ELIQHKRNIC PVKGCGKKFF SHKYLVQHRR  1620
VHMDDRPLKC PWKGCKMTFK WPWARTEHIR VHTGARPYVC AEEGCGQTFR FVSDFSRHKR  1680
KTGHSAKKGR G
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
110421054KRKMRAGIAPRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.10.0C2H2 family protein