PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID evm.model.AsparagusV1_04.590
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; Asparagales; Asparagaceae; Asparagoideae; Asparagus
Family C2H2
Protein Properties Length: 1146aa    MW: 127714 Da    PI: 8.451
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
evm.model.AsparagusV1_04.590genomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H2140.0001510311053121
                                    EEET..TTTEEESSHHHHHHHHH CS
                       zf-C2H2    1 ykCp..dCgksFsrksnLkrHir 21  
                                    y+C+   C++sFs+k++L  H +
  evm.model.AsparagusV1_04.590 1031 YTCNieGCTMSFSTKHDLALHKK 1053
                                    99*******************87 PP

2zf-C2H212.80.0003610561078323
                                    ET..TTTEEESSHHHHHHHHHHT CS
                       zf-C2H2    3 Cp..dCgksFsrksnLkrHirtH 23  
                                    Cp   Cgk F ++ +L++H ++H
  evm.model.AsparagusV1_04.590 1056 CPeeGCGKKFFSHKYLVQHKKVH 1078
                                    9999*****************99 PP

3zf-C2H211.60.0008411141138123
                                    EEETTTTEEESSHHHHHHHHHH..T CS
                       zf-C2H2    1 ykCpdCgksFsrksnLkrHirt..H 23  
                                    y C++C+++F+  s++ rH r+  H
  evm.model.AsparagusV1_04.590 1114 YICSKCSQTFRFVSDFSRHKRKtgH 1138
                                    89******************99666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1146 aa     Download sequence    
MKKFNKKTPF SPLEVEALFW RACADKPFSV EYANDMPGSG FVPIVKKWGC KEEEVGNVGE  60
SGWNMRGVSR AKGSLLRFMK EEIPGVTSPM IYVAMMFSWF AWHVEDHELH SLNYLHLGSG  120
KTWYGVPRDA AIAFEDVVRV HGYGGEVNPL VTFTILGEKT TVMSPEVLLG AGIPCCRLVQ  180
NPGEFVVTFP GAYHSGFNHG FNCAEAANIA TPGWLRVAKE AAVRRASINY PPMVSHFQLL  240
YALALSLCSR VPLIGVSEPR SSRLKDKMRG EGEVMVKEIF VQNVIETNDL LSALLNKGSS  300
CIVLPHNMHD SPLCSNSLLR SQLKMKPRLS LGLCSREEAL EASRVFPAND VILGRNAGIE  360
HSKASATSEK SGISAHNSSG SDSQHGENEN ESAVHSDGLL DHGLLSCVTC GILSFTCVAV  420
VRPKQAAARY LMSADCGFLN EQNIVSGENS NRDDKIHWRR STSDLLCDSE QLERHAQYRS  480
PSPVHVSDHN FEDNGSDAAC RGASALALLA SAYGDSSDLD EDGSPATSPR ADEDNDSSHV  540
TRCANINLPE SDCQNDSDNE YDEMNGSTPE FSSGDHPGML DNLEDNGEME TSSSSIKSIG  600
ETRSVDYEGP ENKYCTTGTA KICQSNVKME RIASGSVSTV MKPNSTRTSP SRNNDAIRQC  660
SVVSAIERSD KDSSRMHVFC LEHALEVEKL LHPLGGVNIM ILCHPDYPKI ETEAKLLAEE  720
LETDYEWKSI DFRGPTQKDL ESIRAAMEDE ESMPTSSDWA VKLGINLFYS ANLSKSPLYS  780
KQLPYNAVIY RAFGCKSPNN SPLTSKSSKR RPGRQRKIVV AGKWCGKVWM RNQVHPLLSD  840
RKDDQEQKHE RFYSKSNSEI KSEEVETEIK NEKIVSRKSS RSRKRKKRPL SRASAKKQKC  900
VTPQVMDKKA KVSDTSAAQT SSTKHSRVLR SCRKSDVKYE SEEEPTTVRS SRRLKAKSET  960
KTKPTITRST SKLHIEQETE EAPNTRLRTK PSKSKDTSAN PPINKQSRKK KPKTTPNPIK  1020
EEEEEEAAKD YTCNIEGCTM SFSTKHDLAL HKKDICPEEG CGKKFFSHKY LVQHKKVHLD  1080
DRPLACPWKG CKMRFKWAWA RTEHIRVHTG DRPYICSKCS QTFRFVSDFS RHKRKTGHLA  1140
KKGRR*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1883887RKRKK
2883911RKRKKRPLSRASAKKQKCVTPQVMDKKAK
3884912RKRKKRPLSRASAKKQKCVTPQVMDKKAK
4885913RKRKKRPLSRASAKKQKCVTPQVMDKKAK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.11e-170C2H2 family protein