PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC4BG004770.2
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family CPP
Protein Properties Length: 564aa    MW: 61226.2 Da    PI: 6.081
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC4BG004770.2genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.93e-16465503240
               TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                       ++k+C+CkkskClk+YCeCfaag++Cse C+C++C Nk 
  TRIDC4BG004770.2 465 SCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKP 503
                       689**********************************96 PP

Sequence ? help Back to Top
Protein Sequence    Length: 564 aa     Download sequence    
ARAEDSPLFG FIDSLSPIEP LKSAYSGNGL QAYHQSLNAA SVSSIFTSPH HNAHKEGSKL  60
SKSSFGDYTE NEISMEDGTD KNKSPTSSTA VRLFACTSTI TRESHTMITC SVNEGIVDPP  120
KEPNDLPQPG RFDSGSPDHN TAPCHGVSVR SDLKQDKCPK LEAVQATNNT VEKRKCLFSS  180
DMQPQDGCQP AKENNEVMGC EWEDLVSVTS GELLAFDSSM DQHHTGVQLA VNNAESCGYL  240
LSKLAGGADI PDRTHPTTSS QAYYHEMVVG EDKTENGQLF PEDKKTILSE EIQDNINEEN  300
ACIPLGCKVE TQQRGVRRRC LVFEASGYSH RTVQKEYVGD LSFSTSKGKS SAQNHRNPGK  360
TPSPHVFRGI GLHLNALALT SKDKMACQDP LATALVPSLK TEQDVHGNLL SAGGNFVHSG  420
SGSLDLQMDN DDCSVGGFLG NDHNSSQSSS PPKKRRKSDN GDDDSCKRCS CKKSKCLKLY  480
CECFAAGVYC SEPCSCQGCL NKPIHEEIVL STRKQIEFRN PLAFAPKVIR MSEAGQETQV  540
GNFINKLIST SVGHYTLLCD AEIL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1452458PKKRRKS
2453457KKRRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.13e-32CPP family protein