PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG84175.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Nin-like
Protein Properties Length: 1899aa    MW: 194789 Da    PI: 5.656
Description Nin-like family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG84175.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1RWP-RK901.9e-2811651215252
      RWP-RK    2 ekeisledlskyFslpikdAAkeLgvclTvLKriCRqyGIkRWPhRkiksl 52  
                  ek+i+l++l++yF++++kdAAk++gvc+T+LKriCRq+GI+RWP+Rki+++
  GBG84175.1 1165 EKTIDLSVLQQYFAGSLKDAAKQIGVCPTTLKRICRQHGIQRWPSRKINKV 1215
                  799**********************************************97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1899 aa     Download sequence    
MAMHEAAMAT TGPGPIKLYS AQGQTLGTSH VLPSSPWSSA PLFVFEGQDV AACAPDYGSH  60
RCGDSQHSQT SLDFRRFLVG EGSVCGGGLL GPGGLVDAGG MLFGGSLRGG NAALLMANAI  120
TAAGGSGGGG GAVLASGSSG PNAGGVAAGA GGGVGEVNIC DISAAGECRE PADGAGVCQD  180
SGVAGRMAAA PTGGVSLWNS SVHGGDAFGH SLLYPLQISE LMELDPIEDE LSSESWLQHG  240
GEGSYHRGNG AATGGSQAEA SPAVNSPQQI QHGICSASAS ASASASTSAS PSAFTSASAS  300
ASAPSAANNN FTSKGGGGYP NCDTTFSPVH CSTSMPGNFA FAPSFPAAPT VGQAESMMMA  360
TASFPAVCGA PGGFCANGRI LWSTQLQQQL LAGSAAKMAR SMFVSPLSPA APGSASTCRP  420
SAQCAGIPTL YARSGEQSYE TMRRLWGQGG AGSGGGVLSC EAGGGGALDP GARRQDGDQG  480
GAVEGMGEAG RTVRTTGRLV AREGGATVGF RKRPREEEDR DREGEGEGGG GMCLHADDSG  540
SVHRPRHHHH HHEHDRVSQC GTRNEQQQRM QGERGGGERT DSAILRGEGG IQDEEAGGEC  600
DSARSPNGEG HYYLVGGNDW DEGEGGWGRV TEGGVGGGVS WRSPDAVIAI STKLTQRMMK  660
ALHCINQTRP DVLVQIWVPR SDHGRLVLTT KEAPFAVEQC SQQLARYRGV SEEYIFSGEE  720
ISQYSGLPGR VFKSGKPEWS PNVQFYSPME YLRANHAAEC DVRGSLAAPV FHPGLPRCVA  780
VIEVVMVGED VRFALQMNNI CRALQAPSLA HGLILAEISE VLTQVSQQHD LPIAQCWIPC  840
EREEQQEEKQ LCYIDEIEEG DASSDSKGCK KKKGSWSLST TEAPFYVSNP IMWGFRNACC  900
EHKLEEGQGT PGRAMMTCAP AYAEDVKRLS KLQYPLTHYA RMFGLGAAVS IRLRSSHSGM  960
ADYILEFFLP TSCAAHDQQQ LLLSGLSVSM QWACRTLRPV TKAELESERS NGIMDSGQEV  1020
FQGAVFKGAA MNPATQCNSL GPPAASAGGD GVIVQIIAED AEDDDEEDEE EDSGEGIDND  1080
MEAEGLCGHS RNIQAPEPVC DPEVPIEPEA SNVAAANRRG KGSTGGKAKG GGGGKAGAAA  1140
KGAPEAEGAT GALRRAERGR RGTMEKTIDL SVLQQYFAGS LKDAAKQIGV CPTTLKRICR  1200
QHGIQRWPSR KINKVSRSIK KLQGVINSVQ GKDGGLQINP LLGAAADDLS AIAALGALGA  1260
ATCAAVNNGP GRGSTLSTTA MGGPSSAPGP ATHPVLGVHG ADAGLSTPES AVTAAALGDW  1320
GVTWAAAGSA GGNPGVRGGG QQKGSGGTRG MMQQQALGMG GSPASGTDQT PPINVSAESD  1380
SMENKPLQRS FSSLHPAMGC AVQNQQQQQT GVMGGGGPVR VVDVDTTEEG GSNHGDEVRC  1440
VTERGSDLGN GSHHSGKSFE NRVMQGVSNC ETSCGAWGVK PGLRRGGKAT GWAAVSGMAQ  1500
GPGGYQCLES RGDLKGVPTW SVSGFGTITA AGGSMSGRRG TAVEGVHGQQ GTGVGSNCRA  1560
QEGEIACMSG RGGGDGGSSC GGQVFRWDGT GSPSSTGQLG HVTTMDLLST IESRVHGGDA  1620
ALAALRVING PGDVGGGATP GRDCDMQNTE PAGYGISYSG RRGGIGMTAG QSNTSAMGLT  1680
AAGGGGMTPR SPARPMRMLG NSLTPNLMTA SSPSNTSCST GMCSGKAGSP GVGAVGMVGA  1740
LGLSPQMLCS SMIDNLPQWP RGFRGGGRDS NAVTTSDVTV KVTFGQDTAR FRLPSGHCFD  1800
DLQQEVAQRL KLDTTTLVMK YLDDEGEYVL LSSNEDLMEC IDVSRNAGNN TIKLTARCEG  1860
GSGLQYGARA ACTGGCGVPN IGYYDAGMAS LSDMNSPQR
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
111251133GGKAKGGGG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G20640.21e-68Nin-like family protein