PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_12728_BGI-A2_v1.0
Common NameF383_30272
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 571aa    MW: 63952.3 Da    PI: 6.1822
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_12728_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix99.82.3e-3139125187
                    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfd 83 
                                 rW++qe+laL+++r++m++ +r++ lk+plWeevs+k++e g++rs+k+Ckek+en+ k++k++k+g+ ++++++++t+++fd
  Cotton_A_12728_BGI-A2_v1.0  39 RWPRQETLALLKIRSDMDSLFRDSTLKGPLWEEVSRKLAELGYHRSAKKCKEKFENVYKYHKRTKDGRTSKADGKTKTYRFFD 121
                                 8********************************************************************************** PP

                    trihelix  84 qlea 87 
                                 +lea
  Cotton_A_12728_BGI-A2_v1.0 122 ELEA 125
                                 **85 PP

2trihelix1041.1e-32375460187
                    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfd 83 
                                 rW+k ev aLi++r++++ +++++  k+plWee+s++mr  g++rs+k+Ckekwen+nk++kk+ke++k r +e+s+tcpyf+
  Cotton_A_12728_BGI-A2_v1.0 375 RWPKVEVEALIKLRTNLDIKYQDNGPKGPLWEEISAAMRNLGYNRSAKRCKEKWENINKYFKKVKENNKTR-PEDSKTCPYFH 456
                                 8********************************************************************97.99********* PP

                    trihelix  84 qlea 87 
                                 ql+a
  Cotton_A_12728_BGI-A2_v1.0 457 QLDA 460
                                 **85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.0053698IPR001005SANT/Myb domain
CDDcd122031.08E-2538103No hitNo description
PfamPF138374.5E-2038125No hitNo description
PROSITE profilePS500906.9113896IPR017877Myb-like domain
SMARTSM007172.3E-4372434IPR001005SANT/Myb domain
SuperFamilySSF466895.74E-5374452IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.4E-4374431IPR009057Homeodomain-like
PROSITE profilePS500907.399374432IPR017877Myb-like domain
CDDcd122031.47E-26375439No hitNo description
PfamPF138371.2E-22375461No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 571 aa     Download sequence    Send to blast
MVEGSAEAAT TVAGILEGSE GEEDRGRVDE GDRSFGGNRW PRQETLALLK IRSDMDSLFR  60
DSTLKGPLWE EVSRKLAELG YHRSAKKCKE KFENVYKYHK RTKDGRTSKA DGKTKTYRFF  120
DELEAFQNLH SLQPLSPPKP QTPTPTSASV MNPTNVPQSH AAVPSINPTL STQPVPPLHS  180
INPCFINISS NLFSTSTSSS TTSNDDSYQG SSGKKRKWKE FFKRLTKEVI EKQEELQNKF  240
LQTIERCEQQ RLAREEAWRV QEMARINKEH ELLVQERSKA AAKDAAVFAF LQKVSGQQPN  300
TVQGNPQPQP QPPPPAQPML APLSTSLPPP PPPPVQVPQP KTHPPPTQAL NFDTSEMSNG  360
GNSAVSVSLS PSPSRWPKVE VEALIKLRTN LDIKYQDNGP KGPLWEEISA AMRNLGYNRS  420
AKRCKEKWEN INKYFKKVKE NNKTRPEDSK TCPYFHQLDA IYKDKISKNG NSLASSSPYG  480
VKPDSRATVP LMVLPEQQWP PPRQANNHQA ETVAMMEEEA DKGNVGHNNN HIQEEEEEEE  540
EEEEDGDTED EYEGNDFELV AKTAPIGSGG E
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX9640914e-99JX964091.1 Gossypium hirsutum clone NBRI_TRANS-360 microsatellite sequence.
GenBankJX9644124e-99JX964412.1 Gossypium hirsutum clone NBRI_TRANS-681 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017642221.10.0PREDICTED: trihelix transcription factor GT-2-like
SwissprotQ391171e-142TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A0B0MU260.0A0A0B0MU26_GOSAR; Trihelix transcription factor GT-2-like protein
STRINGGorai.009G253900.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48492553
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.11e-138Trihelix family protein