Tan0016949 (gene) Snake gourd v1

Overview
NameTan0016949
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG11: 2624603 .. 2631292 (+)
RNA-Seq ExpressionTan0016949
SyntenyTan0016949
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTCAGTGTTGAAGGTTCCAACTAAAAAAATGCCAAATTAAAAAAACAACAAAATAAAATTACCAACTTTGGAAAATACTATAATATATGTAGAAAAAATTTGATTTGCCTTTTTTTTTTTTGGTGTTTATAAATAATAATAATAAAAAAAACTGAGCCGGATCGGACTTCGTCGTCTCTCCGGACGTAACGAGGGCGAAGGAAAGAGACTGTAATTACAGAGTATATATATTATACACATCGAATTTCAGAAATTGTTCCGGTGGCCAAGCCTGCGTTTCTCTTTTCGTTGACTTTGACTTTTCCATTGACGACTTCTTTTTTCTTGTATTTTCCTCGTTGAATTGAAATCTTCAGCAATGTGGCGCGTTTCCGCCATCTCGATTTCATCGCGGTTCTGCTCGATTCGGGAGTCGTAGAAAGACGGCATATCTTTCGCGGCGATGAATCGTGATTTTGCGCTCGTATTCGTGTTCTTCGTTCTCATTCTCGCCTCTCGTGGATCCGATGCTTCTTTCTCGGACCGTATTTGGAATCTTCATCATCGGTTCGCGCTTTCGAACGATTCTCCTAAGGTTTGTTTGCTTTGGACTCTCGATTTTCTTAATTTTGGTGCAACGTTTGGATTGCGGTTGACTTTGAAATTTGTATTTCGCCTTTTCGTTTCTTTAGAGTATAGCTCCGGCCCCTGGTCCTAGCCCTGTTTCCAATGGTAAATTGAGTGGGGACGCCCCAAAAAGTTCTCCAACTCCTGCAATCCCGCCATTTTCCAGTTCAACTGATGGTTCTACTACAGAGAAGTGTAACTCGTCATACAACCCTTGCCACGACCTTAAGAATATGATTGCTTGCCTTATATTCGCAGAAAAAGGTAATAGCGTTTTTATTCCTTTTTAGTTTCTTTCCCCCAATACTATATCATGGAATGTTTTTCTTCTTTCGAACTTAGCTTTTCGTGCTGCCAACTGCCAATAAATTAAGCTTGTGATTTGAGAGGATTGTACCTCTTTCTAACATAGAAATGAACACTCTTCCCATGATGTTCTCTGCTTATTGTTAATGCTTACTGAAGTTTGTTTTCGTTTGCGAATGTTCTACTTGGCACTTGTTATATATTCTTTTGCTGTCTTTCCCAGATTGAATGCATTTCTACTATATGACCCACGATAGTATAGTAGTTCGTTATTTTCTTGATATTAAGTCAGCTTTTGTCAGAATTAGAAGCTTTGGGCTGTGAGGAATTATTTTCCAACTTTCAGTCAACAGTAAATGTCTGCATAAAGATCAGACAATGAAATGATTGAGTCAAAGCATTAAAACCATAGAACCCAAAGATGATGGGAGGAGAGGATGCAGTGAGAGGGGACAGGGCATGCTGTTATTTAGTTTCCAAACACCACTGCCAATCCTTGATTTTTTTTTTAGAAGAGAAACAATTTTCATTAATAAATGAAAGAAATGAAGCTTAGGCAAAATCCCAATACGAAAAGATGATTACAAGAAAGTTCTCCAATGGAAAATCAAAGTTGAAAGTCTACAGTCACCAAAAAGGTGTTTATTTTTATACCAATACGAAGAATGAAAAAGAATCAAATTCAAGAAACTATCAAAAGATGAAAAAGAGTCCTTAAAGATTCTTCTATTTCTTTCACAATATAAATACTAGGAGAAAACTCTATTGATAGCAAGCCAGAAGGTTTTTTGAACACCATGTAAAAGGATGTCCCACAAAAACCAAATCCAAAAGCTCACGGATGGTGTTGGGTAAGACCGTGGACCATCCAAAAGTATCAAGCATGGTTGCCCAATATCATGACACAAAGTTGCAATGAGCAAAGAGAAGACAAGAAAATTCTGAGCTGGAACAACACATCACACAATAGAGGGTTTTGTGAAAGAGGATGTCATTAATGGAGAATTTGCTGGCTGTAATTAGTCAATCCCTCAACACCATGCGGCAATCCTTTTGATTACTGGATGTTAGTTTGTCTGAATGGACATGAACAAACCACTTACAAGAGAACAACTGTGGTGGGAGAGGAGAGGATGACCACCTTCTCTTCCCCAAAACGAACTGAATAGATGGTGGTAACAAGTTGCCTATCAATATCCACGATATGTCCCTAGGATACTCCAATCAAACTACAATGTCATGGGTTCAAAAGATTCCAATCCTGAAGGTGATATATATGGTGAATAACCCTCCCCACGAGAAGACTCTTGCCAGAATTACAGTGCCCTGAGAAACCAAAATTTCCAATTGAAAGATGATTTGTCTATGCCCAACAATAAGGTCATCGTGGGAAACGTCTTGGACTGAACCAAGAATGTTGATTTATATGTGGGAATGATTGAGGTACATATGGTAAAGCTAGCAGTGTAGGTTGAGGATCAAATTACTTGGTGGAACCAACTGTATGGAAATTGTACATAACTTTAAACTATTGCATTCTTGGGACTCACATCCATAAGGCCACGAGGGAAAGGTTTCACTTGTGTATCACTTATAGTTACTATAGGATGCATATTGAGATAGTTAGCCGGATGTCTGTAATGTAGACACATAGTTGAAGAGTTCTATTGGCTTAATAGATGTAACAAACTTACTGATTCACAGTTCCTACAAGTTGCTTAGGATCATGTATTGTGAACAGTAATTCAGGAGTGTATGGACCTACACCTTGTTGGGCACTTAACAGAGGCACTGCAACTAGAGAAATTGACTGATCATTAGGTAGAGGTAGTTTGTTGTCACATTGATCTAGGAAGGTGACTGATCAGTGAACACGGGGAAATGCAAACAGATATAACAATTAACGATGGTAATGGAATAAGCCTGTCTAGTTCTCAAGGTAATATAAAGTGTAGAGGATAAAACTAAACCAGTGAAAAGCATGGATTCAAGTAGTAGATAACTTACAACCCTGATTCATGTACCAATAGTGCGAAGTGTTTCAAGTATATGAACACAAGGGATACTTTTTGAAGGTCCTTAGCAAGACTATTAGTAGCTTTGGTGGAATGCCCTGATATGGCACGATGCCTATTGTGACATGGACAAAGAAGTAGAGGGAGAAGTGACTTGTTCAAAGGGCAGGCGATATTAGTTGTGTAGTTCAAATATCACATTTGACCCCCAAGGAAACTCTCACCCTCTTAGCCACTCATTCTTATGAACTTGATGCATGGTAAGTAGTCAGGTTGGTAATATCATTATTGTGCTGGAACCAAGGAAAATTTCATTTCAAAGTCATTATTGGTTCGTGCATGGTTAGCTGGATCAAAAGAGGTGGAAAATTTTCCATGAGGAAACTCTACTGTTCCAGTTGGGAATGCTTATAAAGGCCCAAGTGATATGAATTGTGGTTGATACAAATTTCAGCCATACTCTATTTGGCTACCAATGGTTGTGACGTGCAAACTATCCCCATTAAACATCAATCAATATCATAGCTTTTTTGTGAAACATAGTTTCCCGCCAAGGACTTCTCAAATTGAATATTTGCGTAGAATAGTGGCTAAAGACTTCAAACCTTAAAGGTTGTTTATAGGAAATACGAAATGGACTACCGTGATTGCAAGACATGAAAAGTCGCATGGACTTGGTTCTAGATCTAGTGCTTTCAAACCTACCTCTCTTCAGAATGGATCCTAGCAAGTATGCACTCCAAAGACAAGTGCAAGAGAAGAGTTTTGATATCTATGCTGTAACTGGCCTCAAGGGTATAACTCGATCGGTTCTTCATGATTTTTATGAAAAGCAATGCTGAACTAACCATGATGACATGAAAAATTTTAAAACTAAGTTGTGATTATTTTTTATTTTATTATTTGTTTTTTCTGGTGGGGGTGGTGGGGAATCAACCAGCCTAATCAACAAAACCCAACTAAACTGACCAAAATCCAACTAACTGAAATTGACTTGACTTTTCTGTTTTAAAATTTGACTATAACTAACCTTCTTTGCACCTAATAAAATTGACAGTCTTGTCAGTTTTTCGATTTTGGTTGGTTGTTGGTTTGGACATGGGTTTTGACCCATCTATTTGTCCTAGCTAGAACTTTGTGCAATGCCAACCTTTTCACTGCAAGATGACGGTATCTGGTATTTGTATCTGGGCAGCGGAGCAACAGACAAGATTGCAACGTAAGTATACTTTCTAATAACAAGGTTCAACATACGCATCAAAGGTCTGGCCCTACTCGAAATTACATTTTATCGGTAAACTATGGGGTTGGTAGACTTTTATTGGAAGTTTATTTGTGTTCCATTCCAGCACAATTTGCTCCAATGATAGGCTGCCTAAGGAAGAATTTGTTGCTTCTGTTGTGTCTTGGGTTGACTTCACCAAGTATCAGCAAGCCTTTTGATGAGTTTCGGGTGGATATCTCATGTAGTATTGAAGGCTGCTCATATACAATAGCACCAACAATCTACTGTCCTATGCTCTAGGTCAAGCATTCTACGAGCGGGAACATTATATAACGCACAAATAATTTATACCATTTAAGCATTTACCCAATGCCTTTCACCAAGATTCCAAAACTCGAAGGCCAATGGACAGACTTTCTTTGGAACTTCAACTTCGTGATCAAACATCAATCTTGAACTAATGAATTGAAAGTGCATTGAGAATAAAATATTATTGCTAACTTTGAAAGGAGAAGTCGTTGCTCTTGATTTGTTTGGAGGTTACTGTATGCTCATGAGTCACGATGATAATTTTTTAGTCATTTGGAAATCTTGTAATCAGCAGCTTGGTACTGACAATTTCCACATATTTGGGGAGTTTTCATTCAAGATAACCAACTTTGTCTACCAAGAACTTTGCTCAGGAGCAGTTTATTAACGACCTTCATTGTGGTGGTTTTGCTGCTCATTTTGTGACTGATAAGATATTAAACTTGATTTCAACGTGATTTTTCTGATCTAGCTGCAAAGGGGGATGTGGGTAATTTTGTGAGATGATGTGCAATCTGTCAAAATTAATAAAAGGGCCAGCTGCAAAATATAAACCTTCACATACCTTTGTTTATTCATTAAATTAGGCGATGTCTTTTTTGTCCTTTTCATTCATTCTACTAAGCCAAACGGCTTGGAAGTGGGAGGTCAAGCGTAAACGAAACTGGAAATGCACCACGATCGTAACTGTTGGAGACAAAACAAGTAGTTGTCTGCCTTTATGTCCTAAAACAGGTCCTAGTAGTCTTCATGTGCTAGTGTGGTCCATATTTTTACTTCAAACTGATTGTGTTTTCTATTTTGTTTCTGCTTACCCAGCAGACATACAAATAATCCCAGTGTTCTTTTATACCGAGGTTTGTGTTTCGATTAACTGGTGTTCGCCACCCACCCCCAGCCCTTTTAGCATCACCTCCATGCCATACCTGTCTTTCAAACACATTGCGATGATGTACTGTTTCAAACAATTTCCCTCGGGGGGAATTAATTCACAATTTATGTTAAGCTTCCAAGGCCAGTATTGCTAAAATATTGTTTTTTCACAGCAGCGGTGGAACAATATCTTTTGATCCAAAATGTTGGAGAGACTTCTCTGAAAGTAAATGTTACAATTTCCGATGCTAAATACAAGGAGATACAAGTTCCTGAGCATCAATCCAAAAAGGTACTCCTCGTTAATCTTTGCTGGATTTTCACGAGGATAGTTAACTTCTTCTTTAAGTAGGATAATAGTCAAATATTCACTACCGAAGGAAAACAAAGGTTTCTGAGACCGTCTCTTTGGTTTTAATTTTAAACTATGAAGATTAAATCTTTTTGAATTCTAAAGCGTGTACCTTGAGATTCATTTATGCTCCATATTCTATTTCAGGTTAATATCTCAGATGTTCCAGGAAATTCAATGATCATATTAGATGCTGGAAATGGGAAGTGTATGATTCACGTAGGATCACCAGCAAAAAATGGCAGCATTTTTAAGCAGATCTCTTCCTATGTAACCCATTTAAACCTCGTATCCGGATCCTACCTACTATTTTCAATTGTTTTGATCATTGGAGGTGTCTGGGCATGCTGCAAGATGGGAACCAAGGAACGCCATGCTGATGGAATCCCATATCAGGAGCTTGAATTGGCAGAGCATGACTCTTCTCCAAACAACGATTTGGAAGCAGCCGAAGGATGGGATCAAGGCTGGGACGACGACTGGGACGAGCCGAAGCCACAAAATAAATCAAGTTCTCACACAAAGGCAAACGGATCATCAAACGGTATTAACTCTAGAACTTCTGATAGAGATGGATGGGGAAATGAATGGGACGATTGAGGTAAGAAACAGCTCAAAAGTTCTTCATACTCCTCAATGTTGAAAGATCAACAAACTTAGTGAACATAGAAAGTTTTAGCTACAGGGTAATGAATGATGAAAATCCTGCTTCCCAACCATGTCATGTGGTAAAAAGTTGTGGGCTAATTTTTTTACTGTTCAATTACAAATGAGAGAGCAAATGTTTGGTTATCACAATAGATAGATATTTTCTTTCTTTTGGCAAACAATAGGTCTCAGTTTTCCTCTTAGTAGAGGTGGAAATTGGAATTTGGAAAGAGCAATAATGTAACTTTTTAGTAAAGAGCATTGCAATGATAAGATTATTTGCTTTGTATTGTTAAGTGCCAGCGTGCTTACATAGATCAGATCAAGAGGGTATCATTTGGTCCCTCAAATAAAG

mRNA sequence

TCTCAGTGTTGAAGGTTCCAACTAAAAAAATGCCAAATTAAAAAAACAACAAAATAAAATTACCAACTTTGGAAAATACTATAATATATGTAGAAAAAATTTGATTTGCCTTTTTTTTTTTTGGTGTTTATAAATAATAATAATAAAAAAAACTGAGCCGGATCGGACTTCGTCGTCTCTCCGGACGTAACGAGGGCGAAGGAAAGAGACTGTAATTACAGAGTATATATATTATACACATCGAATTTCAGAAATTGTTCCGGTGGCCAAGCCTGCGTTTCTCTTTTCGTTGACTTTGACTTTTCCATTGACGACTTCTTTTTTCTTGTATTTTCCTCGTTGAATTGAAATCTTCAGCAATGTGGCGCGTTTCCGCCATCTCGATTTCATCGCGGTTCTGCTCGATTCGGGAGTCGTAGAAAGACGGCATATCTTTCGCGGCGATGAATCGTGATTTTGCGCTCGTATTCGTGTTCTTCGTTCTCATTCTCGCCTCTCGTGGATCCGATGCTTCTTTCTCGGACCGTATTTGGAATCTTCATCATCGGTTCGCGCTTTCGAACGATTCTCCTAAGAGTATAGCTCCGGCCCCTGGTCCTAGCCCTGTTTCCAATGGTAAATTGAGTGGGGACGCCCCAAAAAGTTCTCCAACTCCTGCAATCCCGCCATTTTCCAGTTCAACTGATGGTTCTACTACAGAGAAGTGTAACTCGTCATACAACCCTTGCCACGACCTTAAGAATATGATTGCTTGCCTTATATTCGCAGAAAAAGCAGCGGTGGAACAATATCTTTTGATCCAAAATGTTGGAGAGACTTCTCTGAAAGTAAATGTTACAATTTCCGATGCTAAATACAAGGAGATACAAGTTCCTGAGCATCAATCCAAAAAGGTTAATATCTCAGATGTTCCAGGAAATTCAATGATCATATTAGATGCTGGAAATGGGAAGTGTATGATTCACGTAGGATCACCAGCAAAAAATGGCAGCATTTTTAAGCAGATCTCTTCCTATGTAACCCATTTAAACCTCGTATCCGGATCCTACCTACTATTTTCAATTGTTTTGATCATTGGAGGTGTCTGGGCATGCTGCAAGATGGGAACCAAGGAACGCCATGCTGATGGAATCCCATATCAGGAGCTTGAATTGGCAGAGCATGACTCTTCTCCAAACAACGATTTGGAAGCAGCCGAAGGATGGGATCAAGGCTGGGACGACGACTGGGACGAGCCGAAGCCACAAAATAAATCAAGTTCTCACACAAAGGCAAACGGATCATCAAACGGTATTAACTCTAGAACTTCTGATAGAGATGGATGGGGAAATGAATGGGACGATTGAGGTAAGAAACAGCTCAAAAGTTCTTCATACTCCTCAATGTTGAAAGATCAACAAACTTAGTGAACATAGAAAGTTTTAGCTACAGGGTAATGAATGATGAAAATCCTGCTTCCCAACCATGTCATGTGGTAAAAAGTTGTGGGCTAATTTTTTTACTGTTCAATTACAAATGAGAGAGCAAATGTTTGGTTATCACAATAGATAGATATTTTCTTTCTTTTGGCAAACAATAGGTCTCAGTTTTCCTCTTAGTAGAGGTGGAAATTGGAATTTGGAAAGAGCAATAATGTAACTTTTTAGTAAAGAGCATTGCAATGATAAGATTATTTGCTTTGTATTGTTAAGTGCCAGCGTGCTTACATAGATCAGATCAAGAGGGTATCATTTGGTCCCTCAAATAAAG

Coding sequence (CDS)

ATGAATCGTGATTTTGCGCTCGTATTCGTGTTCTTCGTTCTCATTCTCGCCTCTCGTGGATCCGATGCTTCTTTCTCGGACCGTATTTGGAATCTTCATCATCGGTTCGCGCTTTCGAACGATTCTCCTAAGAGTATAGCTCCGGCCCCTGGTCCTAGCCCTGTTTCCAATGGTAAATTGAGTGGGGACGCCCCAAAAAGTTCTCCAACTCCTGCAATCCCGCCATTTTCCAGTTCAACTGATGGTTCTACTACAGAGAAGTGTAACTCGTCATACAACCCTTGCCACGACCTTAAGAATATGATTGCTTGCCTTATATTCGCAGAAAAAGCAGCGGTGGAACAATATCTTTTGATCCAAAATGTTGGAGAGACTTCTCTGAAAGTAAATGTTACAATTTCCGATGCTAAATACAAGGAGATACAAGTTCCTGAGCATCAATCCAAAAAGGTTAATATCTCAGATGTTCCAGGAAATTCAATGATCATATTAGATGCTGGAAATGGGAAGTGTATGATTCACGTAGGATCACCAGCAAAAAATGGCAGCATTTTTAAGCAGATCTCTTCCTATGTAACCCATTTAAACCTCGTATCCGGATCCTACCTACTATTTTCAATTGTTTTGATCATTGGAGGTGTCTGGGCATGCTGCAAGATGGGAACCAAGGAACGCCATGCTGATGGAATCCCATATCAGGAGCTTGAATTGGCAGAGCATGACTCTTCTCCAAACAACGATTTGGAAGCAGCCGAAGGATGGGATCAAGGCTGGGACGACGACTGGGACGAGCCGAAGCCACAAAATAAATCAAGTTCTCACACAAAGGCAAACGGATCATCAAACGGTATTAACTCTAGAACTTCTGATAGAGATGGATGGGGAAATGAATGGGACGATTGA

Protein sequence

MNRDFALVFVFFVLILASRGSDASFSDRIWNLHHRFALSNDSPKSIAPAPGPSPVSNGKLSGDAPKSSPTPAIPPFSSSTDGSTTEKCNSSYNPCHDLKNMIACLIFAEKAAVEQYLLIQNVGETSLKVNVTISDAKYKEIQVPEHQSKKVNISDVPGNSMIILDAGNGKCMIHVGSPAKNGSIFKQISSYVTHLNLVSGSYLLFSIVLIIGGVWACCKMGTKERHADGIPYQELELAEHDSSPNNDLEAAEGWDQGWDDDWDEPKPQNKSSSHTKANGSSNGINSRTSDRDGWGNEWDD
Homology
BLAST of Tan0016949 vs. NCBI nr
Match: XP_038886197.1 (uncharacterized protein LOC120076442 [Benincasa hispida])

HSP 1 Score: 485.3 bits (1248), Expect = 3.7e-133
Identity = 246/301 (81.73%), Postives = 263/301 (87.38%), Query Frame = 0

Query: 1   MNRDFALVFVFFVLILASRGSDA-SFSDRIWNLHHRFALSNDSPKSIAPAPGPSPVSNGK 60
           MNRD ALVF+FF+ IL S GSDA SF  RIWNLH RFALS D P+S+APAPGPS V NGK
Sbjct: 1   MNRDLALVFIFFLFILLSPGSDASSFPYRIWNLHRRFALSKDPPQSVAPAPGPSSVINGK 60

Query: 61  LSGDAPKSSPTPAIPPFSSSTDGSTTEKCNSSYNPCHDLKNMIACLIFAEKAAVEQYLLI 120
           LS  APKSSPTP IPPF SSTDG TTEKC+SS   CHDLKNM ACL+ AE+A +EQYLLI
Sbjct: 61  LSRGAPKSSPTPVIPPFPSSTDGFTTEKCDSSSKTCHDLKNMTACLLLAEQAVMEQYLLI 120

Query: 121 QNVGETSLKVNVTISDAKYKEIQVPEHQSKKVNISDVPGNSMIILDAGNGKCMIHVGSPA 180
           QN GETSLKVNV +SDAKYKEIQVPEH +KKVNISD PGNSMIILDAGNGKC++HVG   
Sbjct: 121 QNDGETSLKVNVIVSDAKYKEIQVPEHHAKKVNISDFPGNSMIILDAGNGKCIVHVGLLT 180

Query: 181 KNGSIFKQISSYVTHLNLVSGSYLLFSIVLIIGGVWACCKMGTKERHADGIPYQELELAE 240
           K+GSIFKQISSYVTHLN+VSGSYLLFSIVLIIGGVWACCKM TKERHADGIPYQELELAE
Sbjct: 181 KSGSIFKQISSYVTHLNIVSGSYLLFSIVLIIGGVWACCKMRTKERHADGIPYQELELAE 240

Query: 241 HDSSPNNDLEAAEGWDQGWDDDWDEPKPQNKSSSHTKANGSSNGINSRTSDRDGWGNEWD 300
           HDSSP NDLEAAEGWDQGWDDDWDE KP NKS S  KANGSSNGINSRTSDR+GW N+WD
Sbjct: 241 HDSSPTNDLEAAEGWDQGWDDDWDESKPANKSHSDMKANGSSNGINSRTSDRNGWENDWD 300

BLAST of Tan0016949 vs. NCBI nr
Match: KAG6578483.1 (hypothetical protein SDJN03_22931, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 469.9 bits (1208), Expect = 1.6e-128
Identity = 240/300 (80.00%), Postives = 258/300 (86.00%), Query Frame = 0

Query: 1   MNRDFALVFVFFVLILASRGSDASFSDRIWNLHHRFALSNDSPKSIAPAPGPSPVSNGKL 60
           MNRD  LVFVFFVLIL S GSDAS SDRIWNLH RF+LS DSP+ IAPAPGPS V NGKL
Sbjct: 295 MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL 354

Query: 61  SGDAPKSSPTPAIPPFSSSTDGSTTEKCNSSYNPCHDLKNMIACLIFAEKAAVEQYLLIQ 120
            G  P SSPTPAIPPF SSTDG T EKC+ +   CHDLK M ACL FAE+A VEQYLLIQ
Sbjct: 355 IGGVPISSPTPAIPPFPSSTDGFTLEKCDRN-KTCHDLKKMTACLQFAEQAMVEQYLLIQ 414

Query: 121 NVGETSLKVNVTISDAKYKEIQVPEHQSKKVNISDVPGNSMIILDAGNGKCMIHVGSPAK 180
           N GETSLKVNV +SDAKYKE+QVPEH +KKVN+SD+P  S IILDAGNGKC+IHVGSP K
Sbjct: 415 NDGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTK 474

Query: 181 NGSIFKQISSYVTHLNLVSGSYLLFSIVLIIGGVWACCKMGTKERHADGIPYQELELAEH 240
           NGSI KQ SSYVTHLNL+SGSYLLFSI+LIIGGVWACCKM TKERHA+GIPYQELELAE+
Sbjct: 475 NGSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEN 534

Query: 241 DSSPNNDLEAAEGWDQGWDDDWDEPKPQNKSSSHTKANGSSNGINSRTSDRDGWGNEWDD 300
           DSSP NDLEAAEGWDQGWDDDWDE KP NKSSS  K NG SNGINSRTS+R+GWGN+WDD
Sbjct: 535 DSSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENG-SNGINSRTSERNGWGNDWDD 592

BLAST of Tan0016949 vs. NCBI nr
Match: XP_022938773.1 (uncharacterized protein LOC111444889 [Cucurbita moschata] >KAG7016047.1 hypothetical protein SDJN02_21151 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 469.9 bits (1208), Expect = 1.6e-128
Identity = 240/300 (80.00%), Postives = 258/300 (86.00%), Query Frame = 0

Query: 1   MNRDFALVFVFFVLILASRGSDASFSDRIWNLHHRFALSNDSPKSIAPAPGPSPVSNGKL 60
           MNRD  LVFVFFVLIL S GSDAS SDRIWNLH RF+LS DSP+ IAPAPGPS V NGKL
Sbjct: 1   MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL 60

Query: 61  SGDAPKSSPTPAIPPFSSSTDGSTTEKCNSSYNPCHDLKNMIACLIFAEKAAVEQYLLIQ 120
            G  P SSPTPAIPPF SSTDG T EKC+ +   CHDLK M ACL FAE+A VEQYLLIQ
Sbjct: 61  IGGVPISSPTPAIPPFPSSTDGFTLEKCDRN-KTCHDLKKMTACLQFAEQAMVEQYLLIQ 120

Query: 121 NVGETSLKVNVTISDAKYKEIQVPEHQSKKVNISDVPGNSMIILDAGNGKCMIHVGSPAK 180
           N GETSLKVNV +SDAKYKE+QVPEH +KKVN+SD+P  S IILDAGNGKC+IHVGSP K
Sbjct: 121 NDGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTK 180

Query: 181 NGSIFKQISSYVTHLNLVSGSYLLFSIVLIIGGVWACCKMGTKERHADGIPYQELELAEH 240
           NGSI KQ SSYVTHLNL+SGSYLLFSI+LIIGGVWACCKM TKERHA+GIPYQELELAE+
Sbjct: 181 NGSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEN 240

Query: 241 DSSPNNDLEAAEGWDQGWDDDWDEPKPQNKSSSHTKANGSSNGINSRTSDRDGWGNEWDD 300
           DSSP NDLEAAEGWDQGWDDDWDE KP NKSSS  K NG SNGINSRTS+R+GWGN+WDD
Sbjct: 241 DSSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENG-SNGINSRTSERNGWGNDWDD 298

BLAST of Tan0016949 vs. NCBI nr
Match: XP_022992931.1 (uncharacterized protein LOC111489115 [Cucurbita maxima])

HSP 1 Score: 468.4 bits (1204), Expect = 4.7e-128
Identity = 239/300 (79.67%), Postives = 259/300 (86.33%), Query Frame = 0

Query: 1   MNRDFALVFVFFVLILASRGSDASFSDRIWNLHHRFALSNDSPKSIAPAPGPSPVSNGKL 60
           MNRD  LVFVFFVLIL S GS AS SDRIWNL  RF+LS DSP+ IAPAPGPS V NGKL
Sbjct: 1   MNRDLVLVFVFFVLILVSPGSGASLSDRIWNLRLRFSLSKDSPERIAPAPGPSSVINGKL 60

Query: 61  SGDAPKSSPTPAIPPFSSSTDGSTTEKCNSSYNPCHDLKNMIACLIFAEKAAVEQYLLIQ 120
            G  P SSPTPAIPPF SSTDG T+EKC+ +   CHDLK M ACL FAE+A VE+YLLIQ
Sbjct: 61  IGGVPISSPTPAIPPFPSSTDGFTSEKCDRN-KTCHDLKKMTACLQFAEQAMVEKYLLIQ 120

Query: 121 NVGETSLKVNVTISDAKYKEIQVPEHQSKKVNISDVPGNSMIILDAGNGKCMIHVGSPAK 180
           N GETSLKVNV +SDAKYKE+QVPEH++KKVN+SD+P  S IILDAGNGKC+IHVGSP K
Sbjct: 121 NDGETSLKVNVIVSDAKYKEVQVPEHRAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTK 180

Query: 181 NGSIFKQISSYVTHLNLVSGSYLLFSIVLIIGGVWACCKMGTKERHADGIPYQELELAEH 240
           NGSI KQ SSYVTHLNL+SGSYLLFSI+LIIGGVWACCKM TKERHA+GIPYQELELAEH
Sbjct: 181 NGSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEH 240

Query: 241 DSSPNNDLEAAEGWDQGWDDDWDEPKPQNKSSSHTKANGSSNGINSRTSDRDGWGNEWDD 300
           DSSP NDLEAAEGWDQGWDDDWDE KP NKSSS  KANG SNGINSRTS+R+GWGN+WDD
Sbjct: 241 DSSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKANG-SNGINSRTSERNGWGNDWDD 298

BLAST of Tan0016949 vs. NCBI nr
Match: XP_023549795.1 (uncharacterized protein LOC111808188 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 467.2 bits (1201), Expect = 1.0e-127
Identity = 237/300 (79.00%), Postives = 257/300 (85.67%), Query Frame = 0

Query: 1   MNRDFALVFVFFVLILASRGSDASFSDRIWNLHHRFALSNDSPKSIAPAPGPSPVSNGKL 60
           MNRDF LV VFFVLIL S GSDAS +DRIWNLH RF+L  DSP+ IAPAPGPS V NGKL
Sbjct: 1   MNRDFVLVIVFFVLILVSPGSDASLTDRIWNLHLRFSLLKDSPERIAPAPGPSSVINGKL 60

Query: 61  SGDAPKSSPTPAIPPFSSSTDGSTTEKCNSSYNPCHDLKNMIACLIFAEKAAVEQYLLIQ 120
            G  P SSPTPAIPPF SSTDG T EKC+ +   CHDLK M ACL FAE+A VEQYLLIQ
Sbjct: 61  IGGVPISSPTPAIPPFPSSTDGFTLEKCDRN-KTCHDLKKMTACLQFAEQAMVEQYLLIQ 120

Query: 121 NVGETSLKVNVTISDAKYKEIQVPEHQSKKVNISDVPGNSMIILDAGNGKCMIHVGSPAK 180
           N GETSLKVNV +SDAKYK++QVPEH +KKVN+SD+P  S IILDAGNGKC+IHVGSP K
Sbjct: 121 NDGETSLKVNVIVSDAKYKDVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTK 180

Query: 181 NGSIFKQISSYVTHLNLVSGSYLLFSIVLIIGGVWACCKMGTKERHADGIPYQELELAEH 240
           NGSI KQ SSYVTHLNL+SGSYLLFSI+LIIGGVWACCKM TKERHA+GIPYQELELAEH
Sbjct: 181 NGSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEH 240

Query: 241 DSSPNNDLEAAEGWDQGWDDDWDEPKPQNKSSSHTKANGSSNGINSRTSDRDGWGNEWDD 300
           DSSP NDLEAAEGWDQGWDDDWDE KP NKS+S  K NG SNGINSRTS+R+GWGN+WDD
Sbjct: 241 DSSPTNDLEAAEGWDQGWDDDWDESKPANKSNSDIKGNG-SNGINSRTSERNGWGNDWDD 298

BLAST of Tan0016949 vs. ExPASy TrEMBL
Match: A0A6J1FJV4 (uncharacterized protein LOC111444889 OS=Cucurbita moschata OX=3662 GN=LOC111444889 PE=4 SV=1)

HSP 1 Score: 469.9 bits (1208), Expect = 7.8e-129
Identity = 240/300 (80.00%), Postives = 258/300 (86.00%), Query Frame = 0

Query: 1   MNRDFALVFVFFVLILASRGSDASFSDRIWNLHHRFALSNDSPKSIAPAPGPSPVSNGKL 60
           MNRD  LVFVFFVLIL S GSDAS SDRIWNLH RF+LS DSP+ IAPAPGPS V NGKL
Sbjct: 1   MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL 60

Query: 61  SGDAPKSSPTPAIPPFSSSTDGSTTEKCNSSYNPCHDLKNMIACLIFAEKAAVEQYLLIQ 120
            G  P SSPTPAIPPF SSTDG T EKC+ +   CHDLK M ACL FAE+A VEQYLLIQ
Sbjct: 61  IGGVPISSPTPAIPPFPSSTDGFTLEKCDRN-KTCHDLKKMTACLQFAEQAMVEQYLLIQ 120

Query: 121 NVGETSLKVNVTISDAKYKEIQVPEHQSKKVNISDVPGNSMIILDAGNGKCMIHVGSPAK 180
           N GETSLKVNV +SDAKYKE+QVPEH +KKVN+SD+P  S IILDAGNGKC+IHVGSP K
Sbjct: 121 NDGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTK 180

Query: 181 NGSIFKQISSYVTHLNLVSGSYLLFSIVLIIGGVWACCKMGTKERHADGIPYQELELAEH 240
           NGSI KQ SSYVTHLNL+SGSYLLFSI+LIIGGVWACCKM TKERHA+GIPYQELELAE+
Sbjct: 181 NGSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEN 240

Query: 241 DSSPNNDLEAAEGWDQGWDDDWDEPKPQNKSSSHTKANGSSNGINSRTSDRDGWGNEWDD 300
           DSSP NDLEAAEGWDQGWDDDWDE KP NKSSS  K NG SNGINSRTS+R+GWGN+WDD
Sbjct: 241 DSSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENG-SNGINSRTSERNGWGNDWDD 298

BLAST of Tan0016949 vs. ExPASy TrEMBL
Match: A0A6J1JUX9 (uncharacterized protein LOC111489115 OS=Cucurbita maxima OX=3661 GN=LOC111489115 PE=4 SV=1)

HSP 1 Score: 468.4 bits (1204), Expect = 2.3e-128
Identity = 239/300 (79.67%), Postives = 259/300 (86.33%), Query Frame = 0

Query: 1   MNRDFALVFVFFVLILASRGSDASFSDRIWNLHHRFALSNDSPKSIAPAPGPSPVSNGKL 60
           MNRD  LVFVFFVLIL S GS AS SDRIWNL  RF+LS DSP+ IAPAPGPS V NGKL
Sbjct: 1   MNRDLVLVFVFFVLILVSPGSGASLSDRIWNLRLRFSLSKDSPERIAPAPGPSSVINGKL 60

Query: 61  SGDAPKSSPTPAIPPFSSSTDGSTTEKCNSSYNPCHDLKNMIACLIFAEKAAVEQYLLIQ 120
            G  P SSPTPAIPPF SSTDG T+EKC+ +   CHDLK M ACL FAE+A VE+YLLIQ
Sbjct: 61  IGGVPISSPTPAIPPFPSSTDGFTSEKCDRN-KTCHDLKKMTACLQFAEQAMVEKYLLIQ 120

Query: 121 NVGETSLKVNVTISDAKYKEIQVPEHQSKKVNISDVPGNSMIILDAGNGKCMIHVGSPAK 180
           N GETSLKVNV +SDAKYKE+QVPEH++KKVN+SD+P  S IILDAGNGKC+IHVGSP K
Sbjct: 121 NDGETSLKVNVIVSDAKYKEVQVPEHRAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTK 180

Query: 181 NGSIFKQISSYVTHLNLVSGSYLLFSIVLIIGGVWACCKMGTKERHADGIPYQELELAEH 240
           NGSI KQ SSYVTHLNL+SGSYLLFSI+LIIGGVWACCKM TKERHA+GIPYQELELAEH
Sbjct: 181 NGSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEH 240

Query: 241 DSSPNNDLEAAEGWDQGWDDDWDEPKPQNKSSSHTKANGSSNGINSRTSDRDGWGNEWDD 300
           DSSP NDLEAAEGWDQGWDDDWDE KP NKSSS  KANG SNGINSRTS+R+GWGN+WDD
Sbjct: 241 DSSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKANG-SNGINSRTSERNGWGNDWDD 298

BLAST of Tan0016949 vs. ExPASy TrEMBL
Match: A0A6J1C206 (uncharacterized protein LOC111006692 OS=Momordica charantia OX=3673 GN=LOC111006692 PE=4 SV=1)

HSP 1 Score: 450.3 bits (1157), Expect = 6.4e-123
Identity = 231/302 (76.49%), Postives = 254/302 (84.11%), Query Frame = 0

Query: 1   MNRDFALVFVFFVLILASRGSDASFSDRIWNLHHRFALSNDSPKSIAPAPGPSP--VSNG 60
           MN    L+F FF+LI   RGSDASF D     H RFALS  SP+S APAPGP P  V+N 
Sbjct: 1   MNPHLPLLFAFFLLI---RGSDASFPDD----HFRFALSEVSPQSKAPAPGPGPSSVTNR 60

Query: 61  KLSGDAPKSSPTPAIPPFSSSTDGSTTEKCNSSYNPCHDLKNMIACLIFAEKAAVEQYLL 120
           K SG APKSSPTPAIPPF+S  DG TTEKCNSSYN CHDL+NM ACL+FAE A VEQYLL
Sbjct: 61  KFSGGAPKSSPTPAIPPFTSLVDGFTTEKCNSSYNTCHDLENMTACLLFAEHAVVEQYLL 120

Query: 121 IQNVGETSLKVNVTISDAKYKEIQVPEHQSKKVNISDVPGNSMIILDAGNGKCMIHVGSP 180
           IQN GETS+KVN+ IS+AKYKEI++PEH +KKVNISDVPGNSMI L+AGNGKCMIHVG  
Sbjct: 121 IQNDGETSMKVNIIISNAKYKEIKIPEHHAKKVNISDVPGNSMITLEAGNGKCMIHVGLL 180

Query: 181 AKNGSIFKQISSYVTHLNLVSGSYLLFSIVLIIGGVWACCKMGTKERHADGIPYQELELA 240
            K+GSI K+IS Y+ HLNLVSGSYLLF+IVLIIGGVWACC MGTKERHADG+PYQELELA
Sbjct: 181 TKSGSILKKISFYLNHLNLVSGSYLLFAIVLIIGGVWACCNMGTKERHADGVPYQELELA 240

Query: 241 EHDSSPNNDLEAAEGWDQGWDDDWDEPKPQNKSSSHTKANGSSNGINSRTSDRDGWGNEW 300
           EHDSSP NDLEAAEGWDQGWDDDWDE K  NKSS+  KANGSSNG+NS+TSDRDGWGN+W
Sbjct: 241 EHDSSPTNDLEAAEGWDQGWDDDWDESKSTNKSSAQMKANGSSNGLNSKTSDRDGWGNDW 295

BLAST of Tan0016949 vs. ExPASy TrEMBL
Match: A0A1S3C4Q3 (uncharacterized protein LOC103496622 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496622 PE=4 SV=1)

HSP 1 Score: 450.3 bits (1157), Expect = 6.4e-123
Identity = 229/300 (76.33%), Postives = 253/300 (84.33%), Query Frame = 0

Query: 1   MNRDFALVFVFFVLILASRGSDASFSDRIWNLHHRFALSNDSPKSIAPAPGPSPVSNGKL 60
           MNRD A +F+F +LIL S GSDASF +  WNLH RFA+S DS +S+AP PGP+ V NGKL
Sbjct: 1   MNRDLAFLFLFSLLILFSPGSDASFPNHFWNLHLRFAVSKDSLQSVAPTPGPNSVVNGKL 60

Query: 61  SGDAPKSSPTPAIPPFSSSTDGSTTEKCNSSYNPCHDLKNMIACLIFAEKAAVEQYLLIQ 120
           S  A  SS TPAIPP  +STDG TTEKC+SSY  CHDLK++ ACL+ AE+A VEQYLLIQ
Sbjct: 61  SRGATTSSATPAIPPSPNSTDGFTTEKCDSSYKTCHDLKDLSACLLSAEQAEVEQYLLIQ 120

Query: 121 NVGETSLKVNVTISDAKYKEIQVPEHQSKKVNISDVPGNSMIILDAGNGKCMIHVGSPAK 180
           N GETSLKVNV +SD KYKEIQVPEH +KKVNISD PGNSMIILDAGNGKC++HV S  K
Sbjct: 121 NDGETSLKVNVIVSDTKYKEIQVPEHHAKKVNISDFPGNSMIILDAGNGKCIVHVRSLTK 180

Query: 181 NGSIFKQISSYVTHLNLVSGSYLLFSIVLIIGGVWACCKMGTKERHADGIPYQELELAEH 240
           NGSIFKQISSYVTHLNLVSGSYLLFSIV IIGG+WACCKM TKERHA+GIPYQELELAEH
Sbjct: 181 NGSIFKQISSYVTHLNLVSGSYLLFSIVFIIGGIWACCKMKTKERHANGIPYQELELAEH 240

Query: 241 DSSPNNDLEAAEGWDQGWDDDWDEPKPQNKSSSHTKANGSSNGINSRTSDRDGWGNEWDD 300
           DSSP NDLEAAEGWDQGWDDDWDE KP N+SSS  KA    NGINS+TSDR+GW N+WDD
Sbjct: 241 DSSPTNDLEAAEGWDQGWDDDWDESKPANRSSSDMKA----NGINSKTSDRNGWENDWDD 296

BLAST of Tan0016949 vs. ExPASy TrEMBL
Match: A0A6J1HET4 (uncharacterized protein LOC111463332 OS=Cucurbita moschata OX=3662 GN=LOC111463332 PE=4 SV=1)

HSP 1 Score: 433.3 bits (1113), Expect = 8.1e-118
Identity = 225/293 (76.79%), Postives = 251/293 (85.67%), Query Frame = 0

Query: 1   MNRDFALVFVFFVLILASRGSDASFSDRIWNLHHRFALSNDSPKSIAPAPGPSPVSNGKL 60
           MNRD ALVFVFFVL LASRGS+ASFSDRIWNLH RFALS DSP+SIAPAP P+P      
Sbjct: 1   MNRDLALVFVFFVLNLASRGSNASFSDRIWNLHLRFALSKDSPQSIAPAPAPAP------ 60

Query: 61  SGDAPKS-SPTPAIPPFSSSTDGSTTEKCNSSYNPCHDLKNMIACLIFAEKAAVEQYLLI 120
               P S SPTPAIPPF SSTDGS+ +KC+ SYN CHDL+NM ACL+FAEKA +EQYLLI
Sbjct: 61  ---GPSSVSPTPAIPPFPSSTDGSSMKKCSPSYNTCHDLENMTACLLFAEKATMEQYLLI 120

Query: 121 QNVGETSLKVNVTISDAKYKEIQVPEHQSKKVNISDVPGNSMIILDAGNGKCMIHVGSPA 180
           QN G TSLKVNV ISD K+KEIQVPEHQS+KVNISDV GNS I+LDAG+GKC I +GS  
Sbjct: 121 QNAGGTSLKVNVIISDTKFKEIQVPEHQSRKVNISDVLGNSKIVLDAGSGKCEIQLGSLM 180

Query: 181 KNGSIFKQISSYVTHLNLVSGSYLLFSIVLIIGGVWACCKMGTKERHADGIPYQELELAE 240
           K+G+ F+QISSYVTHLNLVSGSY+L SIVLIIGGVWACCK+GTKERHAD +PYQELELAE
Sbjct: 181 KSGTTFEQISSYVTHLNLVSGSYILLSIVLIIGGVWACCKIGTKERHADEVPYQELELAE 240

Query: 241 HDSSPNNDLEAAEGWDQGWDDDW-DEPKPQNKSSSHTKANGSSNGINSRTSDR 292
            DSSP NDLEA+EGWDQGWDDDW DEPKP N+S+S TKA+GSSNGI+SRTS +
Sbjct: 241 QDSSPTNDLEASEGWDQGWDDDWDDEPKPANESNSQTKASGSSNGISSRTSGK 284

BLAST of Tan0016949 vs. TAIR 10
Match: AT3G51580.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; Has 1768 Blast hits to 1607 proteins in 294 species: Archae - 2; Bacteria - 552; Metazoa - 381; Fungi - 236; Plants - 306; Viruses - 38; Other Eukaryotes - 253 (source: NCBI BLink). )

HSP 1 Score: 155.6 bits (392), Expect = 6.2e-38
Identity = 98/273 (35.90%), Postives = 150/273 (54.95%), Query Frame = 0

Query: 39  SNDSPKSIA-PAPGPSPVSNGKLSGDAPKSSPTPAIPPF-----SSSTDGSTTEKCNSSY 98
           S DS K  A  AP P  + +GK   +  K SP  A  P        S++ ++ + C    
Sbjct: 120 SQDSGKLPANMAPPPKSLESGKNETEPGKESPPLAKDPAKGKDDKGSSESASVDTCVGKS 179

Query: 99  NPCHDLKNMIACLIFAEKAAVEQYLLIQNVGETSLKVNVTISDAKYKEIQVPEHQSKKVN 158
           N C    +++AC +  +K A    +L+QN GETSLK  + +     +E+ +P+HQS+KVN
Sbjct: 180 NICRTENSLVACTLSIDKGAANWLILVQNEGETSLKAKIVLPVNALQELTLPKHQSQKVN 239

Query: 159 ISDVPGNSMIILDAGNGKCMIHVGSPAKNGSIFKQISSYVTHLNLVSGSYLLFSIVLIIG 218
           IS     + IILD G G+C +H+  P++  ++     SY   +  ++G+Y L   V+I G
Sbjct: 240 ISISGDTNKIILDTGKGQCALHM-YPSEESTLPFHFPSYEKLVTPINGAYFLIVSVIIFG 299

Query: 219 GVWACCKMGTKERHADGIPYQELELAE----HDSSPNNDLEAAEGWDQGWDDDWDEPKPQ 278
           G+WA C      R   G+PY+ELEL+      + S  +D+E A+ WD+GWDDDWDE    
Sbjct: 300 GIWAFCLCRKNRRAGSGVPYRELELSGGPGLENESGVHDVETAD-WDEGWDDDWDENNAV 359

Query: 279 NKSSSHTKA-NGSSNGINSRTSDRDGWGNEWDD 301
               S  K+ + S+NG+ +R  +RDGW ++WDD
Sbjct: 360 KSPGSAAKSVSISANGLTARAPNRDGWDHDWDD 390

BLAST of Tan0016949 vs. TAIR 10
Match: AT3G51580.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages. )

HSP 1 Score: 143.7 bits (361), Expect = 2.5e-34
Identity = 98/293 (33.45%), Postives = 150/293 (51.19%), Query Frame = 0

Query: 39  SNDSPKSIA-PAPGPSPVSNGKLSGDAPKSSPTPAIPPF-----SSSTDGSTTEKCNSSY 98
           S DS K  A  AP P  + +GK   +  K SP  A  P        S++ ++ + C    
Sbjct: 120 SQDSGKLPANMAPPPKSLESGKNETEPGKESPPLAKDPAKGKDDKGSSESASVDTCVGKS 179

Query: 99  NPCHDLKNMIACLIFAEK--------------------AAVEQYLLIQNVGETSLKVNVT 158
           N C    +++AC +  +K                     A    +L+QN GETSLK  + 
Sbjct: 180 NICRTENSLVACTLSIDKGYETFLDIIVIPQQFARSLLCAANWLILVQNEGETSLKAKIV 239

Query: 159 ISDAKYKEIQVPEHQSKKVNISDVPGNSMIILDAGNGKCMIHVGSPAKNGSIFKQISSYV 218
           +     +E+ +P+HQS+KVNIS     + IILD G G+C +H+  P++  ++     SY 
Sbjct: 240 LPVNALQELTLPKHQSQKVNISISGDTNKIILDTGKGQCALHM-YPSEESTLPFHFPSYE 299

Query: 219 THLNLVSGSYLLFSIVLIIGGVWACCKMGTKERHADGIPYQELELAE----HDSSPNNDL 278
             +  ++G+Y L   V+I GG+WA C      R   G+PY+ELEL+      + S  +D+
Sbjct: 300 KLVTPINGAYFLIVSVIIFGGIWAFCLCRKNRRAGSGVPYRELELSGGPGLENESGVHDV 359

Query: 279 EAAEGWDQGWDDDWDEPKPQNKSSSHTKA-NGSSNGINSRTSDRDGWGNEWDD 301
           E A+ WD+GWDDDWDE        S  K+ + S+NG+ +R  +RDGW ++WDD
Sbjct: 360 ETAD-WDEGWDDDWDENNAVKSPGSAAKSVSISANGLTARAPNRDGWDHDWDD 410

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038886197.13.7e-13381.73uncharacterized protein LOC120076442 [Benincasa hispida][more]
KAG6578483.11.6e-12880.00hypothetical protein SDJN03_22931, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022938773.11.6e-12880.00uncharacterized protein LOC111444889 [Cucurbita moschata] >KAG7016047.1 hypothet... [more]
XP_022992931.14.7e-12879.67uncharacterized protein LOC111489115 [Cucurbita maxima][more]
XP_023549795.11.0e-12779.00uncharacterized protein LOC111808188 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1FJV47.8e-12980.00uncharacterized protein LOC111444889 OS=Cucurbita moschata OX=3662 GN=LOC1114448... [more]
A0A6J1JUX92.3e-12879.67uncharacterized protein LOC111489115 OS=Cucurbita maxima OX=3661 GN=LOC111489115... [more]
A0A6J1C2066.4e-12376.49uncharacterized protein LOC111006692 OS=Momordica charantia OX=3673 GN=LOC111006... [more]
A0A1S3C4Q36.4e-12376.33uncharacterized protein LOC103496622 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1HET48.1e-11876.79uncharacterized protein LOC111463332 OS=Cucurbita moschata OX=3662 GN=LOC1114633... [more]
Match NameE-valueIdentityDescription
AT3G51580.16.2e-3835.90unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G51580.22.5e-3433.45unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 43..88
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 73..88
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 270..290
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 237..299
NoneNo IPR availablePANTHERPTHR34200DENTIN SIALOPHOSPHOPROTEIN-LIKE ISOFORM X1coord: 79..299
NoneNo IPR availablePANTHERPTHR34200:SF2TRANSMEMBRANE PROTEINcoord: 79..299

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0016949.1Tan0016949.1mRNA
Tan0016949.2Tan0016949.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane