CmoCh14G005120 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh14G005120
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionFilamentous hemagglutinin
LocationCmo_Chr14: 2530231 .. 2535578 (-)
RNA-Seq ExpressionCmoCh14G005120
SyntenyCmoCh14G005120
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAGAAGTGGAAGTGATGAAATGGGCATAACCCATCTTCCACGGCTCTTCATTTAAGTGCGCAATTAGTGCCATCAATTCCCTTTTTCAATTTCCTTGTTCCCCCATTTCTCTGTTTTTGGCGCTTCTTCTCCATTTCTTACCTTTTCCCCTTCTACTGTCTTGTCATCTTCTTCCATTTTCCCTTTCGATTTATAAATTTTGGTGAAGAAAACGGATAGATGGATGGTTGATGGATGAGTAATAAACCCCCCTCTCTATATCCACATTTACTTCGAAGACTGTGGCTGTCATTTGTAGTTGTTGCGCCAGTTAAATCAGACCGAATCGCCCCCAATTTCTCTCTCTCTCTCTCTCTCTCTCTCTCGAACCCTAGATTTTCTTTTTGAATTTGGGTTCTGGGTTTGCGGTGGAGAAGCTCCACGAGGTGGGATTTTGAGCTTTTGTGGTTCTGGGATCGATTTGCTTTGAGTTTGAGGTGTAATGGGGGGAGATAATTTGGACCCGATTGAGGGAGGTGGCAATGGCGGTTGTTAACCCACTTCTCATGCATTGCCTCCATGGGAAAGAGCGAGGAAGAACAGCCGCTGCCGGTTGGAGTGAGCTCCTCCGAGCTTTCTGATTGGACTGTGCAGAGTAGATGTGGCGGCGGTGGGTGCTTTGCGATTCGTAGACTGATTGCTGTGAGATGTGTCTTCTTCCTGTTACTGTCGGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCGCCGTTCCTTTCATACGGAGATTGGCCGGATCAGGCGGCTGATTCTACTTATAGAGGTAGGATCCTTCGTCAATTTTCTTTTTCTTTTTCTTTCATGATTTTCCTGTTCAGGCTTCCTGGGTTTTGAGTGTTTGTAGACCTAGAATCATGTTGGTGGGTGTTGGCTGGTGATATCAAATTATCATCCCCTTTTTGTTGGCTGGTGATATCAACTTTGATTGTTTTGAAGAGTAGAATGTTATGGATTTTTGAATGTTTGTAGCTGTTCTTGTGTTTAGATAGCTCCTCGTGGAAGAATTATAGTCTTGTTTTGTTGGATTCGATGGGATTGCGAGTAAGGAAACTCACTCTTGATTGATTCTGGAAGATTTGATCGATTAAATGAGAAGTTTATGATGAAAACTGACTCACTTTTCCATGTATCCATTTAGTTTCCCTGGATTCATGTGAATTCTGTTCATCAAGTTGCTGTTCATATGCATTAGAACTTTCCATTTTGATAATGTTTATAGATTTGTGTTATATTGAAGAACCCTTTTGGCTATCCGAAGGCTTTATGAGCTTGATTGTTCGTTTTTTTCCTCTCTCGGTATACAAACTCGACATGTTTACGCCCATTGATGCATGAAATGTACTGCCTAAAAGCTCATAGTGCTCTGGTTTTAAACGGATCGCTCTACTTTTGTACATTGGAATTTGAAAGAGCGATACATATAAACGATATGATCGTATAACTTAACTGAAATTAATTGCGTCGTAATCAGATGGGTTTAGAAGTTGGGTGATATCTTATAAACTCAATGAGTACTCCATTTGCTTGTTGGGTGTGTCTAGATTTGTTTGATTGTGGTGAACTTTGTGTTAGAAATTGTGAAGTTTTATTGGATTTTATCGATATCAGTTCGTCTTGGTTGAATTATTGTAACGCTATCATATCATGCTAGCCATCCCTTTCGTATATTATATAATCGTTTCGCTGAAGTTATATGATTCTATCCCGTTTCCAGATCATGAAATCGTAGCGTGTTTTCGTGCTCGGAAGCCAGTTCCTTTTCTGAAAAACCATATTTTTGAGCTTGAAGACAACATTTTTGGAGAAATTCCTGTTCCTTTTGTCAAGGTATTCAATCCAGTTTCTCGATTCATCTAGCTTGACTAGTCACGTTTTCGGTCGTTTGATCTGATACTCTGTGAATCAGGTGGCCGTCCTCTCGCTACAATCATTAGGCGGATCGAACGTAACAGATATCATTTTCTCCGTAGATCCTGATGCCAAGTATTCAAAAATTCCACCAACTTCTCAAAGTTTAATCAAGGAAACGTTTGAAACATTGGTTATAAACGACCCTCCTCTCAGATTGAACGCATCGTTATTCGGCAATACTTCGTTGTTCGAGGTGTTGAAATTTCCTGGTGGGATAACTATTATTCCTCCTCAGAGTGCATTTCTTCTGCAGACAGCACAGATCTATTTCAATTTTACGTTGAATTATTCTATCTATCAAATTCAAGTGAATTTCGACGATCTTACCAGCCAGCTGAGGTCGGGATTACGTCTATCTCGTTATGAGGTTTGGCTGTCTTCCTTACTATCTTCTGATGGATATTGTTTTTAGCTGTATAGATTTCACTGACAGCATTTTGTATGATGCAGAATTTATATGTTAGCCTATCGAACGAACGAGGTTCGACAATGCAGGCTCCTACTATTGTTCAGTCGTCTGTTCTGATGGCTATTGGGACGAATTCATCGAATCAAAGACTCAAACAGTTGGCTCAAACCATCACGAATTCTCATTCGGGAAATCTTGGCCTGAACAACACTGTTTTTGGCAAGGTCAAGCAGGTGCGTCTTTCGTCGGTCCTAAACCACTCTCTTAGTGGCGGTCAAGCACGGTCACCTTCACCTGCTCCTCTGCCTCATTCTCACCCTCACCACCACCACCACCACCACCACCACCACCACCACCACNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCACCACCATCAACACCACCATCACCATCACCATCACCATCACCACCACCACCACCACCACCACCACCACCACCAGGACGCTACATATTCACCGAGTCCTGGAACAGAGGAGCACAAATATGCACCGAAGAATGGGATCTCATCAGCTCCTGAAGCTGGTTCATCCCCAGTGGAAAGTCCAGCTTCAAAGAAACGAAACTATGAAGCAACTCCGCCTGGTTTTCGATACGGATATAAAGGGTTGTCAGCAAAAGTCAGAAAACGATCTCATTTAGGCTCTATTCTGTCTCCAAGCAGTCCTCCATCGTCGCCATACTTACGAGTAGGCCTACCAGCACCGGTCACGGTTTCTATATCTGCTTCAAGTCCACTGCAAGGGGTAGCTCTATCTAATGTACAGCCTCCAGAAAAAGGCGACAGAAGTGCCCCTTCAGTCTTGCCACCACAATTTTCTTGTGAGTATGCCATCCTTCATTCTCACATTGTTTCATAGCTCCATCAAGAAACTATTAGTAATGGTTCGAATATACCGACGCGTAATGGAACGGACTCTGATTAGAACCTACTAGAGGAATTTCTGCATCCGTTTTTAACAGATTCAGATAGGCGAAGAAGCCAGACTGGTTAGGGAGTTTGTAGTTTTTGAAAATTAAGTTTATAAGCCAGACTGATCGTGTTGTAATCCGATATGTTGATTTGCAGTTTCTGTAGGCGTTCGTGTTCATACAATTCGATGGACACTCGGGCTGTTTCTTGTTGTATGGCATGTATAACCAAGGAGATAGAACCTACATGCGTACTTCTGGGTAACAACAGGACTCGTAATCGATATCAGAGTTGTGATAGCGATAGCGAGACCGACGCAAAGGCGTGCTCCTGCTAGAGTTGATATTAATGTAAATATGAGATGAAGCAAGTTATTAGGAGATGCATTTTTCCAGGTCAAAGTCACAGAGGTGGCAGGCCTTGTGTTGTTTTTCTTCTGCAGAAAATGTAAAGTAGAGAAGAAATCAGCAAATGGATCTTGTTCTCAACTTCTTAATCAACCAATCACAATTTTTCACCCCCGTTTTTTTAATTTTAATATCCCCGTCTCATATTGTCGGTCTCTTTGATCCTTTGGCAACGTTCATCTGTGTCAACGTTCATCCTGT

mRNA sequence

CAGAAGTGGAAGTGATGAAATGGGCATAACCCATCTTCCACGGCTCTTCATTTAAGTGCGCAATTAGTGCCATCAATTCCCTTTTTCAATTTCCTTGTTCCCCCATTTCTCTGTTTTTGGCGCTTCTTCTCCATTTCTTACCTTTTCCCCTTCTACTGTCTTGTCATCTTCTTCCATTTTCCCTTTCGATTTATAAATTTTGGTGAAGAAAACGGATAGATGGATGGTTGATGGATGAGTAATAAACCCCCCTCTCTATATCCACATTTACTTCGAAGACTGTGGCTGTCATTTGTAGTTGTTGCGCCAGTTAAATCAGACCGAATCGCCCCCAATTTCTCTCTCTCTCTCTCTCTCTCTCTCTCGAACCCTAGATTTTCTTTTTGAATTTGGGTTCTGGGTTTGCGGTGGAGAAGCTCCACGAGGTGGGATTTTGAGCTTTTGTGGTTCTGGGATCGATTTGCTTTGAGTTTGAGGTGTAATGGGGGGAGATAATTTGGACCCGATTGAGGGAGGTGGCAATGGCGGTTGTTAACCCACTTCTCATGCATTGCCTCCATGGGAAAGAGCGAGGAAGAACAGCCGCTGCCGGTTGGAGTGAGCTCCTCCGAGCTTTCTGATTGGACTGTGCAGAGTAGATGTGGCGGCGGTGGGTGCTTTGCGATTCGTAGACTGATTGCTGTGAGATGTGTCTTCTTCCTGTTACTGTCGGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCGCCGTTCCTTTCATACGGAGATTGGCCGGATCAGGCGGCTGATTCTACTTATAGAGATCATGAAATCGTAGCGTGTTTTCGTGCTCGGAAGCCAGTTCCTTTTCTGAAAAACCATATTTTTGAGCTTGAAGACAACATTTTTGGAGAAATTCCTGTTCCTTTTGTCAAGGTGGCCGTCCTCTCGCTACAATCATTAGGCGGATCGAACGTAACAGATATCATTTTCTCCGTAGATCCTGATGCCAAGTATTCAAAAATTCCACCAACTTCTCAAAGTTTAATCAAGGAAACGTTTGAAACATTGGTTATAAACGACCCTCCTCTCAGATTGAACGCATCGTTATTCGGCAATACTTCGTTGTTCGAGGTGTTGAAATTTCCTGGTGGGATAACTATTATTCCTCCTCAGAGTGCATTTCTTCTGCAGACAGCACAGATCTATTTCAATTTTACGTTGAATTATTCTATCTATCAAATTCAAGTGAATTTCGACGATCTTACCAGCCAGCTGAGGTCGGGATTACGTCTATCTCGTTATGAGAATTTATATGTTAGCCTATCGAACGAACGAGGTTCGACAATGCAGGCTCCTACTATTGTTCAGTCGTCTGTTCTGATGGCTATTGGGACGAATTCATCGAATCAAAGACTCAAACAGTTGGCTCAAACCATCACGAATTCTCATTCGGGAAATCTTGGCCTGAACAACACTGTTTTTGGCAAGGTCAAGCAGGACGCTACATATTCACCGAGTCCTGGAACAGAGGAGCACAAATATGCACCGAAGAATGGGATCTCATCAGCTCCTGAAGCTGGTTCATCCCCAGTGGAAAGTCCAGCTTCAAAGAAACGAAACTATGAAGCAACTCCGCCTGGTTTTCGATACGGATATAAAGGGTTGTCAGCAAAAGTCAGAAAACGATCTCATTTAGGCTCTATTCTGTCTCCAAGCAGTCCTCCATCGTCGCCATACTTACGAGTAGGCCTACCAGCACCGGTCACGGTTTCTATATCTGCTTCAAGTCCACTGCAAGGGGTAGCTCTATCTAATGTACAGCCTCCAGAAAAAGGCGACAGAAGTGCCCCTTCAGTCTTGCCACCACAATTTTCTTTTTCTGTAGGCGTTCGTGTTCATACAATTCGATGGACACTCGGGCTGTTTCTTGTTGTATGGCATGTATAACCAAGGAGATAGAACCTACATGCGTACTTCTGGGTAACAACAGGACTCGTAATCGATATCAGAGTTGTGATAGCGATAGCGAGACCGACGCAAAGGCGTGCTCCTGCTAGAGTTGATATTAATGTAAATATGAGATGAAGCAAGTTATTAGGAGATGCATTTTTCCAGGTCAAAGTCACAGAGGTGGCAGGCCTTGTGTTGTTTTTCTTCTGCAGAAAATGTAAAGTAGAGAAGAAATCAGCAAATGGATCTTGTTCTCAACTTCTTAATCAACCAATCACAATTTTTCACCCCCGTTTTTTTAATTTTAATATCCCCGTCTCATATTGTCGGTCTCTTTGATCCTTTGGCAACGTTCATCTGTGTCAACGTTCATCCTGT

Coding sequence (CDS)

ATGGGAAAGAGCGAGGAAGAACAGCCGCTGCCGGTTGGAGTGAGCTCCTCCGAGCTTTCTGATTGGACTGTGCAGAGTAGATGTGGCGGCGGTGGGTGCTTTGCGATTCGTAGACTGATTGCTGTGAGATGTGTCTTCTTCCTGTTACTGTCGGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCGCCGTTCCTTTCATACGGAGATTGGCCGGATCAGGCGGCTGATTCTACTTATAGAGATCATGAAATCGTAGCGTGTTTTCGTGCTCGGAAGCCAGTTCCTTTTCTGAAAAACCATATTTTTGAGCTTGAAGACAACATTTTTGGAGAAATTCCTGTTCCTTTTGTCAAGGTGGCCGTCCTCTCGCTACAATCATTAGGCGGATCGAACGTAACAGATATCATTTTCTCCGTAGATCCTGATGCCAAGTATTCAAAAATTCCACCAACTTCTCAAAGTTTAATCAAGGAAACGTTTGAAACATTGGTTATAAACGACCCTCCTCTCAGATTGAACGCATCGTTATTCGGCAATACTTCGTTGTTCGAGGTGTTGAAATTTCCTGGTGGGATAACTATTATTCCTCCTCAGAGTGCATTTCTTCTGCAGACAGCACAGATCTATTTCAATTTTACGTTGAATTATTCTATCTATCAAATTCAAGTGAATTTCGACGATCTTACCAGCCAGCTGAGGTCGGGATTACGTCTATCTCGTTATGAGAATTTATATGTTAGCCTATCGAACGAACGAGGTTCGACAATGCAGGCTCCTACTATTGTTCAGTCGTCTGTTCTGATGGCTATTGGGACGAATTCATCGAATCAAAGACTCAAACAGTTGGCTCAAACCATCACGAATTCTCATTCGGGAAATCTTGGCCTGAACAACACTGTTTTTGGCAAGGTCAAGCAGGACGCTACATATTCACCGAGTCCTGGAACAGAGGAGCACAAATATGCACCGAAGAATGGGATCTCATCAGCTCCTGAAGCTGGTTCATCCCCAGTGGAAAGTCCAGCTTCAAAGAAACGAAACTATGAAGCAACTCCGCCTGGTTTTCGATACGGATATAAAGGGTTGTCAGCAAAAGTCAGAAAACGATCTCATTTAGGCTCTATTCTGTCTCCAAGCAGTCCTCCATCGTCGCCATACTTACGAGTAGGCCTACCAGCACCGGTCACGGTTTCTATATCTGCTTCAAGTCCACTGCAAGGGGTAGCTCTATCTAATGTACAGCCTCCAGAAAAAGGCGACAGAAGTGCCCCTTCAGTCTTGCCACCACAATTTTCTTTTTCTGTAGGCGTTCGTGTTCATACAATTCGATGGACACTCGGGCTGTTTCTTGTTGTATGGCATGTATAA

Protein sequence

MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNTVFGKVKQDATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKGDRSAPSVLPPQFSFSVGVRVHTIRWTLGLFLVVWHV
Homology
BLAST of CmoCh14G005120 vs. ExPASy TrEMBL
Match: A0A6J1F409 (uncharacterized protein LOC111441963 OS=Cucurbita moschata OX=3662 GN=LOC111441963 PE=4 SV=1)

HSP 1 Score: 863.6 bits (2230), Expect = 3.7e-247
Identity = 456/521 (87.52%), Postives = 456/521 (87.52%), Query Frame = 0

Query: 1   MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60
           MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF
Sbjct: 1   MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60

Query: 61  WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120
           WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV
Sbjct: 61  WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120

Query: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180
           AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN
Sbjct: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180

Query: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS 240
           TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS
Sbjct: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS 240

Query: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT 300
           RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
Sbjct: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT 300

Query: 301 VFGKVK------------------------------------------------------ 360
           VFGKVK                                                      
Sbjct: 301 VFGKVKQVRLSSVLNHSLSGGQARSPSPAPLPHSHPHHHHHHHHHHHHHHQHHHHHHHHH 360

Query: 361 -----------QDATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPG 420
                      QDATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPG
Sbjct: 361 HHHHHHHHHHHQDATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPG 420

Query: 421 FRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQ 457
           FRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQ
Sbjct: 421 FRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQ 480

BLAST of CmoCh14G005120 vs. ExPASy TrEMBL
Match: A0A6J1J074 (uncharacterized protein LOC111482272 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111482272 PE=4 SV=1)

HSP 1 Score: 845.9 bits (2184), Expect = 7.9e-242
Identity = 449/529 (84.88%), Postives = 451/529 (85.26%), Query Frame = 0

Query: 1   MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60
           MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF
Sbjct: 1   MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60

Query: 61  WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120
           WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV
Sbjct: 61  WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120

Query: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180
           AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN
Sbjct: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180

Query: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS 240
           TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNF+DLTSQLRSGLRLS
Sbjct: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLTSQLRSGLRLS 240

Query: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT 300
           RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
Sbjct: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT 300

Query: 301 VFGKVK------------------------------------------------------ 360
           VFGKVK                                                      
Sbjct: 301 VFGKVKQVRLSSVLNHSLSGGQARSPSPAPLPHSHPHHPHHHHHHHHHHHQHHHQHHHHH 360

Query: 361 -------------------QDATYSPSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKR 420
                              QDA YSPSPGTEEHK+APKNGISSAPEAGSSPVESPASKKR
Sbjct: 361 HHHHHHHHHHHHHHHQHHHQDAAYSPSPGTEEHKHAPKNGISSAPEAGSSPVESPASKKR 420

Query: 421 NYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQ 457
           NYEATPPGFRYGYKGLS KVRKRSHLGSI SPSSPPSSPYLRVGLPAPVTVSISASSPL 
Sbjct: 421 NYEATPPGFRYGYKGLSTKVRKRSHLGSIPSPSSPPSSPYLRVGLPAPVTVSISASSPLP 480

BLAST of CmoCh14G005120 vs. ExPASy TrEMBL
Match: A0A6J1J390 (uncharacterized protein LOC111482272 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111482272 PE=4 SV=1)

HSP 1 Score: 835.1 bits (2156), Expect = 1.4e-238
Identity = 448/551 (81.31%), Postives = 450/551 (81.67%), Query Frame = 0

Query: 1   MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60
           MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF
Sbjct: 1   MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60

Query: 61  WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120
           WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV
Sbjct: 61  WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120

Query: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180
           AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN
Sbjct: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180

Query: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS 240
           TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNF+DLTSQLRSGLRLS
Sbjct: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLTSQLRSGLRLS 240

Query: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT 300
           RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT
Sbjct: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT 300

Query: 301 VFGKVKQ----------------------------------------------------- 360
           VFGKVKQ                                                     
Sbjct: 301 VFGKVKQVRLSSVLNHSLSGGQARSPSPAPLPHSHPHHPHHHHHHHHHHHQHHHQHHHHH 360

Query: 361 ------------------------------------------DATYSPSPGTEEHKYAPK 420
                                                      A YSPSPGTEEHK+APK
Sbjct: 361 HHHHHHHHHHHHHHHQHHHQDAAYSPSPGTEEHKHAPKNGISSAAYSPSPGTEEHKHAPK 420

Query: 421 NGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSS 457
           NGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLS KVRKRSHLGSI SPSSPPSS
Sbjct: 421 NGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSTKVRKRSHLGSIPSPSSPPSS 480

BLAST of CmoCh14G005120 vs. ExPASy TrEMBL
Match: A0A0A0LHD1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G819880 PE=4 SV=1)

HSP 1 Score: 719.5 bits (1856), Expect = 8.6e-204
Identity = 385/511 (75.34%), Postives = 413/511 (80.82%), Query Frame = 0

Query: 1   MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60
           MGKSEEEQPLPVG SSSELSD  V++RCGGGGC  IRRLIAVRCVFFLLLSAAVFLSAIF
Sbjct: 1   MGKSEEEQPLPVGGSSSELSDRNVENRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIF 60

Query: 61  WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120
           WLPPFLSYG+WPD+  DS YRDH+IVA F A KPVPFL+ HIFELEDNIFGEIP+P VKV
Sbjct: 61  WLPPFLSYGNWPDRPVDSAYRDHDIVASFHASKPVPFLQKHIFELEDNIFGEIPIPSVKV 120

Query: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180
           A+LSLQSLGG NVT I+F+VD DAKYSKIPPTSQSLIKETFETLVIN+PPLRLN SLFGN
Sbjct: 121 AILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGN 180

Query: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS 240
           TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGLRLS
Sbjct: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLS 240

Query: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTN--SSNQRLKQLAQTITNSHSGNLGLN 300
            YENLYVSLSNERGST+ APT+VQSSVLMAIGTN  SS QRLKQLA TITNSHSGNLGLN
Sbjct: 241 PYENLYVSLSNERGSTIDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN 300

Query: 301 NTVFGKVKQ----------------------------------------------DATYS 360
           NTVFGKVKQ                                              DA YS
Sbjct: 301 NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLPHSHHHRHHHHHHHHHHHHHHRDAAYS 360

Query: 361 PSPGTEEHKYAPKNGISSAPEAGSSPVESPASKKRNYEATPPGFRYGYKGLSAKVRKRSH 420
           PSPGTEEHK+APKNG+SSAPEAGSSP+E P S+KRNYEATPP FRYGYK    K+RK  +
Sbjct: 361 PSPGTEEHKHAPKNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSLTKLRKH-N 420

Query: 421 LGSILSPSSPPSSPYLRVGLPAPVTVSISASSPLQGVALSNVQPPEKG-------DRSAP 457
           LG I SPSS PSSPYLRVG PAPV+ SISASSPL GV LSNVQPP  G       +RS+P
Sbjct: 421 LGPIPSPSSSPSSPYLRVGQPAPVSDSISASSPLSGVVLSNVQPPNTGSGHAENFERSSP 480

BLAST of CmoCh14G005120 vs. ExPASy TrEMBL
Match: A0A1S3B8E9 (uncharacterized protein LOC103487165 OS=Cucumis melo OX=3656 GN=LOC103487165 PE=4 SV=1)

HSP 1 Score: 713.8 bits (1841), Expect = 4.7e-202
Identity = 387/544 (71.14%), Postives = 414/544 (76.10%), Query Frame = 0

Query: 1   MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60
           MGKSEEEQPLPVGVSSSELSD  V++RCGGGGC  IR+LIAVRCVFFLLLSAAVFLSAIF
Sbjct: 1   MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIF 60

Query: 61  WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120
           WLPPFLSYG+WPD+  DS YRDH+IVA F A KPVPFL+NHIFELEDNIFGEIP+P VKV
Sbjct: 61  WLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQNHIFELEDNIFGEIPIPSVKV 120

Query: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180
           A+LSLQSL G NVT I+F+VD DAKYSKIPPTSQSLIKETFETLVIN+PPLRLN SLFGN
Sbjct: 121 AILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGN 180

Query: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS 240
           TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDL+SQLRSGLRLS
Sbjct: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLS 240

Query: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTN--SSNQRLKQLAQTITNSHSGNLGLN 300
            YENLYVSLSNERGSTM APT+VQSSVLMAIGTN  SS QRLKQLA TITNSHSGNLGLN
Sbjct: 241 PYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN 300

Query: 301 NTVFGKVK---------------------------------------------------- 360
           NTVFGKVK                                                    
Sbjct: 301 NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHH 360

Query: 361 ---------------------------QDATYSPSPGTEEHKYAPKNGISSAPEAGSSPV 420
                                      Q A YSPSPGTEEHK+APKNG+SSAPEAGSSP+
Sbjct: 361 HHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAPKNGVSSAPEAGSSPM 420

Query: 421 ESPASKKRNYEATPPGFRYGYKGLSAKVRKRSHLGSILSPSSPPSSPYLRVGLPAPVTVS 457
           E P S+KRNYEATPP FRYGYK  S K+RK+ HLG I SPSS P SPYLRVGLPAPV+ S
Sbjct: 421 EGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLRVGLPAPVSDS 480

BLAST of CmoCh14G005120 vs. TAIR 10
Match: AT3G56590.2 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 326.2 bits (835), Expect = 4.1e-89
Identity = 217/482 (45.02%), Postives = 276/482 (57.26%), Query Frame = 0

Query: 1   MGKSE-EEQPLPVGVSSSELSDWTVQSRCGGGG-------CFAIRRLIAVRCVFFLLLSA 60
           MGK+  EEQ LPV       SD    +R  GGG       C  I    ++RCV  L  SA
Sbjct: 1   MGKNTVEEQNLPV-------SDGAASARNNGGGGISTCCCCDWISSYFSLRCVLILAFSA 60

Query: 61  AVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGE 120
           AVFLSA+FWLPPFL + D  D   D  ++DH IVA F   KP+ F+++++ +LE++I  E
Sbjct: 61  AVFLSALFWLPPFLGFADPGDLDLDPRFKDHRIVASFDVGKPISFMEDNLMQLENDITDE 120

Query: 121 IPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLR 180
           I  P  KV VL+L+ LG  N T +IF++DP+ + SKIP   +SLIK  FETLV      R
Sbjct: 121 ISFPMTKVVVLALERLGDLNRTMVIFAIDPEKENSKIPAEIESLIKAAFETLVQKQLSFR 180

Query: 181 LNASLFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQ 240
           L  SLFG    FEVLKFPGGIT+IPPQ  F LQ AQ+ FNFTLN+SIYQIQ NF++L SQ
Sbjct: 181 LTESLFGEPFFFEVLKFPGGITVIPPQPIFPLQKAQLLFNFTLNFSIYQIQSNFEELASQ 240

Query: 241 LRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHS 300
           L+ G+ L+ YENLY++LSN RGST+  PTIV SSVL+  G++S   RLKQLAQTIT+SHS
Sbjct: 241 LKKGINLASYENLYITLSNSRGSTVAPPTIVHSSVLLTFGSSS---RLKQLAQTITSSHS 300

Query: 301 GNLGLNNTVFGKVKQ-------------DATYSPSPGTEEHKY--------------APK 360
            NLGLN+TVFGKVKQ              +T SPSP  E H+Y              AP+
Sbjct: 301 KNLGLNHTVFGKVKQVRLSSILPHSPATSSTPSPSPQPETHQYPHHHPHHHHHHHELAPE 360

Query: 361 NGISSAPEAGSSPVESPASKKRNYEATP--PGFRYGYKGLSAKVRKRSHLGSILSPSSPP 420
             + S P  G +P  +P          P  P  +   KG SA     +H  +  +P+   
Sbjct: 361 PSL-SPPTKGFAPASAPTKHSPLPPRNPPCPYEQRRPKGNSA----LNHHTAPPTPAPHR 420

Query: 421 SSPYLRVGLPAPVT-VSISASSPLQGVALSNVQPPEKGD-------RSAPSVLPPQFSFS 438
           S P+     PAP    +I  SSPL  V  +++ PP K           +PS  P   S S
Sbjct: 421 SQPHPPAPNPAPPRHHAIPVSSPLPHVVFAHIPPPSKSSPESEPTGEKSPSPAPTPSSAS 467

BLAST of CmoCh14G005120 vs. TAIR 10
Match: AT3G56590.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 322.8 bits (826), Expect = 4.5e-88
Identity = 211/457 (46.17%), Postives = 268/457 (58.64%), Query Frame = 0

Query: 1   MGKSE-EEQPLPVGVSSSELSDWTVQSRCGGGG-------CFAIRRLIAVRCVFFLLLSA 60
           MGK+  EEQ LPV       SD    +R  GGG       C  I    ++RCV  L  SA
Sbjct: 1   MGKNTVEEQNLPV-------SDGAASARNNGGGGISTCCCCDWISSYFSLRCVLILAFSA 60

Query: 61  AVFLSAIFWLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGE 120
           AVFLSA+FWLPPFL + D  D   D  ++DH IVA F   KP+ F+++++ +LE++I  E
Sbjct: 61  AVFLSALFWLPPFLGFADPGDLDLDPRFKDHRIVASFDVGKPISFMEDNLMQLENDITDE 120

Query: 121 IPVPFVKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLR 180
           I  P  KV VL+L+ LG  N T +IF++DP+ + SKIP   +SLIK  FETLV      R
Sbjct: 121 ISFPMTKVVVLALERLGDLNRTMVIFAIDPEKENSKIPAEIESLIKAAFETLVQKQLSFR 180

Query: 181 LNASLFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQ 240
           L  SLFG    FEVLKFPGGIT+IPPQ  F LQ AQ+ FNFTLN+SIYQIQ NF++L SQ
Sbjct: 181 LTESLFGEPFFFEVLKFPGGITVIPPQPIFPLQKAQLLFNFTLNFSIYQIQSNFEELASQ 240

Query: 241 LRSGLRLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHS 300
           L+ G+ L+ YENLY++LSN RGST+  PTIV SSVL+  G++S   RLKQLAQTIT+SHS
Sbjct: 241 LKKGINLASYENLYITLSNSRGSTVAPPTIVHSSVLLTFGSSS---RLKQLAQTITSSHS 300

Query: 301 GNLGLNNTVFGKVKQ-------------DATYSPSPGTEEHKY--------------APK 360
            NLGLN+TVFGKVKQ              +T SPSP  E H+Y              AP+
Sbjct: 301 KNLGLNHTVFGKVKQVRLSSILPHSPATSSTPSPSPQPETHQYPHHHPHHHHHHHELAPE 360

Query: 361 NGISSAPEAGSSPVESPASKKRNYEATP--PGFRYGYKGLSAKVRKRSHLGSILSPSSPP 420
             + S P  G +P  +P          P  P  +   KG SA     +H  +  +P+   
Sbjct: 361 PSL-SPPTKGFAPASAPTKHSPLPPRNPPCPYEQRRPKGNSA----LNHHTAPPTPAPHR 420

BLAST of CmoCh14G005120 vs. TAIR 10
Match: AT3G10810.1 (zinc finger (C3HC4-type RING finger) family protein )

HSP 1 Score: 298.9 bits (764), Expect = 6.9e-81
Identity = 206/501 (41.12%), Postives = 277/501 (55.29%), Query Frame = 0

Query: 1   MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGCFAIRRLIAVRCVFFLLLSAAVFLSAIF 60
           MGK+E++  L V    +        +RC  G C  I   +  +C+F LLLS A+FLSA+F
Sbjct: 1   MGKTEDDVSLRVAGGEATGDSTVRNARC--GCCKWISSFVGFKCLFVLLLSVALFLSALF 60

Query: 61  WLPPFLSYGDWPDQAADSTYRDHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVPFVKV 120
            L PF    D  D   D  +R H IVA F   +   FL  +  +L+++IF E+    +KV
Sbjct: 61  LLLPFPM--DREDSNLDPRFRGHAIVASFSINRSASFLNENTLQLQNDIFQEMSYISIKV 120

Query: 121 AVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASLFGN 180
            +L+++     N+T ++F +DPD  Y +I P S S IKE FE+++IN   L+L  SLFG 
Sbjct: 121 TILAVEPSDELNITKVVFGIDPDTGYREILPLSLSSIKEMFESVLINQSTLQLTKSLFGE 180

Query: 181 TSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGLRLS 240
           T LFEVLKFPGGIT+IPPQSAF LQ  +I FNFTLNYSI+QIQ+NF+ L SQL++GL L+
Sbjct: 181 TFLFEVLKFPGGITVIPPQSAFPLQKFKIVFNFTLNYSIHQIQINFNTLASQLKNGLNLA 240

Query: 241 RYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGLNNT 300
            YENLYVSLSN  GST+  PT V SSVL+ +GT++S+ RLKQL  TIT S S NLGLNNT
Sbjct: 241 PYENLYVSLSNSEGSTVSPPTTVHSSVLLRVGTSNSSPRLKQLTDTITGSRSKNLGLNNT 300

Query: 301 VFGKVKQ------------DATYSPSPGT---------------------EEHKYAPKNG 360
           +FGKVKQ             +T SPSP                         H +   + 
Sbjct: 301 IFGKVKQVRLSSFLPNSSDSSTKSPSPSPSPHSKHHHHHHHHHHHHHHHHHNHHHHHHHN 360

Query: 361 ISSAPEAGSSPVESPA---SKKRNYEATP---PGFRYGYKGLSAKVRKRSHLGSILSPSS 420
           +S       SPV SPA   S+KR   A P   PG R  +K       KR    S  +P+ 
Sbjct: 361 LSPKMAPEVSPVASPAPHRSRKRAPSAPPPCNPGNRVHFK------EKRVQFSSTPAPAP 420

Query: 421 PPSSPYLRVGLPAPVTVS----ISASSPLQGVALSN-VQPP--EKGDRSAPSVL--PPQF 454
              +P+ ++  PAP++ +    +  S+PL  V  ++  QPP  E  +  A  V    PQ 
Sbjct: 421 SAGAPHHQLHSPAPISAAKSHIVPISAPLPHVVFAHAAQPPITEPREPHANEVAHPQPQS 480

BLAST of CmoCh14G005120 vs. TAIR 10
Match: AT1G10790.1 (BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2); Has 78 Blast hits to 78 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 78; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 154.1 bits (388), Expect = 2.8e-37
Identity = 114/315 (36.19%), Postives = 164/315 (52.06%), Query Frame = 0

Query: 1   MGKSEEEQPLPVGVSSSELSDWTVQSRCGGGGC-FAIRRLIAVRCVFFLLLSAAVFLSAI 60
           M K  +E  L +   + +L +     R  G  C  A  RL+ +RC+  L+LS A+ LSAI
Sbjct: 1   MKKHSKENALALQQETLDLENPESSPRSSGRSCSSAFSRLVGLRCLIVLVLSCAILLSAI 60

Query: 61  FWLPPFLSYGDWPDQAADSTYR-DHEIVACFRARKPVPFLKNHIFELEDNIFGEIPVP-F 120
           FWL P  S  ++    AD T + +  + A FR +KPV  +  H  ++E +I   I +   
Sbjct: 61  FWLFPRRSVSEF---KADGTVKLNASVQASFRLQKPVSEVVRHKGKIEHDILRSIGLSNN 120

Query: 121 VKVAVLSLQSLGGSNVTDIIFSVDPDAKYSKIPPTSQSLIKETFETLVINDPPLRLNASL 180
            KV VLSL   G SN TD+ F+V P     +I   S SL++ +F  L      L+L  S 
Sbjct: 121 SKVTVLSLNQSGASNYTDVEFAVLPVPPDHEISKHSLSLLRSSFVKLFAKRSKLKLTTSG 180

Query: 181 FGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLTSQLRSGL 240
           FG  + F+VLKFPGGIT+ P + A +   A + F+ T+  SI  +Q   D L       L
Sbjct: 181 FGKPTSFQVLKFPGGITVDPLEPAPVSGVALVLFSVTIKTSISTVQDRLDLLNGLFEHML 240

Query: 241 RLSRYENLYVSLSNERGSTMQAPTIVQSSVLMAIGTNSSNQRLKQLAQTITNSHSGNLGL 300
            L  YE+++  L+N++GST+  P   Q  V   +     +QRL    Q I  S + NLGL
Sbjct: 241 SLEPYESVHFQLTNKQGSTISPPLTFQVYVAFTM-RKYLHQRLNHFTQIIQTSRAKNLGL 300

Query: 301 NNTVFGKVKQDATYS 313
           +  VFG+VK D T+S
Sbjct: 301 DEAVFGEVK-DITFS 310

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1F4093.7e-24787.52uncharacterized protein LOC111441963 OS=Cucurbita moschata OX=3662 GN=LOC1114419... [more]
A0A6J1J0747.9e-24284.88uncharacterized protein LOC111482272 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1J3901.4e-23881.31uncharacterized protein LOC111482272 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A0A0LHD18.6e-20475.34Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G819880 PE=4 SV=1[more]
A0A1S3B8E94.7e-20271.14uncharacterized protein LOC103487165 OS=Cucumis melo OX=3656 GN=LOC103487165 PE=... [more]
Match NameE-valueIdentityDescription
AT3G56590.24.1e-8945.02hydroxyproline-rich glycoprotein family protein [more]
AT3G56590.14.5e-8846.17hydroxyproline-rich glycoprotein family protein [more]
AT3G10810.16.9e-8141.12zinc finger (C3HC4-type RING finger) family protein [more]
AT1G10790.12.8e-3736.19BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein fam... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 307..352
NoneNo IPR availablePANTHERPTHR33826F20B24.21coord: 1..307
NoneNo IPR availablePANTHERPTHR33826:SF2HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILY PROTEINcoord: 1..307

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh14G005120.1CmoCh14G005120.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane