CsGy3G017170 (gene) Cucumber (Gy14) v2

NameCsGy3G017170
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionHydroxyproline-rich glycoprotein family protein isoform 1
LocationChr3 : 13207549 .. 13210381 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAAGTATCAACAACAGCGTCGATACGGTTAATGCTGCCGCTACTGCCATCGTTTCTGCTGAGGCTCGAGTTCAGCCTACCACACCTCCGGTACGTTTATTTTTTTTTCTTTATTTGTTTTCTGGTTTGATTTTTATTTGGAACTGAGAAAAAGTGCGGTAATTTGATTGAGAAATGGAGTTTTTGATCTCGGTGTTGGTTTGCTGTTTTCAGTTTAGATGAGGAAAATCTCATCAAGGTTGATGATCTAAACGATTTTTTTTTTTAATTTTTAATTTGAGAAAACCGTGTTGGTTACGCGTTTTTGTTTCTTGATTCTAAGGCTTTGATTTGGTACTTTCTGCTTCGGTTGGTGGTGTTTTTTTTTTTTTTTTTTTTTTTGGTGTTGCGCTTTTGTGAAGACGTTGAATCTTAATTTCACAAAAAGTTGTTGCTTAATTAGTCTAGCTGGTAAACGGTTGAAGATAAAGCAACTTGAAAAGTTTCGATTAATCTGCTTTTAGAACAGAAAGGAAGGGGAACATAAGGTTGTTCGGTAATTAATTACAAGTGCGATTTGCTCGATTCTGTGCAATAACTTCCTTCGGAACTTGGATTCATTCGGTGTAAGATTTCTCATACAGTGGGACTTGAATCAATTTAGTTAGCGATCTTATGGACTTGTCATCAGATTTGTTGAATTAGTTGTTTTTGATAATGGATCTTGTAGCGCTGTACAGTGTGTCAATTTATTCGGCTCTTTTGGGCTTTAGTTTTCTTCTCTTAAGGTTGAGTTTGGTCTGGGTTGAGGATTCCTTTGATGGGGACAGACAAAAATTTGCGTCTCTTTAGAGGAAGATTCTATGCTCCTGAAGGTGTTCCGCTTGAGCTATCATACGCTCTTATCACAGAGCAATTGACTGCCAAAATCTTTAGTCTTAATCTTACGACCTCTTTAAATAGATCGATCATCGACCTCAACGGGTGGCTCAAAACTGAGATGCCGTACGGTAGTTAGCTTTCACTTGTAAAACTTTCAACTCTGTTTACCGCTTTATGCAAAAATTTCTGAAACCGTTTGGTTTTTTTTAGCATATGAACACAACATGAATTGTTTACTAACCAATTTAGCCAAGAAACATTAAAAGTTTGGATTTTGAATTGATGAAGAAGGGTTTTCTGTTCCCACTCTCTTCCGTTCTGTCTATTCTCTATTGCCTGACATGAGAGGTGAGGAAGTAACGTAATGTTTTGTTGTTAACACGACATAGTCGATGATTATGAGCTTCTTACCGGAGGTTTTACCTGCTGAAGTTCTTAGACTTCTGAATAGTTGTTATTAGTACTGCTCTGATGGCTTGAATAATGATTCTGTTTTAGCTAGATTGAAGTCTCTGTTCTGTTTCCTGTTGATCTTTTCATTTACATTTTCTTGTTGTCTTAACTACAGAAACGAAGATGGGGTAGCTGCTGGAGTCTGTACTGGTGCTTTGGCATTGGTTCACAGAAAAGCAATAAACGTATAGGTCATGCTGTACTAGTTCCTGAACCTGCAGTACCAGGAGCCGTTGCCCCTGCTGTTGAGCATCGAACACCTTCAACCACCATGGTATTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCTTTCCTCCAGTCCGAACCTACATCAAATACTCAATCTCCTGCTGGATTACTATCCTTAACTGCTCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCTTCCATTTTTGCAATAGGCCCTTATACATATGACACTCAGTTGGTCTCACCTCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACCGCTCCTATTACTCCTCCTCCTGAGTCTGTTCAACTGACTACACCCTCATCTCCTGAAGTTCCATTTGCTAAATTGCTGACATCTTCTCTAAGCCATACTAATAAAAGTTTTGGGACTAACCAAAAGTTCACGCTATCACACTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGTGCTCATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTTCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCACCGAAGCTCTTGGGTCTTGAACATTTTACAACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACTCCAGATGGTACCGGTTTATGTTCTAGGTTAGGTTCAGGAACTTTGACTCCTGATGGTATGGGAATGGGTTCTAGATTGGGATCTGGATCTGTTACCCCAAATGGTATGAGGCAAGATTCAAGATTGGGTTCTGGAACCTTGACGCCTGATGGTCTGGGCCATGGCTTGCAAGATAGTCCATTGTTGGACAACCAAATATCTGAGGTGGCTTCTCTTGCCAACTCAGAAACTGGATGCCAAAATGATGTGACAAATCATAGGGTGTCATTTGAGTTAACTGGTGAAGATGTTGCACGCTGTCTTGCAAATAAGTCATTGACATCCATTAGAACTGAATCTGAGTCTCCGAAGCAAACAAGCACAAGCAATCAAAACGAAAACAAAGAATCATCCAGAGAAGCTGAAACTTGCGAGTTCTTTGACATCAAGACTTCCGCAGCACCAGAAAAAACTCCAGGAGAGGATGATCAATGCTACCAAAATCAGCGAGCTGTAACTCTTGGTTCATTCAAAGAGTTCAACTTTGACCAAACTAAAGGAGAAATACACAACACAGCCTCCATCGGTGCAGAATGGTGGGCCAATGAGAAAGTGGGTGTGAAGGAAGCTAGTCCAGGTAACAACTGGACCTTCTTCCCATTGTTGCAACCTGGCGTCAGCTGA

mRNA sequence

ATGGCAAGTATCAACAACAGCGTCGATACGGTTAATGCTGCCGCTACTGCCATCGTTTCTGCTGAGGCTCGAGTTCAGCCTACCACACCTCCGAAACGAAGATGGGGTAGCTGCTGGAGTCTGTACTGGTGCTTTGGCATTGGTTCACAGAAAAGCAATAAACGTATAGGTCATGCTGTACTAGTTCCTGAACCTGCAGTACCAGGAGCCGTTGCCCCTGCTGTTGAGCATCGAACACCTTCAACCACCATGGTATTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCTTTCCTCCAGTCCGAACCTACATCAAATACTCAATCTCCTGCTGGATTACTATCCTTAACTGCTCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCTTCCATTTTTGCAATAGGCCCTTATACATATGACACTCAGTTGGTCTCACCTCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACCGCTCCTATTACTCCTCCTCCTGAGTCTGTTCAACTGACTACACCCTCATCTCCTGAAGTTCCATTTGCTAAATTGCTGACATCTTCTCTAAGCCATACTAATAAAAGTTTTGGGACTAACCAAAAGTTCACGCTATCACACTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGTGCTCATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTTCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCACCGAAGCTCTTGGGTCTTGAACATTTTACAACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACTCCAGATGGTACCGGTTTATGTTCTAGGTTAGGTTCAGGAACTTTGACTCCTGATGGTATGGGAATGGGTTCTAGATTGGGATCTGGATCTGTTACCCCAAATGGTATGAGGCAAGATTCAAGATTGGGTTCTGGAACCTTGACGCCTGATGGTCTGGGCCATGGCTTGCAAGATAGTCCATTGTTGGACAACCAAATATCTGAGGTGGCTTCTCTTGCCAACTCAGAAACTGGATGCCAAAATGATGTGACAAATCATAGGGTGTCATTTGAGTTAACTGGTGAAGATGTTGCACGCTGTCTTGCAAATAAGTCATTGACATCCATTAGAACTGAATCTGAGTCTCCGAAGCAAACAAGCACAAGCAATCAAAACGAAAACAAAGAATCATCCAGAGAAGCTGAAACTTGCGAGTTCTTTGACATCAAGACTTCCGCAGCACCAGAAAAAACTCCAGGAGAGGATGATCAATGCTACCAAAATCAGCGAGCTGTAACTCTTGGTTCATTCAAAGAGTTCAACTTTGACCAAACTAAAGGAGAAATACACAACACAGCCTCCATCGGTGCAGAATGGTGGGCCAATGAGAAAGTGGGTGTGAAGGAAGCTAGTCCAGGTAACAACTGGACCTTCTTCCCATTGTTGCAACCTGGCGTCAGCTGA

Coding sequence (CDS)

ATGGCAAGTATCAACAACAGCGTCGATACGGTTAATGCTGCCGCTACTGCCATCGTTTCTGCTGAGGCTCGAGTTCAGCCTACCACACCTCCGAAACGAAGATGGGGTAGCTGCTGGAGTCTGTACTGGTGCTTTGGCATTGGTTCACAGAAAAGCAATAAACGTATAGGTCATGCTGTACTAGTTCCTGAACCTGCAGTACCAGGAGCCGTTGCCCCTGCTGTTGAGCATCGAACACCTTCAACCACCATGGTATTACCTTTCATTGCCCCTCCATCTTCTCCTGCATCTTTCCTCCAGTCCGAACCTACATCAAATACTCAATCTCCTGCTGGATTACTATCCTTAACTGCTCTTTCAGTCAATAACTACTCCCCAAATGGACCTGCTTCCATTTTTGCAATAGGCCCTTATACATATGACACTCAGTTGGTCTCACCTCCAGTTTTTTCTGCCTTCACCACTGAACCATCAACCGCTCCTATTACTCCTCCTCCTGAGTCTGTTCAACTGACTACACCCTCATCTCCTGAAGTTCCATTTGCTAAATTGCTGACATCTTCTCTAAGCCATACTAATAAAAGTTTTGGGACTAACCAAAAGTTCACGCTATCACACTGTGATTTCCAGCCTTATCAACCCTACCCAGGAAGCCCTGGTGCTCATCTTATATCACCTGGATCAGTAATTTCAAACTCTGGTACATCTTCTCCTTTTCCTGATAAACACCCCATTCTTGAGTTCCGCATGGCAGATGCACCGAAGCTCTTGGGTCTTGAACATTTTACAACTCGCAAATGGATCTCAAGAATGGGTTCTGGATCTTTGACTCCAGATGGTACCGGTTTATGTTCTAGGTTAGGTTCAGGAACTTTGACTCCTGATGGTATGGGAATGGGTTCTAGATTGGGATCTGGATCTGTTACCCCAAATGGTATGAGGCAAGATTCAAGATTGGGTTCTGGAACCTTGACGCCTGATGGTCTGGGCCATGGCTTGCAAGATAGTCCATTGTTGGACAACCAAATATCTGAGGTGGCTTCTCTTGCCAACTCAGAAACTGGATGCCAAAATGATGTGACAAATCATAGGGTGTCATTTGAGTTAACTGGTGAAGATGTTGCACGCTGTCTTGCAAATAAGTCATTGACATCCATTAGAACTGAATCTGAGTCTCCGAAGCAAACAAGCACAAGCAATCAAAACGAAAACAAAGAATCATCCAGAGAAGCTGAAACTTGCGAGTTCTTTGACATCAAGACTTCCGCAGCACCAGAAAAAACTCCAGGAGAGGATGATCAATGCTACCAAAATCAGCGAGCTGTAACTCTTGGTTCATTCAAAGAGTTCAACTTTGACCAAACTAAAGGAGAAATACACAACACAGCCTCCATCGGTGCAGAATGGTGGGCCAATGAGAAAGTGGGTGTGAAGGAAGCTAGTCCAGGTAACAACTGGACCTTCTTCCCATTGTTGCAACCTGGCGTCAGCTGA

Protein sequence

MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMGSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETGCQNDVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS
BLAST of CsGy3G017170 vs. NCBI nr
Match: XP_004140832.2 (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101210841 [Cucumis sativus])

HSP 1 Score: 840.5 bits (2170), Expect = 3.0e-240
Identity = 495/497 (99.60%), Postives = 495/497 (99.60%), Query Frame = 0

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV
Sbjct: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS
Sbjct: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
           VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP
Sbjct: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240
           FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP
Sbjct: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLXXXXXXXXXXXXXXXXX 300
           DKHPILEFRMADAPKLLGLEHFTTRKWI RMGSGSLTPDGTGL XXXXXXXXXXXXXXXX
Sbjct: 241 DKHPILEFRMADAPKLLGLEHFTTRKWIXRMGSGSLTPDGTGLCXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSETGCQNDV 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSETGCQNDV
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSETGCQNDV 360

Query: 361 TNHRVSFELTGEDVARCLANKSLTSIRTEXXXXXXXXXXXXXXXXXXXREAETCEFFDIK 420
           TNHRVSFELTGEDVARCLANKSLTSIRTEXXXXXXXXXXXXXXXXXXXREAETCEFFDIK
Sbjct: 361 TNHRVSFELTGEDVARCLANKSLTSIRTEXXXXXXXXXXXXXXXXXXXREAETCEFFDIK 420

Query: 421 TSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEA 480
           TSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEA
Sbjct: 421 TSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEA 480

Query: 481 SPGNNWTFFPLLQPGVS 498
           SPGNNWTFFPLLQPGVS
Sbjct: 481 SPGNNWTFFPLLQPGVS 497

BLAST of CsGy3G017170 vs. NCBI nr
Match: XP_008439268.1 (PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo])

HSP 1 Score: 832.8 bits (2150), Expect = 6.2e-238
Identity = 486/497 (97.79%), Postives = 488/497 (98.19%), Query Frame = 0

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M SINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQK+NKRIGHAV
Sbjct: 1   MGSINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQS PTSNTQSPAGLLSLTALS
Sbjct: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSGPTSNTQSPAGLLSLTALS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
           VNNYSPNGPASIFAIGPY YDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP
Sbjct: 121 VNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240
           FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP
Sbjct: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLXXXXXXXXXXXXXXXXX 300
           DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL XXXXXXXXXXXXXXXX
Sbjct: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSETGCQNDV 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSETGCQNDV
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSETGCQNDV 360

Query: 361 TNHRVSFELTGEDVARCLANKSLTSIRTEXXXXXXXXXXXXXXXXXXXREAETCEFFDIK 420
           TNHRVSFELTGEDVARCLANKSLTSIRTE XXXXXXXXXXXXXXX   REAETCEFFDIK
Sbjct: 361 TNHRVSFELTGEDVARCLANKSLTSIRTESXXXXXXXXXXXXXXXELSREAETCEFFDIK 420

Query: 421 TSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEA 480
           TS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIGAEWWANEKVGVKEA
Sbjct: 421 TSMAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEVHNTASIGAEWWANEKVGVKEA 480

Query: 481 SPGNNWTFFPLLQPGVS 498
           SPGNNWTFFPLLQPGVS
Sbjct: 481 SPGNNWTFFPLLQPGVS 497

BLAST of CsGy3G017170 vs. NCBI nr
Match: XP_022141198.1 (uncharacterized protein LOC111011654 [Momordica charantia])

HSP 1 Score: 779.2 bits (2011), Expect = 8.1e-222
Identity = 447/499 (89.58%), Postives = 458/499 (91.78%), Query Frame = 0

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M S+NNSVDTVNAAATAIVSAEARVQP TP KRRWG CWSLYWCFGIGSQK+NKRIGHAV
Sbjct: 1   MGSMNNSVDTVNAAATAIVSAEARVQPPTPSKRRWGCCWSLYWCFGIGSQKNNKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEP VPG VAP VEHRTPSTTMVLPFIAPPSSPASFLQS+P+SN QSPAGLLSLTALS
Sbjct: 61  LVPEPVVPGTVAPVVEHRTPSTTMVLPFIAPPSSPASFLQSDPSSNAQSPAGLLSLTALS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
           VNNYS NGPASIFAIGPY Y+TQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVP
Sbjct: 121 VNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240
           FAKLLTSSLSHTNKSFGTNQKF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP
Sbjct: 181 FAKLLTSSLSHTNKSFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLXXXXXXXXXXXXXXXXX 300
           DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL XXXXXXXXXXXXXXXX
Sbjct: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSETGCQNDV 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX+ LQD  LLDNQISEVASLANSE+GCQNDV
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNALQDGSLLDNQISEVASLANSESGCQNDV 360

Query: 361 TNHRVSFELTGEDVARCLANKSLTSIRTE-XXXXXXXXXXXXXXXXXXXREAETCEFFDI 420
           TNHRVSFELTGEDVARCLANKS+ SIRTE                    REAETCEFFDI
Sbjct: 361 TNHRVSFELTGEDVARCLANKSMASIRTESESSEQQTSSKYQSENKGSSREAETCEFFDI 420

Query: 421 KTSAAPEKTP-GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVK 480
           KTS APEK+P GEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV VK
Sbjct: 421 KTSTAPEKSPAGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVAVK 480

Query: 481 EASPGNNWTFFPLLQPGVS 498
           EA+PGNNWTFFP+LQPGVS
Sbjct: 481 EANPGNNWTFFPMLQPGVS 499

BLAST of CsGy3G017170 vs. NCBI nr
Match: XP_022984784.1 (uncharacterized protein LOC111482967 [Cucurbita maxima])

HSP 1 Score: 771.2 bits (1990), Expect = 2.2e-219
Identity = 450/498 (90.36%), Postives = 460/498 (92.37%), Query Frame = 0

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M S+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFG GSQK+NKRIGHAV
Sbjct: 1   MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEPAV GAVAPAVEHRTPSTT+VLPFIAPPSSPASFLQSEP SN QSPAGLLSLTALS
Sbjct: 61  LVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQSEPPSNAQSPAGLLSLTALS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
           VNNYSPNGPASIFAIGPY YDTQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVP
Sbjct: 121 VNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240
           FAKLLTSSLSHTNKSFGTNQKF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP
Sbjct: 181 FAKLLTSSLSHTNKSFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLXXXXXXXXXXXXXXXXX 300
           DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL XXXXXXXXXXXXXXXX
Sbjct: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSETGCQNDV 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH LQD  LLD+QISEVASLANSETGCQNDV
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHALQDGLLLDSQISEVASLANSETGCQNDV 360

Query: 361 TNHRVSFELTGEDVARCLANKS-LTSIRTEXXXXXXXXXXXXXXXXXXXREAETCEFFDI 420
            NHRVSFELTGEDVARCLANKS  TS  ++            XXXXX  +EAE+CEFFDI
Sbjct: 361 ANHRVSFELTGEDVARCLANKSKQTSTNSQ------------XXXXXSSKEAESCEFFDI 420

Query: 421 KTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKE 480
           KTS APEKT  EDDQCYQNQRAV LGSFKEFNFDQTKGEIH+TASIGAEWWANEKV VKE
Sbjct: 421 KTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEIHSTASIGAEWWANEKVAVKE 480

Query: 481 ASPGNNWTFFPLLQPGVS 498
           ASPGNNWTFFP+LQPGVS
Sbjct: 481 ASPGNNWTFFPMLQPGVS 486

BLAST of CsGy3G017170 vs. NCBI nr
Match: XP_022922938.1 (uncharacterized protein LOC111430767 [Cucurbita moschata])

HSP 1 Score: 770.0 bits (1987), Expect = 4.9e-219
Identity = 449/498 (90.16%), Postives = 460/498 (92.37%), Query Frame = 0

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M S+NNSVDTVNAAATAIVSAEARVQP TPPKRRWGSCWSLYWCFG GSQK+NKRIGHAV
Sbjct: 1   MGSMNNSVDTVNAAATAIVSAEARVQPPTPPKRRWGSCWSLYWCFGNGSQKNNKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEPAV GAVAPAVEHRTPSTT+VLPFIAPPSSPASFLQSEP SN QSPAGLLSLTALS
Sbjct: 61  LVPEPAVSGAVAPAVEHRTPSTTVVLPFIAPPSSPASFLQSEPPSNAQSPAGLLSLTALS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
           VNNYSPNGPASIFAIGPY YDTQLVSPPVFSAF TEPSTAP TPPPESVQLTTPSSPEVP
Sbjct: 121 VNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFPTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240
           FAKLLTSSLSHTNKSFGTNQKF LSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP
Sbjct: 181 FAKLLTSSLSHTNKSFGTNQKFALSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLXXXXXXXXXXXXXXXXX 300
           DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL XXXXXXXXXXXXXXXX
Sbjct: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSETGCQNDV 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH LQD  LLD+QISEVASLANSETGCQNDV
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHALQDGLLLDSQISEVASLANSETGCQNDV 360

Query: 361 TNHRVSFELTGEDVARCLANKS-LTSIRTEXXXXXXXXXXXXXXXXXXXREAETCEFFDI 420
            NHRVSFELTGEDVARCLANKS  TS  ++            XXXXX  +EAE+CEFFDI
Sbjct: 361 ANHRVSFELTGEDVARCLANKSKQTSTNSQ------------XXXXXSSKEAESCEFFDI 420

Query: 421 KTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKE 480
           KTS APEKT  EDDQCYQNQRAV LGSFKEFNFDQTKGE+H+TASIGAEWWANEKV VKE
Sbjct: 421 KTSTAPEKTSAEDDQCYQNQRAVNLGSFKEFNFDQTKGEMHSTASIGAEWWANEKVAVKE 480

Query: 481 ASPGNNWTFFPLLQPGVS 498
           ASPGNNWTFFP+LQPGVS
Sbjct: 481 ASPGNNWTFFPMLQPGVS 486

BLAST of CsGy3G017170 vs. TAIR10
Match: AT5G52430.1 (hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 391.3 bits (1004), Expect = 8.6e-109
Identity = 263/499 (52.71%), Postives = 312/499 (62.53%), Query Frame = 0

Query: 4   INNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVP 63
           +NNSV+TVNAAATAIV+AE+RVQP++  K RWG CWSLY CF  G+QK+NKRIG+AVLVP
Sbjct: 5   VNNSVETVNAAATAIVTAESRVQPSSSQKGRWGKCWSLYSCF--GTQKNNKRIGNAVLVP 64

Query: 64  EPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNN 123
           EP   G     V++   STT+VLPFIAPPSSPASFLQS+P+S + SP G LSLT+   N 
Sbjct: 65  EPVTSGVPVVTVQNSATSTTVVLPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTS---NT 124

Query: 124 YSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPE-SVQLTTPSSPEVPFA 183
           +SP  P S+F +GPY  +TQ V+PPVFSAF TEPSTAP TP PE SV +TTPSSPEVPFA
Sbjct: 125 FSPKEPQSVFTVGPYANETQPVTPPVFSAFITEPSTAPYTPXPESSVHITTPSSPEVPFA 184

Query: 184 KLLTSSLSHTNK--SFGTNQKFTLSHCDFQPYQPYPGSP-GAHLISPGSVISNSGTSSPF 243
           +LLTSSL  T +  + G NQKF+ SH +F+  Q  PGSP G +LISPGSVISNSGTSSP+
Sbjct: 185 QLLTSSLELTRRDSTSGMNQKFSSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPY 244

Query: 244 PDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLXXXXXXXXXXXXXXXX 303
           P K P++EFR+ + PK LG EHFT RKW SR GSGS+TP G G        XXXXXXXXX
Sbjct: 245 PGKSPMVEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPVGHGSGLASGALXXXXXXXXX 304

Query: 304 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSETGCQND 363
           XXXXXX                                 L NQISEVASLANS+ G +  
Sbjct: 305 XXXXXXNTTWP----------------------------LQNQISEVASLANSDHGSEVM 364

Query: 364 VTNHRVSFELTGEDVARCLANKSLTSIRTEXXXXXXXXXXXXXXXXXXXREAETCEFFDI 423
           V +HRVSFELTGEDVARCLA+K   S                        E E     DI
Sbjct: 365 VADHRVSFELTGEDVARCLASKLNRS--------------HDRMNNNDRIETEESSSTDI 424

Query: 424 KTSAAPEKTPGEDDQ-CYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVK 483
           + +        E++Q   Q   + ++GS KEF FD TK E              EKV   
Sbjct: 425 RRNIEKRSGDRENEQHRIQKLSSSSIGSSKEFKFDNTKDE------------NIEKVA-- 438

Query: 484 EASPGNNWTFFPLLQPGVS 498
               GN+W+FFP L+ GVS
Sbjct: 485 ----GNSWSFFPGLRSGVS 438

BLAST of CsGy3G017170 vs. TAIR10
Match: AT4G25620.1 (hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 355.9 bits (912), Expect = 4.0e-98
Identity = 270/508 (53.15%), Postives = 312/508 (61.42%), Query Frame = 0

Query: 1   MASINN-SVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHA 60
           M S+NN SVDTVNAAA+AIVSAE+R QP++  K+R GS WSLYWCF  GS+K+NKRIGHA
Sbjct: 1   MRSVNNSSVDTVNAAASAIVSAESRTQPSSVQKKR-GSWWSLYWCF--GSKKNNKRIGHA 60

Query: 61  VLVPEPAVPG-AVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEP--TSNTQSPAGLLSL 120
           VLVPEPA  G AVAP     + ST++ +PFIAPPSSPASFL S P   S+T  P  L SL
Sbjct: 61  VLVPEPAASGAAVAPVQNSSSNSTSIFMPFIAPPSSPASFLPSGPPSASHTPDPGLLCSL 120

Query: 121 TALSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSS 180
           T         N P S F IGPY ++TQ V+PPVFSAFTTEPSTAP    PES     PSS
Sbjct: 121 TV--------NEPPSAFTIGPYAHETQPVTPPVFSAFTTEPSTAPFXXXPES-----PSS 180

Query: 181 PEVPFAKLLTSSL--SHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSG 240
           PEVPFA+LLTSSL  +  N   G NQKF+ +H +F+  Q YPGSPG +LISPG     SG
Sbjct: 181 PEVPFAQLLTSSLERARRNSGGGMNQKFSAAHYEFKSCQVYPGSPGGNLISPG-----SG 240

Query: 241 TSSPFPDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLXXXXXXXXXXX 300
           TSSP+P K  I+EFR+ + PK LG EHFT RKW SR GSGS+       XXXXXXXXXXX
Sbjct: 241 TSSPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSIXXXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSET 360
           XXXXXXXXXXXXX                           +  LLD+QISEVASLANS+ 
Sbjct: 301 XXXXXXXXXXXXXETVIRMSYGNLTPL-------------EGSLLDSQISEVASLANSDH 360

Query: 361 GC--QND---VTNHRVSFELTGEDVARCLANKSLTSIRTEXXXXXXXXXXXXXXXXXXXR 420
           G    ND   V  HRVSFELTGEDVARCLA+K   S   E                    
Sbjct: 361 GSSRHNDEALVVPHRVSFELTGEDVARCLASKLNRSGSHEKASGEHL------------- 420

Query: 421 EAETCEFFDIKTSAAPEKTPGE-DDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAE 480
               C            KT GE + +  Q  R+ + GS KEF FD T  E+     I +E
Sbjct: 421 RPNCC------------KTSGETESEQSQKLRSFSTGSNKEFKFDSTNEEM--IEKIRSE 447

Query: 481 WWANEKV-GVKEASPGNNWTFFPLLQPG 496
           WWANEKV G  + SP N+WTFFP+L+ G
Sbjct: 481 WWANEKVAGKGDHSPRNSWTFFPVLRSG 447

BLAST of CsGy3G017170 vs. TAIR10
Match: AT1G63720.1 (BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1))

HSP 1 Score: 204.1 bits (518), Expect = 1.9e-52
Identity = 137/266 (51.50%), Postives = 171/266 (64.29%), Query Frame = 0

Query: 2   ASINNSVDTVNAAATAIVSAEARVQPTTP--PKRRWGSCWSLYWCFGIGSQKSNKRIGHA 61
           A+ NN  DT+NAAA+AI S++ R+  ++P   KR+W + WSL  CF  GS +  KRIG++
Sbjct: 5   ANGNNVFDTINAAASAIASSDDRLHQSSPIHKKRKWWNRWSLLKCF--GSSRQRKRIGNS 64

Query: 62  VLVPEP-AVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTA 121
           VLVPEP ++  + +        S    LPFIAPPSSPASF QSEP S TQSP G+LS + 
Sbjct: 65  VLVPEPVSMSSSNSTTSNSGYRSVITTLPFIAPPSSPASFFQSEPPSATQSPVGILSFSP 124

Query: 122 LSVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQL----TTP 181
           L  NN       SIFAIGPY ++TQLVSPPVFS +TTEPS+APITPP +   +    TTP
Sbjct: 125 LPCNN-----RPSIFAIGPYAHETQLVSPPVFSTYTTEPSSAPITPPLDDSSIYLTTTTP 184

Query: 182 SSPEVPFAKLLTSSLSHTNKSFGTNQKFTLSHC-DFQPYQPYPGSPGAHLISPGSVISNS 241
           SSPEVPFA+L  S  +H   S+G   KF +S   +FQ YQ  PGSP   LISP      S
Sbjct: 185 SSPEVPFAQLFNS--NHQTGSYG--YKFPMSSSYEFQFYQLPPGSPLGQLISPS---PGS 244

Query: 242 GTSSPFPDKHPIL--EFRMADAPKLL 258
           G +SPFPD    L   F+++D PKLL
Sbjct: 245 GPTSPFPDGETSLFPHFQVSDPPKLL 256

BLAST of CsGy3G017170 vs. TAIR10
Match: AT1G76660.1 (FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 147.5 bits (371), Expect = 2.2e-35
Identity = 102/215 (47.44%), Postives = 123/215 (57.21%), Query Frame = 0

Query: 32  KRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMV 91
           ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + 
Sbjct: 8   RKRWGGCLGVFSCF--KSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGIN 67

Query: 92  LPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNNYSPNGP-ASIFAIGPYTYDTQL 151
           L  +APPSSPASF  S   S TQSP   LSL A      SP GP +S++A GPY ++TQL
Sbjct: 68  LSLLAPPSSPASFTNSALPSTTQSPNCYLSLAA-----NSPGGPSSSMYATGPYAHETQL 127

Query: 152 VSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTL 211
           VSPPVFS FTTEPSTAP TPPPE  +LT PSSP+VP+A+ LTSS+   N   G       
Sbjct: 128 VSPPVFSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKG------- 187

Query: 212 SHCDFQ-PYQPYPGSPGAHLISPGSVISNSGTSSP 239
            + D Q  Y  YPGSP + L SP S  S  G  SP
Sbjct: 188 HYNDLQATYSLYPGSPASALRSPISRASGDGLLSP 208

BLAST of CsGy3G017170 vs. Swiss-Prot
Match: sp|Q9SRE5|Y1666_ARATH (Uncharacterized protein At1g76660 OS=Arabidopsis thaliana OX=3702 GN=At1g76660 PE=2 SV=1)

HSP 1 Score: 147.5 bits (371), Expect = 3.9e-34
Identity = 102/215 (47.44%), Postives = 123/215 (57.21%), Query Frame = 0

Query: 32  KRRWGSCWSLYWCFGIGSQKSNKRIGHAVLVPEPAVPGAVAPAVEHRT------PSTTMV 91
           ++RWG C  ++ CF   SQK  KRI  A  +PE     A  P   H+        +  + 
Sbjct: 8   RKRWGGCLGVFSCF--KSQKGGKRIVPASRIPEGGNVSASQPNGAHQAGVLNNQAAGGIN 67

Query: 92  LPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSVNNYSPNGP-ASIFAIGPYTYDTQL 151
           L  +APPSSPASF  S   S TQSP   LSL A      SP GP +S++A GPY ++TQL
Sbjct: 68  LSLLAPPSSPASFTNSALPSTTQSPNCYLSLAA-----NSPGGPSSSMYATGPYAHETQL 127

Query: 152 VSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPFAKLLTSSLSHTNKSFGTNQKFTL 211
           VSPPVFS FTTEPSTAP TPPPE  +LT PSSP+VP+A+ LTSS+   N   G       
Sbjct: 128 VSPPVFSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKG------- 187

Query: 212 SHCDFQ-PYQPYPGSPGAHLISPGSVISNSGTSSP 239
            + D Q  Y  YPGSP + L SP S  S  G  SP
Sbjct: 188 HYNDLQATYSLYPGSPASALRSPISRASGDGLLSP 208

BLAST of CsGy3G017170 vs. TrEMBL
Match: tr|A0A1S3AYC5|A0A1S3AYC5_CUCME (uncharacterized protein LOC103484098 OS=Cucumis melo OX=3656 GN=LOC103484098 PE=4 SV=1)

HSP 1 Score: 832.8 bits (2150), Expect = 4.1e-238
Identity = 486/497 (97.79%), Postives = 488/497 (98.19%), Query Frame = 0

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M SINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQK+NKRIGHAV
Sbjct: 1   MGSINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKNNKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQS PTSNTQSPAGLLSLTALS
Sbjct: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSGPTSNTQSPAGLLSLTALS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
           VNNYSPNGPASIFAIGPY YDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP
Sbjct: 121 VNNYSPNGPASIFAIGPYAYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240
           FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP
Sbjct: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLXXXXXXXXXXXXXXXXX 300
           DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGL XXXXXXXXXXXXXXXX
Sbjct: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLGXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSETGCQNDV 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSETGCQNDV
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSETGCQNDV 360

Query: 361 TNHRVSFELTGEDVARCLANKSLTSIRTEXXXXXXXXXXXXXXXXXXXREAETCEFFDIK 420
           TNHRVSFELTGEDVARCLANKSLTSIRTE XXXXXXXXXXXXXXX   REAETCEFFDIK
Sbjct: 361 TNHRVSFELTGEDVARCLANKSLTSIRTESXXXXXXXXXXXXXXXELSREAETCEFFDIK 420

Query: 421 TSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKEA 480
           TS APEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGE+HNTASIGAEWWANEKVGVKEA
Sbjct: 421 TSMAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEVHNTASIGAEWWANEKVGVKEA 480

Query: 481 SPGNNWTFFPLLQPGVS 498
           SPGNNWTFFPLLQPGVS
Sbjct: 481 SPGNNWTFFPLLQPGVS 497

BLAST of CsGy3G017170 vs. TrEMBL
Match: tr|A0A2P5B5V3|A0A2P5B5V3_9ROSA (Hydroxyproline-rich glycoprotein family protein OS=Trema orientalis OX=63057 GN=TorRG33x02_331420 PE=4 SV=1)

HSP 1 Score: 582.4 bits (1500), Expect = 9.6e-163
Identity = 356/511 (69.67%), Postives = 403/511 (78.86%), Query Frame = 0

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M S NNSVDT+NAAATAIVSAE R QPT+ PKRRWGSCWSLYWCF  GS K++KRIGHAV
Sbjct: 1   MRSANNSVDTINAAATAIVSAETRAQPTSVPKRRWGSCWSLYWCF--GSHKNSKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVE-HRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTAL 120
           LVPEP +PGA APA E H+ PST +VLPFIAPPSSPASFLQS+P S TQSPAGLLSLT+L
Sbjct: 61  LVPEPVLPGAAAPAAEHHQAPSTAVVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSL 120

Query: 121 SVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEV 180
           S+N YSP GPASIFAIGPY Y+TQLVSPPVFS FTTEPSTAP TPPPESVQLTTPSSPEV
Sbjct: 121 SINAYSPGGPASIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEV 180

Query: 181 PFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPF 240
           PFA+LLTSSL  T ++ G +QKF+L+HC+FQPYQPYPGSPG HLISPGSV+SNSGTSSPF
Sbjct: 181 PFAQLLTSSLDRTRRNGGMHQKFSLTHCEFQPYQPYPGSPGGHLISPGSVVSNSGTSSPF 240

Query: 241 PDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLXXXXXXXXXXXXXXXX 300
           PD+HP+L FR+ +AP++LG EHFT RKW SR+GSGSLTPDG G XXXXXXXXXXXXXXXX
Sbjct: 241 PDRHPMLGFRIGEAPRILGFEHFTNRKWGSRLGSGSLTPDGVGXXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQ-DSPLLDNQISEVASLANSETGCQN 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   +  ++ LL+N ISEVASLANSE GCQN
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLVAVSTETFLLENLISEVASLANSENGCQN 360

Query: 361 D--VTNHRVSFELTGEDVARCLANKSLTSIRTEXXXXXXXXXXXXXXXXXXXREAET--- 420
           D  V +HRVSFELTGEDVARCLA KS++S+                         +T   
Sbjct: 361 DGSVVDHRVSFELTGEDVARCLAKKSVSSVSRTASDSLEDSTPAECPTKKDVISIDTDNN 420

Query: 421 -------CEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIG 480
                  CE  +  T  +       +DQ YQ  R++TLGS KEFNFD TK ++    +IG
Sbjct: 421 NSSNQLCCE--ETSTEMSEHNCREGEDQSYQKHRSITLGSIKEFNFDNTKADVSVKPAIG 480

Query: 481 AEWWANEKVGVKEASPGNNWTFFPLLQPGVS 498
           +EWWANEKV  KE  PGN+W+FFP+LQPGVS
Sbjct: 481 SEWWANEKVAGKEPKPGNSWSFFPILQPGVS 507

BLAST of CsGy3G017170 vs. TrEMBL
Match: tr|A0A2P5AAZ8|A0A2P5AAZ8_PARAD (Hydroxyproline-rich glycoprotein family protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_350420 PE=4 SV=1)

HSP 1 Score: 577.0 bits (1486), Expect = 4.0e-161
Identity = 353/510 (69.22%), Postives = 398/510 (78.04%), Query Frame = 0

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M S NNSVDT+NAAATAIVSAE R QPT+ PKRRWGSCWSLYWCF  GS K++KRIGHAV
Sbjct: 1   MRSANNSVDTINAAATAIVSAETRAQPTSVPKRRWGSCWSLYWCF--GSHKNSKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVE-HRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTAL 120
           LVPE  +P A APA E H+ PST +VLPFIAPPSSPASFLQS+P S TQSPAGLLSLT+L
Sbjct: 61  LVPESVLPVAAAPAAEHHQAPSTALVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSL 120

Query: 121 SVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEV 180
           S+N YS  GPASIFAIGPY Y+TQLVSPPVFS FTTEPSTAP TPPPESVQLTTPSSPEV
Sbjct: 121 SINAYSSGGPASIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEV 180

Query: 181 PFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPF 240
           PFA+LLTSSL  T ++ GT+QKF+L+HC+FQPYQPYPGSPG HLISPGSV+SNSGTSSPF
Sbjct: 181 PFAQLLTSSLDRTRRNGGTHQKFSLTHCEFQPYQPYPGSPGGHLISPGSVVSNSGTSSPF 240

Query: 241 PDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLXXXXXXXXXXXXXXXX 300
           PD+HPIL FR+ +AP++ G EHFT RKW SR+GSGSLTPDG GL   XXXXXXXXXXXXX
Sbjct: 241 PDRHPILGFRVGEAPRIWGFEHFTNRKWGSRLGSGSLTPDGVGL---XXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSETGCQND 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX    ++ LL+N ISEVASL NSE GCQND
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXETFLLENLISEVASLVNSENGCQND 360

Query: 361 --VTNHRVSFELTGEDVARCLANKSLTSIRTEXXXXXXXXXXXXXXXXXXXREAET---- 420
             V +HRVSFELTGEDVARCLANKS++S+                         +T    
Sbjct: 361 GSVVDHRVSFELTGEDVARCLANKSVSSVSRTASDSLEDTTPAACPTKKDVISIDTDNNN 420

Query: 421 ------CEFFDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGA 480
                 CE  +  T  +       +DQ YQ  R++TLGS KEFNFD TK ++    +IG+
Sbjct: 421 SSNQLCCE--ETSTEVSEHNCREGEDQSYQKHRSITLGSIKEFNFDNTKADVSVKPAIGS 480

Query: 481 EWWANEKVGVKEASPGNNWTFFPLLQPGVS 498
           EWWANEKV  KE  PGN+W+FFP+LQPGVS
Sbjct: 481 EWWANEKVAGKEPKPGNSWSFFPILQPGVS 503

BLAST of CsGy3G017170 vs. TrEMBL
Match: tr|M5XMF7|M5XMF7_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G298300 PE=4 SV=1)

HSP 1 Score: 576.2 bits (1484), Expect = 6.9e-161
Identity = 353/501 (70.46%), Postives = 398/501 (79.44%), Query Frame = 0

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M S+N+SVDT+NAAATAIVSAEAR QPTT PKRRWGSCWSLYWCFG      NKRIGHAV
Sbjct: 1   MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFG---PHKNKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEP VPGA   A++++T ST +V+PFIAPPSSPASFL S+P S TQSPAG LSL +LS
Sbjct: 61  LVPEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
            N YSP GPASIF+IGPY Y+TQLVSPPVFS F TEPSTAP TPPPESVQLTTPSSPEVP
Sbjct: 121 ANAYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFP 240
           FA+LLTSSL    ++ GTNQKF LSH +FQPYQ YPGSPG +LISPGS +SNSGTSSPFP
Sbjct: 181 FAQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFP 240

Query: 241 DKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLXXXXXXXXXXXXXXXXX 300
           D+HP+LEFRM +APKL G +HFTTRKW SR+GSGSLTPDG GLXXXXXXXXXXXXXXXXX
Sbjct: 241 DRHPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLXXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQDSPLLDNQISEVASLANSETGCQ--N 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX    DS LL+NQISEVASLANSE+GCQ   
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDSFLLENQISEVASLANSESGCQTVE 360

Query: 361 DVTNHRVSFELTGEDVARCLANKSLTSIRTEXXXXXXXXXXXXXXXXXXXREAET-CEF- 420
            V +HRVSFELTGEDVA CLANK++ S RT                     ++   CEF 
Sbjct: 361 TVFDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCEFS 420

Query: 421 FDIKTSAAPEKTPGE-DDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKV 480
            +  +S  PE   GE +DQ Y+  R++TLGS K+FNFD TK E+ N  +IG+EWWAN+ V
Sbjct: 421 VEESSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVPNKPNIGSEWWANKNV 480

Query: 481 GVKEASPGNNWTFFPLLQPGV 497
             KE+ P N+WTFFP+LQPGV
Sbjct: 481 AAKESKPCNDWTFFPILQPGV 498

BLAST of CsGy3G017170 vs. TrEMBL
Match: tr|W9S7Z6|W9S7Z6_9ROSA (Uncharacterized protein OS=Morus notabilis OX=981085 GN=L484_004326 PE=4 SV=1)

HSP 1 Score: 575.9 bits (1483), Expect = 9.0e-161
Identity = 364/523 (69.60%), Postives = 401/523 (76.67%), Query Frame = 0

Query: 1   MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60
           M ++NNSV+T+NAAATAIVSAEAR QP   PKRRWGSCWSLYWCF  GS K++KRIGHAV
Sbjct: 1   MRTVNNSVETINAAATAIVSAEARAQPAAVPKRRWGSCWSLYWCF--GSHKNSKRIGHAV 60

Query: 61  LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALS 120
           LVPEP +PGA APA E++ PST +VLPFIAPPSSPASFLQS+P S TQSPAGLLSLT+LS
Sbjct: 61  LVPEPVLPGAAAPAPENQAPSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLS 120

Query: 121 VNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVP 180
           +N YSP GP SIFAIGPY Y+TQLVSPPVFS FTTEPSTAP TPPPESVQLTTPSSPEVP
Sbjct: 121 INAYSPGGPTSIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVP 180

Query: 181 FAKLLTSSLSHTNK-SFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPF 240
           FA+LLTSSL  T + S G NQKF+LSHC+FQPYQ YPGSPG +LISPGSV+SNSGTSSPF
Sbjct: 181 FAQLLTSSLDRTRRNSSGANQKFSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPF 240

Query: 241 PDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLXXXXXXXXXXXXXXXX 300
           PDKHPIL FRM +AP+LLG EHFTT KW SR+GSGSLTPDG GLXXXXXXXXXXXXXXXX
Sbjct: 241 PDKHPILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDGVGLXXXXXXXXXXXXXXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHGLQ----------------DSPLLDNQI 360
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX                    DS LL+NQI
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFLVVSGDSFLLENQI 360

Query: 361 SEVASLANSETGCQND--VTNHRVSFELTGEDVARCLANKSLTSI-RTEXXXXXXXXXXX 420
           SEVASLANS+ GCQND  V +HRVSFELTGEDVARCLA+KS +S  RT            
Sbjct: 361 SEVASLANSDNGCQNDGSVVDHRVSFELTGEDVARCLASKSASSNGRTTSESLEDSPAEC 420

Query: 421 XXXXXXXXREAETCEFFDIKTSAAPEKTP------GEDDQCYQNQRAVTLGSFKEFNFDQ 480
                                     KTP      GEDD  YQ  R++TLGS KEFNFD 
Sbjct: 421 PTKKDGISANNVDSPNDQSCVEETSNKTPQSDCREGEDDHFYQKHRSITLGSIKEFNFDN 480

Query: 481 TKGEIHNTASIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS 498
           TK ++    +IG+EWWANEKV  KEA  GN+W+FFP+LQPGVS
Sbjct: 481 TKADVSVKPTIGSEWWANEKVAGKEAKAGNSWSFFPILQPGVS 521

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004140832.23.0e-24099.60PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101210841 [Cucumis sa... [more]
XP_008439268.16.2e-23897.79PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo][more]
XP_022141198.18.1e-22289.58uncharacterized protein LOC111011654 [Momordica charantia][more]
XP_022984784.12.2e-21990.36uncharacterized protein LOC111482967 [Cucurbita maxima][more]
XP_022922938.14.9e-21990.16uncharacterized protein LOC111430767 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT5G52430.18.6e-10952.71hydroxyproline-rich glycoprotein family protein[more]
AT4G25620.14.0e-9853.15hydroxyproline-rich glycoprotein family protein[more]
AT1G63720.11.9e-5251.50BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein fam... [more]
AT1G76660.12.2e-3547.44FUNCTIONS IN: molecular_function unknown[more]
Match NameE-valueIdentityDescription
sp|Q9SRE5|Y1666_ARATH3.9e-3447.44Uncharacterized protein At1g76660 OS=Arabidopsis thaliana OX=3702 GN=At1g76660 P... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3AYC5|A0A1S3AYC5_CUCME4.1e-23897.79uncharacterized protein LOC103484098 OS=Cucumis melo OX=3656 GN=LOC103484098 PE=... [more]
tr|A0A2P5B5V3|A0A2P5B5V3_9ROSA9.6e-16369.67Hydroxyproline-rich glycoprotein family protein OS=Trema orientalis OX=63057 GN=... [more]
tr|A0A2P5AAZ8|A0A2P5AAZ8_PARAD4.0e-16169.22Hydroxyproline-rich glycoprotein family protein OS=Parasponia andersonii OX=3476... [more]
tr|M5XMF7|M5XMF7_PRUPE6.9e-16170.46Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G298300 PE=4 SV=1[more]
tr|W9S7Z6|W9S7Z6_9ROSA9.0e-16169.60Uncharacterized protein OS=Morus notabilis OX=981085 GN=L484_004326 PE=4 SV=1[more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy3G017170.1CsGy3G017170.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 386..413
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 386..404
NoneNo IPR availablePANTHERPTHR31798:SF4SUBFAMILY NOT NAMEDcoord: 1..296
coord: 334..497
NoneNo IPR availablePANTHERPTHR31798FAMILY NOT NAMEDcoord: 1..296
coord: 334..497
NoneNo IPR availablePANTHERPTHR31798:SF4SUBFAMILY NOT NAMEDcoord: 300..329
NoneNo IPR availablePANTHERPTHR31798FAMILY NOT NAMEDcoord: 300..329

The following gene(s) are paralogous to this gene:

None