CsGy3G001620 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy3G001620
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionProtein of Unknown Function (DUF239)
LocationGy14Chr3: 1192518 .. 1198231 (+)
RNA-Seq ExpressionCsGy3G001620
SyntenyCsGy3G001620
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTATTATTATTCTAAAGCAACATGTTTGGTGATAGTGTTCTTTGTTTGTTTCAATTGCAAATTCAATCATGCCTCTAACCCTAATCTTTCAAGAGAAGAAGACTTGGAGATTGAAAGACAACTCAAACTTCTCAATAAACCATTCATCAAAACATACAAGGTAAGCTATATATACCTACATACACTCTTAATTGAAATCTTGAATCGAGATTCGATAATTTCATTATACTTTATATATTAATTAAAATTCTTTTTGTTTTCGATTATGCACAAGTGAAGATTAATAATAAATAATTTAAGAGAAATGAACTACGATGTTTCTTGCTTAATAGACGAAGGAAGGAGATATCATTGATTGTGTCGACATCAACAAACAACCTGCCCTCGATCATCCTTTACTAAAGAATCACAAAGTTCAGGTACATACTCGTATTTGTGATATCTGTTATATTTCGTGTTTTTCTATTTTCAATTTCAACTTTCATTACATTTTAATTAGGAAAATTTCCAGAAGAAAAAAAAGAACAAAGAAAAATTTTGTTTACAATGTTTTTAGGGAAATTCTTATAGATAGAAAAAAAATTGAAACTATTTACCTAACATACCTATAAAAAACTAAAATGGGGTGAGAAGAATTTGATTGATTTTGGTGTGTTTTATAAATAGTTTTAATTTTTTTTGTTTTTTTTTCTATTTTTGAAAAAAAATCCTTTTAATATGTTTGGAGAGATCTTGAAATAGTTAAAGTCATTAATATTATATTAAAAAAAAATAAAACATGATTTTGATTATTTTAATTAAACAGTTAAATTAAATTTTGAAAAGTAAAAAGCATGATTTAGAATAATTTTTAATGGAGCAAGATTAAAAGTGATTTTTCAGAATTACTCTCAGGTATCAATCATGTATTCTCTATACAAATTAAAGAAATTGATTAACAATATTATTCAGTATTGACAAATAGAAAATGTAATTAGGTCACTAAAACCATCTTTTTAAAGTTAAAATCTTTATATATATATAATTGTTTAGACTATCATTTACGTTTTAGAAATGTTTTCAAAAACTAAACCAAAATTTTATAACTAGAAAAAGTAATTTATTGAAATTTGTTTTAATTTTTAAAATGGACTAAAAATTCAATTATTTTACTTAAGAAAAATGCAAAATATTGTAAGAAAATAGAGAGAAATTAGTTTTAATTTTTAAAAATCAAAGTCCTAGTTTTTTGTTTTAAAAAATTGTCTTTATTTTTAAAAAAATTACGTCTTTCTTAATGAGACATTTGAATTTTTAACCTAATATGAAAACAAAATTAAATTTTTGAAAAAAAATGTACATAAATTGTTAAAATATTGCTAGAAAATAGATAATAATAATAATAATAATAAAACTATAAGCATAGCAAATATTATGTCTTGATTTGCATTCAAATTATATTATGGTTTTTTTGTCTTATCATTTTTGTTTTGTGATGATAGACTCTGCCAAGTGAATTTGTATCTAAATTGTTCAAAGAAGATTCGTCTCAATCAAACAATGGGATACTCACAAGTAATAACAACAATGGAGAAGGTTGTCCTGTTGGATTTGTTCCTATTAGAAGAACATTGAAAGAAGATCTAATTAGGTTAAAATCTCTATCATCCAACAGCAAAAACCAACAATCATCGATGAATCCAGAAGATGATGATCTATCTGGTGATTCTTTTTACGACGCCGTCAGATTTCCTTACTATCAAAATGTAAGTATCCGGATCTTTTAATTTCTTGTATGATTTGTTTCTATTTTTTTGGCTAAATTTTATTTGTAAAGATACATACCTTTGAATTTTATTCATGTTTTAAAAGTTACAATATTATTCTTAATAATCCTTCATATGCTTTTTAAAATTACTATCCTTAAAACTAGATAATTGTTAGATATCGTCACTAAAAGTAAATATGTTTGAAAGTACGTACCATTCTTTTCAAACTTCACAACCCCATCAAGCGTCTTAGTTACGTCACTTCCAAATCGTTCTAGGTTTGTATTTCAAGAATAGTTTTGAAACGTTGCAACTTCGGAAAGAATAATATCATTTTTTAAAAAAAGTAAAGAGTAAAGTTTAAGATATGTTGTTATAGGTTAAGTATGATGAAGGACTGTGGTTTCAACGGGTTATTATTCAAAACTAACCCTAATCATCGAAAAGAATAATGCATATATAGTATTTTGTTATATATATAGTTGCTAACATTTGTATAAATTGTGATTATTATGGAAAGGTTGTTTCTCATTCGTTGATAAAAGCACAATATTATCATGGAGCCAAAGCTCGAATTGCTGTGCACAATGTGAGTTTGAGTGACAATGGTCAATCTTCTTCGGCTAACATATGGGTTCTTGGTGGCTCTGATGATTCTCTTAATGTTCTTATGGCTGGTTGGCAGGTATACACATCTTTCATTTTCCATTTTAACTTTGTCAAATTCTCTTCTTCTTTTGTTCTCTCTATGCAATTCAATATTGGAACTTTTATGTGAATAAGTTAATTTATTTGATAATTATTGGGCGTTTTTTAAAGTTAAGTTTATGAATTAATACTGGTTATAATTAGATAAATTTAAAGTTTTCAATTTCAAAACTAATTGAATAACAAGAGAAAAGTTGTTTATATATATATATATTTTGTAGGAATCGGGGAAAAGGTGAGAAATTCAACTAAACATTTTTTAGATCATATATAAATAATTTGATAGGTGTAAGTATTTACTTTATTGTGAAATTTGAGTAATTTTTTTCAACAAATTTAGATTGTCGAGATTGGATGAATCATATAAAAAAAGTTAGTAAGAAGATTTTTATTAAAATCTCTTAGCAGTCATGATCGAAAGGAGTTTGCAAGAATAGCATAATGATATATGATAATAGAGTCCATGTCACATACAATTTTTAAATTGTAAAAATGACAAATTTAAAAGCAGATTACTGTATAAGATTATCGCCAATTATTAGTCATATTGACCAAATTTGCAATATGAAAAAAAAAATGTAATATTAACGGTTTCTTTTCTATTTTTTTTATCTGACAATTTTCCATTTATTTAAGTTAATTTACCTTTCAAACAAACAAAATGGGAAGTCAAATTTTAAAAATAAAAAACTAGAAACATGATTGTTATCAAAGGATGAAATAATTACAATAATTACAACTTTTTTATAACTTATACTACTAAAATTTAATGAGGATTTTCTTTTTTTTGCATTTTAGGTGAATCCAGCTGTAAATGGTGATAATCTCCCTCGAACCTTCGTGTATTGGACGGTAAGTGCACGAATATTTTTTTCATCATAGTTTCATTATTATTAGTCTAATCAAATTGTGACTATTATTGAACTAGAAATTTTGAACCAATTATCAATTGTCTTTGTTTATCAAATTAATGAGAGAGAAAATATAAATAATTGAGTTTGAGACTAATTAAAAGTTGGAATGTGTGAATACATAACCTAGCAAATTCTAACAACGCCTTGTGTTTTTATCATGATTATTTTGGTCAAATTAAAAATTATTTATTTGATCTAAAGTTCAATTAGACAAAAAAAATAAAAAATTATTAAACCAAAATGCAAAATGCTAATCATGGATCAACCAAGCCCAAAACCTTGAAACCCATTAGAGAGAATTTTAGAAATTTTTACTCCAAAAGGCTAGAGAGAATTCTTCCAACAACCATAAGATATTCTTCGACACATGTTTGCTCCCTCAAATCAAGTATACACATTTGAAAGTTTTCTTAAAACACAATTAATTACAAATTATATAATATACAACTTCTTTTTTTCTTTAGGTTTTGTTAGTCAAATTTTTGCTTCATCTCTCTAACTCTAAGTTTTGAGTTTAGTTTCTATTTGGTTACTAAATTTCAATAGTATATTACATTATTACTGTTGTAATAGGTTGACACAGGTGTTACAACAGGATGTTACAATATGCTATGTCAAGGATTCGTGTTGGTAAATCCAAATATTCACGTAGGCAGTAGTATTCTTCCAGCCTCCATCTATCAAGGACAACAATATGACTATCAATTTAGTATCGTCCAGGTAATGCAATTATTTTCAAACGATAGCTCATACACATTCAACATATTTTCATTTACATTATTTGCATGAAAAATGTCGATATCAAGATCTAAATAGATGCTATGGAAATATGAAAGTAGTATAAAAAATTTGGAGATTGATGAATAGTTTCAAGAAATCATTAGAGAATTTATATATAAAAGAAAATTTTAAATTTAACAAAATTGTATTTCATCTTTATTAAAGTAATTAAGTTCAAATTTATCTATAATATTAAATTATAAATTTAGTTCATAAACTCAGGAGAGATTATATCTATTTGATTCTTAATTCTTTCTAAAAATTTTAAGAACAGTTAGATAATACAATTGAAAATTTATAGTTCTAATTAGACACTATATTCCAATTTATATCTAGGCTAATTAATTTAATTTATATCTTATCAGGCACAAAAAATATATGTTAAAAGTTAATAACCTATTATTTTAATCTAATATACATTATTAGGCTCGTATTTGTGATATTTTGTTCATATAGTCACTCATACGTTTAAATATCAATCCATTTGCTTCATTTTGGTGGATGTTCTTTCAAAATATTAAAATTTTGATCTTTTGAACTTTTAAGAATAATCATTTTTGAAAAAGATATATAAAAATAAAGGAATAAAGTTTTGAACCTTTTTATGGAAAATGAATTCATTTTTTTTACGTTTCTTCTGATCTCCTCTAGCTTCTCTCTTTAAATTAATCGTTTTTATGGAAAGAAATCTTTGGAGCTAGGGTGTGATTAGTGTATCAACTTAGTTGAAATATTTTGGTACATCTATTAACTCCTCACCTCCTACGTATCTTTCTCTAAAAGATAAACCAAAATAAAAATATAGATATTAAACCAATATTCAAATGATCTTAAATTCATTTTACGTTACCATAAATTTTCTATAATACAATTCAAATGTGTTGACACTATCTCTCTATATATTAATATAGAACCTATACAGACATATCGAAAGAAATTTAATATTGCAATCATACGTAAATTACATGTAGAGTAATTATAACTAGAGATAGCAATTTAAGGGATAATAATCAAGTGTAGAACAATATATATGTAAAAAAATTCTATAGCTGATAGACTACACCGATATACAAGTTTATCAACGAACAATATATGCTAATATTTTGATTCTAATTGTTGTATTTTGGTATTATTGTCGGTTACACATGTATTTGTTTGATATGTAGGCTATAGGGCATTGGTGGGTTCGAGTAGGTGATAACCAAGTGGGATTAGGATATTGGCCAAACGAGTTGTTTCCAAATCTACTTAGGGGGGCAGATCAAGTTGCATGGGGAGGCAGTGCACAGCCTACACTATATGGTGATGAAAGCCCTCCATTGGGAAGTGGGCACAAGCCAAATGGTAAACCCGATGAAGCCATTTTCGTTAGGAACATACAATACATAGCACCTAACTACATACTCTCAATACCCACTTTGAACAACACAATAAATTATGTGAGTAACTCTTCCTGTTATGATTTGATCTCTAATGAGAATTGTAGTTTTGATCCCTTCAAATATTGCTTCACTTTTGGAGGCCCAGGTGGGCATGGTTGTGAAGCCTCTACTACTTAA

mRNA sequence

ATGGATTATTATTATTCTAAAGCAACATGTTTGGTGATAGTGTTCTTTGTTTGTTTCAATTGCAAATTCAATCATGCCTCTAACCCTAATCTTTCAAGAGAAGAAGACTTGGAGATTGAAAGACAACTCAAACTTCTCAATAAACCATTCATCAAAACATACAAGACGAAGGAAGGAGATATCATTGATTGTGTCGACATCAACAAACAACCTGCCCTCGATCATCCTTTACTAAAGAATCACAAAGTTCAGACTCTGCCAAGTGAATTTGTATCTAAATTGTTCAAAGAAGATTCGTCTCAATCAAACAATGGGATACTCACAAGTAATAACAACAATGGAGAAGGTTGTCCTGTTGGATTTGTTCCTATTAGAAGAACATTGAAAGAAGATCTAATTAGGTTAAAATCTCTATCATCCAACAGCAAAAACCAACAATCATCGATGAATCCAGAAGATGATGATCTATCTGGTGATTCTTTTTACGACGCCGTCAGATTTCCTTACTATCAAAATGTTGTTTCTCATTCGTTGATAAAAGCACAATATTATCATGGAGCCAAAGCTCGAATTGCTGTGCACAATGTGAGTTTGAGTGACAATGGTCAATCTTCTTCGGCTAACATATGGGTTCTTGGTGGCTCTGATGATTCTCTTAATGTTCTTATGGCTGGTTGGCAGGTGAATCCAGCTGTAAATGGTGATAATCTCCCTCGAACCTTCGTGTATTGGACGGTTGACACAGGTGTTACAACAGGATGTTACAATATGCTATGTCAAGGATTCGTGTTGGTAAATCCAAATATTCACGTAGGCAGTAGTATTCTTCCAGCCTCCATCTATCAAGGACAACAATATGACTATCAATTTAGTATCGTCCAGGCTATAGGGCATTGGTGGGTTCGAGTAGGTGATAACCAAGTGGGATTAGGATATTGGCCAAACGAGTTGTTTCCAAATCTACTTAGGGGGGCAGATCAAGTTGCATGGGGAGGCAGTGCACAGCCTACACTATATGGTGATGAAAGCCCTCCATTGGGAAGTGGGCACAAGCCAAATGGTAAACCCGATGAAGCCATTTTCGTTAGGAACATACAATACATAGCACCTAACTACATACTCTCAATACCCACTTTGAACAACACAATAAATTATGTGAGTAACTCTTCCTGTTATGATTTGATCTCTAATGAGAATTGTAGTTTTGATCCCTTCAAATATTGCTTCACTTTTGGAGGCCCAGGTGGGCATGGTTGTGAAGCCTCTACTACTTAA

Coding sequence (CDS)

ATGGATTATTATTATTCTAAAGCAACATGTTTGGTGATAGTGTTCTTTGTTTGTTTCAATTGCAAATTCAATCATGCCTCTAACCCTAATCTTTCAAGAGAAGAAGACTTGGAGATTGAAAGACAACTCAAACTTCTCAATAAACCATTCATCAAAACATACAAGACGAAGGAAGGAGATATCATTGATTGTGTCGACATCAACAAACAACCTGCCCTCGATCATCCTTTACTAAAGAATCACAAAGTTCAGACTCTGCCAAGTGAATTTGTATCTAAATTGTTCAAAGAAGATTCGTCTCAATCAAACAATGGGATACTCACAAGTAATAACAACAATGGAGAAGGTTGTCCTGTTGGATTTGTTCCTATTAGAAGAACATTGAAAGAAGATCTAATTAGGTTAAAATCTCTATCATCCAACAGCAAAAACCAACAATCATCGATGAATCCAGAAGATGATGATCTATCTGGTGATTCTTTTTACGACGCCGTCAGATTTCCTTACTATCAAAATGTTGTTTCTCATTCGTTGATAAAAGCACAATATTATCATGGAGCCAAAGCTCGAATTGCTGTGCACAATGTGAGTTTGAGTGACAATGGTCAATCTTCTTCGGCTAACATATGGGTTCTTGGTGGCTCTGATGATTCTCTTAATGTTCTTATGGCTGGTTGGCAGGTGAATCCAGCTGTAAATGGTGATAATCTCCCTCGAACCTTCGTGTATTGGACGGTTGACACAGGTGTTACAACAGGATGTTACAATATGCTATGTCAAGGATTCGTGTTGGTAAATCCAAATATTCACGTAGGCAGTAGTATTCTTCCAGCCTCCATCTATCAAGGACAACAATATGACTATCAATTTAGTATCGTCCAGGCTATAGGGCATTGGTGGGTTCGAGTAGGTGATAACCAAGTGGGATTAGGATATTGGCCAAACGAGTTGTTTCCAAATCTACTTAGGGGGGCAGATCAAGTTGCATGGGGAGGCAGTGCACAGCCTACACTATATGGTGATGAAAGCCCTCCATTGGGAAGTGGGCACAAGCCAAATGGTAAACCCGATGAAGCCATTTTCGTTAGGAACATACAATACATAGCACCTAACTACATACTCTCAATACCCACTTTGAACAACACAATAAATTATGTGAGTAACTCTTCCTGTTATGATTTGATCTCTAATGAGAATTGTAGTTTTGATCCCTTCAAATATTGCTTCACTTTTGGAGGCCCAGGTGGGCATGGTTGTGAAGCCTCTACTACTTAA

Protein sequence

MDYYYSKATCLVIVFFVCFNCKFNHASNPNLSREEDLEIERQLKLLNKPFIKTYKTKEGDIIDCVDINKQPALDHPLLKNHKVQTLPSEFVSKLFKEDSSQSNNGILTSNNNNGEGCPVGFVPIRRTLKEDLIRLKSLSSNSKNQQSSMNPEDDDLSGDSFYDAVRFPYYQNVVSHSLIKAQYYHGAKARIAVHNVSLSDNGQSSSANIWVLGGSDDSLNVLMAGWQVNPAVNGDNLPRTFVYWTVDTGVTTGCYNMLCQGFVLVNPNIHVGSSILPASIYQGQQYDYQFSIVQAIGHWWVRVGDNQVGLGYWPNELFPNLLRGADQVAWGGSAQPTLYGDESPPLGSGHKPNGKPDEAIFVRNIQYIAPNYILSIPTLNNTINYVSNSSCYDLISNENCSFDPFKYCFTFGGPGGHGCEASTT*
Homology
BLAST of CsGy3G001620 vs. NCBI nr
Match: XP_031738649.1 (uncharacterized protein LOC116402744 [Cucumis sativus] >KAE8650030.1 hypothetical protein Csa_009857 [Cucumis sativus])

HSP 1 Score: 876 bits (2264), Expect = 0.0
Identity = 424/424 (100.00%), Postives = 424/424 (100.00%), Query Frame = 0

Query: 1   MDYYYSKATCLVIVFFVCFNCKFNHASNPNLSREEDLEIERQLKLLNKPFIKTYKTKEGD 60
           MDYYYSKATCLVIVFFVCFNCKFNHASNPNLSREEDLEIERQLKLLNKPFIKTYKTKEGD
Sbjct: 1   MDYYYSKATCLVIVFFVCFNCKFNHASNPNLSREEDLEIERQLKLLNKPFIKTYKTKEGD 60

Query: 61  IIDCVDINKQPALDHPLLKNHKVQTLPSEFVSKLFKEDSSQSNNGILTSNNNNGEGCPVG 120
           IIDCVDINKQPALDHPLLKNHKVQTLPSEFVSKLFKEDSSQSNNGILTSNNNNGEGCPVG
Sbjct: 61  IIDCVDINKQPALDHPLLKNHKVQTLPSEFVSKLFKEDSSQSNNGILTSNNNNGEGCPVG 120

Query: 121 FVPIRRTLKEDLIRLKSLSSNSKNQQSSMNPEDDDLSGDSFYDAVRFPYYQNVVSHSLIK 180
           FVPIRRTLKEDLIRLKSLSSNSKNQQSSMNPEDDDLSGDSFYDAVRFPYYQNVVSHSLIK
Sbjct: 121 FVPIRRTLKEDLIRLKSLSSNSKNQQSSMNPEDDDLSGDSFYDAVRFPYYQNVVSHSLIK 180

Query: 181 AQYYHGAKARIAVHNVSLSDNGQSSSANIWVLGGSDDSLNVLMAGWQVNPAVNGDNLPRT 240
           AQYYHGAKARIAVHNVSLSDNGQSSSANIWVLGGSDDSLNVLMAGWQVNPAVNGDNLPRT
Sbjct: 181 AQYYHGAKARIAVHNVSLSDNGQSSSANIWVLGGSDDSLNVLMAGWQVNPAVNGDNLPRT 240

Query: 241 FVYWTVDTGVTTGCYNMLCQGFVLVNPNIHVGSSILPASIYQGQQYDYQFSIVQAIGHWW 300
           FVYWTVDTGVTTGCYNMLCQGFVLVNPNIHVGSSILPASIYQGQQYDYQFSIVQAIGHWW
Sbjct: 241 FVYWTVDTGVTTGCYNMLCQGFVLVNPNIHVGSSILPASIYQGQQYDYQFSIVQAIGHWW 300

Query: 301 VRVGDNQVGLGYWPNELFPNLLRGADQVAWGGSAQPTLYGDESPPLGSGHKPNGKPDEAI 360
           VRVGDNQVGLGYWPNELFPNLLRGADQVAWGGSAQPTLYGDESPPLGSGHKPNGKPDEAI
Sbjct: 301 VRVGDNQVGLGYWPNELFPNLLRGADQVAWGGSAQPTLYGDESPPLGSGHKPNGKPDEAI 360

Query: 361 FVRNIQYIAPNYILSIPTLNNTINYVSNSSCYDLISNENCSFDPFKYCFTFGGPGGHGCE 420
           FVRNIQYIAPNYILSIPTLNNTINYVSNSSCYDLISNENCSFDPFKYCFTFGGPGGHGCE
Sbjct: 361 FVRNIQYIAPNYILSIPTLNNTINYVSNSSCYDLISNENCSFDPFKYCFTFGGPGGHGCE 420

Query: 421 ASTT 424
           ASTT
Sbjct: 421 ASTT 424

BLAST of CsGy3G001620 vs. NCBI nr
Match: TYK11502.1 (neprosin 2 [Cucumis melo var. makuwa])

HSP 1 Score: 723 bits (1867), Expect = 6.19e-258
Identity = 357/411 (86.86%), Postives = 380/411 (92.46%), Query Frame = 0

Query: 14  VFFVCFNCKFNHASNPNLSREEDLEIERQLKLLNKPFIKTYKTKEGDIIDCVDINKQPAL 73
           +FFVCFNCKFNHASNPNLSREE+LEIERQLKLLNKPFIKTYKTKEGDIIDCVDINKQPAL
Sbjct: 211 LFFVCFNCKFNHASNPNLSREEELEIERQLKLLNKPFIKTYKTKEGDIIDCVDINKQPAL 270

Query: 74  DHPLLKNHKVQTLPSEFVSKLFKEDS-SQSNNGILTSNNNNGEGCPVGFVPIRRTLKEDL 133
           DHPLLKNHKVQTLPSEF+SKLFKEDS SQSNNGILTSNNNNGEGCP+GFVPIRRTLKEDL
Sbjct: 271 DHPLLKNHKVQTLPSEFISKLFKEDSISQSNNGILTSNNNNGEGCPIGFVPIRRTLKEDL 330

Query: 134 IRLKSLSSNSKNQQSSMNPEDDDLSGDSFYDAVRFPYYQNVVSHSLIKAQYYHGAKARIA 193
           IRLKSLSSN K Q+SSM P+DD  S D  +DAVRFPY QNVVSHSLIKA YYHGAKARIA
Sbjct: 331 IRLKSLSSNYKKQESSMKPQDDQ-SKDFSHDAVRFPYDQNVVSHSLIKATYYHGAKARIA 390

Query: 194 VHNVSLSDNGQSSSANIWVLGGSDDSLNVLMAGWQVNPAVNGDNLPRTFVYWTVDTGVTT 253
           V+NVSLSD  QSSSANIWV+GG D+SLNVLMA      AV+GD+LPRTFVYWT D G TT
Sbjct: 391 VYNVSLSDENQSSSANIWVVGGPDESLNVLMA------AVSGDSLPRTFVYWTTDRGATT 450

Query: 254 GCYNMLCQGFVLVNPNIHVGSSILPASIYQGQQYDYQFSIVQAIGHWWVRVGDNQVGLGY 313
           GCYNMLCQGFVLVNP+I VGSSILPASIYQG+QYDYQFSIVQAIGHWWVRVGD+QVGLGY
Sbjct: 451 GCYNMLCQGFVLVNPDIPVGSSILPASIYQGKQYDYQFSIVQAIGHWWVRVGDDQVGLGY 510

Query: 314 WPNELFPNLLRGADQVAWGGSAQPTLYGDESPPLGSGHKPNGKPDEAIFVRNIQYIAPNY 373
           WP+ELFPNLLRGA+QVAWGGSA+P+LY DESPPLGSGHKPNG+PDEA FVRNIQYIA NY
Sbjct: 511 WPSELFPNLLRGAEQVAWGGSAEPSLYSDESPPLGSGHKPNGRPDEACFVRNIQYIASNY 570

Query: 374 ILSIPTLNNTINYVSNSSCYDLISNENCSFDPFKYCFTFGGPGGHGCEAST 423
           ILSIPTL+NTINYVS+SSCYDLISNENC FDPFKYCFTFGGPGG  C A+T
Sbjct: 571 ILSIPTLDNTINYVSSSSCYDLISNENCDFDPFKYCFTFGGPGGQDCAATT 614

BLAST of CsGy3G001620 vs. NCBI nr
Match: KAE8650029.1 (hypothetical protein Csa_011504 [Cucumis sativus])

HSP 1 Score: 533 bits (1374), Expect = 7.30e-186
Identity = 283/423 (66.90%), Postives = 329/423 (77.78%), Query Frame = 0

Query: 22  KFNHASNPNLSREEDLEIERQLKLLNKPFIKTYK------------TKEGDIIDCVDINK 81
           K + ASN  LSREE+LEIE  LKLLNKP IKTYK            TKEGDIIDCVDINK
Sbjct: 12  KISEASNSKLSREEELEIEEHLKLLNKPSIKTYKSTSRYTRACSYQTKEGDIIDCVDINK 71

Query: 82  QPALDHPLLKNHKVQTLPSEFVSKLFKEDSSQSNNGILT--SNNNNGE-GCPVGFVPIRR 141
           QPALDHPLLKNHKVQTLPS +VSKLFK+DSSQ+NNGI T  SNNNNGE GCP GFVPIRR
Sbjct: 72  QPALDHPLLKNHKVQTLPSGYVSKLFKKDSSQANNGISTLPSNNNNGEEGCPNGFVPIRR 131

Query: 142 TLKEDLIRLKSLSSNSKNQQSSMNPEDDDLSGDSFYDAVRFPYYQNVVSHSLIKA-QYYH 201
           TLK+DLIRLKSLSSN+KNQQSSMNP+DD  S D F D+V+FPYYQNVVSHSL K  + Y+
Sbjct: 132 TLKKDLIRLKSLSSNNKNQQSSMNPQDDQ-SDDFFDDSVKFPYYQNVVSHSLEKGTEKYY 191

Query: 202 GAKARIAVHNVSLSDNGQSSSANIWVLGGSDDSLNVLMAGWQVNPAVNGDNLPRTFVYWT 261
           G K+ ++V+NVSLS + QSSS NIW++GG  DSL VLM GW VNP VNGD + R+FVYWT
Sbjct: 192 GTKSYMSVYNVSLSFD-QSSSTNIWIVGGPVDSLGVLMTGWLVNPEVNGDFVTRSFVYWT 251

Query: 262 VDTGVTTGCYNMLCQGFVLVNPNIHVGSSILPASIYQGQQYDYQFSIVQAIGHWWVRVGD 321
            D G TTGCYNM CQGFV VNP+ HVG+ +LP S Y+GQQYDYQF+I+Q  G+WWV VG+
Sbjct: 252 ADGGTTTGCYNMYCQGFVQVNPSHHVGAPLLPTSTYEGQQYDYQFTIIQIEGNWWVLVGE 311

Query: 322 NQVGLGYWPNELFPNLLRGADQVAWGGSAQPTLYGDESPPLGSGHKPN--GKPDEAIFVR 381
           N +GLGYWP EL  NL+ GADQ+AWGG AQP++ G  SP LGSGHKPN  G  +E  ++R
Sbjct: 312 N-LGLGYWPKELIQNLVDGADQIAWGGIAQPSIDG-VSPMLGSGHKPNENGDYNEGCYIR 371

Query: 382 NIQYIAP----NYILSIPTLNNTINYVSNSSCYDLISNENCSFDPFKYCFTFGGPGGHGC 422
           NIQ I+      Y+L  PT +NT++Y SN+SCYDL  N NC +D  +YCFTFGGPGG  C
Sbjct: 372 NIQIISGAATNTYVL--PTWDNTLSYSSNTSCYDLNPNVNCGYDMMEYCFTFGGPGGPNC 428

BLAST of CsGy3G001620 vs. NCBI nr
Match: XP_031738648.1 (uncharacterized protein LOC105435061 [Cucumis sativus])

HSP 1 Score: 377 bits (968), Expect = 1.42e-125
Identity = 210/406 (51.72%), Postives = 248/406 (61.08%), Query Frame = 0

Query: 24  NHASNPNLSREEDLEIERQLKLLNKPFIKTYKTKEGDIIDCVDINKQPALDHPLLKNHKV 83
           + ASN  LSREE+LEIE  LKLLNKP IKTYKTKEGDIIDCVDINKQPALDHPLLKNHKV
Sbjct: 30  SEASNSKLSREEELEIEEHLKLLNKPSIKTYKTKEGDIIDCVDINKQPALDHPLLKNHKV 89

Query: 84  QTLPSEFVSKLFKEDSSQSNNGILTSNNNNGEGCPVGFVPIRRTLKEDLIRLKSLSSNSK 143
           Q                                                           
Sbjct: 90  Q----------------------------------------------------------- 149

Query: 144 NQQSSMNPEDDDLSGDSFYDAVRFPYYQNVVSHSLIKA-QYYHGAKARIAVHNVSLSDNG 203
                                        VVSHSL K  + Y+G K+ ++V+NVSLS + 
Sbjct: 150 -----------------------------VVSHSLEKGTEKYYGTKSYMSVYNVSLSFD- 209

Query: 204 QSSSANIWVLGGSDDSLNVLMAGWQVNPAVNGDNLPRTFVYWTVDTGVTTGCYNMLCQGF 263
           QSSS NIW++GG  DSL VLM GW VNP VNGD + R+FVYWT D G TTGCYNM CQGF
Sbjct: 210 QSSSTNIWIVGGPVDSLGVLMTGWLVNPEVNGDFVTRSFVYWTADGGTTTGCYNMYCQGF 269

Query: 264 VLVNPNIHVGSSILPASIYQGQQYDYQFSIVQAIGHWWVRVGDNQVGLGYWPNELFPNLL 323
           V VNP+ HVG+ +LP S Y+GQQYDYQF+I+Q  G+WWV VG+N +GLGYWP EL  NL+
Sbjct: 270 VQVNPSHHVGAPLLPTSTYEGQQYDYQFTIIQIEGNWWVLVGEN-LGLGYWPKELIQNLV 329

Query: 324 RGADQVAWGGSAQPTLYGDESPPLGSGHKPN--GKPDEAIFVRNIQYIAP----NYILSI 383
            GADQ+AWGG AQP++ G  SP LGSGHKPN  G  +E  ++RNIQ I+      Y+L  
Sbjct: 330 DGADQIAWGGIAQPSIDG-VSPMLGSGHKPNENGDYNEGCYIRNIQIISGAATNTYVL-- 342

Query: 384 PTLNNTINYVSNSSCYDLISNENCSFDPFKYCFTFGGPGGHGCEAS 422
           PT +NT++Y SN+SCYDL  N NC +D  +YCFTFGGPGG  CEA+
Sbjct: 390 PTWDNTLSYSSNTSCYDLNPNVNCGYDMMEYCFTFGGPGGPNCEAT 342

BLAST of CsGy3G001620 vs. NCBI nr
Match: KAA0053047.1 (uncharacterized protein E6C27_scaffold344G001630 [Cucumis melo var. makuwa])

HSP 1 Score: 367 bits (941), Expect = 6.09e-120
Identity = 212/448 (47.32%), Postives = 238/448 (53.12%), Query Frame = 0

Query: 154 DDLSGDSFYDAVRFPYYQNVVSHSLIKA-QYYHGAKARIAVHNVSLSDNGQSSSANIWVL 213
           DD S D F D+V++P  QNVVSHSL K  + Y+G K+ ++V+NVSLS  GQSSS+NIW++
Sbjct: 5   DDQSDDFFDDSVKYPDNQNVVSHSLKKGPEKYYGTKSYMSVYNVSLSF-GQSSSSNIWIV 64

Query: 214 GGSDDSLNVLMAGWQVNPAVNGDNLPRTFVYWTVD------------------------- 273
           GG  +SL VLM GW VNP VNGD + R+FVYWT D                         
Sbjct: 65  GGPTNSLGVLMTGWLVNPEVNGDFITRSFVYWTADGGATTGCYNMYCQGFVQVNPSHHVG 124

Query: 274 ------------------------------------------------------------ 333
                                                                       
Sbjct: 125 APLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGI 184

Query: 334 ------------------------------------------------------------ 393
                                                                       
Sbjct: 185 AKPSIDGMSPMLGSGHKPNDNGDYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTS 244

Query: 394 --------------------------------TGVTTGCYNMLCQGFVLVNPNIHVGSSI 423
                                            G TTGCYNMLCQGFVLVNP+I VGSSI
Sbjct: 245 CYDLNPNVNCGDDMMEYCFTFGGPGGPNCETDRGATTGCYNMLCQGFVLVNPDIPVGSSI 304

BLAST of CsGy3G001620 vs. ExPASy TrEMBL
Match: A0A5D3CJM0 (Neprosin 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold139G001660 PE=4 SV=1)

HSP 1 Score: 723 bits (1867), Expect = 3.00e-258
Identity = 357/411 (86.86%), Postives = 380/411 (92.46%), Query Frame = 0

Query: 14  VFFVCFNCKFNHASNPNLSREEDLEIERQLKLLNKPFIKTYKTKEGDIIDCVDINKQPAL 73
           +FFVCFNCKFNHASNPNLSREE+LEIERQLKLLNKPFIKTYKTKEGDIIDCVDINKQPAL
Sbjct: 211 LFFVCFNCKFNHASNPNLSREEELEIERQLKLLNKPFIKTYKTKEGDIIDCVDINKQPAL 270

Query: 74  DHPLLKNHKVQTLPSEFVSKLFKEDS-SQSNNGILTSNNNNGEGCPVGFVPIRRTLKEDL 133
           DHPLLKNHKVQTLPSEF+SKLFKEDS SQSNNGILTSNNNNGEGCP+GFVPIRRTLKEDL
Sbjct: 271 DHPLLKNHKVQTLPSEFISKLFKEDSISQSNNGILTSNNNNGEGCPIGFVPIRRTLKEDL 330

Query: 134 IRLKSLSSNSKNQQSSMNPEDDDLSGDSFYDAVRFPYYQNVVSHSLIKAQYYHGAKARIA 193
           IRLKSLSSN K Q+SSM P+DD  S D  +DAVRFPY QNVVSHSLIKA YYHGAKARIA
Sbjct: 331 IRLKSLSSNYKKQESSMKPQDDQ-SKDFSHDAVRFPYDQNVVSHSLIKATYYHGAKARIA 390

Query: 194 VHNVSLSDNGQSSSANIWVLGGSDDSLNVLMAGWQVNPAVNGDNLPRTFVYWTVDTGVTT 253
           V+NVSLSD  QSSSANIWV+GG D+SLNVLMA      AV+GD+LPRTFVYWT D G TT
Sbjct: 391 VYNVSLSDENQSSSANIWVVGGPDESLNVLMA------AVSGDSLPRTFVYWTTDRGATT 450

Query: 254 GCYNMLCQGFVLVNPNIHVGSSILPASIYQGQQYDYQFSIVQAIGHWWVRVGDNQVGLGY 313
           GCYNMLCQGFVLVNP+I VGSSILPASIYQG+QYDYQFSIVQAIGHWWVRVGD+QVGLGY
Sbjct: 451 GCYNMLCQGFVLVNPDIPVGSSILPASIYQGKQYDYQFSIVQAIGHWWVRVGDDQVGLGY 510

Query: 314 WPNELFPNLLRGADQVAWGGSAQPTLYGDESPPLGSGHKPNGKPDEAIFVRNIQYIAPNY 373
           WP+ELFPNLLRGA+QVAWGGSA+P+LY DESPPLGSGHKPNG+PDEA FVRNIQYIA NY
Sbjct: 511 WPSELFPNLLRGAEQVAWGGSAEPSLYSDESPPLGSGHKPNGRPDEACFVRNIQYIASNY 570

Query: 374 ILSIPTLNNTINYVSNSSCYDLISNENCSFDPFKYCFTFGGPGGHGCEAST 423
           ILSIPTL+NTINYVS+SSCYDLISNENC FDPFKYCFTFGGPGG  C A+T
Sbjct: 571 ILSIPTLDNTINYVSSSSCYDLISNENCDFDPFKYCFTFGGPGGQDCAATT 614

BLAST of CsGy3G001620 vs. ExPASy TrEMBL
Match: A0A0A0L400 (Neprosin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G011710 PE=4 SV=1)

HSP 1 Score: 580 bits (1495), Expect = 5.03e-207
Identity = 276/276 (100.00%), Postives = 276/276 (100.00%), Query Frame = 0

Query: 149 MNPEDDDLSGDSFYDAVRFPYYQNVVSHSLIKAQYYHGAKARIAVHNVSLSDNGQSSSAN 208
           MNPEDDDLSGDSFYDAVRFPYYQNVVSHSLIKAQYYHGAKARIAVHNVSLSDNGQSSSAN
Sbjct: 1   MNPEDDDLSGDSFYDAVRFPYYQNVVSHSLIKAQYYHGAKARIAVHNVSLSDNGQSSSAN 60

Query: 209 IWVLGGSDDSLNVLMAGWQVNPAVNGDNLPRTFVYWTVDTGVTTGCYNMLCQGFVLVNPN 268
           IWVLGGSDDSLNVLMAGWQVNPAVNGDNLPRTFVYWTVDTGVTTGCYNMLCQGFVLVNPN
Sbjct: 61  IWVLGGSDDSLNVLMAGWQVNPAVNGDNLPRTFVYWTVDTGVTTGCYNMLCQGFVLVNPN 120

Query: 269 IHVGSSILPASIYQGQQYDYQFSIVQAIGHWWVRVGDNQVGLGYWPNELFPNLLRGADQV 328
           IHVGSSILPASIYQGQQYDYQFSIVQAIGHWWVRVGDNQVGLGYWPNELFPNLLRGADQV
Sbjct: 121 IHVGSSILPASIYQGQQYDYQFSIVQAIGHWWVRVGDNQVGLGYWPNELFPNLLRGADQV 180

Query: 329 AWGGSAQPTLYGDESPPLGSGHKPNGKPDEAIFVRNIQYIAPNYILSIPTLNNTINYVSN 388
           AWGGSAQPTLYGDESPPLGSGHKPNGKPDEAIFVRNIQYIAPNYILSIPTLNNTINYVSN
Sbjct: 181 AWGGSAQPTLYGDESPPLGSGHKPNGKPDEAIFVRNIQYIAPNYILSIPTLNNTINYVSN 240

Query: 389 SSCYDLISNENCSFDPFKYCFTFGGPGGHGCEASTT 424
           SSCYDLISNENCSFDPFKYCFTFGGPGGHGCEASTT
Sbjct: 241 SSCYDLISNENCSFDPFKYCFTFGGPGGHGCEASTT 276

BLAST of CsGy3G001620 vs. ExPASy TrEMBL
Match: A0A5A7UEV4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold344G001630 PE=4 SV=1)

HSP 1 Score: 367 bits (941), Expect = 2.95e-120
Identity = 212/448 (47.32%), Postives = 238/448 (53.12%), Query Frame = 0

Query: 154 DDLSGDSFYDAVRFPYYQNVVSHSLIKA-QYYHGAKARIAVHNVSLSDNGQSSSANIWVL 213
           DD S D F D+V++P  QNVVSHSL K  + Y+G K+ ++V+NVSLS  GQSSS+NIW++
Sbjct: 5   DDQSDDFFDDSVKYPDNQNVVSHSLKKGPEKYYGTKSYMSVYNVSLSF-GQSSSSNIWIV 64

Query: 214 GGSDDSLNVLMAGWQVNPAVNGDNLPRTFVYWTVD------------------------- 273
           GG  +SL VLM GW VNP VNGD + R+FVYWT D                         
Sbjct: 65  GGPTNSLGVLMTGWLVNPEVNGDFITRSFVYWTADGGATTGCYNMYCQGFVQVNPSHHVG 124

Query: 274 ------------------------------------------------------------ 333
                                                                       
Sbjct: 125 APLHPTSTYQGQQYDYQFTIIQIAGNWWVLVGENLGLGYWPKELVQNLVDGAEQIAWGGI 184

Query: 334 ------------------------------------------------------------ 393
                                                                       
Sbjct: 185 AKPSIDGMSPMLGSGHKPNDNGDYNEGCYIRNIQIISGAATNTYKLPTWDNTLSYSSNTS 244

Query: 394 --------------------------------TGVTTGCYNMLCQGFVLVNPNIHVGSSI 423
                                            G TTGCYNMLCQGFVLVNP+I VGSSI
Sbjct: 245 CYDLNPNVNCGDDMMEYCFTFGGPGGPNCETDRGATTGCYNMLCQGFVLVNPDIPVGSSI 304

BLAST of CsGy3G001620 vs. ExPASy TrEMBL
Match: A0A6J1CVJ6 (uncharacterized protein LOC111014777 OS=Momordica charantia OX=3673 GN=LOC111014777 PE=4 SV=1)

HSP 1 Score: 338 bits (868), Expect = 1.41e-110
Identity = 186/411 (45.26%), Postives = 247/411 (60.10%), Query Frame = 0

Query: 11  LVIVFFVCFNCKFNHASNPNLSREEDLEIERQLKLLNKPFIKTYKTKEGDIIDCVDINKQ 70
           L+IV  +  NCK + A + NLSREE+LE+E QLKLLN+PFI T++T+EGDIIDCVDINKQ
Sbjct: 9   LMIVLLLHLNCKGSLAFDSNLSREEELELEMQLKLLNRPFITTFQTEEGDIIDCVDINKQ 68

Query: 71  PALDHPLLKNHKVQTLPSEFVSKLFKEDSSQSNNGILTSNNNNGEGCPVGFVPIRRTLKE 130
           PALDHP LK+HK+QT PS +   L K+ SS  +   +   NNN   CP G+VPIRRT+K+
Sbjct: 69  PALDHPSLKSHKIQTRPSTYPFGLSKDSSSSRDKSFI---NNNNRACPAGYVPIRRTIKK 128

Query: 131 DLIRLKSLSSNSKNQQSSMNPEDDDLSGDSFYDAVRFPYYQNVVSHSLIKAQYYHGAKAR 190
           DLIR++SLSS       +           S    V FPY Q+VVS ++ K   Y+GA   
Sbjct: 129 DLIRIRSLSSKEPTGIKT-----------SIKGGVDFPYNQDVVSVAMKKGIKYYGASGS 188

Query: 191 IAVHNVSLSDNGQSSSANIWVLGGSDDSLNVLMAGWQVNPAVNGDNLPRTFVYWTVDTGV 250
           ++V+N+S++ + QSSS+NIW++GG   + NV++AGWQVNP +NGD+L R FVYWT     
Sbjct: 189 VSVYNLSVAQD-QSSSSNIWIIGGPPQAPNVILAGWQVNPMINGDSLTRMFVYWTD---- 248

Query: 251 TTGCYNMLCQGFVLVNPNIHVGSSILPASIYQGQQYDYQFSIVQAIGHWWVRVGDNQVGL 310
                                                      +  G+WW+ VG++   +
Sbjct: 249 -------------------------------------------RPTGNWWLAVGESHKTI 308

Query: 311 GYWPNELFPNLLRGADQVAWGGSAQPTLYGDESPPLGSGHKPN-GKPDEAIFVRNIQYIA 370
           GYWP ELF +L  G +QVAWGG A+P+  G  SPPLG+GHKPN  K D+A + R + Y+ 
Sbjct: 309 GYWPKELFGHLNDGTEQVAWGGIAKPSPNG-MSPPLGNGHKPNYSKYDDACYFRYMNYVD 356

Query: 371 PNYILSIPTLNNTINYVSNSSCYDLISNENCSFDPFKYCFTFGGPGGHGCE 420
            N     P   NT NY+SN+SCY L + E C  + F YC TFGGPGG+ C 
Sbjct: 369 ENNKGQFPANENTANYLSNNSCYALDNRETCGGEFFYYCITFGGPGGNNCS 356

BLAST of CsGy3G001620 vs. ExPASy TrEMBL
Match: A0A059CIH2 (Uncharacterized protein OS=Eucalyptus grandis OX=71139 GN=EUGRSUZ_D02193 PE=4 SV=1)

HSP 1 Score: 278 bits (712), Expect = 1.26e-86
Identity = 169/413 (40.92%), Postives = 236/413 (57.14%), Query Frame = 0

Query: 14  VFFVCFNCKFNHASNPNLSREEDLEIERQLKLLNKPFIKTYKTKEGDIIDCVDINKQPAL 73
           V FV      + +   N+S+++D+++E QLKLLNKP IKT+ T+EGDIIDC+DI+KQPA+
Sbjct: 6   VAFVLVLLSVSRSKATNISKDDDIDLEEQLKLLNKPPIKTFLTEEGDIIDCIDIDKQPAI 65

Query: 74  DHPLLKNHKVQTLPSEFVSKLFKEDSSQSNNGILTSNNNNGEGCPVGFVPIRRTLKEDLI 133
           DHPLLKNHK+Q  P   +S   K  S+       +      + CP+G VPI+R  KEDLI
Sbjct: 66  DHPLLKNHKIQRKPKLPLSNFSKTSSATKYIRFRSR-----KPCPIGTVPIQRIKKEDLI 125

Query: 134 RLKSLSSNSKNQQSSMNPEDDDLSGDSFYDAVRFPYYQNVVSHS---LIKAQYYHGAKAR 193
           R +S+            P+   ++     DA     ++  +SH    LIK    +GA   
Sbjct: 126 RTRSI------------PKMPSVNMVEIEDAPPSGQHRVFLSHDHTYLIK----YGASGY 185

Query: 194 IAVHNVSLSDNGQSSSANIWVLGGSDDSLNVLMAGWQVNPAVNGDNLPRTFVYWTVDTGV 253
           I+V+N+S + + Q SS NIW+  G  D +++++AGW+V+P +N D L R F YWT D G 
Sbjct: 186 ISVYNISTALD-QFSSHNIWIETGPPDHISMIVAGWRVDPLLNADGLTRLFTYWTGD-GF 245

Query: 254 TTGCYNMLCQGFVLVNPNIHVGSSILPASIYQGQQYDYQFSIVQ--AIGHWWVRVGDNQV 313
             GCYN  CQGFV V+  I     + P S Y G  Y+ +  + Q  A G+WW+RV D  +
Sbjct: 246 RDGCYNTFCQGFVQVDRVITPNYPLTPVSTYGGPIYELKIEVSQDIATGNWWLRVHDPPI 305

Query: 314 GLGYWPNELFPNLLRGADQVAWGGSAQPTLYGDESPPLGSGHKPNGKPDEAIFVRNIQYI 373
            +GYWP ELF NL  G+   AWGG A+    G   PP+G+GH P+   D+A + R +Q++
Sbjct: 306 NVGYWPKELFVNLRNGSLHAAWGGVAKEGANG-YCPPMGNGHMPDVYTDKAAYFRKVQWM 365

Query: 374 APNYILSIPTLNNTINYVSNSSCYDLISNENCSFDPFKYCFTFGGPGGHGCEA 421
             N   S     N    V   SCY+L+ N     DP+ Y FTFGGPGG+ C A
Sbjct: 366 NANGE-SFHPYKNLPKVVDTPSCYNLL-NLKLMRDPWGYLFTFGGPGGY-CRA 391

BLAST of CsGy3G001620 vs. TAIR 10
Match: AT1G55360.1 (Protein of Unknown Function (DUF239) )

HSP 1 Score: 241.5 bits (615), Expect = 1.2e-63
Identity = 143/408 (35.05%), Postives = 220/408 (53.92%), Query Frame = 0

Query: 24  NHASNPNLSREEDLEIERQLKLLNKPFIKTYKTKEGDIIDCVDINKQPALDHPLLKNHKV 83
           ++A+   +S+++  E+++ L  LNKP +K+ ++ +GD+IDCV I+KQPA DHP LK+HK+
Sbjct: 29  SYAARSGVSKQK-FEVKKHLNRLNKPAVKSIQSSDGDVIDCVPISKQPAFDHPFLKDHKI 88

Query: 84  QTLPSEFVSKLFKEDSSQSNNGILTSNNNNGEG-----------CPVGFVPIRRTLKEDL 143
           Q  P+     LF       +N +    +N  EG           C  G +P+RRT ++D+
Sbjct: 89  QMKPNYHPEGLF------DDNKVSAPKSNEKEGHIPQLWHRYGKCSEGTIPMRRTKEDDV 148

Query: 144 IRLKSLSSNSKNQQSSMNPEDDDLSGDSFYDAVRFPYYQNVVSHSLIKAQYYHGAKARIA 203
           +R  S+    K ++ S+      L   +  D +    +Q+ +++  ++   Y+GAKA I 
Sbjct: 149 LRASSVKRYGKKKRRSV-----PLPKSAEPDLINQSGHQHAIAY--VEGDKYYGAKATIN 208

Query: 204 VHNVSLSDNGQSSSANIWVLGGS-DDSLNVLMAGWQVNPAVNGDNLPRTFVYWTVDTGVT 263
           V    +    + S + IW+LGGS    LN + AGWQV+P + GDN  R F YWT D    
Sbjct: 209 VWEPKIQQQNEFSLSQIWLLGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQA 268

Query: 264 TGCYNMLCQGFVLVNPNIHVGSSILPASIYQGQQYDYQFSIVQ--AIGHWWVRVGDNQVG 323
           TGCYN+LC GF+ +N +I +G+SI P S Y+  QYD    I +    GHWW++ G+  V 
Sbjct: 269 TGCYNLLCSGFIQINSDIAMGASISPVSGYRNSQYDISILIWKDPKEGHWWMQFGNGYV- 328

Query: 324 LGYWPNELFPNLLRGADQVAWGGSAQPTLYGDE--SPPLGSGHKPNGKPDEAIFVRNIQY 383
           LGYWP+ LF  L   A  + WGG    +    +  S  +GSG  P     +A + RNIQ 
Sbjct: 329 LGYWPSFLFSYLTESASMIEWGGEVVNSQSDGQHTSTQMGSGKFPEEGFSKASYFRNIQV 388

Query: 384 IAPNYILSIPTLNNTINYVSNSSCYDLISNENCSFDPFKYCFTFGGPG 416
           +  +  L  P    T  +   S+CYD+ +  N   D + + F +GGPG
Sbjct: 389 VDGSNNLKAPKGLGT--FTEQSNCYDVQTGSN---DDWGHYFYYGGPG 416

BLAST of CsGy3G001620 vs. TAIR 10
Match: AT5G56530.1 (Protein of Unknown Function (DUF239) )

HSP 1 Score: 240.4 bits (612), Expect = 2.7e-63
Identity = 147/392 (37.50%), Postives = 214/392 (54.59%), Query Frame = 0

Query: 35  EDLEIERQLKLLNKPFIKTYKTKEGDIIDCVDINKQPALDHPLLKNHKVQTLPSEFVSKL 94
           ++ E+ + L  LNKP +K+ ++ +GDIIDCV I+KQPA DHP LK+HK+Q  PS     L
Sbjct: 38  QNFEVHKHLNRLNKPAVKSIQSPDGDIIDCVHISKQPAFDHPFLKDHKIQMGPSYTPESL 97

Query: 95  F-----KEDSSQSNNGILTSNNNNGEGCPVGFVPIRRTLKEDLIRLKSLSSNSKNQQSSM 154
           F      E   +S N I    + NG  C  G +P+RRT KED++R  S+    K +  S+
Sbjct: 98  FGESKVSEKPKESVNPITQLWHQNGV-CSEGTIPVRRTKKEDVLRASSVKRYGKKKHLSV 157

Query: 155 NPEDDDLSGDSFYDAVRFPYYQNVVSHSLIKAQYYHGAKARIAVHNVSLSDNGQSSSANI 214
                 L   +  D +    +Q+ +++  ++   ++GAKA I V    +  + + S + +
Sbjct: 158 -----PLPRSADPDLINQSGHQHAIAY--VEGGKFYGAKATINVWEPKVQSSNEFSLSQL 217

Query: 215 WVLGGS-DDSLNVLMAGWQVNPAVNGDNLPRTFVYWTVDTGVTTGCYNMLCQGFVLVNPN 274
           W+LGGS    LN + AGWQV+P + GDN  R F YWT D    TGCYN+LC GF+ +N  
Sbjct: 218 WILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSQ 277

Query: 275 IHVGSSILPASIYQGQQYDYQFSIVQ--AIGHWWVRVGDNQVGLGYWPNELFPNLLRGAD 334
           I +G+SI P S +   QYD   +I +    GHWW++ GD  V LGYWP+ LF  L   A 
Sbjct: 278 IAMGASISPVSGFHNPQYDISITIWKDPKEGHWWMQFGDGYV-LGYWPSFLFSYLADSAS 337

Query: 335 QVAWGGSAQPTLYGD---ESPPLGSGHKPNGKPDEAIFVRNIQYIAPNYILSIPTLNNTI 394
            V WGG     +  D    +  +GSG  P+    +A + RNIQ +  +  L  P   NT 
Sbjct: 338 IVEWGGEV-VNMEEDGHHTTTQMGSGQFPDEGFTKASYFRNIQVVDSSNNLKEPKGLNT- 397

Query: 395 NYVSNSSCYDLISNENCSFDPFKYCFTFGGPG 416
            +   S+CYD+   +N   D + + F +GGPG
Sbjct: 398 -FTEKSNCYDVEVGKN---DDWGHYFYYGGPG 414

BLAST of CsGy3G001620 vs. TAIR 10
Match: AT5G56530.2 (Protein of Unknown Function (DUF239) )

HSP 1 Score: 240.4 bits (612), Expect = 2.7e-63
Identity = 147/392 (37.50%), Postives = 214/392 (54.59%), Query Frame = 0

Query: 35  EDLEIERQLKLLNKPFIKTYKTKEGDIIDCVDINKQPALDHPLLKNHKVQTLPSEFVSKL 94
           ++ E+ + L  LNKP +K+ ++ +GDIIDCV I+KQPA DHP LK+HK+Q  PS     L
Sbjct: 38  QNFEVHKHLNRLNKPAVKSIQSPDGDIIDCVHISKQPAFDHPFLKDHKIQMGPSYTPESL 97

Query: 95  F-----KEDSSQSNNGILTSNNNNGEGCPVGFVPIRRTLKEDLIRLKSLSSNSKNQQSSM 154
           F      E   +S N I    + NG  C  G +P+RRT KED++R  S+    K +  S+
Sbjct: 98  FGESKVSEKPKESVNPITQLWHQNGV-CSEGTIPVRRTKKEDVLRASSVKRYGKKKHLSV 157

Query: 155 NPEDDDLSGDSFYDAVRFPYYQNVVSHSLIKAQYYHGAKARIAVHNVSLSDNGQSSSANI 214
                 L   +  D +    +Q+ +++  ++   ++GAKA I V    +  + + S + +
Sbjct: 158 -----PLPRSADPDLINQSGHQHAIAY--VEGGKFYGAKATINVWEPKVQSSNEFSLSQL 217

Query: 215 WVLGGS-DDSLNVLMAGWQVNPAVNGDNLPRTFVYWTVDTGVTTGCYNMLCQGFVLVNPN 274
           W+LGGS    LN + AGWQV+P + GDN  R F YWT D    TGCYN+LC GF+ +N  
Sbjct: 218 WILGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTSDAYQATGCYNLLCSGFIQINSQ 277

Query: 275 IHVGSSILPASIYQGQQYDYQFSIVQ--AIGHWWVRVGDNQVGLGYWPNELFPNLLRGAD 334
           I +G+SI P S +   QYD   +I +    GHWW++ GD  V LGYWP+ LF  L   A 
Sbjct: 278 IAMGASISPVSGFHNPQYDISITIWKDPKEGHWWMQFGDGYV-LGYWPSFLFSYLADSAS 337

Query: 335 QVAWGGSAQPTLYGD---ESPPLGSGHKPNGKPDEAIFVRNIQYIAPNYILSIPTLNNTI 394
            V WGG     +  D    +  +GSG  P+    +A + RNIQ +  +  L  P   NT 
Sbjct: 338 IVEWGGEV-VNMEEDGHHTTTQMGSGQFPDEGFTKASYFRNIQVVDSSNNLKEPKGLNT- 397

Query: 395 NYVSNSSCYDLISNENCSFDPFKYCFTFGGPG 416
            +   S+CYD+   +N   D + + F +GGPG
Sbjct: 398 -FTEKSNCYDVEVGKN---DDWGHYFYYGGPG 414

BLAST of CsGy3G001620 vs. TAIR 10
Match: AT3G13510.1 (Protein of Unknown Function (DUF239) )

HSP 1 Score: 236.5 bits (602), Expect = 3.9e-62
Identity = 146/413 (35.35%), Postives = 220/413 (53.27%), Query Frame = 0

Query: 15  FFVCF--NCKFNHASNPNLSREEDLEIERQLKLLNKPFIKTYKTKEGDIIDCVDINKQPA 74
           FFVC       + A+    S  +  E+++ L  LNKP +KT ++ +GDIIDC+ I+KQPA
Sbjct: 15  FFVCLWVMLSLSCAAASYGSSRQKFEVKKHLNRLNKPPVKTIQSPDGDIIDCIPISKQPA 74

Query: 75  LDHPLLKNHKVQTLPSEFVSKLFKEDSSQS-----NNGILTSNNNNGEGCPVGFVPIRRT 134
            DHP LK+HK+Q  PS     LF ++   +        I    +  G+ C  G +P+RRT
Sbjct: 75  FDHPFLKDHKIQMRPSYHPEGLFDDNKVSAEPKGKETHIPQLWHRYGK-CTEGTIPMRRT 134

Query: 135 LKEDLIRLKSLSSNSKNQQSSMNPEDDDLSGDSFYDAVRFPYYQNVVSHSLIKAQYYHGA 194
            ++D++R  S+    K +  S+      +   +  D +    +Q+ +++  ++   Y+GA
Sbjct: 135 REDDVLRASSVKRYGKKKHRSV-----PIPKSAEPDLINQNGHQHAIAY--VEGDKYYGA 194

Query: 195 KARIAVHNVSLSDNGQSSSANIWVLGGS-DDSLNVLMAGWQVNPAVNGDNLPRTFVYWTV 254
           KA + V    + +  + S + IW+LGGS    LN + AGWQV+P + GDN  R F YWT 
Sbjct: 195 KATLNVWEPKIQNTNEFSLSQIWLLGGSFGQDLNSIEAGWQVSPDLYGDNNTRLFTYWTS 254

Query: 255 DTGVTTGCYNMLCQGFVLVNPNIHVGSSILPASIYQGQQYDYQFSIVQ--AIGHWWVRVG 314
           D    TGCYN+LC GF+ +N +I +G+SI P S Y+  QYD    I +    GHWW++ G
Sbjct: 255 DAYQATGCYNLLCSGFIQINSDIAMGASISPVSGYRNSQYDISILIWKDPKEGHWWMQFG 314

Query: 315 DNQVGLGYWPNELFPNLLRGADQVAWGGS-AQPTLYGDES-PPLGSGHKPNGKPDEAIFV 374
           +  V LGYWP+ LF  L   A  + WGG        G  +   +GSGH P     +A + 
Sbjct: 315 NGYV-LGYWPSFLFSYLTESASMIEWGGEVVNSQSEGHHTWTQMGSGHFPEEGFSKASYF 374

Query: 375 RNIQYIAPNYILSIPTLNNTINYVSNSSCYDLISNENCSFDPFKYCFTFGGPG 416
           RNIQ +  +  L  P    T  +   S+CYD+ +  N   D + + F +GGPG
Sbjct: 375 RNIQVVDGSNNLKAPKGLGT--FTEKSNCYDVQTGSN---DDWGHYFYYGGPG 413

BLAST of CsGy3G001620 vs. TAIR 10
Match: AT5G18460.1 (Protein of Unknown Function (DUF239) )

HSP 1 Score: 230.7 bits (587), Expect = 2.2e-60
Identity = 136/387 (35.14%), Postives = 214/387 (55.30%), Query Frame = 0

Query: 39  IERQLKLLNKPFIKTYKTKEGDIIDCVDINKQPALDHPLLKNHKVQTLPSEFVSKLFKED 98
           I++ L  +NK  + T ++ +GD+IDCV   KQPALDHPLLK+HK+Q  P +      K+D
Sbjct: 50  IQKHLNKINKSPVFTIQSPDGDVIDCVPKRKQPALDHPLLKHHKIQKAPKKMPKMKGKDD 109

Query: 99  SSQSNNGILTSN----NNNGEGCPVGFVPIRRTLKEDLIRLKSLSSNSKNQQSSMNPEDD 158
             +    +L       + NG  CP G VPIRR    D++R KSL    K ++S    +  
Sbjct: 110 DVKEAENVLEGAWQMWHVNGTRCPKGTVPIRRNTMNDVLRAKSLFDFGKKRRSIYLDQRT 169

Query: 159 DLSGDSFYDAVRFPYYQNVVSHSLIKAQYYHGAKARIAVHNVSLSDNGQSSSANIWVLGG 218
           +       DA+    +++ ++++   ++ Y GAKA I V +  + +  + S + IW+L G
Sbjct: 170 EKP-----DALGTNGHEHAIAYTESSSEIY-GAKATINVWDPKIEEVNEFSLSQIWILSG 229

Query: 219 S--DDSLNVLMAGWQVNPAVNGDNLPRTFVYWTVDTGVTTGCYNMLCQGFVLVNPNIHVG 278
           S     LN + AGWQV+P + GDN PR F YWT D+   TGCYN+LC GF+  N  I +G
Sbjct: 230 SFVGPDLNSIEAGWQVSPELYGDNRPRLFTYWTSDSYQATGCYNLLCSGFIQTNNKIAIG 289

Query: 279 SSILPASIYQGQQYDYQFSIVQ--AIGHWWVRVGDNQVGLGYWPNELFPNLLRGADQVAW 338
           ++I P S ++G Q+D    I +   +G+WW+ +GD+ + +GYWP ELF +L   A  V W
Sbjct: 290 AAISPLSTFKGNQFDITILIWKDPKMGNWWMGLGDSTL-VGYWPAELFTHLADHATTVEW 349

Query: 339 GGSAQPTLYGDE--SPPLGSGHKPNGKPDEAIFVRNIQYIAPNYILSIPTLNNTINYVSN 398
           GG    T       +  +GSGH P+    +A + RN++ +  +   S+  +++      N
Sbjct: 350 GGEVVNTRASGRHTTTQMGSGHFPDEGFGKASYFRNLEVVDSDN--SLVPVHDVKILAEN 409

Query: 399 SSCYDLISNENCSFDPFKYCFTFGGPG 416
           + CYD+ S+ +  +  +   F +GGPG
Sbjct: 410 TECYDIKSSYSNEWGTY---FYYGGPG 424

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_031738649.10.0100.00uncharacterized protein LOC116402744 [Cucumis sativus] >KAE8650030.1 hypothetica... [more]
TYK11502.16.19e-25886.86neprosin 2 [Cucumis melo var. makuwa][more]
KAE8650029.17.30e-18666.90hypothetical protein Csa_011504 [Cucumis sativus][more]
XP_031738648.11.42e-12551.72uncharacterized protein LOC105435061 [Cucumis sativus][more]
KAA0053047.16.09e-12047.32uncharacterized protein E6C27_scaffold344G001630 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A5D3CJM03.00e-25886.86Neprosin 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold139G001660 PE... [more]
A0A0A0L4005.03e-207100.00Neprosin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G011710 PE... [more]
A0A5A7UEV42.95e-12047.32Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A6J1CVJ61.41e-11045.26uncharacterized protein LOC111014777 OS=Momordica charantia OX=3673 GN=LOC111014... [more]
A0A059CIH21.26e-8640.92Uncharacterized protein OS=Eucalyptus grandis OX=71139 GN=EUGRSUZ_D02193 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT1G55360.11.2e-6335.05Protein of Unknown Function (DUF239) [more]
AT5G56530.12.7e-6337.50Protein of Unknown Function (DUF239) [more]
AT5G56530.22.7e-6337.50Protein of Unknown Function (DUF239) [more]
AT3G13510.13.9e-6235.35Protein of Unknown Function (DUF239) [more]
AT5G18460.12.2e-6035.14Protein of Unknown Function (DUF239) [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025521Neprosin activation peptidePFAMPF14365Neprosin_APcoord: 54..152
e-value: 3.1E-28
score: 98.6
IPR004314NeprosinPFAMPF03080Neprosincoord: 191..414
e-value: 2.0E-58
score: 197.4
NoneNo IPR availableGENE3D3.90.1320.10coord: 192..307
e-value: 1.2E-12
score: 49.5
NoneNo IPR availablePANTHERPTHR31589:SF2ASLB (DUF239)-RELATEDcoord: 20..416
NoneNo IPR availablePANTHERPTHR31589PROTEIN, PUTATIVE (DUF239)-RELATED-RELATEDcoord: 20..416

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy3G001620.2CsGy3G001620.2mRNA