CsaV3_4G027310 (gene) Cucumber (Chinese Long) v3

Overview
NameCsaV3_4G027310
Typegene
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
Descriptionmucin-5AC
Locationchr4: 16438135 .. 16442864 (-)
RNA-Seq ExpressionCsaV3_4G027310
SyntenyCsaV3_4G027310
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAAGAAGAAAGAAGTAAAGGTTGAAAATGAAAAGTGTATTGTAAGAGAGAGGTTGAATATTATTATATAAATGAAGAAGAAGAAATAAAAAGTAATAATAGAAATAGAGTTAATAGTTTAGAGTGTGAGAGTGTACTATTGAGATCGAACCTCGGCTAAACCGGCCGTTCCCAGAACCCAAATATGGATCTCACGCTGACTCTGTCTCTTCCTTCTCTCTCTGTTTTCTTTCTCAAATATACAAACTTCTTCCATTACTGATTCCTTCTTCTCCGTCTTCCTTCTTCCTTCTTCCTTCATTCCCATTGCATTCTCCTTCAATTCCTCTCTCTATTCACCCCAACCTTCCCAATCATGAACCGCAACTGGCGTGAGCCTCTTTCTGGCTCCCGCAATGCTCCTCTCTTCTCACACCACCGTCGTGGTCATAGCTTCACCGGAATCTCTAGAGATTCCGATGAGAATCTGGATCTCTTCTCCAAGAATCGCCGTACTCTCTCCGTTACTGCCTCTGATGACTCCTCTGATGGTAACTAAATACGGATTTCTCTCCATTCATGCTAATGTACGCTGGATTTTTAGATTCTTCGATTGGGTTTTTTAAAATCTGGTGCTTTTGTGATTCGGTGGATGGATTAAAGCTGTCTGTTGTTCTTTACCTCTCTTCACTACTGTTTTTTTAAGTCTTTGATTTTCGTTTTAATGTTAGCGTCGGTGAAATTGGGGAGGCTTTCTGTTGGATCCGTGAAATTGGCTAAGAGTGGGATTGACGATCTGCTTTCGTCGACTGAGGGAGGAAAACACGATTACGATTGGTAATTTCTTTTGTTAAATTTATCAACGGATTCAATGATTTTCATCTTCTTTTCTTCTGTTTTTTTTTAAAAAAAAAAAAAAAAATCCTGCTTTGGAACTTACCCGCTTTTGAGGCTCTCTTTCATCAATTATCAATGAAGTAGTCCATACAACTATCTATGATTTCATCTGTATCCAAAGTATTTTATAGGATTATCAATTATGGAGATGGTGAAGAAGTTCAATAACATGTCAGCATTACTAAATTCTACCTGTCAAGGACCAATTAATTGAGTTTCCTAGACTACTATATGGATTTACTTGATCTATTTATGTGATTATTTGCGGGACCCTTCTCTGCATTCTTTTAACTTTGATGTTTCTCTTTTGGAGTGATAATGACAAAGAAGTTAAACATGTATTGCAATTAGAATGATGCAAGTACATGTTGATGATTCCTTAAAAATATGCATATTGACGCTATTTGAAGTTGATTCCAGTTGTTCTGATCTTACTTTTAACCGGTAAAAGTGCGAGCCACTTTCTTTTGAGTGAAAAAGAACAAAAAAAATGTAGATCTTGATTGGTGTTTGAAACTTTTAAGTATGATTTTTAGCTAAGTGGGATGCACTAAGTGAAGAATATGATTATTCTGCGGTTAAATTAAGCTTTGTGCTACAGGCTTCTCACCCCACCCGGTACTCCTCTTTTTCCTTCATCCTCTGAAAGTGAAATTCAATCTACGGTAGCAGCACCGAGAAGCAGCACCTTAGTCAGGTCATCTTCGACAACAAAAGCTTCGAGGGTATGGTTCTGCCTTCTTTGCTACCTATGATCTTCCTGAGTTGCATATATAGAGTGATTCAATATCTTCACTTAAGTTTTTTCCTTCTATTTTGTGTGGTTCTTTTAACATTACCGAATGTAACTCAGAAGTGCTTGCTAATGTTACAATTCAAAAGTGTTTGTTAAGTTGGGAGATCGAAATTCATGACCACTCTTGAGTTGTATGATCTAGGCTGTTATTAATTGATTTCCATGTGTTCCAGCTTTCAGTTTCACAATCAGAGAGCAACAATCCTTCAAGGCCAGTAAGGAGCAGTTCCGTGTCCCGGTCCTCTGTCTCCACTCCACAGTATAGTAGTTATTCCTCCAATAGGTCTGCTTCATCAATTCTTAACACAAGCTCAGCTTCGGTTTCCTCTTACATTAGGCCTTCCTCCCCAAGTACACGCAGTGCATCCTCTGCAAGACCTTCTACTCCATCTTCGCGTTCAACACCATCGAGGTCCTCAACTCCTTCAAGAGCCCGTCCATCCCCCAACAGCCCCTCCATTGAAAAACCAAGGCCACTACAAAGTTCAAGGCCGTCTACTCCTAATTCTAGGCCTCAAATTCCTGCAAATTTGAGTTCTCCTGCAGCTCGGTCAAATTCCCGTCCATCTACACCTACTCGACGAAATTCTGCTCCTTCCCTCTCTTCTGTTGTTGGCACTCCATCTTCTACATCACGTGTTCTCTCAACAAATGGACGCAGTTCAACATCAACATCCCGACCAAGCTCCCCTAGTCCTCGGGTCCGGGCTGCACCTCAGCCAATTGTCCCCCCTGATTTTCCTCTCGATACCCCTCCAAACCTCCGAACAACATTGCCTGACCGGCCAATTTCTGCTGGTAGATCCCGCCCAACACCTGCTTCATCGGTTAGGGGAAGTCCAGAAACTACATCAACTGGTACCGTGCCTAGAAGAGCAGCATCACCTACCATAACAAGGGGAAGAATAACAGACGCCCCGGGAAGGGGCCGGTTGAATACAAATGGACACCTCAGTGACAGTCCTGAAACTAGGAGACTTTCAAGTTCTTCTGATTTGAGCGGAAGGAGACCTGTGAAGGCTTCTACAACTACAGCAGAAAGCAACGGATTTGGGAGGTCTATTTCGAAGAAATCACTCGATATGGCCATCAGACATATGGTATGCTCTAACTTCGTTCTGTTGATTATGTACAAACCTATCCTGCCATCGAATTCATTTTCGTTTTAGGATGTGTCTTTTTAACGAACATTCCAATCTGGTTGTGTGGTGTAGGATATAAGAAATGGCCCTGGAAGTGTGCGCTCAGGTTCAGGCAATACTCTATTTCCACACAGCATTCGATCAGCCACTTCGAAAACTCAATCCATTGCTTTGAGTAACTCAGAGGCTATTGATACTGACTACCAAATGAGCAGTAACAACAACATGGATAGAGGAAACCATTTCCATAGACCTTCTGCAACGATTGGAACCGAAGTAGGAGGAGGAGAGAATGGAAGATTTTCTGCAAGCTTGAATCATTTGGACATTTATGAAAGCTCACGTTATGATGCGATATTACTGAAAGAGGACTTGAAAAACACGAATTGGCTGCACAGCACCGACGATAAAACCGATTTGGCTTCCATTTTGGATAATGGATTTGAAGCTTTGCCTGAGCCTTTTGGCCTCTTATAACATCTAAGATGACGATGATGAGGTTTGTATTGTATTTTTTATTTTTCTTTTTTTTAAAAAAAATTAAATTATCATCTTCATCAGCATCATTATTATTATTTTTGGCATCATTTTGGTTTTGAAAATGATGGAGAATCTGTCCAAGAGAAGATAAAGATATGGTTGTATGTGATTTCTTAAGAAAAATTAGTTTGTAATGGTAGAGGCGGGGTGCCTGTTGTCAAAACTTTGGAGTATTTTTGTGTATCCCAAAGAATTTGTTCAGAAAATTTGTGTTTGAACTTTATTTTTGGGGTTAAAAAACACGTTTGGTCCTCTAGTTTCATTCAAGTAATAATTTAGTCTTGAAATGTTTAGCATGTAACAAAGTTTAGTCTTGGTAGTTTGAATTTTGTAATGATTTAGTTAATGGACTTTGTTATGTATATATAACCACTTAAGCTTTACCATGGACAACTCTATGATAATATAATATAATGGGTTTCTTCTTAAATTAGATCAATTATATTGTTCAAAGGTCAAATCTATGTTTTTAAACTAAAGGAATTTAGCTATTACATCTTTAAAGATTGTTCTACTTGCTGAAATTATAAGAAATTCTCATGTTTAACAATTGAATTTTTAAAATTTAAATCAGTTTCTAGACATTGTATTATTAAACTATTTATTCTTCCTTTTTTGAAAAAAAAAAAAAGTTTTGGACAAATTTTGGGGTATGACAAATTTTGATTTGTACTTTCTTGTGTGTTTCGTTCTCTTCAACTTTTAGGTTCATCTTCGTTTTCATCACAACTATCCTTTACATATAAAATTTGTTTTATTTTTCTCAAAAGTATTGTACTAGATTGTGGAGTGGAAAAATCAAATGTTTATCTTCAATGCCCAAACAACTATCAATTATTCTCATGTTGGCATCAAATATGTATTTAATATCATCAAGTATTATTACCATTGTAAAATGCAGATTCTCTAATCATTTTATAGTTTTGGTCGAAGTTTCAGCATTTTTTTTTTCAATTAGTACTTATGGTTTAACTTTAATTTGATTTTATTTAGGTTTCATTTCATTTCAACTTTGTACTAACAAATGTTGAAAATTGGTTTATTTGGTTTGAAGTTAGGCTATGTTTACTGATTCCAATGAGAATGAATTAATGGTGGGTCTCATATGATTAAGTTTTGTGTTCTTTACTTGAGGTCTATGGATGATAGACTAAAAGTGGCTGACAATTTGATTGCATCTGAAAATTAGGTCTTATCGTTACTATTATTGTTACTCTTTTAGAGAAAGTTATGGACAAAATACACTTTTGATATTAAAATTTATGTTTATTTGGTCTCTAAAATTTGAAGATGAATATTTTATTTATGATAAACTTTGCTATATTTACAATTTTTTTAAATGTTATTATACTCTTTATTATTATTATTATTATTA

mRNA sequence

ATGAACCGCAACTGGCGTGAGCCTCTTTCTGGCTCCCGCAATGCTCCTCTCTTCTCACACCACCGTCGTGGTCATAGCTTCACCGGAATCTCTAGAGATTCCGATGAGAATCTGGATCTCTTCTCCAAGAATCGCCGTACTCTCTCCGTTACTGCCTCTGATGACTCCTCTGATGCGTCGGTGAAATTGGGGAGGCTTTCTGTTGGATCCGTGAAATTGGCTAAGAGTGGGATTGACGATCTGCTTTCGTCGACTGAGGGAGGAAAACACGATTACGATTGGCTTCTCACCCCACCCGGTACTCCTCTTTTTCCTTCATCCTCTGAAAGTGAAATTCAATCTACGGTAGCAGCACCGAGAAGCAGCACCTTAGTCAGGTCATCTTCGACAACAAAAGCTTCGAGGCTTTCAGTTTCACAATCAGAGAGCAACAATCCTTCAAGGCCAGTAAGGAGCAGTTCCGTGTCCCGGTCCTCTGTCTCCACTCCACAGTATAGTAGTTATTCCTCCAATAGGTCTGCTTCATCAATTCTTAACACAAGCTCAGCTTCGGTTTCCTCTTACATTAGGCCTTCCTCCCCAAGTACACGCAGTGCATCCTCTGCAAGACCTTCTACTCCATCTTCGCGTTCAACACCATCGAGGTCCTCAACTCCTTCAAGAGCCCGTCCATCCCCCAACAGCCCCTCCATTGAAAAACCAAGGCCACTACAAAGTTCAAGGCCGTCTACTCCTAATTCTAGGCCTCAAATTCCTGCAAATTTGAGTTCTCCTGCAGCTCGGTCAAATTCCCGTCCATCTACACCTACTCGACGAAATTCTGCTCCTTCCCTCTCTTCTGTTGTTGGCACTCCATCTTCTACATCACGTGTTCTCTCAACAAATGGACGCAGTTCAACATCAACATCCCGACCAAGCTCCCCTAGTCCTCGGGTCCGGGCTGCACCTCAGCCAATTGTCCCCCCTGATTTTCCTCTCGATACCCCTCCAAACCTCCGAACAACATTGCCTGACCGGCCAATTTCTGCTGGTAGATCCCGCCCAACACCTGCTTCATCGGTTAGGGGAAGTCCAGAAACTACATCAACTGGTACCGTGCCTAGAAGAGCAGCATCACCTACCATAACAAGGGGAAGAATAACAGACGCCCCGGGAAGGGGCCGGTTGAATACAAATGGACACCTCAGTGACAGTCCTGAAACTAGGAGACTTTCAAGTTCTTCTGATTTGAGCGGAAGGAGACCTGTGAAGGCTTCTACAACTACAGCAGAAAGCAACGGATTTGGGAGGTCTATTTCGAAGAAATCACTCGATATGGCCATCAGACATATGGATATAAGAAATGGCCCTGGAAGTGTGCGCTCAGGTTCAGGCAATACTCTATTTCCACACAGCATTCGATCAGCCACTTCGAAAACTCAATCCATTGCTTTGAGTAACTCAGAGGCTATTGATACTGACTACCAAATGAGCAGTAACAACAACATGGATAGAGGAAACCATTTCCATAGACCTTCTGCAACGATTGGAACCGAAGTAGGAGGAGGAGAGAATGGAAGATTTTCTGCAAGCTTGAATCATTTGGACATTTATGAAAGCTCACGTTATGATGCGATATTACTGAAAGAGGACTTGAAAAACACGAATTGGCTGCACAGCACCGACGATAAAACCGATTTGGCTTCCATTTTGGATAATGGATTTGAAGCTTTGCCTGAGCCTTTTGGCCTCTTATAA

Coding sequence (CDS)

ATGAACCGCAACTGGCGTGAGCCTCTTTCTGGCTCCCGCAATGCTCCTCTCTTCTCACACCACCGTCGTGGTCATAGCTTCACCGGAATCTCTAGAGATTCCGATGAGAATCTGGATCTCTTCTCCAAGAATCGCCGTACTCTCTCCGTTACTGCCTCTGATGACTCCTCTGATGCGTCGGTGAAATTGGGGAGGCTTTCTGTTGGATCCGTGAAATTGGCTAAGAGTGGGATTGACGATCTGCTTTCGTCGACTGAGGGAGGAAAACACGATTACGATTGGCTTCTCACCCCACCCGGTACTCCTCTTTTTCCTTCATCCTCTGAAAGTGAAATTCAATCTACGGTAGCAGCACCGAGAAGCAGCACCTTAGTCAGGTCATCTTCGACAACAAAAGCTTCGAGGCTTTCAGTTTCACAATCAGAGAGCAACAATCCTTCAAGGCCAGTAAGGAGCAGTTCCGTGTCCCGGTCCTCTGTCTCCACTCCACAGTATAGTAGTTATTCCTCCAATAGGTCTGCTTCATCAATTCTTAACACAAGCTCAGCTTCGGTTTCCTCTTACATTAGGCCTTCCTCCCCAAGTACACGCAGTGCATCCTCTGCAAGACCTTCTACTCCATCTTCGCGTTCAACACCATCGAGGTCCTCAACTCCTTCAAGAGCCCGTCCATCCCCCAACAGCCCCTCCATTGAAAAACCAAGGCCACTACAAAGTTCAAGGCCGTCTACTCCTAATTCTAGGCCTCAAATTCCTGCAAATTTGAGTTCTCCTGCAGCTCGGTCAAATTCCCGTCCATCTACACCTACTCGACGAAATTCTGCTCCTTCCCTCTCTTCTGTTGTTGGCACTCCATCTTCTACATCACGTGTTCTCTCAACAAATGGACGCAGTTCAACATCAACATCCCGACCAAGCTCCCCTAGTCCTCGGGTCCGGGCTGCACCTCAGCCAATTGTCCCCCCTGATTTTCCTCTCGATACCCCTCCAAACCTCCGAACAACATTGCCTGACCGGCCAATTTCTGCTGGTAGATCCCGCCCAACACCTGCTTCATCGGTTAGGGGAAGTCCAGAAACTACATCAACTGGTACCGTGCCTAGAAGAGCAGCATCACCTACCATAACAAGGGGAAGAATAACAGACGCCCCGGGAAGGGGCCGGTTGAATACAAATGGACACCTCAGTGACAGTCCTGAAACTAGGAGACTTTCAAGTTCTTCTGATTTGAGCGGAAGGAGACCTGTGAAGGCTTCTACAACTACAGCAGAAAGCAACGGATTTGGGAGGTCTATTTCGAAGAAATCACTCGATATGGCCATCAGACATATGGATATAAGAAATGGCCCTGGAAGTGTGCGCTCAGGTTCAGGCAATACTCTATTTCCACACAGCATTCGATCAGCCACTTCGAAAACTCAATCCATTGCTTTGAGTAACTCAGAGGCTATTGATACTGACTACCAAATGAGCAGTAACAACAACATGGATAGAGGAAACCATTTCCATAGACCTTCTGCAACGATTGGAACCGAAGTAGGAGGAGGAGAGAATGGAAGATTTTCTGCAAGCTTGAATCATTTGGACATTTATGAAAGCTCACGTTATGATGCGATATTACTGAAAGAGGACTTGAAAAACACGAATTGGCTGCACAGCACCGACGATAAAACCGATTTGGCTTCCATTTTGGATAATGGATTTGAAGCTTTGCCTGAGCCTTTTGGCCTCTTATAA

Protein sequence

MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTLSVTASDDSSDASVKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPRSSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNTSSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSSRPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSSTSTSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPETTSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKASTTTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSNSEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSASLNHLDIYESSRYDAILLKEDLKNTNWLHSTDDKTDLASILDNGFEALPEPFGLL*
Homology
BLAST of CsaV3_4G027310 vs. NCBI nr
Match: XP_011653729.1 (mucin-5AC [Cucumis sativus] >KGN54463.1 hypothetical protein Csa_012940 [Cucumis sativus])

HSP 1 Score: 1042.7 bits (2695), Expect = 1.2e-300
Identity = 578/578 (100.00%), Postives = 578/578 (100.00%), Query Frame = 0

Query: 1   MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTLSVTASDDSSDAS 60
           MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTLSVTASDDSSDAS
Sbjct: 1   MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTLSVTASDDSSDAS 60

Query: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPR 120
           VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPR
Sbjct: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPR 120

Query: 121 SSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT 180
           SSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT
Sbjct: 121 SSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT 180

Query: 181 SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS 240
           SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS
Sbjct: 181 SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS 240

Query: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300
           RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST
Sbjct: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300

Query: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPET 360
           STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPET
Sbjct: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPET 360

Query: 361 TSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST 420
           TSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST
Sbjct: 361 TSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST 420

Query: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN 480
           TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN
Sbjct: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN 480

Query: 481 SEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSASLNHLDIYESSRYDAIL 540
           SEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSASLNHLDIYESSRYDAIL
Sbjct: 481 SEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSASLNHLDIYESSRYDAIL 540

Query: 541 LKEDLKNTNWLHSTDDKTDLASILDNGFEALPEPFGLL 579
           LKEDLKNTNWLHSTDDKTDLASILDNGFEALPEPFGLL
Sbjct: 541 LKEDLKNTNWLHSTDDKTDLASILDNGFEALPEPFGLL 578

BLAST of CsaV3_4G027310 vs. NCBI nr
Match: XP_008463284.1 (PREDICTED: mucin-5AC [Cucumis melo] >TYK06228.1 mucin-5AC [Cucumis melo var. makuwa])

HSP 1 Score: 1013.8 bits (2620), Expect = 5.8e-292
Identity = 564/578 (97.58%), Postives = 570/578 (98.62%), Query Frame = 0

Query: 1   MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTLSVTASDDSSDAS 60
           MNRNWREPLSGSRNAPL S HRRGHSFTGISRDSDENLDLFSKNRR+LSVTASDDSSDAS
Sbjct: 1   MNRNWREPLSGSRNAPLLSQHRRGHSFTGISRDSDENLDLFSKNRRSLSVTASDDSSDAS 60

Query: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPR 120
           VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESE+QSTVAAPR
Sbjct: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEVQSTVAAPR 120

Query: 121 SSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT 180
           SSTLVRSSSTTKASRLSVSQSE NNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT
Sbjct: 121 SSTLVRSSSTTKASRLSVSQSECNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT 180

Query: 181 SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS 240
           SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS
Sbjct: 181 SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS 240

Query: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300
           RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST
Sbjct: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300

Query: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPET 360
           STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTP++SVRGSPET
Sbjct: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPSASVRGSPET 360

Query: 361 TSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST 420
           TST TVPRRAASPT+TRGRITD PGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST
Sbjct: 361 TSTVTVPRRAASPTVTRGRITDTPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST 420

Query: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN 480
           TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN
Sbjct: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN 480

Query: 481 SEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSASLNHLDIYESSRYDAIL 540
           SEA DTDYQMSSNNN+DRGNHFHRPSATIGTE GGGENGRFSASLNHLDIYESSRYDAIL
Sbjct: 481 SEATDTDYQMSSNNNVDRGNHFHRPSATIGTE-GGGENGRFSASLNHLDIYESSRYDAIL 540

Query: 541 LKEDLKNTNWLHSTDDKTDLASILDNGFEALPEPFGLL 579
           LKEDLKNTNWLHS DDKTDLASILDNGFEALPEPFGLL
Sbjct: 541 LKEDLKNTNWLHSADDKTDLASILDNGFEALPEPFGLL 577

BLAST of CsaV3_4G027310 vs. NCBI nr
Match: XP_038883319.1 (serine/arginine repetitive matrix protein 2 [Benincasa hispida])

HSP 1 Score: 987.3 bits (2551), Expect = 5.8e-284
Identity = 551/578 (95.33%), Postives = 564/578 (97.58%), Query Frame = 0

Query: 1   MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTLSVTASDDSSDAS 60
           MNRNWREPLSG+RNAPL S HRRGHSFTGISRDSDENLDLFSKNRR+LSV ASDDSSDAS
Sbjct: 1   MNRNWREPLSGARNAPLLSQHRRGHSFTGISRDSDENLDLFSKNRRSLSVAASDDSSDAS 60

Query: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPR 120
           VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTV APR
Sbjct: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVVAPR 120

Query: 121 SSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT 180
           SSTLVRSSSTTKASRLSVSQSES+NPSRP RSSSVSRSSVSTPQYSSYSS+RS SSILNT
Sbjct: 121 SSTLVRSSSTTKASRLSVSQSESHNPSRPARSSSVSRSSVSTPQYSSYSSSRSTSSILNT 180

Query: 181 SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS 240
           SSASVSSYIRPSSPSTRS+SSARPSTPSSR+TPSRSSTPSRARPSP S SIEKPRPLQSS
Sbjct: 181 SSASVSSYIRPSSPSTRSSSSARPSTPSSRTTPSRSSTPSRARPSPTSSSIEKPRPLQSS 240

Query: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300
           RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST
Sbjct: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300

Query: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPET 360
           STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRP+PA+SVRGSPE 
Sbjct: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPSPAASVRGSPEP 360

Query: 361 TSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST 420
           +ST  VPRRAASPT++RGRITDAPGRGRLNTNGHLSDS ETRRLSSSSDLSGRRPVKAST
Sbjct: 361 SSTIAVPRRAASPTVSRGRITDAPGRGRLNTNGHLSDSHETRRLSSSSDLSGRRPVKAST 420

Query: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN 480
           TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN
Sbjct: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN 480

Query: 481 SEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSASLNHLDIYESSRYDAIL 540
           SEAIDTD+QMSSNNNM+RGNHFHRPSATIGTE GGGENGRFSASLNHLDIYESSRYDAIL
Sbjct: 481 SEAIDTDFQMSSNNNMERGNHFHRPSATIGTE-GGGENGRFSASLNHLDIYESSRYDAIL 540

Query: 541 LKEDLKNTNWLHSTDDKTDLASILDNGFEALPEPFGLL 579
           LKEDLKNTNWLHS DDKTDLASILDNGFEALPEPFGLL
Sbjct: 541 LKEDLKNTNWLHSADDKTDLASILDNGFEALPEPFGLL 577

BLAST of CsaV3_4G027310 vs. NCBI nr
Match: XP_023543251.1 (uncharacterized protein YMR317W [Cucurbita pepo subsp. pepo] >XP_023543252.1 uncharacterized protein YMR317W [Cucurbita pepo subsp. pepo])

HSP 1 Score: 925.2 bits (2390), Expect = 2.7e-265
Identity = 523/580 (90.17%), Postives = 550/580 (94.83%), Query Frame = 0

Query: 1   MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTLSVTASDDSSDAS 60
           MNRNWRE LSG RNAPL S HRRGHSFTGISRDSDENLDLFSKNRR+LSV ASD S+DA 
Sbjct: 1   MNRNWRESLSGGRNAPLLSQHRRGHSFTGISRDSDENLDLFSKNRRSLSVAASDGSTDAP 60

Query: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPR 120
           VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESE+QSTVAAPR
Sbjct: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEVQSTVAAPR 120

Query: 121 SSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT 180
           +S+LVRSSSTTKASRLSVSQSESNNPSRP RSSSVSRSSVSTPQYSSYSSNRS SSILNT
Sbjct: 121 NSSLVRSSSTTKASRLSVSQSESNNPSRPARSSSVSRSSVSTPQYSSYSSNRS-SSILNT 180

Query: 181 SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS 240
           SSASVSSYIRP+SPSTRSAS+ARPSTPSSRSTPSRSSTPSRARPSP S SI+KPR LQSS
Sbjct: 181 SSASVSSYIRPASPSTRSASTARPSTPSSRSTPSRSSTPSRARPSPTSSSIDKPRQLQSS 240

Query: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300
           RPSTP+SRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVV TPSSTSRVLSTNGRSST
Sbjct: 241 RPSTPSSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVSTPSSTSRVLSTNGRSST 300

Query: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPET 360
           STSRPSSPSPRVRAAPQPIV PDFPLDTPPNLRTTLPDRPISAGRSRP P +SVRGSPET
Sbjct: 301 STSRPSSPSPRVRAAPQPIVLPDFPLDTPPNLRTTLPDRPISAGRSRPAP-TSVRGSPET 360

Query: 361 TSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST 420
           TST T+PRRA+SPT++RGR+TDAPGRGR+NTNGHLSDSPETRRLSSSSDL GRRPVK ST
Sbjct: 361 TSTVTMPRRASSPTVSRGRLTDAPGRGRVNTNGHLSDSPETRRLSSSSDLGGRRPVKPST 420

Query: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATS--KTQSIAL 480
           TTAESNGFGRSISKKSLD+AIR+MDIRN PG+VRSGSG+TLFPHSIR+A +  KTQSIA 
Sbjct: 421 TTAESNGFGRSISKKSLDVAIRNMDIRNSPGNVRSGSGSTLFPHSIRAAAASPKTQSIAS 480

Query: 481 SNSEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSASLNHLDIYESSRYDA 540
           SN +AIDTD+QMS NNNM+RGNHFHR SAT+GTE GGGENGRF ASLNH+DIYESSRYDA
Sbjct: 481 SNPDAIDTDFQMSINNNMERGNHFHRHSATMGTE-GGGENGRFCASLNHMDIYESSRYDA 540

Query: 541 ILLKEDLKNTNWLHSTDDKTDLASILDNGFEALPEPFGLL 579
           ILLKEDLKNTNWLHS DDKTDLASILDNGFEALPEPFGLL
Sbjct: 541 ILLKEDLKNTNWLHSADDKTDLASILDNGFEALPEPFGLL 577

BLAST of CsaV3_4G027310 vs. NCBI nr
Match: KAG7033708.1 (hypothetical protein SDJN02_03433 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 923.7 bits (2386), Expect = 7.8e-265
Identity = 524/579 (90.50%), Postives = 547/579 (94.47%), Query Frame = 0

Query: 1   MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTLSVTASDDSSDAS 60
           MNRNWRE LSG RNAPL S HRRGHSFTGISRDSDENLDLFSKNRR+LSV ASD S+DA 
Sbjct: 1   MNRNWRESLSGGRNAPLLSQHRRGHSFTGISRDSDENLDLFSKNRRSLSVAASDGSTDAP 60

Query: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPR 120
           VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESE+QSTVAAPR
Sbjct: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEVQSTVAAPR 120

Query: 121 SSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT 180
           +S+LVRSSSTTKASRLSVSQ ESNNPSR  RSSSVSRSSVSTPQYSSYSSNRS SSILNT
Sbjct: 121 NSSLVRSSSTTKASRLSVSQPESNNPSRSARSSSVSRSSVSTPQYSSYSSNRS-SSILNT 180

Query: 181 SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS 240
           SSASVSSYIRP+SPSTRSAS+ARPSTPSSRSTPSRSSTPSRARPSP S SI+KPR LQSS
Sbjct: 181 SSASVSSYIRPASPSTRSASTARPSTPSSRSTPSRSSTPSRARPSPTSSSIDKPRQLQSS 240

Query: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300
           RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVV TPSSTSRVLSTNGRSST
Sbjct: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVSTPSSTSRVLSTNGRSST 300

Query: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPET 360
           STSRPSSPSPRVRAAPQPIV PDFPLDTPPNLRTTLPDRPISAGRSRPTP +SVRGSPET
Sbjct: 301 STSRPSSPSPRVRAAPQPIVLPDFPLDTPPNLRTTLPDRPISAGRSRPTP-TSVRGSPET 360

Query: 361 TSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST 420
           TST T+PRRA+SPT++RGR+TDAPGRGR+NTNGHLSDSPETRRLSSSSDL GRRPVK ST
Sbjct: 361 TSTVTMPRRASSPTVSRGRLTDAPGRGRVNTNGHLSDSPETRRLSSSSDLGGRRPVKPST 420

Query: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATS-KTQSIALS 480
           TTAESNGFGRSISKKSLD+AIR+MDIRN PG+VRSGSG+TLFPHSIR+A S KTQSIA S
Sbjct: 421 TTAESNGFGRSISKKSLDVAIRNMDIRNSPGNVRSGSGSTLFPHSIRAAASPKTQSIASS 480

Query: 481 NSEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSASLNHLDIYESSRYDAI 540
           N EAIDTD+QMS NNNM+RGNHFHR SAT+GTE  GGENGRF ASLNHLDIYESSRYDAI
Sbjct: 481 NPEAIDTDFQMSINNNMERGNHFHRHSATMGTE--GGENGRFCASLNHLDIYESSRYDAI 540

Query: 541 LLKEDLKNTNWLHSTDDKTDLASILDNGFEALPEPFGLL 579
           LLKEDLKNTNWLHS DDKTDL SILDNGFEALPEPFGLL
Sbjct: 541 LLKEDLKNTNWLHSADDKTDLGSILDNGFEALPEPFGLL 575

BLAST of CsaV3_4G027310 vs. ExPASy TrEMBL
Match: A0A0A0KY97 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G335230 PE=4 SV=1)

HSP 1 Score: 1042.7 bits (2695), Expect = 5.6e-301
Identity = 578/578 (100.00%), Postives = 578/578 (100.00%), Query Frame = 0

Query: 1   MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTLSVTASDDSSDAS 60
           MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTLSVTASDDSSDAS
Sbjct: 1   MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTLSVTASDDSSDAS 60

Query: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPR 120
           VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPR
Sbjct: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPR 120

Query: 121 SSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT 180
           SSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT
Sbjct: 121 SSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT 180

Query: 181 SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS 240
           SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS
Sbjct: 181 SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS 240

Query: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300
           RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST
Sbjct: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300

Query: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPET 360
           STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPET
Sbjct: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPET 360

Query: 361 TSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST 420
           TSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST
Sbjct: 361 TSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST 420

Query: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN 480
           TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN
Sbjct: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN 480

Query: 481 SEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSASLNHLDIYESSRYDAIL 540
           SEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSASLNHLDIYESSRYDAIL
Sbjct: 481 SEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSASLNHLDIYESSRYDAIL 540

Query: 541 LKEDLKNTNWLHSTDDKTDLASILDNGFEALPEPFGLL 579
           LKEDLKNTNWLHSTDDKTDLASILDNGFEALPEPFGLL
Sbjct: 541 LKEDLKNTNWLHSTDDKTDLASILDNGFEALPEPFGLL 578

BLAST of CsaV3_4G027310 vs. ExPASy TrEMBL
Match: A0A5D3C4U4 (Mucin-5AC OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold287G00960 PE=4 SV=1)

HSP 1 Score: 1013.8 bits (2620), Expect = 2.8e-292
Identity = 564/578 (97.58%), Postives = 570/578 (98.62%), Query Frame = 0

Query: 1   MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTLSVTASDDSSDAS 60
           MNRNWREPLSGSRNAPL S HRRGHSFTGISRDSDENLDLFSKNRR+LSVTASDDSSDAS
Sbjct: 1   MNRNWREPLSGSRNAPLLSQHRRGHSFTGISRDSDENLDLFSKNRRSLSVTASDDSSDAS 60

Query: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPR 120
           VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESE+QSTVAAPR
Sbjct: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEVQSTVAAPR 120

Query: 121 SSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT 180
           SSTLVRSSSTTKASRLSVSQSE NNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT
Sbjct: 121 SSTLVRSSSTTKASRLSVSQSECNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT 180

Query: 181 SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS 240
           SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS
Sbjct: 181 SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS 240

Query: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300
           RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST
Sbjct: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300

Query: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPET 360
           STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTP++SVRGSPET
Sbjct: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPSASVRGSPET 360

Query: 361 TSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST 420
           TST TVPRRAASPT+TRGRITD PGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST
Sbjct: 361 TSTVTVPRRAASPTVTRGRITDTPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST 420

Query: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN 480
           TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN
Sbjct: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN 480

Query: 481 SEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSASLNHLDIYESSRYDAIL 540
           SEA DTDYQMSSNNN+DRGNHFHRPSATIGTE GGGENGRFSASLNHLDIYESSRYDAIL
Sbjct: 481 SEATDTDYQMSSNNNVDRGNHFHRPSATIGTE-GGGENGRFSASLNHLDIYESSRYDAIL 540

Query: 541 LKEDLKNTNWLHSTDDKTDLASILDNGFEALPEPFGLL 579
           LKEDLKNTNWLHS DDKTDLASILDNGFEALPEPFGLL
Sbjct: 541 LKEDLKNTNWLHSADDKTDLASILDNGFEALPEPFGLL 577

BLAST of CsaV3_4G027310 vs. ExPASy TrEMBL
Match: A0A1S3CIW9 (mucin-5AC OS=Cucumis melo OX=3656 GN=LOC103501479 PE=4 SV=1)

HSP 1 Score: 1013.8 bits (2620), Expect = 2.8e-292
Identity = 564/578 (97.58%), Postives = 570/578 (98.62%), Query Frame = 0

Query: 1   MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTLSVTASDDSSDAS 60
           MNRNWREPLSGSRNAPL S HRRGHSFTGISRDSDENLDLFSKNRR+LSVTASDDSSDAS
Sbjct: 1   MNRNWREPLSGSRNAPLLSQHRRGHSFTGISRDSDENLDLFSKNRRSLSVTASDDSSDAS 60

Query: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPR 120
           VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESE+QSTVAAPR
Sbjct: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEVQSTVAAPR 120

Query: 121 SSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT 180
           SSTLVRSSSTTKASRLSVSQSE NNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT
Sbjct: 121 SSTLVRSSSTTKASRLSVSQSECNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT 180

Query: 181 SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS 240
           SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS
Sbjct: 181 SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS 240

Query: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300
           RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST
Sbjct: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300

Query: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPET 360
           STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTP++SVRGSPET
Sbjct: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPSASVRGSPET 360

Query: 361 TSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST 420
           TST TVPRRAASPT+TRGRITD PGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST
Sbjct: 361 TSTVTVPRRAASPTVTRGRITDTPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST 420

Query: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN 480
           TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN
Sbjct: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN 480

Query: 481 SEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSASLNHLDIYESSRYDAIL 540
           SEA DTDYQMSSNNN+DRGNHFHRPSATIGTE GGGENGRFSASLNHLDIYESSRYDAIL
Sbjct: 481 SEATDTDYQMSSNNNVDRGNHFHRPSATIGTE-GGGENGRFSASLNHLDIYESSRYDAIL 540

Query: 541 LKEDLKNTNWLHSTDDKTDLASILDNGFEALPEPFGLL 579
           LKEDLKNTNWLHS DDKTDLASILDNGFEALPEPFGLL
Sbjct: 541 LKEDLKNTNWLHSADDKTDLASILDNGFEALPEPFGLL 577

BLAST of CsaV3_4G027310 vs. ExPASy TrEMBL
Match: A0A6J1GEV2 (mucin-5AC OS=Cucurbita moschata OX=3662 GN=LOC111453316 PE=4 SV=1)

HSP 1 Score: 922.5 bits (2383), Expect = 8.5e-265
Identity = 523/579 (90.33%), Postives = 547/579 (94.47%), Query Frame = 0

Query: 1   MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTLSVTASDDSSDAS 60
           MNRNWRE LSG RNAPL S HRRGHSFTGISRDSDENLDLFSKNRR+LSV ASD S+DA+
Sbjct: 1   MNRNWRESLSGGRNAPLLSQHRRGHSFTGISRDSDENLDLFSKNRRSLSVAASDGSTDAT 60

Query: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPR 120
           VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESE+QSTVAAPR
Sbjct: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEVQSTVAAPR 120

Query: 121 SSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT 180
           +S+LVRSSSTTKASRLSVSQ ESNNPSR  RSSSVSRSSVSTPQYSSYSSNRS SSILNT
Sbjct: 121 NSSLVRSSSTTKASRLSVSQPESNNPSRSARSSSVSRSSVSTPQYSSYSSNRS-SSILNT 180

Query: 181 SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS 240
           SSASVSSYIRP+SPSTRSAS+ARPSTPSSRSTPSRSSTPSRARPSP S SI+KPR LQSS
Sbjct: 181 SSASVSSYIRPASPSTRSASTARPSTPSSRSTPSRSSTPSRARPSPTSSSIDKPRQLQSS 240

Query: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300
           RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVV TPSSTSRVLSTNGRSST
Sbjct: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVSTPSSTSRVLSTNGRSST 300

Query: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPET 360
           STSRPSSPSPRVRAAPQPIV PDFPLDTPPNLRTTLPDRPISAGRSRPTP +SVRGSPET
Sbjct: 301 STSRPSSPSPRVRAAPQPIVLPDFPLDTPPNLRTTLPDRPISAGRSRPTP-TSVRGSPET 360

Query: 361 TSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST 420
           T T T+PRRA+SPT++RGR+TDAPGRGR+NTNGHLSDSPETRRLSSSSDL GRRPVK ST
Sbjct: 361 TPTVTMPRRASSPTVSRGRLTDAPGRGRVNTNGHLSDSPETRRLSSSSDLGGRRPVKPST 420

Query: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATS-KTQSIALS 480
           TTAESNGFGRSISKKSLD+AIR+MDIRN PG+VRSGSG+TLFPHSIR+A S KTQSIA S
Sbjct: 421 TTAESNGFGRSISKKSLDVAIRNMDIRNSPGNVRSGSGSTLFPHSIRAAASPKTQSIASS 480

Query: 481 NSEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSASLNHLDIYESSRYDAI 540
           N EAIDTD+QMS NNNM+RGNHFHR SAT+GTE  GGENGRF ASLNHLDIYESSRYDAI
Sbjct: 481 NPEAIDTDFQMSINNNMERGNHFHRHSATMGTE--GGENGRFCASLNHLDIYESSRYDAI 540

Query: 541 LLKEDLKNTNWLHSTDDKTDLASILDNGFEALPEPFGLL 579
           LLKEDLKNTNWLHS DDKTDL SILDNGFEALPEPFGLL
Sbjct: 541 LLKEDLKNTNWLHSADDKTDLGSILDNGFEALPEPFGLL 575

BLAST of CsaV3_4G027310 vs. ExPASy TrEMBL
Match: A0A6J1CRA9 (serine/arginine repetitive matrix protein 2 OS=Momordica charantia OX=3673 GN=LOC111013874 PE=4 SV=1)

HSP 1 Score: 915.2 bits (2364), Expect = 1.4e-262
Identity = 514/578 (88.93%), Postives = 538/578 (93.08%), Query Frame = 0

Query: 1   MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTLSVTASDDSSDAS 60
           MNRNWREPLS  RNAPL SHHRRGHSFT ISRDSDENLDLFSKNRR+LSV ASDDSSDAS
Sbjct: 1   MNRNWREPLSAPRNAPLLSHHRRGHSFTAISRDSDENLDLFSKNRRSLSVAASDDSSDAS 60

Query: 61  VKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPR 120
           VKLGRLSVGSVKLAK+GIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESE+QSTVAAPR
Sbjct: 61  VKLGRLSVGSVKLAKNGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEVQSTVAAPR 120

Query: 121 SSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNT 180
           SSTLVRSSSTTKASRLSVS SESNN SRP RSSSVSRSSVSTPQYS+YSSNRS SSILNT
Sbjct: 121 SSTLVRSSSTTKASRLSVSHSESNNSSRPARSSSVSRSSVSTPQYSNYSSNRS-SSILNT 180

Query: 181 SSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSS 240
           SSASVSSYIRPSSPSTR++S+ RPSTPSSR T SRSSTPSRARPSP S SI+KPR +QSS
Sbjct: 181 SSASVSSYIRPSSPSTRNSSTTRPSTPSSRPTASRSSTPSRARPSPTSSSIDKPRQIQSS 240

Query: 241 RPSTPNSRPQIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSST 300
           RPSTP+SRPQ+PANLSSPAARSNSRPSTPTRRNSAPSLSSVVG PS T RVLS NGRSST
Sbjct: 241 RPSTPSSRPQVPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGAPSPTGRVLSINGRSST 300

Query: 301 STSRPSSPSPRVRAAPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPET 360
           STSRPSSPSPRVRA+PQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPA+SVRGSPE 
Sbjct: 301 STSRPSSPSPRVRASPQPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPAASVRGSPEP 360

Query: 361 TSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLSDSPETRRLSSSSDLSGRRPVKAST 420
           TST T+PRR+ASPT++RGR+TD PGRGR+NTNGHLSD  E RRLSSSSDL GRRPVKAST
Sbjct: 361 TSTVTMPRRSASPTVSRGRLTDTPGRGRVNTNGHLSDVSEPRRLSSSSDLGGRRPVKAST 420

Query: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPHSIRSATSKTQSIALSN 480
           TTAESNGFGRSISKKSLDMAIRHMDIRNGPGS+RSGSGNTLFPHSIRSATSKTQSIA +N
Sbjct: 421 TTAESNGFGRSISKKSLDMAIRHMDIRNGPGSMRSGSGNTLFPHSIRSATSKTQSIASNN 480

Query: 481 SEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSASLNHLDIYESSRYDAIL 540
           SEAIDTD+Q SSN+ M+RGNH HR S       GGGENGRFSASLNHLDIYESSRYDAIL
Sbjct: 481 SEAIDTDFQTSSNSFMERGNHLHRSSE------GGGENGRFSASLNHLDIYESSRYDAIL 540

Query: 541 LKEDLKNTNWLHSTDDKTDLASILDNGFEALPEPFGLL 579
           LKEDLKNTNWLHS DDKTDL SILDNGFEALPEPFGLL
Sbjct: 541 LKEDLKNTNWLHSADDKTDLGSILDNGFEALPEPFGLL 571

BLAST of CsaV3_4G027310 vs. TAIR 10
Match: AT3G08670.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G51540.1); Has 48380 Blast hits to 29827 proteins in 1356 species: Archae - 46; Bacteria - 5589; Metazoa - 17361; Fungi - 13192; Plants - 2237; Viruses - 905; Other Eukaryotes - 9050 (source: NCBI BLink). )

HSP 1 Score: 505.4 bits (1300), Expect = 6.2e-143
Identity = 351/596 (58.89%), Postives = 417/596 (69.97%), Query Frame = 0

Query: 1   MNRNWREPLSGSRNAPLFSHHRRGH-------SFTGISRDSDENLDLFSKNRRTLSVTAS 60
           MNRN RE L+G RN P  S  RRG+       S  G SRDSDENLDLFSK RR+  + +S
Sbjct: 1   MNRNLRESLAGGRNIPAISQFRRGNNNNSNNISQNGFSRDSDENLDLFSKIRRSFPLASS 60

Query: 61  DDSSDASVKLGRLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPSSSESEIQ 120
           D+  D S KLGRLSVGS K+A  G DDLLSS EGGK+DYDWLLTPPGTPL      ++  
Sbjct: 61  DELPDVSAKLGRLSVGS-KIAPKGKDDLLSSAEGGKNDYDWLLTPPGTPL-----GNDSH 120

Query: 121 STVAAPRSSTLVRSSSTTKASRLSVSQSESN-NPSRPVRSSSVSRSSVSTPQYSSYSSNR 180
           S++AAP+ ++  R+SS +KASRLSVSQSES  + SRP RSSSV+R S+ST QYSS++S R
Sbjct: 121 SSLAAPKIASSARASSASKASRLSVSQSESGYHSSRPARSSSVTRPSISTSQYSSFTSGR 180

Query: 181 SASSILNTSSASVSSYIRPSSPSTRSASSARPSTPSSRSTPSRSSTPSRARPSPNSPSIE 240
           S SSILNTSSASVSSYIRPSSPS+RS+SSARPSTP+  S+ SRSSTPSR RP  +S S++
Sbjct: 181 SPSSILNTSSASVSSYIRPSSPSSRSSSSARPSTPTRTSSASRSSTPSRIRPGSSSSSMD 240

Query: 241 KPRPLQSSRPSTPNSRPQIPANLSSP---AARSNSRPSTPTRRN-SAPSLSSVVGTPSST 300
           K RP  SSRPSTP SRPQ+ A  SSP   A+R NSRPSTPTRR+ S+ SLS+  G   S 
Sbjct: 241 KARPSLSSRPSTPTSRPQLSA--SSPNIIASRPNSRPSTPTRRSPSSTSLSATSGPTISG 300

Query: 301 SRVLSTNGRSSTSTSRPSSPSPRVRAAP-QPIVPPDFPLDTPPNLRTTLPDRPISAGRSR 360
            R  S NGR+  S SRPSSP PRVR  P QPIV  DFPLDTPPNLRT+LPDRPISAGRSR
Sbjct: 301 GRAAS-NGRTGPSLSRPSSPGPRVRNTPQQPIVLADFPLDTPPNLRTSLPDRPISAGRSR 360

Query: 361 PTPASSV-RGSPETTSTGTVPRRAASPTITRGRITDAPGRGRLNTNG-HLSDSPETRRLS 420
           P   SS+ + SPE    G + RR +SP +TRGR+T+  G+GR   NG HL+D+PE RR+S
Sbjct: 361 PVGGSSMAKASPE--PKGPITRRNSSPIVTRGRLTETQGKGRFGGNGQHLTDAPEPRRIS 420

Query: 421 SSSDLSGRRPVKASTT-TAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNTLFPH 480
           + SD++ RR VK STT T  +NG GRS SK SLDMAIRHMDIRNG  +  + S  TLFP 
Sbjct: 421 NVSDITSRRTVKTSTTVTDNNNGLGRSFSKSSLDMAIRHMDIRNGKTNGCALSTTTLFPQ 480

Query: 481 SIRSATSKTQSIALSNSEAIDTDYQMSSNNNMDRGNHFHRPSATIGTEVGGGENGRFSAS 540
           SIR A+SK Q I   N+ +        S+N  + GN                E  R    
Sbjct: 481 SIRPASSKIQPIRSGNNHS-----DSISSNGTENGNE-------------ANEGRRLMGK 540

Query: 541 LNHLDIYESSRYDAILLKEDLKNTNWLHSTDDK-TDLASILDN-GFEALPEPFGLL 579
           L+ +D+YESSRYDA+LLKED+KNTNWLHS DD+ +D   + DN GFE LPEPF  L
Sbjct: 541 LSDMDMYESSRYDALLLKEDVKNTNWLHSIDDRSSDHGLMFDNGGFELLPEPFAPL 567

BLAST of CsaV3_4G027310 vs. TAIR 10
Match: AT2G40070.2 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT3G09000.1); Has 108635 Blast hits to 60786 proteins in 2176 species: Archae - 287; Bacteria - 15142; Metazoa - 39415; Fungi - 26849; Plants - 4416; Viruses - 2864; Other Eukaryotes - 19662 (source: NCBI BLink). )

HSP 1 Score: 115.9 bits (289), Expect = 1.1e-25
Identity = 170/533 (31.89%), Postives = 256/533 (48.03%), Query Frame = 0

Query: 66  LSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPS-----------------SS 125
           +S G+    K+  DD L+S EG K+DY+WLLTPPGTPLFPS                 S 
Sbjct: 36  ISSGAPPSRKAAPDDFLNS-EGDKNDYEWLLTPPGTPLFPSLEMESHRTMMSQTGDSKSR 95

Query: 126 ESEIQSTVAAPRSSTLVRSSSTTKASRLSVSQSESNNPSRPVRSSSVSRSSVSTPQYSS- 185
            + + S +A   + +  R+  T++    S   S S+  SR   SS    S  +TP   S 
Sbjct: 96  PATLTSRLANSSTESAARNHLTSRQQTSSPGLSSSSGASRRPSSSGGPGSRPATPTGRSS 155

Query: 186 --YSSNRSASSILNTSSASVSSYIRPSSPSTRSASSARPS-TPSSRSTPSRSSTPSRARP 245
              ++++S+     TS A+VSS  RPS  ++RS  SA    TP SRST   S + SR  P
Sbjct: 156 TLTANSKSSRPSTPTSRATVSSATRPSLTNSRSTVSATTKPTPMSRST---SLSSSRLTP 215

Query: 246 SPNSPSIEKPRPLQSSRPSTPNSRPQI--PANLSSPAARSNSRPSTPTRRNSAPSLSSVV 305
           + + P+    R   S   STP++  +   P+  ++P +RS +R STPT R + P   ++ 
Sbjct: 216 TASKPTTSTARSAGSVTRSTPSTTTKSAGPSRSTTPLSRSTARSSTPTSRPTLPPSKTIS 275

Query: 306 GTPSSTSRVLSTNGRSSTSTS------RPSSP-------------------SPRVRAAP- 365
            + + T R +++   ++T+ +      +PSSP                   SP VR+ P 
Sbjct: 276 RSSTPTRRPIASASAATTTANPTISQIKPSSPAPAKPMPTPSKNPALSRAASPTVRSRPW 335

Query: 366 QPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPE--------TTSTGTVPR 425
           +P   P F L+TPPNLRTTLP+RP+SA R RP   SS  GS E               P 
Sbjct: 336 KPSDMPGFSLETPPNLRTTLPERPLSATRGRPGAPSSRSGSVEPGGPPGGRPRRQSCSPS 395

Query: 426 RAASPTITRGRITDAPGRGRLNTNGHLS----DSPETRRLSSSSDLSGRRP--------- 485
           R  +P  + G    A  RG    + ++S     +    R+ +   L+  R          
Sbjct: 396 RGRAPMYSSGSSVPAVNRGYSKASDNVSPVMMGTKMVERVINMRKLAPPRSDDKGSPHGN 455

Query: 486 VKASTTTAESNGFGRSISKKSLDMAIRHMDIRNG-PGSVRSGSGN--TLFPHSIRSATSK 526
           + A +++ +S GFGR++SKKSLDMAIRHMDIR   PG++R    N      +S+RS  ++
Sbjct: 456 LSAKSSSPDSAGFGRTLSKKSLDMAIRHMDIRRTIPGNLRPLMTNIPASSMYSVRSGHTR 515

BLAST of CsaV3_4G027310 vs. TAIR 10
Match: AT2G40070.1 (BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT3G09000.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 114.4 bits (285), Expect = 3.1e-25
Identity = 185/611 (30.28%), Postives = 286/611 (46.81%), Query Frame = 0

Query: 1   MNRNWREPLSGSRNAPLFSHHRRGHSFTGISRDSDENLDLFSKNRRTL----SVTASDDS 60
           MNR++R   S   ++   +  +R      +  + DE L LF + RR      ++  +++ 
Sbjct: 1   MNRSFRAKESLLLDS---AERQRQQLRASMMAEKDEELSLFLEMRRREKEQDNLLLNNNP 60

Query: 61  SDASVKLG---------RLSVGSVKLAKSGIDDLLSSTEGGKHDYDWLLTPPGTPLFPS- 120
            +    LG          +S G+    K+  DD L+S EG K+DY+WLLTPPGTPLFPS 
Sbjct: 61  DEFETPLGSKHGTSPVFNISSGAPPSRKAAPDDFLNS-EGDKNDYEWLLTPPGTPLFPSL 120

Query: 121 ----------------SSESEIQSTVAAPRSSTLVRSSSTTKASRLSVSQSESNNPSRPV 180
                           S  + + S +A   + +  R+  T++    S   S S+  SR  
Sbjct: 121 EMESHRTMMSQTGDSKSRPATLTSRLANSSTESAARNHLTSRQQTSSPGLSSSSGASRRP 180

Query: 181 RSSSVSRSSVSTPQYSS---YSSNRSASSILNTSSASVSSYIRPSSPSTRSASSARPS-T 240
            SS    S  +TP   S    ++++S+     TS A+VSS  RPS  ++RS  SA    T
Sbjct: 181 SSSGGPGSRPATPTGRSSTLTANSKSSRPSTPTSRATVSSATRPSLTNSRSTVSATTKPT 240

Query: 241 PSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSSRPSTPNSRPQI--PANLSSPAARSNS 300
           P SRST   S + SR  P+ + P+    R   S   STP++  +   P+  ++P +RS +
Sbjct: 241 PMSRST---SLSSSRLTPTASKPTTSTARSAGSVTRSTPSTTTKSAGPSRSTTPLSRSTA 300

Query: 301 RPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSSTSTS------RPSSP---------- 360
           R STPT R + P   ++  + + T R +++   ++T+ +      +PSSP          
Sbjct: 301 RSSTPTSRPTLPPSKTISRSSTPTRRPIASASAATTTANPTISQIKPSSPAPAKPMPTPS 360

Query: 361 ---------SPRVRAAP-QPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSP 420
                    SP VR+ P +P   P F L+TPPNLRTTLP+RP+SA R RP   SS  GS 
Sbjct: 361 KNPALSRAASPTVRSRPWKPSDMPGFSLETPPNLRTTLPERPLSATRGRPGAPSSRSGSV 420

Query: 421 E--------TTSTGTVPRRAASPTITRGRITDAPGRGRLNTNGHLS----DSPETRRLSS 480
           E               P R  +P  + G    A  RG    + ++S     +    R+ +
Sbjct: 421 EPGGPPGGRPRRQSCSPSRGRAPMYSSGSSVPAVNRGYSKASDNVSPVMMGTKMVERVIN 480

Query: 481 SSDLSGRRP---------VKASTTTAESNGFGRSISKKSLDMAIRHMDIRNG-PGSVRSG 526
              L+  R          + A +++ +S GFGR++SKKSLDMAIRHMDIR   PG++R  
Sbjct: 481 MRKLAPPRSDDKGSPHGNLSAKSSSPDSAGFGRTLSKKSLDMAIRHMDIRRTIPGNLRPL 540

BLAST of CsaV3_4G027310 vs. TAIR 10
Match: AT3G09000.1 (proline-rich family protein )

HSP 1 Score: 107.8 bits (268), Expect = 2.9e-23
Identity = 166/522 (31.80%), Postives = 243/522 (46.55%), Query Frame = 0

Query: 30  ISRDSDENLDLFSKNRRTLSVTASDD--SSDASVKLGRLSVGSVKLAKSGIDDLLSS--- 89
           ++ D DE L LF + RR      +D   +   +V +      +   A SG+ +  SS   
Sbjct: 2   LTHDRDEELSLFLEMRRREKEHRADSLLTGSDNVSINATLTAAAAAALSGVSETASSQRY 61

Query: 90  ------------TEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPRSSTLVRSSSTTK 149
                       +E  K DYDWLLTPPGTP F   S   + +   AP       S  T  
Sbjct: 62  PLRRTAAENFLYSENEKSDYDWLLTPPGTPQFEKESHRSVMNQHDAP------NSRPTVL 121

Query: 150 ASRLSVSQSE--SNNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNTSSASVSSYIR 209
            SRL   + +  S N ++P  SSS   S     + SS  S+RS S     +  S +    
Sbjct: 122 KSRLGNCREDIVSGNNNKPQTSSS---SVAGLRRPSSSGSSRSTSRPATPTRRSTTPTTS 181

Query: 210 PSSPSTRSASSARPSTPSSRST-PSRSSTPSRARPSPNSPSIEKPRPLQSSRPSTPNSRP 269
            S P T  AS++R STP+SR+T  +  +T S A P   + S    R   S+ P+  N RP
Sbjct: 182 TSRPVTTRASNSRSSTPTSRATLTAARATTSTAAPRTTTTSSGSAR---SATPTRSNPRP 241

Query: 270 QIPANLSSPAARSNSRPSTPTRRNSAPSLSSVVGT--PSSTSRVLSTNGRSSTSTSRPSS 329
                 S+ + +  SRP+TPTRR S P+  S+V +  PS  +    T    S + SR +S
Sbjct: 242 S-----SASSKKPVSRPATPTRRPSTPTGPSIVSSKAPSRGTSPSPTVNSLSKAPSRGTS 301

Query: 330 PSPRVRAAPQPIVPPDFP---LDTPPNLRTTLPDRPISAGRSRPTPASSV---------R 389
           PSP + ++ +P  PP+ P   L+ PPNLRTTL DRP+SA R RP  AS+           
Sbjct: 302 PSPTLNSS-RPWKPPEMPGFSLEAPPNLRTTLADRPVSASRGRPGVASAPGSRSGSIERG 361

Query: 390 GSPETTSTGTVPRRAASPT-------ITRGRITDAPGRGRLNTNGHLSD--------SPE 449
           G P +  +G   R++ SP+        T G +T   GR + +  G   D        +  
Sbjct: 362 GGPTSGGSGNARRQSCSPSRGRAPIGNTNGSLTGVRGRAKASNGGSGCDNLSPVAMGNKM 421

Query: 450 TRRLSSSSDL-------SGRRPVKASTTTAESNGFGRSISKKSLDMAIRHMDIRNG-PGS 486
             R+ +   L       +G R    S++   S G+GR++SK S+DMAIRHMDIR G  G+
Sbjct: 422 VERVVNMRKLGPPRLTENGGRGSGKSSSAFNSLGYGRNLSKSSIDMAIRHMDIRRGMTGN 481

BLAST of CsaV3_4G027310 vs. TAIR 10
Match: AT5G01280.1 (BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT3G09000.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 67.4 bits (163), Expect = 4.3e-11
Identity = 139/460 (30.22%), Postives = 219/460 (47.61%), Query Frame = 0

Query: 85  TEGGKHDYDWLLTPPGTPLFPSSSESEIQSTVAAPRSSTLVRSSSTTKASRL-SVSQSES 144
           ++G K DY+WL+TPPG+P         + + + AP  + +      T  SRL + S+ ES
Sbjct: 19  SDGEKSDYEWLVTPPGSP------SRNVTNHLNAPDDNLM------TLISRLENYSKEES 78

Query: 145 NNPSRPVRSSSVSRSSVSTPQYSSYSSNRSASSILNTSSASVSSYIRPSSPSTRSASSAR 204
            + +  + SSS S S +  P  SS SS+RS S     +  S +   RPS+P++R+ S+  
Sbjct: 79  EHQTTSLHSSS-SVSGIRRP--SSSSSSRSTSRPPTPTRKSKTPAKRPSTPTSRATSTTT 138

Query: 205 PSTPSSRSTPSRSSTPSRARPSPNSPSIEKPRPLQSSRPSTPNSRPQIPANLSSPAARSN 264
            +T +S ST   SST S +RPS +S +      L ++R + P +        S+ + RSN
Sbjct: 139 RATLTSSST--TSSTRSWSRPSSSSGTGTSRVTLTAARATRPTTSTDQQTTGSATSTRSN 198

Query: 265 SRPSTPTRRNSAPSLSSVVGTPSSTSRVLSTNGRSSTSTSRPSSP----------SPRVR 324
           +RP       SAP+      + + T R  + NG S+   S+P+ P          SP VR
Sbjct: 199 NRPM------SAPNSKPGSRSSTPTRRPSTPNGSSTVLRSKPTKPLSKPALSLEASPIVR 258

Query: 325 AAP-QPIVPPDFPLDTPPNLRTTLPDRPISAGRSRPTPASSVRGSPETTST--GTVPRRA 384
           + P +P   P F ++ P NLRTTLPDRP +A  SR T A     S  + ST      R++
Sbjct: 259 SRPWEPYEMPGFSVEAPSNLRTTLPDRPQTASSSR-TRAFDASSSSRSASTERDVAKRQS 318

Query: 385 ASPTITR-------GRITDAPGR--------GRLNTNGHLSDSP----------ETRRLS 444
            SP+ +R       G +    G+        GRL ++    +             T RL+
Sbjct: 319 CSPSRSRAPNGNVNGAVPSLRGQRAKTNNDDGRLISHAAKGNQKVEKVVNMRKLATPRLT 378

Query: 445 SSSDL-----SGRRPVKASTTTAESNGFGRSISKKSLDMAIRHMDIRNGPGSVRSGSGNT 497
            S         G      S++ +   GFGR++SK S+DMA+RHMD+R G  +       T
Sbjct: 379 ESGSRRLGGGGGDSSAGKSSSGSGGFGFGRNLSKSSIDMALRHMDVRKGSMAGNFRHSVT 438

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011653729.11.2e-300100.00mucin-5AC [Cucumis sativus] >KGN54463.1 hypothetical protein Csa_012940 [Cucumis... [more]
XP_008463284.15.8e-29297.58PREDICTED: mucin-5AC [Cucumis melo] >TYK06228.1 mucin-5AC [Cucumis melo var. mak... [more]
XP_038883319.15.8e-28495.33serine/arginine repetitive matrix protein 2 [Benincasa hispida][more]
XP_023543251.12.7e-26590.17uncharacterized protein YMR317W [Cucurbita pepo subsp. pepo] >XP_023543252.1 unc... [more]
KAG7033708.17.8e-26590.50hypothetical protein SDJN02_03433 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KY975.6e-301100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G335230 PE=4 SV=1[more]
A0A5D3C4U42.8e-29297.58Mucin-5AC OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold287G00960 PE=4... [more]
A0A1S3CIW92.8e-29297.58mucin-5AC OS=Cucumis melo OX=3656 GN=LOC103501479 PE=4 SV=1[more]
A0A6J1GEV28.5e-26590.33mucin-5AC OS=Cucurbita moschata OX=3662 GN=LOC111453316 PE=4 SV=1[more]
A0A6J1CRA91.4e-26288.93serine/arginine repetitive matrix protein 2 OS=Momordica charantia OX=3673 GN=LO... [more]
Match NameE-valueIdentityDescription
AT3G08670.16.2e-14358.89unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G40070.21.1e-2531.89FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT2G40070.13.1e-2530.28BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT... [more]
AT3G09000.12.9e-2331.80proline-rich family protein [more]
AT5G01280.14.3e-1130.22BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Chinese Long) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 391..431
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..30
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 311..328
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 104..431
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 104..310
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 346..370
NoneNo IPR availablePANTHERPTHR31949:SF2GASTRIC MUCIN-LIKE PROTEINcoord: 20..578
NoneNo IPR availablePANTHERPTHR31949GASTRIC MUCIN-LIKE PROTEINcoord: 20..578

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_4G027310.1CsaV3_4G027310.1mRNA