Cp4.1LG01g05120 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG01g05120
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionWAS/WASL-interacting protein family member 2, putative isoform 1
LocationCp4.1LG01: 882647 .. 884926 (-)
RNA-Seq ExpressionCp4.1LG01g05120
SyntenyCp4.1LG01g05120
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCGATTCCACTTTCTGGCTTCCCCCCCACTTTCTCTCCGATCACCACACCCTCCCCGGAAAACCCATCTCCGCTCTTCATTCTCCCCTTCAATCTGACCTTCTGAACGACGACGATCAGGAAGACTTTCTTGCCGGCTTGACGCAACGCCTTACTCACTCCACCCTTCGTGATCGTCACAAACTTGTCTCTCTCCACGAATTCGAAGTACCCATCTCTTTAATTTTTTGATTGGCTCGATTTTTGTGATGGGTTTCTGATTTTCCGGTGGCTCTGTTTTTTTTTTTTTTTTTTTTTTTTTTGGCGAAGACTGGATCTCCTCAGTCTACTCTTAGTGGGTTTGGGAGTTGGTCGGCTTGGAGTTCGGTGTCCAGCGACGGCAGCCCAAACGGGCCATCTCAGGCGCCGTCGCCGCCGGTAACCCCTTTTGGCAGCGATGACAACACTTGGGATCTTATCTATGCTGCTGCGAAACAAGTTGCTAGGCTCAAAGTGAACAGTAGCAGAGATGGGGTTATTGGTCCTTCTCAAAGTTCGTCAAATCTTGTTTCTTCTGTCAAAGACGTTGGATTTTACTCCCACGCTTCGCAGGTGAAGTTCATTAGCTTCGGTTTTTCACTCCCCTGTTCTCTGTTTTGCTGTGTTTTTTTATATGAAGTTGTTCGACTCTGTTTTTTGTTTTCCTCTGTTTCAGTTCGGAACAGAGCTTCCGATTCGTAAACCGGAGAGCAGTGTCAACTGGGGAAGACAAGTGAAGGCTGAGAAACAGCGGATCCATTGCGGAGGCGGAGATTTTCATCATGAAAATGGGAGAATTGTTGTTCGTCCTGTAGATTTTTCTCAATCCACATGGTCATCTATGCAACGCAATCACCAAAGAAACCCTTCTCAATCGTGTGCTCCAGCCGTGCCCTCCACCTTCCATGGAGGCGGATCTGCCCCCAAAAAGGAATGCGCCGGAACGGGTGTCTTCTTGCCTCGTCGATACGACAACAACCCACCACAACCTCGCAAAAGAGCAGGTACTTTTAACAATTTGCAAACTTGGGTTCCTCAAATTTGAACATGAATTTTAATATTTGCTCTCTGTTCCTTCAGATTATGGCTCGATTGCTATGTTACCAAGGAGAAACGTACAGGACATGAACCGATCTGTTCCCCAAATGACGTCGAATCGCCGGAGCCAGGGTGAGCTCGTTGAAACGATTCACCAATCTGGAGATCTGTTAAAAAGTAGTTTCCTAATAAGCTTTTAATTTTGGTTTCGCAGAAGCTATAATGGCTCAAAGAAACGCCATTTTCGCAGAGCAGAGGCTAAGCTATTCCCGGCCGGCTGAGAGAGGCAAAAGCTATGAGTTTCTTCTTCCTCAGGAGTGGACATACTGAATTAACACTTACGAAATCTGAATGTAGTTCAAGTGTTTCTTTAGATTAGTGAACTATAAAGAAATGAATTAAGGGTTGTAATTTAAGAAGGGAAAAGGGGATTTGTTTGCAGGACTTCAGTTACCAGTTAGTTTTTTATAGATGAAGAACAGAGTATGCAGTGTTCATCATTCATCAGACAGCGGGGAAAGAGAAAAGGATATTTATTGAACAATGGGTTTTAGGAAACGAGTGAAAGAGAGATTGTAATAATTTTTGTGGGTTTCTTAATTGAAATACATAGCTATAGCTATTAACCCTTCTAAGATTAATAAATTATTTGTTGATTTATCATTAGAAGCAGAGTTTATTCCACTGAATTTTCAATTTTAAGAACCTTAAACTTGAACATAATATATTCTTGACCAAGAGGTCGAAGATTCCGGCCTCCTCATCTCCATTTGCATAATCCATCAGATAAAACATATATTACTTTCAATAAGGTCAAAAGTTCCAACATTCTATAGCGTTAAAAGTGCAATCTTGATCATCACAATTCTTGAATTCCAAAAGAAAAAAGAACCATATCATTTTCACATCAGATATTCATGGAAGACTACATTACAAATTAAAGTCGAACACTATTCTAAAGATAGGGACTAGTAAAGATACACTCGTTTGTTAATCTACGCATAGTTCAGTGGTTAATGAGGCATAAACCCACGAGCCTCACGACTAATCTCAATGATACTAACTTCAAGTGAGTGTTCGAGATAATTTGTGCATACTTCTATCAATCTTAGATAATTGCCTAAGTTGCTAAGAAACTTGCAGAATATTAAATGTTAGGTAGGTGGTCATGCCCATACCATCTGTCCATGCCAACCGTATGTTTTATGAATAGTTTTT

mRNA sequence

ATGGCTTCCGATTCCACTTTCTGGCTTCCCCCCCACTTTCTCTCCGATCACCACACCCTCCCCGGAAAACCCATCTCCGCTCTTCATTCTCCCCTTCAATCTGACCTTCTGAACGACGACGATCAGGAAGACTTTCTTGCCGGCTTGACGCAACGCCTTACTCACTCCACCCTTCGTGATCGTCACAAACTTGTCTCTCTCCACGAATTCGAAACTGGATCTCCTCAGTCTACTCTTAGTGGGTTTGGGAGTTGGTCGGCTTGGAGTTCGGTGTCCAGCGACGGCAGCCCAAACGGGCCATCTCAGGCGCCGTCGCCGCCGGTAACCCCTTTTGGCAGCGATGACAACACTTGGGATCTTATCTATGCTGCTGCGAAACAAGTTGCTAGGCTCAAAGTGAACAGTAGCAGAGATGGGGTTATTGGTCCTTCTCAAAGTTCGTCAAATCTTGTTTCTTCTGTCAAAGACGTTGGATTTTACTCCCACGCTTCGCAGTTCGGAACAGAGCTTCCGATTCGTAAACCGGAGAGCAGTGTCAACTGGGGAAGACAAGTGAAGGCTGAGAAACAGCGGATCCATTGCGGAGGCGGAGATTTTCATCATGAAAATGGGAGAATTGTTGTTCGTCCTGTAGATTTTTCTCAATCCACATGGTCATCTATGCAACGCAATCACCAAAGAAACCCTTCTCAATCGTGTGCTCCAGCCGTGCCCTCCACCTTCCATGGAGGCGGATCTGCCCCCAAAAAGGAATGCGCCGGAACGGGTGTCTTCTTGCCTCGTCGATACGACAACAACCCACCACAACCTCGCAAAAGAGCAGATTATGGCTCGATTGCTATGTTACCAAGGAGAAACGTACAGGACATGAACCGATCTGTTCCCCAAATGACGTCGAATCGCCGGAGCCAGGAAGCTATAATGGCTCAAAGAAACGCCATTTTCGCAGAGCAGAGGCTAAGCTATTCCCGGCCGGCTGAGAGAGGCAAAAGCTATGAGTTTCTTCTTCCTCAGGAGTGGACATACTGAATTAACACTTACGAAATCTGAATGTAGTTCAAGTGTTTCTTTAGATTAGTGAACTATAAAGAAATGAATTAAGGGTTGTAATTTAAGAAGGGAAAAGGGGATTTGTTTGCAGGACTTCAGTTACCAGTTAGTTTTTTATAGATGAAGAACAGAGTATGCAGTGTTCATCATTCATCAGACAGCGGGGAAAGAGAAAAGGATATTTATTGAACAATGGGTTTTAGGAAACGAGTGAAAGAGAGATTGTAATAATTTTTGTGGGTTTCTTAATTGAAATACATAGCTATAGCTATTAACCCTTCTAAGATTAATAAATTATTTGTTGATTTATCATTAGAAGCAGAGTTTATTCCACTGAATTTTCAATTTTAAGAACCTTAAACTTGAACATAATATATTCTTGACCAAGAGGTCGAAGATTCCGGCCTCCTCATCTCCATTTGCATAATCCATCAGATAAAACATATATTACTTTCAATAAGGTCAAAAGTTCCAACATTCTATAGCGTTAAAAGTGCAATCTTGATCATCACAATTCTTGAATTCCAAAAGAAAAAAGAACCATATCATTTTCACATCAGATATTCATGGAAGACTACATTACAAATTAAAGTCGAACACTATTCTAAAGATAGGGACTAGTAAAGATACACTCGTTTGTTAATCTACGCATAGTTCAGTGGTTAATGAGGCATAAACCCACGAGCCTCACGACTAATCTCAATGATACTAACTTCAAGTGAGTGTTCGAGATAATTTGTGCATACTTCTATCAATCTTAGATAATTGCCTAAGTTGCTAAGAAACTTGCAGAATATTAAATGTTAGGTAGGTGGTCATGCCCATACCATCTGTCCATGCCAACCGTATGTTTTATGAATAGTTTTT

Coding sequence (CDS)

ATGGCTTCCGATTCCACTTTCTGGCTTCCCCCCCACTTTCTCTCCGATCACCACACCCTCCCCGGAAAACCCATCTCCGCTCTTCATTCTCCCCTTCAATCTGACCTTCTGAACGACGACGATCAGGAAGACTTTCTTGCCGGCTTGACGCAACGCCTTACTCACTCCACCCTTCGTGATCGTCACAAACTTGTCTCTCTCCACGAATTCGAAACTGGATCTCCTCAGTCTACTCTTAGTGGGTTTGGGAGTTGGTCGGCTTGGAGTTCGGTGTCCAGCGACGGCAGCCCAAACGGGCCATCTCAGGCGCCGTCGCCGCCGGTAACCCCTTTTGGCAGCGATGACAACACTTGGGATCTTATCTATGCTGCTGCGAAACAAGTTGCTAGGCTCAAAGTGAACAGTAGCAGAGATGGGGTTATTGGTCCTTCTCAAAGTTCGTCAAATCTTGTTTCTTCTGTCAAAGACGTTGGATTTTACTCCCACGCTTCGCAGTTCGGAACAGAGCTTCCGATTCGTAAACCGGAGAGCAGTGTCAACTGGGGAAGACAAGTGAAGGCTGAGAAACAGCGGATCCATTGCGGAGGCGGAGATTTTCATCATGAAAATGGGAGAATTGTTGTTCGTCCTGTAGATTTTTCTCAATCCACATGGTCATCTATGCAACGCAATCACCAAAGAAACCCTTCTCAATCGTGTGCTCCAGCCGTGCCCTCCACCTTCCATGGAGGCGGATCTGCCCCCAAAAAGGAATGCGCCGGAACGGGTGTCTTCTTGCCTCGTCGATACGACAACAACCCACCACAACCTCGCAAAAGAGCAGATTATGGCTCGATTGCTATGTTACCAAGGAGAAACGTACAGGACATGAACCGATCTGTTCCCCAAATGACGTCGAATCGCCGGAGCCAGGAAGCTATAATGGCTCAAAGAAACGCCATTTTCGCAGAGCAGAGGCTAAGCTATTCCCGGCCGGCTGAGAGAGGCAAAAGCTATGAGTTTCTTCTTCCTCAGGAGTGGACATACTGA

Protein sequence

MASDSTFWLPPHFLSDHHTLPGKPISALHSPLQSDLLNDDDQEDFLAGLTQRLTHSTLRDRHKLVSLHEFETGSPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTWDLIYAAAKQVARLKVNSSRDGVIGPSQSSSNLVSSVKDVGFYSHASQFGTELPIRKPESSVNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVPSTFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMNRSVPQMTSNRRSQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY
Homology
BLAST of Cp4.1LG01g05120 vs. NCBI nr
Match: XP_023541583.1 (uncharacterized protein LOC111801704 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 692 bits (1785), Expect = 3.21e-251
Identity = 342/344 (99.42%), Postives = 342/344 (99.42%), Query Frame = 0

Query: 1   MASDSTFWLPPHFLSDHHTLPGKPISALHSPLQSDLLNDDDQEDFLAGLTQRLTHSTLRD 60
           MASDSTFWLPPHFLSDHHTLPGKPISALHSPLQSDLLNDDDQEDFLAGLTQRLTHSTLRD
Sbjct: 1   MASDSTFWLPPHFLSDHHTLPGKPISALHSPLQSDLLNDDDQEDFLAGLTQRLTHSTLRD 60

Query: 61  RHKLVSLHEFE--TGSPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW 120
           RHKLVSLHEFE  TGSPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW
Sbjct: 61  RHKLVSLHEFEAKTGSPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW 120

Query: 121 DLIYAAAKQVARLKVNSSRDGVIGPSQSSSNLVSSVKDVGFYSHASQFGTELPIRKPESS 180
           DLIYAAAKQVARLKVNSSRDGVIGPSQSSSNLVSSVKDVGFYSHASQFGTELPIRKPESS
Sbjct: 121 DLIYAAAKQVARLKVNSSRDGVIGPSQSSSNLVSSVKDVGFYSHASQFGTELPIRKPESS 180

Query: 181 VNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP 240
           VNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP
Sbjct: 181 VNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP 240

Query: 241 STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMNRSVPQMT 300
           STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMNRSVPQMT
Sbjct: 241 STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMNRSVPQMT 300

Query: 301 SNRRSQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 342
           SNRRSQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY
Sbjct: 301 SNRRSQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 344

BLAST of Cp4.1LG01g05120 vs. NCBI nr
Match: XP_022996098.1 (uncharacterized protein LOC111491411 [Cucurbita maxima])

HSP 1 Score: 675 bits (1742), Expect = 1.15e-244
Identity = 333/344 (96.80%), Postives = 338/344 (98.26%), Query Frame = 0

Query: 1   MASDSTFWLPPHFLSDHHTLPGKPISALHSPLQSDLLNDDDQEDFLAGLTQRLTHSTLRD 60
           MASDSTFWLPPHFLSDHHTLPGKPISALHSPLQSDLLNDDDQEDFLAGLTQRLTHSTLRD
Sbjct: 1   MASDSTFWLPPHFLSDHHTLPGKPISALHSPLQSDLLNDDDQEDFLAGLTQRLTHSTLRD 60

Query: 61  RHKLVSLHEFE--TGSPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW 120
           RHKLVSLHEFE  TGSPQSTLSGFGSWSAWSSVSS+GSPNGPS APSPPVTP+GSDDNTW
Sbjct: 61  RHKLVSLHEFEAKTGSPQSTLSGFGSWSAWSSVSSEGSPNGPSLAPSPPVTPYGSDDNTW 120

Query: 121 DLIYAAAKQVARLKVNSSRDGVIGPSQSSSNLVSSVKDVGFYSHASQFGTELPIRKPESS 180
           DLIYAAAKQVARLKVN+SRDGVIGPSQ SSNLVSSVKDVGFYSHASQFGTELPIRKPESS
Sbjct: 121 DLIYAAAKQVARLKVNNSRDGVIGPSQISSNLVSSVKDVGFYSHASQFGTELPIRKPESS 180

Query: 181 VNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP 240
           VNWGRQ+KAE QRIHCGGGDFHHENG IVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP
Sbjct: 181 VNWGRQMKAENQRIHCGGGDFHHENGVIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP 240

Query: 241 STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMNRSVPQMT 300
           STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMNRSVPQMT
Sbjct: 241 STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMNRSVPQMT 300

Query: 301 SNRRSQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 342
           SNR+SQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY
Sbjct: 301 SNRQSQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 344

BLAST of Cp4.1LG01g05120 vs. NCBI nr
Match: XP_022942517.1 (uncharacterized protein LOC111447530 [Cucurbita moschata])

HSP 1 Score: 674 bits (1738), Expect = 4.69e-244
Identity = 333/344 (96.80%), Postives = 336/344 (97.67%), Query Frame = 0

Query: 1   MASDSTFWLPPHFLSDHHTLPGKPISALHSPLQSDLLNDDDQEDFLAGLTQRLTHSTLRD 60
           MASDSTFWLPPHFLSDHHTLPGKPI+ALHSPLQS LLNDDDQEDFLAGLTQRLTHSTLRD
Sbjct: 1   MASDSTFWLPPHFLSDHHTLPGKPIAALHSPLQSVLLNDDDQEDFLAGLTQRLTHSTLRD 60

Query: 61  RHKLVSLHEFE--TGSPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW 120
           RHKLVSLHEFE  T SPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW
Sbjct: 61  RHKLVSLHEFEAKTRSPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW 120

Query: 121 DLIYAAAKQVARLKVNSSRDGVIGPSQSSSNLVSSVKDVGFYSHASQFGTELPIRKPESS 180
           DLIYAAAKQVARLKVN+SRDGVIGPSQSSSNLVSSVKDVGFYSHA QFGTELPIRKPESS
Sbjct: 121 DLIYAAAKQVARLKVNNSRDGVIGPSQSSSNLVSSVKDVGFYSHAPQFGTELPIRKPESS 180

Query: 181 VNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP 240
           VNWGRQVKAE QRIHCGGGDFHH+NGRIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP
Sbjct: 181 VNWGRQVKAENQRIHCGGGDFHHKNGRIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP 240

Query: 241 STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMNRSVPQMT 300
           STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDM RSVPQMT
Sbjct: 241 STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMTRSVPQMT 300

Query: 301 SNRRSQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 342
           SNRRSQEAIM QRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY
Sbjct: 301 SNRRSQEAIMGQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 344

BLAST of Cp4.1LG01g05120 vs. NCBI nr
Match: KAG6600025.1 (hypothetical protein SDJN03_05258, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 672 bits (1733), Expect = 2.71e-243
Identity = 333/344 (96.80%), Postives = 335/344 (97.38%), Query Frame = 0

Query: 1   MASDSTFWLPPHFLSDHHTLPGKPISALHSPLQSDLLNDDDQEDFLAGLTQRLTHSTLRD 60
           MASDSTFWL  HFLSDHHTLPGKPI+ALHSPLQS LLNDDDQEDFLAGLTQRLTHSTLRD
Sbjct: 1   MASDSTFWLRHHFLSDHHTLPGKPIAALHSPLQSVLLNDDDQEDFLAGLTQRLTHSTLRD 60

Query: 61  RHKLVSLHEFE--TGSPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW 120
           RHKLVSLHEFE  TGSPQSTL GFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW
Sbjct: 61  RHKLVSLHEFEAKTGSPQSTLCGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW 120

Query: 121 DLIYAAAKQVARLKVNSSRDGVIGPSQSSSNLVSSVKDVGFYSHASQFGTELPIRKPESS 180
           DLIYAAAKQVARLKVN+SRDGVIGPSQSSSNLVSSVKDVGFYSHA QFGTELPIRKPESS
Sbjct: 121 DLIYAAAKQVARLKVNNSRDGVIGPSQSSSNLVSSVKDVGFYSHAPQFGTELPIRKPESS 180

Query: 181 VNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP 240
           VNWGRQVKAE QRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP
Sbjct: 181 VNWGRQVKAENQRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP 240

Query: 241 STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMNRSVPQMT 300
           STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMNRSVPQMT
Sbjct: 241 STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMNRSVPQMT 300

Query: 301 SNRRSQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 342
           SNRRSQEAIM QRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY
Sbjct: 301 SNRRSQEAIMGQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 344

BLAST of Cp4.1LG01g05120 vs. NCBI nr
Match: KAG7030694.1 (hypothetical protein SDJN02_04731 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 663 bits (1710), Expect = 1.36e-239
Identity = 333/356 (93.54%), Postives = 335/356 (94.10%), Query Frame = 0

Query: 1   MASDSTFWLPPHFLSDHHTLPGKPISALHSPLQSDLLNDDDQEDFLAGLTQRLTHSTLRD 60
           MASDSTFWL  HFLSDHHTLPGKPI+ALHSPLQS LLNDDDQEDFLAGLTQRLTHSTLRD
Sbjct: 1   MASDSTFWLRHHFLSDHHTLPGKPIAALHSPLQSVLLNDDDQEDFLAGLTQRLTHSTLRD 60

Query: 61  RHKLVSLHEFE--TGSPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW 120
           RHKLVSLHEFE  TGSPQSTL GFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW
Sbjct: 61  RHKLVSLHEFEAKTGSPQSTLCGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW 120

Query: 121 DLIYAAAKQVARLKVNSSRDGVIGPSQSSSNLVSSVKDVGFYSHASQ------------F 180
           DLIYAAAKQVARLKVN+SRDGVIGPSQSSSNLVSSVKDVGFYSHA Q            F
Sbjct: 121 DLIYAAAKQVARLKVNNSRDGVIGPSQSSSNLVSSVKDVGFYSHAPQLFDSVFCFPLFQF 180

Query: 181 GTELPIRKPESSVNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQ 240
           GTELPIRKPESSVNWGRQVKAE QRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQ
Sbjct: 181 GTELPIRKPESSVNWGRQVKAENQRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQ 240

Query: 241 RNPSQSCAPAVPSTFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRN 300
           RNPSQSCAPAVPSTFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRN
Sbjct: 241 RNPSQSCAPAVPSTFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRN 300

Query: 301 VQDMNRSVPQMTSNRRSQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 342
           VQDMNRSVPQMTSNRRSQEAIM QRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY
Sbjct: 301 VQDMNRSVPQMTSNRRSQEAIMGQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 356

BLAST of Cp4.1LG01g05120 vs. ExPASy TrEMBL
Match: A0A6J1K9W4 (uncharacterized protein LOC111491411 OS=Cucurbita maxima OX=3661 GN=LOC111491411 PE=4 SV=1)

HSP 1 Score: 675 bits (1742), Expect = 5.58e-245
Identity = 333/344 (96.80%), Postives = 338/344 (98.26%), Query Frame = 0

Query: 1   MASDSTFWLPPHFLSDHHTLPGKPISALHSPLQSDLLNDDDQEDFLAGLTQRLTHSTLRD 60
           MASDSTFWLPPHFLSDHHTLPGKPISALHSPLQSDLLNDDDQEDFLAGLTQRLTHSTLRD
Sbjct: 1   MASDSTFWLPPHFLSDHHTLPGKPISALHSPLQSDLLNDDDQEDFLAGLTQRLTHSTLRD 60

Query: 61  RHKLVSLHEFE--TGSPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW 120
           RHKLVSLHEFE  TGSPQSTLSGFGSWSAWSSVSS+GSPNGPS APSPPVTP+GSDDNTW
Sbjct: 61  RHKLVSLHEFEAKTGSPQSTLSGFGSWSAWSSVSSEGSPNGPSLAPSPPVTPYGSDDNTW 120

Query: 121 DLIYAAAKQVARLKVNSSRDGVIGPSQSSSNLVSSVKDVGFYSHASQFGTELPIRKPESS 180
           DLIYAAAKQVARLKVN+SRDGVIGPSQ SSNLVSSVKDVGFYSHASQFGTELPIRKPESS
Sbjct: 121 DLIYAAAKQVARLKVNNSRDGVIGPSQISSNLVSSVKDVGFYSHASQFGTELPIRKPESS 180

Query: 181 VNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP 240
           VNWGRQ+KAE QRIHCGGGDFHHENG IVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP
Sbjct: 181 VNWGRQMKAENQRIHCGGGDFHHENGVIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP 240

Query: 241 STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMNRSVPQMT 300
           STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMNRSVPQMT
Sbjct: 241 STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMNRSVPQMT 300

Query: 301 SNRRSQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 342
           SNR+SQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY
Sbjct: 301 SNRQSQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 344

BLAST of Cp4.1LG01g05120 vs. ExPASy TrEMBL
Match: A0A6J1FQG8 (uncharacterized protein LOC111447530 OS=Cucurbita moschata OX=3662 GN=LOC111447530 PE=4 SV=1)

HSP 1 Score: 674 bits (1738), Expect = 2.27e-244
Identity = 333/344 (96.80%), Postives = 336/344 (97.67%), Query Frame = 0

Query: 1   MASDSTFWLPPHFLSDHHTLPGKPISALHSPLQSDLLNDDDQEDFLAGLTQRLTHSTLRD 60
           MASDSTFWLPPHFLSDHHTLPGKPI+ALHSPLQS LLNDDDQEDFLAGLTQRLTHSTLRD
Sbjct: 1   MASDSTFWLPPHFLSDHHTLPGKPIAALHSPLQSVLLNDDDQEDFLAGLTQRLTHSTLRD 60

Query: 61  RHKLVSLHEFE--TGSPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW 120
           RHKLVSLHEFE  T SPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW
Sbjct: 61  RHKLVSLHEFEAKTRSPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPVTPFGSDDNTW 120

Query: 121 DLIYAAAKQVARLKVNSSRDGVIGPSQSSSNLVSSVKDVGFYSHASQFGTELPIRKPESS 180
           DLIYAAAKQVARLKVN+SRDGVIGPSQSSSNLVSSVKDVGFYSHA QFGTELPIRKPESS
Sbjct: 121 DLIYAAAKQVARLKVNNSRDGVIGPSQSSSNLVSSVKDVGFYSHAPQFGTELPIRKPESS 180

Query: 181 VNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP 240
           VNWGRQVKAE QRIHCGGGDFHH+NGRIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP
Sbjct: 181 VNWGRQVKAENQRIHCGGGDFHHKNGRIVVRPVDFSQSTWSSMQRNHQRNPSQSCAPAVP 240

Query: 241 STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMNRSVPQMT 300
           STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDM RSVPQMT
Sbjct: 241 STFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQDMTRSVPQMT 300

Query: 301 SNRRSQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 342
           SNRRSQEAIM QRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY
Sbjct: 301 SNRRSQEAIMGQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 344

BLAST of Cp4.1LG01g05120 vs. ExPASy TrEMBL
Match: A0A6J1GY57 (uncharacterized protein LOC111458270 OS=Cucurbita moschata OX=3662 GN=LOC111458270 PE=4 SV=1)

HSP 1 Score: 480 bits (1235), Expect = 1.42e-167
Identity = 250/357 (70.03%), Postives = 280/357 (78.43%), Query Frame = 0

Query: 1   MASDSTFWLPPHFLSDHHTLPGK-----PISALHSPLQSDLLNDD--DQEDFLAGLTQRL 60
           MAS S F LPPHFLSDHH LP         S +HSPL S L +DD  D  DFLA LT RL
Sbjct: 1   MASASNFSLPPHFLSDHHNLPTDFPYHFNSSPVHSPLGSVLGDDDNDDDADFLAALTHRL 60

Query: 61  THSTLRDRHKLVSLHEFET-----GSPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPV 120
           TH+TLRD  K  ++H+ +       SPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPP 
Sbjct: 61  THTTLRDSMKPAAVHKPQAKTAMASSPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPT 120

Query: 121 TPFGSDDNTWDLIYAAAKQVARLKVNSSRDGVIGPSQSSSNLVSSVKDVGFYSHASQFGT 180
           TPFG D++TWDLIYAAA QVARLK+N+ RDG+IGPSQSSS L SS+++ GFYSH SQFGT
Sbjct: 121 TPFGGDEDTWDLIYAAAGQVARLKMNTQRDGIIGPSQSSSKLASSMRNAGFYSHPSQFGT 180

Query: 181 ELPIRKPESSVNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQRN 240
           E PI K ES +NWGRQVK E Q+I+C GGDF HE+G  V R VDF QS W S+  +H+RN
Sbjct: 181 EPPINKQESCLNWGRQVKVENQQIYCRGGDFPHEHGNFV-RAVDFPQSAWPSLHPHHRRN 240

Query: 241 PSQSCAPAVPSTFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQ 300
           PSQ   PAV + +HG GSAPKKEC GTGVFLPRRYDNNPPQ RKRAD GSI +LP +NVQ
Sbjct: 241 PSQPSTPAVSAAYHGSGSAPKKECTGTGVFLPRRYDNNPPQSRKRADCGSITLLPGKNVQ 300

Query: 301 DMNRSVPQMTSNRR---SQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 342
           D NRSVPQMTSNRR     EA+MAQRNAIFA+QRLSYSRPAERG+S+EFLLPQEWTY
Sbjct: 301 DFNRSVPQMTSNRRLPPCYEALMAQRNAIFAQQRLSYSRPAERGQSHEFLLPQEWTY 356

BLAST of Cp4.1LG01g05120 vs. ExPASy TrEMBL
Match: A0A6J1IIE5 (uncharacterized protein LOC111477184 OS=Cucurbita maxima OX=3661 GN=LOC111477184 PE=4 SV=1)

HSP 1 Score: 479 bits (1234), Expect = 2.01e-167
Identity = 250/357 (70.03%), Postives = 280/357 (78.43%), Query Frame = 0

Query: 1   MASDSTFWLPPHFLSDHHTLPGK-----PISALHSPLQSDLLNDDDQED--FLAGLTQRL 60
           MAS S F LPPHFLSDHH LP         S +HSPL S L +DD+  D  FLA LT RL
Sbjct: 1   MASASNFSLPPHFLSDHHNLPTDFPYHFNSSPVHSPLGSVLADDDNDHDGDFLAALTHRL 60

Query: 61  THSTLRDRHKLVSLHEFET-----GSPQSTLSGFGSWSAWSSVSSDGSPNGPSQAPSPPV 120
           TH+TLRD  K  S+H+ +       SPQSTLSGFGSWSAWSSVSS+GSPNGPSQAPSPP 
Sbjct: 61  THTTLRDSLKPASVHKPQAKTPMASSPQSTLSGFGSWSAWSSVSSEGSPNGPSQAPSPPT 120

Query: 121 TPFGSDDNTWDLIYAAAKQVARLKVNSSRDGVIGPSQSSSNLVSSVKDVGFYSHASQFGT 180
           TPFG D++TWDLIYAAA QVARLK+N+ RDG+IGPSQSSS L SS+++ GFYSH SQFGT
Sbjct: 121 TPFGGDEDTWDLIYAAAGQVARLKMNTQRDGIIGPSQSSSKLASSMRNAGFYSHPSQFGT 180

Query: 181 ELPIRKPESSVNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQSTWSSMQRNHQRN 240
           E PI K ES +NWGRQVK E Q+I+C  GDFHHE+G  V R VDF QS W S+  +H+RN
Sbjct: 181 EPPINKQESCLNWGRQVKVENQQIYCRRGDFHHEHGNFV-RAVDFPQSAWPSLHPHHRRN 240

Query: 241 PSQSCAPAVPSTFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYGSIAMLPRRNVQ 300
           PSQ   PAV + + GGGSAPKKEC GTGVFLPRRYDNNPPQ RKRAD GSI +LP +NVQ
Sbjct: 241 PSQPSTPAVSAAYQGGGSAPKKECTGTGVFLPRRYDNNPPQSRKRADCGSITLLPGKNVQ 300

Query: 301 DMNRSVPQMTSNRR---SQEAIMAQRNAIFAEQRLSYSRPAERGKSYEFLLPQEWTY 342
           D NRSVPQMTSNRR   S EA+MAQRNAIFA QRLSYSRPAERG+S+EFLLPQEWTY
Sbjct: 301 DFNRSVPQMTSNRRLPPSYEALMAQRNAIFAPQRLSYSRPAERGQSHEFLLPQEWTY 356

BLAST of Cp4.1LG01g05120 vs. ExPASy TrEMBL
Match: A0A5A7UBI2 (WAS/WASL-interacting protein family member 2, putative isoform 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold101G00180 PE=4 SV=1)

HSP 1 Score: 444 bits (1142), Expect = 2.68e-153
Identity = 236/368 (64.13%), Postives = 279/368 (75.82%), Query Frame = 0

Query: 1   MASDSTFWLPPHFLSDHHTLPGKPISA---------------LHSPLQSDLLNDD--DQE 60
           MASDSTF+LPPHFLSDH  LP KP S+               +HSP+ S L +DD  D++
Sbjct: 1   MASDSTFYLPPHFLSDHDNLPPKPTSSALFPTDFPYDFTSSSVHSPVDSVLGDDDNDDEQ 60

Query: 61  DFLAGLTQRLTHSTLRDRHKLVSLHEFET-----GSPQSTLSGFGSWSAWSSVSSDGSPN 120
           DFLA LTQRLT STLRD  KL S+H+ +      GSPQSTLSG GSWSAWSSVSSDGSPN
Sbjct: 61  DFLAALTQRLTQSTLRDSQKLPSVHKSQAKMAMAGSPQSTLSGVGSWSAWSSVSSDGSPN 120

Query: 121 GPSQAPSPPVTPFGSDDNTWDLIYAAAKQVARLKVNSSRDGVIGPSQSSSNLVSSVKDVG 180
           GPS APSPP TPFG ++NTWDLIYAAA QVARLK+N+ RDG+IGPSQSSSNLVSSV + G
Sbjct: 121 GPSLAPSPPTTPFGGENNTWDLIYAAAGQVARLKMNTHRDGIIGPSQSSSNLVSSVHNAG 180

Query: 181 FYSHASQFGTELPIRKPESSVNWGR-QVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQST 240
            YSH SQFGT+ PI KPE+S +WGR QVK E Q+IH  G DF+HEN R + RP+D +QS 
Sbjct: 181 LYSHPSQFGTDPPIYKPENSSHWGRRQVKVENQQIHYRGQDFYHENERFL-RPLDITQSA 240

Query: 241 WSSMQRNHQRNPSQSCAPAVPSTFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYG 300
           W S+  +H+  PSQ   PA  + +HG GSAPKKECAGTGVFLPRRYDNNPPQ R+RAD  
Sbjct: 241 WPSLHPHHRSYPSQPSTPAAHAAYHGVGSAPKKECAGTGVFLPRRYDNNPPQSRRRADSP 300

Query: 301 SIAMLPRRNVQDMNRSVPQMTSNRRSQ---EAIMAQRNAIFAEQRLSYSRPAERGKSYEF 342
           S+A++P +N+Q +N S+P   SNRR Q   +A++AQRN IFA+QRLSY R AER K++EF
Sbjct: 301 SVALVPAKNIQGLNGSIPP--SNRRLQPSYDALIAQRNTIFAQQRLSYPRLAERSKTHEF 360

BLAST of Cp4.1LG01g05120 vs. TAIR 10
Match: AT2G39870.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G55690.1); Has 73 Blast hits to 71 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 2; Fungi - 2; Plants - 69; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 78.6 bits (192), Expect = 1.1e-14
Identity = 104/325 (32.00%), Postives = 133/325 (40.92%), Query Frame = 0

Query: 38  NDDDQEDFLAGLTQRLTHSTLRDRHKLVSLHE---FETGSPQSTLSGFGSWSAWSSVSSD 97
           + DD+EDFLAGLT+RL  ST R    L    E       SPQSTLSG GS+S     S  
Sbjct: 61  SSDDEEDFLAGLTRRLAPSTQRLPSPLFKSEEKRQVAATSPQSTLSGLGSFSN----SGS 120

Query: 98  GSPNGPSQAPSPPVTPFGSDDNTWDLIYAAAKQVARLKVNSSRDGVIGPSQSSSNLVSSV 157
            SP  PS  P  P + F   DN WD+I AAA +VARLK+ S                   
Sbjct: 121 RSPILPS--PPAPTSSF-RRDNAWDVISAAAGEVARLKLGS------------------- 180

Query: 158 KDVGFYSHASQFGTELPIRKPESSV---NWGRQVKAEKQRI-----HCGG-GDFHHENGR 217
               +  H       LP++ PES +   N     + + QR+      C     F     R
Sbjct: 181 ----YEPH------HLPLQTPESLLRRQNAAIHAELQHQRLIEQMWLCSAQSRFKLSENR 240

Query: 218 IVVRPVD----FSQSTWSSMQRNHQRNPSQSCAPAVPSTFHGGGSAPKKECAGTGVFLPR 277
           I  R V+    F    +          P Q+ AP             K+  AGTGVFLPR
Sbjct: 241 IPRRVVNEEGLFENPRYVRRNNPTWLPPQQAAAPL------------KRPSAGTGVFLPR 300

Query: 278 RYDNNPPQPRKRADYGSIAML-PRRNVQDMN-RSVPQMTSNRRSQ--EAIMAQRNAIFAE 337
           RY +  P    +    + AML P+   Q++N      +   RRSQ     M  R+ + A 
Sbjct: 301 RYPSAAPSDSLKTPVNTPAMLQPKVKPQNLNFDEFTNIVGPRRSQFDYECMLARSTVLAR 330

Query: 338 QRLSYSRPAERGKSYEFLLPQEWTY 343
           Q     R    G      LPQ+W Y
Sbjct: 361 Q--GNFRAVSGGG-----LPQDWMY 330

BLAST of Cp4.1LG01g05120 vs. TAIR 10
Match: AT3G55690.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G39870.1); Has 76 Blast hits to 69 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 3; Plants - 69; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 60.8 bits (146), Expect = 2.4e-09
Identity = 67/245 (27.35%), Postives = 104/245 (42.45%), Query Frame = 0

Query: 40  DDQEDFLAGLTQRLTHSTLRDRHKLVSLHEFETGSPQSTLSGFGSWSAWSSVSSDGSPNG 99
           DD++DFLAGLT+RL  ST R     +S   F T   Q            S+ S  GSPNG
Sbjct: 59  DDEDDFLAGLTRRLALSTQR-----LSSPSFVTDKSQMKPK-----VTESTQSGLGSPNG 118

Query: 100 P-SQAPSPPVTPFGSDDNTWDLIYAAAKQVARL-KVNSSRDGVIGPSQSSSNLVSSVKDV 159
           P SQ PSPP +P   +D+   ++ AAA +VA++ K N     +  P+ + + L S  ++V
Sbjct: 119 PFSQVPSPPTSPSREEDSL-KVLSAAAGEVAKIKKANFDAKPISYPNPNPNYLTSFPQNV 178

Query: 160 GFYSHASQFGTELPIRKPESSVNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQST 219
            +Y                 +  W  +               H+   ++ + P     + 
Sbjct: 179 AYY-----------------NCYWLWEP--------------HYPQSQMGIVP-----NA 238

Query: 220 WSSMQRNHQRNPSQSCAPAVPSTFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADYG 279
           W                P+    F+   +A K    GTGVFLPR+Y N    P+K++  G
Sbjct: 239 W-------------HIPPSPVRAFYTLPTAVKSPSTGTGVFLPRKYSNPSDSPKKKSGDG 243

Query: 280 SIAML 283
            + ++
Sbjct: 299 CVKVV 243

BLAST of Cp4.1LG01g05120 vs. TAIR 10
Match: AT3G54000.1 (CONTAINS InterPro DOMAIN/s: Uncharacterised conserved protein UCP022260 (InterPro:IPR016802); Has 94 Blast hits to 94 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 94; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 58.5 bits (140), Expect = 1.2e-08
Identity = 91/369 (24.66%), Postives = 146/369 (39.57%), Query Frame = 0

Query: 1   MASDSTFWLPPHFLSDHHTLPGK------------PISALH------SPLQSDLLNDDDQ 60
           +  D+ FWLP  FL+D   L  K            P    H      S ++ + + +DD+
Sbjct: 7   VVDDAEFWLPTEFLTDDDFLVEKENNSVGIDDSLFPYEPRHGFGTFGSTVKPNTVKEDDE 66

Query: 61  EDFLAGLTQRLTHSTLRDRHK--LVSLHEFETGSPQSTLSGFGSWSAWSSVSSDG--SPN 120
           E FLAGLT+++  S+L+D     +   H F  G+            AW    S    +  
Sbjct: 67  ESFLAGLTRQMVMSSLKDDFSGGVCGNHAFPAGNDH---------KAWEMNRSPPCVAGT 126

Query: 121 GPSQAPSPPVTPFGSDDNTWDLIYAAAKQVARLKVNSSRDGVIG-PSQSSSNLVS-SVKD 180
           G             S  ++WDL  AA +     +   S  G++G P++ S+ + + S   
Sbjct: 127 GCCCLNQRFNQNLNSRVSSWDLYCAAERMSINDEPYHSGRGLLGSPAKLSATVKNHSNNG 186

Query: 181 VGFYSHASQFGTELPIRKPESSVNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQS 240
            G+Y++      +         +   + +K  +Q +    G     NG   V PVD S S
Sbjct: 187 TGYYNNHQSLQYQKLQAIQFQQLKQQQLMKHRRQLVRQNRG--VRVNGNKNVGPVDLSSS 246

Query: 241 TWSSMQRNHQRNPSQSCAPAVPSTFHGGGSAPKKECAGTGVFLPRRYDNNPPQPRKRADY 300
            WS+      + P +    AV    H G    K+   GTGVFLPR  ++      +    
Sbjct: 247 AWSN------QFPRRDVMRAVFIGDHTG----KRGSTGTGVFLPRSVNHTSRTETREKPT 306

Query: 301 GSIAMLPRRNVQDMNRSVPQ-MTSNRRSQEAIMAQR--NAIFAEQRLSYSRPAERGKSYE 343
            S  ++P R  Q +N ++ + + S     +    QR  N  F+ Q +   R  +     E
Sbjct: 307 ISTVLVPARLAQVLNLNLGEPVRSTATLNDVSWRQRSNNGGFSSQMVGGVRAEQ--SVQE 352

BLAST of Cp4.1LG01g05120 vs. TAIR 10
Match: AT3G54000.2 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 52.8 bits (125), Expect = 6.5e-07
Identity = 73/285 (25.61%), Postives = 114/285 (40.00%), Query Frame = 0

Query: 1   MASDSTFWLPPHFLSDHHTLPGK------------PISALH------SPLQSDLLNDDDQ 60
           +  D+ FWLP  FL+D   L  K            P    H      S ++ + + +DD+
Sbjct: 7   VVDDAEFWLPTEFLTDDDFLVEKENNSVGIDDSLFPYEPRHGFGTFGSTVKPNTVKEDDE 66

Query: 61  EDFLAGLTQRLTHSTLRDRHK--LVSLHEFETGSPQSTLSGFGSWSAWSSVSSDG--SPN 120
           E FLAGLT+++  S+L+D     +   H F  G+            AW    S    +  
Sbjct: 67  ESFLAGLTRQMVMSSLKDDFSGGVCGNHAFPAGNDH---------KAWEMNRSPPCVAGT 126

Query: 121 GPSQAPSPPVTPFGSDDNTWDLIYAAAKQVARLKVNSSRDGVIG-PSQSSSNLVS-SVKD 180
           G             S  ++WDL  AA +     +   S  G++G P++ S+ + + S   
Sbjct: 127 GCCCLNQRFNQNLNSRVSSWDLYCAAERMSINDEPYHSGRGLLGSPAKLSATVKNHSNNG 186

Query: 181 VGFYSHASQFGTELPIRKPESSVNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQS 240
            G+Y++      +         +   + +K  +Q +    G     NG   V PVD S S
Sbjct: 187 TGYYNNHQSLQYQKLQAIQFQQLKQQQLMKHRRQLVRQNRG--VRVNGNKNVGPVDLSSS 246

Query: 241 TWSSMQRNHQRNPSQSCAPAVPSTFHGGGSAPKKECAGTGVFLPR 262
            WS+      + P +    AV    H G    K+   GTGVFLPR
Sbjct: 247 AWSN------QFPRRDVMRAVFIGDHTG----KRGSTGTGVFLPR 270

BLAST of Cp4.1LG01g05120 vs. TAIR 10
Match: AT3G54000.3 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 52.8 bits (125), Expect = 6.5e-07
Identity = 73/285 (25.61%), Postives = 114/285 (40.00%), Query Frame = 0

Query: 1   MASDSTFWLPPHFLSDHHTLPGK------------PISALH------SPLQSDLLNDDDQ 60
           +  D+ FWLP  FL+D   L  K            P    H      S ++ + + +DD+
Sbjct: 7   VVDDAEFWLPTEFLTDDDFLVEKENNSVGIDDSLFPYEPRHGFGTFGSTVKPNTVKEDDE 66

Query: 61  EDFLAGLTQRLTHSTLRDRHK--LVSLHEFETGSPQSTLSGFGSWSAWSSVSSDG--SPN 120
           E FLAGLT+++  S+L+D     +   H F  G+            AW    S    +  
Sbjct: 67  ESFLAGLTRQMVMSSLKDDFSGGVCGNHAFPAGNDH---------KAWEMNRSPPCVAGT 126

Query: 121 GPSQAPSPPVTPFGSDDNTWDLIYAAAKQVARLKVNSSRDGVIG-PSQSSSNLVS-SVKD 180
           G             S  ++WDL  AA +     +   S  G++G P++ S+ + + S   
Sbjct: 127 GCCCLNQRFNQNLNSRVSSWDLYCAAERMSINDEPYHSGRGLLGSPAKLSATVKNHSNNG 186

Query: 181 VGFYSHASQFGTELPIRKPESSVNWGRQVKAEKQRIHCGGGDFHHENGRIVVRPVDFSQS 240
            G+Y++      +         +   + +K  +Q +    G     NG   V PVD S S
Sbjct: 187 TGYYNNHQSLQYQKLQAIQFQQLKQQQLMKHRRQLVRQNRG--VRVNGNKNVGPVDLSSS 246

Query: 241 TWSSMQRNHQRNPSQSCAPAVPSTFHGGGSAPKKECAGTGVFLPR 262
            WS+      + P +    AV    H G    K+   GTGVFLPR
Sbjct: 247 AWSN------QFPRRDVMRAVFIGDHTG----KRGSTGTGVFLPR 270

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023541583.13.21e-25199.42uncharacterized protein LOC111801704 [Cucurbita pepo subsp. pepo][more]
XP_022996098.11.15e-24496.80uncharacterized protein LOC111491411 [Cucurbita maxima][more]
XP_022942517.14.69e-24496.80uncharacterized protein LOC111447530 [Cucurbita moschata][more]
KAG6600025.12.71e-24396.80hypothetical protein SDJN03_05258, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7030694.11.36e-23993.54hypothetical protein SDJN02_04731 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
A0A6J1K9W45.58e-24596.80uncharacterized protein LOC111491411 OS=Cucurbita maxima OX=3661 GN=LOC111491411... [more]
A0A6J1FQG82.27e-24496.80uncharacterized protein LOC111447530 OS=Cucurbita moschata OX=3662 GN=LOC1114475... [more]
A0A6J1GY571.42e-16770.03uncharacterized protein LOC111458270 OS=Cucurbita moschata OX=3662 GN=LOC1114582... [more]
A0A6J1IIE52.01e-16770.03uncharacterized protein LOC111477184 OS=Cucurbita maxima OX=3661 GN=LOC111477184... [more]
A0A5A7UBI22.68e-15364.13WAS/WASL-interacting protein family member 2, putative isoform 1 OS=Cucumis melo... [more]
Match NameE-valueIdentityDescription
AT2G39870.11.1e-1432.00unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G55690.12.4e-0927.35unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G54000.11.2e-0824.66CONTAINS InterPro DOMAIN/s: Uncharacterised conserved protein UCP022260 (InterPr... [more]
AT3G54000.26.5e-0725.61unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... [more]
AT3G54000.36.5e-0725.61unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 222..253
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 222..237
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 90..112
NoneNo IPR availablePANTHERPTHR33356:SF17H/ACA RIBONUCLEOPROTEIN COMPLEX NON-CORE SUBUNIT NAF1-LIKEcoord: 2..342
NoneNo IPR availablePANTHERPTHR33356TIP41-LIKE PROTEINcoord: 2..342

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g05120.1Cp4.1LG01g05120.1mRNA