Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATTTGGGAGGGAAAAAAATCTGGACATGGAACATCGCTTCTCCAGCAGCGAGACCATATTTAGAACTGTAAAGCGAAAGCCCTAGCACTCGCTCAGACATGGCCATTTCGCCAATGGCGTTGAGTTCGTCTTTTATCCACCACTCTCCTCTCCATTCCAATAAAACTCACCGACTGTCTGCTATGCCCTCAATTCATACTCCCAAAATTGTTCTGAAAAACTCAAGGCGTTGCTTTTCTACAATCTCTTGCTCTGGTAGGAGGCCAATCCCGACGACGGAGGAAGAGGTTCTTCAAGCTGTGTTGGAATCCGATGAGAAGATTCTTCCTTGCGTTCGGACGTACGAGAATGATTTGTCTCGGCTTACTCTAGTCGGAGGCGTCGATTTCCGGCAGTCTGTTACGGCAGCTGCGGCTGACGGCGGTGAAGCGGCCTCTGAGCACCTTGATTCTGGTATGTCTGCAATGGTTGTGGAGACTGTATTTCCGGGAACTTCAGACGAGCACAGCACGGTCTCGACCCGGCTGGTGTGTTATTTACTAACCTGGCTGCGTAATTTGGTGCTCTATATGCGAATTACAACTATGAATACTCATTGGCATCACTATAGTTTCCATATATTGGAATGAATTCGAGTCTTGACGTGAATGCGTTGCCCCGAACGTTTGCACCTGGCACGTTAGTTTCAGGAGTCGCCACTTTTACTTGCATTTAAAAAAAAAAAACTTATTTTGATTTCCACCATCGAAACAATTAACTCCGTAATTGATGTATGTGATTGATATGGTAGCGTTCTCCATCTCTAGTTCTTTGCTATCTTTAATAGCTCCATGATTAATATGTCCATTTCAAGTATTGTTCGATGTTCGCTTACAATTATGTCCACTGTAGTATAATGTTCTCATATATGAAAATGTGAGTCCTGTCGTATAACACAACAACTTAACTATAATCTGTCACATGGTATCTGTTCCTCCCTCTTGCTATAATATCGTCAATTTGTTAGATAAAAATTATACCTCCGATTGTACTTGTCGTTCTTAAGTTAACTGTTTTAACAGTTTTTACCTGCCAGAGAAGTCAAGGAAAAGGCTAGGAATCTCAAAAAGTCTCTCGCTCAAGATTTTAATTTGAGTACCTCGTCCAAAAATATACTTGCTATGACATTTAGACAAGTAGTATTGCAACAGCTCTGGAGCTTTGAACTGGTTATCTTTATACCTGGATCTGAAAGGAATATGGAAGATCTTGAAAATCCAAGAGAGGTTTGTCATTTCGCCTTATTTTCCTCTATTTTTCGTGTTTTTTGATTTTTTGTTTGTTTGAAAAATTGAAACATCACAGTAGCATGTCTATGCTGCAGGTTTCGCATGATACTGATAGGTTCAGTTCTCTTATGCATTTCATATCGGCAGGTTCCAATGTCTTTCACTCTCAGTTCATCCGAGGAACGAGCCATCTCTGTACTTGCAGAAGCTGTTTGCATGTGCGCTCTTCGAAATACAGAAGGAAAATTTGTCAATGGTACAGCAAGTGGAACTTCAACTAGATTTTTTGATTGGTTCCGGAAGTCCACAATTGTTGCATCAAAGGATTCTTCAGTTATTATCTACAAGTTATTGGACAACGAGGTAGCTGATGCCAAAAGTTTATTACAAAAGTTTAATTCAAATAAGGAGAGCTGGAAACGTAGGAATTTCCAGTCGAAGAACTATTGGTGGATACCTTCTGAACTCTCTGAACTAGAAAAAATCGGTGGGGCTGAATTCAGTGCATGGGCTAGCGAGTATGTACCTTCTTACAGGCTACAAATTGATGCTCGTCAGTTCAATGGTTTAAAATTTGGAGGCTGGAGAGAATCTGCTGAGAATAGGTGGGAAGTCCTTTTGACCCACTCCCAAATGGTATTTTGATCTTTCCTTCTCAGCCGTTATCCTTCTGTATTATCTTAATGTGTCCAGTGATGGTATTTTCACTATTTAGCCTAGTTCAAGAGCTACCTTCTTGGAGTGCACCTTTCGCCGAGGGCAATAGTTATGTTATCATAAATATGTGTTAGACACACTCTTATTTCTCTACTTACTTATCGATCCAAACACAAGATATGGACACGAGAGTACATCTTTATAGATTCAGGCATGGTGAGTACTGAACAAATATTATCTTTATATATTTTGCACATTAACAATCTCAGATTTAATAATTTGTGTGGTGTATAGAATTAAGGATACAAACTTGTATTCACATCTATTCAATTTCAGGAGAAATTAAATATTAATGTATATAACTTCAAGAATGTAATATTTTTGAACTACGTCTTTAAATTTTATCCAGTAAAATTTAACATTATCATTATTATTTTAGTTTCTGCATGTTCTTGTGCTTTTGTTCATGCATATCTCAGTTCCTTGATTTAAATATTCTTCCATGTTAGTGTTGCATGTCTTCAATATGTTCGTATTGAATCCAAGATATATTACATAGGAATCAATCTAACACTTTAAGGTACGTTAGACATGCAGACACTGTTTATAACAGTGTTCATACTTCTTATTCTAAGACTATCTGACCATTCGTTATGCAGGTAGGATTGGCGAACATATTAGACATTTTCTATGAAGATGTATATTCTCTGCCCGATAAACAACTGCAATGTGGTGCAATCGTGCATTCTGCTAACTTGTTAAACAAAAAGGTATCTGCTTGTATGTACACACACATATGTAATTATAAATGTATGTGCATTGTGCTTACTTGTATGTATGTATTTGATTGTTTATTGGTTTCTTTGCAAACCATAAGGAAAGTTGTTTTCTTTTTTCTTCTGGGAACAAAAGTTGTTTGCTATTGAAGTATATGCTCCACATGTTGCAGAGAAACTATTCCTCCTGGGGGTTTCTATCCAAGACTTTAGCTGGTGGAATTTTTTTTGTTACTATTATTGCTGTTGGTCAATATTTTTTGCCTCGCGTTCATGTGTCTGGGAGGTATAATGTAGAACAGCCGGTTTCATCACTTTATGGAGTTAGCTCTGTGAAAAATCAGGCTACAGAAGCAGAAAAGGTATTCACCAGACTTCCCCTTTATTTTCCATTTTCTGACATGATCCATATTTTTATTGAGATGTAAAGTTTAATATATGAACCTTTTATTGACAGTTGGAAGAATACTGCATCTCAGTTGTAAATATTATAAAAGATGCCTTCGGTTGGCATGGTGATGTACACACAGATAAGAGAGTTGGTGCGTGGATTGGGGAAGCTCCTGATTACTTGAGGGTGGTTGAATCTGATACAGGTAGTGAAGATACTCCATCTGGTACGATAGAACAAGATAATGTTGATGGAGTGAAAGCTTCTGCTCAGGACATAGCTAGTTATCAGGTGATTTGAATTTTTATTTGTTTTTTCTTCAGTCTCTTAATTGGGACAATTCAGGGAATGACTTGCATAATTGCTTATTGCATTGTTTCACGTTTTGTTAAAAAGTGTGCAAAGACCTGAGCACTGTCTGCTCAAATGCAAGTAGTGAAACTTAAGCATGCCGATAAAACTTTCTTTCATCTCCTGGTAAGCGGTGTGGTAAACTGAATTTGGAAATATATTATTTCTGCAACGGGGTGTTTCTTATCCTCTGAAAAACGAATCGTTAAGGTCAAATTAGATTTTCTCTGAAGTTATATAAACTTGGCTGAGAGATTAAGGATCCTAGAGAAATTGAAACAAAGGAGAAGGAACTCCCACACTTCCTACTTCAAATAAGAACACAAGAAGACTAGATGACTTTGGGTTTCATCTGCATTTCTACTCAGCAGACTTCAGCACTATTAAGGTATTCATGAGAAACCAATTAACAGAAATATTTCACCTTTTTGAATTTTGATCCTTCTAACTTTCCATATTAATAGTGCTGAAATTTAGCTAACATAGATTTGGTATCCACGATCCTTGAAAACAGGGTCTTCACAGAGAGTTCACCTAATGATTCAAGTGTTCACCTCCTTTTATCTTCACCATTTGTAGGGCCGAATCCGTTCAGCAACTACAAGAAATTGCGGAAGTCACGAAGTTTCACTAAATTCAAACTTCTTCGGGATAATATTTTCCAGGACTTGGTTTCCTCTTCCAAGCCTCGCTGAATCCCCTGTGAGGTTGTGACAGCTAAAGCAAGAAATTTCTGCATCTGTGCTTGCTCAACCTGCCATCCGCCCTTCCAAAGCCGGTTTATTACCATATTCTAGTCTCATGTGATATCATACCCTGGGATTTAAAATATTTACGTCAAGGACTGTAGCAGCCTCCCCTTTTTAGTTCTCCTGTAAACTACCCTTATCTCTCTTTTCCATCAGAAACACAATGATTCTATGTCAAAGTTCATTAGGTTAGCCGTGAATTTTTCATATCACTTTGCCATCAGCGCTAGGTTTCTCTGTCAAATTACCAGTACAGAAACCACACTGGATCAAAGGGACAAAAATAGTGTTCAAGCTGACCAGACAGCCAACTATTCCAATCATCTCATGACCCTATACAATCTCTCGTTCTATACAAATCGTACATTCTTCTTCCAAAATGGTGCCAACTGCAACCCCCTTTTATGGGACACATTCCTTCTATTTTCCAGTTTTCTCTTTCTGCCAGCTTCCAAGACTGAGTTTTTCAGCCATGCTTATCCTGTCAAAATAGAATTGTGTGATTTGCAAGGTGTGATGGACATTATCTTTATCAATCCTAGAGCCTTCTTTTTAACCCGCCTAGCCATATGATAGAGCATTTTACTTTGCATGTTTCTCAGTAGGATTAATGAGGGAAAAGTGTACCGCTTGGCTTAAACTTCTCCAGACTTCAAGGTAAAAAATATGAATCTAGACTTTCCGTTAAGAAGGATCAACAATCTTGGGTCTAAAATCTAAATGCATCTCTATCTTCAGTTTAACTGATTGGAACGAAAGATCTCTCTATAAAAGGGACTTTCATTCTACGGGAGTTGTAAGCTTTCTCCAGATCAAATTTGAAAATCCTACCACTTTCCTTCCTCATCCTTTCCACAACTACCTCATTATCTATCTCAATGGATTCCATTATTTGTGTTCCTCCAGTCAGTGCCATCTAGCTTGCAGAAACCGTAGATCAATTTTTTTAAAAAAATACATGCAACAAGCCCTTATCTTCTTTCATCAACTTAATTGGTCTGAAGTTTAAACCTTAATGGTCTTTTTATGATAATAACTTATTTCTTTTTGACATGCCAAATTATATTTTCTCTTAGGCAAGTATAAAGTGTCCATTCCTTAAATATTTGTGAAACTGCTACACAAAATTTTGGTTTGAAGATGTTTCAAGCTTTGATAAGGCTTTCAGCACCTTTGCCTGCAATATTGGTTAGTCTTAAATAGAATACTTCTCGAAGTCCAAGACTCTGCATAAGACCTGCTCAATATCAAAAGTTCTCTTACTAGAGTAAATTGACTATTAAGTACCTTTTGTTTACTTATTGGTTATATATGATGTGCTTGCTCACCTGCTCGTCAAAACTTAGGTGGTCTTGTCGACTGAAGGAAAGATTGTAGGATTCCAACCAACCAGTCGGGTTGCTGTTAACTATTGGGCTGCAAATCCTTTAGCAAAGCAATTATATGGTGGGAGGAACTTGTCCCCGGGTAAGCTAAATTCTTCTTGCGTGTAAATTCTCTCTCTAATAAATTGTCCTCAACTTGTGCATTTATCTTTCTGTGTATCAAGCAGGCCTTTTTGAATCTGGGTTAAGGATCAGACGCCCGAATGAGGTGATTGTGATAGAGTTGCTTATGTCGGTGAAAACTGATGCTTACTTTGCTTTGGCAAGGCCCACATATTAGTCAATACCCCGAGTTCGCAGGCAGCAGTATTTGATTTGCAGAAACGCCATTCTCAACGTAAAATTTGGATATTCAGATACTAATGCCGCTCCTGGATAATCGTTACAGGCTTCCAGCCCCTCCTAAAGAAGTTGCCATCCCAATCTCGCTCAGATCTCCTCAATGTCGTCAATATAGATAGTCATTCAGATAGCGTAGGAAATTTTGTATTAATGTTGTCCAGATTAGTTTTCCTGTTTAATAAATACAGATGCTACAAGCAACTGTATGTATTACTGTGACACAACTCATGTAAGACAGACAATTGTGTCCAGAGAAGCATGAGACATCTGCTAAAGATAGATTTAGCCTATTTGCAAATGAAACCTACAAATACCCTTTCCCTTGTTCTTCAACATTTTTTAACATTCAGCCCACTTGCATTAAGAGAAGGATGCATCAGGAAATGATGCATTAAATCTACATTTCTGATAATGGTTAACAAGGTTGTAACCTTCGAGCCATATACAAATTTCAATATCTTGAGAGATGACTGAAAAGATTACTCAAGATGTGTATGTTGACAATTGACATCAAATAAAATGTAAACTAGTATCATCCGCATACATAACATAGTGTAGGCGGATTACATTTACTCGAAATCCCGAGGACCTAAACCATTATTGGAGTTTGTATCACCAGCGGAAACAAATATGTATATCACTACTAGTCTTCTGCGGAGTTACAGTGACTAAGAATATTCCAGTTGCAAATTGTAACCCAAAATATCAGGGAAACAACCTGCCTTACCGTTTCCAGAAGTTACAAGCTCCATCATGCGGGTGTATCTGTTGCAGCTACCATGACATGAGTATCAGATGGTATACCACGAGACAAGAAAAAAAGAAAAAAAAAAGTTATTTATGTTCTTCGAGGCCAAAAAGAAAAACCGAATTCTGGAAGAATAAACCGTAAACTCGATTCCATCATATTCCGATTGAAACCATCAGCCTGCAATTCATTCAACAGCTGAGCTTCATAACTATGTAAGGTAATATTCGCTCAAATTCTCCACCAGGTAATCATATTGTGCATGAAGCATATAGTCATATGTTCCATTGGGTGGCATGACTCTATTTGCTGAGGGGTAAACATTTCCAATACCAACTTTTGTGTTAATTCTGGTCCTCCCCTTCACAAGGCGTTTATTATCAGGAAATTCAGCAGTCAGTCTCCCCATTTGCCTCTGTAAATCAGTAACTGAATCGTTGGTTACCTTTGCCTTGGAACCTTTTTTCCCATCGACAAGTTTCAGCTCCAATCGATACAAAGCACAGGCGTCGTACTTATTAATTTCATATTCATAAAGATGCCACTCAAAGTGGTTCAGTGGCTGAGGATCCATATAATAAGATCTCTTCAGCCCTCCAAACTCATTCATCACTTGCTTTCTTGATTCATGCCAACCACGTCCACTGTAATCAGGCAAAGACCTCTGCTTTCTATTCCCACTCTCAAATGCTCTACGAGGCTTATCAAAGAATAGCCACTCCCTAATCATCTCACCCTCCAGTATTGACAGATCAAAAAGCTC
mRNA sequence
AATTTGGGAGGGAAAAAAATCTGGACATGGAACATCGCTTCTCCAGCAGCGAGACCATATTTAGAACTGTAAAGCGAAAGCCCTAGCACTCGCTCAGACATGGCCATTTCGCCAATGGCGTTGAGTTCGTCTTTTATCCACCACTCTCCTCTCCATTCCAATAAAACTCACCGACTGTCTGCTATGCCCTCAATTCATACTCCCAAAATTGTTCTGAAAAACTCAAGGCGTTGCTTTTCTACAATCTCTTGCTCTGGTAGGAGGCCAATCCCGACGACGGAGGAAGAGGTTCTTCAAGCTGTGTTGGAATCCGATGAGAAGATTCTTCCTTGCGTTCGGACGTACGAGAATGATTTGTCTCGGCTTACTCTAGTCGGAGGCGTCGATTTCCGGCAGTCTGTTACGGCAGCTGCGGCTGACGGCGGTGAAGCGGCCTCTGAGCACCTTGATTCTGGTATGTCTGCAATGGTTGTGGAGACTGTATTTCCGGGAACTTCAGACGAGCACAGCACGGTCTCGACCCGGCTGTTTTTACCTGCCAGAGAAGTCAAGGAAAAGGCTAGGAATCTCAAAAAGTCTCTCGCTCAAGATTTTAATTTGAGTACCTCGTCCAAAAATATACTTGCTATGACATTTAGACAAGTAGTATTGCAACAGCTCTGGAGCTTTGAACTGGTTATCTTTATACCTGGATCTGAAAGGAATATGGAAGATCTTGAAAATCCAAGAGAGGTTCCAATGTCTTTCACTCTCAGTTCATCCGAGGAACGAGCCATCTCTGTACTTGCAGAAGCTGTTTGCATGTGCGCTCTTCGAAATACAGAAGGAAAATTTGTCAATGGTACAGCAAGTGGAACTTCAACTAGATTTTTTGATTGGTTCCGGAAGTCCACAATTGTTGCATCAAAGGATTCTTCAGTTATTATCTACAAGTTATTGGACAACGAGGTAGCTGATGCCAAAAGTTTATTACAAAAGTTTAATTCAAATAAGGAGAGCTGGAAACGTAGGAATTTCCAGTCGAAGAACTATTGGTGGATACCTTCTGAACTCTCTGAACTAGAAAAAATCGGTGGGGCTGAATTCAGTGCATGGGCTAGCGAGTATGTACCTTCTTACAGGCTACAAATTGATGCTCGTCAGTTCAATGGTTTAAAATTTGGAGGCTGGAGAGAATCTGCTGAGAATAGGTGGGAAGTCCTTTTGACCCACTCCCAAATGGTAGGATTGGCGAACATATTAGACATTTTCTATGAAGATGTATATTCTCTGCCCGATAAACAACTGCAATGTGGTGCAATCGTGCATTCTGCTAACTTGTTAAACAAAAAGAGAAACTATTCCTCCTGGGGGTTTCTATCCAAGACTTTAGCTGGTGGAATTTTTTTTGTTACTATTATTGCTGTTGGTCAATATTTTTTGCCTCGCGTTCATGTGTCTGGGAGGTATAATGTAGAACAGCCGGTTTCATCACTTTATGGAGTTAGCTCTGTGAAAAATCAGGCTACAGAAGCAGAAAAGTTGGAAGAATACTGCATCTCAGTTGTAAATATTATAAAAGATGCCTTCGGTTGGCATGGTGATGTACACACAGATAAGAGAGTTGGTGCGTGGATTGGGGAAGCTCCTGATTACTTGAGGGTGGTTGAATCTGATACAGGTAGTGAAGATACTCCATCTGGTACGATAGAACAAGATAATGTTGATGGAGTGAAAGCTTCTGCTCAGGACATAGCTAGTTATCAGGTGGTCTTGTCGACTGAAGGAAAGATTGTAGGATTCCAACCAACCAGTCGGGTTGCTGTTAACTATTGGGCTGCAAATCCTTTAGCAAAGCAATTATATGGTGGGAGGAACTTGTCCCCGGGCCTTTTTGAATCTGGGTTAAGGATCAGACGCCCGAATGAGGTGATTGTGATAGAGTTGCTTATGTCGGTGAAAACTGATGCTTACTTTGCTTTGGCAAGGCCCACATATTAGTCAATACCCCGAGTTCGCAGGCAGCAGTATTTGATTTGCAGAAACGCCATTCTCAACGTAAAATTTGGATATTCAGATACTAATGCCGCTCCTGGATAATCGTTACAGGCTTCCAGCCCCTCCTAAAGAAGTTGCCATCCCAATCTCGCTCAGATCTCCTCAATGTCGTCAATATAGATAGTCATTCAGATAGCGTAGGAAATTTTGTATTAATGTTGTCCAGATTAGTTTTCCTGTTTAATAAATACAGATGCTACAAGCAACTGTATGTATTACTGTGACACAACTCATGTAAGACAGACAATTGTGTCCAGAGAAGCATGAGACATCTGCTAAAGATAGATTTAGCCTATTTGCAAATGAAACCTACAAATACCCTTTCCCTTGTTCTTCAACATTTTTTAACATTCAGCCCACTTGCATTAAGAGAAGGATGCATCAGGAAATGATGCATTAAATCTACATTTCTGATAATGGTTAACAAGGTTGTAACCTTCGAGCCATATACAAATTTCAATATCTTGAGAGATGACTGAAAAGATTACTCAAGATGTGTATGTTGACAATTGACATCAAATAAAATGTAAACTAGTATCATCCGCATACATAACATAGTGTAGGCGGATTACATTTACTCGAAATCCCGAGGACCTAAACCATTATTGGAGTTTGTATCACCAGCGGAAACAAATATGTATATCACTACTAGTCTTCTGCGGAGTTACAGTGACTAAGAATATTCCAGTTGCAAATTGTAACCCAAAATATCAGGGAAACAACCTGCCTTACCGTTTCCAGAAGTTACAAGCTCCATCATGCGGGTGTATCTGTTGCAGCTACCATGACATGAGTATCAGATGGTATACCACGAGACAAGAAAAAAAGAAAAAAAAAAGTTATTTATGTTCTTCGAGGCCAAAAAGAAAAACCGAATTCTGGAAGAATAAACCGTAAACTCGATTCCATCATATTCCGATTGAAACCATCAGCCTGCAATTCATTCAACAGCTGAGCTTCATAACTATGTAAGGTAATATTCGCTCAAATTCTCCACCAGGTAATCATATTGTGCATGAAGCATATAGTCATATGTTCCATTGGGTGGCATGACTCTATTTGCTGAGGGGTAAACATTTCCAATACCAACTTTTGTGTTAATTCTGGTCCTCCCCTTCACAAGGCGTTTATTATCAGGAAATTCAGCAGTCAGTCTCCCCATTTGCCTCTGTAAATCAGTAACTGAATCGTTGGTTACCTTTGCCTTGGAACCTTTTTTCCCATCGACAAGTTTCAGCTCCAATCGATACAAAGCACAGGCGTCGTACTTATTAATTTCATATTCATAAAGATGCCACTCAAAGTGGTTCAGTGGCTGAGGATCCATATAATAAGATCTCTTCAGCCCTCCAAACTCATTCATCACTTGCTTTCTTGATTCATGCCAACCACGTCCACTGTAATCAGGCAAAGACCTCTGCTTTCTATTCCCACTCTCAAATGCTCTACGAGGCTTATCAAAGAATAGCCACTCCCTAATCATCTCACCCTCCAGTATTGACAGATCAAAAAGCTC
Coding sequence (CDS)
ATGGCCATTTCGCCAATGGCGTTGAGTTCGTCTTTTATCCACCACTCTCCTCTCCATTCCAATAAAACTCACCGACTGTCTGCTATGCCCTCAATTCATACTCCCAAAATTGTTCTGAAAAACTCAAGGCGTTGCTTTTCTACAATCTCTTGCTCTGGTAGGAGGCCAATCCCGACGACGGAGGAAGAGGTTCTTCAAGCTGTGTTGGAATCCGATGAGAAGATTCTTCCTTGCGTTCGGACGTACGAGAATGATTTGTCTCGGCTTACTCTAGTCGGAGGCGTCGATTTCCGGCAGTCTGTTACGGCAGCTGCGGCTGACGGCGGTGAAGCGGCCTCTGAGCACCTTGATTCTGGTATGTCTGCAATGGTTGTGGAGACTGTATTTCCGGGAACTTCAGACGAGCACAGCACGGTCTCGACCCGGCTGTTTTTACCTGCCAGAGAAGTCAAGGAAAAGGCTAGGAATCTCAAAAAGTCTCTCGCTCAAGATTTTAATTTGAGTACCTCGTCCAAAAATATACTTGCTATGACATTTAGACAAGTAGTATTGCAACAGCTCTGGAGCTTTGAACTGGTTATCTTTATACCTGGATCTGAAAGGAATATGGAAGATCTTGAAAATCCAAGAGAGGTTCCAATGTCTTTCACTCTCAGTTCATCCGAGGAACGAGCCATCTCTGTACTTGCAGAAGCTGTTTGCATGTGCGCTCTTCGAAATACAGAAGGAAAATTTGTCAATGGTACAGCAAGTGGAACTTCAACTAGATTTTTTGATTGGTTCCGGAAGTCCACAATTGTTGCATCAAAGGATTCTTCAGTTATTATCTACAAGTTATTGGACAACGAGGTAGCTGATGCCAAAAGTTTATTACAAAAGTTTAATTCAAATAAGGAGAGCTGGAAACGTAGGAATTTCCAGTCGAAGAACTATTGGTGGATACCTTCTGAACTCTCTGAACTAGAAAAAATCGGTGGGGCTGAATTCAGTGCATGGGCTAGCGAGTATGTACCTTCTTACAGGCTACAAATTGATGCTCGTCAGTTCAATGGTTTAAAATTTGGAGGCTGGAGAGAATCTGCTGAGAATAGGTGGGAAGTCCTTTTGACCCACTCCCAAATGGTAGGATTGGCGAACATATTAGACATTTTCTATGAAGATGTATATTCTCTGCCCGATAAACAACTGCAATGTGGTGCAATCGTGCATTCTGCTAACTTGTTAAACAAAAAGAGAAACTATTCCTCCTGGGGGTTTCTATCCAAGACTTTAGCTGGTGGAATTTTTTTTGTTACTATTATTGCTGTTGGTCAATATTTTTTGCCTCGCGTTCATGTGTCTGGGAGGTATAATGTAGAACAGCCGGTTTCATCACTTTATGGAGTTAGCTCTGTGAAAAATCAGGCTACAGAAGCAGAAAAGTTGGAAGAATACTGCATCTCAGTTGTAAATATTATAAAAGATGCCTTCGGTTGGCATGGTGATGTACACACAGATAAGAGAGTTGGTGCGTGGATTGGGGAAGCTCCTGATTACTTGAGGGTGGTTGAATCTGATACAGGTAGTGAAGATACTCCATCTGGTACGATAGAACAAGATAATGTTGATGGAGTGAAAGCTTCTGCTCAGGACATAGCTAGTTATCAGGTGGTCTTGTCGACTGAAGGAAAGATTGTAGGATTCCAACCAACCAGTCGGGTTGCTGTTAACTATTGGGCTGCAAATCCTTTAGCAAAGCAATTATATGGTGGGAGGAACTTGTCCCCGGGCCTTTTTGAATCTGGGTTAAGGATCAGACGCCCGAATGAGGTGATTGTGATAGAGTTGCTTATGTCGGTGAAAACTGATGCTTACTTTGCTTTGGCAAGGCCCACATATTAG
Protein sequence
MAISPMALSSSFIHHSPLHSNKTHRLSAMPSIHTPKIVLKNSRRCFSTISCSGRRPIPTTEEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGMSAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFRQVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERAISVLAEAVCMCALRNTEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKESWKRRNFQSKNYWWIPSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRESAENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLLNKKRNYSSWGFLSKTLAGGIFFVTIIAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQATEAEKLEEYCISVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKASAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRRPNEVIVIELLMSVKTDAYFALARPTY
Homology
BLAST of Cp4.1LG10g05150.1 vs. NCBI nr
Match:
XP_023543337.1 (uncharacterized protein LOC111803246 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1236 bits (3197), Expect = 0.0
Identity = 626/626 (100.00%), Postives = 626/626 (100.00%), Query Frame = 0
Query: 1 MAISPMALSSSFIHHSPLHSNKTHRLSAMPSIHTPKIVLKNSRRCFSTISCSGRRPIPTT 60
MAISPMALSSSFIHHSPLHSNKTHRLSAMPSIHTPKIVLKNSRRCFSTISCSGRRPIPTT
Sbjct: 1 MAISPMALSSSFIHHSPLHSNKTHRLSAMPSIHTPKIVLKNSRRCFSTISCSGRRPIPTT 60
Query: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM 120
EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM
Sbjct: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM 120
Query: 121 SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR
Sbjct: 121 SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
Query: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERAISVLAEAVCMCALRN 240
QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERAISVLAEAVCMCALRN
Sbjct: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERAISVLAEAVCMCALRN 240
Query: 241 TEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES 300
TEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES
Sbjct: 241 TEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES 300
Query: 301 WKRRNFQSKNYWWIPSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES 360
WKRRNFQSKNYWWIPSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES
Sbjct: 301 WKRRNFQSKNYWWIPSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES 360
Query: 361 AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLLNKKRNYSSWGFL 420
AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLLNKKRNYSSWGFL
Sbjct: 361 AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLLNKKRNYSSWGFL 420
Query: 421 SKTLAGGIFFVTIIAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQATEAEKLEEYCI 480
SKTLAGGIFFVTIIAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQATEAEKLEEYCI
Sbjct: 421 SKTLAGGIFFVTIIAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQATEAEKLEEYCI 480
Query: 481 SVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA 540
SVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA
Sbjct: 481 SVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA 540
Query: 541 SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR 600
SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR
Sbjct: 541 SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR 600
Query: 601 PNEVIVIELLMSVKTDAYFALARPTY 626
PNEVIVIELLMSVKTDAYFALARPTY
Sbjct: 601 PNEVIVIELLMSVKTDAYFALARPTY 626
BLAST of Cp4.1LG10g05150.1 vs. NCBI nr
Match:
XP_022925753.1 (uncharacterized protein LOC111433068 [Cucurbita moschata] >KAG7034621.1 hypothetical protein SDJN02_04351 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1214 bits (3140), Expect = 0.0
Identity = 611/626 (97.60%), Postives = 619/626 (98.88%), Query Frame = 0
Query: 1 MAISPMALSSSFIHHSPLHSNKTHRLSAMPSIHTPKIVLKNSRRCFSTISCSGRRPIPTT 60
MA+SPMALSSSFIHHSP HSN+THRLSA+PSI TPKIVLKNSR CFSTISCSGRRPIPTT
Sbjct: 1 MAMSPMALSSSFIHHSPFHSNRTHRLSAVPSIRTPKIVLKNSRHCFSTISCSGRRPIPTT 60
Query: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM 120
EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQS+TAAAADGGEAASEHLDSGM
Sbjct: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSITAAAADGGEAASEHLDSGM 120
Query: 121 SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR
Sbjct: 121 SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
Query: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERAISVLAEAVCMCALRN 240
QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEER ISVLAEAVC+CALRN
Sbjct: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERVISVLAEAVCLCALRN 240
Query: 241 TEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES 300
TEGKFVN TASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES
Sbjct: 241 TEGKFVNSTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES 300
Query: 301 WKRRNFQSKNYWWIPSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES 360
WKRRNFQSKNYWW+PSELSELEK GGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES
Sbjct: 301 WKRRNFQSKNYWWMPSELSELEKFGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES 360
Query: 361 AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLLNKKRNYSSWGFL 420
AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANL+NKKRNYSSWGFL
Sbjct: 361 AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLINKKRNYSSWGFL 420
Query: 421 SKTLAGGIFFVTIIAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQATEAEKLEEYCI 480
SKTLAGGIFFVTI+AVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQA EAEKLEEYCI
Sbjct: 421 SKTLAGGIFFVTIVAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQAIEAEKLEEYCI 480
Query: 481 SVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA 540
SVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA
Sbjct: 481 SVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA 540
Query: 541 SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR 600
SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR
Sbjct: 541 SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR 600
Query: 601 PNEVIVIELLMSVKTDAYFALARPTY 626
PNEVIVIELLMSVKTDAYFALARPTY
Sbjct: 601 PNEVIVIELLMSVKTDAYFALARPTY 626
BLAST of Cp4.1LG10g05150.1 vs. NCBI nr
Match:
KAG6581337.1 (hypothetical protein SDJN03_21339, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1210 bits (3131), Expect = 0.0
Identity = 610/626 (97.44%), Postives = 617/626 (98.56%), Query Frame = 0
Query: 1 MAISPMALSSSFIHHSPLHSNKTHRLSAMPSIHTPKIVLKNSRRCFSTISCSGRRPIPTT 60
MA+SPMALSSSFIHHSP HSN+THRLSA+PSI TPKIVLKNSR CFSTISCSGRRPIPTT
Sbjct: 1 MAMSPMALSSSFIHHSPFHSNRTHRLSAVPSIRTPKIVLKNSRHCFSTISCSGRRPIPTT 60
Query: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM 120
EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQS+TAAAADGGEAASEHLDSGM
Sbjct: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSITAAAADGGEAASEHLDSGM 120
Query: 121 SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR
Sbjct: 121 SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
Query: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERAISVLAEAVCMCALRN 240
QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEER ISVLAEAVC+CALRN
Sbjct: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERVISVLAEAVCLCALRN 240
Query: 241 TEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES 300
TEGKFVN TASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES
Sbjct: 241 TEGKFVNSTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES 300
Query: 301 WKRRNFQSKNYWWIPSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES 360
WKRRNFQSKNYWW+PSEL ELEK GGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES
Sbjct: 301 WKRRNFQSKNYWWMPSELPELEKFGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES 360
Query: 361 AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLLNKKRNYSSWGFL 420
AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLLNKKRNYSSWGFL
Sbjct: 361 AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLLNKKRNYSSWGFL 420
Query: 421 SKTLAGGIFFVTIIAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQATEAEKLEEYCI 480
SKTLAGGIFFVTI+AVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQA EAEKLEEYCI
Sbjct: 421 SKTLAGGIFFVTIVAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQAIEAEKLEEYCI 480
Query: 481 SVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA 540
SVVNIIKDA GWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA
Sbjct: 481 SVVNIIKDALGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA 540
Query: 541 SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR 600
SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR
Sbjct: 541 SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR 600
Query: 601 PNEVIVIELLMSVKTDAYFALARPTY 626
PNEVIVIELLMSVKTDAYFALARPTY
Sbjct: 601 PNEVIVIELLMSVKTDAYFALARPTY 626
BLAST of Cp4.1LG10g05150.1 vs. NCBI nr
Match:
XP_022978666.1 (uncharacterized protein LOC111478564 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1196 bits (3095), Expect = 0.0
Identity = 607/626 (96.96%), Postives = 617/626 (98.56%), Query Frame = 0
Query: 1 MAISPMALSSSFIHHSPLHSNKTHRLSAMPSIHTPKIVLKNSRRCFSTISCSGRRPIPTT 60
MAISPMALSSSFIH+SPLHSN+THRLSA+PSIHTPKIVLKNSR CFSTISCSGR+PIPTT
Sbjct: 1 MAISPMALSSSFIHYSPLHSNRTHRLSAVPSIHTPKIVLKNSRHCFSTISCSGRKPIPTT 60
Query: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM 120
EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM
Sbjct: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM 120
Query: 121 SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
SAMVVETVFPGTSDE STVSTRLFLPAREV+EKARNLKKSLAQDFNLSTSSKNILAMTFR
Sbjct: 121 SAMVVETVFPGTSDEQSTVSTRLFLPAREVEEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
Query: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERAISVLAEAVCMCALRN 240
QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEER ISVLAEAVC+CALRN
Sbjct: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERVISVLAEAVCLCALRN 240
Query: 241 TEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES 300
TEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNK+S
Sbjct: 241 TEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKKS 300
Query: 301 WKRRNFQSKNYWWIPSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES 360
WKRRNFQSKNYWW+PSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES
Sbjct: 301 WKRRNFQSKNYWWMPSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES 360
Query: 361 AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLLNKKRNYSSWGFL 420
AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDK LQCGAIVHSANLLNKKRNYSSWGFL
Sbjct: 361 AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKLLQCGAIVHSANLLNKKRNYSSWGFL 420
Query: 421 SKTLAGGIFFVTIIAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQATEAEKLEEYCI 480
SKTLAGGIFFVTI+AVGQYFLPRVHVSGRY VEQPVSSLYGVSSVKNQA EAEKLEEYCI
Sbjct: 421 SKTLAGGIFFVTIVAVGQYFLPRVHVSGRYTVEQPVSSLYGVSSVKNQAIEAEKLEEYCI 480
Query: 481 SVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA 540
SVVNIIKDA GWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDT SGTIEQDNV GVKA
Sbjct: 481 SVVNIIKDAVGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTSSGTIEQDNV-GVKA 540
Query: 541 SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR 600
SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR
Sbjct: 541 SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR 600
Query: 601 PNEVIVIELLMSVKTDAYFALARPTY 626
PNEVIVIELL+SVKTDAYFALARPTY
Sbjct: 601 PNEVIVIELLLSVKTDAYFALARPTY 625
BLAST of Cp4.1LG10g05150.1 vs. NCBI nr
Match:
XP_038882562.1 (uncharacterized protein LOC120073791 isoform X2 [Benincasa hispida])
HSP 1 Score: 1104 bits (2856), Expect = 0.0
Identity = 557/626 (88.98%), Postives = 586/626 (93.61%), Query Frame = 0
Query: 1 MAISPMALSSSFIHHSPLHSNKTHRLSAMPSIHTPKIVLKNSRRCFSTISCSGRRPIPTT 60
M ISPMALS SFI +SPLH+ HRL +PSI+TPK+VLKNSR CFSTI+CS RPIPTT
Sbjct: 1 MPISPMALSLSFIQYSPLHAISAHRLFPIPSIYTPKVVLKNSRHCFSTITCSAGRPIPTT 60
Query: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM 120
EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM
Sbjct: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM 120
Query: 121 SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
SAMVVETVFPG SDEHSTVSTRLFLPAREVKEKAR LKKSLAQDF+ STSSKNILAMTFR
Sbjct: 121 SAMVVETVFPGNSDEHSTVSTRLFLPAREVKEKARKLKKSLAQDFHSSTSSKNILAMTFR 180
Query: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERAISVLAEAVCMCALRN 240
QVVLQQLW+FELV+FIPGSERNMEDLENPREVP+SFTLSSSEERAISVLAE VCMCAL+N
Sbjct: 181 QVVLQQLWNFELVVFIPGSERNMEDLENPREVPISFTLSSSEERAISVLAETVCMCALQN 240
Query: 241 TEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES 300
TEGKFVNGT+SGTSTRFFDWFRKSTIVASKDSSVIIYKL DNEVADAKSLLQKFNSNKES
Sbjct: 241 TEGKFVNGTSSGTSTRFFDWFRKSTIVASKDSSVIIYKLFDNEVADAKSLLQKFNSNKES 300
Query: 301 WKRRNFQSKNYWWIPSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES 360
WKRRNF+S NYWW+PSEL++LEKIGGAEF AW SEYVPSYRLQIDA QFNGLKFGGWRES
Sbjct: 301 WKRRNFKSMNYWWMPSELTKLEKIGGAEFCAWVSEYVPSYRLQIDAYQFNGLKFGGWRES 360
Query: 361 AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLLNKKRNYSSWGFL 420
AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDK LQCGAIVHSA+LL+KKRNYSSWG L
Sbjct: 361 AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKLLQCGAIVHSASLLSKKRNYSSWGLL 420
Query: 421 SKTLAGGIFFVTIIAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQATEAEKLEEYCI 480
SKTLAGG+F V I AVGQ F+ RVHV GR +VE+P++SLYGVSSVK+QA EA KLEEYC
Sbjct: 421 SKTLAGGVFLVAIGAVGQRFMSRVHVPGRCSVERPITSLYGVSSVKDQAIEAAKLEEYCT 480
Query: 481 SVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA 540
SVV IIKDAFGWHGDVHTDKRVGAWIGEAPDYL VVESD GSED PSGT EQ++ DGVKA
Sbjct: 481 SVVKIIKDAFGWHGDVHTDKRVGAWIGEAPDYLMVVESDIGSEDAPSGTTEQESTDGVKA 540
Query: 541 SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR 600
SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGL E+GLRIRR
Sbjct: 541 SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLIETGLRIRR 600
Query: 601 PNEVIVIELLMSVKTDAYFALARPTY 626
PNEVIVIELLMSVKTDA+FALARP Y
Sbjct: 601 PNEVIVIELLMSVKTDAFFALARPAY 626
BLAST of Cp4.1LG10g05150.1 vs. ExPASy TrEMBL
Match:
A0A6J1EG51 (uncharacterized protein LOC111433068 OS=Cucurbita moschata OX=3662 GN=LOC111433068 PE=4 SV=1)
HSP 1 Score: 1214 bits (3140), Expect = 0.0
Identity = 611/626 (97.60%), Postives = 619/626 (98.88%), Query Frame = 0
Query: 1 MAISPMALSSSFIHHSPLHSNKTHRLSAMPSIHTPKIVLKNSRRCFSTISCSGRRPIPTT 60
MA+SPMALSSSFIHHSP HSN+THRLSA+PSI TPKIVLKNSR CFSTISCSGRRPIPTT
Sbjct: 1 MAMSPMALSSSFIHHSPFHSNRTHRLSAVPSIRTPKIVLKNSRHCFSTISCSGRRPIPTT 60
Query: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM 120
EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQS+TAAAADGGEAASEHLDSGM
Sbjct: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSITAAAADGGEAASEHLDSGM 120
Query: 121 SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR
Sbjct: 121 SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
Query: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERAISVLAEAVCMCALRN 240
QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEER ISVLAEAVC+CALRN
Sbjct: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERVISVLAEAVCLCALRN 240
Query: 241 TEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES 300
TEGKFVN TASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES
Sbjct: 241 TEGKFVNSTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES 300
Query: 301 WKRRNFQSKNYWWIPSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES 360
WKRRNFQSKNYWW+PSELSELEK GGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES
Sbjct: 301 WKRRNFQSKNYWWMPSELSELEKFGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES 360
Query: 361 AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLLNKKRNYSSWGFL 420
AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANL+NKKRNYSSWGFL
Sbjct: 361 AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLINKKRNYSSWGFL 420
Query: 421 SKTLAGGIFFVTIIAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQATEAEKLEEYCI 480
SKTLAGGIFFVTI+AVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQA EAEKLEEYCI
Sbjct: 421 SKTLAGGIFFVTIVAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQAIEAEKLEEYCI 480
Query: 481 SVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA 540
SVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA
Sbjct: 481 SVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA 540
Query: 541 SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR 600
SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR
Sbjct: 541 SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR 600
Query: 601 PNEVIVIELLMSVKTDAYFALARPTY 626
PNEVIVIELLMSVKTDAYFALARPTY
Sbjct: 601 PNEVIVIELLMSVKTDAYFALARPTY 626
BLAST of Cp4.1LG10g05150.1 vs. ExPASy TrEMBL
Match:
A0A6J1INI1 (uncharacterized protein LOC111478564 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111478564 PE=4 SV=1)
HSP 1 Score: 1196 bits (3095), Expect = 0.0
Identity = 607/626 (96.96%), Postives = 617/626 (98.56%), Query Frame = 0
Query: 1 MAISPMALSSSFIHHSPLHSNKTHRLSAMPSIHTPKIVLKNSRRCFSTISCSGRRPIPTT 60
MAISPMALSSSFIH+SPLHSN+THRLSA+PSIHTPKIVLKNSR CFSTISCSGR+PIPTT
Sbjct: 1 MAISPMALSSSFIHYSPLHSNRTHRLSAVPSIHTPKIVLKNSRHCFSTISCSGRKPIPTT 60
Query: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM 120
EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM
Sbjct: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM 120
Query: 121 SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
SAMVVETVFPGTSDE STVSTRLFLPAREV+EKARNLKKSLAQDFNLSTSSKNILAMTFR
Sbjct: 121 SAMVVETVFPGTSDEQSTVSTRLFLPAREVEEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
Query: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERAISVLAEAVCMCALRN 240
QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEER ISVLAEAVC+CALRN
Sbjct: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERVISVLAEAVCLCALRN 240
Query: 241 TEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES 300
TEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNK+S
Sbjct: 241 TEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKKS 300
Query: 301 WKRRNFQSKNYWWIPSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES 360
WKRRNFQSKNYWW+PSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES
Sbjct: 301 WKRRNFQSKNYWWMPSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES 360
Query: 361 AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLLNKKRNYSSWGFL 420
AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDK LQCGAIVHSANLLNKKRNYSSWGFL
Sbjct: 361 AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKLLQCGAIVHSANLLNKKRNYSSWGFL 420
Query: 421 SKTLAGGIFFVTIIAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQATEAEKLEEYCI 480
SKTLAGGIFFVTI+AVGQYFLPRVHVSGRY VEQPVSSLYGVSSVKNQA EAEKLEEYCI
Sbjct: 421 SKTLAGGIFFVTIVAVGQYFLPRVHVSGRYTVEQPVSSLYGVSSVKNQAIEAEKLEEYCI 480
Query: 481 SVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA 540
SVVNIIKDA GWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDT SGTIEQDNV GVKA
Sbjct: 481 SVVNIIKDAVGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTSSGTIEQDNV-GVKA 540
Query: 541 SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR 600
SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR
Sbjct: 541 SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR 600
Query: 601 PNEVIVIELLMSVKTDAYFALARPTY 626
PNEVIVIELL+SVKTDAYFALARPTY
Sbjct: 601 PNEVIVIELLLSVKTDAYFALARPTY 625
BLAST of Cp4.1LG10g05150.1 vs. ExPASy TrEMBL
Match:
A0A1S3AZK4 (uncharacterized protein LOC103484484 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103484484 PE=4 SV=1)
HSP 1 Score: 1047 bits (2708), Expect = 0.0
Identity = 524/626 (83.71%), Postives = 569/626 (90.89%), Query Frame = 0
Query: 1 MAISPMALSSSFIHHSPLHSNKTHRLSAMPSIHTPKIVLKNSRRCFSTISCSGRRPIPTT 60
MAISPMALS SFIH+SP+ + RL A+P I+TPKIVLKNSR CFST SCS RPIPTT
Sbjct: 1 MAISPMALSLSFIHYSPIQAIPARRLFAIPLIYTPKIVLKNSRHCFSTFSCSAGRPIPTT 60
Query: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM 120
EEEVLQAVLESDEKILPCVRTYENDLSRL+LVGGVDFRQSVTAAAADGGE A+EHLDSGM
Sbjct: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLSLVGGVDFRQSVTAAAADGGETATEHLDSGM 120
Query: 121 SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
AMVVETVFPG SDEHSTVSTRLFLPAREVKEKA L+KSLAQDF+ STSSKNILAMTFR
Sbjct: 121 PAMVVETVFPGISDEHSTVSTRLFLPAREVKEKATKLRKSLAQDFHSSTSSKNILAMTFR 180
Query: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFTLSSSEERAISVLAEAVCMCALRN 240
QVVLQQLW+FELV+F PGSERNMEDLENPREVP+SFTLSSSEERAISVLAE VCMCAL+N
Sbjct: 181 QVVLQQLWNFELVVFTPGSERNMEDLENPREVPISFTLSSSEERAISVLAETVCMCALQN 240
Query: 241 TEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKES 300
TEGKFVNGT+SGTSTR F WFRKSTIVAS+DSSV+I+KL DNEVAD KSLLQKFNSNKES
Sbjct: 241 TEGKFVNGTSSGTSTRLFGWFRKSTIVASEDSSVVIHKLFDNEVADPKSLLQKFNSNKES 300
Query: 301 WKRRNFQSKNYWWIPSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRES 360
WK RNF+S NYWW+PSEL++LEK GG+EF AW SE+VP+YRLQIDA QFN +K GGWRE
Sbjct: 301 WKHRNFKSMNYWWMPSELTKLEKFGGSEFCAWVSEHVPAYRLQIDAHQFNDIKLGGWREF 360
Query: 361 AENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLLNKKRNYSSWGFL 420
ENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGA V SANLL+KKRNYSSWG L
Sbjct: 361 VENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGANVLSANLLSKKRNYSSWGLL 420
Query: 421 SKTLAGGIFFVTIIAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQATEAEKLEEYCI 480
SKTLAGG+FFV I A+GQ F+ RV + GRY+VEQP++SL G+SSVKNQA EA KLE+YCI
Sbjct: 421 SKTLAGGVFFVAIGAIGQRFMSRVRLPGRYSVEQPITSLDGLSSVKNQAMEAAKLEDYCI 480
Query: 481 SVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVKA 540
SVV IIKDAFGWHGDVH DKRVGAWIGEAPDYL VVESD GSED PSG I ++N+D VKA
Sbjct: 481 SVVKIIKDAFGWHGDVHMDKRVGAWIGEAPDYLTVVESDIGSEDAPSGMIGEENIDEVKA 540
Query: 541 SAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIRR 600
SAQDIASYQVVL+TEGKIVGFQPTSRVAVNYWAANPLAKQLYGG+NLSPGL E+GLRI+R
Sbjct: 541 SAQDIASYQVVLTTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGKNLSPGLLETGLRIKR 600
Query: 601 PNEVIVIELLMSVKTDAYFALARPTY 626
PN+V+VIELLMSVKTD +FALARP Y
Sbjct: 601 PNDVVVIELLMSVKTDTFFALARPVY 626
BLAST of Cp4.1LG10g05150.1 vs. ExPASy TrEMBL
Match:
A0A1S3AZL6 (uncharacterized protein LOC103484484 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103484484 PE=4 SV=1)
HSP 1 Score: 1043 bits (2696), Expect = 0.0
Identity = 524/627 (83.57%), Postives = 569/627 (90.75%), Query Frame = 0
Query: 1 MAISPMALSSSFIHHSPLHSNKTHRLSAMPSIHTPKIVLKNSRRCFSTISCSGRRPIPTT 60
MAISPMALS SFIH+SP+ + RL A+P I+TPKIVLKNSR CFST SCS RPIPTT
Sbjct: 1 MAISPMALSLSFIHYSPIQAIPARRLFAIPLIYTPKIVLKNSRHCFSTFSCSAGRPIPTT 60
Query: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM 120
EEEVLQAVLESDEKILPCVRTYENDLSRL+LVGGVDFRQSVTAAAADGGE A+EHLDSGM
Sbjct: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLSLVGGVDFRQSVTAAAADGGETATEHLDSGM 120
Query: 121 SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
AMVVETVFPG SDEHSTVSTRLFLPAREVKEKA L+KSLAQDF+ STSSKNILAMTFR
Sbjct: 121 PAMVVETVFPGISDEHSTVSTRLFLPAREVKEKATKLRKSLAQDFHSSTSSKNILAMTFR 180
Query: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPRE-VPMSFTLSSSEERAISVLAEAVCMCALR 240
QVVLQQLW+FELV+F PGSERNMEDLENPRE VP+SFTLSSSEERAISVLAE VCMCAL+
Sbjct: 181 QVVLQQLWNFELVVFTPGSERNMEDLENPREQVPISFTLSSSEERAISVLAETVCMCALQ 240
Query: 241 NTEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKE 300
NTEGKFVNGT+SGTSTR F WFRKSTIVAS+DSSV+I+KL DNEVAD KSLLQKFNSNKE
Sbjct: 241 NTEGKFVNGTSSGTSTRLFGWFRKSTIVASEDSSVVIHKLFDNEVADPKSLLQKFNSNKE 300
Query: 301 SWKRRNFQSKNYWWIPSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRE 360
SWK RNF+S NYWW+PSEL++LEK GG+EF AW SE+VP+YRLQIDA QFN +K GGWRE
Sbjct: 301 SWKHRNFKSMNYWWMPSELTKLEKFGGSEFCAWVSEHVPAYRLQIDAHQFNDIKLGGWRE 360
Query: 361 SAENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLLNKKRNYSSWGF 420
ENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGA V SANLL+KKRNYSSWG
Sbjct: 361 FVENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGANVLSANLLSKKRNYSSWGL 420
Query: 421 LSKTLAGGIFFVTIIAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQATEAEKLEEYC 480
LSKTLAGG+FFV I A+GQ F+ RV + GRY+VEQP++SL G+SSVKNQA EA KLE+YC
Sbjct: 421 LSKTLAGGVFFVAIGAIGQRFMSRVRLPGRYSVEQPITSLDGLSSVKNQAMEAAKLEDYC 480
Query: 481 ISVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVK 540
ISVV IIKDAFGWHGDVH DKRVGAWIGEAPDYL VVESD GSED PSG I ++N+D VK
Sbjct: 481 ISVVKIIKDAFGWHGDVHMDKRVGAWIGEAPDYLTVVESDIGSEDAPSGMIGEENIDEVK 540
Query: 541 ASAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIR 600
ASAQDIASYQVVL+TEGKIVGFQPTSRVAVNYWAANPLAKQLYGG+NLSPGL E+GLRI+
Sbjct: 541 ASAQDIASYQVVLTTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGKNLSPGLLETGLRIK 600
Query: 601 RPNEVIVIELLMSVKTDAYFALARPTY 626
RPN+V+VIELLMSVKTD +FALARP Y
Sbjct: 601 RPNDVVVIELLMSVKTDTFFALARPVY 627
BLAST of Cp4.1LG10g05150.1 vs. ExPASy TrEMBL
Match:
A0A5A7UA75 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold120G002620 PE=4 SV=1)
HSP 1 Score: 1039 bits (2687), Expect = 0.0
Identity = 523/627 (83.41%), Postives = 568/627 (90.59%), Query Frame = 0
Query: 1 MAISPMALSSSFIHHSPLHSNKTHRLSAMPSIHTPKIVLKNSRRCFSTISCSGRRPIPTT 60
MAISPMALS SFIH+SP+ + +L A+P I+TPKIVLKNSR CFST SCS RPIPTT
Sbjct: 1 MAISPMALSLSFIHYSPIQAIPARQLFAIPLIYTPKIVLKNSRHCFSTFSCSAGRPIPTT 60
Query: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDFRQSVTAAAADGGEAASEHLDSGM 120
EEEVLQAVLESDEKILPCVRTYENDLSRL+LVGGVDFRQSVTAAAADGGE A+EHLDSGM
Sbjct: 61 EEEVLQAVLESDEKILPCVRTYENDLSRLSLVGGVDFRQSVTAAAADGGETATEHLDSGM 120
Query: 121 SAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNLKKSLAQDFNLSTSSKNILAMTFR 180
AMVVETVFPG SDEHSTVSTRLFLPAREVKEKA L+KSLAQDF+ STSSKNILAMTFR
Sbjct: 121 PAMVVETVFPGISDEHSTVSTRLFLPAREVKEKATKLRKSLAQDFHSSTSSKNILAMTFR 180
Query: 181 QVVLQQLWSFELVIFIPGSERNMEDLENPRE-VPMSFTLSSSEERAISVLAEAVCMCALR 240
QVVLQQLW+FELV+F PGSERNMEDLENPRE VP+SFTLSSSEERAISVLAE VCMCAL+
Sbjct: 181 QVVLQQLWNFELVVFTPGSERNMEDLENPREQVPISFTLSSSEERAISVLAETVCMCALQ 240
Query: 241 NTEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIYKLLDNEVADAKSLLQKFNSNKE 300
NTEGKFVNGT+SGTSTR F WFRKSTIVAS+DSSV+I+KL DNEVAD KSLLQKFNSNKE
Sbjct: 241 NTEGKFVNGTSSGTSTRLFGWFRKSTIVASEDSSVVIHKLFDNEVADPKSLLQKFNSNKE 300
Query: 301 SWKRRNFQSKNYWWIPSELSELEKIGGAEFSAWASEYVPSYRLQIDARQFNGLKFGGWRE 360
SWK RNF+S NYWW+PSEL++LEK GG+EF AW SE+VP YRLQIDA QFN +K GGWRE
Sbjct: 301 SWKHRNFKSMNYWWMPSELTKLEKFGGSEFCAWVSEHVPVYRLQIDAYQFNDIKLGGWRE 360
Query: 361 SAENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGAIVHSANLLNKKRNYSSWGF 420
ENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGA V SANLL+KKRNYSSWG
Sbjct: 361 FVENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQCGANVLSANLLSKKRNYSSWGL 420
Query: 421 LSKTLAGGIFFVTIIAVGQYFLPRVHVSGRYNVEQPVSSLYGVSSVKNQATEAEKLEEYC 480
LSKTLAGG+FFV I A+GQ F+ RV + GRY+VEQP++SL G+SSVKNQA EA KLE+YC
Sbjct: 421 LSKTLAGGVFFVAIGAIGQRFMSRVRLPGRYSVEQPITSLDGLSSVKNQAMEAAKLEDYC 480
Query: 481 ISVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYLRVVESDTGSEDTPSGTIEQDNVDGVK 540
ISVV IIKDAFGWHGDVH DKRVGAWIGEAPDYL VVESD GSED PSG I ++N+D VK
Sbjct: 481 ISVVKIIKDAFGWHGDVHMDKRVGAWIGEAPDYLTVVESDIGSEDAPSGMIGEENIDEVK 540
Query: 541 ASAQDIASYQVVLSTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGRNLSPGLFESGLRIR 600
ASAQDIASYQVVL+TEGKIVGFQPTSRVAVNYWAANPLAKQLYGG+NLSPGL E+GLRI+
Sbjct: 541 ASAQDIASYQVVLTTEGKIVGFQPTSRVAVNYWAANPLAKQLYGGKNLSPGLLETGLRIK 600
Query: 601 RPNEVIVIELLMSVKTDAYFALARPTY 626
RPN+V+VIELLMSVKTD +FALARP Y
Sbjct: 601 RPNDVVVIELLMSVKTDTFFALARPVY 627
BLAST of Cp4.1LG10g05150.1 vs. TAIR 10
Match:
AT1G28530.2 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 10 growth stages; Has 20 Blast hits to 20 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 20; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 583.9 bits (1504), Expect = 1.5e-166
Identity = 310/594 (52.19%), Postives = 414/594 (69.70%), Query Frame = 0
Query: 38 VLKNSRRCFSTISCSGRRPIPTTEEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDF 97
+L R S + C + +TEE++L+ V ESD K LPCVRTYEN+ +RL+LVG V F
Sbjct: 25 LLPQQRSSVSFVRCFSKN--SSTEEDILRFVAESDGKALPCVRTYENNSARLSLVGTVAF 84
Query: 98 RQSVTAAAADGGEAASEHLDSGMSAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNL 157
Q++TAAAADGGEAA +HL + MVVETVFPG SD +TVSTRLFLP ++VKE+A+ L
Sbjct: 85 DQALTAAAADGGEAADDHLRENVPVMVVETVFPGGSDPKATVSTRLFLPTKKVKERAKRL 144
Query: 158 KKSLAQDFNLSTSSKNILAMTFRQVVLQQLWSFELVIFIPGSERNMEDLENPREVPMSFT 217
++SL++D + SKNILAMTFRQVVL+QLW+F+LV+F PG+ER M D ENPREV SFT
Sbjct: 145 RRSLSEDLSSGDLSKNILAMTFRQVVLRQLWNFQLVLFAPGAEREMGDFENPREVSTSFT 204
Query: 218 LSSSEERAISVLAEAVCMCALRNTEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVIIY 277
LSSS+ER ISV+AE +C+ AL++TE F++ F W K +AS+D SV+++
Sbjct: 205 LSSSDERVISVIAEVICISALQSTEKHFLDDYLGKAKFPFMKWLSKRRRIASRDCSVVLH 264
Query: 278 KLLDNEVADAKSLLQKFNSNKESWKRRNFQSKNYWWIPSELSELEKIGGAEFSAWASEYV 337
KL D+E + K LL+ + S KE++K + + ++ WW S S+LEKIGG FS+WASEY+
Sbjct: 265 KLFDDE-QNTKLLLEYYQSRKENFKLADTKQRSRWWDLSANSKLEKIGGPGFSSWASEYL 324
Query: 338 PSYRLQIDARQFNGLKFGGWRESAENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQLQ 397
P+YRL++D+ LK GWR+S+EN+WEVLLTHSQMVGLA LDI++ED YSLP KQL
Sbjct: 325 PAYRLEMDSTILADLKLEGWRKSSENKWEVLLTHSQMVGLAEALDIYFEDTYSLPRKQLP 384
Query: 398 CGAIVHSANLLNKKRNYSSWGFLSKTLAGGIFFVTIIAVGQYFLP----RVHVSGRYNVE 457
C + ANL N+K+ S F+S T+A GIF + + A Q+ LP R + R +
Sbjct: 385 CDVPGNYANLPNEKKGLSLLKFISVTMASGIFLLAVSAAAQFCLPQKSERKYPGKRQEIL 444
Query: 458 QPVSSLYGVSSVKNQATEAEKLEEYCISVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDYL 517
S L + +Q++++ +L+ +C +VN +KDA+ W G++ + +GAWIGE PDYL
Sbjct: 445 WSESEL-----LSHQSSDSSELDSFCGLLVNKLKDAYSWVGEITLESSIGAWIGEVPDYL 504
Query: 518 RVVESDTGSED---TPSGTIEQDNVDGVKASAQDIASYQVVLSTEGKIVGFQPTSRVAVN 577
+ ED T S +E N D KASAQDIA+YQVVLS+EGKI+GFQPTSRVAVN
Sbjct: 505 KETSRAKSVEDHIVTSSSLLEILNED-AKASAQDIATYQVVLSSEGKIIGFQPTSRVAVN 564
Query: 578 YWAANPLAKQLYGGRNLSPGLFESGLRIRRPNEVIVIELLMSVKTDAYFALARP 625
+WAANPLA++LYGG+ L PGL E GL+ P +V+V+ELLMSV +D FAL RP
Sbjct: 565 HWAANPLARELYGGKKLKPGLIEPGLKSHPPKKVVVLELLMSVNSDRPFALVRP 609
BLAST of Cp4.1LG10g05150.1 vs. TAIR 10
Match:
AT1G28530.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 10 growth stages; Has 20 Blast hits to 20 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 20; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 579.3 bits (1492), Expect = 3.7e-165
Identity = 310/595 (52.10%), Postives = 414/595 (69.58%), Query Frame = 0
Query: 38 VLKNSRRCFSTISCSGRRPIPTTEEEVLQAVLESDEKILPCVRTYENDLSRLTLVGGVDF 97
+L R S + C + +TEE++L+ V ESD K LPCVRTYEN+ +RL+LVG V F
Sbjct: 25 LLPQQRSSVSFVRCFSKN--SSTEEDILRFVAESDGKALPCVRTYENNSARLSLVGTVAF 84
Query: 98 RQSVTAAAADGGEAASEHLDSGMSAMVVETVFPGTSDEHSTVSTRLFLPAREVKEKARNL 157
Q++TAAAADGGEAA +HL + MVVETVFPG SD +TVSTRLFLP ++VKE+A+ L
Sbjct: 85 DQALTAAAADGGEAADDHLRENVPVMVVETVFPGGSDPKATVSTRLFLPTKKVKERAKRL 144
Query: 158 KKSLAQDFNLSTSSKNILAMTFRQVVLQQLWSFELVIFIPGSERNMEDLENPRE-VPMSF 217
++SL++D + SKNILAMTFRQVVL+QLW+F+LV+F PG+ER M D ENPRE V SF
Sbjct: 145 RRSLSEDLSSGDLSKNILAMTFRQVVLRQLWNFQLVLFAPGAEREMGDFENPREQVSTSF 204
Query: 218 TLSSSEERAISVLAEAVCMCALRNTEGKFVNGTASGTSTRFFDWFRKSTIVASKDSSVII 277
TLSSS+ER ISV+AE +C+ AL++TE F++ F W K +AS+D SV++
Sbjct: 205 TLSSSDERVISVIAEVICISALQSTEKHFLDDYLGKAKFPFMKWLSKRRRIASRDCSVVL 264
Query: 278 YKLLDNEVADAKSLLQKFNSNKESWKRRNFQSKNYWWIPSELSELEKIGGAEFSAWASEY 337
+KL D+E + K LL+ + S KE++K + + ++ WW S S+LEKIGG FS+WASEY
Sbjct: 265 HKLFDDE-QNTKLLLEYYQSRKENFKLADTKQRSRWWDLSANSKLEKIGGPGFSSWASEY 324
Query: 338 VPSYRLQIDARQFNGLKFGGWRESAENRWEVLLTHSQMVGLANILDIFYEDVYSLPDKQL 397
+P+YRL++D+ LK GWR+S+EN+WEVLLTHSQMVGLA LDI++ED YSLP KQL
Sbjct: 325 LPAYRLEMDSTILADLKLEGWRKSSENKWEVLLTHSQMVGLAEALDIYFEDTYSLPRKQL 384
Query: 398 QCGAIVHSANLLNKKRNYSSWGFLSKTLAGGIFFVTIIAVGQYFLP----RVHVSGRYNV 457
C + ANL N+K+ S F+S T+A GIF + + A Q+ LP R + R +
Sbjct: 385 PCDVPGNYANLPNEKKGLSLLKFISVTMASGIFLLAVSAAAQFCLPQKSERKYPGKRQEI 444
Query: 458 EQPVSSLYGVSSVKNQATEAEKLEEYCISVVNIIKDAFGWHGDVHTDKRVGAWIGEAPDY 517
S L + +Q++++ +L+ +C +VN +KDA+ W G++ + +GAWIGE PDY
Sbjct: 445 LWSESEL-----LSHQSSDSSELDSFCGLLVNKLKDAYSWVGEITLESSIGAWIGEVPDY 504
Query: 518 LRVVESDTGSED---TPSGTIEQDNVDGVKASAQDIASYQVVLSTEGKIVGFQPTSRVAV 577
L+ ED T S +E N D KASAQDIA+YQVVLS+EGKI+GFQPTSRVAV
Sbjct: 505 LKETSRAKSVEDHIVTSSSLLEILNED-AKASAQDIATYQVVLSSEGKIIGFQPTSRVAV 564
Query: 578 NYWAANPLAKQLYGGRNLSPGLFESGLRIRRPNEVIVIELLMSVKTDAYFALARP 625
N+WAANPLA++LYGG+ L PGL E GL+ P +V+V+ELLMSV +D FAL RP
Sbjct: 565 NHWAANPLARELYGGKKLKPGLIEPGLKSHPPKKVVVLELLMSVNSDRPFALVRP 610
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023543337.1 | 0.0 | 100.00 | uncharacterized protein LOC111803246 [Cucurbita pepo subsp. pepo] | [more] |
XP_022925753.1 | 0.0 | 97.60 | uncharacterized protein LOC111433068 [Cucurbita moschata] >KAG7034621.1 hypothet... | [more] |
KAG6581337.1 | 0.0 | 97.44 | hypothetical protein SDJN03_21339, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022978666.1 | 0.0 | 96.96 | uncharacterized protein LOC111478564 isoform X1 [Cucurbita maxima] | [more] |
XP_038882562.1 | 0.0 | 88.98 | uncharacterized protein LOC120073791 isoform X2 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EG51 | 0.0 | 97.60 | uncharacterized protein LOC111433068 OS=Cucurbita moschata OX=3662 GN=LOC1114330... | [more] |
A0A6J1INI1 | 0.0 | 96.96 | uncharacterized protein LOC111478564 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A1S3AZK4 | 0.0 | 83.71 | uncharacterized protein LOC103484484 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S3AZL6 | 0.0 | 83.57 | uncharacterized protein LOC103484484 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5A7UA75 | 0.0 | 83.41 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
Match Name | E-value | Identity | Description | |
AT1G28530.2 | 1.5e-166 | 52.19 | unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplas... | [more] |
AT1G28530.1 | 3.7e-165 | 52.10 | unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplas... | [more] |