Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: utr5CDSpolypeptideutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTGGACATAAGAGTTACTGTTTAATAGCGTTCTATAAAATCGTGGATTCTAGAGTTATAGTACGTATGTTCCAATAACAAAACACGATCTTAAATTTCATGGTTAGCTTTATCTTACAGCGTTACTAGCAACACCTCATATTTGATACATCTCTACTGTTTAGATTATTTTATCTGTTATCCAGGTTTTCTTCAGCGAAGTTGAACCTTAGAGTCCCAAAAGTATGATTTTTACTTGAGGAGGCTCCAAATGTATGATTGTAGCTGTTGCATTACATTTTTGTGCATGGTGTTTTGGAGAACTTTGTCGCACAGTTAGCTAGGTACTTAACTTTTTTCTTTTTTGTTTTTTCAGGTATTTGCGCCCGCTTATAATTCTCTATTGCTTTGGTATTGGCATGCTCTTCTGATTATAAGGTTCTATAGACAAGGTCAAGAATGGCTGTTGGATTTAAGTACTGGGATGATTGTGTTGATCCACAAGACATGGAAGCAATGTGGAGTTGCCCTCAAGTATGCGCTGAATGGTTAGATGCTGGAGAGTCCAAAGAACAGAAGGTTCACCTCTCAAGGGATCCAGATGGACAACCTTATTTGACTCAGACAGAGATGAAGGTCATACCCTTTCTATAGCGTTTATGGACATAAAATTAAAGGAAGCATTTAACTTTAATATTTTTATCAGAGCTTGAAGGGGTTATAAGTTTTCAGTATGCATGAGAACCCTACAAGATTTAAAAAATAACTCAATCTTGTCATAGTTTCAATTATTTATCCGCTTGAGGAAGATGAATATTTGCTTTCTGATAATTCAATTTCTTGTCCATTTCTGTCACAGAAAAAAAAATGAAAATTATTGATTTTCTTCCTTTTCTGATGCACGTTATTAGACAAGAAATGTTATATGCCCTTTAAAATTATTGTCTTGTAACATTATGTTATTCTCCAGGCAGTGGCAGATATCATTGTTCAGAGGCACTTTGATTCCAATATAGATTCTGTAAGTTATTTTTTATTTTATTTTTTTGGCTGTATGTTTGCCTCTTTATGTTTTAGATACTTAGAAGAAGAAGAAACCATCATGGGATGACCTTGTGGTTAGCAAGAACCAATGTAAATAGCAAAGGGCCTAGAGAGAATGGATTCAAACAATGATGTCCACCTACCTAAGATTTAATACCCTACAAGTTTCCTTTTCAATCAAATATAGCAAGGCTAGGTGGTTGACCAGTAATTAGTCGAGGTGCACGCAAGTTGGCTCGGACACACAAATATAAAAAAAAATAAATAAAAAAAAGACAATTAATTAAATACTTAGAAGAAGAATAGAATTAAGCCTGCTTCCATCTATGAAAACGGAAATATGGATGCAGATATTGTCATCAAATTCACATTTTAAATTCAAATATTCTATATCATCAGTTAAACAGTTACAGTATTTCATTGTGCTGATATTGATATTCAATTTATTTTTAAACAAAGAAGAGGGTTTCTTGGCTATCTTTTTACTTAGTTGCAAGAATAGTTGCTGTACTTAATTTTCAGTATGTAGTCATAATTCATAATATGTGTAGGAAATGATCTGTGCCATAGCGGAACTTGAGAGCGATAGACAACTTCTTGCAACACGTTATGACAAAAAAACCAAGGAGACAACATTAGGGATCATGCAACTAACACTTAAAACTGCTGAGTGGCTTATCAGGTATGAAGATGATGATTTGATATATTATACATTACATCCTGCATATTCAAATATATACCTAAACGTAGTTCATCAATTAAAACACAATTATAACATATACCTCAATAATATTGTTGGTCTTGCAGTGAATTGGGTTATCAATCTTATGTATTAGAAGGGAACCCAGATGTCCTGAAGAAGCCTTTTGTCAGTGTATATTTTGGGGCTGCTTATCTTAAATGGTTATCAAACTTTGAGCAGAAGTAAGTTATTGAACTCGCATTTCGTTAATCTATTAATTGTATGTTGTATTTGGCAGCTTCTAGTTCATTGTATATACTTTTTGCTAGGTAGTTAATATCATTTACCCTCCAAATATATGGTTCCTGATATTAACATAGCAAACTCCTTTTCAGAGAGAGAAATGAAGAGTTTGTAGTTAGGGCTTATAGAGGTGGTACAAAGAAGGCGACTCACAAAACAACTCTACAACACTGGAAAAGATATCAATCAGTAAAAGAAAGTCTTCCATCCAGGTGCTCATATCCTATTGATGAATAGAGTTTGAACTGCAATAAGGTTTATTCTGTTCAGATGTTGGTTAATCCTGTTAACAGTTAAATGGATACATGAAAATCTAATGCACTAGAACAACTTGAGGATACGATTTGTTTGATAAGGGCATACATTTATGCTTAGATTTAGCTTAACCCATTATTCTCGCTGCATTAGTGAGTTTCATCGCAGAAACTATTTTCCTTCTTAATTTTTTCTTCAAAGCTTGATTGGTTTGTTTTTTACTTCATTAAAATTAGTCAAAAGTTACATCATCAAATATTTTAGTCTCTGAGCATGATTCGATCTTTTTGTGTCTTTTACCTTCAAACTACCATCAAACAAATTTGCTTCCGGATCTACATTTTTGGACTCCTTGTAGTTTCTTGATGAAATATATCCAATCTTCAATCATAACCATGTTTTTTGTATATTAAATTCCTTGACCCTTTACTGAAAACTGTTAATCAAAAGGTGTAAAGAAATGGCCTTCCTCTGCTTTTGAGATTTGACCTGAAATGTAGAGGTTGAGTAGTTTTATATGTGGCAGAACTCACACATATTGGCAGAAACTATGAACTTTTAATTCTCGGTAATATTTTTAAACCATCTTGATGCTTGCATTCAAAATCATGTAACAGATAGGTGCTGGTTTTGGTAATACAGTTGAAGGTTGTGTAAATAATCATATGTATAATTAAAAAGGGTTAACTGCTCATGAGTTATGAGAAGTAGAAAGAAAACATTCTTAATACAATGTTGCAACTTTTAAACTGATAATCATTATATGGATTATGTCTAGAAAACATATCAATGAAGGCATCGTGATGAGTGAGGCCTCCAGCTCTACTACATCCCCTCCGCCTGCTTCAGGGAATACAGGTTGCATTTCCTTACGCATCTCTTACATGTCATTCATTTGCTTCTGCATCATATTCTTGATATTAAGAATTTGGATAGCTGGATAANGGGGGGGGGGCGCTCTTCAAAGATTAGAAATAAGAGAAAAGGCTAAACAATTCAAAACAGGTAGCTCTTTTGGACTAAAAAGAAAGTACTAAGGGTTCTCTTTTTGTAAACGTGATGGAAGTAACGATTATTCAAAACCCTTCATTGTAATAAAGTAATTCTAACCCTTCATTACCACCAAAAAGATAAAGGGGGACATTTTAAATTAAAAAAACATTGCCATTATGCCGTATGCTCATATCATAGAATGATCCTCCCGAGAGAGAAGAATCAAATGATAATCGATGGGGAAAAAGAAAAGCAAGCAGCAAGAAGTTGAACTTAATAGAGGCACCCTATCATACATACATGATTAGTGTGGTTGATTCTTGGTACTTTGATCCCTGTTCATTACTCTTCTATGGTTCATCTCGCCAAGTGTATTTGCATTTCTTGGAGATGACAGTCCTTTTTCTCATCTTAAAGAGGGTGCAGCTATTATTTATACATTTTGGGATTCTCGAGCTACTCCTGAGGACATGGAAGAGATGTGGAATAATCCTGATGTTCTAAAAGAGTGGACTAAATCTGGAGAGAAAAAGGGCAACGTACGGTTTTCTCATGATGCAAAAAAGAGACCCTATCTTTCCCGAGTAGAATTGAAGGTATAAATTAATAAGACTATTACGCATCATAAGAGTACTTATTTGTTGTTCTGGTTTACTATATTTTTCATTCTGCCTACATTTGTGGGATAGTTGCTTTTATTTCTGTTCAGATGTGCAGTGATCTAGTCACCAATTACAACAAAATATGTTGTAGTCACATTCCTCGTACAGTGATCTAATCATTAAATTTTCTTAATGAAGAGTAGAGACCTTGAAGTTTTCTTTCATAAGCTTGCTATCAACCAGTGAAATAATATGTGAATATTTTTGTGTTTATCTTTTACATTTGTGTATTTGTGCATCCTTTTATAGCTTGCATGCACCCGATTGCTTTAACTTCTTGTGTGAAATAAGCTAATGAGTGCTTTAAATTTGAAAAAAAAAAAAAATCCCCATTCTAATTGTCATTGTCTTGCAGGCCGTTGCTGAGATCATTCTTTCAAAACATTTCAGTACAAAAGGAGTTCAACCAGTAAGTCTCTAGTGGCCATTTCATCCTGTACTTTCCTTTGGACTGTGATGTCTTTTATAATACTTTTCTTTGTTTCATTTGATACTTATTCATCTTCTTTAGTGTTTGTCCACAAAATGAGGACGGGAGCTGGAAAAATTTATTATTTACACATCTATCGATTAAAAAAATGTCAATTGTAACTATCATAGGTTGACCTAGGGGTCAATAAGGGTCAATGAAATAATAATGAGTTTAGAGGAAATGAGTTCGATGACCACCTACTTAGGATTTAATATTCTACGAGTTTCCTTGATAGCTAGGGTCAGGTTGTCTTCTGAGATTAGTCGAGGTGTAAACAAGTTGGCTCGAAACACTCACGGATAAAAGTCACCCATATTTCAAACTAAACCAATGGATGAAAAACCAAACCCCTACTGAACAGACCTCACCCAAGCTCTCCAATTTGAATGTAGAATGGAAGGAGTATGATTAGAAACATATTTAGCAATGGGAGACCATGAAAAGCAATGAAACATGGCCAACATGAAGACTTTCCACCGAACGAAAACTCAATCATAATTGTTAATACACATGCTTTACTGTTGATAAGAAAATACATAATAACTCTGGATTTCCACGCACATTCAACGTCTGTCACTTAAATTATGCATGTACACCTGAAAGGAATATTGTGTTGATGAATCATTCATATAAAAGTTCAACACTAGTTTGAAAAATAATCCTGGAATGAGAGCTCTTGGAAGTATTGAACTTTCACAATTCTTTTCAATTTCCTCCTGGACCATGTAACTCGTTCTTCTGAACTTTTGTTCAATTTAACATTCTTCCCAAAATTATTTATTGTTTGAGTATTTATTGGTGCACAGACGGTTCTCTGTGCTCTGGCTGAGATAGTTAGCATGCGCTTCATTAATGGAGCTGGAGGACGTCCTGGAATAATGGGGATCGACTATTCAACTGCATTTTGGATTTATATGTATGTGAATGTTGAACCAATTTTTAAAAAACATTATCAGCATATGTTTACTTATTGTATTACTGTGGGCTAGAATGTTACATCATTATGTTAATCTATTGTTGGTGGGTAGGGAATTGAGCCACAGAGCGTATAGACTTGATTCTGCTGACGACTTAACCAAGCCATTTGTGTCCATGTATTTCGGTGCTGCCTACTTTGTTTGGTTATCTGAATATGAAGGGAGGTTGGTTTCTTCTATGATGAAAATCTCCACTCAAGTATGGAAACATTTTTCTTCCAAGTATTGCATCTAACATGGTAGTTGTGATCAGGGAACGAACTCGACAGTTTGTTTTTCAGGCTTACATATCTGGGCCGCAAAATGTGGATCTTCAAGAACCAGGCCCTCTTTGGCTCAAATTCGAGGAAGCACTGAGCAGCTACGAAGACACTAAAATGTCAGACCTTTCTGAACAACTCCCTCTATTTAATCTTTAGTTTTCTTCACTTTCATGACTAATACAGTAATGGTTATTTTACCTTAGTGGCACTCAAGGGAGCTGTTCCGTCATGTAAAGCAGACATTTACCTCTCCATCTGTTTTCATCTGTAAGTTGTTCGTCATCAGTTGATCATCGCGACAAGCAGAGGGTGGAAGCAGGATATTCATTCATCCACACGACTAAACTGCACTTGGAAGTTTTTTTTTTTGTACAGTGCAGGAATATTACAAAACAAATCTGTTCCATAAATTCTTTTCATTCATTTGCCAACCACTATATTTCAGTGTCAGCACAGCAGTGCCTTTCTTTGGGTTTTCTGTACTGATGAAGTATAACTCTTGATAAAAAAAATTAGAAATTAGAGCCAAAGTTTTTACGTATGGGAATTTGCTCCCTTAGAGGTAGGTCTCTTAACGAGTTAACTATGTTCCGGTGGTTAAAGTAAAAATTTTTTAACCAGGAAATAAAAGAAAATTTGGCAATATGAATGTAGTGAAATTAAAGGAACACCCCGCTTTCATGATATGCAATATATTTTTCTTCCCCGTCACCGTTTCCTCTTTTTC
mRNA sequence
GGTGGACATAAGAGTTACTGTTTAATAGCGTTCTATAAAATCGTGGATTCTAGAGTTATAGTACGTATGTTCCAATAACAAAACACGATCTTAAATTTCATGGTTAGCTTTATCTTACAGCGTTACTAGCAACACCTCATATTTGATACATCTCTACTGTTTAGATTATTTTATCTGTTATCCAGGTTTTCTTCAGCGAAGTTGAACCTTAGAGTCCCAAAAGTATGATTTTTACTTGAGGAGGCTCCAAATGTATGATTGTAGCTGTTGCATTACATTTTTGTGCATGGTGTTTTGGAGAACTTTGTCGCACAGTTAGCTAGGTACTTAACTTTTTTCTTTTTTGTTTTTTCAGGTATTTGCGCCCGCTTATAATTCTCTATTGCTTTGGTATTGGCATGCTCTTCTGATTATAAGGTTCTATAGACAAGGTCAAGAATGGCTGTTGGATTTAAGTACTGGGATGATTGTGTTGATCCACAAGACATGGAAGCAATGTGGAGTTGCCCTCAAGTATGCGCTGAATGGTTAGATGCTGGAGAGTCCAAAGAACAGAAGGTTCACCTCTCAAGGGATCCAGATGGACAACCTTATTTGACTCAGACAGAGATGAAGGCAGTGGCAGATATCATTGTTCAGAGGCACTTTGATTCCAATATAGATTCTGAAATGATCTGTGCCATAGCGGAACTTGAGAGCGATAGACAACTTCTTGCAACACGTTATGACAAAAAAACCAAGGAGACAACATTAGGGATCATGCAACTAACACTTAAAACTGCTGAGTGGCTTATCAGTGAATTGGGTTATCAATCTTATGTATTAGAAGGGAACCCAGATGTCCTGAAGAAGCCTTTTGTCAGTGTATATTTTGGGGCTGCTTATCTTAAATGGTTATCAAACTTTGAGCAGAAAGAGAGAAATGAAGAGTTTGTAGTTAGGGCTTATAGAGGTGGTACAAAGAAGGCGACTCACAAAACAACTCTACAACACTGGAAAAGATATCAATCAGTAAAAGAAAGTCTTCCATCCAGAAAACATATCAATGAAGGCATCGTGATGAGTGAGGCCTCCAGCTCTACTACATCCCCTCCGCCTGCTTCAGGGAATACAGAGGGTGCAGCTATTATTTATACATTTTGGGATTCTCGAGCTACTCCTGAGGACATGGAAGAGATGTGGAATAATCCTGATGTTCTAAAAGAGTGGACTAAATCTGGAGAGAAAAAGGGCAACGTACGGTTTTCTCATGATGCAAAAAAGAGACCCTATCTTTCCCGAGTAGAATTGAAGGCCGTTGCTGAGATCATTCTTTCAAAACATTTCAGTACAAAAGGAGTTCAACCAACGGTTCTCTGTGCTCTGGCTGAGATAGTTAGCATGCGCTTCATTAATGGAGCTGGAGGACGTCCTGGAATAATGGGGATCGACTATTCAACTGCATTTTGGATTTATATGGAATTGAGCCACAGAGCGTATAGACTTGATTCTGCTGACGACTTAACCAAGCCATTTGTGTCCATGTATTTCGGTGCTGCCTACTTTGTTTGGTTATCTGAATATGAAGGGAGGGAACGAACTCGACAGTTTGTTTTTCAGGCTTACATATCTGGGCCGCAAAATGTGGATCTTCAAGAACCAGGCCCTCTTTGGCTCAAATTCGAGGAAGCACTGAGCAGCTACGAAGACACTAAAATTGGCACTCAAGGGAGCTGTTCCGTCATGTAAAGCAGACATTTACCTCTCCATCTGTTTTCATCTGTAAGTTGTTCGTCATCAGTTGATCATCGCGACAAGCAGAGGGTGGAAGCAGGATATTCATTCATCCACACGACTAAACTGCACTTGGAAGTTTTTTTTTTTGTACAGTGCAGGAATATTACAAAACAAATCTGTTCCATAAATTCTTTTCATTCATTTGCCAACCACTATATTTCAGTGTCAGCACAGCAGTGCCTTTCTTTGGGTTTTCTGTACTGATGAAGTATAACTCTTGATAAAAAAAATTAGAAATTAGAGCCAAAGTTTTTACGTATGGGAATTTGCTCCCTTAGAGGTAGGTCTCTTAACGAGTTAACTATGTTCCGGTGGTTAAAGTAAAAATTTTTTAACCAGGAAATAAAAGAAAATTTGGCAATATGAATGTAGTGAAATTAAAGGAACACCCCGCTTTCATGATATGCAATATATTTTTCTTCCCCGTCACCGTTTCCTCTTTTTC
Coding sequence (CDS)
ATGGCTGTTGGATTTAAGTACTGGGATGATTGTGTTGATCCACAAGACATGGAAGCAATGTGGAGTTGCCCTCAAGTATGCGCTGAATGGTTAGATGCTGGAGAGTCCAAAGAACAGAAGGTTCACCTCTCAAGGGATCCAGATGGACAACCTTATTTGACTCAGACAGAGATGAAGGCAGTGGCAGATATCATTGTTCAGAGGCACTTTGATTCCAATATAGATTCTGAAATGATCTGTGCCATAGCGGAACTTGAGAGCGATAGACAACTTCTTGCAACACGTTATGACAAAAAAACCAAGGAGACAACATTAGGGATCATGCAACTAACACTTAAAACTGCTGAGTGGCTTATCAGTGAATTGGGTTATCAATCTTATGTATTAGAAGGGAACCCAGATGTCCTGAAGAAGCCTTTTGTCAGTGTATATTTTGGGGCTGCTTATCTTAAATGGTTATCAAACTTTGAGCAGAAAGAGAGAAATGAAGAGTTTGTAGTTAGGGCTTATAGAGGTGGTACAAAGAAGGCGACTCACAAAACAACTCTACAACACTGGAAAAGATATCAATCAGTAAAAGAAAGTCTTCCATCCAGAAAACATATCAATGAAGGCATCGTGATGAGTGAGGCCTCCAGCTCTACTACATCCCCTCCGCCTGCTTCAGGGAATACAGAGGGTGCAGCTATTATTTATACATTTTGGGATTCTCGAGCTACTCCTGAGGACATGGAAGAGATGTGGAATAATCCTGATGTTCTAAAAGAGTGGACTAAATCTGGAGAGAAAAAGGGCAACGTACGGTTTTCTCATGATGCAAAAAAGAGACCCTATCTTTCCCGAGTAGAATTGAAGGCCGTTGCTGAGATCATTCTTTCAAAACATTTCAGTACAAAAGGAGTTCAACCAACGGTTCTCTGTGCTCTGGCTGAGATAGTTAGCATGCGCTTCATTAATGGAGCTGGAGGACGTCCTGGAATAATGGGGATCGACTATTCAACTGCATTTTGGATTTATATGGAATTGAGCCACAGAGCGTATAGACTTGATTCTGCTGACGACTTAACCAAGCCATTTGTGTCCATGTATTTCGGTGCTGCCTACTTTGTTTGGTTATCTGAATATGAAGGGAGGGAACGAACTCGACAGTTTGTTTTTCAGGCTTACATATCTGGGCCGCAAAATGTGGATCTTCAAGAACCAGGCCCTCTTTGGCTCAAATTCGAGGAAGCACTGAGCAGCTACGAAGACACTAAAATTGGCACTCAAGGGAGCTGTTCCGTCATGTAA
Protein sequence
MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKAVADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLISELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHKTTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRATPEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKGVQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFVSMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKIGTQGSCSVM
Homology
BLAST of MC04g0022 vs. NCBI nr
Match:
XP_022148059.1 (uncharacterized protein LOC111016834 isoform X1 [Momordica charantia])
HSP 1 Score: 878 bits (2269), Expect = 0.0
Identity = 429/429 (100.00%), Postives = 429/429 (100.00%), Query Frame = 0
Query: 1 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 60
MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA
Sbjct: 28 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 87
Query: 61 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 120
VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS
Sbjct: 88 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 147
Query: 121 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 180
ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK
Sbjct: 148 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 207
Query: 181 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT 240
TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT
Sbjct: 208 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT 267
Query: 241 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 300
PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG
Sbjct: 268 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 327
Query: 301 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 360
VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV
Sbjct: 328 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 387
Query: 361 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 420
SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI
Sbjct: 388 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 447
Query: 421 GTQGSCSVM 429
GTQGSCSVM
Sbjct: 448 GTQGSCSVM 456
BLAST of MC04g0022 vs. NCBI nr
Match:
XP_022148061.1 (uncharacterized protein LOC111016834 isoform X3 [Momordica charantia])
HSP 1 Score: 878 bits (2269), Expect = 0.0
Identity = 429/429 (100.00%), Postives = 429/429 (100.00%), Query Frame = 0
Query: 1 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 60
MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA
Sbjct: 1 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 60
Query: 61 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 120
VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS
Sbjct: 61 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 120
Query: 121 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 180
ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK
Sbjct: 121 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 180
Query: 181 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT 240
TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT
Sbjct: 181 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT 240
Query: 241 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 300
PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG
Sbjct: 241 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 300
Query: 301 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 360
VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV
Sbjct: 301 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 360
Query: 361 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 420
SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI
Sbjct: 361 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 420
Query: 421 GTQGSCSVM 429
GTQGSCSVM
Sbjct: 421 GTQGSCSVM 429
BLAST of MC04g0022 vs. NCBI nr
Match:
XP_022148060.1 (uncharacterized protein LOC111016834 isoform X2 [Momordica charantia])
HSP 1 Score: 867 bits (2240), Expect = 0.0
Identity = 426/429 (99.30%), Postives = 426/429 (99.30%), Query Frame = 0
Query: 1 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 60
MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA
Sbjct: 28 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 87
Query: 61 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 120
VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS
Sbjct: 88 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 147
Query: 121 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 180
ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK
Sbjct: 148 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 207
Query: 181 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT 240
TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNT AIIYTFWDSRAT
Sbjct: 208 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNT---AIIYTFWDSRAT 267
Query: 241 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 300
PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG
Sbjct: 268 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 327
Query: 301 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 360
VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV
Sbjct: 328 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 387
Query: 361 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 420
SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI
Sbjct: 388 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 447
Query: 421 GTQGSCSVM 429
GTQGSCSVM
Sbjct: 448 GTQGSCSVM 453
BLAST of MC04g0022 vs. NCBI nr
Match:
XP_038886952.1 (uncharacterized protein LOC120077127 [Benincasa hispida])
HSP 1 Score: 776 bits (2004), Expect = 8.75e-282
Identity = 377/429 (87.88%), Postives = 395/429 (92.07%), Query Frame = 0
Query: 1 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 60
MA+GFKYWDDCVDPQD+EAMWS PQVCAEWLDAGESK QKVHLSRDPDGQPYLTQTEMKA
Sbjct: 1 MAIGFKYWDDCVDPQDIEAMWSYPQVCAEWLDAGESKTQKVHLSRDPDGQPYLTQTEMKA 60
Query: 61 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 120
VADII+QRHF S +DSEMICAIAELESDRQ LATRYDKKTKETTLGIMQ+TLKTA+WL+S
Sbjct: 61 VADIILQRHFVSKVDSEMICAIAELESDRQPLATRYDKKTKETTLGIMQITLKTAQWLVS 120
Query: 121 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 180
ELGYQSY LEGNPDVL KPFV+VYFGAAYLKWLSNFEQKERNEEFVVRAYR GTKKATHK
Sbjct: 121 ELGYQSYGLEGNPDVLSKPFVNVYFGAAYLKWLSNFEQKERNEEFVVRAYRSGTKKATHK 180
Query: 181 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT 240
TTL +WKRY SVKESLPSRKHINE SSS TSPPPASGNTEGAAI YTFWD RAT
Sbjct: 181 TTLPYWKRYLSVKESLPSRKHINE------VSSSATSPPPASGNTEGAAITYTFWDCRAT 240
Query: 241 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 300
PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHD KKRPY+SRVELKA+AEIILSKHFSTKG
Sbjct: 241 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDLKKRPYVSRVELKAIAEIILSKHFSTKG 300
Query: 301 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 360
V+ TVLCALAE+VSMRFING G RPGIMGIDYSTA W+YMEL +RAYRLDS DDLTKPFV
Sbjct: 301 VKSTVLCALAEVVSMRFINGVGARPGIMGIDYSTASWLYMELRYRAYRLDSVDDLTKPFV 360
Query: 361 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 420
SMYFGAAY VWLSEYEGRERTRQFV QAYI+GPQNVDLQE PLWLKFEEALS+YED K
Sbjct: 361 SMYFGAAYLVWLSEYEGRERTRQFVVQAYIAGPQNVDLQETSPLWLKFEEALSNYEDNKS 420
Query: 421 GTQGSCSVM 429
G QGSCS+M
Sbjct: 421 GAQGSCSIM 423
BLAST of MC04g0022 vs. NCBI nr
Match:
XP_004139838.1 (uncharacterized protein LOC101215745 [Cucumis sativus] >XP_011659044.1 uncharacterized protein LOC101215745 [Cucumis sativus] >XP_011659045.1 uncharacterized protein LOC101215745 [Cucumis sativus] >XP_031744864.1 uncharacterized protein LOC101215745 [Cucumis sativus] >XP_031744865.1 uncharacterized protein LOC101215745 [Cucumis sativus] >KGN44290.1 hypothetical protein Csa_015969 [Cucumis sativus])
HSP 1 Score: 775 bits (2000), Expect = 3.56e-281
Identity = 374/429 (87.18%), Postives = 395/429 (92.07%), Query Frame = 0
Query: 1 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 60
MA+GFKYWDDCVDPQDMEAMWS PQVCAEWLDAGESK QKVHLSRDPDGQPYLTQTEMKA
Sbjct: 1 MAIGFKYWDDCVDPQDMEAMWSYPQVCAEWLDAGESKTQKVHLSRDPDGQPYLTQTEMKA 60
Query: 61 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 120
VADI+V RHF SN+DS+MICA+AELESDRQ LATRYDKK KE+TLGIMQ+TLKTAEWL+S
Sbjct: 61 VADIVVHRHFGSNVDSDMICALAELESDRQPLATRYDKKNKESTLGIMQITLKTAEWLVS 120
Query: 121 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 180
EL YQSY LEGNP+VL KPFVSVYFGAAYLKWLSNFEQKER+EEFVVRAYRGGTKKATHK
Sbjct: 121 ELRYQSYGLEGNPEVLSKPFVSVYFGAAYLKWLSNFEQKERSEEFVVRAYRGGTKKATHK 180
Query: 181 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT 240
TTL +WKRY SVKESLPSRKHINE S+STTSPP ASGNTEGAAI YTFWD RAT
Sbjct: 181 TTLPYWKRYLSVKESLPSRKHINE------VSTSTTSPPSASGNTEGAAITYTFWDCRAT 240
Query: 241 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 300
PEDMEEMWNNPDV KEWTKSGEKKGNVRFSHD KKRPY+SRVELKA+AEIILSKHFSTKG
Sbjct: 241 PEDMEEMWNNPDVQKEWTKSGEKKGNVRFSHDLKKRPYVSRVELKAIAEIILSKHFSTKG 300
Query: 301 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 360
VQPTVLCALAE+VSMRFING G RPGIMGIDYSTAFW+YMELS+RAYRLDS DDLTKPFV
Sbjct: 301 VQPTVLCALAEVVSMRFINGVGARPGIMGIDYSTAFWLYMELSYRAYRLDSTDDLTKPFV 360
Query: 361 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 420
SMYFGAAY WLS+YEGRERTRQFV QAYI+GPQNVDL E GPLWLKFEEALS+YED K
Sbjct: 361 SMYFGAAYLAWLSDYEGRERTRQFVVQAYIAGPQNVDLPETGPLWLKFEEALSNYEDNKS 420
Query: 421 GTQGSCSVM 429
G QGSCS+M
Sbjct: 421 GAQGSCSIM 423
BLAST of MC04g0022 vs. ExPASy TrEMBL
Match:
A0A6J1D485 (uncharacterized protein LOC111016834 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111016834 PE=4 SV=1)
HSP 1 Score: 878 bits (2269), Expect = 0.0
Identity = 429/429 (100.00%), Postives = 429/429 (100.00%), Query Frame = 0
Query: 1 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 60
MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA
Sbjct: 1 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 60
Query: 61 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 120
VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS
Sbjct: 61 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 120
Query: 121 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 180
ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK
Sbjct: 121 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 180
Query: 181 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT 240
TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT
Sbjct: 181 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT 240
Query: 241 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 300
PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG
Sbjct: 241 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 300
Query: 301 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 360
VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV
Sbjct: 301 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 360
Query: 361 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 420
SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI
Sbjct: 361 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 420
Query: 421 GTQGSCSVM 429
GTQGSCSVM
Sbjct: 421 GTQGSCSVM 429
BLAST of MC04g0022 vs. ExPASy TrEMBL
Match:
A0A6J1D2W1 (uncharacterized protein LOC111016834 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111016834 PE=4 SV=1)
HSP 1 Score: 878 bits (2269), Expect = 0.0
Identity = 429/429 (100.00%), Postives = 429/429 (100.00%), Query Frame = 0
Query: 1 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 60
MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA
Sbjct: 28 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 87
Query: 61 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 120
VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS
Sbjct: 88 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 147
Query: 121 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 180
ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK
Sbjct: 148 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 207
Query: 181 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT 240
TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT
Sbjct: 208 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT 267
Query: 241 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 300
PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG
Sbjct: 268 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 327
Query: 301 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 360
VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV
Sbjct: 328 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 387
Query: 361 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 420
SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI
Sbjct: 388 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 447
Query: 421 GTQGSCSVM 429
GTQGSCSVM
Sbjct: 448 GTQGSCSVM 456
BLAST of MC04g0022 vs. ExPASy TrEMBL
Match:
A0A6J1D1V8 (uncharacterized protein LOC111016834 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111016834 PE=4 SV=1)
HSP 1 Score: 867 bits (2240), Expect = 0.0
Identity = 426/429 (99.30%), Postives = 426/429 (99.30%), Query Frame = 0
Query: 1 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 60
MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA
Sbjct: 28 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 87
Query: 61 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 120
VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS
Sbjct: 88 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 147
Query: 121 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 180
ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK
Sbjct: 148 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 207
Query: 181 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT 240
TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNT AIIYTFWDSRAT
Sbjct: 208 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNT---AIIYTFWDSRAT 267
Query: 241 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 300
PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG
Sbjct: 268 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 327
Query: 301 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 360
VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV
Sbjct: 328 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 387
Query: 361 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 420
SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI
Sbjct: 388 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 447
Query: 421 GTQGSCSVM 429
GTQGSCSVM
Sbjct: 448 GTQGSCSVM 453
BLAST of MC04g0022 vs. ExPASy TrEMBL
Match:
A0A0A0K3L0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G239030 PE=4 SV=1)
HSP 1 Score: 775 bits (2000), Expect = 1.72e-281
Identity = 374/429 (87.18%), Postives = 395/429 (92.07%), Query Frame = 0
Query: 1 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 60
MA+GFKYWDDCVDPQDMEAMWS PQVCAEWLDAGESK QKVHLSRDPDGQPYLTQTEMKA
Sbjct: 1 MAIGFKYWDDCVDPQDMEAMWSYPQVCAEWLDAGESKTQKVHLSRDPDGQPYLTQTEMKA 60
Query: 61 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 120
VADI+V RHF SN+DS+MICA+AELESDRQ LATRYDKK KE+TLGIMQ+TLKTAEWL+S
Sbjct: 61 VADIVVHRHFGSNVDSDMICALAELESDRQPLATRYDKKNKESTLGIMQITLKTAEWLVS 120
Query: 121 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 180
EL YQSY LEGNP+VL KPFVSVYFGAAYLKWLSNFEQKER+EEFVVRAYRGGTKKATHK
Sbjct: 121 ELRYQSYGLEGNPEVLSKPFVSVYFGAAYLKWLSNFEQKERSEEFVVRAYRGGTKKATHK 180
Query: 181 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT 240
TTL +WKRY SVKESLPSRKHINE S+STTSPP ASGNTEGAAI YTFWD RAT
Sbjct: 181 TTLPYWKRYLSVKESLPSRKHINE------VSTSTTSPPSASGNTEGAAITYTFWDCRAT 240
Query: 241 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 300
PEDMEEMWNNPDV KEWTKSGEKKGNVRFSHD KKRPY+SRVELKA+AEIILSKHFSTKG
Sbjct: 241 PEDMEEMWNNPDVQKEWTKSGEKKGNVRFSHDLKKRPYVSRVELKAIAEIILSKHFSTKG 300
Query: 301 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 360
VQPTVLCALAE+VSMRFING G RPGIMGIDYSTAFW+YMELS+RAYRLDS DDLTKPFV
Sbjct: 301 VQPTVLCALAEVVSMRFINGVGARPGIMGIDYSTAFWLYMELSYRAYRLDSTDDLTKPFV 360
Query: 361 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 420
SMYFGAAY WLS+YEGRERTRQFV QAYI+GPQNVDL E GPLWLKFEEALS+YED K
Sbjct: 361 SMYFGAAYLAWLSDYEGRERTRQFVVQAYIAGPQNVDLPETGPLWLKFEEALSNYEDNKS 420
Query: 421 GTQGSCSVM 429
G QGSCS+M
Sbjct: 421 GAQGSCSIM 423
BLAST of MC04g0022 vs. ExPASy TrEMBL
Match:
A0A1S3BGL4 (uncharacterized protein LOC103489631 OS=Cucumis melo OX=3656 GN=LOC103489631 PE=4 SV=1)
HSP 1 Score: 770 bits (1989), Expect = 8.17e-280
Identity = 374/429 (87.18%), Postives = 392/429 (91.38%), Query Frame = 0
Query: 1 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 60
MA+GFKYWDDCVDPQDMEAMWS PQVCAEWLDAGESK QKVHLSRDPDGQPYLTQTEMKA
Sbjct: 1 MAIGFKYWDDCVDPQDMEAMWSYPQVCAEWLDAGESKTQKVHLSRDPDGQPYLTQTEMKA 60
Query: 61 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 120
V DI+VQRHF S IDSEMICAIAELESDRQ LATRYDKKTKETTLGIMQ+TLKTAEWL+S
Sbjct: 61 VTDIVVQRHFGSKIDSEMICAIAELESDRQPLATRYDKKTKETTLGIMQITLKTAEWLVS 120
Query: 121 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 180
ELGYQSY LEGNP+VL KPFVSVYFGAAYLKWLSNFEQKER+EEFVVRAYRGG KKATHK
Sbjct: 121 ELGYQSYGLEGNPEVLNKPFVSVYFGAAYLKWLSNFEQKERSEEFVVRAYRGGIKKATHK 180
Query: 181 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT 240
TTL +WKRY SVKESLPSRKHINE S+S SPPPASGNTE AAI YT WD RAT
Sbjct: 181 TTLPYWKRYLSVKESLPSRKHINE------VSTSAASPPPASGNTEDAAITYTSWDCRAT 240
Query: 241 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 300
PEDMEEMWNNPDV KEWTKSGEKKG VRFSHD KKRPY+SRVELKA+AEIILSKHFSTKG
Sbjct: 241 PEDMEEMWNNPDVQKEWTKSGEKKGKVRFSHDLKKRPYVSRVELKAIAEIILSKHFSTKG 300
Query: 301 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 360
V+PTVLCALAE+VSMRFING G RPGIMGIDYSTAFW+YMELS+RAYRLDS DDLTKPFV
Sbjct: 301 VKPTVLCALAEVVSMRFINGVGSRPGIMGIDYSTAFWLYMELSYRAYRLDSTDDLTKPFV 360
Query: 361 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 420
SMYFGAAY WLS+YEGRERTRQFV QAYI+GPQNVDLQE GPLWLKFEEALS+YED K
Sbjct: 361 SMYFGAAYLAWLSDYEGRERTRQFVVQAYIAGPQNVDLQETGPLWLKFEEALSNYEDNKS 420
Query: 421 GTQGSCSVM 429
G QGSCS+M
Sbjct: 421 GGQGSCSIM 423
BLAST of MC04g0022 vs. TAIR 10
Match:
AT1G16290.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast, vacuole; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 11 growth stages; CONTAINS InterPro DOMAIN/s: Lytic transglycosylase-like, catalytic (InterPro:IPR008258); Has 171 Blast hits to 155 proteins in 40 species: Archae - 0; Bacteria - 54; Metazoa - 0; Fungi - 0; Plants - 55; Viruses - 0; Other Eukaryotes - 62 (source: NCBI BLink). )
HSP 1 Score: 577.0 bits (1486), Expect = 1.2e-164
Identity = 274/429 (63.87%), Postives = 342/429 (79.72%), Query Frame = 0
Query: 1 MAVGFKYWDDCVDPQDMEAMWSCPQVCAEWLDAGESKEQKVHLSRDPDGQPYLTQTEMKA 60
MA F +W+DCV+P+D+E MW P V AEW+D GE+K QKVHLSRDPDGQPYLTQTEM+A
Sbjct: 1 MANSFTFWNDCVEPEDLEEMWMDPAVSAEWIDVGETKGQKVHLSRDPDGQPYLTQTEMRA 60
Query: 61 VADIIVQRHFDSNIDSEMICAIAELESDRQLLATRYDKKTKETTLGIMQLTLKTAEWLIS 120
V+DI V+RHFDS +DSEMICAIAELESDR+ L RY KKTKET LGI+Q+ KTA WL
Sbjct: 61 VSDITVRRHFDSILDSEMICAIAELESDRKPLIMRYSKKTKETGLGILQVFEKTAAWLAG 120
Query: 121 ELGYQSYVLEGNPDVLKKPFVSVYFGAAYLKWLSNFEQKERNEEFVVRAYRGGTKKATHK 180
GYQ+Y ++ NPD+L KPF++VYFGAAYLKWL++++ +R+EEFVVRAY GGTKKATHK
Sbjct: 121 GQGYQAYNVDDNPDLLHKPFINVYFGAAYLKWLTDYQNNQRSEEFVVRAYNGGTKKATHK 180
Query: 181 TTLQHWKRYQSVKESLPSRKHINEGIVMSEASSSTTSPPPASGNTEGAAIIYTFWDSRAT 240
+TL +WKRY +VKESLPSRKH + G +S T+P NT+ +T+WDSRA+
Sbjct: 181 STLPYWKRYLAVKESLPSRKHGDAG----PSSFRPTNPASPGSNTD-----FTYWDSRAS 240
Query: 241 PEDMEEMWNNPDVLKEWTKSGEKKGNVRFSHDAKKRPYLSRVELKAVAEIILSKHFSTKG 300
PEDME+MWN ++ KEWTKS E++G VRFS D +KRPYLSR ELKAVAEII+SK+FSTKG
Sbjct: 241 PEDMEDMWNQSEICKEWTKSKEERGKVRFSQDGEKRPYLSRGELKAVAEIIVSKYFSTKG 300
Query: 301 VQPTVLCALAEIVSMRFINGAGGRPGIMGIDYSTAFWIYMELSHRAYRLDSADDLTKPFV 360
++ ++CA+A+ V MRF+NG GI+G+DYSTA W+Y EL +RAYR+DSADDLTKPF+
Sbjct: 301 IRVPLVCAIADTVCMRFVNGIKKHVGILGVDYSTASWLYSELGYRAYRVDSADDLTKPFI 360
Query: 361 SMYFGAAYFVWLSEYEGRERTRQFVFQAYISGPQNVDLQEPGPLWLKFEEALSSYEDTKI 420
SMYFG AY VWLSEYEG +R+ QF+ QAY+ GP +VDL+E PLWLKFE+ALS YE++K
Sbjct: 361 SMYFGVAYLVWLSEYEGSQRSNQFIVQAYMKGPDHVDLEESCPLWLKFEQALSYYEESK- 419
Query: 421 GTQGSCSVM 430
GSC ++
Sbjct: 421 RDSGSCVIL 419
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022148059.1 | 0.0 | 100.00 | uncharacterized protein LOC111016834 isoform X1 [Momordica charantia] | [more] |
XP_022148061.1 | 0.0 | 100.00 | uncharacterized protein LOC111016834 isoform X3 [Momordica charantia] | [more] |
XP_022148060.1 | 0.0 | 99.30 | uncharacterized protein LOC111016834 isoform X2 [Momordica charantia] | [more] |
XP_038886952.1 | 8.75e-282 | 87.88 | uncharacterized protein LOC120077127 [Benincasa hispida] | [more] |
XP_004139838.1 | 3.56e-281 | 87.18 | uncharacterized protein LOC101215745 [Cucumis sativus] >XP_011659044.1 uncharact... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1D485 | 0.0 | 100.00 | uncharacterized protein LOC111016834 isoform X3 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1D2W1 | 0.0 | 100.00 | uncharacterized protein LOC111016834 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1D1V8 | 0.0 | 99.30 | uncharacterized protein LOC111016834 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
A0A0A0K3L0 | 1.72e-281 | 87.18 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G239030 PE=4 SV=1 | [more] |
A0A1S3BGL4 | 8.17e-280 | 87.18 | uncharacterized protein LOC103489631 OS=Cucumis melo OX=3656 GN=LOC103489631 PE=... | [more] |
Match Name | E-value | Identity | Description | |
AT1G16290.1 | 1.2e-164 | 63.87 | FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... | [more] |