Moc08g00890 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc08g00890
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionNADH dehydrogenase [ubiquinone] flavoprotein 1, mitochondrial
Locationchr8: 535326 .. 549158 (+)
RNA-Seq ExpressionMoc08g00890
SyntenyMoc08g00890
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATAATGCCTCCTTCTCCAGCATTGAGGAGCTCTCCTGGTAGAGAGCTCAGAGGCAGCAATCATAAGCGAGGCCACAGCTTTGAGAGTGGCATGTGCATAAGAGAGAAAGATGACGATCTTGCTTTATTCAATGAAATGCAGACTCGAGAAAGAGAGAGTTTCCTGCTTCAGTCGGCAGAGGACTTGGAGGACTCATTTTGTAATTACGAGCTAACTCAATTTTATCATTTATATTTTTTTTTGCTGCCAAATATAAAAACACGACCATTTAATTAGGCTGTCAATGTTGCACTTCTCATGAACTTTTTGATAGTTTTCTGCAGCTACAAAGTTGAGGCACTTTTCCGATATCAAGCTTGGTATCTCCATTCCTGTTCGTGGAGAGAATAGTGAATTGCTTAATGTAGATGGGGAGAAAAATGACTATGACTGGTGAGTGGCCTTATTCCTGTTCCAAGTGACTTACTCTTAAATCACCAATTGCTTATAATTATTAGGAGTGTGTGTATATGTATGTGTGTGTTTGGTCCGTATAACTTCTTGCTAATTCATACTTGTGGCCGCTACCTATTTACTTTCTGCCGAAGTGGCTTGTGATAATTTTTGGTGGACTACTGTGCCGGTACTTGTTTATCATATGGTTGTTTGAAGCTGAATAATCAATCCCCTTGCTCTTGGCTCTCACAAAATGTTCTGTATTATCCATCTCAAGGTTGTTAACACCTCCCGACACCCCTCTTTTCCCTTCATTGGACAATGATCCGCCTCCAGTTACTCTTGCAAGCAGGGGGAGACCTCGTAGTCAACCCATTTCCATATCACGATCATCCACGGTATGTGATTAAACTGTTTATATAGAATATAATTGTGTTGCGATCATGTTGGATAAGTCTCTAAAAGTGTCTTTTTACAATTGAGCTATTTGATGTTCTGTACTTCTTTGTTGAAACACTGAAGAGTTTTGTTTTCATAGATGGAAAAAAGTCACAGAAGCAGTACGAGTAGGGGTAGTGCAAGTCCTAATCGTTTAAGCCCATCGCCTAGGTCAGCAAGCAGTGTGCCTCAAATGCGGGGAAGGCAACTGTCAGCCCCGCACTCTAGCCCAACTCCAAGTCTACGACATGCCACACCATCTAGAAGATCAACTACCCCTGCAAGAAGATCATCACCTCCTCCAAGTACACCATCAATATCTGTGACAAGGTCTTCCACCCCAACTCCCAGGAGGTTGAGCACAGGGTCGAGTGGAACTTCTACCACATCTGGGGCAAGGGGAACTTCACCTATAAAGGCAGTAAGAGGAAATTCTGCTTCACCTAAAATAAGAGCATGGCAAACTAATATTCCTGGTTTCTCTTCTGATGCTCCCCCTAACCTTCGAACATCCCTGGCTGATCGGCCAGCATCATATACGAGAGGATCTTCACCAGCTTCTCGAAACAGTATGGACCTTCAATACAAGTACAGTAGGCAATTGATGTCTCCAACTGCTCCTATCAGCTCATCCCATAGTCACGATCGAGATCGCTATAGCTCTTACAGTAGAGGTTCGAAGGCCTCATCTGGTGATGATGATTTAGACTCCCTGCAGTCAATTCCTACTAGCAGTTTGGATAATTCATTGTCAAAAGGAGGGAATACATTTTCAAATAACAAAGCTCTGACCATATCAAAGAAACACAGAATAGTGTCTTCTACTTCCACTCCCAAAAGATCCCTCGATTCCACTATTCGACAATTGGTATGGATTTTATTTTTTAGTTTTTGAATGATATGATATTTTGTACTTTGGAACTCACATTTTGGTTAACATAAACAAACTGTAAAGTTTGTTTTTTTTTCTCAGATTAGTTGTTACATTCACGTGACATACTTTTATTGGGACTGTTTGAATGTTAGTATTTAAAATTTAACATGGTTATTTTATGAACAATTTGAGAAACTGGAAAAAAGATTTTCAACTCTTTTAGTGACTTAGTGCGTAAGTTTTCTGCAAAGTACAATTGATTGTTTGTTGATCTTGTTATTCTCCCGAGTAAGTTCATGTTAGTAAACAGTTTAATGATGTGGAATTTGGCAGAAAAAAATATTTCCATTTCAATGTACTTTGTGAGTCTGACTCATCTTAGATAATCACATTTCTTCTTTACTTGTGTCAGTCATGGTGTGCTTTTATAGTTTCACTGGAGTTTACATGACAACTAATGGTGCATATGCACTTCCTTATTTTTCTCCAGGATCGAAAGAGCCCGAATATGTTTAGGCCACTTTTATCAAGTGTCCCCAGTACCACCTTTTATACTGGCAAGGTGAGTTCTGCCCATCGTTCTCTGATTTCCAGGAACTCCTCAGTCACAACTAGCAGTAATGCAAGTTCTTATAATGGTGCAAGTATTGTACTTGATACAGAAGTGAGTGACCTAAATCAGGATGATATGGCAAATGAATGTGAGAAAGTGCCATATCATGACATTCATGAAGAGATATTCGCTTTCGATAAGATGGATATTGTTAATGAAAATCCCATTAATGATATCAAGTCACTTGATAGTGGCCCTGCACCTGGTTGTGATCCTGTTCTCACTGAAGATAGCAGTCACCAAACTATTATCCCAGAAATAAGTTCCACTTTTGACTCTTCTCGTGCCCAGGGGAATGCTTTTTCAGAGGTTGTTTGCCTTGATGATATAATTTTGTGTCCCAGATGTGGTTGTAGGTATTGTGTCATTGACACAGAGGAAAATTACATTAATCTTTGTCCAGAATGCAGTAGGAAAGAAAAATACCTTGGCATGACCCTTTTGGAAAATATGACTTCAGTTACTGAAAGCATATCAGGGTATTCAATAAAGTATGAAGCAGGTAAGCCTTTCAATAAGGTGGAGTCAGGGGTGATTTCGCTTGAATCTTCCCTAGCTACTGATTTGGGTGAATCTAGGATTTCCGAGTCTCTGGGCAATGTTGAGCAAGATCAAGCATCTTATCCTGAAGAAGGCTTGAGTTACCAGAAAGAAAACTTCCCCTCAGAAACACCGGTGAGCGAAAGTCAGCATAGCCTTATCAACCATTCAGAGATAGGCCAATTAGTTGTTAGTGGCAGTCAGTCCAACACTGAATCTGGATATCAGCAACCTCTTCATCATAACGACTATAAAGATTTGAGGTTTGATTCATCAGAAGGTGCAGGTATTTCTATATTGTTAAAGAGATCGAGCAGCAGTAAGGGCCCTATTGTCCAAGGAAGAACTTTTACTACAAGTACCATATCTTATGATGATCTGTCTTTTGCAAGAGATAGCATGAGCAGTTTGAGAAGTTCTGTTGGACACAGCAGTTTTTCTGCATCATCATCGGCTGATTTCAGCTCAGCCAGACAGATTGAAGCCCGAATACAACGTCAGGTAAGTTCAAGGAAAGGGGAATTAGAGAGCAAAAAGGGTGAAATTTGCGTGAAATCTCATATCTCTGAGGCAGCTTCTTCTGGAACACCTACCAATGCTCATCCTGTATTAGGCTTTGAAACTTGTGAGCAAGAGGAAAATTTGGATTTTACTGTGGCCAATTTAGAATGTTTTTCCAGTCAGGGAACTACTAATTCTTCTCAGAAACCTGAACTAGCTTCTGAAAATGCTGAATCAGATGACACCTCTTCAATTGTGGTTGCTGTTGTAGAGGAGGATAAATTTGAATGCGACAACCGTAGAATACTGGACACTTGTACCTCGGAATCGTCAAGGTATGCCCTCAGCAACAATTTTGATTTTTTTTTTGGGACATTCAATCCAAAATGGTTTATTATACTTAACATGTGAACAGGGAGGACTTATCAGGTGGTAGGAGTGTCTCAGATAAAGAAGCACCAGTTACAACTTCCGACTGTTCCAAATTGGAGGGACACAACATGCCGGATGTTAGTGCGTTTGAAGATGAACAACATCCCAATCATTTAATGACCACAATATCAGAAAAAGAAACGAAACAAATAGCTGAGGTGATAGCACCTGGTTCACAGAGTGATTTATCAATAATATCAAAAAGTCTCCTGGAGGAGGAATCTATGGTTCCTAGTGGGCCTGACGAGGATTTAACACCACCTGTTATTAATACTGAAAAATCTTATGGTATCCTAGGTATGGAAGTTTCTTTGGTTGCGATTTATATGCTTTTCAGCTTTTGTTTAAAATCTCTCCTTGCATCTCTTTTAAGTTCCTAAATTTCTTTAATACTGTGATCTTGAGATTATTTGACTAATCCTGGGTTTTAATCATGTCCTCTTCTAAAATTACTTTAAACTTATTTTGATGAATTTTCACGTTGTCTCCGATCCCTGATAAAACTTTCTGTTTGGCAGAAGAATCAACAGTAATTGTCGATTACCAGGGCAGAAGAAAGGTGGTGAGAAGCTTGACACTTGAAGAGGCAACAGATACAATTCTTTTCTGCAGCTCTATTGTTCATGATATAGCCTATTCAGCTGCTTCCATAGCAATTGAAAAGGAAAACGAGGTTACATTGGAAGGCTCACGGCCGACGGTTACCATTTTGGGAAAATCTAACACTGACAGAAGCGATCTACGCAGTAGAACAGGCGGCAAACGGGTCATGAAATCTCAAAAACTGAGACAACGGCATGTGGAAATGAGTACAAAGCCTCCTGTTACGAAGACAGAGAATGACGAGAACACCGACGAGTCCACCATTCGAAATGTAGGTCTTCCTAACCAAGTGGACAGCATGAAACCTCTAAAGCTGGAATCCAAATGCAACTGCAGTATAATGTGACTCCGATGAAGACTGTCCTTGTTTGTTCATCCTTCCCAGCTCCTACGACAGATCTCGCTACCACATATTTAACAAAAAAGACCTCATGTTCTTGGGCCTGATAGTCACTTGGTTTCAACCGAATCCAGTTTAATGTATAAGAAATGCCACGTCGAATTGGTGACGTAAATGTAAAGTTATTTCGTCGTCTTTTTTTTTTTCTTTTGTTTTCTGATTGTTGTTTAGGGCTGATGATATTTACTTCGAGAGGCCGTAGGGATTTCATTTCTTCAGTTTGTTTGCTCCAGTTGCCACATTCTTGTTGATTATTCTTATTTGATTCAAGTGCTCCGAAGCAGTGTTGAATATAAAGCCTCTCATTCATATTCATTTTCCATATTGTAAAGTGTACCTCTATTCAAAAGTAACCTCTGATTTGGCTCTCTCTCTTCATCCCAATCTCATTTGATACTATATTATGTCACAAATAACACCAAAATTCTTCTGATGCTAACAATTTTTCTTTTCTTTTATTTATTTTTCTTCTCTCCCTGATATTTCAGGGTTCAATCATTGACGGAAGTTATGTACAGTTAGAGAGATATTTGACTCAGTCAAGCTGGTGCAACAGTACTACTATAGCCATTGAGCTGCAGTTCTGTCTGATTGCTGATGCTGCTGGCTGCTCCAACAGTTTGTAAAACTTGGTCAAGTATGTCTCATACTGTCGCAATTTCATTTATTAGACTAACCTAATTGTAGTACAAACAACCAATGTGAGCCCATCTTGGCCATGTGAAAAGGCAAAATAAAGAATAAAGGCAAAGTAGAGAAAAAAAAGGCCATCACTAAGAAACCCCACCTTCAATCTTAAACTGCCTAACATGCTGGCCACTTTTCCTTTTGAGAAAATAAAATTAGTAATGCCTCTATTTGGAAGTTTTGAACTGGAGTTAAGTATATAATATTTATGTCCATTCTTCTTGTTTTATCTTTATTTTTATTTTATATTTTTAAAGAAAGACGTATGAGTGCCTTCCCAAAGAGTTGATGTGAGCAATTGATTGGCTCTCACCATTTGCTTATTCGCTTTGATATCTGTGGAAAAAAAGGAAATTAAGCCTTTTCTTGTCAAGGATTCTTTTTTGACGATCCTGATCTGCATATCTCACTGCAATACCAAAACAAGCAGGTTGAGGAGAACAACTGTGACCTCTGGATATCTTAAGATATTGCTCATCCATCATGTTAAATAAAGGTAACGAATCTAGGTGGCTTTTTTAGTGTTGAGTACGTCTGGTTTTTTGATATGCACACAAATATTATAATATTTCTAGTTTTCTAGCATTTTGAAACCATTATTTTCATTTTAGATCTGTCCATGTTTTATTATTATTATTTACCGTTTGAAGAGAATGCCAAGGTTTTTAAATGAAACGAATGCAGGTGTTCAATTGTTAATCATATTACTTGTACATTGTGCAATCCTGTGTCTTTGCAGTTCTTGCTTGGGTTGGAAGAACAAGGTTTTCCTTAGGCAAGATCTGGAAGGAACTTCTAAAAGTTCTTCAGGTAAGCGGATACTAATTTGGTTTTGGATGAAGGGTACTCTGTTATGTCATTCGTCAATATTCCCAGTATACTTAAGTAGGAAGAAGTTAGATATGCAGAAATTGATCATTGCACATTATTTTAATCTCCAGGAGGTAAGCTCAGTTTAATTTTCCGTTTATTCTCTTTAAGCACATTGAAGAGTATCAGTTGCTCCATTCATTCTCATCGAGCTTGTCTAGTAATGACTTGGGCTAGTCTTATTTCCGAAGTGTGCGAAGTGCTGGGGATGATCATAACTTAATTTGCATTTTTCTCGAATTTGGAGAATTTTAGTAAAAAAGTAGCCCCTCTTTATGGATTATTATATCATTTTTCCTTACTGAGTCTTCTTGATATAGTTCCAAGTCTTTGCCCTATCTTTGAAAACCCTTATGCTTCTCTTTTTAACCAGATATTCTAAGAGCAAGGGAAGAATATGCTACTAACACAATCTTCTACTTTTCCAAAAAAGCATTCAAATGATTCCCACACGAAACCTCCTTTGATGACCTCTCCCTACCCTGAAATACCTTTTGTTTCTCTCCAACCATGTATTCATATCTCCTGTAACAGAGATTGGCAATCATGGCCAGGTGCCAACACTGTACCCTAAACAATTGCAGGAACTTGAACTAGATCTTCGTGGTGTAAGGATAAAGTTAGACTATATATTCTAATCCTGACATTTTCAGAGGACGCATGAGAAAAAAGAACCAAAGTGGAAGAACTCTGGACAAAGTCGAAGGCGGTAACACCCCCCCCCCCCCCCCCCCCGGCAACAAGCTAAGGGAAGAACTTGATTTTCATCCTTATTTCAACCTTGGACAGATGATCAAAATGAGGAAGAGAACATTTGGAAGAAAGGGGATTTACAAGAAAACATCATATCTGATGTTCCCCGTACCCAAGATTAATTTAATTTGAAACTAAACCATCATGAGATGGTTCAAGTTGTCAGTGAGGGCCAATGTAAATAGCAAAAGACTTAGAGAGAATGAGTTCAAACCATGATGGCCACCTACCTAGGATTTAATATCCTACGAGTTACCTTATCAATCAAAACTCGTCAATCAAATGTAGTAGGGTCAGGCGGTTGGCCCGCGATTAGTCGAGGTGCGCACAAGCTGGTTCGGACACACGGATTTCAGAAAAAAAAAAAAATTAGTTTGAAACTATTGGAGGATAGATAAAAGAGAGGACAAGTTAACTGCTTCCATGTCTCCTATGAAAATAAAATTTAAAGGAGAGAACCCATGAAAGAAATCATAGAAGCGATAGGTTATACTGGCAAGGTTAAAAACATATCTAAGAGGATAAGTGCCAATAAAATAGTTTTCTTACTAGCATCTTCCTATCGGAAAGATATATGAGGATGGGGCTAAGGGAAGAGGCAATGAGGGTGAGTGTCTCAGTTTAGGGGTAAAAATCTTTATTTTTTGGTCTACTATTAATTATGTGGAATAATTAGCAAAGAGATGCATTCCAAATCCGTCCATTCCAAATATTTGGCCAAAGCTTTTTAAGTTCCGAAATAGCATCAAGAATAAGATCAATTGGAAATTAGTTATATGTTCTGTATGCAATAAAATAAAAAGAATTATAACAGATCTGACGCTTGTGAGAAACCTGGGCATGGATCTGCTAGTTTGGTATATAGATAACCAAATCTCAAAATTGATACTCTGATGAGTTAAAAACTGGAAATCCTTTGGTGTTTTTGAAGTCCCAAGAGCCTGGCATTAGATCAAGCATGGTATTTAGTAGTCAGGGAGGGGTCCGATCTGGACACGTAAAGTTATTGTCTAAAAGTCTTTAACAAGTATCAGAATTTTACTGTGTTCAACACATGTTCGATGTGGAGTGTGGACATATTATTCAAACTAAATTGTTTTAGCTTCCTAGCTGTTTGTGCACCAAATGGTCTCAATCTAAGGATGTAAATTGTATTAGTTTCCTAGATATCTTTTAGATGTAGTTTATTAAATATTTTCTTTGCTTCTGTCGTCTTGCTAATATTCTGTCCATGCAGGGGATTTTCACATCGTGCAATGGGGCAAGACTAAATTAGCTATATTTTTGGAGTTTGGCTTCGATAATTTGATATGGTCTCCTTGGACAATGGCTTTGGATGTTTCCCTTGGAAAACATATTCTTTGGCATAAAGTTATAACCCTCAAATATGGTGTTCTTTCAGGAGGTTGGTGGGCACATCGTGGCATCATCCTCCTGCAAGGCAGTCGAAAGGTTTGGATTTCCATTATTCCATTATTTAGTTGCAAAAGATCGTAGTTGATTGATTGGCTTCCTTAGATATTTGAAGTGGTACAAGCATGAGATATTGGGAAGATGTTGCAGGGCAATTGGCGTTTGCATGGGGCTCTATAACATCCAGTAGCAAAAGCCTCTTCATAAACCATGTTTTGCATTTTCTCCCAAACCACATAGCACCCGATCCTAAGAAGGAATTGAAATGATGAACAGTTTCAAGAATGGCTTCATTACCTGACTCATTCTGATGATAAATGGCTTCGGGCTTGCGACATTTCTGGAAATTTTGTCAACATATCTCTCGATCTTCCCAGGAGCGGTATTATCGTTCCTGATTTGAGAGAAAATAGATAGTTGCTTGCCTGCAGGTCTGGTGGCAATGAACCATGAAGGGGAAGAATTATAGCATCCATGCAAGATCCTTCTAAATTGTAAACTCTTTCCTCCAACTTTGATACTGACTTCTCTATGTCATCCTTCAAGATTGAGTTGTGTTGATAAAGAGACAATGTGGTGAAAATAATGTCAGACAGATTCCAGATCCCCATCTGGTTGTTGGGTATGTATATCTAGCAAATAATTAATGCAACCTCATCAGTAGTTAAATTTGGAAAGTCTAAAATTGGGGATAGAAAGGATCAATAACATGAGAATTTTTATCTAAAAGTGGATCGATGATAGGAGAACTTTAGATCAACAGCCTTAAAATAGCTTTTCATCCGCTCTACAAATAATTAGAGACGTGGGGGATTGAACTCATAATCTTTTAGTTAGAGACTTGTGTCAATTATCATTGAGTTAAGCTTTACAACGGATAAATCAAACATTCTAATCATACCAATGATAGTTAATGCAACCTCATTAGTAGATATGTATTTGTGATTTTATCAAAATATGTCTTATTTTCTAAATTTTTGACGAATCTATTCAAAATAAATATAGATAAAATCGTCTTTCCATGGAATAATGGTTAAATTGCAAAATTAGTTGTCATGTTTTCTGATAAATTCTTTTTTACTCTTATAATTTTAAAAGTTTTTACTTAGATTTTATAGTTTAGTCGTATCTCGTAAATTATTCATTTTCCTTCTAAGTTGTTGCCTTAATAAATATGCGATTTGTTTTTTAATTTAAATTAATTAATTACATTAAAGTTTAACCAATTGATTTTAATGGAAAAATGTGAATTTAAGCCTAGGTCATATATGCCAAATTTATAACTTTGGGAACTAATTTAAATTTAAGTCTAGATGATAGAGTAAATTGGTAACTTAACCGTATATTTACCCTATCATAAAATTTATTATATTTATCTAATATATTACTAATTGAAGAAATTGCAAAAATAATGCCAGACAAACAGTATGTTGTAATTACATTTACCCATTTTCAAATGTACAACAACCAACGTTGTGCCTTATAAAAAAAACAATAATAATAAATAGTGTTGTGAAAAATGGTTAACCTGAATTTAGCTTAATTAGTTTAGGTATATATCCCTACTTAAGGAAACAATGAGCAGAGTGCGTGACAAGAGAGAACTTAAAATTTAGGTCATTTCAATTTTGATTTATTTTGGTACTTGTATTTACAAAATATTCGTTTTGGTTCCTTAATTTTATTTTTTAATCTCATTTTTTTCTATTACAATTTTAATACGACATGAAACTCAATATGTTTCTTGAAATATAATTATGACTTTATATTAAAAGGTTTTCATTGAATGTATAATATGTGTTAAAATATTGAAAATAGGAACCAAAATGATCACTTCATAAATGTACAAAAACGAAAACGGACATTTTGAAAATACAAAGACTAAAATGAGCTTAAGTTGAAATGAACAAAACCTGAAAGTATAGAAACAAAAATAGTATTTTAACCTAACATTTAATGTTTATTAGATTTGTGAATTTTAAAAAATAGTCGAATCACCCAATTAGACTAGATATGAGCGGACCCAAATTCGTTCGAGTTGGTCCACTCGAACCAACCCAAATAGATTCTCCCCTATATATAAATAAGACTTGAGACTATGACGCGAAGTTCAAATAATTCCACTTTTTGCTATTCTCAACAAAATACCTTGCTTATTTGAGTATTGTTGGAGCGTGTTCGCCTCGCAGTAGCACTACACCAATGTACAGATTTATGGTATATAGAGCTATCAATGTTGGATACTAAAATTAAATTCTTACCCTAACACTTATTATACAATTTTTACAATTTTTCAAAGTTCATGAATCTAAATGATAGACGGCTCAGTGTTGAAAGAGAAATTTCGGCTAATCACTAAACGCCACGTCACAAAATTAAAATTACATTTAGGACCAATGATAAAATTACACGTGTCCTAAATTAATTTTAATATCGACGAATCTAATTATGCCACGTGTCTCTCTTATGGCTGTCCGATGAAATCAACTAATTGGATCAAAACTATAAAGAGACTGGACCCAAGCCCATGGCCCATTAAGCCCATGGCCCATATAAGACTCACCAAAACTCTATAAATAGAGGGGAGAGCAAGAAAAAAGAAGAAGATTATATATTAGAGATTGTAACACCCACATACATAAATACAAATAAATACAAATACAAGTTCATAAATTCCAGTTTCTATTTGTTGAATTTTTTGTCGAACAATGAGATGGTAATTTTTCTTAATATATATTTTAATTTTTATATAGTTTTTATTTTGTTCGAGGAAAATTGTCTACTTAAACAGATTGAGCATTTCGATTTTTTTTTATTCCCTGAGCGAAACGAGTAGTCTTTTTTTGATTTTTTTCTCTTAGTGGGTTCTCTTGAAGGCCCGAATGCAGCTAGGTATTTCTACAAAATTCGGAACAAATTAAAATTACGCAAAGCAATCGTTTCGCTGATCTGAACTGAAGTAGAACTCCACAGGGTTCGTCGTTCTTTCTCTCTAATTTCCGGTAATTTTCGGATCTGTTCTGTCTTTTTTTCTTTCTTTGTTGGAATTAGGATTAAGACTAATAAATCTTCTCACGTGTATCTCTTAATCACCACGAAGCCCAGCTCAATCTCAACCGCTTGATGGCTGACTCCATCAAACACCATCGTGATCAATAAATCTGCGTAGTCTTCATCCTCCGTGCCTTCTCACTCCCTTCTTCATCTAAATTCGATTCTGCAAATCACCTTTAATCGCAGGCGTGTCACTGGGTTCAGCTCATCGGAAGCATTTCATACGATGGTAAGAAATATATATTTATTCTGTTCGATTTCTATTTTCTCATTACCTATTTGTCGTTATTTTTATTTCACATTTCATTTTTGAGTTGTTCCTATTTAGTTCAGCGTATGAATTCGAATGTTTCGAACTTGTTATGAAATTTAGACAATAATTTGTTGTTTCGAGTGCTTTTGATTTCGCTATTTTAGTGTTACCTGTCAGCTGAGGGTTCGAACTGGAAAATTATGGGATCGTACGATGCAACCACTTTTGGGTTGTAGAAATCCTTCAAATTTTATTTCTGTTCGAGGGTTTTTACGTTCTTTAGCATCTGTATGAAAGTTTACAATAGGTCTCTGCATGCTAACCCTTTAGTGGTTTTTAAGTGTACCCCCTTCTTCCTGTTCTTGTTCTTGTTCTTCTTCTTCTTTTTTTTTATATATATTTATAATTTCCCCATTTGAACTGTTATCATTAACTGAGTTTACGTATGTTTGAATAGCCAAAAATTTATTAACTCCGTTGTATATTGATAATTCACTATTCTTCAGGCACCAGCGAGGGGCATCCTCCGTCTTCAACGAACAGCATTGGCAAGGAGTTATACTGATAGGTGGGGGATAGGGTTGAGAGCATTTAGCAATCAGGGTGCTGCAGCTACCAGCAGTCCTCAGCCTCCCTCACCTCCTCCACCTCCAGAGAGAACTCATTTTGGCGGCCTGAAGGATGAGGACCGAATCTTTACCAACTTGTACGGTTTAAATGATCCCTTTCTCAAAGGTGCCATGAAACGTGGAGATTGGTACAGAACGAAAGATTTAGTACTCAAGGGTGCTGATTGGATTGTCAATGAAGTCAAGAAGTCTGGTCTCCGAGGTCGTGGAGGTGCTGGATTTCCATCTGGACTCAAATGGTCATTCATGCCCAAGCTATCTGATGGTCGCCCTTCCTATCTTGTTGTCAATGCTGATGAAAGTGAACCGGGAACCTGTAAGGACAGGGAAATCATGCGGCATGATCCGCACAAACTCTTGGAAGGTTGTCTGATTGCTGGAGTAGGGATGAGGGCTACTGCAGCTTACATATACATCAGAGGTGAGTATGTAAATGAACGAAAGAACCTTGAAAGGGCCAGAAAAGAGGCTTATGAAGCTGGATTTTTGGGCAAGAATGCATGTGGATCTGGTTATGATTTTGATGTTCATATCCACTTTGGTGCTGGTGCTTACATTTGTGGCGAAGAAACAGCTCTATTGGAAAGCCTTGAAGGGAAACAAGGGAAGCCAAGATTGAAACCTCCTTTTCCTGCTAATGCAGGATTATATGGATGTCCTACCACCGTCACAAATGTGGAGACAGTGGCTGTTTCTCCCACCATTTTGAGGCGTGGACCAGAATGGTTTGCCAGTTTTGGAAGGAAGAACAACTCTGGTACAAAGTTGTTTTGTATATCAGGACATGTGAATAAGCCTTGCACTGTTGAAGAGGAAATGAGTATTTCACTGAAAGAACTAATTGAAAGGCACTGTGGAGGGGTTAGAGGTGGATGGGACAACTTGCTTGCTGTAATTCCTGGAGGTTCTTCCGTTCCGTTGCTTCCCAAGCACATTTGTGATGATGTGCTTATGGATTATGATGCACTAAAGGCTGTTCAGTCTGGATTGGGCACTGCAGCTGTAATTGTCATGGATAAATCGACTGATGTTGTCGATGCTATTGCTAGGCTCTCTTATTTTTACAAGCATGAAAGCTGTGGACAGTGCACTCCCTGCAGAGAAGGTACAGGATGGCTGTGGATGATAATGGAAAGAATGAAAGTTGGAAATGCAAAGCTGGAAGAGATCGACATGCTACAAGAGGTTACCAAGCAGATTGAAGGGCACACCATTTGTGCCCTTGGTGATGCTGCGGCTTGGCCTGTGCAGGGTCTTATAAGGCATTTTAGGCCTGAGCTTGAAAGAAGGATCAGAGAACGTGCAGAACGGGAGTTGATCGAGGCTGCTGCATAG

mRNA sequence

ATGATAATGCCTCCTTCTCCAGCATTGAGGAGCTCTCCTGGTAGAGAGCTCAGAGGCAGCAATCATAAGCGAGGCCACAGCTTTGAGAGTGGCATGTGCATAAGAGAGAAAGATGACGATCTTGCTTTATTCAATGAAATGCAGACTCGAGAAAGAGAGAGTTTCCTGCTTCAGTCGGCAGAGGACTTGGAGGACTCATTTTCTACAAAGTTGAGGCACTTTTCCGATATCAAGCTTGGTATCTCCATTCCTGTTCGTGGAGAGAATAGTGAATTGCTTAATGTAGATGGGGAGAAAAATGACTATGACTGGTTGTTAACACCTCCCGACACCCCTCTTTTCCCTTCATTGGACAATGATCCGCCTCCAGTTACTCTTGCAAGCAGGGGGAGACCTCGTAGTCAACCCATTTCCATATCACGATCATCCACGATGGAAAAAAGTCACAGAAGCAGTACGAGTAGGGGTAGTGCAAGTCCTAATCGTTTAAGCCCATCGCCTAGGTCAGCAAGCAGTGTGCCTCAAATGCGGGGAAGGCAACTGTCAGCCCCGCACTCTAGCCCAACTCCAAGTCTACGACATGCCACACCATCTAGAAGATCAACTACCCCTGCAAGAAGATCATCACCTCCTCCAAGTACACCATCAATATCTGTGACAAGGTCTTCCACCCCAACTCCCAGGAGGTTGAGCACAGGGTCGAGTGGAACTTCTACCACATCTGGGGCAAGGGGAACTTCACCTATAAAGGCAGTAAGAGGAAATTCTGCTTCACCTAAAATAAGAGCATGGCAAACTAATATTCCTGGTTTCTCTTCTGATGCTCCCCCTAACCTTCGAACATCCCTGGCTGATCGGCCAGCATCATATACGAGAGGATCTTCACCAGCTTCTCGAAACAGTATGGACCTTCAATACAAGTACAGTAGGCAATTGATGTCTCCAACTGCTCCTATCAGCTCATCCCATAGTCACGATCGAGATCGCTATAGCTCTTACAGTAGAGGTTCGAAGGCCTCATCTGGTGATGATGATTTAGACTCCCTGCAGTCAATTCCTACTAGCAGTTTGGATAATTCATTGTCAAAAGGAGGGAATACATTTTCAAATAACAAAGCTCTGACCATATCAAAGAAACACAGAATAGTGTCTTCTACTTCCACTCCCAAAAGATCCCTCGATTCCACTATTCGACAATTGGATCGAAAGAGCCCGAATATGTTTAGGCCACTTTTATCAAGTGTCCCCAGTACCACCTTTTATACTGGCAAGGTGAGTTCTGCCCATCGTTCTCTGATTTCCAGGAACTCCTCAGTCACAACTAGCAGTAATGCAAGTTCTTATAATGGTGCAAGTATTGTACTTGATACAGAAGTGAGTGACCTAAATCAGGATGATATGGCAAATGAATGTGAGAAAGTGCCATATCATGACATTCATGAAGAGATATTCGCTTTCGATAAGATGGATATTGTTAATGAAAATCCCATTAATGATATCAAGTCACTTGATAGTGGCCCTGCACCTGGTTGTGATCCTGTTCTCACTGAAGATAGCAGTCACCAAACTATTATCCCAGAAATAAGTTCCACTTTTGACTCTTCTCGTGCCCAGGGGAATGCTTTTTCAGAGGTTGTTTGCCTTGATGATATAATTTTGTGTCCCAGATGTGGTTGTAGGTATTGTGTCATTGACACAGAGGAAAATTACATTAATCTTTGTCCAGAATGCAGTAGGAAAGAAAAATACCTTGGCATGACCCTTTTGGAAAATATGACTTCAGTTACTGAAAGCATATCAGGGTATTCAATAAAGTATGAAGCAGGTAAGCCTTTCAATAAGGTGGAGTCAGGGGTGATTTCGCTTGAATCTTCCCTAGCTACTGATTTGGGTGAATCTAGGATTTCCGAGTCTCTGGGCAATGTTGAGCAAGATCAAGCATCTTATCCTGAAGAAGGCTTGAGTTACCAGAAAGAAAACTTCCCCTCAGAAACACCGGTGAGCGAAAGTCAGCATAGCCTTATCAACCATTCAGAGATAGGCCAATTAGTTGTTAGTGGCAGTCAGTCCAACACTGAATCTGGATATCAGCAACCTCTTCATCATAACGACTATAAAGATTTGAGGTTTGATTCATCAGAAGGTGCAGGTATTTCTATATTGTTAAAGAGATCGAGCAGCAGTAAGGGCCCTATTGTCCAAGGAAGAACTTTTACTACAAGTACCATATCTTATGATGATCTGTCTTTTGCAAGAGATAGCATGAGCAGTTTGAGAAGTTCTGTTGGACACAGCAGTTTTTCTGCATCATCATCGGCTGATTTCAGCTCAGCCAGACAGATTGAAGCCCGAATACAACGTCAGGTAAGTTCAAGGAAAGGGGAATTAGAGAGCAAAAAGGGTGAAATTTGCGTGAAATCTCATATCTCTGAGGCAGCTTCTTCTGGAACACCTACCAATGCTCATCCTGTATTAGGCTTTGAAACTTGTGAGCAAGAGGAAAATTTGGATTTTACTGTGGCCAATTTAGAATGTTTTTCCAGTCAGGGAACTACTAATTCTTCTCAGAAACCTGAACTAGCTTCTGAAAATGCTGAATCAGATGACACCTCTTCAATTGTGGTTGCTGTTGTAGAGGAGGATAAATTTGAATGCGACAACCGTAGAATACTGGACACTTGTACCTCGGAATCGTCAAGGGAGGACTTATCAGGTGGTAGGAGTGTCTCAGATAAAGAAGCACCAGTTACAACTTCCGACTGTTCCAAATTGGAGGGACACAACATGCCGGATGTTAGTGCGTTTGAAGATGAACAACATCCCAATCATTTAATGACCACAATATCAGAAAAAGAAACGAAACAAATAGCTGAGGTGATAGCACCTGGTTCACAGAGTGATTTATCAATAATATCAAAAAGTCTCCTGGAGGAGGAATCTATGGTTCCTAGTGGGCCTGACGAGGATTTAACACCACCTGTTATTAATACTGAAAAATCTTATGGTATCCTAGAAGAATCAACAGTAATTGTCGATTACCAGGGCAGAAGAAAGGTGGTGAGAAGCTTGACACTTGAAGAGGCAACAGATACAATTCTTTTCTGCAGCTCTATTGTTCATGATATAGCCTATTCAGCTGCTTCCATAGCAATTGAAAAGGAAAACGAGGTTACATTGGAAGGCTCACGGCCGACGGTTACCATTTTGGGAAAATCTAACACTGACAGAAGCGATCTACGCAGTAGAACAGGCGGCAAACGGGTCATGAAATCTCAAAAACTGAGACAACGGCATGTGGAAATGAGTACAAAGCCTCCTGTTACGAAGACAGAGAATGACGAGAACACCGACGAGTCCACCATTCGAAATGGTTCAATCATTGACGGAAGTTATGTACAGTTAGAGAGATATTTGACTCAGTCAAGCTGGTGCAACAGTACTACTATAGCCATTGAGCTGCAGTTCTGTCTGATTGCTGATGCTGCTGGCTGCTCCAACAGTTTTTCTTGCTTGGGTTGGAAGAACAAGGTTTTCCTTAGGCAAGATCTGGAAGGAACTTCTAAAAGTTCTTCAGGGGATTTTCACATCGTGCAATGGGGCAAGACTAAATTAGCTATATTTTTGGAGTTTGGCTTCGATAATTTGATATGGTCTCCTTGGACAATGGCTTTGGATGTTTCCCTTGGAAAACATATTCTTTGGCATAAAGTTATAACCCTCAAATATGGTGTTCTTTCAGGAGGTTGGTGGGCACATCGTGGCATCATCCTCCTGCAAGGCAGTCGAAAGTTTCAAGAATGGCTTCATTACCTGACTCATTCTGATGATAAATGGCTTCGGGCTTGCGACATTTCTGGAAATTTTGTCAACATATCTCTCGATCTTCCCAGGAGCGGCCCGAATGCAGCTAGGCGTGTCACTGGGTTCAGCTCATCGGAAGCATTTCATACGATGGCACCAGCGAGGGGCATCCTCCGTCTTCAACGAACAGCATTGGCAAGGAGTTATACTGATAGGTGGGGGATAGGGTTGAGAGCATTTAGCAATCAGGGTGCTGCAGCTACCAGCAGTCCTCAGCCTCCCTCACCTCCTCCACCTCCAGAGAGAACTCATTTTGGCGGCCTGAAGGATGAGGACCGAATCTTTACCAACTTGTACGGTTTAAATGATCCCTTTCTCAAAGGTGCCATGAAACGTGGAGATTGGTACAGAACGAAAGATTTAGTACTCAAGGGTGCTGATTGGATTGTCAATGAAGTCAAGAAGTCTGGTCTCCGAGGTCGTGGAGGTGCTGGATTTCCATCTGGACTCAAATGGTCATTCATGCCCAAGCTATCTGATGGTCGCCCTTCCTATCTTGTTGTCAATGCTGATGAAAGTGAACCGGGAACCTGTAAGGACAGGGAAATCATGCGGCATGATCCGCACAAACTCTTGGAAGGTTGTCTGATTGCTGGAGTAGGGATGAGGGCTACTGCAGCTTACATATACATCAGAGGTGAGTATGTAAATGAACGAAAGAACCTTGAAAGGGCCAGAAAAGAGGCTTATGAAGCTGGATTTTTGGGCAAGAATGCATGTGGATCTGGTTATGATTTTGATGTTCATATCCACTTTGGTGCTGGTGCTTACATTTGTGGCGAAGAAACAGCTCTATTGGAAAGCCTTGAAGGGAAACAAGGGAAGCCAAGATTGAAACCTCCTTTTCCTGCTAATGCAGGATTATATGGATGTCCTACCACCGTCACAAATGTGGAGACAGTGGCTGTTTCTCCCACCATTTTGAGGCGTGGACCAGAATGGTTTGCCAGTTTTGGAAGGAAGAACAACTCTGGTACAAAGTTGTTTTGTATATCAGGACATGTGAATAAGCCTTGCACTGTTGAAGAGGAAATGAGTATTTCACTGAAAGAACTAATTGAAAGGCACTGTGGAGGGGTTAGAGGTGGATGGGACAACTTGCTTGCTGTAATTCCTGGAGGTTCTTCCGTTCCGTTGCTTCCCAAGCACATTTGTGATGATGTGCTTATGGATTATGATGCACTAAAGGCTGTTCAGTCTGGATTGGGCACTGCAGCTGTAATTGTCATGGATAAATCGACTGATGTTGTCGATGCTATTGCTAGGCTCTCTTATTTTTACAAGCATGAAAGCTGTGGACAGTGCACTCCCTGCAGAGAAGGTACAGGATGGCTGTGGATGATAATGGAAAGAATGAAAGTTGGAAATGCAAAGCTGGAAGAGATCGACATGCTACAAGAGGTTACCAAGCAGATTGAAGGGCACACCATTTGTGCCCTTGGTGATGCTGCGGCTTGGCCTGTGCAGGGTCTTATAAGGCATTTTAGGCCTGAGCTTGAAAGAAGGATCAGAGAACGTGCAGAACGGGAGTTGATCGAGGCTGCTGCATAG

Coding sequence (CDS)

ATGATAATGCCTCCTTCTCCAGCATTGAGGAGCTCTCCTGGTAGAGAGCTCAGAGGCAGCAATCATAAGCGAGGCCACAGCTTTGAGAGTGGCATGTGCATAAGAGAGAAAGATGACGATCTTGCTTTATTCAATGAAATGCAGACTCGAGAAAGAGAGAGTTTCCTGCTTCAGTCGGCAGAGGACTTGGAGGACTCATTTTCTACAAAGTTGAGGCACTTTTCCGATATCAAGCTTGGTATCTCCATTCCTGTTCGTGGAGAGAATAGTGAATTGCTTAATGTAGATGGGGAGAAAAATGACTATGACTGGTTGTTAACACCTCCCGACACCCCTCTTTTCCCTTCATTGGACAATGATCCGCCTCCAGTTACTCTTGCAAGCAGGGGGAGACCTCGTAGTCAACCCATTTCCATATCACGATCATCCACGATGGAAAAAAGTCACAGAAGCAGTACGAGTAGGGGTAGTGCAAGTCCTAATCGTTTAAGCCCATCGCCTAGGTCAGCAAGCAGTGTGCCTCAAATGCGGGGAAGGCAACTGTCAGCCCCGCACTCTAGCCCAACTCCAAGTCTACGACATGCCACACCATCTAGAAGATCAACTACCCCTGCAAGAAGATCATCACCTCCTCCAAGTACACCATCAATATCTGTGACAAGGTCTTCCACCCCAACTCCCAGGAGGTTGAGCACAGGGTCGAGTGGAACTTCTACCACATCTGGGGCAAGGGGAACTTCACCTATAAAGGCAGTAAGAGGAAATTCTGCTTCACCTAAAATAAGAGCATGGCAAACTAATATTCCTGGTTTCTCTTCTGATGCTCCCCCTAACCTTCGAACATCCCTGGCTGATCGGCCAGCATCATATACGAGAGGATCTTCACCAGCTTCTCGAAACAGTATGGACCTTCAATACAAGTACAGTAGGCAATTGATGTCTCCAACTGCTCCTATCAGCTCATCCCATAGTCACGATCGAGATCGCTATAGCTCTTACAGTAGAGGTTCGAAGGCCTCATCTGGTGATGATGATTTAGACTCCCTGCAGTCAATTCCTACTAGCAGTTTGGATAATTCATTGTCAAAAGGAGGGAATACATTTTCAAATAACAAAGCTCTGACCATATCAAAGAAACACAGAATAGTGTCTTCTACTTCCACTCCCAAAAGATCCCTCGATTCCACTATTCGACAATTGGATCGAAAGAGCCCGAATATGTTTAGGCCACTTTTATCAAGTGTCCCCAGTACCACCTTTTATACTGGCAAGGTGAGTTCTGCCCATCGTTCTCTGATTTCCAGGAACTCCTCAGTCACAACTAGCAGTAATGCAAGTTCTTATAATGGTGCAAGTATTGTACTTGATACAGAAGTGAGTGACCTAAATCAGGATGATATGGCAAATGAATGTGAGAAAGTGCCATATCATGACATTCATGAAGAGATATTCGCTTTCGATAAGATGGATATTGTTAATGAAAATCCCATTAATGATATCAAGTCACTTGATAGTGGCCCTGCACCTGGTTGTGATCCTGTTCTCACTGAAGATAGCAGTCACCAAACTATTATCCCAGAAATAAGTTCCACTTTTGACTCTTCTCGTGCCCAGGGGAATGCTTTTTCAGAGGTTGTTTGCCTTGATGATATAATTTTGTGTCCCAGATGTGGTTGTAGGTATTGTGTCATTGACACAGAGGAAAATTACATTAATCTTTGTCCAGAATGCAGTAGGAAAGAAAAATACCTTGGCATGACCCTTTTGGAAAATATGACTTCAGTTACTGAAAGCATATCAGGGTATTCAATAAAGTATGAAGCAGGTAAGCCTTTCAATAAGGTGGAGTCAGGGGTGATTTCGCTTGAATCTTCCCTAGCTACTGATTTGGGTGAATCTAGGATTTCCGAGTCTCTGGGCAATGTTGAGCAAGATCAAGCATCTTATCCTGAAGAAGGCTTGAGTTACCAGAAAGAAAACTTCCCCTCAGAAACACCGGTGAGCGAAAGTCAGCATAGCCTTATCAACCATTCAGAGATAGGCCAATTAGTTGTTAGTGGCAGTCAGTCCAACACTGAATCTGGATATCAGCAACCTCTTCATCATAACGACTATAAAGATTTGAGGTTTGATTCATCAGAAGGTGCAGGTATTTCTATATTGTTAAAGAGATCGAGCAGCAGTAAGGGCCCTATTGTCCAAGGAAGAACTTTTACTACAAGTACCATATCTTATGATGATCTGTCTTTTGCAAGAGATAGCATGAGCAGTTTGAGAAGTTCTGTTGGACACAGCAGTTTTTCTGCATCATCATCGGCTGATTTCAGCTCAGCCAGACAGATTGAAGCCCGAATACAACGTCAGGTAAGTTCAAGGAAAGGGGAATTAGAGAGCAAAAAGGGTGAAATTTGCGTGAAATCTCATATCTCTGAGGCAGCTTCTTCTGGAACACCTACCAATGCTCATCCTGTATTAGGCTTTGAAACTTGTGAGCAAGAGGAAAATTTGGATTTTACTGTGGCCAATTTAGAATGTTTTTCCAGTCAGGGAACTACTAATTCTTCTCAGAAACCTGAACTAGCTTCTGAAAATGCTGAATCAGATGACACCTCTTCAATTGTGGTTGCTGTTGTAGAGGAGGATAAATTTGAATGCGACAACCGTAGAATACTGGACACTTGTACCTCGGAATCGTCAAGGGAGGACTTATCAGGTGGTAGGAGTGTCTCAGATAAAGAAGCACCAGTTACAACTTCCGACTGTTCCAAATTGGAGGGACACAACATGCCGGATGTTAGTGCGTTTGAAGATGAACAACATCCCAATCATTTAATGACCACAATATCAGAAAAAGAAACGAAACAAATAGCTGAGGTGATAGCACCTGGTTCACAGAGTGATTTATCAATAATATCAAAAAGTCTCCTGGAGGAGGAATCTATGGTTCCTAGTGGGCCTGACGAGGATTTAACACCACCTGTTATTAATACTGAAAAATCTTATGGTATCCTAGAAGAATCAACAGTAATTGTCGATTACCAGGGCAGAAGAAAGGTGGTGAGAAGCTTGACACTTGAAGAGGCAACAGATACAATTCTTTTCTGCAGCTCTATTGTTCATGATATAGCCTATTCAGCTGCTTCCATAGCAATTGAAAAGGAAAACGAGGTTACATTGGAAGGCTCACGGCCGACGGTTACCATTTTGGGAAAATCTAACACTGACAGAAGCGATCTACGCAGTAGAACAGGCGGCAAACGGGTCATGAAATCTCAAAAACTGAGACAACGGCATGTGGAAATGAGTACAAAGCCTCCTGTTACGAAGACAGAGAATGACGAGAACACCGACGAGTCCACCATTCGAAATGGTTCAATCATTGACGGAAGTTATGTACAGTTAGAGAGATATTTGACTCAGTCAAGCTGGTGCAACAGTACTACTATAGCCATTGAGCTGCAGTTCTGTCTGATTGCTGATGCTGCTGGCTGCTCCAACAGTTTTTCTTGCTTGGGTTGGAAGAACAAGGTTTTCCTTAGGCAAGATCTGGAAGGAACTTCTAAAAGTTCTTCAGGGGATTTTCACATCGTGCAATGGGGCAAGACTAAATTAGCTATATTTTTGGAGTTTGGCTTCGATAATTTGATATGGTCTCCTTGGACAATGGCTTTGGATGTTTCCCTTGGAAAACATATTCTTTGGCATAAAGTTATAACCCTCAAATATGGTGTTCTTTCAGGAGGTTGGTGGGCACATCGTGGCATCATCCTCCTGCAAGGCAGTCGAAAGTTTCAAGAATGGCTTCATTACCTGACTCATTCTGATGATAAATGGCTTCGGGCTTGCGACATTTCTGGAAATTTTGTCAACATATCTCTCGATCTTCCCAGGAGCGGCCCGAATGCAGCTAGGCGTGTCACTGGGTTCAGCTCATCGGAAGCATTTCATACGATGGCACCAGCGAGGGGCATCCTCCGTCTTCAACGAACAGCATTGGCAAGGAGTTATACTGATAGGTGGGGGATAGGGTTGAGAGCATTTAGCAATCAGGGTGCTGCAGCTACCAGCAGTCCTCAGCCTCCCTCACCTCCTCCACCTCCAGAGAGAACTCATTTTGGCGGCCTGAAGGATGAGGACCGAATCTTTACCAACTTGTACGGTTTAAATGATCCCTTTCTCAAAGGTGCCATGAAACGTGGAGATTGGTACAGAACGAAAGATTTAGTACTCAAGGGTGCTGATTGGATTGTCAATGAAGTCAAGAAGTCTGGTCTCCGAGGTCGTGGAGGTGCTGGATTTCCATCTGGACTCAAATGGTCATTCATGCCCAAGCTATCTGATGGTCGCCCTTCCTATCTTGTTGTCAATGCTGATGAAAGTGAACCGGGAACCTGTAAGGACAGGGAAATCATGCGGCATGATCCGCACAAACTCTTGGAAGGTTGTCTGATTGCTGGAGTAGGGATGAGGGCTACTGCAGCTTACATATACATCAGAGGTGAGTATGTAAATGAACGAAAGAACCTTGAAAGGGCCAGAAAAGAGGCTTATGAAGCTGGATTTTTGGGCAAGAATGCATGTGGATCTGGTTATGATTTTGATGTTCATATCCACTTTGGTGCTGGTGCTTACATTTGTGGCGAAGAAACAGCTCTATTGGAAAGCCTTGAAGGGAAACAAGGGAAGCCAAGATTGAAACCTCCTTTTCCTGCTAATGCAGGATTATATGGATGTCCTACCACCGTCACAAATGTGGAGACAGTGGCTGTTTCTCCCACCATTTTGAGGCGTGGACCAGAATGGTTTGCCAGTTTTGGAAGGAAGAACAACTCTGGTACAAAGTTGTTTTGTATATCAGGACATGTGAATAAGCCTTGCACTGTTGAAGAGGAAATGAGTATTTCACTGAAAGAACTAATTGAAAGGCACTGTGGAGGGGTTAGAGGTGGATGGGACAACTTGCTTGCTGTAATTCCTGGAGGTTCTTCCGTTCCGTTGCTTCCCAAGCACATTTGTGATGATGTGCTTATGGATTATGATGCACTAAAGGCTGTTCAGTCTGGATTGGGCACTGCAGCTGTAATTGTCATGGATAAATCGACTGATGTTGTCGATGCTATTGCTAGGCTCTCTTATTTTTACAAGCATGAAAGCTGTGGACAGTGCACTCCCTGCAGAGAAGGTACAGGATGGCTGTGGATGATAATGGAAAGAATGAAAGTTGGAAATGCAAAGCTGGAAGAGATCGACATGCTACAAGAGGTTACCAAGCAGATTGAAGGGCACACCATTTGTGCCCTTGGTGATGCTGCGGCTTGGCCTGTGCAGGGTCTTATAAGGCATTTTAGGCCTGAGCTTGAAAGAAGGATCAGAGAACGTGCAGAACGGGAGTTGATCGAGGCTGCTGCATAG

Protein sequence

MIMPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAEDLEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSMDLQYKYSRQLMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEENYINLCPECSRKEKYLGMTLLENMTSVTESISGYSIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRNGSIIDGSYVQLERYLTQSSWCNSTTIAIELQFCLIADAAGCSNSFSCLGWKNKVFLRQDLEGTSKSSSGDFHIVQWGKTKLAIFLEFGFDNLIWSPWTMALDVSLGKHILWHKVITLKYGVLSGGWWAHRGIILLQGSRKFQEWLHYLTHSDDKWLRACDISGNFVNISLDLPRSGPNAARRVTGFSSSEAFHTMAPARGILRLQRTALARSYTDRWGIGLRAFSNQGAAATSSPQPPSPPPPPERTHFGGLKDEDRIFTNLYGLNDPFLKGAMKRGDWYRTKDLVLKGADWIVNEVKKSGLRGRGGAGFPSGLKWSFMPKLSDGRPSYLVVNADESEPGTCKDREIMRHDPHKLLEGCLIAGVGMRATAAYIYIRGEYVNERKNLERARKEAYEAGFLGKNACGSGYDFDVHIHFGAGAYICGEETALLESLEGKQGKPRLKPPFPANAGLYGCPTTVTNVETVAVSPTILRRGPEWFASFGRKNNSGTKLFCISGHVNKPCTVEEEMSISLKELIERHCGGVRGGWDNLLAVIPGGSSVPLLPKHICDDVLMDYDALKAVQSGLGTAAVIVMDKSTDVVDAIARLSYFYKHESCGQCTPCREGTGWLWMIMERMKVGNAKLEEIDMLQEVTKQIEGHTICALGDAAAWPVQGLIRHFRPELERRIRERAERELIEAAA
Homology
BLAST of Moc08g00890 vs. NCBI nr
Match: XP_022131331.1 (uncharacterized protein LOC111004588 isoform X1 [Momordica charantia] >XP_022131332.1 uncharacterized protein LOC111004588 isoform X1 [Momordica charantia])

HSP 1 Score: 2100.1 bits (5440), Expect = 0.0e+00
Identity = 1119/1123 (99.64%), Postives = 1119/1123 (99.64%), Query Frame = 0

Query: 1    MIMPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSA 60
            MIMPPSPALRSSPGREL GSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSA
Sbjct: 1    MIMPPSPALRSSPGRELXGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSA 60

Query: 61   EDLEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDND 120
            EDLEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDND
Sbjct: 61   EDLEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDND 120

Query: 121  PPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQ 180
            PPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQ
Sbjct: 121  PPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQ 180

Query: 181  LSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTT 240
            LSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTT
Sbjct: 181  LSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTT 240

Query: 241  SGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRN 300
            SGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRN
Sbjct: 241  SGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRN 300

Query: 301  SMDLQYKYSRQLMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNS 360
            SMDLQYKYSRQ MSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNS
Sbjct: 301  SMDLQYKYSRQXMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNS 360

Query: 361  LSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTF 420
            LSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTF
Sbjct: 361  LSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTF 420

Query: 421  YTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIH 480
            YTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIH
Sbjct: 421  YTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIH 480

Query: 481  EEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGN 540
            EEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGN
Sbjct: 481  EEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGN 540

Query: 541  AFSEVVCLDDIILCPRCGCRYCVIDTEENYINLCPECSRKEKYLGMTLLENMTSVTESIS 600
            AFSEVVCLDDIILCPRCGCRYCVIDTEEN INLCPECSRKEKYLGMTLLENMT VTESIS
Sbjct: 541  AFSEVVCLDDIILCPRCGCRYCVIDTEENXINLCPECSRKEKYLGMTLLENMTXVTESIS 600

Query: 601  GYSIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKEN 660
            GYSIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKEN
Sbjct: 601  GYSIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKEN 660

Query: 661  FPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISI 720
            FPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISI
Sbjct: 661  FPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISI 720

Query: 721  LLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQ 780
            LLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQ
Sbjct: 721  LLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQ 780

Query: 781  IEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTV 840
            IEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTV
Sbjct: 781  IEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTV 840

Query: 841  ANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSR 900
            ANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSR
Sbjct: 841  ANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSR 900

Query: 901  EDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVI 960
            EDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVI
Sbjct: 901  EDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVI 960

Query: 961  APGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVV 1020
            APGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVV
Sbjct: 961  APGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVV 1020

Query: 1021 RSLTLEEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLR 1080
            RSLTLEEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLR
Sbjct: 1021 RSLTLEEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLR 1080

Query: 1081 SRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1124
            SRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN
Sbjct: 1081 SRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1123

BLAST of Moc08g00890 vs. NCBI nr
Match: XP_022131333.1 (uncharacterized protein LOC111004588 isoform X2 [Momordica charantia])

HSP 1 Score: 2093.5 bits (5423), Expect = 0.0e+00
Identity = 1118/1123 (99.55%), Postives = 1118/1123 (99.55%), Query Frame = 0

Query: 1    MIMPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSA 60
            MIMPPSPALRSSPGREL GSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSA
Sbjct: 1    MIMPPSPALRSSPGRELXGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSA 60

Query: 61   EDLEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDND 120
            EDLEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDND
Sbjct: 61   EDLEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDND 120

Query: 121  PPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQ 180
            PPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQ
Sbjct: 121  PPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQ 180

Query: 181  LSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTT 240
            LSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTT
Sbjct: 181  LSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTT 240

Query: 241  SGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRN 300
            SGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRN
Sbjct: 241  SGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRN 300

Query: 301  SMDLQYKYSRQLMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNS 360
            SMDLQYKYSRQ MSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNS
Sbjct: 301  SMDLQYKYSRQXMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNS 360

Query: 361  LSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTF 420
            LSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTF
Sbjct: 361  LSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTF 420

Query: 421  YTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIH 480
            YTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIH
Sbjct: 421  YTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIH 480

Query: 481  EEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGN 540
            EEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGN
Sbjct: 481  EEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGN 540

Query: 541  AFSEVVCLDDIILCPRCGCRYCVIDTEENYINLCPECSRKEKYLGMTLLENMTSVTESIS 600
            AFSEVVCLDDIILCPRCGCRYCVIDTEEN INLCPECSRKEKYLGMTLLENMT VTESIS
Sbjct: 541  AFSEVVCLDDIILCPRCGCRYCVIDTEENXINLCPECSRKEKYLGMTLLENMTXVTESIS 600

Query: 601  GYSIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKEN 660
            GYSIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKEN
Sbjct: 601  GYSIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKEN 660

Query: 661  FPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISI 720
            FPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISI
Sbjct: 661  FPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISI 720

Query: 721  LLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQ 780
            LLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQ
Sbjct: 721  LLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQ 780

Query: 781  IEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTV 840
            IEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTV
Sbjct: 781  IEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTV 840

Query: 841  ANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSR 900
            ANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSR
Sbjct: 841  ANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSR 900

Query: 901  EDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVI 960
            EDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVI
Sbjct: 901  EDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVI 960

Query: 961  APGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVV 1020
            APGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGIL ESTVIVDYQGRRKVV
Sbjct: 961  APGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGIL-ESTVIVDYQGRRKVV 1020

Query: 1021 RSLTLEEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLR 1080
            RSLTLEEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLR
Sbjct: 1021 RSLTLEEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLR 1080

Query: 1081 SRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1124
            SRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN
Sbjct: 1081 SRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1122

BLAST of Moc08g00890 vs. NCBI nr
Match: XP_022131334.1 (uncharacterized protein LOC111004588 isoform X3 [Momordica charantia])

HSP 1 Score: 1971.1 bits (5105), Expect = 0.0e+00
Identity = 1053/1058 (99.53%), Postives = 1054/1058 (99.62%), Query Frame = 0

Query: 66   SFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPPPVT 125
            S +TKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPPPVT
Sbjct: 32   SAATKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPPPVT 91

Query: 126  LASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLSAPH 185
            LASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLSAPH
Sbjct: 92   LASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLSAPH 151

Query: 186  SSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSGARG 245
            SSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSGARG
Sbjct: 152  SSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSGARG 211

Query: 246  TSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSMDLQ 305
            TSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSMDLQ
Sbjct: 212  TSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSMDLQ 271

Query: 306  YKYSRQLMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLSKGG 365
            YKYSRQ MSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLSKGG
Sbjct: 272  YKYSRQXMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLSKGG 331

Query: 366  NTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTFYTGKV 425
            NTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTFYTGKV
Sbjct: 332  NTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTFYTGKV 391

Query: 426  SSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFA 485
            SSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFA
Sbjct: 392  SSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFA 451

Query: 486  FDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEV 545
            FDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEV
Sbjct: 452  FDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEV 511

Query: 546  VCLDDIILCPRCGCRYCVIDTEENYINLCPECSRKEKYLGMTLLENMTSVTESISGYSIK 605
            VCLDDIILCPRCGCRYCVIDTEEN INLCPECSRKEKYLGMTLLENMT VTESISGYSIK
Sbjct: 512  VCLDDIILCPRCGCRYCVIDTEENXINLCPECSRKEKYLGMTLLENMTXVTESISGYSIK 571

Query: 606  YEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKENFPSET 665
            YEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKENFPSET
Sbjct: 572  YEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKENFPSET 631

Query: 666  PVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISILLKRS 725
            PVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISILLKRS
Sbjct: 632  PVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISILLKRS 691

Query: 726  SSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQIEARI 785
            SSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQIEARI
Sbjct: 692  SSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQIEARI 751

Query: 786  QRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTVANLEC 845
            QRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTVANLEC
Sbjct: 752  QRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTVANLEC 811

Query: 846  FSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSREDLSG 905
            FSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSREDLSG
Sbjct: 812  FSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSREDLSG 871

Query: 906  GRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQ 965
            GRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQ
Sbjct: 872  GRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQ 931

Query: 966  SDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVVRSLTL 1025
            SDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVVRSLTL
Sbjct: 932  SDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVVRSLTL 991

Query: 1026 EEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGG 1085
            EEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGG
Sbjct: 992  EEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGG 1051

Query: 1086 KRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1124
            KRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN
Sbjct: 1052 KRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1089

BLAST of Moc08g00890 vs. NCBI nr
Match: XP_031737323.1 (serine/arginine repetitive matrix protein 2 isoform X1 [Cucumis sativus] >KGN62363.1 hypothetical protein Csa_018751 [Cucumis sativus])

HSP 1 Score: 1655.2 bits (4285), Expect = 0.0e+00
Identity = 922/1139 (80.95%), Postives = 990/1139 (86.92%), Query Frame = 0

Query: 1    MIMPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSA 60
            MIMPPSPALRSSPGRE RGSNHKRGHSFES + IREKDDDLALFNEMQTRERE FLLQSA
Sbjct: 1    MIMPPSPALRSSPGRESRGSNHKRGHSFESAVRIREKDDDLALFNEMQTREREGFLLQSA 60

Query: 61   EDLEDSFSTKLRHFSDIKLGISIPVRGENSELL-NVDGEKNDYDWLLTPPDTPLFPSLDN 120
            EDLEDSFSTKLRHFSD+KLGISIPVRGENS+LL NV+ EKNDYDWLLTPPDTPLFPSLD+
Sbjct: 61   EDLEDSFSTKLRHFSDLKLGISIPVRGENSDLLNNVEAEKNDYDWLLTPPDTPLFPSLDD 120

Query: 121  DPPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGR 180
            +PP V +ASRGRPRSQPISISRSSTMEKSHRSSTSRGS SPNRLSPSPRSA+SVPQ+RGR
Sbjct: 121  EPPSVAIASRGRPRSQPISISRSSTMEKSHRSSTSRGSPSPNRLSPSPRSANSVPQLRGR 180

Query: 181  QLSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTST 240
            QLSAPHSSPTPSLRHATPSRRSTTP RRS PPPSTPS SV RSSTPTPRRLSTGSSGT+ 
Sbjct: 181  QLSAPHSSPTPSLRHATPSRRSTTPTRRSPPPPSTPSTSVPRSSTPTPRRLSTGSSGTAG 240

Query: 241  TSGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASR 300
             SGARGTSPIK+VRGNSASPKIRAWQTNIPGFSSD PPNLRTSL DRPASY RGSSPASR
Sbjct: 241  ISGARGTSPIKSVRGNSASPKIRAWQTNIPGFSSDPPPNLRTSLDDRPASYVRGSSPASR 300

Query: 301  NSMDLQYKYSRQLMSPTA--PISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSL 360
            NS DL +KY RQ MSPTA   ISSSHSHDRDRYSSYSRGS ASSGDDDLDSLQSIP SSL
Sbjct: 301  NSRDLAHKYGRQSMSPTASRSISSSHSHDRDRYSSYSRGSIASSGDDDLDSLQSIPISSL 360

Query: 361  DNSLSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPS 420
            DNSLSKGG +FSNNKAL  SKKHRIVSS S PKRSLDSTIR LDRKSPNMFRPLLSSVPS
Sbjct: 361  DNSLSKGGISFSNNKALAFSKKHRIVSS-SAPKRSLDSTIRHLDRKSPNMFRPLLSSVPS 420

Query: 421  TTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYH 480
            TTFYTGK SSAHRSLISRNSSVTTSSNASS +G  I LDTE SD NQDDM NECEK+ YH
Sbjct: 421  TTFYTGKASSAHRSLISRNSSVTTSSNASSDHGTCIALDTEGSDQNQDDMVNECEKIQYH 480

Query: 481  DIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRA 540
            + HEEIFAFDKMDIV+E+PI+DIKSLDSGPA GCDPV+T DSS++ ++P+ISST DSS  
Sbjct: 481  NSHEEIFAFDKMDIVDEDPIHDIKSLDSGPALGCDPVVTGDSSYEAVVPDISSTSDSSHV 540

Query: 541  QGNAFSEVVCLDDIILCPRCGCRYCVIDTEENYINLCPECSRKEKYLGMTLLENMTSVTE 600
            QG  FSE+VCL+D ++C RCGCRY V DTEEN  NLCPECSR+EK L + + ENMT+VTE
Sbjct: 541  QGADFSEIVCLEDTVVCSRCGCRYRVTDTEENDANLCPECSREEKCLSLAISENMTAVTE 600

Query: 601  SISGY-SIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSY 660
            S+SG  S+KYE  KPF+KVE  VIS +S+LA DLGESRIS  +GNVEQDQASYPE+G SY
Sbjct: 601  SLSGLSSVKYE-DKPFDKVELVVISPDSALANDLGESRISMFVGNVEQDQASYPEQGPSY 660

Query: 661  QKENFPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGA 720
              ENFP+ETP  ESQHSLINH EIGQ  VSG+Q +T SGYQQPL  NDY+ LRFDS EGA
Sbjct: 661  -VENFPAETPSEESQHSLINHLEIGQSAVSGNQPDTGSGYQQPLQRNDYQSLRFDSPEGA 720

Query: 721  GISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFS 780
            GISILLKRSSSSKGP+VQGRTFT STISYDDLSFARDSMSSLRSS+GHSSFSASSSADFS
Sbjct: 721  GISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFS 780

Query: 781  SARQIEARIQRQ--VSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEE 840
            SARQIEAR+QRQ  +SSRKGELE+KKGEI VKSH +E ASSG P +AHP+ GFETC+Q+E
Sbjct: 781  SARQIEARMQRQLSLSSRKGELENKKGEISVKSHCAEIASSGIPASAHPISGFETCKQDE 840

Query: 841  NLDFTVANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTC 900
            N+DF VANLEC S QGTT SSQK ELASEN +SDDTSSI VAVVEEDKFE D  RILDTC
Sbjct: 841  NVDFYVANLECSSCQGTTTSSQKAELASENGKSDDTSSISVAVVEEDKFEYDTCRILDTC 900

Query: 901  TSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQH--PNHLMTTISEKE 960
            TSE SRED SGGRSVSDK+A VT SDCSKLEGHNM     FEDE+     H M TISE E
Sbjct: 901  TSELSREDSSGGRSVSDKDASVTNSDCSKLEGHNMLG-DVFEDERSEVSTHPMITISETE 960

Query: 961  TKQIAEVIAPGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVD 1020
              QIAEV+A GSQ D+S IS   LEEES+V SGPD+DLTP +IN EKS GILEESTVIVD
Sbjct: 961  ATQIAEVVASGSQDDISTISMIPLEEESVVLSGPDQDLTPSIINAEKSDGILEESTVIVD 1020

Query: 1021 YQGRRKVVRSLTLEEATDTILFCSSIVHDIAYSAASIAI--------EKENEVTLEGSRP 1080
            YQG+ KVVRSLTLEEATDTILFCSSIVHD+AYSAA+IAI        EKENEVTLE SRP
Sbjct: 1021 YQGKTKVVRSLTLEEATDTILFCSSIVHDLAYSAATIAIEKEKEKEKEKENEVTLEASRP 1080

Query: 1081 TVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1124
             VTILGKSNT+RSDLR RTGGKRVMKSQK RQR VEMSTKPP+  TENDENTDESTIRN
Sbjct: 1081 MVTILGKSNTNRSDLRHRTGGKRVMKSQKPRQRRVEMSTKPPIAYTENDENTDESTIRN 1135

BLAST of Moc08g00890 vs. NCBI nr
Match: XP_031737324.1 (serine/arginine repetitive matrix protein 2 isoform X2 [Cucumis sativus])

HSP 1 Score: 1648.6 bits (4268), Expect = 0.0e+00
Identity = 921/1139 (80.86%), Postives = 989/1139 (86.83%), Query Frame = 0

Query: 1    MIMPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSA 60
            MIMPPSPALRSSPGRE RGSNHKRGHSFES + IREKDDDLALFNEMQTRERE FLLQSA
Sbjct: 1    MIMPPSPALRSSPGRESRGSNHKRGHSFESAVRIREKDDDLALFNEMQTREREGFLLQSA 60

Query: 61   EDLEDSFSTKLRHFSDIKLGISIPVRGENSELL-NVDGEKNDYDWLLTPPDTPLFPSLDN 120
            EDLEDSFSTKLRHFSD+KLGISIPVRGENS+LL NV+ EKNDYDWLLTPPDTPLFPSLD+
Sbjct: 61   EDLEDSFSTKLRHFSDLKLGISIPVRGENSDLLNNVEAEKNDYDWLLTPPDTPLFPSLDD 120

Query: 121  DPPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGR 180
            +PP V +ASRGRPRSQPISISRSSTMEKSHRSSTSRGS SPNRLSPSPRSA+SVPQ+RGR
Sbjct: 121  EPPSVAIASRGRPRSQPISISRSSTMEKSHRSSTSRGSPSPNRLSPSPRSANSVPQLRGR 180

Query: 181  QLSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTST 240
            QLSAPHSSPTPSLRHATPSRRSTTP RRS PPPSTPS SV RSSTPTPRRLSTGSSGT+ 
Sbjct: 181  QLSAPHSSPTPSLRHATPSRRSTTPTRRSPPPPSTPSTSVPRSSTPTPRRLSTGSSGTAG 240

Query: 241  TSGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASR 300
             SGARGTSPIK+VRGNSASPKIRAWQTNIPGFSSD PPNLRTSL DRPASY RGSSPASR
Sbjct: 241  ISGARGTSPIKSVRGNSASPKIRAWQTNIPGFSSDPPPNLRTSLDDRPASYVRGSSPASR 300

Query: 301  NSMDLQYKYSRQLMSPTA--PISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSL 360
            NS DL +KY RQ MSPTA   ISSSHSHDRDRYSSYSRGS ASSGDDDLDSLQSIP SSL
Sbjct: 301  NSRDLAHKYGRQSMSPTASRSISSSHSHDRDRYSSYSRGSIASSGDDDLDSLQSIPISSL 360

Query: 361  DNSLSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPS 420
            DNSLSKGG +FSNNKAL  SKKHRIVSS S PKRSLDSTIR LDRKSPNMFRPLLSSVPS
Sbjct: 361  DNSLSKGGISFSNNKALAFSKKHRIVSS-SAPKRSLDSTIRHLDRKSPNMFRPLLSSVPS 420

Query: 421  TTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYH 480
            TTFYTGK SSAHRSLISRNSSVTTSSNASS +G  I LDTE SD NQDDM NECEK+ YH
Sbjct: 421  TTFYTGKASSAHRSLISRNSSVTTSSNASSDHGTCIALDTEGSDQNQDDMVNECEKIQYH 480

Query: 481  DIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRA 540
            + HEEIFAFDKMDIV+E+PI+DIKSLDSGPA GCDPV+T DSS++ ++P+ISST DSS  
Sbjct: 481  NSHEEIFAFDKMDIVDEDPIHDIKSLDSGPALGCDPVVTGDSSYEAVVPDISSTSDSSHV 540

Query: 541  QGNAFSEVVCLDDIILCPRCGCRYCVIDTEENYINLCPECSRKEKYLGMTLLENMTSVTE 600
            QG  FSE+VCL+D ++C RCGCRY V DTEEN  NLCPECSR+EK L + + ENMT+VTE
Sbjct: 541  QGADFSEIVCLEDTVVCSRCGCRYRVTDTEENDANLCPECSREEKCLSLAISENMTAVTE 600

Query: 601  SISGY-SIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSY 660
            S+SG  S+KYE  KPF+KVE  VIS +S+LA DLGESRIS  +GNVEQDQASYPE+G SY
Sbjct: 601  SLSGLSSVKYE-DKPFDKVELVVISPDSALANDLGESRISMFVGNVEQDQASYPEQGPSY 660

Query: 661  QKENFPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGA 720
              ENFP+ETP  ESQHSLINH EIGQ  VSG+Q +T SGYQQPL  NDY+ LRFDS EGA
Sbjct: 661  -VENFPAETPSEESQHSLINHLEIGQSAVSGNQPDTGSGYQQPLQRNDYQSLRFDSPEGA 720

Query: 721  GISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFS 780
            GISILLKRSSSSKGP+VQGRTFT STISYDDLSFARDSMSSLRSS+GHSSFSASSSADFS
Sbjct: 721  GISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFS 780

Query: 781  SARQIEARIQRQ--VSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEE 840
            SARQIEAR+QRQ  +SSRKGELE+KKGEI VKSH +E ASSG P +AHP+ GFETC+Q+E
Sbjct: 781  SARQIEARMQRQLSLSSRKGELENKKGEISVKSHCAEIASSGIPASAHPISGFETCKQDE 840

Query: 841  NLDFTVANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTC 900
            N+DF VANLEC S QGTT SSQK ELASEN +SDDTSSI VAVVEEDKFE D  RILDTC
Sbjct: 841  NVDFYVANLECSSCQGTTTSSQKAELASENGKSDDTSSISVAVVEEDKFEYDTCRILDTC 900

Query: 901  TSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQH--PNHLMTTISEKE 960
            TSE SRED SGGRSVSDK+A VT SDCSKLEGHNM     FEDE+     H M TISE E
Sbjct: 901  TSELSREDSSGGRSVSDKDASVTNSDCSKLEGHNMLG-DVFEDERSEVSTHPMITISETE 960

Query: 961  TKQIAEVIAPGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVD 1020
              QIAEV+A GSQ D+S IS   LEEES+V SGPD+DLTP +IN EKS GIL ESTVIVD
Sbjct: 961  ATQIAEVVASGSQDDISTISMIPLEEESVVLSGPDQDLTPSIINAEKSDGIL-ESTVIVD 1020

Query: 1021 YQGRRKVVRSLTLEEATDTILFCSSIVHDIAYSAASIAI--------EKENEVTLEGSRP 1080
            YQG+ KVVRSLTLEEATDTILFCSSIVHD+AYSAA+IAI        EKENEVTLE SRP
Sbjct: 1021 YQGKTKVVRSLTLEEATDTILFCSSIVHDLAYSAATIAIEKEKEKEKEKENEVTLEASRP 1080

Query: 1081 TVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1124
             VTILGKSNT+RSDLR RTGGKRVMKSQK RQR VEMSTKPP+  TENDENTDESTIRN
Sbjct: 1081 MVTILGKSNTNRSDLRHRTGGKRVMKSQKPRQRRVEMSTKPPIAYTENDENTDESTIRN 1134

BLAST of Moc08g00890 vs. ExPASy Swiss-Prot
Match: Q9FNN5 (NADH dehydrogenase [ubiquinone] flavoprotein 1, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g08530 PE=1 SV=1)

HSP 1 Score: 923.3 bits (2385), Expect = 4.2e-267
Identity = 438/486 (90.12%), Postives = 464/486 (95.47%), Query Frame = 0

Query: 1318 MAPARGILRLQRTALARSYTDRWGIGLRAFSNQGAAATSSPQPPSPPPPPERTHFGGLKD 1377
            MAP RGIL LQR       ++R    LR+FS Q A+ +++PQPP PPPPPE+THFGGLKD
Sbjct: 1    MAPVRGILGLQRAVSIWKESNRLTPALRSFSTQAASTSTTPQPPPPPPPPEKTHFGGLKD 60

Query: 1378 EDRIFTNLYGLNDPFLKGAMKRGDWYRTKDLVLKGADWIVNEVKKSGLRGRGGAGFPSGL 1437
            EDRIFTNLYGL+DPFLKGAMKRGDW+RTKDLVLKG DWIVNE+KKSGLRGRGGAGFPSGL
Sbjct: 61   EDRIFTNLYGLHDPFLKGAMKRGDWHRTKDLVLKGTDWIVNEMKKSGLRGRGGAGFPSGL 120

Query: 1438 KWSFMPKLSDGRPSYLVVNADESEPGTCKDREIMRHDPHKLLEGCLIAGVGMRATAAYIY 1497
            KWSFMPK+SDGRPSYLVVNADESEPGTCKDREIMRHDPHKLLEGCLIAGVGMRA+AAYIY
Sbjct: 121  KWSFMPKVSDGRPSYLVVNADESEPGTCKDREIMRHDPHKLLEGCLIAGVGMRASAAYIY 180

Query: 1498 IRGEYVNERKNLERARKEAYEAGFLGKNACGSGYDFDVHIHFGAGAYICGEETALLESLE 1557
            IRGEYVNER NLE+AR+EAY AG LGKNACGSGYDF+V+IHFGAGAYICGEETALLESLE
Sbjct: 181  IRGEYVNERLNLEKARREAYAAGLLGKNACGSGYDFEVYIHFGAGAYICGEETALLESLE 240

Query: 1558 GKQGKPRLKPPFPANAGLYGCPTTVTNVETVAVSPTILRRGPEWFASFGRKNNSGTKLFC 1617
            GKQGKPRLKPPFPANAGLYGCPTTVTNVETVAVSPTILRRGPEWF+SFGRKNN+GTKLFC
Sbjct: 241  GKQGKPRLKPPFPANAGLYGCPTTVTNVETVAVSPTILRRGPEWFSSFGRKNNAGTKLFC 300

Query: 1618 ISGHVNKPCTVEEEMSISLKELIERHCGGVRGGWDNLLAVIPGGSSVPLLPKHICDDVLM 1677
            ISGHVNKPCTVEEEMSI LKELIERHCGGVRGGWDNLLA+IPGGSSVPL+PK+IC+DVLM
Sbjct: 301  ISGHVNKPCTVEEEMSIPLKELIERHCGGVRGGWDNLLAIIPGGSSVPLIPKNICEDVLM 360

Query: 1678 DYDALKAVQSGLGTAAVIVMDKSTDVVDAIARLSYFYKHESCGQCTPCREGTGWLWMIME 1737
            D+DALKAVQSGLGTAAVIVMDKSTDVVDAIARLSYFYKHESCGQCTPCREGTGWLWMIME
Sbjct: 361  DFDALKAVQSGLGTAAVIVMDKSTDVVDAIARLSYFYKHESCGQCTPCREGTGWLWMIME 420

Query: 1738 RMKVGNAKLEEIDMLQEVTKQIEGHTICALGDAAAWPVQGLIRHFRPELERRIRERAERE 1797
            RMKVGNAKLEEIDMLQEVTKQIEGHTICALGDAAAWPVQGLIRHFRPELERRIRERAERE
Sbjct: 421  RMKVGNAKLEEIDMLQEVTKQIEGHTICALGDAAAWPVQGLIRHFRPELERRIRERAERE 480

Query: 1798 LIEAAA 1804
            L++AAA
Sbjct: 481  LLQAAA 486

BLAST of Moc08g00890 vs. ExPASy Swiss-Prot
Match: Q54I90 (NADH dehydrogenase [ubiquinone] flavoprotein 1, mitochondrial OS=Dictyostelium discoideum OX=44689 GN=ndufv1 PE=2 SV=1)

HSP 1 Score: 699.9 bits (1805), Expect = 7.5e-200
Identity = 323/422 (76.54%), Postives = 372/422 (88.15%), Query Frame = 0

Query: 1372 FGGLKDEDRIFTNLYGLNDPFLKGAMKRGDWYRTKDLVLKGADWIVNEVKKSGLRGRGGA 1431
            +GGLKD+DRIFTNLYG +D +LKGA+ RGDWY+TK+++ KG DWI+ E+  SGLRGRGGA
Sbjct: 48   YGGLKDKDRIFTNLYGEHDVYLKGAIARGDWYKTKNIIDKGKDWILKEMMASGLRGRGGA 107

Query: 1432 GFPSGLKWSFMPK-LSDGRPSYLVVNADESEPGTCKDREIMRHDPHKLLEGCLIAGVGMR 1491
            GFPSGLKWSFMPK  S  RP YLV+NADE EPGTCKDREIMRHDPHKL+EGCL+AG  MR
Sbjct: 108  GFPSGLKWSFMPKTTSKDRPQYLVINADEGEPGTCKDREIMRHDPHKLIEGCLLAGFAMR 167

Query: 1492 ATAAYIYIRGEYVNERKNLERARKEAYEAGFLGKNACGSGYDFDVHIHFGAGAYICGEET 1551
            A AAYIYIRGE+  E K LE+A  EAY+AG +G+NACG+GY FDV++H GAGAYICGEET
Sbjct: 168  ACAAYIYIRGEFHYEAKVLEQAIDEAYKAGLIGENACGTGYKFDVYVHRGAGAYICGEET 227

Query: 1552 ALLESLEGKQGKPRLKPPFPANAGLYGCPTTVTNVETVAVSPTILRRGPEWFASFGRKNN 1611
            AL+ES+EGKQGKPRLKPPFPA AGLYGCPTTVTNVETVAV+PTILRRG  WFASFGR  N
Sbjct: 228  ALIESIEGKQGKPRLKPPFPAMAGLYGCPTTVTNVETVAVAPTILRRGGAWFASFGRPKN 287

Query: 1612 SGTKLFCISGHVNKPCTVEEEMSISLKELIERHCGGVRGGWDNLLAVIPGGSSVPLLPKH 1671
            +GTKLFCISGHVN PCTVEEEMSI L+ELI++HCGGV GGWDNL  VIPGGSSVP+LPK+
Sbjct: 288  AGTKLFCISGHVNNPCTVEEEMSIPLRELIDKHCGGVIGGWDNLKGVIPGGSSVPVLPKN 347

Query: 1672 ICDDVLMDYDALKAVQSGLGTAAVIVMDKSTDVVDAIARLSYFYKHESCGQCTPCREGTG 1731
            ICD+VLMD+D L+  +SGLGTAAVIVM+K TD++ AIARLS FYKHESCGQCTPCREG G
Sbjct: 348  ICDNVLMDFDDLRQHRSGLGTAAVIVMNKETDMIAAIARLSKFYKHESCGQCTPCREGVG 407

Query: 1732 WLWMIMERMKVGNAKLEEIDMLQEVTKQIEGHTICALGDAAAWPVQGLIRHFRPELERRI 1791
            WL+ I +R+  GNAK +EID L+E+++QIEGHTICALGDAAAWPVQGLIRHFRPE+E RI
Sbjct: 408  WLYDITDRLVTGNAKPDEIDSLEEISRQIEGHTICALGDAAAWPVQGLIRHFRPEIEDRI 467

Query: 1792 RE 1793
            ++
Sbjct: 468  KQ 469

BLAST of Moc08g00890 vs. ExPASy Swiss-Prot
Match: P25708 (NADH dehydrogenase [ubiquinone] flavoprotein 1, mitochondrial OS=Bos taurus OX=9913 GN=NDUFV1 PE=1 SV=2)

HSP 1 Score: 695.3 bits (1793), Expect = 1.9e-198
Identity = 326/437 (74.60%), Postives = 369/437 (84.44%), Query Frame = 0

Query: 1367 PERTHFGGLKDEDRIFTNLYGLNDPFLKGAMKRGDWYRTKDLVLKGADWIVNEVKKSGLR 1426
            P++T FG LKDEDRIFTNLYG +D  LKGA  RGDWY+TK+++LKG DWI+ EVK SGLR
Sbjct: 27   PKKTSFGSLKDEDRIFTNLYGRHDWRLKGAQSRGDWYKTKEILLKGPDWILGEVKTSGLR 86

Query: 1427 GRGGAGFPSGLKWSFMPKLSDGRPSYLVVNADESEPGTCKDREIMRHDPHKLLEGCLIAG 1486
            GRGGAGFP+GLKWSFM K SDGRP YLVVNADE EPGTCKDREI+RHDPHKL+EGCL+ G
Sbjct: 87   GRGGAGFPTGLKWSFMNKPSDGRPKYLVVNADEGEPGTCKDREIIRHDPHKLVEGCLVGG 146

Query: 1487 VGMRATAAYIYIRGEYVNERKNLERARKEAYEAGFLGKNACGSGYDFDVHIHFGAGAYIC 1546
              M A AAYIYIRGE+ NE  NL+ A +EAYEAG +GKNACGSGYDFDV +  GAGAYIC
Sbjct: 147  RAMGARAAYIYIRGEFYNEASNLQVAIREAYEAGLIGKNACGSGYDFDVFVVRGAGAYIC 206

Query: 1547 GEETALLESLEGKQGKPRLKPPFPANAGLYGCPTTVTNVETVAVSPTILRRGPEWFASFG 1606
            GEETAL+ES+EGKQGKPRLKPPFPA+ G++GCPTTV NVETVAVSPTI RRG  WFASFG
Sbjct: 207  GEETALIESIEGKQGKPRLKPPFPADVGVFGCPTTVANVETVAVSPTICRRGGAWFASFG 266

Query: 1607 RKNNSGTKLFCISGHVNKPCTVEEEMSISLKELIERHCGGVRGGWDNLLAVIPGGSSVPL 1666
            R+ NSGTKLF ISGHVN PCTVEEEMS+ LKELIE+H GGV GGWDNLLAVIPGGSS PL
Sbjct: 267  RERNSGTKLFNISGHVNNPCTVEEEMSVPLKELIEKHAGGVTGGWDNLLAVIPGGSSTPL 326

Query: 1667 LPKHICDDVLMDYDALKAVQSGLGTAAVIVMDKSTDVVDAIARLSYFYKHESCGQCTPCR 1726
            +PK +C+ VLMD+DAL   Q+GLGTAAVIVMD+STD+V AIARL  FYKHESCGQCTPCR
Sbjct: 327  IPKSVCETVLMDFDALIQAQTGLGTAAVIVMDRSTDIVKAIARLIEFYKHESCGQCTPCR 386

Query: 1727 EGTGWLWMIMERMKVGNAKLEEIDMLQEVTKQIEGHTICALGDAAAWPVQGLIRHFRPEL 1786
            EG  W+  +M R   G+A+  EID L E++KQIEGHTICALGD AAWPVQGLIRHFRPEL
Sbjct: 387  EGVDWMNKVMARFVRGDARPAEIDSLWEISKQIEGHTICALGDGAAWPVQGLIRHFRPEL 446

Query: 1787 ERRIRERAERELIEAAA 1804
            E R+++ A++     AA
Sbjct: 447  EERMQQFAQQHQARQAA 463

BLAST of Moc08g00890 vs. ExPASy Swiss-Prot
Match: P49821 (NADH dehydrogenase [ubiquinone] flavoprotein 1, mitochondrial OS=Homo sapiens OX=9606 GN=NDUFV1 PE=1 SV=4)

HSP 1 Score: 693.7 bits (1789), Expect = 5.4e-198
Identity = 324/437 (74.14%), Postives = 368/437 (84.21%), Query Frame = 0

Query: 1367 PERTHFGGLKDEDRIFTNLYGLNDPFLKGAMKRGDWYRTKDLVLKGADWIVNEVKKSGLR 1426
            P++T FG LKDEDRIFTNLYG +D  LKG++ RGDWY+TK+++LKG DWI+ E+K SGLR
Sbjct: 27   PKKTSFGSLKDEDRIFTNLYGRHDWRLKGSLSRGDWYKTKEILLKGPDWILGEIKTSGLR 86

Query: 1427 GRGGAGFPSGLKWSFMPKLSDGRPSYLVVNADESEPGTCKDREIMRHDPHKLLEGCLIAG 1486
            GRGGAGFP+GLKWSFM K SDGRP YLVVNADE EPGTCKDREI+RHDPHKLLEGCL+ G
Sbjct: 87   GRGGAGFPTGLKWSFMNKPSDGRPKYLVVNADEGEPGTCKDREILRHDPHKLLEGCLVGG 146

Query: 1487 VGMRATAAYIYIRGEYVNERKNLERARKEAYEAGFLGKNACGSGYDFDVHIHFGAGAYIC 1546
              M A AAYIYIRGE+ NE  NL+ A +EAYEAG +GKNACGSGYDFDV +  GAGAYIC
Sbjct: 147  RAMGARAAYIYIRGEFYNEASNLQVAIREAYEAGLIGKNACGSGYDFDVFVVRGAGAYIC 206

Query: 1547 GEETALLESLEGKQGKPRLKPPFPANAGLYGCPTTVTNVETVAVSPTILRRGPEWFASFG 1606
            GEETAL+ES+EGKQGKPRLKPPFPA+ G++GCPTTV NVETVAVSPTI RRG  WFA FG
Sbjct: 207  GEETALIESIEGKQGKPRLKPPFPADVGVFGCPTTVANVETVAVSPTICRRGGTWFAGFG 266

Query: 1607 RKNNSGTKLFCISGHVNKPCTVEEEMSISLKELIERHCGGVRGGWDNLLAVIPGGSSVPL 1666
            R+ NSGTKLF ISGHVN PCTVEEEMS+ LKELIE+H GGV GGWDNLLAVIPGGSS PL
Sbjct: 267  RERNSGTKLFNISGHVNHPCTVEEEMSVPLKELIEKHAGGVTGGWDNLLAVIPGGSSTPL 326

Query: 1667 LPKHICDDVLMDYDALKAVQSGLGTAAVIVMDKSTDVVDAIARLSYFYKHESCGQCTPCR 1726
            +PK +C+ VLMD+DAL   Q+GLGTAAVIVMD+STD+V AIARL  FYKHESCGQCTPCR
Sbjct: 327  IPKSVCETVLMDFDALVQAQTGLGTAAVIVMDRSTDIVKAIARLIEFYKHESCGQCTPCR 386

Query: 1727 EGTGWLWMIMERMKVGNAKLEEIDMLQEVTKQIEGHTICALGDAAAWPVQGLIRHFRPEL 1786
            EG  W+  +M R   G+A+  EID L E++KQIEGHTICALGD AAWPVQGLIRHFRPEL
Sbjct: 387  EGVDWMNKVMARFVRGDARPAEIDSLWEISKQIEGHTICALGDGAAWPVQGLIRHFRPEL 446

Query: 1787 ERRIRERAERELIEAAA 1804
            E R++  A++     AA
Sbjct: 447  EERMQRFAQQHQARQAA 463

BLAST of Moc08g00890 vs. ExPASy Swiss-Prot
Match: Q0MQI4 (NADH dehydrogenase [ubiquinone] flavoprotein 1, mitochondrial OS=Pongo pygmaeus OX=9600 GN=NDUFV1 PE=2 SV=1)

HSP 1 Score: 693.3 bits (1788), Expect = 7.0e-198
Identity = 324/437 (74.14%), Postives = 369/437 (84.44%), Query Frame = 0

Query: 1367 PERTHFGGLKDEDRIFTNLYGLNDPFLKGAMKRGDWYRTKDLVLKGADWIVNEVKKSGLR 1426
            P++T FG LKDEDRIFTNLYG +D  LKGA+ RGDWY+TK+++LKG DWI+ E+K SGLR
Sbjct: 27   PKKTSFGSLKDEDRIFTNLYGRHDWRLKGALSRGDWYKTKEILLKGPDWILGEIKTSGLR 86

Query: 1427 GRGGAGFPSGLKWSFMPKLSDGRPSYLVVNADESEPGTCKDREIMRHDPHKLLEGCLIAG 1486
            GRGGAGFP+GLKWSFM K SDGRP YLVVNADE EPGTCKDREI+RHDPHKL+EGCL+ G
Sbjct: 87   GRGGAGFPTGLKWSFMNKPSDGRPKYLVVNADEGEPGTCKDREIIRHDPHKLVEGCLVGG 146

Query: 1487 VGMRATAAYIYIRGEYVNERKNLERARKEAYEAGFLGKNACGSGYDFDVHIHFGAGAYIC 1546
              M A AAYIYIRGE+ NE  NL+ A +EAYEAG +GKNACGSGYDFDV +  GAGAYIC
Sbjct: 147  RAMGARAAYIYIRGEFYNEASNLQVAIREAYEAGLIGKNACGSGYDFDVFVVRGAGAYIC 206

Query: 1547 GEETALLESLEGKQGKPRLKPPFPANAGLYGCPTTVTNVETVAVSPTILRRGPEWFASFG 1606
            GEETAL+ES+EGKQGKPRLKPPFPA+ G++GCPTTV NVETVAVSPTI RRG  WFA FG
Sbjct: 207  GEETALIESIEGKQGKPRLKPPFPADVGVFGCPTTVANVETVAVSPTICRRGGTWFAGFG 266

Query: 1607 RKNNSGTKLFCISGHVNKPCTVEEEMSISLKELIERHCGGVRGGWDNLLAVIPGGSSVPL 1666
            R+ NSGTKLF ISGHVN PCTVEEEMS+ LKELIE+H GGV GGWDNLLAVIPGGSS PL
Sbjct: 267  RERNSGTKLFNISGHVNYPCTVEEEMSVPLKELIEKHAGGVTGGWDNLLAVIPGGSSTPL 326

Query: 1667 LPKHICDDVLMDYDALKAVQSGLGTAAVIVMDKSTDVVDAIARLSYFYKHESCGQCTPCR 1726
            +PK +C+ VLMD+DAL   Q+GLGTAAVIVMD+STD+V AIARL  FYKHESCGQCTPCR
Sbjct: 327  IPKSVCETVLMDFDALVQAQTGLGTAAVIVMDRSTDIVKAIARLIEFYKHESCGQCTPCR 386

Query: 1727 EGTGWLWMIMERMKVGNAKLEEIDMLQEVTKQIEGHTICALGDAAAWPVQGLIRHFRPEL 1786
            EG  W+  +M R   G+A+  EID L E++KQIEGHTICALGD AAWPVQGLIRHFRPEL
Sbjct: 387  EGVDWMNKVMARFVRGDAQPAEIDSLWEISKQIEGHTICALGDGAAWPVQGLIRHFRPEL 446

Query: 1787 ERRIRERAERELIEAAA 1804
            E R++  A++   + AA
Sbjct: 447  EERMQRFAQQHQAQQAA 463

BLAST of Moc08g00890 vs. ExPASy TrEMBL
Match: A0A6J1BQQ5 (uncharacterized protein LOC111004588 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111004588 PE=4 SV=1)

HSP 1 Score: 2100.1 bits (5440), Expect = 0.0e+00
Identity = 1119/1123 (99.64%), Postives = 1119/1123 (99.64%), Query Frame = 0

Query: 1    MIMPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSA 60
            MIMPPSPALRSSPGREL GSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSA
Sbjct: 1    MIMPPSPALRSSPGRELXGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSA 60

Query: 61   EDLEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDND 120
            EDLEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDND
Sbjct: 61   EDLEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDND 120

Query: 121  PPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQ 180
            PPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQ
Sbjct: 121  PPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQ 180

Query: 181  LSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTT 240
            LSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTT
Sbjct: 181  LSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTT 240

Query: 241  SGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRN 300
            SGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRN
Sbjct: 241  SGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRN 300

Query: 301  SMDLQYKYSRQLMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNS 360
            SMDLQYKYSRQ MSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNS
Sbjct: 301  SMDLQYKYSRQXMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNS 360

Query: 361  LSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTF 420
            LSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTF
Sbjct: 361  LSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTF 420

Query: 421  YTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIH 480
            YTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIH
Sbjct: 421  YTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIH 480

Query: 481  EEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGN 540
            EEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGN
Sbjct: 481  EEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGN 540

Query: 541  AFSEVVCLDDIILCPRCGCRYCVIDTEENYINLCPECSRKEKYLGMTLLENMTSVTESIS 600
            AFSEVVCLDDIILCPRCGCRYCVIDTEEN INLCPECSRKEKYLGMTLLENMT VTESIS
Sbjct: 541  AFSEVVCLDDIILCPRCGCRYCVIDTEENXINLCPECSRKEKYLGMTLLENMTXVTESIS 600

Query: 601  GYSIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKEN 660
            GYSIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKEN
Sbjct: 601  GYSIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKEN 660

Query: 661  FPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISI 720
            FPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISI
Sbjct: 661  FPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISI 720

Query: 721  LLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQ 780
            LLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQ
Sbjct: 721  LLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQ 780

Query: 781  IEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTV 840
            IEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTV
Sbjct: 781  IEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTV 840

Query: 841  ANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSR 900
            ANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSR
Sbjct: 841  ANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSR 900

Query: 901  EDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVI 960
            EDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVI
Sbjct: 901  EDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVI 960

Query: 961  APGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVV 1020
            APGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVV
Sbjct: 961  APGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVV 1020

Query: 1021 RSLTLEEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLR 1080
            RSLTLEEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLR
Sbjct: 1021 RSLTLEEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLR 1080

Query: 1081 SRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1124
            SRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN
Sbjct: 1081 SRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1123

BLAST of Moc08g00890 vs. ExPASy TrEMBL
Match: A0A6J1BT22 (uncharacterized protein LOC111004588 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111004588 PE=4 SV=1)

HSP 1 Score: 2093.5 bits (5423), Expect = 0.0e+00
Identity = 1118/1123 (99.55%), Postives = 1118/1123 (99.55%), Query Frame = 0

Query: 1    MIMPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSA 60
            MIMPPSPALRSSPGREL GSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSA
Sbjct: 1    MIMPPSPALRSSPGRELXGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSA 60

Query: 61   EDLEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDND 120
            EDLEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDND
Sbjct: 61   EDLEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDND 120

Query: 121  PPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQ 180
            PPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQ
Sbjct: 121  PPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQ 180

Query: 181  LSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTT 240
            LSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTT
Sbjct: 181  LSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTT 240

Query: 241  SGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRN 300
            SGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRN
Sbjct: 241  SGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRN 300

Query: 301  SMDLQYKYSRQLMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNS 360
            SMDLQYKYSRQ MSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNS
Sbjct: 301  SMDLQYKYSRQXMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNS 360

Query: 361  LSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTF 420
            LSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTF
Sbjct: 361  LSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTF 420

Query: 421  YTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIH 480
            YTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIH
Sbjct: 421  YTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIH 480

Query: 481  EEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGN 540
            EEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGN
Sbjct: 481  EEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGN 540

Query: 541  AFSEVVCLDDIILCPRCGCRYCVIDTEENYINLCPECSRKEKYLGMTLLENMTSVTESIS 600
            AFSEVVCLDDIILCPRCGCRYCVIDTEEN INLCPECSRKEKYLGMTLLENMT VTESIS
Sbjct: 541  AFSEVVCLDDIILCPRCGCRYCVIDTEENXINLCPECSRKEKYLGMTLLENMTXVTESIS 600

Query: 601  GYSIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKEN 660
            GYSIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKEN
Sbjct: 601  GYSIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKEN 660

Query: 661  FPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISI 720
            FPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISI
Sbjct: 661  FPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISI 720

Query: 721  LLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQ 780
            LLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQ
Sbjct: 721  LLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQ 780

Query: 781  IEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTV 840
            IEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTV
Sbjct: 781  IEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTV 840

Query: 841  ANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSR 900
            ANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSR
Sbjct: 841  ANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSR 900

Query: 901  EDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVI 960
            EDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVI
Sbjct: 901  EDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVI 960

Query: 961  APGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVV 1020
            APGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGIL ESTVIVDYQGRRKVV
Sbjct: 961  APGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGIL-ESTVIVDYQGRRKVV 1020

Query: 1021 RSLTLEEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLR 1080
            RSLTLEEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLR
Sbjct: 1021 RSLTLEEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLR 1080

Query: 1081 SRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1124
            SRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN
Sbjct: 1081 SRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1122

BLAST of Moc08g00890 vs. ExPASy TrEMBL
Match: A0A6J1BPZ7 (uncharacterized protein LOC111004588 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111004588 PE=4 SV=1)

HSP 1 Score: 1971.1 bits (5105), Expect = 0.0e+00
Identity = 1053/1058 (99.53%), Postives = 1054/1058 (99.62%), Query Frame = 0

Query: 66   SFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPPPVT 125
            S +TKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPPPVT
Sbjct: 32   SAATKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPPPVT 91

Query: 126  LASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLSAPH 185
            LASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLSAPH
Sbjct: 92   LASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLSAPH 151

Query: 186  SSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSGARG 245
            SSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSGARG
Sbjct: 152  SSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSGARG 211

Query: 246  TSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSMDLQ 305
            TSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSMDLQ
Sbjct: 212  TSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSMDLQ 271

Query: 306  YKYSRQLMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLSKGG 365
            YKYSRQ MSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLSKGG
Sbjct: 272  YKYSRQXMSPTAPISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNSLSKGG 331

Query: 366  NTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTFYTGKV 425
            NTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTFYTGKV
Sbjct: 332  NTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTTFYTGKV 391

Query: 426  SSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFA 485
            SSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFA
Sbjct: 392  SSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDIHEEIFA 451

Query: 486  FDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEV 545
            FDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEV
Sbjct: 452  FDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRAQGNAFSEV 511

Query: 546  VCLDDIILCPRCGCRYCVIDTEENYINLCPECSRKEKYLGMTLLENMTSVTESISGYSIK 605
            VCLDDIILCPRCGCRYCVIDTEEN INLCPECSRKEKYLGMTLLENMT VTESISGYSIK
Sbjct: 512  VCLDDIILCPRCGCRYCVIDTEENXINLCPECSRKEKYLGMTLLENMTXVTESISGYSIK 571

Query: 606  YEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKENFPSET 665
            YEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKENFPSET
Sbjct: 572  YEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSYQKENFPSET 631

Query: 666  PVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISILLKRS 725
            PVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISILLKRS
Sbjct: 632  PVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGAGISILLKRS 691

Query: 726  SSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQIEARI 785
            SSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQIEARI
Sbjct: 692  SSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFSSARQIEARI 751

Query: 786  QRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTVANLEC 845
            QRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTVANLEC
Sbjct: 752  QRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENLDFTVANLEC 811

Query: 846  FSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSREDLSG 905
            FSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSREDLSG
Sbjct: 812  FSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTSESSREDLSG 871

Query: 906  GRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQ 965
            GRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQ
Sbjct: 872  GRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVIAPGSQ 931

Query: 966  SDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVVRSLTL 1025
            SDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVVRSLTL
Sbjct: 932  SDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVVRSLTL 991

Query: 1026 EEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGG 1085
            EEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGG
Sbjct: 992  EEATDTILFCSSIVHDIAYSAASIAIEKENEVTLEGSRPTVTILGKSNTDRSDLRSRTGG 1051

Query: 1086 KRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1124
            KRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN
Sbjct: 1052 KRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1089

BLAST of Moc08g00890 vs. ExPASy TrEMBL
Match: A0A0A0LKP5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G350460 PE=4 SV=1)

HSP 1 Score: 1655.2 bits (4285), Expect = 0.0e+00
Identity = 922/1139 (80.95%), Postives = 990/1139 (86.92%), Query Frame = 0

Query: 1    MIMPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSA 60
            MIMPPSPALRSSPGRE RGSNHKRGHSFES + IREKDDDLALFNEMQTRERE FLLQSA
Sbjct: 1    MIMPPSPALRSSPGRESRGSNHKRGHSFESAVRIREKDDDLALFNEMQTREREGFLLQSA 60

Query: 61   EDLEDSFSTKLRHFSDIKLGISIPVRGENSELL-NVDGEKNDYDWLLTPPDTPLFPSLDN 120
            EDLEDSFSTKLRHFSD+KLGISIPVRGENS+LL NV+ EKNDYDWLLTPPDTPLFPSLD+
Sbjct: 61   EDLEDSFSTKLRHFSDLKLGISIPVRGENSDLLNNVEAEKNDYDWLLTPPDTPLFPSLDD 120

Query: 121  DPPPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGR 180
            +PP V +ASRGRPRSQPISISRSSTMEKSHRSSTSRGS SPNRLSPSPRSA+SVPQ+RGR
Sbjct: 121  EPPSVAIASRGRPRSQPISISRSSTMEKSHRSSTSRGSPSPNRLSPSPRSANSVPQLRGR 180

Query: 181  QLSAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTST 240
            QLSAPHSSPTPSLRHATPSRRSTTP RRS PPPSTPS SV RSSTPTPRRLSTGSSGT+ 
Sbjct: 181  QLSAPHSSPTPSLRHATPSRRSTTPTRRSPPPPSTPSTSVPRSSTPTPRRLSTGSSGTAG 240

Query: 241  TSGARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASR 300
             SGARGTSPIK+VRGNSASPKIRAWQTNIPGFSSD PPNLRTSL DRPASY RGSSPASR
Sbjct: 241  ISGARGTSPIKSVRGNSASPKIRAWQTNIPGFSSDPPPNLRTSLDDRPASYVRGSSPASR 300

Query: 301  NSMDLQYKYSRQLMSPTA--PISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSL 360
            NS DL +KY RQ MSPTA   ISSSHSHDRDRYSSYSRGS ASSGDDDLDSLQSIP SSL
Sbjct: 301  NSRDLAHKYGRQSMSPTASRSISSSHSHDRDRYSSYSRGSIASSGDDDLDSLQSIPISSL 360

Query: 361  DNSLSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPS 420
            DNSLSKGG +FSNNKAL  SKKHRIVSS S PKRSLDSTIR LDRKSPNMFRPLLSSVPS
Sbjct: 361  DNSLSKGGISFSNNKALAFSKKHRIVSS-SAPKRSLDSTIRHLDRKSPNMFRPLLSSVPS 420

Query: 421  TTFYTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYH 480
            TTFYTGK SSAHRSLISRNSSVTTSSNASS +G  I LDTE SD NQDDM NECEK+ YH
Sbjct: 421  TTFYTGKASSAHRSLISRNSSVTTSSNASSDHGTCIALDTEGSDQNQDDMVNECEKIQYH 480

Query: 481  DIHEEIFAFDKMDIVNENPINDIKSLDSGPAPGCDPVLTEDSSHQTIIPEISSTFDSSRA 540
            + HEEIFAFDKMDIV+E+PI+DIKSLDSGPA GCDPV+T DSS++ ++P+ISST DSS  
Sbjct: 481  NSHEEIFAFDKMDIVDEDPIHDIKSLDSGPALGCDPVVTGDSSYEAVVPDISSTSDSSHV 540

Query: 541  QGNAFSEVVCLDDIILCPRCGCRYCVIDTEENYINLCPECSRKEKYLGMTLLENMTSVTE 600
            QG  FSE+VCL+D ++C RCGCRY V DTEEN  NLCPECSR+EK L + + ENMT+VTE
Sbjct: 541  QGADFSEIVCLEDTVVCSRCGCRYRVTDTEENDANLCPECSREEKCLSLAISENMTAVTE 600

Query: 601  SISGY-SIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSY 660
            S+SG  S+KYE  KPF+KVE  VIS +S+LA DLGESRIS  +GNVEQDQASYPE+G SY
Sbjct: 601  SLSGLSSVKYE-DKPFDKVELVVISPDSALANDLGESRISMFVGNVEQDQASYPEQGPSY 660

Query: 661  QKENFPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGA 720
              ENFP+ETP  ESQHSLINH EIGQ  VSG+Q +T SGYQQPL  NDY+ LRFDS EGA
Sbjct: 661  -VENFPAETPSEESQHSLINHLEIGQSAVSGNQPDTGSGYQQPLQRNDYQSLRFDSPEGA 720

Query: 721  GISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFS 780
            GISILLKRSSSSKGP+VQGRTFT STISYDDLSFARDSMSSLRSS+GHSSFSASSSADFS
Sbjct: 721  GISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFS 780

Query: 781  SARQIEARIQRQ--VSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEE 840
            SARQIEAR+QRQ  +SSRKGELE+KKGEI VKSH +E ASSG P +AHP+ GFETC+Q+E
Sbjct: 781  SARQIEARMQRQLSLSSRKGELENKKGEISVKSHCAEIASSGIPASAHPISGFETCKQDE 840

Query: 841  NLDFTVANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTC 900
            N+DF VANLEC S QGTT SSQK ELASEN +SDDTSSI VAVVEEDKFE D  RILDTC
Sbjct: 841  NVDFYVANLECSSCQGTTTSSQKAELASENGKSDDTSSISVAVVEEDKFEYDTCRILDTC 900

Query: 901  TSESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDEQH--PNHLMTTISEKE 960
            TSE SRED SGGRSVSDK+A VT SDCSKLEGHNM     FEDE+     H M TISE E
Sbjct: 901  TSELSREDSSGGRSVSDKDASVTNSDCSKLEGHNMLG-DVFEDERSEVSTHPMITISETE 960

Query: 961  TKQIAEVIAPGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVD 1020
              QIAEV+A GSQ D+S IS   LEEES+V SGPD+DLTP +IN EKS GILEESTVIVD
Sbjct: 961  ATQIAEVVASGSQDDISTISMIPLEEESVVLSGPDQDLTPSIINAEKSDGILEESTVIVD 1020

Query: 1021 YQGRRKVVRSLTLEEATDTILFCSSIVHDIAYSAASIAI--------EKENEVTLEGSRP 1080
            YQG+ KVVRSLTLEEATDTILFCSSIVHD+AYSAA+IAI        EKENEVTLE SRP
Sbjct: 1021 YQGKTKVVRSLTLEEATDTILFCSSIVHDLAYSAATIAIEKEKEKEKEKENEVTLEASRP 1080

Query: 1081 TVTILGKSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1124
             VTILGKSNT+RSDLR RTGGKRVMKSQK RQR VEMSTKPP+  TENDENTDESTIRN
Sbjct: 1081 MVTILGKSNTNRSDLRHRTGGKRVMKSQKPRQRRVEMSTKPPIAYTENDENTDESTIRN 1135

BLAST of Moc08g00890 vs. ExPASy TrEMBL
Match: A0A6J1HBJ8 (flocculation protein FLO11 OS=Cucurbita moschata OX=3662 GN=LOC111462594 PE=4 SV=1)

HSP 1 Score: 1648.3 bits (4267), Expect = 0.0e+00
Identity = 905/1133 (79.88%), Postives = 983/1133 (86.76%), Query Frame = 0

Query: 3    MPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED 62
            MPPSPALRSSPG E RGSNHKRGHSFESG  IREKDDDLALFNEMQTRER+ FLLQSAED
Sbjct: 1    MPPSPALRSSPGSEPRGSNHKRGHSFESGARIREKDDDLALFNEMQTRERDDFLLQSAED 60

Query: 63   LEDSFSTKLRHFSDIKLGISIPVRGENSE-LLNVDGEKNDYDWLLTPPDTPLFPSLDNDP 122
             EDSFSTKLRHF D+KLGIS+PVRGENS+ L+N + +KNDYDWLLTPPDTPLFPSLD++P
Sbjct: 61   FEDSFSTKLRHFPDLKLGISVPVRGENSDMLINAETDKNDYDWLLTPPDTPLFPSLDDEP 120

Query: 123  PPVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQL 182
            PPVT+ASRGRPRSQPISISRSSTMEKSHRSSTSRGS SPNRLSPSPRSA+SVPQ+RGRQL
Sbjct: 121  PPVTIASRGRPRSQPISISRSSTMEKSHRSSTSRGSPSPNRLSPSPRSANSVPQLRGRQL 180

Query: 183  SAPHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTS 242
            SAPHSSPTPSLRHATPSRRSTTP RRSSPPPS PS SV RSSTPTPRRLSTGSSG +  S
Sbjct: 181  SAPHSSPTPSLRHATPSRRSTTPTRRSSPPPSMPSTSVPRSSTPTPRRLSTGSSGAAVIS 240

Query: 243  GARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNS 302
            G RGTSP+K+VRGNSASPKIRAWQTNIPGFSS+ PPNLRTSLADRPASY RGSSPASRNS
Sbjct: 241  GTRGTSPVKSVRGNSASPKIRAWQTNIPGFSSEPPPNLRTSLADRPASYVRGSSPASRNS 300

Query: 303  MDLQYKYSRQLMSPTA--PISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDN 362
             DL +KY RQ MSPTA   I+S HSHDRD YSSYSRGS ASSGDDDLDSLQS+P S+LDN
Sbjct: 301  RDLAHKYGRQSMSPTASRSITSPHSHDRDHYSSYSRGSIASSGDDDLDSLQSMPISTLDN 360

Query: 363  SLSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTT 422
            SLSKGG + SNNKAL +SKKHRIVSS+S PKRSLDSTIRQLDRKSPNMFRPLLSSVPSTT
Sbjct: 361  SLSKGGISLSNNKALALSKKHRIVSSSSAPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTT 420

Query: 423  FYTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHDI 482
            FYTGK SSAHR LISRNSSVTTSSNASS +G  I LDTE SD NQ+D  NECEK+PYHD 
Sbjct: 421  FYTGKASSAHR-LISRNSSVTTSSNASSDHGTCIALDTEGSDHNQNDTTNECEKMPYHDS 480

Query: 483  HEEIFAFDKMDIVNENPINDIKSLDS--GPAPGCDPVLTEDSSHQTIIPEISSTFDSSRA 542
            HEEIFAFDKMDIV+E+P + IKSLDS  GPA GCDPV+T DSS++ +IP+I ST DSS  
Sbjct: 481  HEEIFAFDKMDIVDEDPFHVIKSLDSGRGPALGCDPVVTGDSSYEAVIPDIISTSDSSHV 540

Query: 543  QGNAFSEVVCLDDIILCPRCGCRYCVIDTEENYINLCPECSRKEKYLGMTLLENMTSVTE 602
            QG  FSEVVCL+D  +C RCGCRY VID+EEN +N CPECSR+EK +GM +  N TSVTE
Sbjct: 541  QGGDFSEVVCLEDTFVCSRCGCRYRVIDSEENTLNCCPECSREEKDIGMAISNNTTSVTE 600

Query: 603  SISGY-SIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQASYPEEGLSY 662
            S+SG  S+KYEA KPFN+V+S VIS +SSLATD GESRIS S+GN+EQDQAS+PE+G SY
Sbjct: 601  SLSGLSSVKYEADKPFNRVDSLVISPDSSLATDFGESRISMSVGNIEQDQASFPEQGPSY 660

Query: 663  QKENFPSETPVSESQHSLINHSEIGQLVVSGSQSNTESGYQQPLHHNDYKDLRFDSSEGA 722
             +ENFPSETPV ESQHSL NH E+GQL V+GSQ NTESG QQPL HNDY+ LRFDSSEGA
Sbjct: 661  LEENFPSETPVEESQHSLTNHLEMGQLAVNGSQPNTESGCQQPLQHNDYQTLRFDSSEGA 720

Query: 723  GISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSSVGHSSFSASSSADFS 782
            GISILLKRSSSSKGP+VQGRTFT STISYDDLSFARDSMSSLRSS+GHSSFSASSSADFS
Sbjct: 721  GISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFS 780

Query: 783  SARQIEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNAHPVLGFETCEQEENL 842
            S+RQIE R+QRQ+SSRKG+LE+KK E+ VKSH SE AS+GTP NAHP+  FETC+QEEN+
Sbjct: 781  SSRQIEGRMQRQLSSRKGDLENKKCEVSVKSHCSEVASTGTPANAHPISSFETCKQEENV 840

Query: 843  DFTVANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEEDKFECDNRRILDTCTS 902
            DF VA LECFSSQGTT SS KPELASENAESDD SSIV A VEEDK ECD  R LD CTS
Sbjct: 841  DFYVATLECFSSQGTTMSSHKPELASENAESDDASSIVAAAVEEDKLECDKCRRLDNCTS 900

Query: 903  ESSREDLSGGRSVSDKEAPVTTSDCSKLEGHNMPDVSAFEDE--QHPNHLMTTISEKETK 962
             SSRED SGGRSVSDK+A VTT DCS+LEGHN+ D   FEDE  + P H MTTISE E  
Sbjct: 901  GSSREDTSGGRSVSDKDASVTTFDCSRLEGHNILDGDVFEDEHTELPTHPMTTISETEAA 960

Query: 963  QIAEVIAPGSQSDLSIISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQ 1022
            QIAEVI PGSQ+DLSII  S+  EES VPSGPD+DL P VINTEKS GILE STVIVDYQ
Sbjct: 961  QIAEVIGPGSQNDLSII-PSIPLEESAVPSGPDQDLAPSVINTEKSDGILERSTVIVDYQ 1020

Query: 1023 GRRKVVRSLTLEEATDTILFCSSIVHDIAYSAASIAI----EKENEVTLEGSRPTVTILG 1082
            GR KV RSLTLEEATDTILFCSSIVHD+AYSAA+IAI    EKENEVTLE SRP VTILG
Sbjct: 1021 GRTKVGRSLTLEEATDTILFCSSIVHDLAYSAATIAIEKEKEKENEVTLEASRPMVTILG 1080

Query: 1083 KSNTDRSDLRSRTGGKRVMKSQKLRQRHVEMSTKPPVTKTENDENTDESTIRN 1124
            KS  +R DLR RTGGKRVMKSQK RQR VEMSTKPP+ KTENDENTDESTI+N
Sbjct: 1081 KSYPNRGDLRHRTGGKRVMKSQKPRQRRVEMSTKPPIAKTENDENTDESTIQN 1131

BLAST of Moc08g00890 vs. TAIR 10
Match: AT5G08530.1 (51 kDa subunit of complex I )

HSP 1 Score: 923.3 bits (2385), Expect = 3.0e-268
Identity = 438/486 (90.12%), Postives = 464/486 (95.47%), Query Frame = 0

Query: 1318 MAPARGILRLQRTALARSYTDRWGIGLRAFSNQGAAATSSPQPPSPPPPPERTHFGGLKD 1377
            MAP RGIL LQR       ++R    LR+FS Q A+ +++PQPP PPPPPE+THFGGLKD
Sbjct: 1    MAPVRGILGLQRAVSIWKESNRLTPALRSFSTQAASTSTTPQPPPPPPPPEKTHFGGLKD 60

Query: 1378 EDRIFTNLYGLNDPFLKGAMKRGDWYRTKDLVLKGADWIVNEVKKSGLRGRGGAGFPSGL 1437
            EDRIFTNLYGL+DPFLKGAMKRGDW+RTKDLVLKG DWIVNE+KKSGLRGRGGAGFPSGL
Sbjct: 61   EDRIFTNLYGLHDPFLKGAMKRGDWHRTKDLVLKGTDWIVNEMKKSGLRGRGGAGFPSGL 120

Query: 1438 KWSFMPKLSDGRPSYLVVNADESEPGTCKDREIMRHDPHKLLEGCLIAGVGMRATAAYIY 1497
            KWSFMPK+SDGRPSYLVVNADESEPGTCKDREIMRHDPHKLLEGCLIAGVGMRA+AAYIY
Sbjct: 121  KWSFMPKVSDGRPSYLVVNADESEPGTCKDREIMRHDPHKLLEGCLIAGVGMRASAAYIY 180

Query: 1498 IRGEYVNERKNLERARKEAYEAGFLGKNACGSGYDFDVHIHFGAGAYICGEETALLESLE 1557
            IRGEYVNER NLE+AR+EAY AG LGKNACGSGYDF+V+IHFGAGAYICGEETALLESLE
Sbjct: 181  IRGEYVNERLNLEKARREAYAAGLLGKNACGSGYDFEVYIHFGAGAYICGEETALLESLE 240

Query: 1558 GKQGKPRLKPPFPANAGLYGCPTTVTNVETVAVSPTILRRGPEWFASFGRKNNSGTKLFC 1617
            GKQGKPRLKPPFPANAGLYGCPTTVTNVETVAVSPTILRRGPEWF+SFGRKNN+GTKLFC
Sbjct: 241  GKQGKPRLKPPFPANAGLYGCPTTVTNVETVAVSPTILRRGPEWFSSFGRKNNAGTKLFC 300

Query: 1618 ISGHVNKPCTVEEEMSISLKELIERHCGGVRGGWDNLLAVIPGGSSVPLLPKHICDDVLM 1677
            ISGHVNKPCTVEEEMSI LKELIERHCGGVRGGWDNLLA+IPGGSSVPL+PK+IC+DVLM
Sbjct: 301  ISGHVNKPCTVEEEMSIPLKELIERHCGGVRGGWDNLLAIIPGGSSVPLIPKNICEDVLM 360

Query: 1678 DYDALKAVQSGLGTAAVIVMDKSTDVVDAIARLSYFYKHESCGQCTPCREGTGWLWMIME 1737
            D+DALKAVQSGLGTAAVIVMDKSTDVVDAIARLSYFYKHESCGQCTPCREGTGWLWMIME
Sbjct: 361  DFDALKAVQSGLGTAAVIVMDKSTDVVDAIARLSYFYKHESCGQCTPCREGTGWLWMIME 420

Query: 1738 RMKVGNAKLEEIDMLQEVTKQIEGHTICALGDAAAWPVQGLIRHFRPELERRIRERAERE 1797
            RMKVGNAKLEEIDMLQEVTKQIEGHTICALGDAAAWPVQGLIRHFRPELERRIRERAERE
Sbjct: 421  RMKVGNAKLEEIDMLQEVTKQIEGHTICALGDAAAWPVQGLIRHFRPELERRIRERAERE 480

Query: 1798 LIEAAA 1804
            L++AAA
Sbjct: 481  LLQAAA 486

BLAST of Moc08g00890 vs. TAIR 10
Match: AT1G27850.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G40070.1); Has 9215 Blast hits to 5316 proteins in 473 species: Archae - 6; Bacteria - 773; Metazoa - 3392; Fungi - 1710; Plants - 539; Viruses - 143; Other Eukaryotes - 2652 (source: NCBI BLink). )

HSP 1 Score: 607.1 bits (1564), Expect = 4.7e-173
Identity = 473/1176 (40.22%), Postives = 656/1176 (55.78%), Query Frame = 0

Query: 3    MPPSPALRSSPGRELRGSNHKRGHSFESGMCIREKDDDLALFNEMQTRERESFLLQSAED 62
            MPPSPALR SPGREL G  H+RGHS E G+  R+KDDDLALF+EMQ +ER+SFLLQS++D
Sbjct: 1    MPPSPALRCSPGRELPGKKHRRGHSIEYGILFRDKDDDLALFSEMQDKERDSFLLQSSDD 60

Query: 63   LEDSFSTKLRHFSDIKLGISIPVRGENSELLNVDGEKNDYDWLLTPPDTPLFPSLDNDPP 122
            LED FSTKL+HFS+     +IPV+GE+S LL  +G+KNDYDWLLTPPDTPLFPSLD+ PP
Sbjct: 61   LEDVFSTKLKHFSE----FTIPVQGESSRLLTAEGDKNDYDWLLTPPDTPLFPSLDDQPP 120

Query: 123  PVTLASRGRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSPRSASSVPQMRGRQLS 182
              ++  RGRP+SQ IS+SRSSTMEKS RS  S+GSASPNRLS SPR A ++ Q+RGR  S
Sbjct: 121  AASVVRRGRPQSQ-ISLSRSSTMEKSRRS--SKGSASPNRLSTSPR-ADNMQQIRGRPSS 180

Query: 183  APHSSPTPSLRHATPSRRSTTPARRSSPPPSTPSISVTRSSTPTPRRLSTGSSGTSTTSG 242
            A H SP          RRS TP RR SP P  PS  V+RS TPT RR+STGS+ T  +  
Sbjct: 181  ARHPSP-------ASGRRSGTPVRRISPTPGKPSGPVSRSPTPTSRRMSTGST-TMASPA 240

Query: 243  ARGTSPIKAVRGNSASPKIRAWQTNIPGFSSDAPPNLRTSLADRPASYTRGSSPASRNSM 302
             RGTSP+ + RGNS SPKI+ WQ+NIPGFS DAPPNLRTSL DRPASY RGSSPASRN  
Sbjct: 241  VRGTSPVSSSRGNSPSPKIKVWQSNIPGFSLDAPPNLRTSLGDRPASYVRGSSPASRNGR 300

Query: 303  DLQYKYSRQLMSPTA--PISSSHSHDRDRYSSYSRGSKASSGDDDLDSLQSIPTSSLDNS 362
            D     SR+ +SP+A   +SSSHSH+RDR+SS S+GS ASSGDDDL SLQSIP    + +
Sbjct: 301  DAVSTRSRKSVSPSASRSVSSSHSHERDRFSSQSKGSVASSGDDDLHSLQSIPVGGSERA 360

Query: 363  LSKGGNTFSNNKALTISKKHRIVSSTSTPKRSLDSTIRQLD--RKSPNMFRPLLSSVPST 422
            +SK  +   N++    S+  +++S  S P+R  +S +RQ++  +   +MFRPL SS+PST
Sbjct: 361  VSKRASLSPNSRT---SRSSKLLSPGSAPRRPFESALRQMEHPKSHHSMFRPLASSLPST 420

Query: 423  TFYTGKVSSAHRSLISRNSSVTTSSNASSYNGASIVLDTEVSDLNQDDMANECEKVPYHD 482
              Y+GK SS++  ++ R+S+ T  SN+SS      + D +  D       +E E + Y D
Sbjct: 421  GIYSGKGSSSYHHIMLRHSTATVGSNSSSGQVTGFMPDAKGMD-PVPVFQSEVENLAYPD 480

Query: 483  IHEEIFAFDKMDIVNENPIND---------IKSLDSGPAPGCDPVLTEDSSHQTIIPEIS 542
             HEE  AF  +++ NE+  ++         +  +D      C+    E+ SHQ    E S
Sbjct: 481  KHEESIAFGMVNLSNESSRHESHESSFSDQLGDMDQDYTVECESSANEEVSHQVFDVENS 540

Query: 543  STFDSSRAQGNAFSEVVCLDDIILCPRCGCRYCVIDTEENYINLCPECSRKEKYLGM-TL 602
            ST  S    GN F E V L+ + +C RCG  YC  +   + IN+CPEC  +  ++   + 
Sbjct: 541  STHGSLHV-GNEFLEGVALETMEVCGRCGSHYCATEATRSEINICPECREEHSFVETDSP 600

Query: 603  LENMTSVTESISGYSIKYEAGKPFNKVESGVISLESSLATDLGESRISESLGNVEQDQAS 662
              N   ++++I   +  Y    P       VI +  SL   + E  I E+   +EQ   S
Sbjct: 601  GTNSPKLSQTIFDENKLYFENIP-------VIDVLDSLPVVMVEEEILETPEKIEQCDNS 660

Query: 663  YPEEGLSYQKENFPSETPVSESQHSLINHSEIGQLVVSGSQSNTESG-------YQQPLH 722
            Y +E   Y          + E    ++N+ +       G+QS+T  G         Q   
Sbjct: 661  YEQE--QYHLYESSISRALEEQNVDMLNYKD-------GTQSSTGCGPLSIGTKDTQTQL 720

Query: 723  HNDYKDLRFDSSEGAGISILLKRSSSSKGPIVQGRTFTTSTISYDDLSFARDSMSSLRSS 782
             + + D+   S     + +++KRS S K P++Q    +  T SY+  S++RD   SLRSS
Sbjct: 721  SDKHHDVNIGSLGRGDVPLVIKRSVSMKSPVIQANNSSCFTRSYEGFSYSRDRSISLRSS 780

Query: 783  VGHSSFSASSSADFSSARQIEARIQRQVSSRKGELESKKGEICVKSHISEAASSGTPTNA 842
                + SASSS D+ S+ +  + I RQ S    +LE+ + +   KS  + ++SSG  ++ 
Sbjct: 781  T--ETASASSSWDYGSSIRKGSHI-RQRSGSTLDLETHRYDTNSKSLSTMSSSSGMSSHT 840

Query: 843  HPVLGFETCEQEENLDFTVANLECFSSQGTTNSSQKPELASENAESDDTSSIVVAVVEE- 902
               L       E++ +   A + C   +    S  +P    +N E  +T+ +    VE  
Sbjct: 841  FQAL---NVMPEDSFEMCAAQMTCTLDETHQESHTEP----QNLECKETNVMNADFVESV 900

Query: 903  ---------------------DKFECDN-RRILDTCTSE-SSREDLSGGRSVSDKEAPVT 962
                                 D   C+N   + +T  S+  +RE  +  RS SD  A   
Sbjct: 901  GLVRISANVLGDLAEHNPVVMDDECCENGNDVANTVISKGETRESPAHIRSTSDLGASPI 960

Query: 963  TSDCSKLEGHNMPDVSAFEDEQHPNHLMTTISEKETKQIAEVIAPG-------SQSDLSI 1022
            T DC   +   + +    E    P+ L TT + +   + +E   PG        +S+ ++
Sbjct: 961  TDDCPFNDHSRLQENDVNET---PHGLSTTTASEIEPESSEPEIPGLGVHDEIPESERNL 1020

Query: 1023 ISKSLLEEESMVPSGPDEDLTPPVINTEKSYGILEESTVIVDYQGRRKVVRSLTLEEATD 1082
             +     E+SMV +  D   +   +N      IL+ESTV+V+  G  K  RSLTLEEATD
Sbjct: 1021 NAVDDCSEKSMVHASVDHHSSSAPVNE-----ILDESTVLVECPG-GKEPRSLTLEEATD 1080

Query: 1083 TILFCSSIVHDIAYSAASIAIEKENEVTLEGS--RPTVTILGKSNTDRSDLRSRTG---G 1122
            TILFCSSIVHD+ Y AA+IA++K  +V  E     PTVT+LGKSN +R+      G    
Sbjct: 1081 TILFCSSIVHDLVYQAATIAMDKAKDVPAEEEMLHPTVTVLGKSNANRNSYGLGGGTKAK 1119

BLAST of Moc08g00890 vs. TAIR 10
Match: AT2G40070.1 (BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT3G09000.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 98.2 bits (243), Expect = 7.1e-20
Identity = 162/549 (29.51%), Postives = 243/549 (44.26%), Query Frame = 0

Query: 36  EKDDDLALFNEMQTRERES---FLLQSAEDLEDSFSTK--LRHFSDIKLGISIPVRGENS 95
           EKD++L+LF EM+ RE+E     L  + ++ E    +K       +I  G     +    
Sbjct: 30  EKDEELSLFLEMRRREKEQDNLLLNNNPDEFETPLGSKHGTSPVFNISSGAPPSRKAAPD 89

Query: 96  ELLNVDGEKNDYDWLLTPPDTPLFPSL-------------DNDPPPVTLASR-------- 155
           + LN +G+KNDY+WLLTPP TPLFPSL             D+   P TL SR        
Sbjct: 90  DFLNSEGDKNDYEWLLTPPGTPLFPSLEMESHRTMMSQTGDSKSRPATLTSRLANSSTES 149

Query: 156 ------------------------------GRPRSQPIS-ISRSSTMEKSHRSS-----T 215
                                         G P S+P +   RSST+  + +SS     T
Sbjct: 150 AARNHLTSRQQTSSPGLSSSSGASRRPSSSGGPGSRPATPTGRSSTLTANSKSSRPSTPT 209

Query: 216 SRGSAS-------PNRLSPSPRSASSVPQMRGRQLSAPHSSPTP------------SLRH 275
           SR + S        N  S    +    P  R   LS+   +PT             S+  
Sbjct: 210 SRATVSSATRPSLTNSRSTVSATTKPTPMSRSTSLSSSRLTPTASKPTTSTARSAGSVTR 269

Query: 276 ATPS--------RRSTTPARRSSPPPST--------PSISVTRSSTPTPRRLSTGSSGTS 335
           +TPS         RSTTP  RS+   ST        PS +++RSSTPT R +++ S+ T+
Sbjct: 270 STPSTTTKSAGPSRSTTPLSRSTARSSTPTSRPTLPPSKTISRSSTPTRRPIASASAATT 329

Query: 336 TT----SGARGTSPIKA----------VRGNSASPKIRA--WQ-TNIPGFSSDAPPNLRT 395
           T     S  + +SP  A              +ASP +R+  W+ +++PGFS + PPNLRT
Sbjct: 330 TANPTISQIKPSSPAPAKPMPTPSKNPALSRAASPTVRSRPWKPSDMPGFSLETPPNLRT 389

Query: 396 SLADRPASYTRG--SSPASRNSM-----DLQYKYSRQLMSPT---APISSSHSHDRDRYS 448
           +L +RP S TRG   +P+SR+           +  RQ  SP+   AP+ SS S       
Sbjct: 390 TLPERPLSATRGRPGAPSSRSGSVEPGGPPGGRPRRQSCSPSRGRAPMYSSGSSVPAVNR 449

BLAST of Moc08g00890 vs. TAIR 10
Match: AT3G09000.1 (proline-rich family protein )

HSP 1 Score: 86.3 bits (212), Expect = 2.8e-16
Identity = 157/536 (29.29%), Postives = 232/536 (43.28%), Query Frame = 0

Query: 32  MCIREKDDDLALFNEMQTRERE---SFLLQSAED------LEDSFSTKLRHFSDIKLGIS 91
           M   ++D++L+LF EM+ RE+E     LL  +++      L  + +  L   S+      
Sbjct: 1   MLTHDRDEELSLFLEMRRREKEHRADSLLTGSDNVSINATLTAAAAAALSGVSETASSQR 60

Query: 92  IPVRGENSE-LLNVDGEKNDYDWLLTPPDTPLFPS-------LDNDPP---PVTLASR-- 151
            P+R   +E  L  + EK+DYDWLLTPP TP F           +D P   P  L SR  
Sbjct: 61  YPLRRTAAENFLYSENEKSDYDWLLTPPGTPQFEKESHRSVMNQHDAPNSRPTVLKSRLG 120

Query: 152 -----------GRPRSQPISISRSSTMEKSHRSSTSRGSASPNRLSPSP----------- 211
                       +P++   S++       S  S ++   A+P R S +P           
Sbjct: 121 NCREDIVSGNNNKPQTSSSSVAGLRRPSSSGSSRSTSRPATPTRRSTTPTTSTSRPVTTR 180

Query: 212 --RSASSVPQMRGRQLSAPHSSPTPSLRHATP---SRRSTTPARRSSPPPSTPSIS--VT 271
              S SS P  R    +A  ++ T + R  T    S RS TP  RS+P PS+ S    V+
Sbjct: 181 ASNSRSSTPTSRATLTAARATTSTAAPRTTTTSSGSARSATPT-RSNPRPSSASSKKPVS 240

Query: 272 RSSTPTPR-RLSTGSSGTSTTSGARGTSPIKAV--------RGNSASPKI---RAWQ-TN 331
           R +TPT R    TG S  S+ + +RGTSP   V        RG S SP +   R W+   
Sbjct: 241 RPATPTRRPSTPTGPSIVSSKAPSRGTSPSPTVNSLSKAPSRGTSPSPTLNSSRPWKPPE 300

Query: 332 IPGFSSDAPPNLRTSLADRPASYTRG-----SSPASRNSMDLQYKYSRQLMSPTAPISSS 391
           +PGFS +APPNLRT+LADRP S +RG     S+P SR         S  +     P S  
Sbjct: 301 MPGFSLEAPPNLRTTLADRPVSASRGRPGVASAPGSR---------SGSIERGGGPTSGG 360

Query: 392 HSHDRDRYSSYSRG-----------------SKASSGDDDLDSLQSIPTSS-----LDNS 451
             + R +  S SRG                 +KAS+G    D+L  +   +     + N 
Sbjct: 361 SGNARRQSCSPSRGRAPIGNTNGSLTGVRGRAKASNGGSGCDNLSPVAMGNKMVERVVNM 420

Query: 452 LSKGGNTFSNNKALTISKKHRIVSS----TSTPKRSLDSTIRQLD--RKSPNMFRPLLSS 471
              G    + N      K     +S     +  K S+D  IR +D  R      RPL++ 
Sbjct: 421 RKLGPPRLTENGGRGSGKSSSAFNSLGYGRNLSKSSIDMAIRHMDIRRGMTGNLRPLVTK 480

BLAST of Moc08g00890 vs. TAIR 10
Match: AT2G40070.2 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT3G09000.1); Has 108635 Blast hits to 60786 proteins in 2176 species: Archae - 287; Bacteria - 15142; Metazoa - 39415; Fungi - 26849; Plants - 4416; Viruses - 2864; Other Eukaryotes - 19662 (source: NCBI BLink). )

HSP 1 Score: 85.1 bits (209), Expect = 6.2e-16
Identity = 156/536 (29.10%), Postives = 235/536 (43.84%), Query Frame = 0

Query: 50  RERESFLLQSAEDLEDSFSTKL--RHFS----DIKLGISIPVRGENSELLNVDGEKNDYD 109
           +E+++ LL +     D F T L  +H +    +I  G     +    + LN +G+KNDY+
Sbjct: 6   KEQDNLLLNNN---PDEFETPLGSKHGTSPVFNISSGAPPSRKAAPDDFLNSEGDKNDYE 65

Query: 110 WLLTPPDTPLFPSL-------------DNDPPPVTLASR--------------------- 169
           WLLTPP TPLFPSL             D+   P TL SR                     
Sbjct: 66  WLLTPPGTPLFPSLEMESHRTMMSQTGDSKSRPATLTSRLANSSTESAARNHLTSRQQTS 125

Query: 170 -----------------GRPRSQPIS-ISRSSTMEKSHRSS-----TSRGSAS------- 229
                            G P S+P +   RSST+  + +SS     TSR + S       
Sbjct: 126 SPGLSSSSGASRRPSSSGGPGSRPATPTGRSSTLTANSKSSRPSTPTSRATVSSATRPSL 185

Query: 230 PNRLSPSPRSASSVPQMRGRQLSAPHSSPTP------------SLRHATPS--------R 289
            N  S    +    P  R   LS+   +PT             S+  +TPS         
Sbjct: 186 TNSRSTVSATTKPTPMSRSTSLSSSRLTPTASKPTTSTARSAGSVTRSTPSTTTKSAGPS 245

Query: 290 RSTTPARRSSPPPST--------PSISVTRSSTPTPRRLSTGSSGTSTT----SGARGTS 349
           RSTTP  RS+   ST        PS +++RSSTPT R +++ S+ T+T     S  + +S
Sbjct: 246 RSTTPLSRSTARSSTPTSRPTLPPSKTISRSSTPTRRPIASASAATTTANPTISQIKPSS 305

Query: 350 PIKA----------VRGNSASPKIRA--WQ-TNIPGFSSDAPPNLRTSLADRPASYTRG- 409
           P  A              +ASP +R+  W+ +++PGFS + PPNLRT+L +RP S TRG 
Sbjct: 306 PAPAKPMPTPSKNPALSRAASPTVRSRPWKPSDMPGFSLETPPNLRTTLPERPLSATRGR 365

Query: 410 -SSPASRNSM-----DLQYKYSRQLMSPT---APISSSHSHDRDRYSSYSRGS------- 448
             +P+SR+           +  RQ  SP+   AP+ SS S        YS+ S       
Sbjct: 366 PGAPSSRSGSVEPGGPPGGRPRRQSCSPSRGRAPMYSSGSSVPAVNRGYSKASDNVSPVM 425

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022131331.10.0e+0099.64uncharacterized protein LOC111004588 isoform X1 [Momordica charantia] >XP_022131... [more]
XP_022131333.10.0e+0099.55uncharacterized protein LOC111004588 isoform X2 [Momordica charantia][more]
XP_022131334.10.0e+0099.53uncharacterized protein LOC111004588 isoform X3 [Momordica charantia][more]
XP_031737323.10.0e+0080.95serine/arginine repetitive matrix protein 2 isoform X1 [Cucumis sativus] >KGN623... [more]
XP_031737324.10.0e+0080.86serine/arginine repetitive matrix protein 2 isoform X2 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
Q9FNN54.2e-26790.12NADH dehydrogenase [ubiquinone] flavoprotein 1, mitochondrial OS=Arabidopsis tha... [more]
Q54I907.5e-20076.54NADH dehydrogenase [ubiquinone] flavoprotein 1, mitochondrial OS=Dictyostelium d... [more]
P257081.9e-19874.60NADH dehydrogenase [ubiquinone] flavoprotein 1, mitochondrial OS=Bos taurus OX=9... [more]
P498215.4e-19874.14NADH dehydrogenase [ubiquinone] flavoprotein 1, mitochondrial OS=Homo sapiens OX... [more]
Q0MQI47.0e-19874.14NADH dehydrogenase [ubiquinone] flavoprotein 1, mitochondrial OS=Pongo pygmaeus ... [more]
Match NameE-valueIdentityDescription
A0A6J1BQQ50.0e+0099.64uncharacterized protein LOC111004588 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1BT220.0e+0099.55uncharacterized protein LOC111004588 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1BPZ70.0e+0099.53uncharacterized protein LOC111004588 isoform X3 OS=Momordica charantia OX=3673 G... [more]
A0A0A0LKP50.0e+0080.95Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G350460 PE=4 SV=1[more]
A0A6J1HBJ80.0e+0079.88flocculation protein FLO11 OS=Cucurbita moschata OX=3662 GN=LOC111462594 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT5G08530.13.0e-26890.1251 kDa subunit of complex I [more]
AT1G27850.14.7e-17340.22unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G40070.17.1e-2029.51BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT... [more]
AT3G09000.12.8e-1629.29proline-rich family protein [more]
AT2G40070.26.2e-1629.10FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 1499..1519
NoneNo IPR availableGENE3D3.10.20.600coord: 1605..1701
e-value: 3.3E-35
score: 122.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 131..246
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..31
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 259..301
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 334..372
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 16..31
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1351..1374
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 315..372
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1097..1120
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 896..939
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 849..871
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 100..301
NoneNo IPR availablePANTHERPTHR31949:SF3RUN/FYVE DOMAIN PROTEINcoord: 3..1124
NoneNo IPR availablePANTHERPTHR31949GASTRIC MUCIN-LIKE PROTEINcoord: 3..1124
NoneNo IPR availableSUPERFAMILY142984Nqo1 middle domain-likecoord: 1614..1699
IPR019575NADH-ubiquinone oxidoreductase 51kDa subunit, iron-sulphur binding domainSMARTSM00928NADH_4Fe_4S_2coord: 1704..1749
e-value: 8.3E-25
score: 98.4
IPR019575NADH-ubiquinone oxidoreductase 51kDa subunit, iron-sulphur binding domainPFAMPF10589NADH_4Fe-4Scoord: 1706..1788
e-value: 5.9E-28
score: 96.6
IPR037207NADH-ubiquinone oxidoreductase 51kDa subunit, iron-sulphur binding domain superfamilyGENE3D1.20.1440.230coord: 1702..1797
e-value: 1.3E-36
score: 126.4
IPR037207NADH-ubiquinone oxidoreductase 51kDa subunit, iron-sulphur binding domain superfamilySUPERFAMILY140490Nqo1C-terminal domain-likecoord: 1700..1793
IPR011537NADH ubiquinone oxidoreductase, F subunitTIGRFAMTIGR01959TIGR01959coord: 1381..1790
e-value: 2.4E-190
score: 630.2
IPR011538NADH-ubiquinone oxidoreductase 51kDa subunit, FMN-binding domainPFAMPF01512Complex1_51Kcoord: 1420..1589
e-value: 1.0E-45
score: 155.4
IPR037225NADH-ubiquinone oxidoreductase 51kDa subunit, FMN-binding domain superfamilyGENE3D3.40.50.11540coord: 1425..1604
e-value: 1.6E-83
score: 280.8
IPR037225NADH-ubiquinone oxidoreductase 51kDa subunit, FMN-binding domain superfamilySUPERFAMILY142019Nqo1 FMN-binding domain-likecoord: 1378..1613
IPR019554Soluble ligand binding domainPFAMPF10531SLBBcoord: 1616..1665
e-value: 6.2E-7
score: 29.2
IPR001949NADH:ubiquinone oxidoreductase, 51kDa subunit, conserved sitePROSITEPS00644COMPLEX1_51K_1coord: 1540..1555
IPR001949NADH:ubiquinone oxidoreductase, 51kDa subunit, conserved sitePROSITEPS00645COMPLEX1_51K_2coord: 1717..1728
IPR000253Forkhead-associated (FHA) domainPROSITEPS50006FHA_DOMAINcoord: 1067..1132
score: 7.6915

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc08g00890.1Moc08g00890.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0051211 anisotropic cell growth
biological_process GO:0022900 electron transport chain
cellular_component GO:0055028 cortical microtubule
cellular_component GO:0005743 mitochondrial inner membrane
cellular_component GO:0045271 respiratory chain complex I
molecular_function GO:0051539 4 iron, 4 sulfur cluster binding
molecular_function GO:0010181 FMN binding
molecular_function GO:0046872 metal ion binding
molecular_function GO:0051287 NAD binding
molecular_function GO:0008137 NADH dehydrogenase (ubiquinone) activity
molecular_function GO:0005515 protein binding