Moc03g21040 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc03g21040
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionActin-related protein 2/3 complex subunit 1B, putative isoform 1
Locationchr3: 14360952 .. 14370086 (-)
RNA-Seq ExpressionMoc03g21040
SyntenyMoc03g21040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGGTCCTTTAGATTTCGAGTCAGAAGACCCGCTTCTCAGTTCCCCTGTCGCTCTCAAGAAGAGGTCTCAATATCTCAGAATCCTCTATTTCGTGTTTTTCCTTTCCAAACCTAAGCCCATATGGGGGCTTAGGACATATGTTTCAATTTTCTTCTTGCTCTACTTTCACTGGGGTTTGTCTGATTCGGTTTGTTCCAAGGATTGTTCCGTGAATATTTCTTGTAAATTGGAAGTTTCATATTATCTCTCTCTTATATTATTGTCTTGTTTCTCTACTGTTTTCCGTTCAAATCTGTGAACTGAAGCTTAAAAATCTGTAGCAGTTCGACAGTAGCATAAATTTTATATAAAGGGAGCTAAGAAGTAGCCATTGTTGTGGTTGTGCCTTCTTTGCAACTTTGATTTGTTAGTTTTAGTAAACGTGAAGTTCTACCTCTCTGAGTCCGAGTACAAATGCTTCTGGAACAAGTGGTTTGAGAGGCTGTTGACGTCTATTTTTTATACCAATCAGGAAAATGATTATCGGGTTAGATGATCTTCTGACTGATCACTACAAGGATAAATGCAAACTCGTTGAGAAAGAATCTAGACAGGCAAAGAAGAAAAAGAACTATAATTCGGATGATGATGATTTTGGCAAAGAGGCTGTGGTGTCCCAAGTTGTTGATGAATGCCAAAATAAGGTAAGAAATGCCTAGGAGCAGAATTTTTCTGGAGTTTTCTTAGCCTTCCTTCACTTATGATCTAAGAATGTTTTGTGTAGATGAGCCAACTAGGGGGTGAGGAAGATACATCCATATGGGGCTTAAAGGTTTTTGGAGAACAGGTTTGCAGTGTAACCTTACAGCTTCCGCAATGCGTTTGTCCTGAATGCCTTTATTCAAGAGGATTTCTCTGTGATTAACAAATTGGTTGCATTTCTGTGCATGTAAATTGCAGAAAACCCCACCAACTTTACAAAGTCCTGAACTCAAAAGTTGCCACCTTTTGCGAAATTTCCTAAATAATGAAGTTAACTCTTTGGTGAATCTCACCATGGAGAAAGGTTTGCTTTTCCTTCATGTTTGAATTCTCTGAATCTCCCTCATCATTAAGTAGTAATACTTTTATAGTTTTTCTTAATTGAGTTATGATATAACCTTTATTATTGATGATATTATGTAATGACTACTGAAGTTTCAAGTTGAAATTATTTTTACCCTAAACCTTATGAACTGTTGCTACATCAATGAAATTGTTTCTTTCACAAAAAAAAAAAACCTTATGAACTGTTGCATTGCTACATTTTTTTTAGTTTTTGCCTGGTTTTAAGTTTAATTGATTGACTTATGGCTACTCAATGGCAAAAATATGGCAATTTGAACAATAAATTTAAAGCTTGTTGTTTGCATATACTACATAATCATTAAAAAGAGCAATAAGAGAGATTAAAAAACGAGAATGAACAACACCATGCATTTTGAATTCTGTTTGTCTTTTCTCTTGAAATATAGGAAAATACTAATTCTCACTGAATCCAGTTGTCTGACATATGCCCCTATATTGTCTTTATCATAGAGAATAGGAAGGTGGGATGATGAAGATGCCAACCACTTGGTGAACTGGAAACTAGCTTCTTTAGCTCCCATTTGGGGGGGGGGGGGGGTGGGTTGTTGTTGTGGGGGAAGACCAAACTATCTATTTGGCACTCTATTGTTATTAAATAGATTATAGAGCTTGAATTGTTTGTTTGCGCAGAGTAGGAAGAAAAACGTGTATTAGTTGACTATTTCTGTATTCTGTCCTTACAACTTTCTGGAAGTGTAAGATCATATTTGTAAGAGATAGGTGATTTGGAGGTTCTGTCTTTAGCTAACTTTTACCCTACGTCCCTACCTTGTATGCTCTTTGCCAAGCTTAGCAGTCAGCTACCAAAGCCGTGTCTGATAGTTTCTTGCAGTGCTGAATTCCCATTTCTGGGAGGAAATTATAAGAGAATGGAAATTCAGGATTGTTTGTTGCCTCTTTCTTATTTCTTCTTTAGATGGTATCATACTATTTTTATGGAATCAGAACTCTATATGTTGATGCCTCAGGCTCTGAGATACATTCACTTGTTTTATCTGAAAGCTCTTGCAACAAAGGATTCAAGGGCCTATATCGGTGTTCTCCTAAATTGGATGTGGAGGTTTAAAACTCCAATGAAGTTCCAATTCTTTGCGAGAGTTGTTTCTTAGGAGGCTGAATTCTATTAATAAATTTCAGAGAATGCACCCTTTCTTCTATCTCTATTATCATTTGTGCGTCCTTCGAGAGAAGGCAACTGTAAACTTGGACCATGTTTTTGTGCACTGCCCTCTCGTGAGGAAGTTGTGGGAGAATTTTTAGGAGATGCTTAGACTTTCTTGGGAGCCCTTCCTTTGGTTGAAAAAAATTCAGATGATATGAGGTCCTTTGTTTTTTGTCTAAAAAAAGATGATGTACAAGTCTCACCTTATGGGTGGTTTGGGACGTAGAACTTGAGGATATAGTGTGAGGCTTCAAATAAGGTCGGAGATAGAGTTATTGTTTTGCAGCAGTTATTCTTTTTAACACCCAACAGAGTTCTATAATTTATCAATAATTTTGTAATCCACCAGCTTATAAAGTTCTCTGTAATATTCTAGTCCCTTGGTGAGGTGGAACCCCTGTTCTTTTATCCTTATTTATACATTTTTGTAGCAATTCATCTATTTTTTCTTCCAGAAAAGGAGTAGGGAAAAGGAAAACATGAAGTCCACTTAAAAAAAAAAAAATAATGATAGCAGTGGTTCATTTATTTATGCATTTTAAAATGTTGATAAGTTAGTACCTCGAAGCTGATTTTGGTATCATGAATTTTCTAAAAATCTAAAACCTTTCGCAGACAAATTAAATAAGTTGTCAATATATTTATTTGATATGTTATTGCAGGTGATGCTTTCCTTGAAGGTTTATTGGTAAATGGCTGGCTGTCAACACTTGTTTCTTTGACAGGTCGTGTAGAAAAACCAATTGCCATATGGACCTTCAATTTAAGTGAGTACATCTATAATTCCAGTCTTGTTCATGGAACTCTTTACATTAGCCTTAATTCTTTTTTATGCGTTTATTTTCATATTTCAGTGTTATATTCATCAAGAGAAGAGCTAAGAACATCAGCTTTTGATTTCTGGAGGGATATCATGTTACCTAAACATGAGGTTTCTGTTTTTTCTTTGACCACATGTTTCTTTTTATTTCTTCAACGTTTATTAATTTTTCTTAAGGCTTATCTAGAATTCTTAACTAGGTTGAACAACAGCCTCTCCAAATTGATTGGTTTCCTAACCATGCTCAATTAGGAGAAGCTCTTGAGACTTATGGATTTAGATTTGAGTGCTCATTAAATCCTGGATTTGTCCACGCTGGTAAACTTCCTTGACATCAAACTACACATTCATGAGAAAATATATGCTATACTATGAAGGGTGATTTGTTCGCTGAAGGTTATTTGATTTTTAGATACGCAAACTCAAGGTTTCGCAAAAGTGTAAATCCTTTTAATAGTTATAGTTTAAATTCCCTTATATCCAACTAGAGAAGTTTTTTGTAACTTCATTGGATAGGCGCTTTTTTTTTTTTTTTTTTTTTTAATATATTTCACATTATCAATGAAATTTGATTCCTATCCCAAAAAAAAAAAAAAAAAAAACTCAAGGTTTCGCACAAGATTTAATTCTTCCTCTATAATATGAAATCATCATAAAGTGCTGTCTCTTTAAGCACGTAGCACGTGGCATGTGTTAAAAAATCTTTGGAAATTGGAATGATAAGGCCTTTTTGGGTTTCTGATGTATGACTTTCATTTTGCTGCATTGATATTGTGTTTTAATTTACATTATGAAAACATTTTGTTAGTATGATTTTTAATTGGAGATAAGAGATTGCCTTAGCTTTGAACTTTCAATTTCACCAGATTGGACTCTAAACTAGAATAAGTAGTACAATTAATACCTCCTCATTCCTCAAAAGAATATAAGTTGTTTCTAAACTAACTAATAAATCACTCAAATGCTCAAAATTAAGCCATAAAAATACCTAAATCCAACCACACAAATATTCAAATCTTGCCTCAAAATCCTGCCAGGTTAAAAATGCTCAAACTCAATGCTGAAAATGTCCAAATCTAATTAAAAAGCACAAACACACCCTGAACAATCAATATCCACAAAAATAACCTCATCCAAGTACAATGATGCTCTTACATCACAGTTCTTTAAAATAAATAAAATTCAACTATTAGAGAGTATTAATTTCACTTAAGTTCTATGAATAAAAATTTTAAGACAAAATTTGGCTCACGATCTCAAAATGAACTTGTTCCTTCTTTGTATCAACCAATATTCATAATGGGTTTATTGGTTTAAATCATGAATGTTAATCCATTTGATTCACCACTACTATGCATTGACTTAGATGCAGCACCACATGGATGACAGCATCCAAATTTCTGAGTTGGGCTACAGAGGGGTGGATAGTAAATTATAGAGATTGATTATCCTTTTCAGTGATATCAGAGTAACTATGATCTGAAGTTTTTATAGAGTTCCTAATTTCTCCTACTCAATCTGTATCCAGTTTTTCCGACCACCAGCATACTCCTTGCCCTGTGTAAATCCTTGTTACTCATCATGACCTCAGCGTCCACGCCTCAACTGCATAACATCCTTGCATTCCTCATTTACCCTTTCCCTATTATTGTGTGAGTACATTTATGTAAGGTATCCCCTTACTTTTATTGATATTCAAGAACAAGAACAATATAATCAACCCAACCAATTTGAGAGGGCCCAAGTCTCCAAGAAAGCACTATGCTATTCTTCCTTACTAAGAAATTACCTACCTCCCCCAAGGGAGCATTCTCCCTTTTAACAAAACTACTCCATACGCCCAAATTGGGAGACGGAGTCCATTTATAAGGAGCTAATGCCCCATTACCTACCTCCCCCAAGGGAACATTCTCCCCTTTAACAAAACTATTCCATACGCCCAAATGGGCCCCACTCTTCTAACTAACTGGAAGTACCCCTTCTACCCCTCCTAGTATATGTCTTCACCACTGGGGTCTAACAACTTATGCTCGAGGGCCTATTGGATACCTTGTCCCAATGAAGTAACCTTATCCTCAAGGTGAGGTTTGGAAACTGAGTTTGTAAATCCTTAATAAGCTTCCATCATTTGAAGGCTAGCTGGGTTCGACTTATTTGATCAATTAGTCTACCTTAAAATTTTTTTATGATTGATACCTGATTCTAGTTAAAAGATGAAAAAAACAAGGGCTTCTTTCAAGACATTGCTCAAAAATAAACAAGGGCTTCTTTGGAACTTAGATATTAAATTAAAGAAACAAGGGCTTCTTTTAAGACATTGTTAAAAATTAAATACAATAAAGCTAGTTTCTTCTCATCCAAGAAAAGGGAAAAAAAAGAAGGCTTAAAATCATATGCAAGGTGAAGTGCCAGGGAGGGGTCATACTCAAGGGGTGCTTTTGTTCAATGCTTGATAATAAAAGTTATTTTTTATTGAAAATTGACTCTTGCGAAAATTGATGTTAAATTTTCGGAGCTCAAGAAACTCCATGAATTAATGTTTTGCTTATGAGTTCTGACCGTATGCACTCCAATGTTATGTGCTTGGAAAATGAGAAATTACTACTTTCTGAAATTAATTAGGGACTATGCTTATGAGAACAAGTCTTTCCATTCATTTGTTATTTTATGTATTTTGTTTTTTATTCCGCTTGTGTATGAGTTACTATATTTTAGAATTGTCATATAAAGTCTGACTTCAATAGCCATGTAGAAACCCCCCAAGCCCTTGCCATTTGAGTTAGTTGCTTTACTCTTTCTTACACAAGTTATGGAATATTGTTAGGTTCTGGACGTGGAGGGCCACCTCAGAATATAAGGGCTTGGATCAAATTTATTACTATTTGTTGTCAAACAAAGTAAGAAGTCCTTTTTTCTTTTCTTTTGAAAAATGTTTGTGAGTTTAACTCTTTTGCCTTTGGCATTCGCTAATGTTTTCAATCTTGAAACTTAATGTGGCTACTGTAAAAGTAATTTTCTTCTCAACATATGATTTTACTTGCACAATACCAATAAAAAAACAAGTGTGGCTTCATGTTTTGTTTTTTTTTTTTTAAATCTTTGCAGGATTAAAAAGCATATATTTACATTCTCCGAAGTTGAGCAACTAGCTGAAGCCATTATTTGTTTATTTCTAGACCGCCAGTTTCAAGGCTTATCCATGCTCTTATGTGAATGCTTACAATCGCTTATCCATTACTTCACAGATGAAGAATGGAAGGCATGTTGTGAGAAAATAGCAAAATCTCTTGTTTGCAGGTATTGGACTCTCCCTAGTGATGTAAAGTGTATACATTATATAAATTAAAGGTGTAAGTAATTCGGTTCAGTATTTTTATTTTTATTTTTTTTGTTAAAACTGACATTGATTTTAAATGACCAAACTAATAAAATCTATCTAATCCAAACTGACTGTTTTCTAATTTTATACTTCAGAAAACTGAACTAACTGGCAAGTATAATTAAAAAAACTGAACTGAGTGACTTGGTTCTTTTAGTTTCGTCGATTTTATTTTATTTAATTAATTAATTAATTTTTTTTCTTAATTTTGGCTTGCACCCAACACCACTGATACAAATGTGATTAAACATATTTACAATCAATCATATCATGACTACTTTTTTTTTTTCCTGCCTCTGCTCTGCTATTGCAGGATTCCTAGGGATCTAAATTGCTTGCGAGCTGTGGAATGCATTTCTGGAGCTGATGCCCGTAGTAAATATCTGAGGAGTGCAGTTGCCTATCAAATTCTTCTTATGTGCTTTGAAACTGAGGTGGTCCTTTCCTTTTTTTTCGTTGGCTAAATATTCAGAGTTTTCTTTTGAAGCATAGCTTTGTTTACTGCTCCATAACACTCATCTTTCCTGCAAGATTATGTGTATCACATGCATATCATTTTCATATATGCTTGTTGAAATTTTTTAAGACAGTAGATTATTATCATATCCCATTTACAATATTCTGGTATGTTGTTTGAGGTGTCCTTCTATTCTGGAGAGGAATCATAGATTGTTCGGGGGATGGAGAAATCTGTAGATTATGTTTGGGATATCGTTAGATTTAACACTTCTTTATGGGGCTCGGTTTCTAAGCCCATTTGTAATTATTCGCTAGGCCTTATTCTTTTGGATTGGAGTCCTTTTCTACAGGGGATCTCCTCTTTCAGGGCTGTTTTTTTTTATGCCCTTTTGTAGGCTTTTTTGTAGCTTTTTTTTTGGATGCCCTTTTGTAGGCTTTTTGTAGCTCTGGTATCTTTTTCATTATCCTCATAAAAAAAACCCCTTCGAAGGGTTTTTCGTGTAGCTCCTATTTCCATATCTTGAACTCCCCGTCTCCTTCTTCCCCTTCGACCCCTTGATTTACCTCTCTTTGAAAGATTAAAATTCCGAAGAAGATCAAGTTCTTTGGGTGGCAAGTCTTACTTGGGAAAGTCAATACCATGGATTATATCCAGAGGATGTCTTCCTTTTGTCTGGGCCCGCTTTGGTGTGTTCTCTGTAGGAGTGCCTCGGTAGATCTCGACCATTTGTTGTGGACTTGTCAGTTCCCGCAGGATTTATGGTTTCATTTCTTTAGGTGTTTTGGGATGACCTGGGTTTGTACAAGGGATTGTAGGGCGATGATGAAGGATTTGCTGTTATTCCCACCTTTTCGTGACAGGCGTCGTTTTTTGTGGCAGACTTGTTTTCTAGCTATTTTATGGGGTATTTTCTAGCTATTTTATGGGGTATTTGGTTGGATAGGAATAATAGACTGTTTAGGGGATGGAGAAATCTGTGGACTATGTGTGGGAACTCATTAGATTTAATTCTTCGCTTTGGGGCTCGGTTTATAAGGCCTTTTGTAATTATCCGTTAGGCATCATTCTTTTGGATTGGAGCCATTTTCTTTAGAAGATTCCTCTTTCAAGGTTGTCTTTTTGTAGGCCCTTTTTGTTCGTCTTGTATCTCTTTTCATTTTTCTCAATGAAAGCGTAGTTTCTTACGAAAAAAGAACAATAGCATGACTATTTGTATCACAGTCAAGGTTAGGTTGAAAGTGCATATGTTAATCTTGGAACAAATGTCAATTAGCCCGCTATTTATTTTTTGGTTCATACTTTTATTTTGTCTCAAATGTGGTTTAAATTTTATTTTTGTCTTTAAACTTTTTGTCTCTCAACTTCAAAGGTTATATTTTGGTTCCTTCACACATTTATTTTGTTCGAGTCTCTCAACTTTTAAATGTTACGTTTTAGTTTCTCAACTTTTATACTTCTGTTTTGTTTTAGCCTCACATCATGTTATGTTTTAATTCCTCAACTTTCTAATGATATGTTTGGCTCTAAACATTGCATAAAATTTTTTGTTAATCCTTACCTCAAATTTTCATCAATCATTTAACAAGAATTTAAGTTTGTTCCAAAACCTGATATTTAACCTACTATATATGGTTTTAAATATTTATCTATGTATACATTTATTGATCAGACAAAGAAACAAAAGATGTATCTATGATTCTACATTAAGTTTCTTTGAAATAATTTATAAAAGGCTTAAAGTTCAAGAACTGAATGAAATAATTATGAAAGTTGAGAGAATCAATATTATGTAAATCAACACAACAACCTTCTAACTTCACATTATCTTTCAGGCTACTAATGAGGAAGGGGTTTTGAGATTACTCACCTCAATCACAGTTAAAGACAAAAGCTGCGACCTATTTAAGCTGTATATTTACTTGGTGTTGACGGAGAACTGGCTTGTCGGTAGTCGAACGTTAGAAGGCAAGCCGCTAATTTGTGAAATGTGGGGTCTGTTTCTCAGAAACTGCTCCTGCCAAATCACCAGTACAGATTTGAGGTCCTACGCATCAAAGGTGTGAAAAACTCTTCACAATCTTTAACTTAGATTGATAATCCATCACTTGAGGGTGATGTTTCTCAAATCAAAACTCTCTCTCGTCACACTTTCAGGTTCGTAACAAAGCTTCATATATCCTGCAAAGCTCTTTTAACGAATAG

mRNA sequence

ATGGATGGTCCTTTAGATTTCGAGTCAGAAGACCCGCTTCTCAGTTCCCCTGTCGCTCTCAAGAAGAGGAAAATGATTATCGGGTTAGATGATCTTCTGACTGATCACTACAAGGATAAATGCAAACTCGTTGAGAAAGAATCTAGACAGGCAAAGAAGAAAAAGAACTATAATTCGGATGATGATGATTTTGGCAAAGAGGCTGTGGTGTCCCAAGTTGTTGATGAATGCCAAAATAAGATGAGCCAACTAGGGGGTGAGGAAGATACATCCATATGGGGCTTAAAGGTTTTTGGAGAACAGAAAACCCCACCAACTTTACAAAGTCCTGAACTCAAAAGTTGCCACCTTTTGCGAAATTTCCTAAATAATGAAGTTAACTCTTTGGTGAATCTCACCATGGAGAAAGGTGATGCTTTCCTTGAAGGTTTATTGGTAAATGGCTGGCTGTCAACACTTGTTTCTTTGACAGGTCGTGTAGAAAAACCAATTGCCATATGGACCTTCAATTTAATGTTATATTCATCAAGAGAAGAGCTAAGAACATCAGCTTTTGATTTCTGGAGGGATATCATGTTACCTAAACATGAGGTTGAACAACAGCCTCTCCAAATTGATTGGTTTCCTAACCATGCTCAATTAGGAGAAGCTCTTGAGACTTATGGATTTAGATTTGAGTGCTCATTAAATCCTGGATTTGTCCACGCTGGTTCTGGACGTGGAGGGCCACCTCAGAATATAAGGGCTTGGATCAAATTTATTACTATTTGTTGTCAAACAAAGATTAAAAAGCATATATTTACATTCTCCGAAGTTGAGCAACTAGCTGAAGCCATTATTTGTTTATTTCTAGACCGCCAGTTTCAAGGCTTATCCATGCTCTTATGTGAATGCTTACAATCGCTTATCCATTACTTCACAGATGAAGAATGGAAGGCATGTTGTGAGAAAATAGCAAAATCTCTTGTTTGCAGGATTCCTAGGGATCTAAATTGCTTGCGAGCTGTGGAATGCATTTCTGGAGCTGATGCCCGTAGTAAATATCTGAGGAGTGCAGTTGCCTATCAAATTCTTCTTATGTGCTTTGAAACTGAGGCTACTAATGAGGAAGGGGTTTTGAGATTACTCACCTCAATCACAGTTAAAGACAAAAGCTGCGACCTATTTAAGCTGTATATTTACTTGGTGTTGACGGAGAACTGGCTTGTCGGTAGTCGAACGTTAGAAGGCAAGCCGCTAATTTGTGAAATGTGGGGTCTGTTTCTCAGAAACTGCTCCTGCCAAATCACCAGTACAGATTTGAGGTCCTACGCATCAAAGGTTCGTAACAAAGCTTCATATATCCTGCAAAGCTCTTTTAACGAATAG

Coding sequence (CDS)

ATGGATGGTCCTTTAGATTTCGAGTCAGAAGACCCGCTTCTCAGTTCCCCTGTCGCTCTCAAGAAGAGGAAAATGATTATCGGGTTAGATGATCTTCTGACTGATCACTACAAGGATAAATGCAAACTCGTTGAGAAAGAATCTAGACAGGCAAAGAAGAAAAAGAACTATAATTCGGATGATGATGATTTTGGCAAAGAGGCTGTGGTGTCCCAAGTTGTTGATGAATGCCAAAATAAGATGAGCCAACTAGGGGGTGAGGAAGATACATCCATATGGGGCTTAAAGGTTTTTGGAGAACAGAAAACCCCACCAACTTTACAAAGTCCTGAACTCAAAAGTTGCCACCTTTTGCGAAATTTCCTAAATAATGAAGTTAACTCTTTGGTGAATCTCACCATGGAGAAAGGTGATGCTTTCCTTGAAGGTTTATTGGTAAATGGCTGGCTGTCAACACTTGTTTCTTTGACAGGTCGTGTAGAAAAACCAATTGCCATATGGACCTTCAATTTAATGTTATATTCATCAAGAGAAGAGCTAAGAACATCAGCTTTTGATTTCTGGAGGGATATCATGTTACCTAAACATGAGGTTGAACAACAGCCTCTCCAAATTGATTGGTTTCCTAACCATGCTCAATTAGGAGAAGCTCTTGAGACTTATGGATTTAGATTTGAGTGCTCATTAAATCCTGGATTTGTCCACGCTGGTTCTGGACGTGGAGGGCCACCTCAGAATATAAGGGCTTGGATCAAATTTATTACTATTTGTTGTCAAACAAAGATTAAAAAGCATATATTTACATTCTCCGAAGTTGAGCAACTAGCTGAAGCCATTATTTGTTTATTTCTAGACCGCCAGTTTCAAGGCTTATCCATGCTCTTATGTGAATGCTTACAATCGCTTATCCATTACTTCACAGATGAAGAATGGAAGGCATGTTGTGAGAAAATAGCAAAATCTCTTGTTTGCAGGATTCCTAGGGATCTAAATTGCTTGCGAGCTGTGGAATGCATTTCTGGAGCTGATGCCCGTAGTAAATATCTGAGGAGTGCAGTTGCCTATCAAATTCTTCTTATGTGCTTTGAAACTGAGGCTACTAATGAGGAAGGGGTTTTGAGATTACTCACCTCAATCACAGTTAAAGACAAAAGCTGCGACCTATTTAAGCTGTATATTTACTTGGTGTTGACGGAGAACTGGCTTGTCGGTAGTCGAACGTTAGAAGGCAAGCCGCTAATTTGTGAAATGTGGGGTCTGTTTCTCAGAAACTGCTCCTGCCAAATCACCAGTACAGATTTGAGGTCCTACGCATCAAAGGTTCGTAACAAAGCTTCATATATCCTGCAAAGCTCTTTTAACGAATAG

Protein sequence

MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSDDDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRNFLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREELRTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGLFLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE
Homology
BLAST of Moc03g21040 vs. NCBI nr
Match: XP_022152868.1 (uncharacterized protein LOC111020493 [Momordica charantia])

HSP 1 Score: 927.5 bits (2396), Expect = 4.3e-266
Identity = 455/455 (100.00%), Postives = 455/455 (100.00%), Query Frame = 0

Query: 1   MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
           MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD
Sbjct: 1   MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60

Query: 61  DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
           DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN
Sbjct: 61  DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120

Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
           FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL
Sbjct: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180

Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
           RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR
Sbjct: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240

Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
           GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ
Sbjct: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300

Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
           SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM
Sbjct: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360

Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
           CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL
Sbjct: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420

Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
           FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE
Sbjct: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 455

BLAST of Moc03g21040 vs. NCBI nr
Match: XP_038895435.1 (uncharacterized protein LOC120083668 isoform X2 [Benincasa hispida])

HSP 1 Score: 817.0 bits (2109), Expect = 8.1e-233
Identity = 399/455 (87.69%), Postives = 422/455 (92.75%), Query Frame = 0

Query: 1   MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
           MDG LDFESEDPLLSSPV LKKRK IIGLDDLLTDHYKDKCKLVEKES+QAKK K Y SD
Sbjct: 3   MDGTLDFESEDPLLSSPVTLKKRKKIIGLDDLLTDHYKDKCKLVEKESKQAKKIKKYESD 62

Query: 61  DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
           DDD GKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VFGEQK  P LQSPEL+SC+ L+ 
Sbjct: 63  DDDLGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFGEQKPSPALQSPELESCNFLQT 122

Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
           FLNNEVNSLVNLT+E GDAFLEGLLVNGWLSTLV LTGR EK +AIWTFNLMLYSSRE+L
Sbjct: 123 FLNNEVNSLVNLTVETGDAFLEGLLVNGWLSTLVYLTGRAEKSLAIWTFNLMLYSSREDL 182

Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
           RTSA DFWRDIML  +EVEQQ LQIDWFPN+AQLGEALETYG+RF+CSLNPGF+H GS R
Sbjct: 183 RTSAGDFWRDIMLTTNEVEQQHLQIDWFPNYAQLGEALETYGYRFQCSLNPGFIHTGSRR 242

Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
           GGPPQNIRAWIKFITICCQTKIKK+IFT SEVEQLAEAIICLFLDRQFQG+++LLCECLQ
Sbjct: 243 GGPPQNIRAWIKFITICCQTKIKKNIFTSSEVEQLAEAIICLFLDRQFQGITVLLCECLQ 302

Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
           SLIHYFTDEEW ACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS +AYQILL+
Sbjct: 303 SLIHYFTDEEWNACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTIAYQILLI 362

Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
           CFE EA+NEE VLR+LTSITVKDKSCDLFKLYIYLVLTENWLVGS+TLEGKPLICEMWGL
Sbjct: 363 CFENEASNEEEVLRVLTSITVKDKSCDLFKLYIYLVLTENWLVGSQTLEGKPLICEMWGL 422

Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
           FLRNCSCQI STDLRSYASKVRNKASYILQS+F E
Sbjct: 423 FLRNCSCQIASTDLRSYASKVRNKASYILQSAFEE 457

BLAST of Moc03g21040 vs. NCBI nr
Match: XP_011653682.1 (uncharacterized protein LOC101216339 isoform X1 [Cucumis sativus] >KGN64767.1 hypothetical protein Csa_013682 [Cucumis sativus])

HSP 1 Score: 812.8 bits (2098), Expect = 1.5e-231
Identity = 395/455 (86.81%), Postives = 423/455 (92.97%), Query Frame = 0

Query: 1   MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
           MD PLDFESEDPLLSSPVALKKRK IIGLDDLLTDHYKDKCKLVEKES+ AKK+KNY+SD
Sbjct: 3   MDAPLDFESEDPLLSSPVALKKRKKIIGLDDLLTDHYKDKCKLVEKESKLAKKRKNYDSD 62

Query: 61  DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
           DDDFGKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VFGEQK PP LQ+PEL+SC  L+ 
Sbjct: 63  DDDFGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFGEQKPPPALQTPELESCQFLQT 122

Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
           FLNNEVNSLVNLT+EKGD FLEGLLVNGWLSTLVSLTG VEK +AIWTFNLMLYSSRE L
Sbjct: 123 FLNNEVNSLVNLTVEKGDVFLEGLLVNGWLSTLVSLTGHVEKSLAIWTFNLMLYSSREGL 182

Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
           RTSA DFW+DIML  +EVEQQ LQ+DWFP++AQLGEAL+TYG+RFECSLNPG +H GSGR
Sbjct: 183 RTSACDFWKDIMLTTNEVEQQHLQVDWFPSYAQLGEALDTYGYRFECSLNPGLIHTGSGR 242

Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
           GGPPQNIRAWIKFITICCQTK+KK+IFT SEV +LAEAIICLFLDRQFQG+++LLCECLQ
Sbjct: 243 GGPPQNIRAWIKFITICCQTKVKKNIFTSSEVGRLAEAIICLFLDRQFQGITVLLCECLQ 302

Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
           SLIHYFTDE+WKACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS VAYQILL+
Sbjct: 303 SLIHYFTDEDWKACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTVAYQILLI 362

Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
           CF+ EATNEE VLR+LTSITVKDKSCDLFKLYIYLVLTENWLVGSR  EGKPL  EMWGL
Sbjct: 363 CFKNEATNEEEVLRVLTSITVKDKSCDLFKLYIYLVLTENWLVGSRMCEGKPLTREMWGL 422

Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
           FLRNCSCQI STDLRSYASKVRNKASYILQSSF+E
Sbjct: 423 FLRNCSCQIASTDLRSYASKVRNKASYILQSSFDE 457

BLAST of Moc03g21040 vs. NCBI nr
Match: XP_038895434.1 (uncharacterized protein LOC120083668 isoform X1 [Benincasa hispida])

HSP 1 Score: 808.9 bits (2088), Expect = 2.2e-230
Identity = 398/461 (86.33%), Postives = 422/461 (91.54%), Query Frame = 0

Query: 1   MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
           MDG LDFESEDPLLSSPV LKKRK IIGLDDLLTDHYKDKCKLVEKES+QAKK K Y SD
Sbjct: 3   MDGTLDFESEDPLLSSPVTLKKRKKIIGLDDLLTDHYKDKCKLVEKESKQAKKIKKYESD 62

Query: 61  DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
           DDD GKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VFGEQK  P LQSPEL+SC+ L+ 
Sbjct: 63  DDDLGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFGEQKPSPALQSPELESCNFLQT 122

Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNL------MLY 180
           FLNNEVNSLVNLT+E GDAFLEGLLVNGWLSTLV LTGR EK +AIWTFNL      +LY
Sbjct: 123 FLNNEVNSLVNLTVETGDAFLEGLLVNGWLSTLVYLTGRAEKSLAIWTFNLIYFHIPVLY 182

Query: 181 SSREELRTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFV 240
           SSRE+LRTSA DFWRDIML  +EVEQQ LQIDWFPN+AQLGEALETYG+RF+CSLNPGF+
Sbjct: 183 SSREDLRTSAGDFWRDIMLTTNEVEQQHLQIDWFPNYAQLGEALETYGYRFQCSLNPGFI 242

Query: 241 HAGSGRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSML 300
           H GS RGGPPQNIRAWIKFITICCQTKIKK+IFT SEVEQLAEAIICLFLDRQFQG+++L
Sbjct: 243 HTGSRRGGPPQNIRAWIKFITICCQTKIKKNIFTSSEVEQLAEAIICLFLDRQFQGITVL 302

Query: 301 LCECLQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVA 360
           LCECLQSLIHYFTDEEW ACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS +A
Sbjct: 303 LCECLQSLIHYFTDEEWNACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTIA 362

Query: 361 YQILLMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLI 420
           YQILL+CFE EA+NEE VLR+LTSITVKDKSCDLFKLYIYLVLTENWLVGS+TLEGKPLI
Sbjct: 363 YQILLICFENEASNEEEVLRVLTSITVKDKSCDLFKLYIYLVLTENWLVGSQTLEGKPLI 422

Query: 421 CEMWGLFLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
           CEMWGLFLRNCSCQI STDLRSYASKVRNKASYILQS+F E
Sbjct: 423 CEMWGLFLRNCSCQIASTDLRSYASKVRNKASYILQSAFEE 463

BLAST of Moc03g21040 vs. NCBI nr
Match: XP_023520957.1 (uncharacterized protein LOC111784515 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023520958.1 uncharacterized protein LOC111784515 isoform X2 [Cucurbita pepo subsp. pepo] >XP_023520965.1 uncharacterized protein LOC111784527 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 805.4 bits (2079), Expect = 2.4e-229
Identity = 388/455 (85.27%), Postives = 424/455 (93.19%), Query Frame = 0

Query: 1   MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
           MDGPLDFESEDPLLSSPV+LKKRK IIGLDDLLTDHYKDKCKLVEKES+QAKKKK Y+SD
Sbjct: 3   MDGPLDFESEDPLLSSPVSLKKRKKIIGLDDLLTDHYKDKCKLVEKESKQAKKKKKYDSD 62

Query: 61  DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
           D+DFGKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VFGEQK PP LQSPELK+C+LL+ 
Sbjct: 63  DEDFGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFGEQKPPPALQSPELKNCYLLQT 122

Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
           FLNNEVNSLVNLT+EKG+AFLEGLLVNGWLSTLVS+T RVEKP+AIWTFNLMLYSSREEL
Sbjct: 123 FLNNEVNSLVNLTVEKGEAFLEGLLVNGWLSTLVSVTNRVEKPLAIWTFNLMLYSSREEL 182

Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
            TSA DFWRDIML  +EV QQ L+I+WFP +AQLGEALETYG+RFECSL PGFVH GSGR
Sbjct: 183 STSACDFWRDIMLTTNEVNQQHLRIEWFPTYAQLGEALETYGYRFECSLKPGFVHNGSGR 242

Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
           GGPPQNIRAWIKF+T+CCQTK++KHIFT SEVE+LAEA++CLFLDRQF+G+++LLCECLQ
Sbjct: 243 GGPPQNIRAWIKFVTVCCQTKMRKHIFTSSEVERLAEAVLCLFLDRQFRGVTVLLCECLQ 302

Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
           SL+ YFTDE+WKACC+ IAK+LV RIP DLNCLRAVECISG D RSKYLRS VAYQILL+
Sbjct: 303 SLVRYFTDEDWKACCDNIAKTLVSRIPMDLNCLRAVECISGVDQRSKYLRSTVAYQILLI 362

Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
            FE EATNEE VLR+LTS+TVKDKSCDLFKLYIYLVLTENWLVGSR L+GKPLICEMWGL
Sbjct: 363 FFENEATNEEEVLRVLTSMTVKDKSCDLFKLYIYLVLTENWLVGSRMLDGKPLICEMWGL 422

Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
           FLR CSCQI STDLRSYASKVRNKASYILQSSF E
Sbjct: 423 FLRKCSCQIASTDLRSYASKVRNKASYILQSSFEE 457

BLAST of Moc03g21040 vs. ExPASy TrEMBL
Match: A0A6J1DG18 (uncharacterized protein LOC111020493 OS=Momordica charantia OX=3673 GN=LOC111020493 PE=4 SV=1)

HSP 1 Score: 927.5 bits (2396), Expect = 2.1e-266
Identity = 455/455 (100.00%), Postives = 455/455 (100.00%), Query Frame = 0

Query: 1   MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
           MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD
Sbjct: 1   MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60

Query: 61  DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
           DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN
Sbjct: 61  DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120

Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
           FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL
Sbjct: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180

Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
           RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR
Sbjct: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240

Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
           GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ
Sbjct: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300

Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
           SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM
Sbjct: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360

Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
           CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL
Sbjct: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420

Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
           FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE
Sbjct: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 455

BLAST of Moc03g21040 vs. ExPASy TrEMBL
Match: A0A0A0LVC0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G095530 PE=4 SV=1)

HSP 1 Score: 812.8 bits (2098), Expect = 7.4e-232
Identity = 395/455 (86.81%), Postives = 423/455 (92.97%), Query Frame = 0

Query: 1   MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
           MD PLDFESEDPLLSSPVALKKRK IIGLDDLLTDHYKDKCKLVEKES+ AKK+KNY+SD
Sbjct: 3   MDAPLDFESEDPLLSSPVALKKRKKIIGLDDLLTDHYKDKCKLVEKESKLAKKRKNYDSD 62

Query: 61  DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
           DDDFGKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VFGEQK PP LQ+PEL+SC  L+ 
Sbjct: 63  DDDFGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFGEQKPPPALQTPELESCQFLQT 122

Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
           FLNNEVNSLVNLT+EKGD FLEGLLVNGWLSTLVSLTG VEK +AIWTFNLMLYSSRE L
Sbjct: 123 FLNNEVNSLVNLTVEKGDVFLEGLLVNGWLSTLVSLTGHVEKSLAIWTFNLMLYSSREGL 182

Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
           RTSA DFW+DIML  +EVEQQ LQ+DWFP++AQLGEAL+TYG+RFECSLNPG +H GSGR
Sbjct: 183 RTSACDFWKDIMLTTNEVEQQHLQVDWFPSYAQLGEALDTYGYRFECSLNPGLIHTGSGR 242

Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
           GGPPQNIRAWIKFITICCQTK+KK+IFT SEV +LAEAIICLFLDRQFQG+++LLCECLQ
Sbjct: 243 GGPPQNIRAWIKFITICCQTKVKKNIFTSSEVGRLAEAIICLFLDRQFQGITVLLCECLQ 302

Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
           SLIHYFTDE+WKACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS VAYQILL+
Sbjct: 303 SLIHYFTDEDWKACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTVAYQILLI 362

Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
           CF+ EATNEE VLR+LTSITVKDKSCDLFKLYIYLVLTENWLVGSR  EGKPL  EMWGL
Sbjct: 363 CFKNEATNEEEVLRVLTSITVKDKSCDLFKLYIYLVLTENWLVGSRMCEGKPLTREMWGL 422

Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
           FLRNCSCQI STDLRSYASKVRNKASYILQSSF+E
Sbjct: 423 FLRNCSCQIASTDLRSYASKVRNKASYILQSSFDE 457

BLAST of Moc03g21040 vs. ExPASy TrEMBL
Match: A0A5D3BL50 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2042G00060 PE=4 SV=1)

HSP 1 Score: 800.0 bits (2065), Expect = 5.0e-228
Identity = 392/455 (86.15%), Postives = 419/455 (92.09%), Query Frame = 0

Query: 1   MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
           MD PLDFESEDPLLSSPVALKKRK IIGLDDLLTDHYKDKCKLVEKE++ AKKKK Y+SD
Sbjct: 3   MDAPLDFESEDPLLSSPVALKKRKKIIGLDDLLTDHYKDKCKLVEKEAKLAKKKKKYDSD 62

Query: 61  DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
           DDDFGKEAVV+QVVD+CQNKM+QLGGEEDTSIWGL VFGEQK PP L+SPEL+SC  L+ 
Sbjct: 63  DDDFGKEAVVTQVVDQCQNKMNQLGGEEDTSIWGLVVFGEQKPPPALESPELESCQFLQT 122

Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
           FLN+EVNSLVNLT+EKGDAFLEGLLVNGWLSTLVSLTGR EK +AIWTFNLMLYSSRE+L
Sbjct: 123 FLNSEVNSLVNLTVEKGDAFLEGLLVNGWLSTLVSLTGRAEKSLAIWTFNLMLYSSREDL 182

Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
           RTSA DFW+DIML  +EV Q  LQIDWFPN+AQLGEALETYG+RFECSLNPG +H GSGR
Sbjct: 183 RTSACDFWKDIMLTTNEVGQH-LQIDWFPNYAQLGEALETYGYRFECSLNPGLIHTGSGR 242

Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
           GGPPQNIRAWIKFITICCQTK KK+IFT SEVE+LAEAIICLFLDRQFQG++ LLCECLQ
Sbjct: 243 GGPPQNIRAWIKFITICCQTKNKKNIFTSSEVERLAEAIICLFLDRQFQGITELLCECLQ 302

Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
           SLIHYFTDEEW ACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS VAYQILL 
Sbjct: 303 SLIHYFTDEEWNACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTVAYQILLF 362

Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
           CF+ EATNEE VLR++TSITVKDK+CDLFKLYIYLVLTENWLVG RT EGKPL  EMWGL
Sbjct: 363 CFKKEATNEEEVLRVVTSITVKDKNCDLFKLYIYLVLTENWLVGGRTCEGKPLTREMWGL 422

Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
           FLRNCSCQI STDLRSYASKVRNKASYILQSSF+E
Sbjct: 423 FLRNCSCQIASTDLRSYASKVRNKASYILQSSFDE 456

BLAST of Moc03g21040 vs. ExPASy TrEMBL
Match: A0A1S3CJ26 (uncharacterized protein LOC103501522 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501522 PE=4 SV=1)

HSP 1 Score: 800.0 bits (2065), Expect = 5.0e-228
Identity = 392/455 (86.15%), Postives = 419/455 (92.09%), Query Frame = 0

Query: 1   MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
           MD PLDFESEDPLLSSPVALKKRK IIGLDDLLTDHYKDKCKLVEKE++ AKKKK Y+SD
Sbjct: 3   MDAPLDFESEDPLLSSPVALKKRKKIIGLDDLLTDHYKDKCKLVEKEAKLAKKKKKYDSD 62

Query: 61  DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
           DDDFGKEAVV+QVVD+CQNKM+QLGGEEDTSIWGL VFGEQK PP L+SPEL+SC  L+ 
Sbjct: 63  DDDFGKEAVVTQVVDQCQNKMNQLGGEEDTSIWGLVVFGEQKPPPALESPELESCQFLQT 122

Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
           FLN+EVNSLVNLT+EKGDAFLEGLLVNGWLSTLVSLTGR EK +AIWTFNLMLYSSRE+L
Sbjct: 123 FLNSEVNSLVNLTVEKGDAFLEGLLVNGWLSTLVSLTGRAEKSLAIWTFNLMLYSSREDL 182

Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
           RTSA DFW+DIML  +EV Q  LQIDWFPN+AQLGEALETYG+RFECSLNPG +H GSGR
Sbjct: 183 RTSACDFWKDIMLTTNEVGQH-LQIDWFPNYAQLGEALETYGYRFECSLNPGLIHTGSGR 242

Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
           GGPPQNIRAWIKFITICCQTK KK+IFT SEVE+LAEAIICLFLDRQFQG++ LLCECLQ
Sbjct: 243 GGPPQNIRAWIKFITICCQTKNKKNIFTSSEVERLAEAIICLFLDRQFQGITELLCECLQ 302

Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
           SLIHYFTDEEW ACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS VAYQILL 
Sbjct: 303 SLIHYFTDEEWNACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTVAYQILLF 362

Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
           CF+ EATNEE VLR++TSITVKDK+CDLFKLYIYLVLTENWLVG RT EGKPL  EMWGL
Sbjct: 363 CFKKEATNEEEVLRVVTSITVKDKNCDLFKLYIYLVLTENWLVGGRTCEGKPLTREMWGL 422

Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
           FLRNCSCQI STDLRSYASKVRNKASYILQSSF+E
Sbjct: 423 FLRNCSCQIASTDLRSYASKVRNKASYILQSSFDE 456

BLAST of Moc03g21040 vs. ExPASy TrEMBL
Match: A0A6J1HJZ9 (uncharacterized protein LOC111464803 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111464803 PE=4 SV=1)

HSP 1 Score: 797.7 bits (2059), Expect = 2.5e-227
Identity = 385/455 (84.62%), Postives = 422/455 (92.75%), Query Frame = 0

Query: 1   MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
           MDGPLDFESEDPLLSSPV+LKKRK IIGLDDLLTDHYKDKCKLVEKES+QAKKKK Y+SD
Sbjct: 3   MDGPLDFESEDPLLSSPVSLKKRKKIIGLDDLLTDHYKDKCKLVEKESKQAKKKKKYDSD 62

Query: 61  DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
           D+DFGKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VF EQK PP LQSPELK+C+LL+ 
Sbjct: 63  DEDFGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFREQKPPPALQSPELKNCYLLQA 122

Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
           FLNNEVNSLVNLT+EKG+AFLEGLLVNGWLSTLVS+T RVEKP+AIWTFNLMLYSSREEL
Sbjct: 123 FLNNEVNSLVNLTVEKGEAFLEGLLVNGWLSTLVSVTNRVEKPLAIWTFNLMLYSSREEL 182

Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
            TSA DFWRDIML  ++V QQ L+I+WFP +AQLGEALETYG+RFECSL PGFVH GSGR
Sbjct: 183 STSACDFWRDIMLTTNKVNQQHLRIEWFPTYAQLGEALETYGYRFECSLKPGFVHNGSGR 242

Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
           GGPPQNIRAWIKF+T+CCQTK+KK+IFT SEVE+LAEAI+CLFLDRQF+G+++LLCECLQ
Sbjct: 243 GGPPQNIRAWIKFVTVCCQTKMKKYIFTSSEVERLAEAILCLFLDRQFRGVTVLLCECLQ 302

Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
           SL+HYFTDE+WKACC+ IAK+LV RIP DLNCLRAVECISG D RSKYLRS VAYQILL+
Sbjct: 303 SLVHYFTDEDWKACCDNIAKALVSRIPMDLNCLRAVECISGVDQRSKYLRSTVAYQILLI 362

Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
            FE E TNEE VLR+LTS+TVKD+SCDLFKLYIYLVLTENWLVGSR L+GKPLICEMWGL
Sbjct: 363 FFENEGTNEEEVLRVLTSMTVKDRSCDLFKLYIYLVLTENWLVGSRMLDGKPLICEMWGL 422

Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
           FLR CSCQI STDLRSYASKVRNKASYILQS F E
Sbjct: 423 FLRKCSCQIASTDLRSYASKVRNKASYILQSCFEE 457

BLAST of Moc03g21040 vs. TAIR 10
Match: AT2G28130.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 13 growth stages; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 496.9 bits (1278), Expect = 1.7e-140
Identity = 241/452 (53.32%), Postives = 334/452 (73.89%), Query Frame = 0

Query: 1   MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
           +DGPLDFE+EDPL++ P  ++KRK +IGLDDLL+D YK+K K+++K +++ K  K Y+SD
Sbjct: 3   LDGPLDFENEDPLVNPPTIIEKRKKVIGLDDLLSDFYKEKSKVIDKVNKKRKVSKVYHSD 62

Query: 61  DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
           DD+ G+   +SQ V ECQN+M+++  EE+   WGL +FG+QKTP      +L SC LL+ 
Sbjct: 63  DDEQGQVDKLSQCVVECQNQMNEIADEEENQEWGLSMFGDQKTPIPSLLVDLDSCCLLKE 122

Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
           F+NN++N +V LT+++G  F+EGLLVNGWL+ L+   GRVEK I  WT N++LYSS+E+L
Sbjct: 123 FMNNQLNLVVGLTVDEGTTFIEGLLVNGWLTRLIMTCGRVEKFICKWTLNILLYSSKEDL 182

Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPG--FVHAGS 240
           R+SA DFW  I+L +++V    ++I W PN+ +L EALE+YGFR   S +       A S
Sbjct: 183 RSSACDFWCSILLSQNKVNGASVEIYWLPNYQELKEALESYGFRISLSHSQDVELAEADS 242

Query: 241 GRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCEC 300
              GPPQNIRAW+  +T CCQ + KK IFT S+VEQ+AE ++ L LDR   GLS+LL EC
Sbjct: 243 ECQGPPQNIRAWLTLVTTCCQFRCKKPIFTTSQVEQIAEILVSLLLDRSLLGLSILLQEC 302

Query: 301 LQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQIL 360
           L S+I  F +EEW + C+KIA SL  R+P+D+NCLR VE +SG DARSK+LRS++A+Q+L
Sbjct: 303 LISVIGSFKEEEWISSCKKIANSLASRVPQDINCLRVVESVSGVDARSKHLRSSIAHQML 362

Query: 361 LMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMW 420
           ++  +    ++E +L  L SI VK++SC+LFK+YI+LVL ENWL  S  +E KP++ +MW
Sbjct: 363 VVLLD-HKDSDENLLSSLMSINVKERSCNLFKMYIFLVLAENWLFSSTLVEAKPVLRDMW 422

Query: 421 GLFLRNCSCQITSTDLRSYASKVRNKASYILQ 451
            +FLRNCSCQI STDLRSYASKVR +A+Y+LQ
Sbjct: 423 AVFLRNCSCQINSTDLRSYASKVRTRAAYLLQ 453

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022152868.14.3e-266100.00uncharacterized protein LOC111020493 [Momordica charantia][more]
XP_038895435.18.1e-23387.69uncharacterized protein LOC120083668 isoform X2 [Benincasa hispida][more]
XP_011653682.11.5e-23186.81uncharacterized protein LOC101216339 isoform X1 [Cucumis sativus] >KGN64767.1 hy... [more]
XP_038895434.12.2e-23086.33uncharacterized protein LOC120083668 isoform X1 [Benincasa hispida][more]
XP_023520957.12.4e-22985.27uncharacterized protein LOC111784515 isoform X1 [Cucurbita pepo subsp. pepo] >XP... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DG182.1e-266100.00uncharacterized protein LOC111020493 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
A0A0A0LVC07.4e-23286.81Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G095530 PE=4 SV=1[more]
A0A5D3BL505.0e-22886.15Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3CJ265.0e-22886.15uncharacterized protein LOC103501522 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1HJZ92.5e-22784.62uncharacterized protein LOC111464803 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT2G28130.11.7e-14053.32unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR37212ACTIN PROTEIN 2/3 COMPLEX SUBUNIT-LIKE PROTEINcoord: 1..454

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc03g21040.1Moc03g21040.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006281 DNA repair
cellular_component GO:0005634 nucleus