MS021968 (gene) Bitter gourd (TR) v1

Overview
NameMS021968
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAlpha-(1,6)-fucosyltransferase
Locationscaffold110: 660109 .. 665112 (+)
RNA-Seq ExpressionMS021968
SyntenyMS021968
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGCACAGAATCAGAAGTCTTTGGAAAGAGTTGTTTCTCAGAAAGCTCTGCAACTGGGGAGTTCATTTCCTTGTCAAATTTGTGTTGTGGGGTTTCTCTGTGGAGTTTGTATTGCCTCTCTGTTTCTTGGTGCTTTCCCTTTGGGATTTGACTGGAGTTCTTTATCACCCAATTCAGTGACTTCTTCATTGTGGAACTCCACTTCTGAAAATATTAGTAAGTCATCATCTCTCTCTCTCTTAAATGGAATCATATGTATTGCTATGCTTGTTTGATATGAACAAATTCTGAAAGGTGAGTTGGCTGAATTTGACATTTAAGAATTTGGTAAATACAATTTGGATTGAATTCTAAGATATTTGAGAGTTAGCCAAAGATAAGCTCACCATGGGTTTGGCCCCATCATTGGGATCATGGTTCAAGTGGGCTCCTACTTAAGATATTAAATTTCTTACCACGATACCGGCCGTGGTAGGACTACTAGATTTTTATCTCATGAGATTAGTCGAGGTGTTTAGCCAAAGATAGTGTATTATGAGTTTTCTAATGCTTAATGTTATAGGGTCGAATGATTAGCGAATAAGCTAAGATTAGTCGAGATGTGCTGAACCAATCCTCAAAACTGTCAGGAGAAAGGGAAAAATTTGTAATTCTTGAAGCTTTGTTATGGAACTTTTGTTTGGATTGTGGAAAAATTAATTGGTATAAAGGTGTATGGGAATGAATGATTAAAAGGTGTAGTTTAAGGTACTTAAAGTTTAGAATAGCTTCTACTTTACCCAATGCATTTTGACTTCAATAGCTTAGTTGTATCAAGACTAGTAGATTCTTCACTATTTTCTTCTGGGAGAAGCTACTGGTTCCTACTTTTTTATTTAGTTGGCCATGAATAAAATCATCATTCATTTTTAATTAGGAAATATATATATATAAAAGCATGTAATCATTCAAGCAATTTCAATGAGATGTCATCTTGAAGGAACTATACAATATCTGATCTTTCATGAAAGCAATAAGGTAAATGATTCAGCCATTTTAAATTGCTCCCCAAGGGTGAATAATTTGTTGCTCATCGTTGATTAATCTCTATATTTTGACACTTTTTTCCAACCCCCCGAAAAAAATGTGAATCATTTGTTAGGCTGCAACTTTAGACCAAAGGAGGTTGAGAGACTGAAGGATTTCCAGAGAATTAAGGTAAACATCGCCAATGACGAGAAAGCATCCTTATTATATTCAGCTTGGAGTGTTTTGTTGATTGAACCGATCAGTGGAAAAAATGCTGTTCTGCGGGGTCTTGGATTGGACGACAAAGCTACAGTTCCGAAAGCACCCCATTTGGAAGATTGCAAGTTGAAAGCAGAAGCAAATAAACGTTTCGATGAGCGATCTGAAACTGATGGATTCCCTCCTTGGACAAGTTGGAAGGGGTTCTTAGACTTGCACCTGACTGATGCTGCAACTGAGAGATCGAGCTACCTTCGGCACCGGGAAATATCTAAAGGTTCCTACCCTCCTTGGGTATGTCTCTAAAATTTTTTAACCTTTGATATGTAAAGTGATATTGATGTTAGATTCTTTTTCCATATAATTTTTTTTTTTCTTTTTTTTTTTATGGTTAAGTTACAAATTTTGTCTATTTAGTTACTAGACTTTCAAAATTAAGATCATAAGCTTTGAAGTATAGTTCTATGAGTTCATTCCTGCCATTCATACCGTTAACTAAATAATGACGTGTTCAAGTTTCTTGATCTTTTTTACAAGTTGCTCGAGGAATAACTCTCAATTGTTTTCTTTGTGGGGGCATTTGGCTCGAGAGAAACAACAAAATTTGGGGGGTGGAGAGATCTTGGATCTTATTAGATTTATTAGATTTAATACTTATTTGTGGGATCATTTTTTAAGGAGTTATGTAATTATCTCTTAGGGGTTATTCTTTTGGAGTTATGTAATTATCTGTTAGGGGTTATTCTTTTGGATTGGAGCCCCTTCCTATAGGGGGTTTCCTCTTATCGGGCTGTTTTTTTATCCTCTAGTATATCTTTTCATATCTCTCAATGAAAGTTTGGTTTCTTATCTAGAGAAAAAAGGTTTTTTTTTTTATTAATTTTTTGCCTTAAGTCATGTTGCTTCTAATGATTTTGGGCCTCATGTTTAATTATTGTTTAGCACATTCCTAAAACAGCTTCGTATATCGTCCTACAAGCATAGGATAATCCCATAAAGTTAATAATCCGAAAATTTGATTAACCACGACAAATTGATACCAATACAATAGTTTCCAGGTTATCGGATCAGATGAAGAAAATTACCCCTTGACTCGAAAAGTACAACGGGATCTATGGATCCACCAGCATCCATCGAACTGTAGAGATCCAAATGTTCGGTTTCTTGTAGCTGATTGGGAGCGGTTACCTGGATTTGGGATCGGAGCTCAAATTGCAGGAATGTGTGGACTTCTTGCTATAGCGATCAATGAGAAGAGGATTCTTGTTACGAGTTATTATAATCGAGCTGACCATGATGGTTGTCAAGGTACAGATGATAATATGGATGATTAAAAAGAAAGATTAGGTATCGATATGCTATATGTATGAACTGTATTTAATATTTGCCATCTTTTTCCCTGTATCTCTTTTCTTCTGAATTTGAAGCTTGGGAAGGACTTATTTAAGAAAACGCAGCTAGGGAGCATTATTTTAGAGCAACTACTTTTCTGCTAGCATGTTCTTTTGAAAAGGCTCTTAACCTCTGTTTTTTGTTAGGTTCATCTCGGTCCAGTTGGTCGTGCTATTTCTTCCCTGAAACATCCCAAGAATGTCGGGACCATGCATTTGAACTTTTGGGGAACAATGAAGCTTGGAAGAGCAGAATTATAACGGCAAAAGAAAACTACAGCACCAAGGAAATATGGACTGGTCGGATTCCTAGGTGAGATATTGCCTTAGGGAATAAAAATATCCATCAACACGTCAATATTTCCAAGGATTTGACATTGATATCAAGACAATAGACAATATGCTGTATAAATATCAATGAAATATAAAAAATTGCACTAGTATAAATGTAAATACAGATCAAGTTTAGTGATTTCCATAATTTTTAAAGCATCGATTTGAGATCGATATCGATATCAAGATTTTCATTGTATTGCTTTAGTTAAGGTAAAGATTTTCTTATTAATGCAGGACATGGGGAAATCCATGGAGTTATTTGCAACCCACAACAGAAGTTAATGGAAGTCTACTTTTTCATCATCGCAAAATAGATAGAAGGTGGTGGAGAGCACAGGTAATCTTTTATGCTCTTCCAAAAAACTGGTGATCTCTATCCGTTTCATATACATATACATGCAAACGCACACACCACAAACGAACAAATTCAATTCAAGAATCGTTATCGAAGATTGCGATGACTCAATTGCTCCTGTAAATTGTTAGATTACAAACAATGTTTCTCCTTGCAATACAGGACATGTACCTCAGCACCGATACTATTTTGATAATCCCATTTGTTGGCTCATAAGAAGATTATTTTGCTTGAATAATAACAATTGATCCAACCAAAAAACGCTCCAATCGGAAATAAAACCTAAAAGATAATTATAGAAGTCTTTAGTTACAGACGCCCTGAAAGAGATTAAACCTCTTCCTAAGACACAAACCCTTTTAAATAGCTTACTAAAACAAAAGATGTGTTTGAATGTAATAATGAACATGAACTACCTCCTATAAACTTGTCTTACATTAGATTAGAAGAATTTGTGCTGTAAAGTAATATGGAGTGAAATTGTATTATACCAAAGAGAAAATACACTGAGCTGATGAAATTAATCCCCACAATATGCGCAGCTCCACCAAATCCTTTCGAACTCACACCCTAACAAACTCGTCGGATAATTACAAATATGCCACTACTAATATTACCACGACATCATTCCCACCCTACCCAAATCTTAAGAGGGCAGAATTTATCATGCTACAGAACACTGATATTGCAAACTTTACCCTCAGGCAGTTCGGTACTTAATGAGATTTCAGACAGAATACACATGTGGCTTGATGAATGCTGCTCGCCATGCTGCATTCGGGAAGGAAGCTGCAGAAATGGTTCTCAAGAGTCTCCATGGTAAATGGCCAGAGGTATGTTGAAAACTAGTTTTACTTTTTTGGATTTTAGGATGGGATAAGACAAGAAAAAATTGGCTGCTGGATTCAAATATCAAATGGCTAAAGGAAATTCAATGATTGAACCAACAGTGTCTAATCATAATTTTTCAAGATAAGATGTAAAACAGTATGACACTGTTAGAAATAGAATCAGGTTGTCATTCTAAATCCGTAGTTGTACGGTAGGTATTTACAAATAGACTGGTCGATTATGATTTTCAGGGTCAGGTTTTTCTCTTATATAGAGCATCTTATTTCAGGGAGATTCACTGACATCAAAGCATGATATAGAAGATTTCGTATGGTCGAACCACAAAGCATGGATACCTAGGCCACTGTTAAGCATGCATGTAAGAATGGGAGATAAAGCCTGTGAAATGAAGGTTGTTGAATTTGAAGGATACATGGCCCTTGCCGAACGCATTAGAAGACGTTTTCCTAATCTTGATAGCATTTGGCTTTCAACCGAAATGCAGGTCTGTGTTCTAACCACTATATGAAATGTGTGTGATAGAGCAACATCTATAAGCTCTTCTCTCACTATGTCCATCTCAGGAAGTGATCGATAAAACAAGAAGCTACCCATCCTGGAGATTTTACTACACAGACGTGAAACGACAAGTAGGAAATCTTACGATGGCCACCTACGAAGCACAGCTTGGTAGGATAACCAGCACAAACTATCCCCTTGTGAACTTCCTGATGGCAACTGAGGCTGATTTTTTCGTTGGAGCATTGGGCTCAACATGGTGCTTTCTTATAGATGGAATGAGAAACACGGGGGGTAAAGTAATGGCCGGATACTTGAGTGTTAACAAGGATCGGTTTTGG

mRNA sequence

ATGGAGGCACAGAATCAGAAGTCTTTGGAAAGAGTTGTTTCTCAGAAAGCTCTGCAACTGGGGAGTTCATTTCCTTGTCAAATTTGTGTTGTGGGGTTTCTCTGTGGAGTTTGTATTGCCTCTCTGTTTCTTGGTGCTTTCCCTTTGGGATTTGACTGGAGTTCTTTATCACCCAATTCAGTGACTTCTTCATTGTGGAACTCCACTTCTGAAAATATTAGCTGCAACTTTAGACCAAAGGAGGTTGAGAGACTGAAGGATTTCCAGAGAATTAAGGTAAACATCGCCAATGACGAGAAAGCATCCTTATTATATTCAGCTTGGAGTGTTTTGTTGATTGAACCGATCAGTGGAAAAAATGCTGTTCTGCGGGGTCTTGGATTGGACGACAAAGCTACAGTTCCGAAAGCACCCCATTTGGAAGATTGCAAGTTGAAAGCAGAAGCAAATAAACGTTTCGATGAGCGATCTGAAACTGATGGATTCCCTCCTTGGACAAGTTGGAAGGGGTTCTTAGACTTGCACCTGACTGATGCTGCAACTGAGAGATCGAGCTACCTTCGGCACCGGGAAATATCTAAAGGTTCCTACCCTCCTTGGGTTATCGGATCAGATGAAGAAAATTACCCCTTGACTCGAAAAGTACAACGGGATCTATGGATCCACCAGCATCCATCGAACTGTAGAGATCCAAATGTTCGGTTTCTTGTAGCTGATTGGGAGCGGTTACCTGGATTTGGGATCGGAGCTCAAATTGCAGGAATGTGTGGACTTCTTGCTATAGCGATCAATGAGAAGAGGATTCTTGTTACGAGTTATTATAATCGAGCTGACCATGATGGTTGTCAAGGTTCATCTCGGTCCAGTTGGTCGTGCTATTTCTTCCCTGAAACATCCCAAGAATGTCGGGACCATGCATTTGAACTTTTGGGGAACAATGAAGCTTGGAAGAGCAGAATTATAACGGCAAAAGAAAACTACAGCACCAAGGAAATATGGACTGGTCGGATTCCTAGGACATGGGGAAATCCATGGAGTTATTTGCAACCCACAACAGAAGTTAATGGAAGTCTACTTTTTCATCATCGCAAAATAGATAGAAGGTGGTGGAGAGCACAGGCAGTTCGGTACTTAATGAGATTTCAGACAGAATACACATGTGGCTTGATGAATGCTGCTCGCCATGCTGCATTCGGGAAGGAAGCTGCAGAAATGGTTCTCAAGAGTCTCCATGGTAAATGGCCAGAGGGAGATTCACTGACATCAAAGCATGATATAGAAGATTTCGTATGGTCGAACCACAAAGCATGGATACCTAGGCCACTGTTAAGCATGCATGTAAGAATGGGAGATAAAGCCTGTGAAATGAAGGTTGTTGAATTTGAAGGATACATGGCCCTTGCCGAACGCATTAGAAGACGTTTTCCTAATCTTGATAGCATTTGGCTTTCAACCGAAATGCAGGAAGTGATCGATAAAACAAGAAGCTACCCATCCTGGAGATTTTACTACACAGACGTGAAACGACAAGTAGGAAATCTTACGATGGCCACCTACGAAGCACAGCTTGGTAGGATAACCAGCACAAACTATCCCCTTGTGAACTTCCTGATGGCAACTGAGGCTGATTTTTTCGTTGGAGCATTGGGCTCAACATGGTGCTTTCTTATAGATGGAATGAGAAACACGGGGGGTAAAGTAATGGCCGGATACTTGAGTGTTAACAAGGATCGGTTTTGG

Coding sequence (CDS)

ATGGAGGCACAGAATCAGAAGTCTTTGGAAAGAGTTGTTTCTCAGAAAGCTCTGCAACTGGGGAGTTCATTTCCTTGTCAAATTTGTGTTGTGGGGTTTCTCTGTGGAGTTTGTATTGCCTCTCTGTTTCTTGGTGCTTTCCCTTTGGGATTTGACTGGAGTTCTTTATCACCCAATTCAGTGACTTCTTCATTGTGGAACTCCACTTCTGAAAATATTAGCTGCAACTTTAGACCAAAGGAGGTTGAGAGACTGAAGGATTTCCAGAGAATTAAGGTAAACATCGCCAATGACGAGAAAGCATCCTTATTATATTCAGCTTGGAGTGTTTTGTTGATTGAACCGATCAGTGGAAAAAATGCTGTTCTGCGGGGTCTTGGATTGGACGACAAAGCTACAGTTCCGAAAGCACCCCATTTGGAAGATTGCAAGTTGAAAGCAGAAGCAAATAAACGTTTCGATGAGCGATCTGAAACTGATGGATTCCCTCCTTGGACAAGTTGGAAGGGGTTCTTAGACTTGCACCTGACTGATGCTGCAACTGAGAGATCGAGCTACCTTCGGCACCGGGAAATATCTAAAGGTTCCTACCCTCCTTGGGTTATCGGATCAGATGAAGAAAATTACCCCTTGACTCGAAAAGTACAACGGGATCTATGGATCCACCAGCATCCATCGAACTGTAGAGATCCAAATGTTCGGTTTCTTGTAGCTGATTGGGAGCGGTTACCTGGATTTGGGATCGGAGCTCAAATTGCAGGAATGTGTGGACTTCTTGCTATAGCGATCAATGAGAAGAGGATTCTTGTTACGAGTTATTATAATCGAGCTGACCATGATGGTTGTCAAGGTTCATCTCGGTCCAGTTGGTCGTGCTATTTCTTCCCTGAAACATCCCAAGAATGTCGGGACCATGCATTTGAACTTTTGGGGAACAATGAAGCTTGGAAGAGCAGAATTATAACGGCAAAAGAAAACTACAGCACCAAGGAAATATGGACTGGTCGGATTCCTAGGACATGGGGAAATCCATGGAGTTATTTGCAACCCACAACAGAAGTTAATGGAAGTCTACTTTTTCATCATCGCAAAATAGATAGAAGGTGGTGGAGAGCACAGGCAGTTCGGTACTTAATGAGATTTCAGACAGAATACACATGTGGCTTGATGAATGCTGCTCGCCATGCTGCATTCGGGAAGGAAGCTGCAGAAATGGTTCTCAAGAGTCTCCATGGTAAATGGCCAGAGGGAGATTCACTGACATCAAAGCATGATATAGAAGATTTCGTATGGTCGAACCACAAAGCATGGATACCTAGGCCACTGTTAAGCATGCATGTAAGAATGGGAGATAAAGCCTGTGAAATGAAGGTTGTTGAATTTGAAGGATACATGGCCCTTGCCGAACGCATTAGAAGACGTTTTCCTAATCTTGATAGCATTTGGCTTTCAACCGAAATGCAGGAAGTGATCGATAAAACAAGAAGCTACCCATCCTGGAGATTTTACTACACAGACGTGAAACGACAAGTAGGAAATCTTACGATGGCCACCTACGAAGCACAGCTTGGTAGGATAACCAGCACAAACTATCCCCTTGTGAACTTCCTGATGGCAACTGAGGCTGATTTTTTCGTTGGAGCATTGGGCTCAACATGGTGCTTTCTTATAGATGGAATGAGAAACACGGGGGGTAAAGTAATGGCCGGATACTTGAGTGTTAACAAGGATCGGTTTTGG

Protein sequence

MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFPLGFDWSSLSPNSVTSSLWNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLLIEPISGKNAVLRGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLHLTDAATERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRFLVADWERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFFPETSQECRDHAFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVNGSLLFHHRKIDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWPEGDSLTSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRFPNLDSIWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVNFLMATEADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
Homology
BLAST of MS021968 vs. NCBI nr
Match: XP_022149877.1 (uncharacterized protein LOC111018192 isoform X1 [Momordica charantia] >XP_022149878.1 uncharacterized protein LOC111018192 isoform X1 [Momordica charantia] >XP_022149879.1 uncharacterized protein LOC111018192 isoform X1 [Momordica charantia] >XP_022149880.1 uncharacterized protein LOC111018192 isoform X1 [Momordica charantia])

HSP 1 Score: 1207.6 bits (3123), Expect = 0.0e+00
Identity = 574/580 (98.97%), Postives = 576/580 (99.31%), Query Frame = 0

Query: 1   MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFPLGFDWSSLSPNS 60
           MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFPLGFDWSS SPNS
Sbjct: 1   MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFPLGFDWSSFSPNS 60

Query: 61  VTSSLWNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLLIEPISGKN 120
           VTSSLWNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLL EPISGKN
Sbjct: 61  VTSSLWNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLLTEPISGKN 120

Query: 121 AVLRGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLHLTDAA 180
           AV RGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLHLTDAA
Sbjct: 121 AVPRGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLHLTDAA 180

Query: 181 TERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRFLVADW 240
           TERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQR+LWIHQHPSNCRDPNVRFLVADW
Sbjct: 181 TERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQRELWIHQHPSNCRDPNVRFLVADW 240

Query: 241 ERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFFPETSQ 300
           ERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFFPETSQ
Sbjct: 241 ERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFFPETSQ 300

Query: 301 ECRDHAFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVNGSLLF 360
           ECRD AFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVNGSLLF
Sbjct: 301 ECRDRAFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVNGSLLF 360

Query: 361 HHRKIDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWPEGDSL 420
           HHRK+DRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWPEGDSL
Sbjct: 361 HHRKMDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWPEGDSL 420

Query: 421 TSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRFPNLDS 480
           TSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRFPNLDS
Sbjct: 421 TSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRFPNLDS 480

Query: 481 IWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVNFLMAT 540
           IWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVNFLMAT
Sbjct: 481 IWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVNFLMAT 540

Query: 541 EADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 581
           EADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
Sbjct: 541 EADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 580

BLAST of MS021968 vs. NCBI nr
Match: XP_038889861.1 (uncharacterized protein LOC120079658 [Benincasa hispida])

HSP 1 Score: 1073.2 bits (2774), Expect = 8.9e-310
Identity = 514/585 (87.86%), Postives = 541/585 (92.48%), Query Frame = 0

Query: 1   MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF-----PLGFDWSS 60
           ME QNQKSLER+VSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF     PLGF WSS
Sbjct: 1   METQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSFGSPLGFGWSS 60

Query: 61  LSPNSVTSSLWNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLLIEP 120
            SPNS  +SLWNS+SENI+CN RPKE+E+LKDFQRIKVNI +DEK SLLYSAWS LL EP
Sbjct: 61  FSPNSQPASLWNSSSENINCNLRPKEIEKLKDFQRIKVNI-DDEKTSLLYSAWSSLLTEP 120

Query: 121 ISGKNAVLRGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLH 180
           ISG+NA LR LGL DKA +P  PHLEDCK KA+ANK FDERS ++GFPPWTSWKGFLD+H
Sbjct: 121 ISGRNAFLRDLGL-DKAILPNPPHLEDCKSKAKANKHFDERSASNGFPPWTSWKGFLDMH 180

Query: 181 LTDAATERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRF 240
            T   TE SS L H+E+ +GSYPPWV GSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRF
Sbjct: 181 PT-TTTEESSNLWHQEMLEGSYPPWVSGSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRF 240

Query: 241 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFF 300
           LVADWERLPGFGIGAQIAGMCGLLAIAINEKR+LVT+YYNRADHDGCQG  RSSWSCYF 
Sbjct: 241 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRVLVTNYYNRADHDGCQGLFRSSWSCYFL 300

Query: 301 PETSQECRDHAFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVN 360
           PETSQECRD AFELLGNNEAWKS IITAKENYSTKEIWTGRIPR+WGNPWSYLQPTTEVN
Sbjct: 301 PETSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRSWGNPWSYLQPTTEVN 360

Query: 361 GSLLFHHRKIDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWP 420
           GSLL  HRK+DRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWP
Sbjct: 361 GSLLSKHRKMDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWP 420

Query: 421 EGDSLTSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRF 480
           + DS+TSK+DIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFE YMALA+RIRRRF
Sbjct: 421 KKDSVTSKYDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEEYMALAKRIRRRF 480

Query: 481 PNLDSIWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVN 540
           PNLDSIWLSTEMQEVIDKT SYPSW+FYYT+VKRQVGNLTMATYEAQLGRITSTNYPLVN
Sbjct: 481 PNLDSIWLSTEMQEVIDKTTSYPSWKFYYTNVKRQVGNLTMATYEAQLGRITSTNYPLVN 540

Query: 541 FLMATEADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 581
           FLMAT+ADFFVGALGSTWCFLIDGMRNT GKVMAGYLSVNKDRFW
Sbjct: 541 FLMATDADFFVGALGSTWCFLIDGMRNTAGKVMAGYLSVNKDRFW 582

BLAST of MS021968 vs. NCBI nr
Match: XP_011649010.1 (uncharacterized protein LOC101206485 isoform X1 [Cucumis sativus] >XP_031737266.1 uncharacterized protein LOC101206485 isoform X1 [Cucumis sativus] >KGN61282.1 hypothetical protein Csa_006093 [Cucumis sativus])

HSP 1 Score: 1067.0 bits (2758), Expect = 5.7e-308
Identity = 513/585 (87.69%), Postives = 537/585 (91.79%), Query Frame = 0

Query: 1   MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF-----PLGFDWSS 60
           MEAQNQKSLER+VSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF     PLGF WSS
Sbjct: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSLGSPLGFGWSS 60

Query: 61  LSPNSVTSSLWNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLLIEP 120
            SPNS  +SL NSTSENI+CNFRPKE+E L+DFQRIKVN  +DEK SLLYSAWS L+ EP
Sbjct: 61  FSPNSQPASLCNSTSENINCNFRPKEIEELRDFQRIKVN-NDDEKTSLLYSAWSSLMTEP 120

Query: 121 ISGKNAVLRGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLH 180
           IS +NA LR LGL DKAT+P APHLE+CKLKAE NKRFDER +TDGFPPWTSWKG LD H
Sbjct: 121 ISSRNAFLRDLGL-DKATIPNAPHLENCKLKAETNKRFDERLQTDGFPPWTSWKGILDTH 180

Query: 181 LTDAATERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRF 240
            T A TE SSYLR +E+  GS+PPWV GSDEENYPLTRKVQRDLWIHQHP NC D NVRF
Sbjct: 181 PT-AMTEESSYLRRQEMFGGSFPPWVSGSDEENYPLTRKVQRDLWIHQHPLNCSDSNVRF 240

Query: 241 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFF 300
           LVADWERLPGFGIGAQIAGMCGLLAIAINEKR+LVT+YYNRADHDGCQGSSRSSWSCYF 
Sbjct: 241 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRVLVTNYYNRADHDGCQGSSRSSWSCYFL 300

Query: 301 PETSQECRDHAFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVN 360
           PETSQECRD AFELLGNNEAWKS IITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVN
Sbjct: 301 PETSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVN 360

Query: 361 GSLLFHHRKIDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWP 420
           GSLL  HRK+DRRWWRAQAVRYLMRF+TEYTCGLMNAARHAAFGKEAAEM LKSL GKWP
Sbjct: 361 GSLLSKHRKMDRRWWRAQAVRYLMRFKTEYTCGLMNAARHAAFGKEAAEMALKSLDGKWP 420

Query: 421 EGDSLTSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRF 480
           + DS TSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEF  YMALA+RIRRRF
Sbjct: 421 KKDSTTSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFAEYMALAKRIRRRF 480

Query: 481 PNLDSIWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVN 540
           PNLD+IWLSTEMQEVIDKT SYPSW+FYYT+VKRQVGNLTMATYEAQLGRITSTNYPLVN
Sbjct: 481 PNLDNIWLSTEMQEVIDKTVSYPSWKFYYTNVKRQVGNLTMATYEAQLGRITSTNYPLVN 540

Query: 541 FLMATEADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 581
           FLMATEADFF+GALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
Sbjct: 541 FLMATEADFFIGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 582

BLAST of MS021968 vs. NCBI nr
Match: XP_008441824.1 (PREDICTED: uncharacterized protein LOC103485874 [Cucumis melo] >XP_008441825.1 PREDICTED: uncharacterized protein LOC103485874 [Cucumis melo] >KAA0049954.1 uncharacterized protein E6C27_scaffold13G001180 [Cucumis melo var. makuwa] >TYK07686.1 uncharacterized protein E5676_scaffold105G001010 [Cucumis melo var. makuwa])

HSP 1 Score: 1060.8 bits (2742), Expect = 4.1e-306
Identity = 507/585 (86.67%), Postives = 533/585 (91.11%), Query Frame = 0

Query: 1   MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF-----PLGFDWSS 60
           MEAQNQKSLER+VSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF     PLGF WSS
Sbjct: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSLGSPLGFGWSS 60

Query: 61  LSPNSVTSSLWNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLLIEP 120
            SPNS  +S WNSTSEN +CNFRPKE+E  KDFQR+KVNI +DEK SLLYSAWS LL EP
Sbjct: 61  FSPNSQPASSWNSTSENTNCNFRPKEIENPKDFQRVKVNIDDDEKTSLLYSAWSSLLTEP 120

Query: 121 ISGKNAVLRGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLH 180
           +S +N  LR LGL DK T+P APHLE+C LKAEANKRFDERS TDGFP WTSWKGFLD H
Sbjct: 121 VSRRNTFLRDLGL-DKGTIPNAPHLENCMLKAEANKRFDERSATDGFPSWTSWKGFLDTH 180

Query: 181 LTDAATERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRF 240
            T A TE SS LR +E  +GSYPPWV GSDEENYPLTRKVQRDLWIHQHP NC D N+RF
Sbjct: 181 PT-AMTEESSNLRRQEKFEGSYPPWVSGSDEENYPLTRKVQRDLWIHQHPLNCSDSNIRF 240

Query: 241 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFF 300
           LVADWERLPGFGIGAQIAGMCGLLAIAINEKR+LVT+YYNRADHDGCQGSSRSSWSCYF 
Sbjct: 241 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRVLVTNYYNRADHDGCQGSSRSSWSCYFL 300

Query: 301 PETSQECRDHAFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVN 360
           PETSQECRD AFELLGNNEAWKS IITAKENYSTKEIWTGRIPR WGNPWSYLQPTTEVN
Sbjct: 301 PETSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQPTTEVN 360

Query: 361 GSLLFHHRKIDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWP 420
           GSLL  HRK+DRRWWRAQAVRYLMRF+TEY CGLMNAARHAAFGKEAAEMVLKSL GKWP
Sbjct: 361 GSLLSKHRKMDRRWWRAQAVRYLMRFKTEYMCGLMNAARHAAFGKEAAEMVLKSLDGKWP 420

Query: 421 EGDSLTSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRF 480
           + DS+TSK DIEDFVWS+HKAWIPRPLLSMHVRMGDKACEMKVVEFE YMALA+RIRRRF
Sbjct: 421 KKDSMTSKRDIEDFVWSDHKAWIPRPLLSMHVRMGDKACEMKVVEFEEYMALAKRIRRRF 480

Query: 481 PNLDSIWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVN 540
           PNLD+IWLSTEMQEVIDKT SYPSW+FYYT+VKRQ+GNLTMATYEAQLGRITSTNYPLVN
Sbjct: 481 PNLDNIWLSTEMQEVIDKTVSYPSWKFYYTNVKRQIGNLTMATYEAQLGRITSTNYPLVN 540

Query: 541 FLMATEADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 581
           FLMATEADFF+GALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
Sbjct: 541 FLMATEADFFIGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 583

BLAST of MS021968 vs. NCBI nr
Match: KAG6586157.1 (hypothetical protein SDJN03_18890, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1055.0 bits (2727), Expect = 2.3e-304
Identity = 508/585 (86.84%), Postives = 537/585 (91.79%), Query Frame = 0

Query: 1   MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF-----PLGFDWSS 60
           MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF     PLGF WSS
Sbjct: 77  MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFASFGAPLGFGWSS 136

Query: 61  LSPNSVTSSLWNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLLIEP 120
           LS NS+ +S WNSTSE I+CN + +EVER KDF++I+V+  +++K SLLY+AWS  L EP
Sbjct: 137 LSSNSLPASFWNSTSEAINCNLKVEEVERPKDFRKIEVH-KDEDKVSLLYAAWSSSLTEP 196

Query: 121 ISGKNAVLRGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLH 180
           I G NA  R LGL DKATVP APHLEDCKLK EANKRFDER  TDGF  WTSWKGFLD+H
Sbjct: 197 IRGNNAFSRYLGL-DKATVPNAPHLEDCKLKVEANKRFDER--TDGFLRWTSWKGFLDMH 256

Query: 181 LTDAATERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRF 240
            T A +E SSYLRH+E+SKGSYPPWV GSDEENYPLTRKVQRDLWIHQHP NCRDPNVRF
Sbjct: 257 PT-ATSEESSYLRHQEMSKGSYPPWVTGSDEENYPLTRKVQRDLWIHQHPLNCRDPNVRF 316

Query: 241 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFF 300
           LVADWERLPGFGIGAQIAGMCGLLAIAINEKR+LVT+YYNRADHDGCQGSSRSSWSCYFF
Sbjct: 317 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRVLVTNYYNRADHDGCQGSSRSSWSCYFF 376

Query: 301 PETSQECRDHAFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVN 360
           PETSQECRD AFELLGNN AWKS IITAKENYSTKEIWTGR+PR WGNPWSY+QPTTEVN
Sbjct: 377 PETSQECRDRAFELLGNNTAWKSGIITAKENYSTKEIWTGRVPRIWGNPWSYMQPTTEVN 436

Query: 361 GSLLFHHRKIDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWP 420
           GSLL +HRK+DRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSL GKWP
Sbjct: 437 GSLLSNHRKMDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLDGKWP 496

Query: 421 EGDSLTSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRF 480
           + DS+ SKHDIE+FVWSNHK WIPRPLLSMHVRMGDKACEMKVVEFE YMALA RIRRRF
Sbjct: 497 KNDSMVSKHDIEEFVWSNHKPWIPRPLLSMHVRMGDKACEMKVVEFEEYMALAGRIRRRF 556

Query: 481 PNLDSIWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVN 540
           PNLDSIWLSTEMQEVIDKTRSYPSW+FYYT+VKRQVGNLTMATYEAQLGRITSTNYPLVN
Sbjct: 557 PNLDSIWLSTEMQEVIDKTRSYPSWKFYYTNVKRQVGNLTMATYEAQLGRITSTNYPLVN 616

Query: 541 FLMATEADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 581
           FLMATEADFF+GALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
Sbjct: 617 FLMATEADFFIGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 656

BLAST of MS021968 vs. ExPASy TrEMBL
Match: A0A6J1D6Z2 (uncharacterized protein LOC111018192 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018192 PE=4 SV=1)

HSP 1 Score: 1207.6 bits (3123), Expect = 0.0e+00
Identity = 574/580 (98.97%), Postives = 576/580 (99.31%), Query Frame = 0

Query: 1   MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFPLGFDWSSLSPNS 60
           MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFPLGFDWSS SPNS
Sbjct: 1   MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFPLGFDWSSFSPNS 60

Query: 61  VTSSLWNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLLIEPISGKN 120
           VTSSLWNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLL EPISGKN
Sbjct: 61  VTSSLWNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLLTEPISGKN 120

Query: 121 AVLRGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLHLTDAA 180
           AV RGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLHLTDAA
Sbjct: 121 AVPRGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLHLTDAA 180

Query: 181 TERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRFLVADW 240
           TERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQR+LWIHQHPSNCRDPNVRFLVADW
Sbjct: 181 TERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQRELWIHQHPSNCRDPNVRFLVADW 240

Query: 241 ERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFFPETSQ 300
           ERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFFPETSQ
Sbjct: 241 ERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFFPETSQ 300

Query: 301 ECRDHAFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVNGSLLF 360
           ECRD AFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVNGSLLF
Sbjct: 301 ECRDRAFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVNGSLLF 360

Query: 361 HHRKIDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWPEGDSL 420
           HHRK+DRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWPEGDSL
Sbjct: 361 HHRKMDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWPEGDSL 420

Query: 421 TSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRFPNLDS 480
           TSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRFPNLDS
Sbjct: 421 TSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRFPNLDS 480

Query: 481 IWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVNFLMAT 540
           IWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVNFLMAT
Sbjct: 481 IWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVNFLMAT 540

Query: 541 EADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 581
           EADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
Sbjct: 541 EADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 580

BLAST of MS021968 vs. ExPASy TrEMBL
Match: A0A0A0LKK1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G075410 PE=4 SV=1)

HSP 1 Score: 1067.0 bits (2758), Expect = 2.8e-308
Identity = 513/585 (87.69%), Postives = 537/585 (91.79%), Query Frame = 0

Query: 1   MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF-----PLGFDWSS 60
           MEAQNQKSLER+VSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF     PLGF WSS
Sbjct: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSLGSPLGFGWSS 60

Query: 61  LSPNSVTSSLWNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLLIEP 120
            SPNS  +SL NSTSENI+CNFRPKE+E L+DFQRIKVN  +DEK SLLYSAWS L+ EP
Sbjct: 61  FSPNSQPASLCNSTSENINCNFRPKEIEELRDFQRIKVN-NDDEKTSLLYSAWSSLMTEP 120

Query: 121 ISGKNAVLRGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLH 180
           IS +NA LR LGL DKAT+P APHLE+CKLKAE NKRFDER +TDGFPPWTSWKG LD H
Sbjct: 121 ISSRNAFLRDLGL-DKATIPNAPHLENCKLKAETNKRFDERLQTDGFPPWTSWKGILDTH 180

Query: 181 LTDAATERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRF 240
            T A TE SSYLR +E+  GS+PPWV GSDEENYPLTRKVQRDLWIHQHP NC D NVRF
Sbjct: 181 PT-AMTEESSYLRRQEMFGGSFPPWVSGSDEENYPLTRKVQRDLWIHQHPLNCSDSNVRF 240

Query: 241 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFF 300
           LVADWERLPGFGIGAQIAGMCGLLAIAINEKR+LVT+YYNRADHDGCQGSSRSSWSCYF 
Sbjct: 241 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRVLVTNYYNRADHDGCQGSSRSSWSCYFL 300

Query: 301 PETSQECRDHAFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVN 360
           PETSQECRD AFELLGNNEAWKS IITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVN
Sbjct: 301 PETSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVN 360

Query: 361 GSLLFHHRKIDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWP 420
           GSLL  HRK+DRRWWRAQAVRYLMRF+TEYTCGLMNAARHAAFGKEAAEM LKSL GKWP
Sbjct: 361 GSLLSKHRKMDRRWWRAQAVRYLMRFKTEYTCGLMNAARHAAFGKEAAEMALKSLDGKWP 420

Query: 421 EGDSLTSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRF 480
           + DS TSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEF  YMALA+RIRRRF
Sbjct: 421 KKDSTTSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFAEYMALAKRIRRRF 480

Query: 481 PNLDSIWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVN 540
           PNLD+IWLSTEMQEVIDKT SYPSW+FYYT+VKRQVGNLTMATYEAQLGRITSTNYPLVN
Sbjct: 481 PNLDNIWLSTEMQEVIDKTVSYPSWKFYYTNVKRQVGNLTMATYEAQLGRITSTNYPLVN 540

Query: 541 FLMATEADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 581
           FLMATEADFF+GALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
Sbjct: 541 FLMATEADFFIGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 582

BLAST of MS021968 vs. ExPASy TrEMBL
Match: A0A1S3B531 (uncharacterized protein LOC103485874 OS=Cucumis melo OX=3656 GN=LOC103485874 PE=4 SV=1)

HSP 1 Score: 1060.8 bits (2742), Expect = 2.0e-306
Identity = 507/585 (86.67%), Postives = 533/585 (91.11%), Query Frame = 0

Query: 1   MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF-----PLGFDWSS 60
           MEAQNQKSLER+VSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF     PLGF WSS
Sbjct: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSLGSPLGFGWSS 60

Query: 61  LSPNSVTSSLWNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLLIEP 120
            SPNS  +S WNSTSEN +CNFRPKE+E  KDFQR+KVNI +DEK SLLYSAWS LL EP
Sbjct: 61  FSPNSQPASSWNSTSENTNCNFRPKEIENPKDFQRVKVNIDDDEKTSLLYSAWSSLLTEP 120

Query: 121 ISGKNAVLRGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLH 180
           +S +N  LR LGL DK T+P APHLE+C LKAEANKRFDERS TDGFP WTSWKGFLD H
Sbjct: 121 VSRRNTFLRDLGL-DKGTIPNAPHLENCMLKAEANKRFDERSATDGFPSWTSWKGFLDTH 180

Query: 181 LTDAATERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRF 240
            T A TE SS LR +E  +GSYPPWV GSDEENYPLTRKVQRDLWIHQHP NC D N+RF
Sbjct: 181 PT-AMTEESSNLRRQEKFEGSYPPWVSGSDEENYPLTRKVQRDLWIHQHPLNCSDSNIRF 240

Query: 241 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFF 300
           LVADWERLPGFGIGAQIAGMCGLLAIAINEKR+LVT+YYNRADHDGCQGSSRSSWSCYF 
Sbjct: 241 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRVLVTNYYNRADHDGCQGSSRSSWSCYFL 300

Query: 301 PETSQECRDHAFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVN 360
           PETSQECRD AFELLGNNEAWKS IITAKENYSTKEIWTGRIPR WGNPWSYLQPTTEVN
Sbjct: 301 PETSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQPTTEVN 360

Query: 361 GSLLFHHRKIDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWP 420
           GSLL  HRK+DRRWWRAQAVRYLMRF+TEY CGLMNAARHAAFGKEAAEMVLKSL GKWP
Sbjct: 361 GSLLSKHRKMDRRWWRAQAVRYLMRFKTEYMCGLMNAARHAAFGKEAAEMVLKSLDGKWP 420

Query: 421 EGDSLTSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRF 480
           + DS+TSK DIEDFVWS+HKAWIPRPLLSMHVRMGDKACEMKVVEFE YMALA+RIRRRF
Sbjct: 421 KKDSMTSKRDIEDFVWSDHKAWIPRPLLSMHVRMGDKACEMKVVEFEEYMALAKRIRRRF 480

Query: 481 PNLDSIWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVN 540
           PNLD+IWLSTEMQEVIDKT SYPSW+FYYT+VKRQ+GNLTMATYEAQLGRITSTNYPLVN
Sbjct: 481 PNLDNIWLSTEMQEVIDKTVSYPSWKFYYTNVKRQIGNLTMATYEAQLGRITSTNYPLVN 540

Query: 541 FLMATEADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 581
           FLMATEADFF+GALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
Sbjct: 541 FLMATEADFFIGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 583

BLAST of MS021968 vs. ExPASy TrEMBL
Match: A0A5D3CBF0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold105G001010 PE=4 SV=1)

HSP 1 Score: 1060.8 bits (2742), Expect = 2.0e-306
Identity = 507/585 (86.67%), Postives = 533/585 (91.11%), Query Frame = 0

Query: 1   MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF-----PLGFDWSS 60
           MEAQNQKSLER+VSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF     PLGF WSS
Sbjct: 1   MEAQNQKSLERIVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFTSLGSPLGFGWSS 60

Query: 61  LSPNSVTSSLWNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLLIEP 120
            SPNS  +S WNSTSEN +CNFRPKE+E  KDFQR+KVNI +DEK SLLYSAWS LL EP
Sbjct: 61  FSPNSQPASSWNSTSENTNCNFRPKEIENPKDFQRVKVNIDDDEKTSLLYSAWSSLLTEP 120

Query: 121 ISGKNAVLRGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLH 180
           +S +N  LR LGL DK T+P APHLE+C LKAEANKRFDERS TDGFP WTSWKGFLD H
Sbjct: 121 VSRRNTFLRDLGL-DKGTIPNAPHLENCMLKAEANKRFDERSATDGFPSWTSWKGFLDTH 180

Query: 181 LTDAATERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRF 240
            T A TE SS LR +E  +GSYPPWV GSDEENYPLTRKVQRDLWIHQHP NC D N+RF
Sbjct: 181 PT-AMTEESSNLRRQEKFEGSYPPWVSGSDEENYPLTRKVQRDLWIHQHPLNCSDSNIRF 240

Query: 241 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFF 300
           LVADWERLPGFGIGAQIAGMCGLLAIAINEKR+LVT+YYNRADHDGCQGSSRSSWSCYF 
Sbjct: 241 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRVLVTNYYNRADHDGCQGSSRSSWSCYFL 300

Query: 301 PETSQECRDHAFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVN 360
           PETSQECRD AFELLGNNEAWKS IITAKENYSTKEIWTGRIPR WGNPWSYLQPTTEVN
Sbjct: 301 PETSQECRDRAFELLGNNEAWKSGIITAKENYSTKEIWTGRIPRAWGNPWSYLQPTTEVN 360

Query: 361 GSLLFHHRKIDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWP 420
           GSLL  HRK+DRRWWRAQAVRYLMRF+TEY CGLMNAARHAAFGKEAAEMVLKSL GKWP
Sbjct: 361 GSLLSKHRKMDRRWWRAQAVRYLMRFKTEYMCGLMNAARHAAFGKEAAEMVLKSLDGKWP 420

Query: 421 EGDSLTSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRF 480
           + DS+TSK DIEDFVWS+HKAWIPRPLLSMHVRMGDKACEMKVVEFE YMALA+RIRRRF
Sbjct: 421 KKDSMTSKRDIEDFVWSDHKAWIPRPLLSMHVRMGDKACEMKVVEFEEYMALAKRIRRRF 480

Query: 481 PNLDSIWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVN 540
           PNLD+IWLSTEMQEVIDKT SYPSW+FYYT+VKRQ+GNLTMATYEAQLGRITSTNYPLVN
Sbjct: 481 PNLDNIWLSTEMQEVIDKTVSYPSWKFYYTNVKRQIGNLTMATYEAQLGRITSTNYPLVN 540

Query: 541 FLMATEADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 581
           FLMATEADFF+GALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
Sbjct: 541 FLMATEADFFIGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 583

BLAST of MS021968 vs. ExPASy TrEMBL
Match: A0A6J1FBJ6 (uncharacterized protein LOC111443937 OS=Cucurbita moschata OX=3662 GN=LOC111443937 PE=4 SV=1)

HSP 1 Score: 1053.9 bits (2724), Expect = 2.4e-304
Identity = 507/585 (86.67%), Postives = 537/585 (91.79%), Query Frame = 0

Query: 1   MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF-----PLGFDWSS 60
           MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF     PLGF WSS
Sbjct: 1   MEAQNQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAFASFGAPLGFGWSS 60

Query: 61  LSPNSVTSSLWNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLLIEP 120
           LS NS+ +S WNSTSE I+CN + +EVER KDF++I+V+  +++K SLLY+AWS  L EP
Sbjct: 61  LSSNSLPASFWNSTSEAINCNLKVEEVERPKDFRKIEVH-KDEDKVSLLYAAWSSSLTEP 120

Query: 121 ISGKNAVLRGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLH 180
           I G NA  R LGL DKATVP APHLEDCKLK EANKRFDER  TDGF  WTSWKGFLD+H
Sbjct: 121 IRGNNAFSRYLGL-DKATVPNAPHLEDCKLKVEANKRFDER--TDGFLRWTSWKGFLDMH 180

Query: 181 LTDAATERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRF 240
            T A +E SSYLRH+E+SKGSYPPWV GSDEENYPLTRKVQRDLWIHQHP NCRDPNVRF
Sbjct: 181 PT-ATSEESSYLRHQEMSKGSYPPWVTGSDEENYPLTRKVQRDLWIHQHPLNCRDPNVRF 240

Query: 241 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFF 300
           LVADWERLPGFGIGAQIAGMCGLLAIAINEKR+LVT+YYNRADHDGCQGSSRSSWSCYFF
Sbjct: 241 LVADWERLPGFGIGAQIAGMCGLLAIAINEKRVLVTNYYNRADHDGCQGSSRSSWSCYFF 300

Query: 301 PETSQECRDHAFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVN 360
           PETSQECRD AFELLGNN AWKS IITAKENYSTKEIWTGR+PR WGNPWSY+QPTTEVN
Sbjct: 301 PETSQECRDRAFELLGNNTAWKSGIITAKENYSTKEIWTGRVPRIWGNPWSYMQPTTEVN 360

Query: 361 GSLLFHHRKIDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWP 420
           GSLL +HRK+DRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKS+ GKWP
Sbjct: 361 GSLLSNHRKMDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSVDGKWP 420

Query: 421 EGDSLTSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRF 480
           + DS+ SKHDIE+FVWSNHK WIPRPLLSMHVRMGDKACEMKVVEFE YMALA RIRRRF
Sbjct: 421 KNDSMVSKHDIEEFVWSNHKPWIPRPLLSMHVRMGDKACEMKVVEFEEYMALAGRIRRRF 480

Query: 481 PNLDSIWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVN 540
           PNLDSIWLSTEMQEVIDKTRSYPSW+FYYT+VKRQVGNLTMATYEAQLGRITSTNYPLVN
Sbjct: 481 PNLDSIWLSTEMQEVIDKTRSYPSWKFYYTNVKRQVGNLTMATYEAQLGRITSTNYPLVN 540

Query: 541 FLMATEADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 581
           FLMATEADFF+GALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW
Sbjct: 541 FLMATEADFFIGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 580

BLAST of MS021968 vs. TAIR 10
Match: AT5G28910.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G28960.1); Has 82 Blast hits to 80 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 0; Plants - 78; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink). )

HSP 1 Score: 694.1 bits (1790), Expect = 9.4e-200
Identity = 343/580 (59.14%), Postives = 428/580 (73.79%), Query Frame = 0

Query: 5   NQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF-PLG-FDWSSLSPNSVT 64
           + KSLERVVS++AL+LG+SFPCQICVVGFLCG+C+ SLFL A   LG F++++ S  S +
Sbjct: 2   SMKSLERVVSERALKLGNSFPCQICVVGFLCGICLTSLFLAALTSLGTFEFAAFSFTSSS 61

Query: 65  SSL--WNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLLIEPISGKN 124
           S     NS++ +I  N       +LK   + KV I  +++  LL SAW  LL+     + 
Sbjct: 62  SVFPPCNSSTSHI-INMVASIDRKLK--WKNKVEIEEEDEVKLLVSAWDNLLL----NEE 121

Query: 125 AVLRGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLHLTDAA 184
             L+ +G+ +K+ VP  PHLE+C+ KA   +R D R                        
Sbjct: 122 DFLKKVGI-NKSDVPNGPHLENCEEKARVRERLDTR------------------------ 181

Query: 185 TERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRFLVADW 244
                      ++  + PPW+ G DEENYPLTR+VQRD+WIHQHP +C + +++FLVADW
Sbjct: 182 -----------LANWTLPPWISGGDEENYPLTRRVQRDIWIHQHPLDCGNKSLKFLVADW 241

Query: 245 ERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFFPETSQ 304
           E LPGFGIGAQIAGM GLLAIAINE R+LV +YYNRADHDGC+GS R +WSCYF  ETS+
Sbjct: 242 ETLPGFGIGAQIAGMTGLLAIAINENRVLVANYYNRADHDGCKGSFRGNWSCYFLQETSE 301

Query: 305 ECRDHAFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVNGSLLF 364
           ECR  AF ++   EAW+S I+T K+NYSTKEIW G IP+ WG PWSY++PTTE+NGSL+ 
Sbjct: 302 ECRKRAFAIVKKREAWESGIVTGKQNYSTKEIWAGAIPKQWGKPWSYMKPTTEINGSLIS 361

Query: 365 HHRKIDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWPEGDSL 424
           +HRK+DRRWWRAQAVRYLMR+QTEYTCGLMN AR++AFGKEAA++VL +  G W + +  
Sbjct: 362 NHRKMDRRWWRAQAVRYLMRYQTEYTCGLMNIARNSAFGKEAAKIVLSA--GDWRKKNK- 421

Query: 425 TSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRFPNLDS 484
             + +IE+ VWS+HK W+PRP+LS+HVRMGDKACEM+V   E YM LA+RIR RFP L+ 
Sbjct: 422 KMRTEIEEQVWSDHKPWLPRPMLSVHVRMGDKACEMRVAALEEYMHLADRIRDRFPELNR 481

Query: 485 IWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVNFLMAT 544
           IWLSTEM+EV+D+++ Y  WRFYYT+V RQVGN +MA YEA LGR  STNYPLVNFLMA+
Sbjct: 482 IWLSTEMKEVVDRSKDYAHWRFYYTEVARQVGNKSMAEYEASLGREMSTNYPLVNFLMAS 535

Query: 545 EADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 581
           EADFFVGALGSTWCFLIDGMRNTGGKVM+GYLSVNKDRFW
Sbjct: 542 EADFFVGALGSTWCFLIDGMRNTGGKVMSGYLSVNKDRFW 535

BLAST of MS021968 vs. TAIR 10
Match: AT5G28960.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G28910.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 674.5 bits (1739), Expect = 7.7e-194
Identity = 333/580 (57.41%), Postives = 412/580 (71.03%), Query Frame = 0

Query: 5   NQKSLERVVSQKALQLGSSFPCQICVVGFLCGVCIASLFLGAF-PLG-FDWSSLSPNSVT 64
           + KSL R+VS+KAL+LG+SFPCQICVVG LCG+C  SLFL A   LG F+ ++ S ++ +
Sbjct: 2   SMKSLNRMVSEKALKLGNSFPCQICVVGILCGICFTSLFLAALTSLGTFELAAFSLSASS 61

Query: 65  SSL--WNSTSENISCNFRPKEVERLKDFQRIKVNIANDEKASLLYSAWSVLLIEPISGKN 124
           S+   +NSTS  I        ++R   +        ND++  LL SAW  L++       
Sbjct: 62  SAFLPYNSTSHII---HMVSSMDRKLKWTNKAEKGDNDDQVKLLVSAWDNLVL----NNE 121

Query: 125 AVLRGLGLDDKATVPKAPHLEDCKLKAEANKRFDERSETDGFPPWTSWKGFLDLHLTDAA 184
             L+ LG+ +K+ VP APHLE+C++ +   +R D R                        
Sbjct: 122 DFLKKLGM-NKSDVPNAPHLENCEVISRVRERLDTR------------------------ 181

Query: 185 TERSSYLRHREISKGSYPPWVIGSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRFLVADW 244
                      I+  S+P W+ G DE+NYPLTR VQ ++WIHQHP +C +  V+FLV DW
Sbjct: 182 -----------IANQSFPTWITGGDEQNYPLTRIVQSEIWIHQHPLDCENETVKFLVVDW 241

Query: 245 ERLPGFGIGAQIAGMCGLLAIAINEKRILVTSYYNRADHDGCQGSSRSSWSCYFFPETSQ 304
           E LP +G GAQI  M GLLAIAINE R+LV ++YNRADHDGC+GSSR  WSCYF PETS+
Sbjct: 242 ETLPIYGSGAQITEMTGLLAIAINENRVLVANHYNRADHDGCRGSSRGRWSCYFLPETSE 301

Query: 305 ECRDHAFELLGNNEAWKSRIITAKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVNGSLLF 364
           ECR  AF ++   EAW+S  +T K+NYS+K IW G IP+ WG PWSY++PTTE+NGSL+ 
Sbjct: 302 ECRKRAFAVVREKEAWESGTVTGKQNYSSKVIWAGPIPKLWGKPWSYMKPTTEINGSLIS 361

Query: 365 HHRKIDRRWWRAQAVRYLMRFQTEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWPEGDSL 424
            HRK+DRRWWRAQAVRYLMRFQT YTCGLMNAAR+ AFGKEAAE+VL +  G W +    
Sbjct: 362 KHRKMDRRWWRAQAVRYLMRFQTAYTCGLMNAARNTAFGKEAAEIVLSA--GDWRKNKKK 421

Query: 425 TSKHDIEDFVWSNHKAWIPRPLLSMHVRMGDKACEMKVVEFEGYMALAERIRRRFPNLDS 484
             K +IE+ VWSNHK W+PRP+LS+HVRMGDKACEM+V   E YM LA+RI+ RFP L+ 
Sbjct: 422 KVKTEIEEQVWSNHKPWVPRPMLSVHVRMGDKACEMRVAALEEYMHLADRIKDRFPELNK 481

Query: 485 IWLSTEMQEVIDKTRSYPSWRFYYTDVKRQVGNLTMATYEAQLGRITSTNYPLVNFLMAT 544
           IWLSTEM+EV+DK++ Y  WRFYYT+V RQVGN ++A YEA LGR TSTNYPLVNFLMA+
Sbjct: 482 IWLSTEMKEVVDKSKDYDHWRFYYTEVARQVGNKSIAEYEASLGRETSTNYPLVNFLMAS 536

Query: 545 EADFFVGALGSTWCFLIDGMRNTGGKVMAGYLSVNKDRFW 581
           EADFFVGALGSTWCFLIDGMRNTGGKVM+GYLSVNKDRFW
Sbjct: 542 EADFFVGALGSTWCFLIDGMRNTGGKVMSGYLSVNKDRFW 536

BLAST of MS021968 vs. TAIR 10
Match: AT5G28910.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G28960.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 601.7 bits (1550), Expect = 6.4e-172
Identity = 271/378 (71.69%), Postives = 322/378 (85.19%), Query Frame = 0

Query: 203 GSDEENYPLTRKVQRDLWIHQHPSNCRDPNVRFLVADWERLPGFGIGAQIAGMCGLLAIA 262
           G DEENYPLTR+VQRD+WIHQHP +C + +++FLVADWE LPGFGIGAQIAGM GLLAIA
Sbjct: 34  GGDEENYPLTRRVQRDIWIHQHPLDCGNKSLKFLVADWETLPGFGIGAQIAGMTGLLAIA 93

Query: 263 INEKRILVTSYYNRADHDGCQGSSRSSWSCYFFPETSQECRDHAFELLGNNEAWKSRIIT 322
           INE R+LV +YYNRADHDGC+GS R +WSCYF  ETS+ECR  AF ++   EAW+S I+T
Sbjct: 94  INENRVLVANYYNRADHDGCKGSFRGNWSCYFLQETSEECRKRAFAIVKKREAWESGIVT 153

Query: 323 AKENYSTKEIWTGRIPRTWGNPWSYLQPTTEVNGSLLFHHRKIDRRWWRAQAVRYLMRFQ 382
            K+NYSTKEIW G IP+ WG PWSY++PTTE+NGSL+ +HRK+DRRWWRAQAVRYLMR+Q
Sbjct: 154 GKQNYSTKEIWAGAIPKQWGKPWSYMKPTTEINGSLISNHRKMDRRWWRAQAVRYLMRYQ 213

Query: 383 TEYTCGLMNAARHAAFGKEAAEMVLKSLHGKWPEGDSLTSKHDIEDFVWSNHKAWIPRPL 442
           TEYTCGLMN AR++AFGKEAA++VL +  G W + +    + +IE+ VWS+HK W+PRP+
Sbjct: 214 TEYTCGLMNIARNSAFGKEAAKIVLSA--GDWRKKNK-KMRTEIEEQVWSDHKPWLPRPM 273

Query: 443 LSMHVRMGDKACEMKVVEFEGYMALAERIRRRFPNLDSIWLSTEMQEVIDKTRSYPSWRF 502
           LS+HVRMGDKACEM+V   E YM LA+RIR RFP L+ IWLSTEM+EV+D+++ Y  WRF
Sbjct: 274 LSVHVRMGDKACEMRVAALEEYMHLADRIRDRFPELNRIWLSTEMKEVVDRSKDYAHWRF 333

Query: 503 YYTDVKRQVGNLTMATYEAQLGRITSTNYPLVNFLMATEADFFVGALGSTWCFLIDGMRN 562
           YYT+V RQVGN +MA YEA LGR  STNYPLVNFLMA+EADFFVGALGSTWCFLIDGMRN
Sbjct: 334 YYTEVARQVGNKSMAEYEASLGREMSTNYPLVNFLMASEADFFVGALGSTWCFLIDGMRN 393

Query: 563 TGGKVMAGYLSVNKDRFW 581
           TGGKVM+GYLSVNKDRFW
Sbjct: 394 TGGKVMSGYLSVNKDRFW 408

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022149877.10.0e+0098.97uncharacterized protein LOC111018192 isoform X1 [Momordica charantia] >XP_022149... [more]
XP_038889861.18.9e-31087.86uncharacterized protein LOC120079658 [Benincasa hispida][more]
XP_011649010.15.7e-30887.69uncharacterized protein LOC101206485 isoform X1 [Cucumis sativus] >XP_031737266.... [more]
XP_008441824.14.1e-30686.67PREDICTED: uncharacterized protein LOC103485874 [Cucumis melo] >XP_008441825.1 P... [more]
KAG6586157.12.3e-30486.84hypothetical protein SDJN03_18890, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1D6Z20.0e+0098.97uncharacterized protein LOC111018192 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A0A0LKK12.8e-30887.69Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G075410 PE=4 SV=1[more]
A0A1S3B5312.0e-30686.67uncharacterized protein LOC103485874 OS=Cucumis melo OX=3656 GN=LOC103485874 PE=... [more]
A0A5D3CBF02.0e-30686.67Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6J1FBJ62.4e-30486.67uncharacterized protein LOC111443937 OS=Cucurbita moschata OX=3662 GN=LOC1114439... [more]
Match NameE-valueIdentityDescription
AT5G28910.29.4e-20059.14unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G28960.17.7e-19457.41unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G28910.16.4e-17271.69unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.50.11350coord: 401..569
e-value: 2.8E-7
score: 32.4
NoneNo IPR availablePANTHERPTHR13132ALPHA- 1,6 -FUCOSYLTRANSFERASEcoord: 1..580
NoneNo IPR availablePANTHERPTHR13132:SF32COATOMER PROTEINcoord: 1..580

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS021968.1MS021968.1mRNA