MS015236 (gene) Bitter gourd (TR) v1

Overview
NameMS015236
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Locationscaffold2: 1966807 .. 1968143 (+)
RNA-Seq ExpressionMS015236
SyntenyMS015236
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGCGACGGGTAGAAGTGAAAGAAGTGAACATTCAATAGAACAGCCAGAAATAATAGACAGAAGTGATAGTAAAAATTCTGAAGGTACTGTTAAAGAAGAGTTTCCACTTTCCCCAACCAAACAACTGGGGTATGGGTCTTCATCGTCTTTAAATTTGTTGGAATTGTCAGATTTCGTTCAAGTTGACGCGCATATCAATCAATCTGAAGCAGGCACGATTTGTTTGGGTAATCCCGATCATGTTATTTACCAGCTTCAGTTTAAAGATGATGAGAGCGATGCTTCCAGCATCAGTTCTGCCAAATCAGAGAAGGGTAGCGGTTTTCAAGGTACATTTGCATCCCAAGTTTCTGACATTACCCCCGAGTCCGTTAGAAATGTTTTGTCCCCAACTCAATCCCCTCCTCTTCAGACGATGGATCGTGCGGGAGAATATGATCCATTTAGAATTCCGTCTGCAGTTTTCCAAAGAAGTAGATCAATCACTCCATTGGAATGGAGTATTGCCTCAAATGAATCGTTGTTTAGCATTAATGTCGGAAACAATAGTCTCTCCAGAGATCATGTGTTTAGGTTGAGTGATTTGGGTAAGCCTGACGAGTTGACAAAATCGGGCGAGTTGTTTGTGTTTAACCCACCACCTACAGTTATCACATCAAGAGAAACTGAAAAGAAGAGTGCTGAATTTGAAAAGGATACTAAAATGGAAGATACCGCAGAATATACCGTTAATGATAAAGAGGGTGTGATTACAAGAAGTCCAAGTGAGAGAAAGGGGACGCCTCCTGCAATATCATGGAATTCCTCTAACGTATCTCGGCACTCAGATAGAAGTCAAAGCAGCTCGAATTCCTTTGCTTTCCCGATGTAAGTATTTGCTCATACTTTATTGATAAAAACATCATTAATTGGATTTCTACTTCCCAACTTTGTGCTTGCTTGAGTATGTTTTACTAAGGACTTAGGATTCCATATACCCTTTCATAATGCCTTCCTAGCATTCTGCCTGATTAATAGGCATTCATACGGTTTCTTTTTAGACTACTATATTGTTTTCTTTTTTGTATCTTTTTTTTCTTTCACATCTAAGAGCACGAAGATAGATCCCATTATCTTGGCAAATGCTAATTAGAGTATAAGGGAATATGCGCTTTATGTGTTGCAGAAAGAAGAAGTGTGCATGCTCATCGTGCTACTGTTCCAACTGTAGTCAGGCGTTCTGCTACAAGCTATGGCCAAGTTGCTACTGTACATGGCCATGCTGCTGCTGTTGGAACTGTAGCAGGCCGTTCTGCTACTGTTGGAACTGTAGCCAAAAAGGATTGTCTTGTG

mRNA sequence

ATGGCAGCGACGGGTAGAAGTGAAAGAAGTGAACATTCAATAGAACAGCCAGAAATAATAGACAGAAGTGATAGTAAAAATTCTGAAGGTACTGTTAAAGAAGAGTTTCCACTTTCCCCAACCAAACAACTGGGGTATGGGTCTTCATCGTCTTTAAATTTGTTGGAATTGTCAGATTTCGTTCAAGTTGACGCGCATATCAATCAATCTGAAGCAGGCACGATTTGTTTGGGTAATCCCGATCATGTTATTTACCAGCTTCAGTTTAAAGATGATGAGAGCGATGCTTCCAGCATCAGTTCTGCCAAATCAGAGAAGGGTAGCGGTTTTCAAGGTACATTTGCATCCCAAGTTTCTGACATTACCCCCGAGTCCGTTAGAAATGTTTTGTCCCCAACTCAATCCCCTCCTCTTCAGACGATGGATCGTGCGGGAGAATATGATCCATTTAGAATTCCGTCTGCAGTTTTCCAAAGAAGTAGATCAATCACTCCATTGGAATGGAGTATTGCCTCAAATGAATCGTTGTTTAGCATTAATGTCGGAAACAATAGTCTCTCCAGAGATCATGTGTTTAGGTTGAGTGATTTGGGTAAGCCTGACGAGTTGACAAAATCGGGCGAGTTGTTTGTGTTTAACCCACCACCTACAGTTATCACATCAAGAGAAACTGAAAAGAAGAGTGCTGAATTTGAAAAGGATACTAAAATGGAAGATACCGCAGAATATACCGTTAATGATAAAGAGGGTGTGATTACAAGAAGTCCAAGTGAGAGAAAGGGGACGCCTCCTGCAATATCATGGAATTCCTCTAACGTATCTCGGCACTCAGATAGAAGTCAAAGCAGCTCGAATTCCTTTGCTTTCCCGATGCGTTCTGCTACAAGCTATGGCCAAGTTGCTACTGTACATGGCCATGCTGCTGCTGTTGGAACTGTAGCAGGCCGTTCTGCTACTGTTGGAACTGTAGCCAAAAAGGATTGTCTTGTG

Coding sequence (CDS)

ATGGCAGCGACGGGTAGAAGTGAAAGAAGTGAACATTCAATAGAACAGCCAGAAATAATAGACAGAAGTGATAGTAAAAATTCTGAAGGTACTGTTAAAGAAGAGTTTCCACTTTCCCCAACCAAACAACTGGGGTATGGGTCTTCATCGTCTTTAAATTTGTTGGAATTGTCAGATTTCGTTCAAGTTGACGCGCATATCAATCAATCTGAAGCAGGCACGATTTGTTTGGGTAATCCCGATCATGTTATTTACCAGCTTCAGTTTAAAGATGATGAGAGCGATGCTTCCAGCATCAGTTCTGCCAAATCAGAGAAGGGTAGCGGTTTTCAAGGTACATTTGCATCCCAAGTTTCTGACATTACCCCCGAGTCCGTTAGAAATGTTTTGTCCCCAACTCAATCCCCTCCTCTTCAGACGATGGATCGTGCGGGAGAATATGATCCATTTAGAATTCCGTCTGCAGTTTTCCAAAGAAGTAGATCAATCACTCCATTGGAATGGAGTATTGCCTCAAATGAATCGTTGTTTAGCATTAATGTCGGAAACAATAGTCTCTCCAGAGATCATGTGTTTAGGTTGAGTGATTTGGGTAAGCCTGACGAGTTGACAAAATCGGGCGAGTTGTTTGTGTTTAACCCACCACCTACAGTTATCACATCAAGAGAAACTGAAAAGAAGAGTGCTGAATTTGAAAAGGATACTAAAATGGAAGATACCGCAGAATATACCGTTAATGATAAAGAGGGTGTGATTACAAGAAGTCCAAGTGAGAGAAAGGGGACGCCTCCTGCAATATCATGGAATTCCTCTAACGTATCTCGGCACTCAGATAGAAGTCAAAGCAGCTCGAATTCCTTTGCTTTCCCGATGCGTTCTGCTACAAGCTATGGCCAAGTTGCTACTGTACATGGCCATGCTGCTGCTGTTGGAACTGTAGCAGGCCGTTCTGCTACTGTTGGAACTGTAGCCAAAAAGGATTGTCTTGTG

Protein sequence

MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVKEEFPLSPTKQLGYGSSSSLNLLELSDFVQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQGTFASQVSDITPESVRNVLSPTQSPPLQTMDRAGEYDPFRIPSAVFQRSRSITPLEWSIASNESLFSINVGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITSRETEKKSAEFEKDTKMEDTAEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFPMRSATSYGQVATVHGHAAAVGTVAGRSATVGTVAKKDCLV
Homology
BLAST of MS015236 vs. NCBI nr
Match: XP_022149012.1 (uncharacterized protein LOC111017534 isoform X2 [Momordica charantia])

HSP 1 Score: 553.9 bits (1426), Expect = 9.3e-154
Identity = 289/292 (98.97%), Postives = 291/292 (99.66%), Query Frame = 0

Query: 1   MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVKEEFPLSPTKQLGYGSSSSLNLLELSDF 60
           MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVK EFPLSPTKQLGYGSSSSLNLLELSDF
Sbjct: 1   MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVK-EFPLSPTKQLGYGSSSSLNLLELSDF 60

Query: 61  VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQGTFASQVSD 120
           VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQGTFASQVSD
Sbjct: 61  VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQGTFASQVSD 120

Query: 121 ITPESVRNVLSPTQSPPLQTMDRAGEYDPFRIPSAVFQRSRSITPLEWSIASNESLFSIN 180
           ITPESVRNVLSPTQSPPLQTMDRAGEYDPFRIPSAVFQRSRSITPLEWSIASNESLFSIN
Sbjct: 121 ITPESVRNVLSPTQSPPLQTMDRAGEYDPFRIPSAVFQRSRSITPLEWSIASNESLFSIN 180

Query: 181 VGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITSRETEKKSAEFEKDTKMEDT 240
           VGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITSRETEKKSAEFEKDTKMEDT
Sbjct: 181 VGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITSRETEKKSAEFEKDTKMEDT 240

Query: 241 AEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFPMR 293
           AEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFP++
Sbjct: 241 AEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFPIK 291

BLAST of MS015236 vs. NCBI nr
Match: XP_022149010.1 (uncharacterized protein LOC111017534 isoform X1 [Momordica charantia] >XP_022149011.1 uncharacterized protein LOC111017534 isoform X1 [Momordica charantia])

HSP 1 Score: 552.4 bits (1422), Expect = 2.7e-153
Identity = 294/307 (95.77%), Postives = 296/307 (96.42%), Query Frame = 0

Query: 1   MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVKEEFPLSPTKQLGYGSSSSLNLLELSDF 60
           MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVK EFPLSPTKQLGYGSSSSLNLLELSDF
Sbjct: 1   MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVK-EFPLSPTKQLGYGSSSSLNLLELSDF 60

Query: 61  VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQGTFASQVSD 120
           VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQGTFASQVSD
Sbjct: 61  VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQGTFASQVSD 120

Query: 121 ITPESVRNVLSPTQSPPLQTMDRAGEYDPFRIPSAVFQRSRSITPLEWSIASNESLFSIN 180
           ITPESVRNVLSPTQSPPLQTMDRAGEYDPFRIPSAVFQRSRSITPLEWSIASNESLFSIN
Sbjct: 121 ITPESVRNVLSPTQSPPLQTMDRAGEYDPFRIPSAVFQRSRSITPLEWSIASNESLFSIN 180

Query: 181 VGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITSRETEKKSAEFEKDTKMEDT 240
           VGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITSRETEKKSAEFEKDTKMEDT
Sbjct: 181 VGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITSRETEKKSAEFEKDTKMEDT 240

Query: 241 AEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFPMRSATSYGQV 300
           AEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFP+  A    Q 
Sbjct: 241 AEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFPI-LADEEAQA 300

Query: 301 ATVHGHA 308
            +V G A
Sbjct: 301 GSVPGDA 305

BLAST of MS015236 vs. NCBI nr
Match: XP_038892415.1 (uncharacterized protein LOC120081528 isoform X2 [Benincasa hispida])

HSP 1 Score: 369.8 bits (948), Expect = 2.5e-98
Identity = 213/329 (64.74%), Postives = 241/329 (73.25%), Query Frame = 0

Query: 1   MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVKEEFPLSPTKQLGYGSSSSLNLLELSDF 60
           MAATGR ERS H  E P II+   SK+SEGT  E    SPT+QLGYGSSSS+NLLE+SD 
Sbjct: 1   MAATGRRERSSHLTEHPVIIELGSSKHSEGTGGEGVSFSPTEQLGYGSSSSINLLEISDL 60

Query: 61  VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQG-TFASQVS 120
            QV +++++ EA TIC GNP   +YQLQFKDD+SD SS+ S+KSEK SG QG T ASQVS
Sbjct: 61  AQVGSNVDRPEADTICFGNPGRGMYQLQFKDDDSDVSSVCSSKSEKSSGSQGPTSASQVS 120

Query: 121 DITPESVRNVLSPTQSPPLQTMDRAG---EYDPFRIPSAVFQRSRSITPLEWSIASNESL 180
           D T ES  N +SPTQSPPLQ MDR G    YDPFRIPSAVFQRSRSITPLEWSIASNESL
Sbjct: 121 DFTLESGGNFMSPTQSPPLQMMDRVGGYESYDPFRIPSAVFQRSRSITPLEWSIASNESL 180

Query: 181 FSINVGNNSLSRDHVFRLSDLGKPDELTKSGE------LFVFNPPPTVITSRETEKKSAE 240
           FSI+VGNNS SRDH    S+L K DELTKSGE      LFVF+PPP VITSRETE KSAE
Sbjct: 181 FSIHVGNNSFSRDHALMSSELSKSDELTKSGELMKSDDLFVFSPPPAVITSRETEVKSAE 240

Query: 241 FEKDTKMEDTAEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFP 300
            E++ KM DT EY + DKEG+I    S+R   PPA+SWNSS+ SRHSDRSQSSS+SFAFP
Sbjct: 241 CEEEPKMADTIEYNIEDKEGLIAEDLSDRNLPPPAVSWNSSSKSRHSDRSQSSSDSFAFP 300

Query: 301 MRSATSYGQVATVHGHAAAVGTVAGRSAT 320
           +  A    Q  TV  HA   GT     +T
Sbjct: 301 I-LADKEAQGGTVPVHAKFQGTPVSSKST 328

BLAST of MS015236 vs. NCBI nr
Match: XP_038892414.1 (uncharacterized protein LOC120081528 isoform X1 [Benincasa hispida])

HSP 1 Score: 369.4 bits (947), Expect = 3.3e-98
Identity = 204/302 (67.55%), Postives = 232/302 (76.82%), Query Frame = 0

Query: 1   MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVKEEFPLSPTKQLGYGSSSSLNLLELSDF 60
           MAATGR ERS H  E P II+   SK+SEGT  E    SPT+QLGYGSSSS+NLLE+SD 
Sbjct: 1   MAATGRRERSSHLTEHPVIIELGSSKHSEGTGGEGVSFSPTEQLGYGSSSSINLLEISDL 60

Query: 61  VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQG-TFASQVS 120
            QV +++++ EA TIC GNP   +YQLQFKDD+SD SS+ S+KSEK SG QG T ASQVS
Sbjct: 61  AQVGSNVDRPEADTICFGNPGRGMYQLQFKDDDSDVSSVCSSKSEKSSGSQGPTSASQVS 120

Query: 121 DITPESVRNVLSPTQSPPLQTMDRAG---EYDPFRIPSAVFQRSRSITPLEWSIASNESL 180
           D T ES  N +SPTQSPPLQ MDR G    YDPFRIPSAVFQRSRSITPLEWSIASNESL
Sbjct: 121 DFTLESGGNFMSPTQSPPLQMMDRVGGYESYDPFRIPSAVFQRSRSITPLEWSIASNESL 180

Query: 181 FSINVGNNSLSRDHVFRLSDLGKPDELTKSGE------LFVFNPPPTVITSRETEKKSAE 240
           FSI+VGNNS SRDH    S+L K DELTKSGE      LFVF+PPP VITSRETE KSAE
Sbjct: 181 FSIHVGNNSFSRDHALMSSELSKSDELTKSGELMKSDDLFVFSPPPAVITSRETEVKSAE 240

Query: 241 FEKDTKMEDTAEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFP 293
            E++ KM DT EY + DKEG+I    S+R   PPA+SWNSS+ SRHSDRSQSSS+SFAFP
Sbjct: 241 CEEEPKMADTIEYNIEDKEGLIAEDLSDRNLPPPAVSWNSSSKSRHSDRSQSSSDSFAFP 300

BLAST of MS015236 vs. NCBI nr
Match: XP_038892416.1 (uncharacterized protein LOC120081528 isoform X3 [Benincasa hispida])

HSP 1 Score: 369.4 bits (947), Expect = 3.3e-98
Identity = 204/302 (67.55%), Postives = 232/302 (76.82%), Query Frame = 0

Query: 1   MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVKEEFPLSPTKQLGYGSSSSLNLLELSDF 60
           MAATGR ERS H  E P II+   SK+SEGT  E    SPT+QLGYGSSSS+NLLE+SD 
Sbjct: 1   MAATGRRERSSHLTEHPVIIELGSSKHSEGTGGEGVSFSPTEQLGYGSSSSINLLEISDL 60

Query: 61  VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQG-TFASQVS 120
            QV +++++ EA TIC GNP   +YQLQFKDD+SD SS+ S+KSEK SG QG T ASQVS
Sbjct: 61  AQVGSNVDRPEADTICFGNPGRGMYQLQFKDDDSDVSSVCSSKSEKSSGSQGPTSASQVS 120

Query: 121 DITPESVRNVLSPTQSPPLQTMDRAG---EYDPFRIPSAVFQRSRSITPLEWSIASNESL 180
           D T ES  N +SPTQSPPLQ MDR G    YDPFRIPSAVFQRSRSITPLEWSIASNESL
Sbjct: 121 DFTLESGGNFMSPTQSPPLQMMDRVGGYESYDPFRIPSAVFQRSRSITPLEWSIASNESL 180

Query: 181 FSINVGNNSLSRDHVFRLSDLGKPDELTKSGE------LFVFNPPPTVITSRETEKKSAE 240
           FSI+VGNNS SRDH    S+L K DELTKSGE      LFVF+PPP VITSRETE KSAE
Sbjct: 181 FSIHVGNNSFSRDHALMSSELSKSDELTKSGELMKSDDLFVFSPPPAVITSRETEVKSAE 240

Query: 241 FEKDTKMEDTAEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFP 293
            E++ KM DT EY + DKEG+I    S+R   PPA+SWNSS+ SRHSDRSQSSS+SFAFP
Sbjct: 241 CEEEPKMADTIEYNIEDKEGLIAEDLSDRNLPPPAVSWNSSSKSRHSDRSQSSSDSFAFP 300

BLAST of MS015236 vs. ExPASy TrEMBL
Match: A0A6J1D6N0 (uncharacterized protein LOC111017534 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111017534 PE=4 SV=1)

HSP 1 Score: 553.9 bits (1426), Expect = 4.5e-154
Identity = 289/292 (98.97%), Postives = 291/292 (99.66%), Query Frame = 0

Query: 1   MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVKEEFPLSPTKQLGYGSSSSLNLLELSDF 60
           MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVK EFPLSPTKQLGYGSSSSLNLLELSDF
Sbjct: 1   MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVK-EFPLSPTKQLGYGSSSSLNLLELSDF 60

Query: 61  VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQGTFASQVSD 120
           VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQGTFASQVSD
Sbjct: 61  VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQGTFASQVSD 120

Query: 121 ITPESVRNVLSPTQSPPLQTMDRAGEYDPFRIPSAVFQRSRSITPLEWSIASNESLFSIN 180
           ITPESVRNVLSPTQSPPLQTMDRAGEYDPFRIPSAVFQRSRSITPLEWSIASNESLFSIN
Sbjct: 121 ITPESVRNVLSPTQSPPLQTMDRAGEYDPFRIPSAVFQRSRSITPLEWSIASNESLFSIN 180

Query: 181 VGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITSRETEKKSAEFEKDTKMEDT 240
           VGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITSRETEKKSAEFEKDTKMEDT
Sbjct: 181 VGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITSRETEKKSAEFEKDTKMEDT 240

Query: 241 AEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFPMR 293
           AEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFP++
Sbjct: 241 AEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFPIK 291

BLAST of MS015236 vs. ExPASy TrEMBL
Match: A0A6J1D736 (uncharacterized protein LOC111017534 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111017534 PE=4 SV=1)

HSP 1 Score: 552.4 bits (1422), Expect = 1.3e-153
Identity = 294/307 (95.77%), Postives = 296/307 (96.42%), Query Frame = 0

Query: 1   MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVKEEFPLSPTKQLGYGSSSSLNLLELSDF 60
           MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVK EFPLSPTKQLGYGSSSSLNLLELSDF
Sbjct: 1   MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVK-EFPLSPTKQLGYGSSSSLNLLELSDF 60

Query: 61  VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQGTFASQVSD 120
           VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQGTFASQVSD
Sbjct: 61  VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQGTFASQVSD 120

Query: 121 ITPESVRNVLSPTQSPPLQTMDRAGEYDPFRIPSAVFQRSRSITPLEWSIASNESLFSIN 180
           ITPESVRNVLSPTQSPPLQTMDRAGEYDPFRIPSAVFQRSRSITPLEWSIASNESLFSIN
Sbjct: 121 ITPESVRNVLSPTQSPPLQTMDRAGEYDPFRIPSAVFQRSRSITPLEWSIASNESLFSIN 180

Query: 181 VGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITSRETEKKSAEFEKDTKMEDT 240
           VGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITSRETEKKSAEFEKDTKMEDT
Sbjct: 181 VGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITSRETEKKSAEFEKDTKMEDT 240

Query: 241 AEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFPMRSATSYGQV 300
           AEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFP+  A    Q 
Sbjct: 241 AEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFPI-LADEEAQA 300

Query: 301 ATVHGHA 308
            +V G A
Sbjct: 301 GSVPGDA 305

BLAST of MS015236 vs. ExPASy TrEMBL
Match: A0A5A7STD6 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G00130 PE=4 SV=1)

HSP 1 Score: 350.1 bits (897), Expect = 9.9e-93
Identity = 217/378 (57.41%), Postives = 249/378 (65.87%), Query Frame = 0

Query: 1   MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVKEEFPLSPTKQLGYGSSSSLNLLELSDF 60
           MAA G  ERS H+  QP+II    S NS GT  +   L PTKQLG+GS+SS++LLE+SD 
Sbjct: 1   MAARGSRERSSHTTGQPDIIGLG-SINSGGTGGKGVSLPPTKQLGFGSASSIDLLEISDL 60

Query: 61  VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQ-GTFASQVS 120
            QV A++N+ EA TICL NP   IYQLQFKDD+S+ SS  S+KSEK SG    T  SQVS
Sbjct: 61  AQVGANVNRPEADTICLANPGRGIYQLQFKDDDSNVSSNCSSKSEKSSGSPCPTSVSQVS 120

Query: 121 DITPESVRNVLSPTQSPPLQTMDRAG----EYDPFRIPSAVFQRSRSITPLEWSIASNES 180
            +T ES  NVLSPTQSP LQTMDR G     YDPFRIPSAVFQRS S+TP+EWSIASNES
Sbjct: 121 GLTHESGGNVLSPTQSPSLQTMDRMGGYDESYDPFRIPSAVFQRSSSVTPMEWSIASNES 180

Query: 181 LFSINVGNNSLSRDHVFRLSDLGKPDELTKSG------ELFVFNPPPTVITSRETEKKSA 240
           LFSI VGNNS SRDHV  LS+ GK  ELTKSG      E FVF+ PP VITSRE E KSA
Sbjct: 181 LFSIQVGNNSFSRDHVSMLSEFGKSGELTKSGKFKKADESFVFSQPPAVITSREAEMKSA 240

Query: 241 EFEKDTKMEDTAEYTVNDKEGVITRSP-SERKGTPPAISWNSSNVSRHSDRSQSSSNSFA 300
           E+E+  KM DT EY + DK G IT    S+R   PPA+SWNSS  SRHSD+SQSSS+SFA
Sbjct: 241 EYEEGPKMADTIEYNIKDKGGSITDDDLSDRNLPPPAVSWNSSTKSRHSDKSQSSSDSFA 300

Query: 301 FPM------------------------------------RSATSYGQVATVHGHAAAVGT 331
           FP+                                    RSAT+ G VAT+HG     GT
Sbjct: 301 FPITLNLLKPPLINKSSYGLLLERRSAHAHRVTVLTVVRRSATARGHVATLHGQTVVAGT 360

BLAST of MS015236 vs. ExPASy TrEMBL
Match: A0A6J1IL15 (uncharacterized protein LOC111478406 OS=Cucurbita maxima OX=3661 GN=LOC111478406 PE=4 SV=1)

HSP 1 Score: 341.3 bits (874), Expect = 4.6e-90
Identity = 194/295 (65.76%), Postives = 223/295 (75.59%), Query Frame = 0

Query: 1   MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVKEEFPLSPTKQLGYGSSSSLNLLELSDF 60
           MAATGR E S HS +QP+II    S +SE T  E F LSPT QL YGSSS +NL+ELSD 
Sbjct: 1   MAATGRCETSSHSTKQPDIIGLG-STHSEDTAGEGFSLSPTGQLAYGSSSCINLMELSDL 60

Query: 61  VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQG-TFASQVS 120
           VQVD ++N+ E GTICLGNPDH +YQLQFKDDESD SS+ SAKSEK SG QG T ASQVS
Sbjct: 61  VQVDENVNRPEGGTICLGNPDHDVYQLQFKDDESDVSSLCSAKSEKSSGSQGPTSASQVS 120

Query: 121 DITPESVRNVLSPTQSPPLQTMDRAG---EYDPFRIPSAVFQRSRSITPLEWSIASNESL 180
           DIT ES  NV+SPTQSPPLQTMDR G    YDP RIPSAVFQRSRS TPLEWSIASNESL
Sbjct: 121 DITHESGGNVMSPTQSPPLQTMDRVGGYEPYDPLRIPSAVFQRSRSTTPLEWSIASNESL 180

Query: 181 FSINVGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITSRETEKKSAEFEKDTK 240
           FSI+ G++S S D++  LSDLGK D      EL VFN  P VITSRETE KS E+E++ K
Sbjct: 181 FSIHGGDDSFSGDNLSMLSDLGKSD------ELLVFNSAPAVITSRETEMKSVEYEEEPK 240

Query: 241 MEDTAEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFPM 292
           + DT EY + D+EG++    S R   PPA+S +SS+ SRHS+RSQ SSNSFAFP+
Sbjct: 241 VADTTEYNIEDREGLVAEDLSGRNVLPPALSSSSSSRSRHSERSQCSSNSFAFPI 288

BLAST of MS015236 vs. ExPASy TrEMBL
Match: A0A1S4DW31 (uncharacterized protein LOC103489558 isoform X5 OS=Cucumis melo OX=3656 GN=LOC103489558 PE=4 SV=1)

HSP 1 Score: 328.6 bits (841), Expect = 3.1e-86
Identity = 193/304 (63.49%), Postives = 220/304 (72.37%), Query Frame = 0

Query: 1   MAATGRSERSEHSIEQPEIIDRSDSKNSEGTVKEEFPLSPTKQLGYGSSSSLNLLELSDF 60
           MAA G  ERS H+  QP+II    S NS GT  +   L PTKQLG+GS+SS++LLE+SD 
Sbjct: 1   MAARGSRERSSHTTGQPDIIGLG-SINSGGTGGKGVSLPPTKQLGFGSASSIDLLEISDL 60

Query: 61  VQVDAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQ-GTFASQVS 120
            QV A++N+ EA TICL NP   IYQLQFKDD+S+ SS  S+KSEK SG    T  SQVS
Sbjct: 61  AQVGANVNRPEADTICLANPGRGIYQLQFKDDDSNVSSNCSSKSEKSSGSPCPTSVSQVS 120

Query: 121 DITPESVRNVLSPTQSPPLQTMDRAG----EYDPFRIPSAVFQRSRSITPLEWSIASNES 180
            +T ES  NVLSPTQSP LQTMDR G     YDPFRIPSAVFQRS S+TP+EWSIASNES
Sbjct: 121 GLTHESGGNVLSPTQSPSLQTMDRMGGYDESYDPFRIPSAVFQRSSSVTPMEWSIASNES 180

Query: 181 LFSINVGNNSLSRDHVFRLSDLGKPDELTKSG------ELFVFNPPPTVITSRETEKKSA 240
           LFSI VGNNS SRDHV  LS+ GK  ELTKSG      E FVF+ PP VITSRE E KSA
Sbjct: 181 LFSIQVGNNSFSRDHVSMLSEFGKSGELTKSGKFKKADESFVFSQPPAVITSREAEMKSA 240

Query: 241 EFEKDTKMEDTAEYTVNDKEGVITRSP-SERKGTPPAISWNSSNVSRHSDRSQSSSNSFA 293
           E+E+  KM DT EY + DK G IT    S+R   PPA+SWNSS  SRHSD+SQSSS+SFA
Sbjct: 241 EYEEGPKMADTIEYNIKDKGGSITDDDLSDRNLPPPAVSWNSSTKSRHSDKSQSSSDSFA 300

BLAST of MS015236 vs. TAIR 10
Match: AT2G03630.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G74220.1); Has 126 Blast hits to 126 proteins in 28 species: Archae - 0; Bacteria - 6; Metazoa - 7; Fungi - 5; Plants - 87; Viruses - 0; Other Eukaryotes - 21 (source: NCBI BLink). )

HSP 1 Score: 113.2 bits (282), Expect = 3.9e-25
Identity = 90/235 (38.30%), Postives = 129/235 (54.89%), Query Frame = 0

Query: 71  EAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQGTFASQVSDITPESVRNVL 130
           E  TI L   + VI   +     S +SS SS+ S        ++ S   D+  E ++  L
Sbjct: 33  EHKTIPLQETETVIINSESHSRLSSSSSSSSSSS--------SYLSPPKDLPEEVLKESL 92

Query: 131 SPTQ--SPPLQTMDR--AGEYDPFRIPSAVFQRSRSITPLEWSIASNESLFSINVGNNSL 190
           +  +  SPP+Q MDR   G+YDP RIPS+VF+RS+S  P EWS  SNESLFSI++GNNS 
Sbjct: 93  NDPEISSPPVQVMDRDNNGKYDPNRIPSSVFERSKSNVPAEWSCTSNESLFSIHLGNNSF 152

Query: 191 SRDHVFRLSDLGKPDELTKSGELFVFNP----PPTVITSRE---TEKKSAEFEKDTK--- 250
           +        DL K  EL KSGEL  ++P    PP   +  +    E K  E +K+ K   
Sbjct: 153 TGYG----GDLMKSGELYKSGELLAYSPGLPMPPVPGSEPKPVVEEPKVVESDKEEKVVL 212

Query: 251 MEDTAEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFPM 292
           +E+T   + +++EG   +  S  K   PA+SW +   S  S+RS +S++SF+FPM
Sbjct: 213 VEETHTSSSDEEEG---KRESHEKEQHPAVSWKTPTTSYRSNRSSNSAHSFSFPM 252

BLAST of MS015236 vs. TAIR 10
Match: AT2G03630.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: male gametophyte, pollen tube; EXPRESSED DURING: L mature pollen stage, M germinated pollen stage; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G74220.1); Has 87 Blast hits to 87 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 85; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 89.4 bits (220), Expect = 6.0e-18
Identity = 70/177 (39.55%), Postives = 95/177 (53.67%), Query Frame = 0

Query: 71  EAGTICLGNPDHVIYQLQFKDDESDASSISSAKSEKGSGFQGTFASQVSDITPESVRNVL 130
           E  TI L   + VI   +     S +SS SS+ S        ++ S   D+  E ++  L
Sbjct: 33  EHKTIPLQETETVIINSESHSRLSSSSSSSSSSS--------SYLSPPKDLPEEVLKESL 92

Query: 131 SPTQ--SPPLQTMDR--AGEYDPFRIPSAVFQRSRSITPLEWSIASNESLFSINVGNNSL 190
           +  +  SPP+Q MDR   G+YDP RIPS+VF+RS+S  P EWS  SNESLFSI++GNNS 
Sbjct: 93  NDPEISSPPVQVMDRDNNGKYDPNRIPSSVFERSKSNVPAEWSCTSNESLFSIHLGNNSF 152

Query: 191 SRDHVFRLSDLGKPDELTKSGELFVFNP----PPTVITSRE---TEKKSAEFEKDTK 237
           +        DL K  EL KSGEL  ++P    PP   +  +    E K  E +K+ K
Sbjct: 153 TGYG----GDLMKSGELYKSGELLAYSPGLPMPPVPGSEPKPVVEEPKVVESDKEEK 197

BLAST of MS015236 vs. TAIR 10
Match: AT1G69280.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G03630.1); Has 885 Blast hits to 474 proteins in 40 species: Archae - 0; Bacteria - 4; Metazoa - 731; Fungi - 9; Plants - 68; Viruses - 0; Other Eukaryotes - 73 (source: NCBI BLink). )

HSP 1 Score: 64.3 bits (155), Expect = 2.1e-10
Identity = 85/308 (27.60%), Postives = 138/308 (44.81%), Query Frame = 0

Query: 5   GRSERSEHSIEQPEIIDRSDSKNSEGTVKEEFPLSPTK-QLGYGSSSSLNLLELSDFVQV 64
           G++ ++ H +++    D + S     T+KE     P   QLG  +S  ++ + L +  ++
Sbjct: 8   GKNRKAYH-LKKHRKSDNNKSFEPRPTIKEVNDEKPVLFQLGSIASYGVSDVRLEEDPEI 67

Query: 65  DAHINQSEAGTICLGNPDHVIYQLQFKDDESDASSIS-------SAKSEKGSGFQGTFAS 124
               + S + +       H I     K + S  SS+        S+K  +  G   T  S
Sbjct: 68  TTRRSTSLSLSPTSSEDIHEINAKGRKSNLSFDSSLHNKELFSLSSKLFELQGSSDTPGS 127

Query: 125 QVSDITPESVRNVLSPTQSPPLQTMDRAGEYDPFRIPSAVFQRSRSITPLEWSIA--SNE 184
           QVS +T  SV  +L    SP +Q MDR G   P R          S+  LE +++  SN+
Sbjct: 128 QVSGVTNASVEPLL---LSPSIQMMDREGSDQPER---------NSLPTLEKNLSTLSND 187

Query: 185 SLFSINVGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITSRETEKKSAEFEKD 244
           SLFS+++G+N+++RD +F   D  K  E+TKSGEL  F P      +      S++  K 
Sbjct: 188 SLFSLSIGDNTIARDELFSYRDF-KSGEITKSGELLSFCP------AIHGPADSSDLGKS 247

Query: 245 TKMEDTAEYTVNDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFPMRSA 303
             MED A                E K +   +SW +     +SD + SS+ SF++P+   
Sbjct: 248 FDMEDKAS------------GECEDKSSNSNVSWRNIGDCNNSDETPSSTQSFSYPITKK 283

BLAST of MS015236 vs. TAIR 10
Match: AT1G74220.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: male gametophyte, flower, pollen tube; EXPRESSED DURING: L mature pollen stage, M germinated pollen stage, 4 anthesis; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G03630.1); Has 383 Blast hits to 347 proteins in 86 species: Archae - 0; Bacteria - 17; Metazoa - 87; Fungi - 65; Plants - 76; Viruses - 2; Other Eukaryotes - 136 (source: NCBI BLink). )

HSP 1 Score: 63.9 bits (154), Expect = 2.7e-10
Identity = 46/110 (41.82%), Postives = 56/110 (50.91%), Query Frame = 0

Query: 123 PESVRNVLSPTQSPPLQTMDRA-----------GEYDPFRIPSAVFQRSRSITPLEWSIA 182
           P+   N  +  QSPP Q M+R+               P+RIPS VF R+ S  P EWS  
Sbjct: 50  PDQNHNHNNVEQSPPTQVMERSTNNTTTTTTSTPNTPPYRIPSHVFARTTSTAP-EWSTI 109

Query: 183 SNESLFSINVGNNSLSRDHVFRLSDLGKPDELTKSGELFVFNPPPTVITS 222
           SNESLFSI++GNNS +    F            KSGEL  F  PP+ ITS
Sbjct: 110 SNESLFSIHMGNNSFTGGDYF------------KSGEL-TFPQPPSPITS 145

BLAST of MS015236 vs. TAIR 10
Match: AT3G02125.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: pollen tube; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G39200.1); Has 2247 Blast hits to 1434 proteins in 202 species: Archae - 4; Bacteria - 111; Metazoa - 942; Fungi - 239; Plants - 140; Viruses - 37; Other Eukaryotes - 774 (source: NCBI BLink). )

HSP 1 Score: 53.5 bits (127), Expect = 3.7e-07
Identity = 69/226 (30.53%), Postives = 106/226 (46.90%), Query Frame = 0

Query: 98  SISSAKSEKGSGFQGTFASQVSDITPESVRNVLSPTQSPPLQTMDRAGEYDPFRIPSAVF 157
           S+S+++++K       + S VS  T  S       + +     +D  G YDP RIPS+VF
Sbjct: 47  SLSASEAQKLRESHQAYNSSVSSYTSSSW------SSNHQNHLIDLPG-YDPTRIPSSVF 106

Query: 158 QRSRSITPLEWSIASNESLFSINVGNNSLSRDHVFRLSDL-------------------- 217
             S+     EWS+ASNESLFSI+ GN S+S     RL+++                    
Sbjct: 107 S-SKPGNSTEWSLASNESLFSIHDGNFSIST--ALRLAEIPRFEETVHVITEINSVPLPP 166

Query: 218 --GKPDELTK----SGELFVFNPPPTVITSRETEKKSAEFEKDTKMED------TAEYTV 277
              KP+E  K      E +      + I   E E+K +E E D + E+       AE  V
Sbjct: 167 PVKKPNEYEKETIAEKEPYQVENSNSDIEDNEEEEKMSEVESDDEHEEEQTDMIEAEALV 226

Query: 278 NDKEGVITRSPSERKGTPPAISWNSSNVSRHSDRSQSSSNSFAFPM 292
            +KE + T   ++ + +   +S +S ++S  SD S +S  SFAFP+
Sbjct: 227 -EKEVIETVKENKPEDSNSIVS-HSPSISCRSDTSNNSIGSFAFPL 260

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022149012.19.3e-15498.97uncharacterized protein LOC111017534 isoform X2 [Momordica charantia][more]
XP_022149010.12.7e-15395.77uncharacterized protein LOC111017534 isoform X1 [Momordica charantia] >XP_022149... [more]
XP_038892415.12.5e-9864.74uncharacterized protein LOC120081528 isoform X2 [Benincasa hispida][more]
XP_038892414.13.3e-9867.55uncharacterized protein LOC120081528 isoform X1 [Benincasa hispida][more]
XP_038892416.13.3e-9867.55uncharacterized protein LOC120081528 isoform X3 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1D6N04.5e-15498.97uncharacterized protein LOC111017534 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1D7361.3e-15395.77uncharacterized protein LOC111017534 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A5A7STD69.9e-9357.41Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6J1IL154.6e-9065.76uncharacterized protein LOC111478406 OS=Cucurbita maxima OX=3661 GN=LOC111478406... [more]
A0A1S4DW313.1e-8663.49uncharacterized protein LOC103489558 isoform X5 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT2G03630.13.9e-2538.30unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G03630.26.0e-1839.55unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G69280.12.1e-1027.60unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G74220.12.7e-1041.82unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G02125.13.7e-0730.53unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 214..286
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..45
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 8..31
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 260..286
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 223..245
NoneNo IPR availablePANTHERPTHR33673:SF27SUBFAMILY NOT NAMEDcoord: 7..293
NoneNo IPR availablePANTHERPTHR33673SUPPRESSOR SRP40-LIKE PROTEINcoord: 7..293

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS015236.1MS015236.1mRNA