MS001255 (gene) Bitter gourd (TR) v1

Overview
NameMS001255
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSAGA-Tad1 domain-containing protein
Locationscaffold36: 2687404 .. 2688663 (-)
RNA-Seq ExpressionMS001255
SyntenyMS001255
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTTGGAACTGATCGGTCAAAACGGTATTTCTTTTACTTGAACAGGTTCTTGAGTCAGAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTCGTCGGGTGCTCGGGAGAGACAATCTTTGGCTGCATAATCAGTTGATACAGTCAATTTTGAAGAATGCGTGCCAAGCTAAGGCTGCGCCGCCGTTACCTGTAGCAGGCTATCCGAAAACTTCAACACAATCTGCAAAAGTTTCCCCTGTTATAGAAGATGGGAATGGGGACACTGGAGCTGTTTATCCTACTTCTACTCAAAGTATTCCCATTTGGTCTAATGGAGGATTTCCAGCGTCCCCAAGAAAGAGCAGGTCTGGGATACGCGACCGCAAACTCAAGGACAGGCCGAGTCCACTGGGGCCAAATGGGAAGGTCGAATGTATCTCACATCAATCAGCAGGCAAGAAAGATGGAAGCTGTAAAATGATGATGGTTAATGGTGATGCAACTCTATGTGACTATCAGAGACCAGTGCAACATCTGCAAGGAGTCGCTGAACTGCCTGAAAACAATATCGAGGCTAGAATTCGACCAGCAGGAAAGCAAGTCCTAAACAATAAGATCCATGATGAAGGAACTAAGGTTGGAGACCGGGAAGAAGCGGGACACTCAATCCACTCGGGTTTGCTTCGAAGTCGCTTACTGGCACCCCTTGGGATTCCTTTCTGCTCAGCTAGTATCGGCGGGGCCCGCAAAGCGAGGCCTGCGGATTTTGGGGGTGATTTCGTTAGCTTTAGCGATATTGGTCATTTATCGGATACAGAGTCATTGAGACGACGCATGGAACAAATTGCTGCAGTACATGGCCTAGGCAGTGTTTCTGCAGACTCTGCTAATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATAAGGTCTTGTGTTGGCTTGGTTGGAACATGGCCTATGCCATGTGAGCCCGAGAAGCCTCTTACCGATAAGCTACAGGTTCAGGGGAAGGTGATCAATGGTATGTTGCCGAATAATCAATTACACGGGCGACATAGCAATGGAAGCAGAGAAGTTATGCACGAGCACAGATTACGGTGCTCGGTATCGTTGCTTGATTTCAAAGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGGAAGACTGGCCTTTGCTGCTGGAGAAAATTCGTATGCGTGCATTCGAGGAA

mRNA sequence

ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTTGGAACTGATCGGTCAAAACGGTATTTCTTTTACTTGAACAGGTTCTTGAGTCAGAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTCGTCGGGTGCTCGGGAGAGACAATCTTTGGCTGCATAATCAGTTGATACAGTCAATTTTGAAGAATGCGTGCCAAGCTAAGGCTGCGCCGCCGTTACCTGTAGCAGGCTATCCGAAAACTTCAACACAATCTGCAAAAGTTTCCCCTGTTATAGAAGATGGGAATGGGGACACTGGAGCTGTTTATCCTACTTCTACTCAAAGTATTCCCATTTGGTCTAATGGAGGATTTCCAGCGTCCCCAAGAAAGAGCAGGTCTGGGATACGCGACCGCAAACTCAAGGACAGGCCGAGTCCACTGGGGCCAAATGGGAAGGTCGAATGTATCTCACATCAATCAGCAGGCAAGAAAGATGGAAGCTGTAAAATGATGATGGTTAATGGTGATGCAACTCTATGTGACTATCAGAGACCAGTGCAACATCTGCAAGGAGTCGCTGAACTGCCTGAAAACAATATCGAGGCTAGAATTCGACCAGCAGGAAAGCAAGTCCTAAACAATAAGATCCATGATGAAGGAACTAAGGTTGGAGACCGGGAAGAAGCGGGACACTCAATCCACTCGGGTTTGCTTCGAAGTCGCTTACTGGCACCCCTTGGGATTCCTTTCTGCTCAGCTAGTATCGGCGGGGCCCGCAAAGCGAGGCCTGCGGATTTTGGGGGTGATTTCGTTAGCTTTAGCGATATTGGTCATTTATCGGATACAGAGTCATTGAGACGACGCATGGAACAAATTGCTGCAGTACATGGCCTAGGCAGTGTTTCTGCAGACTCTGCTAATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATAAGGTCTTGTGTTGGCTTGGTTGGAACATGGCCTATGCCATGTGAGCCCGAGAAGCCTCTTACCGATAAGCTACAGGTTCAGGGGAAGGTGATCAATGGTATGTTGCCGAATAATCAATTACACGGGCGACATAGCAATGGAAGCAGAGAAGTTATGCACGAGCACAGATTACGGTGCTCGGTATCGTTGCTTGATTTCAAAGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGGAAGACTGGCCTTTGCTGCTGGAGAAAATTCGTATGCGTGCATTCGAGGAA

Coding sequence (CDS)

ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTTGGAACTGATCGGTCAAAACGGTATTTCTTTTACTTGAACAGGTTCTTGAGTCAGAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTCGTCGGGTGCTCGGGAGAGACAATCTTTGGCTGCATAATCAGTTGATACAGTCAATTTTGAAGAATGCGTGCCAAGCTAAGGCTGCGCCGCCGTTACCTGTAGCAGGCTATCCGAAAACTTCAACACAATCTGCAAAAGTTTCCCCTGTTATAGAAGATGGGAATGGGGACACTGGAGCTGTTTATCCTACTTCTACTCAAAGTATTCCCATTTGGTCTAATGGAGGATTTCCAGCGTCCCCAAGAAAGAGCAGGTCTGGGATACGCGACCGCAAACTCAAGGACAGGCCGAGTCCACTGGGGCCAAATGGGAAGGTCGAATGTATCTCACATCAATCAGCAGGCAAGAAAGATGGAAGCTGTAAAATGATGATGGTTAATGGTGATGCAACTCTATGTGACTATCAGAGACCAGTGCAACATCTGCAAGGAGTCGCTGAACTGCCTGAAAACAATATCGAGGCTAGAATTCGACCAGCAGGAAAGCAAGTCCTAAACAATAAGATCCATGATGAAGGAACTAAGGTTGGAGACCGGGAAGAAGCGGGACACTCAATCCACTCGGGTTTGCTTCGAAGTCGCTTACTGGCACCCCTTGGGATTCCTTTCTGCTCAGCTAGTATCGGCGGGGCCCGCAAAGCGAGGCCTGCGGATTTTGGGGGTGATTTCGTTAGCTTTAGCGATATTGGTCATTTATCGGATACAGAGTCATTGAGACGACGCATGGAACAAATTGCTGCAGTACATGGCCTAGGCAGTGTTTCTGCAGACTCTGCTAATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATAAGGTCTTGTGTTGGCTTGGTTGGAACATGGCCTATGCCATGTGAGCCCGAGAAGCCTCTTACCGATAAGCTACAGGTTCAGGGGAAGGTGATCAATGGTATGTTGCCGAATAATCAATTACACGGGCGACATAGCAATGGAAGCAGAGAAGTTATGCACGAGCACAGATTACGGTGCTCGGTATCGTTGCTTGATTTCAAAGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGGAAGACTGGCCTTTGCTGCTGGAGAAAATTCGTATGCGTGCATTCGAGGAA

Protein sequence

MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELPENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVHGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE
Homology
BLAST of MS001255 vs. NCBI nr
Match: XP_022132327.1 (uncharacterized protein LOC111005206 [Momordica charantia])

HSP 1 Score: 839.0 bits (2166), Expect = 1.8e-239
Identity = 417/420 (99.29%), Postives = 418/420 (99.52%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60
           MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL
Sbjct: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSI 120
           WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGN DTGAVYPTSTQSI
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNEDTGAVYPTSTQSI 120

Query: 121 PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD 180
           PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD
Sbjct: 121 PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD 180

Query: 181 ATLCDYQRPVQHLQGVAELPENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSG 240
           ATLCDYQRPVQHLQGVAELPENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSG
Sbjct: 181 ATLCDYQRPVQHLQGVAELPENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSG 240

Query: 241 LLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVH 300
           LL+SRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVH
Sbjct: 241 LLQSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVH 300

Query: 301 GLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGML 360
           GLGSVSADSANILNKVLDVYLKQLIRSCVGLVGT PMPCEPEKPLTDKLQVQGKVINGML
Sbjct: 301 GLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTCPMPCEPEKPLTDKLQVQGKVINGML 360

Query: 361 PNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE 420
           PNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE
Sbjct: 361 PNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE 420

BLAST of MS001255 vs. NCBI nr
Match: XP_023546134.1 (uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 674.5 bits (1739), Expect = 6.0e-190
Identity = 349/421 (82.90%), Postives = 369/421 (87.65%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60
           MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLC RVLGR+NL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSI 120
           WLHNQLIQSILKNACQAKAAPP+P AGYPKTSTQ+AK+SPVIEDGN D GAV+PTSTQ I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFPTSTQGI 120

Query: 121 PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD 180
           PIWSN GFP SPRK RSGIRDRKLKDRPS L PN KVECIS QSA K+DGSC++M+ NG+
Sbjct: 121 PIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMLDNGN 180

Query: 181 ATLCDYQRPVQHLQGVAELPENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHS 240
           AT CDYQRPVQHLQGV ELPENNIEAR+ RP+GKQVL  ++  EGTKV DREEA  S  S
Sbjct: 181 ATSCDYQRPVQHLQGVYELPENNIEARVQRPSGKQVLQMQV--EGTKVEDREEARQSNRS 240

Query: 241 GLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
            LLRSRLLAPLGIPFCSASIGGA K RP D GG+F SFSD+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNF-SFSDMGHLLDTESLRRRMEQIAAV 300

Query: 301 HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGM 360
            GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP   EPEKPL    Q+QGKVINGM
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWP-AFEPEKPLAHNQQIQGKVINGM 360

Query: 361 LPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE 420
           LPNNQLH  HSNG+ EV+HE RL CS+SLLDFKVAMELNPKQLGEDWPLLLEKI MRAF 
Sbjct: 361 LPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFT 417

BLAST of MS001255 vs. NCBI nr
Match: XP_022997521.1 (uncharacterized protein LOC111492414 [Cucurbita maxima])

HSP 1 Score: 672.2 bits (1733), Expect = 3.0e-189
Identity = 349/421 (82.90%), Postives = 369/421 (87.65%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60
           MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLC RVLGR+NL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSI 120
           WLHNQLIQSILKNACQAKAAPP+P AGYPKTSTQ+AK+SPVIEDGN D GAV+ TSTQ I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFATSTQGI 120

Query: 121 PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD 180
           PIWSN GF  SPRK RSGIRDRKLKDRPS L PN KVECIS QSA K+DGSC++MM NG+
Sbjct: 121 PIWSNEGFSMSPRKCRSGIRDRKLKDRPSLLAPNLKVECISAQSACKEDGSCRIMMDNGN 180

Query: 181 ATLCDYQRPVQHLQGVAELPENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHS 240
           AT CDYQRPVQHLQGV ELPENNIEAR+ RP+GKQVL  ++  EGTKV DREEA  S  S
Sbjct: 181 ATSCDYQRPVQHLQGVFELPENNIEARVQRPSGKQVLQMQV--EGTKVEDREEARQSNRS 240

Query: 241 GLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
            LLRSRLLAPLGIPFCSASIGGA K RP D GG+F SFSD+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNF-SFSDMGHLLDTESLRRRMEQIAAV 300

Query: 301 HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGM 360
            GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP+  EPEKPL    Q+QGKVINGM
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGPWPV-FEPEKPLAHNQQIQGKVINGM 360

Query: 361 LPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE 420
           LPNNQLH  HSNG+REV+HE RL CS+SLLDFKVAMELNPKQLGEDWPLLLEKI MRAF 
Sbjct: 361 LPNNQLHRLHSNGNREVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFT 417

BLAST of MS001255 vs. NCBI nr
Match: XP_008445087.1 (PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo] >KAA0064996.1 SAGA-Tad1 domain-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 670.6 bits (1729), Expect = 8.7e-189
Identity = 347/421 (82.42%), Postives = 365/421 (86.70%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60
           MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK C RVLGR+NL
Sbjct: 1   MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSI 120
           WLHNQLIQSILKNACQAKAAPP+PVAGYPKTSTQSAK+SP++EDGN D GAV+PTSTQ+I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVEDGNEDGGAVFPTSTQNI 120

Query: 121 PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD 180
           P WSNG    SPRK RSGIRDRKLKDRPS LGPNGKVECISH SA          M NGD
Sbjct: 121 PGWSNG---VSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSAN---------MDNGD 180

Query: 181 ATLCDYQRPVQHLQGVAELPENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHS 240
           ATLCDY+RPVQHLQGVAELPENNIE R+ +P+GKQVL+NKI  E TKV DREEAG S HS
Sbjct: 181 ATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHS 240

Query: 241 GLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
            LLRSRLLAPLGIPFCSAS GG  K RP D GGDF SF D+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDF-SFGDVGHLLDTESLRRRMEQIAAV 300

Query: 301 HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGM 360
            GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP   EPEKPL  K Q+QGKVINGM
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWP-AYEPEKPLAHKQQIQGKVINGM 360

Query: 361 LPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE 420
           LPNNQLHGRHSNG+ EV+HEHRL+CS+SLLDFKVAMELNP QLGEDWPLLLEKI MRAF 
Sbjct: 361 LPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMRAFG 407

BLAST of MS001255 vs. NCBI nr
Match: KAG7029751.1 (hypothetical protein SDJN02_08093, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 666.4 bits (1718), Expect = 1.6e-187
Identity = 347/421 (82.42%), Postives = 364/421 (86.46%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60
           MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLC RVLGR+NL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSI 120
           WLHNQLIQSILKNACQAKAAPP+P AGYPKTSTQ+AK+SPVIEDGN D GAV+PTSTQ I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFPTSTQGI 120

Query: 121 PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD 180
           PIWSN GFP SPRK RSGIRDRKLKDRPS L PN KVECIS QSA K+DGSC++MM NG+
Sbjct: 121 PIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGN 180

Query: 181 ATLCDYQRPVQHLQGVAELPENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHS 240
           AT CDYQRPVQHLQGV ELPENNIEAR+ RPAGKQVL         +V DREEA  S  S
Sbjct: 181 ATSCDYQRPVQHLQGVFELPENNIEARVQRPAGKQVLQ-------MQVEDREEARQSNRS 240

Query: 241 GLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
            LLRSRLLAPLGIPFCSASIGGA K RP D GG+F SFSD+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNF-SFSDMGHLLDTESLRRRMEQIAAV 300

Query: 301 HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGM 360
            GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP   EPEKPL    Q+QGKVINGM
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWP-AFEPEKPLAHNQQIQGKVINGM 360

Query: 361 LPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE 420
           LPNNQLH  HSNG+ EV+HE RL CS+SLLDFKVAMELNPKQLGEDWPLLLEKI MRAF 
Sbjct: 361 LPNNQLHRLHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFT 412

BLAST of MS001255 vs. ExPASy TrEMBL
Match: A0A6J1BTJ5 (uncharacterized protein LOC111005206 OS=Momordica charantia OX=3673 GN=LOC111005206 PE=4 SV=1)

HSP 1 Score: 839.0 bits (2166), Expect = 8.9e-240
Identity = 417/420 (99.29%), Postives = 418/420 (99.52%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60
           MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL
Sbjct: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSI 120
           WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGN DTGAVYPTSTQSI
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNEDTGAVYPTSTQSI 120

Query: 121 PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD 180
           PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD
Sbjct: 121 PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD 180

Query: 181 ATLCDYQRPVQHLQGVAELPENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSG 240
           ATLCDYQRPVQHLQGVAELPENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSG
Sbjct: 181 ATLCDYQRPVQHLQGVAELPENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSG 240

Query: 241 LLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVH 300
           LL+SRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVH
Sbjct: 241 LLQSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVH 300

Query: 301 GLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGML 360
           GLGSVSADSANILNKVLDVYLKQLIRSCVGLVGT PMPCEPEKPLTDKLQVQGKVINGML
Sbjct: 301 GLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTCPMPCEPEKPLTDKLQVQGKVINGML 360

Query: 361 PNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE 420
           PNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE
Sbjct: 361 PNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE 420

BLAST of MS001255 vs. ExPASy TrEMBL
Match: A0A6J1K7Q1 (uncharacterized protein LOC111492414 OS=Cucurbita maxima OX=3661 GN=LOC111492414 PE=4 SV=1)

HSP 1 Score: 672.2 bits (1733), Expect = 1.4e-189
Identity = 349/421 (82.90%), Postives = 369/421 (87.65%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60
           MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLC RVLGR+NL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSI 120
           WLHNQLIQSILKNACQAKAAPP+P AGYPKTSTQ+AK+SPVIEDGN D GAV+ TSTQ I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFATSTQGI 120

Query: 121 PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD 180
           PIWSN GF  SPRK RSGIRDRKLKDRPS L PN KVECIS QSA K+DGSC++MM NG+
Sbjct: 121 PIWSNEGFSMSPRKCRSGIRDRKLKDRPSLLAPNLKVECISAQSACKEDGSCRIMMDNGN 180

Query: 181 ATLCDYQRPVQHLQGVAELPENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHS 240
           AT CDYQRPVQHLQGV ELPENNIEAR+ RP+GKQVL  ++  EGTKV DREEA  S  S
Sbjct: 181 ATSCDYQRPVQHLQGVFELPENNIEARVQRPSGKQVLQMQV--EGTKVEDREEARQSNRS 240

Query: 241 GLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
            LLRSRLLAPLGIPFCSASIGGA K RP D GG+F SFSD+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNF-SFSDMGHLLDTESLRRRMEQIAAV 300

Query: 301 HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGM 360
            GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP+  EPEKPL    Q+QGKVINGM
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGPWPV-FEPEKPLAHNQQIQGKVINGM 360

Query: 361 LPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE 420
           LPNNQLH  HSNG+REV+HE RL CS+SLLDFKVAMELNPKQLGEDWPLLLEKI MRAF 
Sbjct: 361 LPNNQLHRLHSNGNREVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRAFT 417

BLAST of MS001255 vs. ExPASy TrEMBL
Match: A0A5A7VF96 (SAGA-Tad1 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G003200 PE=4 SV=1)

HSP 1 Score: 670.6 bits (1729), Expect = 4.2e-189
Identity = 347/421 (82.42%), Postives = 365/421 (86.70%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60
           MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK C RVLGR+NL
Sbjct: 1   MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSI 120
           WLHNQLIQSILKNACQAKAAPP+PVAGYPKTSTQSAK+SP++EDGN D GAV+PTSTQ+I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVEDGNEDGGAVFPTSTQNI 120

Query: 121 PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD 180
           P WSNG    SPRK RSGIRDRKLKDRPS LGPNGKVECISH SA          M NGD
Sbjct: 121 PGWSNG---VSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSAN---------MDNGD 180

Query: 181 ATLCDYQRPVQHLQGVAELPENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHS 240
           ATLCDY+RPVQHLQGVAELPENNIE R+ +P+GKQVL+NKI  E TKV DREEAG S HS
Sbjct: 181 ATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHS 240

Query: 241 GLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
            LLRSRLLAPLGIPFCSAS GG  K RP D GGDF SF D+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDF-SFGDVGHLLDTESLRRRMEQIAAV 300

Query: 301 HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGM 360
            GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP   EPEKPL  K Q+QGKVINGM
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWP-AYEPEKPLAHKQQIQGKVINGM 360

Query: 361 LPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE 420
           LPNNQLHGRHSNG+ EV+HEHRL+CS+SLLDFKVAMELNP QLGEDWPLLLEKI MRAF 
Sbjct: 361 LPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMRAFG 407

BLAST of MS001255 vs. ExPASy TrEMBL
Match: A0A1S3BCQ5 (uncharacterized protein LOC103488231 OS=Cucumis melo OX=3656 GN=LOC103488231 PE=4 SV=1)

HSP 1 Score: 670.6 bits (1729), Expect = 4.2e-189
Identity = 347/421 (82.42%), Postives = 365/421 (86.70%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60
           MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK C RVLGR+NL
Sbjct: 1   MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSI 120
           WLHNQLIQSILKNACQAKAAPP+PVAGYPKTSTQSAK+SP++EDGN D GAV+PTSTQ+I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVEDGNEDGGAVFPTSTQNI 120

Query: 121 PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD 180
           P WSNG    SPRK RSGIRDRKLKDRPS LGPNGKVECISH SA          M NGD
Sbjct: 121 PGWSNG---VSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSAN---------MDNGD 180

Query: 181 ATLCDYQRPVQHLQGVAELPENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHS 240
           ATLCDY+RPVQHLQGVAELPENNIE R+ +P+GKQVL+NKI  E TKV DREEAG S HS
Sbjct: 181 ATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHS 240

Query: 241 GLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
            LLRSRLLAPLGIPFCSAS GG  K RP D GGDF SF D+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDF-SFGDVGHLLDTESLRRRMEQIAAV 300

Query: 301 HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGM 360
            GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP   EPEKPL  K Q+QGKVINGM
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWP-AYEPEKPLAHKQQIQGKVINGM 360

Query: 361 LPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE 420
           LPNNQLHGRHSNG+ EV+HEHRL+CS+SLLDFKVAMELNP QLGEDWPLLLEKI MRAF 
Sbjct: 361 LPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMRAFG 407

BLAST of MS001255 vs. ExPASy TrEMBL
Match: A0A0A0LM32 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G378510 PE=4 SV=1)

HSP 1 Score: 666.0 bits (1717), Expect = 1.0e-187
Identity = 346/421 (82.19%), Postives = 364/421 (86.46%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60
           MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK C RVLGR+NL
Sbjct: 1   MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSI 120
           WLHNQLIQSILKNACQAK APP+PVAGYPKTSTQSAK+SP++EDGN D GAV+PTSTQ+I
Sbjct: 61  WLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISPLVEDGNEDGGAVFPTSTQNI 120

Query: 121 PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD 180
           P WSNG    SPRK RSGIRDRKLKDRPS LGPNGKVECISH SA          M NGD
Sbjct: 121 PGWSNG---VSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSAN---------MDNGD 180

Query: 181 ATLCDYQRPVQHLQGVAELPENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHS 240
           ATLCDY+RPVQ+LQG+AELPENNIE R+ +P+GKQ L NKI  E TKV DREEAG S HS
Sbjct: 181 ATLCDYKRPVQNLQGIAELPENNIEVRVPQPSGKQDLQNKIQVEATKVEDREEAGQSNHS 240

Query: 241 GLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
            LLRSRLLAPLGIPFCSASIGGARK RP D GGDF S SD+GHL DTESLRRRMEQIAAV
Sbjct: 241 SLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDF-SLSDVGHLLDTESLRRRMEQIAAV 300

Query: 301 HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGM 360
            GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP   EPEKPL+ K Q QGKVINGM
Sbjct: 301 QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWP-AYEPEKPLSHKQQFQGKVINGM 360

Query: 361 LPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE 420
           LPNNQLHGRHSNGS EV+HEHRL+CS+SLLDFKVAMELNP QLGEDWPLLLEKI MR F 
Sbjct: 361 LPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMRTFG 407

BLAST of MS001255 vs. TAIR 10
Match: AT2G24530.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G31440.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 346.7 bits (888), Expect = 2.7e-95
Identity = 199/420 (47.38%), Postives = 255/420 (60.71%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60
           MQ  Q  RI L ELK  IVKK G +RS+RYF+YL RFLSQKL+K+EFDK C R+LGR+NL
Sbjct: 1   MQRSQDQRISLCELKEHIVKKTGVERSRRYFYYLGRFLSQKLTKSEFDKTCLRLLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSI 120
            LHNQLI+SIL+NA  AK+ PP   AG+    +  A       DG   +G + P  +Q  
Sbjct: 61  SLHNQLIRSILRNATVAKSPPPDHEAGH----STKANAFQSRGDGLEQSGTLIPNHSQHE 120

Query: 121 PIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGD 180
           P+WSNG  P SPRK RSG+++RK +DRPSPLG NGKVE + HQ   ++D    + M NG 
Sbjct: 121 PVWSNGVLPISPRKVRSGMQNRKSRDRPSPLGSNGKVEHMLHQPVCREDNRGSVGMENG- 180

Query: 181 ATLCDYQRPVQHLQGVAELPENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSG 240
               DYQR  ++   VA+  +      +RP  K  + NK       + D +         
Sbjct: 181 ----DYQRSGRY---VADEKDGEF---LRPVEKPRIPNKEKIAAVSMRDDQNQEEQARVN 240

Query: 241 LLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVH 300
           L  S L+APLGIPFCSAS+GG+ +  P     + +S  D G L D E LR+RME IA   
Sbjct: 241 LSMSPLIAPLGIPFCSASVGGSPRTIPVSTNAELISCYDSGGLPDIEMLRKRMENIAVAQ 300

Query: 301 GLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGML 360
           GL  VS + A  LN +LDVYLK+LI SC  LVG      +P K    K Q Q K++NG+ 
Sbjct: 301 GLEGVSMECAKTLNNMLDVYLKKLINSCFDLVGARSTNGDPGKQRIGKQQSQNKIVNGVW 360

Query: 361 PNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE 420
           P N L  +  NGS ++  +H    SVS+LDF+ AMELNP+QLGEDWP L E+I +R+FEE
Sbjct: 361 PTNSLKIQTPNGSSDIRQDHH---SVSMLDFRTAMELNPRQLGEDWPTLRERISLRSFEE 402

BLAST of MS001255 vs. TAIR 10
Match: AT4G31440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24530.1); Has 210 Blast hits to 209 proteins in 55 species: Archae - 0; Bacteria - 72; Metazoa - 2; Fungi - 6; Plants - 128; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 281.2 bits (718), Expect = 1.4e-75
Identity = 181/421 (42.99%), Postives = 240/421 (57.01%), Query Frame = 0

Query: 1   MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNL 60
           MQ  Q  RIDL ELK  IVKK+G +RS RYF+YL RFLSQKL+K+EFDK C R+LGR+NL
Sbjct: 1   MQRLQDPRIDLAELKVHIVKKVGVERSTRYFYYLGRFLSQKLTKSEFDKSCFRLLGRENL 60

Query: 61  WLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSI 120
            LHN+LI+SIL+NA  AK+ P +  +G+P  S    K     EDG  ++ ++ P   ++ 
Sbjct: 61  SLHNKLIRSILRNASLAKSPPSVHQSGHPGKSLVLGK-----EDGPEESRSLNPDHIRND 120

Query: 121 PIWSNGGFPASPRKSRSG-IRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNG 180
              SNG       K R G   DR ++D+P PLG NGKV                      
Sbjct: 121 LALSNGVL----AKVRPGTCDDRTIRDKPCPLGSNGKV---------------------- 180

Query: 181 DATLCDYQRPVQHLQGVAELPENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHS 240
                 Y RP ++       P+    A + PA ++ ++ K       +   +EA   I  
Sbjct: 181 -LGPFAYSRPGRY-------PDERDSAFLCPAEQKAVSGK-DQVAAPISRDDEAQVRI-- 240

Query: 241 GLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV 300
            L    ++APLGIPFCSAS+GG R+  P       +S  D G LSDTE LR+RME IA  
Sbjct: 241 -LSTPPVMAPLGIPFCSASVGGDRRTVPVSTSAAAISCYDSGGLSDTEMLRKRMENIAVT 300

Query: 301 HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGM 360
            GLG VSA+ + +LN +LD+YLK+L++SCV L G   M   P K   +K Q + +++NG+
Sbjct: 301 QGLGGVSAECSIVLNNMLDLYLKKLMKSCVDLAGARSMNGTPGKHSLEKQQSRDELVNGV 360

Query: 361 LPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFE 420
             NN  H + SN   ++  E     SVSLLDF+VAMELNP QLGEDWPLL E+I +  FE
Sbjct: 361 RTNNSFHIQTSNQPSDITREQH---SVSLLDFRVAMELNPHQLGEDWPLLRERISISLFE 375

BLAST of MS001255 vs. TAIR 10
Match: AT4G33890.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 133 Blast hits to 131 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 2; Plants - 129; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 161.4 bits (407), Expect = 1.6e-39
Identity = 132/426 (30.99%), Postives = 210/426 (49.30%), Query Frame = 0

Query: 4   QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLH 63
           Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR N+ LH
Sbjct: 5   QGSSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLH 64

Query: 64  NQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSIPIW 123
           N+LI+SI+KNAC AK+ P +   G              +  GNGD+      ++Q  P+ 
Sbjct: 65  NRLIRSIIKNACIAKSPPFIKKGG------------SFVRFGNGDS----KKNSQIQPLH 124

Query: 124 SNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECI---SHQSAGKKDGSCKMMMVNGD 183
            +  F  S RK RS    RKL+DRPSPLGP GK   +   + +S  K   + +++ +   
Sbjct: 125 GDSAFSPSTRKCRS----RKLRDRPSPLGPLGKPHSLTTTNEESMSKAQSATELLSLG-- 184

Query: 184 ATLCDYQRPVQHLQGVAELPENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSG 243
                  RP   +  V                         +EG +V        S+ S 
Sbjct: 185 ------SRPPVEVVSV-------------------------EEGEEVEQIAGGSPSVQS- 244

Query: 244 LLRSRLLAPLGIPFCSASI-GGARKARPADFGGDFVSFS-----DIGHLSDTESLRRRME 303
             R  L APLG+   S S+  GA +   ++      SF+     + G L DT +LR R+E
Sbjct: 245 --RCPLTAPLGV---SMSLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTLRSRLE 304

Query: 304 QIAAVHGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGK 363
           +   + GL  ++ DS ++LN  LDV++++LI  C+ L  T        +  TD+++    
Sbjct: 305 RRLEMEGL-KITMDSVSLLNSGLDVFMRRLIEPCLSLANT--------RCGTDRVR---- 342

Query: 364 VINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIR 421
                    +++ +++  SR + +       VS+ DF+  MELN + LGEDWP+ +EKI 
Sbjct: 365 ---------EMNYQYTQQSRRLSY-------VSMSDFRAGMELNTEILGEDWPMHMEKIC 342

BLAST of MS001255 vs. TAIR 10
Match: AT4G33890.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 161.4 bits (407), Expect = 1.6e-39
Identity = 132/426 (30.99%), Postives = 210/426 (49.30%), Query Frame = 0

Query: 4   QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLH 63
           Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR N+ LH
Sbjct: 5   QGSSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLH 64

Query: 64  NQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSIPIW 123
           N+LI+SI+KNAC AK+ P +   G              +  GNGD+      ++Q  P+ 
Sbjct: 65  NRLIRSIIKNACIAKSPPFIKKGG------------SFVRFGNGDS----KKNSQIQPLH 124

Query: 124 SNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECI---SHQSAGKKDGSCKMMMVNGD 183
            +  F  S RK RS    RKL+DRPSPLGP GK   +   + +S  K   + +++ +   
Sbjct: 125 GDSAFSPSTRKCRS----RKLRDRPSPLGPLGKPHSLTTTNEESMSKAQSATELLSLG-- 184

Query: 184 ATLCDYQRPVQHLQGVAELPENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSG 243
                  RP   +  V                         +EG +V        S+ S 
Sbjct: 185 ------SRPPVEVVSV-------------------------EEGEEVEQIAGGSPSVQS- 244

Query: 244 LLRSRLLAPLGIPFCSASI-GGARKARPADFGGDFVSFS-----DIGHLSDTESLRRRME 303
             R  L APLG+   S S+  GA +   ++      SF+     + G L DT +LR R+E
Sbjct: 245 --RCPLTAPLGV---SMSLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTLRSRLE 304

Query: 304 QIAAVHGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGK 363
           +   + GL  ++ DS ++LN  LDV++++LI  C+ L  T        +  TD+++    
Sbjct: 305 RRLEMEGL-KITMDSVSLLNSGLDVFMRRLIEPCLSLANT--------RCGTDRVR---- 342

Query: 364 VINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIR 421
                    +++ +++  SR + +       VS+ DF+  MELN + LGEDWP+ +EKI 
Sbjct: 365 ---------EMNYQYTQQSRRLSY-------VSMSDFRAGMELNTEILGEDWPMHMEKIC 342

BLAST of MS001255 vs. TAIR 10
Match: AT2G14850.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G33890.2); Has 140 Blast hits to 132 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 155.2 bits (391), Expect = 1.1e-37
Identity = 128/414 (30.92%), Postives = 189/414 (45.65%), Query Frame = 0

Query: 8   RIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLI 67
           R++  E+K+ I +K+G  R+  YF  L +FL+ ++SK+EFDKLC + +GR+N+ LHN+L+
Sbjct: 9   RLNSLEIKALIYQKIGHQRADTYFDQLGKFLTSRISKSEFDKLCSKTVGRENISLHNRLV 68

Query: 68  QSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTGAVYPTSTQSIPIWSNGG 127
           +SILKNA  AK+ PP     YPK S                             ++ +  
Sbjct: 69  RSILKNASVAKSPPP----RYPKKS-----------------------------LYGDPV 128

Query: 128 FPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQ 187
           FP SPRK RS    RK +DRPSPLGP GK + ++                  D ++   Q
Sbjct: 129 FPPSPRKCRS----RKFRDRPSPLGPLGKPQSLT---------------TTNDESMSKAQ 188

Query: 188 RPVQHLQGVAELPENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLL 247
           R          LP   +                 ++G +V ++     S+ S   RS L 
Sbjct: 189 R----------LPMEVVSV---------------EDGEEV-EQMTGSPSVQS---RSPLT 248

Query: 248 APLGIPFCSASIGGARKARPADFGG-DFVSFSDIGHLSDTESLRRRMEQIAAVHGLGSVS 307
           APLG+ F   S     KAR + + G +  +    G L D  +LR R+E+   + G+  +S
Sbjct: 249 APLGVSFHLKS-----KARFSTYNGINRETCQSSGELPDMITLRARLEKKLEMEGI-KLS 291

Query: 308 ADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLH 367
            DSAN+LN+ L+ Y+++LI  C+ L                                   
Sbjct: 309 MDSANLLNRGLNAYMRRLIEPCLSLAS--------------------------------- 291

Query: 368 GRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE 421
                       + R   +VS+LDF  AME+NP+ LGE+WP+ LEKI  RA EE
Sbjct: 369 -----------QQKRAVSNVSMLDFHAAMEVNPRVLGEEWPIQLEKICCRASEE 291

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022132327.11.8e-23999.29uncharacterized protein LOC111005206 [Momordica charantia][more]
XP_023546134.16.0e-19082.90uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo][more]
XP_022997521.13.0e-18982.90uncharacterized protein LOC111492414 [Cucurbita maxima][more]
XP_008445087.18.7e-18982.42PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo] >KAA0064996.1 SAG... [more]
KAG7029751.11.6e-18782.42hypothetical protein SDJN02_08093, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1BTJ58.9e-24099.29uncharacterized protein LOC111005206 OS=Momordica charantia OX=3673 GN=LOC111005... [more]
A0A6J1K7Q11.4e-18982.90uncharacterized protein LOC111492414 OS=Cucurbita maxima OX=3661 GN=LOC111492414... [more]
A0A5A7VF964.2e-18982.42SAGA-Tad1 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6... [more]
A0A1S3BCQ54.2e-18982.42uncharacterized protein LOC103488231 OS=Cucumis melo OX=3656 GN=LOC103488231 PE=... [more]
A0A0A0LM321.0e-18782.19Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G378510 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G24530.12.7e-9547.38unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G31440.11.4e-7542.99unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G33890.11.6e-3930.99unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G33890.21.6e-3930.99unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G14850.11.1e-3730.92unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PFAMPF12767SAGA-Tad1coord: 5..332
e-value: 2.7E-61
score: 207.3
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PANTHERPTHR21277TRANSCRIPTIONAL ADAPTER 1coord: 1..420
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 123..161
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 135..149
NoneNo IPR availablePANTHERPTHR21277:SF38TRANSCRIPTIONAL REGULATOR OF RNA POLII, SAGA, SUBUNITcoord: 1..420

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS001255.1MS001255.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0070461 SAGA-type complex